sequence types based: Topics by Science.gov

Sample records for sequence types based

Development of Mycoplasma synoviae (MS) core genome multilocus sequence typing (cgMLST) scheme.

PubMed

Ghanem, Mostafa; El-Gazzar, Mohamed

2018-05-01

Mycoplasma synoviae (MS) is a poultry pathogen with reported increased prevalence and virulence in recent years. MS strain identification is essential for prevention, control efforts and epidemiological outbreak investigations. Multiple multilocus based sequence typing schemes have been developed for MS, yet the resolution of these schemes could be limited for outbreak investigation. The cost of whole genome sequencing became close to that of sequencing the seven MLST targets; however, there is no standardized method for typing MS strains based on whole genome sequences. In this paper, we propose a core genome multilocus sequence typing (cgMLST) scheme as a standardized and reproducible method for typing MS based whole genome sequences. A diverse set of 25 MS whole genome sequences were used to identify 302 core genome genes as cgMLST targets (35.5% of MS genome) and 44 whole genome sequences of MS isolates from six countries in four continents were used for typing applying this scheme. cgMLST based phylogenetic trees displayed a high degree of agreement with core genome SNP based analysis and available epidemiological information. cgMLST allowed evaluation of two conventional MLST schemes of MS. The high discriminatory power of cgMLST allowed differentiation between samples of the same conventional MLST type. cgMLST represents a standardized, accurate, highly discriminatory, and reproducible method for differentiation between MS isolates. Like conventional MLST, it provides stable and expandable nomenclature, allowing for comparing and sharing the typing results between different laboratories worldwide. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.
Typing of canine parvovirus isolates using mini-sequencing based single nucleotide polymorphism analysis.

PubMed

Naidu, Hariprasad; Subramanian, B Mohana; Chinchkar, Shankar Ramchandra; Sriraman, Rajan; Rana, Samir Kumar; Srinivasan, V A

2012-05-01

The antigenic types of canine parvovirus (CPV) are defined based on differences in the amino acids of the major capsid protein VP2. Type specificity is conferred by a limited number of amino acid changes and in particular by few nucleotide substitutions. PCR based methods are not particularly suitable for typing circulating variants which differ in a few specific nucleotide substitutions. Assays for determining SNPs can detect efficiently nucleotide substitutions and can thus be adapted to identify CPV types. In the present study, CPV typing was performed by single nucleotide extension using the mini-sequencing technique. A mini-sequencing signature was established for all the four CPV types (CPV2, 2a, 2b and 2c) and feline panleukopenia virus. The CPV typing using the mini-sequencing reaction was performed for 13 CPV field isolates and the two vaccine strains available in our repository. All the isolates had been typed earlier by full-length sequencing of the VP2 gene. The typing results obtained from mini-sequencing matched completely with that of sequencing. Typing could be achieved with less than 100 copies of standard plasmid DNA constructs or ≤10¹ FAID₅₀ of virus by mini-sequencing technique. The technique was also efficient for detecting multiple types in mixed infections. Copyright © 2012 Elsevier B.V. All rights reserved.
Reads2Type: a web application for rapid microbial taxonomy identification.

PubMed

Saputra, Dhany; Rasmussen, Simon; Larsen, Mette V; Haddad, Nizar; Sperotto, Maria Maddalena; Aarestrup, Frank M; Lund, Ole; Sicheritz-Pontén, Thomas

2015-11-25

Identification of bacteria may be based on sequencing and molecular analysis of a specific locus such as 16S rRNA, or a set of loci such as in multilocus sequence typing. In the near future, healthcare institutions and routine diagnostic microbiology laboratories may need to sequence the entire genome of microbial isolates. Therefore we have developed Reads2Type, a web-based tool for taxonomy identification based on whole bacterial genome sequence data. Raw sequencing data provided by the user are mapped against a set of marker probes that are derived from currently available bacteria complete genomes. Using a dataset of 1003 whole genome sequenced bacteria from various sequencing platforms, Reads2Type was able to identify the species with 99.5 % accuracy and on the minutes time scale. In comparison with other tools, Reads2Type offers the advantage of not needing to transfer sequencing files, as the entire computational analysis is done on the computer of whom utilizes the web application. This also prevents data privacy issues to arise. The Reads2Type tool is available at http://www.cbs.dtu.dk/~dhany/reads2type.html.
Multilocus sequence typing of total-genome-sequenced bacteria.

PubMed

Larsen, Mette V; Cosentino, Salvatore; Rasmussen, Simon; Friis, Carsten; Hasman, Henrik; Marvig, Rasmus Lykke; Jelsbak, Lars; Sicheritz-Pontén, Thomas; Ussery, David W; Aarestrup, Frank M; Lund, Ole

2012-04-01

Accurate strain identification is essential for anyone working with bacteria. For many species, multilocus sequence typing (MLST) is considered the "gold standard" of typing, but it is traditionally performed in an expensive and time-consuming manner. As the costs of whole-genome sequencing (WGS) continue to decline, it becomes increasingly available to scientists and routine diagnostic laboratories. Currently, the cost is below that of traditional MLST. The new challenges will be how to extract the relevant information from the large amount of data so as to allow for comparison over time and between laboratories. Ideally, this information should also allow for comparison to historical data. We developed a Web-based method for MLST of 66 bacterial species based on WGS data. As input, the method uses short sequence reads from four sequencing platforms or preassembled genomes. Updates from the MLST databases are downloaded monthly, and the best-matching MLST alleles of the specified MLST scheme are found using a BLAST-based ranking method. The sequence type is then determined by the combination of alleles identified. The method was tested on preassembled genomes from 336 isolates covering 56 MLST schemes, on short sequence reads from 387 isolates covering 10 schemes, and on a small test set of short sequence reads from 29 isolates for which the sequence type had been determined by traditional methods. The method presented here enables investigators to determine the sequence types of their isolates on the basis of WGS data. This method is publicly available at www.cbs.dtu.dk/services/MLST.
Typing Clostridium difficile strains based on tandem repeat sequences

PubMed Central

2009-01-01

Background Genotyping of epidemic Clostridium difficile strains is necessary to track their emergence and spread. Portability of genotyping data is desirable to facilitate inter-laboratory comparisons and epidemiological studies. Results This report presents results from a systematic screen for variation in repetitive DNA in the genome of C. difficile. We describe two tandem repeat loci, designated 'TR6' and 'TR10', which display extensive sequence variation that may be useful for sequence-based strain typing. Based on an investigation of 154 C. difficile isolates comprising 75 ribotypes, tandem repeat sequencing demonstrated excellent concordance with widely used PCR ribotyping and equal discriminatory power. Moreover, tandem repeat sequences enabled the reconstruction of the isolates' largely clonal population structure and evolutionary history. Conclusion We conclude that sequence analysis of the two repetitive loci introduced here may be highly useful for routine typing of C. difficile. Tandem repeat sequence typing resolves phylogenetic diversity to a level equivalent to PCR ribotypes. DNA sequences may be stored in databases accessible over the internet, obviating the need for the exchange of reference strains. PMID:19133124
An automated genotyping tool for enteroviruses and noroviruses.

PubMed

Kroneman, A; Vennema, H; Deforche, K; v d Avoort, H; Peñaranda, S; Oberste, M S; Vinjé, J; Koopmans, M

2011-06-01

Molecular techniques are established as routine in virological laboratories and virus typing through (partial) sequence analysis is increasingly common. Quality assurance for the use of typing data requires harmonization of genotype nomenclature, and agreement on target genes, depending on the level of resolution required, and robustness of methods. To develop and validate web-based open-access typing-tools for enteroviruses and noroviruses. An automated web-based typing algorithm was developed, starting with BLAST analysis of the query sequence against a reference set of sequences from viruses in the family Picornaviridae or Caliciviridae. The second step is phylogenetic analysis of the query sequence and a sub-set of the reference sequences, to assign the enterovirus type or norovirus genotype and/or variant, with profile alignment, construction of phylogenetic trees and bootstrap validation. Typing is performed on VP1 sequences of Human enterovirus A to D, and ORF1 and ORF2 sequences of genogroup I and II noroviruses. For validation, we used the tools to automatically type sequences in the RIVM and CDC enterovirus databases and the FBVE norovirus database. Using the typing-tools, 785(99%) of 795 Enterovirus VP1 sequences, and 8154(98.5%) of 8342 norovirus sequences were typed in accordance with previously used methods. Subtyping into variants was achieved for 4439(78.4%) of 5838 NoV GII.4 sequences. The online typing-tools reliably assign genotypes for enteroviruses and noroviruses. The use of phylogenetic methods makes these tools robust to ongoing evolution. This should facilitate standardized genotyping and nomenclature in clinical and public health laboratories, thus supporting inter-laboratory comparisons. Copyright © 2011 Elsevier B.V. All rights reserved.
Update on Pneumocystis carinii f. sp. hominis Typing Based on Nucleotide Sequence Variations in Internal Transcribed Spacer Regions of rRNA Genes

PubMed Central

Lee, Chao-Hung; Helweg-Larsen, Jannik; Tang, Xing; Jin, Shaoling; Li, Baozheng; Bartlett, Marilyn S.; Lu, Jang-Jih; Lundgren, Bettina; Lundgren, Jens D.; Olsson, Mats; Lucas, Sebastian B.; Roux, Patricia; Cargnel, Antonietta; Atzori, Chiara; Matos, Olga; Smith, James W.

1998-01-01

Pneumocystis carinii f. sp. hominis isolates from 207 clinical specimens from nine countries were typed based on nucleotide sequence variations in the internal transcribed spacer regions I and II (ITS1 and ITS2, respectively) of rRNA genes. The number of ITS1 nucleotides has been revised from the previously reported 157 bp to 161 bp. Likewise, the number of ITS2 nucleotides has been changed from 177 to 192 bp. The number of ITS1 sequence types has increased from 2 to 15, and that of ITS2 has increased from 3 to 14. The 15 ITS1 sequence types are designated types A through O, and the 14 ITS2 types are named types a through n. A total of 59 types of P. carinii f. sp. hominis were found in this study. PMID:9508304
Classification of community types, successional sequences, and landscapes of the Copper River Delta, Alaska.

Treesearch

Keith. Boggs

2000-01-01

A classification of community types, successional sequences, and landscapes is presented for the piedmont of the Copper River Delta. The classification was based on a sampling of 471 sites. A total of 75 community types, 42 successional sequences, and 6 landscapes are described. The classification of community types reflects the existing vegetation communities on the...
Molecular Strain Typing of Mycobacterium tuberculosis: a Review of Frequently Used Methods

PubMed Central

2016-01-01

Tuberculosis, caused by the bacterium Mycobacterium tuberculosis, remains one of the most serious global health problems. Molecular typing of M. tuberculosis has been used for various epidemiologic purposes as well as for clinical management. Currently, many techniques are available to type M. tuberculosis. Choosing the most appropriate technique in accordance with the existing laboratory conditions and the specific features of the geographic region is important. Insertion sequence IS6110-based restriction fragment length polymorphism (RFLP) analysis is considered the gold standard for the molecular epidemiologic investigations of tuberculosis. However, other polymerase chain reaction-based methods such as spacer oligonucleotide typing (spoligotyping), which detects 43 spacer sequence-interspersing direct repeats (DRs) in the genomic DR region; mycobacterial interspersed repetitive units–variable number tandem repeats, (MIRU-VNTR), which determines the number and size of tandem repetitive DNA sequences; repetitive-sequence-based PCR (rep-PCR), which provides high-throughput genotypic fingerprinting of multiple Mycobacterium species; and the recently developed genome-based whole genome sequencing methods demonstrate similar discriminatory power and greater convenience. This review focuses on techniques frequently used for the molecular typing of M. tuberculosis and discusses their general aspects and applications. PMID:27709842
Molecular Strain Typing of Mycobacterium tuberculosis: a Review of Frequently Used Methods.

PubMed

Ei, Phyu Win; Aung, Wah Wah; Lee, Jong Seok; Choi, Go Eun; Chang, Chulhun L

2016-11-01

Tuberculosis, caused by the bacterium Mycobacterium tuberculosis, remains one of the most serious global health problems. Molecular typing of M. tuberculosis has been used for various epidemiologic purposes as well as for clinical management. Currently, many techniques are available to type M. tuberculosis. Choosing the most appropriate technique in accordance with the existing laboratory conditions and the specific features of the geographic region is important. Insertion sequence IS6110-based restriction fragment length polymorphism (RFLP) analysis is considered the gold standard for the molecular epidemiologic investigations of tuberculosis. However, other polymerase chain reaction-based methods such as spacer oligonucleotide typing (spoligotyping), which detects 43 spacer sequence-interspersing direct repeats (DRs) in the genomic DR region; mycobacterial interspersed repetitive units-variable number tandem repeats, (MIRU-VNTR), which determines the number and size of tandem repetitive DNA sequences; repetitive-sequence-based PCR (rep-PCR), which provides high-throughput genotypic fingerprinting of multiple Mycobacterium species; and the recently developed genome-based whole genome sequencing methods demonstrate similar discriminatory power and greater convenience. This review focuses on techniques frequently used for the molecular typing of M. tuberculosis and discusses their general aspects and applications.
Direct typing of Canine parvovirus (CPV) from infected dog faeces by rapid mini sequencing technique.

PubMed

V, Pavana Jyothi; S, Akila; Selvan, Malini K; Naidu, Hariprasad; Raghunathan, Shwethaa; Kota, Sathish; Sundaram, R C Raja; Rana, Samir Kumar; Raj, G Dhinakar; Srinivasan, V A; Mohana Subramanian, B

2016-12-01

Canine parvovirus (CPV) is a non-enveloped single stranded DNA virus with an icosahedral capsid. Mini-sequencing based CPV typing was developed earlier to detect and differentiate all the CPV types and FPV in a single reaction. This technique was further evaluated in the present study by performing the mini-sequencing directly from fecal samples which avoided tedious virus isolation steps by cell culture system. Fecal swab samples were collected from 84 dogs with enteritis symptoms, suggestive of parvoviral infection from different locations across India. Seventy six of these samples were positive by PCR; the subsequent mini-sequencing reaction typed 74 of them as type 2a virus, and 2 samples as type 2b. Additionally, 25 of the positive samples were typed by cycle sequencing of PCR products. Direct CPV typing from fecal samples using mini-sequencing showed 100% correlation with CPV typing by cycle sequencing. Moreover, CPV typing was achieved by mini-sequencing even with faintly positive PCR amplicons which was not possible by cycle sequencing. Therefore, the mini-sequencing technique is recommended for regular epidemiological follow up of CPV types, since the technique is rapid, highly sensitive and high capacity method for CPV typing. Copyright © 2016. Published by Elsevier B.V.
Storage and utilization of HLA genomic data--new approaches to HLA typing.

PubMed

Helmberg, W

2000-01-01

Currently available DNA-based HLA typing assays can provide detailed information about sequence motifs of a tested sample. It is still a common practice, however, for information acquired by high-resolution sequence specific oligonucleotide probe (SSOP) typing or sequence specific priming (SSP) to be presented in a low-resolution serological format. Unfortunately, this representation can lead to significant loss of useful data in many cases. An alternative to assigning allele equivalents to suchDNA typing results is simply to store the observed typing pattern and utilize the information with the help of Virtual DNA Analysis (VDA). Interpretation of the stored typing patterns can then be updated based on newly defined alleles, assuming the sequence motifs detected by the typing reagents are known. Rather than updating reagent specificities in individual laboratories, such updates should be performed in a central, publicly available sequence database. By referring to this database, HLA genomic data can then be stored and transferred between laboratories without loss of information. The 13th International Histocompatibility Workshop offers an ideal opportunity to begin building this common database for the entire human MHC.
Evaluation of an Optimal Epidemiological Typing Scheme for Legionella pneumophila with Whole-Genome Sequence Data Using Validation Guidelines

PubMed Central

Mentasti, Massimo; Tewolde, Rediat; Aslett, Martin; Harris, Simon R.; Afshar, Baharak; Underwood, Anthony; Harrison, Timothy G.

2016-01-01

Sequence-based typing (SBT), analogous to multilocus sequence typing (MLST), is the current “gold standard” typing method for investigation of legionellosis outbreaks caused by Legionella pneumophila. However, as common sequence types (STs) cause many infections, some investigations remain unresolved. In this study, various whole-genome sequencing (WGS)-based methods were evaluated according to published guidelines, including (i) a single nucleotide polymorphism (SNP)-based method, (ii) extended MLST using different numbers of genes, (iii) determination of gene presence or absence, and (iv) a kmer-based method. L. pneumophila serogroup 1 isolates (n = 106) from the standard “typing panel,” previously used by the European Society for Clinical Microbiology Study Group on Legionella Infections (ESGLI), were tested together with another 229 isolates. Over 98% of isolates were considered typeable using the SNP- and kmer-based methods. Percentages of isolates with complete extended MLST profiles ranged from 99.1% (50 genes) to 86.8% (1,455 genes), while only 41.5% produced a full profile with the gene presence/absence scheme. Replicates demonstrated that all methods offer 100% reproducibility. Indices of discrimination range from 0.972 (ribosomal MLST) to 0.999 (SNP based), and all values were higher than that achieved with SBT (0.940). Epidemiological concordance is generally inversely related to discriminatory power. We propose that an extended MLST scheme with ∼50 genes provides optimal epidemiological concordance while substantially improving the discrimination offered by SBT and can be used as part of a hierarchical typing scheme that should maintain backwards compatibility and increase discrimination where necessary. This analysis will be useful for the ESGLI to design a scheme that has the potential to become the new gold standard typing method for L. pneumophila. PMID:27280420
Evaluation of an Optimal Epidemiological Typing Scheme for Legionella pneumophila with Whole-Genome Sequence Data Using Validation Guidelines.

PubMed

David, Sophia; Mentasti, Massimo; Tewolde, Rediat; Aslett, Martin; Harris, Simon R; Afshar, Baharak; Underwood, Anthony; Fry, Norman K; Parkhill, Julian; Harrison, Timothy G

2016-08-01

Sequence-based typing (SBT), analogous to multilocus sequence typing (MLST), is the current "gold standard" typing method for investigation of legionellosis outbreaks caused by Legionella pneumophila However, as common sequence types (STs) cause many infections, some investigations remain unresolved. In this study, various whole-genome sequencing (WGS)-based methods were evaluated according to published guidelines, including (i) a single nucleotide polymorphism (SNP)-based method, (ii) extended MLST using different numbers of genes, (iii) determination of gene presence or absence, and (iv) a kmer-based method. L. pneumophila serogroup 1 isolates (n = 106) from the standard "typing panel," previously used by the European Society for Clinical Microbiology Study Group on Legionella Infections (ESGLI), were tested together with another 229 isolates. Over 98% of isolates were considered typeable using the SNP- and kmer-based methods. Percentages of isolates with complete extended MLST profiles ranged from 99.1% (50 genes) to 86.8% (1,455 genes), while only 41.5% produced a full profile with the gene presence/absence scheme. Replicates demonstrated that all methods offer 100% reproducibility. Indices of discrimination range from 0.972 (ribosomal MLST) to 0.999 (SNP based), and all values were higher than that achieved with SBT (0.940). Epidemiological concordance is generally inversely related to discriminatory power. We propose that an extended MLST scheme with ∼50 genes provides optimal epidemiological concordance while substantially improving the discrimination offered by SBT and can be used as part of a hierarchical typing scheme that should maintain backwards compatibility and increase discrimination where necessary. This analysis will be useful for the ESGLI to design a scheme that has the potential to become the new gold standard typing method for L. pneumophila. Copyright © 2016 David et al.
Phenotypic H-Antigen Typing by Mass Spectrometry Combined with Genetic Typing of H Antigens, O Antigens, and Toxins by Whole-Genome Sequencing Enhances Identification of Escherichia coli Isolates.

PubMed

Cheng, Keding; Chui, Huixia; Domish, Larissa; Sloan, Angela; Hernandez, Drexler; McCorrister, Stuart; Robinson, Alyssia; Walker, Matthew; Peterson, Lorea A M; Majcher, Miles; Ratnam, Sam; Haldane, David J M; Bekal, Sadjia; Wylie, John; Chui, Linda; Tyler, Shaun; Xu, Bianli; Reimer, Aleisha; Nadon, Celine; Knox, J David; Wang, Gehua

2016-08-01

Mass spectrometry-based phenotypic H-antigen typing (MS-H) combined with whole-genome-sequencing-based genetic identification of H antigens, O antigens, and toxins (WGS-HOT) was used to type 60 clinical Escherichia coli isolates, 43 of which were previously identified as nonmotile, H type undetermined, or O rough by serotyping or having shown discordant MS-H and serotyping results. Whole-genome sequencing confirmed that MS-H was able to provide more accurate data regarding H antigen expression than serotyping. Further, enhanced and more confident O antigen identification resulted from gene cluster based typing in combination with conventional typing based on the gene pair comprising wzx and wzy and that comprising wzm and wzt The O antigen was identified in 94.6% of the isolates when the two genetic O typing approaches (gene pair and gene cluster) were used in conjunction, in comparison to 78.6% when the gene pair database was used alone. In addition, 98.2% of the isolates showed the existence of genes for various toxins and/or virulence factors, among which verotoxins (Shiga toxin 1 and/or Shiga toxin 2) were 100% concordant with conventional PCR based testing results. With more applications of mass spectrometry and whole-genome sequencing in clinical microbiology laboratories, this combined phenotypic and genetic typing platform (MS-H plus WGS-HOT) should be ideal for pathogenic E. coli typing. Copyright © 2016 Cheng et al.
Sequence analysis of chloroplast chlB gene of medicinal Ephedra species and its application to authentication of Ephedra Herb.

PubMed

Guo, Yahong; Tsuruga, Ayako; Yamaguchi, Shigeharu; Oba, Koji; Iwai, Kasumi; Sekita, Setsuko; Mizukami, Hajime

2006-06-01

Chloroplast chlB gene encoding subunit B of light-independent protochlorophyllide reductase was amplified from herbarium and crude drug specimens of Ephedra sinica, E. intermedia, E. equisetina, and E. przewalskii. Sequence comparison of the chlB gene indicated that all the E. sinica specimens have the same sequence type (Type S) distinctive from other species, while there are two sequence types (Type E1 and Type E2) in E. equisetina. E. intermedia and E. prezewalskii revealed an identical sequence type (Type IP). E. sinica was also identified by digesting the chlB fragment with Bcl I. A novel method for DNA authentication of Ephedra Herb based on the sequences of the chloroplast chlB gene and internal transcribed spacer of nuclear rRNA genes was developed and successfully applied for identification of the crude drugs obtained in the Chinese market.
Effectiveness of the standard and an alternative set of Streptococcus pneumoniae multi locus sequence typing primers.

PubMed

Adamiak, Paul; Vanderkooi, Otto G; Kellner, James D; Schryvers, Anthony B; Bettinger, Julie A; Alcantara, Joenel

2014-06-03

Multi-locus sequence typing (MLST) is a portable, broadly applicable method for classifying bacterial isolates at an intra-species level. This methodology provides clinical and scientific investigators with a standardized means of monitoring evolution within bacterial populations. MLST uses the DNA sequences from a set of genes such that each unique combination of sequences defines an isolate's sequence type. In order to reliably determine the sequence of a typing gene, matching sequence reads for both strands of the gene must be obtained. This study assesses the ability of both the standard, and an alternative set of, Streptococcus pneumoniae MLST primers to completely sequence, in both directions, the required typing alleles. The results demonstrated that for five (aroE, recP, spi, xpt, ddl) of the seven S. pneumoniae typing alleles, the standard primers were unable to obtain the complete forward and reverse sequences. This is due to the standard primers annealing too closely to the target regions, and current sequencing technology failing to sequence the bases that are too close to the primer. The alternative primer set described here, which includes a combination of primers proposed by the CDC and several designed as part of this study, addresses this limitation by annealing to highly conserved segments further from the target region. This primer set was subsequently employed to sequence type 105 S. pneumoniae isolates collected by the Canadian Immunization Monitoring Program ACTive (IMPACT) over a period of 18 years. The inability of several of the standard S. pneumoniae MLST primers to fully sequence the required region was consistently observed and is the result of a shift in sequencing technology occurring after the original primers were designed. The results presented here introduce clear documentation describing this phenomenon into the literature, and provide additional guidance, through the introduction of a widely validated set of alternative primers, to research groups seeking to undertake S. pneumoniae MLST based studies.
Unraveling systematic inventory of Echinops (Asteraceae) with special reference to nrDNA ITS sequence-based molecular typing of Echinops abuzinadianus.

PubMed

Ali, M A; Al-Hemaid, F M; Lee, J; Hatamleh, A A; Gyulai, G; Rahman, M O

2015-10-02

The present study explored the systematic inventory of Echinops L. (Asteraceae) of Saudi Arabia, with special reference to the molecular typing of Echinops abuzinadianus Chaudhary, an endemic species to Saudi Arabia, based on the internal transcribed spacer (ITS) sequences (ITS1-5.8S-ITS2) of nuclear ribosomal DNA. A sequence similarity search using BLAST and a phylogenetic analysis of the ITS sequence of E. abuzinadianus revealed a high level of sequence similarity with E. glaberrimus DC. (section Ritropsis). The novel primary sequence and the secondary structure of ITS2 of E. abuzinadianus could potentially be used for molecular genotyping.
Sequence of Child Care Type and Child Development: What Role Does Peer Exposure Play?

ERIC Educational Resources Information Center

Morrissey, Taryn W.

2010-01-01

Child care arrangements change as children age; in general, hours in home-based child care decrease as hours in center-based settings increase. This sequence of child care type may correspond with children's developmental needs; the small peer groups and low child-adult ratios typical of home-based care may allow for more individual child-adult…
Genetic analysis of Fasciola isolates from cattle in Korea based on second internal transcribed spacer (ITS-2) sequence of nuclear ribosomal DNA.

PubMed

Choe, Se-Eun; Nguyen, Thuy Thi-Dieu; Kang, Tae-Gyu; Kweon, Chang-Hee; Kang, Seung-Won

2011-09-01

Nuclear ribosomal DNA sequence of the second internal transcribed spacer (ITS-2) has been used efficiently to identify the liver fluke species collected from different hosts and various geographic regions. ITS-2 sequences of 19 Fasciola samples collected from Korean native cattle were determined and compared. Sequence comparison including ITS-2 sequences of isolates from this study and reference sequences from Fasciola hepatica and Fasciola gigantica and intermediate Fasciola in Genbank revealed seven identical variable sites of investigated isolates. Among 19 samples, 12 individuals had ITS-2 sequences completely identical to that of pure F. hepatica, five possessed the sequences identical to F. gigantica type, whereas two shared the sequence of both F. hepatica and F. gigantica. No variations in length and nucleotide composition of ITS-2 sequence were observed within isolates that belonged to F. hepatica or F. gigantica. At the position of 218, five Fasciola containing a single-base substitution (C>T) formed a distinct branch inside the F. gigantica-type group which was similar to those of Asian-origin isolates. The phylogenetic tree of the Fasciola spp. based on complete ITS-2 sequences from this study and other representative isolates in different locations clearly showed that pure F. hepatica, F. gigantica type and intermediate Fasciola were observed. The result also provided additional genetic evidence for the existence of three forms of Fasciola isolated from native cattle in Korea by genetic approach using ITS-2 sequence.

A PCR technique based on the Hip1 interspersed repetitive sequence distinguishes cyanobacterial species and strains.

PubMed

Smith, J K; Parry, J D; Day, J G; Smith, R J

1998-10-01

The use of primers based on the Hip1 sequence as a typing technique for cyanobacteria has been investigated. The discovery of short repetitive sequence structures in bacterial DNA during the last decade has led to the development of PCR-based methods for typing, i.e., distinguishing and identifying, bacterial species and strains. An octameric palindromic sequence known as Hip1 has been shown to be present in the chromosomal DNA of many species of cyanobacteria as a highly repetitious interspersed sequence. PCR primers were constructed that extended the Hip1 sequence at the 3' end by two bases. Five of the 16 possible extended primers were tested. Each of the five primers produced a different set of products when used to prime PCR from cyanobacterial genomic DNA. Each primer produced a distinct set of products for each of the 15 cyanobacterial species tested. The ability of Hip1-based PCR to resolve taxonomic differences was assessed by analysis of independent isolates of Anabaena flos-aquae and Nostoc ellipsosporum obtained from the CCAP (Culture Collection of Algae and Protozoa, IFE, Cumbria, UK). A PCR-based RFLP analysis of products amplified from the 23S-16S rDNA intergenic region was used to characterize the isolates and to compare with the Hip1 typing data. The RFLP and Hip1 typing yielded similar results and both techniques were able to distinguish different strains. On the basis of these results it is suggested that the Hip1 PCR technique may assist in distinguishing cyanobacterial species and strains.
Creation of a type IIS restriction endonuclease with a long recognition sequence

PubMed Central

Lippow, Shaun M.; Aha, Patti M.; Parker, Matthew H.; Blake, William J.; Baynes, Brian M.; Lipovšek, Daša

2009-01-01

Type IIS restriction endonucleases cleave DNA outside their recognition sequences, and are therefore particularly useful in the assembly of DNA from smaller fragments. A limitation of type IIS restriction endonucleases in assembly of long DNA sequences is the relative abundance of their target sites. To facilitate ligation-based assembly of extremely long pieces of DNA, we have engineered a new type IIS restriction endonuclease that combines the specificity of the homing endonuclease I-SceI with the type IIS cleavage pattern of FokI. We linked a non-cleaving mutant of I-SceI, which conveys to the chimeric enzyme its specificity for an 18-bp DNA sequence, to the catalytic domain of FokI, which cuts DNA at a defined site outside the target site. Whereas previously described chimeric endonucleases do not produce type IIS-like precise DNA overhangs suitable for ligation, our chimeric endonuclease cleaves double-stranded DNA exactly 2 and 6 nt from the target site to generate homogeneous, 5′, four-base overhangs, which can be ligated with 90% fidelity. We anticipate that these enzymes will be particularly useful in manipulation of DNA fragments larger than a thousand bases, which are very likely to contain target sites for all natural type IIS restriction endonucleases. PMID:19304757
Automated typing of red blood cell and platelet antigens: a whole-genome sequencing study.

PubMed

Lane, William J; Westhoff, Connie M; Gleadall, Nicholas S; Aguad, Maria; Smeland-Wagman, Robin; Vege, Sunitha; Simmons, Daimon P; Mah, Helen H; Lebo, Matthew S; Walter, Klaudia; Soranzo, Nicole; Di Angelantonio, Emanuele; Danesh, John; Roberts, David J; Watkins, Nick A; Ouwehand, Willem H; Butterworth, Adam S; Kaufman, Richard M; Rehm, Heidi L; Silberstein, Leslie E; Green, Robert C

2018-06-01

There are more than 300 known red blood cell (RBC) antigens and 33 platelet antigens that differ between individuals. Sensitisation to antigens is a serious complication that can occur in prenatal medicine and after blood transfusion, particularly for patients who require multiple transfusions. Although pre-transfusion compatibility testing largely relies on serological methods, reagents are not available for many antigens. Methods based on single-nucleotide polymorphism (SNP) arrays have been used, but typing for ABO and Rh-the most important blood groups-cannot be done with SNP typing alone. We aimed to develop a novel method based on whole-genome sequencing to identify RBC and platelet antigens. This whole-genome sequencing study is a subanalysis of data from patients in the whole-genome sequencing arm of the MedSeq Project randomised controlled trial (NCT01736566) with no measured patient outcomes. We created a database of molecular changes in RBC and platelet antigens and developed an automated antigen-typing algorithm based on whole-genome sequencing (bloodTyper). This algorithm was iteratively improved to address cis-trans haplotype ambiguities and homologous gene alignments. Whole-genome sequencing data from 110 MedSeq participants (30 × depth) were used to initially validate bloodTyper through comparison with conventional serology and SNP methods for typing of 38 RBC antigens in 12 blood-group systems and 22 human platelet antigens. bloodTyper was further validated with whole-genome sequencing data from 200 INTERVAL trial participants (15 × depth) with serological comparisons. We iteratively improved bloodTyper by comparing its typing results with conventional serological and SNP typing in three rounds of testing. The initial whole-genome sequencing typing algorithm was 99·5% concordant across the first 20 MedSeq genomes. Addressing discordances led to development of an improved algorithm that was 99·8% concordant for the remaining 90 MedSeq genomes. Additional modifications led to the final algorithm, which was 99·2% concordant across 200 INTERVAL genomes (or 99·9% after adjustment for the lower depth of coverage). By enabling more precise antigen-matching of patients with blood donors, antigen typing based on whole-genome sequencing provides a novel approach to improve transfusion outcomes with the potential to transform the practice of transfusion medicine. National Human Genome Research Institute, Doris Duke Charitable Foundation, National Health Service Blood and Transplant, National Institute for Health Research, and Wellcome Trust. Copyright © 2018 Elsevier Ltd. All rights reserved.
Molecular epidemiologic analysis of a Pneumocystis pneumonia outbreak among renal transplant patients.

PubMed

Urabe, N; Ishii, Y; Hyodo, Y; Aoki, K; Yoshizawa, S; Saga, T; Murayama, S Y; Sakai, K; Homma, S; Tateda, K

2016-04-01

Between 18 November and 3 December 2011, five renal transplant patients at the Department of Nephrology, Toho University Omori Medical Centre, Tokyo, were diagnosed with Pneumocystis pneumonia (PCP). We used molecular epidemiologic methods to determine whether the patients were infected with the same strain of Pneumocystis jirovecii. DNA extracted from the residual bronchoalveolar lavage fluid from the five outbreak cases and from another 20 cases of PCP between 2007 and 2014 were used for multilocus sequence typing to compare the genetic similarity of the P. jirovecii. DNA base sequencing by the Sanger method showed some regions where two bases overlapped and could not be defined. A next-generation sequencer was used to analyse the types and ratios of these overlapping bases. DNA base sequences of P. jirovecii in the bronchoalveolar lavage fluid from four of the five PCP patients in the 2011 outbreak and from another two renal transplant patients who developed PCP in 2013 were highly homologous. The Sanger method revealed 14 genomic regions where two differing DNA bases overlapped and could not be identified. Analyses of the overlapping bases by a next-generation sequencer revealed that the differing types of base were present in almost identical ratios. There is a strong possibility that the PCP outbreak at the Toho University Omori Medical Centre was caused by the same strain of P. jirovecii. Two different types of base present in some regions may be due to P. jirovecii's being a diploid species. Copyright © 2015 European Society of Clinical Microbiology and Infectious Diseases. Published by Elsevier Ltd. All rights reserved.
DNA-based differentiation of the Ecuadorian cocoa types CCN-51 and Arriba based on sequence differences in the chloroplast genome.

PubMed

Herrmann, Luise; Haase, Ilka; Blauhut, Maike; Barz, Nadine; Fischer, Markus

2014-12-17

Two cocoa types, Arriba and CCN-51, are being cultivated in Ecuador. With regard to the unique aroma, Arriba is considered a fine cocoa type, while CCN-51 is a bulk cocoa because of its weaker aroma. Because it is being assumed that Arriba is mixed with CCN-51, there is an interest in the analytical differentiation of the two types. Two methods to identify CCN-51 adulterations in Arriba cocoa were developed on the basis of differences in the chloroplast DNA. On the one hand, a different repeat of the sequence TAAAG in the inverted repeat region results in a different length of amplicons for the two cocoa types, which can be detected by agarose gel electrophoresis, capillary gel electrophoresis, and denaturing high-performance liquid chromatography. On the other hand, single nucleotide polymorphisms (SNPs) between the CCN-51 and Arriba sequences represent restriction sites, which can be used for restriction fragment length polymorphism analysis. A semi-quantitative analysis based on these SNPs is feasible. A method for an exact quantitation based on these results is not realizable. These sequence variations were confirmed for a comprehensive cultivar collection of Arriba and CCN-51, for both bean and leaf samples.
The Effects of CBI Lesson Sequence Type and Field Dependence on Learning from Computer-Based Cooperative Instruction in Web

ERIC Educational Resources Information Center

Ipek, Ismail

2010-01-01

The purpose of this study was to investigate the effects of CBI lesson sequence type and cognitive style of field dependence on learning from Computer-Based Cooperative Instruction (CBCI) in WEB on the dependent measures, achievement, reading comprehension and reading rate. Eighty-seven college undergraduate students were randomly assigned to…
Analysis of Pre-Analytic Factors Affecting the Success of Clinical Next-Generation Sequencing of Solid Organ Malignancies.

PubMed

Chen, Hui; Luthra, Rajyalakshmi; Goswami, Rashmi S; Singh, Rajesh R; Roy-Chowdhuri, Sinchita

2015-08-28

Application of next-generation sequencing (NGS) technology to routine clinical practice has enabled characterization of personalized cancer genomes to identify patients likely to have a response to targeted therapy. The proper selection of tumor sample for downstream NGS based mutational analysis is critical to generate accurate results and to guide therapeutic intervention. However, multiple pre-analytic factors come into play in determining the success of NGS testing. In this review, we discuss pre-analytic requirements for AmpliSeq PCR-based sequencing using Ion Torrent Personal Genome Machine (PGM) (Life Technologies), a NGS sequencing platform that is often used by clinical laboratories for sequencing solid tumors because of its low input DNA requirement from formalin fixed and paraffin embedded tissue. The success of NGS mutational analysis is affected not only by the input DNA quantity but also by several other factors, including the specimen type, the DNA quality, and the tumor cellularity. Here, we review tissue requirements for solid tumor NGS based mutational analysis, including procedure types, tissue types, tumor volume and fraction, decalcification, and treatment effects.
PHYLOViZ: phylogenetic inference and data visualization for sequence based typing methods

PubMed Central

2012-01-01

Background With the decrease of DNA sequencing costs, sequence-based typing methods are rapidly becoming the gold standard for epidemiological surveillance. These methods provide reproducible and comparable results needed for a global scale bacterial population analysis, while retaining their usefulness for local epidemiological surveys. Online databases that collect the generated allelic profiles and associated epidemiological data are available but this wealth of data remains underused and are frequently poorly annotated since no user-friendly tool exists to analyze and explore it. Results PHYLOViZ is platform independent Java software that allows the integrated analysis of sequence-based typing methods, including SNP data generated from whole genome sequence approaches, and associated epidemiological data. goeBURST and its Minimum Spanning Tree expansion are used for visualizing the possible evolutionary relationships between isolates. The results can be displayed as an annotated graph overlaying the query results of any other epidemiological data available. Conclusions PHYLOViZ is a user-friendly software that allows the combined analysis of multiple data sources for microbial epidemiological and population studies. It is freely available at http://www.phyloviz.net. PMID:22568821
Real-Time PCR Typing of Escherichia coli Based on Multiple Single Nucleotide Polymorphisms--a Convenient and Rapid Method.

PubMed

Lager, Malin; Mernelius, Sara; Löfgren, Sture; Söderman, Jan

2016-01-01

Healthcare-associated infections caused by Escherichia coli and antibiotic resistance due to extended-spectrum beta-lactamase (ESBL) production constitute a threat against patient safety. To identify, track, and control outbreaks and to detect emerging virulent clones, typing tools of sufficient discriminatory power that generate reproducible and unambiguous data are needed. A probe based real-time PCR method targeting multiple single nucleotide polymorphisms (SNP) was developed. The method was based on the multi locus sequence typing scheme of Institute Pasteur and by adaptation of previously described typing assays. An 8 SNP-panel that reached a Simpson's diversity index of 0.95 was established, based on analysis of sporadic E. coli cases (ESBL n = 27 and non-ESBL n = 53). This multi-SNP assay was used to identify the sequence type 131 (ST131) complex according to the Achtman's multi locus sequence typing scheme. However, it did not fully discriminate within the complex but provided a diagnostic signature that outperformed a previously described detection assay. Pulsed-field gel electrophoresis typing of isolates from a presumed outbreak (n = 22) identified two outbreaks (ST127 and ST131) and three different non-outbreak-related isolates. Multi-SNP typing generated congruent data except for one non-outbreak-related ST131 isolate. We consider multi-SNP real-time PCR typing an accessible primary generic E. coli typing tool for rapid and uniform type identification.
Subtyping Salmonella enterica serovar enteritidis isolates from different sources by using sequence typing based on virulence genes and clustered regularly interspaced short palindromic repeats (CRISPRs).

PubMed

Liu, Fenyun; Kariyawasam, Subhashinie; Jayarao, Bhushan M; Barrangou, Rodolphe; Gerner-Smidt, Peter; Ribot, Efrain M; Knabel, Stephen J; Dudley, Edward G

2011-07-01

Salmonella enterica subsp. enterica serovar Enteritidis is a major cause of food-borne salmonellosis in the United States. Two major food vehicles for S. Enteritidis are contaminated eggs and chicken meat. Improved subtyping methods are needed to accurately track specific strains of S. Enteritidis related to human salmonellosis throughout the chicken and egg food system. A sequence typing scheme based on virulence genes (fimH and sseL) and clustered regularly interspaced short palindromic repeats (CRISPRs)-CRISPR-including multi-virulence-locus sequence typing (designated CRISPR-MVLST)-was used to characterize 35 human clinical isolates, 46 chicken isolates, 24 egg isolates, and 63 hen house environment isolates of S. Enteritidis. A total of 27 sequence types (STs) were identified among the 167 isolates. CRISPR-MVLST identified three persistent and predominate STs circulating among U.S. human clinical isolates and chicken, egg, and hen house environmental isolates in Pennsylvania, and an ST that was found only in eggs and humans. It also identified a potential environment-specific sequence type. Moreover, cluster analysis based on fimH and sseL identified a number of clusters, of which several were found in more than one outbreak, as well as 11 singletons. Further research is needed to determine if CRISPR-MVLST might help identify the ecological origins of S. Enteritidis strains that contaminate chickens and eggs.
Integrating alignment-based and alignment-free sequence similarity measures for biological sequence classification.

PubMed

Borozan, Ivan; Watt, Stuart; Ferretti, Vincent

2015-05-01

Alignment-based sequence similarity searches, while accurate for some type of sequences, can produce incorrect results when used on more divergent but functionally related sequences that have undergone the sequence rearrangements observed in many bacterial and viral genomes. Here, we propose a classification model that exploits the complementary nature of alignment-based and alignment-free similarity measures with the aim to improve the accuracy with which DNA and protein sequences are characterized. Our model classifies sequences using a combined sequence similarity score calculated by adaptively weighting the contribution of different sequence similarity measures. Weights are determined independently for each sequence in the test set and reflect the discriminatory ability of individual similarity measures in the training set. Because the similarity between some sequences is determined more accurately with one type of measure rather than another, our classifier allows different sets of weights to be associated with different sequences. Using five different similarity measures, we show that our model significantly improves the classification accuracy over the current composition- and alignment-based models, when predicting the taxonomic lineage for both short viral sequence fragments and complete viral sequences. We also show that our model can be used effectively for the classification of reads from a real metagenome dataset as well as protein sequences. All the datasets and the code used in this study are freely available at https://collaborators.oicr.on.ca/vferretti/borozan_csss/csss.html. ivan.borozan@gmail.com Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.
Integrating alignment-based and alignment-free sequence similarity measures for biological sequence classification

PubMed Central

Borozan, Ivan; Watt, Stuart; Ferretti, Vincent

2015-01-01

Motivation: Alignment-based sequence similarity searches, while accurate for some type of sequences, can produce incorrect results when used on more divergent but functionally related sequences that have undergone the sequence rearrangements observed in many bacterial and viral genomes. Here, we propose a classification model that exploits the complementary nature of alignment-based and alignment-free similarity measures with the aim to improve the accuracy with which DNA and protein sequences are characterized. Results: Our model classifies sequences using a combined sequence similarity score calculated by adaptively weighting the contribution of different sequence similarity measures. Weights are determined independently for each sequence in the test set and reflect the discriminatory ability of individual similarity measures in the training set. Because the similarity between some sequences is determined more accurately with one type of measure rather than another, our classifier allows different sets of weights to be associated with different sequences. Using five different similarity measures, we show that our model significantly improves the classification accuracy over the current composition- and alignment-based models, when predicting the taxonomic lineage for both short viral sequence fragments and complete viral sequences. We also show that our model can be used effectively for the classification of reads from a real metagenome dataset as well as protein sequences. Availability and implementation: All the datasets and the code used in this study are freely available at https://collaborators.oicr.on.ca/vferretti/borozan_csss/csss.html. Contact: ivan.borozan@gmail.com Supplementary information: Supplementary data are available at Bioinformatics online. PMID:25573913
Ribosomal DNA intergenic spacer sequence in foxtail millet, Setaria italica (L.) P. Beauv. and its characterization and application to typing of foxtail millet landraces.

PubMed

Fukunaga, Kenji; Ichitani, Katsuyuki; Taura, Satoru; Sato, Muneharu; Kawase, Makoto

2005-02-01

We determined the sequence of ribosomal DNA (rDNA) intergenic spacer (IGS) of foxtail millet isolated in our previous study, and identified subrepeats in the polymorphic region. We also developed a PCR-based method for identifying rDNA types based on sequence information and assessed 153 accessions of foxtail millet. Results were congruent with our previous works. This study provides new findings regarding the geographical distribution of rDNA variants. This new method facilitates analyses of numerous foxtail millet accessions. It is helpful for typing of foxtail millet germplasms and elucidating the evolution of this millet.
Polymorphic amplified typing sequences (PATS) and pulsed-field gel electrophoresis (PFGE) yield comparable results in the strain typing of a diverse set of bovine Escherichia coli O157 isolates

USDA-ARS?s Scientific Manuscript database

The PCR-based Escherichia coli O157 (O157) strain typing system, Polymorphic Amplified Typing Sequences (PATS), targets insertions-deletions (Indels) and single nucleotide polymorphisms (SNPs) at the XbaI and AvrII(BlnI) restriction enzyme sites, respectively, besides amplifying four known virulenc...
Comparison of Three Different Hepatitis C Virus Genotyping Methods: 5'NCR PCR-RFLP, Core Type-Specific PCR, and NS5b Sequencing in a Tertiary Care Hospital in South India.

PubMed

Daniel, Hubert D-J; David, Joel; Raghuraman, Sukanya; Gnanamony, Manu; Chandy, George M; Sridharan, Gopalan; Abraham, Priya

2017-05-01

Based on genetic heterogeneity, hepatitis C virus (HCV) is classified into seven major genotypes and 64 subtypes. In spite of the sequence heterogeneity, all genotypes share an identical complement of colinear genes within the large open reading frame. The genetic interrelationships between these genes are consistent among genotypes. Due to this property, complete sequencing of the HCV genome is not required. HCV genotypes along with subtypes are critical for planning antiviral therapy. Certain genotypes are also associated with higher progression to liver cirrhosis. In this study, 100 blood samples were collected from individuals who came for routine HCV genotype identification. These samples were used for the comparison of two different genotyping methods (5'NCR PCR-RFLP and HCV core type-specific PCR) with NS5b sequencing. Of the 100 samples genotyped using 5'NCR PCR-RFLP and HCV core type-specific PCR, 90% (κ = 0.913, P < 0.00) and 96% (κ = 0.794, P < 0.00) correlated with NS5b sequencing, respectively. Sixty percent and 75% of discordant samples by 5'NCR PCR-RFLP and HCV core type-specific PCR, respectively, belonged to genotype 6. All the HCV genotype 1 subtypes were classified accurately by both the methods. This study shows that the 5'NCR-based PCR-RFLP and the HCV core type-specific PCR-based assays correctly identified HCV genotypes except genotype 6 from this region. Direct sequencing of the HCV core region was able to identify all the genotype 6 from this region and serves as an alternative to NS5b sequencing. © 2016 Wiley Periodicals, Inc.
Noninvasive Prenatal Paternity Testing (NIPAT) through Maternal Plasma DNA Sequencing: A Pilot Study.

PubMed

Jiang, Haojun; Xie, Yifan; Li, Xuchao; Ge, Huijuan; Deng, Yongqiang; Mu, Haofang; Feng, Xiaoli; Yin, Lu; Du, Zhou; Chen, Fang; He, Nongyue

2016-01-01

Short tandem repeats (STRs) and single nucleotide polymorphisms (SNPs) have been already used to perform noninvasive prenatal paternity testing from maternal plasma DNA. The frequently used technologies were PCR followed by capillary electrophoresis and SNP typing array, respectively. Here, we developed a noninvasive prenatal paternity testing (NIPAT) based on SNP typing with maternal plasma DNA sequencing. We evaluated the influence factors (minor allele frequency (MAF), the number of total SNP, fetal fraction and effective sequencing depth) and designed three different selective SNP panels in order to verify the performance in clinical cases. Combining targeted deep sequencing of selective SNP and informative bioinformatics pipeline, we calculated the combined paternity index (CPI) of 17 cases to determine paternity. Sequencing-based NIPAT results fully agreed with invasive prenatal paternity test using STR multiplex system. Our study here proved that the maternal plasma DNA sequencing-based technology is feasible and accurate in determining paternity, which may provide an alternative in forensic application in the future.
Use of the LUS in sequence allele designations to facilitate probabilistic genotyping of NGS-based STR typing results.

PubMed

Just, Rebecca S; Irwin, Jodi A

2018-05-01

Some of the expected advantages of next generation sequencing (NGS) for short tandem repeat (STR) typing include enhanced mixture detection and genotype resolution via sequence variation among non-homologous alleles of the same length. However, at the same time that NGS methods for forensic DNA typing have advanced in recent years, many caseworking laboratories have implemented or are transitioning to probabilistic genotyping to assist the interpretation of complex autosomal STR typing results. Current probabilistic software programs are designed for length-based data, and were not intended to accommodate sequence strings as the product input. Yet to leverage the benefits of NGS for enhanced genotyping and mixture deconvolution, the sequence variation among same-length products must be utilized in some form. Here, we propose use of the longest uninterrupted stretch (LUS) in allele designations as a simple method to represent sequence variation within the STR repeat regions and facilitate - in the nearterm - probabilistic interpretation of NGS-based typing results. An examination of published population data indicated that a reference LUS region is straightforward to define for most autosomal STR loci, and that using repeat unit plus LUS length as the allele designator can represent greater than 80% of the alleles detected by sequencing. A proof of concept study performed using a freely available probabilistic software demonstrated that the LUS length can be used in allele designations when a program does not require alleles to be integers, and that utilizing sequence information improves interpretation of both single-source and mixed contributor STR typing results as compared to using repeat unit information alone. The LUS concept for allele designation maintains the repeat-based allele nomenclature that will permit backward compatibility to extant STR databases, and the LUS lengths themselves will be concordant regardless of the NGS assay or analysis tools employed. Further, these biologically based, easy-to-derive designations uphold clear relationships between parent alleles and their stutter products, enabling analysis in fully continuous probabilistic programs that model stutter while avoiding the algorithmic complexities that come with string based searches. Though using repeat unit plus LUS length as the allele designator does not capture variation that occurs outside of the core repeat regions, this straightforward approach would permit the large majority of known STR sequence variation to be used for mixture deconvolution and, in turn, result in more informative mixture statistics in the near term. Ultimately, the method could bridge the gap from current length-based probabilistic systems to facilitate broader adoption of NGS by forensic DNA testing laboratories. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.
Review and International Recommendation of Methods for Typing Neisseria gonorrhoeae Isolates and Their Implications for Improved Knowledge of Gonococcal Epidemiology, Treatment, and Biology

PubMed Central

Unemo, Magnus; Dillon, Jo-Anne R.

2011-01-01

Summary: Gonorrhea, which may become untreatable due to multiple resistance to available antibiotics, remains a public health problem worldwide. Precise methods for typing Neisseria gonorrhoeae, together with epidemiological information, are crucial for an enhanced understanding regarding issues involving epidemiology, test of cure and contact tracing, identifying core groups and risk behaviors, and recommending effective antimicrobial treatment, control, and preventive measures. This review evaluates methods for typing N. gonorrhoeae isolates and recommends various methods for different situations. Phenotypic typing methods, as well as some now-outdated DNA-based methods, have limited usefulness in differentiating between strains of N. gonorrhoeae. Genotypic methods based on DNA sequencing are preferred, and the selection of the appropriate genotypic method should be guided by its performance characteristics and whether short-term epidemiology (microepidemiology) or long-term and/or global epidemiology (macroepidemiology) matters are being investigated. Currently, for microepidemiological questions, the best methods for fast, objective, portable, highly discriminatory, reproducible, typeable, and high-throughput characterization are N. gonorrhoeae multiantigen sequence typing (NG-MAST) or full- or extended-length porB gene sequencing. However, pulsed-field gel electrophoresis (PFGE) and Opa typing can be valuable in specific situations, i.e., extreme microepidemiology, despite their limitations. For macroepidemiological studies and phylogenetic studies, DNA sequencing of chromosomal housekeeping genes, such as multilocus sequence typing (MLST), provides a more nuanced understanding. PMID:21734242
Enantiospecific recognition of DNA sequences by a proflavine Tröger base.

PubMed

Bailly, C; Laine, W; Demeunynck, M; Lhomme, J

2000-07-05

The DNA interaction of a chiral Tröger base derived from proflavine was investigated by DNA melting temperature measurements and complementary biochemical assays. DNase I footprinting experiments demonstrate that the binding of the proflavine-based Tröger base is both enantio- and sequence-specific. The (+)-isomer poorly interacts with DNA in a non-sequence-selective fashion. In sharp contrast, the corresponding (-)-isomer recognizes preferentially certain DNA sequences containing both A. T and G. C base pairs, such as the motifs 5'-GTT. AAC and 5'-ATGA. TCAT. This is the first experimental demonstration that acridine-type Tröger bases can be used for enantiospecific recognition of DNA sequences. Copyright 2000 Academic Press.
[Sequence-based typing of enviromental Legionella pneumophila isolates in Guangzhou].

PubMed

Zhang, Ying; Qu, Pinghua; Zhang, Jian; Chen, Shouyi

2011-03-01

To characterize the genes of Legionella pneumophila isolated from different water source in Guangzhou from 2006 to 2009. To genotype the strains by using sequence-based typing (SBT) scheme. In total 44 L. pneumophila strains were identified by SBT with 7 diversifying genes of flaA, asd, mip, pilE, mompS, proA and neuA. Analysis of the amplicons sequence was taken in the European Working Group for Legionella Infections (EWGLI) international SBT database to obtain the allelic profiles and sequence types (STs). Serogroups were typed by latex agglutination test. Data from SBT revealed a high diversity among the strains and ST01 accounts for 30% (13/ 44). Fifteen new STs were discovered from 20 STs and 2 of them were newly assigned (ST887 and ST888) by EWGLI. SBT Phylogenetic tree was generated by SplitsTree and BURST programs. High diversity and specificity were observed of the L. pneumophila strains in Guangzhou. SBT is useful for L. pneumophila genomic study and epidemiological surveillance.

An improved and validated RNA HLA class I SBT approach for obtaining full length coding sequences.

PubMed

Gerritsen, K E H; Olieslagers, T I; Groeneweg, M; Voorter, C E M; Tilanus, M G J

2014-11-01

The functional relevance of human leukocyte antigen (HLA) class I allele polymorphism beyond exons 2 and 3 is difficult to address because more than 70% of the HLA class I alleles are defined by exons 2 and 3 sequences only. For routine application on clinical samples we improved and validated the HLA sequence-based typing (SBT) approach based on RNA templates, using either a single locus-specific or two overlapping group-specific polymerase chain reaction (PCR) amplifications, with three forward and three reverse sequencing reactions for full length sequencing. Locus-specific HLA typing with RNA SBT of a reference panel, representing the major antigen groups, showed identical results compared to DNA SBT typing. Alleles encountered with unknown exons in the IMGT/HLA database and three samples, two with Null and one with a Low expressed allele, have been addressed by the group-specific RNA SBT approach to obtain full length coding sequences. This RNA SBT approach has proven its value in our routine full length definition of alleles. © 2014 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
spa typing for epidemiological surveillance of Staphylococcus aureus.

PubMed

Hallin, Marie; Friedrich, Alexander W; Struelens, Marc J

2009-01-01

The spa typing method is based on sequencing of the polymorphic X region of the protein A gene (spa), present in all strains of Staphylococcus aureus. The X region is constituted of a variable number of 24-bp repeats flanked by well-conserved regions. This single-locus sequence-based typing method combines a number of technical advantages, such as rapidity, reproducibility, and portability. Moreover, due to its repeat structure, the spa locus simultaneously indexes micro- and macrovariations, enabling the use of spa typing in both local and global epidemiological studies. These studies are facilitated by the establishment of standardized spa type nomenclature and Internet shared databases.
Amplicon Sequencing of the slpH Locus Permits Culture-Independent Strain Typing of Lactobacillus helveticus in Dairy Products

PubMed Central

Moser, Aline; Wüthrich, Daniel; Bruggmann, Rémy; Eugster-Meier, Elisabeth; Meile, Leo; Irmler, Stefan

2017-01-01

The advent of massive parallel sequencing technologies has opened up possibilities for the study of the bacterial diversity of ecosystems without the need for enrichment or single strain isolation. By exploiting 78 genome data-sets from Lactobacillus helveticus strains, we found that the slpH locus that encodes a putative surface layer protein displays sufficient genetic heterogeneity to be a suitable target for strain typing. Based on high-throughput slpH gene sequencing and the detection of single-base DNA sequence variations, we established a culture-independent method to assess the biodiversity of the L. helveticus strains present in fermented dairy food. When we applied the method to study the L. helveticus strain composition in 15 natural whey cultures (NWCs) that were collected at different Gruyère, a protected designation of origin (PDO) production facilities, we detected a total of 10 sequence types (STs). In addition, we monitored the development of a three-strain mix in raclette cheese for 17 weeks. PMID:28775722
Molecular and phylogenetic characterizations of an Eimeria krijgsmanni Yakimoff & Gouseff, 1938 (Apicomplexa: Eimeriidae) mouse intestinal protozoan parasite by partial 18S ribosomal RNA gene sequence analysis.

PubMed

Takeo, Toshinori; Tanaka, Tetsuya; Matsubayashi, Makoto; Maeda, Hiroki; Kusakisako, Kodai; Matsui, Toshihiro; Mochizuki, Masami; Matsuo, Tomohide

2014-08-01

Previously, we characterized an undocumented strain of Eimeria krijgsmanni by morphological and biological features. Here, we present a detailed molecular phylogenetic analysis of this organism. Namely, 18S ribosomal RNA gene (rDNA) sequences of E. krijgsmanni were analyzed to incorporate this species into a comprehensive Eimeria phylogeny. As a result, partial 18S rDNA sequence from E. krijgsmanni was successfully determined, and two different types, Type A and Type B, that differed by 1 base pair were identified. E. krijgsmanni was originally isolated from a single oocyst, and thus the result show that the two types might have allelic sequence heterogeneity in the 18S rDNA. Based on phylogenetic analyses, the two types of E. krijgsmanni 18S rDNA formed one of two clades among murine Eimeria spp.; these Eimeria clades reflected morphological similarity among the Eimeria spp. This is the third molecular phylogenetic characterization of a murine Eimeria spp. in addition to E. falciformis and E. papillata. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
High-Resolution Melting Analysis for Rapid Detection of Sequence Type 131 Escherichia coli.

PubMed

Harrison, Lucas B; Hanson, Nancy D

2017-06-01

Escherichia coli isolates belonging to the sequence type 131 (ST131) clonal complex have been associated with the global distribution of fluoroquinolone and β-lactam resistance. Whole-genome sequencing and multilocus sequence typing identify sequence type but are expensive when evaluating large numbers of samples. This study was designed to develop a cost-effective screening tool using high-resolution melting (HRM) analysis to differentiate ST131 from non-ST131 E. coli in large sample populations in the absence of sequence analysis. The method was optimized using DNA from 12 E. coli isolates. Singleplex PCR was performed using 10 ng of DNA, Type-it HRM buffer, and multilocus sequence typing primers and was followed by multiplex PCR. The amplicon sizes ranged from 630 to 737 bp. Melt temperature peaks were determined by performing HRM analysis at 0.1°C resolution from 50 to 95°C on a Rotor-Gene Q 5-plex HRM system. Derivative melt curves were compared between sequence types and analyzed by principal component analysis. A blinded study of 191 E. coli isolates of ST131 and unknown sequence types validated this methodology. This methodology returned 99.2% specificity (124 true negatives and 1 false positive) and 100% sensitivity (66 true positives and 0 false negatives). This HRM methodology distinguishes ST131 from non-ST131 E. coli without sequence analysis. The analysis can be accomplished in about 3 h in any laboratory with an HRM-capable instrument and principal component analysis software. Therefore, this assay is a fast and cost-effective alternative to sequencing-based ST131 identification. Copyright © 2017 Harrison and Hanson.
The repeating nucleotide sequence in the repetitive mitochondrial DNA from a "low-density" petite mutant of yeast.

PubMed Central

Van Kreijl, C F; Bos, J L

1977-01-01

The repeating nucleotide sequence of 68 base pairs in the mtDNA from an ethidium-induced cytoplasmic petite mutant of yeast has been determined. For sequence analysis specifically primed and terminated RNA copies, obtained by in vitro transcription of the separated strands, were use. The sequence consists of 66 consecutive AT base pairs flanked by two GC pairs and comprises nearly all of the mutant mitochondrial genome. The sequence, moreover, also represents the first part of wild-type mtDNA sequence so far. Images PMID:198740
A novel HLA-B allele, B*5214, detected in a Taiwanese volunteer bone marrow donor using a sequence-based typing method.

PubMed

Chen, M J; Chu, C C; Shyr, M H; Lin, C L; Lin, P Y; Yang, K L

2010-02-01

HLA-B*5214, a novel rare allele of HLA-B*52 variant, was found in a Taiwanese volunteer bone marrow donor by sequence-based typing method. The sequence of B*5214 is identical to that of B*520101 in exon 2 but differs from B*520101 in exon 3 at nucleotide positions 419 A-->T and 435 A-->G. Alteration of these two nucleotides resulted an amino acid substitution at amino acid residue 116 Y-->F ( TAC-->TTC) and a silent exchange at residue 121 K-->K (AAA-->AAG).
Transcriptomic analysis of Siberian ginseng (Eleutherococcus senticosus) to discover genes involved in saponin biosynthesis.

PubMed

Hwang, Hwan-Su; Lee, Hyoshin; Choi, Yong Eui

2015-03-14

Eleutherococcus senticosus, Siberian ginseng, is a highly valued woody medicinal plant belonging to the family Araliaceae. E. senticosus produces a rich variety of saponins such as oleanane-type, noroleanane-type, 29-hydroxyoleanan-type, and lupane-type saponins. Genomic or transcriptomic approaches have not been used to investigate the saponin biosynthetic pathway in this plant. In this study, de novo sequencing was performed to select candidate genes involved in the saponin biosynthetic pathway. A half-plate 454 pyrosequencing run produced 627,923 high-quality reads with an average sequence length of 422 bases. De novo assembly generated 72,811 unique sequences, including 15,217 contigs and 57,594 singletons. Approximately 48,300 (66.3%) unique sequences were annotated using BLAST similarity searches. All of the mevalonate pathway genes for saponin biosynthesis starting from acetyl-CoA were isolated. Moreover, 206 reads of cytochrome P450 (CYP) and 145 reads of uridine diphosphate glycosyltransferase (UGT) sequences were isolated. Based on methyl jasmonate (MeJA) treatment and real-time PCR (qPCR) analysis, 3 CYPs and 3 UGTs were finally selected as candidate genes involved in the saponin biosynthetic pathway. The identified sequences associated with saponin biosynthesis will facilitate the study of the functional genomics of saponin biosynthesis and genetic engineering of E. senticosus.
Influence of laminate sequence and fabric type on the inherent acoustic nonlinearity in carbon fiber reinforced composites.

PubMed

Chakrapani, Sunil Kishore; Barnard, Daniel J; Dayal, Vinay

2016-05-01

This paper presents the study of influence of laminate sequence and fabric type on the baseline acoustic nonlinearity of fiber-reinforced composites. Nonlinear elastic wave techniques are increasingly becoming popular in detecting damage in composite materials. It was earlier observed by the authors that the non-classical nonlinear response of fiber-reinforced composite is influenced by the fiber orientation [Chakrapani, Barnard, and Dayal, J. Acoust. Soc. Am. 137(2), 617-624 (2015)]. The current study expands this effort to investigate the effect of laminate sequence and fabric type on the non-classical nonlinear response. Two hypotheses were developed using the previous results, and the theory of interlaminar stresses to investigate the influence of laminate sequence and fabric type. Each hypothesis was tested by capturing the nonlinear response by performing nonlinear resonance spectroscopy and measuring frequency shifts, loss factors, and higher harmonics. It was observed that the laminate sequence can either increase or decrease the nonlinear response based on the stacking sequence. Similarly, tests were performed to compare unidirectional fabric and woven fabric and it was observed that woven fabric exhibited a lower nonlinear response compared to the unidirectional fabric. Conjectures based on the matrix properties and interlaminar stresses were used in an attempt to explain the observed nonlinear responses for different configurations.
SeqRate: sequence-based protein folding type classification and rates prediction

PubMed Central

2010-01-01

Background Protein folding rate is an important property of a protein. Predicting protein folding rate is useful for understanding protein folding process and guiding protein design. Most previous methods of predicting protein folding rate require the tertiary structure of a protein as an input. And most methods do not distinguish the different kinetic nature (two-state folding or multi-state folding) of the proteins. Here we developed a method, SeqRate, to predict both protein folding kinetic type (two-state versus multi-state) and real-value folding rate using sequence length, amino acid composition, contact order, contact number, and secondary structure information predicted from only protein sequence with support vector machines. Results We systematically studied the contributions of individual features to folding rate prediction. On a standard benchmark dataset, the accuracy of folding kinetic type classification is 80%. The Pearson correlation coefficient and the mean absolute difference between predicted and experimental folding rates (sec-1) in the base-10 logarithmic scale are 0.81 and 0.79 for two-state protein folders, and 0.80 and 0.68 for three-state protein folders. SeqRate is the first sequence-based method for protein folding type classification and its accuracy of fold rate prediction is improved over previous sequence-based methods. Its performance can be further enhanced with additional information, such as structure-based geometric contacts, as inputs. Conclusions Both the web server and software of predicting folding rate are publicly available at http://casp.rnet.missouri.edu/fold_rate/index.html. PMID:20438647
Multilocus sequence typing reveals a novel subspeciation of Lactobacillus delbrueckii.

PubMed

Tanigawa, Kana; Watanabe, Koichi

2011-03-01

Currently, the species Lactobacillus delbrueckii is divided into four subspecies, L. delbrueckii subsp. delbrueckii, L. delbrueckii subsp. bulgaricus, L. delbrueckii subsp. indicus and L. delbrueckii subsp. lactis. These classifications were based mainly on phenotypic identification methods and few studies have used genotypic identification methods. As a result, these subspecies have not yet been reliably delineated. In this study, the four subspecies of L. delbrueckii were discriminated by phenotype and by genotypic identification [amplified-fragment length polymorphism (AFLP) and multilocus sequence typing (MLST)] methods. The MLST method developed here was based on the analysis of seven housekeeping genes (fusA, gyrB, hsp60, ileS, pyrG, recA and recG). The MLST method had good discriminatory ability: the 41 strains of L. delbrueckii examined were divided into 34 sequence types, with 29 sequence types represented by only a single strain. The sequence types were divided into eight groups. These groups could be discriminated as representing different subspecies. The results of the AFLP and MLST analyses were consistent. The type strain of L. delbrueckii subsp. delbrueckii, YIT 0080(T), was clearly discriminated from the other strains currently classified as members of this subspecies, which were located close to strains of L. delbrueckii subsp. lactis. The MLST scheme developed in this study should be a useful tool for the identification of strains of L. delbrueckii to the subspecies level.
Type III restriction-modification enzymes: a historical perspective.

PubMed

Rao, Desirazu N; Dryden, David T F; Bheemanaik, Shivakumara

2014-01-01

Restriction endonucleases interact with DNA at specific sites leading to cleavage of DNA. Bacterial DNA is protected from restriction endonuclease cleavage by modifying the DNA using a DNA methyltransferase. Based on their molecular structure, sequence recognition, cleavage position and cofactor requirements, restriction-modification (R-M) systems are classified into four groups. Type III R-M enzymes need to interact with two separate unmethylated DNA sequences in inversely repeated head-to-head orientations for efficient cleavage to occur at a defined location (25-27 bp downstream of one of the recognition sites). Like the Type I R-M enzymes, Type III R-M enzymes possess a sequence-specific ATPase activity for DNA cleavage. ATP hydrolysis is required for the long-distance communication between the sites before cleavage. Different models, based on 1D diffusion and/or 3D-DNA looping, exist to explain how the long-distance interaction between the two recognition sites takes place. Type III R-M systems are found in most sequenced bacteria. Genome sequencing of many pathogenic bacteria also shows the presence of a number of phase-variable Type III R-M systems, which play a role in virulence. A growing number of these enzymes are being subjected to biochemical and genetic studies, which, when combined with ongoing structural analyses, promise to provide details for mechanisms of DNA recognition and catalysis.
The Salmonella In Silico Typing Resource (SISTR): An Open Web-Accessible Tool for Rapidly Typing and Subtyping Draft Salmonella Genome Assemblies.

PubMed

Yoshida, Catherine E; Kruczkiewicz, Peter; Laing, Chad R; Lingohr, Erika J; Gannon, Victor P J; Nash, John H E; Taboada, Eduardo N

2016-01-01

For nearly 100 years serotyping has been the gold standard for the identification of Salmonella serovars. Despite the increasing adoption of DNA-based subtyping approaches, serotype information remains a cornerstone in food safety and public health activities aimed at reducing the burden of salmonellosis. At the same time, recent advances in whole-genome sequencing (WGS) promise to revolutionize our ability to perform advanced pathogen characterization in support of improved source attribution and outbreak analysis. We present the Salmonella In Silico Typing Resource (SISTR), a bioinformatics platform for rapidly performing simultaneous in silico analyses for several leading subtyping methods on draft Salmonella genome assemblies. In addition to performing serovar prediction by genoserotyping, this resource integrates sequence-based typing analyses for: Multi-Locus Sequence Typing (MLST), ribosomal MLST (rMLST), and core genome MLST (cgMLST). We show how phylogenetic context from cgMLST analysis can supplement the genoserotyping analysis and increase the accuracy of in silico serovar prediction to over 94.6% on a dataset comprised of 4,188 finished genomes and WGS draft assemblies. In addition to allowing analysis of user-uploaded whole-genome assemblies, the SISTR platform incorporates a database comprising over 4,000 publicly available genomes, allowing users to place their isolates in a broader phylogenetic and epidemiological context. The resource incorporates several metadata driven visualizations to examine the phylogenetic, geospatial and temporal distribution of genome-sequenced isolates. As sequencing of Salmonella isolates at public health laboratories around the world becomes increasingly common, rapid in silico analysis of minimally processed draft genome assemblies provides a powerful approach for molecular epidemiology in support of public health investigations. Moreover, this type of integrated analysis using multiple sequence-based methods of sub-typing allows for continuity with historical serotyping data as we transition towards the increasing adoption of genomic analyses in epidemiology. The SISTR platform is freely available on the web at https://lfz.corefacility.ca/sistr-app/.
Proposals for the classification of human rhinovirus species A, B and C into genotypically assigned types

PubMed Central

McIntyre, Chloe L.; Knowles, Nick J.

2013-01-01

Human rhinoviruses (HRVs) frequently cause mild upper respiratory tract infections and more severe disease manifestations such as bronchiolitis and asthma exacerbations. HRV is classified into three species within the genus Enterovirus of the family Picornaviridae. HRV species A and B contain 75 and 25 serotypes identified by cross-neutralization assays, although the use of such assays for routine HRV typing is hampered by the large number of serotypes, replacement of virus isolation by molecular methods in HRV diagnosis and the poor or absent replication of HRV species C in cell culture. To address these problems, we propose an alternative, genotypic classification of HRV-based genetic relatedness analogous to that used for enteroviruses. Nucleotide distances between 384 complete VP1 sequences of currently assigned HRV (sero)types identified divergence thresholds of 13, 12 and 13 % for species A, B and C, respectively, that divided inter- and intra-type comparisons. These were paralleled by 10, 9.5 and 10 % thresholds in the larger dataset of >3800 VP4 region sequences. Assignments based on VP1 sequences led to minor revisions of existing type designations (such as the reclassification of serotype pairs, e.g. A8/A95 and A29/A44, as single serotypes) and the designation of new HRV types A101–106, B101–103 and C34–C51. A protocol for assignment and numbering of new HRV types using VP1 sequences and the restriction of VP4 sequence comparisons to type identification and provisional type assignments is proposed. Genotypic assignment and identification of HRV types will be of considerable value in the future investigation of type-associated differences in disease outcomes, transmission and epidemiology. PMID:23677786
Comparative performance of high-density oligonucleotide sequencing and dideoxynucleotide sequencing of HIV type 1 pol from clinical samples.

PubMed

Günthard, H F; Wong, J K; Ignacio, C C; Havlir, D V; Richman, D D

1998-07-01

The performance of the high-density oligonucleotide array methodology (GeneChip) in detecting drug resistance mutations in HIV-1 pol was compared with that of automated dideoxynucleotide sequencing (ABI) of clinical samples, viral stocks, and plasmid-derived NL4-3 clones. Sequences from 29 clinical samples (plasma RNA, n = 17; lymph node RNA, n = 5; lymph node DNA, n = 7) from 12 patients, from 6 viral stock RNA samples, and from 13 NL4-3 clones were generated by both methods. Editing was done independently by a different investigator for each method before comparing the sequences. In addition, NL4-3 wild type (WT) and mutants were mixed in varying concentrations and sequenced by both methods. Overall, a concordance of 99.1% was found for a total of 30,865 bases compared. The comparison of clinical samples (plasma RNA and lymph node RNA and DNA) showed a slightly lower match of base calls, 98.8% for 19,831 nucleotides compared (protease region, 99.5%, n = 8272; RT region, 98.3%, n = 11,316), than for viral stocks and NL4-3 clones (protease region, 99.8%; RT region, 99.5%). Artificial mixing experiments showed a bias toward calling wild-type bases by GeneChip. Discordant base calls are most likely due to differential detection of mixtures. The concordance between GeneChip and ABI was high and appeared dependent on the nature of the templates (directly amplified versus cloned) and the complexity of mixes.
NGSCheckMate: software for validating sample identity in next-generation sequencing studies within and across data types.

PubMed

Lee, Sejoon; Lee, Soohyun; Ouellette, Scott; Park, Woong-Yang; Lee, Eunjung A; Park, Peter J

2017-06-20

In many next-generation sequencing (NGS) studies, multiple samples or data types are profiled for each individual. An important quality control (QC) step in these studies is to ensure that datasets from the same subject are properly paired. Given the heterogeneity of data types, file types and sequencing depths in a multi-dimensional study, a robust program that provides a standardized metric for genotype comparisons would be useful. Here, we describe NGSCheckMate, a user-friendly software package for verifying sample identities from FASTQ, BAM or VCF files. This tool uses a model-based method to compare allele read fractions at known single-nucleotide polymorphisms, considering depth-dependent behavior of similarity metrics for identical and unrelated samples. Our evaluation shows that NGSCheckMate is effective for a variety of data types, including exome sequencing, whole-genome sequencing, RNA-seq, ChIP-seq, targeted sequencing and single-cell whole-genome sequencing, with a minimal requirement for sequencing depth (>0.5X). An alignment-free module can be run directly on FASTQ files for a quick initial check. We recommend using this software as a QC step in NGS studies. https://github.com/parklab/NGSCheckMate. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Multi-Virulence-Locus Sequence Typing of Staphylococcus lugdunensis Generates Results Consistent with a Clonal Population Structure and Is Reliable for Epidemiological Typing

PubMed Central

Didi, Jennifer; Lemée, Ludovic; Gibert, Laure; Pons, Jean-Louis

2014-01-01

Staphylococcus lugdunensis is an emergent virulent coagulase-negative staphylococcus responsible for severe infections similar to those caused by Staphylococcus aureus. To understand its potentially pathogenic capacity and have further detailed knowledge of the molecular traits of this organism, 93 isolates from various geographic origins were analyzed by multi-virulence-locus sequence typing (MVLST), targeting seven known or putative virulence-associated loci (atlLR2, atlLR3, hlb, isdJ, SLUG_09050, SLUG_16930, and vwbl). The polymorphisms of the putative virulence-associated loci were moderate and comparable to those of the housekeeping genes analyzed by multilocus sequence typing (MLST). However, the MVLST scheme generated 43 virulence types (VTs) compared to 20 sequence types (STs) based on MLST, indicating that MVLST was significantly more discriminating (Simpson's index [D], 0.943). No hypervirulent lineage or cluster specific to carriage strains was defined. The results of multilocus sequence analysis of known and putative virulence-associated loci are consistent with a clonal population structure for S. lugdunensis, suggesting a coevolution of these genes with housekeeping genes. Indeed, the nonsynonymous to synonymous evolutionary substitutions (dN/dS) ratio, the Tajima's D test, and Single-likelihood ancestor counting (SLAC) analysis suggest that all virulence-associated loci were under negative selection, even atlLR2 (AtlL protein) and SLUG_16930 (FbpA homologue), for which the dN/dS ratios were higher. In addition, this analysis of virulence-associated loci allowed us to propose a trilocus sequence typing scheme based on the intragenic regions of atlLR3, isdJ, and SLUG_16930, which is more discriminant than MLST for studying short-term epidemiology and further characterizing the lineages of the rare but highly pathogenic S. lugdunensis. PMID:25078912
The multilocus sequence typing network: mlst.net.

PubMed

Aanensen, David M; Spratt, Brian G

2005-07-01

The unambiguous characterization of strains of a pathogen is crucial for addressing questions relating to its epidemiology, population and evolutionary biology. Multilocus sequence typing (MLST), which defines strains from the sequences at seven house-keeping loci, has become the method of choice for molecular typing of many bacterial and fungal pathogens (and non-pathogens), and MLST schemes and strain databases are available for a growing number of prokaryotic and eukaryotic organisms. Sequence data are ideal for strain characterization as they are unambiguous, meaning strains can readily be compared between laboratories via the Internet. Laboratories undertaking MLST can quickly progress from sequencing the seven gene fragments to characterizing their strains and relating them to those submitted by others and to the population as a whole. We provide the gateway to a number of MLST schemes, each of which contain a set of tools for the initial characterization of strains, and methods for relating query strains to other strains of the species, including clustering based on differences in allelic profiles, phylogenetic trees based on concatenated sequences, and a recently developed method (eBURST) for identifying clonal complexes within a species and displaying the overall structure of the population. This network of MLST websites is available at http://www.mlst.net.
Structural and sequence features of two residue turns in beta-hairpins.

PubMed

Madan, Bharat; Seo, Sung Yong; Lee, Sun-Gu

2014-09-01

Beta-turns in beta-hairpins have been implicated as important sites in protein folding. In particular, two residue β-turns, the most abundant connecting elements in beta-hairpins, have been a major target for engineering protein stability and folding. In this study, we attempted to investigate and update the structural and sequence properties of two residue turns in beta-hairpins with a large data set. For this, 3977 beta-turns were extracted from 2394 nonhomologous protein chains and analyzed. First, the distribution, dihedral angles and twists of two residue turn types were determined, and compared with previous data. The trend of turn type occurrence and most structural features of the turn types were similar to previous results, but for the first time Type II turns in beta-hairpins were identified. Second, sequence motifs for the turn types were devised based on amino acid positional potentials of two-residue turns, and their distributions were examined. From this study, we could identify code-like sequence motifs for the two residue beta-turn types. Finally, structural and sequence properties of beta-strands in the beta-hairpins were analyzed, which revealed that the beta-strands showed no specific sequence and structural patterns for turn types. The analytical results in this study are expected to be a reference in the engineering or design of beta-hairpin turn structures and sequences. © 2014 Wiley Periodicals, Inc.
Neisseria gonorrhoeae Sequence Typing for Antimicrobial Resistance, a Novel Antimicrobial Resistance Multilocus Typing Scheme for Tracking Global Dissemination of N. gonorrhoeae Strains.

PubMed

Demczuk, W; Sidhu, S; Unemo, M; Whiley, D M; Allen, V G; Dillon, J R; Cole, M; Seah, C; Trembizki, E; Trees, D L; Kersh, E N; Abrams, A J; de Vries, H J C; van Dam, A P; Medina, I; Bharat, A; Mulvey, M R; Van Domselaar, G; Martin, I

2017-05-01

A curated Web-based user-friendly sequence typing tool based on antimicrobial resistance determinants in Neisseria gonorrhoeae was developed and is publicly accessible (https://ngstar.canada.ca). The N. gonorrhoeae Sequence Typing for Antimicrobial Resistance (NG-STAR) molecular typing scheme uses the DNA sequences of 7 genes ( penA , mtrR , porB , ponA , gyrA , parC , and 23S rRNA) associated with resistance to β-lactam antimicrobials, macrolides, or fluoroquinolones. NG-STAR uses the entire penA sequence, combining the historical nomenclature for penA types I to XXXVIII with novel nucleotide sequence designations; the full mtrR sequence and a portion of its promoter region; portions of ponA , porB , gyrA , and parC ; and 23S rRNA sequences. NG-STAR grouped 768 isolates into 139 sequence types (STs) ( n = 660) consisting of 29 clonal complexes (CCs) having a maximum of a single-locus variation, and 76 NG-STAR STs ( n = 109) were identified as unrelated singletons. NG-STAR had a high Simpson's diversity index value of 96.5% (95% confidence interval [CI] = 0.959 to 0.969). The most common STs were NG-STAR ST-90 ( n = 100; 13.0%), ST-42 and ST-91 ( n = 45; 5.9%), ST-64 ( n = 44; 5.72%), and ST-139 ( n = 42; 5.5%). Decreased susceptibility to azithromycin was associated with NG-STAR ST-58, ST-61, ST-64, ST-79, ST-91, and ST-139 ( n = 156; 92.3%); decreased susceptibility to cephalosporins was associated with NG-STAR ST-90, ST-91, and ST-97 ( n = 162; 94.2%); and ciprofloxacin resistance was associated with NG-STAR ST-26, ST-90, ST-91, ST-97, ST-150, and ST-158 ( n = 196; 98.0%). All isolates of NG-STAR ST-42, ST-43, ST-63, ST-81, and ST-160 ( n = 106) were susceptible to all four antimicrobials. The standardization of nomenclature associated with antimicrobial resistance determinants through an internationally available database will facilitate the monitoring of the global dissemination of antimicrobial-resistant N. gonorrhoeae strains. © Crown copyright 2017.

BrucellaBase: Genome information resource.

PubMed

Sankarasubramanian, Jagadesan; Vishnu, Udayakumar S; Khader, L K M Abdul; Sridhar, Jayavel; Gunasekaran, Paramasamy; Rajendhran, Jeyaprakash

2016-09-01

Brucella sp. causes a major zoonotic disease, brucellosis. Brucella belongs to the family Brucellaceae under the order Rhizobiales of Alphaproteobacteria. We present BrucellaBase, a web-based platform, providing features of a genome database together with unique analysis tools. We have developed a web version of the multilocus sequence typing (MLST) (Whatmore et al., 2007) and phylogenetic analysis of Brucella spp. BrucellaBase currently contains genome data of 510 Brucella strains along with the user interfaces for BLAST, VFDB, CARD, pairwise genome alignment and MLST typing. Availability of these tools will enable the researchers interested in Brucella to get meaningful information from Brucella genome sequences. BrucellaBase will regularly be updated with new genome sequences, new features along with improvements in genome annotations. BrucellaBase is available online at http://www.dbtbrucellosis.in/brucellabase.html or http://59.99.226.203/brucellabase/homepage.html. Copyright © 2016 Elsevier B.V. All rights reserved.
The spa typing of methicillin-resistant Staphylococcus aureus isolates by High Resolution Melting (HRM) analysis.

PubMed

Fasihi, Yasser; Fooladi, Saba; Mohammadi, Mohammad Ali; Emaneini, Mohammad; Kalantar-Neyestanaki, Davood

2017-09-06

Molecular typing is an important tool for control and prevention of infection. A suitable molecular typing method for epidemiological investigation must be easy to perform, highly reproducible, inexpensive, rapid and easy to interpret. In this study, two molecular typing methods including the conventional PCR-sequencing method and high resolution melting (HRM) analysis were used for staphylococcal protein A (spa) typing of 30 Methicillin-resistant Staphylococcus aureus (MRSA) isolates recovered from clinical samples. Based on PCR-sequencing method results, 16 different spa types were identified among the 30 MRSA isolates. Among the 16 different spa types, 14 spa types separated by HRM method. Two spa types including t4718 and t2894 were not separated from each other. According to our results, spa typing based on HRM analysis method is very rapid, easy to perform and cost-effective, but this method must be standardized for different regions, spa types, and real-time machinery.
The complete genome sequence of human adenovirus 84, a highly recombinant new Human mastadenovirus D type with a unique fiber gene.

PubMed

Kaján, Győző L; Kajon, Adriana E; Pinto, Alexis Castillo; Bartha, Dániel; Arnberg, Niklas

2017-10-15

A novel human adenovirus was isolated from a pediatric case of acute respiratory disease in Panama City, Panama in 2011. The clinical isolate was initially identified as an intertypic recombinant based on hexon and fiber gene sequencing. Based on the analysis of its complete genome sequence, the novel complex recombinant Human mastadenovirus D (HAdV-D) strain was classified into a new HAdV type: HAdV-84, and it was designated Adenovirus D human/PAN/P309886/2011/84[P43H17F84]. HAdV-D types possess usually an ocular or gastrointestinal tropism, and respiratory association is scarcely reported. The virus has a novel fiber type, most closely related to, but still clearly distant from that of HAdV-36. The predicted fiber is hypothesised to bind sialic acid with lower affinity compared to HAdV-37. Bioinformatic analysis of the complete genomic sequence of HAdV-84 revealed multiple homologous recombination events and provided deeper insight into HAdV evolution. Copyright © 2017 Elsevier B.V. All rights reserved.
Are Escherichia coli Pathotypes Still Relevant in the Era of Whole-Genome Sequencing?

PubMed Central

Robins-Browne, Roy M.; Holt, Kathryn E.; Ingle, Danielle J.; Hocking, Dianna M.; Yang, Ji; Tauschek, Marija

2016-01-01

The empirical and pragmatic nature of diagnostic microbiology has given rise to several different schemes to subtype E.coli, including biotyping, serotyping, and pathotyping. These schemes have proved invaluable in identifying and tracking outbreaks, and for prognostication in individual cases of infection, but they are imprecise and potentially misleading due to the malleability and continuous evolution of E. coli. Whole genome sequencing can be used to accurately determine E. coli subtypes that are based on allelic variation or differences in gene content, such as serotyping and pathotyping. Whole genome sequencing also provides information about single nucleotide polymorphisms in the core genome of E. coli, which form the basis of sequence typing, and is more reliable than other systems for tracking the evolution and spread of individual strains. A typing scheme for E. coli based on genome sequences that includes elements of both the core and accessory genomes, should reduce typing anomalies and promote understanding of how different varieties of E. coli spread and cause disease. Such a scheme could also define pathotypes more precisely than current methods. PMID:27917373
Are Escherichia coli Pathotypes Still Relevant in the Era of Whole-Genome Sequencing?

PubMed

Robins-Browne, Roy M; Holt, Kathryn E; Ingle, Danielle J; Hocking, Dianna M; Yang, Ji; Tauschek, Marija

2016-01-01

The empirical and pragmatic nature of diagnostic microbiology has given rise to several different schemes to subtype E .coli, including biotyping, serotyping, and pathotyping. These schemes have proved invaluable in identifying and tracking outbreaks, and for prognostication in individual cases of infection, but they are imprecise and potentially misleading due to the malleability and continuous evolution of E. coli . Whole genome sequencing can be used to accurately determine E. coli subtypes that are based on allelic variation or differences in gene content, such as serotyping and pathotyping. Whole genome sequencing also provides information about single nucleotide polymorphisms in the core genome of E. coli , which form the basis of sequence typing, and is more reliable than other systems for tracking the evolution and spread of individual strains. A typing scheme for E. coli based on genome sequences that includes elements of both the core and accessory genomes, should reduce typing anomalies and promote understanding of how different varieties of E. coli spread and cause disease. Such a scheme could also define pathotypes more precisely than current methods.
Core genome conservation of Staphylococcus haemolyticus limits sequence based population structure analysis.

PubMed

Cavanagh, Jorunn Pauline; Klingenberg, Claus; Hanssen, Anne-Merethe; Fredheim, Elizabeth Aarag; Francois, Patrice; Schrenzel, Jacques; Flægstad, Trond; Sollid, Johanna Ericson

2012-06-01

The notoriously multi-resistant Staphylococcus haemolyticus is an emerging pathogen causing serious infections in immunocompromised patients. Defining the population structure is important to detect outbreaks and spread of antimicrobial resistant clones. Currently, the standard typing technique is pulsed-field gel electrophoresis (PFGE). In this study we describe novel molecular typing schemes for S. haemolyticus using multi locus sequence typing (MLST) and multi locus variable number of tandem repeats (VNTR) analysis. Seven housekeeping genes (MLST) and five VNTR loci (MLVF) were selected for the novel typing schemes. A panel of 45 human and veterinary S. haemolyticus isolates was investigated. The collection had diverse PFGE patterns (38 PFGE types) and was sampled over a 20 year-period from eight countries. MLST resolved 17 sequence types (Simpsons index of diversity [SID]=0.877) and MLVF resolved 14 repeat types (SID=0.831). We found a low sequence diversity. Phylogenetic analysis clustered the isolates in three (MLST) and one (MLVF) clonal complexes, respectively. Taken together, neither the MLST nor the MLVF scheme was suitable to resolve the population structure of this S. haemolyticus collection. Future MLVF and MLST schemes will benefit from addition of more variable core genome sequences identified by comparing different fully sequenced S. haemolyticus genomes. Copyright © 2012 Elsevier B.V. All rights reserved.
Detection of cystic fibrosis mutations in a GeneChip{trademark} assay format

DOE Office of Scientific and Technical Information (OSTI.GOV)

Miyada, C.G.; Cronin, M.T.; Kim, S.M.

1994-09-01

We are developing assays for the detection of cystic fibrosis mutations based on DNA hybridization. A DNA sample is amplified by PCR, labeled by incorporating a fluorescein-tagged dNTP, enzymatically treated to produce smaller fragments and hybridized to a series of short (13-16 bases) oligonucleotides synthesized on a glass surface via photolithography. The hybrids are detected by eqifluorescence and mutations are identified by the specific pattern of hybridization. In a GeneChip assay, the chip surface is composed of a series of subarrays, each being specific for a particular mutation. Each subarray is further subdivided into a series of probes (40 total),more » half based on the mutant sequence and the remainder based on the wild-type sequence. For each of the subarrays, there is a redundancy in the number of probes that should hybridize to either a wild-type or a mutant target. The multiple probe strategy provides sequence information for a short five base region overlapping the mutation site. In addition, homozygous wild-type and mutant as well as heterozygous samples are each identified by a specific pattern of hybridization. The small size of each probe feature (250 x 250 {mu}m{sup 2}) permits the inclusion of additional probes required to generate sequence information by hybridization.« less
Massively parallel pyrosequencing of the mitochondrial genome with the 454 methodology in forensic genetics.

PubMed

Mikkelsen, Martin; Frank-Hansen, Rune; Hansen, Anders J; Morling, Niels

2014-09-01

of sequencing of whole mitochondrial genome, HV1 and HV2 DNA with the second generation system (SGS) Roche 454 GS Junior were compared with results of Sanger sequencing and SNP typing with SNaPshot single base extension detected with MALDI-TOF and capillary electrophoresis. We investigated the performance of the software analysis of the data, reproducibility, ability to sequence homopolymeric regions, detection of mixtures and heteroplasmy as well as the implications of the depth of coverage. We found full reproducibility between samples sequenced twice with SGS. We found close to full concordance between the mtDNA sequences of 26 samples obtained with (1) the 454 SGS method using a depth of coverage above 100 and (2) Sanger sequencing and SNP typing. The discrepancies were primarily observed in homopolymeric regions. The 454 SGS method was able to sequence 95% of the reads correctly in homopolymers up to 4 bases, and up to 6 bases could be sequenced with similar success if the results were carefully, visually inspected. The 454 technology was able to detect mixtures or heteroplasmy of approximately 10%. We detected previously unreported heteroplasmy in the GM9947A component of the NIST human mitochondrial DNA SRM-2392 standard reference material. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Diversity of 16S rRNA genes of new Ehrlichia strains isolated from horses with clinical signs of Potomac horse fever.

PubMed

Wen, B; Rikihisa, Y; Fuerst, P A; Chaichanasiriwithaya, W

1995-04-01

Ehrlichia risticii is the causative agent of Potomac horse fever. Variations among the major antigens of different local E. risticii strains have been detected previously. To further assess genetic variability in this species or species complex, the sequences of the 16S rRNA genes of several isolates obtained from sick horses diagnosed as having Potomac horse fever were determined. The sequences of six isolates obtained from Ohio and three isolates obtained from Kentucky were amplified by PCR. Three groups of sequences were identified. The sequences of five of the Ohio isolates were identical to the sequence of the type strain of E. risticii, the Illinois strain. The sequence of one Ohio isolate, isolate 081, was unique; this sequence differed in 10 nucleotides from the sequence of the type strain (level of similarity, 99.3%). The sequences of the three Kentucky isolates were identical to each other, but differed by five bases from the sequence of the type strain (level of similarity, 99.6%). The levels of sequence similarity of isolate 081, the Kentucky isolates, and the type strain to the next most closely related Ehrlichia sp., Ehrlichia sennetsu, were 99.3, 99.2, and 99.2%, respectively. On the basis of the distinct antigenic profiles and the levels of 16S rRNA sequence divergence, isolate 081 is as divergent from the type strain of E. risticii as E. sennetsu is. Therefore, we suggest that strain 081 and the Kentucky isolates may represent two new distinct Ehrlichia species.
The evolution and population structure of Lactobacillus fermentum from different naturally fermented products as determined by multilocus sequence typing (MLST).

PubMed

Dan, Tong; Liu, Wenjun; Song, Yuqin; Xu, Haiyan; Menghe, Bilige; Zhang, Heping; Sun, Zhihong

2015-05-20

Lactobacillus fermentum is economically important in the production and preservation of fermented foods. A repeatable and discriminative typing method was devised to characterize L. fermentum at the molecular level. The multilocus sequence typing (MLST) scheme developed was based on analysis of the internal sequence of 11 housekeeping gene fragments (clpX, dnaA, dnaK, groEL, murC, murE, pepX, pyrG, recA, rpoB, and uvrC). MLST analysis of 203 isolates of L. fermentum from Mongolia and seven provinces/ autonomous regions in China identified 57 sequence types (ST), 27 of which were represented by only a single isolate, indicating high genetic diversity. Phylogenetic analyses based on the sequence of the 11 housekeeping gene fragments indicated that the L. fermentum isolates analyzed belonged to two major groups. A standardized index of association (I A (S)) indicated a weak clonal population structure in L. fermentum. Split decomposition analysis indicated that recombination played an important role in generating the genetic diversity observed in L. fermentum. The results from the minimum spanning tree strongly suggested that evolution of L. fermentum STs was not correlated with geography or food-type. The MLST scheme developed will be valuable for further studies on the evolution and population structure of L. fermentum isolates used in food products.
Population-scale whole genome sequencing identifies 271 highly polymorphic short tandem repeats from Japanese population.

PubMed

Hirata, Satoshi; Kojima, Kaname; Misawa, Kazuharu; Gervais, Olivier; Kawai, Yosuke; Nagasaki, Masao

2018-05-01

Forensic DNA typing is widely used to identify missing persons and plays a central role in forensic profiling. DNA typing usually uses capillary electrophoresis fragment analysis of PCR amplification products to detect the length of short tandem repeat (STR) markers. Here, we analyzed whole genome data from 1,070 Japanese individuals generated using massively parallel short-read sequencing of 162 paired-end bases. We have analyzed 843,473 STR loci with two to six basepair repeat units and cataloged highly polymorphic STR loci in the Japanese population. To evaluate the performance of the cataloged STR loci, we compared 23 STR loci, widely used in forensic DNA typing, with capillary electrophoresis based STR genotyping results in the Japanese population. Seventeen loci had high correlations and high call rates. The other six loci had low call rates or low correlations due to either the limitations of short-read sequencing technology, the bioinformatics tool used, or the complexity of repeat patterns. With these analyses, we have also purified the suitable 218 STR loci with four basepair repeat units and 53 loci with five basepair repeat units both for short read sequencing and PCR based technologies, which would be candidates to the actual forensic DNA typing in Japanese population.
High-resolution melting genotyping of Enterococcus faecium based on multilocus sequence typing derived single nucleotide polymorphisms.

PubMed

Tong, Steven Y C; Xie, Shirley; Richardson, Leisha J; Ballard, Susan A; Dakh, Farshid; Grabsch, Elizabeth A; Grayson, M Lindsay; Howden, Benjamin P; Johnson, Paul D R; Giffard, Philip M

2011-01-01

We have developed a single nucleotide polymorphism (SNP) nucleated high-resolution melting (HRM) technique to genotype Enterococcus faecium. Eight SNPs were derived from the E. faecium multilocus sequence typing (MLST) database and amplified fragments containing these SNPs were interrogated by HRM. We tested the HRM genotyping scheme on 85 E. faecium bloodstream isolates and compared the results with MLST, pulsed-field gel electrophoresis (PFGE) and an allele specific real-time PCR (AS kinetic PCR) SNP typing method. In silico analysis based on predicted HRM curves according to the G+C content of each fragment for all 567 sequence types (STs) in the MLST database together with empiric data from the 85 isolates demonstrated that HRM analysis resolves E. faecium into 231 "melting types" (MelTs) and provides a Simpson's Index of Diversity (D) of 0.991 with respect to MLST. This is a significant improvement on the AS kinetic PCR SNP typing scheme that resolves 61 SNP types with D of 0.95. The MelTs were concordant with the known ST of the isolates. For the 85 isolates, there were 13 PFGE patterns, 17 STs, 14 MelTs and eight SNP types. There was excellent concordance between PFGE, MLST and MelTs with Adjusted Rand Indices of PFGE to MelT 0.936 and ST to MelT 0.973. In conclusion, this HRM based method appears rapid and reproducible. The results are concordant with MLST and the MLST based population structure.
Survey of local and global biological network alignment: the need to reconcile the two sides of the same coin.

PubMed

Guzzi, Pietro Hiram; Milenkovic, Tijana

2018-05-01

Analogous to genomic sequence alignment that allows for across-species transfer of biological knowledge between conserved sequence regions, biological network alignment can be used to guide the knowledge transfer between conserved regions of molecular networks of different species. Hence, biological network alignment can be used to redefine the traditional notion of a sequence-based homology to a new notion of network-based homology. Analogous to genomic sequence alignment, there exist local and global biological network alignments. Here, we survey prominent and recent computational approaches of each network alignment type and discuss their (dis)advantages. Then, as it was recently shown that the two approach types are complementary, in the sense that they capture different slices of cellular functioning, we discuss the need to reconcile the two network alignment types and present a recent first step in this direction. We conclude with some open research problems on this topic and comment on the usefulness of network alignment in other domains besides computational biology.
Ribosomal subunit protein typing using matrix-assisted laser desorption ionization time-of-flight mass spectrometry (MALDI-TOF MS) for the identification and discrimination of Aspergillus species.

PubMed

Nakamura, Sayaka; Sato, Hiroaki; Tanaka, Reiko; Kusuya, Yoko; Takahashi, Hiroki; Yaguchi, Takashi

2017-04-26

Accurate identification of Aspergillus species is a very important subject. Mass spectral fingerprinting using matrix-assisted laser desorption ionization time-of-flight mass spectrometry (MALDI-TOF MS) is generally employed for the rapid identification of fungal isolates. However, the results are based on simple mass spectral pattern-matching, with no peak assignment and no taxonomic input. We propose here a ribosomal subunit protein (RSP) typing technique using MALDI-TOF MS for the identification and discrimination of Aspergillus species. The results are concluded to be phylogenetic in that they reflect the molecular evolution of housekeeping RSPs. The amino acid sequences of RSPs of genome-sequenced strains of Aspergillus species were first verified and compared to compile a reliable biomarker list for the identification of Aspergillus species. In this process, we revealed that many amino acid sequences of RSPs (about 10-60%, depending on strain) registered in the public protein databases needed to be corrected or newly added. The verified RSPs were allocated to RSP types based on their mass. Peak assignments of RSPs of each sample strain as observed by MALDI-TOF MS were then performed to set RSP type profiles, which were then further processed by means of cluster analysis. The resulting dendrogram based on RSP types showed a relatively good concordance with the tree based on β-tubulin gene sequences. RSP typing was able to further discriminate the strains belonging to Aspergillus section Fumigati. The RSP typing method could be applied to identify Aspergillus species, even for species within section Fumigati. The discrimination power of RSP typing appears to be comparable to conventional β-tubulin gene analysis. This method would therefore be suitable for species identification and discrimination at the strain to species level. Because RSP typing can characterize the strains within section Fumigati, this method has potential as a powerful and reliable tool in the field of clinical microbiology.
Routine HLA-B genotyping with PCR-sequence-specific oligonucleotides detects a B*52 variant (B*5206).

PubMed

Hoelsch, K; Lenggeler, I; Pfannes, W; Knabe, H; Klein, H-G; Woelpl, A

2005-05-01

A new human leukocyte antigen (HLA)-B allele was found during routine typing of samples for a German unrelated bone marrow donor registry, the "Aktion Knochenmarkspende Bayern". After first interpretation of data of two independent low-resolution sequence-specific oligonucleotide typing tests, a B*51 variant was suggested. Further analysis via sequence-based typing identified the sequence as new B*52 allele. This new allele officially assigned as B*5206 differs from HLA-B*520102 by one nucleotide exchange in exon 2. The mutation is located at nucleotide position 274, at which a cytosine is substituted by a thymine leading to an amino acid change at protein position 67 from serine (TCC) to phenylalanine (TTC).
Using information content and base frequencies to distinguish mutations from genetic polymorphisms in splice junction recognition sites.

PubMed

Rogan, P K; Schneider, T D

1995-01-01

Predicting the effects of nucleotide substitutions in human splice sites has been based on analysis of consensus sequences. We used a graphic representation of sequence conservation and base frequency, the sequence logo, to demonstrate that a change in a splice acceptor of hMSH2 (a gene associated with familial nonpolyposis colon cancer) probably does not reduce splicing efficiency. This confirms a population genetic study that suggested that this substitution is a genetic polymorphism. The information theory-based sequence logo is quantitative and more sensitive than the corresponding splice acceptor consensus sequence for detection of true mutations. Information analysis may potentially be used to distinguish polymorphisms from mutations in other types of transcriptional, translational, or protein-coding motifs.
Development of chemiluminescent probe hybridization, RT-PCR and nucleic acid cycle sequencing assays of Sabin type 3 isolates to identify base pair 472 Sabin type 3 mutants associated with vaccine associated paralytic poliomyelitis.

PubMed

Old, M O; Logan, L H; Maldonado, Y A

1997-11-01

Sabin type 3 polio vaccine virus is the most common cause of poliovaccine associated paralytic poliomyelitis. Vaccine associated paralytic poliomyelitis cases have been associated with Sabin type 3 revertants containing a single U to C substitution at bp 472 of Sabin type 3. A rapid method of identification of Sabin type 3 bp 472 mutants is described. An enterovirus group-specific probe for use in a chemiluminescent dot blot hybridization assay was developed to identify enterovirus positive viral lysates. A reverse transcription-polymerase chain reaction (RT-PCR) assay producing a 319 bp PCR product containing the Sabin type 3 bp 472 mutation site was then employed to identify Sabin type 3 isolates. Chemiluminescent nucleic acid cycle sequencing of the purified 319 bp PCR product was then employed to identify nucleic acid sequences at bp 472. The enterovirus group probe hybridization procedure and isolation of the Sabin type 3 PCR product were highly sensitive and specific; nucleic acid cycle sequencing corresponded to the known sequence of stock Sabin type 3 isolates. These methods will be used to identify the Sabin type 3 reversion rate from sequential stool samples of infants obtained after the first and second doses of oral poliovirus vaccine.
DNA capture and next-generation sequencing can recover whole mitochondrial genomes from highly degraded samples for human identification

PubMed Central

2013-01-01

Background Mitochondrial DNA (mtDNA) typing can be a useful aid for identifying people from compromised samples when nuclear DNA is too damaged, degraded or below detection thresholds for routine short tandem repeat (STR)-based analysis. Standard mtDNA typing, focused on PCR amplicon sequencing of the control region (HVS I and HVS II), is limited by the resolving power of this short sequence, which misses up to 70% of the variation present in the mtDNA genome. Methods We used in-solution hybridisation-based DNA capture (using DNA capture probes prepared from modern human mtDNA) to recover mtDNA from post-mortem human remains in which the majority of DNA is both highly fragmented (<100 base pairs in length) and chemically damaged. The method ‘immortalises’ the finite quantities of DNA in valuable extracts as DNA libraries, which is followed by the targeted enrichment of endogenous mtDNA sequences and characterisation by next-generation sequencing (NGS). Results We sequenced whole mitochondrial genomes for human identification from samples where standard nuclear STR typing produced only partial profiles or demonstrably failed and/or where standard mtDNA hypervariable region sequences lacked resolving power. Multiple rounds of enrichment can substantially improve coverage and sequencing depth of mtDNA genomes from highly degraded samples. The application of this method has led to the reliable mitochondrial sequencing of human skeletal remains from unidentified World War Two (WWII) casualties approximately 70 years old and from archaeological remains (up to 2,500 years old). Conclusions This approach has potential applications in forensic science, historical human identification cases, archived medical samples, kinship analysis and population studies. In particular the methodology can be applied to any case, involving human or non-human species, where whole mitochondrial genome sequences are required to provide the highest level of maternal lineage discrimination. Multiple rounds of in-solution hybridisation-based DNA capture can retrieve whole mitochondrial genome sequences from even the most challenging samples. PMID:24289217
Discriminatory usefulness of pulsed-field gel electrophoresis and sequence-based typing in Legionella outbreaks.

PubMed

Quero, Sara; García-Núñez, Marian; Párraga-Niño, Noemí; Barrabeig, Irene; Pedro-Botet, Maria L; de Simon, Mercè; Sopena, Nieves; Sabrià, Miquel

2016-06-01

To compare the discriminatory power of pulsed-field gel electrophoresis (PFGE) and sequence-based typing (SBT) in Legionella outbreaks for determining the infection source. Twenty-five investigations of Legionnaires' disease were analyzed by PFGE, SBT and Dresden monoclonal antibody. The results suggested that monoclonal antibody could reduce the number of Legionella isolates to be characterized by molecular methods. The epidemiological concordance PFGE-SBT was 100%, while the molecular concordance was 64%. Adjusted Wallace index (AW) showed that PFGE has better discriminatory power than SBT (AWSBT→PFGE = 0.767; AWPFGE→SBT = 1). The discrepancies appeared mostly in sequence type (ST) 1, a worldwide distributed ST for which PFGE discriminated different profiles. SBT discriminatory power was not sufficient verifying the infection source, especially in worldwide distributed STs, which were classified into different PFGE patterns.
HPV-QUEST: A highly customized system for automated HPV sequence analysis capable of processing Next Generation sequencing data set.

PubMed

Yin, Li; Yao, Jiqiang; Gardner, Brent P; Chang, Kaifen; Yu, Fahong; Goodenow, Maureen M

2012-01-01

Next Generation sequencing (NGS) applied to human papilloma viruses (HPV) can provide sensitive methods to investigate the molecular epidemiology of multiple type HPV infection. Currently a genotyping system with a comprehensive collection of updated HPV reference sequences and a capacity to handle NGS data sets is lacking. HPV-QUEST was developed as an automated and rapid HPV genotyping system. The web-based HPV-QUEST subtyping algorithm was developed using HTML, PHP, Perl scripting language, and MYSQL as the database backend. HPV-QUEST includes a database of annotated HPV reference sequences with updated nomenclature covering 5 genuses, 14 species and 150 mucosal and cutaneous types to genotype blasted query sequences. HPV-QUEST processes up to 10 megabases of sequences within 1 to 2 minutes. Results are reported in html, text and excel formats and display e-value, blast score, and local and coverage identities; provide genus, species, type, infection site and risk for the best matched reference HPV sequence; and produce results ready for additional analyses.

Epstein-Barr Virus Sequence Variation—Biology and Disease

PubMed Central

Tzellos, Stelios; Farrell, Paul J.

2012-01-01

Some key questions in Epstein-Barr virus (EBV) biology center on whether naturally occurring sequence differences in the virus affect infection or EBV associated diseases. Understanding the pattern of EBV sequence variation is also important for possible development of EBV vaccines. At present EBV isolates worldwide can be grouped into Type 1 and Type 2, a classification based on the EBNA2 gene sequence. Type 1 EBV is the most prevalent worldwide but Type 2 is common in parts of Africa. Type 1 transforms human B cells into lymphoblastoid cell lines much more efficiently than Type 2 EBV. Molecular mechanisms that may account for this difference in cell transformation are now becoming clearer. Advances in sequencing technology will greatly increase the amount of whole EBV genome data for EBV isolated from different parts of the world. Study of regional variation of EBV strains independent of the Type 1/Type 2 classification and systematic investigation of the relationship between viral strains, infection and disease will become possible. The recent discovery that specific mutation of the EBV EBNA3B gene may be linked to development of diffuse large B cell lymphoma illustrates the importance that mutations in the virus genome may have in infection and human disease. PMID:25436768
Over a Decade of recA and tly Gene Sequence Typing of the Skin Bacterium Propionibacterium acnes: What Have We Learnt?

PubMed Central

2017-01-01

The Gram-positive, anaerobic bacterium Propionibacterium acnes forms part of the normal microbiota on human skin and mucosal surfaces. While normally associated with skin health, P. acnes is also an opportunistic pathogen linked with a range of human infections and clinical conditions. Over the last decade, our knowledge of the intraspecies phylogenetics and taxonomy of this bacterium has increased tremendously due to the introduction of DNA typing schemes based on single and multiple gene loci, as well as whole genomes. Furthermore, this work has led to the identification of specific lineages associated with skin health and human disease. In this review we will look back at the introduction of DNA sequence typing of P. acnes based on recA and tly loci, and then describe how these methods provided a basic understanding of the population genetic structure of the bacterium, and even helped characterize the grapevine-associated lineage of P. acnes, known as P. acnes type Zappe, which appears to have undergone a host switch from humans-to-plants. Particular limitations of recA and tly sequence typing will also be presented, as well as a detailed discussion of more recent, higher resolution, DNA-based methods to type P. acnes and investigate its evolutionary history in greater detail. PMID:29267255
Evaluation of an automated repetitive sequence-based PCR system for subtyping Enterobacter sakazakii.

PubMed

Healy, B; Mullane, N; Collin, V; Mailler, S; Iversen, C; Chatellier, S; Storrs, M; Fanning, S

2008-07-01

Enterobacter sakazakii is regarded as a ubiquitous organism that can be isolated from a wide range of foods and environments. Infection in at-risk infants has been epidemiologically linked to the consumption of contaminated powdered infant formula. Preventing the dissemination of this pathogen in a powdered infant formula manufacturing facility is an important step in ensuring consumer confidence in a given brand together with the protection of the health status of a vulnerable population. In this study we report the application of a repetitive sequence-based PCR typing method to subtype a previously well-characterized collection of E. sakazakii isolates of diverse origin. While both methods successfully discriminated between the collection of isolates, repetitive sequence-based PCR identified 65 types, whereas pulsed-field gel electrophoresis identified 110 types showing > or =95% similarity. The method was quick and easy to perform, and our data demonstrated the utility and value of this approach to monitor in-process contamination, which could potentially contribute to a reduction in the transmission of E. sakazakii.
Genome-wide-analyses of Listeria monocytogenes from food-processing plants reveal clonal diversity and date the emergence of persisting sequence types.

PubMed

Knudsen, Gitte M; Nielsen, Jesper Boye; Marvig, Rasmus L; Ng, Yin; Worning, Peder; Westh, Henrik; Gram, Lone

2017-08-01

Whole genome sequencing is increasing used in epidemiology, e.g. for tracing outbreaks of food-borne diseases. This requires in-depth understanding of pathogen emergence, persistence and genomic diversity along the food production chain including in food processing plants. We sequenced the genomes of 80 isolates of Listeria monocytogenes sampled from Danish food processing plants over a time-period of 20 years, and analysed the sequences together with 10 public available reference genomes to advance our understanding of interplant and intraplant genomic diversity of L. monocytogenes. Except for three persisting sequence types (ST) based on Multi Locus Sequence Typing being ST7, ST8 and ST121, long-term persistence of clonal groups was limited, and new clones were introduced continuously, potentially from raw materials. No particular gene could be linked to the persistence phenotype. Using time-based phylogenetic analyses of the persistent STs, we estimate the L. monocytogenes evolutionary rate to be 0.18-0.35 single nucleotide polymorphisms/year, suggesting that the persistent STs emerged approximately 100 years ago, which correlates with the onset of industrialization and globalization of the food market. © 2017 Society for Applied Microbiology and John Wiley & Sons Ltd.
Species-specific Typing of DNA Based on Palindrome Frequency Patterns

PubMed Central

Lamprea-Burgunder, Estelle; Ludin, Philipp; Mäser, Pascal

2011-01-01

DNA in its natural, double-stranded form may contain palindromes, sequences which read the same from either side because they are identical to their reverse complement on the sister strand. Short palindromes are underrepresented in all kinds of genomes. The frequency distribution of short palindromes exhibits more than twice the inter-species variance of non-palindromic sequences, which renders palindromes optimally suited for the typing of DNA. Here, we show that based on palindrome frequency, DNA sequences can be discriminated to the level of species of origin. By plotting the ratios of actual occurrence to expectancy, we generate palindrome frequency patterns that allow to cluster different sequences of the same genome and to assign plasmids, and in some cases even viruses to their respective host genomes. This finding will be of use in the growing field of metagenomics. PMID:21429991
[Analysis of 4 clustered high risk acute flaccid paralysis cases in Shanxi Province in 2006].

PubMed

Yan, Dong-mei; Zhang, Yong; Wang, Dong-yan

2010-04-01

Analysis of epidemiology of 4 clustered high risk acute flaccid paralysis(AFP) cases reported by Shanxi province in 2006 and VP1 gene characteristic for type III poliovirus isolated from the four AFP cases. Virus isolation and identification were conducted according to the 4th edition of WHO polio laboratory manual. The sequence of VP1 region were amplified and sequenced. The phylogenetic trees based on VP1 region were constructed. Three of four high risk AFP cases were suspected as vaccine associated paralysis poliomyelitis (VAPP), the onset date of them were close. VP1 sequencing of the four type III isolates revealed that the identity were 99.7%, 99.9%, 99.4% and 99.9% respectively compared with vaccine reference strain-BJOPV3. According to WHO criteria, the four isolates were identified as type III vaccine-related poliovirus. Phylogenetic analysis based on VP1 coding sequence showed that the four type III poliovirus were not related significantly. The type III poliovirus isolated from 3 suspected VAPP cases shared one nucleotide mutation at 2637 (C-->U), which result in the amino acid mutation from Val into Ala. The improvement of laboratory surveillance for clustered high risk AFP cases should be strengthened so as to detect and prevent poliovirus circulation timely.
Comparison and Evaluation of the Molecular Typing Methods for Toxigenic Vibrio cholerae in Southwest China.

PubMed

Liao, Feng; Mo, Zhishuo; Chen, Meiling; Pang, Bo; Fu, Xiaoqing; Xu, Wen; Jing, Huaiqi; Kan, Biao; Gu, Wenpeng

2018-01-01

Vibrio cholerae O1 strains taken from the repository of Yunnan province, southwest China, were abundant and special. We selected 70 typical toxigenic V. cholerae (69 O1 and one O139 serogroup strains) isolated from Yunnan province, performed the pulsed field gel electrophoresis (PFGE), multilocus sequence typing (MLST), and MLST of virulence gene (V-MLST) methods, and evaluated the resolution abilities for typing methods. The ctxB subunit sequence analysis for all strains have shown that cholera between 1986 and 1995 was associated with mixed infections with El Tor and El Tor variants, while infections after 1996 were all caused by El Tor variant strains. Seventy V. cholerae obtained 50 PFGE patterns, with a high resolution. The strains could be divided into three groups with predominance of strains isolated during 1980s, 1990s, and 2000s, respectively, showing a good consistency with the epidemiological investigation. We also evaluated two MLST method for V. cholerae , one was used seven housekeeping genes ( adk , gyrB , metE , pntA , mdh , purM , and pyrC ), and all the isolates belonged to ST69; another was used nine housekeeping genes ( cat , chi , dnaE , gyrB , lap , pgm , recA , rstA , and gmd ). A total of seven sequence types (STs) were found by using this method for all the strains; among them, rstA gene had five alleles, recA and gmd have two alleles, and others had only one allele. The virulence gene sequence typing method ( ctxAB , tcpA , and toxR ) showed that 70 strains were divided into nine STs; among them, tcpA gene had six alleles, toxR had five alleles, while ctxAB was identical for all the strains. The latter two sequences based typing methods also had consistency with epidemiology of the strains. PFGE had a higher resolution ability compared with the sequence based typing method, and MLST used seven housekeeping genes showed the lower resolution power than nine housekeeping genes and virulence genes methods. These two sequence typing methods could distinguish some epidemiological special strains in local area.
Nucleotide sequences of the tet(M) genes from the American and Dutch type tetracycline resistance plasmids of Neisseria gonorrhoeae.

PubMed

Gascoyne-Binzi, D M; Heritage, J; Hawkey, P M

1993-11-01

High-level tetracycline-resistant Neisseria gonorrhoeae (TRNG) has been associated with the presence of a plasmid approximately 25.2 MDa in size which carries a Tet M tetracycline resistance determinant. Two different plasmid types, American and Dutch, have previously been described, based on the restriction endonuclease digestion pattern. In this study, the tet(M) genes from the two plasmid types have been amplified by the polymerase chain reaction (PCR) and then sequenced. The gene sequences from the two plasmids shared 96.8% identity, and showed similarities with different segments of the tet(M) gene sequences from Tn1545, Tn916 and Ureaplasma urealyticum. The data suggest that it is highly likely that the Tet M determinant found in the American type plasmid has a different origin from that present in the Dutch plasmid.
Cloning, sequencing, and expression of cDNA for human. beta. -glucuronidase

DOE Office of Scientific and Technical Information (OSTI.GOV)

Oshima, A.; Kyle, J.W.; Miller, R.D.

1987-02-01

The authors report here the cDNA sequence for human placental ..beta..-glucuronidase (..beta..-D-glucuronoside glucuronosohydrolase, EC 3.2.1.31) and demonstrate expression of the human enzyme in transfected COS cells. They also sequenced a partial cDNA clone from human fibroblasts that contained a 153-base-pair deletion within the coding sequence and found a second type of cDNA clone from placenta that contained the same deletion. Nuclease S1 mapping studies demonstrated two types of mRNAs in human placenta that corresponded to the two types of cDNA clones isolated. The NH/sub 2/-terminal amino acid sequence determined for human spleen ..beta..-glucuronidase agreed with that inferred from the DNAmore » sequence of the two placental clones, beginning at amino acid 23, suggesting a cleaved signal sequence of 22 amino acids. When transfected into COS cells, plasmids containing either placental clone expressed an immunoprecipitable protein that contained N-linked oligosaccharides as evidenced by sensitivity to endoglycosidase F. However, only transfection with the clone containing the 153-base-pair segment led to expression of human ..beta..-glucuronidase activity. These studies provide the sequence for the full-length cDNA for human ..beta..-glucuronidase, demonstrate the existence of two populations of mRNA for ..beta..-glucuronidase in human placenta, only one of which specifies a catalytically active enzyme, and illustrate the importance of expression studies in verifying that a cDNA is functionally full-length.« less
Development of an ELA-DRA gene typing method based on pyrosequencing technology.

PubMed

Díaz, S; Echeverría, M G; It, V; Posik, D M; Rogberg-Muñoz, A; Pena, N L; Peral-García, P; Vega-Pla, J L; Giovambattista, G

2008-11-01

The polymorphism of equine lymphocyte antigen (ELA) class II DRA gene had been detected by polymerase chain reaction-single-strand conformational polymorphism (PCR-SSCP) and reference strand-mediated conformation analysis. These methodologies allowed to identify 11 ELA-DRA exon 2 sequences, three of which are widely distributed among domestic horse breeds. Herein, we describe the development of a pyrosequencing-based method applicable to ELA-DRA typing, by screening samples from eight different horse breeds previously typed by PCR-SSCP. This sequence-based method would be useful in high-throughput genotyping of major histocompatibility complex genes in horses and other animal species, making this system interesting as a rapid screening method for animal genotyping of immune-related genes.
Genome Sequence of the Thermophile Bacillus coagulans Hammer, the Type Strain of the Species

PubMed Central

Su, Fei; Tao, Fei; Tang, Hongzhi

2012-01-01

Here we announce a 3.0-Mb assembly of the Bacillus coagulans Hammer strain, which is the type strain of the species within the genus Bacillus. Genomic analyses based on the sequence may provide insights into the phylogeny of the species and help to elucidate characteristics of the poorly studied strains of Bacillus coagulans. PMID:23105047
Genome sequence of the thermophile Bacillus coagulans Hammer, the type strain of the species.

PubMed

Su, Fei; Tao, Fei; Tang, Hongzhi; Xu, Ping

2012-11-01

Here we announce a 3.0-Mb assembly of the Bacillus coagulans Hammer strain, which is the type strain of the species within the genus Bacillus. Genomic analyses based on the sequence may provide insights into the phylogeny of the species and help to elucidate characteristics of the poorly studied strains of Bacillus coagulans.
Development of a Multiplex Single Base Extension Assay for Mitochondrial DNA Haplogroup Typing

PubMed Central

Nelson, Tahnee M.; Just, Rebecca S.; Loreille, Odile; Schanfield, Moses S.; Podini, Daniele

2007-01-01

Aim To provide a screening tool to reduce time and sample consumption when attempting mtDNA haplogroup typing. Methods A single base primer extension assay was developed to enable typing, in a single reaction, of twelve mtDNA haplogroup specific polymorphisms. For validation purposes a total of 147 samples were tested including 73 samples successfully haplogroup typed using mtDNA control region (CR) sequence data, 21 samples inconclusively haplogroup typed by CR data, 20 samples previously haplogroup typed using restriction fragment length polymorphism (RFLP) analysis, and 31 samples of known ancestral origin without previous haplogroup typing. Additionally, two highly degraded human bones embalmed and buried in the early 1950s were analyzed using the single nucleotide polymorphisms (SNP) multiplex. Results When the SNP multiplex was used to type the 96 previously CR sequenced specimens, an increase in haplogroup or macrohaplogroup assignment relative to conventional CR sequence analysis was observed. The single base extension assay was also successfully used to assign a haplogroup to decades-old, embalmed skeletal remains dating to World War II. Conclusion The SNP multiplex was successfully used to obtain haplogroup status of highly degraded human bones, and demonstrated the ability to eliminate possible contributors. The SNP multiplex provides a low-cost, high throughput method for typing of mtDNA haplogroups A, B, C, D, E, F, G, H, L1/L2, L3, M, and N that could be useful for screening purposes for human identification efforts and anthropological studies. PMID:17696300
Protein sequences clustering of herpes virus by using Tribe Markov clustering (Tribe-MCL)

NASA Astrophysics Data System (ADS)

Bustamam, A.; Siswantining, T.; Febriyani, N. L.; Novitasari, I. D.; Cahyaningrum, R. D.

2017-07-01

The herpes virus can be found anywhere and one of the important characteristics is its ability to cause acute and chronic infection at certain times so as a result of the infection allows severe complications occurred. The herpes virus is composed of DNA containing protein and wrapped by glycoproteins. In this work, the Herpes viruses family is classified and analyzed by clustering their protein-sequence using Tribe Markov Clustering (Tribe-MCL) algorithm. Tribe-MCL is an efficient clustering method based on the theory of Markov chains, to classify protein families from protein sequences using pre-computed sequence similarity information. We implement the Tribe-MCL algorithm using an open source program of R. We select 24 protein sequences of Herpes virus obtained from NCBI database. The dataset consists of three types of glycoprotein B, F, and H. Each type has eight herpes virus that infected humans. Based on our simulation using different inflation factor r=1.5, 2, 3 we find a various number of the clusters results. The greater the inflation factor the greater the number of their clusters. Each protein will grouped together in the same type of protein.
Variability of the protein sequences of lcrV between epidemic and atypical rhamnose-positive strains of Yersinia pestis.

PubMed

Anisimov, Andrey P; Panfertsev, Evgeniy A; Svetoch, Tat'yana E; Dentovskaya, Svetlana V

2007-01-01

Sequencing of lcrV genes and comparison of the deduced amino acid sequences from ten Y. pestis strains belonging mostly to the group of atypical rhamnose-positive isolates (non-pestis subspecies or pestoides group) showed that the LcrV proteins analyzed could be classified into five sequence types. This classification was based on major amino acid polymorphisms among LcrV proteins in the four "hot points" of the protein sequences. Some additional minor polymorphisms were found throughout these sequence types. The "hot points" corresponded to amino acids 18 (Lys --> Asn), 72 (Lys --> Arg), 273 (Cys --> Ser), and 324-326 (Ser-Gly-Lys --> Arg) in the LcrV sequence of the reference Y. pestis strain CO92. One possible explanation for polymorphism in amino acid sequences of LcrV among different strains is that strain-specific variation resulted from adaptation of the plague pathogen to different rodent and lagomorph hosts.
The nucleotide sequence of the intergenic region between the 5.8S and 26S rRNA genes of the yeast ribosomal RNA operon. Possible implications for the interaction between 5.8S and 26S rRNA and the processing of the primary transcript.

PubMed Central

Veldman, G M; Klootwijk, J; van Heerikhuizen, H; Planta, R J

1981-01-01

We have determined the nucleotide sequence of part of a cloned yeast ribosomal RNA operon extending from the 5.8S RNA gene downstream into the 5' -terminal region of the 26S RNA gene. We mapped the pertinent processing sites, viz. the 5' end of 26S rRNA and the 3'ends of 5.8S rRNA and its immediate precursor, 7S RNA. At the 3' end of 7S RNA we find the sequence UCGUUU which is very similar to the type I consensus sequence UCAUUA/U present at the 3' ends of 17S, 5.8S and 26S rRNA as well as 18S precursor rRNA in yeast. At the 5' end of the 26S RNA gene we find a sequence of thirteen nucleotides which is homologous to the type II sequence present at the 5' termini of both the 17S and the 5.8S RNA gene. These findings further support the suggestion put forward earlier (G.M. Veldman et al. (1980) Nucl. Acids Res. 8, 2907-2920) that both consensus sequences are involved in the recognition of precursor rRNA by the processing nuclease(s). We discuss a model for the processing of yeast rRNA in which a processing enzyme sequentially recognizes several combinations of a type I and a type II consensus sequence. We also describe the existence of a significant base complementarity between sequences in the 5' -terminal region of 26S rRNA and the 3' -terminal region of 5.8S rRNA. We suggest that base pairing between these sequences contributes to the binding between 5.8S and 26S rRNA. Images PMID:7312619
A RESTful application programming interface for the PubMLST molecular typing and genome databases

PubMed Central

Bray, James E.; Maiden, Martin C. J.

2017-01-01

Abstract Molecular typing is used to differentiate microorganisms at the subspecies or strain level for epidemiological investigations, infection control, public health and environmental sampling. DNA sequence-based typing methods require authoritative databases that link sequence variants to nomenclature in order to facilitate communication and comparison of identified types in national or global settings. The PubMLST website (https://pubmlst.org/) fulfils this role for over a hundred microorganisms for which it hosts curated molecular sequence typing data, providing sequence and allelic profile definitions for multi-locus sequence typing (MLST) and single-gene typing approaches. In recent years, these have expanded to cover the whole genome with schemes such as core genome MLST (cgMLST) and whole genome MLST (wgMLST) which catalogue the allelic diversity found in hundreds to thousands of genes. These approaches provide a common nomenclature for high-resolution strain characterization and comparison. Molecular typing information is linked to isolate provenance, phenotype, and increasingly genome assemblies, providing a resource for outbreak investigation and research in to population structure, gene association, global epidemiology and vaccine coverage. A Representational State Transfer (REST) Application Programming Interface (API) has been developed for the PubMLST website to make these large quantities of structured molecular typing and whole genome sequence data available for programmatic access by any third party application. The API is an integral component of the Bacterial Isolate Genome Sequence Database (BIGSdb) platform that is used to host PubMLST resources, and exposes all public data within the site. In addition to data browsing, searching and download, the API supports authentication and submission of new data to curator queues. Database URL: http://rest.pubmlst.org/ PMID:29220452
Epidemiological information is key when interpreting whole genome sequence data – lessons learned from a large Legionella pneumophila outbreak in Warstein, Germany, 2013

PubMed Central

Petzold, Markus; Prior, Karola; Moran-Gilad, Jacob; Harmsen, Dag; Lück, Christian

2017-01-01

Introduction Whole genome sequencing (WGS) is increasingly used in Legionnaires’ disease (LD) outbreak investigations, owing to its higher resolution than sequence-based typing, the gold standard typing method for Legionella pneumophila, in the analysis of endemic strains. Recently, a gene-by-gene typing approach based on 1,521 core genes called core genome multilocus sequence typing (cgMLST) was described that enables a robust and standardised typing of L. pneumophila. Methods: We applied this cgMLST scheme to isolates obtained during the largest outbreak of LD reported so far in Germany. In this outbreak, the epidemic clone ST345 had been isolated from patients and four different environmental sources. In total 42 clinical and environmental isolates were retrospectively typed. Results: Epidemiologically unrelated ST345 isolates were clearly distinguishable from the epidemic clone. Remarkably, epidemic isolates split up into two distinct clusters, ST345-A and ST345-B, each respectively containing a mix of clinical and epidemiologically-related environmental samples. Discussion/conclusion: The outbreak was therefore likely caused by both variants of the single sequence type, which pre-existed in the environmental reservoirs. The two clusters differed by 40 alleles located in two neighbouring genomic regions of ca 42 and 26 kb. Additional analysis supported horizontal gene transfer of the two regions as responsible for the difference between the variants. Both regions comprise virulence genes and have previously been reported to be involved in recombination events. This corroborates the notion that genomic outbreak investigations should always take epidemiological information into consideration when making inferences. Overall, cgMLST proved helpful in disentangling the complex genomic epidemiology of the outbreak. PMID:29162202
Epidemiological information is key when interpreting whole genome sequence data - lessons learned from a large Legionella pneumophila outbreak in Warstein, Germany, 2013.

PubMed

Petzold, Markus; Prior, Karola; Moran-Gilad, Jacob; Harmsen, Dag; Lück, Christian

2017-11-01

IntroductionWhole genome sequencing (WGS) is increasingly used in Legionnaires' disease (LD) outbreak investigations, owing to its higher resolution than sequence-based typing, the gold standard typing method for Legionella pneumophila, in the analysis of endemic strains. Recently, a gene-by-gene typing approach based on 1,521 core genes called core genome multilocus sequence typing (cgMLST) was described that enables a robust and standardised typing of L. pneumophila . Methods : We applied this cgMLST scheme to isolates obtained during the largest outbreak of LD reported so far in Germany. In this outbreak, the epidemic clone ST345 had been isolated from patients and four different environmental sources. In total 42 clinical and environmental isolates were retrospectively typed. Results : Epidemiologically unrelated ST345 isolates were clearly distinguishable from the epidemic clone. Remarkably, epidemic isolates split up into two distinct clusters, ST345-A and ST345-B, each respectively containing a mix of clinical and epidemiologically-related environmental samples. Discussion/conclusion : The outbreak was therefore likely caused by both variants of the single sequence type, which pre-existed in the environmental reservoirs. The two clusters differed by 40 alleles located in two neighbouring genomic regions of ca 42 and 26 kb. Additional analysis supported horizontal gene transfer of the two regions as responsible for the difference between the variants. Both regions comprise virulence genes and have previously been reported to be involved in recombination events. This corroborates the notion that genomic outbreak investigations should always take epidemiological information into consideration when making inferences. Overall, cgMLST proved helpful in disentangling the complex genomic epidemiology of the outbreak.
Molecular analysis of Acinetobacter baumannii strains isolated in Lebanon using four different typing methods.

PubMed

Rafei, Rayane; Dabboussi, Fouad; Hamze, Monzer; Eveillard, Matthieu; Lemarié, Carole; Gaultier, Marie-Pierre; Mallat, Hassan; Moghnieh, Rima; Husni-Samaha, Rola; Joly-Guillou, Marie-Laure; Kempf, Marie

2014-01-01

This study analyzed 42 Acinetobacter baumannii strains collected between 2009-2012 from different hospitals in Beyrouth and North Lebanon to better understand the epidemiology and carbapenem resistance mechanisms in our collection and to compare the robustness of pulsed field gel electrophoresis (PFGE), multilocus sequence typing (MLST), repetitive sequence-based PCR (rep-PCR) and blaOXA-51 sequence-based typing (SBT). Among 31 carbapenem resistant strains, we have detected three carbapenem resistance genes: 28 carried the blaOXA-23 gene, 1 the blaOXA-24 gene and 2 strains the blaOXA-58 gene. This is the first detection of blaOXA-23 and blaOXA-24 in Lebanon. PFGE identified 11 types and was the most discriminating technique followed by rep-PCR (9 types), blaOXA-51 SBT (8 types) and MLST (7 types). The PFGE type A'/ST2 was the dominant genotype in our collection present in Beyrouth and North Lebanon. The clustering agreement between all techniques was measured by adjust Wallace coefficient. An overall agreement has been demonstrated. High values of adjust Wallace coefficient were found with followed combinations: PFGE to predict MLST types = 100%, PFGE to predict blaOXA-51 SBT = 100%, blaOXA-51 SBT to predict MLST = 100%, MLST to predict blaOXA-51 SBT = 84.7%, rep-PCR to predict MLST = 81.5%, PFGE to predict rep-PCR = 69% and rep-PCR to predict blaOXA-51 SBT = 67.2%. PFGE and MLST are gold standard methods for outbreaks investigation and population structure studies respectively. Otherwise, these two techniques are technically, time and cost demanding. We recommend the use of blaOXA-51 SBT as first typing method to screen isolates and assign them to their corresponding clonal lineages. Repetitive sequence-based PCR is a rapid tool to access outbreaks but careful interpretation of results must be always performed.

Gene sequence analyses and other DNA-based methods for yeast species recognition

USDA-ARS?s Scientific Manuscript database

DNA sequence analyses, as well as other DNA-based methodologies, have transformed the way in which yeasts are identified. The focus of this chapter will be on the resolution of species using various types of DNA comparisons. In other chapters in this book, Rozpedowska, Piškur and Wolfe discuss mul...
Introduction of the hybcell-based compact sequencing technology and comparison to state-of-the-art methodologies for KRAS mutation detection.

PubMed

Zopf, Agnes; Raim, Roman; Danzer, Martin; Niklas, Norbert; Spilka, Rita; Pröll, Johannes; Gabriel, Christian; Nechansky, Andreas; Roucka, Markus

2015-03-01

The detection of KRAS mutations in codons 12 and 13 is critical for anti-EGFR therapy strategies; however, only those methodologies with high sensitivity, specificity, and accuracy as well as the best cost and turnaround balance are suitable for routine daily testing. Here we compared the performance of compact sequencing using the novel hybcell technology with 454 next-generation sequencing (454-NGS), Sanger sequencing, and pyrosequencing, using an evaluation panel of 35 specimens. A total of 32 mutations and 10 wild-type cases were reported using 454-NGS as the reference method. Specificity ranged from 100% for Sanger sequencing to 80% for pyrosequencing. Sanger sequencing and hybcell-based compact sequencing achieved a sensitivity of 96%, whereas pyrosequencing had a sensitivity of 88%. Accuracy was 97% for Sanger sequencing, 85% for pyrosequencing, and 94% for hybcell-based compact sequencing. Quantitative results were obtained for 454-NGS and hybcell-based compact sequencing data, resulting in a significant correlation (r = 0.914). Whereas pyrosequencing and Sanger sequencing were not able to detect multiple mutated cell clones within one tumor specimen, 454-NGS and the hybcell-based compact sequencing detected multiple mutations in two specimens. Our comparison shows that the hybcell-based compact sequencing is a valuable alternative to state-of-the-art methodologies used for detection of clinically relevant point mutations.
Multi-virulence-locus sequence typing of Staphylococcus lugdunensis generates results consistent with a clonal population structure and is reliable for epidemiological typing.

PubMed

Didi, Jennifer; Lemée, Ludovic; Gibert, Laure; Pons, Jean-Louis; Pestel-Caron, Martine

2014-10-01

Staphylococcus lugdunensis is an emergent virulent coagulase-negative staphylococcus responsible for severe infections similar to those caused by Staphylococcus aureus. To understand its potentially pathogenic capacity and have further detailed knowledge of the molecular traits of this organism, 93 isolates from various geographic origins were analyzed by multi-virulence-locus sequence typing (MVLST), targeting seven known or putative virulence-associated loci (atlLR2, atlLR3, hlb, isdJ, SLUG_09050, SLUG_16930, and vwbl). The polymorphisms of the putative virulence-associated loci were moderate and comparable to those of the housekeeping genes analyzed by multilocus sequence typing (MLST). However, the MVLST scheme generated 43 virulence types (VTs) compared to 20 sequence types (STs) based on MLST, indicating that MVLST was significantly more discriminating (Simpson's index [D], 0.943). No hypervirulent lineage or cluster specific to carriage strains was defined. The results of multilocus sequence analysis of known and putative virulence-associated loci are consistent with a clonal population structure for S. lugdunensis, suggesting a coevolution of these genes with housekeeping genes. Indeed, the nonsynonymous to synonymous evolutionary substitutions (dN/dS) ratio, the Tajima's D test, and Single-likelihood ancestor counting (SLAC) analysis suggest that all virulence-associated loci were under negative selection, even atlLR2 (AtlL protein) and SLUG_16930 (FbpA homologue), for which the dN/dS ratios were higher. In addition, this analysis of virulence-associated loci allowed us to propose a trilocus sequence typing scheme based on the intragenic regions of atlLR3, isdJ, and SLUG_16930, which is more discriminant than MLST for studying short-term epidemiology and further characterizing the lineages of the rare but highly pathogenic S. lugdunensis. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
Novel Single Nucleotide Polymorphism-Based Assay for Genotyping Mycobacterium avium subsp. paratuberculosis

PubMed Central

Goldstone, Robert J.; McLuckie, Joyce; Smith, David G. E.

2015-01-01

Typing of Mycobacterium avium subspecies paratuberculosis strains presents a challenge, since they are genetically monomorphic and traditional molecular techniques have limited discriminatory power. The recent advances and availability of whole-genome sequencing have extended possibilities for the characterization of Mycobacterium avium subspecies paratuberculosis, and whole-genome sequencing can provide a phylogenetic context to facilitate global epidemiology studies. In this study, we developed a single nucleotide polymorphism (SNP) assay based on PCR and restriction enzyme digestion or sequencing of the amplified product. The SNP analysis was performed using genome sequence data from 133 Mycobacterium avium subspecies paratuberculosis isolates with different genotypes from 8 different host species and 17 distinct geographic regions around the world. A total of 28,402 SNPs were identified among all of the isolates. The minimum number of SNPs required to distinguish between all of the 133 genomes was 93 and between only the type C isolates was 41. To reduce the number of SNPs and PCRs required, we adopted an approach based on sequential detection of SNPs and a decision tree. By the analysis of 14 SNPs Mycobacterium avium subspecies paratuberculosis isolates can be characterized within 14 phylogenetic groups with a higher discriminatory power than mycobacterial interspersed repetitive unit–variable number tandem repeat assay and other typing methods. Continuous updating of genome sequences is needed in order to better characterize new phylogenetic groups and SNP profiles. The novel SNP assay is a discriminative, simple, reproducible method and requires only basic laboratory equipment for the large-scale global typing of Mycobacterium avium subspecies paratuberculosis isolates. PMID:26677250
Improved detection of genetic markers of antimicrobial resistance by hybridization probe-based melting curve analysis using primers to mask proximal mutations: examples include the influenza H275Y substitution.

PubMed

Whiley, David M; Jacob, Kevin; Nakos, Jennifer; Bletchly, Cheryl; Nimmo, Graeme R; Nissen, Michael D; Sloots, Theo P

2012-06-01

Numerous real-time PCR assays have been described for detection of the influenza A H275Y alteration. However, the performance of these methods can be undermined by sequence variation in the regions flanking the codon of interest. This is a problem encountered more broadly in microbial diagnostics. In this study, we developed a modification of hybridization probe-based melting curve analysis, whereby primers are used to mask proximal mutations in the sequence targets of hybridization probes, so as to limit the potential for sequence variation to interfere with typing. The approach was applied to the H275Y alteration of the influenza A (H1N1) 2009 strain, as well as a Neisseria gonorrhoeae mutation associated with antimicrobial resistance. Assay performances were assessed using influenza A and N. gonorrhoeae strains characterized by DNA sequencing. The modified hybridization probe-based approach proved successful in limiting the effects of proximal mutations, with the results of melting curve analyses being 100% consistent with the results of DNA sequencing for all influenza A and N. gonorrhoeae strains tested. Notably, these included influenza A and N. gonorrhoeae strains exhibiting additional mutations in hybridization probe targets. Of particular interest was that the H275Y assay correctly typed influenza A strains harbouring a T822C nucleotide substitution, previously shown to interfere with H275Y typing methods. Overall our modified hybridization probe-based approach provides a simple means of circumventing problems caused by sequence variation, and offers improved detection of the influenza A H275Y alteration and potentially other resistance mechanisms.
Surveillance and laboratory detection for non-polio enteroviruses in the European Union/European Economic Area, 2016

PubMed Central

Harvala, Heli; Jasir, Aftab; Penttinen, Pasi; Pastore Celentano, Lucia; Greco, Donato; Broberg, Eeva

2017-01-01

Enteroviruses (EVs) cause severe outbreaks of respiratory and neurological disease as illustrated by EV-D68 and EV-A71 outbreaks, respectively. We have mapped European laboratory capacity for identification and characterisation of non-polio EVs to improve preparedness to respond to (re)-emerging EVs linked to severe disease. An online questionnaire on non-polio EV surveillance and laboratory detection was submitted to all 30 European Union (EU)/European Economic Area (EEA) countries. Twenty-nine countries responded; 26 conducted laboratory-based non-polio EV surveillance, and 24 included neurological infections in their surveillance. Eleven countries have established specific surveillance for EV-D68 via sentinel influenza surveillance (n = 7), typing EV-positive respiratory samples (n = 10) and/or acute flaccid paralysis surveillance (n = 5). Of 26 countries performing non-polio EV characterisation/typing, 10 further characterised culture-positive EV isolates, whereas the remainder typed PCR-positive but culture-negative samples. Although 19 countries have introduced sequence-based EV typing, seven still rely entirely on virus isolation. Based on 2015 data, six countries typed over 300 specimens mostly by sequencing, whereas 11 countries characterised under 50 EV-positive samples. EV surveillance activity varied between EU/EEA countries, and did not always specifically target patients with neurological and/or respiratory infections. Introduction of sequence-based typing methods is needed throughout the EU/EEA to enhance laboratory capacity for the detection of EVs. PMID:29162204
Surveillance and laboratory detection for non-polio enteroviruses in the European Union/European Economic Area, 2016.

PubMed

Harvala, Heli; Jasir, Aftab; Penttinen, Pasi; Pastore Celentano, Lucia; Greco, Donato; Broberg, Eeva

2017-11-01

Enteroviruses (EVs) cause severe outbreaks of respiratory and neurological disease as illustrated by EV-D68 and EV-A71 outbreaks, respectively. We have mapped European laboratory capacity for identification and characterisation of non-polio EVs to improve preparedness to respond to (re)-emerging EVs linked to severe disease. An online questionnaire on non-polio EV surveillance and laboratory detection was submitted to all 30 European Union (EU)/European Economic Area (EEA) countries. Twenty-nine countries responded; 26 conducted laboratory-based non-polio EV surveillance, and 24 included neurological infections in their surveillance. Eleven countries have established specific surveillance for EV-D68 via sentinel influenza surveillance (n = 7), typing EV-positive respiratory samples (n = 10) and/or acute flaccid paralysis surveillance (n = 5). Of 26 countries performing non-polio EV characterisation/typing, 10 further characterised culture-positive EV isolates, whereas the remainder typed PCR-positive but culture-negative samples. Although 19 countries have introduced sequence-based EV typing, seven still rely entirely on virus isolation. Based on 2015 data, six countries typed over 300 specimens mostly by sequencing, whereas 11 countries characterised under 50 EV-positive samples. EV surveillance activity varied between EU/EEA countries, and did not always specifically target patients with neurological and/or respiratory infections. Introduction of sequence-based typing methods is needed throughout the EU/EEA to enhance laboratory capacity for the detection of EVs.
Whole Genome Sequencing for Genomics-Guided Investigations of Escherichia coli O157:H7 Outbreaks.

PubMed

Rusconi, Brigida; Sanjar, Fatemeh; Koenig, Sara S K; Mammel, Mark K; Tarr, Phillip I; Eppinger, Mark

2016-01-01

Multi isolate whole genome sequencing (WGS) and typing for outbreak investigations has become a reality in the post-genomics era. We applied this technology to strains from Escherichia coli O157:H7 outbreaks. These include isolates from seven North America outbreaks, as well as multiple isolates from the same patient and from different infected individuals in the same household. Customized high-resolution bioinformatics sequence typing strategies were developed to assess the core genome and mobilome plasticity. Sequence typing was performed using an in-house single nucleotide polymorphism (SNP) discovery and validation pipeline. Discriminatory power becomes of particular importance for the investigation of isolates from outbreaks in which macrogenomic techniques such as pulse-field gel electrophoresis or multiple locus variable number tandem repeat analysis do not differentiate closely related organisms. We also characterized differences in the phage inventory, allowing us to identify plasticity among outbreak strains that is not detectable at the core genome level. Our comprehensive analysis of the mobilome identified multiple plasmids that have not previously been associated with this lineage. Applied phylogenomics approaches provide strong molecular evidence for exceptionally little heterogeneity of strains within outbreaks and demonstrate the value of intra-cluster comparisons, rather than basing the analysis on archetypal reference strains. Next generation sequencing and whole genome typing strategies provide the technological foundation for genomic epidemiology outbreak investigation utilizing its significantly higher sample throughput, cost efficiency, and phylogenetic relatedness accuracy. These phylogenomics approaches have major public health relevance in translating information from the sequence-based survey to support timely and informed countermeasures. Polymorphisms identified in this work offer robust phylogenetic signals that index both short- and long-term evolution and can complement currently employed typing schemes for outbreak ex- and inclusion, diagnostics, surveillance, and forensic studies.
Whole Genome Sequencing for Genomics-Guided Investigations of Escherichia coli O157:H7 Outbreaks

PubMed Central

Rusconi, Brigida; Sanjar, Fatemeh; Koenig, Sara S. K.; Mammel, Mark K.; Tarr, Phillip I.; Eppinger, Mark

2016-01-01

Multi isolate whole genome sequencing (WGS) and typing for outbreak investigations has become a reality in the post-genomics era. We applied this technology to strains from Escherichia coli O157:H7 outbreaks. These include isolates from seven North America outbreaks, as well as multiple isolates from the same patient and from different infected individuals in the same household. Customized high-resolution bioinformatics sequence typing strategies were developed to assess the core genome and mobilome plasticity. Sequence typing was performed using an in-house single nucleotide polymorphism (SNP) discovery and validation pipeline. Discriminatory power becomes of particular importance for the investigation of isolates from outbreaks in which macrogenomic techniques such as pulse-field gel electrophoresis or multiple locus variable number tandem repeat analysis do not differentiate closely related organisms. We also characterized differences in the phage inventory, allowing us to identify plasticity among outbreak strains that is not detectable at the core genome level. Our comprehensive analysis of the mobilome identified multiple plasmids that have not previously been associated with this lineage. Applied phylogenomics approaches provide strong molecular evidence for exceptionally little heterogeneity of strains within outbreaks and demonstrate the value of intra-cluster comparisons, rather than basing the analysis on archetypal reference strains. Next generation sequencing and whole genome typing strategies provide the technological foundation for genomic epidemiology outbreak investigation utilizing its significantly higher sample throughput, cost efficiency, and phylogenetic relatedness accuracy. These phylogenomics approaches have major public health relevance in translating information from the sequence-based survey to support timely and informed countermeasures. Polymorphisms identified in this work offer robust phylogenetic signals that index both short- and long-term evolution and can complement currently employed typing schemes for outbreak ex- and inclusion, diagnostics, surveillance, and forensic studies. PMID:27446025
AgdbNet – antigen sequence database software for bacterial typing

PubMed Central

Jolley, Keith A; Maiden, Martin CJ

2006-01-01

Background Bacterial typing schemes based on the sequences of genes encoding surface antigens require databases that provide a uniform, curated, and widely accepted nomenclature of the variants identified. Due to the differences in typing schemes, imposed by the diversity of genes targeted, creating these databases has typically required the writing of one-off code to link the database to a web interface. Here we describe agdbNet, widely applicable web database software that facilitates simultaneous BLAST querying of multiple loci using either nucleotide or peptide sequences. Results Databases are described by XML files that are parsed by a Perl CGI script. Each database can have any number of loci, which may be defined by nucleotide and/or peptide sequences. The software is currently in use on at least five public databases for the typing of Neisseria meningitidis, Campylobacter jejuni and Streptococcus equi and can be set up to query internal isolate tables or suitably-configured external isolate databases, such as those used for multilocus sequence typing. The style of the resulting website can be fully configured by modifying stylesheets and through the use of customised header and footer files that surround the output of the script. Conclusion The software provides a rapid means of setting up customised Internet antigen sequence databases. The flexible configuration options enable typing schemes with differing requirements to be accommodated. PMID:16790057
Distribution of monoclonal antibody subgroups and sequence-based types among Legionella pneumophila serogroup 1 isolates derived from cooling tower water, bathwater, and soil in Japan.

PubMed

Amemura-Maekawa, Junko; Kikukawa, Kiyomi; Helbig, Jürgen H; Kaneko, Satoko; Suzuki-Hashimoto, Atsuko; Furuhata, Katsunori; Chang, Bin; Murai, Miyo; Ichinose, Masayuki; Ohnishi, Makoto; Kura, Fumiaki

2012-06-01

Legionella pneumophila serogroup (SG) 1 is the most frequent cause of legionellosis. This study analyzed environmental isolates of L. pneumophila SG 1 in Japan using monoclonal antibody (MAb) typing and sequence-based typing (SBT). Samples were analyzed from bathwater (BW; n = 50), cooling tower water (CT; n = 50), and soil (SO; n = 35). The distribution of MAb types varied by source, with the most prevalent types being Bellingham (42%), Oxford (72%), and OLDA (51%) in BW, CT, and SO, respectively. The ratios of MAb 3/1 positive isolates were 26, 2, and 14% from BW, CT, and SO, respectively. The environmental isolates from BW, CT, and SO were divided into 34 sequence types (STs; index of discrimination [IOD] = 0.973), 8 STs (IOD = 0.448), and 11 STs (IOD = 0.879), respectively. Genetic variation among CT isolates was smaller than seen in BW and SO. ST1 accounted for 74% of the CT isolates. The only common STs between (i) BW and CT, (ii) BW and SO, and (iii) CT and SO were ST1, ST129, and ST48, respectively, suggesting that each environment constitutes an independent habitat.
Characteristics and molecular phylogeny of Fasciola flukes from Bangladesh, determined based on spermatogenesis and nuclear and mitochondrial DNA analyses.

PubMed

Mohanta, Uday Kumar; Ichikawa-Seki, Madoka; Shoriki, Takuya; Katakura, Ken; Itagaki, Tadashi

2014-07-01

This study aimed to precisely discriminate Fasciola spp. based on DNA sequences of nuclear internal transcribed spacer 1 (ITS1) and mitochondrial nicotinamide adenine dinucleotide (NADH) dehydrogenase subunit 1 (nad1) gene. We collected 150 adult flukes from the bile ducts of cattle, buffaloes, sheep, and goats from six different regions of Bangladesh. Spermatogenic status was determined by analyzing stained seminal vesicles. The ITS1 types were analyzed using the polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP) method. The nad1 haplotypes were identified based on PCR and direct sequencing and analyzed phylogenetically by comparing with nad1 haplotypes of Fasciola spp. from other Asian countries. Of the 127 aspermic flukes, 98 were identified as Fg type in ITS1, whereas 29 were identified as Fh/Fg type, indicating a combination of ITS1 sequences of Fasciola hepatica and Fasciola gigantica. All the 127 aspermic flukes showed Fsp-NDI-Bd11 in nad1 haplotype with nucleotide sequences identical to aspermic Fasciola sp. from Asian countries. Further, 20 spermic flukes were identified as F. gigantica based on their spermatogenic status and Fg type in ITS1. F. gigantica population was thought to be introduced into Bangladesh considerably earlier than the aspermic Fasciola sp. because 11 haplotypes with high haplotype diversity were detected from the F. gigantica population. However, three flukes from Bangladesh could not be precisely identified, because their spermatogenic status, ITS1 types, and nad1 haplotypes were ambiguous. Therefore, developing a robust method to distinguish aspermic Fasciola sp. from other Fasciola species is necessary in the future.
Population Based Assessment of MHC Class 1 Antigens Down Regulation as Marker in Increased Risk for Development and Progression of Breast Cancer From Benign Breast Lesions

DTIC Science & Technology

2006-01-01

isolated using a routine salting-out method (DNA E-Z Prepkit, Orchid Diagnostics Europe, St Katelijne Waver, Belgium). Sequence based typing In...electrophoresis using ethidiumbromide to show the single 2 KB band before sequencing. Next, sequencing reactions were performed separately for exons 2, 3...Multiplex reverse transcription-polymerase chain reaction for simultaneous screening of 29 translocations and chromosomal aberrations in acute
Non coding extremities of the seven influenza virus type C vRNA segments: effect on transcription and replication by the type C and type A polymerase complexes

PubMed Central

Crescenzo-Chaigne, Bernadette; Barbezange, Cyril; van der Werf, Sylvie

2008-01-01

Background The transcription/replication of the influenza viruses implicate the terminal nucleotide sequences of viral RNA, which comprise sequences at the extremities conserved among the genomic segments as well as variable 3' and 5' non-coding (NC) regions. The plasmid-based system for the in vivo reconstitution of functional ribonucleoproteins, upon expression of viral-like RNAs together with the nucleoprotein and polymerase proteins has been widely used to analyze transcription/replication of influenza viruses. It was thus shown that the type A polymerase could transcribe and replicate type A, B, or C vRNA templates whereas neither type B nor type C polymerases were able to transcribe and replicate type A templates efficiently. Here we studied the importance of the NC regions from the seven segments of type C influenza virus for efficient transcription/replication by the type A and C polymerases. Results The NC sequences of the seven genomic segments of the type C influenza virus C/Johannesburg/1/66 strain were found to be more variable in length than those of the type A and B viruses. The levels of transcription/replication of viral-like vRNAs harboring the NC sequences of the respective type C virus segments flanking the CAT reporter gene were comparable in the presence of either type C or type A polymerase complexes except for the NS and PB2-like vRNAs. For the NS-like vRNA, the transcription/replication level was higher after introduction of a U residue at position 6 in the 5' NC region as for all other segments. For the PB2-like vRNA the CAT expression level was particularly reduced with the type C polymerase. Analysis of mutants of the 5' NC sequence in the PB2-like vRNA, the shortest 5' NC sequence among the seven segments, showed that additional sequences within the PB2 ORF were essential for the efficiency of transcription but not replication by the type C polymerase complex. Conclusion In the context of a PB2-like reporter vRNA template, the sequence upstream the polyU stretch plays a role in the transcription/replication process by the type C polymerase complex. PMID:18973655
Study of mitochondria D-loop gene to detect the heterogeneity of gemak in Turnicidae family

NASA Astrophysics Data System (ADS)

Setiati, N.; Partaya

2018-03-01

As a part of life biodiversity, birds in Turnicidae family should be preserved from the extinction and its type heterogeneity decline. One effort for giving the strategic base of plasma nutfah conservation is through genetic heterogeneity study. The aim of the research is to analyze D-loop gen from DNA mitochondria of gemak bird in Turnicidae family molecularly. From the result of the analysis, it may be known the genetic heterogeneity of gemak bird based on the sequence of D-loop gen. The collection of both types of gemak of Turnicidae family is still easy since we can find them in ricefield area after harvest particularly for Gemakloreng (Turnix sylvatica), it means while gemak tegalan (Turnixsusciator) is getting difficult to find. Based on the above DNA quantification standard, the blood sample of Gemak in this research is mostly grouped into pure blood (ranges from 1,63 – 1,90), and it deserves to be used for PCR analysis. The sequencing analysis has not detected the sequence of nucleotide completely. However, it indicates sequence polymorphism of base as the arranger of D-loop gen. D-loop gen may identify genetic heterogeneity of gemak bird of Turnicidae family, but it is necessary to perform further sequencing analysis with PCR-RFLP technique. This complete nucleotide sequence is obtained and easy to detect after being cut restriction enzyme.
[Clustered regularly interspaced short palindromic repeats (CRISPR) site in Bacillus anthracis].

PubMed

Gao, Zhiqi; Wang, Dongshu; Feng, Erling; Wang, Bingxiang; Hui, Yiming; Han, Shaobo; Jiao, Lei; Liu, Xiankai; Wang, Hengliang

2014-11-04

To investigate the polymorphism of clustered regularly interspaced short palindromic repeats (CRISPR) in Bacillu santhracis and the application to molecular typing based on the polymorphism of CRISPR in B. anthracis. We downloaded the whole genome sequence of 6 B. anthracis strains and extracted the CRISPR sites. We designed the primers of CRISPR sites and amplified the CRISPR fragments in 193 B. anthracis strains by PCR and sequenced these fragments. In order to reveal the polymorphism of CRISPR in B. anthracis, wealigned all the extracted sequences and sequenced results by local blasting. At the same time, we also analyzed the CRISPR sites in B. cereus and B. thuringiensis. We did not find any polymorphism of CRISPR in B. anthracis. The molecular typing approach based on CRISPR polymorphism is not suitable for B. anthracis, but it is possible for us to distinguish B. anthracis from B. cereus and B. thuringiensis.
Classification of circulation type sequences applied to snow avalanches over the eastern Pyrenees (Andorra and Catalonia)

NASA Astrophysics Data System (ADS)

Esteban, Pere; Beck, Christoph; Philipp, Andreas

2010-05-01

Using data associated with accidents or damages caused by snow avalanches over the eastern Pyrenees (Andorra and Catalonia) several atmospheric circulation type catalogues have been obtained. For this purpose, different circulation type classification methods based on Principal Component Analysis (T-mode and S-mode using the extreme scores) and on optimization procedures (Improved K-means and SANDRA) were applied . Considering the characteristics of the phenomena studied, not only single day circulation patterns were taken into account but also sequences of circulation types of varying length. Thus different classifications with different numbers of types and for different sequence lengths were obtained using the different classification methods. Simple between type variability, within type variability, and outlier detection procedures have been applied for selecting the best result concerning snow avalanches type classifications. Furthermore, days without occurrence of the hazards were also related to the avalanche centroids using pattern-correlations, facilitating the calculation of the anomalies between hazardous and no hazardous days, and also frequencies of occurrence of hazardous events for each circulation type. Finally, the catalogues statistically considered the best results are evaluated using the avalanche forecaster expert knowledge. Consistent explanation of snow avalanches occurrence by means of circulation sequences is obtained, but always considering results from classifications with different sequence length. This work has been developed in the framework of the COST Action 733 (Harmonisation and Applications of Weather Type Classifications for European regions).
spa Typing and Multilocus Sequence Typing Show Comparable Performance in a Macroepidemiologic Study of Staphylococcus aureus in the United States

PubMed Central

O'Hara, F. Patrick; Suaya, Jose A.; Ray, G. Thomas; Baxter, Roger; Brown, Megan L.; Mera, Robertino M.; Close, Nicole M.; Thomas, Elizabeth

2016-01-01

A number of molecular typing methods have been developed for characterization of Staphylococcus aureus isolates. The utility of these systems depends on the nature of the investigation for which they are used. We compared two commonly used methods of molecular typing, multilocus sequence typing (MLST) (and its clustering algorithm, Based Upon Related Sequence Type [BURST]) with the staphylococcal protein A (spa) typing (and its clustering algorithm, Based Upon Repeat Pattern [BURP]), to assess the utility of these methods for macroepidemiology and evolutionary studies of S. aureus in the United States. We typed a total of 366 clinical isolates of S. aureus by these methods and evaluated indices of diversity and concordance values. Our results show that, when combined with the BURP clustering algorithm to delineate clonal lineages, spa typing produces results that are highly comparable with those produced by MLST/BURST. Therefore, spa typing is appropriate for use in macroepidemiology and evolutionary studies and, given its lower implementation cost, this method appears to be more efficient. The findings are robust and are consistent across different settings, patient ages, and specimen sources. Our results also support a model in which the methicillin-resistant S. aureus (MRSA) population in the United States comprises two major lineages (USA300 and USA100), which each consist of closely related variants. PMID:26669861
spa Typing and Multilocus Sequence Typing Show Comparable Performance in a Macroepidemiologic Study of Staphylococcus aureus in the United States.

PubMed

O'Hara, F Patrick; Suaya, Jose A; Ray, G Thomas; Baxter, Roger; Brown, Megan L; Mera, Robertino M; Close, Nicole M; Thomas, Elizabeth; Amrine-Madsen, Heather

2016-01-01

A number of molecular typing methods have been developed for characterization of Staphylococcus aureus isolates. The utility of these systems depends on the nature of the investigation for which they are used. We compared two commonly used methods of molecular typing, multilocus sequence typing (MLST) (and its clustering algorithm, Based Upon Related Sequence Type [BURST]) with the staphylococcal protein A (spa) typing (and its clustering algorithm, Based Upon Repeat Pattern [BURP]), to assess the utility of these methods for macroepidemiology and evolutionary studies of S. aureus in the United States. We typed a total of 366 clinical isolates of S. aureus by these methods and evaluated indices of diversity and concordance values. Our results show that, when combined with the BURP clustering algorithm to delineate clonal lineages, spa typing produces results that are highly comparable with those produced by MLST/BURST. Therefore, spa typing is appropriate for use in macroepidemiology and evolutionary studies and, given its lower implementation cost, this method appears to be more efficient. The findings are robust and are consistent across different settings, patient ages, and specimen sources. Our results also support a model in which the methicillin-resistant S. aureus (MRSA) population in the United States comprises two major lineages (USA300 and USA100), which each consist of closely related variants.
Whole genome sequencing options for bacterial strain typing and epidemiologic analysis based on single nucleotide polymorphism versus gene-by-gene-based approaches.

PubMed

Schürch, A C; Arredondo-Alonso, S; Willems, R J L; Goering, R V

2018-04-01

Whole genome sequence (WGS)-based strain typing finds increasing use in the epidemiologic analysis of bacterial pathogens in both public health as well as more localized infection control settings. This minireview describes methodologic approaches that have been explored for WGS-based epidemiologic analysis and considers the challenges and pitfalls of data interpretation. Personal collection of relevant publications. When applying WGS to study the molecular epidemiology of bacterial pathogens, genomic variability between strains is translated into measures of distance by determining single nucleotide polymorphisms in core genome alignments or by indexing allelic variation in hundreds to thousands of core genes, assigning types to unique allelic profiles. Interpreting isolate relatedness from these distances is highly organism specific, and attempts to establish species-specific cutoffs are unlikely to be generally applicable. In cases where single nucleotide polymorphism or core gene typing do not provide the resolution necessary for accurate assessment of the epidemiology of bacterial pathogens, inclusion of accessory gene or plasmid sequences may provide the additional required discrimination. As with all epidemiologic analysis, realizing the full potential of the revolutionary advances in WGS-based approaches requires understanding and dealing with issues related to the fundamental steps of data generation and interpretation. Copyright © 2018 The Authors. Published by Elsevier Ltd.. All rights reserved.

Discovery of a bovine enterovirus in alpaca.

PubMed

McClenahan, Shasta D; Scherba, Gail; Borst, Luke; Fredrickson, Richard L; Krause, Philip R; Uhlenhaut, Christine

2013-01-01

A cytopathic virus was isolated using Madin-Darby bovine kidney (MDBK) cells from lung tissue of alpaca that died of a severe respiratory infection. To identify the virus, the infected cell culture supernatant was enriched for virus particles and a generic, PCR-based method was used to amplify potential viral sequences. Genomic sequence data of the alpaca isolate was obtained and compared with sequences of known viruses. The new alpaca virus sequence was most similar to recently designated Enterovirus species F, previously bovine enterovirus (BEVs), viruses that are globally prevalent in cattle, although they appear not to cause significant disease. Because bovine enteroviruses have not been previously reported in U.S. alpaca, we suspect that this type of infection is fairly rare, and in this case appeared not to spread beyond the original outbreak. The capsid sequence of the detected virus had greatest homology to Enterovirus F type 1 (indicating that the virus should be considered a member of serotype 1), but the virus had greater homology in 2A protease sequence to type 3, suggesting that it may have been a recombinant. Identifying pathogens that infect a new host species for the first time can be challenging. As the disease in a new host species may be quite different from that in the original or natural host, the pathogen may not be suspected based on the clinical presentation, delaying diagnosis. Although this virus replicated in MDBK cells, existing standard culture and molecular methods could not identify it. In this case, a highly sensitive generic PCR-based pathogen-detection method was used to identify this pathogen.
Discovery of a Bovine Enterovirus in Alpaca

PubMed Central

McClenahan, Shasta D.; Scherba, Gail; Borst, Luke; Fredrickson, Richard L.; Krause, Philip R.; Uhlenhaut, Christine

2013-01-01

A cytopathic virus was isolated using Madin-Darby bovine kidney (MDBK) cells from lung tissue of alpaca that died of a severe respiratory infection. To identify the virus, the infected cell culture supernatant was enriched for virus particles and a generic, PCR-based method was used to amplify potential viral sequences. Genomic sequence data of the alpaca isolate was obtained and compared with sequences of known viruses. The new alpaca virus sequence was most similar to recently designated Enterovirus species F, previously bovine enterovirus (BEVs), viruses that are globally prevalent in cattle, although they appear not to cause significant disease. Because bovine enteroviruses have not been previously reported in U.S. alpaca, we suspect that this type of infection is fairly rare, and in this case appeared not to spread beyond the original outbreak. The capsid sequence of the detected virus had greatest homology to Enterovirus F type 1 (indicating that the virus should be considered a member of serotype 1), but the virus had greater homology in 2A protease sequence to type 3, suggesting that it may have been a recombinant. Identifying pathogens that infect a new host species for the first time can be challenging. As the disease in a new host species may be quite different from that in the original or natural host, the pathogen may not be suspected based on the clinical presentation, delaying diagnosis. Although this virus replicated in MDBK cells, existing standard culture and molecular methods could not identify it. In this case, a highly sensitive generic PCR-based pathogen-detection method was used to identify this pathogen. PMID:23950875
A novel wavelet sequence based on deep bidirectional LSTM network model for ECG signal classification.

PubMed

Yildirim, Özal

2018-05-01

Long-short term memory networks (LSTMs), which have recently emerged in sequential data analysis, are the most widely used type of recurrent neural networks (RNNs) architecture. Progress on the topic of deep learning includes successful adaptations of deep versions of these architectures. In this study, a new model for deep bidirectional LSTM network-based wavelet sequences called DBLSTM-WS was proposed for classifying electrocardiogram (ECG) signals. For this purpose, a new wavelet-based layer is implemented to generate ECG signal sequences. The ECG signals were decomposed into frequency sub-bands at different scales in this layer. These sub-bands are used as sequences for the input of LSTM networks. New network models that include unidirectional (ULSTM) and bidirectional (BLSTM) structures are designed for performance comparisons. Experimental studies have been performed for five different types of heartbeats obtained from the MIT-BIH arrhythmia database. These five types are Normal Sinus Rhythm (NSR), Ventricular Premature Contraction (VPC), Paced Beat (PB), Left Bundle Branch Block (LBBB), and Right Bundle Branch Block (RBBB). The results show that the DBLSTM-WS model gives a high recognition performance of 99.39%. It has been observed that the wavelet-based layer proposed in the study significantly improves the recognition performance of conventional networks. This proposed network structure is an important approach that can be applied to similar signal processing problems. Copyright © 2018 Elsevier Ltd. All rights reserved.
Genotype-specific signal generation based on digestion of 3-way DNA junctions: application to KRAS variation detection.

PubMed

Amicarelli, Giulia; Adlerstein, Daniel; Shehi, Erlet; Wang, Fengfei; Makrigiorgos, G Mike

2006-10-01

Genotyping methods that reveal single-nucleotide differences are useful for a wide range of applications. We used digestion of 3-way DNA junctions in a novel technology, OneCutEventAmplificatioN (OCEAN) that allows sequence-specific signal generation and amplification. We combined OCEAN with peptide-nucleic-acid (PNA)-based variant enrichment to detect and simultaneously genotype v-Ki-ras2 Kirsten rat sarcoma viral oncogene homolog (KRAS) codon 12 sequence variants in human tissue specimens. We analyzed KRAS codon 12 sequence variants in 106 lung cancer surgical specimens. We conducted a PNA-PCR reaction that suppresses wild-type KRAS amplification and genotyped the product with a set of OCEAN reactions carried out in fluorescence microplate format. The isothermal OCEAN assay enabled a 3-way DNA junction to form between the specific target nucleic acid, a fluorescently labeled "amplifier", and an "anchor". The amplifier-anchor contact contains the recognition site for a restriction enzyme. Digestion produces a cleaved amplifier and generation of a fluorescent signal. The cleaved amplifier dissociates from the 3-way DNA junction, allowing a new amplifier to bind and propagate the reaction. The system detected and genotyped KRAS sequence variants down to approximately 0.3% variant-to-wild-type alleles. PNA-PCR/OCEAN had a concordance rate with PNA-PCR/sequencing of 93% to 98%, depending on the exact implementation. Concordance rate with restriction endonuclease-mediated selective-PCR/sequencing was 89%. OCEAN is a practical and low-cost novel technology for sequence-specific signal generation. Reliable analysis of KRAS sequence alterations in human specimens circumvents the requirement for sequencing. Application is expected in genotyping KRAS codon 12 sequence variants in surgical specimens or in bodily fluids, as well as single-base variations and sequence alterations in other genes.
Free energy determinants of secondary structure formation: III. beta-turns and their role in protein folding.

PubMed

Yang, A S; Hitz, B; Honig, B

1996-06-21

The stability of beta-turns is calculated as a function of sequence and turn type with a Monte Carlo sampling technique. The conformational energy of four internal hydrogen-bonded turn types, I, I', II and II', is obtained by evaluating their gas phase energy with the CHARMM force field and accounting for solvation effects with the Finite Difference Poisson-Boltzmann (FDPB) method. All four turn types are found to be less stable than the coil state, independent of the sequence in the turn. The free-energy penalties associated with turn formation vary between 1.6 kcal/mol and 7.7 kcal/mol, depending on the sequence and turn type. Differences in turn stability arise mainly from intraresidue interactions within the two central residues of the turn. For each combination of the two central residues, except for -Gly-Gly-, the most stable beta-turn type is always found to occur most commonly in native proteins. The fact that a model based on local interactions accounts for the observed preference of specific sequences suggests that long-range tertiary interactions tend to play a secondary role in determining turn conformation. In contrast, for beta-hairpins, long-range interactions appear to dominate. Specifically, due to the right-handed twist of beta-strands, type I' turns for -Gly-Gly- are found to occur with high frequency, even when local energetics would dictate otherwise. The fact that any combination of two residues is found able to adopt a relatively low-energy turn structure explains why the amino acid sequence in turns is highly variable. The calculated free-energy cost of turn formation, when combined with related numbers obtained for alpha-helices and beta-sheets, suggests a model for the initiation of protein folding based on metastable fragments of secondary structure.
Identification of Fasciola species based on mitochondrial and nuclear DNA reveals the co-existence of intermediate Fasciola and Fasciola gigantica in Thailand.

PubMed

Wannasan, Anchalee; Khositharattanakool, Pathamet; Chaiwong, Prasong; Piangjai, Somsak; Uparanukraw, Pichart; Morakote, Nimit

2014-11-01

Molecular techniques were used to identify Fasciola species collected from Chiang Mai Thailand. Morphometrically, 65 stained and 45 fresh worms collected from cattle suggested the possible occurrence of both F. gigantica and F. hepatica. Twenty-two worms comprising 15 from cattle and 7 from human patients, were identified subsequently based on three genetic markers: mitochondrial nicotinamide adenine dinucleotide dehydrogenase subunit 1 (nad1), mitochondrial cytochrome c oxidase subunit 1 (cox1) and nuclear ribosomal internal transcribed spacer 2 (ITS2). All of them presented the F. gigantica type in maternally inherited mitochondrial sequences (nad1 and cox1), with six types in each sequence (FgNDI-CM1 to FgNDI-CM6 and FgCOI-CM1 to FgCOI-CM6, respectively). Remarkably, the predominant nad1 type, FgNDI-CM6, was identical to that of aspermic Fasciola sp. formerly reported from Thailand, Japan, Korea, China, Vietnam, and Myanmar. ITS2 sequences were analyzed successfully in 20 worms. Fifteen worms showed the F. gigantica type and five (including one worm from a patient) had mixed ITS2 sequences of both F. gigantica and F. hepatica in the same worms, with additional heterogeneity within both ITS2 types. This study revealed the intermediate form of Fasciola coexisting with F. gigantica for the first time in Thailand.
Reconsideration of Protocrea (Hypocreales, Hypocreaceae)

USDA-ARS?s Scientific Manuscript database

The genus Protocrea is re-defined, based on holotype and fresh specimens of its type species P. farinosa, using morphology of teleomorph and anamorph and phylogenetic analyses of rpb2 sequences. Data based on currently available specimens suggest the existence of six species. Apart from the type, P....
Sequencing artifacts in the type A influenza databases and attempts to correct them.

PubMed

Suarez, David L; Chester, Nikki; Hatfield, Jason

2014-07-01

There are over 276 000 influenza gene sequences in public databases, with the quality of the sequences determined by the contributor. As part of a high school class project, influenza sequences with possible errors were identified in the public databases based on the size of the gene being longer than expected, with the hypothesis that these sequences would have an error. Students contacted sequence submitters alerting them of the possible sequence issue(s) and requested they the suspect sequence(s) be correct as appropriate. Type A influenza viruses were screened, and gene segments longer than the accepted size were identified for further analysis. Attention was placed on sequences with additional nucleotides upstream or downstream of the highly conserved non-coding ends of the viral segments. A total of 1081 sequences were identified that met this criterion. Three types of errors were commonly observed: non-influenza primer sequence wasn't removed from the sequence; PCR product was cloned and plasmid sequence was included in the sequence; and Taq polymerase added an adenine at the end of the PCR product. Internal insertions of nucleotide sequence were also commonly observed, but in many cases it was unclear if the sequence was correct or actually contained an error. A total of 215 sequences, or 22.8% of the suspect sequences, were corrected in the public databases in the first year of the student project. Unfortunately 138 additional sequences with possible errors were added to the databases in the second year. Additional awareness of the need for data integrity of sequences submitted to public databases is needed to fully reap the benefits of these large data sets. © 2014 The Authors. Influenza and Other Respiratory Viruses Published by John Wiley & Sons Ltd.
Identification, Classification, and Phylogeny of the Pathogenic Species Exophiala jeanselmei and Related Species by Mitochondrial Cytochrome b Gene Analysis

PubMed Central

Wang, Li; Yokoyama, Koji; Miyaji, Makoto; Nishimura, Kazuko

2001-01-01

We analyzed a 402-bp sequence of the mitochondrial cytochrome b gene of 34 strains of Exophiala jeanselmei and 16 strains representing 12 related species. The strains of E. jeanselmei were classified into 20 DNA types and 17 amino acid types. The differences between these strains were found in 1 to 60 nucleotides and 1 to 17 amino acids. On the basis of the identities and similarities of nucleotide and amino acid sequences, some strains were reidentified: i.e., two strains of E. jeanselmei var. hetermorpha and one strain of E. castellanii as E. dermatitidis (including the type strain), three strains of E. jeanselmei as E. jeanselmei var. lecanii-corni (including the type strain), three strains of E. jeanselmei as E. bergeri (including the type strain), seven strains of E. jeanselmei as E. pisciphila (including the type strain), seven strains of E. jeanselmei as E. jeanselmei var. jeanselmei (including the type strain), one strain of E. jeanselmei as Fonsecaea pedrosoi (including the type strain), and one strain of E. jeanselmei as E. spinifera (including the type strain). Some E. jeanselmei strains showed distinct nucleotide and amino acid sequences. The amino-acid-based UPGMA (unweighted pair group method with the arithmetic mean) tree exhibited nearly the same topology as those of the DNA-based trees obtained by neighbor joining, maximum parsimony, and maximum likelihood methods. PMID:11724862
KpnBI is the prototype of a new family (IE) of bacterial type I restriction-modification system

PubMed Central

Chin, V.; Valinluck, V.; Magaki, S.; Ryu, J.

2004-01-01

KpnBI is a restriction-modification (R-M) system recognized in the GM236 strain of Klebsiella pneumoniae. Here, the KpnBI modification genes were cloned into a plasmid using a modification expression screening method. The modification genes that consist of both hsdM (2631 bp) and hsdS (1344 bp) genes were identified on an 8.2 kb EcoRI chromosomal fragment. These two genes overlap by one base and share the same promoter located upstream of the hsdM gene. Using recently developed plasmid R-M tests and a computer program RM Search, the DNA recognition sequence for the KpnBI enzymes was identified as a new 8 nt sequence containing one degenerate base with a 6 nt spacer, CAAANNNNNNRTCA. From Dam methylation and HindIII sensitivity tests, the methylation loci were predicted to be the italicized third adenine in the 5′ specific region and the adenine opposite the italicized thymine in the 3′ specific region. Combined with previous sequence data for hsdR, we concluded that the KpnBI system is a typical type I R-M system. The deduced amino acid sequences of the three subunits of the KpnBI system show only limited homologies (25 to 33% identity) at best, to the four previously categorized type I families (IA, IB, IC, and ID). Furthermore, their identity scores to other uncharacterized putative genome type I sequences were 53% at maximum. Therefore, we propose that KpnBI is the prototype of a new ‘type IE’ family. PMID:15475385
Mitochondrial sequence divergence among Antarctic killer whale ecotypes is consistent with multiple species.

PubMed

LeDuc, Richard G; Robertson, Kelly M; Pitman, Robert L

2008-08-23

Recently, three visually distinct forms of killer whales (Orcinus orca) were described from Antarctic waters and designated as types A, B and C. Based on consistent differences in prey selection and habitat preferences, morphological divergence and apparent lack of interbreeding among these broadly sympatric forms, it was suggested that they may represent separate species. To evaluate this hypothesis, we compared complete sequences of the mitochondrial control region from 81 Antarctic killer whale samples, including 9 type A, 18 type B, 47 type C and 7 type-undetermined individuals. We found three fixed differences that separated type A from B and C, and a single fixed difference that separated type C from A and B. These results are consistent with reproductive isolation among the different forms, although caution is needed in drawing further conclusions. Despite dramatic differences in morphology and ecology, the relatively low levels of sequence divergence in Antarctic killer whales indicate that these evolutionary changes occurred relatively rapidly and recently.
A Pedagogical Theory and Practice for College Writing Courses and Writing across the Curriculum Courses: A Social Constructionist Perspective on Learning through Argument.

ERIC Educational Resources Information Center

Soffree-Cady, Flore

To provide a writing pedagogy grounded in theory, a teaching method was developed which sequenced certain types of assignments. The classification of types and the organizational structure of the sequences were based on a teaching model that draws upon theories from various disciplines. Although the teaching activities are not new in themselves,…
A Bioinformatics-Based Alternative mRNA Splicing Code that May Explain Some Disease Mutations Is Conserved in Animals.

PubMed

Qu, Wen; Cingolani, Pablo; Zeeberg, Barry R; Ruden, Douglas M

2017-01-01

Deep sequencing of cDNAs made from spliced mRNAs indicates that most coding genes in many animals and plants have pre-mRNA transcripts that are alternatively spliced. In pre-mRNAs, in addition to invariant exons that are present in almost all mature mRNA products, there are at least 6 additional types of exons, such as exons from alternative promoters or with alternative polyA sites, mutually exclusive exons, skipped exons, or exons with alternative 5' or 3' splice sites. Our bioinformatics-based hypothesis is that, in analogy to the genetic code, there is an "alternative-splicing code" in introns and flanking exon sequences, analogous to the genetic code, that directs alternative splicing of many of the 36 types of introns. In humans, we identified 42 different consensus sequences that are each present in at least 100 human introns. 37 of the 42 top consensus sequences are significantly enriched or depleted in at least one of the 36 types of introns. We further supported our hypothesis by showing that 96 out of 96 analyzed human disease mutations that affect RNA splicing, and change alternative splicing from one class to another, can be partially explained by a mutation altering a consensus sequence from one type of intron to that of another type of intron. Some of the alternative splicing consensus sequences, and presumably their small-RNA or protein targets, are evolutionarily conserved from 50 plant to animal species. We also noticed the set of introns within a gene usually share the same splicing codes, thus arguing that one sub-type of splicesosome might process all (or most) of the introns in a given gene. Our work sheds new light on a possible mechanism for generating the tremendous diversity in protein structure by alternative splicing of pre-mRNAs.
Identification of Trypanosoma cruzi Discrete Typing Units (DTUs) in Latin-American migrants in Barcelona (Spain).

PubMed

Abras, Alba; Gállego, Montserrat; Muñoz, Carmen; Juiz, Natalia A; Ramírez, Juan Carlos; Cura, Carolina I; Tebar, Silvia; Fernández-Arévalo, Anna; Pinazo, María-Jesús; de la Torre, Leonardo; Posada, Elizabeth; Navarro, Ferran; Espinal, Paula; Ballart, Cristina; Portús, Montserrat; Gascón, Joaquim; Schijman, Alejandro G

2017-04-01

Trypanosoma cruzi, the causative agent of Chagas disease, is divided into six Discrete Typing Units (DTUs): TcI-TcVI. We aimed to identify T. cruzi DTUs in Latin-American migrants in the Barcelona area (Spain) and to assess different molecular typing approaches for the characterization of T. cruzi genotypes. Seventy-five peripheral blood samples were analyzed by two real-time PCR methods (qPCR) based on satellite DNA (SatDNA) and kinetoplastid DNA (kDNA). The 20 samples testing positive in both methods, all belonging to Bolivian individuals, were submitted to DTU characterization using two PCR-based flowcharts: multiplex qPCR using TaqMan probes (MTq-PCR), and conventional PCR. These samples were also studied by sequencing the SatDNA and classified as type I (TcI/III), type II (TcII/IV) and type I/II hybrid (TcV/VI). Ten out of the 20 samples gave positive results in the flowcharts: TcV (5 samples), TcII/V/VI (3) and mixed infections by TcV plus TcII (1) and TcV plus TcII/VI (1). By SatDNA sequencing, we classified the 20 samples, 19 as type I/II and one as type I. The most frequent DTU identified by both flowcharts, and suggested by SatDNA sequencing in the remaining samples with low parasitic loads, TcV, is common in Bolivia and predominant in peripheral blood. The mixed infection by TcV-TcII was detected for the first time simultaneously in Bolivian migrants. PCR-based flowcharts are very useful to characterize DTUs during acute infection. SatDNA sequence analysis cannot discriminate T. cruzi populations at the level of a single DTU but it enabled us to increase the number of characterized cases in chronically infected patients. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Several Families of Sequences with Low Correlation and Large Linear Span

NASA Astrophysics Data System (ADS)

Zeng, Fanxin; Zhang, Zhenyu

In DS-CDMA systems and DS-UWB radios, low correlation of spreading sequences can greatly help to minimize multiple access interference (MAI) and large linear span of spreading sequences can reduce their predictability. In this letter, new sequence sets with low correlation and large linear span are proposed. Based on the construction Trm1[Trnm(αbt+γiαdt)]r for generating p-ary sequences of period pn-1, where n=2m, d=upm±v, b=u±v, γi∈GF(pn), and p is an arbitrary prime number, several methods to choose the parameter d are provided. The obtained sequences with family size pn are of four-valued, five-valued, six-valued or seven-valued correlation and the maximum nontrivial correlation value is (u+v-1)pm-1. The simulation by a computer shows that the linear span of the new sequences is larger than that of the sequences with Niho-type and Welch-type decimations, and similar to that of [10].
On the phylogenetic placement of human T cell leukemia virus type 1 sequences associated with an Andean mummy.

PubMed

Coulthart, Michael B; Posada, David; Crandall, Keith A; Dekaban, Gregory A

2006-03-01

Recently, the putative finding of ancient human T cell leukemia virus type 1 (HTLV-1) long terminal repeat (LTR) DNA sequences in association with a 1500-year-old Chilean mummy has stirred vigorous debate. The debate is based partly on the inherent uncertainties associated with phylogenetic reconstruction when only short sequences of closely related genotypes are available. However, a full analysis of what phylogenetic information is present in the mummy data has not previously been published, leaving open the question of what precisely is the range of admissible interpretation. To fulfill this need, we re-analyzed the mummy data in a new way. We first performed phylogenetic analysis of 188 published LTR DNA sequences from extant strains belonging to the HTLV-1 Cosmopolitan clade, using the method of statistical parsimony which is designed both to optimize phylogenetic resolution among sequences with little evolutionary divergence, and to permit precise mapping of individual sequence mutations onto branches of a divergence network. We then deduced possible phylogenetic positions for the two main categories of published Chilean mummy sequences, based on their published 157-nucleotide LTR sequences. The possible phylogenetic placements for one of the mummy sequence categories are consistent with a modern origin. However, one of these placements for the other mummy sequence category falls very close to the root of the Cosmopolitan clade, consistent with an ancient origin for both this mummy sequence and the Cosmopolitan clade.
Core Genome Multilocus Sequence Typing Scheme for High-Resolution Typing of Enterococcus faecium

PubMed Central

de Been, Mark; Pinholt, Mette; Top, Janetta; Bletz, Stefan; van Schaik, Willem; Brouwer, Ellen; Rogers, Malbert; Kraat, Yvette; Bonten, Marc; Corander, Jukka; Westh, Henrik; Harmsen, Dag

2015-01-01

Enterococcus faecium, a common inhabitant of the human gut, has emerged in the last 2 decades as an important multidrug-resistant nosocomial pathogen. Since the start of the 21st century, multilocus sequence typing (MLST) has been used to study the molecular epidemiology of E. faecium. However, due to the use of a small number of genes, the resolution of MLST is limited. Whole-genome sequencing (WGS) now allows for high-resolution tracing of outbreaks, but current WGS-based approaches lack standardization, rendering them less suitable for interlaboratory prospective surveillance. To overcome this limitation, we developed a core genome MLST (cgMLST) scheme for E. faecium. cgMLST transfers genome-wide single nucleotide polymorphism (SNP) diversity into a standardized and portable allele numbering system that is far less computationally intensive than SNP-based analysis of WGS data. The E. faecium cgMLST scheme was built using 40 genome sequences that represented the diversity of the species. The scheme consists of 1,423 cgMLST target genes. To test the performance of the scheme, we performed WGS analysis of 103 outbreak isolates from five different hospitals in the Netherlands, Denmark, and Germany. The cgMLST scheme performed well in distinguishing between epidemiologically related and unrelated isolates, even between those that had the same sequence type (ST), which denotes the higher discriminatory power of this cgMLST scheme over that of conventional MLST. We also show that in terms of resolution, the performance of the E. faecium cgMLST scheme is equivalent to that of an SNP-based approach. In conclusion, the cgMLST scheme developed in this study facilitates rapid, standardized, and high-resolution tracing of E. faecium outbreaks. PMID:26400782
Core Genome Multilocus Sequence Typing Scheme for High- Resolution Typing of Enterococcus faecium.

PubMed

de Been, Mark; Pinholt, Mette; Top, Janetta; Bletz, Stefan; Mellmann, Alexander; van Schaik, Willem; Brouwer, Ellen; Rogers, Malbert; Kraat, Yvette; Bonten, Marc; Corander, Jukka; Westh, Henrik; Harmsen, Dag; Willems, Rob J L

2015-12-01

Enterococcus faecium, a common inhabitant of the human gut, has emerged in the last 2 decades as an important multidrug-resistant nosocomial pathogen. Since the start of the 21st century, multilocus sequence typing (MLST) has been used to study the molecular epidemiology of E. faecium. However, due to the use of a small number of genes, the resolution of MLST is limited. Whole-genome sequencing (WGS) now allows for high-resolution tracing of outbreaks, but current WGS-based approaches lack standardization, rendering them less suitable for interlaboratory prospective surveillance. To overcome this limitation, we developed a core genome MLST (cgMLST) scheme for E. faecium. cgMLST transfers genome-wide single nucleotide polymorphism(SNP) diversity into a standardized and portable allele numbering system that is far less computationally intensive than SNP-based analysis of WGS data. The E. faecium cgMLST scheme was built using 40 genome sequences that represented the diversity of the species. The scheme consists of 1,423 cgMLST target genes. To test the performance of the scheme, we performed WGS analysis of 103 outbreak isolates from five different hospitals in the Netherlands, Denmark, and Germany. The cgMLST scheme performed well in distinguishing between epidemiologically related and unrelated isolates, even between those that had the same sequence type (ST), which denotes the higher discriminatory power of this cgMLST scheme over that of conventional MLST. We also show that in terms of resolution, the performance of the E. faecium cgMLST scheme is equivalent to that of an SNP-based approach. In conclusion, the cgMLST scheme developed in this study facilitates rapid, standardized, and high-resolution tracing of E. faecium outbreaks.
Microgravity

NASA Image and Video Library

1998-12-01

Type II restriction enzymes, such as Eco R1 endonulease, present a unique advantage for the study of sequence-specific recognition because they leave a record of where they have been in the form of the cleaved ends of the DNA sites where they were bound. The differential behavior of a sequence -specific protein at sites of differing base sequence is the essence of the sequence-specificity; the core question is how do these proteins discriminate between different DNA sequences especially when the two sequences are very similar. Principal Investigator: Dan Carter/New Century Pharmaceuticals
Protein Crystal Eco R1 Endonulease-DNA Complex

NASA Technical Reports Server (NTRS)

1998-01-01

Type II restriction enzymes, such as Eco R1 endonulease, present a unique advantage for the study of sequence-specific recognition because they leave a record of where they have been in the form of the cleaved ends of the DNA sites where they were bound. The differential behavior of a sequence -specific protein at sites of differing base sequence is the essence of the sequence-specificity; the core question is how do these proteins discriminate between different DNA sequences especially when the two sequences are very similar. Principal Investigator: Dan Carter/New Century Pharmaceuticals

Statistical theory for protein combinatorial libraries. Packing interactions, backbone flexibility, and the sequence variability of a main-chain structure.

PubMed

Kono, H; Saven, J G

2001-02-23

Combinatorial experiments provide new ways to probe the determinants of protein folding and to identify novel folding amino acid sequences. These types of experiments, however, are complicated both by enormous conformational complexity and by large numbers of possible sequences. Therefore, a quantitative computational theory would be helpful in designing and interpreting these types of experiment. Here, we present and apply a statistically based, computational approach for identifying the properties of sequences compatible with a given main-chain structure. Protein side-chain conformations are included in an atom-based fashion. Calculations are performed for a variety of similar backbone structures to identify sequence properties that are robust with respect to minor changes in main-chain structure. Rather than specific sequences, the method yields the likelihood of each of the amino acids at preselected positions in a given protein structure. The theory may be used to quantify the characteristics of sequence space for a chosen structure without explicitly tabulating sequences. To account for hydrophobic effects, we introduce an environmental energy that it is consistent with other simple hydrophobicity scales and show that it is effective for side-chain modeling. We apply the method to calculate the identity probabilities of selected positions of the immunoglobulin light chain-binding domain of protein L, for which many variant folding sequences are available. The calculations compare favorably with the experimentally observed identity probabilities.
Mapping Base Modifications in DNA by Transverse-Current Sequencing

NASA Astrophysics Data System (ADS)

Alvarez, Jose R.; Skachkov, Dmitry; Massey, Steven E.; Kalitsov, Alan; Velev, Julian P.

2018-02-01

Sequencing DNA modifications and lesions, such as methylation of cytosine and oxidation of guanine, is even more important and challenging than sequencing the genome itself. The traditional methods for detecting DNA modifications are either insensitive to these modifications or require additional processing steps to identify a particular type of modification. Transverse-current sequencing in nanopores can potentially identify the canonical bases and base modifications in the same run. In this work, we demonstrate that the most common DNA epigenetic modifications and lesions can be detected with any predefined accuracy based on their tunneling current signature. Our results are based on simulations of the nanopore tunneling current through DNA molecules, calculated using nonequilibrium electron-transport methodology within an effective multiorbital model derived from first-principles calculations, followed by a base-calling algorithm accounting for neighbor current-current correlations. This methodology can be integrated with existing experimental techniques to improve base-calling fidelity.
Single Assay for Simultaneous Detection and Differential Identification of Human and Avian Influenza Virus Types, Subtypes, and Emergent Variants

PubMed Central

Metzgar, David; Myers, Christopher A.; Russell, Kevin L.; Faix, Dennis; Blair, Patrick J.; Brown, Jason; Vo, Scott; Swayne, David E.; Thomas, Colleen; Stenger, David A.; Lin, Baochuan; Malanoski, Anthony P.; Wang, Zheng; Blaney, Kate M.; Long, Nina C.; Schnur, Joel M.; Saad, Magdi D.; Borsuk, Lisa A.; Lichanska, Agnieszka M.; Lorence, Matthew C.; Weslowski, Brian; Schafer, Klaus O.; Tibbetts, Clark

2010-01-01

For more than four decades the cause of most type A influenza virus infections of humans has been attributed to only two viral subtypes, A/H1N1 or A/H3N2. In contrast, avian and other vertebrate species are a reservoir of type A influenza virus genome diversity, hosting strains representing at least 120 of 144 combinations of 16 viral hemagglutinin and 9 viral neuraminidase subtypes. Viral genome segment reassortments and mutations emerging within this reservoir may spawn new influenza virus strains as imminent epidemic or pandemic threats to human health and poultry production. Traditional methods to detect and differentiate influenza virus subtypes are either time-consuming and labor-intensive (culture-based) or remarkably insensitive (antibody-based). Molecular diagnostic assays based upon reverse transcriptase-polymerase chain reaction (RT-PCR) have short assay cycle time, and high analytical sensitivity and specificity. However, none of these diagnostic tests determine viral gene nucleotide sequences to distinguish strains and variants of a detected pathogen from one specimen to the next. Decision-quality, strain- and variant-specific pathogen gene sequence information may be critical for public health, infection control, surveillance, epidemiology, or medical/veterinary treatment planning. The Resequencing Pathogen Microarray (RPM-Flu) is a robust, highly multiplexed and target gene sequencing-based alternative to both traditional culture- or biomarker-based diagnostic tests. RPM-Flu is a single, simultaneous differential diagnostic assay for all subtype combinations of type A influenza viruses and for 30 other viral and bacterial pathogens that may cause influenza-like illness. These other pathogen targets of RPM-Flu may co-infect and compound the morbidity and/or mortality of patients with influenza. The informative specificity of a single RPM-Flu test represents specimen-specific viral gene sequences as determinants of virus type, A/HN subtype, virulence, host-range, and resistance to antiviral agents. PMID:20140251
Single assay for simultaneous detection and differential identification of human and avian influenza virus types, subtypes, and emergent variants.

PubMed

Metzgar, David; Myers, Christopher A; Russell, Kevin L; Faix, Dennis; Blair, Patrick J; Brown, Jason; Vo, Scott; Swayne, David E; Thomas, Colleen; Stenger, David A; Lin, Baochuan; Malanoski, Anthony P; Wang, Zheng; Blaney, Kate M; Long, Nina C; Schnur, Joel M; Saad, Magdi D; Borsuk, Lisa A; Lichanska, Agnieszka M; Lorence, Matthew C; Weslowski, Brian; Schafer, Klaus O; Tibbetts, Clark

2010-02-03

For more than four decades the cause of most type A influenza virus infections of humans has been attributed to only two viral subtypes, A/H1N1 or A/H3N2. In contrast, avian and other vertebrate species are a reservoir of type A influenza virus genome diversity, hosting strains representing at least 120 of 144 combinations of 16 viral hemagglutinin and 9 viral neuraminidase subtypes. Viral genome segment reassortments and mutations emerging within this reservoir may spawn new influenza virus strains as imminent epidemic or pandemic threats to human health and poultry production. Traditional methods to detect and differentiate influenza virus subtypes are either time-consuming and labor-intensive (culture-based) or remarkably insensitive (antibody-based). Molecular diagnostic assays based upon reverse transcriptase-polymerase chain reaction (RT-PCR) have short assay cycle time, and high analytical sensitivity and specificity. However, none of these diagnostic tests determine viral gene nucleotide sequences to distinguish strains and variants of a detected pathogen from one specimen to the next. Decision-quality, strain- and variant-specific pathogen gene sequence information may be critical for public health, infection control, surveillance, epidemiology, or medical/veterinary treatment planning. The Resequencing Pathogen Microarray (RPM-Flu) is a robust, highly multiplexed and target gene sequencing-based alternative to both traditional culture- or biomarker-based diagnostic tests. RPM-Flu is a single, simultaneous differential diagnostic assay for all subtype combinations of type A influenza viruses and for 30 other viral and bacterial pathogens that may cause influenza-like illness. These other pathogen targets of RPM-Flu may co-infect and compound the morbidity and/or mortality of patients with influenza. The informative specificity of a single RPM-Flu test represents specimen-specific viral gene sequences as determinants of virus type, A/HN subtype, virulence, host-range, and resistance to antiviral agents.
Detection of cystic fibrosis transmembrane conductance regulator ΔF508 gene mutation using a paper-based nucleic acid hybridization assay and a smartphone camera.

PubMed

Malhotra, Karan; Noor, M Omair; Krull, Ulrich J

2018-05-29

Diagnostic technology that makes use of paper platforms in conjunction with the ubiquitous availability of digital cameras in cellular telephones and personal assistive devices offers opportunities for development of bioassays that are cost effective and widely distributed. Assays that operate effectively in aqueous solution require further development for implementation in paper substrates, overcoming issues associated with surface interactions on a matrix that offers a large surface-to-volume ratio and constraints on convective mixing. This report presents and compares two related methods for determination of oligonucleotides that serve as indicators of cystic fibrosis, differentiating between the normal wild-type sequence, and a mutant-type sequence that has a 3-base replacement. The transduction strategy operates by selective hybridization of oligonucleotide probes that are conjugated to fluorescent quantum dots, where hybridization of target sequences causes a molecular fluorophore to approach the quantum dot and become emissive through fluorescence resonance energy transfer. Detection can rely on hybridization of a target that is labelled with Cy3 fluorophore, or in the presence of an unlabelled target when a sandwich assay format is implemented with a labelled reporter oligonucleotide. Selectivity to determine the presence of mismatched sequences involves appropriate selection of nucleotide sequences to set melt temperatures, in conjunction with control of stringency conditions using formamide as a chaotrope. It was determined that both direct and sandwich assays on paper substrates are able to distinguish between wild-type and mutant-type samples.
Retrosynthetic Reaction Prediction Using Neural Sequence-to-Sequence Models

PubMed Central

2017-01-01

We describe a fully data driven model that learns to perform a retrosynthetic reaction prediction task, which is treated as a sequence-to-sequence mapping problem. The end-to-end trained model has an encoder–decoder architecture that consists of two recurrent neural networks, which has previously shown great success in solving other sequence-to-sequence prediction tasks such as machine translation. The model is trained on 50,000 experimental reaction examples from the United States patent literature, which span 10 broad reaction types that are commonly used by medicinal chemists. We find that our model performs comparably with a rule-based expert system baseline model, and also overcomes certain limitations associated with rule-based expert systems and with any machine learning approach that contains a rule-based expert system component. Our model provides an important first step toward solving the challenging problem of computational retrosynthetic analysis. PMID:29104927
A novel multi-locus sequence typing (MLST) protocol for Leuconostoc lactis isolates from traditional dairy products in China and Mongolia.

PubMed

Dan, Tong; Liu, Wenjun; Sun, Zhihong; Lv, Qiang; Xu, Haiyan; Song, Yuqin; Zhang, Heping

2014-06-09

Economically, Leuconostoc lactis is one of the most important species in the genus Leuconostoc. It plays an important role in the food industry including the production of dextrans and bacteriocins. Currently, traditional molecular typing approaches for characterisation of this species at the isolate level are either unavailable or are not sufficiently reliable for practical use. Multilocus sequence typing (MLST) is a robust and reliable method for characterising bacterial and fungal species at the molecular level. In this study, a novel MLST protocol was developed for 50 L. lactis isolates from Mongolia and China. Sequences from eight targeted genes (groEL, carB, recA, pheS, murC, pyrG, rpoB and uvrC) were obtained. Sequence analysis indicated 20 different sequence types (STs), with 13 of them being represented by a single isolate. Phylogenetic analysis based on the sequences of eight MLST loci indicated that the isolates belonged to two major groups, A (34 isolates) and B (16 isolates). Linkage disequilibrium analyses indicated that recombination occurred at a low frequency in L. lactis, indicating a clonal population structure. Split-decomposition analysis indicated that intraspecies recombination played a role in generating genotypic diversity amongst isolates. Our results indicated that MLST is a valuable tool for typing L. lactis isolates that can be used for further monitoring of evolutionary changes and population genetics.
Composition for nucleic acid sequencing

DOEpatents

Korlach, Jonas [Ithaca, NY; Webb, Watt W [Ithaca, NY; Levene, Michael [Ithaca, NY; Turner, Stephen [Ithaca, NY; Craighead, Harold G [Ithaca, NY; Foquet, Mathieu [Ithaca, NY

2008-08-26

The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
Method for sequencing nucleic acid molecules

DOEpatents

Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

2006-06-06

The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
Method for sequencing nucleic acid molecules

DOEpatents

Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

2006-05-30

The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
Evaluation of the Bacterial Diversity in the Human Tongue Coating Based on Genus-Specific Primers for 16S rRNA Sequencing.

PubMed

Sun, Beili; Zhou, Dongrui; Tu, Jing; Lu, Zuhong

2017-01-01

The characteristics of tongue coating are very important symbols for disease diagnosis in traditional Chinese medicine (TCM) theory. As a habitat of oral microbiota, bacteria on the tongue dorsum have been proved to be the cause of many oral diseases. The high-throughput next-generation sequencing (NGS) platforms have been widely applied in the analysis of bacterial 16S rRNA gene. We developed a methodology based on genus-specific multiprimer amplification and ligation-based sequencing for microbiota analysis. In order to validate the efficiency of the approach, we thoroughly analyzed six tongue coating samples from lung cancer patients with different TCM types, and more than 600 genera of bacteria were detected by this platform. The results showed that ligation-based parallel sequencing combined with enzyme digestion and multiamplification could expand the effective length of sequencing reads and could be applied in the microbiota analysis.
A seminested PCR assay for detection and typing of human papillomavirus based on E1 gene sequences.

PubMed

Cavalcante, Gustavo Henrique O; de Araújo, Josélio M G; Fernandes, José Veríssimo; Lanza, Daniel C F

2018-05-01

HPV infection is considered one of the leading causes of cervical cancer in the world. To date, more than 180 types of HPV have been described and viral typing is critical for defining the prognosis of cancer. In this work, a seminested PCR which allow fast and inexpensively detection and typing of HPV is presented. The system is based on the amplification of a variable length region within the viral gene E1, using three primers that potentially anneal in all HPV genomes. The amplicons produced in the first step can be identified by high resolution electrophoresis or direct sequencing. The seminested step includes nine specific primers which can be used in multiplex or individual reactions to discriminate the main types of HPV by amplicon size differentiation using agarose electrophoresis, reducing the time spent and cost per analysis. Copyright © 2017 Elsevier Inc. All rights reserved.
Kernel based machine learning algorithm for the efficient prediction of type III polyketide synthase family of proteins.

PubMed

Mallika, V; Sivakumar, K C; Jaichand, S; Soniya, E V

2010-07-13

Type III Polyketide synthases (PKS) are family of proteins considered to have significant roles in the biosynthesis of various polyketides in plants, fungi and bacteria. As these proteins shows positive effects to human health, more researches are going on regarding this particular protein. Developing a tool to identify the probability of sequence being a type III polyketide synthase will minimize the time consumption and manpower efforts. In this approach, we have designed and implemented PKSIIIpred, a high performance prediction server for type III PKS where the classifier is Support Vector Machines (SVMs). Based on the limited training dataset, the tool efficiently predicts the type III PKS superfamily of proteins with high sensitivity and specificity. The PKSIIIpred is available at http://type3pks.in/prediction/. We expect that this tool may serve as a useful resource for type III PKS researchers. Currently work is being progressed for further betterment of prediction accuracy by including more sequence features in the training dataset.
Redesigning the type II' β-turn in green fluorescent protein to type I': implications for folding kinetics and stability.

PubMed

Madan, Bharat; Sokalingam, Sriram; Raghunathan, Govindan; Lee, Sun-Gu

2014-10-01

Both Type I' and Type II' β-turns have the same sense of the β-turn twist that is compatible with the β-sheet twist. They occur predominantly in two residue β-hairpins, but the occurrence of Type I' β-turns is two times higher than Type II' β-turns. This suggests that Type I' β-turns may be more stable than Type II' β-turns, and Type I' β-turn sequence and structure can be more favorable for protein folding than Type II' β-turns. Here, we redesigned the native Type II' β-turn in GFP to Type I' β-turn, and investigated its effect on protein folding and stability. The Type I' β-turns were designed based on the statistical analysis of residues in natural Type I' β-turns. The substitution of the native "GD" sequence of i+1 and i+2 residues with Type I' preferred "(N/D)G" sequence motif increased the folding rate by 50% and slightly improved the thermodynamic stability. Despite the enhancement of in vitro refolding kinetics and stability of the redesigned mutants, they showed poor soluble expression level compared to wild type. To overcome this problem, i and i + 3 residues of the designed Type I' β-turn were further engineered. The mutation of Thr to Lys at i + 3 could restore the in vivo soluble expression of the Type I' mutant. This study indicates that Type II' β-turns in natural β-hairpins can be further optimized by converting the sequence to Type I'. © 2014 Wiley Periodicals, Inc.
Nucleotide sequence and proposed secondary structure of Columnea latent viroid: a natural mosaic of viroid sequences.

PubMed Central

Hammond, R; Smith, D R; Diener, T O

1989-01-01

The Columnea latent viroid (CLV) occurs latently in certain Columnea erythrophae plants grown commercially. In potato and tomato, CLV causes potato spindle tuber viroid (PSTV)-like symptoms. Its nucleotide sequence and proposed secondary structure reveal that CLV consists of a single-stranded circular RNA of 370 nucleotides which can assume a rod-like structure with extensive base-pairing characteristic of all known viroids. The electrophoretic mobility of circular CLV under nondenaturing conditions suggests a potential tertiary structure. CLV contains extensive sequence homologies to the PSTV group of viroids but contains a central conserved region identical to that of hop stunt viroid (HSV). CLV also shares some biological properties with each of the two types of viroids. Most probably, CLV is the result of intracellular RNA recombination between an HSV-type and one or more PSTV-type viroids replicating in the same plant. Images PMID:2602114
Reclassification of Actinobacillus muris as Muribacter muris gen. nov., comb. nov.

PubMed

Nicklas, Werner; Bisgaard, Magne; Aalbæk, Bent; Kuhnert, Peter; Christensen, Henrik

2015-10-01

To reinvestigate the taxonomy of [Actinobacillus] muris, 474 strains, mainly from mice and rats, were characterized by phenotype and 130 strains selected for genotypic characterization by 16S rRNA and partial rpoB gene sequencing. The type strain was further investigated by whole-genome sequencing. Phylogenetic analysis of the DNA sequences showed one monophyletic group with intragroup similarities of 96.7 and 97.2 % for the 16S rRNA and rpoB genes, respectively. The highest 16S rRNA gene sequence similarity to a taxon with a validly published name outside the group was 95.9 %, to the type strain of [Pasteurella] pneumotropica. The closest related taxon based on rpoB sequence comparison was 'Haemophilus influenzae-murium', with 88.4 % similarity. A new genus and a new combination, Muribacter muris gen. nov., comb. nov., are proposed based on a distinct phylogenetic position based on 16S rRNA and rpoB gene sequence comparisons, with major divergence from the existing genera of the family Pasteurellaceae. The new genus has the characteristics of [A.] muris with the emendation that acid formation from ( - )-d-mannitol and hydrolysis of aesculin are variable, while the α-glucosidase test is positive. There is no requirement for exogenously supplied NAD (V factor) for the majority of strains investigated; however, one strain was found to require NAD. The major fatty acids of the type strain of Muribacter muris were C14 : 0, C14 : 0 3-OH/iso-C16 : 1 I, C16 : 1ω7c and C16 : 0, which is in line with most genera of the Pasteurellaceae. The type strain of Muribacter muris is CCUG 16938T ( = NCTC 12432T = ATCC 49577T).
Modeling and prediction of peptide drift times in ion mobility spectrometry using sequence-based and structure-based approaches.

PubMed

Zhang, Yiming; Jin, Quan; Wang, Shuting; Ren, Ren

2011-05-01

The mobile behavior of 1481 peptides in ion mobility spectrometry (IMS), which are generated by protease digestion of the Drosophila melanogaster proteome, is modeled and predicted based on two different types of characterization methods, i.e. sequence-based approach and structure-based approach. In this procedure, the sequence-based approach considers both the amino acid composition of a peptide and the local environment profile of each amino acid in the peptide; the structure-based approach is performed with the CODESSA protocol, which regards a peptide as a common organic compound and generates more than 200 statistically significant variables to characterize the whole structure profile of a peptide molecule. Subsequently, the nonlinear support vector machine (SVM) and Gaussian process (GP) as well as linear partial least squares (PLS) regression is employed to correlate the structural parameters of the characterizations with the IMS drift times of these peptides. The obtained quantitative structure-spectrum relationship (QSSR) models are evaluated rigorously and investigated systematically via both one-deep and two-deep cross-validations as well as the rigorous Monte Carlo cross-validation (MCCV). We also give a comprehensive comparison on the resulting statistics arising from the different combinations of variable types with modeling methods and find that the sequence-based approach can give the QSSR models with better fitting ability and predictive power but worse interpretability than the structure-based approach. In addition, though the QSSR modeling using sequence-based approach is not needed for the preparation of the minimization structures of peptides before the modeling, it would be considerably efficient as compared to that using structure-based approach. Copyright © 2011 Elsevier Ltd. All rights reserved.
GeneChip{sup {trademark}} screening assay for cystic fibrosis mutations

DOE Office of Scientific and Technical Information (OSTI.GOV)

Cronn, M.T.; Miyada, C.G.; Fucini, R.V.

1994-09-01

GeneChip{sup {trademark}} assays are based on high density, carefully designed arrays of short oligonucleotide probes (13-16 bases) built directly on derivatized silica substrates. DNA target sequence analysis is achieved by hybridizing fluorescently labeled amplification products to these arrays. Fluorescent hybridization signals located within the probe array are translated into target sequence information using the known probe sequence at each array feature. The mutation screening assay for cystic fibrosis includes sets of oligonucleotide probes designed to detect numerous different mutations that have been described in 14 exons and one intron of the CFTR gene. Each mutation site is addressed by amore » sub-array of at least 40 probe sequences, half designed to detect the wild type gene sequence and half designed to detect the reported mutant sequence. Hybridization with homozygous mutant, homozygous wild type or heterozygous targets results in distinctive hybridization patterns within a sub-array, permitting specific discrimination of each mutation. The GeneChip probe arrays are very small (approximately 1 cm{sup 2}). There miniature size coupled with their high information content make GeneChip probe arrays a useful and practical means for providing CF mutation analysis in a clinical setting.« less
Distribution of Monoclonal Antibody Subgroups and Sequence-Based Types among Legionella pneumophila Serogroup 1 Isolates Derived from Cooling Tower Water, Bathwater, and Soil in Japan

PubMed Central

Kikukawa, Kiyomi; Helbig, Jürgen H.; Kaneko, Satoko; Suzuki-Hashimoto, Atsuko; Furuhata, Katsunori; Chang, Bin; Murai, Miyo; Ichinose, Masayuki; Ohnishi, Makoto; Kura, Fumiaki

2012-01-01

Legionella pneumophila serogroup (SG) 1 is the most frequent cause of legionellosis. This study analyzed environmental isolates of L. pneumophila SG 1 in Japan using monoclonal antibody (MAb) typing and sequence-based typing (SBT). Samples were analyzed from bathwater (BW; n = 50), cooling tower water (CT; n = 50), and soil (SO; n = 35). The distribution of MAb types varied by source, with the most prevalent types being Bellingham (42%), Oxford (72%), and OLDA (51%) in BW, CT, and SO, respectively. The ratios of MAb 3/1 positive isolates were 26, 2, and 14% from BW, CT, and SO, respectively. The environmental isolates from BW, CT, and SO were divided into 34 sequence types (STs; index of discrimination [IOD] = 0.973), 8 STs (IOD = 0.448), and 11 STs (IOD = 0.879), respectively. Genetic variation among CT isolates was smaller than seen in BW and SO. ST1 accounted for 74% of the CT isolates. The only common STs between (i) BW and CT, (ii) BW and SO, and (iii) CT and SO were ST1, ST129, and ST48, respectively, suggesting that each environment constitutes an independent habitat. PMID:22492442
Identification of four novel HLA-B alleles, B*1590, B*1591, B*2726, and B*4705, from an East African population by high-resolution sequence-based typing.

PubMed

Luo, M; Mao, X; Plummer, F A

2005-02-01

We report here four novel HLA-B alleles, B*1590, B*1591, B*2726, and B*4705, identified from an East African population during sequence-based HLA-B typing. The novel alleles were confirmed by sequencing two separate polymerase chain reaction products, and by molecular cloning and sequencing multiple clones. B*1590 is identical to B*1510 at exon 2 and exon 3, except for a difference (GCCGTC) at codon 158. Sequence differences at codon 152 (GAGGTG) and codon 167 (TGGTCG) differentiate B*1591 from B*1503 at exon 3. B*2726 is identical to B*2708 at exon 2 and exon 3, except for a difference (AAGCAG) at codon 70. B*4705 was identified in three Kenyan women. The allele is identical to B*47010101/02 at exon 2 and exon 3, except for differences at codon 97 (AGGAAT) and codon 99 (TTTTAT). These new alleles have been named by the WHO Nomenclature Committee. Identification of these novel HLA-B alleles reflects the genetic diversity of this East African population.

Integrated massively parallel sequencing of 15 autosomal STRs and Amelogenin using a simplified library preparation approach.

PubMed

Xue, Jian; Wu, Riga; Pan, Yajiao; Wang, Shunxia; Qu, Baowang; Qin, Ying; Shi, Yuequn; Zhang, Chuchu; Li, Ran; Zhang, Liyan; Zhou, Cheng; Sun, Hongyu

2018-04-02

Massively parallel sequencing (MPS) technologies, also termed as next-generation sequencing (NGS), are becoming increasingly popular in study of short tandem repeats (STR). However, current library preparation methods are usually based on ligation or two-round PCR that requires more steps, making it time-consuming (about 2 days), laborious and expensive. In this study, a 16-plex STR typing system was designed with fusion primer strategy based on the Ion Torrent S5 XL platform which could effectively resolve the above challenges for forensic DNA database-type samples (bloodstains, saliva stains, etc.). The efficiency of this system was tested in 253 Han Chinese participants. The libraries were prepared without DNA isolation and adapter ligation, and the whole process only required approximately 5 h. The proportion of thoroughly genotyped samples in which all the 16 loci were successfully genotyped was 86% (220/256). Of the samples, 99.7% showed 100% concordance between NGS-based STR typing and capillary electrophoresis (CE)-based STR typing. The inconsistency might have been caused by off-ladder alleles and mutations in primer binding sites. Overall, this panel enabled the large-scale genotyping of the DNA samples with controlled quality and quantity because it is a simple, operation-friendly process flow that saves labor, time and costs. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Crash sequence based risk matrix for motorcycle crashes.

PubMed

Wu, Kun-Feng; Sasidharan, Lekshmi; Thor, Craig P; Chen, Sheng-Yin

2018-04-05

Considerable research has been conducted related to motorcycle and other powered-two-wheeler (PTW) crashes; however, it always has been controversial among practitioners concerning with types of crashes should be first targeted and how to prioritize resources for the implementation of mitigating actions. Therefore, there is a need to identify types of motorcycle crashes that constitute the greatest safety risk to riders - most frequent and most severe crashes. This pilot study seeks exhibit the efficacy of a new approach for prioritizing PTW crash causation sequences as they relate to injury severity to better inform the application of mitigating countermeasures. To accomplish this, the present study constructed a crash sequence-based risk matrix to identify most frequent and most severe motorcycle crashes in an attempt to better connect causes and countermeasures of PTW crashes. Although the frequency of each crash sequence can be computed from crash data, a crash severity model is needed to compare the levels of crash severity among different crash sequences, while controlling for other factors that also have effects on crash severity such drivers' age, use of helmet, etc. The construction of risk matrix based on crash sequences involve two tasks: formulation of crash sequence and the estimation of a mixed-effects (ME) model to adjust the levels of severities for each crash sequence to account for other crash contributing factors that would have an effect on the maximum level of crash severity in a crash. Three data elements from the National Automotive Sampling System - General Estimating System (NASS-GES) data were utilized to form a crash sequence: critical event, crash types, and sequence of events. A mixed-effects model was constructed to model the severity levels for each crash sequence while accounting for the effects of those crash contributing factors on crash severity. A total of 8039 crashes involving 8208 motorcycles occurred during 2011 and 2013 were included in this study, weighted to represent 338,655 motorcyclists involved in traffic crashes in three years (2011-2013)(NHTSA, 2013). The top five most frequent and severe types of crash sequences were identified, accounting for 23 percent of all the motorcycle crashes included in the study, and they are (1) run-off-road crashes on the right, and hitting roadside objects, (2) cross-median crashes, and rollover, (3) left-turn oncoming crashes, and head-on, (4) crossing over (passing through) or turning into opposite direction at intersections, and (5) side-impacted. In addition to crash sequences, several other factors were also identified to have effects on crash severity: use of helmet, presence of horizontal curves, alcohol consumption, road surface condition, roadway functional class, and nighttime condition. Copyright © 2018 Elsevier Ltd. All rights reserved.
Molecular characterization and epidemiology of cefoxitin resistance among Enterobacteriaceae lacking inducible chromosomal ampC genes from hospitalized and non-hospitalized patients in Algeria: description of new sequence type in Klebsiella pneumoniae isolates.

PubMed

Gharout-Sait, Alima; Touati, Abdelaziz; Guillard, Thomas; Brasme, Lucien; de Champs, Christophe

2015-01-01

In this study, 922 consecutive non-duplicate clinical isolates of Enterobacteriaceae obtained from hospitalized and non-hospitalized patients at Bejaia, Algeria were analyzed for AmpC-type β-lactamases production. The ampC genes and their genetic environment were characterized using polymerase chain reaction (PCR) and sequencing. Plasmid incompatibility groups were determined by using PCR-based replicon typing. Phylogenetic grouping and multilocus sequence typing were determined for molecular typing of the plasmid-mediated AmpC (pAmpC) isolates. Of the isolates, 15 (1.6%) were identified as AmpC producers including 14 CMY-4-producing isolates and one DHA-1-producing Klebsiella pneumoniae. All AmpC-producing isolates co-expressed the broad-spectrum TEM-1 β-lactamase and three of them co-produced CTX-M and/or SHV-12 ESBL. Phylogenetic grouping and virulence genotyping of the E. coli isolates revealed that most of them belonged to groups D and B1. Multilocus sequence typing analysis of K. pneumoniae isolates identified four different sequence types (STs) with two new sequences: ST1617 and ST1618. Plasmid replicon typing indicates that blaCMY-4 gene was located on broad host range A/C plasmid, while LVPK replicon was associated with blaDHA-1. All isolates carrying blaCMY-4 displayed the transposon-like structures ISEcp1/ΔISEcp1-blaCMY-blc-sugE. Our study showed that CMY-4 was the main pAmpC in the Enterobacteriaceae isolates in Algeria. Copyright © 2015 Elsevier Editora Ltda. All rights reserved.
Typing of Panton-Valentine Leukocidin-Encoding Phages and lukSF-PV Gene Sequence Variation in Staphylococcus aureus from China.

PubMed

Zhao, Huanqiang; Hu, Fupin; Jin, Shu; Xu, Xiaogang; Zou, Yuhan; Ding, Baixing; He, Chunyan; Gong, Fang; Liu, Qingzhong

2016-01-01

Panton-Valentine leukocidin (PVL, encoded by lukSF-PV genes), a bi-component and pore-forming toxin, is carried by different staphylococcal bacteriophages. The prevalence of PVL in Staphylococcus aureus has been reported around the globe. However, the data on PVL-encoding phage types, lukSF-PV gene variation and chromosomal phage insertion sites for PVL-positive S. aureus are limited, especially in China. In order to obtain a more complete understanding of the molecular epidemiology of PVL-positive S. aureus, an integrated and modified PCR-based scheme was applied to detect the PVL-encoding phage types. Phage insertion locus and the lukSF-PV variant were determined by PCR and sequencing. Meanwhile, the genetic background was characterized by staphylococcal cassette chromosome mec (SCCmec) typing, staphylococcal protein A (spa) gene polymorphisms typing, pulsed-field gel electrophoresis (PFGE) typing, accessory gene regulator (agr) locus typing and multilocus sequence typing (MLST). Seventy eight (78/1175, 6.6%) isolates possessed the lukSF-PV genes and 59.0% (46/78) of PVL-positive strains belonged to CC59 lineage. Eight known different PVL-encoding phage types were detected, and Φ7247PVL/ΦST5967PVL (n = 13) and ΦPVL (n = 12) were the most prevalent among them. While 25 (25/78, 32.1%) isolates, belonging to ST30, and ST59 clones, were unable to be typed by the modified PCR-based scheme. Single nucleotide polymorphisms (SNPs) were identified at five locations in the lukSF-PV genes, two of which were non-synonymous. Maximum-likelihood tree analysis of attachment sites sequences detected six SNP profiles for attR and eight for attL, respectively. In conclusion, the PVL-positive S. aureus mainly harbored Φ7247PVL/ΦST5967PVL and ΦPVL in the regions studied. lukSF-PV gene sequences, PVL-encoding phages, and phage insertion locus generally varied with lineages. Moreover, PVL-positive clones that have emerged worldwide likely carry distinct phages.
Structural characterization of copia-type retrotransposons leads to insights into the marker development in a biofuel crop, Jatropha curcas L.

PubMed Central

2013-01-01

Background Recently, Jatropha curcas L. has attracted worldwide attention for its potential as a source of biodiesel. However, most DNA markers have demonstrated high levels of genetic similarity among and within jatropha populations around the globe. Despite promising features of copia-type retrotransposons as ideal genetic tools for gene tagging, mutagenesis, and marker-assisted selection, they have not been characterized in the jatropha genome yet. Here, we examined the diversity, evolution, and genome-wide organization of copia-type retrotransposons in the Asian, African, and Mesoamerican accessions of jatropha, then introduced a retrotransposon-based marker for this biofuel crop. Results In total, 157 PCR fragments that were amplified using the degenerate primers for the reverse transcriptase (RT) domain of copia-type retroelements were sequenced and aligned to construct the neighbor-joining tree. Phylogenetic analysis demonstrated that isolated copia RT sequences were classified into ten families, which were then grouped into three lineages. An in-depth study of the jatropha genome for the RT sequences of each family led to the characterization of full consensus sequences of the jatropha copia-type families. Estimated copy numbers of target sequences were largely different among families, as was presence of genes within 5 kb flanking regions for each family. Five copia-type families were as appealing candidates for the development of DNA marker systems. A candidate marker from family Jc7 was particularly capable of detecting genetic variation among different jatropha accessions. Fluorescence in situ hybridization (FISH) to metaphase chromosomes reveals that copia-type retrotransposons are scattered across chromosomes mainly located in the distal part regions. Conclusion This is the first report on genome-wide analysis and the cytogenetic mapping of copia-type retrotransposons of jatropha, leading to the discovery of families bearing high potential as DNA markers. Distinct dynamics of individual copia-type families, feasibility of a retrotransposon-based insertion polymorphism marker system in examining genetic variability, and approaches for the development of breeding strategies in jatropha using copia-type retrotransposons are discussed. PMID:24020916
Comparison of advanced whole genome sequence-based methods to distinguish strains of Salmonella enterica serovar Heidelberg involved in foodborne outbreaks in Québec.

PubMed

Vincent, Caroline; Usongo, Valentine; Berry, Chrystal; Tremblay, Denise M; Moineau, Sylvain; Yousfi, Khadidja; Doualla-Bell, Florence; Fournier, Eric; Nadon, Céline; Goodridge, Lawrence; Bekal, Sadjia

2018-08-01

Salmonella enterica serovar Heidelberg (S. Heidelberg) is one of the top serovars causing human salmonellosis. This serovar ranks second and third among serovars that cause human infections in Québec and Canada, respectively, and has been associated with severe infections. Traditional typing methods such as PFGE do not display adequate discrimination required to resolve outbreak investigations due to the low level of genetic diversity of isolates belonging to this serovar. This study evaluates the ability of four whole genome sequence (WGS)-based typing methods to differentiate among 145 S. Heidelberg strains involved in four distinct outbreak events and sporadic cases of salmonellosis that occurred in Québec between 2007 and 2016. Isolates from all outbreaks were indistinguishable by PFGE. The core genome single nucleotide variant (SNV), core genome multilocus sequence typing (MLST) and whole genome MLST approaches were highly discriminatory and separated outbreak strains into four distinct phylogenetic clusters that were concordant with the epidemiological data. The clustered regularly interspaced short palindromic repeats (CRISPR) typing method was less discriminatory. However, CRISPR typing may be used as a secondary method to differentiate isolates of S. Heidelberg that are genetically similar but epidemiologically unrelated to outbreak events. WGS-based typing methods provide a highly discriminatory alternative to PFGE for the laboratory investigation of foodborne outbreaks. Copyright © 2018 Elsevier Ltd. All rights reserved.
Coupling detrended fluctuation analysis for multiple warehouse-out behavioral sequences

NASA Astrophysics Data System (ADS)

Yao, Can-Zhong; Lin, Ji-Nan; Zheng, Xu-Zhou

2017-01-01

Interaction patterns among different warehouses could make the warehouse-out behavioral sequences less predictable. We firstly take a coupling detrended fluctuation analysis on the warehouse-out quantity, and find that the multivariate sequences exhibit significant coupling multifractal characteristics regardless of the types of steel products. Secondly, we track the sources of multifractal warehouse-out sequences by shuffling and surrogating original ones, and we find that fat-tail distribution contributes more to multifractal features than the long-term memory, regardless of types of steel products. From perspective of warehouse contribution, some warehouses steadily contribute more to multifractal than other warehouses. Finally, based on multiscale multifractal analysis, we propose Hurst surface structure to investigate coupling multifractal, and show that multiple behavioral sequences exhibit significant coupling multifractal features that emerge and usually be restricted within relatively greater time scale interval.
Genetic diversity analysis of Leuconostoc mesenteroides from Korean vegetables and food products by multilocus sequence typing.

PubMed

Sharma, Anshul; Kaur, Jasmine; Lee, Sulhee; Park, Young-Seo

2018-06-01

In the present study, 35 Leuconostoc mesenteroides strains isolated from vegetables and food products from South Korea were studied by multilocus sequence typing (MLST) of seven housekeeping genes (atpA, groEL, gyrB, pheS, pyrG, rpoA, and uvrC). The fragment sizes of the seven amplified housekeeping genes ranged in length from 366 to 1414 bp. Sequence analysis indicated 27 different sequence types (STs) with 25 of them being represented by a single strain indicating high genetic diversity, whereas the remaining 2 were characterized by five strains each. In total, 220 polymorphic nucleotide sites were detected among seven housekeeping genes. The phylogenetic analysis based on the STs of the seven loci indicated that the 35 strains belonged to two major groups, A (28 strains) and B (7 strains). Split decomposition analysis showed that intraspecies recombination played a role in generating diversity among strains. The minimum spanning tree showed that the evolution of the STs was not correlated with food source. This study signifies that the multilocus sequence typing is a valuable tool to access the genetic diversity among L. mesenteroides strains from South Korea and can be used further to monitor the evolutionary changes.
Use of Whole-Genus Genome Sequence Data To Develop a Multilocus Sequence Typing Tool That Accurately Identifies Yersinia Isolates to the Species and Subspecies Levels

PubMed Central

Hall, Miquette; Chattaway, Marie A.; Reuter, Sandra; Savin, Cyril; Strauch, Eckhard; Carniel, Elisabeth; Connor, Thomas; Van Damme, Inge; Rajakaruna, Lakshani; Rajendram, Dunstan; Jenkins, Claire; Thomson, Nicholas R.

2014-01-01

The genus Yersinia is a large and diverse bacterial genus consisting of human-pathogenic species, a fish-pathogenic species, and a large number of environmental species. Recently, the phylogenetic and population structure of the entire genus was elucidated through the genome sequence data of 241 strains encompassing every known species in the genus. Here we report the mining of this enormous data set to create a multilocus sequence typing-based scheme that can identify Yersinia strains to the species level to a level of resolution equal to that for whole-genome sequencing. Our assay is designed to be able to accurately subtype the important human-pathogenic species Yersinia enterocolitica to whole-genome resolution levels. We also report the validation of the scheme on 386 strains from reference laboratory collections across Europe. We propose that the scheme is an important molecular typing system to allow accurate and reproducible identification of Yersinia isolates to the species level, a process often inconsistent in nonspecialist laboratories. Additionally, our assay is the most phylogenetically informative typing scheme available for Y. enterocolitica. PMID:25339391
Defining and Evaluating a Core Genome Multilocus Sequence Typing Scheme for Genome-Wide Typing of Clostridium difficile.

PubMed

Bletz, Stefan; Janezic, Sandra; Harmsen, Dag; Rupnik, Maja; Mellmann, Alexander

2018-06-01

Clostridium difficile , recently renamed Clostridioides difficile , is the most common cause of antibiotic-associated nosocomial gastrointestinal infections worldwide. To differentiate endogenous infections and transmission events, highly discriminatory subtyping is necessary. Today, methods based on whole-genome sequencing data are increasingly used to subtype bacterial pathogens; however, frequently a standardized methodology and typing nomenclature are missing. Here we report a core genome multilocus sequence typing (cgMLST) approach developed for C. difficile Initially, we determined the breadth of the C. difficile population based on all available MLST sequence types with Bayesian inference (BAPS). The resulting BAPS partitions were used in combination with C. difficile clade information to select representative isolates that were subsequently used to define cgMLST target genes. Finally, we evaluated the novel cgMLST scheme with genomes from 3,025 isolates. BAPS grouping ( n = 6 groups) together with the clade information led to a total of 11 representative isolates that were included for cgMLST definition and resulted in 2,270 cgMLST genes that were present in all isolates. Overall, 2,184 to 2,268 cgMLST targets were detected in the genome sequences of 70 outbreak-associated and reference strains, and on average 99.3% cgMLST targets (1,116 to 2,270 targets) were present in 2,954 genomes downloaded from the NCBI database, underlining the representativeness of the cgMLST scheme. Moreover, reanalyzing different cluster scenarios with cgMLST were concordant to published single nucleotide variant analyses. In conclusion, the novel cgMLST is representative for the whole C. difficile population, is highly discriminatory in outbreak situations, and provides a unique nomenclature facilitating interlaboratory exchange. Copyright © 2018 American Society for Microbiology.
Phylogenetic analysis of Mycobacterium massiliense strains having recombinant rpoB gene laterally transferred from Mycobacterium abscessus.

PubMed

Kim, Byoung-Jun; Kim, Ga-Na; Kim, Bo-Ram; Shim, Tae-Sun; Kook, Yoon-Hoh; Kim, Bum-Joon

2017-01-01

Recent multi locus sequence typing (MLST) and genome based studies indicate that lateral gene transfer (LGT) events in the rpoB gene are prevalent between Mycobacterium abscessus complex strains. To check the prevalence of the M. massiliense strains subject to rpoB LGT (Rec-mas), we applied rpoB typing (711 bp) to 106 Korean strains of M. massiliense infection that had already been identified by hsp65 sequence analysis (603 bp). The analysis indicated 6 smooth strains in M. massiliense Type I (10.0%, 6/60) genotypes but no strains in M. massiliense Type II genotypes (0%, 0/46), showing a discrepancy between the 2 typing methods. Further MLST analysis based on the partial sequencing of seven housekeeping genes, argH, cya, glpK, gnd, murC, pta and purH, as well as erm(41) PCR proved that these 6 Rec-mas strains consisted of two distinct genotypes belonging to M. massiliense and not M. abscessus. The complete rpoB sequencing analysis showed that these 6 Rec-mas strains have an identical hybrid rpoB gene, of which a 478 bp partial rpoB fragment may be laterally transferred from M. abscessus. Notably, five of the 6 Rec-mas strains showed complete identical sequences in a total of nine genes, including the seven MLST genes, hsp65, and rpoB, suggesting their clonal propagation in South Korea. In conclusion, we identified 6 M. massiliense smooth strains of 2 phylogenetically distinct genotypes with a specific hybrid rpoB gene laterally transferred from M. abscessus from Korean patients. Their clinical relevance and bacteriological traits remain to be elucidated.
Phylogenetic analysis of Mycobacterium massiliense strains having recombinant rpoB gene laterally transferred from Mycobacterium abscessus

PubMed Central

Kim, Byoung-Jun; Kim, Ga-Na; Kim, Bo-Ram; Shim, Tae-Sun; Kook, Yoon-Hoh

2017-01-01

Recent multi locus sequence typing (MLST) and genome based studies indicate that lateral gene transfer (LGT) events in the rpoB gene are prevalent between Mycobacterium abscessus complex strains. To check the prevalence of the M. massiliense strains subject to rpoB LGT (Rec-mas), we applied rpoB typing (711 bp) to 106 Korean strains of M. massiliense infection that had already been identified by hsp65 sequence analysis (603 bp). The analysis indicated 6 smooth strains in M. massiliense Type I (10.0%, 6/60) genotypes but no strains in M. massiliense Type II genotypes (0%, 0/46), showing a discrepancy between the 2 typing methods. Further MLST analysis based on the partial sequencing of seven housekeeping genes, argH, cya, glpK, gnd, murC, pta and purH, as well as erm(41) PCR proved that these 6 Rec-mas strains consisted of two distinct genotypes belonging to M. massiliense and not M. abscessus. The complete rpoB sequencing analysis showed that these 6 Rec-mas strains have an identical hybrid rpoB gene, of which a 478 bp partial rpoB fragment may be laterally transferred from M. abscessus. Notably, five of the 6 Rec-mas strains showed complete identical sequences in a total of nine genes, including the seven MLST genes, hsp65, and rpoB, suggesting their clonal propagation in South Korea. In conclusion, we identified 6 M. massiliense smooth strains of 2 phylogenetically distinct genotypes with a specific hybrid rpoB gene laterally transferred from M. abscessus from Korean patients. Their clinical relevance and bacteriological traits remain to be elucidated. PMID:28604829
Distribution of sequence-based types of legionella pneumophila serogroup 1 strains isolated from cooling towers, hot springs, and potable water systems in China.

PubMed

Qin, Tian; Zhou, Haijian; Ren, Hongyu; Guan, Hong; Li, Machao; Zhu, Bingqing; Shao, Zhujun

2014-04-01

Legionella pneumophila serogroup 1 causes Legionnaires' disease. Water systems contaminated with Legionella are the implicated sources of Legionnaires' disease. This study analyzed L. pneumophila serogroup 1 strains in China using sequence-based typing. Strains were isolated from cooling towers (n = 96), hot springs (n = 42), and potable water systems (n = 26). Isolates from cooling towers, hot springs, and potable water systems were divided into 25 sequence types (STs; index of discrimination [IOD], 0.711), 19 STs (IOD, 0.934), and 3 STs (IOD, 0.151), respectively. The genetic variation among the potable water isolates was lower than that among cooling tower and hot spring isolates. ST1 was the predominant type, accounting for 49.4% of analyzed strains (n = 81), followed by ST154. With the exception of two strains, all potable water isolates (92.3%) belonged to ST1. In contrast, 53.1% (51/96) and only 14.3% (6/42) of cooling tower and hot spring, respectively, isolates belonged to ST1. There were differences in the distributions of clone groups among the water sources. The comparisons among L. pneumophila strains isolated in China, Japan, and South Korea revealed that similar clones (ST1 complex and ST154 complex) exist in these countries. In conclusion, in China, STs had several unique allelic profiles, and ST1 was the most prevalent sequence type of environmental L. pneumophila serogroup 1 isolates, similar to its prevalence in Japan and South Korea.
A Wide Variety of Clostridium perfringens Type A Food-Borne Isolates That Carry a Chromosomal cpe Gene Belong to One Multilocus Sequence Typing Cluster

PubMed Central

Xiao, Yinghua; Wagendorp, Arjen; Moezelaar, Roy; Abee, Tjakko

2012-01-01

Of 98 suspected food-borne Clostridium perfringens isolates obtained from a nationwide survey by the Food and Consumer Product Safety Authority in The Netherlands, 59 strains were identified as C. perfringens type A. Using PCR-based techniques, the cpe gene encoding enterotoxin was detected in eight isolates, showing a chromosomal location for seven isolates and a plasmid location for one isolate. Further characterization of these strains by using (GTG)5 fingerprint repetitive sequence-based PCR analysis distinguished C. perfringens from other sulfite-reducing clostridia but did not allow for differentiation between various types of C. perfringens strains. To characterize the C. perfringens strains further, multilocus sequence typing (MLST) analysis was performed on eight housekeeping genes of both enterotoxic and non-cpe isolates, and the data were combined with a previous global survey covering strains associated with food poisoning, gas gangrene, and isolates from food or healthy individuals. This revealed that the chromosomal cpe strains (food strains and isolates from food poisoning cases) belong to a distinct cluster that is significantly distant from all the other cpe plasmid-carrying and cpe-negative strains. These results suggest that different groups of C. perfringens have undergone niche specialization and that a distinct group of food isolates has specific core genome sequences. Such findings have epidemiological and evolutionary significance. Better understanding of the origin and reservoir of enterotoxic C. perfringens may allow for improved control of this organism in foods. PMID:22865060
A small test of a sequence-based typing method: definition of the B*1520 allele.

PubMed

Domena, J D; Little, A M; Arnett, K L; Adams, E J; Marsh, S G; Parham, P

1994-10-01

Santamaria et al. (Human Immunology 1993 37: 39-50) describe a method of sequence-based typing (SBT) for HLA-A, B and C alleles said to give "unambiguous typing of any sample, heterozygous or homozygous, without requiring additional typing information". From SBT analysis, which involves determination of partial sequences of mixed alleles, these investigators reported that cell lines KT17 (HLA-B35,62) and OLGA (HLA-B62) from the reference panel of the 10th International Histocompatibility Workshop express novel variants of HLA-B15 (B1501-MN6) and HLA-B35 (B3501-MN7) respectively. To study further the novel alleles, we cloned and sequenced full-length HLA-B cDNA clones isolated from the KT17 and OLGA cell lines. We find that KT17 expresses B*3501, as assigned by SBT, and B*1501, the common allele encoding the B62 antigen. We were unable to confirm that KT17 expresses the novel B1501-MN6 variant identified by SBT. For OLGA our analysis confirms the partial sequences obtained by SBT. Thus OLGA expresses B*1501 and a novel HLA-B allele. The complete sequence of the latter shows it is a hybrid having exons 1 and 2 in common with B*1501 and other B15 subtypes and exons 3-7 in common with B*3501 and related molecules including B*5301 and B*5801. The novel allele has been designated B*1520 because of its sequence similarity with the B15 group; furthermore, serological analysis shows that the B*1520 product does not express epitopes in common with either B35, B53 or B58. The B*1520 heavy chain has a similar isoelectric point to A*3101; B*1520 was undetected by previous applications of isoelectric focusing because B*1520 and A31 are both expressed by OLGA. In conclusion, HLA-B typing of two cell lines by cDNA cloning and sequencing gives concordant results with SBT for three of the four alleles. The cause of the discrepancy for the fourth allele is unknown, however, this finding indicates that the novel HLA-A, B and C sequences emerging from SBT studies need independent verification.
Pyrosequencing-based quantitative measurement of CALR mutation allele burdens and their clinical implications in patients with myeloproliferative neoplasms.

PubMed

Oh, Yejin; Song, Ik-Chan; Kim, Jimyung; Kwon, Gye Cheol; Koo, Sun Hoe; Kim, Seon Young

2018-05-01

We developed a pyrosequencing-based method for the quantification of CALR mutations and compared the results using Sanger sequencing, fragment length analysis (FLA), digital-droplet PCR (ddPCR), and next-generation sequencing (NGS). Method validation studies were performed using cloned plasmid controls. Samples from 24 patients with myeloproliferative neoplasms were evaluated. Among the 24 patients, 15 had CALR mutations (7 type 1, 2 type 2, and 6 other mutations). The type 1 or type 2 mutation-positive results from pyrosequencing exhibited 100% concordance with the Sanger sequencing results. One novel CALR mutation was not detected by pyrosequencing. The CALR mutation allele burdens measured by pyrosequencing were slightly lower than those measured by FLA but slightly higher than the results obtained using ddPCR. Pyrosequencing exhibited high correlations with both methods. The mutation allele burdens estimated by NGS were significantly lower than those measured by pyrosequencing. An increased CALR mutation allele burden was associated with overt primary myelofibrosis. Patients with >70% mutation allele burdens in myeloid cells had a significantly longer time from diagnosis (P = 0.007), more bone marrow fibrosis (P = 0.010), and lower hemoglobin (P = 0.007). Pyrosequencing was a useful rapid sequencing method to determine the burden of CALR mutations. Copyright © 2018 Elsevier B.V. All rights reserved.
Multilocus sequence typing and pulsed-field gel electrophoresis analysis of Oenococcus oeni from different wine-producing regions of China.

PubMed

Wang, Tao; Li, Hua; Wang, Hua; Su, Jing

2015-04-16

The present study established a typing method with NotI-based pulsed-field gel electrophoresis (PFGE) and stress response gene schemed multilocus sequence typing (MLST) for 55 Oenococcus oeni strains isolated from six individual regions in China and two model strains PSU-1 (CP000411) and ATCC BAA-1163 (AAUV00000000). Seven stress response genes, cfa, clpL, clpP, ctsR, mleA, mleP and omrA, were selected for MLST testing, and positive selective pressure was detected for these genes. Furthermore, both methods separated the strains into two clusters. The PFGE clusters are correlated with the region, whereas the sequence types (STs) formed by the MLST confirm the two clusters identified by PFGE. In addition, the population structure was a mixture of evolutionary pathways, and the strains exhibited both clonal and panmictic characteristics. Copyright © 2015 Elsevier B.V. All rights reserved.
CRISPRTarget

PubMed Central

Biswas, Ambarish; Gagnon, Joshua N.; Brouns, Stan J.J.; Fineran, Peter C.; Brown, Chris M.

2013-01-01

The bacterial and archaeal CRISPR/Cas adaptive immune system targets specific protospacer nucleotide sequences in invading organisms. This requires base pairing between processed CRISPR RNA and the target protospacer. For type I and II CRISPR/Cas systems, protospacer adjacent motifs (PAM) are essential for target recognition, and for type III, mismatches in the flanking sequences are important in the antiviral response. In this study, we examine the properties of each class of CRISPR. We use this information to provide a tool (CRISPRTarget) that predicts the most likely targets of CRISPR RNAs (http://bioanalysis.otago.ac.nz/CRISPRTarget). This can be used to discover targets in newly sequenced genomic or metagenomic data. To test its utility, we discover features and targets of well-characterized Streptococcus thermophilus and Sulfolobus solfataricus type II and III CRISPR/Cas systems. Finally, in Pectobacterium species, we identify new CRISPR targets and propose a model of temperate phage exposure and subsequent inhibition by the type I CRISPR/Cas systems. PMID:23492433
Genetic diversity of pneumococcal surface protein A in invasive pneumococcal isolates from Korean children, 1991-2016.

PubMed

Yun, Ki Wook; Choi, Eun Hwa; Lee, Hoan Jong

2017-01-01

Pneumococcal surface protein A (PspA) is an important virulence factor of pneumococci and has been investigated as a primary component of a capsular serotype-independent pneumococcal vaccine. Thus, we sought to determine the genetic diversity of PspA to explore its potential as a vaccine candidate. Among the 190 invasive pneumococcal isolates collected from Korean children between 1991 and 2016, two (1.1%) isolates were found to have no pspA by multiple polymerase chain reactions. The full length pspA genes from 185 pneumococcal isolates were sequenced. The length of pspA varied, ranging from 1,719 to 2,301 base pairs with 55.7-100% nucleotide identity. Based on the sequences of the clade-defining regions, 68.7% and 49.7% were in PspA family 2 and clade 3/family 2, respectively. PspA clade types were correlated with genotypes using multilocus sequence typing and divided into several subclades based on diversity analysis of the N-terminal α-helical regions, which showed nucleotide sequence identities of 45.7-100% and amino acid sequence identities of 23.1-100%. Putative antigenicity plots were also diverse among individual clades and subclades. The differences in antigenicity patterns were concentrated within the N-terminal 120 amino acids. In conclusion, the N-terminal α-helical domain, which is known to be the major immunogenic portion of PspA, is genetically variable and should be further evaluated for antigenic differences and cross-reactivity between various PspA types from pneumococcal isolates.
Modeling kinetic rate variation in third generation DNA sequencing data to detect putative modifications to DNA bases

PubMed Central

Schadt, Eric E.; Banerjee, Onureena; Fang, Gang; Feng, Zhixing; Wong, Wing H.; Zhang, Xuegong; Kislyuk, Andrey; Clark, Tyson A.; Luong, Khai; Keren-Paz, Alona; Chess, Andrew; Kumar, Vipin; Chen-Plotkin, Alice; Sondheimer, Neal; Korlach, Jonas; Kasarskis, Andrew

2013-01-01

Current generation DNA sequencing instruments are moving closer to seamlessly sequencing genomes of entire populations as a routine part of scientific investigation. However, while significant inroads have been made identifying small nucleotide variation and structural variations in DNA that impact phenotypes of interest, progress has not been as dramatic regarding epigenetic changes and base-level damage to DNA, largely due to technological limitations in assaying all known and unknown types of modifications at genome scale. Recently, single-molecule real time (SMRT) sequencing has been reported to identify kinetic variation (KV) events that have been demonstrated to reflect epigenetic changes of every known type, providing a path forward for detecting base modifications as a routine part of sequencing. However, to date no statistical framework has been proposed to enhance the power to detect these events while also controlling for false-positive events. By modeling enzyme kinetics in the neighborhood of an arbitrary location in a genomic region of interest as a conditional random field, we provide a statistical framework for incorporating kinetic information at a test position of interest as well as at neighboring sites that help enhance the power to detect KV events. The performance of this and related models is explored, with the best-performing model applied to plasmid DNA isolated from Escherichia coli and mitochondrial DNA isolated from human brain tissue. We highlight widespread kinetic variation events, some of which strongly associate with known modification events, while others represent putative chemically modified sites of unknown types. PMID:23093720

Modeling kinetic rate variation in third generation DNA sequencing data to detect putative modifications to DNA bases.

PubMed

Schadt, Eric E; Banerjee, Onureena; Fang, Gang; Feng, Zhixing; Wong, Wing H; Zhang, Xuegong; Kislyuk, Andrey; Clark, Tyson A; Luong, Khai; Keren-Paz, Alona; Chess, Andrew; Kumar, Vipin; Chen-Plotkin, Alice; Sondheimer, Neal; Korlach, Jonas; Kasarskis, Andrew

2013-01-01

Current generation DNA sequencing instruments are moving closer to seamlessly sequencing genomes of entire populations as a routine part of scientific investigation. However, while significant inroads have been made identifying small nucleotide variation and structural variations in DNA that impact phenotypes of interest, progress has not been as dramatic regarding epigenetic changes and base-level damage to DNA, largely due to technological limitations in assaying all known and unknown types of modifications at genome scale. Recently, single-molecule real time (SMRT) sequencing has been reported to identify kinetic variation (KV) events that have been demonstrated to reflect epigenetic changes of every known type, providing a path forward for detecting base modifications as a routine part of sequencing. However, to date no statistical framework has been proposed to enhance the power to detect these events while also controlling for false-positive events. By modeling enzyme kinetics in the neighborhood of an arbitrary location in a genomic region of interest as a conditional random field, we provide a statistical framework for incorporating kinetic information at a test position of interest as well as at neighboring sites that help enhance the power to detect KV events. The performance of this and related models is explored, with the best-performing model applied to plasmid DNA isolated from Escherichia coli and mitochondrial DNA isolated from human brain tissue. We highlight widespread kinetic variation events, some of which strongly associate with known modification events, while others represent putative chemically modified sites of unknown types.
Somatosensory neuron types identified by high-coverage single-cell RNA-sequencing and functional heterogeneity

PubMed Central

Li, Chang-Lin; Li, Kai-Cheng; Wu, Dan; Chen, Yan; Luo, Hao; Zhao, Jing-Rong; Wang, Sa-Shuang; Sun, Ming-Ming; Lu, Ying-Jin; Zhong, Yan-Qing; Hu, Xu-Ye; Hou, Rui; Zhou, Bei-Bei; Bao, Lan; Xiao, Hua-Sheng; Zhang, Xu

2016-01-01

Sensory neurons are distinguished by distinct signaling networks and receptive characteristics. Thus, sensory neuron types can be defined by linking transcriptome-based neuron typing with the sensory phenotypes. Here we classify somatosensory neurons of the mouse dorsal root ganglion (DRG) by high-coverage single-cell RNA-sequencing (10 950 ± 1 218 genes per neuron) and neuron size-based hierarchical clustering. Moreover, single DRG neurons responding to cutaneous stimuli are recorded using an in vivo whole-cell patch clamp technique and classified by neuron-type genetic markers. Small diameter DRG neurons are classified into one type of low-threshold mechanoreceptor and five types of mechanoheat nociceptors (MHNs). Each of the MHN types is further categorized into two subtypes. Large DRG neurons are categorized into four types, including neurexophilin 1-expressing MHNs and mechanical nociceptors (MNs) expressing BAI1-associated protein 2-like 1 (Baiap2l1). Mechanoreceptors expressing trafficking protein particle complex 3-like and Baiap2l1-marked MNs are subdivided into two subtypes each. These results provide a new system for cataloging somatosensory neurons and their transcriptome databases. PMID:26691752
mirVAFC: A Web Server for Prioritizations of Pathogenic Sequence Variants from Exome Sequencing Data via Classifications.

PubMed

Li, Zhongshan; Liu, Zhenwei; Jiang, Yi; Chen, Denghui; Ran, Xia; Sun, Zhong Sheng; Wu, Jinyu

2017-01-01

Exome sequencing has been widely used to identify the genetic variants underlying human genetic disorders for clinical diagnoses, but the identification of pathogenic sequence variants among the huge amounts of benign ones is complicated and challenging. Here, we describe a new Web server named mirVAFC for pathogenic sequence variants prioritizations from clinical exome sequencing (CES) variant data of single individual or family. The mirVAFC is able to comprehensively annotate sequence variants, filter out most irrelevant variants using custom criteria, classify variants into different categories as for estimated pathogenicity, and lastly provide pathogenic variants prioritizations based on classifications and mutation effects. Case studies using different types of datasets for different diseases from publication and our in-house data have revealed that mirVAFC can efficiently identify the right pathogenic candidates as in original work in each case. Overall, the Web server mirVAFC is specifically developed for pathogenic sequence variant identifications from family-based CES variants using classification-based prioritizations. The mirVAFC Web server is freely accessible at https://www.wzgenomics.cn/mirVAFC/. © 2016 WILEY PERIODICALS, INC.
High speed nucleic acid sequencing

DOEpatents

Korlach, Jonas [Ithaca, NY; Webb, Watt W [Ithaca, NY; Levene, Michael [Ithaca, NY; Turner, Stephen [Ithaca, NY; Craighead, Harold G [Ithaca, NY; Foquet, Mathieu [Ithaca, NY

2011-05-17

The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid. Each type of labeled nucleotide comprises an acceptor fluorophore attached to a phosphate portion of the nucleotide such that the fluorophore is removed upon incorporation into a growing strand. Fluorescent signal is emitted via fluorescent resonance energy transfer between the donor fluorophore and the acceptor fluorophore as each nucleotide is incorporated into the growing strand. The sequence is deduced by identifying which base is being incorporated into the growing strand.
Population structure of Lactobacillus helveticus isolates from naturally fermented dairy products based on multilocus sequence typing.

PubMed

Sun, Zhihong; Liu, Wenjun; Song, Yuqin; Xu, Haiyan; Yu, Jie; Bilige, Menghe; Zhang, Heping; Chen, Yongfu

2015-05-01

Lactobacillus helveticus is an economically important lactic acid bacterium used in industrial dairy fermentation. In the present study, the population structure of 245 isolates of L. helveticus from different naturally fermented dairy products in China and Mongolia were investigated using an multilocus sequence typing scheme with 11 housekeeping genes. A total of 108 sequence types were detected, which formed 8 clonal complexes and 27 singletons. Results from Structure, SplitsTree, and ClonalFrame software analyses demonstrated the presence of 3 subpopulations in the L. helveticus isolates used in our study, namely koumiss, kurut-tarag, and panmictic lineages. Most L. helveticus isolates from particular ecological origins had specific population structures. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
HLA-B*5808, a new HLA-B allele characterized by sequence based typing.

PubMed

Poli, F; Crespiatico, L; Frison, S; Longhi, E; Marlianici, E; Scalamogna, M

2003-12-01

This brief communication describes a new HLA-B allele (HLA-B*5808) detected in an Italian white volunteer bone marrow donor. With serology, this subject was typed as HLA-B15,17, whereas with molecular biology B*15, B*51, B*52 and/or B*58 could be assigned. In order to clarify the results, direct and cloning sequencing of exons 2, 3 and 4 were carried out. This new allele is identical to HLA-B*5801 in exon 2 except for a silent point mutation at nucleotide 141 where a C is substituted by a T; exons 3 and 4 are typical of HLA-B*51, B*52 and B*78. The peculiar sequence of B*5808 could explain the discrepancy between the serological and molecular typing results.
Evolution of EF-hand calcium-modulated proteins. IV. Exon shuffling did not determine the domain compositions of EF-hand proteins

NASA Technical Reports Server (NTRS)

Kretsinger, R. H.; Nakayama, S.

1993-01-01

In the previous three reports in this series we demonstrated that the EF-hand family of proteins evolved by a complex pattern of gene duplication, transposition, and splicing. The dendrograms based on exon sequences are nearly identical to those based on protein sequences for troponin C, the essential light chain myosin, the regulatory light chain, and calpain. This validates both the computational methods and the dendrograms for these subfamilies. The proposal of congruence for calmodulin, troponin C, essential light chain, and regulatory light chain was confirmed. There are, however, significant differences in the calmodulin dendrograms computed from DNA and from protein sequences. In this study we find that introns are distributed throughout the EF-hand domain and the interdomain regions. Further, dendrograms based on intron type and distribution bear little resemblance to those based on protein or on DNA sequences. We conclude that introns are inserted, and probably deleted, with relatively high frequency. Further, in the EF-hand family exons do not correspond to structural domains and exon shuffling played little if any role in the evolution of this widely distributed homolog family. Calmodulin has had a turbulent evolution. Its dendrograms based on protein sequence, exon sequence, 3'-tail sequence, intron sequences, and intron positions all show significant differences.
Clarification of the Concept of Ganoderma orbiforme with High Morphological Plasticity

PubMed Central

Wang, Dong-Mei; Wu, Sheng-Hua; Yao, Yi-Jian

2014-01-01

Ganoderma has been considered a very difficult genus among the polypores to classify and is currently in a state of taxonomic chaos. In a study of Ganoderma collections including numerous type specimens, we found that six species namely G. cupreum, G. densizonatum, G. limushanense, G. mastoporum, G. orbiforme, G. subtornatum, and records of G. fornicatum from Mainland China and Taiwan are very similar to one another in basidiocarp texture, pilear cuticle structure, context color, pore color and basidiospore characteristics. Further, we sequenced the nrDNA ITS region (ITS1 and ITS2) and partial mtDNA SSU region of the studied materials, and performed phylogenetic analyses based on these sequence data. The nrDNA ITS sequence analysis results show that the eight nrDNA ITS sequences derived from this study have single-nucleotide polymorphisms in ITS1 and/or ITS2 at inter- and intra-individual levels. In the nrDNA ITS phylogenetic trees, all the sequences from this study are grouped together with those of G. cupreum and G. mastoporum retrieved from GenBank to form a distinct clade. The mtDNA SSU sequence analysis results reveal that the five mtDNA SSU sequences derived from this study are clustered together with those of G. cupreum retrieved from GenBank and also form a distinct clade in the mtDNA SSU phylogenetic trees. Based on morphological and molecular data, we conclude that the studied taxa are conspecific. Among the names assigned to this species, G. fornicatum given to Asian collections has nomenclatural priority over the others. However, the type of G. fornicatum from Brazil is probably lost and a modern description based on the type lacks. The identification of the Asian collections to G. fornicatum therefore cannot be confirmed. To the best of our knowledge, G. orbiforme is the earliest valid name for use. PMID:24875218
A novel multi-locus sequence typing (MLST) protocol for Leuconostoc lactis isolates from traditional dairy products in China and Mongolia

PubMed Central

2014-01-01

Background Economically, Leuconostoc lactis is one of the most important species in the genus Leuconostoc. It plays an important role in the food industry including the production of dextrans and bacteriocins. Currently, traditional molecular typing approaches for characterisation of this species at the isolate level are either unavailable or are not sufficiently reliable for practical use. Multilocus sequence typing (MLST) is a robust and reliable method for characterising bacterial and fungal species at the molecular level. In this study, a novel MLST protocol was developed for 50 L. lactis isolates from Mongolia and China. Results Sequences from eight targeted genes (groEL, carB, recA, pheS, murC, pyrG, rpoB and uvrC) were obtained. Sequence analysis indicated 20 different sequence types (STs), with 13 of them being represented by a single isolate. Phylogenetic analysis based on the sequences of eight MLST loci indicated that the isolates belonged to two major groups, A (34 isolates) and B (16 isolates). Linkage disequilibrium analyses indicated that recombination occurred at a low frequency in L. lactis, indicating a clonal population structure. Split-decomposition analysis indicated that intraspecies recombination played a role in generating genotypic diversity amongst isolates. Conclusions Our results indicated that MLST is a valuable tool for typing L. lactis isolates that can be used for further monitoring of evolutionary changes and population genetics. PMID:24912963
Lactobacillus delbrueckii subsp. jakobsenii subsp. nov., isolated from dolo wort, an alcoholic fermented beverage in Burkina Faso.

PubMed

Adimpong, David B; Nielsen, Dennis S; Sørensen, Kim I; Vogensen, Finn K; Sawadogo-Lingani, Hagrétou; Derkx, Patrick M F; Jespersen, Lene

2013-10-01

Lactobacillus delbrueckii is divided into five subspecies based on phenotypic and genotypic differences. A novel isolate, designated ZN7a-9(T), was isolated from malted sorghum wort used for making an alcoholic beverage (dolo) in Burkina Faso. The results of 16S rRNA gene sequencing, DNA-DNA hybridization and peptidoglycan cell-wall structure type analyses indicated that it belongs to the species L. delbrueckii. The genome sequence of isolate ZN7a-9(T) was determined by Illumina-based sequencing. Multilocus sequence typing (MLST) and split-decomposition analyses were performed on seven concatenated housekeeping genes obtained from the genome sequence of strain ZN7a-9(T) together with 41 additional L. delbrueckii strains. The results of the MLST and split-decomposition analyses could not establish the exact subspecies of L. delbrueckii represented by strain ZN7a-9(T) as it clustered with L. delbrueckii strains unassigned to any of the recognized subspecies of L. delbrueckii. Strain ZN7a-9(T) additionally differed from the recognized type strains of the subspecies of L. delbrueckii with respect to its carbohydrate fermentation profile. In conclusion, the cumulative results indicate that strain ZN7a-9(T) represents a novel subspecies of L. delbrueckii closely related to Lactobacillus delbrueckii subsp. lactis and Lactobacillus delbrueckii subsp. delbrueckii for which the name Lactobacillus delbrueckii subsp. jakobsenii subsp. nov. is proposed. The type strain is ZN7a-9(T) = DSM 26046(T) = LMG 27067(T).
FlyBase: genes and gene models

PubMed Central

Drysdale, Rachel A.; Crosby, Madeline A.

2005-01-01

FlyBase (http://flybase.org) is the primary repository of genetic and molecular data of the insect family Drosophilidae. For the most extensively studied species, Drosophila melanogaster, a wide range of data are presented in integrated formats. Data types include mutant phenotypes, molecular characterization of mutant alleles and aberrations, cytological maps, wild-type expression patterns, anatomical images, transgenic constructs and insertions, sequence-level gene models and molecular classification of gene product functions. There is a growing body of data for other Drosophila species; this is expected to increase dramatically over the next year, with the completion of draft-quality genomic sequences of an additional 11 Drosphila species. PMID:15608223
Influenza virus sequence feature variant type analysis: evidence of a role for NS1 in influenza virus host range restriction.

PubMed

Noronha, Jyothi M; Liu, Mengya; Squires, R Burke; Pickett, Brett E; Hale, Benjamin G; Air, Gillian M; Galloway, Summer E; Takimoto, Toru; Schmolke, Mirco; Hunt, Victoria; Klem, Edward; García-Sastre, Adolfo; McGee, Monnie; Scheuermann, Richard H

2012-05-01

Genetic drift of influenza virus genomic sequences occurs through the combined effects of sequence alterations introduced by a low-fidelity polymerase and the varying selective pressures experienced as the virus migrates through different host environments. While traditional phylogenetic analysis is useful in tracking the evolutionary heritage of these viruses, the specific genetic determinants that dictate important phenotypic characteristics are often difficult to discern within the complex genetic background arising through evolution. Here we describe a novel influenza virus sequence feature variant type (Flu-SFVT) approach, made available through the public Influenza Research Database resource (www.fludb.org), in which variant types (VTs) identified in defined influenza virus protein sequence features (SFs) are used for genotype-phenotype association studies. Since SFs have been defined for all influenza virus proteins based on known structural, functional, and immune epitope recognition properties, the Flu-SFVT approach allows the rapid identification of the molecular genetic determinants of important influenza virus characteristics and their connection to underlying biological functions. We demonstrate the use of the SFVT approach to obtain statistical evidence for effects of NS1 protein sequence variations in dictating influenza virus host range restriction.
Fuzzy logic based on-line fault detection and classification in transmission line.

PubMed

Adhikari, Shuma; Sinha, Nidul; Dorendrajit, Thingam

2016-01-01

This study presents fuzzy logic based online fault detection and classification of transmission line using Programmable Automation and Control technology based National Instrument Compact Reconfigurable i/o (CRIO) devices. The LabVIEW software combined with CRIO can perform real time data acquisition of transmission line. When fault occurs in the system current waveforms are distorted due to transients and their pattern changes according to the type of fault in the system. The three phase alternating current, zero sequence and positive sequence current data generated by LabVIEW through CRIO-9067 are processed directly for relaying. The result shows that proposed technique is capable of right tripping action and classification of type of fault at high speed therefore can be employed in practical application.
An asparagine residue at the N-terminus affects the maturation process of low molecular weight glutenin subunits of wheat endosperm

PubMed Central

2014-01-01

Background Wheat glutenin polymers are made up of two main subunit types, the high- (HMW-GS) and low- (LMW-GS) molecular weight subunits. These latter are represented by heterogeneous proteins. The most common, based on the first amino acid of the mature sequence, are known as LMW-m and LMW-s types. The mature sequences differ as a consequence of three extra amino acids (MET-) at the N-terminus of LMW-m types. The nucleotide sequences of their encoding genes are, however, nearly identical, so that the relationship between gene and protein sequences is difficult to ascertain. It has been hypothesized that the presence of an asparagine residue in position 23 of the complete coding sequence for the LMW-s type might account for the observed three-residue shortened sequence, as a consequence of cleavage at the asparagine by an asparaginyl endopeptidase. Results We performed site-directed mutagenesis of a LMW-s gene to replace asparagine at position 23 with threonine and thus convert it to a candidate LMW-m type gene. Similarly, a candidate LMW-m type gene was mutated at position 23 to replace threonine with asparagine. Next, we produced transgenic durum wheat (cultivar Svevo) lines by introducing the mutated versions of the LMW-m and LMW-s genes, along with the wild type counterpart of the LMW-m gene. Proteomic comparisons between the transgenic and null segregant plants enabled identification of transgenic proteins by mass spectrometry analyses and Edman N-terminal sequencing. Conclusions Our results show that the formation of LMW-s type relies on the presence of an asparagine residue close to the N-terminus generated by signal peptide cleavage, and that LMW-GS can be quantitatively processed most likely by vacuolar asparaginyl endoproteases, suggesting that those accumulated in the vacuole are not sequestered into stable aggregates that would hinder the action of proteolytic enzymes. Rather, whatever is the mechanism of glutenin polymer transport to the vacuole, the proteins remain available for proteolytic processing, and can be converted to the mature form by the removal of a short N-terminal sequence. PMID:24629124
Generation of a novel next-generation sequencing-based method for the isolation of new human papillomavirus types.

PubMed

Brancaccio, Rosario N; Robitaille, Alexis; Dutta, Sankhadeep; Cuenin, Cyrille; Santare, Daiga; Skenders, Girts; Leja, Marcis; Fischer, Nicole; Giuliano, Anna R; Rollison, Dana E; Grundhoff, Adam; Tommasino, Massimo; Gheit, Tarik

2018-05-07

With the advent of new molecular tools, the discovery of new papillomaviruses (PVs) has accelerated during the past decade, enabling the expansion of knowledge about the viral populations that inhabit the human body. Human PVs (HPVs) are etiologically linked to benign or malignant lesions of the skin and mucosa. The detection of HPV types can vary widely, depending mainly on the methodology and the quality of the biological sample. Next-generation sequencing is one of the most powerful tools, enabling the discovery of novel viruses in a wide range of biological material. Here, we report a novel protocol for the detection of known and unknown HPV types in human skin and oral gargle samples using improved PCR protocols combined with next-generation sequencing. We identified 105 putative new PV types in addition to 296 known types, thus providing important information about the viral distribution in the oral cavity and skin. Copyright © 2018. Published by Elsevier Inc.
Promoter Sequences Prediction Using Relational Association Rule Mining

PubMed Central

Czibula, Gabriela; Bocicor, Maria-Iuliana; Czibula, Istvan Gergely

2012-01-01

In this paper we are approaching, from a computational perspective, the problem of promoter sequences prediction, an important problem within the field of bioinformatics. As the conditions for a DNA sequence to function as a promoter are not known, machine learning based classification models are still developed to approach the problem of promoter identification in the DNA. We are proposing a classification model based on relational association rules mining. Relational association rules are a particular type of association rules and describe numerical orderings between attributes that commonly occur over a data set. Our classifier is based on the discovery of relational association rules for predicting if a DNA sequence contains or not a promoter region. An experimental evaluation of the proposed model and comparison with similar existing approaches is provided. The obtained results show that our classifier overperforms the existing techniques for identifying promoter sequences, confirming the potential of our proposal. PMID:22563233
Stellar model chromospheres. IX - Chromospheric activity in dwarf stars

NASA Technical Reports Server (NTRS)

Kelch, W. L.; Worden, S. P.; Linsky, J. L.

1979-01-01

High-resolution Ca II K line profiles are used to model the upper photospheres and lower chromospheres of eight main-sequence stars ranging in spectral type from F0 to M0 and exhibiting different degrees of chromospheric activity. The model chromospheres are studied as a function of spectral type and activity for stars of similar spectral type in order to obtain evidence of enhanced nonradiative heating in the upper-photospheric models and in the ratio of minimum temperature at the base of the chromosphere to effective temperature, a correlation between activity and temperature in the lower chromospheres, and a correlation of the width at the base of the K-line emission core and at the K2 features with activity. Chromospheric radiative losses are estimated for the modelled stars and other previously analyzed main-sequence stars. The results obtained strengthen the argument that dMe flare stars exhibit fundamentally solar-type activity but on an increased scale.
Labeled nucleotide phosphate (NP) probes

DOEpatents

Korlach, Jonas [Ithaca, NY; Webb, Watt W [Ithaca, NY; Levene, Michael [Ithaca, NY; Turner, Stephen [Ithaca, NY; Craighead, Harold G [Ithaca, NY; Foquet, Mathieu [Ithaca, NY

2009-02-03

The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
Evaluation of targeted exome sequencing for 28 protein-based blood group systems, including the homologous gene systems, for blood group genotyping.

PubMed

Schoeman, Elizna M; Lopez, Genghis H; McGowan, Eunike C; Millard, Glenda M; O'Brien, Helen; Roulis, Eileen V; Liew, Yew-Wah; Martin, Jacqueline R; McGrath, Kelli A; Powley, Tanya; Flower, Robert L; Hyland, Catherine A

2017-04-01

Blood group single nucleotide polymorphism genotyping probes for a limited range of polymorphisms. This study investigated whether massively parallel sequencing (also known as next-generation sequencing), with a targeted exome strategy, provides an extended blood group genotype and the extent to which massively parallel sequencing correctly genotypes in homologous gene systems, such as RH and MNS. Donor samples (n = 28) that were extensively phenotyped and genotyped using single nucleotide polymorphism typing, were analyzed using the TruSight One Sequencing Panel and MiSeq platform. Genes for 28 protein-based blood group systems, GATA1, and KLF1 were analyzed. Copy number variation analysis was used to characterize complex structural variants in the GYPC and RH systems. The average sequencing depth per target region was 66.2 ± 39.8. Each sample harbored on average 43 ± 9 variants, of which 10 ± 3 were used for genotyping. For the 28 samples, massively parallel sequencing variant sequences correctly matched expected sequences based on single nucleotide polymorphism genotyping data. Copy number variation analysis defined the Rh C/c alleles and complex RHD hybrids. Hybrid RHD*D-CE-D variants were correctly identified, but copy number variation analysis did not confidently distinguish between D and CE exon deletion versus rearrangement. The targeted exome sequencing strategy employed extended the range of blood group genotypes detected compared with single nucleotide polymorphism typing. This single-test format included detection of complex MNS hybrid cases and, with copy number variation analysis, defined RH hybrid genes along with the RHCE*C allele hitherto difficult to resolve by variant detection. The approach is economical compared with whole-genome sequencing and is suitable for a red blood cell reference laboratory setting. © 2017 AABB.
Genome sequence of the mud-dwelling archaeon Methanoplanus limicola type strain (DSM 2279 T), reclassification of Methanoplanus petrolearius as Methanolacinia petrolearia and emended descriptions of the genera Methanoplanus and Methanolacinia

DOE PAGES

Goker, Markus; Lu, Megan; Fiebig, Anne; ...

2014-06-15

Methanoplanus limicola Wildgruber et al. 1984 is a mesophilic methanogen that was isolated from a swamp composed of drilling waste near Naples, Italy, shortly after the Archaea were recognized as a separate domain of life. Methanoplanus is the type genus in the family Methanoplanaceae, a taxon that felt into disuse since modern 16S rRNA gene sequences-based taxonomy was established. Methanoplanus is now placed within the Methanomicrobiaceae, a family that is so far poorly characterized at the genome level. The only other type strain of the genus with a sequenced genome, Methanoplanus petrolearius SEBR 4847 T, turned out to be misclassifiedmore » and required reclassification to Methanolacinia. Both, Methanoplanus and Methanolacinia, needed taxonomic emendations due to a significant deviation of the G+C content of their genomes from previously published (pregenome-sequence era) values. Until now genome sequences were published for only four of the 33 species with validly published names in the Methanomicrobiaceae. Here we describe the features of M. limicola, together with the improved-high-quality draft genome sequence and an notation of the type strain, M3 T. The 3,200,946 bp long chromosome (permanent draft sequence) with its 3,064 protein-coding and 65 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project.« less

Laser mass spectrometry for DNA sequencing, disease diagnosis, and fingerprinting

NASA Astrophysics Data System (ADS)

Chen, C. H. Winston; Taranenko, N. I.; Zhu, Y. F.; Chung, C. N.; Allman, S. L.

1997-05-01

Since laser mass spectrometry has the potential for achieving very fast DNA analysis, we recently applied it to DNA sequencing, DNA typing for fingerprinting, and DNA screening for disease diagnosis. Two different approaches for sequencing DNA have been successfully demonstrated. One is to sequence DNA with DNA ladders produced from Sanger's enzymatic method. The other is to do direct sequencing without DNA ladders. The need for quick DNA typing for identification purposes is critical for forensic application. Our preliminary results indicate laser mass spectrometry can possible be used for rapid DNA fingerprinting applications at a much lower cost than gel electrophoresis. Population screening for certain genetic disease can be a very efficient step to reducing medical costs through prevention. Since laser mass spectrometry can provide very fast DNA analysis, we applied laser mass spectrometry to disease diagnosis. Clinical samples with both base deletion and point mutation have been tested with complete success.
Development of a new-type riboswitch using an aptazyme and an anti-RBS sequence.

PubMed

Ogawa, Atsushi; Maeda, Mizuo

2007-01-01

We constructed a new-type riboswitch, which functions in E. coli, using an aptazyme and an anti-RBS sequence. This riboswitch usually suppresses the gene expression with its anti-RBS sequence bound to the RBS of its own mRNA(OFF), while it activates the translation only when a cofactor of the aptazyme is added to release the anti-RBS sequence from itself as a result of cofactor-induced self-cleavage by the aptazyme (ON). Although this aptazyme-based riboswitch did not function at 37 degrees C in vivo in spite of its high activity at this temperature in vitro, it worked well at lower temperature (23 degrees C). We also improved the efficiency of this riboswitch by constructing a cascading system.
Incorporating information on predicted solvent accessibility to the co-evolution-based study of protein interactions.

PubMed

Ochoa, David; García-Gutiérrez, Ponciano; Juan, David; Valencia, Alfonso; Pazos, Florencio

2013-01-27

A widespread family of methods for studying and predicting protein interactions using sequence information is based on co-evolution, quantified as similarity of phylogenetic trees. Part of the co-evolution observed between interacting proteins could be due to co-adaptation caused by inter-protein contacts. In this case, the co-evolution is expected to be more evident when evaluated on the surface of the proteins or the internal layers close to it. In this work we study the effect of incorporating information on predicted solvent accessibility to three methods for predicting protein interactions based on similarity of phylogenetic trees. We evaluate the performance of these methods in predicting different types of protein associations when trees based on positions with different characteristics of predicted accessibility are used as input. We found that predicted accessibility improves the results of two recent versions of the mirrortree methodology in predicting direct binary physical interactions, while it neither improves these methods, nor the original mirrortree method, in predicting other types of interactions. That improvement comes at no cost in terms of applicability since accessibility can be predicted for any sequence. We also found that predictions of protein-protein interactions are improved when multiple sequence alignments with a richer representation of sequences (including paralogs) are incorporated in the accessibility prediction.
Formation of (DNA)2-LNA triplet with recombinant base recognition: A quantum mechanical study

NASA Astrophysics Data System (ADS)

Mall, Vijaya Shri; Tiwari, Rakesh Kumar

2018-05-01

The formation of DNA triple helix offers the verity of new possibilities in molecular biology. However its applications are limited to purine and pyrimidine rich sequences recognized by forming Hoogsteen/Reverse Hoogsteen triplets in major groove sites of DNA duplex. To overcome this drawback modification in bases backbone and glucose of nucleotide unit of DNA have been proposed so that the third strand base recognized by both the bases of DNA duplex by forming Recombinant type(R-type) of bonding in mixed sequences. Here we performed Quanrum Mechanical (Hartree-Fock and DFT) methodology on natural DNA and Locked Nucleic Acids(LNA) triplets using 6-31G and some other new advance basis sets. Study suggests energetically stable conformation has been observed for recombinant triplets in order of G-C*G > A-T*A > G-C*C > T-A*T for both type of triplets. Interestingly LNA leads to more stable conformation in all set of triplets, clearly suggests an important biological tool to overcome above mentioned drawbacks.
Streptococcus caprae sp. nov., isolated from Iberian ibex (Capra pyrenaica hispanica).

PubMed

Vela, A I; Mentaberre, G; Lavín, S; Domínguez, L; Fernández-Garayzábal, J F

2016-01-01

Biochemical and molecular genetic studies were performed on a novel Gram-stain-positive, catalase-negative, coccus-shaped organism isolated from tonsil samples of two Iberian ibexes. The micro-organism was identified as a streptococcal species based on its cellular, morphological and biochemical characteristics. 16S rRNA gene sequence comparison studies confirmed its identification as a member of the genus Streptococcus, but the organism did not correspond to any species of this genus. The nearest phylogenetic relative of the unknown coccus from ibex was Streptococcus porci 2923-03T (96.6 % 16S rRNA gene sequence similarity). Analysis based on rpoB and sodA gene sequences revealed sequence similarity values lower than 86.0 and 83.8 %, respectively, from the type strains of recognized Streptococcus species. The novel bacterial isolate was distinguished from Streptococcus porci and other Streptococcus species using biochemical tests. Based on both phenotypic and phylogenetic findings, it is proposed that the unknown bacterium be classified as representing a novel species of the genus Streptococcus, for which the name Streptococcus caprae sp. nov. is proposed. The type strain is DICM07-02790-1CT ( = CECT 8872T = CCUG 67170T).
An Assessment of Cumulative Axial and Torsional Fatigue in a Cobalt-Base Superalloy

NASA Technical Reports Server (NTRS)

Kalluri, Sreeramesh; Bonacuse, Peter J.

2010-01-01

Cumulative fatigue under axial and torsional loading conditions can include both load-order (higMow and low/high) as well as load-type sequence (axial/torsional and torsional/axial) effects. Previously reported experimental studies on a cobalt-base superalloy, Haynes 188 at 538 C, addressed these effects. These studies characterized the cumulative axial and torsional fatigue behavior under high amplitude followed by low amplitude (Kalluri, S. and Bonacuse, P. J., "Cumulative Axial and Torsional Fatigue: An Investigation of Load-Type Sequance Effects," in Multiaxial Fatigue and Deformation: Testing and Prediction, ASTM STP 1387, S. Kalluri, and P. J. Bonacuse, Eds., American Society for Testing and Materials, West Conshohocken, PA, 2000, pp. 281-301) and low amplitude followed by high amplitude (Bonacuse, P. and Kalluri, S. "Sequenced Axial and Torsional Cumulative Fatigue: Low Amplitude Followed by High Amplitude Loading," Biaxial/Multiaxial Fatigue and Fracture, ESIS Publication 31, A. Carpinteri, M. De Freitas, and A. Spagnoli, Eds., Elsevier, New York, 2003, pp. 165-182) conditions. In both studies, experiments with the following four load-type sequences were performed: (a) axial/axial, (b) torsional/torsional, (c) axial/torsional, and (d) torsional/axial. In this paper, the cumulative axial and torsional fatigue data generated in the two previous studies are combined to generate a comprehensive cumulative fatigue database on both the load-order and load-type sequence effects. This comprehensive database is used to examine applicability of the Palmgren-langer-Miner linear damage rule and a nonlinear damage curve approach for Haynes 188 subjected to the load-order and load-type sequencing described above. Summations of life fractions from the experiments are compared to the predictions from both the linear and nonlinear cumulative fatigue damage approaches. The significance of load-order versus load-type sequence effects for axial and torsional loading conditions is discussed. Possible reasons for the observed differences between the computed and observed summations of cycle fractions are rationalized in terms of the observed ever lutions of cyclic axial and shear stress ranges in the cumulative fatigue tests.
Finding needles in haystacks: linking scientific names, reference specimens and molecular data for Fungi.

PubMed

Schoch, Conrad L; Robbertse, Barbara; Robert, Vincent; Vu, Duong; Cardinali, Gianluigi; Irinyi, Laszlo; Meyer, Wieland; Nilsson, R Henrik; Hughes, Karen; Miller, Andrew N; Kirk, Paul M; Abarenkov, Kessy; Aime, M Catherine; Ariyawansa, Hiran A; Bidartondo, Martin; Boekhout, Teun; Buyck, Bart; Cai, Qing; Chen, Jie; Crespo, Ana; Crous, Pedro W; Damm, Ulrike; De Beer, Z Wilhelm; Dentinger, Bryn T M; Divakar, Pradeep K; Dueñas, Margarita; Feau, Nicolas; Fliegerova, Katerina; García, Miguel A; Ge, Zai-Wei; Griffith, Gareth W; Groenewald, Johannes Z; Groenewald, Marizeth; Grube, Martin; Gryzenhout, Marieka; Gueidan, Cécile; Guo, Liangdong; Hambleton, Sarah; Hamelin, Richard; Hansen, Karen; Hofstetter, Valérie; Hong, Seung-Beom; Houbraken, Jos; Hyde, Kevin D; Inderbitzin, Patrik; Johnston, Peter R; Karunarathna, Samantha C; Kõljalg, Urmas; Kovács, Gábor M; Kraichak, Ekaphan; Krizsan, Krisztina; Kurtzman, Cletus P; Larsson, Karl-Henrik; Leavitt, Steven; Letcher, Peter M; Liimatainen, Kare; Liu, Jian-Kui; Lodge, D Jean; Luangsa-ard, Janet Jennifer; Lumbsch, H Thorsten; Maharachchikumbura, Sajeewa S N; Manamgoda, Dimuthu; Martín, María P; Minnis, Andrew M; Moncalvo, Jean-Marc; Mulè, Giuseppina; Nakasone, Karen K; Niskanen, Tuula; Olariaga, Ibai; Papp, Tamás; Petkovits, Tamás; Pino-Bodas, Raquel; Powell, Martha J; Raja, Huzefa A; Redecker, Dirk; Sarmiento-Ramirez, J M; Seifert, Keith A; Shrestha, Bhushan; Stenroos, Soili; Stielow, Benjamin; Suh, Sung-Oui; Tanaka, Kazuaki; Tedersoo, Leho; Telleria, M Teresa; Udayanga, Dhanushka; Untereiner, Wendy A; Diéguez Uribeondo, Javier; Subbarao, Krishna V; Vágvölgyi, Csaba; Visagie, Cobus; Voigt, Kerstin; Walker, Donald M; Weir, Bevan S; Weiß, Michael; Wijayawardene, Nalin N; Wingfield, Michael J; Xu, J P; Yang, Zhu L; Zhang, Ning; Zhuang, Wen-Ying; Federhen, Scott

2014-01-01

DNA phylogenetic comparisons have shown that morphology-based species recognition often underestimates fungal diversity. Therefore, the need for accurate DNA sequence data, tied to both correct taxonomic names and clearly annotated specimen data, has never been greater. Furthermore, the growing number of molecular ecology and microbiome projects using high-throughput sequencing require fast and effective methods for en masse species assignments. In this article, we focus on selecting and re-annotating a set of marker reference sequences that represent each currently accepted order of Fungi. The particular focus is on sequences from the internal transcribed spacer region in the nuclear ribosomal cistron, derived from type specimens and/or ex-type cultures. Re-annotated and verified sequences were deposited in a curated public database at the National Center for Biotechnology Information (NCBI), namely the RefSeq Targeted Loci (RTL) database, and will be visible during routine sequence similarity searches with NR_prefixed accession numbers. A set of standards and protocols is proposed to improve the data quality of new sequences, and we suggest how type and other reference sequences can be used to improve identification of Fungi. Database URL: http://www.ncbi.nlm.nih.gov/bioproject/PRJNA177353. Published by Oxford University Press 2013. This work is written by US Government employees and is in the public domain in the US.
Predicting membrane protein types by fusing composite protein sequence features into pseudo amino acid composition.

PubMed

Hayat, Maqsood; Khan, Asifullah

2011-02-21

Membrane proteins are vital type of proteins that serve as channels, receptors, and energy transducers in a cell. Prediction of membrane protein types is an important research area in bioinformatics. Knowledge of membrane protein types provides some valuable information for predicting novel example of the membrane protein types. However, classification of membrane protein types can be both time consuming and susceptible to errors due to the inherent similarity of membrane protein types. In this paper, neural networks based membrane protein type prediction system is proposed. Composite protein sequence representation (CPSR) is used to extract the features of a protein sequence, which includes seven feature sets; amino acid composition, sequence length, 2 gram exchange group frequency, hydrophobic group, electronic group, sum of hydrophobicity, and R-group. Principal component analysis is then employed to reduce the dimensionality of the feature vector. The probabilistic neural network (PNN), generalized regression neural network, and support vector machine (SVM) are used as classifiers. A high success rate of 86.01% is obtained using SVM for the jackknife test. In case of independent dataset test, PNN yields the highest accuracy of 95.73%. These classifiers exhibit improved performance using other performance measures such as sensitivity, specificity, Mathew's correlation coefficient, and F-measure. The experimental results show that the prediction performance of the proposed scheme for classifying membrane protein types is the best reported, so far. This performance improvement may largely be credited to the learning capabilities of neural networks and the composite feature extraction strategy, which exploits seven different properties of protein sequences. The proposed Mem-Predictor can be accessed at http://111.68.99.218/Mem-Predictor. Copyright Â© 2010 Elsevier Ltd. All rights reserved.
Application of High-Throughput Next-Generation Sequencing for HLA Typing on Buccal Extracted DNA: Results from over 10,000 Donor Recruitment Samples

PubMed Central

Nguyen, David; Valenzuela, Nicole; Takemura, Ping; Bolon, Yung-Tsi; Springer, Brianna; Saito, Katsuyuki; Zheng, Ying; Hague, Tim; Pasztor, Agnes; Horvath, Gyorgy; Rigo, Krisztina; Reed, Elaine F.; Zhang, Qiuheng

2016-01-01

Background Unambiguous HLA typing is important in hematopoietic stem cell transplantation (HSCT), HLA disease association studies, and solid organ transplantation. However, current molecular typing methods only interrogate the antigen recognition site (ARS) of HLA genes, resulting in many cis-trans ambiguities that require additional typing methods to resolve. Here we report high-resolution HLA typing of 10,063 National Marrow Donor Program (NMDP) registry donors using long-range PCR by next generation sequencing (NGS) approach on buccal swab DNA. Methods Multiplex long-range PCR primers amplified the full-length of HLA class I genes (A, B, C) from promotor to 3’ UTR. Class II genes (DRB1, DQB1) were amplified from exon 2 through part of exon 4. PCR amplicons were pooled and sheared using Covaris fragmentation. Library preparation was performed using the Illumina TruSeq Nano kit on the Beckman FX automated platform. Each sample was tagged with a unique barcode, followed by 2×250 bp paired-end sequencing on the Illumina MiSeq. HLA typing was assigned using Omixon Twin software that combines two independent computational algorithms to ensure high confidence in allele calling. Consensus sequence and typing results were reported in Histoimmunogenetics Markup Language (HML) format. All homozygous alleles were confirmed by Luminex SSO typing and exon novelties were confirmed by Sanger sequencing. Results Using this automated workflow, over 10,063 NMDP registry donors were successfully typed under high-resolution by NGS. Despite known challenges of nucleic acid degradation and low DNA concentration commonly associated with buccal-based specimens, 97.8% of samples were successfully amplified using long-range PCR. Among these, 98.2% were successfully reported by NGS, with an accuracy rate of 99.84% in an independent blind Quality Control audit performed by the NDMP. In this study, NGS-HLA typing identified 23 null alleles (0.023%), 92 rare alleles (0.091%) and 42 exon novelties (0.042%). Conclusion Long-range, unambiguous HLA genotyping is achievable on clinical buccal swab-extracted DNA. Importantly, full-length gene sequencing and the ability to curate full sequence data will permit future interrogation of the impact of introns, expanded exons, and other gene regulatory sequences on clinical outcomes in transplantation. PMID:27798706
Application of High-Throughput Next-Generation Sequencing for HLA Typing on Buccal Extracted DNA: Results from over 10,000 Donor Recruitment Samples.

PubMed

Yin, Yuxin; Lan, James H; Nguyen, David; Valenzuela, Nicole; Takemura, Ping; Bolon, Yung-Tsi; Springer, Brianna; Saito, Katsuyuki; Zheng, Ying; Hague, Tim; Pasztor, Agnes; Horvath, Gyorgy; Rigo, Krisztina; Reed, Elaine F; Zhang, Qiuheng

2016-01-01

Unambiguous HLA typing is important in hematopoietic stem cell transplantation (HSCT), HLA disease association studies, and solid organ transplantation. However, current molecular typing methods only interrogate the antigen recognition site (ARS) of HLA genes, resulting in many cis-trans ambiguities that require additional typing methods to resolve. Here we report high-resolution HLA typing of 10,063 National Marrow Donor Program (NMDP) registry donors using long-range PCR by next generation sequencing (NGS) approach on buccal swab DNA. Multiplex long-range PCR primers amplified the full-length of HLA class I genes (A, B, C) from promotor to 3' UTR. Class II genes (DRB1, DQB1) were amplified from exon 2 through part of exon 4. PCR amplicons were pooled and sheared using Covaris fragmentation. Library preparation was performed using the Illumina TruSeq Nano kit on the Beckman FX automated platform. Each sample was tagged with a unique barcode, followed by 2×250 bp paired-end sequencing on the Illumina MiSeq. HLA typing was assigned using Omixon Twin software that combines two independent computational algorithms to ensure high confidence in allele calling. Consensus sequence and typing results were reported in Histoimmunogenetics Markup Language (HML) format. All homozygous alleles were confirmed by Luminex SSO typing and exon novelties were confirmed by Sanger sequencing. Using this automated workflow, over 10,063 NMDP registry donors were successfully typed under high-resolution by NGS. Despite known challenges of nucleic acid degradation and low DNA concentration commonly associated with buccal-based specimens, 97.8% of samples were successfully amplified using long-range PCR. Among these, 98.2% were successfully reported by NGS, with an accuracy rate of 99.84% in an independent blind Quality Control audit performed by the NDMP. In this study, NGS-HLA typing identified 23 null alleles (0.023%), 92 rare alleles (0.091%) and 42 exon novelties (0.042%). Long-range, unambiguous HLA genotyping is achievable on clinical buccal swab-extracted DNA. Importantly, full-length gene sequencing and the ability to curate full sequence data will permit future interrogation of the impact of introns, expanded exons, and other gene regulatory sequences on clinical outcomes in transplantation.
Covariant Evolutionary Event Analysis for Base Interaction Prediction Using a Relational Database Management System for RNA.

PubMed

Xu, Weijia; Ozer, Stuart; Gutell, Robin R

2009-01-01

With an increasingly large amount of sequences properly aligned, comparative sequence analysis can accurately identify not only common structures formed by standard base pairing but also new types of structural elements and constraints. However, traditional methods are too computationally expensive to perform well on large scale alignment and less effective with the sequences from diversified phylogenetic classifications. We propose a new approach that utilizes coevolutional rates among pairs of nucleotide positions using phylogenetic and evolutionary relationships of the organisms of aligned sequences. With a novel data schema to manage relevant information within a relational database, our method, implemented with a Microsoft SQL Server 2005, showed 90% sensitivity in identifying base pair interactions among 16S ribosomal RNA sequences from Bacteria, at a scale 40 times bigger and 50% better sensitivity than a previous study. The results also indicated covariation signals for a few sets of cross-strand base stacking pairs in secondary structure helices, and other subtle constraints in the RNA structure.
Covariant Evolutionary Event Analysis for Base Interaction Prediction Using a Relational Database Management System for RNA

PubMed Central

Xu, Weijia; Ozer, Stuart; Gutell, Robin R.

2010-01-01

With an increasingly large amount of sequences properly aligned, comparative sequence analysis can accurately identify not only common structures formed by standard base pairing but also new types of structural elements and constraints. However, traditional methods are too computationally expensive to perform well on large scale alignment and less effective with the sequences from diversified phylogenetic classifications. We propose a new approach that utilizes coevolutional rates among pairs of nucleotide positions using phylogenetic and evolutionary relationships of the organisms of aligned sequences. With a novel data schema to manage relevant information within a relational database, our method, implemented with a Microsoft SQL Server 2005, showed 90% sensitivity in identifying base pair interactions among 16S ribosomal RNA sequences from Bacteria, at a scale 40 times bigger and 50% better sensitivity than a previous study. The results also indicated covariation signals for a few sets of cross-strand base stacking pairs in secondary structure helices, and other subtle constraints in the RNA structure. PMID:20502534
High-sensitivity HLA typing by Saturated Tiling Capture Sequencing (STC-Seq).

PubMed

Jiao, Yang; Li, Ran; Wu, Chao; Ding, Yibin; Liu, Yanning; Jia, Danmei; Wang, Lifeng; Xu, Xiang; Zhu, Jing; Zheng, Min; Jia, Junling

2018-01-15

Highly polymorphic human leukocyte antigen (HLA) genes are responsible for fine-tuning the adaptive immune system. High-resolution HLA typing is important for the treatment of autoimmune and infectious diseases. Additionally, it is routinely performed for identifying matched donors in transplantation medicine. Although many HLA typing approaches have been developed, the complexity, low-efficiency and high-cost of current HLA-typing assays limit their application in population-based high-throughput HLA typing for donors, which is required for creating large-scale databases for transplantation and precision medicine. Here, we present a cost-efficient Saturated Tiling Capture Sequencing (STC-Seq) approach to capturing 14 HLA class I and II genes. The highly efficient capture (an approximately 23,000-fold enrichment) of these genes allows for simplified allele calling. Tests on five genes (HLA-A/B/C/DRB1/DQB1) from 31 human samples and 351 datasets using STC-Seq showed results that were 98% consistent with the known two sets of digitals (field1 and field2) genotypes. Additionally, STC can capture genomic DNA fragments longer than 3 kb from HLA loci, making the library compatible with the third-generation sequencing. STC-Seq is a highly accurate and cost-efficient method for HLA typing which can be used to facilitate the establishment of population-based HLA databases for the precision and transplantation medicine.
Systematic revision of the adeleid haemogregarines, with creation of Bartazoon n. g., reassignment of Hepatozoon argantis Garnham, 1954 to Hemolivia, and molecular data on Hemolivia stellata

PubMed Central

Karadjian, Grégory; Chavatte, Jean-Marc; Landau, Irène

2015-01-01

Life cycles and molecular data for terrestrial haemogregarines are reviewed in this article. Collection material was re-examined: Hepatozoon argantis Garnham, 1954 in Argas brumpti was reassigned to Hemolivia as Hemolivia argantis (Garnham, 1954) n. comb.; parasite DNA was extracted from a tick crush on smear of an archived slide of Hemolivia stellata in Amblyomma rotondatum, then the 18S ssrRNA gene was amplified by PCR. A systematic revision of the group is proposed, based on biological life cycles and phylogenetic reconstruction. Four types of life cycles, based on parasite vector, vertebrate host and the characteristics of their development, are defined. We propose combining species, based on their biology, into four groups (types I, II, III and IV). The characters of each type are defined and associated with a type genus and a type species. The biological characters of each type are associated with a different genus and a type species. The phylogenetic reconstruction with sequences deposited in the databases and our own new sequence of Hemolivia stellata is consistent with this classification. The classification is as follows: Type I, Hepatozoon Miller, 1908, type species H. perniciosum Miller, 1908; Type II, Karyolysus Labbé, 1894, type species K. lacertae (Danilewsky, 1886) Reichenow, 1913; Type III Hemolivia Petit et al., 1990, type species H. stellata, Petit et al., 1990; and Type IV: Bartazoon n. g., type species B. breinli (Mackerras, 1960). PMID:26551414
Sequence diversity within the reovirus S2 gene: reovirus genes reassort in nature, and their termini are predicted to form a panhandle motif.

PubMed Central

Chapell, J D; Goral, M I; Rodgers, S E; dePamphilis, C W; Dermody, T S

1994-01-01

To better understand genetic diversity within mammalian reoviruses, we determined S2 nucleotide and deduced sigma 2 amino acid sequences of nine reovirus strains and compared these sequences with those of prototype strains of the three reovirus serotypes. The S2 gene and sigma 2 protein are highly conserved among the four type 1, one type 2, and seven type 3 strains studied. Phylogenetic analyses based on S2 nucleotide sequences of the 12 reovirus strains indicate that diversity within the S2 gene is independent of viral serotype. Additionally, we found marked topological differences between phylogenetic trees generated from S1 and S2 gene nucleotide sequences of the seven type 3 strains. These results demonstrate that reovirus S1 and S2 genes have distinct evolutionary histories, thus providing phylogenetic evidence for lateral transfer of reovirus genes in nature. When variability among the 12 sigma 2-encoding S2 nucleotide sequences was analyzed at synonymous positions, we found that approximately 60 nucleotides at the 5' terminus and 30 nucleotides at the 3' terminus were markedly conserved in comparison with other sigma 2-encoding regions of S2. Predictions of RNA secondary structures indicate that the more conserved S2 sequences participate in the formation of an extended region of duplex RNA interrupted by a pair of stem-loops. Among the 12 deduced sigma 2 amino acid sequences examined, substitutions were observed at only 11% of amino acid positions. This finding suggests that constraints on the structure or function of sigma 2, perhaps in part because of its location in the virion core, have limited sequence diversity within this protein. PMID:8289378
Multiple Locus Variable-Number Tandem-Repeat and Single-Nucleotide Polymorphism-Based Brucella Typing Reveals Multiple Lineages in Brucella melitensis Currently Endemic in China.

PubMed

Sun, Mingjun; Jing, Zhigang; Di, Dongdong; Yan, Hao; Zhang, Zhicheng; Xu, Quangang; Zhang, Xiyue; Wang, Xun; Ni, Bo; Sun, Xiangxiang; Yan, Chengxu; Yang, Zhen; Tian, Lili; Li, Jinping; Fan, Weixing

2017-01-01

Brucellosis is a worldwide zoonotic disease caused by Brucella spp. In China, brucellosis is recognized as a reemerging disease mainly caused by Brucella melitensis specie. To better understand the currently endemic B. melitensis strains in China, three Brucella genotyping methods were applied to 110 B. melitensis strains obtained in past several years. By MLVA genotyping, five MLVA-8 genotypes were identified, among which genotypes 42 (1-5-3-13-2-2-3-2) was recognized as the predominant genotype, while genotype 63 (1-5-3-13-2-3-3-2) and a novel genotype of 1-5-3-13-2-4-3-2 were second frequently observed. MLVA-16 discerned a total of 57 MLVA-16 genotypes among these Brucella strains, with 41 genotypes being firstly detected and the other 16 genotypes being previously reported. By BruMLSA21 typing, six sequence types (STs) were identified, among them ST8 is the most frequently seen in China while the other five STs were firstly detected and designated as ST137, ST138, ST139, ST140, and ST141 by international multilocus sequence typing database. Whole-genome sequence (WGS)-single-nucleotide polymorphism (SNP)-based typing and phylogenetic analysis resolved Chinese B. melitensis strains into five clusters, reflecting the existence of multiple lineages among these Chinese B. melitensis strains. In phylogeny, Chinese lineages are more closely related to strains collected from East Mediterranean and Middle East countries, such as Turkey, Kuwait, and Iraq. In the next few years, MLVA typing will certainly remain an important epidemiological tool for Brucella infection analysis, as it displays a high discriminatory ability and achieves result largely in agreement with WGS-SNP-based typing. However, WGS-SNP-based typing is found to be the most powerful and reliable method in discerning Brucella strains and will be popular used in the future.
An efficient, versatile and scalable pattern growth approach to mine frequent patterns in unaligned protein sequences.

PubMed

Ye, Kai; Kosters, Walter A; Ijzerman, Adriaan P

2007-03-15

Pattern discovery in protein sequences is often based on multiple sequence alignments (MSA). The procedure can be computationally intensive and often requires manual adjustment, which may be particularly difficult for a set of deviating sequences. In contrast, two algorithms, PRATT2 (http//www.ebi.ac.uk/pratt/) and TEIRESIAS (http://cbcsrv.watson.ibm.com/) are used to directly identify frequent patterns from unaligned biological sequences without an attempt to align them. Here we propose a new algorithm with more efficiency and more functionality than both PRATT2 and TEIRESIAS, and discuss some of its applications to G protein-coupled receptors, a protein family of important drug targets. In this study, we designed and implemented six algorithms to mine three different pattern types from either one or two datasets using a pattern growth approach. We compared our approach to PRATT2 and TEIRESIAS in efficiency, completeness and the diversity of pattern types. Compared to PRATT2, our approach is faster, capable of processing large datasets and able to identify the so-called type III patterns. Our approach is comparable to TEIRESIAS in the discovery of the so-called type I patterns but has additional functionality such as mining the so-called type II and type III patterns and finding discriminating patterns between two datasets. The source code for pattern growth algorithms and their pseudo-code are available at http://www.liacs.nl/home/kosters/pg/.
On the Sequence-Directed Nature of Human Gene Mutation: The Role of Genomic Architecture and the Local DNA Sequence Environment in Mediating Gene Mutations Underlying Human Inherited Disease

PubMed Central

Cooper, David N.; Bacolla, Albino; Férec, Claude; Vasquez, Karen M.; Kehrer-Sawatzki, Hildegard; Chen, Jian-Min

2011-01-01

Different types of human gene mutation may vary in size, from structural variants (SVs) to single base-pair substitutions, but what they all have in common is that their nature, size and location are often determined either by specific characteristics of the local DNA sequence environment or by higher-order features of the genomic architecture. The human genome is now recognized to contain ‘pervasive architectural flaws’ in that certain DNA sequences are inherently mutation-prone by virtue of their base composition, sequence repetitivity and/or epigenetic modification. Here we explore how the nature, location and frequency of different types of mutation causing inherited disease are shaped in large part, and often in remarkably predictable ways, by the local DNA sequence environment. The mutability of a given gene or genomic region may also be influenced indirectly by a variety of non-canonical (non-B) secondary structures whose formation is facilitated by the underlying DNA sequence. Since these non-B DNA structures can interfere with subsequent DNA replication and repair, and may serve to increase mutation frequencies in generalized fashion (i.e. both in the context of subtle mutations and SVs), they have the potential to serve as a unifying concept in studies of mutational mechanisms underlying human inherited disease. PMID:21853507
Molecular and Cellular Mechanisms for the Interaction between Gold Nanoparticles and Neuroimmune Cells Based on Size, Shape, and Charge

DTIC Science & Technology

2014-04-25

IgG secretion. 2.3 Designing of Synthetic peptide The immunogenic peptides against the foot and mouth disease virus ( FMDV ) were designed and...synthesized based on viral protein 1 of type O FMDV . The amino acid sequence for pFMDV is NGSSKYGDTSTNNVRGDLQVLAQKAERTLC. An extra cysteine was added...peptides were synthesized based on the amino acid sequence of the VP1 coat protein of the FMDV (table 1). The peptide pFMDVD (19 amino acids in length
ProDeGe: A computational protocol for fully automated decontamination of genomes

DOE PAGES

Tennessen, Kristin; Andersen, Evan; Clingenpeel, Scott; ...

2015-06-09

Single amplified genomes and genomes assembled from metagenomes have enabled the exploration of uncultured microorganisms at an unprecedented scale. However, both these types of products are plagued by contamination. Since these genomes are now being generated in a high-throughput manner and sequences from them are propagating into public databases to drive novel scientific discoveries, rigorous quality controls and decontamination protocols are urgently needed. Here, we present ProDeGe (Protocol for fully automated Decontamination of Genomes), the first computational protocol for fully automated decontamination of draft genomes. ProDeGe classifies sequences into two classes—clean and contaminant—using a combination of homology and feature-based methodologies.more » On average, 84% of sequence from the non-target organism is removed from the data set (specificity) and 84% of the sequence from the target organism is retained (sensitivity). Lastly, the procedure operates successfully at a rate of ~0.30 CPU core hours per megabase of sequence and can be applied to any type of genome sequence.« less

ProDeGe: A computational protocol for fully automated decontamination of genomes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Tennessen, Kristin; Andersen, Evan; Clingenpeel, Scott

Single amplified genomes and genomes assembled from metagenomes have enabled the exploration of uncultured microorganisms at an unprecedented scale. However, both these types of products are plagued by contamination. Since these genomes are now being generated in a high-throughput manner and sequences from them are propagating into public databases to drive novel scientific discoveries, rigorous quality controls and decontamination protocols are urgently needed. Here, we present ProDeGe (Protocol for fully automated Decontamination of Genomes), the first computational protocol for fully automated decontamination of draft genomes. ProDeGe classifies sequences into two classes—clean and contaminant—using a combination of homology and feature-based methodologies.more » On average, 84% of sequence from the non-target organism is removed from the data set (specificity) and 84% of the sequence from the target organism is retained (sensitivity). Lastly, the procedure operates successfully at a rate of ~0.30 CPU core hours per megabase of sequence and can be applied to any type of genome sequence.« less
Comparison of the nucleotide and amino acid sequences of the RsrI and EcoRI restriction endonucleases.

PubMed

Stephenson, F H; Ballard, B T; Boyer, H W; Rosenberg, J M; Greene, P J

1989-12-21

The RsrI endonuclease, a type-II restriction endonuclease (ENase) found in Rhodobacter sphaeroides, is an isoschizomer of the EcoRI ENase. A clone containing an 11-kb BamHI fragment was isolated from an R. sphaeroides genomic DNA library by hybridization with synthetic oligodeoxyribonucleotide probes based on the N-terminal amino acid (aa) sequence of RsrI. Extracts of E. coli containing a subclone of the 11-kb fragment display RsrI activity. Nucleotide sequence analysis reveals an 831-bp open reading frame encoding a polypeptide of 277 aa. A 50% identity exists within a 266-aa overlap between the deduced aa sequences of RsrI and EcoRI. Regions of 75-100% aa sequence identity correspond to key structural and functional regions of EcoRI. The type-II ENases have many common properties, and a common origin might have been expected. Nevertheless, this is the first demonstration of aa sequence similarity between ENases produced by different organisms.
16S-23S rRNA gene internal transcribed spacer sequences for analysis of the phylogenetic relationships among species of the genus Porphyromonas.

PubMed

Conrads, Georg; Citron, Diane M; Tyrrell, Kerin L; Horz, Hans-Peter; Goldstein, Ellie J C

2005-03-01

The 16S-23S rRNA gene internal transcribed spacer (ITS) regions of 11 reference strains of Porphyromonas species, together with Bacteroides distasonis and Tannerella forsythensis, were analysed to examine interspecies relationships. Compared with the phylogenetic tree generated using 16S rRNA gene sequences, the resolution of the ITS sequence-based tree was higher, but species positioning and clustering were similar with both approaches. The recent separation of Porphyromonas gulae and Porphyromonas gingivalis into distinct species was confirmed by the ITS data. In addition, analysis of the ITS sequences of 24 clinical isolates of Porphyromonas asaccharolytica plus the type strain ATCC 25260(T) divided the sequences into two clusters, of which one was alpha-fucosidase-positive (like the type strain) while the other was alpha-fucosidase-negative. The latter resembled the previously studied unusual extra-oral isolates of 'Porphyromonas endodontalis-like organisms' (PELOs) which could therefore be called 'Porphyromonas asaccharolytica-like organisms' (PALOs), based on the genetic identification. Moreover, the proposal of alpha-fucosidase-negative P. asaccharolytica strains as a new species should also be considered.
Method for identifying and quantifying nucleic acid sequence aberrations

DOEpatents

Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

1998-01-01

A method for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe.
Method for identifying and quantifying nucleic acid sequence aberrations

DOEpatents

Lucas, J.N.; Straume, T.; Bogen, K.T.

1998-07-21

A method is disclosed for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe. 11 figs.
Recent progress in the research about Propionibacterium acnes strain diversity and acne: pathogen or bystander?

PubMed

Kwon, Hyuck Hoon; Suh, Dae Hun

2016-11-01

Recent progress has steadily reported the existence of the diverse strains of Propionibacterium acnes, and these studies have contributed to the elucidation of their contradictory roles between normal commensals and pathogens. In this review, the authors aimed to provide an update on the recent understanding of research about P. acnes strain diversity and acne, analyzing the potential implications for clinical applications. Before the era of genomic research, P. acnes was known to be distinguished based on serological agglutination tests, cell wall sugar analysis, or fermentation traits. Since the complete genome sequence of P. acnes was first deciphered, genetic studies based on sequence data have expanded with the introduction of more refined and precise DNA-based typing methods, including multilocus sequence typing and metagenomics. These sophisticated techniques have revealed that P. acnes consists of phylogenetically distinct cluster groups with various pathogenic traits, including elicitation of inflammation, protein secretome profile, and unique distribution patterns in various skin loci. In following large-scale studies from patients' acne samples have revealed that specific sequence types are included within the phylogenetic divisions and further suggested that particular P. acnes strains play an etiologic role in acne while others are associated with health, providing a firm platform for evidential-based research into the exact role of this organism in acne. We strongly believe that future research would provide fruitful results in not only clarifying the apparent controversy with respect to roles of P. acnes but also developing therapeutic drugs by pinpointing specific targets of the pathogenic strain only. © 2016 The International Society of Dermatology.
Single-Molecule Electrical Random Resequencing of DNA and RNA

NASA Astrophysics Data System (ADS)

Ohshiro, Takahito; Matsubara, Kazuki; Tsutsui, Makusu; Furuhashi, Masayuki; Taniguchi, Masateru; Kawai, Tomoji

2012-07-01

Two paradigm shifts in DNA sequencing technologies--from bulk to single molecules and from optical to electrical detection--are expected to realize label-free, low-cost DNA sequencing that does not require PCR amplification. It will lead to development of high-throughput third-generation sequencing technologies for personalized medicine. Although nanopore devices have been proposed as third-generation DNA-sequencing devices, a significant milestone in these technologies has been attained by demonstrating a novel technique for resequencing DNA using electrical signals. Here we report single-molecule electrical resequencing of DNA and RNA using a hybrid method of identifying single-base molecules via tunneling currents and random sequencing. Our method reads sequences of nine types of DNA oligomers. The complete sequence of 5'-UGAGGUA-3' from the let-7 microRNA family was also identified by creating a composite of overlapping fragment sequences, which was randomly determined using tunneling current conducted by single-base molecules as they passed between a pair of nanoelectrodes.
Dual signal amplification for highly sensitive electrochemical detection of uropathogens via enzyme-based catalytic target recycling.

PubMed

Su, Jiao; Zhang, Haijie; Jiang, Bingying; Zheng, Huzhi; Chai, Yaqin; Yuan, Ruo; Xiang, Yun

2011-11-15

We report an ultrasensitive electrochemical approach for the detection of uropathogen sequence-specific DNA target. The sensing strategy involves a dual signal amplification process, which combines the signal enhancement by the enzymatic target recycling technique with the sensitivity improvement by the quantum dot (QD) layer-by-layer (LBL) assembled labels. The enzyme-based catalytic target DNA recycling process results in the use of each target DNA sequence for multiple times and leads to direct amplification of the analytical signal. Moreover, the LBL assembled QD labels can further enhance the sensitivity of the sensing system. The coupling of these two effective signal amplification strategies thus leads to low femtomolar (5fM) detection of the target DNA sequences. The proposed strategy also shows excellent discrimination between the target DNA and the single-base mismatch sequences. The advantageous intrinsic sequence-independent property of exonuclease III over other sequence-dependent enzymes makes our new dual signal amplification system a general sensing platform for monitoring ultralow level of various types of target DNA sequences. Copyright © 2011 Elsevier B.V. All rights reserved.
Targeted therapy according to next generation sequencing-based panel sequencing.

PubMed

Saito, Motonobu; Momma, Tomoyuki; Kono, Koji

2018-04-17

Targeted therapy against actionable gene mutations shows a significantly higher response rate as well as longer survival compared to conventional chemotherapy, and has become a standard therapy for many cancers. Recent progress in next-generation sequencing (NGS) has enabled to identify huge number of genetic aberrations. Based on sequencing results, patients recommend to undergo targeted therapy or immunotherapy. In cases where there are no available approved drugs for the genetic mutations detected in the patients, it is recommended to be facilitate the registration for the clinical trials. For that purpose, a NGS-based sequencing panel that can simultaneously target multiple genes in a single investigation has been used in daily clinical practice. To date, various types of sequencing panels have been developed to investigate genetic aberrations with tumor somatic genome variants (gain-of-function or loss-of-function mutations, high-level copy number alterations, and gene fusions) through comprehensive bioinformatics. Because sequencing panels are efficient and cost-effective, they are quickly being adopted outside the lab, in hospitals and clinics, in order to identify personal targeted therapy for individual cancer patients.
'DNA Strider': a 'C' program for the fast analysis of DNA and protein sequences on the Apple Macintosh family of computers.

PubMed Central

Marck, C

1988-01-01

DNA Strider is a new integrated DNA and Protein sequence analysis program written with the C language for the Macintosh Plus, SE and II computers. It has been designed as an easy to learn and use program as well as a fast and efficient tool for the day-to-day sequence analysis work. The program consists of a multi-window sequence editor and of various DNA and Protein analysis functions. The editor may use 4 different types of sequences (DNA, degenerate DNA, RNA and one-letter coded protein) and can handle simultaneously 6 sequences of any type up to 32.5 kB each. Negative numbering of the bases is allowed for DNA sequences. All classical restriction and translation analysis functions are present and can be performed in any order on any open sequence or part of a sequence. The main feature of the program is that the same analysis function can be repeated several times on different sequences, thus generating multiple windows on the screen. Many graphic capabilities have been incorporated such as graphic restriction map, hydrophobicity profile and the CAI plot- codon adaptation index according to Sharp and Li. The restriction sites search uses a newly designed fast hexamer look-ahead algorithm. Typical runtime for the search of all sites with a library of 130 restriction endonucleases is 1 second per 10,000 bases. The circular graphic restriction map of the pBR322 plasmid can be therefore computed from its sequence and displayed on the Macintosh Plus screen within 2 seconds and its multiline restriction map obtained in a scrolling window within 5 seconds. PMID:2832831
Comparison of Sanger and next generation sequencing performance for genotyping Cryptosporidium isolates at the 18S rRNA and actin loci.

PubMed

Paparini, Andrea; Gofton, Alexander; Yang, Rongchang; White, Nicole; Bunce, Michael; Ryan, Una M

2015-01-01

Cryptosporidium is an important enteric pathogen that infects a wide range of humans and animals. Rapid and reliable detection and characterisation methods are essential for understanding the transmission dynamics of the parasite. Sanger sequencing, and high-throughput sequencing (HTS) on an Ion Torrent platform, were compared with each other for their sensitivity and accuracy in detecting and characterising 25 Cryptosporidium-positive human and animal faecal samples. Ion Torrent reads (n = 123,857) were obtained at both 18S rRNA and actin loci for 21 of the 25 samples. Of these, one isolate at the actin locus (Cattle 05) and three at the 18S rRNA locus (HTS 10, HTS 11 and HTS 12), suffered PCR drop-out (i.e. PCR failures) when using fusion-tagged PCR. Sanger sequences were obtained for both loci for 23 of the 25 samples and showed good agreement with Ion Torrent-based genotyping. Two samples both from pythons (SK 02 and SK 05) produced mixed 18S and actin chromatograms by Sanger sequencing but were clearly identified by Ion Torrent sequencing as C. muris. One isolate (SK 03) was typed as C. muris by Sanger sequencing but was identified as a mixed C. muris and C. tyzzeri infection by HTS. 18S rRNA Type B sequences were identified in 4/6 C. parvum isolates when deep sequenced but were undetected in Sanger sequencing. Sanger was cheaper than Ion Torrent when sequencing a small numbers of samples, but when larger numbers of samples are considered (n = 60), the costs were comparative. Fusion-tagged amplicon based approaches are a powerful way of approaching mixtures, the only draw-back being the loss of PCR efficiency on low-template samples when using primers coupled to MID tags and adaptors. Taken together these data show that HTS has excellent potential for revealing the "true" composition of species/types in a Cryptosporidium infection, but that HTS workflows need to be carefully developed to ensure sensitivity, accuracy and contamination are controlled. Copyright © 2015 Elsevier Inc. All rights reserved.
An Active RBSE Framework to Generate Optimal Stimulus Sequences in a BCI for Spelling

NASA Astrophysics Data System (ADS)

Moghadamfalahi, Mohammad; Akcakaya, Murat; Nezamfar, Hooman; Sourati, Jamshid; Erdogmus, Deniz

2017-10-01

A class of brain computer interfaces (BCIs) employs noninvasive recordings of electroencephalography (EEG) signals to enable users with severe speech and motor impairments to interact with their environment and social network. For example, EEG based BCIs for typing popularly utilize event related potentials (ERPs) for inference. Presentation paradigm design in current ERP-based letter by letter typing BCIs typically query the user with an arbitrary subset characters. However, the typing accuracy and also typing speed can potentially be enhanced with more informed subset selection and flash assignment. In this manuscript, we introduce the active recursive Bayesian state estimation (active-RBSE) framework for inference and sequence optimization. Prior to presentation in each iteration, rather than showing a subset of randomly selected characters, the developed framework optimally selects a subset based on a query function. Selected queries are made adaptively specialized for users during each intent detection. Through a simulation-based study, we assess the effect of active-RBSE on the performance of a language-model assisted typing BCI in terms of typing speed and accuracy. To provide a baseline for comparison, we also utilize standard presentation paradigms namely, row and column matrix presentation paradigm and also random rapid serial visual presentation paradigms. The results show that utilization of active-RBSE can enhance the online performance of the system, both in terms of typing accuracy and speed.
RNA interference inhibits herpes simplex virus type 1 isolated from saliva samples and mucocutaneous lesions.

PubMed

Silva, Amanda Perse da; Lopes, Juliana Freitas; Paula, Vanessa Salete de

2014-01-01

The aim of this study was to evaluate the use of RNA interference to inhibit herpes simplex virus type-1 replication in vitro. For herpes simplex virus type-1 gene silencing, three different small interfering RNAs (siRNAs) targeting the herpes simplex virus type-1 UL39 gene (sequence si-UL 39-1, si-UL 39-2, and si-UL 39-3) were used, which encode the large subunit of ribonucleotide reductase, an essential enzyme for DNA synthesis. Herpes simplex virus type-1 was isolated from saliva samples and mucocutaneous lesions from infected patients. All mucocutaneous lesions' samples were positive for herpes simplex virus type-1 by real-time PCR and by virus isolation; all herpes simplex virus type-1 from saliva samples were positive by real-time PCR and 50% were positive by virus isolation. The levels of herpes simplex virus type-1 DNA remaining after siRNA treatment were assessed by real-time PCR, whose results demonstrated that the effect of siRNAs on gene expression depends on siRNA concentration. The three siRNA sequences used were able to inhibit viral replication, assessed by real-time PCR and plaque assays and among them, the sequence si-UL 39-1 was the most effective. This sequence inhibited 99% of herpes simplex virus type-1 replication. The results demonstrate that silencing herpes simplex virus type-1 UL39 expression by siRNAs effectively inhibits herpes simplex virus type-1 replication, suggesting that siRNA based antiviral strategy may be a potential therapeutic alternative. Copyright © 2014. Published by Elsevier Editora Ltda.
Molecular and comparative analysis of Salmonella enterica Senftenberg from humans and animals using PFGE, MLST and NARMS.

PubMed

Stepan, Ryan M; Sherwood, Julie S; Petermann, Shana R; Logue, Catherine M

2011-06-27

Salmonella species are recognized worldwide as a significant cause of human and animal disease. In this study the molecular profiles and characteristics of Salmonella enterica Senftenberg isolated from human cases of illness and those recovered from healthy or diagnostic cases in animals were assessed. Included in the study was a comparison with our own sequenced strain of S. Senfteberg recovered from production turkeys in North Dakota. Isolates examined in this study were subjected to antimicrobial susceptibility profiling using the National Antimicrobial Resistance Monitoring System (NARMS) panel which tested susceptibility to 15 different antimicrobial agents. The molecular profiles of all isolates were determined using Pulsed Field Gel Electrophoresis (PFGE) and the sequence types of the strains were obtained using Multi-Locus Sequence Type (MLST) analysis based on amplification and sequence interrogation of seven housekeeping genes (aroC, dnaN, hemD, hisD, purE, sucA, and thrA). PFGE data was input into BioNumerics analysis software to generate a dendrogram of relatedness among the strains. The study found 93 profiles among 98 S. Senftenberg isolates tested and there were primarily two sequence types associated with humans and animals (ST185 and ST14) with overlap observed in all host types suggesting that the distribution of S. Senftenberg sequence types is not host dependent. Antimicrobial resistance was observed among the animal strains, however no resistance was detected in human isolates suggesting that animal husbandry has a significant influence on the selection and promotion of antimicrobial resistance. The data demonstrates the circulation of at least two strain types in both animal and human health suggesting that S. Senftenberg is relatively homogeneous in its distribution. The data generated in this study could be used towards defining a pathotype for this serovar.
Molecular Epidemiology of Carbapenem-Resistant Acinetobacter baumannii Isolates in the Gulf Cooperation Council States: Dominance of OXA-23-Type Producers

PubMed Central

Sartor, Anna L.; Sidjabat, Hanna E.; Balkhy, Hanan H.; Walsh, Timothy R.; Al Johani, Sameera M.; AlJindan, Reem Y.; Alfaresi, Mubarak; Ibrahim, Emad; Al-Jardani, Amina; Al Salman, Jameela; Dashti, Ali A.; Johani, Khalid; Paterson, David L.

2015-01-01

The molecular epidemiology and mechanisms of resistance of carbapenem-resistant Acinetobacter baumannii (CRAB) were determined in hospitals in the states of the Cooperation Council for the Arab States of the Gulf (Gulf Cooperation Council [GCC]), namely, Saudi Arabia, United Arab Emirates, Oman, Qatar, Bahrain, and Kuwait. Isolates were subjected to PCR-based detection of antibiotic resistance genes and repetitive sequence-based PCR (rep-PCR) assessments of clonality. Selected isolates were subjected to multilocus sequence typing (MLST). We investigated 117 isolates resistant to carbapenem antibiotics (either imipenem or meropenem). All isolates were positive for OXA-51. The most common carbapenemases were the OXA-23-type, found in 107 isolates, followed by OXA-40-type (OXA-24-type), found in 5 isolates; 3 isolates carried the ISAba1 element upstream of blaOXA-51-type. No OXA-58-type, NDM-type, VIM-type, or IMP-type producers were detected. Multiple clones were detected with 16 clusters of clonally related CRAB. Some clusters involved hospitals in different states. MLST analysis of 15 representative isolates from different clusters identified seven different sequence types (ST195, ST208, ST229, ST436, ST450, ST452, and ST499), as well as three novel STs. The vast majority (84%) of the isolates in this study were associated with health care exposure. Awareness of multidrug-resistant organisms in GCC states has important implications for optimizing infection control practices; establishing antimicrobial stewardship programs within hospital, community, and agricultural settings; and emphasizing the need for establishing regional active surveillance systems. This will help to control the spread of CRAB in the Middle East and in hospitals accommodating transferred patients from this region. PMID:25568439
Molecular epidemiology of carbapenem-resistant Acinetobacter baumannii isolates in the Gulf Cooperation Council States: dominance of OXA-23-type producers.

PubMed

Zowawi, Hosam M; Sartor, Anna L; Sidjabat, Hanna E; Balkhy, Hanan H; Walsh, Timothy R; Al Johani, Sameera M; AlJindan, Reem Y; Alfaresi, Mubarak; Ibrahim, Emad; Al-Jardani, Amina; Al Salman, Jameela; Dashti, Ali A; Johani, Khalid; Paterson, David L

2015-03-01

The molecular epidemiology and mechanisms of resistance of carbapenem-resistant Acinetobacter baumannii (CRAB) were determined in hospitals in the states of the Cooperation Council for the Arab States of the Gulf (Gulf Cooperation Council [GCC]), namely, Saudi Arabia, United Arab Emirates, Oman, Qatar, Bahrain, and Kuwait. Isolates were subjected to PCR-based detection of antibiotic resistance genes and repetitive sequence-based PCR (rep-PCR) assessments of clonality. Selected isolates were subjected to multilocus sequence typing (MLST). We investigated 117 isolates resistant to carbapenem antibiotics (either imipenem or meropenem). All isolates were positive for OXA-51. The most common carbapenemases were the OXA-23-type, found in 107 isolates, followed by OXA-40-type (OXA-24-type), found in 5 isolates; 3 isolates carried the ISAba1 element upstream of blaOXA-51-type. No OXA-58-type, NDM-type, VIM-type, or IMP-type producers were detected. Multiple clones were detected with 16 clusters of clonally related CRAB. Some clusters involved hospitals in different states. MLST analysis of 15 representative isolates from different clusters identified seven different sequence types (ST195, ST208, ST229, ST436, ST450, ST452, and ST499), as well as three novel STs. The vast majority (84%) of the isolates in this study were associated with health care exposure. Awareness of multidrug-resistant organisms in GCC states has important implications for optimizing infection control practices; establishing antimicrobial stewardship programs within hospital, community, and agricultural settings; and emphasizing the need for establishing regional active surveillance systems. This will help to control the spread of CRAB in the Middle East and in hospitals accommodating transferred patients from this region. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Genetic variation and DNA fingerprinting of durian types in Malaysia using simple sequence repeat (SSR) markers.

PubMed

Siew, Ging Yang; Ng, Wei Lun; Tan, Sheau Wei; Alitheen, Noorjahan Banu; Tan, Soon Guan; Yeap, Swee Keong

2018-01-01

Durian ( Durio zibethinus ) is one of the most popular tropical fruits in Asia. To date, 126 durian types have been registered with the Department of Agriculture in Malaysia based on phenotypic characteristics. Classification based on morphology is convenient, easy, and fast but it suffers from phenotypic plasticity as a direct result of environmental factors and age. To overcome the limitation of morphological classification, there is a need to carry out genetic characterization of the various durian types. Such data is important for the evaluation and management of durian genetic resources in producing countries. In this study, simple sequence repeat (SSR) markers were used to study the genetic variation in 27 durian types from the germplasm collection of Universiti Putra Malaysia. Based on DNA sequences deposited in Genbank, seven pairs of primers were successfully designed to amplify SSR regions in the durian DNA samples. High levels of variation among the 27 durian types were observed (expected heterozygosity, H E = 0.35). The DNA fingerprinting power of SSR markers revealed by the combined probability of identity (PI) of all loci was 2.3×10 -3 . Unique DNA fingerprints were generated for 21 out of 27 durian types using five polymorphic SSR markers (the other two SSR markers were monomorphic). We further tested the utility of these markers by evaluating the clonal status of shared durian types from different germplasm collection sites, and found that some were not clones. The findings in this preliminary study not only shows the feasibility of using SSR markers for DNA fingerprinting of durian types, but also challenges the current classification of durian types, e.g., on whether the different types should be called "clones", "varieties", or "cultivars". Such matters have a direct impact on the regulation and management of durian genetic resources in the region.
Genetic variation and DNA fingerprinting of durian types in Malaysia using simple sequence repeat (SSR) markers

PubMed Central

Siew, Ging Yang; Tan, Sheau Wei; Tan, Soon Guan; Yeap, Swee Keong

2018-01-01

Durian (Durio zibethinus) is one of the most popular tropical fruits in Asia. To date, 126 durian types have been registered with the Department of Agriculture in Malaysia based on phenotypic characteristics. Classification based on morphology is convenient, easy, and fast but it suffers from phenotypic plasticity as a direct result of environmental factors and age. To overcome the limitation of morphological classification, there is a need to carry out genetic characterization of the various durian types. Such data is important for the evaluation and management of durian genetic resources in producing countries. In this study, simple sequence repeat (SSR) markers were used to study the genetic variation in 27 durian types from the germplasm collection of Universiti Putra Malaysia. Based on DNA sequences deposited in Genbank, seven pairs of primers were successfully designed to amplify SSR regions in the durian DNA samples. High levels of variation among the 27 durian types were observed (expected heterozygosity, HE = 0.35). The DNA fingerprinting power of SSR markers revealed by the combined probability of identity (PI) of all loci was 2.3×10−3. Unique DNA fingerprints were generated for 21 out of 27 durian types using five polymorphic SSR markers (the other two SSR markers were monomorphic). We further tested the utility of these markers by evaluating the clonal status of shared durian types from different germplasm collection sites, and found that some were not clones. The findings in this preliminary study not only shows the feasibility of using SSR markers for DNA fingerprinting of durian types, but also challenges the current classification of durian types, e.g., on whether the different types should be called “clones”, “varieties”, or “cultivars”. Such matters have a direct impact on the regulation and management of durian genetic resources in the region. PMID:29511604
BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results.

PubMed

Worley, K C; Wiese, B A; Smith, R F

1995-09-01

BEAUTY (BLAST enhanced alignment utility) is an enhanced version of the NCBI's BLAST data base search tool that facilitates identification of the functions of matched sequences. We have created new data bases of conserved regions and functional domains for protein sequences in NCBI's Entrez data base, and BEAUTY allows this information to be incorporated directly into BLAST search results. A Conserved Regions Data Base, containing the locations of conserved regions within Entrez protein sequences, was constructed by (1) clustering the entire data base into families, (2) aligning each family using our PIMA multiple sequence alignment program, and (3) scanning the multiple alignments to locate the conserved regions within each aligned sequence. A separate Annotated Domains Data Base was constructed by extracting the locations of all annotated domains and sites from sequences represented in the Entrez, PROSITE, BLOCKS, and PRINTS data bases. BEAUTY performs a BLAST search of those Entrez sequences with conserved regions and/or annotated domains. BEAUTY then uses the information from the Conserved Regions and Annotated Domains data bases to generate, for each matched sequence, a schematic display that allows one to directly compare the relative locations of (1) the conserved regions, (2) annotated domains and sites, and (3) the locally aligned regions matched in the BLAST search. In addition, BEAUTY search results include World-Wide Web hypertext links to a number of external data bases that provide a variety of additional types of information on the function of matched sequences. This convenient integration of protein families, conserved regions, annotated domains, alignment displays, and World-Wide Web resources greatly enhances the biological informativeness of sequence similarity searches. BEAUTY searches can be performed remotely on our system using the "BCM Search Launcher" World-Wide Web pages (URL is < http:/ /gc.bcm.tmc.edu:8088/ search-launcher/launcher.html > ).
Longitudinal stability of MRI for mapping brain change using tensor-based morphometry.

PubMed

Leow, Alex D; Klunder, Andrea D; Jack, Clifford R; Toga, Arthur W; Dale, Anders M; Bernstein, Matt A; Britson, Paula J; Gunter, Jeffrey L; Ward, Chadwick P; Whitwell, Jennifer L; Borowski, Bret J; Fleisher, Adam S; Fox, Nick C; Harvey, Danielle; Kornak, John; Schuff, Norbert; Studholme, Colin; Alexander, Gene E; Weiner, Michael W; Thompson, Paul M

2006-06-01

Measures of brain change can be computed from sequential MRI scans, providing valuable information on disease progression, e.g., for patient monitoring and drug trials. Tensor-based morphometry (TBM) creates maps of these brain changes, visualizing the 3D profile and rates of tissue growth or atrophy, but its sensitivity depends on the contrast and geometric stability of the images. As part of the Alzheimer's Disease Neuroimaging Initiative (ADNI), 17 normal elderly subjects were scanned twice (at a 2-week interval) with several 3D 1.5 T MRI pulse sequences: high and low flip angle SPGR/FLASH (from which Synthetic T1 images were generated), MP-RAGE, IR-SPGR (N = 10) and MEDIC (N = 7) scans. For each subject and scan type, a 3D deformation map aligned baseline and follow-up scans, computed with a nonlinear, inverse-consistent elastic registration algorithm. Voxelwise statistics, in ICBM stereotaxic space, visualized the profile of mean absolute change and its cross-subject variance; these maps were then compared using permutation testing. Image stability depended on: (1) the pulse sequence; (2) the transmit/receive coil type (birdcage versus phased array); (3) spatial distortion corrections (using MEDIC sequence information); (4) B1-field intensity inhomogeneity correction (using N3). SPGR/FLASH images acquired using a birdcage coil had least overall deviation. N3 correction reduced coil type and pulse sequence differences and improved scan reproducibility, except for Synthetic T1 images (which were intrinsically corrected for B1-inhomogeneity). No strong evidence favored B0 correction. Although SPGR/FLASH images showed least deviation here, pulse sequence selection for the ADNI project was based on multiple additional image analyses, to be reported elsewhere.

Longitudinal stability of MRI for mapping brain change using tensor-based morphometry

PubMed Central

Leow, Alex D.; Klunder, Andrea D.; Jack, Clifford R.; Toga, Arthur W.; Dale, Anders M.; Bernstein, Matt A.; Britson, Paula J.; Gunter, Jeffrey L.; Ward, Chadwick P.; Whitwell, Jennifer L.; Borowski, Bret J.; Fleisher, Adam S.; Fox, Nick C.; Harvey, Danielle; Kornak, John; Schuff, Norbert; Studholme, Colin; Alexander, Gene E.; Weiner, Michael W.; Thompson, Paul M.

2007-01-01

Measures of brain change can be computed from sequential MRI scans, providing valuable information on disease progression, e.g., for patient monitoring and drug trials. Tensor-based morphometry (TBM) creates maps of these brain changes, visualizing the 3D profile and rates of tissue growth or atrophy, but its sensitivity depends on the contrast and geometric stability of the images. A s part of the Alzheimer’s Disease Neuroimaging Initiative (ADNI), 17 normal elderly subjects were scanned twice (at a 2-week interval) with several 3D 1.5 T MRI pulse sequences: high and low flip angle SPGR/FLASH (from which Synthetic T1 images were generated), MP-RAGE, IR-SPGR (N = 10) and MEDIC (N = 7) scans. For each subject and scan type, a 3D deformation map aligned baseline and follow-up scans, computed with a nonlinear, inverse-consistent elastic registration algorithm. Voxelwise statistics, in ICBM stereotaxic space, visualized the profile of mean absolute change and its cross-subject variance; these maps were then compared using permutation testing. Image stability depended on: (1) the pulse sequence; (2) the transmit/receive coil type (birdcage versus phased array); (3) spatial distortion corrections (using MEDIC sequence information); (4) B1-field intensity inhomogeneity correction (using N3). SPGR/FLASH images acquired using a birdcage coil had least overall deviation. N3 correction reduced coil type and pulse sequence differences and improved scan reproducibility, except for Synthetic T1 images (which were intrinsically corrected for B1-inhomogeneity). No strong evidence favored B0 correction. Although SPGR/FLASH images showed least deviation here, pulse sequence selection for the ADNI project was based on multiple additional image analyses, to be reported elsewhere. PMID:16480900
Diversity Analysis of Dairy and Nondairy Lactococcus lactis Isolates, Using a Novel Multilocus Sequence Analysis Scheme and (GTG)5-PCR Fingerprinting▿

PubMed Central

Rademaker, Jan L. W.; Herbet, Hélène; Starrenburg, Marjo J. C.; Naser, Sabri M.; Gevers, Dirk; Kelly, William J.; Hugenholtz, Jeroen; Swings, Jean; van Hylckama Vlieg, Johan E. T.

2007-01-01

The diversity of a collection of 102 lactococcus isolates including 91 Lactococcus lactis isolates of dairy and nondairy origin was explored using partial small subunit rRNA gene sequence analysis and limited phenotypic analyses. A subset of 89 strains of L. lactis subsp. cremoris and L. lactis subsp. lactis isolates was further analyzed by (GTG)5-PCR fingerprinting and a novel multilocus sequence analysis (MLSA) scheme. Two major genomic lineages within L. lactis were found. The L. lactis subsp. cremoris type-strain-like genotype lineage included both L. lactis subsp. cremoris and L. lactis subsp. lactis isolates. The other major lineage, with a L. lactis subsp. lactis type-strain-like genotype, comprised L. lactis subsp. lactis isolates only. A novel third genomic lineage represented two L. lactis subsp. lactis isolates of nondairy origin. The genomic lineages deviate from the subspecific classification of L. lactis that is based on a few phenotypic traits only. MLSA of six partial genes (atpA, encoding ATP synthase alpha subunit; pheS, encoding phenylalanine tRNA synthetase; rpoA, encoding RNA polymerase alpha chain; bcaT, encoding branched chain amino acid aminotransferase; pepN, encoding aminopeptidase N; and pepX, encoding X-prolyl dipeptidyl peptidase) revealed 363 polymorphic sites (total length, 1,970 bases) among 89 L. lactis subsp. cremoris and L. lactis subsp. lactis isolates with unique sequence types for most isolates. This allowed high-resolution cluster analysis in which dairy isolates form subclusters of limited diversity within the genomic lineages. The pheS DNA sequence analysis yielded two genetic groups dissimilar to the other genotyping analysis-based lineages, indicating a disparate acquisition route for this gene. PMID:17890345
Diversity analysis of dairy and nondairy Lactococcus lactis isolates, using a novel multilocus sequence analysis scheme and (GTG)5-PCR fingerprinting.

PubMed

Rademaker, Jan L W; Herbet, Hélène; Starrenburg, Marjo J C; Naser, Sabri M; Gevers, Dirk; Kelly, William J; Hugenholtz, Jeroen; Swings, Jean; van Hylckama Vlieg, Johan E T

2007-11-01

The diversity of a collection of 102 lactococcus isolates including 91 Lactococcus lactis isolates of dairy and nondairy origin was explored using partial small subunit rRNA gene sequence analysis and limited phenotypic analyses. A subset of 89 strains of L. lactis subsp. cremoris and L. lactis subsp. lactis isolates was further analyzed by (GTG)(5)-PCR fingerprinting and a novel multilocus sequence analysis (MLSA) scheme. Two major genomic lineages within L. lactis were found. The L. lactis subsp. cremoris type-strain-like genotype lineage included both L. lactis subsp. cremoris and L. lactis subsp. lactis isolates. The other major lineage, with a L. lactis subsp. lactis type-strain-like genotype, comprised L. lactis subsp. lactis isolates only. A novel third genomic lineage represented two L. lactis subsp. lactis isolates of nondairy origin. The genomic lineages deviate from the subspecific classification of L. lactis that is based on a few phenotypic traits only. MLSA of six partial genes (atpA, encoding ATP synthase alpha subunit; pheS, encoding phenylalanine tRNA synthetase; rpoA, encoding RNA polymerase alpha chain; bcaT, encoding branched chain amino acid aminotransferase; pepN, encoding aminopeptidase N; and pepX, encoding X-prolyl dipeptidyl peptidase) revealed 363 polymorphic sites (total length, 1,970 bases) among 89 L. lactis subsp. cremoris and L. lactis subsp. lactis isolates with unique sequence types for most isolates. This allowed high-resolution cluster analysis in which dairy isolates form subclusters of limited diversity within the genomic lineages. The pheS DNA sequence analysis yielded two genetic groups dissimilar to the other genotyping analysis-based lineages, indicating a disparate acquisition route for this gene.
Predicting turns in proteins with a unified model.

PubMed

Song, Qi; Li, Tonghua; Cong, Peisheng; Sun, Jiangming; Li, Dapeng; Tang, Shengnan

2012-01-01

Turns are a critical element of the structure of a protein; turns play a crucial role in loops, folds, and interactions. Current prediction methods are well developed for the prediction of individual turn types, including α-turn, β-turn, and γ-turn, etc. However, for further protein structure and function prediction it is necessary to develop a uniform model that can accurately predict all types of turns simultaneously. In this study, we present a novel approach, TurnP, which offers the ability to investigate all the turns in a protein based on a unified model. The main characteristics of TurnP are: (i) using newly exploited features of structural evolution information (secondary structure and shape string of protein) based on structure homologies, (ii) considering all types of turns in a unified model, and (iii) practical capability of accurate prediction of all turns simultaneously for a query. TurnP utilizes predicted secondary structures and predicted shape strings, both of which have greater accuracy, based on innovative technologies which were both developed by our group. Then, sequence and structural evolution features, which are profile of sequence, profile of secondary structures and profile of shape strings are generated by sequence and structure alignment. When TurnP was validated on a non-redundant dataset (4,107 entries) by five-fold cross-validation, we achieved an accuracy of 88.8% and a sensitivity of 71.8%, which exceeded the most state-of-the-art predictors of certain type of turn. Newly determined sequences, the EVA and CASP9 datasets were used as independent tests and the results we achieved were outstanding for turn predictions and confirmed the good performance of TurnP for practical applications.
Predicting Turns in Proteins with a Unified Model

PubMed Central

Song, Qi; Li, Tonghua; Cong, Peisheng; Sun, Jiangming; Li, Dapeng; Tang, Shengnan

2012-01-01

Motivation Turns are a critical element of the structure of a protein; turns play a crucial role in loops, folds, and interactions. Current prediction methods are well developed for the prediction of individual turn types, including α-turn, β-turn, and γ-turn, etc. However, for further protein structure and function prediction it is necessary to develop a uniform model that can accurately predict all types of turns simultaneously. Results In this study, we present a novel approach, TurnP, which offers the ability to investigate all the turns in a protein based on a unified model. The main characteristics of TurnP are: (i) using newly exploited features of structural evolution information (secondary structure and shape string of protein) based on structure homologies, (ii) considering all types of turns in a unified model, and (iii) practical capability of accurate prediction of all turns simultaneously for a query. TurnP utilizes predicted secondary structures and predicted shape strings, both of which have greater accuracy, based on innovative technologies which were both developed by our group. Then, sequence and structural evolution features, which are profile of sequence, profile of secondary structures and profile of shape strings are generated by sequence and structure alignment. When TurnP was validated on a non-redundant dataset (4,107 entries) by five-fold cross-validation, we achieved an accuracy of 88.8% and a sensitivity of 71.8%, which exceeded the most state-of-the-art predictors of certain type of turn. Newly determined sequences, the EVA and CASP9 datasets were used as independent tests and the results we achieved were outstanding for turn predictions and confirmed the good performance of TurnP for practical applications. PMID:23144872
Relationships between functional genes in Lactobacillus delbrueckii ssp. bulgaricus isolates and phenotypic characteristics associated with fermentation time and flavor production in yogurt elucidated using multilocus sequence typing.

PubMed

Liu, Wenjun; Yu, Jie; Sun, Zhihong; Song, Yuqin; Wang, Xueni; Wang, Hongmei; Wuren, Tuoya; Zha, Musu; Menghe, Bilige; Heping, Zhang

2016-01-01

Lactobacillus delbrueckii ssp. bulgaricus (L. bulgaricus) is well known for its worldwide application in yogurt production. Flavor production and acid producing are considered as the most important characteristics for starter culture screening. To our knowledge this is the first study applying functional gene sequence multilocus sequence typing technology to predict the fermentation and flavor-producing characteristics of yogurt-producing bacteria. In the present study, phenotypic characteristics of 35 L. bulgaricus strains were quantified during the fermentation of milk to yogurt and during its subsequent storage; these included fermentation time, acidification rate, pH, titratable acidity, and flavor characteristics (acetaldehyde concentration). Furthermore, multilocus sequence typing analysis of 7 functional genes associated with fermentation time, acid production, and flavor formation was done to elucidate the phylogeny and genetic evolution of the same L. bulgaricus isolates. The results showed that strains significantly differed in fermentation time, acidification rate, and acetaldehyde production. Combining functional gene sequence analysis with phenotypic characteristics demonstrated that groups of strains established using genotype data were consistent with groups identified based on their phenotypic traits. This study has established an efficient and rapid molecular genotyping method to identify strains with good fermentation traits; this has the potential to replace time-consuming conventional methods based on direct measurement of phenotypic traits. Copyright © 2016 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Evaluation of the class II region of the major histocompatibility complex of the greyhound with the genomic matching technique and sequence-based typing.

PubMed

Fliegner, R A; Holloway, S A; Lester, S; McLure, C A; Dawkins, R L

2008-08-01

The class II region of the major histocompatibility complex was evaluated in 25 greyhounds by sequence-based typing and the genomic matching technique (GMT). Two new DLA-DRB1 alleles were identified. Twenty-four dogs carried the DLA-DRB1*01201/DQA1*00401/DQB1*01303/DQB1*01701 haplotype, which carries two DQB1 alleles. One haplotype was identified from which DQB1 and DQA1 appeared to be deleted. The GMT enabled detection of DQB1 copy number, discrimination of the different class II haplotypes and the identification of new, possibly biologically relevant polymorphisms.
Detection of distorted frames in retinal video-sequences via machine learning

NASA Astrophysics Data System (ADS)

Kolar, Radim; Liberdova, Ivana; Odstrcilik, Jan; Hracho, Michal; Tornow, Ralf P.

2017-07-01

This paper describes detection of distorted frames in retinal sequences based on set of global features extracted from each frame. The feature vector is consequently used in classification step, in which three types of classifiers are tested. The best classification accuracy 96% has been achieved with support vector machine approach.
Exploring DNA variant segregation types in pooled genome sequencing enables effective mapping of weeping trait in Malus

USDA-ARS?s Scientific Manuscript database

In recent years, next generation sequencing (NGS) based bulked segregant analysis (BSA) has become a powerful approach for allele discovery in non-model plant species. However, challenges remain, particular for out-crossing species with complex genomes. Here, the genetic control of a weeping bran...
Size and sequence polymorphisms in the glutamate-rich protein gene of the human malaria parasite Plasmodium falciparum in Thailand.

PubMed

Pattaradilokrat, Sittiporn; Trakoolsoontorn, Chawinya; Simpalipan, Phumin; Warrit, Natapot; Kaewthamasorn, Morakot; Harnyuttanakorn, Pongchai

2018-01-22

The glutamate-rich protein (GLURP) of the malaria parasite Plasmodium falciparum is a key surface antigen that serves as a component of a clinical vaccine. Moreover, the GLURP gene is also employed routinely as a genetic marker for malarial genotyping in epidemiological studies. While extensive size polymorphisms in GLURP are well recorded, the extent of the sequence diversity of this gene is rarely investigated. The present study aimed to explore the genetic diversity of GLURP in natural populations of P. falciparum. The polymorphic C-terminal repetitive R2 region of GLURP sequences from 65 P. falciparum isolates in Thailand were generated and combined with the data from 103 worldwide isolates to generate a GLURP database. The collection was comprised of 168 alleles, encoding 105 unique GLURP subtypes, characterized by 18 types of amino acid repeat units (AAU). Of these, 28 GLURP subtypes, formed by 10 AAU types, were detected in P. falciparum in Thailand. Among them, 19 GLURP subtypes and 2 AAU types are described for the first time in the Thai parasite population. The AAU sequences were highly conserved, which is likely due to negative selection. Standard Fst analysis revealed the shared distributions of GLURP types among the P. falciparum populations, providing evidence of gene flow among the different demographic populations. Sequence diversity causing size variations in GLURP in Thai P. falciparum populations were detected, and caused by non-synonymous substitutions in repeat units and some insertion/deletion of aspartic acid or glutamic acid codons between repeat units. The P. falciparum population structure based on GLURP showed promising implications for the development of GLURP-based vaccines and for monitoring vaccine efficacy.
International Life Science Institute North America Cronobacter (Formerly Enterobacter sakazakii) isolate set.

PubMed

Ivy, Reid A; Farber, Jeffrey M; Pagotto, Franco; Wiedmann, Martin

2013-01-01

Foodborne pathogen isolate collections are important for the development of detection methods, for validation of intervention strategies, and to develop an understanding of pathogenesis and virulence. We have assembled a publicly available Cronobacter (formerly Enterobacter sakazakii) isolate set that consists of (i) 25 Cronobacter sakazakii isolates, (ii) two Cronobacter malonaticus isolates, (iii) one Cronobacter muytjensii isolate, which displays some atypical phenotypic characteristics, biochemical profiles, and colony color on selected differential media, and (iv) two nonclinical Enterobacter asburiae isolates, which show some phenotypic characteristics similar to those of Cronobacter spp. The set consists of human (n = 10), food (n = 11), and environmental (n = 9) isolates. Analysis of partial 16S rDNA sequence and seven-gene multilocus sequence typing data allowed for reliable identification of these isolates to species and identification of 14 isolates as sequence type 4, which had previously been shown to be the most common C. sakazakii sequence type associated with neonatal meningitis. Phenotypic characterization was carried out with API 20E and API 32E test strips and streaking on two selective chromogenic agars; isolates were also assessed for sorbitol fermentation and growth at 45°C. Although these strategies typically produced the same classification as sequence-based strategies, based on a panel of four biochemical tests, one C. sakazakii isolate yielded inconclusive data and one was classified as C. malonaticus. EcoRI automated ribotyping and pulsed-field gel electrophoresis (PFGE) with XbaI separated the set into 23 unique ribotypes and 30 unique PFGE types, respectively, indicating subtype diversity within the set. Subtype and source data for the collection are publicly available in the PathogenTracker database (www. pathogentracker. net), which allows for continuous updating of information on the set, including links to publications that include information on isolates from this collection.
The tolerance to exchanges of the Watson–Crick base pair in the hammerhead ribozyme core is determined by surrounding elements

PubMed Central

Przybilski, Rita; Hammann, Christian

2007-01-01

Tertiary interacting elements are important features of functional RNA molecules, for example, in all small nucleolytic ribozymes. The recent crystal structure of a tertiary stabilized type I hammerhead ribozyme revealed a conventional Watson–Crick base pair in the catalytic core, formed between nucleotides C3 and G8. We show that any Watson–Crick base pair between these positions retains cleavage competence in two type III ribozymes. In the Arabidopsis thaliana sequence, only moderate differences in cleavage rates are observed for the different base pairs, while the peach latent mosaic viroid (PLMVd) ribozyme exhibits a preference for a pyrimidine at position 3 and a purine at position 8. To understand these differences, we created a series of chimeric ribozymes in which we swapped sequence elements that surround the catalytic core. The kinetic characterization of the resulting ribozymes revealed that the tertiary interacting loop sequences of the PLMVd ribozyme are sufficient to induce the preference for Y3–R8 base pairs in the A. thaliana hammerhead ribozyme. In contrast to this, only when the entire stem–loops I and II of the A. thaliana sequences are grafted on the PLMVd ribozyme is any Watson–Crick base pair similarly tolerated. The data provide evidence for a complex interplay of secondary and tertiary structure elements that lead, mediated by long-range effects, to an individual modulation of the local structure in the catalytic core of different hammerhead ribozymes. PMID:17666711
Palindromic Sequence Artifacts Generated during Next Generation Sequencing Library Preparation from Historic and Ancient DNA

PubMed Central

Star, Bastiaan; Nederbragt, Alexander J.; Hansen, Marianne H. S.; Skage, Morten; Gilfillan, Gregor D.; Bradbury, Ian R.; Pampoulie, Christophe; Stenseth, Nils Chr; Jakobsen, Kjetill S.; Jentoft, Sissel

2014-01-01

Degradation-specific processes and variation in laboratory protocols can bias the DNA sequence composition from samples of ancient or historic origin. Here, we identify a novel artifact in sequences from historic samples of Atlantic cod (Gadus morhua), which forms interrupted palindromes consisting of reverse complementary sequence at the 5′ and 3′-ends of sequencing reads. The palindromic sequences themselves have specific properties – the bases at the 5′-end align well to the reference genome, whereas extensive misalignments exists among the bases at the terminal 3′-end. The terminal 3′ bases are artificial extensions likely caused by the occurrence of hairpin loops in single stranded DNA (ssDNA), which can be ligated and amplified in particular library creation protocols. We propose that such hairpin loops allow the inclusion of erroneous nucleotides, specifically at the 3′-end of DNA strands, with the 5′-end of the same strand providing the template. We also find these palindromes in previously published ancient DNA (aDNA) datasets, albeit at varying and substantially lower frequencies. This artifact can negatively affect the yield of endogenous DNA in these types of samples and introduces sequence bias. PMID:24608104
Piscine reovirus: Genomic and molecular phylogenetic analysis from farmed and wild salmonids collected on the Canada/US Pacific Coast

USGS Publications Warehouse

Siah, Ahmed; Morrison, Diane B.; Fringuelli, Elena; Savage, Paul S.; Richmond, Zina; Purcell, Maureen K.; Johns, Robert; Johnson, Stewart C.; Sakasida, Sonja M.

2015-01-01

Piscine reovirus (PRV) is a double stranded non-enveloped RNA virus detected in farmed and wild salmonids. This study examined the phylogenetic relationships among different PRV sequence types present in samples from salmonids in Western Canada and the US, including Alaska (US), British Columbia (Canada) and Washington State (US). Tissues testing positive for PRV were partially sequenced for segment S1, producing 71 sequences that grouped into 10 unique sequence types. Sequence analysis revealed no identifiable geographical or temporal variation among the sequence types. Identical sequence types were found in fish sampled in 2001, 2005 and 2014. In addition, PRV positive samples from fish derived from Alaska, British Columbia and Washington State share identical sequence types. Comparative analysis of the phylogenetic tree indicated that Canada/US Pacific Northwest sequences formed a subgroup with some Norwegian sequence types (group II), distinct from other Norwegian and Chilean sequences (groups I, III and IV). Representative PRV positive samples from farmed and wild fish in British Columbia and Washington State were subjected to genome sequencing using next generation sequencing methods. Individual analysis of each of the 10 partial segments indicated that the Canadian and US PRV sequence types clustered separately from available whole genome sequences of some Norwegian and Chilean sequences for all segments except the segment S4. In summary, PRV was genetically homogenous over a large geographic distance (Alaska to Washington State), and the sequence types were relatively stable over a 13 year period.
Piscine Reovirus: Genomic and Molecular Phylogenetic Analysis from Farmed and Wild Salmonids Collected on the Canada/US Pacific Coast

PubMed Central

Siah, Ahmed; Morrison, Diane B.; Fringuelli, Elena; Savage, Paul; Richmond, Zina; Johns, Robert; Purcell, Maureen K.; Johnson, Stewart C.; Saksida, Sonja M.

2015-01-01

Piscine reovirus (PRV) is a double stranded non-enveloped RNA virus detected in farmed and wild salmonids. This study examined the phylogenetic relationships among different PRV sequence types present in samples from salmonids in Western Canada and the US, including Alaska (US), British Columbia (Canada) and Washington State (US). Tissues testing positive for PRV were partially sequenced for segment S1, producing 71 sequences that grouped into 10 unique sequence types. Sequence analysis revealed no identifiable geographical or temporal variation among the sequence types. Identical sequence types were found in fish sampled in 2001, 2005 and 2014. In addition, PRV positive samples from fish derived from Alaska, British Columbia and Washington State share identical sequence types. Comparative analysis of the phylogenetic tree indicated that Canada/US Pacific Northwest sequences formed a subgroup with some Norwegian sequence types (group II), distinct from other Norwegian and Chilean sequences (groups I, III and IV). Representative PRV positive samples from farmed and wild fish in British Columbia and Washington State were subjected to genome sequencing using next generation sequencing methods. Individual analysis of each of the 10 partial segments indicated that the Canadian and US PRV sequence types clustered separately from available whole genome sequences of some Norwegian and Chilean sequences for all segments except the segment S4. In summary, PRV was genetically homogenous over a large geographic distance (Alaska to Washington State), and the sequence types were relatively stable over a 13 year period. PMID:26536673
Volcanic Soils as Sources of Novel CO-Oxidizing Paraburkholderia and Burkholderia: Paraburkholderia hiiakae sp. nov., Paraburkholderia metrosideri sp. nov., Paraburkholderia paradisi sp. nov., Paraburkholderia peleae sp. nov., and Burkholderia alpina sp. nov. a Member of the Burkholderia cepacia Complex

PubMed Central

Weber, Carolyn F.; King, Gary M.

2017-01-01

Previous studies showed that members of the Burkholderiales were important in the succession of aerobic, molybdenum-dependent CO oxidizing-bacteria on volcanic soils. During these studies, four isolates were obtained from Kilauea Volcano (Hawai‘i, USA); one strain was isolated from Pico de Orizaba (Mexico) during a separate study. Based on 16S rRNA gene sequence similarities, the Pico de Orizaba isolate and the isolates from Kilauea Volcano were provisionally assigned to the genera Burkholderia and Paraburkholderia, respectively. Each of the isolates possessed a form I coxL gene that encoded the catalytic subunit of carbon monoxide dehydrogenase (CODH); none of the most closely related type strains possessed coxL or oxidized CO. Genome sequences for Paraburkholderia type strains facilitated an analysis of 16S rRNA gene sequence similarities and average nucleotide identities (ANI). ANI did not exceed 95% (the recommended cutoff for species differentiation) for any of the pairwise comparisons among 27 reference strains related to the new isolates. However, since the highest 16S rRNA gene sequence similarity among this set of reference strains was 98.93%, DNA-DNA hybridizations (DDH) were performed for two isolates whose 16S rRNA gene sequence similarities with their nearest phylogenetic neighbors were 98.96 and 99.11%. In both cases DDH values were <16%. Based on multiple variables, four of the isolates represent novel species within the Paraburkholderia: Paraburkholderia hiiakae sp. nov. (type strain I2T = DSM 28029T = LMG 27952T); Paraburkholderia paradisi sp. nov. (type strain WAT = DSM 28027T = LMG 27949T); Paraburkholderia peleae sp. nov. (type strain PP52-1T = DSM 28028T = LMG 27950T); and Paraburkholderia metrosideri sp. nov. (type strain DNBP6-1T = DSM 28030T = LMG 28140T). The remaining isolate represents the first CO-oxidizing member of the Burkholderia cepacia complex: Burkholderia alpina sp. nov. (type strain PO-04-17-38T = DSM 28031T = LMG 28138T). PMID:28270796
Volcanic Soils as Sources of Novel CO-Oxidizing Paraburkholderia and Burkholderia: Paraburkholderia hiiakae sp. nov., Paraburkholderia metrosideri sp. nov., Paraburkholderia paradisi sp. nov., Paraburkholderia peleae sp. nov., and Burkholderia alpina sp. nov. a Member of the Burkholderia cepacia Complex.

PubMed

Weber, Carolyn F; King, Gary M

2017-01-01

Previous studies showed that members of the Burkholderiales were important in the succession of aerobic, molybdenum-dependent CO oxidizing-bacteria on volcanic soils. During these studies, four isolates were obtained from Kilauea Volcano (Hawai'i, USA); one strain was isolated from Pico de Orizaba (Mexico) during a separate study. Based on 16S rRNA gene sequence similarities, the Pico de Orizaba isolate and the isolates from Kilauea Volcano were provisionally assigned to the genera Burkholderia and Paraburkholderia , respectively. Each of the isolates possessed a form I coxL gene that encoded the catalytic subunit of carbon monoxide dehydrogenase (CODH); none of the most closely related type strains possessed coxL or oxidized CO. Genome sequences for Paraburkholderia type strains facilitated an analysis of 16S rRNA gene sequence similarities and average nucleotide identities (ANI). ANI did not exceed 95% (the recommended cutoff for species differentiation) for any of the pairwise comparisons among 27 reference strains related to the new isolates. However, since the highest 16S rRNA gene sequence similarity among this set of reference strains was 98.93%, DNA-DNA hybridizations (DDH) were performed for two isolates whose 16S rRNA gene sequence similarities with their nearest phylogenetic neighbors were 98.96 and 99.11%. In both cases DDH values were <16%. Based on multiple variables, four of the isolates represent novel species within the Paraburkholderia : Paraburkholderia hiiakae sp. nov. (type strain I2 T = DSM 28029 T = LMG 27952 T ); Paraburkholderia paradisi sp. nov. (type strain WA T = DSM 28027 T = LMG 27949 T ); Paraburkholderia peleae sp. nov. (type strain PP52-1 T = DSM 28028 T = LMG 27950 T ); and Paraburkholderia metrosideri sp. nov. (type strain DNBP6-1 T = DSM 28030 T = LMG 28140 T ). The remaining isolate represents the first CO-oxidizing member of the Burkholderia cepacia complex: Burkholderia alpina sp. nov. (type strain PO-04-17-38 T = DSM 28031 T = LMG 28138 T ).
Nucleic acid analysis using terminal-phosphate-labeled nucleotides

DOEpatents

Korlach, Jonas [Ithaca, NY; Webb, Watt W [Ithaca, NY; Levene, Michael [Ithaca, NY; Turner, Stephen [Ithaca, NY; Craighead, Harold G [Ithaca, NY; Foquet, Mathieu [Ithaca, NY

2008-04-22

The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
Assessing the Fidelity of Ancient DNA Sequences Amplified From Nuclear Genes

PubMed Central

Binladen, Jonas; Wiuf, Carsten; Gilbert, M. Thomas P.; Bunce, Michael; Barnett, Ross; Larson, Greger; Greenwood, Alex D.; Haile, James; Ho, Simon Y. W.; Hansen, Anders J.; Willerslev, Eske

2006-01-01

To date, the field of ancient DNA has relied almost exclusively on mitochondrial DNA (mtDNA) sequences. However, a number of recent studies have reported the successful recovery of ancient nuclear DNA (nuDNA) sequences, thereby allowing the characterization of genetic loci directly involved in phenotypic traits of extinct taxa. It is well documented that postmortem damage in ancient mtDNA can lead to the generation of artifactual sequences. However, as yet no one has thoroughly investigated the damage spectrum in ancient nuDNA. By comparing clone sequences from 23 fossil specimens, recovered from environments ranging from permafrost to desert, we demonstrate the presence of miscoding lesion damage in both the mtDNA and nuDNA, resulting in insertion of erroneous bases during amplification. Interestingly, no significant differences in the frequency of miscoding lesion damage are recorded between mtDNA and nuDNA despite great differences in cellular copy numbers. For both mtDNA and nuDNA, we find significant positive correlations between total sequence heterogeneity and the rates of type 1 transitions (adenine → guanine and thymine → cytosine) and type 2 transitions (cytosine → thymine and guanine → adenine), respectively. Type 2 transitions are by far the most dominant and increase relative to those of type 1 with damage load. The results suggest that the deamination of cytosine (and 5-methyl cytosine) to uracil (and thymine) is the main cause of miscoding lesions in both ancient mtDNA and nuDNA sequences. We argue that the problems presented by postmortem damage, as well as problems with contamination from exogenous sources of conserved nuclear genes, allelic variation, and the reliance on single nucleotide polymorphisms, call for great caution in studies relying on ancient nuDNA sequences. PMID:16299392
Dissemination of metallo-β-lactamase-producing Pseudomonas aeruginosa of sequence type 235 in Asian countries.

PubMed

Kim, Moon Jung; Bae, Il Kwon; Jeong, Seok Hoon; Kim, So Hyun; Song, Jae Hoon; Choi, Jae Young; Yoon, Sang Sun; Thamlikitkul, Visanu; Hsueh, Po-Ren; Yasin, Rohani Md; Lalitha, M K; Lee, Kyungwon

2013-12-01

To investigate the epidemiological traits of metallo-β-lactamase (MBL)-producing Pseudomonas aeruginosa (MPPA) clinical isolates collected by the Asian Network for Surveillance of Resistant Pathogens (ANSORP). A total of 16 MPPA clinical isolates were collected from six Asian countries in 2000 to 2009 by ANSORP. The MBL gene was detected by PCR amplification. The genetic organization of the class 1 integron carrying the MBL gene cassette was investigated by PCR mapping and sequencing. Southern blotting, repetitive sequence-based PCR and multilocus sequence typing (MLST) experiments were performed to characterize the isolates. PCR and sequencing experiments detected the blaVIM-2 (n = 12), blaVIM-3 (n = 1), blaIMP-6 (n = 2) and blaIMP-26 (n = 1) genes. The MBL genes were located on the chromosome in all isolates except one. Furthermore, all the MBL genes were located in a class 1 integron. All the MPPA isolates from Malaysia, Thailand, Sri Lanka and Korea were identified as sequence type (ST) 235 by MLST. Three VIM-2-producing isolates from India were identified as ST773, and one isolate harbouring VIM-3 from Taiwan was identified as ST298. P. aeruginosa ST235 might play a role in dissemination of MBL genes in Asian countries.

A 12-year molecular survey of clinical herpes simplex virus type 2 isolates demonstrates the circulation of clade A and B strains in Germany.

PubMed

Schmidt-Chanasit, Jonas; Bialonski, Alexandra; Heinemann, Patrick; Ulrich, Rainer G; Günther, Stephan; Rabenau, Holger F; Doerr, Hans Wilhelm

2010-07-01

Recently two different herpes simplex virus type 2 (HSV-2) clades (A and B) were described on DNA sequence data of the glycoprotein E (gE), G (gG) and I (gI) genes. To type the circulating HSV-2 wild-type strains in Germany by a novel approach and to monitor potential changes in the molecular epidemiology between 1997 and 2008. A total of 64 clinical HSV-2 isolates were analyzed by a novel approach using the DNA sequences of the complete open reading frames of glycoprotein B (gB) and gG. Recombination analysis of the gB and gG gene sequences was performed to reveal intragenic recombinants. Based on the phylogenetic analysis of the gB coding DNA sequence 8 of 64 (12%) isolates were classified as clade A strains and 56 of 64 (88%) isolates were classified as clade B strains. Analysis of the gG coding DNA sequence classified 4 (6%) isolates as clade A strains and 60 (94%) isolates as clade B strains. In comparison, the 8 isolates classified as clade A strains using the gB sequence data were classified as clade B strains when using the gG coding DNA sequence, suggesting intergenic recombination events. Intragenic recombination events were not detected. The first molecular survey of clinical HSV-2 isolates from Germany demonstrated the circulation of clade A and B strains and of intergenic recombinants over a period of 12 years. Copyright (c) 2010 Elsevier B.V. All rights reserved.
Pyrosequencing®-Based Identification of Low-Frequency Mutations Enriched Through Enhanced-ice-COLD-PCR.

PubMed

How-Kit, Alexandre; Tost, Jörg

2015-01-01

A number of molecular diagnostic assays have been developed in the last years for mutation detection. Although these methods have become increasingly sensitive, most of them are incompatible with a sequencing-based readout and require prior knowledge of the mutation present in the sample. Consequently, coamplification at low denaturation (COLD)-PCR-based methods have been developed and combine a high analytical sensitivity due to mutation enrichment in the sample with the identification of known or unknown mutations by downstream sequencing experiments. Among these methods, the recently developed Enhanced-ice-COLD-PCR appeared as the most powerful method as it outperformed the other COLD-PCR-based methods in terms of the mutation enrichment and due to the simplicity of the experimental setup of the assay. Indeed, E-ice-COLD-PCR is very versatile as it can be used on all types of PCR platforms and is applicable to different types of samples including fresh frozen, FFPE, and plasma samples. The technique relies on the incorporation of an LNA containing blocker probe in the PCR reaction followed by selective heteroduplex denaturation enabling amplification of the mutant allele while amplification of the wild-type allele is prevented. Combined with Pyrosequencing(®), which is a very quantitative high-resolution sequencing technology, E-ice-COLD-PCR can detect and identify mutations with a limit of detection down to 0.01 %.
SSAW: A new sequence similarity analysis method based on the stationary discrete wavelet transform.

PubMed

Lin, Jie; Wei, Jing; Adjeroh, Donald; Jiang, Bing-Hua; Jiang, Yue

2018-05-02

Alignment-free sequence similarity analysis methods often lead to significant savings in computational time over alignment-based counterparts. A new alignment-free sequence similarity analysis method, called SSAW is proposed. SSAW stands for Sequence Similarity Analysis using the Stationary Discrete Wavelet Transform (SDWT). It extracts k-mers from a sequence, then maps each k-mer to a complex number field. Then, the series of complex numbers formed are transformed into feature vectors using the stationary discrete wavelet transform. After these steps, the original sequence is turned into a feature vector with numeric values, which can then be used for clustering and/or classification. Using two different types of applications, namely, clustering and classification, we compared SSAW against the the-state-of-the-art alignment free sequence analysis methods. SSAW demonstrates competitive or superior performance in terms of standard indicators, such as accuracy, F-score, precision, and recall. The running time was significantly better in most cases. These make SSAW a suitable method for sequence analysis, especially, given the rapidly increasing volumes of sequence data required by most modern applications.
Are commercial providers a viable option for clinical bacterial sequencing?

PubMed

Raven, Kathy; Blane, Beth; Churcher, Carol; Parkhill, Julian; Peacock, Sharon J

2018-04-05

Bacterial whole-genome sequencing in the clinical setting has the potential to bring major improvements to infection control and clinical practice. Sequencing instruments are not currently available in the majority of routine microbiology laboratories worldwide, but an alternative is to use external sequencing providers. To foster discussion around this we investigated whether send-out services were a viable option. Four providers offering MiSeq sequencing were selected based on cost and evaluated based on the service provided and sequence data quality. DNA was prepared from five methicillin-resistant Staphylococcus aureus (MRSA) isolates, four of which were investigated during a previously published outbreak in the UK together with a reference MRSA isolate (ST22 HO 5096 0412). Cost of sequencing per isolate ranged from £155 to £342 and turnaround times from DNA postage to arrival of sequence data ranged from 12 to 63 days. Comparison of commercially generated genomes against the original sequence data demonstrated very high concordance, with no more than one single nucleotide polymorphism (SNP) difference on core genome mapping between the original sequences and the new sequence for all four providers. Multilocus sequence type could not be assigned based on assembly for the two cheapest sequence providers due to fragmented assemblies probably caused by a lower output of sequence data per isolate. Our results indicate that external providers returned highly accurate genome data, but that improvements are required in turnaround time to make this a viable option for use in clinical practice.
Sequence quality analysis tool for HIV type 1 protease and reverse transcriptase.

PubMed

Delong, Allison K; Wu, Mingham; Bennett, Diane; Parkin, Neil; Wu, Zhijin; Hogan, Joseph W; Kantor, Rami

2012-08-01

Access to antiretroviral therapy is increasing globally and drug resistance evolution is anticipated. Currently, protease (PR) and reverse transcriptase (RT) sequence generation is increasing, including the use of in-house sequencing assays, and quality assessment prior to sequence analysis is essential. We created a computational HIV PR/RT Sequence Quality Analysis Tool (SQUAT) that runs in the R statistical environment. Sequence quality thresholds are calculated from a large dataset (46,802 PR and 44,432 RT sequences) from the published literature ( http://hivdb.Stanford.edu ). Nucleic acid sequences are read into SQUAT, identified, aligned, and translated. Nucleic acid sequences are flagged if with >five 1-2-base insertions; >one 3-base insertion; >one deletion; >six PR or >18 RT ambiguous bases; >three consecutive PR or >four RT nucleic acid mutations; >zero stop codons; >three PR or >six RT ambiguous amino acids; >three consecutive PR or >four RT amino acid mutations; >zero unique amino acids; or <0.5% or >15% genetic distance from another submitted sequence. Thresholds are user modifiable. SQUAT output includes a summary report with detailed comments for troubleshooting of flagged sequences, histograms of pairwise genetic distances, neighbor joining phylogenetic trees, and aligned nucleic and amino acid sequences. SQUAT is a stand-alone, free, web-independent tool to ensure use of high-quality HIV PR/RT sequences in interpretation and reporting of drug resistance, while increasing awareness and expertise and facilitating troubleshooting of potentially problematic sequences.
Myelin protein zero gene sequencing diagnoses Charcot-Marie-Tooth Type 1B disease

DOE Office of Scientific and Technical Information (OSTI.GOV)

Su, Y.; Zhang, H.; Madrid, R.

1994-09-01

Charcot-Marie-Tooth disease (CMT), the most common genetic neuropathy, affects about 1 in 2600 people in Norway and is found worldwide. CMT Type 1 (CMT1) has slow nerve conduction with demyelinated Schwann cells. Autosomal dominant CMT Type 1B (CMT1B) results from mutations in the myelin protein zero gene which directs the synthesis of more than half of all Schwann cell protein. This gene was mapped to the chromosome 1q22-1q23.1 borderline by fluorescence in situ hybridization. The first 7 of 7 reported CMT1B mutations are unique. Thus the most effective means to identify CMT1B mutations in at-risk family members and fetuses ismore » to sequence the entire coding sequence in dominant or sporadic CMT patients without the CMT1A duplication. Of the 19 primers used in 16 pars to uniquely amplify the entire MPZ coding sequence, 6 primer pairs were used to amplify and sequence the 6 exons. The DyeDeoxy Terminator cycle sequencing method used with four different color fluorescent lables was superior to manual sequencing because it sequences more bases unambiguously from extracted genomic DNA samples within 24 hours. This protocol was used to test 28 CMT and Dejerine-Sottas patients without CMT1A gene duplication. Sequencing MPZ gene-specific amplified fragments identified 9 polymorphic sites within the 6 exons that encode the 248 amino acid MPZ protein. The large number of major CMT1B mutations identified by single strand sequencing are being verified by reverse strand sequencing and when possible, by restriction enzyme analysis. This protocol can be used to distringuish CMT1B patients from othre CMT phenotypes and to determine the CMT1B status of relatives both presymptomatically and prenatally.« less
Influence of Geographical Origin and Flour Type on Diversity of Lactic Acid Bacteria in Traditional Belgian Sourdoughs▿ †

PubMed Central

Scheirlinck, Ilse; Van der Meulen, Roel; Van Schoor, Ann; Vancanneyt, Marc; De Vuyst, Luc; Vandamme, Peter; Huys, Geert

2007-01-01

A culture-based approach was used to investigate the diversity of lactic acid bacteria (LAB) in Belgian traditional sourdoughs and to assess the influence of flour type, bakery environment, geographical origin, and technological characteristics on the taxonomic composition of these LAB communities. For this purpose, a total of 714 LAB from 21 sourdoughs sampled at 11 artisan bakeries throughout Belgium were subjected to a polyphasic identification approach. The microbial composition of the traditional sourdoughs was characterized by bacteriological culture in combination with genotypic identification methods, including repetitive element sequence-based PCR fingerprinting and phenylalanyl-tRNA synthase (pheS) gene sequence analysis. LAB from Belgian sourdoughs belonged to the genera Lactobacillus, Pediococcus, Leuconostoc, Weissella, and Enterococcus, with the heterofermentative species Lactobacillus paralimentarius, Lactobacillus sanfranciscensis, Lactobacillus plantarum, and Lactobacillus pontis as the most frequently isolated taxa. Statistical analysis of the identification data indicated that the microbial composition of the sourdoughs is mainly affected by the bakery environment rather than the flour type (wheat, rye, spelt, or a mixture of these) used. In conclusion, the polyphasic approach, based on rapid genotypic screening and high-resolution, sequence-dependent identification, proved to be a powerful tool for studying the LAB diversity in traditional fermented foods such as sourdough. PMID:17675431
Influence of geographical origin and flour type on diversity of lactic acid bacteria in traditional Belgian sourdoughs.

PubMed

Scheirlinck, Ilse; Van der Meulen, Roel; Van Schoor, Ann; Vancanneyt, Marc; De Vuyst, Luc; Vandamme, Peter; Huys, Geert

2007-10-01

A culture-based approach was used to investigate the diversity of lactic acid bacteria (LAB) in Belgian traditional sourdoughs and to assess the influence of flour type, bakery environment, geographical origin, and technological characteristics on the taxonomic composition of these LAB communities. For this purpose, a total of 714 LAB from 21 sourdoughs sampled at 11 artisan bakeries throughout Belgium were subjected to a polyphasic identification approach. The microbial composition of the traditional sourdoughs was characterized by bacteriological culture in combination with genotypic identification methods, including repetitive element sequence-based PCR fingerprinting and phenylalanyl-tRNA synthase (pheS) gene sequence analysis. LAB from Belgian sourdoughs belonged to the genera Lactobacillus, Pediococcus, Leuconostoc, Weissella, and Enterococcus, with the heterofermentative species Lactobacillus paralimentarius, Lactobacillus sanfranciscensis, Lactobacillus plantarum, and Lactobacillus pontis as the most frequently isolated taxa. Statistical analysis of the identification data indicated that the microbial composition of the sourdoughs is mainly affected by the bakery environment rather than the flour type (wheat, rye, spelt, or a mixture of these) used. In conclusion, the polyphasic approach, based on rapid genotypic screening and high-resolution, sequence-dependent identification, proved to be a powerful tool for studying the LAB diversity in traditional fermented foods such as sourdough.
Molecular typing of methicillin-resistant Staphylococcus aureus: Comparison of PCR-based open reading frame typing, multilocus sequence typing, and Staphylococcus protein A gene typing.

PubMed

Ogihara, Shinji; Saito, Ryoichi; Sawabe, Etsuko; Kozakai, Takahiro; Shima, Mari; Aiso, Yoshibumi; Fujie, Toshihide; Nukui, Yoko; Koike, Ryuji; Hagihara, Michio; Tohda, Shuji

2018-04-01

The recently developed PCR-based open reading frame typing (POT) method is a useful molecular typing tool. Here, we evaluated the performance of POT for molecular typing of methicillin-resistant Staphylococcus aureus (MRSA) isolates and compared its performance to those of multilocus sequence typing (MLST) and Staphylococcus protein A gene typing (spa typing). Thirty-seven MRSA isolates were collected between July 2012 and May 2015. MLST, spa typing, and POT were performed, and their discriminatory powers were evaluated using Simpson's index analysis. The MRSA isolates were classified into 11, 18, and 33 types by MLST, spa typing, and POT, respectively. The predominant strains identified by MLST, spa typing, and POT were ST8 and ST764, t002, and 93-191-127, respectively. The discriminatory power of MLST, spa typing, and POT was 0.853, 0.875, and 0.992, respectively, indicating that POT had the highest discriminatory power. Moreover, the results of MLST and spa were available after 2 days, whereas that of POT was available in 5 h. Furthermore, POT is rapid and easy to perform and interpret. Therefore, POT is a superior molecular typing tool for monitoring nosocomial transmission of MRSA. Copyright © 2017 Japanese Society of Chemotherapy and The Japanese Association for Infectious Diseases. Published by Elsevier Ltd. All rights reserved.
Spiking neural network model for memorizing sequences with forward and backward recall.

PubMed

Borisyuk, Roman; Chik, David; Kazanovich, Yakov; da Silva Gomes, João

2013-06-01

We present an oscillatory network of conductance based spiking neurons of Hodgkin-Huxley type as a model of memory storage and retrieval of sequences of events (or objects). The model is inspired by psychological and neurobiological evidence on sequential memories. The building block of the model is an oscillatory module which contains excitatory and inhibitory neurons with all-to-all connections. The connection architecture comprises two layers. A lower layer represents consecutive events during their storage and recall. This layer is composed of oscillatory modules. Plastic excitatory connections between the modules are implemented using an STDP type learning rule for sequential storage. Excitatory neurons in the upper layer project star-like modifiable connections toward the excitatory lower layer neurons. These neurons in the upper layer are used to tag sequences of events represented in the lower layer. Computer simulations demonstrate good performance of the model including difficult cases when different sequences contain overlapping events. We show that the model with STDP type or anti-STDP type learning rules can be applied for the simulation of forward and backward replay of neural spikes respectively. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Analysis of the full genome of human group C rotaviruses reveals lineage diversification and reassortment.

PubMed

Medici, Maria Cristina; Tummolo, Fabio; Martella, Vito; Arcangeletti, Maria Cristina; De Conto, Flora; Chezzi, Carlo; Fehér, Enikő; Marton, Szilvia; Calderaro, Adriana; Bányai, Krisztián

2016-08-01

Group C rotaviruses (RVC) are enteric pathogens of humans and animals. Whole-genome sequences are available only for few RVCs, leaving gaps in our knowledge about their genetic diversity. We determined the full-length genome sequence of two human RVCs (PR2593/2004 and PR713/2012), detected in Italy from hospital-based surveillance for rotavirus infection in 2004 and 2012. In the 11 RNA genomic segments, the two Italian RVCs segregated within separate intra-genotypic lineages showed variation ranging from 1.9 % (VP6) to 15.9 % (VP3) at the nucleotide level. Comprehensive analysis of human RVC sequences available in the databases allowed us to reveal the existence of at least two major genome configurations, defined as type I and type II. Human RVCs of type I were all associated with the M3 VP3 genotype, including the Italian strain PR2593/2004. Conversely, human RVCs of type II were all associated with the M2 VP3 genotype, including the Italian strain PR713/2012. Reassortant RVC strains between these major genome configurations were identified. Although only a few full-genome sequences of human RVCs, mostly of Asian origin, are available, the analysis of human RVC sequences retrieved from the databases indicates that at least two intra-genotypic RVC lineages circulate in European countries. Gathering more sequence data is necessary to develop a standardized genotype and intra-genotypic lineage classification system useful for epidemiological investigations and avoiding confusion in the literature.
Transfusion strategy for weak D type 4.0 based on RHD alleles and RH haplotypes in Tunisia

PubMed Central

Ouchari, Mouna; Srivastava, Kshitij; Romdhane, Houda; Yacoub, Saloua Jemni; Flegel, Willy Albert

2017-01-01

Background With more than 460 RHD alleles, this gene is the most complex and polymorphic among all blood group systems. The Tunisian population has the largest known prevalence of weak D type 4.0 alleles, occurring in 1 of 105 RH haplotypes. We aimed to establish a rationale for the transfusion strategy of weak D type 4.0 in Tunisia. Study design and methods Donors were randomly screened for the serological weak D phenotype. The RHD coding sequence and parts of the introns were sequenced. To establish the RH haplotype, the RHCE gene was tested for characteristic single nucleotide positions. Results We determined all RHD alleles and the RH haplotypes coding for the serologic weak D phenotype among 13,431 Tunisian donations. A serologic weak D phenotype was found in 67 individuals (0.50%). Among them, 60 carried a weak D type 4 allele: 53 weak D type 4.0, 6 weak D type 4.2.2 (DAR), and 1 weak D type 4.1. Another 4 donors had 1 variant allele each: DVII, weak D type 1, weak D type 3, and weak D type 100, while 3 donors showed a normal RHD sequence. The weak D type 4.0 was most often linked to RHCE*ceVS.04.01, weak D type 4.2.2 to RHCE*ceAR, and weak D type 4.1 to RHCE*ceVS.02, while the other RHD alleles were linked to one of the common RHCE alleles. Conclusions Among the weak D phenotypes in Tunisia, no novel RHD allele was found and almost 90% were caused by alleles of the weak D type 4 cluster, of which 88% represented the weak D type 4.0 allele. Based on established RH haplotypes for variant RHD and RHCE alleles and the lack of adverse clinical reports, we recommend D positive transfusions for patients with weak D type 4.0 in Tunisia. PMID:29193104
Transfusion strategy for weak D Type 4.0 based on RHD alleles and RH haplotypes in Tunisia.

PubMed

Ouchari, Mouna; Srivastava, Kshitij; Romdhane, Houda; Jemni Yacoub, Saloua; Flegel, Willy Albert

2018-02-01

With more than 460 RHD alleles, this gene is the most complex and polymorphic among all blood group systems. The Tunisian population has the largest known prevalence of weak D Type 4.0 alleles, occurring in one of 105 RH haplotypes. We aimed to establish a rationale for the transfusion strategy of weak D Type 4.0 in Tunisia. Donors were randomly screened for the serologic weak D phenotype. The RHD coding sequence and parts of the introns were sequenced. To establish the RH haplotype, the RHCE gene was tested for characteristic single-nucleotide positions. We determined all RHD alleles and the RH haplotypes coding for the serologic weak D phenotype among 13,431 Tunisian donations. A serologic weak D phenotype was found in 67 individuals (0.50%). Among them, 60 carried a weak D Type 4 allele: 53 weak D Type 4.0, six weak D Type 4.2.2 (DAR), and one weak D Type 4.1. An additional four donors had one variant allele each: DVII, weak D Type 1, weak D Type 3, and weak D type 100, while three donors showed a normal RHD sequence. The weak D Type 4.0 was most often linked to RHCE*ceVS.04.01, weak D Type 4.2.2 to RHCE*ceAR, and weak D Type 4.1 to RHCE*ceVS.02, while the other RHD alleles were linked to one of the common RHCE alleles. Among the weak D phenotypes in Tunisia, no novel RHD allele was found and almost 90% were caused by alleles of the weak D Type 4 cluster, of which 88% represented the weak D Type 4.0 allele. Based on established RH haplotypes for variant RHD and RHCE alleles and the lack of adverse clinical reports, we recommend D+ transfusions for patients with weak D Type 4.0 in Tunisia. © 2017 AABB.
Novel division level bacterial diversity in a Yellowstone hot spring.

PubMed

Hugenholtz, P; Pitulle, C; Hershberger, K L; Pace, N R

1998-01-01

A culture-independent molecular phylogenetic survey was carried out for the bacterial community in Obsidian Pool (OP), a Yellowstone National Park hot spring previously shown to contain remarkable archaeal diversity (S. M. Barns, R. E. Fundyga, M. W. Jeffries, and N. R. Page, Proc. Natl. Acad. Sci. USA 91:1609-1613, 1994). Small-subunit rRNA genes (rDNA) were amplified directly from OP sediment DNA by PCR with universally conserved or Bacteria-specific rDNA primers and cloned. Unique rDNA types among > 300 clones were identified by restriction fragment length polymorphism, and 122 representative rDNA sequences were determined. These were found to represent 54 distinct bacterial sequence types or clusters (> or = 98% identity) of sequences. A majority (70%) of the sequence types were affiliated with 14 previously recognized bacterial divisions (main phyla; kingdoms); 30% were unaffiliated with recognized bacterial divisions. The unaffiliated sequence types (represented by 38 sequences) nominally comprise 12 novel, division level lineages termed candidate divisions. Several OP sequences were nearly identical to those of cultivated chemolithotrophic thermophiles, including the hydrogen-oxidizing Calderobacterium and the sulfate reducers Thermodesulfovibrio and Thermodesulfobacterium, or belonged to monophyletic assemblages recognized for a particular type of metabolism, such as the hydrogen-oxidizing Aquificales and the sulfate-reducing delta-Proteobacteria. The occurrence of such organisms is consistent with the chemical composition of OP (high in reduced iron and sulfur) and suggests a lithotrophic base for primary productivity in this hot spring, through hydrogen oxidation and sulfate reduction. Unexpectedly, no archaeal sequences were encountered in OP clone libraries made with universal primers. Hybridization analysis of amplified OP DNA with domain-specific probes confirmed that the analyzed community rDNA from OP sediment was predominantly bacterial. These results expand substantially our knowledge of the extent of bacterial diversity and call into question the commonly held notion that Archaea dominate hydrothermal environments. Finally, the currently known extent of division level bacterial phylogenetic diversity is collated and summarized.
Development of Pineapple Microsatellite Markers and Germplasm Genetic Diversity Analysis

PubMed Central

Tong, Helin; Chen, You; Wang, Jingyi; Chen, Yeyuan; Sun, Guangming; He, Junhu; Wu, Yaoting

2013-01-01

Two methods were used to develop pineapple microsatellite markers. Genomic library-based SSR development: using selectively amplified microsatellite assay, 86 sequences were generated from pineapple genomic library. 91 (96.8%) of the 94 Simple Sequence Repeat (SSR) loci were dinucleotide repeats (39 AC/GT repeats and 52 GA/TC repeats, accounting for 42.9% and 57.1%, resp.), and the other three were mononucleotide repeats. Thirty-six pairs of SSR primers were designed; 24 of them generated clear bands of expected sizes, and 13 of them showed polymorphism. EST-based SSR development: 5659 pineapple EST sequences obtained from NCBI were analyzed; among 1397 nonredundant EST sequences, 843 were found containing 1110 SSR loci (217 of them contained more than one SSR locus). Frequency of SSRs in pineapple EST sequences is 1SSR/3.73 kb, and 44 types were found. Mononucleotide, dinucleotide, and trinucleotide repeats dominate, accounting for 95.6% in total. AG/CT and AGC/GCT were the dominant type of dinucleotide and trinucleotide repeats, accounting for 83.5% and 24.1%, respectively. Thirty pairs of primers were designed for each of randomly selected 30 sequences; 26 of them generated clear and reproducible bands, and 22 of them showed polymorphism. Eighteen pairs of primers obtained by the one or the other of the two methods above that showed polymorphism were selected to carry out germplasm genetic diversity analysis for 48 breeds of pineapple; similarity coefficients of these breeds were between 0.59 and 1.00, and they can be divided into four groups accordingly. Amplification products of five SSR markers were extracted and sequenced, corresponding repeat loci were found and locus mutations are mainly in copy number of repeats and base mutations in the flanking region. PMID:24024187
Efficient engineering of a bacteriophage genome using the type I-E CRISPR-Cas system.

PubMed

Kiro, Ruth; Shitrit, Dror; Qimron, Udi

2014-01-01

The clustered regularly interspaced short palindromic repeats (CRISPR)-CRISPR-associated (Cas) system has recently been used to engineer genomes of various organisms, but surprisingly, not those of bacteriophages (phages). Here we present a method to genetically engineer the Escherichia coli phage T7 using the type I-E CRISPR-Cas system. T7 phage genome is edited by homologous recombination with a DNA sequence flanked by sequences homologous to the desired location. Non-edited genomes are targeted by the CRISPR-Cas system, thus enabling isolation of the desired recombinant phages. This method broadens CRISPR Cas-based editing to phages and uses a CRISPR-Cas type other than type II. The method may be adjusted to genetically engineer any bacteriophage genome.
Selection and Validation of a Multilocus Variable-Number Tandem-Repeat Analysis Panel for Typing Shigella spp.▿ †

PubMed Central

Gorgé, Olivier; Lopez, Stéphanie; Hilaire, Valérie; Lisanti, Olivier; Ramisse, Vincent; Vergnaud, Gilles

2008-01-01

The Shigella genus has historically been separated into four species, based on biochemical assays. The classification within each species relies on serotyping. Recently, genome sequencing and DNA assays, in particular the multilocus sequence typing (MLST) approach, greatly improved the current knowledge of the origin and phylogenetic evolution of Shigella spp. The Shigella and Escherichia genera are now considered to belong to a unique genomospecies. Multilocus variable-number tandem-repeat (VNTR) analysis (MLVA) provides valuable polymorphic markers for genotyping and performing phylogenetic analyses of highly homogeneous bacterial pathogens. Here, we assess the capability of MLVA for Shigella typing. Thirty-two potentially polymorphic VNTRs were selected by analyzing in silico five Shigella genomic sequences and subsequently evaluated. Eventually, a panel of 15 VNTRs was selected (i.e., MLVA15 analysis). MLVA15 analysis of 78 strains or genome sequences of Shigella spp. and 11 strains or genome sequences of Escherichia coli distinguished 83 genotypes. Shigella population cluster analysis gave consistent results compared to MLST. MLVA15 analysis showed capabilities for E. coli typing, providing classification among pathogenic and nonpathogenic E. coli strains included in the study. The resulting data can be queried on our genotyping webpage (http://mlva.u-psud.fr). The MLVA15 assay is rapid, highly discriminatory, and reproducible for Shigella and Escherichia strains, suggesting that it could significantly contribute to epidemiological trace-back analysis of Shigella infections and pathogenic Escherichia outbreaks. Typing was performed on strains obtained mostly from collections. Further studies should include strains of much more diverse origins, including all pathogenic E. coli types. PMID:18216214
Penicillin-Binding Protein Transpeptidase Signatures for Tracking and Predicting β-Lactam Resistance Levels in Streptococcus pneumoniae

PubMed Central

Metcalf, Benjamin J.; Chochua, Sopio; Li, Zhongya; Gertz, Robert E.; Walker, Hollis; Hawkins, Paulina A.; Tran, Theresa; Whitney, Cynthia G.; McGee, Lesley; Beall, Bernard W.

2016-01-01

ABSTRACT β-Lactam antibiotics are the drugs of choice to treat pneumococcal infections. The spread of β-lactam-resistant pneumococci is a major concern in choosing an effective therapy for patients. Systematically tracking β-lactam resistance could benefit disease surveillance. Here we developed a classification system in which a pneumococcal isolate is assigned to a “PBP type” based on sequence signatures in the transpeptidase domains (TPDs) of the three critical penicillin-binding proteins (PBPs), PBP1a, PBP2b, and PBP2x. We identified 307 unique PBP types from 2,528 invasive pneumococcal isolates, which had known MICs to six β-lactams based on broth microdilution. We found that increased β-lactam MICs strongly correlated with PBP types containing divergent TPD sequences. The PBP type explained 94 to 99% of variation in MICs both before and after accounting for genomic backgrounds defined by multilocus sequence typing, indicating that genomic backgrounds made little independent contribution to β-lactam MICs at the population level. We further developed and evaluated predictive models of MICs based on PBP type. Compared to microdilution MICs, MICs predicted by PBP type showed essential agreement (MICs agree within 1 dilution) of >98%, category agreement (interpretive results agree) of >94%, a major discrepancy (sensitive isolate predicted as resistant) rate of <3%, and a very major discrepancy (resistant isolate predicted as sensitive) rate of <2% for all six β-lactams. Thus, the PBP transpeptidase signatures are robust indicators of MICs to different β-lactam antibiotics in clinical pneumococcal isolates and serve as an accurate alternative to phenotypic susceptibility testing. PMID:27302760
Combining one-step Sanger sequencing with phasing probe hybridization for HLA class I typing yields rapid, G-group resolution predicting 99% of unique full length protein sequences.

PubMed

Tu, Bin; Masaberg, Carly; Hou, Lihua; Behm, Daniel; Brescia, Peter; Cha, Nuri; Kariyawasam, Kanthi; Lee, Jar How; Nong, Thoa; Sells, John; Tausch, Paul; Yang, Ruyan; Ng, Jennifer; Hurley, Carolyn Katovich

2017-02-01

Sanger-based DNA sequencing of exons 2+3 of HLA class I alleles from a heterozygote frequently results in two or more alternative genotypes. This study was undertaken to reduce the time and effort required to produce a single high resolution HLA genotype. Samples were typed in parallel by Sanger sequencing and oligonucleotide probe hybridization. This workflow, together with optimization of analysis software, was tested and refined during the typing of over 42,000 volunteers for an unrelated hematopoietic progenitor cell donor registry. Next generation DNA sequencing (NGS) was applied to over 1000 of these samples to identify the alleles present within the G group designations. Single genotypes at G level resolution were obtained for over 95% of the loci without additional assays. The vast majority of alleles identified (>99%) were the primary allele giving the G groups their name. Only 0.7% of the alleles identified encoded protein variants that were not detected by a focus on the antigen recognition domain (ARD)-encoding exons. Our combined method routinely provides biologically relevant typing resolution at the level of the ARD. It can be applied to both single samples or to large volume typing supporting either bone marrow or solid organ transplantation using technologies currently available in many HLA laboratories. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Base-Calling Algorithm with Vocabulary (BCV) Method for Analyzing Population Sequencing Chromatograms

PubMed Central

Fantin, Yuri S.; Neverov, Alexey D.; Favorov, Alexander V.; Alvarez-Figueroa, Maria V.; Braslavskaya, Svetlana I.; Gordukova, Maria A.; Karandashova, Inga V.; Kuleshov, Konstantin V.; Myznikova, Anna I.; Polishchuk, Maya S.; Reshetov, Denis A.; Voiciehovskaya, Yana A.; Mironov, Andrei A.; Chulanov, Vladimir P.

2013-01-01

Sanger sequencing is a common method of reading DNA sequences. It is less expensive than high-throughput methods, and it is appropriate for numerous applications including molecular diagnostics. However, sequencing mixtures of similar DNA of pathogens with this method is challenging. This is important because most clinical samples contain such mixtures, rather than pure single strains. The traditional solution is to sequence selected clones of PCR products, a complicated, time-consuming, and expensive procedure. Here, we propose the base-calling with vocabulary (BCV) method that computationally deciphers Sanger chromatograms obtained from mixed DNA samples. The inputs to the BCV algorithm are a chromatogram and a dictionary of sequences that are similar to those we expect to obtain. We apply the base-calling function on a test dataset of chromatograms without ambiguous positions, as well as one with 3–14% sequence degeneracy. Furthermore, we use BCV to assemble a consensus sequence for an HIV genome fragment in a sample containing a mixture of viral DNA variants and to determine the positions of the indels. Finally, we detect drug-resistant Mycobacterium tuberculosis strains carrying frameshift mutations mixed with wild-type bacteria in the pncA gene, and roughly characterize bacterial communities in clinical samples by direct 16S rRNA sequencing. PMID:23382983

Synthetic peptides and fluorogenic substrates related to the reactive site sequence of Kunitz-type inhibitors isolated from Bauhinia: interaction with human plasma kallikrein.

PubMed

Oliva, M L; Santomauro-Vaz, E M; Andrade, S A; Juliano, M A; Pott, V J; Sampaio, M U; Sampaio, C A

2001-01-01

We have previously described Kunitz-type serine proteinase inhibitors purified from Bauhinia seeds. Human plasma kallikrein shows different susceptibility to those inhibitors. In this communication, we describe the interaction of human plasma kallikrein with fluorogenic and non-fluorogenic peptides based on the Bauhinia inhibitors' reactive site. The hydrolysis of the substrate based on the B. variegata inhibitor reactive site sequence, Abz-VVISALPRSVFIQ-EDDnp (Km 1.42 microM, kcat 0.06 s(-1), and kcat/Km 4.23 x 10(4) M(-1) s(-1)), is more favorable than that of Abz-VMIAALPRTMFIQ-EDDnp, related to the B. ungulata sequence (Km 0.43 microM, kcat 0.00017 s(-1), and kcat/Km 3.9 x 10(2) M(-1) s(-1)). Human plasma kallikrein does not hydrolyze the substrates Abz-RPGLPVRFESPL-EDDnp and Abz-FESPLRINIIKE-EDDnp based on the B. bauhinioides inhibitor reactive site sequence, the most effective inhibitor of the enzyme. These peptides are competitive inhibitors with Ki values in the nM range. The synthetic peptide containing 19 amino acids based on the B. bauhinioides inhibitor reactive site (RPGLPVRFESPL) is poorly cleaved by kallikrein. The given substrates are highly specific for trypsin and chymotrypsin hydrolysis. Other serine proteinases such as factor Xa, factor XII, thrombin and plasmin do not hydrolyze B. bauhinioides inhibitor related substrates.
Innovative anisotropic phantoms for calibration of diffusion tensor imaging sequences.

PubMed

Kłodowski, Krzysztof; Krzyżak, Artur Tadeusz

2016-05-01

The paper describes a novel type of anisotropic phantoms designed for b-matrix spatial distribution diffusion tensor imaging (BSD-DTI). Cubic plate anisotropic phantom, cylinder capillary phantom and water reference phantom are described as a complete set necessary for calibration, validation and normalization of BSD-DTI. An innovative design of the phantoms basing on enclosing the anisotropic cores in glass balls filled with liquid made for the first time possible BSD calibration with usage of echo planar imaging (EPI) sequence. Susceptibility artifacts prone to occur in EPI sequences were visibly reduced in the central region of the phantoms. The phantoms were designed for usage in a clinical scanner's head coil, but can be scaled for other coil or scanner types. The phantoms can be also used for a pre-calibration of imaging of other types of phantoms having more specific applications. Copyright © 2015 Elsevier Inc. All rights reserved.
Detecting atypical examples of known domain types by sequence similarity searching: the SBASE domain library approach.

PubMed

Dhir, Somdutta; Pacurar, Mircea; Franklin, Dino; Gáspári, Zoltán; Kertész-Farkas, Attila; Kocsor, András; Eisenhaber, Frank; Pongor, Sándor

2010-11-01

SBASE is a project initiated to detect known domain types and predicting domain architectures using sequence similarity searching (Simon et al., Protein Seq Data Anal, 5: 39-42, 1992, Pongor et al, Nucl. Acids. Res. 21:3111-3115, 1992). The current approach uses a curated collection of domain sequences - the SBASE domain library - and standard similarity search algorithms, followed by postprocessing which is based on a simple statistics of the domain similarity network (http://hydra.icgeb.trieste.it/sbase/). It is especially useful in detecting rare, atypical examples of known domain types which are sometimes missed even by more sophisticated methodologies. This approach does not require multiple alignment or machine learning techniques, and can be a useful complement to other domain detection methodologies. This article gives an overview of the project history as well as of the concepts and principles developed within this the project.
Molecular dynamics study of some non-hydrogen-bonding base pair DNA strands

NASA Astrophysics Data System (ADS)

Tiwari, Rakesh K.; Ojha, Rajendra P.; Tiwari, Gargi; Pandey, Vishnudatt; Mall, Vijaysree

2018-05-01

In order to elucidate the structural activity of hydrophobic modified DNA, the DMMO2-D5SICS, base pair is introduced as a constituent in different set of 12-mer and 14-mer DNA sequences for the molecular dynamics (MD) simulation in explicit water solvent. AMBER 14 force field was employed for each set of duplex during the 200ns production-dynamics simulation in orthogonal-box-water solvent by the Particle-Mesh-Ewald (PME) method in infinite periodic boundary conditions (PBC) to determine conformational parameters of the complex. The force-field parameters of modified base-pair were calculated by Gaussian-code using Hartree-Fock /ab-initio methodology. RMSD Results reveal that the conformation of the duplex is sequence dependent and the binding energy of the complex depends on the position of the modified base-pair in the nucleic acid strand. We found that non-bonding energy had a significant contribution to stabilising such type of duplex in comparison to electrostatic energy. The distortion produced within strands by such type of base-pair was local and destabilised the duplex integrity near to substitution, moreover the binding energy of duplex depends on the position of substitution of hydrophobic base-pair and the DNA sequence and strongly supports the corresponding experimental study.
Single-strand conformation polymorphism (SSCP)-based mutation scanning approaches to fingerprint sequence variation in ribosomal DNA of ascaridoid nematodes.

PubMed

Zhu, X Q; Gasser, R B

1998-06-01

In this study, we assessed single-strand conformation polymorphism (SSCP)-based approaches for their capacity to fingerprint sequence variation in ribosomal DNA (rDNA) of ascaridoid nematodes of veterinary and/or human health significance. The second internal transcribed spacer region (ITS-2) of rDNA was utilised as the target region because it is known to provide species-specific markers for this group of parasites. ITS-2 was amplified by PCR from genomic DNA derived from individual parasites and subjected to analysis. Direct SSCP analysis of amplicons from seven taxa (Toxocara vitulorum, Toxocara cati, Toxocara canis, Toxascaris leonina, Baylisascaris procyonis, Ascaris suum and Parascaris equorum) showed that the single-strand (ss) ITS-2 patterns produced allowed their unequivocal identification to species. While no variation in SSCP patterns was detected in the ITS-2 within four species for which multiple samples were available, the method allowed the direct display of four distinct sequence types of ITS-2 among individual worms of T. cati. Comparison of SSCP/sequencing with the methods of dideoxy fingerprinting (ddF) and restriction endonuclease fingerprinting (REF) revealed that also ddF allowed the definition of the four sequence types, whereas REF displayed three of four. The findings indicate the usefulness of the SSCP-based approaches for the identification of ascaridoid nematodes to species, the direct display of sequence variation in rDNA and the detection of population variation. The ability to fingerprint microheterogeneity in ITS-2 rDNA using such approaches also has implications for studying fundamental aspects relating to mutational change in rDNA.
Analysis of Typing Methods for Epidemiological Surveillance of both Methicillin-Resistant and Methicillin-Susceptible Staphylococcus aureus Strains▿ †

PubMed Central

Faria, Nuno A.; Carrico, João A.; Oliveira, Duarte C.; Ramirez, Mário; de Lencastre, Hermínia

2008-01-01

Sequence-based methods for typing Staphylococcus aureus, such as multilocus sequence typing (MLST) and spa typing, have increased interlaboratory reproducibility, portability, and speed in obtaining results, but pulsed-field gel electrophoresis (PFGE), remains the method of choice in many laboratories due to the extensive experience with this methodology and the large body of data accumulated using the technique. Comparisons between typing methods have been overwhelmingly based on a qualitative assessment of the overall agreement of results and the relative discriminatory indexes. In this study, we quantitatively assess the congruence of the major typing methods for S. aureus, using a diverse collection of 198 S. aureus strains previously characterized by PFGE, spa typing, MLST, and, in the case of methicillin-resistant S. aureus (MRSA), SCCmec typing in order to establish the quantitative congruence between the typing methods. The results of most typing methods agree in that MRSA and methicillin-susceptible S. aureus (MSSA) differ in terms of diversity of genetic backgrounds, with MSSA being more diverse. Our results show that spa typing has a very good predictive power over the clonal relationships defined by eBURST, while PFGE is less accurate for that purpose but nevertheless provides better typeability and discriminatory power. The combination of PFGE and spa typing provided even better results. Based on these observations, we suggest the use of the conjugation of spa typing and PFGE typing for epidemiological surveillance studies, since this combination provides the ability to infer long-term relationships while maintaining the discriminatory power and typeability needed in short-term studies. PMID:17989188
Strategies for the Identification and Tracking of Cronobacter Species: An Opportunistic Pathogen of Concern to Neonatal Health

PubMed Central

Yan, Qiongqiong; Fanning, Séamus

2015-01-01

Cronobacter species are emerging opportunistic food-borne pathogens, which consists of seven species, including C. sakazakii, C. malonaticus, C. muytjensii, C. turicensis, C. dublinensis, C. universalis, and C. condimenti. The organism can cause severe clinical infections, including necrotizing enterocolitis, septicemia, and meningitis, predominately among neonates <4 weeks of age. Cronobacter species can be isolated from various foods and their surrounding environments; however, powdered infant formula (PIF) is the most frequently implicated food source linked with Cronobacter infection. This review aims to provide a summary of laboratory-based strategies that can be used to identify and trace Cronobacter species. The identification of Cronobacter species using conventional culture method and immuno-based detection protocols were first presented. The molecular detection and identification at genus-, and species-level along with molecular-based serogroup approaches are also described, followed by the molecular sub-typing methods, in particular pulsed-field gel electrophoresis and multi-locus sequence typing. Next generation sequence approaches, including whole genome sequencing, DNA microarray, and high-throughput whole-transcriptome sequencing, are also highlighted. Appropriate application of these strategies would contribute to reduce the risk of Cronobacter contamination in PIF and production environments, thereby improving food safety and protecting public health. PMID:26000266
Phylogenetic analysis of Fusobacterium prausnitzii based upon the 16S rRNA gene sequence and PCR confirmation.

PubMed

Wang, R F; Cao, W W; Cerniglia, C E

1996-01-01

In order to develop a PCR method to detect Fusobacterium prausnitzii in human feces and to clarify the phylogenetic position of this species, its 16S rRNA gene sequence was determined. The sequence described in this paper is different from the 16S rRNA gene sequence is specific for F. prausnitzii, and the results of this assay confirmed that F. prausnitzii is the most common species in human feces. However, a PCR assay based on the original GenBank sequence was negative when it was performed with two strains of F. prausnitzii obtained from the American Type Culture Collection. A phylogenetic tree based on the new 16S rRNA gene sequence was constructed. On this tree F. prausnitzii was not a member of the Fusobacterium group but was closer to some Eubacterium spp. and located between Clostridium "clusters III and IV" (M.D. Collins, P.A. Lawson, A. Willems, J.J. Cordoba, J. Fernandez-Garayzabal, P. Garcia, J. Cai, H. Hippe, and J.A.E. Farrow, Int. J. Syst. Bacteriol. 44:812-826, 1994).
Reference voltage calculation method based on zero-sequence component optimisation for a regional compensation DVR

NASA Astrophysics Data System (ADS)

Jian, Le; Cao, Wang; Jintao, Yang; Yinge, Wang

2018-04-01

This paper describes the design of a dynamic voltage restorer (DVR) that can simultaneously protect several sensitive loads from voltage sags in a region of an MV distribution network. A novel reference voltage calculation method based on zero-sequence voltage optimisation is proposed for this DVR to optimise cost-effectiveness in compensation of voltage sags with different characteristics in an ungrounded neutral system. Based on a detailed analysis of the characteristics of voltage sags caused by different types of faults and the effect of the wiring mode of the transformer on these characteristics, the optimisation target of the reference voltage calculation is presented with several constraints. The reference voltages under all types of voltage sags are calculated by optimising the zero-sequence component, which can reduce the degree of swell in the phase-to-ground voltage after compensation to the maximum extent and can improve the symmetry degree of the output voltages of the DVR, thereby effectively increasing the compensation ability. The validity and effectiveness of the proposed method are verified by simulation and experimental results.
Sunflower centromeres consist of a centromere-specific LINE and a chromosome-specific tandem repeat.

PubMed

Nagaki, Kiyotaka; Tanaka, Keisuke; Yamaji, Naoki; Kobayashi, Hisato; Murata, Minoru

2015-01-01

The kinetochore is a protein complex including kinetochore-specific proteins that plays a role in chromatid segregation during mitosis and meiosis. The complex associates with centromeric DNA sequences that are usually species-specific. In plant species, tandem repeats including satellite DNA sequences and retrotransposons have been reported as centromeric DNA sequences. In this study on sunflowers, a cDNA-encoding centromere-specific histone H3 (CENH3) was isolated from a cDNA pool from a seedling, and an antibody was raised against a peptide synthesized from the deduced cDNA. The antibody specifically recognized the sunflower CENH3 (HaCENH3) and showed centromeric signals by immunostaining and immunohistochemical staining analysis. The antibody was also applied in chromatin immunoprecipitation (ChIP)-Seq to isolate centromeric DNA sequences and two different types of repetitive DNA sequences were identified. One was a long interspersed nuclear element (LINE)-like sequence, which showed centromere-specific signals on almost all chromosomes in sunflowers. This is the first report of a centromeric LINE sequence, suggesting possible centromere targeting ability. Another type of identified repetitive DNA was a tandem repeat sequence with a 187-bp unit that was found only on a pair of chromosomes. The HaCENH3 content of the tandem repeats was estimated to be much higher than that of the LINE, which implies centromere evolution from LINE-based centromeres to more stable tandem-repeat-based centromeres. In addition, the epigenetic status of the sunflower centromeres was investigated by immunohistochemical staining and ChIP, and it was found that centromeres were heterochromatic.
Molecular Analysis of Spinal Muscular Atrophy: A genotyping protocol based on TaqMan(®) real-time PCR.

PubMed

de Souza Godinho, Fernanda Marques; Bock, Hugo; Gheno, Tailise Conte; Saraiva-Pereira, Maria Luiza

2012-12-01

Spinal muscular atrophy (SMA) is an autosomal recessive inherited disorder caused by alterations in the survival motor neuron I (SMN1) gene. SMA patients are classified as type I-IV based on severity of symptoms and age of onset. About 95% of SMA cases are caused by the homozygous absence of SMN1 due to gene deletion or conversion into SMN2. PCR-based methods have been widely used in genetic testing for SMA. In this work, we introduce a new approach based on TaqMan(®)real-time PCR for research and diagnostic settings. DNA samples from 100 individuals with clinical signs and symptoms suggestive of SMA were analyzed. Mutant DNA samples as well as controls were confirmed by DNA sequencing. We detected 58 SMA cases (58.0%) by showing deletion of SMN1 exon 7. Considering clinical information available from 56 of them, the patient distribution was 26 (46.4%) SMA type I, 16 (28.6%) SMA type II and 14 (25.0%) SMA type III. Results generated by the new method was confirmed by PCR-RFLP and by DNA sequencing when required. In conclusion, a protocol based on real-time PCR was shown to be effective and specific for molecular analysis of SMA patients.
Emergence of new types of Theileria orientalis in Australian cattle and possible cause of theileriosis outbreaks

PubMed Central

2011-01-01

Theileria parasites cause a benign infection of cattle in parts of Australia where they are endemic, but have, in recent years, been suspected of being responsible for a number of outbreaks of disease in cattle near the coast of New South Wales. The objective of this study was to identify and characterize the species of Theileria in cattle on six farms in New South Wales where disease outbreaks have occurred, and compare with Theileria from three disease-free farms in Queensland that is endemic for Theileria. Special reference was made to sub-typing of T. orientalis by type-specific PCR and sequencing of the small subunit (SSU) rRNA gene, and sequence analysis of the gene encoding a polymorphic merozoite/piroplasm surface protein (MPSP) that may be under immune selection. Nucleotide sequencing of SSU rRNA and MPSP genes revealed the presence of four Theileria genotypes: T. orientalis (buffeli), T. orientalis (ikeda), T. orientalis (chitose) and T. orientalis type 4 (MPSP) or type C (SSU rRNA). The majority of animals showed mixed infections while a few showed single infection. When MPSP nucleotide sequences were translated into amino acids, base transition did not change amino acid composition of the protein product, suggesting possible silent polymorphism. The occurrence of ikeda and type 4 (type C) previously not reported to occur and silent mutation is thought to have enhanced parasite evasion of the host immune response causing the outbreak. PMID:21338493
Structural features of the rice chromosome 4 centromere.

PubMed

Zhang, Yu; Huang, Yuchen; Zhang, Lei; Li, Ying; Lu, Tingting; Lu, Yiqi; Feng, Qi; Zhao, Qiang; Cheng, Zhukuan; Xue, Yongbiao; Wing, Rod A; Han, Bin

2004-01-01

A complete sequence of a chromosome centromere is necessary for fully understanding centromere function. We reported the sequence structures of the first complete rice chromosome centromere through sequencing a large insert bacterial artificial chromosome clone-based contig, which covered the rice chromosome 4 centromere. Complete sequencing of the 124-kb rice chromosome 4 centromere revealed that it consisted of 18 tracts of 379 tandemly arrayed repeats known as CentO and a total of 19 centromeric retroelements (CRs) but no unique sequences were detected. Four tracts, composed of 65 CentO repeats, were located in the opposite orientation, and 18 CentO tracts were flanked by 19 retroelements. The CRs were classified into four types, and the type I retroelements appeared to be more specific to rice centromeres. The preferential insert of the CRs among CentO repeats indicated that the centromere-specific retroelements may contribute to centromere expansion during evolution. The presence of three intact retrotransposons in the centromere suggests that they may be responsible for functional centromere initiation through a transcription-mediated mechanism.
[Phylogenetic and diversity analysis of Acidithiobacillus spp. based on 16S rRNA and RubisCO genes homologues].

PubMed

Liu, Minrui; Lin, Pengwu; Qi, Xing'e; Ni, Yongqing

2016-04-14

The purpose of the study was to reveal geographic region-related Acidithiobacillus spp. distribution and allopatric speciation. Phylogenetic and diversity analysis was done to expand our knowledge on microbial phylogeography, diversity-maintaining mechanisms and molecular biogeography. We amplified 16S rRNA gene and RubisCO genes to construct corresponding phylogenetic trees based on the sequence homology and analyzed genetic diversity of Acidithiobacillus spp.. Thirty-five strains were isolated from three different regions in China (Yunnan, Hubei, Xinjiang). The whole isolates were classified into five groups. Four strains were identified as A. ferrivorans, six as A. ferridurans, YNTR4-15 Leptspirillum ferrooxidans and HBDY3-31 as Leptospirillum ferrodiazotrophum. The remaining strains were identified as A. ferrooxidans. Analysis of cbbL and cbbM genes sequences of representative 26 strains indicated that cbbL gene of 19 were two copies (cbbL1 and cbbL2) and 7 possessed only cbbL1. cbbM gene was single copy. In nucleotide-based trees, cbbL1 gene sequences of strains were separated into three sequence types, and the cbbL2 was similar to cbbL1 with three types. Codon bias of RubisCO genes was not obvious in Acidithiobacillus spp.. Strains isolated from three different regions in China indicated a great genetic diversity in Acidithiobacillus spp. and their 16S rRNA/RubisCO genes sequence was of significant difference. Phylogenetic tree based on 16S rRNA genes and RubisCO genes was different in Acidithiobacillus spp..
Reclassification of Pseudomonas mephitica Claydon and Hammer 1939 as a later heterotypic synonym of Janthinobacterium lividum (Eisenberg 1891) De Ley et al. 1978.

PubMed

Kämpfer, Peter; Falsen, Enevold; Busse, Hans-Jürgen

2008-01-01

Pseudomonas mephitica CCUG 2513(T) has been reinvestigated to clarify its taxonomic position. 16S rRNA gene sequence comparisons demonstrated that this strain clusters phylogenetically closely with Janthinobacterium lividum (99.8% sequence similarity to the type strain). Investigation of fatty acid patterns, polar lipid profiles, polyamine patterns and quinone systems supported this delineation. Substrate utilization profiles and biochemical characteristics displayed no differences from the type strain of J. lividum, CCUG 2344(T). Therefore, the reclassification of Pseudomonas mephitica as a later heterotypic synonym of Janthinobacterium lividum is proposed, based upon the estimated phylogenetic position derived from 16S rRNA gene sequence data and chemotaxonomic and biochemical data.
Multilocus sequence typing scheme for the Mycobacterium abscessus complex.

PubMed

Macheras, Edouard; Konjek, Julie; Roux, Anne-Laure; Thiberge, Jean-Michel; Bastian, Sylvaine; Leão, Sylvia Cardoso; Palaci, Moises; Sivadon-Tardy, Valérie; Gutierrez, Cristina; Richter, Elvira; Rüsch-Gerdes, Sabine; Pfyffer, Gaby E; Bodmer, Thomas; Jarlier, Vincent; Cambau, Emmanuelle; Brisse, Sylvain; Caro, Valérie; Rastogi, Nalin; Gaillard, Jean-Louis; Heym, Beate

2014-01-01

We developed a multilocus sequence typing (MLST) scheme for Mycobacterium abscessus sensu lato, based on the partial sequencing of seven housekeeping genes: argH, cya, glpK, gnd, murC, pta and purH. This scheme was used to characterize a collection of 227 isolates recovered between 1994 and 2010 in France, Germany, Switzerland and Brazil. We identified 100 different sequence types (STs), which were distributed into three groups on the tree obtained by concatenating the sequences of the seven housekeeping gene fragments (3576bp): the M. abscessus sensu stricto group (44 STs), the "M. massiliense" group (31 STs) and the "M. bolletii" group (25 STs). SplitTree analysis showed a degree of intergroup lateral transfers. There was also evidence of lateral transfer events involving rpoB. The most prevalent STs in our collection were ST1 (CC5; 20 isolates) and ST23 (CC3; 31 isolates). Both STs were found in Europe and Brazil, and the latter was implicated in a large post-surgical procedure outbreak in Brazil. Respiratory isolates from patients with cystic fibrosis belonged to a large variety of STs; however, ST2 was predominant in this group of patients. Our MLST scheme, publicly available at www.pasteur.fr/mlst, offers investigators a valuable typing tool for M. abscessus sensu lato in future epidemiological studies throughout the world. Copyright © 2013 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.
The first FDA marketing authorizations of next-generation sequencing technology and tests: challenges, solutions and impact for future assays.

PubMed

Bijwaard, Karen; Dickey, Jennifer S; Kelm, Kellie; Težak, Živana

2015-01-01

The rapid emergence and clinical translation of novel high-throughput sequencing technologies created a need to clarify the regulatory pathway for the evaluation and authorization of these unique technologies. Recently, the US FDA authorized for marketing four next generation sequencing (NGS)-based diagnostic devices which consisted of two heritable disease-specific assays, library preparation reagents and a NGS platform that are intended for human germline targeted sequencing from whole blood. These first authorizations can serve as a case study in how different types of NGS-based technology are reviewed by the FDA. In this manuscript we describe challenges associated with the evaluation of these novel technologies and provide an overview of what was reviewed. Besides making validated NGS-based devices available for in vitro diagnostic use, these first authorizations create a regulatory path for similar future instruments and assays.
Dynamics of actin evolution in dinoflagellates.

PubMed

Kim, Sunju; Bachvaroff, Tsvetan R; Handy, Sara M; Delwiche, Charles F

2011-04-01

Dinoflagellates have unique nuclei and intriguing genome characteristics with very high DNA content making complete genome sequencing difficult. In dinoflagellates, many genes are found in multicopy gene families, but the processes involved in the establishment and maintenance of these gene families are poorly understood. Understanding the dynamics of gene family evolution in dinoflagellates requires comparisons at different evolutionary scales. Studies of closely related species provide fine-scale information relative to species divergence, whereas comparisons of more distantly related species provides broad context. We selected the actin gene family as a highly expressed conserved gene previously studied in dinoflagellates. Of the 142 sequences determined in this study, 103 were from the two closely related species, Dinophysis acuminata and D. caudata, including full length and partial cDNA sequences as well as partial genomic amplicons. For these two Dinophysis species, at least three types of sequences could be identified. Most copies (79%) were relatively similar and in nucleotide trees, the sequences formed two bushy clades corresponding to the two species. In comparisons within species, only eight to ten nucleotide differences were found between these copies. The two remaining types formed clades containing sequences from both species. One type included the most similar sequences in between-species comparisons with as few as 12 nucleotide differences between species. The second type included the most divergent sequences in comparisons between and within species with up to 93 nucleotide differences between sequences. In all the sequences, most variation occurred in synonymous sites or the 5' UnTranslated Region (UTR), although there was still limited amino acid variation between most sequences. Several potential pseudogenes were found (approximately 10% of all sequences depending on species) with incomplete open reading frames due to frameshifts or early stop codons. Overall, variation in the actin gene family fits best with the "birth and death" model of evolution based on recent duplications, pseudogenes, and incomplete lineage sorting. Divergence between species was similar to variation within species, so that actin may be too conserved to be useful for phylogenetic estimation of closely related species.
High-resolution typing of Chlamydia trachomatis: epidemiological and clinical uses.

PubMed

de Vries, Henry J C; Schim van der Loeff, Maarten F; Bruisten, Sylvia M

2015-02-01

A state-of-the-art overview of molecular Chlamydia trachomatis typing methods that are used for routine diagnostics and scientific studies. Molecular epidemiology uses high-resolution typing techniques such as multilocus sequence typing, multilocus variable number of tandem repeats analysis, and whole-genome sequencing to identify strains based on their DNA sequence. These data can be used for cluster, network and phylogenetic analyses, and are used to unveil transmission networks, risk groups, and evolutionary pathways. High-resolution typing of C. trachomatis strains is applied to monitor treatment efficacy and re-infections, and to study the recent emergence of lymphogranuloma venereum (LGV) amongst men who have sex with men in high-income countries. Chlamydia strain typing has clinical relevance in disease management, as LGV needs longer treatment than non-LGV C. trachomatis. It has also led to the discovery of a new variant Chlamydia strain in Sweden, which was not detected by some commercial C. trachomatis diagnostic platforms. After a brief history and comparison of the various Chlamydia typing methods, the applications of the current techniques are described and future endeavors to extend scientific understanding are formulated. High-resolution typing will likely help to further unravel the pathophysiological mechanisms behind the wide clinical spectrum of chlamydial disease.
Efficacy of Pulsed-Field Gel Electrophoresis and Repetitive Element Sequence-Based PCR in Typing of Salmonella Isolates from Assam, India.

PubMed

Gogoi, Purnima; Borah, Probodh; Hussain, Iftikar; Das, Leena; Hazarika, Girin; Tamuly, Shantanu; Barkalita, Luit Moni

2018-05-01

A total of 12 Salmonella isolates belonging to different serovars, viz , Salmonella enterica serovar Enteritidis ( n = 4), Salmonella enterica serovar Weltevreden ( n = 4), Salmonella enterica serovar Newport ( n = 1), Salmonella enterica serovar Litchifield ( n = 1), and untypeable strains ( n = 2) were isolated from 332 diarrheic fecal samples collected from animals, birds, and humans. Of the two molecular typing methods applied, viz , repetitive element sequence-based PCR (REP-PCR) and pulsed-field gel electrophoresis (PFGE), PFGE could clearly differentiate the strains belonging to different serovars as well as differentiate between strains of the same serovar with respect to their source of isolation, whereas REP-PCR could not differentiate between strains of the same serovar. Thus, it can be suggested that PFGE is more useful and appropriate for molecular typing of Salmonella isolates during epidemiological investigations than REP-PCR. Copyright © 2018 American Society for Microbiology.

Optimization of sequence alignment for simple sequence repeat regions.

PubMed

Jighly, Abdulqader; Hamwieh, Aladdin; Ogbonnaya, Francis C

2011-07-20

Microsatellites, or simple sequence repeats (SSRs), are tandemly repeated DNA sequences, including tandem copies of specific sequences no longer than six bases, that are distributed in the genome. SSR has been used as a molecular marker because it is easy to detect and is used in a range of applications, including genetic diversity, genome mapping, and marker assisted selection. It is also very mutable because of slipping in the DNA polymerase during DNA replication. This unique mutation increases the insertion/deletion (INDELs) mutation frequency to a high ratio - more than other types of molecular markers such as single nucleotide polymorphism (SNPs).SNPs are more frequent than INDELs. Therefore, all designed algorithms for sequence alignment fit the vast majority of the genomic sequence without considering microsatellite regions, as unique sequences that require special consideration. The old algorithm is limited in its application because there are many overlaps between different repeat units which result in false evolutionary relationships. To overcome the limitation of the aligning algorithm when dealing with SSR loci, a new algorithm was developed using PERL script with a Tk graphical interface. This program is based on aligning sequences after determining the repeated units first, and the last SSR nucleotides positions. This results in a shifting process according to the inserted repeated unit type.When studying the phylogenic relations before and after applying the new algorithm, many differences in the trees were obtained by increasing the SSR length and complexity. However, less distance between different linage had been observed after applying the new algorithm. The new algorithm produces better estimates for aligning SSR loci because it reflects more reliable evolutionary relations between different linages. It reduces overlapping during SSR alignment, which results in a more realistic phylogenic relationship.
Comparison of Dixon Sequences for Estimation of Percent Breast Fibroglandular Tissue

PubMed Central

Ledger, Araminta E. W.; Scurr, Erica D.; Hughes, Julie; Macdonald, Alison; Wallace, Toni; Thomas, Karen; Wilson, Robin; Leach, Martin O.; Schmidt, Maria A.

2016-01-01

Objectives To evaluate sources of error in the Magnetic Resonance Imaging (MRI) measurement of percent fibroglandular tissue (%FGT) using two-point Dixon sequences for fat-water separation. Methods Ten female volunteers (median age: 31 yrs, range: 23–50 yrs) gave informed consent following Research Ethics Committee approval. Each volunteer was scanned twice following repositioning to enable an estimation of measurement repeatability from high-resolution gradient-echo (GRE) proton-density (PD)-weighted Dixon sequences. Differences in measures of %FGT attributable to resolution, T1 weighting and sequence type were assessed by comparison of this Dixon sequence with low-resolution GRE PD-weighted Dixon data, and against gradient-echo (GRE) or spin-echo (SE) based T1-weighted Dixon datasets, respectively. Results %FGT measurement from high-resolution PD-weighted Dixon sequences had a coefficient of repeatability of ±4.3%. There was no significant difference in %FGT between high-resolution and low-resolution PD-weighted data. Values of %FGT from GRE and SE T1-weighted data were strongly correlated with that derived from PD-weighted data (r = 0.995 and 0.96, respectively). However, both sequences exhibited higher mean %FGT by 2.9% (p < 0.0001) and 12.6% (p < 0.0001), respectively, in comparison with PD-weighted data; the increase in %FGT from the SE T1-weighted sequence was significantly larger at lower breast densities. Conclusion Although measurement of %FGT at low resolution is feasible, T1 weighting and sequence type impact on the accuracy of Dixon-based %FGT measurements; Dixon MRI protocols for %FGT measurement should be carefully considered, particularly for longitudinal or multi-centre studies. PMID:27011312
Minim typing--a rapid and low cost MLST based typing tool for Klebsiella pneumoniae.

PubMed

Andersson, Patiyan; Tong, Steven Y C; Bell, Jan M; Turnidge, John D; Giffard, Philip M

2012-01-01

Here we report a single nucleotide polymorphism (SNP) based genotyping method for Klebsiella pneumoniae utilising high-resolution melting (HRM) analysis of fragments within the multilocus sequence typing (MLST) loci. The approach is termed mini-MLST or Minim typing and it has previously been applied to Streptococcus pyogenes, Staphylococcus aureus and Enterococcus faecium. Six SNPs were derived from concatenated MLST sequences on the basis of maximisation of the Simpsons Index of Diversity (D). DNA fragments incorporating these SNPs and predicted to be suitable for HRM analysis were designed. Using the assumption that HRM alleles are defined by G+C content, Minim typing using six fragments was predicted to provide a D = 0.979 against known STs. The method was tested against 202 K. pneumoniae using a blinded approach in which the MLST analyses were performed after the HRM analyses. The HRM-based alleles were indeed in accordance with G+C content, and the Minim typing identified known STs and flagged new STs. The tonB MLST locus was determined to be very diverse, and the two Minim fragments located herein contribute greatly to the resolving power. However these fragments are refractory to amplification in a minority of isolates. Therefore, we assessed the performance of two additional formats: one using only the four fragments located outside the tonB gene (D = 0.929), and the other using HRM data from these four fragments in conjunction with sequencing of the tonB MLST fragment (D = 0.995). The HRM assays were developed on the Rotorgene 6000, and the method was shown to also be robust on the LightCycler 480, allowing a 384-well high through-put format. The assay provides rapid, robust and low-cost typing with fully portable results that can directly be related to current MLST data. Minim typing in combination with molecular screening for antibiotic resistance markers can be a powerful surveillance tool kit.
Minim Typing – A Rapid and Low Cost MLST Based Typing Tool for Klebsiella pneumoniae

PubMed Central

Andersson, Patiyan; Tong, Steven Y. C.; Bell, Jan M.; Turnidge, John D.; Giffard, Philip M.

2012-01-01

Here we report a single nucleotide polymorphism (SNP) based genotyping method for Klebsiella pneumoniae utilising high-resolution melting (HRM) analysis of fragments within the multilocus sequence typing (MLST) loci. The approach is termed mini-MLST or Minim typing and it has previously been applied to Streptococcus pyogenes, Staphylococcus aureus and Enterococcus faecium. Six SNPs were derived from concatenated MLST sequences on the basis of maximisation of the Simpsons Index of Diversity (D). DNA fragments incorporating these SNPs and predicted to be suitable for HRM analysis were designed. Using the assumption that HRM alleles are defined by G+C content, Minim typing using six fragments was predicted to provide a D = 0.979 against known STs. The method was tested against 202 K. pneumoniae using a blinded approach in which the MLST analyses were performed after the HRM analyses. The HRM-based alleles were indeed in accordance with G+C content, and the Minim typing identified known STs and flagged new STs. The tonB MLST locus was determined to be very diverse, and the two Minim fragments located herein contribute greatly to the resolving power. However these fragments are refractory to amplification in a minority of isolates. Therefore, we assessed the performance of two additional formats: one using only the four fragments located outside the tonB gene (D = 0.929), and the other using HRM data from these four fragments in conjunction with sequencing of the tonB MLST fragment (D = 0.995). The HRM assays were developed on the Rotorgene 6000, and the method was shown to also be robust on the LightCycler 480, allowing a 384-well high through-put format. The assay provides rapid, robust and low-cost typing with fully portable results that can directly be related to current MLST data. Minim typing in combination with molecular screening for antibiotic resistance markers can be a powerful surveillance tool kit. PMID:22428067
Stratigraphic architecture and gamma ray logs of deeper ramp carbonates (Upper Jurassic, SW Germany)

NASA Astrophysics Data System (ADS)

Pawellek, T.; Aigner, T.

2003-07-01

The objective of this paper is to contribute to the development of sequence stratigraphic models for extensive epicontinental carbonate systems deposited over cratonic areas. Epicontinental carbonates of the SW German Upper Jurassic were analysed in terms of microfacies, sedimentology and sequence stratigraphy based on 2.5 km of core, 70 borehole gamma ray logs and 24 quarries. Facies analysis revealed six major facies belts across the deeper parts of the carbonate ramp, situated generally below fair-weather wave base, and mostly below average storm wave base but in the reach of occasional storm events. Observed stratigraphic patterns differ in some aspects from widely published sequence stratigraphic models: Elementary sedimentary cycles are mostly more or less symmetrical and are, thus, referred to as "genetic sequences" or "genetic units" [AAAPG Bull. 55 (1971) 1137; Frazier, D.E., 1974. Depositional episodes: their relationship to the Quaternary stratigraphic framework in the northwestern portion of the Gulf Basin. University of Texas, Austin, Bureau of Economic Geology Geologicalo Circular 71-1; AAPG Bull. 73 (1989) 125; Galloway, W.E., Hobday, D.K., 1996. Terrigenous Clastic Depositional Systems. 489 pp., Springer; Cross, T.A., Baker, M.R., Chapin, M.S., Clark, M.S., Gardner, M.H., Hanson, M.S., Lessenger, M.A., Little, L.D., McDonough, K.J., Sonnenfeld, M.D., Valasek, D.W., Williams, M.R., Witter, D.N., 1993. Applications of high-resolution sequence stratigraphy to reservoir analysis. Edition Technip 1993, 11-33; Bull. Cent. Rech. Explor. Prod. Elf-Aquitaine 16 (1992) 357; Homewood, P., Mauriaud, P., Lafont, F., 2000. Best practices in sequence stratigraphy. Elf EP Mem. 25, 81 pp.; Homewood, P., Eberli, G.P., 2000. Genetic stratigraphy on the exploration and production scales. Elf EP Mem. 24, 290 pp.], in contrast to the asymmetrical, shallowing-upward "parasequences" of the EXXON approach. Neither sequence boundaries nor maximum flooding surfaces could be clearly delineated. Cycle boundaries are generally not represented by sharp stratal surfaces but are always transitional and, thus, referred to as "turnarounds" [Nor. Pet. Soc. Spec. Publ. 8 (1998) 171]. Several types of genetic sequences were delineated. Both major types of facies and sequences show characteristic gamma ray log signatures. Based on the cycle stacking and the gamma ray patterns, a hierarchy of sequences was recognized, probably driven in part by 100,000- and 400,000-year Milankovitch signals. The cyclicity allowed regional correlations across various depositional environments such as sponge-microbial bioherms and coeval basins. The basin-wide correlation revealed evidence for a subtle clinoform-type stratigraphic architecture along very gentle slopes, rather than a so far assumed simple "layer cake" pattern.
Estimating Exceptionally Rare Germline and Somatic Mutation Frequencies via Next Generation Sequencing

PubMed Central

Yoon, Song-Ro; Arnheim, Norman; Calabrese, Peter

2016-01-01

We used targeted next generation deep-sequencing (Safe Sequencing System) to measure ultra-rare de novo mutation frequencies in the human male germline by attaching a unique identifier code to each target DNA molecule. Segments from three different human genes (FGFR3, MECP2 and PTPN11) were studied. Regardless of the gene segment, the particular testis donor or the 73 different testis pieces used, the frequencies for any one of the six different mutation types were consistent. Averaging over the C>T/G>A and G>T/C>A mutation types the background mutation frequency was 2.6x10-5 per base pair, while for the four other mutation types the average background frequency was lower at 1.5x10-6 per base pair. These rates far exceed the well documented human genome average frequency per base pair (~10−8) suggesting a non-biological explanation for our data. By computational modeling and a new experimental procedure to distinguish between pre-mutagenic lesion base mismatches and a fully mutated base pair in the original DNA molecule, we argue that most of the base-dependent variation in background frequency is due to a mixture of deamination and oxidation during the first two PCR cycles. Finally, we looked at a previously studied disease mutation in the PTPN11 gene and could easily distinguish true mutations from the SSS background. We also discuss the limits and possibilities of this and other methods to measure exceptionally rare mutation frequencies, and we present calculations for other scientists seeking to design their own such experiments. PMID:27341568
A segmentation method for lung nodule image sequences based on superpixels and density-based spatial clustering of applications with noise

PubMed Central

Zhang, Wei; Zhang, Xiaolong; Qiang, Yan; Tian, Qi; Tang, Xiaoxian

2017-01-01

The fast and accurate segmentation of lung nodule image sequences is the basis of subsequent processing and diagnostic analyses. However, previous research investigating nodule segmentation algorithms cannot entirely segment cavitary nodules, and the segmentation of juxta-vascular nodules is inaccurate and inefficient. To solve these problems, we propose a new method for the segmentation of lung nodule image sequences based on superpixels and density-based spatial clustering of applications with noise (DBSCAN). First, our method uses three-dimensional computed tomography image features of the average intensity projection combined with multi-scale dot enhancement for preprocessing. Hexagonal clustering and morphological optimized sequential linear iterative clustering (HMSLIC) for sequence image oversegmentation is then proposed to obtain superpixel blocks. The adaptive weight coefficient is then constructed to calculate the distance required between superpixels to achieve precise lung nodules positioning and to obtain the subsequent clustering starting block. Moreover, by fitting the distance and detecting the change in slope, an accurate clustering threshold is obtained. Thereafter, a fast DBSCAN superpixel sequence clustering algorithm, which is optimized by the strategy of only clustering the lung nodules and adaptive threshold, is then used to obtain lung nodule mask sequences. Finally, the lung nodule image sequences are obtained. The experimental results show that our method rapidly, completely and accurately segments various types of lung nodule image sequences. PMID:28880916
Mitochondrial DNA typing from human axillary, pubic and head hair shafts - success rates and sequence comparisons.

PubMed

Pfeiffer, H; Hühne, J; Ortmann, C; Waterkamp, K; Brinkmann, B

1999-01-01

The analysis of mitochondrial DNA (mtDNA) from shed hairs has gained high importance in forensic casework since telogen hairs are one of the most common types of evidence left at the crime scene. In this systematic study of hair shafts from 20 individuals, the correlation of mtDNA recovery with hair morphology (length, diameter, volume, colour), with sex, and with body localisation (head, armpit, pubis) was investigated. The highest average success rate of hypervariable region 1 (HV 1) sequencing was found in head hair shafts (75%) followed by pubic (66%) and axillary hair shafts (52%). No statistically significant correlation between morphological parameters or sex and the success rate of sequencing was found. MtDNA sequences of buccal cells, head, pubic and axillary hair shafts did not show intraindividual differences. Heteroplasmic base positions were observed neither in the hair shafts nor in control samples of buccal cells.
The Use of Weighted Graphs for Large-Scale Genome Analysis

PubMed Central

Zhou, Fang; Toivonen, Hannu; King, Ross D.

2014-01-01

There is an acute need for better tools to extract knowledge from the growing flood of sequence data. For example, thousands of complete genomes have been sequenced, and their metabolic networks inferred. Such data should enable a better understanding of evolution. However, most existing network analysis methods are based on pair-wise comparisons, and these do not scale to thousands of genomes. Here we propose the use of weighted graphs as a data structure to enable large-scale phylogenetic analysis of networks. We have developed three types of weighted graph for enzymes: taxonomic (these summarize phylogenetic importance), isoenzymatic (these summarize enzymatic variety/redundancy), and sequence-similarity (these summarize sequence conservation); and we applied these types of weighted graph to survey prokaryotic metabolism. To demonstrate the utility of this approach we have compared and contrasted the large-scale evolution of metabolism in Archaea and Eubacteria. Our results provide evidence for limits to the contingency of evolution. PMID:24619061
The Staphylococcus aureus Two-Component System AgrAC Displays Four Distinct Genomic Arrangements That Delineate Genomic Virulence Factor Signatures

PubMed Central

Choudhary, Kumari S.; Mih, Nathan; Monk, Jonathan; Kavvas, Erol; Yurkovich, James T.; Sakoulas, George; Palsson, Bernhard O.

2018-01-01

Two-component systems (TCSs) consist of a histidine kinase and a response regulator. Here, we evaluated the conservation of the AgrAC TCS among 149 completely sequenced Staphylococcus aureus strains. It is composed of four genes: agrBDCA. We found that: (i) AgrAC system (agr) was found in all but one of the 149 strains, (ii) the agr positive strains were further classified into four agr types based on AgrD protein sequences, (iii) the four agr types not only specified the chromosomal arrangement of the agr genes but also the sequence divergence of AgrC histidine kinase protein, which confers signal specificity, (iv) the sequence divergence was reflected in distinct structural properties especially in the transmembrane region and second extracellular binding domain, and (v) there was a strong correlation between the agr type and the virulence genomic profile of the organism. Taken together, these results demonstrate that bioinformatic analysis of the agr locus leads to a classification system that correlates with the presence of virulence factors and protein structural properties. PMID:29887846
Levels of integration in cognitive control and sequence processing in the prefrontal cortex.

PubMed

Bahlmann, Jörg; Korb, Franziska M; Gratton, Caterina; Friederici, Angela D

2012-01-01

Cognitive control is necessary to flexibly act in changing environments. Sequence processing is needed in language comprehension to build the syntactic structure in sentences. Functional imaging studies suggest that sequence processing engages the left ventrolateral prefrontal cortex (PFC). In contrast, cognitive control processes additionally recruit bilateral rostral lateral PFC regions. The present study aimed to investigate these two types of processes in one experimental paradigm. Sequence processing was manipulated using two different sequencing rules varying in complexity. Cognitive control was varied with different cue-sets that determined the choice of a sequencing rule. Univariate analyses revealed distinct PFC regions for the two types of processing (i.e. sequence processing: left ventrolateral PFC and cognitive control processing: bilateral dorsolateral and rostral PFC). Moreover, in a common brain network (including left lateral PFC and intraparietal sulcus) no interaction between sequence and cognitive control processing was observed. In contrast, a multivariate pattern analysis revealed an interaction of sequence and cognitive control processing, such that voxels in left lateral PFC and parietal cortex showed different tuning functions for tasks involving different sequencing and cognitive control demands. These results suggest that the difference between the process of rule selection (i.e. cognitive control) and the process of rule-based sequencing (i.e. sequence processing) find their neuronal underpinnings in distinct activation patterns in lateral PFC. Moreover, the combination of rule selection and rule sequencing can shape the response of neurons in lateral PFC and parietal cortex.
Levels of Integration in Cognitive Control and Sequence Processing in the Prefrontal Cortex

PubMed Central

Bahlmann, Jörg; Korb, Franziska M.; Gratton, Caterina; Friederici, Angela D.

2012-01-01

Cognitive control is necessary to flexibly act in changing environments. Sequence processing is needed in language comprehension to build the syntactic structure in sentences. Functional imaging studies suggest that sequence processing engages the left ventrolateral prefrontal cortex (PFC). In contrast, cognitive control processes additionally recruit bilateral rostral lateral PFC regions. The present study aimed to investigate these two types of processes in one experimental paradigm. Sequence processing was manipulated using two different sequencing rules varying in complexity. Cognitive control was varied with different cue-sets that determined the choice of a sequencing rule. Univariate analyses revealed distinct PFC regions for the two types of processing (i.e. sequence processing: left ventrolateral PFC and cognitive control processing: bilateral dorsolateral and rostral PFC). Moreover, in a common brain network (including left lateral PFC and intraparietal sulcus) no interaction between sequence and cognitive control processing was observed. In contrast, a multivariate pattern analysis revealed an interaction of sequence and cognitive control processing, such that voxels in left lateral PFC and parietal cortex showed different tuning functions for tasks involving different sequencing and cognitive control demands. These results suggest that the difference between the process of rule selection (i.e. cognitive control) and the process of rule-based sequencing (i.e. sequence processing) find their neuronal underpinnings in distinct activation patterns in lateral PFC. Moreover, the combination of rule selection and rule sequencing can shape the response of neurons in lateral PFC and parietal cortex. PMID:22952762
Rather than by direct acquisition via lateral gene transfer, GHF5 cellulases were passed on from early Pratylenchidae to root-knot and cyst nematodes.

PubMed

Rybarczyk-Mydłowska, Katarzyna; Maboreke, Hazel Ruvimbo; van Megen, Hanny; van den Elsen, Sven; Mooyman, Paul; Smant, Geert; Bakker, Jaap; Helder, Johannes

2012-11-21

Plant parasitic nematodes are unusual Metazoans as they are equipped with genes that allow for symbiont-independent degradation of plant cell walls. Among the cell wall-degrading enzymes, glycoside hydrolase family 5 (GHF5) cellulases are relatively well characterized, especially for high impact parasites such as root-knot and cyst nematodes. Interestingly, ancestors of extant nematodes most likely acquired these GHF5 cellulases from a prokaryote donor by one or multiple lateral gene transfer events. To obtain insight into the origin of GHF5 cellulases among evolutionary advanced members of the order Tylenchida, cellulase biodiversity data from less distal family members were collected and analyzed. Single nematodes were used to obtain (partial) genomic sequences of cellulases from representatives of the genera Meloidogyne, Pratylenchus, Hirschmanniella and Globodera. Combined Bayesian analysis of ≈ 100 cellulase sequences revealed three types of catalytic domains (A, B, and C). Represented by 84 sequences, type B is numerically dominant, and the overall topology of the catalytic domain type shows remarkable resemblance with trees based on neutral (= pathogenicity-unrelated) small subunit ribosomal DNA sequences. Bayesian analysis further suggested a sister relationship between the lesion nematode Pratylenchus thornei and all type B cellulases from root-knot nematodes. Yet, the relationship between the three catalytic domain types remained unclear. Superposition of intron data onto the cellulase tree suggests that types B and C are related, and together distinct from type A that is characterized by two unique introns. All Tylenchida members investigated here harbored one or multiple GHF5 cellulases. Three types of catalytic domains are distinguished, and the presence of at least two types is relatively common among plant parasitic Tylenchida. Analysis of coding sequences of cellulases suggests that root-knot and cyst nematodes did not acquire this gene directly by lateral genes transfer. More likely, these genes were passed on by ancestors of a family nowadays known as the Pratylenchidae.
Identification of GATC- and CCGG- recognizing Type II REases and their putative specificity-determining positions using Scan2S—a novel motif scan algorithm with optional secondary structure constraints

PubMed Central

Niv, Masha Y.; Skrabanek, Lucy; Roberts, Richard J.; Scheraga, Harold A.; Weinstein, Harel

2008-01-01

Restriction endonucleases (REases) are DNA-cleaving enzymes that have become indispensable tools in molecular biology. Type II REases are highly divergent in sequence despite their common structural core, function and, in some cases, common specificities towards DNA sequences. This makes it difficult to identify and classify them functionally based on sequence, and has hampered the efforts of specificity-engineering. Here, we define novel REase sequence motifs, which extend beyond the PD-(D/E)XK hallmark, and incorporate secondary structure information. The automated search using these motifs is carried out with a newly developed fast regular expression matching algorithm that accommodates long patterns with optional secondary structure constraints. Using this new tool, named Scan2S, motifs derived from REases with specificity towards GATC- and CGGG-containing DNA sequences successfully identify REases of the same specificity. Notably, some of these sequences are not identified by standard sequence detection tools. The new motifs highlight potential specificity-determining positions that do not fully overlap for the GATC- and the CCGG-recognizing REases and are candidates for specificity re-engineering. PMID:17972284
Identification of GATC- and CCGG-recognizing Type II REases and their putative specificity-determining positions using Scan2S--a novel motif scan algorithm with optional secondary structure constraints.

PubMed

Niv, Masha Y; Skrabanek, Lucy; Roberts, Richard J; Scheraga, Harold A; Weinstein, Harel

2008-05-01

Restriction endonucleases (REases) are DNA-cleaving enzymes that have become indispensable tools in molecular biology. Type II REases are highly divergent in sequence despite their common structural core, function and, in some cases, common specificities towards DNA sequences. This makes it difficult to identify and classify them functionally based on sequence, and has hampered the efforts of specificity-engineering. Here, we define novel REase sequence motifs, which extend beyond the PD-(D/E)XK hallmark, and incorporate secondary structure information. The automated search using these motifs is carried out with a newly developed fast regular expression matching algorithm that accommodates long patterns with optional secondary structure constraints. Using this new tool, named Scan2S, motifs derived from REases with specificity towards GATC- and CGGG-containing DNA sequences successfully identify REases of the same specificity. Notably, some of these sequences are not identified by standard sequence detection tools. The new motifs highlight potential specificity-determining positions that do not fully overlap for the GATC- and the CCGG-recognizing REases and are candidates for specificity re-engineering.
New insights into Trypanosoma cruzi evolution, genotyping and molecular diagnostics from satellite DNA sequence analysis.

PubMed

Ramírez, Juan C; Torres, Carolina; Curto, María de Los A; Schijman, Alejandro G

2017-12-01

Trypanosoma cruzi has been subdivided into seven Discrete Typing Units (DTUs), TcI-TcVI and Tcbat. Two major evolutionary models have been proposed to explain the origin of hybrid lineages, but while it is widely accepted that TcV and TcVI are the result of genetic exchange between TcII and TcIII strains, the origin of TcIII and TcIV is still a matter of debate. T. cruzi satellite DNA (SatDNA), comprised of 195 bp units organized in tandem repeats, from both TcV and TcVI stocks were found to have SatDNA copies type TcI and TcII; whereas contradictory results were observed for TcIII stocks and no TcIV sequence has been analyzed yet. Herein, we have gone deeper into this matter analyzing 335 distinct SatDNA sequences from 19 T. cruzi stocks representative of DTUs TcI-TcVI for phylogenetic inference. Bayesian phylogenetic tree showed that all sequences were grouped in three major clusters, which corresponded to sequences from DTUs TcI/III, TcII and TcIV; whereas TcV and TcVI stocks had two sets of sequences distributed into TcI/III and TcII clusters. As expected, the lowest genetic distances were found between TcI and TcIII, and between TcV and TcVI sequences; whereas the highest ones were observed between TcII and TcI/III, and among TcIV sequences and those from the remaining DTUs. In addition, signature patterns associated to specific T. cruzi lineages were identified and new primers that improved SatDNA-based qPCR sensitivity were designed. Our findings support the theory that TcIII is not the result of a hybridization event between TcI and TcII, and that TcIV had an independent origin from the other DTUs, contributing to clarifying the evolutionary history of T. cruzi lineages. Moreover, this work opens the possibility of typing samples from Chagas disease patients with low parasitic loads and improving molecular diagnostic methods of T. cruzi infection based on SatDNA sequence amplification.
Molecular and morphological differentiation between the crop and weedy types in velvetleaf (Abutilon theophrasti Medik.) using a chloroplast DNA marker: seed source of the present invasive velvetleaf in Japan.

PubMed

Kurokawa, S; Shibaike, H; Akiyama, H; Yoshimura, Y

2004-12-01

A comparison of chloroplast DNA (cpDNA) sequences was carried out between the crop and weed types of Abutilon theophrasti to clarify the seed source of the present weedy velvetleaf in Japan. A sequencing analysis of approx. 6% of the chloroplast genome (ca 10 kbp) detected three nucleotide substitutions, one six-base-pair insertion/deletion (indel) and one 30-base pair inversion, which distinguish two haplotypes of cpDNA. A PCR-based survey of the indel and the inversion revealed that the 93 accessions of velvetleaf collected from the world could be divided into two groups. A morphological marker (capsule color) could be used to discriminate the crop type and the weed type, and hence, along with cpDNA haplotype, to distinguish three genotypes (Type I, II, and III). All Japanese cultivars and crop accessions from other countries were Type I. Weed types were divided into Type II and III. All of the samples from the USA, and the samples taken from grain imports to Japan were Type III. Since most of the weedy types distributed in Japan were of Type III, it is argued that they were introduced as seeds in the imported grain. We also found that the Type II plants sporadically occurred in Japan. It is suggested that they originated as hybrids, with indigenous cultivars as the maternal ancestor. Such hybrids must have survived since the cessation of velvetleaf cultivation about a century ago.
Combined use of real-time PCR and nested sequence-based typing in survey of human Legionella infection.

PubMed

Qin, T; Zhou, H; Ren, H; Shi, W; Jin, H; Jiang, X; Xu, Y; Zhou, M; Li, J; Wang, J; Shao, Z; Xu, X

2016-07-01

Legionnaires' disease (LD) is a globally distributed systemic infectious disease. The burden of LD in many regions is still unclear, especially in Asian countries including China. A survey of Legionella infection using real-time PCR and nested sequence-based typing (SBT) was performed in two hospitals in Shanghai, China. A total of 265 bronchoalveolar lavage fluid (BALF) specimens were collected from hospital A between January 2012 and December 2013, and 359 sputum specimens were collected from hospital B throughout 2012. A total of 71 specimens were positive for Legionella according to real-time PCR focusing on the 5S rRNA gene. Seventy of these specimens were identified as Legionella pneumophila as a result of real-time PCR amplification of the dotA gene. Results of nested SBT revealed high genetic polymorphism in these L. pneumophila and ST1 was the predominant sequence type. These data revealed that the burden of LD in China is much greater than that recognized previously, and real-time PCR may be a suitable monitoring technology for LD in large sample surveys in regions lacking the economic and technical resources to perform other methods, such as urinary antigen tests and culture methods.
CRISPR Diversity and Microevolution in Clostridium difficile

PubMed Central

Andersen, Joakim M.; Shoup, Madelyn; Robinson, Cathy; Britton, Robert; Olsen, Katharina E.P.; Barrangou, Rodolphe

2016-01-01

Abstract Virulent strains of Clostridium difficile have become a global health problem associated with morbidity and mortality. Traditional typing methods do not provide ideal resolution to track outbreak strains, ascertain genetic diversity between isolates, or monitor the phylogeny of this species on a global basis. Here, we investigate the occurrence and diversity of clustered regularly interspaced short palindromic repeats (CRISPR) and CRISPR-associated genes (cas) in C. difficile to assess the potential of CRISPR-based phylogeny and high-resolution genotyping. A single Type-IB CRISPR-Cas system was identified in 217 analyzed genomes with cas gene clusters present at conserved chromosomal locations, suggesting vertical evolution of the system, assessing a total of 1,865 CRISPR arrays. The CRISPR arrays, markedly enriched (8.5 arrays/genome) compared with other species, occur both at conserved and variable locations across strains, and thus provide a basis for typing based on locus occurrence and spacer polymorphism. Clustering of strains by array composition correlated with sequence type (ST) analysis. Spacer content and polymorphism within conserved CRISPR arrays revealed phylogenetic relationship across clades and within ST. Spacer polymorphisms of conserved arrays were instrumental for differentiating closely related strains, e.g., ST1/RT027/B1 strains and pathogenicity locus encoding ST3/RT001 strains. CRISPR spacers showed sequence similarity to phage sequences, which is consistent with the native role of CRISPR-Cas as adaptive immune systems in bacteria. Overall, CRISPR-Cas sequences constitute a valuable basis for genotyping of C. difficile isolates, provide insights into the micro-evolutionary events that occur between closely related strains, and reflect the evolutionary trajectory of these genomes. PMID:27576538
Sequence Effect on the Formation of DNA Minidumbbells.

PubMed

Liu, Yuan; Lam, Sik Lok

2017-11-16

The DNA minidumbbell (MDB) is a recently identified non-B structure. The reported MDBs contain two TTTA, CCTG, or CTTG type II loops. At present, the knowledge and understanding of the sequence criteria for MDB formation are still limited. In this study, we performed a systematic high-resolution nuclear magnetic resonance (NMR) and native gel study to investigate the effect of sequence variations in tandem repeats on the formation of MDBs. Our NMR results reveal the importance of hydrogen bonds, base-base stacking, and hydrophobic interactions from each of the participating residues. We conclude that in the MDBs formed by tandem repeats, C-G loop-closing base pairs are more stabilizing than T-A loop-closing base pairs, and thymine residues in both the second and third loop positions are more stabilizing than cytosine residues. The results from this study enrich our knowledge on the sequence criteria for the formation of MDBs, paving a path for better exploring their potential roles in biological systems and DNA nanotechnology.

BATTLE: Biomarker-Based Approaches of Targeted Therapy for Lung Cancer Elimination

DTIC Science & Technology

2008-04-01

although a grade 3 neutropenia was dose-limiting in one importance. Th th ubstrate of the CYP3A4 isoenzyme and P-gp. Its metabolism is sensitive to...tratification in clinis Molecular Pathway Biomarkers Type of Analysis EGFR EGFR Mutation ( exons 18 to 21) DNA sequencing EGFR Increased Copy Number...polysomy/am 1plification) DNA FISH K-Ras/B-Raf K-RAS Mutation (codons 12,13, 61) DNA sequencing B-RAF Mutations ( exons 11 and 15) DNA sequencing
Enterobacter muelleri sp. nov., isolated from the rhizosphere of Zea mays.

PubMed

Kämpfer, Peter; McInroy, John A; Glaeser, Stefanie P

2015-11-01

A beige-pigmented, oxidase-negative bacterial strain (JM-458T), isolated from a rhizosphere sample, was studied using a polyphasic taxonomic approach. Cells of the isolate were rod-shaped and stained Gram-negative. A comparison of the 16S rRNA gene sequence of strain JM-458T with sequences of the type strains of closely related species of the genus Enterobacter showed that it shared highest sequence similarity with Enterobacter mori (98.7 %), Enterobacter hormaechei (98.3 %), Enterobacter cloacae subsp. dissolvens, Enterobacter ludwigii and Enterobacter asburiae (all 98.2 %). 16S rRNA gene sequence similarities to all other Enterobacter species were below 98 %. Multilocus sequence analysis based on concatenated partial rpoB, gyrB, infB and atpD gene sequences showed a clear distinction of strain JM-458T from its closest related type strains. The fatty acid profile of the strain consisted of C16 : 0, C17 : 0 cyclo, iso-C15 : 0 2-OH/C16 : 1ω7c and C18 : 1ω7c as major components. DNA-DNA hybridizations between strain JM-458T and the type strains of E. mori, E. hormaechei and E. ludwigii resulted in relatedness values of 29 % (reciprocal 25 %), 24 % (reciprocal 43 %) and 16 % (reciprocal 17 %), respectively. DNA-DNA hybridization results together with multilocus sequence analysis results and differential biochemical and chemotaxonomic properties showed that strain JM-458T represents a novel species of the genus Enterobacter, for which the name Enterobacter muelleri sp. nov. is proposed. The type strain is JM-458T ( = DSM 29346T = CIP 110826T = LMG 28480T = CCM 8546T).
Lactobacillus allii sp. nov. isolated from scallion kimchi.

PubMed

Jung, Min Young; Lee, Se Hee; Lee, Moeun; Song, Jung Hee; Chang, Ji Yoon

2017-12-01

A novel strain of lactic acid bacteria, WiKim39 T , was isolated from a scallion kimchi sample consisting of fermented chili peppers and vegetables. The isolate was a Gram-positive, rod-shaped, non-motile, catalase-negative and facultatively anaerobic lactic acid bacterium. Phylogenetic analysis of the 16S rRNA gene sequence showed that strain WiKim39 T belonged to the genus Lactobacillus, and shared 97.1-98.2 % pair-wise sequence similarities with related type strains, Lactobacillus nodensis, Lactobacillus insicii, Lactobacillus versmoldensis, Lactobacillus tucceti and Lactobacillus furfuricola. The G+C content of the strain based on its genome sequence was 35.3 mol%. The ANI values between WiKim39 T and the closest relatives were lower than 80 %. Based on the phenotypic, biochemical, and phylogenetic analyses, strain WiKim39 T represents a novel species of the genus Lactobacillus, for which the name Lactobacillus allii sp. nov. is proposed. The type strain is WiKim39 T (=KCTC 21077 T =JCM 31938 T ).
Lactobacillus allii sp. nov. isolated from scallion kimchi

PubMed Central

Jung, Min Young; Lee, Se Hee; Lee, Moeun; Song, Jung Hee; Chang, Ji Yoon

2017-01-01

A novel strain of lactic acid bacteria, WiKim39T, was isolated from a scallion kimchi sample consisting of fermented chili peppers and vegetables. The isolate was a Gram-positive, rod-shaped, non-motile, catalase-negative and facultatively anaerobic lactic acid bacterium. Phylogenetic analysis of the 16S rRNA gene sequence showed that strain WiKim39T belonged to the genus Lactobacillus, and shared 97.1–98.2 % pair-wise sequence similarities with related type strains, Lactobacillus nodensis, Lactobacillus insicii, Lactobacillus versmoldensis, Lactobacillus tucceti and Lactobacillus furfuricola. The G+C content of the strain based on its genome sequence was 35.3 mol%. The ANI values between WiKim39T and the closest relatives were lower than 80 %. Based on the phenotypic, biochemical, and phylogenetic analyses, strain WiKim39T represents a novel species of the genus Lactobacillus, for which the name Lactobacillus allii sp. nov. is proposed. The type strain is WiKim39T (=KCTC 21077T=JCM 31938T). PMID:29043955
Mitogenic effect contributes to increased virulence of Streptococcus suis sequence type 7 to cause streptococcal toxic shock-like syndrome.

PubMed

Zheng, H; Ye, C; Segura, M; Gottschalk, M; Xu, J

2008-09-01

Streptococcus suis serotype 2 sequence type 7 strains emerged in 1996 and caused a streptococcal toxic shock-like syndrome in 1998 and 2005 in China. Evidence indicated that the virulence of S. suis sequence type 7 had increased, but the mechanism was unknown. The sequence type 7 strain SC84, isolated from a patient with streptococcal toxic shock-like syndrome during the Sichuan outbreak, and the sequence type 1 strain 31533, a typical highly pathogenic strain isolated from a diseased pig, were used in comparative studies. In this study we show the mechanisms underlying cytokine production differed between the two types of strains. The S. suis sequence type 7 strain SC84 possesses a stronger capacity to stimulate T cells, naive T cells and peripheral blood mononuclear cell proliferation than does S. suis sequence type 1 strain 31533. The T cell response to both strains was dependent upon the presence of antigen-presenting cells. Histo-incompatible antigen-presenting cells were sufficient to provide the accessory signals to naive T cell stimulated by the two strains, indicating that both sequence type 7 and 1 strains possess mitogens; however, the mitogenic effect was different. Therefore, we propose that the difference in the mitogenic effect of sequence type 7 strain SC84 compared with the sequence type 1 strain 31533 of S. suis may be associated with the clinical, epidemiological and microbiological difference, where the ST 7 strains have a larger mitogenic effect.
Mitogenic effect contributes to increased virulence of Streptococcus suis sequence type 7 to cause streptococcal toxic shock-like syndrome

PubMed Central

Zheng, H; Ye, C; Segura, M; Gottschalk, M; Xu, J

2008-01-01

Streptococcus suis serotype 2 sequence type 7 strains emerged in 1996 and caused a streptococcal toxic shock-like syndrome in 1998 and 2005 in China. Evidence indicated that the virulence of S. suis sequence type 7 had increased, but the mechanism was unknown. The sequence type 7 strain SC84, isolated from a patient with streptococcal toxic shock-like syndrome during the Sichuan outbreak, and the sequence type 1 strain 31533, a typical highly pathogenic strain isolated from a diseased pig, were used in comparative studies. In this study we show the mechanisms underlying cytokine production differed between the two types of strains. The S. suis sequence type 7 strain SC84 possesses a stronger capacity to stimulate T cells, naive T cells and peripheral blood mononuclear cell proliferation than does S. suis sequence type 1 strain 31533. The T cell response to both strains was dependent upon the presence of antigen-presenting cells. Histo-incompatible antigen-presenting cells were sufficient to provide the accessory signals to naive T cell stimulated by the two strains, indicating that both sequence type 7 and 1 strains possess mitogens; however, the mitogenic effect was different. Therefore, we propose that the difference in the mitogenic effect of sequence type 7 strain SC84 compared with the sequence type 1 strain 31533 of S. suis may be associated with the clinical, epidemiological and microbiological difference, where the ST 7 strains have a larger mitogenic effect. PMID:18803762
Giardia telomeric sequence d(TAGGG)4 forms two intramolecular G-quadruplexes in K+ solution: effect of loop length and sequence on the folding topology.

PubMed

Hu, Lanying; Lim, Kah Wai; Bouaziz, Serge; Phan, Anh Tuân

2009-11-25

Recently, it has been shown that in K(+) solution the human telomeric sequence d[TAGGG(TTAGGG)(3)] forms a (3 + 1) intramolecular G-quadruplex, while the Bombyx mori telomeric sequence d[TAGG(TTAGG)(3)], which differs from the human counterpart only by one G deletion in each repeat, forms a chair-type intramolecular G-quadruplex, indicating an effect of G-tract length on the folding topology of G-quadruplexes. To explore the effect of loop length and sequence on the folding topology of G-quadruplexes, here we examine the structure of the four-repeat Giardia telomeric sequence d[TAGGG(TAGGG)(3)], which differs from the human counterpart only by one T deletion within the non-G linker in each repeat. We show by NMR that this sequence forms two different intramolecular G-quadruplexes in K(+) solution. The first one is a novel basket-type antiparallel-stranded G-quadruplex containing two G-tetrads, a G x (A-G) triad, and two A x T base pairs; the three loops are consecutively edgewise-diagonal-edgewise. The second one is a propeller-type parallel-stranded G-quadruplex involving three G-tetrads; the three loops are all double-chain-reversal. Recurrence of several structural elements in the observed structures suggests a "cut and paste" principle for the design and prediction of G-quadruplex topologies, for which different elements could be extracted from one G-quadruplex and inserted into another.
Analysis of the genome-wide variations among multiple strains of the plant pathogenic bacterium Xylella fastidiosa

PubMed Central

Doddapaneni, Harshavardhan; Yao, Jiqiang; Lin, Hong; Walker, M Andrew; Civerolo, Edwin L

2006-01-01

Background The Gram-negative, xylem-limited phytopathogenic bacterium Xylella fastidiosa is responsible for causing economically important diseases in grapevine, citrus and many other plant species. Despite its economic impact, relatively little is known about the genomic variations among strains isolated from different hosts and their influence on the population genetics of this pathogen. With the availability of genome sequence information for four strains, it is now possible to perform genome-wide analyses to identify and categorize such DNA variations and to understand their influence on strain functional divergence. Results There are 1,579 genes and 194 non-coding homologous sequences present in the genomes of all four strains, representing a 76. 2% conservation of the sequenced genome. About 60% of the X. fastidiosa unique sequences exist as tandem gene clusters of 6 or more genes. Multiple alignments identified 12,754 SNPs and 14,449 INDELs in the 1528 common genes and 20,779 SNPs and 10,075 INDELs in the 194 non-coding sequences. The average SNP frequency was 1.08 × 10-2 per base pair of DNA and the average INDEL frequency was 2.06 × 10-2 per base pair of DNA. On an average, 60.33% of the SNPs were synonymous type while 39.67% were non-synonymous type. The mutation frequency, primarily in the form of external INDELs was the main type of sequence variation. The relative similarity between the strains was discussed according to the INDEL and SNP differences. The number of genes unique to each strain were 60 (9a5c), 54 (Dixon), 83 (Ann1) and 9 (Temecula-1). A sub-set of the strain specific genes showed significant differences in terms of their codon usage and GC composition from the native genes suggesting their xenologous origin. Tandem repeat analysis of the genomic sequences of the four strains identified associations of repeat sequences with hypothetical and phage related functions. Conclusion INDELs and strain specific genes have been identified as the main source of variations among strains, with individual strains showing different rates of genome evolution. Based on these genome comparisons, it appears that the Pierce's disease strain Temecula-1 genome represents the ancestral genome of the X. fastidiosa. Results of this analysis are publicly available in the form of a web database. PMID:16948851
Slider--maximum use of probability information for alignment of short sequence reads and SNP detection.

PubMed

Malhis, Nawar; Butterfield, Yaron S N; Ester, Martin; Jones, Steven J M

2009-01-01

A plethora of alignment tools have been created that are designed to best fit different types of alignment conditions. While some of these are made for aligning Illumina Sequence Analyzer reads, none of these are fully utilizing its probability (prb) output. In this article, we will introduce a new alignment approach (Slider) that reduces the alignment problem space by utilizing each read base's probabilities given in the prb files. Compared with other aligners, Slider has higher alignment accuracy and efficiency. In addition, given that Slider matches bases with probabilities other than the most probable, it significantly reduces the percentage of base mismatches. The result is that its SNP predictions are more accurate than other SNP prediction approaches used today that start from the most probable sequence, including those using base quality.
Questioning short-term memory and its measurement: Why digit span measures long-term associative learning.

PubMed

Jones, Gary; Macken, Bill

2015-11-01

Traditional accounts of verbal short-term memory explain differences in performance for different types of verbal material by reference to inherent characteristics of the verbal items making up memory sequences. The role of previous experience with sequences of different types is ostensibly controlled for either by deliberate exclusion or by presenting multiple trials constructed from different random permutations. We cast doubt on this general approach in a detailed analysis of the basis for the robust finding that short-term memory for digit sequences is superior to that for other sequences of verbal material. Specifically, we show across four experiments that this advantage is not due to inherent characteristics of digits as verbal items, nor are individual digits within sequences better remembered than other types of individual verbal items. Rather, the advantage for digit sequences stems from the increased frequency, compared to other verbal material, with which digits appear in random sequences in natural language, and furthermore, relatively frequent digit sequences support better short-term serial recall than less frequent ones. We also provide corpus-based computational support for the argument that performance in a short-term memory setting is a function of basic associative learning processes operating on the linguistic experience of the rememberer. The experimental and computational results raise questions not only about the role played by measurement of digit span in cognition generally, but also about the way in which long-term memory processes impact on short-term memory functioning. Copyright © 2015 The Authors. Published by Elsevier B.V. All rights reserved.
Development and application of a multilocus sequence analysis method for the identification of genotypes within genus Bradyrhizobium and for establishing nodule occupancy of soybean (Glycine max L. Merr)

USDA-ARS?s Scientific Manuscript database

A Multilocus Sequence Typing (MLST) method based on allelic variation of 7 chromosomal loci was developed for characterizing genotypes within the genus Bradyrhizobium. With the method 29 distinct multilocus genotypes (GTs) were identified among 191 culture collection soybean strains. The occupancy ...
DNA-Sequence Based Typing of the Cronobacter Genus Using MLST, CRISPR-cas Array and Capsular Profiling

PubMed Central

Ogrodzki, Pauline; Forsythe, Stephen J.

2017-01-01

The Cronobacter genus is composed of seven species, within which a number of pathovars have been described. The most notable infections by Cronobacter spp. are of infants through the consumption of contaminated infant formula. The description of the genus has greatly improved in recent years through DNA sequencing techniques, and this has led to a robust means of identification. However some species are highly clonal and this limits the ability to discriminate between unrelated strains by some methods of genotyping. This article updates the application of three genotyping methods across the Cronobacter genus. The three genotyping methods were multilocus sequence typing (MLST), capsular profiling of the K-antigen and colanic acid (CA) biosynthesis regions, and CRISPR-cas array profiling. A total of 1654 MLST profiled and 286 whole genome sequenced strains, available by open access at the PubMLST Cronobacter database, were used this analysis. The predominance of C. sakazakii and C. malonaticus in clinical infections was confirmed. The majority of clinical strains being in the C. sakazakii clonal complexes (CC) 1 and 4, sequence types (ST) 8 and 12 and C. malonaticus ST7. The capsular profile K2:CA2, previously proposed as being strongly associated with C. sakazakii and C. malonaticus isolates from severe neonatal infections, was also found in C. turicensis, C. dublinensis and C. universalis. The majority of CRISPR-cas types across the genus was the I-E (Ecoli) type. Some strains of C. dublinensis and C. muytjensii encoded the I-F (Ypseudo) type, and others lacked the cas gene loci. The significance of the expanding profiling will be of benefit to researchers as well as governmental and industrial risk assessors. PMID:29033918
Identification and functional activity of a staphylocoagulase type XI variant originating from staphylococcal food poisoning isolates.

PubMed

Suzuki, Y; Matsushita, S; Kubota, H; Kobayashi, M; Murauchi, K; Higuchi, Y; Kato, R; Hirai, A; Sadamasu, K

2016-09-01

Staphylocoagulase, an extracellular protein secreted by Staphylococcus aureus, has been used as an epidemiological marker. At least 12 serotypes and 24 genotypes subdivided on the basis of nucleotide sequence have been reported to date. In this study, we identified a novel staphylocoagulase nucleotide sequence, coa310, from staphylococcal food poisoning isolates that had the ability to coagulate plasma, but could not be typed using the conventional method. The protein encoded by coa310 contained the six fundamental conserved domains of staphylocoagulase. The full-length nucleotide sequence of coa310 shared the highest similarity (77·5%) with that of staphylocoagulase-type (SCT) XIa. The sequence of the D1 region, which would be responsible for the determination of SCT, shared the highest similarity (91·8%) with that of SCT XIa. These results suggest that coa310 is a novel variant of SCT XI. Moreover, we demonstrated that coa310 encodes a functioning coagulase, by confirming the coagulating activity of the recombinant protein expressed from coa310. This is the first study to directly demonstrate that Coa310, a putative SCT XI, has coagulating activity. These findings may be useful for the improvement of the staphylocoagulase-typing method, including serotyping and genotyping. This is the first study to identify a novel variant of staphylocoagulase type XI based on its nucleotide sequence and to demonstrate coagulating activity in the variant using a recombinant protein. Elucidation of the variety of staphylocoagulases will provide suggestions for further improvement of the staphylocoagulase-typing method and contribute to our understanding of the epidemiologic characterization of Staphylococcus aureus. © 2016 The Society for Applied Microbiology.
Component identification of electron transport chains in curdlan-producing Agrobacterium sp. ATCC 31749 and its genome-specific prediction using comparative genome and phylogenetic trees analysis.

PubMed

Zhang, Hongtao; Setubal, Joao Carlos; Zhan, Xiaobei; Zheng, Zhiyong; Yu, Lijun; Wu, Jianrong; Chen, Dingqiang

2011-06-01

Agrobacterium sp. ATCC 31749 (formerly named Alcaligenes faecalis var. myxogenes) is a non-pathogenic aerobic soil bacterium used in large scale biotechnological production of curdlan. However, little is known about its genomic information. DNA partial sequence of electron transport chains (ETCs) protein genes were obtained in order to understand the components of ETC and genomic-specificity in Agrobacterium sp. ATCC 31749. Degenerate primers were designed according to ETC conserved sequences in other reported species. DNA partial sequences of ETC genes in Agrobacterium sp. ATCC 31749 were cloned by the PCR method using degenerate primers. Based on comparative genomic analysis, nine electron transport elements were ascertained, including NADH ubiquinone oxidoreductase, succinate dehydrogenase complex II, complex III, cytochrome c, ubiquinone biosynthesis protein ubiB, cytochrome d terminal oxidase, cytochrome bo terminal oxidase, cytochrome cbb (3)-type terminal oxidase and cytochrome caa (3)-type terminal oxidase. Similarity and phylogenetic analyses of these genes revealed that among fully sequenced Agrobacterium species, Agrobacterium sp. ATCC 31749 is closest to Agrobacterium tumefaciens C58. Based on these results a comprehensive ETC model for Agrobacterium sp. ATCC 31749 is proposed.
Lactobacillus heilongjiangensis sp. nov., isolated from Chinese pickle.

PubMed

Gu, Chun Tao; Li, Chun Yan; Yang, Li Jie; Huo, Gui Cheng

2013-11-01

A Gram-stain-positive bacterial strain, S4-3(T), was isolated from traditional pickle in Heilongjiang Province, China. The bacterium was characterized by a polyphasic approach, including 16S rRNA gene sequence analysis, pheS gene sequence analysis, rpoA gene sequence analysis, dnaK gene sequence analysis, fatty acid methyl ester (FAME) analysis, determination of DNA G+C content, DNA-DNA hybridization and an analysis of phenotypic features. Strain S4-3(T) showed 97.9-98.7 % 16S rRNA gene sequence similarities, 84.4-94.1 % pheS gene sequence similarities and 94.4-96.9 % rpoA gene sequence similarities to the type strains of Lactobacillus nantensis, Lactobacillus mindensis, Lactobacillus crustorum, Lactobacillus futsaii, Lactobacillus farciminis and Lactobacillus kimchiensis. dnaK gene sequence similarities between S4-3(T) and Lactobacillus nantensis LMG 23510(T), Lactobacillus mindensis LMG 21932(T), Lactobacillus crustorum LMG 23699(T), Lactobacillus futsaii JCM 17355(T) and Lactobacillus farciminis LMG 9200(T) were 95.4, 91.5, 90.4, 91.7 and 93.1 %, respectively. Based upon the data obtained in the present study, a novel species, Lactobacillus heilongjiangensis sp. nov., is proposed and the type strain is S4-3(T) ( = LMG 26166(T) = NCIMB 14701(T)).
Vertical transmission of highly similar blaCTX-M-1-harboring IncI1 plasmids in Escherichia coli with different MLST types in the poultry production pyramid

PubMed Central

Zurfluh, Katrin; Wang, Juan; Klumpp, Jochen; Nüesch-Inderbinen, Magdalena; Fanning, Séamus; Stephan, Roger

2014-01-01

Objectives: The purpose of this study was to characterize sets of extended-spectrum β-lactamases (ESBL)-producing Enterobacteriaceae collected longitudinally from different flocks of broiler breeders, meconium of 1-day-old broilers from theses breeder flocks, as well as from these broiler flocks before slaughter. Methods: Five sets of ESBL-producing Escherichia coli were studied by multi-locus sequence typing (MLST), phylogenetic grouping, PCR-based replicon typing and resistance profiling. The blaCTX-M-1-harboring plasmids of one set (pHV295.1, pHV114.1, and pHV292.1) were fully sequenced and subjected to comparative analysis. Results: Eleven different MLST sequence types (ST) were identified with ST1056 the predominant one, isolated in all five sets either on the broiler breeder or meconium level. Plasmid sequencing revealed that blaCTX-M-1 was carried by highly similar IncI1/ST3 plasmids that were 105 076 bp, 110 997 bp, and 117 269 bp in size, respectively. Conclusions: The fact that genetically similar IncI1/ST3 plasmids were found in ESBL-producing E. coli of different MLST types isolated at the different levels in the broiler production pyramid provides strong evidence for a vertical transmission of these plasmids from a common source (nucleus poultry flocks). PMID:25324838
Vertical transmission of highly similar bla CTX-M-1-harboring IncI1 plasmids in Escherichia coli with different MLST types in the poultry production pyramid.

PubMed

Zurfluh, Katrin; Wang, Juan; Klumpp, Jochen; Nüesch-Inderbinen, Magdalena; Fanning, Séamus; Stephan, Roger

2014-01-01

The purpose of this study was to characterize sets of extended-spectrum β-lactamases (ESBL)-producing Enterobacteriaceae collected longitudinally from different flocks of broiler breeders, meconium of 1-day-old broilers from theses breeder flocks, as well as from these broiler flocks before slaughter. Five sets of ESBL-producing Escherichia coli were studied by multi-locus sequence typing (MLST), phylogenetic grouping, PCR-based replicon typing and resistance profiling. The bla CTX-M-1-harboring plasmids of one set (pHV295.1, pHV114.1, and pHV292.1) were fully sequenced and subjected to comparative analysis. Eleven different MLST sequence types (ST) were identified with ST1056 the predominant one, isolated in all five sets either on the broiler breeder or meconium level. Plasmid sequencing revealed that bla CTX-M-1 was carried by highly similar IncI1/ST3 plasmids that were 105 076 bp, 110 997 bp, and 117 269 bp in size, respectively. The fact that genetically similar IncI1/ST3 plasmids were found in ESBL-producing E. coli of different MLST types isolated at the different levels in the broiler production pyramid provides strong evidence for a vertical transmission of these plasmids from a common source (nucleus poultry flocks).
Role of Modular Polyketide Synthases in the Production of Polyether Ladder Compounds in Ciguatoxin-Producing Gambierdiscus polynesiensis and G. excentricus (Dinophyceae).

PubMed

Kohli, Gurjeet S; Campbell, Katrina; John, Uwe; Smith, Kirsty F; Fraga, Santiago; Rhodes, Lesley L; Murray, Shauna A

2017-09-01

Gambierdiscus, a benthic dinoflagellate, produces ciguatoxins that cause the human illness Ciguatera. Ciguatoxins are polyether ladder compounds that have a polyketide origin, indicating that polyketide synthases (PKS) are involved in their production. We sequenced transcriptomes of Gambierdiscus excentricus and Gambierdiscus polynesiensis and found 264 contigs encoding single domain ketoacyl synthases (KS; G. excentricus: 106, G. polynesiensis: 143) and ketoreductases (KR; G. excentricus: 7, G. polynesiensis: 8) with sequence similarity to type I PKSs, as reported in other dinoflagellates. In addition, 24 contigs (G. excentricus: 3, G. polynesiensis: 21) encoding multiple PKS domains (forming typical type I PKSs modules) were found. The proposed structure produced by one of these megasynthases resembles a partial carbon backbone of a polyether ladder compound. Seventeen contigs encoding single domain KS, KR, s-malonyltransacylase, dehydratase and enoyl reductase with sequence similarity to type II fatty acid synthases (FAS) in plants were found. Type I PKS and type II FAS genes were distinguished based on the arrangement of domains on the contigs and their sequence similarity and phylogenetic clustering with known PKS/FAS genes in other organisms. This differentiation of PKS and FAS pathways in Gambierdiscus is important, as it will facilitate approaches to investigating toxin biosynthesis pathways in dinoflagellates. © 2017 The Author(s) Journal of Eukaryotic Microbiology © 2017 International Society of Protistologists.
Methanotroph Diversity in Landfill Soil: Isolation of Novel Type I and Type II Methanotrophs Whose Presence Was Suggested by Culture-Independent 16S Ribosomal DNA Analysis

PubMed Central

Wise, Mark G.; McArthur, J Vaun; Shimkets, Lawrence J.

1999-01-01

The diversity of the methanotrophic community in mildly acidic landfill cover soil was assessed by three methods: two culture-independent molecular approaches and a traditional culture-based approach. For the first of the molecular studies, two primer pairs specific for the 16S rRNA gene of validly published type I (including the former type X) and type II methanotrophs were identified and tested. These primers were used to amplify directly extracted soil DNA, and the products were used to construct type I and type II clone libraries. The second molecular approach, based on denaturing gradient gel electrophoresis (DGGE), provided profiles of the methanotrophic community members as distinguished by sequence differences in variable region 3 of the 16S ribosomal DNA. For the culturing studies, an extinction-dilution technique was employed to isolate slow-growing but numerically dominant strains. The key variables of the series of enrichment conditions were initial pH (4.8 versus 6.8), air/CH4/CO2 headspace ratio (50:45:5 versus 90:9:1), and concentration of the medium (1× nitrate minimal salts [NMS] versus 0.2× NMS). Screening of the isolates showed that the nutrient-rich 1× NMS selected for type I methanotrophs, while the nutrient-poor 0.2× NMS tended to enrich for type II methanotrophs. Partial sequencing of the 16S rRNA gene from selected clones and isolates revealed some of the same novel sequence types. Phylogenetic analysis of the type I clone library suggested the presence of a new phylotype related to the Methylobacter-Methylomicrobium group, and this was confirmed by isolating two members of this cluster. The type II clone library also suggested the existence of a novel group of related species distinct from the validly published Methylosinus and Methylocystis genera, and two members of this cluster were also successfully cultured. Partial sequencing of the pmoA gene, which codes for the 27-kDa polypeptide of the particulate methane monooxygenase, reaffirmed the phylogenetic placement of the four isolates. Finally, not all of the bands separated by DGGE could be accounted for by the clones and isolates. This polyphasic assessment of community structure demonstrates that much diversity among the obligate methane oxidizers has yet to be formally described. PMID:10543800
Molecular cloning and nucleotide sequence of the alpha and beta subunits of allophycocyanin from the cyanelle genome of Cyanophora paradoxa.

PubMed Central

Bryant, D A; de Lorimier, R; Lambert, D H; Dubbs, J M; Stirewalt, V L; Stevens, S E; Porter, R D; Tam, J; Jay, E

1985-01-01

The genes for the alpha- and beta-subunit apoproteins of allophycocyanin (AP) were isolated from the cyanelle genome of Cyanophora paradoxa and subjected to nucleotide sequence analysis. The AP beta-subunit apoprotein gene was localized to a 7.8-kilobase-pair Pst I restriction fragment from cyanelle DNA by hybridization with a tetradecameric oligonucleotide probe. Sequence analysis using that oligonucleotide and its complement as primers for the dideoxy chain-termination sequencing method confirmed the presence of both AP alpha- and beta-subunit genes on this restriction fragment. Additional oligonucleotide primers were synthesized as sequencing progressed and were used to determine rapidly the nucleotide sequence of a 1336-base-pair region of this cloned fragment. This strategy allowed the sequencing to be completed without a detailed restriction map and without extensive and time-consuming subcloning. The sequenced region contains two open reading frames whose deduced amino acid sequences are 81-85% homologous to cyanobacterial and red algal AP subunits whose amino acid sequences have been determined. The two open reading frames are in the same orientation and are separated by 39 base pairs. AP alpha is 5' to AP beta and both coding sequences are preceded by a polypurine, Shine-Dalgarno-type sequence. Sequences upstream from AP alpha closely resemble the Escherichia coli consensus promoter sequences and also show considerable homology to promoter sequences for several chloroplast-encoded psbA genes. A 56-base-pair palindromic sequence downstream from the AP beta gene could play a role in the termination of transcription or translation. The allophycocyanin apoprotein subunit genes are located on the large single-copy region of the cyanelle genome. PMID:2987916

An enrichment, amplification, and sequence-based typing (EAST) approach for foodborne pathogen detection and surveillance

USDA-ARS?s Scientific Manuscript database

Introduction: Detection of foodborne pathogens typically involves microbiological enrichment with subsequent isolation and identification of a pure culture. This is typically followed by strain typing, which provides information critical to outbreak and source investigations. In the early 1990’s pul...
Reference karyotype and cytomolecular map for loblolly pine (Pinus taeda L.)

Treesearch

M. Nurul Islam-faridi; C. Dana Nelson; Thomas L. Kubisiak

2007-01-01

A reference karyotype is presented for loblolly pine (Pinus taeda L., subgenus Pinus , section Pinus, subsection Australes), based on fluorescent in situ hybridization (FISH), using 18s-28s rDNA, 5s rDNA, and Arabidopsis-type telomere repeat sequence (A-type TRS). Well...
HAlign-II: efficient ultra-large multiple sequence alignment and phylogenetic tree reconstruction with distributed and parallel computing.

PubMed

Wan, Shixiang; Zou, Quan

2017-01-01

Multiple sequence alignment (MSA) plays a key role in biological sequence analyses, especially in phylogenetic tree construction. Extreme increase in next-generation sequencing results in shortage of efficient ultra-large biological sequence alignment approaches for coping with different sequence types. Distributed and parallel computing represents a crucial technique for accelerating ultra-large (e.g. files more than 1 GB) sequence analyses. Based on HAlign and Spark distributed computing system, we implement a highly cost-efficient and time-efficient HAlign-II tool to address ultra-large multiple biological sequence alignment and phylogenetic tree construction. The experiments in the DNA and protein large scale data sets, which are more than 1GB files, showed that HAlign II could save time and space. It outperformed the current software tools. HAlign-II can efficiently carry out MSA and construct phylogenetic trees with ultra-large numbers of biological sequences. HAlign-II shows extremely high memory efficiency and scales well with increases in computing resource. THAlign-II provides a user-friendly web server based on our distributed computing infrastructure. HAlign-II with open-source codes and datasets was established at http://lab.malab.cn/soft/halign.
Molecular biological researches of Kuro-Koji molds, their classification and safety.

PubMed

Yamada, Osamu; Takara, Ryo; Hamada, Ryoko; Hayashi, Risa; Tsukahara, Masatoshi; Mikami, Shigeaki

2011-09-01

To assess the position of Kuro-Koji molds in black Aspergillus, we performed sequence analysis of approximately 2500 nucleotides of partial gene fragments, such as histone 3, on a total of 57 Aspergillus strains, including Aspergillus kawachii NBRC 4308, 12 Kuro-Koji molds isolated from awamori breweries in Japan, Aspergillus niger ATCC 1015, and A. tubingensis ATCC10550. Sequence results showed that all black Aspergillus strains could be classified into 3 types, type N which includes A. niger ATCC 1015, type T which includes A. tubingensis ATCC 10550, and type L which includes A. kawachii NBRC 4308. Phylogenetic analysis showed these three types belong to different clusters. All 12 Kuro-Koji molds isolated from awamori breweries were classified as type L, thus we concluded type L represents the industrial Kuro-Koji molds. We found all type L strains lack the An15g07920 gene which is required for ochratoxin A biosynthesis in black Aspergillus. This sequence is present in the genome of A. niger CBS 513.88 and has homology to the polyketide synthase fragment of A. ochraceus which is involved in ochratoxin A biosynthesis. Based on the industrial importance and the safety of Kuro-Koji molds, we propose to classify the type L strains as Aspergillus luchuensis, as initially reported by Dr. Inui. Copyright © 2011 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.
An Outbreak of Streptococcus pyogenes in a Mental Health Facility: Advantage of Well-Timed Whole-Genome Sequencing Over emm Typing.

PubMed

Bergin, Sarah M; Periaswamy, Balamurugan; Barkham, Timothy; Chua, Hong Choon; Mok, Yee Ming; Fung, Daniel Shuen Sheng; Su, Alex Hsin Chuan; Lee, Yen Ling; Chua, Ming Lai Ivan; Ng, Poh Yong; Soon, Wei Jia Wendy; Chu, Collins Wenhan; Tan, Siyun Lucinda; Meehan, Mary; Ang, Brenda Sze Peng; Leo, Yee Sin; Holden, Matthew T G; De, Partha; Hsu, Li Yang; Chen, Swaine L; de Sessions, Paola Florez; Marimuthu, Kalisvar

2018-05-09

OBJECTIVEWe report the utility of whole-genome sequencing (WGS) conducted in a clinically relevant time frame (ie, sufficient for guiding management decision), in managing a Streptococcus pyogenes outbreak, and present a comparison of its performance with emm typing.SETTINGA 2,000-bed tertiary-care psychiatric hospital.METHODSActive surveillance was conducted to identify new cases of S. pyogenes. WGS guided targeted epidemiological investigations, and infection control measures were implemented. Single-nucleotide polymorphism (SNP)-based genome phylogeny, emm typing, and multilocus sequence typing (MLST) were performed. We compared the ability of WGS and emm typing to correctly identify person-to-person transmission and to guide the management of the outbreak.RESULTSThe study included 204 patients and 152 staff. We identified 35 patients and 2 staff members with S. pyogenes. WGS revealed polyclonal S. pyogenes infections with 3 genetically distinct phylogenetic clusters (C1-C3). Cluster C1 isolates were all emm type 4, sequence type 915 and had pairwise SNP differences of 0-5, which suggested recent person-to-person transmissions. Epidemiological investigation revealed that cluster C1 was mediated by dermal colonization and transmission of S. pyogenes in a male residential ward. Clusters C2 and C3 were genomically diverse, with pairwise SNP differences of 21-45 and 26-58, and emm 11 and mostly emm120, respectively. Clusters C2 and C3, which may have been considered person-to-person transmissions by emm typing, were shown by WGS to be unlikely by integrating pairwise SNP differences with epidemiology.CONCLUSIONSWGS had higher resolution than emm typing in identifying clusters with recent and ongoing person-to-person transmissions, which allowed implementation of targeted intervention to control the outbreak.Infect Control Hosp Epidemiol 2018;1-9.
TaALMT1 promoter sequence compositions, acid tolerance, and Al tolerance in wheat cultivars and landraces from Sichuan in China.

PubMed

Han, C; Dai, S F; Liu, D C; Pu, Z J; Wei, Y M; Zheng, Y L; Wen, D J; Zhao, L; Yan, Z H

2013-11-18

Previous genetic studies on wheat from various sources have indicated that aluminum (Al) tolerance may have originated independently in USA, Brazil, and China. Here, TaALMT1 promoter sequences of 92 landraces and cultivars from Sichuan, China, were sequenced. Five promoter types (I', II, III, IV, and V) were observed in 39 cultivars, and only three promoter types (I, II, and III) were observed in 53 landraces. Among the wheat collections worldwide, only the Chinese Spring (CS) landrace native to Sichuan, China, carried the TaALMT1 promoter type III. Besides CS, two other Sichuan-bred landraces and six cultivars with TaALMT1 promoter type III were identified in this study. In the phylogenetic tree constructed based on the TaALMT1 promoter sequences, type III formed a separate branch, which was supported by a high bootstrap value. It is likely that TaALMT1 promoter type III originated from Sichuan-bred wheat landraces of China. In addition, the landraces with promoter type I showed the lowest Al tolerance among all landraces and cultivars. Furthermore, the cultivars with promoter type IV showed better Al tolerance than landraces with promoter type II. A comparison of acid tolerance and Al tolerance between cultivars and landraces showed that the landraces had better acid tolerance than the cultivars, whereas the cultivars showed better Al tolerance than the landraces. Moreover, significant difference in Al tolerance was also observed between the cultivars raised by the National Ministry of Agriculture and by Sichuan Province. Among the landraces from different regions, those from the East showed better acid tolerance and Al tolerance than those from the South and West of Sichuan. Additional Al-tolerant and acid-tolerant wheat lines were also identified.
Parallel algorithms for large-scale biological sequence alignment on Xeon-Phi based clusters.

PubMed

Lan, Haidong; Chan, Yuandong; Xu, Kai; Schmidt, Bertil; Peng, Shaoliang; Liu, Weiguo

2016-07-19

Computing alignments between two or more sequences are common operations frequently performed in computational molecular biology. The continuing growth of biological sequence databases establishes the need for their efficient parallel implementation on modern accelerators. This paper presents new approaches to high performance biological sequence database scanning with the Smith-Waterman algorithm and the first stage of progressive multiple sequence alignment based on the ClustalW heuristic on a Xeon Phi-based compute cluster. Our approach uses a three-level parallelization scheme to take full advantage of the compute power available on this type of architecture; i.e. cluster-level data parallelism, thread-level coarse-grained parallelism, and vector-level fine-grained parallelism. Furthermore, we re-organize the sequence datasets and use Xeon Phi shuffle operations to improve I/O efficiency. Evaluations show that our method achieves a peak overall performance up to 220 GCUPS for scanning real protein sequence databanks on a single node consisting of two Intel E5-2620 CPUs and two Intel Xeon Phi 7110P cards. It also exhibits good scalability in terms of sequence length and size, and number of compute nodes for both database scanning and multiple sequence alignment. Furthermore, the achieved performance is highly competitive in comparison to optimized Xeon Phi and GPU implementations. Our implementation is available at https://github.com/turbo0628/LSDBS-mpi .
Bifidobacterium reuteri sp. nov., Bifidobacterium callitrichos sp. nov., Bifidobacterium saguini sp. nov., Bifidobacterium stellenboschense sp. nov. and Bifidobacterium biavatii sp. nov. isolated from faeces of common marmoset (Callithrix jacchus) and red-handed tamarin (Saguinus midas).

PubMed

Endo, Akihito; Futagawa-Endo, Yuka; Schumann, Peter; Pukall, Rüdiger; Dicks, Leon M T

2012-03-01

Five strains of bifidobacteria were isolated from faeces of a common marmoset (Callithrix jacchus) and a red-handed tamarin (Saguinus midas). The five isolates clustered inside the phylogenetic group of the genus Bifidobacterium but did not show high sequence similarities between the isolates and to known species in the genus by phylogenetic analysis based on 16S rRNA gene sequences. Sequence analyses of dnaJ1 and hsp60 also indicated their independent phylogenetic positions to each other in the Bifidobacterium cluster. DNA G+C contents of the species ranged from 57.3 to 66.3 mol%, which is within the values recorded for Bifidobacterium species. All isolates showed fructose-6-phosphate phosphoketolase activity. Based on the data provided, the five isolates represent five novel species, for which the names Bifidobacterium reuteri sp. nov. (type strain: AFB22-1(T) = JCM 17295(T) = DSM 23975(T)), Bifidobacterium callitrichos sp. nov. (type strain: AFB22-5(T) = JCM 17296(T) = DSM 23973(T)), Bifidobacterium saguini sp. nov. (type strain: AFB23-1(T) = JCM 17297(T) = DSM 23967(T)), Bifidobacterium stellenboschense sp. nov. (type strain: AFB23-3(T) = JCM 17298(T) = DSM 23968(T)) and Bifidobacterium biavatii sp. nov. (type strain: AFB23-4(T) = JCM 17299(T) = DSM 23969(T)) are proposed. Copyright © 2011 Elsevier GmbH. All rights reserved.
Molecular and comparative analysis of Salmonella enterica Senftenberg from humans and animals using PFGE, MLST and NARMS

PubMed Central

2011-01-01

Background Salmonella species are recognized worldwide as a significant cause of human and animal disease. In this study the molecular profiles and characteristics of Salmonella enterica Senftenberg isolated from human cases of illness and those recovered from healthy or diagnostic cases in animals were assessed. Included in the study was a comparison with our own sequenced strain of S. Senfteberg recovered from production turkeys in North Dakota. Isolates examined in this study were subjected to antimicrobial susceptibility profiling using the National Antimicrobial Resistance Monitoring System (NARMS) panel which tested susceptibility to 15 different antimicrobial agents. The molecular profiles of all isolates were determined using Pulsed Field Gel Electrophoresis (PFGE) and the sequence types of the strains were obtained using Multi-Locus Sequence Type (MLST) analysis based on amplification and sequence interrogation of seven housekeeping genes (aroC, dnaN, hemD, hisD, purE, sucA, and thrA). PFGE data was input into BioNumerics analysis software to generate a dendrogram of relatedness among the strains. Results The study found 93 profiles among 98 S. Senftenberg isolates tested and there were primarily two sequence types associated with humans and animals (ST185 and ST14) with overlap observed in all host types suggesting that the distribution of S. Senftenberg sequence types is not host dependent. Antimicrobial resistance was observed among the animal strains, however no resistance was detected in human isolates suggesting that animal husbandry has a significant influence on the selection and promotion of antimicrobial resistance. Conclusion The data demonstrates the circulation of at least two strain types in both animal and human health suggesting that S. Senftenberg is relatively homogeneous in its distribution. The data generated in this study could be used towards defining a pathotype for this serovar. PMID:21708021
Reconstitution of wild type viral DNA in simian cells transfected with early and late SV40 defective genomes.

PubMed

O'Neill, F J; Gao, Y; Xu, X

1993-11-01

The DNAs of polyomaviruses ordinarily exist as a single circular molecule of approximately 5000 base pairs. Variants of SV40, BKV and JCV have been described which contain two complementing defective DNA molecules. These defectives, which form a bipartite genome structure, contain either the viral early region or the late region. The defectives have the unique property of being able to tolerate variable sized reiterations of regulatory and terminus region sequences, and portions of the coding region. They can also exchange coding region sequences with other polyomaviruses. It has been suggested that the bipartite genome structure might be a stage in the evolution of polyomaviruses which can uniquely sustain genome and sequence diversity. However, it is not known if the regulatory and terminus region sequences are highly mutable. Also, it is not known if the bipartite genome structure is reversible and what the conditions might be which would favor restoration of the monomolecular genome structure. We addressed the first question by sequencing the reiterated regulatory and terminus regions of E- and L-SV40 DNAs. This revealed a large number of mutations in the regulatory regions of the defective genomes, including deletions, insertions, rearrangements and base substitutions. We also detected insertions and base substitutions in the T-antigen gene. We addressed the second question by introducing into permissive simian cells, E- and L-SV40 genomes which had been engineered to contain only a single regulatory region. Analysis of viral DNA from transfected cells demonstrated recombined genomes containing a wild type monomolecular DNA structure. However, the complete defectives, containing reiterated regulatory regions, could often compete away the wild type genomes. The recombinant monomolecular genomes were isolated, cloned and found to be infectious. All of the DNA alterations identified in one of the regulatory regions of E-SV40 DNA were present in the recombinant monomolecular genomes. These and other findings indicate that the bipartite genome state can sustain many mutations which wtSV40 cannot directly sustain. However, the mutations can later be introduced into the wild type genomes when the E- and L-SV40 DNAs recombine to generate a new monomolecular genome structure.
Rapid Threat Organism Recognition Pipeline

DOE Office of Scientific and Technical Information (OSTI.GOV)

Williams, Kelly P.; Solberg, Owen D.; Schoeniger, Joseph S.

2013-05-07

The RAPTOR computational pipeline identifies microbial nucleic acid sequences present in sequence data from clinical samples. It takes as input raw short-read genomic sequence data (in particular, the type generated by the Illumina sequencing platforms) and outputs taxonomic evaluation of detected microbes in various human-readable formats. This software was designed to assist in the diagnosis or characterization of infectious disease, by detecting pathogen sequences in nucleic acid sequence data from clinical samples. It has also been applied in the detection of algal pathogens, when algal biofuel ponds became unproductive. RAPTOR first trims and filters genomic sequence reads based on qualitymore » and related considerations, then performs a quick alignment to the human (or other host) genome to filter out host sequences, then performs a deeper search against microbial genomes. Alignment to a protein sequence database is optional. Alignment results are summarized and placed in a taxonomic framework using the Lowest Common Ancestor algorithm.« less
Identifying metabolic enzymes with multiple types of association evidence

PubMed Central

Kharchenko, Peter; Chen, Lifeng; Freund, Yoav; Vitkup, Dennis; Church, George M

2006-01-01

Background Existing large-scale metabolic models of sequenced organisms commonly include enzymatic functions which can not be attributed to any gene in that organism. Existing computational strategies for identifying such missing genes rely primarily on sequence homology to known enzyme-encoding genes. Results We present a novel method for identifying genes encoding for a specific metabolic function based on a local structure of metabolic network and multiple types of functional association evidence, including clustering of genes on the chromosome, similarity of phylogenetic profiles, gene expression, protein fusion events and others. Using E. coli and S. cerevisiae metabolic networks, we illustrate predictive ability of each individual type of association evidence and show that significantly better predictions can be obtained based on the combination of all data. In this way our method is able to predict 60% of enzyme-encoding genes of E. coli metabolism within the top 10 (out of 3551) candidates for their enzymatic function, and as a top candidate within 43% of the cases. Conclusion We illustrate that a combination of genome context and other functional association evidence is effective in predicting genes encoding metabolic enzymes. Our approach does not rely on direct sequence homology to known enzyme-encoding genes, and can be used in conjunction with traditional homology-based metabolic reconstruction methods. The method can also be used to target orphan metabolic activities. PMID:16571130
Genotyping of Salmonella enterica serovar Typhi strains isolated from 1959 to 2006 in China and analysis of genetic diversity by genomic microarray.

PubMed

Zhang, Haifang; Zhang, Xiaolei; Yan, Meiying; Pang, Bo; Kan, Biao; Xu, Huaxi; Huang, Xinxiang

2011-12-15

To determine the genotype of Salmonella enterica serovar Typhi (S. Typhi) strains in China and analyze their genetic diversity. We collected S. Typhi strains from 1959 to 2006 in five highly endemic Chinese provinces and chose 40 representative strains. Multilocus sequence typing was used to determine the genotypes or sequence types (ST) and microarray-based comparative genomic hybridization (M-CGH) to investigate the differences in gene content among these strains. Forty representative S. Typhi strains belonged to 4 sequence types (ST1, ST2, ST890, and ST892). The predominant S. Typhi genotype (31/40) was ST2 and it had a diverse geographic distribution. We discovered two novel STs - ST890 and ST892. M-CGH showed that 69 genes in these two novel STs were divergent from S. Typhi Ty2, which belongs to ST1. In addition, 5 representative Typhi strains of ST2 isolated from Guizhou province showed differences in divergent genes. We determined two novel sequence types, ST890 and ST892, and found that ST2 was the most prevalent genotype of S. Typhi in China. Genetic diversity was present even within a highly clonal bacterial population.
Bayesian clustering of DNA sequences using Markov chains and a stochastic partition model.

PubMed

Jääskinen, Väinö; Parkkinen, Ville; Cheng, Lu; Corander, Jukka

2014-02-01

In many biological applications it is necessary to cluster DNA sequences into groups that represent underlying organismal units, such as named species or genera. In metagenomics this grouping needs typically to be achieved on the basis of relatively short sequences which contain different types of errors, making the use of a statistical modeling approach desirable. Here we introduce a novel method for this purpose by developing a stochastic partition model that clusters Markov chains of a given order. The model is based on a Dirichlet process prior and we use conjugate priors for the Markov chain parameters which enables an analytical expression for comparing the marginal likelihoods of any two partitions. To find a good candidate for the posterior mode in the partition space, we use a hybrid computational approach which combines the EM-algorithm with a greedy search. This is demonstrated to be faster and yield highly accurate results compared to earlier suggested clustering methods for the metagenomics application. Our model is fairly generic and could also be used for clustering of other types of sequence data for which Markov chains provide a reasonable way to compress information, as illustrated by experiments on shotgun sequence type data from an Escherichia coli strain.
Assembly of highly repetitive genomes using short reads: the genome of discrete typing unit III Trypanosoma cruzi strain 231.

PubMed

Baptista, Rodrigo P; Reis-Cunha, Joao Luis; DeBarry, Jeremy D; Chiari, Egler; Kissinger, Jessica C; Bartholomeu, Daniella C; Macedo, Andrea M

2018-02-14

Next-generation sequencing (NGS) methods are low-cost high-throughput technologies that produce thousands to millions of sequence reads. Despite the high number of raw sequence reads, their short length, relative to Sanger, PacBio or Nanopore reads, complicates the assembly of genomic repeats. Many genome tools are available, but the assembly of highly repetitive genome sequences using only NGS short reads remains challenging. Genome assembly of organisms responsible for important neglected diseases such as Trypanosoma cruzi, the aetiological agent of Chagas disease, is known to be challenging because of their repetitive nature. Only three of six recognized discrete typing units (DTUs) of the parasite have their draft genomes published and therefore genome evolution analyses in the taxon are limited. In this study, we developed a computational workflow to assemble highly repetitive genomes via a combination of de novo and reference-based assembly strategies to better overcome the intrinsic limitations of each, based on Illumina reads. The highly repetitive genome of the human-infecting parasite T. cruzi 231 strain was used as a test subject. The combined-assembly approach shown in this study benefits from the reference-based assembly ability to resolve highly repetitive sequences and from the de novo capacity to assemble genome-specific regions, improving the quality of the assembly. The acceptable confidence obtained by analyzing our results showed that our combined approach is an attractive option to assemble highly repetitive genomes with NGS short reads. Phylogenomic analysis including the 231 strain, the first representative of DTU III whose genome was sequenced, was also performed and provides new insights into T. cruzi genome evolution.
HLA-A, -B, -DRB1 allele and haplotype frequencies of 920 cord blood units from Central Chile.

PubMed

Schäfer, Christian; Sauter, Jürgen; Riethmüller, Tobias; Kashi, Zahra Mehdizadeh; Schmidt, Alexander H; Barriga, Francisco J

2016-08-01

We present human leukocyte antigen (HLA) haplotype and allele/antigenic group frequencies derived from a data set of 920 umbilical cord blood units collected in Central Chile. HLA-A and -B genotypes were typed using sequence specific oligonucleotide probe methods while HLA-DRB1 genotypes were obtained from sequencing-based typing. The most frequent haplotype is A*29~B*44~DRB1*07:01 with an estimated frequency of 2.1%. Copyright © 2016 American Society for Histocompatibility and Immunogenetics. Published by Elsevier Inc. All rights reserved.
SCCmecFinder, a Web-Based Tool for Typing of Staphylococcal Cassette Chromosome mec in Staphylococcus aureus Using Whole-Genome Sequence Data.

PubMed

Kaya, Hülya; Hasman, Henrik; Larsen, Jesper; Stegger, Marc; Johannesen, Thor Bech; Allesøe, Rosa Lundbye; Lemvigh, Camilla Koldbæk; Aarestrup, Frank Møller; Lund, Ole; Larsen, Anders Rhod

2018-01-01

Typing of methicillin-resistant Staphylococcus aureus (MRSA) is important in infection control and surveillance. The current nomenclature of MRSA includes the genetic background of the S. aureus strain determined by multilocus sequence typing (MLST) or equivalent methods like spa typing and typing of the mobile genetic element staphylococcal cassette chromosome mec (SCC mec ), which carries the mecA or mecC gene. Whereas MLST and spa typing are relatively simple, typing of SCC mec is less trivial because of its heterogeneity. Whole-genome sequencing (WGS) provides the essential data for typing of the genetic background and SCC mec , but so far, no bioinformatic tools for SCC mec typing have been available. Here, we report the development and evaluation of SCC mec Finder for characterization of the SCC mec element from S. aureus WGS data. SCC mec Finder is able to identify all SCC mec element types, designated I to XIII, with subtyping of SCC mec types IV (2B) and V (5C2). SCC mec elements are characterized by two different gene prediction approaches to achieve correct annotation, a Basic Local Alignment Search Tool (BLAST)-based approach and a k -mer-based approach. Evaluation of SCC mec Finder by using a diverse collection of clinical isolates ( n = 93) showed a high typeability level of 96.7%, which increased to 98.9% upon modification of the default settings. In conclusion, SCC mec Finder can be an alternative to more laborious SCC mec typing methods and is freely available at https://cge.cbs.dtu.dk/services/SCCmecFinder. IMPORTANCE SCC mec in MRSA is acknowledged to be of importance not only because it contains the mecA or mecC gene but also for staphylococcal adaptation to different environments, e.g., in hospitals, the community, and livestock. Typing of SCC mec by PCR techniques has, because of its heterogeneity, been challenging, and whole-genome sequencing has only partially solved this since no good bioinformatic tools have been available. In this article, we describe the development of a new bioinformatic tool, SCC mec Finder, that includes most of the needs for infection control professionals and researchers regarding the interpretation of SCC mec elements. The software detects all of the SCC mec elements accepted by the International Working Group on the Classification of Staphylococcal Cassette Chromosome Elements, and users will be prompted if diverging and potential new elements are uploaded. Furthermore, SCC mec Finder will be curated and updated as new elements are found and it is easy to use and freely accessible.
Identification of random nucleic acid sequence aberrations using dual capture probes which hybridize to different chromosome regions

DOEpatents

Lucas, J.N.; Straume, T.; Bogen, K.T.

1998-03-24

A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration. 14 figs.
Identification of random nucleic acid sequence aberrations using dual capture probes which hybridize to different chromosome regions

DOEpatents

Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

1998-01-01

A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration.
Sequence-Based Typing of Legionella pneumophila Serogroup 1 Offers the Potential for True Portability in Legionellosis Outbreak Investigation

PubMed Central

Gaia, Valeria; Fry, Norman K.; Harrison, Timothy G.; Peduzzi, Raffaele

2003-01-01

Seven gene loci of Legionella pneumophila serogroup 1 were analyzed as potential epidemiological typing markers to aid in the investigation of legionella outbreaks. The genes chosen included four likely to be selectively neutral (acn, groES, groEL, and recA) and three likely to be under selective pressure (flaA, mompS, and proA). Oligonucleotide primers were designed to amplify 279- to 763-bp fragments from each gene. Initial sequence analysis of the seven loci from 10 well-characterized isolates of L. pneumophila serogroup 1 gave excellent reproducibility (R) and epidemiological concordance (E) values (R = 1.00; E = 1.00). The three loci showing greatest discrimination and nucleotide variation, flaA, mompS, and proA, were chosen for further study. Indices of discrimination (D) were calculated using a panel of 79 unrelated isolates. Single loci gave D values ranging from 0.767 to 0.857, and a combination of all three loci resulted in a D value of 0.924. When all three loci were combined with monoclonal antibody subgrouping, the D value was 0.971. Sequence-based typing of L. pneumophila serogroup 1 using only three loci is epidemiologically concordant and highly discriminatory and has the potential to become the new “gold standard” for the epidemiological typing of L. pneumophila. PMID:12843023

Effect of base sequence on the DNA cross-linking properties of pyrrolobenzodiazepine (PBD) dimers

PubMed Central

Rahman, Khondaker M.; James, Colin H.; Thurston, David E.

2011-01-01

Pyrrolo[2,1-c][1,4]benzodiazepine (PBD) dimers are synthetic sequence-selective DNA minor-groove cross-linking agents that possess two electrophilic imine moieties (or their equivalent) capable of forming covalent aminal linkages with guanine C2-NH2 functionalities. The PBD dimer SJG-136, which has a C8–O–(CH2)3–O–C8′′ central linker joining the two PBD moieties, is currently undergoing phase II clinical trials and current research is focused on developing analogues of SJG-136 with different linker lengths and substitution patterns. Using a reversed-phase ion pair HPLC/MS method to evaluate interaction with oligonucleotides of varying length and sequence, we recently reported (JACS, 2009, 131, 13 756) that SJG-136 can form three different types of adducts: inter- and intrastrand cross-linked adducts, and mono-alkylated adducts. These studies have now been extended to include PBD dimers with a longer central linker (C8–O–(CH2)5–O–C8′), demonstrating that the type and distribution of adducts appear to depend on (i) the length of the C8/C8′-linker connecting the two PBD units, (ii) the positioning of the two reactive guanine bases on the same or opposite strands, and (iii) their separation (i.e. the number of base pairs, usually ATs, between them). Based on these data, a set of rules are emerging that can be used to predict the DNA–interaction behaviour of a PBD dimer of particular C8–C8′ linker length towards a given DNA sequence. These observations suggest that it may be possible to design PBD dimers to target specific DNA sequences. PMID:21427082
On continuous user authentication via typing behavior.

PubMed

Roth, Joseph; Liu, Xiaoming; Metaxas, Dimitris

2014-10-01

We hypothesize that an individual computer user has a unique and consistent habitual pattern of hand movements, independent of the text, while typing on a keyboard. As a result, this paper proposes a novel biometric modality named typing behavior (TB) for continuous user authentication. Given a webcam pointing toward a keyboard, we develop real-time computer vision algorithms to automatically extract hand movement patterns from the video stream. Unlike the typical continuous biometrics, such as keystroke dynamics (KD), TB provides a reliable authentication with a short delay, while avoiding explicit key-logging. We collect a video database where 63 unique subjects type static text and free text for multiple sessions. For one typing video, the hands are segmented in each frame and a unique descriptor is extracted based on the shape and position of hands, as well as their temporal dynamics in the video sequence. We propose a novel approach, named bag of multi-dimensional phrases, to match the cross-feature and cross-temporal pattern between a gallery sequence and probe sequence. The experimental results demonstrate a superior performance of TB when compared with KD, which, together with our ultrareal-time demo system, warrant further investigation of this novel vision application and biometric modality.
An overview of various typing methods for clinical epidemiology of the emerging pathogen Stenotrophomonas maltophilia.

PubMed

Gherardi, Giovanni; Creti, Roberta; Pompilio, Arianna; Di Bonaventura, Giovanni

2015-03-01

Typing of bacterial isolates has been used for decades to study local outbreaks as well as in national and international surveillances for monitoring newly emerging resistant clones. Despite being recognized as a nosocomial pathogen, the precise modes of transmission of Stenotrophomonas maltophilia in health care settings are unknown. Due to the high genetic diversity observed among S. maltophilia clinical isolates, the typing results might be better interpreted if also environmental strains were included. This could help to identify preventative measures to be designed and implemented for decreasing the possibility of outbreaks and nosocomial infections. In this review, we attempt to provide an overview on the most common typing methods used for clinical epidemiology of S. maltophilia strains, such as PCR-based fingerprinting analyses, pulsed-field gel electrophoresis, multilocus variable number tandem repeat analysis, and multilocus sequence type. Application of the proteomic-based mass spectrometry by matrix-assisted laser desorption ionization-time of flight is also described. Improvements of typing methods already in use have to be achieved to facilitate S. maltophilia infection control at any level. In the near future, when novel Web-based platforms for rapid data processing and analysis will be available, whole genome sequencing technologies will likely become a highly powerful tool for outbreak investigations and surveillance studies in routine clinical practices. Copyright © 2015 Elsevier Inc. All rights reserved.
Veillonella infantium sp. nov., an anaerobic, Gram-stain-negative coccus isolated from tongue biofilm of a Thai child.

PubMed

Mashima, Izumi; Liao, Yu-Chieh; Miyakawa, Hiroshi; Theodorea, Citra F; Thawboon, Boonyanit; Thaweboon, Sroisiri; Scannapieco, Frank A; Nakazawa, Futoshi

2018-04-01

A strain of a novel anaerobic, Gram-stain-negative coccus was isolated from the tongue biofilm of a Thai child. This strain was shown, at the phenotypic level and based on 16S rRNA gene sequencing, to be a member of the genus Veillonella. Comparative analysis of the 16S rRNA, dnaK and rpoB gene sequences indicated that phylogenetically the strain comprised a distinct novel branch within the genus Veillonella. The novel strain showed 99.8, 95.1 and 95.9 % similarity to partial 16S rRNA, dnaK and rpoB gene sequences, respectively, to the type strains of the two most closely related species, Veillonelladispar ATCC 17748 T and Veillonellatobetsuensis ATCC BAA-2400 T . The novel strain could be discriminated from previously reported species of the genus Veillonella based on partial dnaK and rpoB gene sequencing and average nucleotide identity values. The major acid end-product produced by this strain was acetic acid under anaerobic conditions in trypticase-yeast extract-haemin with 1 % (w/v) glucose or fructose medium. Lactate was fermented to acetic acid and propionic acid. Based on these observations, this strain represents a novel species, for which the name Veillonella infantium sp. nov. is proposed. The type strain is T11011-4 T (=JCM 31738 T =TSD-88 T ).
Detection of BRAF mutations from solid tumors using Tumorplex™ technology

PubMed Central

Yo, Jacob; Hay, Katie S.L.; Vinayagamoorthy, Dilanthi; Maryanski, Danielle; Carter, Mark; Wiegel, Joseph; Vinayagamoorthy, Thuraiayah

2015-01-01

Allele specific multiplex sequencing (Tumorplex™) is a new molecular platform for the detection of single base mutation in tumor biopsies with high sensitivity for clinical testing. Tumorplex™ is a novel modification of Sanger sequencing technology that generates both mutant and wild type nucleotide sequences simultaneously in the same electropherogram. The molecular weight of the two sequencing primers are different such that the two sequences generated are separated, thus eliminating possible suppression of mutant signal by the more abundant wild type signal. Tumorplex™ platform technology was tested using BRAF mutation V600E. These studies were performed with cloned BRAF mutations and genomic DNA extracted from tumor cells carrying 50% mutant allele. The lower limit of detection for BRAF V600E was found to be 20 genome equivalents (GE) using genomic DNA extracted from mutation specific cell lines. Sensitivity of the assay was tested by challenging the mutant allele with wild type allele at 20 GE, and was able to detect BRAF mutant signal at a GE ration of 20:1 × 107 (mutant to wild-type). This level of sensitivity can detect low abundance of clonal mutations in tumor biopsies and eliminate the need for cell enrichment. • Tumorplex™ is a single tube assay that permits the recognition of mutant allele without suppression by wildtype signal. • Tumorplex™ provides a high level of sensitivity. • Tumorplex™ can be used with small sample size with mixed population of cells carrying heterogeneous gDNA. PMID:26258049
Effect of the reflectional symmetry on the coherent hole transport across DNA hairpins

NASA Astrophysics Data System (ADS)

Zarea, Mehdi; Berlin, Yuri; Ratner, Mark A.

2017-03-01

The coherent hole transfer in three types of DNA hairpins containing strands with adenine (A) and guanine (G) nucleobases has been studied. The investigated hairpins involve An+1GGAn, AnGAGAn, or (AG)2nA strands that connect the hole donor and hole acceptor located on opposite ends of hairpins. The positive charge transfer from the photo-excited donor to the acceptor is shown to be slower for An+1GGAn in comparison with AnGAGAn and (AG)2nA sequences. We have revealed that this is due to the reflectional symmetry of the last two sequences with respect to the axis passing through the middle base. As has been demonstrated, the symmetry of the sequence structure manifests itself in the reflectional symmetry of the energy eigenstates. In addition, it has been shown that (AG)2nA is the only symmetric sequence with a zero energy state in the middle of the LUMO tight-binding energy band. Based on our theoretical findings, we predict that the hairpin with this sequence should have the fastest coherent hole transfer rate among the class of base sequences studied.
Direct repeat sequences in the Streptomyces chitinase-63 promoter direct both glucose repression and chitin induction

PubMed Central

Ni, Xiangyang; Westpheling, Janet

1997-01-01

The chi63 promoter directs glucose-sensitive, chitin-dependent transcription of a gene involved in the utilization of chitin as carbon source. Analysis of 5′ and 3′ deletions of the promoter region revealed that a 350-bp segment is sufficient for wild-type levels of expression and regulation. The analysis of single base changes throughout the promoter region, introduced by random and site-directed mutagenesis, identified several sequences to be important for activity and regulation. Single base changes at −10, −12, −32, −33, −35, and −37 upstream of the transcription start site resulted in loss of activity from the promoter, suggesting that bases in these positions are important for RNA polymerase interaction. The sequences centered around −10 (TATTCT) and −35 (TTGACC) in this promoter are, in fact, prototypical of eubacterial promoters. Overlapping the RNA polymerase binding site is a perfect 12-bp direct repeat sequence. Some base changes within this direct repeat resulted in constitutive expression, suggesting that this sequence is an operator for negative regulation. Other base changes resulted in loss of glucose repression while retaining the requirement for chitin induction, suggesting that this sequence is also involved in glucose repression. The fact that cis-acting mutations resulted in glucose resistance but not inducer independence rules out the possibility that glucose repression acts exclusively by inducer exclusion. The fact that mutations that affect glucose repression and chitin induction fall within the same direct repeat sequence module suggests that the direct repeat sequence facilitates both chitin induction and glucose repression. PMID:9371809
Next Generation Sequencing Technology and Genomewide Data Analysis: Perspectives for Retinal Research

PubMed Central

Chaitankar, Vijender; Karakülah, Gökhan; Ratnapriya, Rinki; Giuste, Felipe O.; Brooks, Matthew J.; Swaroop, Anand

2016-01-01

The advent of high throughput next generation sequencing (NGS) has accelerated the pace of discovery of disease-associated genetic variants and genomewide profiling of expressed sequences and epigenetic marks, thereby permitting systems-based analyses of ocular development and disease. Rapid evolution of NGS and associated methodologies presents significant challenges in acquisition, management, and analysis of large data sets and for extracting biologically or clinically relevant information. Here we illustrate the basic design of commonly used NGS-based methods, specifically whole exome sequencing, transcriptome, and epigenome profiling, and provide recommendations for data analyses. We briefly discuss systems biology approaches for integrating multiple data sets to elucidate gene regulatory or disease networks. While we provide examples from the retina, the NGS guidelines reviewed here are applicable to other tissues/cell types as well. PMID:27297499
Cronobacter, the emergent bacterial pathogen Enterobacter sakazakii comes of age; MLST and whole genome sequence analysis.

PubMed

Forsythe, Stephen J; Dickins, Benjamin; Jolley, Keith A

2014-12-16

Following the association of Cronobacter spp. to several publicized fatal outbreaks in neonatal intensive care units of meningitis and necrotising enterocolitis, the World Health Organization (WHO) in 2004 requested the establishment of a molecular typing scheme to enable the international control of the organism. This paper presents the application of Next Generation Sequencing (NGS) to Cronobacter which has led to the establishment of the Cronobacter PubMLST genome and sequence definition database (http://pubmlst.org/cronobacter/) containing over 1000 isolates with metadata along with the recognition of specific clonal lineages linked to neonatal meningitis and adult infections Whole genome sequencing and multilocus sequence typing (MLST) has supports the formal recognition of the genus Cronobacter composed of seven species to replace the former single species Enterobacter sakazakii. Applying the 7-loci MLST scheme to 1007 strains revealed 298 definable sequence types, yet only C. sakazakii clonal complex 4 (CC4) was principally associated with neonatal meningitis. This clonal lineage has been confirmed using ribosomal-MLST (51-loci) and whole genome-MLST (1865 loci) to analyse 107 whole genomes via the Cronobacter PubMLST database. This database has enabled the retrospective analysis of historic cases and outbreaks following re-identification of those strains. The Cronobacter PubMLST database offers a central, open access, reliable sequence-based repository for researchers. It has the capacity to create new analysis schemes 'on the fly', and to integrate metadata (source, geographic distribution, clinical presentation). It is also expandable and adaptable to changes in taxonomy, and able to support the development of reliable detection methods of use to industry and regulatory authorities. Therefore it meets the WHO (2004) request for the establishment of a typing scheme for this emergent bacterial pathogen. Whole genome sequencing has additionally shown a range of potential virulence and environmental fitness traits which may account for the association of C. sakazakii CC4 pathogenicity, and propensity for neonatal CNS.
Next generation sequencing (NGS): a golden tool in forensic toolkit.

PubMed

Aly, S M; Sabri, D M

The DNA analysis is a cornerstone in contemporary forensic sciences. DNA sequencing technologies are powerful tools that enrich molecular sciences in the past based on Sanger sequencing and continue to glowing these sciences based on Next generation sequencing (NGS). Next generation sequencing has excellent potential to flourish and increase the molecular applications in forensic sciences by jumping over the pitfalls of the conventional method of sequencing. The main advantages of NGS compared to conventional method that it utilizes simultaneously a large number of genetic markers with high-resolution of genetic data. These advantages will help in solving several challenges such as mixture analysis and dealing with minute degraded samples. Based on these new technologies, many markers could be examined to get important biological data such as age, geographical origins, tissue type determination, external visible traits and monozygotic twins identification. It also could get data related to microbes, insects, plants and soil which are of great medico-legal importance. Despite the dozens of forensic research involving NGS, there are requirements before using this technology routinely in forensic cases. Thus, there is a great need to more studies that address robustness of these techniques. Therefore, this work highlights the applications of forensic sciences in the era of massively parallel sequencing.
[Study on the genetic difference of SEO type Hantaviruses].

PubMed

Zhang, X; Zhou, S; Wang, H; Hu, J; Guan, Z; Liu, H

2000-10-01

To understand the genetic type of Hantaviruses and the difference between them caused by rodents in Beijing and to furhter explore the source of the infectious factors. Hantavirus RNA, isolated from lungs of rodents captured in Beijing and positive with Hantavirus antigens with frozen sectioning and Immunofluorescent assay, were reverse-transcribed and amplified with PCR with Hantavirus-specific primers. Five of the PCR amplifications were discovered and sequenced with 300 bp sequence data of M segments (from 2003 - 2302nt according cDNA of seoul 8039 strain). Nucleotide sequence homology showed that they were sequences of SEO-type Hantavirus. Compared with SEO type Hantavirus, the nucleotide sequence homology of these samples was more than 94% while the homology of amonia acid sequence was more than 98%. When compared with HNT type Hantavirus, the homology of nucleotide sequence became less than 72% with the homology of amonia acid sequence less than 81%. Similar to other Hantavirus of SEO type, their nucleotide sequences and deduced amino acid sequences were highly preserved. Phylogenetic tree analysis showed that the five viruses could be divided into at least 4 branches. It was quite likely that there were at least two sub-type SEO viruses with 4 branches that were circulating in Beijing.
Aptamer-based impedimetric sensor for bacterial typing.

PubMed

Labib, Mahmoud; Zamay, Anna S; Kolovskaya, Olga S; Reshetneva, Irina T; Zamay, Galina S; Kibbee, Richard J; Sattar, Syed A; Zamay, Tatiana N; Berezovski, Maxim V

2012-10-02

The development of an aptamer-based impedimetric sensor for typing of bacteria (AIST-B) is presented. Highly specific DNA aptamers to Salmonella enteritidis were selected via Cell-SELEX technique. Twelve rounds of selection were performed; each comprises a positive selection step against S. enteritidis and a negative selection step against a mixture of related pathogens, including Salmonella typhimurium, Escherichia coli, Staphylococcus aureus, Pseudomonas aeruginosa, and Citrobacter freundii, to ensure the species-specificity of the selected aptamers. After sequencing of the pool showing the highest binding affinity to S. enteritidis, a DNA sequence of high affinity to the bacteria was integrated into an impedimetric sensor via self-assembly onto a gold nanoparticles-modified screen-printed carbon electrode (GNPs-SPCE). Remarkably, this aptasensor is highly selective and can successfully detect S. enteritidis down to 600 CFU mL(-1) (equivalent to 18 CFU in 30 μL assay volume) in 10 min and distinguish it from other Salmonella species, including S. typhimurium and S. choleraesuis. This report is envisaged to open a new venue for the aptamer-based typing of a variety of microorganisms using a rapid, economic, and label-free electrochemical platform.
Species Identification of Bovine, Ovine and Porcine Type 1 Collagen; Comparing Peptide Mass Fingerprinting and LC-Based Proteomics Methods.

PubMed

Buckley, Mike

2016-03-24

Collagen is one of the most ubiquitous proteins in the animal kingdom and the dominant protein in extracellular tissues such as bone, skin and other connective tissues in which it acts primarily as a supporting scaffold. It has been widely investigated scientifically, not only as a biomedical material for regenerative medicine, but also for its role as a food source for both humans and livestock. Due to the long-term stability of collagen, as well as its abundance in bone, it has been proposed as a source of biomarkers for species identification not only for heat- and pressure-rendered animal feed but also in ancient archaeological and palaeontological specimens, typically carried out by peptide mass fingerprinting (PMF) as well as in-depth liquid chromatography (LC)-based tandem mass spectrometric methods. Through the analysis of the three most common domesticates species, cow, sheep, and pig, this research investigates the advantages of each approach over the other, investigating sites of sequence variation with known functional properties of the collagen molecule. Results indicate that the previously identified species biomarkers through PMF analysis are not among the most variable type 1 collagen peptides present in these tissues, the latter of which can be detected by LC-based methods. However, it is clear that the highly repetitive sequence motif of collagen throughout the molecule, combined with the variability of the sites and relative abundance levels of hydroxylation, can result in high scoring false positive peptide matches using these LC-based methods. Additionally, the greater alpha 2(I) chain sequence variation, in comparison to the alpha 1(I) chain, did not appear to be specific to any particular functional properties, implying that intra-chain functional constraints on sequence variation are not as great as inter-chain constraints. However, although some of the most variable peptides were only observed in LC-based methods, until the range of publicly available collagen sequences improves, the simplicity of the PMF approach and suitable range of peptide sequence variation observed makes it the ideal method for initial taxonomic identification prior to further analysis by LC-based methods only when required.
Structure-based design of broadly protective group a streptococcal M protein-based vaccines.

PubMed

Dale, James B; Smeesters, Pierre R; Courtney, Harry S; Penfound, Thomas A; Hohn, Claudia M; Smith, Jeremy C; Baudry, Jerome Y

2017-01-03

A major obstacle to the development of broadly protective M protein-based group A streptococcal (GAS) vaccines is the variability within the N-terminal epitopes that evoke potent bactericidal antibodies. The concept of M type-specific protective immune responses has recently been challenged based on the observation that multivalent M protein vaccines elicited cross-reactive bactericidal antibodies against a number of non-vaccine M types of GAS. Additionally, a new "cluster-based" typing system of 175M proteins identified a limited number of clusters containing closely related M proteins. In the current study, we used the emm cluster typing system, in combination with computational structure-based peptide modeling, as a novel approach to the design of potentially broadly protective M protein-based vaccines. M protein sequences (AA 16-50) from the E4 cluster containing 17 emm types of GAS were analyzed using de novo 3-D structure prediction tools and the resulting structures subjected to chemical diversity analysis to identify sequences that were the most representative of the 3-D physicochemical properties of the M peptides in the cluster. Five peptides that spanned the range of physicochemical attributes of all 17 peptides were used to formulate synthetic and recombinant vaccines. Rabbit antisera were assayed for antibodies that cross-reacted with E4 peptides and whole bacteria by ELISA and for bactericidal activity against all E4GAS. The synthetic vaccine rabbit antisera reacted with all 17 E4M peptides and demonstrated bactericidal activity against 15/17 E4GAS. A recombinant hybrid vaccine containing the same E4 peptides also elicited antibodies that cross-reacted with all E4M peptides. Comprehensive studies using structure-based design may result in a broadly protective M peptide vaccine that will elicit cluster-specific and emm type-specific antibody responses against the majority of clinically relevant emm types of GAS. Copyright Â© 2016 Elsevier Ltd. All rights reserved.
Novel insertion mutation of ABCB1 gene in an ivermectin-sensitive Border Collie.

PubMed

Han, Jae-Ik; Son, Hyoung-Won; Park, Seung-Cheol; Na, Ki-Jeong

2010-12-01

P-glycoprotein (P-gp) is encoded by the ABCB1 gene and acts as an efflux pump for xenobiotics. In the Border Collie, a nonsense mutation caused by a 4-base pair deletion in the ABCB1 gene is associated with a premature stop to P-gp synthesis. In this study, we examined the full-length coding sequence of the ABCB1 gene in an ivermectin-sensitive Border Collie that lacked the aforementioned deletion mutation. The sequence was compared to the corresponding sequences of a wild-type Beagle and seven ivermectin-tolerant family members of the Border Collie. When compared to the wild-type Beagle sequence, that of the ivermectin-sensitive Border Collie was found to have one insertion mutation and eight single nucleotide polymorphisms (SNPs) in the coding sequence of the ABCB1 gene. While the eight SNPs were also found in the family members' sequences, the insertion mutation was found only in the ivermectin-sensitive dog. These results suggest the possibility that the SNPs are species-specific features of the ABCB1 gene in Border Collies, and that the insertion mutation may be related to ivermectin intolerance.
Novel insertion mutation of ABCB1 gene in an ivermectin-sensitive Border Collie

PubMed Central

Han, Jae-Ik; Son, Hyoung-Won; Park, Seung-Cheol

2010-01-01

P-glycoprotein (P-gp) is encoded by the ABCB1 gene and acts as an efflux pump for xenobiotics. In the Border Collie, a nonsense mutation caused by a 4-base pair deletion in the ABCB1 gene is associated with a premature stop to P-gp synthesis. In this study, we examined the full-length coding sequence of the ABCB1 gene in an ivermectin-sensitive Border Collie that lacked the aforementioned deletion mutation. The sequence was compared to the corresponding sequences of a wild-type Beagle and seven ivermectin-tolerant family members of the Border Collie. When compared to the wild-type Beagle sequence, that of the ivermectin-sensitive Border Collie was found to have one insertion mutation and eight single nucleotide polymorphisms (SNPs) in the coding sequence of the ABCB1 gene. While the eight SNPs were also found in the family members' sequences, the insertion mutation was found only in the ivermectin-sensitive dog. These results suggest the possibility that the SNPs are species-specific features of the ABCB1 gene in Border Collies, and that the insertion mutation may be related to ivermectin intolerance. PMID:21113104
Scanning electron microscopy (SEM) evaluation of sealing ability of MTA and EndoSequence as root-end filling materials with chitosan and carboxymethyl chitosan (CMC) as retrograde smear layer removing agents.

PubMed

Nagesh, Bolla; Jeevani, Eppala; Sujana, Varri; Damaraju, Bharagavi; Sreeha, Kaluvakolanu; Ramesh, Penumaka

2016-01-01

The purpose of this study was to evaluate the sealing ability of mineral trioxide aggregate (MTA) and EndoSequence with chitosan and carboxymethyl chitosan (CMC) as retrograde smear layer removing agents using scanning electron microscopy (SEM). Forty human single rooted teeth were taken. Crowns were decoronated and canals were obturated. Apically roots were resected and retrograde cavities were done. Based on the type of retrograde material placed and the type of smear layer removal agent used for retrograde cavities, they were divided into four groups (N = 10): Group I chitosan with EndoSequence, group II chitosan with MTA, group III CMC with EndoSequence, and Group IV CMC with MTA. All the samples were longitudinally sectioned, and the SEM analysis was done for marginal adaptation. Kruskal-Wallis and Mann-Witney analysis tests. SEM images showed the presence of less gaps in group III, i.e., CMC with EndoSequence when compared to other groups with statistically significant difference. Within the limited scope of this study, it was concluded that EndoSequence as retrograde material showed better marginal sealing ability.
iFeature: a python package and web server for features extraction and selection from protein and peptide sequences.

PubMed

Chen, Zhen; Zhao, Pei; Li, Fuyi; Leier, André; Marquez-Lago, Tatiana T; Wang, Yanan; Webb, Geoffrey I; Smith, A Ian; Daly, Roger J; Chou, Kuo-Chen; Song, Jiangning

2018-03-08

Structural and physiochemical descriptors extracted from sequence data have been widely used to represent sequences and predict structural, functional, expression and interaction profiles of proteins and peptides as well as DNAs/RNAs. Here, we present iFeature, a versatile Python-based toolkit for generating various numerical feature representation schemes for both protein and peptide sequences. iFeature is capable of calculating and extracting a comprehensive spectrum of 18 major sequence encoding schemes that encompass 53 different types of feature descriptors. It also allows users to extract specific amino acid properties from the AAindex database. Furthermore, iFeature integrates 12 different types of commonly used feature clustering, selection, and dimensionality reduction algorithms, greatly facilitating training, analysis, and benchmarking of machine-learning models. The functionality of iFeature is made freely available via an online web server and a stand-alone toolkit. http://iFeature.erc.monash.edu/; https://github.com/Superzchen/iFeature/. jiangning.song@monash.edu; kcchou@gordonlifescience.org; roger.daly@monash.edu. Supplementary data are available at Bioinformatics online.
Lactobacillus cypricasei Lawson et al. 2001 is a later heterotypic synonym of Lactobacillus acidipiscis Tanasupawat et al. 2000.

PubMed

Naser, Sabri M; Vancanneyt, Marc; Hoste, Bart; Snauwaert, Cindy; Swings, Jean

2006-07-01

The applicability of a multilocus sequence analysis (MLSA)-based identification system for lactobacilli was evaluated. Two housekeeping genes that code for the phenylalanyl-tRNA synthase alpha-subunit (pheS) and RNA polymerase alpha-subunit (rpoA) were sequenced and analysed for members of the Lactobacillus salivarius species group. The type strains of Lactobacillus acidipiscis and Lactobacillus cypricasei were investigated further using a third gene that encodes the alpha-subunit of ATP synthase (atpA). The MLSA data revealed close relatedness between L. acidipiscis and L. cypricasei, with 99.8-100 % pheS, rpoA and atpA gene sequence similarities. Comparison of the 16S rRNA gene sequences of the type strains of the two species confirmed the close relatedness (99.8 % gene sequence similarity) between the two taxa. Similar phenotypes and high DNA-DNA binding values in the range of 84 to 97.5 % confirmed that L. acidipiscis and L. cypricasei are synonymous species. On the basis of the present study, it is proposed that Lactobacillus cypricasei is a later heterotypic synonym of Lactobacillus acidipiscis.
Molecular prevalence and genetic diversity of bovine Theileria orientalis in Myanmar.

PubMed

Bawm, Saw; Shimizu, Kohei; Hirota, Jun-Ichi; Tosa, Yusuke; Htun, Lat Lat; Maw, Ni Ni; Thein, Myint; Kato, Hirotomo; Sakurai, Tatsuya; Katakura, Ken

2014-08-01

Theileria orientalis is a causative agent of benign theileriosis in cattle and distributed in mainly Asian countries. In the present study, we examined the prevalence of T. orientalis infection by PCR based on the major piroplasm surface protein gene (MPSP) sequences in cattle in Myanmar, followed by phylogenetic analysis of the MPSP genes. The MPSP gene was amplified in 258 of 713 (36.2%) cattle blood DNA samples collected from five cities in different geographical regions of Myanmar. Phylogenetic analysis of MPSP sequences from 54 T. orientalis-positive DNA samples revealed the presence of six allelic genotypes, including Types 1, 3, 4, 5, 7, and N-3. Types 5 and 7 were the predominant types detected. Sequences of the MPSP genes detected in Myanmar were closely related to those from Thailand, Vietnam or Mongolia. These findings suggest that movement of animals carrying T. orientalis parasites between Southeast Asian countries could be a reason for the similar genotype distribution of the parasites in Myanmar. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

Investigating the long-term course of schizophrenia by sequence analysis.

PubMed

An der Heiden, Wolfram; Häfner, Heinz

2015-08-30

In the present study we set out to explore the long-term clinical course of schizophrenia in a holistic manner by adopting sequence analysis. Our aim was to identify course types of illness by means of cluster analysis. The study was based on course and outcome data for 107 patients followed up over 134 months after first admission in the ABC Schizophrenia Study. Focusing on the main syndromes (positive, negative, depressive and unspecific symptoms) and their combinations we looked for similarities in individual illness courses using the 'optimal matching' method. A cluster analysis performed on the resulting similarity matrix yielded two main groups (a 'improving' and a 'chronic' group), which comprised a total of six different types of illness course. The course types differed in both quantitative (frequency of syndromes and syndrome combinations) and qualitative terms (clinical presentation, sequence of syndromes). Cluster membership was only rarely, but clearly associated with sociodemographic characteristics, treatment data and other illness variables. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Single-Cell Sequencing of the Healthy and Diseased Heart Reveals Ckap4 as a New Modulator of Fibroblasts Activation.

PubMed

Gladka, Monika M; Molenaar, Bas; de Ruiter, Hesther; van der Elst, Stefan; Tsui, Hoyee; Versteeg, Danielle; Lacraz, Grègory P A; Huibers, Manon M H; van Oudenaarden, Alexander; van Rooij, Eva

2018-01-31

Background -Genome-wide transcriptome analysis has greatly advanced our understanding of the regulatory networks underlying basic cardiac biology and mechanisms driving disease. However, so far, the resolution of studying gene expression patterns in the adult heart has been limited to the level of extracts from whole tissues. The use of tissue homogenates inherently causes the loss of any information on cellular origin or cell type-specific changes in gene expression. Recent developments in RNA amplification strategies provide a unique opportunity to use small amounts of input RNA for genome-wide sequencing of single cells. Methods -Here, we present a method to obtain high quality RNA from digested cardiac tissue from adult mice for automated single-cell sequencing of both the healthy and diseased heart. Results -After optimization, we were able to perform single-cell sequencing on adult cardiac tissue under both homeostatic conditions and after ischemic injury. Clustering analysis based on differential gene expression unveiled known and novel markers of all main cardiac cell types. Based on differential gene expression we were also able to identify multiple subpopulations within a certain cell type. Furthermore, applying single-cell sequencing on both the healthy and the injured heart indicated the presence of disease-specific cell subpopulations. As such, we identified cytoskeleton associated protein 4 ( Ckap4 ) as a novel marker for activated fibroblasts that positively correlates with known myofibroblast markers in both mouse and human cardiac tissue. Ckap4 inhibition in activated fibroblasts treated with TGFβ triggered a greater increase in the expression of genes related to activated fibroblasts compared to control, suggesting a role of Ckap4 in modulating fibroblast activation in the injured heart. Conclusions -Single-cell sequencing on both the healthy and diseased adult heart allows us to study transcriptomic differences between cardiac cells, as well as cell type-specific changes in gene expression during cardiac disease. This new approach provides a wealth of novel insights into molecular changes that underlie the cellular processes relevant for cardiac biology and pathophysiology. Applying this technology could lead to the discovery of new therapeutic targets relevant for heart disease.
Phylogenetic and environmental diversity of DsrAB-type dissimilatory (bi)sulfite reductases

PubMed Central

Müller, Albert Leopold; Kjeldsen, Kasper Urup; Rattei, Thomas; Pester, Michael; Loy, Alexander

2015-01-01

The energy metabolism of essential microbial guilds in the biogeochemical sulfur cycle is based on a DsrAB-type dissimilatory (bi)sulfite reductase that either catalyzes the reduction of sulfite to sulfide during anaerobic respiration of sulfate, sulfite and organosulfonates, or acts in reverse during sulfur oxidation. Common use of dsrAB as a functional marker showed that dsrAB richness in many environments is dominated by novel sequence variants and collectively represents an extensive, largely uncharted sequence assemblage. Here, we established a comprehensive, manually curated dsrAB/DsrAB database and used it to categorize the known dsrAB diversity, reanalyze the evolutionary history of dsrAB and evaluate the coverage of published dsrAB-targeted primers. Based on a DsrAB consensus phylogeny, we introduce an operational classification system for environmental dsrAB sequences that integrates established taxonomic groups with operational taxonomic units (OTUs) at multiple phylogenetic levels, ranging from DsrAB enzyme families that reflect reductive or oxidative DsrAB types of bacterial or archaeal origin, superclusters, uncultured family-level lineages to species-level OTUs. Environmental dsrAB sequences constituted at least 13 stable family-level lineages without any cultivated representatives, suggesting that major taxa of sulfite/sulfate-reducing microorganisms have not yet been identified. Three of these uncultured lineages occur mainly in marine environments, while specific habitat preferences are not evident for members of the other 10 uncultured lineages. In summary, our publically available dsrAB/DsrAB database, the phylogenetic framework, the multilevel classification system and a set of recommended primers provide a necessary foundation for large-scale dsrAB ecology studies with next-generation sequencing methods. PMID:25343514
Integrative analysis workflow for the structural and functional classification of C-type lectins

PubMed Central

2011-01-01

Background It is important to understand the roles of C-type lectins in the immune system due to their ubiquity and diverse range of functions in animal cells. It has been observed that currently confirmed C-type lectins share a highly conserved domain known as the C-type carbohydrate recognition domain (CRD). Using the sequence profile of the CRD, an increasing number of putative C-type lectins have been identified. Hence, it is highly needed to develop a systematic framework that enables us to elucidate their carbohydrate (glycan) recognition function, and discover their physiological and pathological roles. Results Presented herein is an integrated workflow for characterizing the sequence and structural features of novel C-type lectins. Our workflow utilizes web-based queries and available software suites to annotate features that can be found on the C-type lectin, given its amino acid sequence. At the same time, it incorporates modeling and analysis of glycans - a major class of ligands that interact with C-type lectins. Thereafter, the results are analyzed together with context-specific knowledge to filter off unlikely predictions. This allows researchers to design their subsequent experiments to confirm the functions of the C-type lectins in a systematic manner. Conclusions The efficacy and usefulness of our proposed immunoinformatics workflow was demonstrated by applying our integrated workflow to a novel C-type lectin -CLEC17A - and we report some of its possible functions that warrants further validation through wet-lab experiments. PMID:22372988
Divergent nuclear 18S rDNA paralogs in a turkey coccidium, Eimeria meleagrimitis, complicate molecular systematics and identification.

PubMed

El-Sherry, Shiem; Ogedengbe, Mosun E; Hafeez, Mian A; Barta, John R

2013-07-01

Multiple 18S rDNA sequences were obtained from two single-oocyst-derived lines of each of Eimeria meleagrimitis and Eimeria adenoeides. After analysing the 15 new 18S rDNA sequences from two lines of E. meleagrimitis and 17 new sequences from two lines of E. adenoeides, there were clear indications that divergent, paralogous 18S rDNA copies existed within the nuclear genome of E. meleagrimitis. In contrast, mitochondrial cytochrome c oxidase subunit I (COI) partial sequences from all lines of a particular Eimeria sp. were identical and, in phylogenetic analyses, COI sequences clustered unambiguously in monophyletic and highly-supported clades specific to individual Eimeria sp. Phylogenetic analysis of the new 18S rDNA sequences from E. meleagrimitis showed that they formed two distinct clades: Type A with four new sequences; and Type B with nine new sequences; both Types A and B sequences were obtained from each of the single-oocyst-derived lines of E. meleagrimitis. Together these rDNA types formed a well-supported E. meleagrimitis clade. Types A and B 18S rDNA sequences from E. meleagrimitis had a mean sequence identity of only 97.4% whereas mean sequence identity within types was 99.1-99.3%. The observed intraspecific sequence divergence among E. meleagrimitis 18S rDNA sequence types was even higher (approximately 2.6%) than the interspecific sequence divergence present between some well-recognized species such as Eimeria tenella and Eimeria necatrix (1.1%). Our observations suggest that, unlike COI sequences, 18S rDNA sequences are not reliable molecular markers to be used alone for species identification with coccidia, although 18S rDNA sequences have clear utility for phylogenetic reconstruction of apicomplexan parasites at the genus and higher taxonomic ranks. Copyright © 2013. Published by Elsevier Ltd.
Two distinct Epichloë species symbiotic with Achnatherum inebrians, drunken horse grass.

PubMed

Chen, Li; Li, Xiuzhang; Li, Chunjie; Swoboda, Ginger A; Young, Carolyn A; Sugawara, Koya; Leuchtmann, Adrian; Schardl, Christopher L

2015-01-01

Achnatherum inebrians, colloquially known as drunken horse grass, is associated with livestock toxicity in northern China. Epichloë gansuensis (Eg) was described from endophyte isolates from A. inebrians in Sunan County, Gansu Province, whereas a morphologically distinct variety, E. gansuensis var. inebrians (Ei), was described based on two isolates from A. inebrians seeds collected in Urumqi County, Xinjiang Province. Genome sequencing and alkaloid analyses also distinguish these taxa; the Ei isolates produce neurotropic lysergic acid amides (ergot alkaloids), and an Eg isolate produces paxilline (an indole-diterpene alkaloid). To better elucidate the taxonomic diversity of Epichloë spp. symbiotic with A. inebrians, we surveyed eight populations in Xinjiang, Gansu and Inner Mongolia provinces of China and analyzed their genotypes by multiplex PCR for alkaloid biosynthesis genes and mating-type genes. Genotypes consistent with Ei were present in all eight populations, of which they dominated seven. The Ei isolates were all mating type A and tested positive for the ergot alkaloid gene, dmaW. In contrast Eg isolates were all mating type B and had the indole-diterpene gene, idtG. The genome was sequenced from an Ei isolate from seeds collected in Xiahe County, Gansu, and compared to that of the varietal ex type isolate from Urumqi. Alkaloid genes and four different housekeeping genes were nearly identical between the two sequenced Ei isolates and were distinct from a sequenced Eg isolate. Phylogenetic analysis placed Ei, Eg and Epichloë sibirica into respective subclades of a clade that emanated from the base of the Epichloë phylogeny. Given its chemotypic, genotypic, morphological and phylogenetic distinctiveness, its widespread occurrence in rangelands of northern China, and its importance in livestock toxicity, we propose raising Ei to species rank as Epichloë inebrians. © 2015 by The Mycological Society of America.
CRISPR Diversity and Microevolution in Clostridium difficile.

PubMed

Andersen, Joakim M; Shoup, Madelyn; Robinson, Cathy; Britton, Robert; Olsen, Katharina E P; Barrangou, Rodolphe

2016-09-19

Virulent strains of Clostridium difficile have become a global health problem associated with morbidity and mortality. Traditional typing methods do not provide ideal resolution to track outbreak strains, ascertain genetic diversity between isolates, or monitor the phylogeny of this species on a global basis. Here, we investigate the occurrence and diversity of clustered regularly interspaced short palindromic repeats (CRISPR) and CRISPR-associated genes (cas) in C. difficile to assess the potential of CRISPR-based phylogeny and high-resolution genotyping. A single Type-IB CRISPR-Cas system was identified in 217 analyzed genomes with cas gene clusters present at conserved chromosomal locations, suggesting vertical evolution of the system, assessing a total of 1,865 CRISPR arrays. The CRISPR arrays, markedly enriched (8.5 arrays/genome) compared with other species, occur both at conserved and variable locations across strains, and thus provide a basis for typing based on locus occurrence and spacer polymorphism. Clustering of strains by array composition correlated with sequence type (ST) analysis. Spacer content and polymorphism within conserved CRISPR arrays revealed phylogenetic relationship across clades and within ST. Spacer polymorphisms of conserved arrays were instrumental for differentiating closely related strains, e.g., ST1/RT027/B1 strains and pathogenicity locus encoding ST3/RT001 strains. CRISPR spacers showed sequence similarity to phage sequences, which is consistent with the native role of CRISPR-Cas as adaptive immune systems in bacteria. Overall, CRISPR-Cas sequences constitute a valuable basis for genotyping of C. difficile isolates, provide insights into the micro-evolutionary events that occur between closely related strains, and reflect the evolutionary trajectory of these genomes. © The Author(s) 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Genetic variation among Flavobacterium psychrophilum isolates from wild and farmed salmonids in Norway and Chile.

PubMed

Apablaza, P; Løland, A D; Brevik, Ø J; Ilardi, P; Battaglia, J; Nylund, A

2013-04-01

To aim of the study was to describe the genetic relationship between isolates of Flavobacterium psychrophilum with a main emphasis of samples from Chile and Norway. The isolates have been obtained from farmed salmonids in Norway and Chile, and from wild salmonids in Norway, but isolates from North America and European countries are also included in the analysis. The study is based on phylogenetic analysis of 16S rRNA and seven housekeeping genes (HG), gyrB, atpA, dnaK, trpB, fumC, murG and tuf, and the use of a multilocus sequence typing (MLST) system, based on nucleotide polymorphism in the HG, as an alternative to the phylogenies. The variation within the selected genes was limited, and the phylogenetic analysis gave little resolution between the isolates. The MLST gave a much better resolution resulting in 53 sequence types where the same sequences types could be found in Chile, North America and European countries, and in different host species. Multilocus sequence typing give a relatively good separation of different isolates of Fl. psychrophilum and show that there are no distinct geographical or host-specific isolates in the studied material from Chile, North America and Europe. Nor was it possible to separate between isolates from ulcers and systemic infections vs isolates from the surface of healthy salmonids. This study shows a wide geographical distribution of Fl. psychrophilum, indicating that the bacterium has a large potential for transmission over long distances, and between different salmonid hosts species. This knowledge will be important for future management of salmonids diseases connected to Fl. psychrophilum. © 2013 The Society for Applied Microbiology.
Persistence of Mycoplasma hyopneumoniae sequence types in spite of a control program for enzootic pneumonia in pigs.

PubMed

Overesch, Gudrun; Kuhnert, Peter

2017-09-15

Enzootic pneumonia (EP) in pigs caused by Mycoplasma (M.) hyopneumoniae has successfully been combatted in Switzerland. A control program was fully implemented in 2004 which is based on total depopulation strategies of affected fattening farms as well as partial depopulation on breeding farms. Thereby, the number of cases has dropped drastically from more than 200 in 2003 to two cases in 2013. Currently monitoring is done based on clinical observation and subsequent diagnostic of coughing pigs. Moreover, in case of more than 10% gross pathological lesions per slaughter batch laboratory confirmation for EP is compulsory. Despite these strict measures it was not possible to eliminate M. hyopneumoniae from Swiss pig production. In fact, during the last few years the number of EP cases has slightly increased. Therefore, genotyping of the involved M. hyopneumoniae strains was conducted in order to elucidate possible sources and routes of infection. All available and typeable samples from totally 22 cases during the period 2014-2016 were investigated by extended multilocus sequence typing (MLST). A total of 16 cases, including eight from 2014, five from 2015 and three from 2016 could thereby be included in the study. MLST revealed that the majority of cases in 2014/2015 were due to two major spread scenarios, i.e. two M. hyopneumoniae sequence types, each scenario involving six individual production farms in five to six different Cantons (states), respectively. Moreover, by comparison of archived sequences some sequence types were observed over ten years demonstrating their persistence over a long time and the possible partial failure of elimination measures in Switzerland. Insufficient sanitation on affected farms and subsequent animal transport of symptomless infected pigs could lead to recurrent cases. Wild boar harbor identical strains found with EP but solid data are missing to assign a role as reservoir to this wild animal. Implementing a monitoring scheme for M. hyopneumoniae in wild boar in combination with genotyping of all available samples from domestic pigs could direct responsible authorities to possible gaps and deficiencies of control measures taken for combating enzootic pneumonia. With the newly installed PubMLST database sequence types for M. hyopneumoniae are now available and allow tracing back strains on the international level. Copyright © 2017 Elsevier B.V. All rights reserved.
The effects of metal ions on the DNA damage induced by hydrogen peroxide.

PubMed

Kobayashi, S; Ueda, K; Komano, T

1990-01-01

The effects of metal ions on DNA damage induced by hydrogen peroxide were investigated using two methods, agarose-gel electrophoretic analysis of supercoiled DNA and sequencing-gel analysis of single end-labeled DNA fragments of defined sequences. Hydrogen peroxide induced DNA damage when iron or copper ion was present. At least two classes of DNA damage were induced, one being direct DNA-strand cleavage, and the other being base modification labile to hot piperidine. The investigation of the damaged sites and the inhibitory effects of radical scavengers revealed that hydroxyl radical was the species which attacked DNA in the reaction of H2O2/Fe(II). On the other hand, two types of DNA damage were induced by H2O2/Cu(II). Type I damage was predominant and inhibited by potassium iodide, but type II was not. The sites of the base-modification induced by type I damage were similar to those by lipid peroxidation products and by ascorbate in the presence of Cu(II), suggesting the involvement of radical species other than free hydroxyl radical in the damaging reactions.
Production of Mutated Porcine Embryos Using Zinc Finger Nucleases and a Reporter-based Cell Enrichment System.

PubMed

Koo, Ok Jae; Park, Sol Ji; Lee, Choongil; Kang, Jung Taek; Kim, Sujin; Moon, Joon Ho; Choi, Ji Yei; Kim, Hyojin; Jang, Goo; Kim, Jin-Soo; Kim, Seokjoong; Lee, Byeong-Chun

2014-03-01

To facilitate the construction of genetically-modified pigs, we produced cloned embryos derived from porcine fibroblasts transfected with a pair of engineered zinc finger nuclease (ZFN) plasmids to create targeted mutations and enriched using a reporter plasmid system. The reporter expresses RFP and eGFP simultaneously when ZFN-mediated site-specific mutations occur. Thus, double positive cells (RFP(+)/eGFP(+)) were selected and used for somatic cell nuclear transfer. Two types of reporter based enrichment systems were used in this study; the cloned embryos derived from cells enriched using a magnetic sorting-based system showed better developmental competence than did those derived from cells enriched by flow cytometry. Mutated sequences, such as insertions, deletions, or substitutions, together with the wild-type sequence, were found in the cloned porcine blastocysts. Therefore, genetic mutations can be achieved in cloned porcine embryos reconstructed with ZFN-treated cells that were enriched by a reporter-based system.
A LabVIEW based template for user created experiment automation.

PubMed

Kim, D J; Fisk, Z

2012-12-01

We have developed an expandable software template to automate user created experiments. The LabVIEW based template is easily modifiable to add together user created measurements, controls, and data logging with virtually any type of laboratory equipment. We use reentrant sequential selection to implement sequence script making it possible to wrap a long series of the user created experiments and execute them in sequence. Details of software structure and application examples for scanning probe microscope and automated transport experiments using custom built laboratory electronics and a cryostat are described.
Crystal structure and sequence-dependent conformation of the A.G mispaired oligonucleotide d(CGCAAGCTGGCG).

PubMed Central

Webster, G D; Sanderson, M R; Skelly, J V; Neidle, S; Swann, P F; Li, B F; Tickle, I J

1990-01-01

The crystal structure of the dodecanucleotide d(CGCAAGCTGGCG) has been determined to a resolution of 2.5 A and refined to an R factor of 19.3% for 1710 reflections. The sequence crystallizes as a B-type double helix, with two G(anti).A(syn) base pairs. These are stabilized by three-center hydrogen bonds to pyrimidines that induce perturbations in base-pair geometry. The central AGCT region of the helix has a wide (greater than 6 A) minor groove. PMID:2395870
Typing of artiodactyl MHC-DRB genes with the help of intronic simple repeated DNA sequences.

PubMed

Schwaiger, F W; Buitkamp, J; Weyers, E; Epplen, J T

1993-02-01

An efficient oligonucleotide typing method for the highly polymorphic MHC-DRB genes is described for artiodactyls like cattle, sheep and goat. By means of the polymerase chain reaction, the second exon of MHC-DRB is amplified as well as part of the adjacent intron containing a mixed simple repeat sequence. Using this primer combination we were able to amplify the MHC-DRB exons 2 and adjacent introns from all of the investigated 10 species of the family of Bovidae and giraffes. Therefore, the DRB genes of novel artiodactyl species can also be readily studied. Oligonucleotide probes specific for the polymorphisms of ungulate DRB genes are used with which sequences differing in at least one single base can be distinguished. Exonic polymorphism was found to be correlated with the allele lengths and the patterns of the repeat structures. Hence oligonucleotide probes specific for different simple repeats and polymorphic positions serve also for typing across species barriers. The strict correlation of sequence length and exonic polymorphism permits a preselection of specific oligonucleotides for hybridization. Thus more than 20 alleles can already be differentiated from each of the three species.
Molecular epidemiology of drug-resistant Neisseria gonorrhoeae in Russia (Current Status, 2015).

PubMed

Kubanov, Alexey; Vorobyev, Denis; Chestkov, Aleksandr; Leinsoo, Arvo; Shaskolskiy, Boris; Dementieva, Ekaterina; Solomka, Viktoria; Plakhova, Xenia; Gryadunov, Dmitry; Deryabin, Dmitriy

2016-08-09

The widespread distribution of Neisseria gonorrhoeae strains that are resistant to previously used and clinically implemented antibiotics is a significant global public health problem. In line with WHO standards, the national Gonococcal Antimicrobial Surveillance Programme (RU-GASP) has been in existence in Russia since 2004; herein, the current status (2015) is described, including associations between N. gonorrhoeae antimicrobial susceptibility, primary genetic resistance determinants and specific strain sequence types. A total of 124 N. gonorrhoeae strains obtained from 9 regions in Russia in 2015 were examined using N. gonorrhoeae Multi-Antigen Sequence Typing (NG-MAST), an antimicrobial susceptibility test according to European Committee on Antimicrobial Susceptibility Testing (EUCAST) criteria and an oligonucleotide microarray for the identification of mutations in the penA, ponA, rpsJ, gyrA and parC genes responsible for penicillin G, tetracycline, and fluoroquinolone resistance. Genogroup (G) isolates were evaluated based on their porB and tbpB sequence types (STs). NG-MAST analysis showed a diversified population of N. gonorrhoeae in Russia with 58 sequence types, 35 of which were described for the first time. The STs 807, 1544, 1993, 5714, 9476 and 12531, which were typical for some Russian Federation regions and several countries of the former Soviet Union, were represented by five or more isolates. The internationally widespread ST 1407 was represented by a single strain in the present study. Division into genogroups facilitated an exploration of the associations between N. gonorrhoeae sequence type, antimicrobial resistance spectra and genetic resistance determinant contents. Preliminarily susceptible (G-807, G-12531) and resistant (G-5714, G-9476) genogroups were revealed. The variability in the most frequently observed STs and genogroups in each participating region indicated geographically restricted antimicrobial susceptibility in N. gonorrhoeae populations. Resistance or intermediate susceptibility to previously recommended antimicrobials, such as penicillin G (60.5 %), ciprofloxacin (41.1 %) and tetracycline (25 %), is common in the N. gonorrhoeae population. Based on previous reports and current data, ceftriaxone and spectinomycin should be recommended for first-line empiric antimicrobial monotherapy for gonorrhoea in Russia.
Task-set switching under cue-based versus memory-based switching conditions in younger and older adults.

PubMed

Kray, Jutta

2006-08-11

Adult age differences in task switching and advance preparation were examined by comparing cue-based and memory-based switching conditions. Task switching was assessed by determining two types of costs that occur at the general (mixing costs) and specific (switching costs) level of switching. Advance preparation was investigated by varying the time interval until the next task (short, middle, very long). Results indicated that the implementation of task sets was different for cue-based switching with random task sequences and memory-based switching with predictable task sequences. Switching costs were strongly reduced under cue-based switching conditions, indicating that task-set cues facilitate the retrieval of the next task. Age differences were found for mixing costs and for switching costs only under cue-based conditions in which older adults showed smaller switching costs than younger adults. It is suggested that older adults adopt a less extreme bias between two tasks than younger adults in situations associated with uncertainty. For cue-based switching with random task sequences, older adults are less engaged in a complete reconfiguration of task sets because of the probability of a further task change. Furthermore, the reduction of switching costs was more pronounced for cue- than memory-based switching for short preparation intervals, whereas the reduction of switch costs was more pronounced for memory- than cue-based switching for longer preparation intervals at least for older adults. Together these findings suggest that the implementation of task sets is functionally different for the two types of task-switching conditions.
dCITE: Measuring Necessary Cladistic Information Can Help You Reduce Polytomy Artefacts in Trees.

PubMed

Wise, Michael J

2016-01-01

Biologists regularly create phylogenetic trees to better understand the evolutionary origins of their species of interest, and often use genomes as their data source. However, as more and more incomplete genomes are published, in many cases it may not be possible to compute genome-based phylogenetic trees due to large gaps in the assembled sequences. In addition, comparison of complete genomes may not even be desirable due to the presence of horizontally acquired and homologous genes. A decision must therefore be made about which gene, or gene combinations, should be used to compute a tree. Deflated Cladistic Information based on Total Entropy (dCITE) is proposed as an easily computed metric for measuring the cladistic information in multiple sequence alignments representing a range of taxa, without the need to first compute the corresponding trees. dCITE scores can be used to rank candidate genes or decide whether input sequences provide insufficient cladistic information, making artefactual polytomies more likely. The dCITE method can be applied to protein, nucleotide or encoded phenotypic data, so can be used to select which data-type is most appropriate, given the choice. In a series of experiments the dCITE method was compared with related measures. Then, as a practical demonstration, the ideas developed in the paper were applied to a dataset representing species from the order Campylobacterales; trees based on sequence combinations, selected on the basis of their dCITE scores, were compared with a tree constructed to mimic Multi-Locus Sequence Typing (MLST) combinations of fragments. We see that the greater the dCITE score the more likely it is that the computed phylogenetic tree will be free of artefactual polytomies. Secondly, cladistic information saturates, beyond which little additional cladistic information can be obtained by adding additional sequences. Finally, sequences with high cladistic information produce more consistent trees for the same taxa.
dCITE: Measuring Necessary Cladistic Information Can Help You Reduce Polytomy Artefacts in Trees

PubMed Central

2016-01-01

Biologists regularly create phylogenetic trees to better understand the evolutionary origins of their species of interest, and often use genomes as their data source. However, as more and more incomplete genomes are published, in many cases it may not be possible to compute genome-based phylogenetic trees due to large gaps in the assembled sequences. In addition, comparison of complete genomes may not even be desirable due to the presence of horizontally acquired and homologous genes. A decision must therefore be made about which gene, or gene combinations, should be used to compute a tree. Deflated Cladistic Information based on Total Entropy (dCITE) is proposed as an easily computed metric for measuring the cladistic information in multiple sequence alignments representing a range of taxa, without the need to first compute the corresponding trees. dCITE scores can be used to rank candidate genes or decide whether input sequences provide insufficient cladistic information, making artefactual polytomies more likely. The dCITE method can be applied to protein, nucleotide or encoded phenotypic data, so can be used to select which data-type is most appropriate, given the choice. In a series of experiments the dCITE method was compared with related measures. Then, as a practical demonstration, the ideas developed in the paper were applied to a dataset representing species from the order Campylobacterales; trees based on sequence combinations, selected on the basis of their dCITE scores, were compared with a tree constructed to mimic Multi-Locus Sequence Typing (MLST) combinations of fragments. We see that the greater the dCITE score the more likely it is that the computed phylogenetic tree will be free of artefactual polytomies. Secondly, cladistic information saturates, beyond which little additional cladistic information can be obtained by adding additional sequences. Finally, sequences with high cladistic information produce more consistent trees for the same taxa. PMID:27898695
Bartonella dromedarii sp. nov. isolated from domesticated camels (Camelus dromedarius) in Israel.

PubMed

Rasis, Michal; Rudoler, Nir; Schwartz, David; Giladi, Michael

2014-11-01

Bartonella spp. are fastidious, Gram-negative bacilli that cause a wide spectrum of diseases in humans. Most Bartonella spp. have adapted to a specific host, generally a domestic or wild mammal. Dromedary camels (Camelus dromedarius) have become a focus of growing public-health interest because they have been identified as a reservoir host for the Middle East respiratory syndrome coronavirus. Nevertheless, data on camel zoonoses are limited. We aimed to study the occurrence of Bartonella bacteremia among dromedaries in Israel. Nine of 51 (17.6%) camels were found to be bacteremic with Bartonella spp.; bacteremia levels ranged from five to >1000 colony-forming units/mL. Phylogenetic reconstruction based on the concatenated sequences of gltA and rpoB genes demonstrated that the dromedary Bartonella isolates are closely related to other ruminant-derived Bartonella spp., with B. bovis being the nearest relative. Using electron microscopy, the novel isolates were shown to be flagellated, whereas B. bovis is nonflagellated. Sequence comparisons analysis of the housekeeping genes ftsZ, ribC, and groEL showed the highest homology to B. chomelii, B. capreoli, and B. birtlesii, respectively. Sequence analysis of the gltA and rpoB revealed ∼96% identity to B. bovis, a previously suggested cutoff value for sequence-based differentiation of Bartonella spp., suggesting that this approach does not have sufficient discriminatory power for differentiating ruminant-related Bartonella spp. A comprehensive multilocus sequence typing (MLST) analysis based on nine genetic loci (gltA, rpoB, ftsZ, internal transcribed spacer (ITS), 16S rRNA, ribC, groEL, nuoG, and SsrA) identified seven sequence types of the new dromedary isolates. This is the first description of a Bartonella sp. from camelids. On the basis of a distinct reservoir and ecological niche, sequence analyses, and expression of flagella, we designate these isolates as a novel Bartonella sp. named Bartonella dromedarii sp. nov. Further studies are required to explore its zoonotic potential.
Comparison of double-locus sequence typing (DLST) and multilocus sequence typing (MLST) for the investigation of Pseudomonas aeruginosa populations.

PubMed

Cholley, Pascal; Stojanov, Milos; Hocquet, Didier; Thouverez, Michelle; Bertrand, Xavier; Blanc, Dominique S

2015-08-01

Reliable molecular typing methods are necessary to investigate the epidemiology of bacterial pathogens. Reference methods such as multilocus sequence typing (MLST) and pulsed-field gel electrophoresis (PFGE) are costly and time consuming. Here, we compared our newly developed double-locus sequence typing (DLST) method for Pseudomonas aeruginosa to MLST and PFGE on a collection of 281 isolates. DLST was as discriminatory as MLST and was able to recognize "high-risk" epidemic clones. Both methods were highly congruent. Not surprisingly, a higher discriminatory power was observed with PFGE. In conclusion, being a simple method (single-strand sequencing of only 2 loci), DLST is valuable as a first-line typing tool for epidemiological investigations of P. aeruginosa. Coupled to a more discriminant method like PFGE or whole genome sequencing, it might represent an efficient typing strategy to investigate or prevent outbreaks. Copyright © 2015 Elsevier Inc. All rights reserved.

Comparative analysis of the full genome sequence of European bat lyssavirus type 1 and type 2 with other lyssaviruses and evidence for a conserved transcription termination and polyadenylation motif in the G-L 3' non-translated region.

PubMed

Marston, D A; McElhinney, L M; Johnson, N; Müller, T; Conzelmann, K K; Tordo, N; Fooks, A R

2007-04-01

We report the first full-length genomic sequences for European bat lyssavirus type-1 (EBLV-1) and type-2 (EBLV-2). The EBLV-1 genomic sequence was derived from a virus isolated from a serotine bat in Hamburg, Germany, in 1968 and the EBLV-2 sequence was derived from a virus isolate from a human case of rabies that occurred in Scotland in 2002. A long-distance PCR strategy was used to amplify the open reading frames (ORFs), followed by standard and modified RACE (rapid amplification of cDNA ends) techniques to amplify the 3' and 5' ends. The lengths of each complete viral genome for EBLV-1 and EBLV-2 were 11 966 and 11 930 base pairs, respectively, and follow the standard rhabdovirus genome organization of five viral proteins. Comparison with other lyssavirus sequences demonstrates variation in degrees of homology, with the genomic termini showing a high degree of complementarity. The nucleoprotein was the most conserved, both intra- and intergenotypically, followed by the polymerase (L), matrix and glyco- proteins, with the phosphoprotein being the most variable. In addition, we have shown that the two EBLVs utilize a conserved transcription termination and polyadenylation (TTP) motif, approximately 50 nt upstream of the L gene start codon. All available lyssavirus sequences to date, with the exception of Pasteur virus (PV) and PV-derived isolates, use the second TTP site. This observation may explain differences in pathogenicity between lyssavirus strains, dependent on the length of the untranslated region, which might affect transcriptional activity and RNA stability.
Genotyping and Source Tracking of Cronobacter sakazakii and C. malonaticus Isolates from Powdered Infant Formula and an Infant Formula Production Factory in China

PubMed Central

Fei, Peng; Man, Chaoxin; Lou, Binbin; Forsythe, Stephen J.; Chai, Yunlei; Li, Ran; Niu, Jieting

2015-01-01

Cronobacter spp. (formerly defined as Enterobacter sakazakii) are opportunistic bacterial pathogens of both infants and adults. In this study, we analyzed 70 Cronobacter isolates from powdered infant formula (PIF) and an infant formula production facility in China to determine possible contamination routes. The strains were profiled by multilocus sequence typing (MLST), pulsed-field gel electrophoresis (PFGE), PCR-based O-antigen serotyping, and ompA and rpoB sequence analyses. The isolates were primarily Cronobacter sakazakii (66/70) or Cronobacter malonaticus (4/70). The strains were divided into 38 pulsotypes (PTs) using PFGE and 19 sequence types (STs) by MLST. In contrast, rpoB and ompA sequence analyses divided the strains into 10 overlapping clusters each. PCR serotyping of the 66 C. sakazakii and 4 C. malonaticus strains resulted in the identification of four C. sakazakii serotypes (O1, O2, O4, and O7) and a single C. malonaticus serotype, O2. The dominant C. sakazakii sequence types from PIF and an infant formula production factory in China were C. sakazakii clonal complex 4 (CC4) (n = 19), ST1 (n = 14), and ST64 (n = 11). C. sakazakii CC4 is a clonal lineage strongly associated with neonatal meningitis. In the process of manufacturing PIF, the spray-drying, fluidized-bed-drying, and packing areas were the main areas with Cronobacter contamination. C. sakazakii strains with the same pulsotypes (PT3 and PT2) and sequence types (ST1 and ST64) were isolated both from processing equipment and from the PIF finished product. PMID:26048942
Genotyping and Source Tracking of Cronobacter sakazakii and C. malonaticus Isolates from Powdered Infant Formula and an Infant Formula Production Factory in China.

PubMed

Fei, Peng; Man, Chaoxin; Lou, Binbin; Forsythe, Stephen J; Chai, Yunlei; Li, Ran; Niu, Jieting; Jiang, Yujun

2015-08-15

Cronobacter spp. (formerly defined as Enterobacter sakazakii) are opportunistic bacterial pathogens of both infants and adults. In this study, we analyzed 70 Cronobacter isolates from powdered infant formula (PIF) and an infant formula production facility in China to determine possible contamination routes. The strains were profiled by multilocus sequence typing (MLST), pulsed-field gel electrophoresis (PFGE), PCR-based O-antigen serotyping, and ompA and rpoB sequence analyses. The isolates were primarily Cronobacter sakazakii (66/70) or Cronobacter malonaticus (4/70). The strains were divided into 38 pulsotypes (PTs) using PFGE and 19 sequence types (STs) by MLST. In contrast, rpoB and ompA sequence analyses divided the strains into 10 overlapping clusters each. PCR serotyping of the 66 C. sakazakii and 4 C. malonaticus strains resulted in the identification of four C. sakazakii serotypes (O1, O2, O4, and O7) and a single C. malonaticus serotype, O2. The dominant C. sakazakii sequence types from PIF and an infant formula production factory in China were C. sakazakii clonal complex 4 (CC4) (n = 19), ST1 (n = 14), and ST64 (n = 11). C. sakazakii CC4 is a clonal lineage strongly associated with neonatal meningitis. In the process of manufacturing PIF, the spray-drying, fluidized-bed-drying, and packing areas were the main areas with Cronobacter contamination. C. sakazakii strains with the same pulsotypes (PT3 and PT2) and sequence types (ST1 and ST64) were isolated both from processing equipment and from the PIF finished product. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
A glow of HLA typing in organ transplantation

PubMed Central

2013-01-01

The transplant of organs and tissues is one of the greatest curative achievements of this century. In organ transplantation, the adaptive immunity is considered the main response exerted to the transplanted tissue, since the main goal of the immune response is the MHC (major histocompatibility complex) molecules expressed on the surface of donor cells. Cell surface molecules that induce an antigenic stimulus cause the rejection immune response to grafted tissue or organ. A wide variety of transplantation antigens have been described, including the major histocompatibility molecules, minor histocompatibility antigens, ABO blood group antigens and endothelial cell antigens. The sensitization to MHC antigens may be caused by transfusions, pregnancy, or failed previous grafts leading to development of anti-human leukocyte antigen (HLA) antibodies that are important factor responsible for graft rejection in solid organ transplantation and play a role in post-transfusion complication Anti-HLA Abs may be present in healthy individuals. Methods for HLA typing are described, including serological methods, molecular techniques of sequence-specific priming (SSP), sequence-specific oligonucleotide probing (SSOP), Sequence based typing (SBT) and reference strand-based conformation analysis (RSCA) method. Problems with organ transplantation are reservoir of organs and immune suppressive treatments that used to decrease rate of rejection with less side effect and complications. PMID:23432791
Sequence-based characterization of Listeria monocytogenes strains isolated from domestic retail meat in the Tokyo metropolitan area of Japan.

PubMed

Yoshikawa, Yuko; Ochiai, Yoshitsugu; Mochizuki, Mariko; Takano, Takashi; Hondo, Ryo; Ueda, Fukiko

2018-05-31

To assess the level of Listeria monocytogenes contamination of domestic retail meat in Tokyo, Japan, we compared isolates from 2004 to 2007 with those isolated before 2003. The overall prevalence of L. monocytogenes among these samples significantly diminished over time (1998-2003, 28.0%; 2004-2007, 17.6%) reflecting a significant decrease in the frequency of contamination of beef. Serotype 1/2a was isolated most frequently, reflecting a change in the predominant serotype in pork from 1/2c to 1/2a. We performed a simple genetic subtyping method based on three genes, iap, sigB, and actA, as well as traditional multilocus sequence typing to classify the allele types (ATs). No extensive variation among sequence types was detected; however, increased genetic diversity among the ATs of the three genes in the 2004-2007 isolates was evident. We identified AT 26 of the iap gene, not previously reported in Japanese isolates, and six ATs of the sigB gene, including four with nonsense mutations not currently registered in L. monocytogenes DNA databases. sigB is an evolutionally conserved gene that plays a role in the stress response. Our results indicate that the sigB gene may be relatively unstable among L. monocytogenes strains circulating in Japan.
Complete chloroplast and ribosomal sequences for 30 accessions elucidate evolution of Oryza AA genome species

PubMed Central

Kim, Kyunghee; Lee, Sang-Choon; Lee, Junki; Yu, Yeisoo; Yang, Kiwoung; Choi, Beom-Soon; Koh, Hee-Jong; Waminal, Nomar Espinosa; Choi, Hong-Il; Kim, Nam-Hoon; Jang, Woojong; Park, Hyun-Seung; Lee, Jonghoon; Lee, Hyun Oh; Joh, Ho Jun; Lee, Hyeon Ju; Park, Jee Young; Perumal, Sampath; Jayakodi, Murukarthick; Lee, Yun Sun; Kim, Backki; Copetti, Dario; Kim, Soonok; Kim, Sunggil; Lim, Ki-Byung; Kim, Young-Dong; Lee, Jungho; Cho, Kwang-Su; Park, Beom-Seok; Wing, Rod A.; Yang, Tae-Jin

2015-01-01

Cytoplasmic chloroplast (cp) genomes and nuclear ribosomal DNA (nR) are the primary sequences used to understand plant diversity and evolution. We introduce a high-throughput method to simultaneously obtain complete cp and nR sequences using Illumina platform whole-genome sequence. We applied the method to 30 rice specimens belonging to nine Oryza species. Concurrent phylogenomic analysis using cp and nR of several of specimens of the same Oryza AA genome species provides insight into the evolution and domestication of cultivated rice, clarifying three ambiguous but important issues in the evolution of wild Oryza species. First, cp-based trees clearly classify each lineage but can be biased by inter-subspecies cross-hybridization events during speciation. Second, O. glumaepatula, a South American wild rice, includes two cytoplasm types, one of which is derived from a recent interspecies hybridization with O. longistminata. Third, the Australian O. rufipogan-type rice is a perennial form of O. meridionalis. PMID:26506948
Substitution of wild-type yellow fever Asibi sequences for 17D vaccine sequences in ChimeriVax-dengue 4 does not enhance infection of Aedes aegypti mosquitoes.

PubMed

McGee, Charles E; Tsetsarkin, Konstantin; Vanlandingham, Dana L; McElroy, Kate L; Lang, Jean; Guy, Bruno; Decelle, Thierry; Higgs, Stephen

2008-03-01

To address concerns that a flavivirus vaccine/wild-type recombinant virus might have a high mosquito infectivity phenotype, the yellow fever virus (YFV) 17D backbone of the ChimeriVax-dengue 4 virus was replaced with the corresponding gene sequences of the virulent YFV Asibi strain. Field-collected and laboratory-colonized Aedes aegypti mosquitoes were fed on blood containing each of the viruses under investigation and held for 14 days after infection. Infection and dissemination rates were based on antigen detection in titrated body or head triturates. Our data indicate that, even in the highly unlikely event of recombination or substantial backbone reversion, virulent sequences do not enhance the transmissibility of ChimeriVax viruses. In light of the low-level viremias that have been observed after vaccination in human volunteers coupled with low mosquito infectivity, it is predicted that the risk of mosquito infection and transmission of ChimeriVax vaccine recombinant/revertant viruses in nature is minimal.
Cumulative Axial and Torsional Fatigue: An Investigation of Load-Type Sequencing Effects

NASA Technical Reports Server (NTRS)

Kalluri, Sreeramesh; Bonacuse, Peter J.

2000-01-01

Cumulative fatigue behavior of a wrought cobalt-base superalloy, Haynes 188 was investigated at 538 C under various single-step sequences of axial and torsional loading conditions. Initially, fully-reversed, axial and torsional fatigue tests were conducted under strain control at 538 C on thin-walled tubular specimens to establish baseline fatigue life relationships. Subsequently, four sequences (axial/axial, torsional/torsional, axial/torsional, and torsional/axial) of two load-level fatigue tests were conducted to characterize both the load-order (high/low) and load-type sequencing effects. For the two load-level tests, summations of life fractions and the remaining fatigue lives at the second load-level were computed by the Miner's Linear Damage Rule (LDR) and a nonlinear Damage Curve Approach (DCA). In general, for all four cases predictions by LDR were unconservative. Predictions by the DCA were within a factor of two of the experimentally observed fatigue lives for a majority of the cumulative axial and torsional fatigue tests.
Winnowing DNA for rare sequences: highly specific sequence and methylation based enrichment.

PubMed

Thompson, Jason D; Shibahara, Gosuke; Rajan, Sweta; Pel, Joel; Marziali, Andre

2012-01-01

Rare mutations in cell populations are known to be hallmarks of many diseases and cancers. Similarly, differential DNA methylation patterns arise in rare cell populations with diagnostic potential such as fetal cells circulating in maternal blood. Unfortunately, the frequency of alleles with diagnostic potential, relative to wild-type background sequence, is often well below the frequency of errors in currently available methods for sequence analysis, including very high throughput DNA sequencing. We demonstrate a DNA preparation and purification method that through non-linear electrophoretic separation in media containing oligonucleotide probes, achieves 10,000 fold enrichment of target DNA with single nucleotide specificity, and 100 fold enrichment of unmodified methylated DNA differing from the background by the methylation of a single cytosine residue.
DNA extraction for streamlined metagenomics of diverse environmental samples.

PubMed

Marotz, Clarisse; Amir, Amnon; Humphrey, Greg; Gaffney, James; Gogul, Grant; Knight, Rob

2017-06-01

A major bottleneck for metagenomic sequencing is rapid and efficient DNA extraction. Here, we compare the extraction efficiencies of three magnetic bead-based platforms (KingFisher, epMotion, and Tecan) to a standardized column-based extraction platform across a variety of sample types, including feces, oral, skin, soil, and water. Replicate sample plates were extracted and prepared for 16S rRNA gene amplicon sequencing in parallel to assess extraction bias and DNA quality. The data demonstrate that any effect of extraction method on sequencing results was small compared with the variability across samples; however, the KingFisher platform produced the largest number of high-quality reads in the shortest amount of time. Based on these results, we have identified an extraction pipeline that dramatically reduces sample processing time without sacrificing bacterial taxonomic or abundance information.
Molecular Epidemiology and Phylogeny Reveal Complex Spatial Dynamics in Areas Where Canine Parvovirus Is Endemic ▿†

PubMed Central

Clegg, S. R.; Coyne, K. P.; Parker, J.; Dawson, S.; Godsall, S. A.; Pinchbeck, G.; Cripps, P. J.; Gaskell, R. M.; Radford, A. D.

2011-01-01

Canine parvovirus type 2 (CPV-2) is a severe enteric pathogen of dogs, causing high mortality in unvaccinated dogs. After emerging, CPV-2 spread rapidly worldwide. However, there is now some evidence to suggest that international transmission appears to be more restricted. In order to investigate the transmission and evolution of CPV-2 both nationally and in relation to the global situation, we have used a long-range PCR to amplify and sequence the full VP2 gene of 150 canine parvoviruses obtained from a large cross-sectional sample of dogs presenting with severe diarrhea to veterinarians in the United Kingdom, over a 2-year period. Among these 150 strains, 50 different DNA sequence types (S) were identified, and apart from one case, all appeared unique to the United Kingdom. Phylogenetic analysis provided clear evidence for spatial clustering at the international level and for the first time also at the national level, with the geographical range of some sequence types appearing to be highly restricted within the United Kingdom. Evolution of the VP2 gene in this data set was associated with a lack of positive selection. In addition, the majority of predicted amino acid sequences were identical to those found elsewhere in the world, suggesting that CPV VP2 has evolved a highly fit conformation. Based on typing systems using key amino acid mutations, 43% of viruses were CPV-2a, and 57% CPV-2b, with no type 2 or 2c found. However, phylogenetic analysis suggested complex antigenic evolution of this virus, with both type 2a and 2b viruses appearing polyphyletic. As such, typing based on specific amino acid mutations may not reflect the true epidemiology of this virus. The geographical restriction that we observed both within the United Kingdom and between the United Kingdom and other countries, together with the lack of CPV-2c in this population, strongly suggests the spread of CPV within its population may be heterogeneously subject to limiting factors. This cross-sectional study of national and global CPV phylogeographic segregation reveals a substantially more complex epidemic structure than previously described. PMID:21593180
Mitochondrial DNA variant at HVI region as a candidate of genetic markers of type 2 diabetes

NASA Astrophysics Data System (ADS)

Gumilar, Gun Gun; Purnamasari, Yunita; Setiadi, Rahmat

2016-02-01

Mitochondrial DNA (mtDNA) is maternally inherited. mtDNA mutations which can contribute to the excess of maternal inheritance of type 2 diabetes. Due to the high mutation rate, one of the areas in the mtDNA that is often associated with the disease is the hypervariable region I (HVI). Therefore, this study was conducted to determine the genetic variants of human mtDNA HVI that related to the type 2 diabetes in four samples that were taken from four generations in one lineage. Steps being taken include the lyses of hair follicles, amplification of mtDNA HVI fragment using Polymerase Chain Reaction (PCR), detection of PCR products through agarose gel electrophoresis technique, the measurement of the concentration of mtDNA using UV-Vis spectrophotometer, determination of the nucleotide sequence via direct sequencing method and analysis of the sequencing results using SeqMan DNASTAR program. Based on the comparison between nucleotide sequence of samples and revised Cambridge Reference Sequence (rCRS) obtained six same mutations that these are C16147T, T16189C, C16193del, T16127C, A16235G, and A16293C. After comparing the data obtained to the secondary data from Mitomap and NCBI, it were found that two mutations, T16189C and T16217C, become candidates as genetic markers of type 2 diabetes even the mutations were found also in the generations of undiagnosed type 2 diabetes. The results of this study are expected to give contribution to the collection of human mtDNA database of genetic variants that associated to metabolic diseases, so that in the future it can be utilized in various fields, especially in medicine.
Isolation of Propionibacterium acnes among the microbiota of primary endodontic infections with and without intraoral communication.

PubMed

Niazi, Sadia Ambreen; Al Kharusi, Hana Suleiman; Patel, Shanon; Bruce, Kenneth; Beighton, David; Foschi, Federico; Mannocci, Francesco

2016-11-01

The presence of opportunistic pathogens such as Propionibacterium acnes (P. acnes) may contribute to the endodontic pathology. The presence of P. acnes may be influenced by different endodontic conditions. The aims of the study were firstly, to identify P. acnes within the whole cultivable microbiota of primary endodontic infections, to investigate which P. acnes phylotypes predominate in such infections and secondly to determine if the presence of an "open" communication (e.g. a sinus) can be associated with the isolation of P. acnes from the root canal. The predominant cultivable microbiota of 15 primary endodontic lesions (7 without communication with the oral environment and 8 with an open communication) were identified using partial 16S ribosomal RNA (rRNA) gene sequence analysis. The identification of the organism was determined by interrogating the Human Oral Microbiome Database. The P. acnes isolates were typed on the basis of the recA gene sequence comparison. A neighbor-joining tree was constructed using MEGA 4.1 with the inclusion of known recA sequences. There was no difference in the number of species identified from lesions without communication (5.86 ± 3.7) and those with communication (5.37 ± 3.6) (P > 0.05). PCR-based 16S rRNA gene sequencing revealed P. acnes as the most prevalent isolate recovered from lesions with communication. recA gene sequencing revealed two phylogenetic lineages present in lesion with communication, with mainly type I (further split into type IA and type IB) and type II. The presence of P. acnes as opportunistic pathogens has been confirmed and may sustain the traits observed in specific clinical presentations. Clinical management of open lesions may require further disinfection to eliminate opportunistic bacteria.
Pan-genome multilocus sequence typing and outbreak-specific reference-based single nucleotide polymorphism analysis to resolve two concurrent Staphylococcus aureus outbreaks in neonatal services.

PubMed

Roisin, S; Gaudin, C; De Mendonça, R; Bellon, J; Van Vaerenbergh, K; De Bruyne, K; Byl, B; Pouseele, H; Denis, O; Supply, P

2016-06-01

We used a two-step whole genome sequencing analysis for resolving two concurrent outbreaks in two neonatal services in Belgium, caused by exfoliative toxin A-encoding-gene-positive (eta+) methicillin-susceptible Staphylococcus aureus with an otherwise sporadic spa-type t209 (ST-109). Outbreak A involved 19 neonates and one healthcare worker in a Brussels hospital from May 2011 to October 2013. After a first episode interrupted by decolonization procedures applied over 7 months, the outbreak resumed concomitantly with the onset of outbreak B in a hospital in Asse, comprising 11 neonates and one healthcare worker from mid-2012 to January 2013. Pan-genome multilocus sequence typing, defined on the basis of 42 core and accessory reference genomes, and single-nucleotide polymorphisms mapped on an outbreak-specific de novo assembly were used to compare 28 available outbreak isolates and 19 eta+/spa-type t209 isolates identified by routine or nationwide surveillance. Pan-genome multilocus sequence typing showed that the outbreaks were caused by independent clones not closely related to any of the surveillance isolates. Isolates from only ten cases with overlapping stays in outbreak A, including four pairs of twins, showed no or only a single nucleotide polymorphism variation, indicating limited sequential transmission. Detection of larger genomic variation, even from the start of the outbreak, pointed to sporadic seeding from a pre-existing exogenous source, which persisted throughout the whole course of outbreak A. Whole genome sequencing analysis can provide unique fine-tuned insights into transmission pathways of complex outbreaks even at their inception, which, with timely use, could valuably guide efforts for early source identification. Copyright © 2016 European Society of Clinical Microbiology and Infectious Diseases. Published by Elsevier Ltd. All rights reserved.
Development of a Single Locus Sequence Typing (SLST) Scheme for Typing Bacterial Species Directly from Complex Communities.

PubMed

Scholz, Christian F P; Jensen, Anders

2017-01-01

The protocol describes a computational method to develop a Single Locus Sequence Typing (SLST) scheme for typing bacterial species. The resulting scheme can be used to type bacterial isolates as well as bacterial species directly from complex communities using next-generation sequencing technologies.
Universal sequence map (USM) of arbitrary discrete sequences

PubMed Central

2002-01-01

Background For over a decade the idea of representing biological sequences in a continuous coordinate space has maintained its appeal but not been fully realized. The basic idea is that any sequence of symbols may define trajectories in the continuous space conserving all its statistical properties. Ideally, such a representation would allow scale independent sequence analysis – without the context of fixed memory length. A simple example would consist on being able to infer the homology between two sequences solely by comparing the coordinates of any two homologous units. Results We have successfully identified such an iterative function for bijective mappingψ of discrete sequences into objects of continuous state space that enable scale-independent sequence analysis. The technique, named Universal Sequence Mapping (USM), is applicable to sequences with an arbitrary length and arbitrary number of unique units and generates a representation where map distance estimates sequence similarity. The novel USM procedure is based on earlier work by these and other authors on the properties of Chaos Game Representation (CGR). The latter enables the representation of 4 unit type sequences (like DNA) as an order free Markov Chain transition table. The properties of USM are illustrated with test data and can be verified for other data by using the accompanying web-based tool:http://bioinformatics.musc.edu/~jonas/usm/. Conclusions USM is shown to enable a statistical mechanics approach to sequence analysis. The scale independent representation frees sequence analysis from the need to assume a memory length in the investigation of syntactic rules. PMID:11895567
Candida ficus sp. nov., a novel yeast species from the gut of Apriona germari larvae.

PubMed

Hui, Feng-Li; Niu, Qiu-Hong; Ke, Tao; Liu, Zheng

2012-11-01

A novel yeast species is described based on three strains from the gut of wood-boring larvae collected in a tree trunk of Ficus carica cultivated in parks near Nanyang, central China. Phylogenetic analysis based on sequences of the D1/D2 domains of the large subunit rRNA gene showed that these strains occurred in a separate clade that was genetically distinct from all known ascomycetous yeasts. In terms of pairwise sequence divergence, the novel strains differed by 15.3% divergence from the type strain of Pichia terricola, and by 15.8% divergence from the type strains of Pichia exigua and Candida rugopelliculosa in the D1/D2 domains. All three are ascomycetous yeasts in the Pichia clade. Unlike P. terricola, P. exigua and C. rugopelliculosa, the novel isolates did not ferment glucose. The name Candida ficus sp. nov. is proposed to accommodate these highly divergent organisms, with STN-8(T) (=CICC 1980(T)=CBS 12638(T)) as the type strain.
Direct detection of a BRAF mutation in total RNA from melanoma cells using cantilever arrays

NASA Astrophysics Data System (ADS)

Huber, F.; Lang, H. P.; Backmann, N.; Rimoldi, D.; Gerber, Ch.

2013-02-01

Malignant melanoma, the deadliest form of skin cancer, is characterized by a predominant mutation in the BRAF gene. Drugs that target tumours carrying this mutation have recently entered the clinic. Accordingly, patients are routinely screened for mutations in this gene to determine whether they can benefit from this type of treatment. The current gold standard for mutation screening uses real-time polymerase chain reaction and sequencing methods. Here we show that an assay based on microcantilever arrays can detect the mutation nanomechanically without amplification in total RNA samples isolated from melanoma cells. The assay is based on a BRAF-specific oligonucleotide probe. We detected mutant BRAF at a concentration of 500 pM in a 50-fold excess of the wild-type sequence. The method was able to distinguish melanoma cells carrying the mutation from wild-type cells using as little as 20 ng µl-1 of RNA material, without prior PCR amplification and use of labels.
Development of Multilocus Sequence Typing (MLST) for Mycoplasma synoviae.

PubMed

El-Gazzar, Mohamed; Ghanem, Mostafa; McDonald, Kristina; Ferguson-Noel, Naola; Raviv, Ziv; Slemons, Richard D

2017-03-01

Mycoplasma synoviae (MS) is a poultry pathogen that has had an increasing incidence and economic impact over the past few years. Strain identification is necessary for outbreak investigation, infection source identification, and facilitating prevention and control as well as eradication efforts. Currently, a segment of the variable lipoprotein hemagglutinin A (vlhA) gene (420 bp) is the only target that is used for MS strain identification. A major limitation of this assay is that colonality of typed samples can only be inferred if their vlhA sequences are identical; however, if their sequences are different, the degree of relatedness is uncertain. In this study we propose a multilocus sequence typing (MLST) assay to further refine MS strain identification. After initial screening of 24 housekeeping genes as potential targets, seven genes were selected for the MLST assay. An internal segment (450-711 bp) from each of the seven genes was successfully amplified and sequenced from 58 different MS strains and field isolates (n = 30) or positive clinical samples (n = 28). The collective sequence of all seven gene segments (3960 bp total) was used for MS sequence typing. The 58 tested MS samples were typed into 30 different sequence types using the MLST assay and, coincidentally, all the samples were typed into 30 sequence types using the vlhA assay. However, the phylogenetic tree generated using the MLST data was more congruent to the epidemiologic information than was the tree generated by the vlhA assay. We suggest that the newly developed MLST assay and the vlhA assay could be used in tandem for MS typing. The MLST assay will be a valuable and more reliable tool for MS sequence typing, providing better understanding of the epidemiology of MS infection. This in turn will aid disease prevention, control, and eradication efforts.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Sobottka, Marcelo, E-mail: sobottka@mtm.ufsc.br; Hart, Andrew G., E-mail: ahart@dim.uchile.cl

Highlights: {yields} We propose a simple stochastic model to construct primitive DNA sequences. {yields} The model provide an explanation for Chargaff's second parity rule in primitive DNA sequences. {yields} The model is also used to predict a novel type of strand symmetry in primitive DNA sequences. {yields} We extend the results for bacterial DNA sequences and compare distributional properties intrinsic to the model to statistical estimates from 1049 bacterial genomes. {yields} We find out statistical evidences that the novel type of strand symmetry holds for bacterial DNA sequences. -- Abstract: Chargaff's second parity rule for short oligonucleotides states that themore » frequency of any short nucleotide sequence on a strand is approximately equal to the frequency of its reverse complement on the same strand. Recent studies have shown that, with the exception of organellar DNA, this parity rule generally holds for double-stranded DNA genomes and fails to hold for single-stranded genomes. While Chargaff's first parity rule is fully explained by the Watson-Crick pairing in the DNA double helix, a definitive explanation for the second parity rule has not yet been determined. In this work, we propose a model based on a hidden Markov process for approximating the distributional structure of primitive DNA sequences. Then, we use the model to provide another possible theoretical explanation for Chargaff's second parity rule, and to predict novel distributional aspects of bacterial DNA sequences.« less

Phylogenomic Study of Burkholderia glathei-like Organisms, Proposal of 13 Novel Burkholderia Species and Emended Descriptions of Burkholderia sordidicola, Burkholderia zhejiangensis, and Burkholderia grimmiae

PubMed Central

Peeters, Charlotte; Meier-Kolthoff, Jan P.; Verheyde, Bart; De Brandt, Evie; Cooper, Vaughn S.; Vandamme, Peter

2016-01-01

Partial gyrB gene sequence analysis of 17 isolates from human and environmental sources revealed 13 clusters of strains and identified them as Burkholderia glathei clade (BGC) bacteria. The taxonomic status of these clusters was examined by whole-genome sequence analysis, determination of the G+C content, whole-cell fatty acid analysis and biochemical characterization. The whole-genome sequence-based phylogeny was assessed using the Genome Blast Distance Phylogeny (GBDP) method and an extended multilocus sequence analysis (MLSA) approach. The results demonstrated that these 17 BGC isolates represented 13 novel Burkholderia species that could be distinguished by both genotypic and phenotypic characteristics. BGC strains exhibited a broad metabolic versatility and developed beneficial, symbiotic, and pathogenic interactions with different hosts. Our data also confirmed that there is no phylogenetic subdivision in the genus Burkholderia that distinguishes beneficial from pathogenic strains. We therefore propose to formally classify the 13 novel BGC Burkholderia species as Burkholderia arvi sp. nov. (type strain LMG 29317T = CCUG 68412T), Burkholderia hypogeia sp. nov. (type strain LMG 29322T = CCUG 68407T), Burkholderia ptereochthonis sp. nov. (type strain LMG 29326T = CCUG 68403T), Burkholderia glebae sp. nov. (type strain LMG 29325T = CCUG 68404T), Burkholderia pedi sp. nov. (type strain LMG 29323T = CCUG 68406T), Burkholderia arationis sp. nov. (type strain LMG 29324T = CCUG 68405T), Burkholderia fortuita sp. nov. (type strain LMG 29320T = CCUG 68409T), Burkholderia temeraria sp. nov. (type strain LMG 29319T = CCUG 68410T), Burkholderia calidae sp. nov. (type strain LMG 29321T = CCUG 68408T), Burkholderia concitans sp. nov. (type strain LMG 29315T = CCUG 68414T), Burkholderia turbans sp. nov. (type strain LMG 29316T = CCUG 68413T), Burkholderia catudaia sp. nov. (type strain LMG 29318T = CCUG 68411T) and Burkholderia peredens sp. nov. (type strain LMG 29314T = CCUG 68415T). Furthermore, we present emended descriptions of the species Burkholderia sordidicola, Burkholderia zhejiangensis and Burkholderia grimmiae. The GenBank/EMBL/DDBJ accession numbers for the 16S rRNA and gyrB gene sequences determined in this study are LT158612-LT158624 and LT158625-LT158641, respectively. PMID:27375597
Bovine Papillomavirus in Brazil: Detection of Coinfection of Unusual Types by a PCR-RFLP Method

PubMed Central

Carvalho, R. F.; Sakata, S. T.; Giovanni, D. N. S.; Mori, E.; Brandão, P. E.; Richtzenhain, L. J.; Pozzi, C. R.; Arcaro, J. R. P.; Miranda, M. S.; Mazzuchelli-de-Souza, J.; Melo, T. C.; Comenale, G.; Assaf, S. L. M. R.; Beçak, W.; Stocco, R. C.

2013-01-01

Bovine papillomavirus (BPV) is recognized as a causal agent of benign and malignant tumors in cattle. Thirteen types of BPV are currently characterized and classified into three distinct genera, associated with different pathological outcomes. The described BPV types as well as other putative ones have been demonstrated by molecular biology methods, mainly by the employment of degenerated PCR primers. Specifically, divergences in the nucleotide sequence of the L1 gene are useful for the identification and classification of new papillomavirus types. On the present work, a method based on the PCR-RFLP technique and DNA sequencing was evaluated as a screening tool, allowing for the detection of two relatively rare types of BPV in lesions samples from a six-year-old Holstein dairy cow, chronically affected with cutaneous papillomatosis. These findings point to the dissemination of BPVs with unclear pathogenic potential, since two relatively rare, new described BPV types, which were first characterized in Japan, were also detected in Brazil. PMID:23865043
Molecular Identification of Ectomycorrhizal Mycelium in Soil Horizons

PubMed Central

Landeweert, Renske; Leeflang, Paula; Kuyper, Thom W.; Hoffland, Ellis; Rosling, Anna; Wernars, Karel; Smit, Eric

2003-01-01

Molecular identification techniques based on total DNA extraction provide a unique tool for identification of mycelium in soil. Using molecular identification techniques, the ectomycorrhizal (EM) fungal community under coniferous vegetation was analyzed. Soil samples were taken at different depths from four horizons of a podzol profile. A basidiomycete-specific primer pair (ITS1F-ITS4B) was used to amplify fungal internal transcribed spacer (ITS) sequences from total DNA extracts of the soil horizons. Amplified basidiomycete DNA was cloned and sequenced, and a selection of the obtained clones was analyzed phylogenetically. Based on sequence similarity, the fungal clone sequences were sorted into 25 different fungal groups, or operational taxonomic units (OTUs). Out of 25 basidiomycete OTUs, 7 OTUs showed high nucleotide homology (≥99%) with known EM fungal sequences and 16 were found exclusively in the mineral soil. The taxonomic positions of six OTUs remained unclear. OTU sequences were compared to sequences from morphotyped EM root tips collected from the same sites. Of the 25 OTUs, 10 OTUs had ≥98% sequence similarity with these EM root tip sequences. The present study demonstrates the use of molecular techniques to identify EM hyphae in various soil types. This approach differs from the conventional method of EM root tip identification and provides a novel approach to examine EM fungal communities in soil. PMID:12514012
Rapid identification of fungal pathogens in BacT/ALERT, BACTEC, and BBL MGIT media using polymerase chain reaction and DNA sequencing of the internal transcribed spacer regions.

PubMed

Pryce, Todd M; Palladino, Silvano; Price, Diane M; Gardam, Dianne J; Campbell, Peter B; Christiansen, Keryn J; Murray, Ronan J

2006-04-01

We report a direct polymerase chain reaction/sequence (d-PCRS)-based method for the rapid identification of clinically significant fungi from 5 different types of commercial broth enrichment media inoculated with clinical specimens. Media including BacT/ALERT FA (BioMérieux, Marcy l'Etoile, France) (n = 87), BACTEC Plus Aerobic/F (Becton Dickinson, Microbiology Systems, Sparks, MD) (n = 16), BACTEC Peds Plus/F (Becton Dickinson) (n = 15), BACTEC Lytic/10 Anaerobic/F (Becton Dickinson) (n = 11) bottles, and BBL MGIT (Becton Dickinson) (n = 11) were inoculated with specimens from 138 patients. A universal DNA extraction method was used combining a novel pretreatment step to remove PCR inhibitors with a column-based DNA extraction kit. Target sequences in the noncoding internal transcribed spacer regions of the rRNA gene were amplified by PCR and sequenced using a rapid (24 h) automated capillary electrophoresis system. Using sequence alignment software, fungi were identified by sequence similarity with sequences derived from isolates identified by upper-level reference laboratories or isolates defined as ex-type strains. We identified Candida albicans (n = 14), Candida parapsilosis (n = 8), Candida glabrata (n = 7), Candida krusei (n = 2), Scedosporium prolificans (n = 4), and 1 each of Candida orthopsilosis, Candida dubliniensis, Candida kefyr, Candida tropicalis, Candida guilliermondii, Saccharomyces cerevisiae, Cryptococcus neoformans, Aspergillus fumigatus, Histoplasma capsulatum, and Malassezia pachydermatis by d-PCRS analysis. All d-PCRS identifications from positive broths were in agreement with the final species identification of the isolates grown from subculture. Earlier identification of fungi using d-PCRS may facilitate prompt and more appropriate antifungal therapy.
Identification and characterization of unrecognized viruses in stool samples of non-polio acute flaccid paralysis children by simplified VIDISCA.

PubMed

Shaukat, Shahzad; Angez, Mehar; Alam, Muhammad Masroor; Jebbink, Maarten F; Deijs, Martin; Canuti, Marta; Sharif, Salmaan; de Vries, Michel; Khurshid, Adnan; Mahmood, Tariq; van der Hoek, Lia; Zaidi, Syed Sohail Zahoor

2014-08-12

The use of sequence independent methods combined with next generation sequencing for identification purposes in clinical samples appears promising and exciting results have been achieved to understand unexplained infections. One sequence independent method, Virus Discovery based on cDNA Amplified Fragment Length Polymorphism (VIDISCA) is capable of identifying viruses that would have remained unidentified in standard diagnostics or cell cultures. VIDISCA is normally combined with next generation sequencing, however, we set up a simplified VIDISCA which can be used in case next generation sequencing is not possible. Stool samples of 10 patients with unexplained acute flaccid paralysis showing cytopathic effect in rhabdomyosarcoma cells and/or mouse cells were used to test the efficiency of this method. To further characterize the viruses, VIDISCA-positive samples were amplified and sequenced with gene specific primers. Simplified VIDISCA detected seven viruses (70%) and the proportion of eukaryotic viral sequences from each sample ranged from 8.3 to 45.8%. Human enterovirus EV-B97, EV-B100, echovirus-9 and echovirus-21, human parechovirus type-3, human astrovirus probably a type-3/5 recombinant, and tetnovirus-1 were identified. Phylogenetic analysis based on the VP1 region demonstrated that the human enteroviruses are more divergent isolates circulating in the community. Our data support that a simplified VIDISCA protocol can efficiently identify unrecognized viruses grown in cell culture with low cost, limited time without need of advanced technical expertise. Also complex data interpretation is avoided thus the method can be used as a powerful diagnostic tool in limited resources. Redesigning the routine diagnostics might lead to additional detection of previously undiagnosed viruses in clinical samples of patients.
Changing practice: red blood cell typing by molecular methods for patients with sickle cell disease.

PubMed

Casas, Jessica; Friedman, David F; Jackson, Tannoa; Vege, Sunitha; Westhoff, Connie M; Chou, Stella T

2015-06-01

Extended red blood cell (RBC) antigen matching is recommended to limit alloimmunization in patients with sickle cell disease (SCD). DNA-based testing to predict blood group phenotypes has enhanced availability of antigen-negative donor units and improved typing of transfused patients, but replacement of routine serologic typing for non-ABO antigens with molecular typing for patients has not been reported. This study compared the historical RBC antigen phenotypes obtained by hemagglutination methods with genotype predictions in 494 patients with SCD. For discrepant results, repeat serologic testing was performed and/or investigated by gene sequencing for silent or variant alleles. Seventy-one typing discrepancies were identified among 6360 antigen comparisons (1.1%). New specimens for repeat serologic testing were obtained for 66 discrepancies and retyping agreed with the genotype in 64 cases. One repeat Jk(b-) serologic phenotype, predicted Jk(b+) by genotype, was found by direct sequencing of JK to be a silenced allele, and one N typing discrepancy remains under investigation. Fifteen false-negative serologic results were associated with alleles encoding weak antigens or single-dose Fy(b) expression. DNA-based RBC typing provided improved accuracy and expanded information on RBC antigens compared to hemagglutination methods, leading to its implementation as the primary method for extended RBC typing for patients with SCD at our institution. © 2015 AABB.
Method for isolating chromosomal DNA in preparation for hybridization in suspension

DOEpatents

Lucas, Joe N.

2000-01-01

A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration. Chromosomal DNA in a sample containing cell debris is prepared for hybridization in suspension by treating the mixture with RNase. The treated DNA can also be fixed prior to hybridization.
Atropos: specific, sensitive, and speedy trimming of sequencing reads.

PubMed

Didion, John P; Martin, Marcel; Collins, Francis S

2017-01-01

A key step in the transformation of raw sequencing reads into biological insights is the trimming of adapter sequences and low-quality bases. Read trimming has been shown to increase the quality and reliability while decreasing the computational requirements of downstream analyses. Many read trimming software tools are available; however, no tool simultaneously provides the accuracy, computational efficiency, and feature set required to handle the types and volumes of data generated in modern sequencing-based experiments. Here we introduce Atropos and show that it trims reads with high sensitivity and specificity while maintaining leading-edge speed. Compared to other state-of-the-art read trimming tools, Atropos achieves significant increases in trimming accuracy while remaining competitive in execution times. Furthermore, Atropos maintains high accuracy even when trimming data with elevated rates of sequencing errors. The accuracy, high performance, and broad feature set offered by Atropos makes it an appropriate choice for the pre-processing of Illumina, ABI SOLiD, and other current-generation short-read sequencing datasets. Atropos is open source and free software written in Python (3.3+) and available at https://github.com/jdidion/atropos.
Atropos: specific, sensitive, and speedy trimming of sequencing reads

PubMed Central

Collins, Francis S.

2017-01-01

A key step in the transformation of raw sequencing reads into biological insights is the trimming of adapter sequences and low-quality bases. Read trimming has been shown to increase the quality and reliability while decreasing the computational requirements of downstream analyses. Many read trimming software tools are available; however, no tool simultaneously provides the accuracy, computational efficiency, and feature set required to handle the types and volumes of data generated in modern sequencing-based experiments. Here we introduce Atropos and show that it trims reads with high sensitivity and specificity while maintaining leading-edge speed. Compared to other state-of-the-art read trimming tools, Atropos achieves significant increases in trimming accuracy while remaining competitive in execution times. Furthermore, Atropos maintains high accuracy even when trimming data with elevated rates of sequencing errors. The accuracy, high performance, and broad feature set offered by Atropos makes it an appropriate choice for the pre-processing of Illumina, ABI SOLiD, and other current-generation short-read sequencing datasets. Atropos is open source and free software written in Python (3.3+) and available at https://github.com/jdidion/atropos. PMID:28875074
BIOPEP database and other programs for processing bioactive peptide sequences.

PubMed

Minkiewicz, Piotr; Dziuba, Jerzy; Iwaniak, Anna; Dziuba, Marta; Darewicz, Małgorzata

2008-01-01

This review presents the potential for application of computational tools in peptide science based on a sample BIOPEP database and program as well as other programs and databases available via the World Wide Web. The BIOPEP application contains a database of biologically active peptide sequences and a program enabling construction of profiles of the potential biological activity of protein fragments, calculation of quantitative descriptors as measures of the value of proteins as potential precursors of bioactive peptides, and prediction of bonds susceptible to hydrolysis by endopeptidases in a protein chain. Other bioactive and allergenic peptide sequence databases are also presented. Programs enabling the construction of binary and multiple alignments between peptide sequences, the construction of sequence motifs attributed to a given type of bioactivity, searching for potential precursors of bioactive peptides, and the prediction of sites susceptible to proteolytic cleavage in protein chains are available via the Internet as are other approaches concerning secondary structure prediction and calculation of physicochemical features based on amino acid sequence. Programs for prediction of allergenic and toxic properties have also been developed. This review explores the possibilities of cooperation between various programs.
Highly conserved intragenic HSV-2 sequences: Results from next-generation sequencing of HSV-2 UL and US regions from genital swabs collected from 3 continents.

PubMed

Johnston, Christine; Magaret, Amalia; Roychoudhury, Pavitra; Greninger, Alexander L; Cheng, Anqi; Diem, Kurt; Fitzgibbon, Matthew P; Huang, Meei-Li; Selke, Stacy; Lingappa, Jairam R; Celum, Connie; Jerome, Keith R; Wald, Anna; Koelle, David M

2017-10-01

Understanding the variability in circulating herpes simplex virus type 2 (HSV-2) genomic sequences is critical to the development of HSV-2 vaccines. Genital lesion swabs containing ≥ 10 7 log 10 copies HSV DNA collected from Africa, the USA, and South America underwent next-generation sequencing, followed by K-mer based filtering and de novo genomic assembly. Sites of heterogeneity within coding regions in unique long and unique short (U L _U S ) regions were identified. Phylogenetic trees were created using maximum likelihood reconstruction. Among 46 samples from 38 persons, 1468 intragenic base-pair substitutions were identified. The maximum nucleotide distance between strains for concatenated U L_ U S segments was 0.4%. Phylogeny did not reveal geographic clustering. The most variable proteins had non-synonymous mutations in < 3% of amino acids. Unenriched HSV-2 DNA can undergo next-generation sequencing to identify intragenic variability. The use of clinical swabs for sequencing expands the information that can be gathered directly from these specimens. Copyright © 2017 Elsevier Inc. All rights reserved.
Genome-wide comparative analysis reveals human-mouse regulatory landscape and evolution.

PubMed

Denas, Olgert; Sandstrom, Richard; Cheng, Yong; Beal, Kathryn; Herrero, Javier; Hardison, Ross C; Taylor, James

2015-02-14

Because species-specific gene expression is driven by species-specific regulation, understanding the relationship between sequence and function of the regulatory regions in different species will help elucidate how differences among species arise. Despite active experimental and computational research, relationships among sequence, conservation, and function are still poorly understood. We compared transcription factor occupied segments (TFos) for 116 human and 35 mouse TFs in 546 human and 125 mouse cell types and tissues from the Human and the Mouse ENCODE projects. We based the map between human and mouse TFos on a one-to-one nucleotide cross-species mapper, bnMapper, that utilizes whole genome alignments (WGA). Our analysis shows that TFos are under evolutionary constraint, but a substantial portion (25.1% of mouse and 25.85% of human on average) of the TFos does not have a homologous sequence on the other species; this portion varies among cell types and TFs. Furthermore, 47.67% and 57.01% of the homologous TFos sequence shows binding activity on the other species for human and mouse respectively. However, 79.87% and 69.22% is repurposed such that it binds the same TF in different cells or different TFs in the same cells. Remarkably, within the set of repurposed TFos, the corresponding genome regions in the other species are preferred locations of novel TFos. These events suggest exaptation of some functional regulatory sequences into new function. Despite TFos repurposing, we did not find substantial changes in their predicted target genes, suggesting that CRMs buffer evolutionary events allowing little or no change in the TFos - target gene associations. Thus, the small portion of TFos with strictly conserved occupancy underestimates the degree of conservation of regulatory interactions. We mapped regulatory sequences from an extensive number of TFs and cell types between human and mouse using WGA. A comparative analysis of this correspondence unveiled the extent of the shared regulatory sequence across TFs and cell types under study. Importantly, a large part of the shared regulatory sequence is repurposed on the other species. This sequence, fueled by turnover events, provides a strong case for exaptation in regulatory elements.
Not all transmembrane helices are born equal: Towards the extension of the sequence homology concept to membrane proteins

PubMed Central

2011-01-01

Background Sequence homology considerations widely used to transfer functional annotation to uncharacterized protein sequences require special precautions in the case of non-globular sequence segments including membrane-spanning stretches composed of non-polar residues. Simple, quantitative criteria are desirable for identifying transmembrane helices (TMs) that must be included into or should be excluded from start sequence segments in similarity searches aimed at finding distant homologues. Results We found that there are two types of TMs in membrane-associated proteins. On the one hand, there are so-called simple TMs with elevated hydrophobicity, low sequence complexity and extraordinary enrichment in long aliphatic residues. They merely serve as membrane-anchoring device. In contrast, so-called complex TMs have lower hydrophobicity, higher sequence complexity and some functional residues. These TMs have additional roles besides membrane anchoring such as intra-membrane complex formation, ligand binding or a catalytic role. Simple and complex TMs can occur both in single- and multi-membrane-spanning proteins essentially in any type of topology. Whereas simple TMs have the potential to confuse searches for sequence homologues and to generate unrelated hits with seemingly convincing statistical significance, complex TMs contain essential evolutionary information. Conclusion For extending the homology concept onto membrane proteins, we provide a necessary quantitative criterion to distinguish simple TMs (and a sufficient criterion for complex TMs) in query sequences prior to their usage in homology searches based on assessment of hydrophobicity and sequence complexity of the TM sequence segments. Reviewers This article was reviewed by Shamil Sunyaev, L. Aravind and Arcady Mushegian. PMID:22024092
Direct Repeat Unit (dru) Typing of Methicillin-Resistant Staphylococcus pseudintermedius from Dogs and Cats.

PubMed

Kadlec, Kristina; Schwarz, Stefan; Goering, Richard V; Weese, J Scott

2015-12-01

Methicillin-resistant Staphylococcus pseudintermedius (MRSP) has emerged in a remarkable manner as an important problem in dogs and cats. However, limited molecular epidemiological information is available. The aims of this study were to apply direct repeat unit (dru) typing in a large collection of well-characterized MRSP isolates and to use dru typing to analyze a collection of previously uncharacterized MRSP isolates. Two collections of MRSP isolates from dogs and cats were included in this study. The first collection comprised 115 well-characterized MRSP isolates from North America and Europe. The data for these isolates included multilocus sequence typing (MLST) and staphylococcal protein A gene (spa) typing results as well as SmaI macrorestriction patterns after pulsed-field gel electrophoresis (PFGE). The second collection was a convenience sample of 360 isolates from North America. The dru region was amplified by PCR, sequenced, and analyzed. For the first collection, the discriminatory indices of the typing methods were calculated. All isolates were successfully dru typed. The discriminatory power for dru typing (D = 0.423) was comparable to that of spa typing (D = 0.445) and of MLST (D = 0.417) in the first collection. Occasionally, dru typing was able to further discriminate between isolates that shared the same spa type. Among all 475 isolates, 26 different dru types were identified, with 2 predominant types (dt9a and dt11a) among 349 (73.4%) isolates. The results of this study underline that dru typing is a useful tool for MRSP typing, being an objective, standardized, sequence-based method that is relatively cost-efficient and easy to perform. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
The Evolution of Strain Typing in the Mycobacterium tuberculosis Complex.

PubMed

Merker, Matthias; Kohl, Thomas A; Niemann, Stefan; Supply, Philip

2017-01-01

Tuberculosis (TB) is a contagious disease with a complex epidemiology. Therefore, molecular typing (genotyping) of Mycobacterium tuberculosis complex (MTBC) strains is of primary importance to effectively guide outbreak investigations, define transmission dynamics and assist global epidemiological surveillance of the disease. Large-scale genotyping is also needed to get better insights into the biological diversity and the evolution of the pathogen. Thanks to its shorter turnaround and simple numerical nomenclature system, mycobacterial interspersed repetitive unit-variable-number tandem repeat (MIRU-VNTR) typing, based on 24 standardized plus 4 hypervariable loci, optionally combined with spoligotyping, has replaced IS6110 DNA fingerprinting over the last decade as a gold standard among classical strain typing methods for many applications. With the continuous progress and decreasing costs of next-generation sequencing (NGS) technologies, typing based on whole genome sequencing (WGS) is now increasingly performed for near complete exploitation of the available genetic information. However, some important challenges remain such as the lack of standardization of WGS analysis pipelines, the need of databases for sharing WGS data at a global level, and a better understanding of the relevant genomic distances for defining clusters of recent TB transmission in different epidemiological contexts. This chapter provides an overview of the evolution of genotyping methods over the last three decades, which culminated with the development of WGS-based methods. It addresses the relative advantages and limitations of these techniques, indicates current challenges and potential directions for facilitating standardization of WGS-based typing, and provides suggestions on what method to use depending on the specific research question.
Antiretroviral treatment sequencing strategies to overcome HIV type 1 drug resistance in adolescents and adults in low-middle-income countries.

PubMed

De Luca, Andrea; Hamers, Raphael L; Schapiro, Jonathan M

2013-06-15

Antiretroviral treatment (ART) is expanding to human immunodeficiency virus type 1 (HIV-1)-infected persons in low-middle income countries, thanks to a public health approach. With 3 available drug classes, 2 ART sequencing lines are programmatically foreseen. The emergence and transmission of viral drug resistance represents a challenge to the efficacy of ART. Knowledge of HIV-1 drug resistance selection associated with specific drugs and regimens and the consequent activity of residual drug options are essential in programming ART sequencing options aimed at preserving ART efficacy for as long as possible. This article determines optimal ART sequencing options for overcoming HIV-1 drug resistance in resource-limited settings, using currently available drugs and treatment monitoring opportunities. From the perspective of drug resistance and on the basis of limited virologic monitoring data, optimal sequencing seems to involve use of a tenofovir-containing nonnucleoside reverse-transcriptase inhibitor-based first-line regimen, followed by a zidovudine-containing, protease inhibitor (PI)-based second-line regimen. Other options and their consequences are explored by considering within-class and between-class sequencing opportunities, including boosted PI monotherapies and future options with integrase inhibitors. Nucleoside reverse-transcriptase inhibitor resistance pathways in HIV-1 subtype C suggest an additional reason for accelerating stavudine phase out. Viral load monitoring avoids the accumulation of resistance mutations that significantly reduce the activity of next-line options. Rational use of resources, including broader access to viral load monitoring, will help ensure 3 lines of fully active treatment options, thereby increasing the duration of ART success.
Leaf margin phenotype-specific restriction-site-associated DNA-derived markers for pineapple (Ananas comosus L.).

PubMed

Urasaki, Naoya; Goeku, Satoko; Kaneshima, Risa; Takamine, Tomonori; Tarora, Kazuhiko; Takeuchi, Makoto; Moromizato, Chie; Yonamine, Kaname; Hosaka, Fumiko; Terakami, Shingo; Matsumura, Hideo; Yamamoto, Toshiya; Shoda, Moriyuki

2015-06-01

To explore genome-wide DNA polymorphisms and identify DNA markers for leaf margin phenotypes, a restriction-site-associated DNA sequencing analysis was employed to analyze three bulked DNAs of F1 progeny from a cross between a 'piping-leaf-type' cultivar, 'Yugafu', and a 'spiny-tip-leaf-type' variety, 'Yonekura'. The parents were both Ananas comosus var. comosus. From the analysis, piping-leaf and spiny-tip-leaf gene-specific restriction-site-associated DNA sequencing tags were obtained and designated as PLSTs and STLSTs, respectively. The five PLSTs and two STSLTs were successfully converted to cleaved amplified polymorphic sequence (CAPS) or simple sequence repeat (SSR) markers using the sequence differences between alleles. Based on the genotyping of the F1 with two SSR and three CAPS markers, the five PLST markers were mapped in the vicinity of the P locus, with the closest marker, PLST1_SSR, being located 1.5 cM from the P locus. The two CAPS markers from STLST1 and STLST3 perfectly assessed the 'spiny-leaf type' as homozygotes of the recessive s allele of the S gene. The recombination value between the S locus and STLST loci was 2.4, and STLSTs were located 2.2 cM from the S locus. SSR and CAPS markers are applicable to marker-assisted selection of leaf margin phenotypes in pineapple breeding.
Human papillomavirus detection and typing using a nested-PCR-RFLP assay.

PubMed

Coser, Janaina; Boeira, Thaís da Rocha; Fonseca, André Salvador Kazantzi; Ikuta, Nilo; Lunge, Vagner Ricardo

2011-01-01

It is clinically important to detect and type human papillomavirus (HPV) in a sensitive and specific manner. Development of a nested-polymerase chain reaction-restriction fragment length polymorphism (nested-PCR-RFLP) assay to detect and type HPV based on the analysis of L1 gene. Analysis of published DNA sequence of mucosal HPV types to select sequences of new primers. Design of an original nested-PCR assay using the new primers pair selected and classical MY09/11 primers. HPV detection and typing in cervical samples using the nested-PCR-RFLP assay. The nested-PCR-RFLP assay detected and typed HPV in cervical samples. Of the total of 128 clinical samples submitted to simple PCR and nested-PCR for detection of HPV, 37 (28.9%) were positive for the virus by both methods and 25 samples were positive only by nested-PCR (67.5% increase in detection rate compared with single PCR). All HPV positive samples were effectively typed by RFLP assay. The method of nested-PCR proved to be an effective diagnostic tool for HPV detection and typing.
Analysis of sequencing data for probing RNA secondary structures and protein-RNA binding in studying posttranscriptional regulations.

PubMed

Hu, Xihao; Wu, Yang; Lu, Zhi John; Yip, Kevin Y

2016-11-01

High-throughput sequencing has been used to study posttranscriptional regulations, where the identification of protein-RNA binding is a major and fast-developing sub-area, which is in turn benefited by the sequencing methods for whole-transcriptome probing of RNA secondary structures. In the study of RNA secondary structures using high-throughput sequencing, bases are modified or cleaved according to their structural features, which alter the resulting composition of sequencing reads. In the study of protein-RNA binding, methods have been proposed to immuno-precipitate (IP) protein-bound RNA transcripts in vitro or in vivo By sequencing these transcripts, the protein-RNA interactions and the binding locations can be identified. For both types of data, read counts are affected by a combination of confounding factors, including expression levels of transcripts, sequence biases, mapping errors and the probing or IP efficiency of the experimental protocols. Careful processing of the sequencing data and proper extraction of important features are fundamentally important to a successful analysis. Here we review and compare different experimental methods for probing RNA secondary structures and binding sites of RNA-binding proteins (RBPs), and the computational methods proposed for analyzing the corresponding sequencing data. We suggest how these two types of data should be integrated to study the structural properties of RBP binding sites as a systematic way to better understand posttranscriptional regulations. © The Author 2015. Published by Oxford University Press. For Permissions, please email: journals.permissions@oup.com.
Microbe-ID: an open source toolbox for microbial genotyping and species identification.

PubMed

Tabima, Javier F; Everhart, Sydney E; Larsen, Meredith M; Weisberg, Alexandra J; Kamvar, Zhian N; Tancos, Matthew A; Smart, Christine D; Chang, Jeff H; Grünwald, Niklaus J

2016-01-01

Development of tools to identify species, genotypes, or novel strains of invasive organisms is critical for monitoring emergence and implementing rapid response measures. Molecular markers, although critical to identifying species or genotypes, require bioinformatic tools for analysis. However, user-friendly analytical tools for fast identification are not readily available. To address this need, we created a web-based set of applications called Microbe-ID that allow for customizing a toolbox for rapid species identification and strain genotyping using any genetic markers of choice. Two components of Microbe-ID, named Sequence-ID and Genotype-ID, implement species and genotype identification, respectively. Sequence-ID allows identification of species by using BLAST to query sequences for any locus of interest against a custom reference sequence database. Genotype-ID allows placement of an unknown multilocus marker in either a minimum spanning network or dendrogram with bootstrap support from a user-created reference database. Microbe-ID can be used for identification of any organism based on nucleotide sequences or any molecular marker type and several examples are provided. We created a public website for demonstration purposes called Microbe-ID (microbe-id.org) and provided a working implementation for the genus Phytophthora (phytophthora-id.org). In Phytophthora-ID, the Sequence-ID application allows identification based on ITS or cox spacer sequences. Genotype-ID groups individuals into clonal lineages based on simple sequence repeat (SSR) markers for the two invasive plant pathogen species P. infestans and P. ramorum. All code is open source and available on github and CRAN. Instructions for installation and use are provided at https://github.com/grunwaldlab/Microbe-ID.

Genome-based classification of micromonosporae with a focus on their biotechnological and ecological potential.

PubMed

Carro, Lorena; Nouioui, Imen; Sangal, Vartul; Meier-Kolthoff, Jan P; Trujillo, Martha E; Montero-Calasanz, Maria Del Carmen; Sahin, Nevzat; Smith, Darren Lee; Kim, Kristi E; Peluso, Paul; Deshpande, Shweta; Woyke, Tanja; Shapiro, Nicole; Kyrpides, Nikos C; Klenk, Hans-Peter; Göker, Markus; Goodfellow, Michael

2018-01-11

There is a need to clarify relationships within the actinobacterial genus Micromonospora, the type genus of the family Micromonosporaceae, given its biotechnological and ecological importance. Here, draft genomes of 40 Micromonospora type strains and two non-type strains are made available through the Genomic Encyclopedia of Bacteria and Archaea project and used to generate a phylogenomic tree which showed they could be assigned to well supported phyletic lines that were not evident in corresponding trees based on single and concatenated sequences of conserved genes. DNA G+C ratios derived from genome sequences showed that corresponding data from species descriptions were imprecise. Emended descriptions include precise base composition data and approximate genome sizes of the type strains. antiSMASH analyses of the draft genomes show that micromonosporae have a previously unrealised potential to synthesize novel specialized metabolites. Close to one thousand biosynthetic gene clusters were detected, including NRPS, PKS, terpenes and siderophores clusters that were discontinuously distributed thereby opening up the prospect of prioritising gifted strains for natural product discovery. The distribution of key stress related genes provide an insight into how micromonosporae adapt to key environmental variables. Genes associated with plant interactions highlight the potential use of micromonosporae in agriculture and biotechnology.
Implementation of Objective PASC-Derived Taxon Demarcation Criteria for Official Classification of Filoviruses.

PubMed

Bào, Yīmíng; Amarasinghe, Gaya K; Basler, Christopher F; Bavari, Sina; Bukreyev, Alexander; Chandran, Kartik; Dolnik, Olga; Dye, John M; Ebihara, Hideki; Formenty, Pierre; Hewson, Roger; Kobinger, Gary P; Leroy, Eric M; Mühlberger, Elke; Netesov, Sergey V; Patterson, Jean L; Paweska, Janusz T; Smither, Sophie J; Takada, Ayato; Towner, Jonathan S; Volchkov, Viktor E; Wahl-Jensen, Victoria; Kuhn, Jens H

2017-05-11

The mononegaviral family Filoviridae has eight members assigned to three genera and seven species. Until now, genus and species demarcation were based on arbitrarily chosen filovirus genome sequence divergence values (≈50% for genera, ≈30% for species) and arbitrarily chosen phenotypic virus or virion characteristics. Here we report filovirus genome sequence-based taxon demarcation criteria using the publicly accessible PAirwise Sequencing Comparison (PASC) tool of the US National Center for Biotechnology Information (Bethesda, MD, USA). Comparison of all available filovirus genomes in GenBank using PASC revealed optimal genus demarcation at the 55-58% sequence diversity threshold range for genera and at the 23-36% sequence diversity threshold range for species. Because these thresholds do not change the current official filovirus classification, these values are now implemented as filovirus taxon demarcation criteria that may solely be used for filovirus classification in case additional data are absent. A near-complete, coding-complete, or complete filovirus genome sequence will now be required to allow official classification of any novel "filovirus." Classification of filoviruses into existing taxa or determining the need for novel taxa is now straightforward and could even become automated using a presented algorithm/flowchart rooted in RefSeq (type) sequences.
MULTILOCUS SEQUENCE TYPING OF BRUCELLA ISOLATES FROM THAILAND.

PubMed

Chawjiraphan, Wireeya; Sonthayanon, Piengchan; Chanket, Phanita; Benjathummarak, Surachet; Kerdsin, Anusak; Kalambhaheti, Thareerat

2016-11-01

Although brucellosis outbreaks in Thailand are rare, they cause abortions and infertility in animals, resulting in significant economic loss. Because Brucella spp display > 90% DNA homology, multilocus sequence typing (MLST) was employed to categorize local Brucella isolates into sequence types (STs) and to determine their genetic relatedness. Brucella samples were isolated from vaginal secretion of cows and goats, and from blood cultures of infected individuals. Brucella species were determined by multiplex PCR of eight loci, in addition to MLST based on partial DNA sequences of nine house-keeping genes. MLST analysis of 36 isolates revealed 78 distinct novel allele types and 34 novel STs, while two isolates possessed the known ST8. Sequence alignments identified polymorphic sites in each allele, ranging from 2-6%, while overall genetic diversity was 3.6%. MLST analysis of the 36 Brucella isolates classified them into three species, namely, B. melitensis, B. abortus and B. suis, in agreement with multiplex PCR results. Genetic relatedness among ST members of B. melitensis and B. abortus determined by eBURST program revealed ST2 as founder of B. abortus isolates and ST8 the founder of B. melitensis isolates. ST 36, 41 and 50 of Thai Brucella isolates were identified as single locus variants of clonal cluster (CC) 8, while the majority of STs were diverse. The genetic diversity and relatedness identified using MLST revealed hitherto unexpected diversity among Thai Brucella isolates. Genetic classification of isolates could reveal the route of brucellosis transmission among humans and farm animals and also reveal their relationship with other isolates in the region and other parts of the world.
In Silico identification of pathogenic strains of Cronobacter from Biochemical data reveals association of inositol fermentation with pathogenicity.

PubMed

Hamby, Stephen E; Joseph, Susan; Forsythe, Stephen J; Chuzhanova, Nadia

2011-09-20

Cronobacter, formerly known as Enterobacter sakazakii, is a food-borne pathogen known to cause neonatal meningitis, septicaemia and death. Current diagnostic tests for identification of Cronobacter do not differentiate between species, necessitating time consuming 16S rDNA gene sequencing or multilocus sequence typing (MLST). The organism is ubiquitous, being found in the environment and in a wide range of foods, although there is variation in pathogenicity between Cronobacter isolates and between species. Therefore to be able to differentiate between the pathogenic and non-pathogenic strains is of interest to the food industry and regulators. Here we report the use of Expectation Maximization clustering to categorise 98 strains of Cronobacter as pathogenic or non-pathogenic based on biochemical test results from standard diagnostic test kits. Pathogenicity of a strain was postulated on the basis of either pathogenic symptoms associated with strain source or corresponding MLST sequence types, allowing the clusters to be labelled as containing either pathogenic or non-pathogenic strains. The resulting clusters gave good differentiation of strains into pathogenic and non-pathogenic groups, corresponding well to isolate source and MLST sequence type. The results also revealed a potential association between pathogenicity and inositol fermentation. An investigation of the genomes of Cronobacter sakazakii and C. turicensis revealed the gene for inositol monophosphatase is associated with putative virulence factors in pathogenic strains of Cronobacter. We demonstrated a computational approach allowing existing diagnostic kits to be used to identify pathogenic strains of Cronobacter. The resulting clusters correlated well with MLST sequence types and revealed new information about the pathogenicity of Cronobacter species.
HPV Genotyping of Modified General Primer-Amplicons Is More Analytically Sensitive and Specific by Sequencing than by Hybridization

PubMed Central

Meisal, Roger; Rounge, Trine Ballestad; Christiansen, Irene Kraus; Eieland, Alexander Kirkeby; Worren, Merete Molton; Molden, Tor Faksvaag; Kommedal, Øyvind; Hovig, Eivind; Leegaard, Truls Michael

2017-01-01

Sensitive and specific genotyping of human papillomaviruses (HPVs) is important for population-based surveillance of carcinogenic HPV types and for monitoring vaccine effectiveness. Here we compare HPV genotyping by Next Generation Sequencing (NGS) to an established DNA hybridization method. In DNA isolated from urine, the overall analytical sensitivity of NGS was found to be 22% higher than that of hybridization. NGS was also found to be the most specific method and expanded the detection repertoire beyond the 37 types of the DNA hybridization assay. Furthermore, NGS provided an increased resolution by identifying genetic variants of individual HPV types. The same Modified General Primers (MGP)-amplicon was used in both methods. The NGS method is described in detail to facilitate implementation in the clinical microbiology laboratory and includes suggestions for new standards for detection and calling of types and variants with improved resolution. PMID:28045981
HPV Genotyping of Modified General Primer-Amplicons Is More Analytically Sensitive and Specific by Sequencing than by Hybridization.

PubMed

Meisal, Roger; Rounge, Trine Ballestad; Christiansen, Irene Kraus; Eieland, Alexander Kirkeby; Worren, Merete Molton; Molden, Tor Faksvaag; Kommedal, Øyvind; Hovig, Eivind; Leegaard, Truls Michael; Ambur, Ole Herman

2017-01-01

Sensitive and specific genotyping of human papillomaviruses (HPVs) is important for population-based surveillance of carcinogenic HPV types and for monitoring vaccine effectiveness. Here we compare HPV genotyping by Next Generation Sequencing (NGS) to an established DNA hybridization method. In DNA isolated from urine, the overall analytical sensitivity of NGS was found to be 22% higher than that of hybridization. NGS was also found to be the most specific method and expanded the detection repertoire beyond the 37 types of the DNA hybridization assay. Furthermore, NGS provided an increased resolution by identifying genetic variants of individual HPV types. The same Modified General Primers (MGP)-amplicon was used in both methods. The NGS method is described in detail to facilitate implementation in the clinical microbiology laboratory and includes suggestions for new standards for detection and calling of types and variants with improved resolution.
Toward the 1,000 dollars human genome.

PubMed

Bennett, Simon T; Barnes, Colin; Cox, Anthony; Davies, Lisa; Brown, Clive

2005-06-01

Revolutionary new technologies, capable of transforming the economics of sequencing, are providing an unparalleled opportunity to analyze human genetic variation comprehensively at the whole-genome level within a realistic timeframe and at affordable costs. Current estimates suggest that it would cost somewhere in the region of 30 million US dollars to sequence an entire human genome using Sanger-based sequencing, and on one machine it would take about 60 years. Solexa is widely regarded as a company with the necessary disruptive technology to be the first to achieve the ultimate goal of the so-called 1,000 dollars human genome - the conceptual cost-point needed for routine analysis of individual genomes. Solexa's technology is based on completely novel sequencing chemistry capable of sequencing billions of individual DNA molecules simultaneously, a base at a time, to enable highly accurate, low cost analysis of an entire human genome in a single experiment. When applied over a large enough genomic region, these new approaches to resequencing will enable the simultaneous detection and typing of known, as well as unknown, polymorphisms, and will also offer information about patterns of linkage disequilibrium in the population being studied. Technological progress, leading to the advent of single-molecule-based approaches, is beginning to dramatically drive down costs and increase throughput to unprecedented levels, each being several orders of magnitude better than that which is currently available. A new sequencing paradigm based on single molecules will be faster, cheaper and more sensitive, and will permit routine analysis at the whole-genome level.
DNA Sequence-Dependent Ionic Currents in Ultra-Small Solid-State Nanopores†

PubMed Central

Comer, Jeffrey

2016-01-01

Measurements of ionic currents through nanopores partially blocked by DNA have emerged as a powerful method for characterization of the DNA nucleotide sequence. Although the effect of the nucleotide sequence on the nanopore blockade current has been experimentally demonstrated, prediction and interpretation of such measurements remain a formidable challenge. Using atomic resolution computational approaches, here we show how the sequence, molecular conformation, and pore geometry affect the blockade ionic current in model solid-state nanopores. We demonstrate that the blockade current from a DNA molecule is determined by the chemical identities and conformations of at least three consecutive nucleotides. We find the blockade currents produced by the nucleotide triplets to vary considerably with their nucleotide sequence despite having nearly identical molecular conformations. Encouragingly, we find blockade current differences as large as 25% for single-base substitutions in ultra small (1.6 nm × 1.1 nm cross section; 2 nm length) solid-state nanopores. Despite the complex dependence of the blockade current on the sequence and conformation of the DNA triplets, we find that, under many conditions, the number of thymine bases is positively correlated with the current, whereas the number of purine bases and the presence of both purine and pyrimidines in the triplet are negatively correlated with the current. Based on these observations, we construct a simple theoretical model that relates the ion current to the base content of a solid-state nanopore. Furthermore, we show that compact conformations of DNA in narrow pores provide the greatest signal-to-noise ratio for single base detection, whereas reduction of the nanopore length increases the ionic current noise. Thus, the sequence dependence of nanopore blockade current can be theoretically rationalized, although the predictions will likely need to be customized for each nanopore type. PMID:27103233
Resistance-Associated NS5A Variants of Hepatitis C Virus Are Susceptible to Interferon-Based Therapy.

PubMed

Itakura, Jun; Kurosaki, Masayuki; Higuchi, Mayu; Takada, Hitomi; Nakakuki, Natsuko; Itakura, Yoshie; Tamaki, Nobuharu; Yasui, Yutaka; Suzuki, Shoko; Tsuchiya, Kaoru; Nakanishi, Hiroyuki; Takahashi, Yuka; Maekawa, Shinya; Enomoto, Nobuyuki; Izumi, Namiki

2015-01-01

The presence of resistance-associated variants (RAVs) of hepatitis C virus (HCV) attenuates the efficacy of direct acting antivirals (DAAs). The objective of this study was to characterize the susceptibility of RAVs to interferon-based therapy. Direct and deep sequencing were performed to detect Y93H RAV in the NS5A region. Twenty nine genotype 1b patients with detectable RAV at baseline were treated by a combination of simeprevir, pegylated interferon and ribavirin. The longitudinal changes in the proportion of Y93H RAV during therapy and at breakthrough or relapse were determined. By direct sequencing, Y93H RAV became undetectable or decreased in proportion at an early time point during therapy (within 7 days) in 57% of patients with both the Y93H variant and wild type virus at baseline when HCV RNA was still detectable. By deep sequencing, the proportion of Y93H RAV against Y93 wild type was 52.7% (5.8%- 97.4%) at baseline which significantly decreased to 29.7% (0.16%- 98.3%) within 7 days of initiation of treatment (p = 0.023). The proportion of Y93H RAV was reduced in 21 of 29 cases (72.4%) and a marked reduction of more than 10% was observed in 14 cases (48.7%). HCV RNA reduction was significantly greater for Y93H RAV (-3.65±1.3 logIU/mL/day) than the Y93 wild type (-3.35±1.0 logIU/mL/day) (p<0.001). Y93H RAV is more susceptible to interferon-based therapy than the Y93 wild type.
Real-time reverse transcription polymerase chain reaction method for detection of Canine distemper virus modified live vaccine shedding for differentiation from infection with wild-type strains.

PubMed

Wilkes, Rebecca P; Sanchez, Elena; Riley, Matthew C; Kennedy, Melissa A

2014-01-01

Canine distemper virus (CDV) remains a common cause of infectious disease in dogs, particularly in high-density housing situations such as shelters. Vaccination of all dogs against CDV is recommended at the time of admission to animal shelters and many use a modified live virus (MLV) vaccine. From a diagnostic standpoint for dogs with suspected CDV infection, this is problematic because highly sensitive diagnostic real-time reverse transcription polymerase chain reaction (RT-PCR) tests are able to detect MLV virus in clinical samples. Real-time PCR can be used to quantitate amount of virus shedding and can differentiate vaccine strains from wild-type strains when shedding is high. However, differentiation by quantitation is not possible in vaccinated animals during acute infection, when shedding is low and could be mistaken for low level vaccine virus shedding. While there are gel-based RT-PCR assays for differentiation of vaccine strains from field strains based on sequence differences, the sensitivity of these assays is unable to match that of the real-time RT-PCR assay currently used in the authors' laboratory. Therefore, a real-time RT-PCR assay was developed that detects CDV MLV vaccine strains and distinguishes them from wild-type strains based on nucleotide sequence differences, rather than the amount of viral RNA in the sample. The test is highly sensitive, with detection of as few as 5 virus genomic copies (corresponding to 10(-1) TCID(50)). Sequencing of the DNA real-time products also allows phylogenetic differentiation of the wild-type strains. This test will aid diagnosis during outbreaks of CDV in recently vaccinated animals.
Pyrosequencing analysis of the gyrB gene to differentiate bacteria responsible for diarrheal diseases.

PubMed

Hou, X-L; Cao, Q-Y; Jia, H-Y; Chen, Z

2008-07-01

Pathogens causing acute diarrhea include a large variety of species from Enterobacteriaceae and Vibrionaceae. A method based on pyrosequencing was used here to differentiate bacteria commonly associated with diarrhea in China; the method is targeted to a partial amplicon of the gyrB gene, which encodes the B subunit of DNA gyrase. Twenty-eight specific polymorphic positions were identified from sequence alignment of a large sequence dataset and targeted using 17 sequencing primers. Of 95 isolates tested, belonging to 13 species within 7 genera, most could be identified to the species level; O157 type could be differentiated from other E. coli types; Salmonella enterica subsp. enterica could be identified at the serotype level; the genus Shigella, except for S. boydii and S. dysenteriae, could also be identified. All these isolates were also subjected to conventional sequencing of a relatively long ( approximately1.2 kb) region of gyrB DNA; these results confirmed those with pyrosequencing. Twenty-two fecal samples were surveyed, the results of which were concordant with culture-based bacterial identification, and the pathogen detection limit with simulated stool specimens was 10(4) CFU/ml. DNA from different pathogens was also mixed to simulate a case of multibacterial infection, and the generated signals correlated well with the mix ratio. In summary, the gyrB-based pyrosequencing approach proved to have significant reliability and discriminatory power for enteropathogenic bacterial identification and provided a fast and effective method for clinical diagnosis.
Using Next Generation Sequencing for Multiplexed Trait-Linked Markers in Wheat

PubMed Central

Bernardo, Amy; Wang, Shan; St. Amand, Paul; Bai, Guihua

2015-01-01

With the advent of next generation sequencing (NGS) technologies, single nucleotide polymorphisms (SNPs) have become the major type of marker for genotyping in many crops. However, the availability of SNP markers for important traits of bread wheat ( Triticum aestivum L.) that can be effectively used in marker-assisted selection (MAS) is still limited and SNP assays for MAS are usually uniplex. A shift from uniplex to multiplex assays will allow the simultaneous analysis of multiple markers and increase MAS efficiency. We designed 33 locus-specific markers from SNP or indel-based marker sequences that linked to 20 different quantitative trait loci (QTL) or genes of agronomic importance in wheat and analyzed the amplicon sequences using an Ion Torrent Proton Sequencer and a custom allele detection pipeline to determine the genotypes of 24 selected germplasm accessions. Among the 33 markers, 27 were successfully multiplexed and 23 had 100% SNP call rates. Results from analysis of "kompetitive allele-specific PCR" (KASP) and sequence tagged site (STS) markers developed from the same loci fully verified the genotype calls of 23 markers. The NGS-based multiplexed assay developed in this study is suitable for rapid and high-throughput screening of SNPs and some indel-based markers in wheat. PMID:26625271
Streptococcus suis, an important pig pathogen and emerging zoonotic agent—an update on the worldwide distribution based on serotyping and sequence typing

PubMed Central

Goyette-Desjardins, Guillaume; Auger, Jean-Philippe; Xu, Jianguo; Segura, Mariela; Gottschalk, Marcelo

2014-01-01

Streptococcus suis is an important pathogen causing economic problems in the pig industry. Moreover, it is a zoonotic agent causing severe infections to people in close contact with infected pigs or pork-derived products. Although considered sporadic in the past, human S. suis infections have been reported during the last 45 years, with two large outbreaks recorded in China. In fact, the number of reported human cases has significantly increased in recent years. In this review, we present the worldwide distribution of serotypes and sequence types (STs), as determined by multilocus sequence typing, for pigs (between 2002 and 2013) and humans (between 1968 and 2013). The methods employed for S. suis identification and typing, the current epidemiological knowledge regarding serotypes and STs and the zoonotic potential of S. suis are discussed. Increased awareness of S. suis in both human and veterinary diagnostic laboratories and further establishment of typing methods will contribute to our knowledge of this pathogen, especially in regions where complete and/or recent data is lacking. More research is required to understand differences in virulence that occur among S. suis strains and if these differences can be associated with specific serotypes or STs. PMID:26038745
DNA barcode goes two-dimensions: DNA QR code web server.

PubMed

Liu, Chang; Shi, Linchun; Xu, Xiaolan; Li, Huan; Xing, Hang; Liang, Dong; Jiang, Kun; Pang, Xiaohui; Song, Jingyuan; Chen, Shilin

2012-01-01

The DNA barcoding technology uses a standard region of DNA sequence for species identification and discovery. At present, "DNA barcode" actually refers to DNA sequences, which are not amenable to information storage, recognition, and retrieval. Our aim is to identify the best symbology that can represent DNA barcode sequences in practical applications. A comprehensive set of sequences for five DNA barcode markers ITS2, rbcL, matK, psbA-trnH, and CO1 was used as the test data. Fifty-three different types of one-dimensional and ten two-dimensional barcode symbologies were compared based on different criteria, such as coding capacity, compression efficiency, and error detection ability. The quick response (QR) code was found to have the largest coding capacity and relatively high compression ratio. To facilitate the further usage of QR code-based DNA barcodes, a web server was developed and is accessible at http://qrfordna.dnsalias.org. The web server allows users to retrieve the QR code for a species of interests, convert a DNA sequence to and from a QR code, and perform species identification based on local and global sequence similarities. In summary, the first comprehensive evaluation of various barcode symbologies has been carried out. The QR code has been found to be the most appropriate symbology for DNA barcode sequences. A web server has also been constructed to allow biologists to utilize QR codes in practical DNA barcoding applications.
PHYSICO: An UNIX based Standalone Procedure for Computation of Individual and Group Properties of Protein Sequences.

PubMed

Gupta, Parth Sarthi Sen; Banerjee, Shyamashree; Islam, Rifat Nawaz Ul; Mondal, Sudipta; Mondal, Buddhadev; Bandyopadhyay, Amal K

2014-01-01

In the genomic and proteomic era, efficient and automated analyses of sequence properties of protein have become an important task in bioinformatics. There are general public licensed (GPL) software tools to perform a part of the job. However, computations of mean properties of large number of orthologous sequences are not possible from the above mentioned GPL sets. Further, there is no GPL software or server which can calculate window dependent sequence properties for a large number of sequences in a single run. With a view to overcome above limitations, we have developed a standalone procedure i.e. PHYSICO, which performs various stages of computation in a single run based on the type of input provided either in RAW-FASTA or BLOCK-FASTA format and makes excel output for: a) Composition, Class composition, Mean molecular weight, Isoelectic point, Aliphatic index and GRAVY, b) column based compositions, variability and difference matrix, c) 25 kinds of window dependent sequence properties. The program is fast, efficient, error free and user friendly. Calculation of mean and standard deviation of homologous sequences sets, for comparison purpose when relevant, is another attribute of the program; a property seldom seen in existing GPL softwares. PHYSICO is freely available for non-commercial/academic user in formal request to the corresponding author akbanerjee@biotech.buruniv.ac.in.
PHYSICO: An UNIX based Standalone Procedure for Computation of Individual and Group Properties of Protein Sequences

PubMed Central

Gupta, Parth Sarthi Sen; Banerjee, Shyamashree; Islam, Rifat Nawaz Ul; Mondal, Sudipta; Mondal, Buddhadev; Bandyopadhyay, Amal K

2014-01-01

In the genomic and proteomic era, efficient and automated analyses of sequence properties of protein have become an important task in bioinformatics. There are general public licensed (GPL) software tools to perform a part of the job. However, computations of mean properties of large number of orthologous sequences are not possible from the above mentioned GPL sets. Further, there is no GPL software or server which can calculate window dependent sequence properties for a large number of sequences in a single run. With a view to overcome above limitations, we have developed a standalone procedure i.e. PHYSICO, which performs various stages of computation in a single run based on the type of input provided either in RAW-FASTA or BLOCK-FASTA format and makes excel output for: a) Composition, Class composition, Mean molecular weight, Isoelectic point, Aliphatic index and GRAVY, b) column based compositions, variability and difference matrix, c) 25 kinds of window dependent sequence properties. The program is fast, efficient, error free and user friendly. Calculation of mean and standard deviation of homologous sequences sets, for comparison purpose when relevant, is another attribute of the program; a property seldom seen in existing GPL softwares. Availability PHYSICO is freely available for non-commercial/academic user in formal request to the corresponding author akbanerjee@biotech.buruniv.ac.in PMID:24616564
Preferential access to genetic information from endogenous hominin ancient DNA and accurate quantitative SNP-typing via SPEX

PubMed Central

Brotherton, Paul; Sanchez, Juan J.; Cooper, Alan; Endicott, Phillip

2010-01-01

The analysis of targeted genetic loci from ancient, forensic and clinical samples is usually built upon polymerase chain reaction (PCR)-generated sequence data. However, many studies have shown that PCR amplification from poor-quality DNA templates can create sequence artefacts at significant levels. With hominin (human and other hominid) samples, the pervasive presence of highly PCR-amplifiable human DNA contaminants in the vast majority of samples can lead to the creation of recombinant hybrids and other non-authentic artefacts. The resulting PCR-generated sequences can then be difficult, if not impossible, to authenticate. In contrast, single primer extension (SPEX)-based approaches can genotype single nucleotide polymorphisms from ancient fragments of DNA as accurately as modern DNA. A single SPEX-type assay can amplify just one of the duplex DNA strands at target loci and generate a multi-fold depth-of-coverage, with non-authentic recombinant hybrids reduced to undetectable levels. Crucially, SPEX-type approaches can preferentially access genetic information from damaged and degraded endogenous ancient DNA templates over modern human DNA contaminants. The development of SPEX-type assays offers the potential for highly accurate, quantitative genotyping from ancient hominin samples. PMID:19864251
Gradient waveform pre-emphasis based on the gradient system transfer function.

PubMed

Stich, Manuel; Wech, Tobias; Slawig, Anne; Ringler, Ralf; Dewdney, Andrew; Greiser, Andreas; Ruyters, Gudrun; Bley, Thorsten A; Köstler, Herbert

2018-02-25

The gradient system transfer function (GSTF) has been used to describe the distorted k-space trajectory for image reconstruction. The purpose of this work was to use the GSTF to determine the pre-emphasis for an undistorted gradient output and intended k-space trajectory. The GSTF of the MR system was determined using only standard MR hardware without special equipment such as field probes or a field camera. The GSTF was used for trajectory prediction in image reconstruction and for a gradient waveform pre-emphasis. As test sequences, a gradient-echo sequence with phase-encoding gradient modulation and a gradient-echo sequence with a spiral read-out trajectory were implemented and subsequently applied on a structural phantom and in vivo head measurements. Image artifacts were successfully suppressed by applying the GSTF-based pre-emphasis. Equivalent results are achieved with images acquired using GSTF-based post-correction of the trajectory as a part of image reconstruction. In contrast, the pre-emphasis approach allows reconstruction using the initially intended trajectory. The artifact suppression shown for two sequences demonstrates that the GSTF can serve for a novel pre-emphasis. A pre-emphasis based on the GSTF information can be applied to any arbitrary sequence type. © 2018 International Society for Magnetic Resonance in Medicine.
Intact long-type dupA as a marker for gastroduodenal diseases in Okinawan subpopulation, Japan.

PubMed

Takahashi, Ayaka; Shiota, Seiji; Matsunari, Osamu; Watada, Masahide; Suzuki, Rumiko; Nakachi, Saori; Kinjo, Nagisa; Kinjo, Fukunori; Yamaoka, Yoshio

2013-02-01

Helicobacter pylori dupA can be divided into two types according to the presence or absence of the mutation. In addition, full-sequenced data revealed that dupA has two types with different lengths depend on the presence of approximately 600 bp in the putative 5' region (presence; long-type and absence; short-type), which has not been taken into account in previous studies. A total of 319 strains isolated from Okinawa, the south islands of Japan, were included. The status of dupA and cagA was determined by polymerase chain reaction. The presence of mutations in long-type dupA was determined by DNA sequencing. The prevalence of long-type dupA was 26.3% (84/319). Sequence analysis showed that there were only six cases (7.1%) with point mutations lead to stop codon among 84 long-type dupA strains studied. Interestingly, intact long-type dupA without frameshift mutation, but not short-type dupA, was significantly associated with gastric ulcer and gastric cancer than gastritis (p = .001 and p = .019, respectively). After adjustment by age, gender, and cagA, the presence of intact long-type dupA was significantly associated with gastric ulcer and gastric cancer compared with gastritis (odds ratio [OR] = 3.35, 95% confidence interval [CI] = 1.55-7.24 and OR = 4.14, 95% CI = 1.23-13.94, respectively). Intact long-type dupA is a real virulence marker for severe outcomes in Okinawa, Japan. The previous information gained from PCR-based methods without taking long-type dupA into account must be interpreted with caution. © 2012 Blackwell Publishing Ltd.
Intact long-type dupA as a marker for gastroduodenal diseases in Okinawan subpopulation, Japan

PubMed Central

Takahashi, Ayaka; Shiota, Seiji; Matsunari, Osamu; Watada, Masahide; Suzuki, Rumiko; Nakachi, Saori; Kinjo, Nagisa; Kinjo, Fukunori; Yamaoka, Yoshio

2012-01-01

Background Helicobacter pylori dupA can be divided into two types according to the presence or absence of the mutation. In addition, full-sequenced data revealed that dupA has two types with different lengths depend on the presence of approximately 600 bp in the putative 5' region (presence; long-type and absence; short-type), which has not been taken into account in previous studies. Methods A total of 319 strains isolated from Okinawa, the south islands of Japan, were included. The status of dupA and cagA was determined by polymerase chain reaction. The presence of mutations in long-type dupA was determined by DNA sequencing. Results The prevalence of long-type dupA was 26.3% (84/319). Sequence analysis showed that there were only 6 cases (7.1%) with point mutations lead to stop codon among 84 long-type dupA strains studied. Interestingly, intact long-type dupA without frameshift mutation, but not short-type dupA was significantly associated with gastric ulcer and gastric cancer than gastritis (P = 0.001 and P = 0.019, respectively). After adjustment by age, gender and cagA, the presence of intact long-type dupA was significantly associated with gastric ulcer and gastric cancer compared with gastritis (odds ratio [OR] = 3.35, 95% confidence interval [CI] = 1.55–7.24 and OR = 4.14, 95% CI = 1.23–13.94, respectively). Conclusions Intact long-type dupA is a real virulence marker for severe outcomes in Okinawa, Japan. The previous information gained from PCR-based methods without taking long-type dupA into account must be interpreted with caution. PMID:23067336

Development of a genotyping microarray for Usher syndrome.

PubMed

Cremers, Frans P M; Kimberling, William J; Külm, Maigi; de Brouwer, Arjan P; van Wijk, Erwin; te Brinke, Heleen; Cremers, Cor W R J; Hoefsloot, Lies H; Banfi, Sandro; Simonelli, Francesca; Fleischhauer, Johannes C; Berger, Wolfgang; Kelley, Phil M; Haralambous, Elene; Bitner-Glindzicz, Maria; Webster, Andrew R; Saihan, Zubin; De Baere, Elfride; Leroy, Bart P; Silvestri, Giuliana; McKay, Gareth J; Koenekoop, Robert K; Millan, Jose M; Rosenberg, Thomas; Joensuu, Tarja; Sankila, Eeva-Marja; Weil, Dominique; Weston, Mike D; Wissinger, Bernd; Kremer, Hannie

2007-02-01

Usher syndrome, a combination of retinitis pigmentosa (RP) and sensorineural hearing loss with or without vestibular dysfunction, displays a high degree of clinical and genetic heterogeneity. Three clinical subtypes can be distinguished, based on the age of onset and severity of the hearing impairment, and the presence or absence of vestibular abnormalities. Thus far, eight genes have been implicated in the syndrome, together comprising 347 protein-coding exons. To improve DNA diagnostics for patients with Usher syndrome, we developed a genotyping microarray based on the arrayed primer extension (APEX) method. Allele-specific oligonucleotides corresponding to all 298 Usher syndrome-associated sequence variants known to date, 76 of which are novel, were arrayed. Approximately half of these variants were validated using original patient DNAs, which yielded an accuracy of >98%. The efficiency of the Usher genotyping microarray was tested using DNAs from 370 unrelated European and American patients with Usher syndrome. Sequence variants were identified in 64/140 (46%) patients with Usher syndrome type I, 45/189 (24%) patients with Usher syndrome type II, 6/21 (29%) patients with Usher syndrome type III and 6/20 (30%) patients with atypical Usher syndrome. The chip also identified two novel sequence variants, c.400C>T (p.R134X) in PCDH15 and c.1606T>C (p.C536S) in USH2A. The Usher genotyping microarray is a versatile and affordable screening tool for Usher syndrome. Its efficiency will improve with the addition of novel sequence variants with minimal extra costs, making it a very useful first-pass screening tool.
Development of a genotyping microarray for Usher syndrome

PubMed Central

Cremers, Frans P M; Kimberling, William J; Külm, Maigi; de Brouwer, Arjan P; van Wijk, Erwin; te Brinke, Heleen; Cremers, Cor W R J; Hoefsloot, Lies H; Banfi, Sandro; Simonelli, Francesca; Fleischhauer, Johannes C; Berger, Wolfgang; Kelley, Phil M; Haralambous, Elene; Bitner‐Glindzicz, Maria; Webster, Andrew R; Saihan, Zubin; De Baere, Elfride; Leroy, Bart P; Silvestri, Giuliana; McKay, Gareth J; Koenekoop, Robert K; Millan, Jose M; Rosenberg, Thomas; Joensuu, Tarja; Sankila, Eeva‐Marja; Weil, Dominique; Weston, Mike D; Wissinger, Bernd; Kremer, Hannie

2007-01-01

Background Usher syndrome, a combination of retinitis pigmentosa (RP) and sensorineural hearing loss with or without vestibular dysfunction, displays a high degree of clinical and genetic heterogeneity. Three clinical subtypes can be distinguished, based on the age of onset and severity of the hearing impairment, and the presence or absence of vestibular abnormalities. Thus far, eight genes have been implicated in the syndrome, together comprising 347 protein‐coding exons. Methods: To improve DNA diagnostics for patients with Usher syndrome, we developed a genotyping microarray based on the arrayed primer extension (APEX) method. Allele‐specific oligonucleotides corresponding to all 298 Usher syndrome‐associated sequence variants known to date, 76 of which are novel, were arrayed. Results Approximately half of these variants were validated using original patient DNAs, which yielded an accuracy of >98%. The efficiency of the Usher genotyping microarray was tested using DNAs from 370 unrelated European and American patients with Usher syndrome. Sequence variants were identified in 64/140 (46%) patients with Usher syndrome type I, 45/189 (24%) patients with Usher syndrome type II, 6/21 (29%) patients with Usher syndrome type III and 6/20 (30%) patients with atypical Usher syndrome. The chip also identified two novel sequence variants, c.400C>T (p.R134X) in PCDH15 and c.1606T>C (p.C536S) in USH2A. Conclusion The Usher genotyping microarray is a versatile and affordable screening tool for Usher syndrome. Its efficiency will improve with the addition of novel sequence variants with minimal extra costs, making it a very useful first‐pass screening tool. PMID:16963483
Sequence-Based Genotyping of Expressed Swine Leukocyte Antigen Class I Alleles by Next-Generation Sequencing Reveal Novel Swine Leukocyte Antigen Class I Haplotypes and Alleles in Belgian, Danish, and Kenyan Fattening Pigs and Göttingen Minipigs.

PubMed

Sørensen, Maria Rathmann; Ilsøe, Mette; Strube, Mikael Lenz; Bishop, Richard; Erbs, Gitte; Hartmann, Sofie Bruun; Jungersen, Gregers

2017-01-01

The need for typing of the swine leukocyte antigen (SLA) is increasing with the expanded use of pigs as models for human diseases and organ-transplantation experiments, their use in infection studies, and for design of veterinary vaccines. Knowledge of SLA sequences is furthermore a prerequisite for the prediction of epitope binding in pigs. The low number of known SLA class I alleles and the limited knowledge of their prevalence in different pig breeds emphasizes the need for efficient SLA typing methods. This study utilizes an SLA class I-typing method based on next-generation sequencing of barcoded PCR amplicons. The amplicons were generated with universal primers and predicted to resolve 68-88% of all known SLA class I alleles dependent on amplicon size. We analyzed the SLA profiles of 72 pigs from four different pig populations; Göttingen minipigs and Belgian, Kenyan, and Danish fattening pigs. We identified 67 alleles, nine previously described haplotypes and 15 novel haplotypes. The highest variation in SLA class I profiles was observed in the Danish pigs and the lowest among the Göttingen minipig population, which also have the highest percentage of homozygote individuals. Highlighting the fact that there are still numerous unknown SLA class I alleles to be discovered, a total of 12 novel SLA class I alleles were identified. Overall, we present new information about known and novel alleles and haplotypes and their prevalence in the tested pig populations.
DNA methylation at hepatitis B viral integrants is associated with methylation at flanking human genomic sequences

PubMed Central

Watanabe, Yoshiyuki; Yamamoto, Hiroyuki; Oikawa, Ritsuko; Toyota, Minoru; Yamamoto, Masakazu; Kokudo, Norihiro; Tanaka, Shinji; Arii, Shigeki; Yotsuyanagi, Hiroshi; Koike, Kazuhiko; Itoh, Fumio

2015-01-01

Integration of DNA viruses into the human genome plays an important role in various types of tumors, including hepatitis B virus (HBV)–related hepatocellular carcinoma. However, the molecular details and clinical impact of HBV integration on either human or HBV epigenomes are unknown. Here, we show that methylation of the integrated HBV DNA is related to the methylation status of the flanking human genome. We developed a next-generation sequencing-based method for structural methylation analysis of integrated viral genomes (denoted G-NaVI). This method is a novel approach that enables enrichment of viral fragments for sequencing using unique baits based on the sequence of the HBV genome. We detected integrated HBV sequences in the genome of the PLC/PRF/5 cell line and found variable levels of methylation within the integrated HBV genomes. Allele-specific methylation analysis revealed that the HBV genome often became significantly methylated when integrated into highly methylated host sites. After integration into unmethylated human genome regions such as promoters, however, the HBV DNA remains unmethylated and may eventually play an important role in tumorigenesis. The observed dynamic changes in DNA methylation of the host and viral genomes may functionally affect the biological behavior of HBV. These findings may impact public health given that millions of people worldwide are carriers of HBV. We also believe our assay will be a powerful tool to increase our understanding of the various types of DNA virus-associated tumorigenesis. PMID:25653310
Genome Sequence of Saccharomyces carlsbergensis, the World’s First Pure Culture Lager Yeast

PubMed Central

Walther, Andrea; Hesselbart, Ana; Wendland, Jürgen

2014-01-01

Lager yeast beer production was revolutionized by the introduction of pure culture strains. The first established lager yeast strain is known as the bottom fermenting Saccharomyces carlsbergensis, which was originally termed Unterhefe No. 1 by Emil Chr. Hansen and has been used in production in since 1883. S. carlsbergensis belongs to group I/Saaz-type lager yeast strains and is better adapted to cold growth conditions than group II/Frohberg-type lager yeasts, e.g., the Weihenstephan strain WS34/70. Here, we sequenced S. carlsbergensis using next generation sequencing technologies. Lager yeasts are descendants from hybrids formed between a S. cerevisiae parent and a parent similar to S. eubayanus. Accordingly, the S. carlsbergensis 19.5-Mb genome is substantially larger than the 12-Mb S. cerevisiae genome. Based on the sequence scaffolds, synteny to the S. cerevisae genome, and by using directed polymerase chain reaction for gap closure, we generated a chromosomal map of S. carlsbergensis consisting of 29 unique chromosomes. We present evidence for genome and chromosome evolution within S. carlsbergensis via chromosome loss and loss of heterozygosity specifically of parts derived from the S. cerevisiae parent. Based on our sequence data and via fluorescence-activated cell-sorting analysis, we determined the ploidy of S. carlsbergensis. This inferred that this strain is basically triploid with a diploid S. eubayanus and haploid S. cerevisiae genome content. In contrast the Weihenstephan strain, which we resequenced, is essentially tetraploid composed of two diploid S. cerevisiae and S. eubayanus genomes. Based on conserved translocations between the parental genomes in S. carlsbergensis and the Weihenstephan strain we propose a joint evolutionary ancestry for lager yeast strains. PMID:24578374
Molecular characterization and phylogenetic relationships among microsporidian isolates infecting silkworm, Bombyx mori using small subunit rRNA (SSU-rRNA) gene sequence analysis.

PubMed

Nath, B Surendra; Gupta, S K; Bajpai, A K

2012-12-01

The life cycle, spore morphology, pathogenicity, tissue specificity, mode of transmission and small subunit rRNA (SSU-rRNA) gene sequence analysis of the five new microsporidian isolates viz., NIWB-11bp, NIWB-12n, NIWB-13md, NIWB-14b and NIWB-15mb identified from the silkworm, Bombyx mori have been studied along with type species, NIK-1s_mys. The life cycle of the microsporidians identified exhibited the sequential developmental cycles that are similar to the general developmental cycle of the genus, Nosema. The spores showed considerable variations in their shape, length and width. The pathogenicity observed was dose-dependent and differed from each of the microsporidian isolates; the NIWB-15mb was found to be more virulent than other isolates. All of the microsporidians were found to infect most of the tissues examined and showed gonadal infection and transovarial transmission in the infected silkworms. SSU-rRNA sequence based phylogenetic tree placed NIWB-14b, NIWB-12n and NIWB-11bp in a separate branch along with other Nosema species and Nosema bombycis; while NIWB-15mb and NIWB-13md together formed another cluster along with other Nosema species. NIK-1s_mys revealed a signature sequence similar to standard type species, N. bombycis, indicating that NIK-1s_mys is similar to N. bombycis. Based on phylogenetic relationships, branch length information based on genetic distance and nucleotide differences, we conclude that the microsporidian isolates identified are distinctly different from the other known species and belonging to the genus, Nosema. This SSU-rRNA gene sequence analysis method is found to be more useful approach in detecting different and closely related microsporidians of this economically important domestic insect.
Analysis of whole genome sequences of 16 strains of rubella virus from the United States, 1961-2009.

PubMed

Abernathy, Emily; Chen, Min-hsin; Bera, Jayati; Shrivastava, Susmita; Kirkness, Ewen; Zheng, Qi; Bellini, William; Icenogle, Joseph

2013-01-25

Rubella virus is the causative agent of rubella, a mild rash illness, and a potent teratogenic agent when contracted by a pregnant woman. Global rubella control programs target the reduction and elimination of congenital rubella syndrome. Phylogenetic analysis of partial sequences of rubella viruses has contributed to virus surveillance efforts and played an important role in demonstrating that indigenous rubella viruses have been eliminated in the United States. Sixteen wild-type rubella viruses were chosen for whole genome sequencing. All 16 viruses were collected in the United States from 1961 to 2009 and are from 8 of the 13 known rubella genotypes. Phylogenetic analysis of 30 whole genome sequences produced a maximum likelihood tree giving high bootstrap values for all genotypes except provisional genotype 1a. Comparison of the 16 new complete sequences and 14 previously sequenced wild-type viruses found regions with clusters of variable amino acids. The 5' 250 nucleotides of the genome are more conserved than any other part of the genome. Genotype specific deletions in the untranslated region between the non-structural and structural open reading frames were observed for genotypes 2B and genotype 1G. No evidence was seen for recombination events among the 30 viruses. The analysis presented here is consistent with previous reports on the genetic characterization of rubella virus genomes. Conserved and variable regions were identified and additional evidence for genotype specific nucleotide deletions in the intergenic region was found. Phylogenetic analysis confirmed genotype groupings originally based on structural protein coding region sequences, which provides support for the WHO nomenclature for genetic characterization of wild-type rubella viruses.
Molecular Evolution of a Type 1 Wild-Vaccine Poliovirus Recombinant during Widespread Circulation in China

PubMed Central

Liu, Hong-Mei; Zheng, Du-Ping; Zhang, Li-Bi; Oberste, M. Steven; Pallansch, Mark A.; Kew, Olen M.

2000-01-01

Type 1 wild-vaccine recombinant polioviruses were isolated from poliomyelitis patients in China from 1991 to 1993. We compared the sequences of 34 recombinant isolates over the 1,353-nucleotide (nt) genomic interval (nt 2480 to 3832) encoding the major capsid protein, VP1, and the protease, 2A. All recombinants had a 367-nt block of sequence (nt 3271 to 3637) derived from the Sabin 1 oral poliovirus vaccine strain spanning the 3′-terminal sequences of VP1 (115 nt) and the 5′ half of 2A (252 nt). The remaining VP1 sequences were closely (up to 99.5%) related to those of a major genotype of wild type 1 poliovirus endemic to China up to 1994. In contrast, the non-vaccine-derived sequences at the 3′ half of 2A were more distantly related (<90% nucleotide sequence match) to those of other contemporary wild polioviruses from China. The vaccine-derived sequences of the earliest (April 1991) isolates completely matched those of Sabin 1. Later isolates diverged from the early isolates primarily by accumulation of synonymous base substitutions (at a rate of ∼3.7 × 10−2 substitutions per synonymous site per year) over the entire VP1-2A interval. Distinct evolutionary lineages were found in different Chinese provinces. From the combined epidemiologic and evolutionary analyses, we propose that the recombinant virus arose during mixed infection of a single individual in northern China in early 1991 and that its progeny spread by multiple independent chains of transmission into some of the most populous areas of China within a year of the initiating infection. PMID:11070012
Genetic characterization of a Coxsackie A9 virus associated with aseptic meningitis in Alberta, Canada in 2010

PubMed Central

2013-01-01

Background An unusually high incidence of aseptic meningitis caused by enteroviruses was noted in Alberta, Canada between March and October 2010. Sequence based typing was performed on the enterovirus positive samples to gain a better understanding of the molecular characteristics of the Coxsackie A9 (CVA-9) strain responsible for most cases in this outbreak. Methods Molecular typing was performed by amplification and sequencing of the VP2 region. The genomic sequence of one of the 2010 outbreak isolates was compared to a CVA-9 isolate from 2003 and the prototype sequence to study genetic drift and recombination. Results Of the 4323 samples tested, 213 were positive for enteroviruses (4.93%). The majority of the positives were detected in CSF samples (n = 157, 73.71%) and 81.94% of the sequenced isolates were typed as CVA-9. The sequenced CVA-9 positives were predominantly (94.16%) detected in patients ranging in age from 15 to 29 years and the peak months for detection were between March and October. Full genome sequence comparisons revealed that the CVA-9 viruses isolated in Alberta in 2003 and 2010 were highly homologous to the prototype CVA-9 in the structural VP1, VP2 and VP3 regions but divergent in the VP4, non-structural and non-coding regions. Conclusion The increase in cases of aseptic meningitis was associated with enterovirus CVA-9. Sequence divergence between the prototype strain of CVA-9 and the Alberta isolates suggests genetic drifting and/or recombination events, however the sequence was conserved in the antigenic regions determined by the VP1, VP2 and VP3 genes. These results suggest that the increase in CVA-9 cases likely did not result from the emergence of a radically different immune escape mutant. PMID:23521862
NetTurnP – Neural Network Prediction of Beta-turns by Use of Evolutionary Information and Predicted Protein Sequence Features

PubMed Central

Petersen, Bent; Lundegaard, Claus; Petersen, Thomas Nordahl

2010-01-01

β-turns are the most common type of non-repetitive structures, and constitute on average 25% of the amino acids in proteins. The formation of β-turns plays an important role in protein folding, protein stability and molecular recognition processes. In this work we present the neural network method NetTurnP, for prediction of two-class β-turns and prediction of the individual β-turn types, by use of evolutionary information and predicted protein sequence features. It has been evaluated against a commonly used dataset BT426, and achieves a Matthews correlation coefficient of 0.50, which is the highest reported performance on a two-class prediction of β-turn and not-β-turn. Furthermore NetTurnP shows improved performance on some of the specific β-turn types. In the present work, neural network methods have been trained to predict β-turn or not and individual β-turn types from the primary amino acid sequence. The individual β-turn types I, I', II, II', VIII, VIa1, VIa2, VIba and IV have been predicted based on classifications by PROMOTIF, and the two-class prediction of β-turn or not is a superset comprised of all β-turn types. The performance is evaluated using a golden set of non-homologous sequences known as BT426. Our two-class prediction method achieves a performance of: MCC = 0.50, Qtotal = 82.1%, sensitivity = 75.6%, PPV = 68.8% and AUC = 0.864. We have compared our performance to eleven other prediction methods that obtain Matthews correlation coefficients in the range of 0.17 – 0.47. For the type specific β-turn predictions, only type I and II can be predicted with reasonable Matthews correlation coefficients, where we obtain performance values of 0.36 and 0.31, respectively. Conclusion The NetTurnP method has been implemented as a webserver, which is freely available at http://www.cbs.dtu.dk/services/NetTurnP/. NetTurnP is the only available webserver that allows submission of multiple sequences. PMID:21152409
NetTurnP--neural network prediction of beta-turns by use of evolutionary information and predicted protein sequence features.

PubMed

Petersen, Bent; Lundegaard, Claus; Petersen, Thomas Nordahl

2010-11-30

β-turns are the most common type of non-repetitive structures, and constitute on average 25% of the amino acids in proteins. The formation of β-turns plays an important role in protein folding, protein stability and molecular recognition processes. In this work we present the neural network method NetTurnP, for prediction of two-class β-turns and prediction of the individual β-turn types, by use of evolutionary information and predicted protein sequence features. It has been evaluated against a commonly used dataset BT426, and achieves a Matthews correlation coefficient of 0.50, which is the highest reported performance on a two-class prediction of β-turn and not-β-turn. Furthermore NetTurnP shows improved performance on some of the specific β-turn types. In the present work, neural network methods have been trained to predict β-turn or not and individual β-turn types from the primary amino acid sequence. The individual β-turn types I, I', II, II', VIII, VIa1, VIa2, VIba and IV have been predicted based on classifications by PROMOTIF, and the two-class prediction of β-turn or not is a superset comprised of all β-turn types. The performance is evaluated using a golden set of non-homologous sequences known as BT426. Our two-class prediction method achieves a performance of: MCC=0.50, Qtotal=82.1%, sensitivity=75.6%, PPV=68.8% and AUC=0.864. We have compared our performance to eleven other prediction methods that obtain Matthews correlation coefficients in the range of 0.17-0.47. For the type specific β-turn predictions, only type I and II can be predicted with reasonable Matthews correlation coefficients, where we obtain performance values of 0.36 and 0.31, respectively. The NetTurnP method has been implemented as a webserver, which is freely available at http://www.cbs.dtu.dk/services/NetTurnP/. NetTurnP is the only available webserver that allows submission of multiple sequences.
Flexbar 3.0 - SIMD and multicore parallelization.

PubMed

Roehr, Johannes T; Dieterich, Christoph; Reinert, Knut

2017-09-15

High-throughput sequencing machines can process many samples in a single run. For Illumina systems, sequencing reads are barcoded with an additional DNA tag that is contained in the respective sequencing adapters. The recognition of barcode and adapter sequences is hence commonly needed for the analysis of next-generation sequencing data. Flexbar performs demultiplexing based on barcodes and adapter trimming for such data. The massive amounts of data generated on modern sequencing machines demand that this preprocessing is done as efficiently as possible. We present Flexbar 3.0, the successor of the popular program Flexbar. It employs now twofold parallelism: multi-threading and additionally SIMD vectorization. Both types of parallelism are used to speed-up the computation of pair-wise sequence alignments, which are used for the detection of barcodes and adapters. Furthermore, new features were included to cover a wide range of applications. We evaluated the performance of Flexbar based on a simulated sequencing dataset. Our program outcompetes other tools in terms of speed and is among the best tools in the presented quality benchmark. https://github.com/seqan/flexbar. johannes.roehr@fu-berlin.de or knut.reinert@fu-berlin.de. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Winnowing DNA for Rare Sequences: Highly Specific Sequence and Methylation Based Enrichment

PubMed Central

Thompson, Jason D.; Shibahara, Gosuke; Rajan, Sweta; Pel, Joel; Marziali, Andre

2012-01-01

Rare mutations in cell populations are known to be hallmarks of many diseases and cancers. Similarly, differential DNA methylation patterns arise in rare cell populations with diagnostic potential such as fetal cells circulating in maternal blood. Unfortunately, the frequency of alleles with diagnostic potential, relative to wild-type background sequence, is often well below the frequency of errors in currently available methods for sequence analysis, including very high throughput DNA sequencing. We demonstrate a DNA preparation and purification method that through non-linear electrophoretic separation in media containing oligonucleotide probes, achieves 10,000 fold enrichment of target DNA with single nucleotide specificity, and 100 fold enrichment of unmodified methylated DNA differing from the background by the methylation of a single cytosine residue. PMID:22355378
Phenotypic, Genotypic, and Antimicrobial Characteristics of Streptococcus halichoeri Isolates from Humans, Proposal To Rename Streptococcus halichoeri as Streptococcus halichoeri subsp. halichoeri, and Description of Streptococcus halichoeri subsp. hominis subsp. nov., a Bacterium Associated with Human Clinical Infections.

PubMed

Shewmaker, P L; Whitney, A M; Humrighouse, B W

2016-03-01

Phenotypic, genotypic, and antimicrobial characteristics of six phenotypically distinct human clinical isolates that most closely resembled the type strain of Streptococcus halichoeri isolated from a seal are presented. Sequencing of the 16S rRNA, rpoB, sodA, and recN genes; comparative whole-genome analysis; conventional biochemical and Rapid ID 32 Strep identification methods; and antimicrobial susceptibility testing were performed on the human isolates, the type strain of S. halichoeri, and type strains of closely related species. The six human clinical isolates were biochemically indistinguishable from each other and showed 100% 16S rRNA, rpoB, sodA, and recN gene sequence similarity. Comparative 16S rRNA gene sequencing analysis revealed 98.6% similarity to S. halichoeri CCUG 48324(T), 97.9% similarity to S. canis ATCC 43496(T), and 97.8% similarity to S. ictaluri ATCC BAA-1300(T). A 3,530-bp fragment of the rpoB gene was 98.8% similar to the S. halichoeri type strain, 84.6% to the S. canis type strain, and 83.8% to the S. ictaluri type strain. The S. halichoeri type strain and the human clinical isolates were susceptible to the antimicrobials tested based on CLSI guidelines for Streptococcus species viridans group with the exception of tetracycline and erythromycin. The human isolates were phenotypically distinct from the type strain isolated from a seal; comparative whole-genome sequence analysis confirmed that the human isolates were S. halichoeri. On the basis of these results, a novel subspecies, Streptococcus halichoeri subsp. hominis, is proposed for the human isolates and Streptococcus halichoeri subsp. halichoeri is proposed for the gray seal isolates. The type strain of the novel subspecies is SS1844(T) = CCUG 67100(T) = LMG 28801(T). Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Multiplex detection of respiratory pathogens

DOEpatents

McBride, Mary [Brentwood, CA; Slezak, Thomas [Livermore, CA; Birch, James M [Albany, CA

2012-07-31

Described are kits and methods useful for detection of respiratory pathogens (influenza A (including subtyping capability for H1, H3, H5 and H7 subtypes) influenza B, parainfluenza (type 2), respiratory syncytial virus, and adenovirus) in a sample. Genomic sequence information from the respiratory pathogens was analyzed to identify signature sequences, e.g., polynucleotide sequences useful for confirming the presence or absence of a pathogen in a sample. Primer and probe sets were designed and optimized for use in a PCR based, multiplexed Luminex assay to successfully identify the presence or absence of pathogens in a sample.
Serotype IV Sequence Type 468 Group B Streptococcus Neonatal Invasive Disease, Minnesota, USA.

PubMed

Teatero, Sarah; Ferrieri, Patricia; Fittipaldi, Nahuel

2016-11-01

To further understand the emergence of serotype IV group B Streptococcus (GBS) invasive disease, we used whole-genome sequencing to characterize 3 sequence type 468 strains isolated from neonates in Minnesota, USA. We found that strains of tetracycline-resistant sequence type 468 GBS have acquired virulence genes from a putative clonal complex 17 GBS donor by recombination.
Phylogeny of 54 representative strains of species in the family Pasteurellaceae as determined by comparison of 16S rRNA sequences.

PubMed Central

Dewhirst, F E; Paster, B J; Olsen, I; Fraser, G J

1992-01-01

Virtually complete 16S rRNA sequences were determined for 54 representative strains of species in the family Pasteurellaceae. Of these strains, 15 were Pasteurella, 16 were Actinobacillus, and 23 were Haemophilus. A phylogenetic tree was constructed based on sequence similarity, using the Neighbor-Joining method. Fifty-three of the strains fell within four large clusters. The first cluster included the type strains of Haemophilus influenzae, H. aegyptius, H. aphrophilus, H. haemolyticus, H. paraphrophilus, H. segnis, and Actinobacillus actinomycetemcomitans. This cluster also contained A. actinomycetemcomitans FDC Y4, ATCC 29522, ATCC 29523, and ATCC 29524 and H. aphrophilus NCTC 7901. The second cluster included the type strains of A. seminis and Pasteurella aerogenes and H. somnus OVCG 43826. The third cluster was composed of the type strains of Pasteurella multocida, P. anatis, P. avium, P. canis, P. dagmatis, P. gallinarum, P. langaa, P. stomatis, P. volantium, H. haemoglobinophilus, H. parasuis, H. paracuniculus, H. paragallinarum, and A. capsulatus. This cluster also contained Pasteurella species A CCUG 18782, Pasteurella species B CCUG 19974, Haemophilus taxon C CAPM 5111, H. parasuis type 5 Nagasaki, P. volantium (H. parainfluenzae) NCTC 4101, and P. trehalosi NCTC 10624. The fourth cluster included the type strains of Actinobacillus lignieresii, A. equuli, A. pleuropneumoniae, A. suis, A. ureae, H. parahaemolyticus, H. parainfluenzae, H. paraphrohaemolyticus, H. ducreyi, and P. haemolytica. This cluster also contained Actinobacillus species strain CCUG 19799 (Bisgaard taxon 11), A. suis ATCC 15557, H. ducreyi ATCC 27722 and HD 35000, Haemophilus minor group strain 202, and H. parainfluenzae ATCC 29242. The type strain of P. pneumotropica branched alone to form a fifth group. The branching of the Pasteurellaceae family tree was quite complex. The four major clusters contained multiple subclusters. The clusters contained both rapidly and slowly evolving strains (indicated by differing numbers of base changes incorporated into the 16S rRNA sequence relative to outgroup organisms). While the results presented a clear picture of the phylogenetic relationships, the complexity of the branching will make division of the family into genera a difficult and somewhat subjective task. We do not suggest any taxonomic changes at this time. PMID:1548238
Multiple alignment-free sequence comparison

PubMed Central

Ren, Jie; Song, Kai; Sun, Fengzhu; Deng, Minghua; Reinert, Gesine

2013-01-01

Motivation: Recently, a range of new statistics have become available for the alignment-free comparison of two sequences based on k-tuple word content. Here, we extend these statistics to the simultaneous comparison of more than two sequences. Our suite of statistics contains, first, and , extensions of statistics for pairwise comparison of the joint k-tuple content of all the sequences, and second, , and , averages of sums of pairwise comparison statistics. The two tasks we consider are, first, to identify sequences that are similar to a set of target sequences, and, second, to measure the similarity within a set of sequences. Results: Our investigation uses both simulated data as well as cis-regulatory module data where the task is to identify cis-regulatory modules with similar transcription factor binding sites. We find that although for real data, all of our statistics show a similar performance, on simulated data the Shepp-type statistics are in some instances outperformed by star-type statistics. The multiple alignment-free statistics are more sensitive to contamination in the data than the pairwise average statistics. Availability: Our implementation of the five statistics is available as R package named ‘multiAlignFree’ at be http://www-rcf.usc.edu/∼fsun/Programs/multiAlignFree/multiAlignFreemain.html. Contact: reinert@stats.ox.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online. PMID:23990418
Complete genome sequence of Pedobacter heparinus type strain (HIM 762-3T)

PubMed Central

Han, Cliff; Spring, Stefan; Lapidus, Alla; Del Rio, Tijana Glavina; Tice, Hope; Copeland, Alex; Cheng, Jan-Fang; Lucas, Susan; Chen, Feng; Nolan, Matt; Bruce, David; Goodwin, Lynne; Pitluck, Sam; Ivanova, Natalia; Mavromatis, Konstantinos; Mikhailova, Natalia; Pati, Amrita; Chen, Amy; Palaniappan, Krishna; Land, Miriam; Hauser, Loren; Chang, Yun-Juan; Jeffries, Cynthia C.; Saunders, Elizabeth; Chertkov, Olga; Brettin, Thomas; Göker, Markus; Rohde, Manfred; Bristow, Jim; Eisen, Jonathan A.; Markowitz, Victor; Hugenholtz, Philip; Kyrpides, Nikos C.; Klenk, Hans-Peter; Detter, John C.

2009-01-01

Pedobacter heparinus (Payza and Korn 1956) Steyn et al. 1998 comb. nov. is the type species of the rapidly growing genus Pedobacter within the family Sphingobacteriaceae of the phylum ‘Bacteroidetes’. P. heparinus is of interest, because it was the first isolated strain shown to grow with heparin as sole carbon and nitrogen source and because it produces several enzymes involved in the degradation of mucopolysaccharides. All available data about this species are based on a sole strain that was isolated from dry soil. Here we describe the features of this organism, together with the complete genome sequence, and annotation. This is the first report on a complete genome sequence of a member of the genus Pedobacter, and the 5,167,383 bp long single replicon genome with its 4287 protein-coding and 54 RNA genes is part of the Genomic Encyclopedia of Bacteria and Archaea project. PMID:21304637
High-Throughput Single-Cell RNA Sequencing and Data Analysis.

PubMed

Sagar; Herman, Josip Stefan; Pospisilik, John Andrew; Grün, Dominic

2018-01-01

Understanding biological systems at a single cell resolution may reveal several novel insights which remain masked by the conventional population-based techniques providing an average readout of the behavior of cells. Single-cell transcriptome sequencing holds the potential to identify novel cell types and characterize the cellular composition of any organ or tissue in health and disease. Here, we describe a customized high-throughput protocol for single-cell RNA-sequencing (scRNA-seq) combining flow cytometry and a nanoliter-scale robotic system. Since scRNA-seq requires amplification of a low amount of endogenous cellular RNA, leading to substantial technical noise in the dataset, downstream data filtering and analysis require special care. Therefore, we also briefly describe in-house state-of-the-art data analysis algorithms developed to identify cellular subpopulations including rare cell types as well as to derive lineage trees by ordering the identified subpopulations of cells along the inferred differentiation trajectories.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Larimer, Frank W; Chain, Patrick S. G.; Hauser, Loren John

Rhodopseudomonas palustris is among the most metabolically versatile bacteria known. It uses light, inorganic compounds, or organic compounds, for energy. It acquires carbon from many types of green plant-derived compounds or by carbon dioxide fixation, and it fixes nitrogen. Here we describe the genome sequence of R. palustris, which consists of a 5,459,213-base-pair (bp) circular chromosome with 4,836 predicted genes and a plasmid of 8,427 bp. The sequence reveals genes that confer a remarkably large number of options within a given type of metabolism, including three nitrogenases, five benzene ring cleavage pathways and four light harvesting 2 systems. R. palustrismore » encodes 63 signal transduction histidine kinases and 79 response regulator receiver domains. Almost 15% of the genome is devoted to transport. This genome sequence is a starting point to use R. palustris as a model to explore how organisms integrate metabolic modules in response to environmental perturbations.« less
Does typing of Chlamydia trachomatis using housekeeping multilocus sequence typing reveal different sexual networks among heterosexuals and men who have sex with men?

PubMed

Versteeg, Bart; Bruisten, Sylvia M; van der Ende, Arie; Pannekoek, Yvonne

2016-04-18

Chlamydia trachomatis infections remain the most common bacterial sexually transmitted infection worldwide. To gain more insight into the epidemiology and transmission of C. trachomatis, several schemes of multilocus sequence typing (MLST) have been developed. We investigated the clustering of C. trachomatis strains derived from men who have sex with men (MSM) and heterosexuals using the MLST scheme based on 7 housekeeping genes (MLST-7) adapted for clinical specimens and a high-resolution MLST scheme based on 6 polymorphic genes, including ompA (hr-MLST-6). Specimens from 100 C. trachomatis infected men who have sex with men (MSM) and 100 heterosexual women were randomly selected from previous studies and sequenced. We adapted the MLST-7 scheme to a nested assay to be suitable for direct typing of clinical specimens. All selected specimens were typed using both the adapted MLST-7 scheme and the hr-MLST-6 scheme. Clustering of C. trachomatis strains derived from MSM and heterosexuals was assessed using minimum spanning tree analysis. Sufficient chlamydial DNA was present in 188 of the 200 (94 %) selected samples. Using the adapted MLST-7 scheme, full MLST profiles were obtained for 187 of 188 tested specimens resulting in a high success rate of 99.5 %. Of these 187 specimens, 91 (48.7 %) were from MSM and 96 (51.3 %) from heterosexuals. We detected 21 sequence types (STs) using the adapted MLST-7 and 79 STs using the hr-MLST-6 scheme. Minimum spanning tree analyses was used to examine the clustering of MLST-7 data, which showed no reflection of separate transmission in MSM and heterosexual hosts. Moreover, typing using the hr-MLST-6 scheme identified genetically related clusters within each of clusters that were identified by using the MLST-7 scheme. No distinct transmission of C. trachomatis could be observed in MSM and heterosexuals using the adapted MLST-7 scheme in contrast to using the hr-MLST-6. In addition, we compared clustering of both MLST schemes and demonstrated that typing using the hr-MLST-6 scheme is able to identify genetically related clusters of C. trachomatis strains within each of the clusters that were identified by using the MLST-7 scheme.
A Three-Dimensional RNA Motif in Potato spindle tuber viroid Mediates Trafficking from Palisade Mesophyll to Spongy Mesophyll in Nicotiana benthamiana[W

PubMed Central

Takeda, Ryuta; Petrov, Anton I.; Leontis, Neocles B.; Ding, Biao

2011-01-01

Cell-to-cell trafficking of RNA is an emerging biological principle that integrates systemic gene regulation, viral infection, antiviral response, and cell-to-cell communication. A key mechanistic question is how an RNA is specifically selected for trafficking from one type of cell into another type. Here, we report the identification of an RNA motif in Potato spindle tuber viroid (PSTVd) required for trafficking from palisade mesophyll to spongy mesophyll in Nicotiana benthamiana leaves. This motif, called loop 6, has the sequence 5′-CGA-3′...5′-GAC-3′ flanked on both sides by cis Watson-Crick G/C and G/U wobble base pairs. We present a three-dimensional (3D) structural model of loop 6 that specifies all non-Watson-Crick base pair interactions, derived by isostericity-based sequence comparisons with 3D RNA motifs from the RNA x-ray crystal structure database. The model is supported by available chemical modification patterns, natural sequence conservation/variations in PSTVd isolates and related species, and functional characterization of all possible mutants for each of the loop 6 base pairs. Our findings and approaches have broad implications for studying the 3D RNA structural motifs mediating trafficking of diverse RNA species across specific cellular boundaries and for studying the structure-function relationships of RNA motifs in other biological processes. PMID:21258006
A three-dimensional RNA motif in Potato spindle tuber viroid mediates trafficking from palisade mesophyll to spongy mesophyll in Nicotiana benthamiana.

PubMed

Takeda, Ryuta; Petrov, Anton I; Leontis, Neocles B; Ding, Biao

2011-01-01

Cell-to-cell trafficking of RNA is an emerging biological principle that integrates systemic gene regulation, viral infection, antiviral response, and cell-to-cell communication. A key mechanistic question is how an RNA is specifically selected for trafficking from one type of cell into another type. Here, we report the identification of an RNA motif in Potato spindle tuber viroid (PSTVd) required for trafficking from palisade mesophyll to spongy mesophyll in Nicotiana benthamiana leaves. This motif, called loop 6, has the sequence 5'-CGA-3'...5'-GAC-3' flanked on both sides by cis Watson-Crick G/C and G/U wobble base pairs. We present a three-dimensional (3D) structural model of loop 6 that specifies all non-Watson-Crick base pair interactions, derived by isostericity-based sequence comparisons with 3D RNA motifs from the RNA x-ray crystal structure database. The model is supported by available chemical modification patterns, natural sequence conservation/variations in PSTVd isolates and related species, and functional characterization of all possible mutants for each of the loop 6 base pairs. Our findings and approaches have broad implications for studying the 3D RNA structural motifs mediating trafficking of diverse RNA species across specific cellular boundaries and for studying the structure-function relationships of RNA motifs in other biological processes.
Listeria costaricensis sp. nov.

PubMed

Núñez-Montero, Kattia; Leclercq, Alexandre; Moura, Alexandra; Vales, Guillaume; Peraza, Johnny; Pizarro-Cerdá, Javier; Lecuit, Marc

2018-03-01

A bacterial strain isolated from a food processing drainage system in Costa Rica fulfilled the criteria as belonging to the genus Listeria, but could not be assigned to any of the known species. Phylogenetic analysis based on the 16S rRNA gene revealed highest sequence similarity with the type strain of Listeria floridensis (98.7 %). Phylogenetic analysis based on Listeria core genomes placed the novel taxon within the Listeria fleishmannii, L. floridensis and Listeria aquatica clade (Listeria sensu lato). Whole-genome sequence analyses based on the average nucleotide blast identity (ANI<80 %) indicated that this isolate belonged to a novel species. Results of pairwise amino acid identity (AAI>70 %) and percentage of conserved proteins (POCP>68 %) with currently known Listeria species, as well as of biochemical characterization, confirmed that the strain constituted a novel species within the genus Listeria. The name Listeria costaricensis sp. nov. is proposed for the novel species, and is represented by the type strain CLIP 2016/00682 T (=CIP 111400 T =DSM 105474 T ).
The Sequencing Bead Array (SBA), a Next-Generation Digital Suspension Array

PubMed Central

Akhras, Michael S.; Pettersson, Erik; Diamond, Lisa; Unemo, Magnus; Okamoto, Jennifer; Davis, Ronald W.; Pourmand, Nader

2013-01-01

Here we describe the novel Sequencing Bead Array (SBA), a complete assay for molecular diagnostics and typing applications. SBA is a digital suspension array using Next-Generation Sequencing (NGS), to replace conventional optical readout platforms. The technology allows for reducing the number of instruments required in a laboratory setting, where the same NGS instrument could be employed from whole-genome and targeted sequencing to SBA broad-range biomarker detection and genotyping. As proof-of-concept, a model assay was designed that could distinguish ten Human Papillomavirus (HPV) genotypes associated with cervical cancer progression. SBA was used to genotype 20 cervical tumor samples and, when compared with amplicon pyrosequencing, was able to detect two additional co-infections due to increased sensitivity. We also introduce in-house software Sphix, enabling easy accessibility and interpretation of results. The technology offers a multi-parallel, rapid, robust, and scalable system that is readily adaptable for a multitude of microarray diagnostic and typing applications, e.g. genetic signatures, single nucleotide polymorphisms (SNPs), structural variations, and immunoassays. SBA has the potential to dramatically change the way we perform probe-based applications, and allow for a smooth transition towards the technology offered by genomic sequencing. PMID:24116138
Toward rules relating zinc finger protein sequences and DNA binding site preferences.

PubMed

Desjarlais, J R; Berg, J M

1992-08-15

Zinc finger proteins of the Cys2-His2 type consist of tandem arrays of domains, where each domain appears to contact three adjacent base pairs of DNA through three key residues. We have designed and prepared a series of variants of the central zinc finger within the DNA binding domain of Sp1 by using information from an analysis of a large data base of zinc finger protein sequences. Through systematic variations at two of the three contact positions (underlined), relatively specific recognition of sequences of the form 5'-GGGGN(G or T)GGG-3' has been achieved. These results provide the basis for rules that may develop into a code that will allow the design of zinc finger proteins with preselected DNA site specificity.
The fungal composition of natural biofinishes on oil-treated wood.

PubMed

van Nieuwenhuijzen, Elke J; Houbraken, Jos A M P; Punt, Peter J; Roeselers, Guus; Adan, Olaf C G; Samson, Robert A

2017-01-01

Biofinished wood is considered to be a decorative and protective material for outdoor constructions, showing advantages compared to traditional treated wood in terms of sustainability and self-repair. Natural dark wood staining fungi are essential to biofinish formation on wood. Although all sorts of outdoor situated timber are subjected to fungal staining, the homogenous dark staining called biofinish has only been detected on specific vegetable oil-treated substrates. Revealing the fungal composition of various natural biofinishes on wood is a first step to understand and control biofinish formation for industrial application. A culture-based survey of fungi in natural biofinishes on oil-treated wood samples showed the common wood stain fungus Aureobasidium and the recently described genus Superstratomyces to be predominant constituents. A culture-independent approach, based on amplification of the internal transcribed spacer regions, cloning and Sanger sequencing, resulted in clone libraries of two types of biofinishes. Aureobasidium was present in both biofinish types, but was only predominant in biofinishes on pine sapwood treated with raw linseed oil. Most cloned sequences of the other biofinish type (pine sapwood treated with olive oil) could not be identified. In addition, a more in-depth overview of the fungal composition of biofinishes was obtained with Illumina amplicon sequencing that targeted the internal transcribed spacer region 1. All investigated samples, that varied in wood species, (oil) treatments and exposure times, contained Aureobasidium and this genus was predominant in the biofinishes on pine sapwood treated with raw linseed oil. Lapidomyces was the predominant genus in most of the other biofinishes and present in all other samples. Surprisingly, Superstratomyces , which was predominantly detected by the cultivation-based approach, could not be found with the Illumina sequencing approach, while Lapidomyces was not detected in the culture-based approach. Overall, the culture-based approach and two culture-independent methods that were used in this study revealed that natural biofinishes were composed of multiple fungal genera always containing the common wood staining mould Aureobasidium . Besides Aureobasidium , the use of other fungal genera for the production of biofinished wood has to be considered.
Molecular characterisation of extended-spectrum β-lactamase (ESBL)-producing Escherichia coli isolates from hospital and ambulatory patients in Germany.

PubMed

Pietsch, Michael; Eller, Christoph; Wendt, Constanze; Holfelder, Martin; Falgenhauer, Linda; Fruth, Angelika; Grössl, Tobias; Leistner, Rasmus; Valenza, Giuseppe; Werner, Guido; Pfeifer, Yvonne

2017-02-01

The increase of Escherichia coli producing extended-spectrum β-lactamases (ESBL) in hospitals and their emergence as intestinal colonisers of healthy humans is of concern. Transmission ways and the extent of spread of distinct E. coli clones or ESBL genes among humans and animals via the food chain or the environment is a matter of debate. In this study we determined ESBL genotypes in E. coli isolates (n=233) resistant to 3rd generation cephalosporins from hospitals and medical practices using PCR and sequencing. Bacterial strain typing was performed by PCR-based phylogrouping, multilocus sequence typing (MLST) and a ST131-specific PCR. Results showed that CTX-M-15 (50.4%), CTX-M-1 (28.4%) and CTX-M-14 (5.6%) were the most common ESBL types. Especially, CTX-M-15 was associated with E. coli ST131 of phylogenetic group B2, which was the dominant sequence type among our isolates (35.8%). MLST typing revealed 40 different sequence types (STs), with ST131, ST410, ST10 and ST38 as the most prevalent ones. Our findings give an overview of the current distribution of ESBL-producing E. coli isolates from humans in Germany. E. coli O25b:H4-ST131 was confirmed to be the most common clone, which is known for its successful dissemination worldwide. Although heterogeneity among the isolates was found, several successful clones previously described in animals (ST410, ST10) also occurred in our isolate collection. Further detailed investigations of ESBL-producing isolates from different habitats are needed to evaluate possible transfer ways. Copyright © 2015 Elsevier B.V. All rights reserved.
Genetic Variation and Its Reflection on Posttranslational Modifications in Frequency Clock and Mating Type a-1 Proteins in Sordaria fimicola

PubMed Central

Arif, Rabia; Akram, Faiza; Jamil, Tazeen; Lee, Siu Fai

2017-01-01

Posttranslational modifications (PTMs) occur in all essential proteins taking command of their functions. There are many domains inside proteins where modifications take place on side-chains of amino acids through various enzymes to generate different species of proteins. In this manuscript we have, for the first time, predicted posttranslational modifications of frequency clock and mating type a-1 proteins in Sordaria fimicola collected from different sites to see the effect of environment on proteins or various amino acids pickings and their ultimate impact on consensus sequences present in mating type proteins using bioinformatics tools. Furthermore, we have also measured and walked through genomic DNA of various Sordaria strains to determine genetic diversity by genotyping the short sequence repeats (SSRs) of wild strains of S. fimicola collected from contrasting environments of two opposing slopes (harsh and xeric south facing slope and mild north facing slope) of Evolution Canyon (EC), Israel. Based on the whole genome sequence of S. macrospora, we targeted 20 genomic regions in S. fimicola which contain short sequence repeats (SSRs). Our data revealed genetic variations in strains from south facing slope and these findings assist in the hypothesis that genetic variations caused by stressful environments lead to evolution. PMID:28717646
Genetic Variation and Its Reflection on Posttranslational Modifications in Frequency Clock and Mating Type a-1 Proteins in Sordaria fimicola.

PubMed

Arif, Rabia; Akram, Faiza; Jamil, Tazeen; Mukhtar, Hamid; Lee, Siu Fai; Saleem, Muhammad

2017-01-01

Posttranslational modifications (PTMs) occur in all essential proteins taking command of their functions. There are many domains inside proteins where modifications take place on side-chains of amino acids through various enzymes to generate different species of proteins. In this manuscript we have, for the first time, predicted posttranslational modifications of frequency clock and mating type a-1 proteins in Sordaria fimicola collected from different sites to see the effect of environment on proteins or various amino acids pickings and their ultimate impact on consensus sequences present in mating type proteins using bioinformatics tools. Furthermore, we have also measured and walked through genomic DNA of various Sordaria strains to determine genetic diversity by genotyping the short sequence repeats (SSRs) of wild strains of S. fimicola collected from contrasting environments of two opposing slopes (harsh and xeric south facing slope and mild north facing slope) of Evolution Canyon (EC), Israel. Based on the whole genome sequence of S. macrospora , we targeted 20 genomic regions in S. fimicola which contain short sequence repeats (SSRs). Our data revealed genetic variations in strains from south facing slope and these findings assist in the hypothesis that genetic variations caused by stressful environments lead to evolution.
A simple method for MR elastography: a gradient-echo type multi-echo sequence.

PubMed

Numano, Tomokazu; Mizuhara, Kazuyuki; Hata, Junichi; Washio, Toshikatsu; Homma, Kazuhiro

2015-01-01

To demonstrate the feasibility of a novel MR elastography (MRE) technique based on a conventional gradient-echo type multi-echo MR sequence which does not need additional bipolar magnetic field gradients (motion encoding gradient: MEG), yet is sensitive to vibration. In a gradient-echo type multi-echo MR sequence, several images are produced from each echo of the train with different echo times (TEs). If these echoes are synchronized with the vibration, each readout's gradient lobes achieve a MEG-like effect, and the later generated echo causes a greater MEG-like effect. The sequence was tested for the tissue-mimicking agarose gel phantoms and the psoas major muscles of healthy volunteers. It was confirmed that the readout gradient lobes caused an MEG-like effect and the later TE images had higher sensitivity to vibrations. The magnitude image of later generated echo suffered the T2 decay and the susceptibility artifacts, but the wave image and elastogram of later generated echo were unaffected by these effects. In in vivo experiments, this method was able to measure the mean shear modulus of the psoas major muscle. From the results of phantom experiments and volunteer studies, it was shown that this method has clinical application potential. Copyright © 2014 Elsevier Inc. All rights reserved.
Sequence of structures in fine-grained turbidites: Comparison of recent deep-sea and ancient flysch sediments

NASA Astrophysics Data System (ADS)

Stow, Dorrik A. V.; Shanmugam, Ganapathy

1980-01-01

A comparative study of the sequence of sedimentary structures in ancient and modern fine-grained turbidites is made in three contrasting areas. They are (1) Holocene and Pleistocene deep-sea muds of the Nova Scotian Slope and Rise, (2) Middle Ordovician Sevier Shale of the Valley and Ridge Province of the Southern Appalachians, and (3) Cambro-Ordovician Halifax Slate of the Meguma Group in Nova Scotia. A standard sequence of structures is proposed for fine-grained turbidites. The complete sequence has nine sub-divisions that are here termed T 0 to T 8. "The lower subdivision (T 0) comprises a silt lamina which has a sharp, scoured and load-cast base, internal parallel-lamination and cross-lamination, and a sharp current-lineated or wavy surface with 'fading-ripples' (= Type C etc. …)." (= Type C ripple-drift cross-lamination, Jopling and Walker, 1968). The overlying sequence shows textural and compositional grading through alternating silt and mud laminae. A convolute-laminated sub-division (T 1) is overlain by low-amplitude climbing ripples (T 2), thin regular laminae (T 3), thin indistinct laminae (T 4), and thin wipsy or convolute laminae (T 5). The topmost three divisions, graded mud (T 6), ungraded mud (T 7) and bioturbated mud (T 8), do not have silt laminae but rare patchy silt lenses and silt pseudonodules and a thin zone of micro-burrowing near the upper surface. The proposed sequence is analogous to the Bouma (1962) structural scheme for sandy turbidites and is approximately equivalent to Bouma's (C)DE divisions. The repetition of partial sequences characterizes different parts of the slope/base-of-slope/basin plain environment, and represents deposition from different stages of evolution of a large, muddy, turbidity flow. Microstructural detail and sequence are well preserved in ancient and even slightly metamorphosed sediments. Their recognition is important for determining depositional processes and for palaeoenvironmental interpretation.
Charting improvements in US registry HLA typing ambiguity using a typing resolution score.

PubMed

Paunić, Vanja; Gragert, Loren; Schneider, Joel; Müller, Carlheinz; Maiers, Martin

2016-07-01

Unrelated stem cell registries have been collecting HLA typing of volunteer bone marrow donors for over 25years. Donor selection for hematopoietic stem cell transplantation is based primarily on matching the alleles of donors and patients at five polymorphic HLA loci. As HLA typing technologies have continually advanced since the beginnings of stem cell transplantation, registries have accrued typings of varied HLA typing ambiguity. We present a new typing resolution score (TRS), based on the likelihood of self-match, that allows the systematic comparison of HLA typings across different methods, data sets and populations. We apply the TRS to chart improvement in HLA typing within the Be The Match Registry of the United States from the initiation of DNA-based HLA typing to the current state of high-resolution typing using next-generation sequencing technologies. In addition, we present a publicly available online tool for evaluation of any given HLA typing. This TRS objectively evaluates HLA typing methods and can help define standards for acceptable recruitment HLA typing. Copyright © 2016 American Society for Histocompatibility and Immunogenetics. Published by Elsevier Inc. All rights reserved.
Multiple nucleotide preferences determine cleavage-site recognition by the HIV-1 and M-MuLV RNases H.

PubMed

Schultz, Sharon J; Zhang, Miaohua; Champoux, James J

2010-03-19

The RNase H activity of reverse transcriptase is required during retroviral replication and represents a potential target in antiviral drug therapies. Sequence features flanking a cleavage site influence the three types of retroviral RNase H activity: internal, DNA 3'-end-directed, and RNA 5'-end-directed. Using the reverse transcriptases of HIV-1 (human immunodeficiency virus type 1) and Moloney murine leukemia virus (M-MuLV), we evaluated how individual base preferences at a cleavage site direct retroviral RNase H specificity. Strong test cleavage sites (designated as between nucleotide positions -1 and +1) for the HIV-1 and M-MuLV enzymes were introduced into model hybrid substrates designed to assay internal or DNA 3'-end-directed cleavage, and base substitutions were tested at specific nucleotide positions. For internal cleavage, positions +1, -2, -4, -5, -10, and -14 for HIV-1 and positions +1, -2, -6, and -7 for M-MuLV significantly affected RNase H cleavage efficiency, while positions -7 and -12 for HIV-1 and positions -4, -9, and -11 for M-MuLV had more modest effects. DNA 3'-end-directed cleavage was influenced substantially by positions +1, -2, -4, and -5 for HIV-1 and positions +1, -2, -6, and -7 for M-MuLV. Cleavage-site distance from the recessed end did not affect sequence preferences for M-MuLV reverse transcriptase. Based on the identified sequence preferences, a cleavage site recognized by both HIV-1 and M-MuLV enzymes was introduced into a sequence that was otherwise resistant to RNase H. The isolated RNase H domain of M-MuLV reverse transcriptase retained sequence preferences at positions +1 and -2 despite prolific cleavage in the absence of the polymerase domain. The sequence preferences of retroviral RNase H likely reflect structural features in the substrate that favor cleavage and represent a novel specificity determinant to consider in drug design. Copyright (c) 2010 Elsevier Ltd. All rights reserved.
Long-term excretion of vaccine-derived poliovirus by a healthy child.

PubMed

Martín, Javier; Odoom, Kofi; Tuite, Gráinne; Dunn, Glynis; Hopewell, Nicola; Cooper, Gill; Fitzharris, Catherine; Butler, Karina; Hall, William W; Minor, Philip D

2004-12-01

A child was found to be excreting type 1 vaccine-derived poliovirus (VDPV) with a 1.1% sequence drift from Sabin type 1 vaccine strain in the VP1 coding region 6 months after he was immunized with oral live polio vaccine. Seventeen type 1 poliovirus isolates were recovered from stools taken from this child during the following 4 months. Contrary to expectation, the child was not deficient in humoral immunity and showed high levels of serum neutralization against poliovirus. Selected virus isolates were characterized in terms of their antigenic properties, virulence in transgenic mice, sensitivity for growth at high temperatures, and differences in nucleotide sequence from the Sabin type 1 strain. The VDPV isolates showed mutations at key nucleotide positions that correlated with the observed reversion to biological properties typical of wild polioviruses. A number of capsid mutations mapped at known antigenic sites leading to changes in the viral antigenic structure. Estimates of sequence evolution based on the accumulation of nucleotide changes in the VP1 coding region detected a "defective" molecular clock running at an apparent faster speed of 2.05% nucleotide changes per year versus 1% shown in previous studies. Remarkably, when compared to several type 1 VDPV strains of different origins, isolates from this child showed a much higher proportion of nonsynonymous versus synonymous nucleotide changes in the capsid coding region. This anomaly could explain the high VP1 sequence drift found and the ability of these virus strains to replicate in the gut for a longer period than expected.
‘Candidatus Phytoplasma palmicola’, a novel taxon associated with a lethal yellowing-type disease (LYD) of coconut (Cocos nucifera L.) in Mozambique

USDA-ARS?s Scientific Manuscript database

In this study, the taxonomic position and group classification of the phytoplasma associated with a lethal yellowing-type disease (LYD) of coconut (Cocos nucifera L.) in Mozambique were addressed. Pairwise sequence similarity values based on alignment of near full-length 16SrRNA genes (1530 bp) reve...
GARLIC: a bioinformatic toolkit for aetiologically connecting diseases and cell type-specific regulatory maps

PubMed Central

Nikolić, Miloš; Papantonis, Argyris

2017-01-01

Abstract Genome-wide association studies (GWAS) have emerged as a powerful tool to uncover the genetic basis of human common diseases, which often show a complex, polygenic and multi-factorial aetiology. These studies have revealed that 70–90% of all single nucleotide polymorphisms (SNPs) associated with common complex diseases do not occur within genes (i.e. they are non-coding), making the discovery of disease-causative genetic variants and the elucidation of the underlying pathological mechanisms far from straightforward. Based on emerging evidences suggesting that disease-associated SNPs are frequently found within cell type-specific regulatory sequences, here we present GARLIC (GWAS-based Prediction Toolkit for Connecting Diseases and Cell Types), a user-friendly, multi-purpose software with an associated database and online viewer that, using global maps of cis-regulatory elements, can aetiologically connect human diseases with relevant cell types. Additionally, GARLIC can be used to retrieve potential disease-causative genetic variants overlapping regulatory sequences of interest. Overall, GARLIC can satisfy several important needs within the field of medical genetics, thus potentially assisting in the ultimate goal of uncovering the elusive and complex genetic basis of common human disorders. PMID:28007912
A novel method for simultaneous Enterococcus species identification/typing and van genotyping by high resolution melt analysis.

PubMed

Gurtler, Volker; Grando, Danilla; Mayall, Barrie C; Wang, Jenny; Ghaly-Derias, Shahbano

2012-09-01

In order to develop a typing and identification method for van gene containing Enterococcus faecium, two multiplex PCR reactions were developed for use in HRM-PCR (High Resolution Melt-PCR): (i) vanA, vanB, vanC, vanC23 to detect van genes from different Enterococcus species; (ii) ISR (intergenic spacer region between the 16S and 23S rRNA genes) to detect all Enterococcus species and obtain species and isolate specific HRM curves. To test and validate the method three groups of isolates were tested: (i) 1672 Enterococcus species isolates from January 2009 to December 2009; (ii) 71 isolates previously identified and typed by PFGE (pulsed-field gel electrophoresis) and MLST (multi-locus sequence typing); and (iii) 18 of the isolates from (i) for which ISR sequencing was done. As well as successfully identifying 2 common genotypes by HRM from the Austin Hospital clinical isolates, this study analysed the sequences of all the vanB genes deposited in GenBank and developed a numerical classification scheme for the standardised naming of these vanB genotypes. The identification of Enterococcus faecalis from E. faecium was reliable and stable using ISR PCR. The typing of E. faecium by ISR PCR: (i) detected two variable peaks corresponding to different copy numbers of insertion sequences I and II corresponding to peak I and II respectively; (ii) produced 7 melt profiles for E. faecium with variable copy numbers of sequences I and II; (iii) demonstrated stability and instability of peak heights with equal frequency within the patient sample (36.4±4.5 days and 38.6±5.8 days respectively for 192 patients); (iv) detected ISR-HRM types with as much discrimination as PFGE and more than MLST; and (v) detected ISR-HRM types that differentiated some isolates that were identical by PFGE and MLST. In conjunction with the rapid and accurate van genotyping method described here, this ISR-HRM typing and identification method can be used as a stable identification and typing method with predictable instability based on recombination and concerted evolution of the rrn operon that will complement existing typing methods. Crown Copyright © 2012. Published by Elsevier B.V. All rights reserved.
Uncommonly isolated clinical Pseudomonas: identification and phylogenetic assignation.

PubMed

Mulet, M; Gomila, M; Ramírez, A; Cardew, S; Moore, E R B; Lalucat, J; García-Valdés, E

2017-02-01

Fifty-two Pseudomonas strains that were difficult to identify at the species level in the phenotypic routine characterizations employed by clinical microbiology laboratories were selected for genotypic-based analysis. Species level identifications were done initially by partial sequencing of the DNA dependent RNA polymerase sub-unit D gene (rpoD). Two other gene sequences, for the small sub-unit ribosonal RNA (16S rRNA) and for DNA gyrase sub-unit B (gyrB) were added in a multilocus sequence analysis (MLSA) study to confirm the species identifications. These sequences were analyzed with a collection of reference sequences from the type strains of 161 Pseudomonas species within an in-house multi-locus sequence analysis database. Whole-cell matrix-assisted laser-desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) analyses of these strains complemented the DNA sequenced-based phylogenetic analyses and were observed to be in accordance with the results of the sequence data. Twenty-three out of 52 strains were assigned to 12 recognized species not commonly detected in clinical specimens and 29 (56 %) were considered representatives of at least ten putative new species. Most strains were distributed within the P. fluorescens and P. aeruginosa lineages. The value of rpoD sequences in species-level identifications for Pseudomonas is emphasized. The correct species identifications of clinical strains is essential for establishing the intrinsic antibiotic resistance patterns and improved treatment plans.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.