Science.gov

Sample records for acid sequence diversity

  1. Prebiotically plausible mechanisms increase compositional diversity of nucleic acid sequences

    PubMed Central

    Derr, Julien; Manapat, Michael L.; Rajamani, Sudha; Leu, Kevin; Xulvi-Brunet, Ramon; Joseph, Isaac; Nowak, Martin A.; Chen, Irene A.

    2012-01-01

    During the origin of life, the biological information of nucleic acid polymers must have increased to encode functional molecules (the RNA world). Ribozymes tend to be compositionally unbiased, as is the vast majority of possible sequence space. However, ribonucleotides vary greatly in synthetic yield, reactivity and degradation rate, and their non-enzymatic polymerization results in compositionally biased sequences. While natural selection could lead to complex sequences, molecules with some activity are required to begin this process. Was the emergence of compositionally diverse sequences a matter of chance, or could prebiotically plausible reactions counter chemical biases to increase the probability of finding a ribozyme? Our in silico simulations using a two-letter alphabet show that template-directed ligation and high concatenation rates counter compositional bias and shift the pool toward longer sequences, permitting greater exploration of sequence space and stable folding. We verified experimentally that unbiased DNA sequences are more efficient templates for ligation, thus increasing the compositional diversity of the pool. Our work suggests that prebiotically plausible chemical mechanisms of nucleic acid polymerization and ligation could predispose toward a diverse pool of longer, potentially structured molecules. Such mechanisms could have set the stage for the appearance of functional activity very early in the emergence of life. PMID:22319215

  2. tax and rex Sequences of bovine leukaemia virus from globally diverse isolates: rex amino acid sequence more variable than tax.

    PubMed

    McGirr, K M; Buehring, G C

    2005-02-01

    Bovine leukaemia virus (BLV) is an important agricultural problem with high costs to the dairy industry. Here, we examine the variation of the tax and rex genes of BLV. The tax and rex genes share 420 bases and have overlapping reading frames. The tax gene encodes a protein that functions as a transactivator of the BLV promoter, is required for viral replication, acts on cellular promoters, and is responsible for oncogenesis. The rex facilitates the export of viral mRNAs from the nucleus and regulates transcription. We have sequenced five new isolates of the tax/rex gene. We examined the five new and three previously published tax/rex DNA and predicted amino acid sequences of BLV isolates from cattle in representative regions worldwide. The highest variation among nucleic acid sequences for tax and rex was 7% and 5%, respectively; among predicted amino acid sequences for Tax and Rex, 9% and 11%, respectively. Significantly more nucleotide changes resulted in predicted amino acid changes in the rex gene than in the tax gene (P < or = 0.0006). This variability is higher than previously reported for any region of the viral genome. This research may also have implications for the development of Tax-based vaccines. PMID:15702995

  3. A computer program for the estimation of protein and nucleic acid sequence diversity in random point mutagenesis libraries

    PubMed Central

    Volles, Michael J.; Lansbury, Peter T.

    2005-01-01

    A computer program for the generation and analysis of in silico random point mutagenesis libraries is described. The program operates by mutagenizing an input nucleic acid sequence according to mutation parameters specified by the user for each sequence position and type of point mutation. The program can mimic almost any type of random mutagenesis library, including those produced via error-prone PCR (ep-PCR), mutator Escherichia coli strains, chemical mutagenesis, and doped or random oligonucleotide synthesis. The program analyzes the generated nucleic acid sequences and/or the associated protein library to produce several estimates of library diversity (number of unique sequences, point mutations, and single point mutants) and the rate of saturation of these diversities during experimental screening or selection of clones. This information allows one to select the optimal screen size for a given mutagenesis library, necessary to efficiently obtain a certain coverage of the sequence-space. The program also reports the abundance of each specific protein mutation at each sequence position, which is useful as a measure of the level and type of mutation bias in the library. Alternatively, one can use the program to evaluate the relative merits of preexisting libraries, or to examine various hypothetical mutation schemes to determine the optimal method for creating a library that serves the screen/selection of interest. Simulated libraries of at least 109 sequences are accessible by the numerical algorithm with currently available personal computers; an analytical algorithm is also available which can rapidly calculate a subset of the numerical statistics in libraries of arbitrarily large size. A multi-type double-strand stochastic model of ep-PCR is developed in an appendix to demonstrate the applicability of the algorithm to amplifying mutagenesis procedures. Estimators of DNA polymerase mutation-type-specific error rates are derived using the model. Analyses of an

  4. Diverse Bacterial PKS Sequences Derived From Okadaic Acid-Producing Dinoflagellates

    PubMed Central

    Perez, Roberto; Liu, Li; Lopez, Jose; An, Tianying; Rein, Kathleen S.

    2008-01-01

    Okadaic acid (OA) and the related dinophysistoxins are isolated from dinoflagellates of the genus Prorocentrum and Dinophysis. Bacteria of the Roseobacter group have been associated with okadaic acid producing dinoflagellates and have been previously implicated in OA production. Analysis of 16S rRNA libraries reveals that Roseobacter are the most abundant bacteria associated with OA producing dinoflagellates of the genus Prorocentrum and are not found in association with non-toxic dinoflagellates. While some polyketide synthase (PKS) genes form a highly supported Prorocentrum clade, most appear to be bacterial, but unrelated to Roseobacter or Alpha-Proteobacterial PKSs or those derived from other Alveolates Karenia brevis or Crytosporidium parvum. PMID:18728765

  5. The sequence diversity and expression among genes of the folic acid biosynthesis pathway in industrial Saccharomyces strains.

    PubMed

    Goncerzewicz, Anna; Misiewicz, Anna

    2015-01-01

    Folic acid is an important vitamin in human nutrition and its deficiency in pregnant women's diets results in neural tube defects and other neurological damage to the fetus. Additionally, DNA synthesis, cell division and intestinal absorption are inhibited in case of adults. Since this discovery, governments and health organizations worldwide have made recommendations concerning folic acid supplementation of food for women planning to become pregnant. In many countries this has led to the introduction of fortifications, where synthetic folic acid is added to flour. It is known that Saccharomyces strains (brewing and bakers' yeast) are one of the main producers of folic acid and they can be used as a natural source of this vitamin. Proper selection of the most efficient strains may enhance the folate content in bread, fermented vegetables, dairy products and beer by 100% and may be used in the food industry. The objective of this study was to select the optimal producing yeast strain by determining the differences in nucleotide sequences in the FOL2, FOL3 and DFR1 genes of folic acid biosynthesis pathway. The Multitemperature Single Strand Conformation Polymorphism (MSSCP) method and further nucleotide sequencing for selected strains were applied to indicate SNPs in selected gene fragments. The RT qPCR technique was also applied to examine relative expression of the FOL3 gene. Furthermore, this is the first time ever that industrial yeast strains were analysed regarding genes of the folic acid biosynthesis pathway. It was observed that a correlation exists between the folic acid amount produced by industrial yeast strains and changes in the nucleotide sequence of adequate genes. The most significant changes occur in the DFR1 gene, mostly in the first part, which causes major protein structure modifications in KKP 232, KKP 222 and KKP 277 strains. Our study shows that the large amount of SNP contributes to impairment of the selected enzymes and S. cerevisiae and S

  6. High Genetic Diversity among Strains of the Unindustrialized Lactic Acid Bacterium Carnobacterium maltaromaticum in Dairy Products as Revealed by Multilocus Sequence Typing

    PubMed Central

    Rahman, Abdur; Cailliez-Grimal, Catherine; Bontemps, Cyril; Payot, Sophie; Chaillou, Stéphane; Revol-Junelles, Anne-Marie

    2014-01-01

    Dairy products are colonized with three main classes of lactic acid bacteria (LAB): opportunistic bacteria, traditional starters, and industrial starters. Most of the population structure studies were previously performed with LAB species belonging to these three classes and give interesting knowledge about the population structure of LAB at the stage where they are already industrialized. However, these studies give little information about the population structure of LAB prior their use as an industrial starter. Carnobacterium maltaromaticum is a LAB colonizing diverse environments, including dairy products. Since this bacterium was discovered relatively recently, it is not yet commercialized as an industrial starter, which makes C. maltaromaticum an interesting model for the study of unindustrialized LAB population structure in dairy products. A multilocus sequence typing scheme based on an analysis of fragments of the genes dapE, ddlA, glpQ, ilvE, pyc, pyrE, and leuS was applied to a collection of 47 strains, including 28 strains isolated from dairy products. The scheme allowed detecting 36 sequence types with a discriminatory index of 0.98. The whole population was clustered in four deeply branched lineages, in which the dairy strains were spread. Moreover, the dairy strains could exhibit a high diversity within these lineages, leading to an overall dairy population with a diversity level as high as that of the nondairy population. These results are in agreement with the hypothesis according to which the industrialization of LAB leads to a diversity reduction in dairy products. PMID:24747901

  7. Insertion Sequence Diversity in Archaea

    PubMed Central

    Filée, J.; Siguier, P.; Chandler, M.

    2007-01-01

    Insertion sequences (ISs) can constitute an important component of prokaryotic (bacterial and archaeal) genomes. Over 1,500 individual ISs are included at present in the ISfinder database (www-is.biotoul.fr), and these represent only a small portion of those in the available prokaryotic genome sequences and those that are being discovered in ongoing sequencing projects. In spite of this diversity, the transposition mechanisms of only a few of these ubiquitous mobile genetic elements are known, and these are all restricted to those present in bacteria. This review presents an overview of ISs within the archaeal kingdom. We first provide a general historical summary of the known properties and behaviors of archaeal ISs. We then consider how transposition might be regulated in some cases by small antisense RNAs and by termination codon readthrough. This is followed by an extensive analysis of the IS content in the sequenced archaeal genomes present in the public databases as of June 2006, which provides an overview of their distribution among the major archaeal classes and species. We show that the diversity of archaeal ISs is very great and comparable to that of bacteria. We compare archaeal ISs to known bacterial ISs and find that most are clearly members of families first described for bacteria. Several cases of lateral gene transfer between bacteria and archaea are clearly documented, notably for methanogenic archaea. However, several archaeal ISs do not have bacterial equivalents but can be grouped into Archaea-specific groups or families. In addition to ISs, we identify and list nonautonomous IS-derived elements, such as miniature inverted-repeat transposable elements. Finally, we present a possible scenario for the evolutionary history of ISs in the Archaea. PMID:17347521

  8. Composition for nucleic acid sequencing

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2008-08-26

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  9. High speed nucleic acid sequencing

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2011-05-17

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid. Each type of labeled nucleotide comprises an acceptor fluorophore attached to a phosphate portion of the nucleotide such that the fluorophore is removed upon incorporation into a growing strand. Fluorescent signal is emitted via fluorescent resonance energy transfer between the donor fluorophore and the acceptor fluorophore as each nucleotide is incorporated into the growing strand. The sequence is deduced by identifying which base is being incorporated into the growing strand.

  10. Castor Bean Organelle Genome Sequencing and Worldwide Genetic Diversity Analysis

    PubMed Central

    Chan, Agnes P.; Williams, Amber L.; Rice, Danny W.; Liu, Xinyue; Melake-Berhan, Admasu; Huot Creasy, Heather; Puiu, Daniela; Rosovitz, M. J.; Khouri, Hoda M.; Beckstrom-Sternberg, Stephen M.; Allan, Gerard J.; Keim, Paul; Ravel, Jacques; Rabinowicz, Pablo D.

    2011-01-01

    Castor bean is an important oil-producing plant in the Euphorbiaceae family. Its high-quality oil contains up to 90% of the unusual fatty acid ricinoleate, which has many industrial and medical applications. Castor bean seeds also contain ricin, a highly toxic Type 2 ribosome-inactivating protein, which has gained relevance in recent years due to biosafety concerns. In order to gain knowledge on global genetic diversity in castor bean and to ultimately help the development of breeding and forensic tools, we carried out an extensive chloroplast sequence diversity analysis. Taking advantage of the recently published genome sequence of castor bean, we assembled the chloroplast and mitochondrion genomes extracting selected reads from the available whole genome shotgun reads. Using the chloroplast reference genome we used the methylation filtration technique to readily obtain draft genome sequences of 7 geographically and genetically diverse castor bean accessions. These sequence data were used to identify single nucleotide polymorphism markers and phylogenetic analysis resulted in the identification of two major clades that were not apparent in previous population genetic studies using genetic markers derived from nuclear DNA. Two distinct sub-clades could be defined within each major clade and large-scale genotyping of castor bean populations worldwide confirmed previously observed low levels of genetic diversity and showed a broad geographic distribution of each sub-clade. PMID:21750729

  11. Diversity of amino acids in a typical chernozem of Moldova

    NASA Astrophysics Data System (ADS)

    Frunze, N. I.

    2014-12-01

    The content and composition of the amino acids in typical chernozems were studied. The objects of the study included a reference soil under an old fallow and three variants under fodder crop rotations: not fertilized, with mineral fertilizers, and with organic fertilizers. The contents of 18 amino acids were determined in these soils. The amino acids were extracted by the method of acid hydrolysis and identified by the method of ion-exchange chromatography. The total content of most of the amino acids was maximal in the reference soil; it was much lower in the cultivated soils and decreased in the following sequence: organic background > mineral background > no fertilization. The diversity of amino acids was evaluated quantitatively using different parameters applied in ecology for estimating various aspects of the species composition of communities (Simpson, Margalef, Menhinick, and Shannon's indices). The diversity and contribution of different amino acids to the total pool of amino acids also varied significantly in the studied variants. The maximum diversity of amino acids and maximum evenness of their relative abundance indices were typical of the reference chernozem; these parameters were lower in the cultivated soils. It was concluded that the changes in the structure of the amino acids under the impact of agricultural loads are similar to those that are usually observed under stress conditions.

  12. Chip-based sequencing nucleic acids

    DOEpatents

    Beer, Neil Reginald

    2014-08-26

    A system for fast DNA sequencing by amplification of genetic material within microreactors, denaturing, demulsifying, and then sequencing the material, while retaining it in a PCR/sequencing zone by a magnetic field. One embodiment includes sequencing nucleic acids on a microchip that includes a microchannel flow channel in the microchip. The nucleic acids are isolated and hybridized to magnetic nanoparticles or to magnetic polystyrene-coated beads. Microreactor droplets are formed in the microchannel flow channel. The microreactor droplets containing the nucleic acids and the magnetic nanoparticles are retained in a magnetic trap in the microchannel flow channel and sequenced.

  13. Distinguishing Proteins From Arbitrary Amino Acid Sequences

    PubMed Central

    Yau, Stephen S.-T.; Mao, Wei-Guang; Benson, Max; He, Rong Lucy

    2015-01-01

    What kinds of amino acid sequences could possibly be protein sequences? From all existing databases that we can find, known proteins are only a small fraction of all possible combinations of amino acids. Beginning with Sanger's first detailed determination of a protein sequence in 1952, previous studies have focused on describing the structure of existing protein sequences in order to construct the protein universe. No one, however, has developed a criteria for determining whether an arbitrary amino acid sequence can be a protein. Here we show that when the collection of arbitrary amino acid sequences is viewed in an appropriate geometric context, the protein sequences cluster together. This leads to a new computational test, described here, that has proved to be remarkably accurate at determining whether an arbitrary amino acid sequence can be a protein. Even more, if the results of this test indicate that the sequence can be a protein, and it is indeed a protein sequence, then its identity as a protein sequence is uniquely defined. We anticipate our computational test will be useful for those who are attempting to complete the job of discovering all proteins, or constructing the protein universe. PMID:25609314

  14. Method for sequencing nucleic acid molecules

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2006-05-30

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  15. Method for sequencing nucleic acid molecules

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2006-06-06

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  16. Conformational diversity and the emergence of sequence signatures during evolution.

    PubMed

    Parisi, Gustavo; Zea, Diego Javier; Monzon, Alexander Miguel; Marino-Buslje, Cristina

    2015-06-01

    Proteins' native structure is an ensemble of conformers in equilibrium, including all their respective functional states and intermediates. The induced-fit first and the pre-equilibrium theories later, described how structural changes are required to explain the allosteric and cooperative behaviours in proteins, which are key to protein function. The conformational ensemble concept has become a key tool in explaining an endless list of essential protein properties such as function, enzyme and antibody promiscuity, signal transduction, protein-protein recognition, origin of diseases, origin of new protein functions, evolutionary rate and order-disorder transitions, among others. Conformational diversity is encoded by the amino acid sequence and such a signature can be evidenced through evolutionary studies as evolutionary rate, conservation and coevolution. PMID:25749052

  17. Amino acid sequence repertoire of the bacterial proteome and the occurrence of untranslatable sequences.

    PubMed

    Navon, Sharon Penias; Kornberg, Guy; Chen, Jin; Schwartzman, Tali; Tsai, Albert; Puglisi, Elisabetta Viani; Puglisi, Joseph D; Adir, Noam

    2016-06-28

    Bioinformatic analysis of Escherichia coli proteomes revealed that all possible amino acid triplet sequences occur at their expected frequencies, with four exceptions. Two of the four underrepresented sequences (URSs) were shown to interfere with translation in vivo and in vitro. Enlarging the URS by a single amino acid resulted in increased translational inhibition. Single-molecule methods revealed stalling of translation at the entrance of the peptide exit tunnel of the ribosome, adjacent to ribosomal nucleotides A2062 and U2585. Interaction with these same ribosomal residues is involved in regulation of translation by longer, naturally occurring protein sequences. The E. coli exit tunnel has evidently evolved to minimize interaction with the exit tunnel and maximize the sequence diversity of the proteome, although allowing some interactions for regulatory purposes. Bioinformatic analysis of the human proteome revealed no underrepresented triplet sequences, possibly reflecting an absence of regulation by interaction with the exit tunnel. PMID:27307442

  18. Sequence diversity of wheat mosaic virus isolates.

    PubMed

    Stewart, Lucy R

    2016-02-01

    Wheat mosaic virus (WMoV), transmitted by eriophyid wheat curl mites (Aceria tosichella) is the causal agent of High Plains disease in wheat and maize. WMoV and other members of the genus Emaravirus evaded thorough molecular characterization for many years due to the experimental challenges of mite transmission and manipulating multisegmented negative sense RNA genomes. Recently, the complete genome sequence of a Nebraska isolate of WMoV revealed eight segments, plus a variant sequence of the nucleocapsid protein-encoding segment. Here, near-complete and partial consensus sequences of five more WMoV isolates are reported and compared to the Nebraska isolate: an Ohio maize isolate (GG1), a Kansas barley isolate (KS7), and three Ohio wheat isolates (H1, K1, W1). Results show two distinct groups of WMoV isolates: Ohio wheat isolate RNA segments had 84% or lower nucleotide sequence identity to the NE isolate, whereas GG1 and KS7 had 98% or higher nucleotide sequence identity to the NE isolate. Knowledge of the sequence variability of WMoV isolates is a step toward understanding virus biology, and potentially explaining observed biological variation. PMID:26590326

  19. INCREASING THE DIVERSITY OF EST SEQUENCES FOR FRAGARIA

    Technology Transfer Automated Retrieval System (TEKTRAN)

    This project aims to substantially increase the amount of strawberry expressed sequence tag (EST) data available to the community, and increase the diversity of EST sequences for the family Rosaceae. Currently, there are approximately 19,000 Fragaria ESTs in GenBank, 50% of which have been generated...

  20. Diversity of Microbial Sialic Acid Metabolism

    PubMed Central

    Vimr, Eric R.; Kalivoda, Kathryn A.; Deszo, Eric L.; Steenbergen, Susan M.

    2004-01-01

    Sialic acids are structurally unique nine-carbon keto sugars occupying the interface between the host and commensal or pathogenic microorganisms. An important function of host sialic acid is to regulate innate immunity, and microbes have evolved various strategies for subverting this process by decorating their surfaces with sialylated oligosaccharides that mimic those of the host. These subversive strategies include a de novo synthetic pathway and at least two truncated pathways that depend on scavenging host-derived intermediates. A fourth strategy involves modification of sialidases so that instead of transferring sialic acid to water (hydrolysis), a second active site is created for binding alternative acceptors. Sialic acids also are excellent sources of carbon, nitrogen, energy, and precursors of cell wall biosynthesis. The catabolic strategies for exploiting host sialic acids as nutritional sources are as diverse as the biosynthetic mechanisms, including examples of horizontal gene transfer and multiple transport systems. Finally, as compounds coating the surfaces of virtually every vertebrate cell, sialic acids provide information about the host environment that, at least in Escherichia coli, is interpreted by the global regulator encoded by nanR. In addition to regulating the catabolism of sialic acids through the nan operon, NanR controls at least two other operons of unknown function and appears to participate in the regulation of type 1 fimbrial phase variation. Sialic acid is, therefore, a host molecule to be copied (molecular mimicry), eaten (nutrition), and interpreted (cell signaling) by diverse metabolic machinery in all major groups of mammalian pathogens and commensals. PMID:15007099

  1. Estimating the diversity of peptide populations from limited sequence data.

    SciTech Connect

    Makowski, L.; Soares, A.; Biosciences Division; BNL

    2003-03-01

    Combinatorial libraries of peptides such as those displayed on the surface of a bacteriophage particle have become widely used tools for characterizing protein-protein and protein-small molecule interactions. The quality of a library frequently depends on its completeness, or diversity -- the proportion of possible sequences actually present in the library. The diversity of these libraries is frequently quoted on the basis of phage titers that provide little information about their completeness. Here, an analytical expression for diversity is introduced and a method for estimating the diversity of a peptide library from the sequences of a limited number of the members of the library is demonstrated. The diversities of a number of computationally constructed and actual peptide libraries are estimated using this method.

  2. Sequences Of Amino Acids For Human Serum Albumin

    NASA Technical Reports Server (NTRS)

    Carter, Daniel C.

    1992-01-01

    Sequences of amino acids defined for use in making polypeptides one-third to one-sixth as large as parent human serum albumin molecule. Smaller, chemically stable peptides have diverse applications including service as artificial human serum and as active components of biosensors and chromatographic matrices. In applications involving production of artificial sera from new sequences, little or no concern about viral contaminants. Smaller genetically engineered polypeptides more easily expressed and produced in large quantities, making commercial isolation and production more feasible and profitable.

  3. Phenolic acid esterases, coding sequences and methods

    DOEpatents

    Blum, David L.; Kataeva, Irina; Li, Xin-Liang; Ljungdahl, Lars G.

    2002-01-01

    Described herein are four phenolic acid esterases, three of which correspond to domains of previously unknown function within bacterial xylanases, from XynY and XynZ of Clostridium thermocellum and from a xylanase of Ruminococcus. The fourth specifically exemplified xylanase is a protein encoded within the genome of Orpinomyces PC-2. The amino acids of these polypeptides and nucleotide sequences encoding them are provided. Recombinant host cells, expression vectors and methods for the recombinant production of phenolic acid esterases are also provided.

  4. Sequence analysis and genetic diversity of five new Indian isolates of cucumber mosaic virus.

    PubMed

    Kumar, S; Gautam, K K; Raj, S K

    2015-12-01

    Cucumber mosaic virus (CMV) is an important virus since it causes severe losses to many economically important crops worldwide. Five new isolates of CMV were isolated from naturally infected Hippeastrum hybridum, Dahlia pinnata, Hemerocallis fulva, Acorus calamus and Typhonium trilobatum plants, all exhibiting severe leaf mosaic symptoms. For molecular identification and sequence analyses, the complete coat protein (CP) gene of these isolates was amplified by RT-PCR. The resulting amplicons were cloned and sequenced and isolates were designated as HH (KP698590), DP (JF682239), HF (KP698589), AC (KP698588) and TT (JX570732). For study of genetic diversity among these isolates, the sequence data were analysed by BLASTn, multiple alignment and generating phylogenetic trees along with the respective sequences of other CMV isolates available in GenBank Database were done. The isolates under study showed 82-99% sequence diversity among them at nucleotide and amino acid levels; however they showed close relationships with CMV isolates of subgroup IB. In alignment analysis of amino acid sequences of HH and AC isolates, we have found fifteen and twelve unique substitutions, compared to HF, DP and TT isolates, suggesting the cause of high genetic diversity. PMID:26666188

  5. Amino-Acid Sequence of Porcine Pepsin

    PubMed Central

    Tang, J.; Sepulveda, P.; Marciniszyn, J.; Chen, K. C. S.; Huang, W-Y.; Tao, N.; Liu, D.; Lanier, J. P.

    1973-01-01

    As the culmination of several years of experiments, we propose a complete amino-acid sequence for porcine pepsin, an enzyme containing 327 amino-acid residues in a single polypeptide chain. In the sequence determination, the enzyme was treated with cyanogen bromide. Five resulting fragments were purified. The amino-acid sequence of four of the fragments accounted for 290 residues. Because the structure of a 37-residue carboxyl-terminal fragment was already known, it was not studied. The alignment of these fragments was determined from the sequence of methionyl-peptides we had previously reported. We also discovered the locations of activesite aspartyl residues, as well as the pairing of the three disulfide bridges. A minor component of commercial crystalline pepsin was found to contain two extra amino-acid residues, Ala-Leu-, at the amino-terminus of the molecule. This minor component was apparently derived from a different site of cleavage during the activation of porcine pepsinogen. PMID:4587252

  6. Genetic diversity of ITS sequences of Bursaphelenchus xylophilus.

    PubMed

    Cardoso, J M S; Fonseca, L; Abrantes, I

    2012-01-01

    The sequence variation of internal transcribed spacer (ITS) regions of ribosomal DNA has been routinely used for species identification and species-level phylogeny of the pinewood nematode, Bursaphelenchus xylophilus. In this study, the intraspecies ITS genetic diversity of B. xylophilus was evaluated. Three pinewood nematode isolates from the United States, Japan, and Portugal were used for polymerase chain reaction (PCR) ITS region amplification and sequencing. Multiple peaks were observed in sequencing chromatograms from ITS regions of American and Japanese isolates, suggesting the presence of more than one ribosomal sequence for each isolate. PCR products were further cloned and 10 clones of each isolate were subsequently sequenced. Additionally, the ITS regions of individual nematodes from each isolate were amplified, cloned and sequenced. Among the 3 B. xylophilus isolates analyzed, an intraspecific and intra-isolate molecular variability was found. The intra-isolate ITS molecular diversity in the American isolate was higher than that in the Japanese and Portuguese isolates. However, the level of sequence variation observed within isolates was about the same as that described among ITS repeats within individuals. PMID:23096915

  7. Method for identifying and quantifying nucleic acid sequence aberrations

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    1998-01-01

    A method for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe.

  8. Method for identifying and quantifying nucleic acid sequence aberrations

    DOEpatents

    Lucas, J.N.; Straume, T.; Bogen, K.T.

    1998-07-21

    A method is disclosed for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe. 11 figs.

  9. Next Generation Sequencing Reveals the Hidden Diversity of Zooplankton Assemblages

    PubMed Central

    Harmer, Rachel A.; Somerfield, Paul J.; Atkinson, Angus

    2013-01-01

    Background Zooplankton play an important role in our oceans, in biogeochemical cycling and providing a food source for commercially important fish larvae. However, difficulties in correctly identifying zooplankton hinder our understanding of their roles in marine ecosystem functioning, and can prevent detection of long term changes in their community structure. The advent of massively parallel next generation sequencing technology allows DNA sequence data to be recovered directly from whole community samples. Here we assess the ability of such sequencing to quantify richness and diversity of a mixed zooplankton assemblage from a productive time series site in the Western English Channel. Methodology/Principle Findings Plankton net hauls (200 µm) were taken at the Western Channel Observatory station L4 in September 2010 and January 2011. These samples were analysed by microscopy and metagenetic analysis of the 18S nuclear small subunit ribosomal RNA gene using the 454 pyrosequencing platform. Following quality control a total of 419,041 sequences were obtained for all samples. The sequences clustered into 205 operational taxonomic units using a 97% similarity cut-off. Allocation of taxonomy by comparison with the National Centre for Biotechnology Information database identified 135 OTUs to species level, 11 to genus level and 1 to order, <2.5% of sequences were classified as unknowns. By comparison a skilled microscopic analyst was able to routinely enumerate only 58 taxonomic groups. Conclusions Metagenetics reveals a previously hidden taxonomic richness, especially for Copepoda and hard-to-identify meroplankton such as Bivalvia, Gastropoda and Polychaeta. It also reveals rare species and parasites. We conclude that Next Generation Sequencing of 18S amplicons is a powerful tool for elucidating the true diversity and species richness of zooplankton communities. While this approach allows for broad diversity assessments of plankton it may become increasingly

  10. Methods for analyzing nucleic acid sequences

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2011-05-17

    The present invention is directed to a method of sequencing a target nucleic acid. The method provides a complex comprising a polymerase enzyme, a target nucleic acid molecule, and a primer, wherein the complex is immobilized on a support Fluorescent label is attached to a terminal phosphate group of the nucleotide or nucleotide analog. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The time duration of the signal from labeled nucleotides or nucleotide analogs that become incorporated is distinguished from freely diffusing labels by a longer retention in the observation volume for the nucleotides or nucleotide analogs that become incorporated than for the freely diffusing labels.

  11. Dynamics of immunoglobulin sequence diversity in HIV-1 infected individuals.

    PubMed

    Hoehn, Kenneth B; Gall, Astrid; Bashford-Rogers, Rachael; Fidler, S J; Kaye, S; Weber, J N; McClure, M O; Kellam, Paul; Pybus, Oliver G

    2015-09-01

    Advances in immunoglobulin (Ig) sequencing technology are leading to new perspectives on immune system dynamics. Much research in this nascent field has focused on resolving immune responses to viral infection. However, the dynamics of B-cell diversity in early HIV infection, and in response to anti-retroviral therapy, are still poorly understood. Here, we investigate these dynamics through bulk Ig sequencing of samples collected over 2 years from a group of eight HIV-1 infected patients, five of whom received anti-retroviral therapy during the first half of the study period. We applied previously published methods for visualizing and quantifying B-cell sequence diversity, including the Gini index, and compared their efficacy to alternative measures. While we found significantly greater clonal structure in HIV-infected patients versus healthy controls, within HIV patients, we observed no significant relationships between statistics of B-cell clonal expansion and clinical variables such as viral load and CD4(+) count. Although there are many potential explanations for this, we suggest that important factors include poor sampling resolution and complex B-cell dynamics that are difficult to summarize using simple summary statistics. Importantly, we find a significant association between observed Gini indices and sequencing read depth, and we conclude that more robust analytical methods and a closer integration of experimental and theoretical work is needed to further our understanding of B-cell repertoire diversity during viral infection. PMID:26194755

  12. Dynamics of immunoglobulin sequence diversity in HIV-1 infected individuals

    PubMed Central

    Hoehn, Kenneth B.; Gall, Astrid; Bashford-Rogers, Rachael; Fidler, S. J.; Kaye, S.; Weber, J. N.; McClure, M. O.; Kellam, Paul; Pybus, Oliver G.

    2015-01-01

    Advances in immunoglobulin (Ig) sequencing technology are leading to new perspectives on immune system dynamics. Much research in this nascent field has focused on resolving immune responses to viral infection. However, the dynamics of B-cell diversity in early HIV infection, and in response to anti-retroviral therapy, are still poorly understood. Here, we investigate these dynamics through bulk Ig sequencing of samples collected over 2 years from a group of eight HIV-1 infected patients, five of whom received anti-retroviral therapy during the first half of the study period. We applied previously published methods for visualizing and quantifying B-cell sequence diversity, including the Gini index, and compared their efficacy to alternative measures. While we found significantly greater clonal structure in HIV-infected patients versus healthy controls, within HIV patients, we observed no significant relationships between statistics of B-cell clonal expansion and clinical variables such as viral load and CD4+ count. Although there are many potential explanations for this, we suggest that important factors include poor sampling resolution and complex B-cell dynamics that are difficult to summarize using simple summary statistics. Importantly, we find a significant association between observed Gini indices and sequencing read depth, and we conclude that more robust analytical methods and a closer integration of experimental and theoretical work is needed to further our understanding of B-cell repertoire diversity during viral infection. PMID:26194755

  13. Sequence Programmable Peptoid Polymers for Diverse Materials Applications.

    PubMed

    Knight, Abigail S; Zhou, Effie Y; Francis, Matthew B; Zuckermann, Ronald N

    2015-10-14

    Polymer sequence programmability is required for the diverse structures and complex properties that are achieved by native biological polymers, but efforts towards controlling the sequence of synthetic polymers are, by comparison, still in their infancy. Traditional polymers provide robust and chemically diverse materials, but synthetic control over their monomer sequences is limited. The modular and step-wise synthesis of peptoid polymers, on the other hand, allows for precise control over the monomer sequences, affording opportunities for these chains to fold into well-defined nanostructures. Hundreds of different side chains have been incorporated into peptoid polymers using efficient reaction chemistry, allowing for a seemingly infinite variety of possible synthetically accessible polymer sequences. Combinatorial discovery techniques have allowed the identification of functional polymers within large libraries of peptoids, and newly developed theoretical modeling tools specifically adapted for peptoids enable the future design of polymers with desired functions. Work towards controlling the three-dimensional structure of peptoids, from the conformation of the amide bond to the formation of protein-like tertiary structure, has and will continue to enable the construction of tunable and innovative nanomaterials that bridge the gap between natural and synthetic polymers. PMID:25855478

  14. Next generation sequencing technologies: tool to study avian virus diversity.

    PubMed

    Kapgate, S S; Barbuddhe, S B; Kumanan, K

    2015-03-01

    Increased globalisation, climatic changes and wildlife-livestock interface led to emergence of novel viral pathogens or zoonoses that have become serious concern to avian, animal and human health. High biodiversity and bird migration facilitate spread of the pathogen and provide reservoirs for emerging infectious diseases. Current classical diagnostic methods designed to be virus-specific or aim to be limited to group of viral agents, hinder identifying of novel viruses or viral variants. Recently developed approaches of next-generation sequencing (NGS) provide culture-independent methods that are useful for understanding viral diversity and discovery of novel virus, thereby enabling a better diagnosis and disease control. This review discusses the different possible steps of a NGS study utilizing sequence-independent amplification, high-throughput sequencing and bioinformatics approaches to identify novel avian viruses and their diversity. NGS lead to the identification of a wide range of new viruses such as picobirnavirus, picornavirus, orthoreovirus and avian gamma coronavirus associated with fulminating disease in guinea fowl and is also used in describing viral diversity among avian species. The review also briefly discusses areas of viral-host interaction and disease associated causalities with newly identified avian viruses. PMID:25790045

  15. Draft Genome Sequences of Two Novel Acidimicrobiaceae Members from an Acid Mine Drainage Biofilm Metagenome.

    PubMed

    Pinto, Ameet J; Sharp, Jonathan O; Yoder, Michael J; Almstrand, Robert

    2016-01-01

    Bacteria belonging to the family Acidimicrobiaceae are frequently encountered in heavy metal-contaminated acidic environments. However, their phylogenetic and metabolic diversity is poorly resolved. We present draft genome sequences of two novel and phylogenetically distinct Acidimicrobiaceae members assembled from an acid mine drainage biofilm metagenome. PMID:26769942

  16. Draft Genome Sequences of Two Novel Acidimicrobiaceae Members from an Acid Mine Drainage Biofilm Metagenome

    PubMed Central

    Pinto, Ameet J.; Sharp, Jonathan O.; Yoder, Michael J.

    2016-01-01

    Bacteria belonging to the family Acidimicrobiaceae are frequently encountered in heavy metal-contaminated acidic environments. However, their phylogenetic and metabolic diversity is poorly resolved. We present draft genome sequences of two novel and phylogenetically distinct Acidimicrobiaceae members assembled from an acid mine drainage biofilm metagenome. PMID:26769942

  17. Code-Time Diversity for Direct Sequence Spread Spectrum Systems

    PubMed Central

    Hassan, A. Y.

    2014-01-01

    Time diversity is achieved in direct sequence spread spectrum by receiving different faded delayed copies of the transmitted symbols from different uncorrelated channel paths when the transmission signal bandwidth is greater than the coherence bandwidth of the channel. In this paper, a new time diversity scheme is proposed for spread spectrum systems. It is called code-time diversity. In this new scheme, N spreading codes are used to transmit one data symbol over N successive symbols interval. The diversity order in the proposed scheme equals to the number of the used spreading codes N multiplied by the number of the uncorrelated paths of the channel L. The paper represents the transmitted signal model. Two demodulators structures will be proposed based on the received signal models from Rayleigh flat and frequency selective fading channels. Probability of error in the proposed diversity scheme is also calculated for the same two fading channels. Finally, simulation results are represented and compared with that of maximal ration combiner (MRC) and multiple-input and multiple-output (MIMO) systems. PMID:24982925

  18. Diverse nucleotide compositions and sequence fluctuation in Rubisco protein genes

    NASA Astrophysics Data System (ADS)

    Holden, Todd; Dehipawala, S.; Cheung, E.; Bienaime, R.; Ye, J.; Tremberger, G., Jr.; Schneider, P.; Lieberman, D.; Cheung, T.

    2011-10-01

    The Rubisco protein-enzyme is arguably the most abundance protein on Earth. The biology dogma of transcription and translation necessitates the study of the Rubisco genes and Rubisco-like genes in various species. Stronger correlation of fractal dimension of the atomic number fluctuation along a DNA sequence with Shannon entropy has been observed in the studied Rubisco-like gene sequences, suggesting a more diverse evolutionary pressure and constraints in the Rubisco sequences. The strategy of using metal for structural stabilization appears to be an ancient mechanism, with data from the porphobilinogen deaminase gene in Capsaspora owczarzaki and Monosiga brevicollis. Using the chi-square distance probability, our analysis supports the conjecture that the more ancient Rubisco-like sequence in Microcystis aeruginosa would have experienced very different evolutionary pressure and bio-chemical constraint as compared to Bordetella bronchiseptica, the two microbes occupying either end of the correlation graph. Our exploratory study would indicate that high fractal dimension Rubisco sequence would support high carbon dioxide rate via the Michaelis- Menten coefficient; with implication for the control of the whooping cough pathogen Bordetella bronchiseptica, a microbe containing a high fractal dimension Rubisco-like sequence (2.07). Using the internal comparison of chi-square distance probability for 16S rRNA (~ E-22) versus radiation repair Rec-A gene (~ E-05) in high GC content Deinococcus radiodurans, our analysis supports the conjecture that high GC content microbes containing Rubisco-like sequence are likely to include an extra-terrestrial origin, relative to Deinococcus radiodurans. Similar photosynthesis process that could utilize host star radiation would not compete with radiation resistant process from the biology dogma perspective in environments such as Mars and exoplanets.

  19. An intimate link between antimicrobial peptide sequence diversity and binding to essential components of bacterial membranes.

    PubMed

    Schmitt, Paulina; Rosa, Rafael D; Destoumieux-Garzón, Delphine

    2016-05-01

    Antimicrobial peptides and proteins (AMPs) are widespread in the living kingdom. They are key effectors of defense reactions and mediators of competitions between organisms. They are often cationic and amphiphilic, which favors their interactions with the anionic membranes of microorganisms. Several AMP families do not directly alter membrane integrity but rather target conserved components of the bacterial membranes in a process that provides them with potent and specific antimicrobial activities. Thus, lipopolysaccharides (LPS), lipoteichoic acids (LTA) and the peptidoglycan precursor Lipid II are targeted by a broad series of AMPs. Studying the functional diversity of immune effectors tells us about the essential residues involved in AMP mechanism of action. Marine invertebrates have been found to produce a remarkable diversity of AMPs. Molluscan defensins and crustacean anti-LPS factors (ALF) are diverse in terms of amino acid sequence and show contrasted phenotypes in terms of antimicrobial activity. Their activity is directed essentially against Gram-positive or Gram-negative bacteria due to their specific interactions with Lipid II or Lipid A, respectively. Through those interesting examples, we discuss here how sequence diversity generated throughout evolution informs us on residues required for essential molecular interaction at the bacterial membranes and subsequent antibacterial activity. Through the analysis of molecular variants having lost antibacterial activity or shaped novel functions, we also discuss the molecular bases of functional divergence in AMPs. This article is part of a Special Issue entitled: Antimicrobial peptides edited by Karl Lohner and Kai Hilpert. PMID:26498397

  20. Detection of nucleic acid sequences by invader-directed cleavage

    DOEpatents

    Brow, Mary Ann D.; Hall, Jeff Steven Grotelueschen; Lyamichev, Victor; Olive, David Michael; Prudent, James Robert

    1999-01-01

    The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The 5' nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based by charge.

  1. Sequence Diversity, Reproductive Isolation and Species Concepts in Saccharomyces

    PubMed Central

    Liti, Gianni; Barton, David B. H.; Louis, Edward J.

    2006-01-01

    Using the biological species definition, yeasts of the genus Saccharomyces sensu stricto comprise six species and one natural hybrid. Previous work has shown that reproductive isolation between the species is due primarily to sequence divergence acted upon by the mismatch repair system and not due to major gene differences or chromosomal rearrangements. Sequence divergence through mismatch repair has also been shown to cause partial reproductive isolation among populations within a species. We have surveyed sequence variation in populations of Saccharomyces sensu stricto yeasts and measured meiotic sterility in hybrids. This allows us to determine the divergence necessary to produce the reproductive isolation seen among species. Rather than a sharp transition from fertility to sterility, which may have been expected, we find a smooth monotonic relationship between diversity and reproductive isolation, even as far as the well-accepted designations of S. paradoxus and S. cerevisiae as distinct species. Furthermore, we show that one species of Saccharomyces—S. cariocanus—differs from a population of S. paradoxus by four translocations, but not by sequence. There is molecular evidence of recent introgression from S. cerevisiae into the European population of S. paradoxus, supporting the idea that in nature the boundary between these species is fuzzy. PMID:16951060

  2. Correlation between fibroin amino acid sequence and physical silk properties.

    PubMed

    Fedic, Robert; Zurovec, Michal; Sehnal, Frantisek

    2003-09-12

    The fiber properties of lepidopteran silk depend on the amino acid repeats that interact during H-fibroin polymerization. The aim of our research was to relate repeat composition to insect biology and fiber strength. Representative regions of the H-fibroin genes were sequenced and analyzed in three pyralid species: wax moth (Galleria mellonella), European flour moth (Ephestia kuehniella), and Indian meal moth (Plodia interpunctella). The amino acid repeats are species-specific, evidently a diversification of an ancestral region of 43 residues, and include three types of regularly dispersed motifs: modifications of GSSAASAA sequence, stretches of tripeptides GXZ where X and Z represent bulky residues, and sequences similar to PVIVIEE. No concatenations of GX dipeptide or alanine, which are typical for Bombyx silkworms and Antheraea silk moths, respectively, were found. Despite different repeat structure, the silks of G. mellonella and E. kuehniella exhibit similar tensile strength as the Bombyx and Antheraea silks. We suggest that in these latter two species, variations in the repeat length obstruct repeat alignment, but sufficiently long stretches of iterated residues get superposed to interact. In the pyralid H-fibroins, interactions of the widely separated and diverse motifs depend on the precision of repeat matching; silk is strong in G. mellonella and E. kuehniella, with 2-3 types of long homogeneous repeats, and nearly 10 times weaker in P. interpunctella, with seven types of shorter erratic repeats. The high proportion of large amino acids in the H-fibroin of pyralids has probably evolved in connection with the spinning habit of caterpillars that live in protective silk tubes and spin continuously, enlarging the tubes on one end and partly devouring the other one. The silk serves as a depot of energetically rich and essential amino acids that may be scarce in the diet. PMID:12816957

  3. Next-Generation Sequencing Reveals Significant Bacterial Diversity of Botrytized Wine

    PubMed Central

    Bokulich, Nicholas A.; Joseph, C. M. Lucy; Allen, Greg; Benson, Andrew K.; Mills, David A.

    2012-01-01

    While wine fermentation has long been known to involve complex microbial communities, the composition and role of bacteria other than a select set of lactic acid bacteria (LAB) has often been assumed either negligible or detrimental. This study served as a pilot study for using barcoded amplicon next-generation sequencing to profile bacterial community structure in wines and grape musts, comparing the taxonomic depth achieved by sequencing two different domains of prokaryotic 16S rDNA (V4 and V5). This study was designed to serve two goals: 1) to empirically determine the most taxonomically informative 16S rDNA target region for barcoded amplicon sequencing of wine, comparing V4 and V5 domains of bacterial 16S rDNA to terminal restriction fragment length polymorphism (TRFLP) of LAB communities; and 2) to explore the bacterial communities of wine fermentation to better understand the biodiversity of wine at a depth previously unattainable using other techniques. Analysis of amplicons from the V4 and V5 provided similar views of the bacterial communities of botrytized wine fermentations, revealing a broad diversity of low-abundance taxa not traditionally associated with wine, as well as atypical LAB communities initially detected by TRFLP. The V4 domain was determined as the more suitable read for wine ecology studies, as it provided greater taxonomic depth for profiling LAB communities. In addition, targeted enrichment was used to isolate two species of Alphaproteobacteria from a finished fermentation. Significant differences in diversity between inoculated and uninoculated samples suggest that Saccharomyces inoculation exerts selective pressure on bacterial diversity in these fermentations, most notably suppressing abundance of acetic acid bacteria. These results determine the bacterial diversity of botrytized wines to be far higher than previously realized, providing further insight into the fermentation dynamics of these wines, and demonstrate the utility of next

  4. Phylogenetic diversity of insecticolous fusaria inferred from multilocus DNA sequence data and their molecular identification via FUSARIUM-ID and Fusarium MLST

    Technology Transfer Automated Retrieval System (TEKTRAN)

    We constructed several multilocus Deoxyribonucleic acid (DNA) sequence datasets to assess the phylogenetic diversity of insecticolous fusaria, especially focusing on those housed in the Agricultural Research Service Collection of Entomopathogenic Fungi (ARSEF), and to facilitate molecular identifica...

  5. Hybridization and sequencing of nucleic acids using base pair mismatches

    DOEpatents

    Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua

    2001-01-01

    Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.

  6. Structural and Functional Diversity of Acidic Scorpion Potassium Channel Toxins

    PubMed Central

    He, Ya-Wen; Pan, Na; Ding, Jiu-Ping; Cao, Zhi-Jian; Liu, Mai-Li; Li, Wen-Xin; Yi, Hong; Jiang, Ling; Wu, Ying-Liang

    2012-01-01

    Background Although the basic scorpion K+ channel toxins (KTxs) are well-known pharmacological tools and potential drug candidates, characterization the acidic KTxs still has the great significance for their potential selectivity towards different K+ channel subtypes. Unfortunately, research on the acidic KTxs has been ignored for several years and progressed slowly. Principal Findings Here, we describe the identification of nine new acidic KTxs by cDNA cloning and bioinformatic analyses. Seven of these toxins belong to three new α-KTx subfamilies (α-KTx28, α-KTx29, and α-KTx30), and two are new members of the known κ-KTx2 subfamily. ImKTx104 containing three disulfide bridges, the first member of the α-KTx28 subfamily, has a low sequence homology with other known KTxs, and its NMR structure suggests ImKTx104 adopts a modified cystine-stabilized α-helix-loop-β-sheet (CS-α/β) fold motif that has no apparent α-helixs and β-sheets, but still stabilized by three disulfide bridges. These newly described acidic KTxs exhibit differential pharmacological effects on potassium channels. Acidic scorpion toxin ImKTx104 was the first peptide inhibitor found to affect KCNQ1 channel, which is insensitive to the basic KTxs and is strongly associated with human cardiac abnormalities. ImKTx104 selectively inhibited KCNQ1 channel with a Kd of 11.69 µM, but was less effective against the basic KTxs-sensitive potassium channels. In addition to the ImKTx104 toxin, HeTx204 peptide, containing a cystine-stabilized α-helix-loop-helix (CS-α/α) fold scaffold motif, blocked both Kv1.3 and KCNQ1 channels. StKTx23 toxin, with a cystine-stabilized α-helix-loop-β-sheet (CS-α/β) fold motif, could inhibit Kv1.3 channel, but not the KCNQ1 channel. Conclusions/Significance These findings characterize the structural and functional diversity of acidic KTxs, and could accelerate the development and clinical use of acidic KTxs as pharmacological tools and potential drugs. PMID

  7. 77 FR 65537 - Requirements for Patent Applications Containing Nucleotide Sequence and/or Amino Acid Sequence...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2012-10-29

    ... Amino Acid Sequence Disclosures ACTION: Proposed collection; comment request. SUMMARY: The United States....'' SUPPLEMENTARY INFORMATION: I. Abstract Patent applications that contain nucleotide and/or amino acid sequence disclosures must include a copy of the sequence listing in accordance with the requirements in 37 CFR...

  8. In vivo generation of DNA sequence diversity for cellular barcoding

    PubMed Central

    Peikon, Ian D.; Gizatullina, Diana I.; Zador, Anthony M.

    2014-01-01

    Heterogeneity is a ubiquitous feature of biological systems. A complete understanding of such systems requires a method for uniquely identifying and tracking individual components and their interactions with each other. We have developed a novel method of uniquely tagging individual cells in vivo with a genetic ‘barcode’ that can be recovered by DNA sequencing. Our method is a two-component system comprised of a genetic barcode cassette whose fragments are shuffled by Rci, a site-specific DNA invertase. The system is highly scalable, with the potential to generate theoretical diversities in the billions. We demonstrate the feasibility of this technique in Escherichia coli. Currently, this method could be employed to track the dynamics of populations of microbes through various bottlenecks. Advances of this method should prove useful in tracking interactions of cells within a network, and/or heterogeneity within complex biological samples. PMID:25013177

  9. Mitochondrial sequences show diverse evolutionary histories of African hominoids.

    PubMed

    Gagneux, P; Wills, C; Gerloff, U; Tautz, D; Morin, P A; Boesch, C; Fruth, B; Hohmann, G; Ryder, O A; Woodruff, D S

    1999-04-27

    Phylogenetic trees for the four extant species of African hominoids are presented, based on mtDNA control region-1 sequences from 1,158 unique haplotypes. We include 83 new haplotypes of western chimpanzees and bonobos. Phylogenetic analysis of this enlarged database, which takes intraspecific geographic variability into account, reveals different patterns of evolution among species and great heterogeneity in species-level variation. Several chimpanzee and bonobo clades (and even single social groups) have retained substantially more mitochondrial variation than is seen in the entire human species. Among the 811 human haplotypes, those that branch off early are predominantly but not exclusively African. Neighbor joining trees provide strong evidence that eastern chimpanzee and human clades have experienced reduced effective population sizes, the latter apparently since the Homo sapiens-neanderthalensis split. Application of topiary pruning resolves ambiguities in the phylogenetic tree that are attributable to homoplasies in the data set. The diverse patterns of mtDNA sequence variation seen in today's hominoid taxa probably reflect historical differences in ecological plasticity, female-biased dispersal, range fragmentation over differing periods of time, and competition among social groups. These results are relevant to the origin of zoonotic diseases, including HIV-1, and call into question some aspects of the current taxonomic treatment and conservation management of gorillas and chimpanzees. PMID:10220421

  10. Predicting intrinsic disorder from amino acid sequence.

    PubMed

    Obradovic, Zoran; Peng, Kang; Vucetic, Slobodan; Radivojac, Predrag; Brown, Celeste J; Dunker, A Keith

    2003-01-01

    Blind predictions of intrinsic order and disorder were made on 42 proteins subsequently revealed to contain 9,044 ordered residues, 284 disordered residues in 26 segments of length 30 residues or less, and 281 disordered residues in 2 disordered segments of length greater than 30 residues. The accuracies of the six predictors used in this experiment ranged from 77% to 91% for the ordered regions and from 56% to 78% for the disordered segments. The average of the order and disorder predictions ranged from 73% to 77%. The prediction of disorder in the shorter segments was poor, from 25% to 66% correct, while the prediction of disorder in the longer segments was better, from 75% to 95% correct. Four of the predictors were composed of ensembles of neural networks. This enabled them to deal more efficiently with the large asymmetry in the training data through diversified sampling from the significantly larger ordered set and achieve better accuracy on ordered and long disordered regions. The exclusive use of long disordered regions for predictor training likely contributed to the disparity of the predictions on long versus short disordered regions, while averaging the output values over 61-residue windows to eliminate short predictions of order or disorder probably contributed to the even greater disparity for three of the predictors. This experiment supports the predictability of intrinsic disorder from amino acid sequence. PMID:14579347

  11. Metagenomic analyses reveal phylogenetic diversity of carboxypeptidase gene sequences in activated sludge of a wastewater treatment plant in Shanghai, China.

    PubMed

    Jin, Hao; Li, Bailin; Peng, Xu; Chen, Lanming

    2014-01-01

    Activated sludge of wastewater treatment plants carries a diverse microflora. However, up to 80-90 % of microorganisms in activated sludge cannot be cultured by current laboratory techniques, leaving an enzyme reservoir largely unexplored. In this study, we investigated carboxypeptidase diversity in activated sludge of a wastewater treatment plant in Shanghai, China, by a culture-independent metagenomic approach. Three sets of consensus degenerate hybrid oligonucleotide primers (CODEHOPs) targeting conserved domains of public carboxypeptidases have been designed to amplify carboxypeptidase gene sequences in the metagenomic DNA of activated sludge by PCR. The desired amplicons were evaluated by carboxypeptidase sequence clone libraries and phylogenetic analyses. We uncovered a significant diversity of carboxypeptidases present in the activated sludge. Deduced carboxypeptidase amino acid sequences (127-208 amino acids) were classified into three distinct clusters, α, β, and γ. Sequences belonging to clusters α and β shared 58-97 % identity to known carboxypeptidase sequences from diverse species, whereas sequences in the cluster γ were remarkably less related to public carboxypeptidase homologous in the GenBank database, strongly suggesting that novel carboxypeptidase families or microbial niches exist in the activated sludge. We also observed numerous carboxypeptidase sequences that were much closer to those from representative strains present in industrial and sewage treatment and bioremediation. Thermostable and halotolerant carboxypeptidase sequences were also detected in clusters α and β. Coexistence of various carboxypeptidases is evidence of a diverse microflora in the activated sludge, a feature suggesting a valuable gene resource to be further explored for biotechnology application. PMID:24860282

  12. Sequence diversity of small, round-structured viruses in the Norwalk virus group.

    PubMed Central

    Wang, J; Jiang, X; Madore, H P; Gray, J; Desselberger, U; Ando, T; Seto, Y; Oishi, I; Lew, J F; Green, K Y

    1994-01-01

    We have determined the nucleotide sequences of a highly conserved region of the RNA-dependent RNA polymerase of the prototype Snow Mountain agent (SMA) and of four other small, round-structured viruses (antigenically Norwalk virus [NV]-like or SMA-like) following reverse transcription-PCR amplification of viral RNA obtained from human stools. The stool samples were either from volunteers administered SMA or from sporadic cases and outbreaks of gastroenteritis that occurred in Japan and the United Kingdom between 1984 and 1992. The GLPSG and YGDD RNA polymerase motifs were in the proper locations in the sequences of the five SRSVs, but each sequence was distinct from the 8FIIa prototype NV sequence and from each other. Analysis of the sequences and reactivities in a new NV antigen enzyme-linked immunosorbent assay showed that the five viruses could be divided into two groups (serogroups) with NV and SMA, respectively, being the prototypes. The sequences of the capsid region and a nonstructural region (2C) were determined from one strain from each group. One virus (SRSV-KY-89/89/J), isolated in Japan and antigenically similar to the prototype NV (isolated 21 years earlier in Ohio), showed a remarkable level of sequence similarity to NV. KY-89 and the 8FIIa NV showed 87.2% nucleotide similarity over 2,516 continuous nucleotides amounting to 96 to 98.9% amino acid similarity in three distinct domains in two open reading frames. Between the prototype SMA and NV, the polymerase region showed 63% nucleotide and 59% amino acid similarity, respectively. Two other antigenically SMA-like isolates (SRSV-925/92/UK and SRSV-OTH-25/89/J), from the United Kingdom and Japan, showed 80% nucleotide and 88 to 92% amino acid similarity in the polymerase region to the prototype SMA isolated 16 and 13 years earlier in the United States. The capsid region of the antigenically SMA-like OTH-25 virus showed 53% nucleotide and 65% amino acid similarity to the prototype NV capsid region

  13. Methods and compositions for efficient nucleic acid sequencing

    DOEpatents

    Drmanac, Radoje

    2002-01-01

    Disclosed are novel methods and compositions for rapid and highly efficient nucleic acid sequencing based upon hybridization with two sets of small oligonucleotide probes of known sequences. Extremely large nucleic acid molecules, including chromosomes and non-amplified RNA, may be sequenced without prior cloning or subcloning steps. The methods of the invention also solve various current problems associated with sequencing technology such as, for example, high noise to signal ratios and difficult discrimination, attaching many nucleic acid fragments to a surface, preparing many, longer or more complex probes and labelling more species.

  14. Methods and compositions for efficient nucleic acid sequencing

    DOEpatents

    Drmanac, Radoje

    2006-07-04

    Disclosed are novel methods and compositions for rapid and highly efficient nucleic acid sequencing based upon hybridization with two sets of small oligonucleotide probes of known sequences. Extremely large nucleic acid molecules, including chromosomes and non-amplified RNA, may be sequenced without prior cloning or subcloning steps. The methods of the invention also solve various current problems associated with sequencing technology such as, for example, high noise to signal ratios and difficult discrimination, attaching many nucleic acid fragments to a surface, preparing many, longer or more complex probes and labelling more species.

  15. Kit for detecting nucleic acid sequences using competitive hybridization probes

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    2001-01-01

    A kit is provided for detecting a target nucleic acid sequence in a sample, the kit comprising: a first hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the first hybridization probe including a first complexing agent for forming a binding pair with a second complexing agent; and a second hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the first hybridization probe does not selectively hybridize, the second hybridization probe including a detectable marker; a third hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the third hybridization probe including the same detectable marker as the second hybridization probe; and a fourth hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the third hybridization probe does not selectively hybridize, the fourth hybridization probe including the first complexing agent for forming a binding pair with the second complexing agent; wherein the first and second hybridization probes are capable of simultaneously hybridizing to the target sequence and the third and fourth hybridization probes are capable of simultaneously hybridizing to the target sequence, the detectable marker is not present on the first or fourth hybridization probes and the first, second, third, and fourth hybridization probes each include a competitive nucleic acid sequence which is sufficiently complementary to a third portion of the target sequence that the competitive sequences of the first, second, third, and fourth hybridization probes compete with each other to hybridize to the third portion of the

  16. Marine protist diversity in European coastal waters and sediments as revealed by high-throughput sequencing.

    PubMed

    Massana, Ramon; Gobet, Angélique; Audic, Stéphane; Bass, David; Bittner, Lucie; Boutte, Christophe; Chambouvet, Aurélie; Christen, Richard; Claverie, Jean-Michel; Decelle, Johan; Dolan, John R; Dunthorn, Micah; Edvardsen, Bente; Forn, Irene; Forster, Dominik; Guillou, Laure; Jaillon, Olivier; Kooistra, Wiebe H C F; Logares, Ramiro; Mahé, Frédéric; Not, Fabrice; Ogata, Hiroyuki; Pawlowski, Jan; Pernice, Massimo C; Probert, Ian; Romac, Sarah; Richards, Thomas; Santini, Sébastien; Shalchian-Tabrizi, Kamran; Siano, Raffaele; Simon, Nathalie; Stoeck, Thorsten; Vaulot, Daniel; Zingone, Adriana; de Vargas, Colomban

    2015-10-01

    Although protists are critical components of marine ecosystems, they are still poorly characterized. Here we analysed the taxonomic diversity of planktonic and benthic protist communities collected in six distant European coastal sites. Environmental deoxyribonucleic acid (DNA) and ribonucleic acid (RNA) from three size fractions (pico-, nano- and micro/mesoplankton), as well as from dissolved DNA and surface sediments were used as templates for tag pyrosequencing of the V4 region of the 18S ribosomal DNA. Beta-diversity analyses split the protist community structure into three main clusters: picoplankton-nanoplankton-dissolved DNA, micro/mesoplankton and sediments. Within each cluster, protist communities from the same site and time clustered together, while communities from the same site but different seasons were unrelated. Both DNA and RNA-based surveys provided similar relative abundances for most class-level taxonomic groups. Yet, particular groups were overrepresented in one of the two templates, such as marine alveolates (MALV)-I and MALV-II that were much more abundant in DNA surveys. Overall, the groups displaying the highest relative contribution were Dinophyceae, Diatomea, Ciliophora and Acantharia. Also, well represented were Mamiellophyceae, Cryptomonadales, marine alveolates and marine stramenopiles in the picoplankton, and Monadofilosa and basal Fungi in sediments. Our extensive and systematic sequencing of geographically separated sites provides the most comprehensive molecular description of coastal marine protist diversity to date. PMID:26119494

  17. Solid phase sequencing of double-stranded nucleic acids

    DOEpatents

    Fu, Dong-Jing; Cantor, Charles R.; Koster, Hubert; Smith, Cassandra L.

    2002-01-01

    This invention relates to methods for detecting and sequencing of target double-stranded nucleic acid sequences, to nucleic acid probes and arrays of probes useful in these methods, and to kits and systems which contain these probes. Useful methods involve hybridizing the nucleic acids or nucleic acids which represent complementary or homologous sequences of the target to an array of nucleic acid probes. These probe comprise a single-stranded portion, an optional double-stranded portion and a variable sequence within the single-stranded portion. The molecular weights of the hybridized nucleic acids of the set can be determined by mass spectroscopy, and the sequence of the target determined from the molecular weights of the fragments. Nucleic acids whose sequences can be determined include nucleic acids in biological samples such as patient biopsies and environmental samples. Probes may be fixed to a solid support such as a hybridization chip to facilitate automated determination of molecular weights and identification of the target sequence.

  18. Analysis and Annotation of Nucleic Acid Sequence

    SciTech Connect

    States, David J.

    2004-07-28

    The aims of this project were to develop improved methods for computational genome annotation and to apply these methods to improve the annotation of genomic sequence data with a specific focus on human genome sequencing. The project resulted in a substantial body of published work. Notable contributions of this project were the identification of basecalling and lane tracking as error processes in genome sequencing and contributions to improved methods for these steps in genome sequencing. This technology improved the accuracy and throughput of genome sequence analysis. Probabilistic methods for physical map construction were developed. Improved methods for sequence alignment, alternative splicing analysis, promoter identification and NF kappa B response gene prediction were also developed.

  19. Analysis and Annotation of Nucleic Acid Sequence

    SciTech Connect

    David J. States

    1998-08-01

    The aims of this project were to develop improved methods for computational genome annotation and to apply these methods to improve the annotation of genomic sequence data with a specific focus on human genome sequencing. The project resulted in a substantial body of published work. Notable contributions of this project were the identification of basecalling and lane tracking as error processes in genome sequencing and contributions to improved methods for these steps in genome sequencing. This technology improved the accuracy and throughput of genome sequence analysis. Probabilistic methods for physical map construction were developed. Improved methods for sequence alignment, alternative splicing analysis, promoter identification and NF kappa B response gene prediction were also developed.

  20. Survey of duckweed diversity in Lake Chao and total fatty acid, triacylglycerol, profiles of representative strains.

    PubMed

    Tang, J; Li, Y; Ma, J; Cheng, J J

    2015-09-01

    Lemnaceae (duckweeds) are widely distributed aquatic flowering plants. Their high growth rate, starch content and suitability for bioremediation make them potential feedstock for biofuels. However, few natural duckweed resources have been investigated in China, and there is no information about total fatty acid (TFA) and triacylglycerol (TAG) composition of duckweeds from China. Here, the genetic diversity of a natural duckweed population collected from Lake Chao, China, was investigated using multilocus sequence typing (MLST). The 54 strains were categorised into four species in four genera, representing 12 distinct sequence types. Strains representing Lemna aequinoctialis and Spirodela polyrhiza were predominant. Interestingly, a surprisingly high degree of genetic diversification within L. aequinoctialis was observed. The four duckweed species revealed a uniform fatty acid composition, with three fatty acids, palmitic acid, linoleic acid and linolenic acid, accounting for more than 80% of the TFA. The TFA in biomass varied among species, ranging from 1.05% (of dry weight, DW) for L. punctata and S. polyrhiza to 1.62% for Wolffia globosa. The four duckweed species contained similar TAG contents, 0.02% mg · DW(-1). The fatty acid profiles of TAG were different from those of TFA, and also varied among the four species. The survey investigated the genetic diversity of duckweeds from Lake Chao, and provides an initial insight into TFA and TAG of four duckweed species, indicating that intraspecific and interspecific variations exist in the content and composition of both TFA and TAG in comparison with other studies. PMID:25950142

  1. The Effects of Microsatellite Selection on Linked Sequence Diversity

    PubMed Central

    Haasl, Ryan J.; Johnson, Ross C.; Payseur, Bret A.

    2014-01-01

    The genome-wide scan for selection is an important method for identifying loci involved in adaptive evolution. However, theory that underlies standard scans for selection assumes a simple mutation model. In particular, recurrent mutation of the selective target is not considered. Although this assumption is reasonable for single-nucleotide variants (SNVs), a microsatellite targeted by selection will reliably violate this assumption due to high mutation rate. Moreover, the mutation rate of microsatellites is generally high enough to ensure that recurrent mutation is pervasive rather than occasional. It is therefore unclear if positive selection targeting microsatellites can be detected using standard scanning statistics. Examples of functional variation at microsatellites underscore the significance of understanding the genomic effects of microsatellite selection. Here, we investigate the joint effects of selection and complex mutation on linked sequence diversity, comparing simulations of microsatellite selection and SNV-based selective sweeps. We find that selection on microsatellites is generally difficult to detect using popular summaries of the site frequency spectrum, and, under certain conditions, using popular methods such as the integrated haplotype statistic and SweepFinder. However, comparisons of the number of haplotypes (K) and segregating sites (S) often provide considerable power to detect selection on microsatellites. We apply this knowledge to a scan of autosomes in the human CEU population (CEPH population sampled from Utah). In addition to the most commonly reported targets of selection in European populations, we identify numerous novel genomic regions that bear highly anomalous haplotype configurations. Using one of these regions—intron 1 of MAGI2—as an example, we show that the anomalous configuration is coincident with a perfect CA repeat of length 22. We conclude that standard genome-wide scans will commonly fail to detect mutationally

  2. From Artificial Amino Acids to Sequence-Defined Targeted Oligoaminoamides.

    PubMed

    Morys, Stephan; Wagner, Ernst; Lächelt, Ulrich

    2016-01-01

    Artificial oligoamino acids with appropriate protecting groups can be used for the sequential assembly of oligoaminoamides on solid-phase. With the help of these oligoamino acids multifunctional nucleic acid (NA) carriers can be designed and produced in highly defined topologies. Here we describe the synthesis of the artificial oligoamino acid Fmoc-Stp(Boc3)-OH, the subsequent assembly into sequence-defined oligomers and the formulation of tumor-targeted plasmid DNA (pDNA) polyplexes. PMID:27436323

  3. Novel alpha-conotoxins identified by gene sequencing from cone snails native to Hainan, and their sequence diversity.

    PubMed

    Luo, Sulan; Zhangsun, Dongting; Zhang, Ben; Quan, Yaru; Wu, Yong

    2006-11-01

    Conotoxins (CTX) from the venom of marine cone snails (genus Conus) represent large families of proteins, which show a similar precursor organization with surprisingly conserved signal sequence of the precursor peptides, but highly diverse pharmacological activities. By using the conserved sequences found within the genes that encode the alpha-conotoxin precursors, a technique based on RT-PCR was used to identify, respectively, two novel peptides (LiC22, LeD2) from the two worm-hunting Conus species Conus lividus, and Conus litteratus, and one novel peptide (TeA21) from the snail-hunting Conus species Conus textile, all native to Hainan in China. The three peptides share an alpha4/7 subfamily alpha-conotoxins common cysteine pattern (CCX(4)CX(7)C, two disulfide bonds), which are competitive antagonists of nicotinic acetylcholine receptor (nAChRs). The cDNA of LiC22N encodes a precursor of 40 residues, including a propeptide of 19 residues and a mature peptide of 21 residues. The cDNA of LeD2N encodes a precursor of 41 residues, including a propeptide of 21 residues and a mature peptide of 16 residues with three additional Gly residues. The cDNA of TeA21N encodes a precursor of 38 residues, including a propeptide of 20 residues and a mature peptide of 17 residues with an additional residue Gly. The additional residue Gly of LeD2N and TeA21N is a prerequisite for the amidation of the preceding C-terminal Cys. All three sequences are processed at the common signal site -X-Arg- immediately before the mature peptide sequences. The properties of the alpha4/7 conotoxins known so far were discussed in detail. Phylogenetic analysis of the new conotoxins in the present study and the published homologue of alpha4/7 conotoxins from the other Conus species were performed systematically. Patterns of sequence divergence for the three regions of signal, proregion, and mature peptides, both nucleotide acids and residue substitutions in DNA and peptide levels, as well as Cys codon

  4. Detecting frame shifts by amino acid sequence comparison.

    PubMed

    Claverie, J M

    1993-12-20

    Various amino acid substitution scoring matrices are used in conjunction with local alignments programs to detect regions of similarity and infer potential common ancestry between proteins. The usual scoring schemes derive from the implicit hypothesis that related proteins evolve from a common ancestor by the accumulation of point mutations and that amino acids tend to be progressively substituted by others with similar properties. However, other frequent single mutation events, like nucleotide insertion or deletion and gene inversion, change the translation reading frame and cause previously encoded amino acid sequences to become unrecognizable at once. Here, I derive five new types of scoring matrix, each capable of detecting a specific frame shift (deletion, insertion and inversion in 3 frames) and use them with a regular local alignments program to detect amino acid sequences that may have derived from alternative reading frames of the same nucleotide sequence. Frame shifts are inferred from the sole comparison of the protein sequences. The five scoring matrices were used with the BLASTP program to compare all the protein sequences in the Swissprot database. Surprisingly, the searches revealed hundreds of highly significant frame shift matches, of which many are likely to represent sequencing errors. Others provide some evidence that frame shift mutations might be used in protein evolution as a way to create new amino acid sequences from pre-existing coding regions. PMID:7903399

  5. Segments of amino acid sequence similarity in beta-amylases.

    PubMed

    Friedberg, F; Rhodes, C

    1988-01-01

    In alpha-amylases from animals, plants and bacteria and in beta-amylases from plants and bacteria a number of segments exhibit amino acid sequence similarity specific to the alpha or to the beta type, respectively. In the case of the beta-amylases the similar sequence regions are extensive and they are disrupted only by short interspersed dissimilar regions. Close to the C terminus, however, no such sequence similarity exist. PMID:2464171

  6. Effects of Abiotic Factors on the Phylogenetic Diversity of Bacterial Communities in Acidic Thermal Springs▿

    PubMed Central

    Mathur, Jayanti; Bizzoco, Richard W.; Ellis, Dean G.; Lipson, David A.; Poole, Alexander W.; Levine, Richard; Kelley, Scott T.

    2007-01-01

    Acidic thermal springs offer ideal environments for studying processes underlying extremophile microbial diversity. We used a carefully designed comparative analysis of acidic thermal springs in Yellowstone National Park to determine how abiotic factors (chemistry and temperature) shape acidophile microbial communities. Small-subunit rRNA gene sequences were PCR amplified, cloned, and sequenced, by using evolutionarily conserved bacterium-specific primers, directly from environmental DNA extracted from Amphitheater Springs and Roaring Mountain sediment samples. Energy-dispersive X-ray spectroscopy, X-ray diffraction, and colorimetric assays were used to analyze sediment chemistry, while an optical emission spectrometer was used to evaluate water chemistry and electronic probes were used to measure the pH, temperature, and Eh of the spring waters. Phylogenetic-statistical analyses found exceptionally strong correlations between bacterial community composition and sediment mineral chemistry, followed by weaker but significant correlations with temperature gradients. For example, sulfur-rich sediment samples contained a high diversity of uncultured organisms related to Hydrogenobaculum spp., while iron-rich sediments were dominated by uncultured organisms related to a diverse array of gram-positive iron oxidizers. A detailed analysis of redox chemistry indicated that the available energy sources and electron acceptors were sufficient to support the metabolic potential of Hydrogenobaculum spp. and iron oxidizers, respectively. Principal-component analysis found that two factors explained 95% of the genetic diversity, with most of the variance attributable to mineral chemistry and a smaller fraction attributable to temperature. PMID:17220248

  7. Effects of abiotic factors on the phylogenetic diversity of bacterial communities in acidic thermal springs.

    PubMed

    Mathur, Jayanti; Bizzoco, Richard W; Ellis, Dean G; Lipson, David A; Poole, Alexander W; Levine, Richard; Kelley, Scott T

    2007-04-01

    Acidic thermal springs offer ideal environments for studying processes underlying extremophile microbial diversity. We used a carefully designed comparative analysis of acidic thermal springs in Yellowstone National Park to determine how abiotic factors (chemistry and temperature) shape acidophile microbial communities. Small-subunit rRNA gene sequences were PCR amplified, cloned, and sequenced, by using evolutionarily conserved bacterium-specific primers, directly from environmental DNA extracted from Amphitheater Springs and Roaring Mountain sediment samples. Energy-dispersive X-ray spectroscopy, X-ray diffraction, and colorimetric assays were used to analyze sediment chemistry, while an optical emission spectrometer was used to evaluate water chemistry and electronic probes were used to measure the pH, temperature, and E(h) of the spring waters. Phylogenetic-statistical analyses found exceptionally strong correlations between bacterial community composition and sediment mineral chemistry, followed by weaker but significant correlations with temperature gradients. For example, sulfur-rich sediment samples contained a high diversity of uncultured organisms related to Hydrogenobaculum spp., while iron-rich sediments were dominated by uncultured organisms related to a diverse array of gram-positive iron oxidizers. A detailed analysis of redox chemistry indicated that the available energy sources and electron acceptors were sufficient to support the metabolic potential of Hydrogenobaculum spp. and iron oxidizers, respectively. Principal-component analysis found that two factors explained 95% of the genetic diversity, with most of the variance attributable to mineral chemistry and a smaller fraction attributable to temperature. PMID:17220248

  8. Sequence Diversity in MIC6 Gene among Toxoplasma gondii Isolates from Different Hosts and Geographical Locations

    PubMed Central

    Li, Zhong-Yuan; Song, Hui-Qun; Chen, Jia; Zhu, Xing-Quan

    2015-01-01

    Toxoplasma gondii is an opportunistic protozoan parasite that can infect almost all warm-blooded animals including humans with a worldwide distribution. Micronemes play an important role in invasion process of T. gondii, associated with the attachment, motility, and host cell recognition. In this research, sequence diversity in microneme protein 6 (MIC6) gene among 16 T. gondii isolates from different hosts and geographical regions and 1 reference strain was examined. The results showed that the sequence of all the examined T. gondii strains was 1,050 bp in length, and their A + T content was between 45.7% and 46.1%. Sequence analysis presented 33 nucleotide mutation positions (0-1.1%), resulting in 23 amino acid substitutions (0-2.3%) aligned with T. gondii RH strain. Moreover, T. gondii strains representing the 3 classical genotypes (Type I, II, and III) were separated into different clusters based on the locus of MIC6 using phylogenetic analyses by Bayesian inference (BI), maximum parsimony (MP), and maximum likelihood (ML), but T. gondii strains belonging to ToxoDB #9 were separated into different clusters. Our results suggested that MIC6 gene is not a suitable marker for T. gondii population genetic studies. PMID:26174829

  9. Sequence Diversity in MIC6 Gene among Toxoplasma gondii Isolates from Different Hosts and Geographical Locations.

    PubMed

    Li, Zhong-Yuan; Song, Hui-Qun; Chen, Jia; Zhu, Xing-Quan

    2015-06-01

    Toxoplasma gondii is an opportunistic protozoan parasite that can infect almost all warm-blooded animals including humans with a worldwide distribution. Micronemes play an important role in invasion process of T. gondii, associated with the attachment, motility, and host cell recognition. In this research, sequence diversity in microneme protein 6 (MIC6) gene among 16 T. gondii isolates from different hosts and geographical regions and 1 reference strain was examined. The results showed that the sequence of all the examined T. gondii strains was 1,050 bp in length, and their A + T content was between 45.7% and 46.1%. Sequence analysis presented 33 nucleotide mutation positions (0-1.1%), resulting in 23 amino acid substitutions (0-2.3%) aligned with T. gondii RH strain. Moreover, T. gondii strains representing the 3 classical genotypes (Type I, II, and III) were separated into different clusters based on the locus of MIC6 using phylogenetic analyses by Bayesian inference (BI), maximum parsimony (MP), and maximum likelihood (ML), but T. gondii strains belonging to ToxoDB #9 were separated into different clusters. Our results suggested that MIC6 gene is not a suitable marker for T. gondii population genetic studies. PMID:26174829

  10. Meteoritic Amino Acids: Diversity in Compositions Reflects Parent Body Histories.

    PubMed

    Elsila, Jamie E; Aponte, José C; Blackmond, Donna G; Burton, Aaron S; Dworkin, Jason P; Glavin, Daniel P

    2016-06-22

    The analysis of amino acids in meteorites dates back over 50 years; however, it is only in recent years that research has expanded beyond investigations of a narrow set of meteorite groups (exemplified by the Murchison meteorite) into meteorites of other types and classes. These new studies have shown a wide diversity in the abundance and distribution of amino acids across carbonaceous chondrite groups, highlighting the role of parent body processes and composition in the creation, preservation, or alteration of amino acids. Although most chiral amino acids are racemic in meteorites, the enantiomeric distribution of some amino acids, particularly of the nonprotein amino acid isovaline, has also been shown to vary both within certain meteorites and across carbonaceous meteorite groups. Large l-enantiomeric excesses of some extraterrestrial protein amino acids (up to ∼60%) have also been observed in rare cases and point to nonbiological enantiomeric enrichment processes prior to the emergence of life. In this Outlook, we review these recent meteoritic analyses, focusing on variations in abundance, structural distributions, and enantiomeric distributions of amino acids and discussing possible explanations for these observations and the potential for future work. PMID:27413780

  11. Meteoritic Amino Acids: Diversity in Compositions Reflects Parent Body Histories

    PubMed Central

    2016-01-01

    The analysis of amino acids in meteorites dates back over 50 years; however, it is only in recent years that research has expanded beyond investigations of a narrow set of meteorite groups (exemplified by the Murchison meteorite) into meteorites of other types and classes. These new studies have shown a wide diversity in the abundance and distribution of amino acids across carbonaceous chondrite groups, highlighting the role of parent body processes and composition in the creation, preservation, or alteration of amino acids. Although most chiral amino acids are racemic in meteorites, the enantiomeric distribution of some amino acids, particularly of the nonprotein amino acid isovaline, has also been shown to vary both within certain meteorites and across carbonaceous meteorite groups. Large l-enantiomeric excesses of some extraterrestrial protein amino acids (up to ∼60%) have also been observed in rare cases and point to nonbiological enantiomeric enrichment processes prior to the emergence of life. In this Outlook, we review these recent meteoritic analyses, focusing on variations in abundance, structural distributions, and enantiomeric distributions of amino acids and discussing possible explanations for these observations and the potential for future work. PMID:27413780

  12. Self-sequencing of amino acids and origins of polyfunctional protocells

    NASA Technical Reports Server (NTRS)

    Fox, S. W.

    1984-01-01

    The role of proteins in the origin of living things is discussed. It has been experimentally established that amino acids can sequence themselves under simulated geological conditions with highly nonrandom products which accordingly contain diverse information. Multiple copies of each type of macromolecule are formed, resulting in greater power for any protoenzymic molecule than would accrue from a single copy of each type. Thermal proteins are readily incorporated into laboratory protocells. The experimental evidence for original polyfunctional protocells is discussed.

  13. Global Genomic Diversity of Human Papillomavirus 6 Based on 724 Isolates and 190 Complete Genome Sequences

    PubMed Central

    Jelen, Mateja M.; Chen, Zigui; Kocjan, Boštjan J.; Burt, Felicity J.; Chan, Paul K. S.; Chouhy, Diego; Combrinck, Catharina E.; Coutlée, François; Estrade, Christine; Ferenczy, Alex; Fiander, Alison; Franco, Eduardo L.; Garland, Suzanne M.; Giri, Adriana A.; González, Joaquín Víctor; Gröning, Arndt; Heidrich, Kerstin; Hibbitts, Sam; Hošnjak, Lea; Luk, Tommy N. M.; Marinic, Karina; Matsukura, Toshihiko; Neumann, Anna; Oštrbenk, Anja; Picconi, Maria Alejandra; Richardson, Harriet; Sagadin, Martin; Sahli, Roland; Seedat, Riaz Y.; Seme, Katja; Severini, Alberto; Sinchi, Jessica L.; Smahelova, Jana; Tabrizi, Sepehr N.; Tachezy, Ruth; Tohme, Sarah; Uloza, Virgilijus; Vitkauskiene, Astra; Wong, Yong Wee; Židovec Lepej, Snježana; Burk, Robert D.

    2014-01-01

    ABSTRACT Human papillomavirus type 6 (HPV6) is the major etiological agent of anogenital warts and laryngeal papillomas and has been included in both the quadrivalent and nonavalent prophylactic HPV vaccines. This study investigated the global genomic diversity of HPV6, using 724 isolates and 190 complete genomes from six continents, and the association of HPV6 genomic variants with geographical location, anatomical site of infection/disease, and gender. Initially, a 2,800-bp E5a-E5b-L1-LCR fragment was sequenced from 492/530 (92.8%) HPV6-positive samples collected for this study. Among them, 130 exhibited at least one single nucleotide polymorphism (SNP), indel, or amino acid change in the E5a-E5b-L1-LCR fragment and were sequenced in full. A global alignment and maximum likelihood tree of 190 complete HPV6 genomes (130 fully sequenced in this study and 60 obtained from sequence repositories) revealed two variant lineages, A and B, and five B sublineages: B1, B2, B3, B4, and B5. HPV6 (sub)lineage-specific SNPs and a 960-bp representative region for whole-genome-based phylogenetic clustering within the L2 open reading frame were identified. Multivariate logistic regression analysis revealed that lineage B predominated globally. Sublineage B3 was more common in Africa and North and South America, and lineage A was more common in Asia. Sublineages B1 and B3 were associated with anogenital infections, indicating a potential lesion-specific predilection of some HPV6 sublineages. Females had higher odds for infection with sublineage B3 than males. In conclusion, a global HPV6 phylogenetic analysis revealed the existence of two variant lineages and five sublineages, showing some degree of ethnogeographic, gender, and/or disease predilection in their distribution. IMPORTANCE This study established the largest database of globally circulating HPV6 genomic variants and contributed a total of 130 new, complete HPV6 genome sequences to available sequence repositories. Two HPV

  14. Diversity of putative archaeal RNA viruses in metagenomic datasets of a yellowstone acidic hot spring.

    PubMed

    Wang, Hongming; Yu, Yongxin; Liu, Taigang; Pan, Yingjie; Yan, Shuling; Wang, Yongjie

    2015-01-01

    Two genomic fragments (5,662 and 1,269 nt in size, GenBank accession no. JQ756122 and JQ756123, respectively) of novel, positive-strand RNA viruses that infect archaea were first discovered in an acidic hot spring in Yellowstone National Park (Bolduc et al., 2012). To investigate the diversity of these newly identified putative archaeal RNA viruses, global metagenomic datasets were searched for sequences that were significantly similar to those of the viruses. A total of 3,757 associated reads were retrieved solely from the Yellowstone datasets and were used to assemble the genomes of the putative archaeal RNA viruses. Nine contigs with lengths ranging from 417 to 5,866 nt were obtained, 4 of which were longer than 2,200 nt; one contig was 204 nt longer than JQ756122, representing the longest genomic sequence of the putative archaeal RNA viruses. These contigs revealed more than 50% sequence similarity to JQ756122 or JQ756123 and may be partial or nearly complete genomes of novel genogroups or genotypes of the putative archaeal RNA viruses. Sequence and phylogenetic analyses indicated that the archaeal RNA viruses are genetically diverse, with at least 3 related viral lineages in the Yellowstone acidic hot spring environment. PMID:25918685

  15. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... acids are not intended to be embraced by this definition. Any amino acid sequence that contains post-translationally modified amino acids may be described as the amino acid sequence that is initially translated... sequence of four or more amino acids or an unbranched sequence of ten or more nucleotides....

  16. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... acids are not intended to be embraced by this definition. Any amino acid sequence that contains post-translationally modified amino acids may be described as the amino acid sequence that is initially translated... sequence of four or more amino acids or an unbranched sequence of ten or more nucleotides....

  17. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... acids are not intended to be embraced by this definition. Any amino acid sequence that contains post-translationally modified amino acids may be described as the amino acid sequence that is initially translated... sequence of four or more amino acids or an unbranched sequence of ten or more nucleotides....

  18. Molecular diversity of lactic acid bacteria from cassava sour starch (Colombia).

    PubMed

    Omar, N B; Ampe, F; Raimbault, M; Guyot, J P; Tailliez, P

    2000-06-01

    Lactic acid bacteria and more particularly lactobacilli and Leuconostoc, are widely found in a wide variety of traditional fermented foods of tropical countries, made with cereals, tubers, meat or fish. These products represent a source of bacterial diversity that cannot be accurately analysed using classical phenotypic and biochemical tests. In the present work, the identification and the molecular diversity of lactic acid bacteria isolated from cassava sour starch fermentation were assessed by using a combination of complementary molecular methods: Randomly Amplified Polymorphic DNA fingerprinting (RAPD), plasmid profiling, hybridization using rRNA phylogenetic probes and partial 16S rDNA sequencing. The results revealed a large diversity of bacterial species (Lb. manihotivorans, Lb. plantarum, Lb. casei, Lb. hilgardii, Lb. buchneri, Lb. fermentum, Ln. mesenteroides and Pediococcus sp.). However, the most frequently isolated species were Lb. plantarum and Lb. manihotivorans. The RAPD analysis revealed a large molecular diversity between Lb. manihotivorans or Lb. plantarum strains. These results, observed on a rather limited number of samples, reveal that significant bacterial diversity is generated in traditional cassava sour starch fermentations. We propose that the presence of the amylolytic Lb. manihotivorans strains could have a role in sour starch processing. PMID:10930082

  19. A method to find palindromes in nucleic acid sequences.

    PubMed

    Anjana, Ramnath; Shankar, Mani; Vaishnavi, Marthandan Kirti; Sekar, Kanagaraj

    2013-01-01

    Various types of sequences in the human genome are known to play important roles in different aspects of genomic functioning. Among these sequences, palindromic nucleic acid sequences are one such type that have been studied in detail and found to influence a wide variety of genomic characteristics. For a nucleotide sequence to be considered as a palindrome, its complementary strand must read the same in the opposite direction. For example, both the strands i.e the strand going from 5' to 3' and its complementary strand from 3' to 5' must be complementary. A typical nucleotide palindromic sequence would be TATA (5' to 3') and its complimentary sequence from 3' to 5' would be ATAT. Thus, a new method has been developed using dynamic programming to fetch the palindromic nucleic acid sequences. The new method uses less memory and thereby it increases the overall speed and efficiency. The proposed method has been tested using the bacterial (3891 KB bases) and human chromosomal sequences (Chr-18: 74366 kb and Chr-Y: 25554 kb) and the computation time for finding the palindromic sequences is in milli seconds. PMID:23515654

  20. Analysis of sequence diversity in hypervariable regions of the external glycoprotein of human immunodeficiency virus type 1.

    PubMed Central

    Simmonds, P; Balfe, P; Ludlam, C A; Bishop, J O; Brown, A J

    1990-01-01

    Nucleotide sequences in three hypervariable regions of the human immunodeficiency virus type 1 (HIV-1) env gene were obtained by sequencing provirus present in peripheral blood mononuclear cells of HIV-infected individuals. Single molecules of target sequences were isolated by limiting dilution and amplified in two stages by the polymerase chain reaction, using nested primers. The product was directly sequenced to avoid errors introduced by Taq polymerase during the amplification process. There was extensive variation between sequences from the same individual as well as between sequences from different individuals. Interpatient variability was markedly less in individuals infected from a common source. A high proportion of amino acid substitutions in the hypervariable regions altered the number and positions of potential N-linked glycosylation sites. Sequences in two hypervariable regions frequently contained short (3- to 15-bp) duplications or deletions, and by amplifying peripheral blood mononuclear cell DNA containing 10(2) or 10(3) proviral molecules and analyzing the product by high-resolution electrophoresis, the total number and abundance of distinct length variants within an individual could be estimated, providing a more comprehensive analysis of the variants present than would be obtained by sequencing alone. Sequences from many individuals showed frequent amino acid substitutions at certain key positions for neutralizing-antibody and cytotoxic T-cell recognition in the immunodominant loop. The rates of synonymous and nonsynonymous nucleotide substitution in the region of this and flanking regions indicate that strong positive selection for amino acid change is operating in the generation of antigenic diversity. Images PMID:2243378

  1. Bovine Genetic Diversity Revealed By mtDNA Sequence Variation

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Mitochondrial DNA single nucleotide polymorphism (SNP) data were used to determine genetic distance, nucleotide diversity, construction of haplotypes, estimation of information contents, and phylogenic relationships in bovine HapMap breeds. The Bovine International HapMap panel consists of 720 anima...

  2. Crop diversity sequencing can improve crop tolerance to weeds

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The corn-soybean rotation in eastern South Dakota has led to a weed community comprised of species with similar life cycles to the crops; subsequently, weed management is a major input cost for producers. We are exploring crop diversity in this rotation to determine if producers can reduce the need...

  3. Multimodal phylogeny for taxonomy: integrating information from nucleotide and amino acid sequences.

    PubMed

    Bicego, Manuele; Dellaglio, Franco; Felis, Giovanna E

    2007-10-01

    The crucial role played by the analysis of microbial diversity in biotechnology-based innovations has increased the interest in the microbial taxonomy research area. Phylogenetic sequence analyses have contributed significantly to the advances in this field, also in the view of the large amount of sequence data collected in recent years. Phylogenetic analyses could be realized on the basis of protein-encoding nucleotide sequences or encoded amino acid molecules: these two mechanisms present different peculiarities, still starting from two alternative representations of the same information. This complementarity could be exploited to achieve a multimodal phylogenetic scheme that is able to integrate gene and protein information in order to realize a single final tree. This aspect has been poorly addressed in the literature. In this paper, we propose to integrate the two phylogenetic analyses using basic schemes derived from the multimodality fusion theory (or multiclassifier systems theory), a well-founded and rigorous branch for which its powerfulness has already been demonstrated in other pattern recognition contexts. The proposed approach could be applied to distance matrix-based phylogenetic techniques (like neighbor joining), resulting in a smart and fast method. The proposed methodology has been tested in a real case involving sequences of some species of lactic acid bacteria. With this dataset, both nucleotide sequence- and amino acid sequence-based phylogenetic analyses present some drawbacks, which are overcome with the multimodal analysis. PMID:17933011

  4. Sequence diversity of NanA manifests in distinct enzyme kinetics and inhibitor susceptibility

    NASA Astrophysics Data System (ADS)

    Xu, Zhongli; von Grafenstein, Susanne; Walther, Elisabeth; Fuchs, Julian E.; Liedl, Klaus R.; Sauerbrei, Andreas; Schmidtke, Michaela

    2016-04-01

    Streptococcus pneumoniae is the leading pathogen causing bacterial pneumonia and meningitis. Its surface-associated virulence factor neuraminidase A (NanA) promotes the bacterial colonization by removing the terminal sialyl residues from glycoconjugates on eukaryotic cell surface. The predominant role of NanA in the pathogenesis of pneumococci renders it an attractive target for therapeutic intervention. Despite the highly conserved activity of NanA, our alignment of the 11 NanAs revealed the evolutionary diversity of this enzyme. The amino acid substitutions we identified, particularly those in the lectin domain and in the insertion domain next to the catalytic centre triggered our special interest. We synthesised the representative NanAs and the mutagenized derivatives from E. coli for enzyme kinetics study and neuraminidase inhibitor susceptibility test. Via molecular docking we got a deeper insight into the differences between the two major variants of NanA and their influence on the ligand-target interactions. In addition, our molecular dynamics simulations revealed a prominent intrinsic flexibility of the linker between the active site and the insertion domain, which influences the inhibitor binding. Our findings for the first time associated the primary sequence diversity of NanA with the biochemical properties of the enzyme and with the inhibitory efficiency of neuraminidase inhibitors.

  5. Sequence diversity of NanA manifests in distinct enzyme kinetics and inhibitor susceptibility

    PubMed Central

    Xu, Zhongli; von Grafenstein, Susanne; Walther, Elisabeth; Fuchs, Julian E.; Liedl, Klaus R.; Sauerbrei, Andreas; Schmidtke, Michaela

    2016-01-01

    Streptococcus pneumoniae is the leading pathogen causing bacterial pneumonia and meningitis. Its surface-associated virulence factor neuraminidase A (NanA) promotes the bacterial colonization by removing the terminal sialyl residues from glycoconjugates on eukaryotic cell surface. The predominant role of NanA in the pathogenesis of pneumococci renders it an attractive target for therapeutic intervention. Despite the highly conserved activity of NanA, our alignment of the 11 NanAs revealed the evolutionary diversity of this enzyme. The amino acid substitutions we identified, particularly those in the lectin domain and in the insertion domain next to the catalytic centre triggered our special interest. We synthesised the representative NanAs and the mutagenized derivatives from E. coli for enzyme kinetics study and neuraminidase inhibitor susceptibility test. Via molecular docking we got a deeper insight into the differences between the two major variants of NanA and their influence on the ligand-target interactions. In addition, our molecular dynamics simulations revealed a prominent intrinsic flexibility of the linker between the active site and the insertion domain, which influences the inhibitor binding. Our findings for the first time associated the primary sequence diversity of NanA with the biochemical properties of the enzyme and with the inhibitory efficiency of neuraminidase inhibitors. PMID:27125351

  6. Demographic history of India and mtDNA-sequence diversity.

    PubMed Central

    Mountain, J L; Hebert, J M; Bhattacharyya, S; Underhill, P A; Ottolenghi, C; Gadgil, M; Cavalli-Sforza, L L

    1995-01-01

    The demographic history of India was examined by comparing mtDNA sequences obtained from members of three culturally divergent Indian subpopulations (endogamous caste groups). While an inferred tree revealed some clustering according to caste affiliation, there was no clear separation into three genetically distinct groups along caste lines. Comparison of pairwise nucleotide difference distributions, however, did indicate a difference in growth patterns between two of the castes. The Brahmin population appears to have undergone either a rapid expansion or steady growth. The low-ranking Mukri caste, however, may have either maintained a roughly constant population size or undergone multiple bottlenecks during that period. Comparison of the Indian sequences to those obtained from other populations, using a tree, revealed that the Indian sequences, along with all other non-African samples, form a starlike cluster. This cluster may represent a major expansion, possibly originating in southern Asia, taking place at some point after modern humans initially left Africa. PMID:7717409

  7. Draft Genome Sequences of a Phylogenetically Diverse Suite of Pseudomonas syringae Strains from Multiple Source Populations

    PubMed Central

    Yourstone, Scott; Lind, Abigail; Guilbaud, Caroline; Sands, David C.; Jones, Corbin D.; Morris, Cindy E.; Dangl, Jeffrey L.

    2014-01-01

    Here, we report the draft genome sequences for 7 phylogenetically diverse isolates of Pseudomonas syringae, obtained from numerous environmental sources and geographically proximate crop species. Overall, these sequences provide a wealth of information about the differences (or lack thereof) between isolates from disease outbreaks and those from other sources. PMID:24459267

  8. A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Advances in next-generation technologies have driven the costs of DNA sequencing down to the point that genotyping-by-sequencing (GBS) is now feasible for high diversity, large genome species. Here, we report a procedure for constructing GBS libraries based on reducing genome complexity with restri...

  9. On Quantum Algorithm for Multiple Alignment of Amino Acid Sequences

    NASA Astrophysics Data System (ADS)

    Iriyama, Satoshi; Ohya, Masanori

    2009-02-01

    The alignment of genome sequences or amino acid sequences is one of fundamental operations for the study of life. Usual computational complexity for the multiple alignment of N sequences with common length L by dynamic programming is O(LN). This alignment is considered as one of the NP problems, so that it is desirable to find a nice algorithm of the multiple alignment. Thus in this paper we propose the quantum algorithm for the multiple alignment based on the works12,1,2 in which the NP complete problem was shown to be the P problem by means of quantum algorithm and chaos information dynamics.

  10. The amino-acid sequence of kangaroo pancreatic ribonuclease.

    PubMed

    Gaastra, W; Welling, G W; Beintema, J J

    1978-05-01

    Red kangaroo (Macropus rufus) ribonuclease was isolated from pancreatic tissue by affinity chromatography. The amino acid sequence was determined by automatic sequencing of overlapping large fragments and by analysis of shorter peptides obtained by digestion with a number of proteolytic enzymes. The polypeptide chain consists of 122 amino acid residues. Compared to other ribonucleases, the N-terminal residue and residue 114 are deleted. In other pancreatic ribonucleases position 114 is occupied by a cis proline residue in an external loop at the surface of the molecule. Other remarkable substitutions are the presence of a tyrosine residue at position 123 instead of a serine which forms a hydrogen bond with the pyrimidine ring of a nucleotide substrate, and a number of hydrophobichydrophilic interchanges in the sequence 51-55, which forms part of an alpha-helix in bovine ribonuclease and exhibits few substitutions in the placental mammals. Kangaroo ribonuclease contains no carbohydrate, although the enzyme possesses a recognition site for carbohydrate attachment in the sequence Asn-Val-Thr (62-64). The enzyme differs at about 35-40% of the positions from all other mammalian pancreatic ribonucleases sequenced to date, which is in agreement with the early divergence between the marsupials and the placental mammals. From fragmentary data a tentative sequence of red-necked wallaby (Macropus rufogriseus) pancreatic ribonuclease has been derived. Eight differences with the kangaroo sequence were found. PMID:658039

  11. Contemporary environmental variation determines microbial diversity patterns in acid mine drainage

    PubMed Central

    Kuang, Jia-Liang; Huang, Li-Nan; Chen, Lin-Xing; Hua, Zheng-Shuang; Li, Sheng-Jin; Hu, Min; Li, Jin-Tian; Shu, Wen-Sheng

    2013-01-01

    A wide array of microorganisms survive and thrive in extreme environments. However, we know little about the patterns of, and controls over, their large-scale ecological distribution. To this end, we have applied a bar-coded 16S rRNA pyrosequencing technology to explore the phylogenetic differentiation among 59 microbial communities from physically and geochemically diverse acid mine drainage (AMD) sites across Southeast China, revealing for the first time environmental variation as the major factor explaining community differences in these harsh environments. Our data showed that overall microbial diversity estimates, including phylogenetic diversity, phylotype richness and pairwise UniFrac distance, were largely correlated with pH conditions. Furthermore, multivariate regression tree analysis also identified solution pH as a strong predictor of relative lineage abundance. Betaproteobacteria, mostly affiliated with the ‘Ferrovum' genus, were explicitly predominant in assemblages under moderate pH conditions, whereas Alphaproteobacteria, Euryarchaeota, Gammaproteobacteria and Nitrospira exhibited a strong adaptation to more acidic environments. Strikingly, such pH-dependent patterns could also be observed in a subsequent comprehensive analysis of the environmental distribution of acidophilic microorganisms based on 16S rRNA gene sequences previously retrieved from globally distributed AMD and associated environments, regardless of the long-distance isolation and the distinct substrate types. Collectively, our results suggest that microbial diversity patterns are better predicted by contemporary environmental variation rather than geographical distance in extreme AMD systems. PMID:23178673

  12. Sequence diversity under the multispecies coalescent with Yule process and constant population size.

    PubMed

    Heled, Joseph

    2012-03-01

    The study of sequence diversity under phylogenetic models is now classic. Theoretical studies of diversity under the Kingman coalescent appeared shortly after the introduction of the coalescent. In this paper we revisit this topic under the multispecies coalescent, an extension of the single population model to multiple populations. We derive exact formulas for the sequence dissimilarity of two sequences drawn at random under a basic multispecies setup. The multispecies model uses three parameters--the species tree birth rate under the pure birth process (Yule), the species effective population size and the mutation rate. We also discuss the effects of relaxing some of the model assumptions. PMID:22210390

  13. Interrogating and predicting tolerated sequence diversity in protein folds: application to E. elaterium trypsin inhibitor-II cystine-knot miniprotein.

    PubMed

    Lahti, Jennifer L; Silverman, Adam P; Cochran, Jennifer R

    2009-09-01

    Cystine-knot miniproteins (knottins) are promising molecular scaffolds for protein engineering applications. Members of the knottin family have multiple loops capable of displaying conformationally constrained polypeptides for molecular recognition. While previous studies have illustrated the potential of engineering knottins with modified loop sequences, a thorough exploration into the tolerated loop lengths and sequence space of a knottin scaffold has not been performed. In this work, we used the Ecballium elaterium trypsin inhibitor II (EETI) as a model member of the knottin family and constructed libraries of EETI loop-substituted variants with diversity in both amino acid sequence and loop length. Using yeast surface display, we isolated properly folded EETI loop-substituted clones and applied sequence analysis tools to assess the tolerated diversity of both amino acid sequence and loop length. In addition, we used covariance analysis to study the relationships between individual positions in the substituted loops, based on the expectation that correlated amino acid substitutions will occur between interacting residue pairs. We then used the results of our sequence and covariance analyses to successfully predict loop sequences that facilitated proper folding of the knottin when substituted into EETI loop 3. The sequence trends we observed in properly folded EETI loop-substituted clones will be useful for guiding future protein engineering efforts with this knottin scaffold. Furthermore, our findings demonstrate that the combination of directed evolution with sequence and covariance analyses can be a powerful tool for rational protein engineering. PMID:19730675

  14. Mapping of Genotype–Phenotype Diversity among Clinical Isolates of Mycobacterium tuberculosis by Sequence-Based Transcriptional Profiling

    PubMed Central

    Rose, Graham; Cortes, Teresa; Comas, Iñaki; Coscolla, Mireia; Gagneux, Sebastien; Young, Douglas B.

    2013-01-01

    Genome sequencing has identified an extensive repertoire of single nucleotide polymorphisms among clinical isolates of Mycobacterium tuberculosis, but the extent to which these differences influence phenotypic properties of the bacteria remains to be elucidated. To determine whether these polymorphisms give rise to phenotypic diversity, we have integrated genome data sets with RNA sequencing to assess their impact on the comparative transcriptome profiles of strains belonging to M. tuberculosis Lineages 1 and 2. We observed clear correlations between genotype and transcriptional phenotype. These arose by three mechanisms. First, lineage-specific changes in amino acid sequence of transcriptional regulators were associated with alterations in their ability to control gene expression. Second, changes in nucleotide sequence were associated with alteration of promoter activity and generation of novel transcriptional start sites in intergenic regions and within coding sequences. We show that in some cases this mechanism is expected to generate functionally active truncated proteins involved in innate immune recognition. Finally, genes showing lineage-specific patterns of differential expression not linked directly to primary mutations were characterized by a striking overrepresentation of toxin–antitoxin pairs. Taken together, these findings advance our understanding of mycobacterial evolution, contribute to a systems level understanding of this important human pathogen, and more broadly demonstrate the application of state-of-the-art techniques to provide novel insight into mechanisms by which intergenic and silent mutations contribute to diversity. PMID:24115728

  15. Exploring Genetic Diversity in Plants Using High-Throughput Sequencing Techniques.

    PubMed

    Onda, Yoshihiko; Mochida, Keiichi

    2016-08-01

    Food security has emerged as an urgent concern because of the rising world population. To meet the food demands of the near future, it is required to improve the productivity of various crops, not just of staple food crops. The genetic diversity among plant populations in a given species allows the plants to adapt to various environmental conditions. Such diversity could therefore yield valuable traits that could overcome the food-security challenges. To explore genetic diversity comprehensively and to rapidly identify useful genes and/or allele, advanced high-throughput sequencing techniques, also called next-generation sequencing (NGS) technologies, have been developed. These provide practical solutions to the challenges in crop genomics. Here, we review various sources of genetic diversity in plants, newly developed genetic diversity-mining tools synergized with NGS techniques, and related genetic approaches such as quantitative trait locus analysis and genome-wide association study. PMID:27499684

  16. Repetitive sequences: the hidden diversity of heterochromatin in prochilodontid fish

    PubMed Central

    Terencio, Maria L.; Schneider, Carlos H.; Gross, Maria C.; do Carmo, Edson Junior; Nogaroto, Viviane; de Almeida, Mara Cristina; Artoni, Roberto Ferreira; Vicari, Marcelo R.; Feldberg, Eliana

    2015-01-01

    Abstract The structure and organization of repetitive elements in fish genomes are still relatively poorly understood, although most of these elements are believed to be located in heterochromatic regions. Repetitive elements are considered essential in evolutionary processes as hotspots for mutations and chromosomal rearrangements, among other functions – thus providing new genomic alternatives and regulatory sites for gene expression. The present study sought to characterize repetitive DNA sequences in the genomes of Semaprochilodus insignis (Jardine & Schomburgk, 1841) and Semaprochilodus taeniurus (Valenciennes, 1817) and identify regions of conserved syntenic blocks in this genome fraction of three species of Prochilodontidae (Semaprochilodus insignis, Semaprochilodus taeniurus, and Prochilodus lineatus (Valenciennes, 1836) by cross-FISH using Cot-1 DNA (renaturation kinetics) probes. We found that the repetitive fractions of the genomes of Semaprochilodus insignis and Semaprochilodus taeniurus have significant amounts of conserved syntenic blocks in hybridization sites, but with low degrees of similarity between them and the genome of Prochilodus lineatus, especially in relation to B chromosomes. The cloning and sequencing of the repetitive genomic elements of Semaprochilodus insignis and Semaprochilodus taeniurus using Cot-1 DNA identified 48 fragments that displayed high similarity with repetitive sequences deposited in public DNA databases and classified as microsatellites, transposons, and retrotransposons. The repetitive fractions of the Semaprochilodus insignis and Semaprochilodus taeniurus genomes exhibited high degrees of conserved syntenic blocks in terms of both the structures and locations of hybridization sites, but a low degree of similarity with the syntenic blocks of the Prochilodus lineatus genome. Future comparative analyses of other prochilodontidae species will be needed to advance our understanding of the organization and evolution of

  17. RNA editing generates cellular subsets with diverse sequence within populations.

    PubMed

    Harjanto, Dewi; Papamarkou, Theodore; Oates, Chris J; Rayon-Estrada, Violeta; Papavasiliou, F Nina; Papavasiliou, Anastasia

    2016-01-01

    RNA editing is a mutational mechanism that specifically alters the nucleotide content in transcribed RNA. However, editing rates vary widely, and could result from equivalent editing amongst individual cells, or represent an average of variable editing within a population. Here we present a hierarchical Bayesian model that quantifies the variance of editing rates at specific sites using RNA-seq data from both single cells, and a cognate bulk sample to distinguish between these two possibilities. The model predicts high variance for specific edited sites in murine macrophages and dendritic cells, findings that we validated experimentally by using targeted amplification of specific editable transcripts from single cells. The model also predicts changes in variance in editing rates for specific sites in dendritic cells during the course of LPS stimulation. Our data demonstrate substantial variance in editing signatures amongst single cells, supporting the notion that RNA editing generates diversity within cellular populations. PMID:27418407

  18. RNA editing generates cellular subsets with diverse sequence within populations

    PubMed Central

    Harjanto, Dewi; Papamarkou, Theodore; Oates, Chris J.; Rayon-Estrada, Violeta; Papavasiliou, F. Nina; Papavasiliou, Anastasia

    2016-01-01

    RNA editing is a mutational mechanism that specifically alters the nucleotide content in transcribed RNA. However, editing rates vary widely, and could result from equivalent editing amongst individual cells, or represent an average of variable editing within a population. Here we present a hierarchical Bayesian model that quantifies the variance of editing rates at specific sites using RNA-seq data from both single cells, and a cognate bulk sample to distinguish between these two possibilities. The model predicts high variance for specific edited sites in murine macrophages and dendritic cells, findings that we validated experimentally by using targeted amplification of specific editable transcripts from single cells. The model also predicts changes in variance in editing rates for specific sites in dendritic cells during the course of LPS stimulation. Our data demonstrate substantial variance in editing signatures amongst single cells, supporting the notion that RNA editing generates diversity within cellular populations. PMID:27418407

  19. Low Diversity in the Mitogenome of Sperm Whales Revealed by Next-Generation Sequencing

    PubMed Central

    Alexander, Alana; Steel, Debbie; Slikas, Beth; Hoekzema, Kendra; Carraher, Colm; Parks, Matthew; Cronn, Richard; Baker, C. Scott

    2013-01-01

    Large population sizes and global distributions generally associate with high mitochondrial DNA control region (CR) diversity. The sperm whale (Physeter macrocephalus) is an exception, showing low CR diversity relative to other cetaceans; however, diversity levels throughout the remainder of the sperm whale mitogenome are unknown. We sequenced 20 mitogenomes from 17 sperm whales representative of worldwide diversity using Next Generation Sequencing (NGS) technologies (Illumina GAIIx, Roche 454 GS Junior). Resequencing of three individuals with both NGS platforms and partial Sanger sequencing showed low discrepancy rates (454-Illumina: 0.0071%; Sanger-Illumina: 0.0034%; and Sanger-454: 0.0023%) confirming suitability of both NGS platforms for investigating low mitogenomic diversity. Using the 17 sperm whale mitogenomes in a phylogenetic reconstruction with 41 other species, including 11 new dolphin mitogenomes, we tested two hypotheses for the low CR diversity. First, the hypothesis that CR-specific constraints have reduced diversity solely in the CR was rejected as diversity was low throughout the mitogenome, not just in the CR (overall diversity π = 0.096%; protein-coding 3rd codon = 0.22%; CR = 0.35%), and CR phylogenetic signal was congruent with protein-coding regions. Second, the hypothesis that slow substitution rates reduced diversity throughout the sperm whale mitogenome was rejected as sperm whales had significantly higher rates of CR evolution and no evidence of slow coding region evolution relative to other cetaceans. The estimated time to most recent common ancestor for sperm whale mitogenomes was 72,800 to 137,400 years ago (95% highest probability density interval), consistent with previous hypotheses of a bottleneck or selective sweep as likely causes of low mitogenome diversity. PMID:23254394

  20. Microbial diversity and metabolic networks in acid mine drainage habitats

    PubMed Central

    Méndez-García, Celia; Peláez, Ana I.; Mesa, Victoria; Sánchez, Jesús; Golyshina, Olga V.; Ferrer, Manuel

    2015-01-01

    Acid mine drainage (AMD) emplacements are low-complexity natural systems. Low-pH conditions appear to be the main factor underlying the limited diversity of the microbial populations thriving in these environments, although temperature, ionic composition, total organic carbon, and dissolved oxygen are also considered to significantly influence their microbial life. This natural reduction in diversity driven by extreme conditions was reflected in several studies on the microbial populations inhabiting the various micro-environments present in such ecosystems. Early studies based on the physiology of the autochthonous microbiota and the growing success of omics-based methodologies have enabled a better understanding of microbial ecology and function in low-pH mine outflows; however, complementary omics-derived data should be included to completely describe their microbial ecology. Furthermore, recent updates on the distribution of eukaryotes and archaea recovered through sterile filtering (herein referred to as filterable fraction) in these environments demand their inclusion in the microbial characterization of AMD systems. In this review, we present a complete overview of the bacterial, archaeal (including filterable fraction), and eukaryotic diversity in these ecosystems, and include a thorough depiction of the metabolism and element cycling in AMD habitats. We also review different metabolic network structures at the organismal level, which is necessary to disentangle the role of each member of the AMD communities described thus far. PMID:26074887

  1. Genetic Diversity and Population Differentiation of Guignardia mangiferae from “Tahiti” Acid Lime

    PubMed Central

    Wickert, Ester; Lemos, Eliana Gertrudes de Macedo; Kishi, Luciano Takeshi; de Souza, Andressa; de Goes, Antonio

    2012-01-01

    Among the citrus plants, “Tahiti” acid lime is known as a host of G. mangiferae fungi. This species is considered endophytic for citrus plants and is easily isolated from asymptomatic fruits and leaves. G. mangiferae is genetically related and sometimes confused with G. citricarpa which causes Citrus Black Spot (CBS). “Tahiti” acid lime is one of the few species that means to be resistant to this disease because it does not present symptoms. Despite the fact that it is commonly found in citric plants, little is known about the populations of G. mangiferae associated with these plants. Hence, the objective of this work was to gain insights about the genetic diversity of the G. mangiferae populations that colonize “Tahiti” acid limes by sequencing cistron ITS1-5.8S-ITS2. It was verified that “Tahiti” acid lime plants are hosts of G. mangiferae and also of G. citricarpa, without presenting symptoms of CBS. Populations of G. mangiferae present low-to-moderate genetic diversity and show little-to-moderate levels of population differentiation. As gene flow was detected among the studied populations and they share haplotypes, it is possible that all populations, from citrus plants and also from the other known hosts of this fungus, belong to one great panmictic population. PMID:22619579

  2. Amino acid sequence of Salmonella typhimurium branched-chain amino acid aminotransferase.

    PubMed

    Feild, M J; Nguyen, D C; Armstrong, F B

    1989-06-13

    The complete amino acid sequence of the subunit of branched-chain amino acid aminotransferase (transaminase B, EC 2.6.1.42) of Salmonella typhimurium was determined. An Escherichia coli recombinant containing the ilvGEDAY gene cluster of Salmonella was used as the source of the hexameric enzyme. The peptide fragments used for sequencing were generated by treatment with trypsin, Staphylococcus aureus V8 protease, endoproteinase Lys-C, and cyanogen bromide. The enzyme subunit contains 308 residues and has a molecular weight of 33,920. To determine the coenzyme-binding site, the pyridoxal 5-phosphate containing enzyme was treated with tritiated sodium borohydride prior to trypsin digestion. Peptide map comparisons with an apoenzyme tryptic digest and monitoring radioactivity incorporation allowed identification of the pyridoxylated peptide, which was then isolated and sequenced. The coenzyme-binding site is the lysyl residue at position 159. The amino acid sequence of Salmonella transaminase B is 97.4% identical with that of Escherichia coli, differing in only eight amino acid positions. Sequence comparisons of transaminase B to other known aminotransferase sequences revealed limited sequence similarity (24-33%) when conserved amino acid substitutions are allowed and alignments were forced to occur on the coenzyme-binding site. PMID:2669973

  3. Genotyping by sequencing reveals the genetic diversity of the USDA pisum diversity collection

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The USDA expanded Pisum Single Plant (PSP) core collection is a unique resource that represents the breadth of the genetic diversity of the genus in an inbred format that facilitates genetic study. The collection includes inbred accessions from the refined pea core collection, parent lines of USDA r...

  4. Genome diversity in Brachypodium distachyon: deep sequencing of highly diverse inbred lines

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Natural variation provides a powerful opportunity to study the genetic basis of biological traits. Brachypodium distachyon is a broadly distributed diploid model grass with a small genome and a large collection of diverse inbred lines. As a step towards understanding the genetic basis of the natura...

  5. Amino acid sequence of bovine heart coupling factor 6.

    PubMed Central

    Fang, J K; Jacobs, J W; Kanner, B I; Racker, E; Bradshaw, R A

    1984-01-01

    The amino acid sequence of bovine heart mitochondrial coupling factor 6 (F6) has been determined by automated Edman degradation of the whole protein and derived peptides. Preparations based on heat precipitation and ethanol extraction showed allotypic variation at three positions while material further purified by HPLC yielded only one sequence that also differed by a Phe-Thr replacement at residue 62. The mature protein contains 76 amino acids with a calculated molecular weight of 9006 and a pI of approximately equal to 5, in good agreement with experimentally measured values. The charged amino acids are mainly clustered at the termini and in one section in the middle; these three polar segments are separated by two segments relatively rich in nonpolar residues. Chou-Fasman analysis suggests three stretches of alpha-helix coinciding (or within) the high-charge-density sequences with a single beta-turn at the first polar-nonpolar junction. Comparison of the F6 sequence with those of other proteins did not reveal any homologous structures. PMID:6149548

  6. Diversity of Frankia in soil assessed by Illumina sequencing of nifH gene fragments.

    PubMed

    Rodriguez, David; Guerra, Trina M; Forstner, Michael R J; Hahn, Dittmar

    2016-09-01

    Targeted Illumina sequencing of nitrogenase reductase (nifH) gene fragments and analyses of pair-end reads through a modified QIIME pipeline were used to assess the diversity of the actinomyceteous genus Frankia in three soils. Soils were vegetated with host or non-host plants, and included locations in Illinois (ABA, host), Colorado (CoMt, non-host), and Wisconsin (FMWI, non-host). After filtering, seven unique sequences were recovered for soil ABA, six for CoMt, and four sequences for FMWI. These sequences were included in a Bayesian topology anchored by published sequence data from pure cultures of Frankia. Sequences from all three soils showed affinities to Frankia strains from both the Alnus and Elaeagnus host infection groups. Reads representing Casuarina-infective strains were not detected. Four sequences from soil CoMt and five sequences from soil ABA did not cluster, at 97% similarity, into a shared OTU that contained a cultured relative. These results demonstrate that targeted Illumina sequencing provides an efficient and economical method for assessing haplotype diversity of ecofunctional genes (e.g. nifH) at the genus level in microorganisms that perform important ecosystem functions. PMID:27485903

  7. Sequence diversity and novelty of natural assemblages of picoeukaryotes from the Indian Ocean

    PubMed Central

    Massana, Ramon; Pernice, Massimo; Bunge, John A; Campo, Javier del

    2011-01-01

    Despite the ecological importance of marine pico-size eukaryotes, the study of their in situ diversity using molecular tools started just a few years ago. These studies have revealed that marine picoeukaryotes are very diverse and include many novel taxa. However, the amount and structure of their phylogenetic diversity and the extent of their sequence novelty still remains poorly known, as a systematic analysis has been seldom attempted. In this study, we use a coherent and carefully curated data set of 500 published 18S ribosomal DNA sequences to quantify the diversity and novelty patterns of picoeukaryotes in the Indian Ocean. Our phylogenetic tree showed many distant lineages. We grouped sequences in OTUs (operational taxonomic units) at discrete values delineated by pair-wise Jukes–Cantor (JC) distances and tree patristic distances. At a distance of 0.01, the number of OTUs observed (237/242; using JC or patristic distances, respectively) was half the number of sequences analyzed, indicating the existence of microdiverse clusters of highly related sequences. At this distance level, we estimated 600–800 OTUs using several statistical methods. The number of OTUs observed was still substantial at higher distances (39/82 at 0.20 distance) suggesting a large diversity at high-taxonomic ranks. Most sequences were related to marine clones from other sites and many were distant to cultured organisms, highlighting the huge culturing gap within protists. The novelty analysis indicated the putative presence of pseudogenes and of truly novel high-rank phylogenetic lineages. The identified diversity and novelty patterns among marine picoeukaryotes are of great importance for understanding and interpreting their ecology and evolution. PMID:20631807

  8. Exploring the environmental diversity of kinetoplastid flagellates in the high-throughput DNA sequencing era

    PubMed Central

    d’Avila-Levy, Claudia Masini; Boucinha, Carolina; Kostygov, Alexei; Santos, Helena Lúcia Carneiro; Morelli, Karina Alessandra; Grybchuk-Ieremenko, Anastasiia; Duval, Linda; Votýpka, Jan; Yurchenko, Vyacheslav; Grellier, Philippe; Lukeš, Julius

    2015-01-01

    The class Kinetoplastea encompasses both free-living and parasitic species from a wide range of hosts. Several representatives of this group are responsible for severe human diseases and for economic losses in agriculture and livestock. While this group encompasses over 30 genera, most of the available information has been derived from the vertebrate pathogenic genera Leishmaniaand Trypanosoma. Recent studies of the previously neglected groups of Kinetoplastea indicated that the actual diversity is much higher than previously thought. This article discusses the known segment of kinetoplastid diversity and how gene-directed Sanger sequencing and next-generation sequencing methods can help to deepen our knowledge of these interesting protists. PMID:26602872

  9. Challenges and opportunities in estimating viral genetic diversity from next-generation sequencing data

    PubMed Central

    Beerenwinkel, Niko; Günthard, Huldrych F.; Roth, Volker; Metzner, Karin J.

    2012-01-01

    Many viruses, including the clinically relevant RNA viruses HIV (human immunodeficiency virus) and HCV (hepatitis C virus), exist in large populations and display high genetic heterogeneity within and between infected hosts. Assessing intra-patient viral genetic diversity is essential for understanding the evolutionary dynamics of viruses, for designing effective vaccines, and for the success of antiviral therapy. Next-generation sequencing (NGS) technologies allow the rapid and cost-effective acquisition of thousands to millions of short DNA sequences from a single sample. However, this approach entails several challenges in experimental design and computational data analysis. Here, we review the entire process of inferring viral diversity from sample collection to computing measures of genetic diversity. We discuss sample preparation, including reverse transcription and amplification, and the effect of experimental conditions on diversity estimates due to in vitro base substitutions, insertions, deletions, and recombination. The use of different NGS platforms and their sequencing error profiles are compared in the context of various applications of diversity estimation, ranging from the detection of single nucleotide variants (SNVs) to the reconstruction of whole-genome haplotypes. We describe the statistical and computational challenges arising from these technical artifacts, and we review existing approaches, including available software, for their solution. Finally, we discuss open problems, and highlight successful biomedical applications and potential future clinical use of NGS to estimate viral diversity. PMID:22973268

  10. Nanopores and nucleic acids: prospects for ultrarapid sequencing

    NASA Technical Reports Server (NTRS)

    Deamer, D. W.; Akeson, M.

    2000-01-01

    DNA and RNA molecules can be detected as they are driven through a nanopore by an applied electric field at rates ranging from several hundred microseconds to a few milliseconds per molecule. The nanopore can rapidly discriminate between pyrimidine and purine segments along a single-stranded nucleic acid molecule. Nanopore detection and characterization of single molecules represents a new method for directly reading information encoded in linear polymers. If single-nucleotide resolution can be achieved, it is possible that nucleic acid sequences can be determined at rates exceeding a thousand bases per second.

  11. Amino acid sequence of the Amur tiger prion protein.

    PubMed

    Wu, Changde; Pang, Wanyong; Zhao, Deming

    2006-10-01

    Prion diseases are fatal neurodegenerative disorders in human and animal associated with conformational conversion of a cellular prion protein (PrP(C)) into the pathologic isoform (PrP(Sc)). Various data indicate that the polymorphisms within the open reading frame (ORF) of PrP are associated with the susceptibility and control the species barrier in prion diseases. In the present study, partial Prnp from 25 Amur tigers (tPrnp) were cloned and screened for polymorphisms. Four single nucleotide polymorphisms (T423C, A501G, C511A, A610G) were found; the C511A and A610G nucleotide substitutions resulted in the amino acid changes Lysine171Glutamine and Alanine204Threoine, respectively. The tPrnp amino acid sequence is similar to house cat (Felis catus ) and sheep, but differs significantly from other two cat Prnp sequences that were previously deposited in GenBank. PMID:16780982

  12. Multilocus Sequence Typing of Genital Chlamydia trachomatis in Norway Reveals Multiple New Sequence Types and a Large Genetic Diversity

    PubMed Central

    Gravningen, Kirsten; Christerson, Linus; Furberg, Anne-Sofie; Simonsen, Gunnar Skov; Ödman, Kristina; Ståhlsten, Anna; Herrmann, Björn

    2012-01-01

    Background The Chlamydia trachomatis incidence rate in Finnmark, the most northern and sparsely populated county in Norway, has been twice the national average. This population based cross-sectional study among Finnmark high school students had the following aims: i) to examine distribution of multilocus sequence types (STs) of C. trachomatis in a previously unmapped area, ii) to compare chlamydia genetic diversity in Finnmark with that of two urban regions, and iii) to compare discriminatory capacity of multilocus sequence typing (MLST) with conventional ompA sequencing in a large number of chlamydia specimens. Methodology ompA sequencing and a high-resolution MLST system based on PCR amplification and DNA sequencing of five highly variable genetic regions were used. Eighty chlamydia specimens from adolescents aged 15–20 years in Finnmark were collected in five high schools (n = 60) and from routine clinical samples in the laboratory (n = 20). These were compared to routine clinical samples from adolescents in Tromsø (n = 80) and Trondheim (n = 88), capitals of North and Central Norway, respectively. Principal Findings ompA sequencing detected 11 genotypes in 248 specimens from all three areas. MLST displayed 50 STs providing a five-fold higher resolution. Two-thirds of all STs were novel. The common ompA E/Bour genotype comprised 46% and resolved into 24 different STs. MLST identified the Swedish new variant of C. trachomatis not discriminated by ompA sequencing. Simpson's discriminatory index (D) was 0.93 for MLST, while a corrected Dc was 0.97. There were no statistically significant differences in ST genetic diversity between geographic areas. Finnmark had an atypical genovar distribution with G being predominant. This was mainly due to expansion of specific STs of which the novel ST161 was unique for Finnmark. Conclusions/Significance MLST revealed multiple new STs and a larger genetic diversity in comparison to ompA sequencing and proved

  13. Estimating and comparing microbial diversity in the presence of sequencing errors

    PubMed Central

    Chiu, Chun-Huo

    2016-01-01

    Estimating and comparing microbial diversity are statistically challenging due to limited sampling and possible sequencing errors for low-frequency counts, producing spurious singletons. The inflated singleton count seriously affects statistical analysis and inferences about microbial diversity. Previous statistical approaches to tackle the sequencing errors generally require different parametric assumptions about the sampling model or about the functional form of frequency counts. Different parametric assumptions may lead to drastically different diversity estimates. We focus on nonparametric methods which are universally valid for all parametric assumptions and can be used to compare diversity across communities. We develop here a nonparametric estimator of the true singleton count to replace the spurious singleton count in all methods/approaches. Our estimator of the true singleton count is in terms of the frequency counts of doubletons, tripletons and quadrupletons, provided these three frequency counts are reliable. To quantify microbial alpha diversity for an individual community, we adopt the measure of Hill numbers (effective number of taxa) under a nonparametric framework. Hill numbers, parameterized by an order q that determines the measures’ emphasis on rare or common species, include taxa richness (q = 0), Shannon diversity (q = 1, the exponential of Shannon entropy), and Simpson diversity (q = 2, the inverse of Simpson index). A diversity profile which depicts the Hill number as a function of order q conveys all information contained in a taxa abundance distribution. Based on the estimated singleton count and the original non-singleton frequency counts, two statistical approaches (non-asymptotic and asymptotic) are developed to compare microbial diversity for multiple communities. (1) A non-asymptotic approach refers to the comparison of estimated diversities of standardized samples with a common finite sample size or sample completeness. This

  14. Estimating and comparing microbial diversity in the presence of sequencing errors.

    PubMed

    Chiu, Chun-Huo; Chao, Anne

    2016-01-01

    Estimating and comparing microbial diversity are statistically challenging due to limited sampling and possible sequencing errors for low-frequency counts, producing spurious singletons. The inflated singleton count seriously affects statistical analysis and inferences about microbial diversity. Previous statistical approaches to tackle the sequencing errors generally require different parametric assumptions about the sampling model or about the functional form of frequency counts. Different parametric assumptions may lead to drastically different diversity estimates. We focus on nonparametric methods which are universally valid for all parametric assumptions and can be used to compare diversity across communities. We develop here a nonparametric estimator of the true singleton count to replace the spurious singleton count in all methods/approaches. Our estimator of the true singleton count is in terms of the frequency counts of doubletons, tripletons and quadrupletons, provided these three frequency counts are reliable. To quantify microbial alpha diversity for an individual community, we adopt the measure of Hill numbers (effective number of taxa) under a nonparametric framework. Hill numbers, parameterized by an order q that determines the measures' emphasis on rare or common species, include taxa richness (q = 0), Shannon diversity (q = 1, the exponential of Shannon entropy), and Simpson diversity (q = 2, the inverse of Simpson index). A diversity profile which depicts the Hill number as a function of order q conveys all information contained in a taxa abundance distribution. Based on the estimated singleton count and the original non-singleton frequency counts, two statistical approaches (non-asymptotic and asymptotic) are developed to compare microbial diversity for multiple communities. (1) A non-asymptotic approach refers to the comparison of estimated diversities of standardized samples with a common finite sample size or sample completeness. This approach

  15. HIGH SEQUENCE DIVERSITY IN THE RNA SYNTHESIZED AT THE LAMPBRUSH STAGE OF OÖGENESIS*

    PubMed Central

    Davidson, Eric H.; Hough, Barbara R.

    1969-01-01

    Many diverse RNA's are synthesized in the lampbrush stage oöcyte of Xenopus, as shown by the presence of different nucleotide sequences in the RNA population. This fact has been established by hybridizing lampbrush stage oöcyte RNA with an isolated nonrepetitive fraction of Xenopus DNA. Images PMID:5257126

  16. High sequence diversity in the RNA synthesized at the lampbrush stage of oögenesis.

    PubMed

    Davidson, E H; Hough, B R

    1969-06-01

    Many diverse RNA's are synthesized in the lampbrush stage oöcyte of Xenopus, as shown by the presence of different nucleotide sequences in the RNA population. This fact has been established by hybridizing lampbrush stage oöcyte RNA with an isolated nonrepetitive fraction of Xenopus DNA. PMID:5257126

  17. Next generation sequencing to define prokaryotic and fungal diversity in the bovine rumen

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A combination of Sanger and 454 sequences of small subunit rRNA loci were used to interrogate the microbial diversity in the bovine rumen of 14 pasture-fed animals. The observed bacterial species richness, based on the V1-V3 region of the 15S rRNA gene, was between 1902 to 2596 species-level operati...

  18. Host-specific segregation of ribosomal nucleotide sequence diversity in the microsporidian Enterocytozoon bieneusi.

    PubMed

    Widmer, Giovanni; Akiyoshi, Donna E

    2010-01-01

    Enterocytozoon bieneusi is a unicellular enteric fungal pathogen and the most common cause of human microsporidiosis. The frequent detection of this organism in animals, including companion animals, livestock and wildlife, has raised the question of the importance of animal reservoirs in the epidemiology of this pathogen. A partial sequence of the ribosomal internal transcribed spacer (ITS) has been widely used as a genetic marker for studying the molecular epidemiology of E. bieneusi. With the aim of comparing E. bieneusi ITS genotypes originating from different host species, and assess the potential for zoonotic transmission, E. bieneusi ITS sequences retrieved from GenBank were analyzed using two metrics of diversity, rarefaction and phylogenetic distance. In spite of the human ITS sample being geographically more diverse, ITS sequence diversity in animals exceeded that of humans. In both host groups much of the ITS diversity remains to be sampled. Using quantitative phylogenetic tests we found evidence for a partial but significant segregation of E. bieneusi ITS sequences according to host species. Host-specific segregation was confirmed by hierarchical analysis of molecular variation. To improve our understanding of the epidemiology of human microsporidiosis and strengthen the study of E. bieneusi populations, efforts to genotype additional E. bieneusi isolates from wildlife and companion animals should be prioritized and the geographic and species diversify of animal samples should be increased. Due to the possibility of genetic recombination in this species, additional unlinked genetic markers need to be developed and included in future studies. PMID:19931647

  19. Evaluation of Genetic Diversity of the USDA Lablab Purpureus Germplasm Collection Using Simple Sequence Repeat Markers

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The genetic diversity of the USDA Lablab purpureus germplasm collection is unknown and was assessed by using polymorphic simple sequence repeat (SSR) markers derived from Medicago, soybean and cowpea. Phylogenetic analysis paritioned 47 representative accessions into two main clades (wild clade prod...

  20. Quantum-Sequencing: Biophysics of quantum tunneling through nucleic acids

    NASA Astrophysics Data System (ADS)

    Casamada Ribot, Josep; Chatterjee, Anushree; Nagpal, Prashant

    2014-03-01

    Tunneling microscopy and spectroscopy has extensively been used in physical surface sciences to study quantum tunneling to measure electronic local density of states of nanomaterials and to characterize adsorbed species. Quantum-Sequencing (Q-Seq) is a new method based on tunneling microscopy for electronic sequencing of single molecule of nucleic acids. A major goal of third-generation sequencing technologies is to develop a fast, reliable, enzyme-free single-molecule sequencing method. Here, we present the unique ``electronic fingerprints'' for all nucleotides on DNA and RNA using Q-Seq along their intrinsic biophysical parameters. We have analyzed tunneling spectra for the nucleotides at different pH conditions and analyzed the HOMO, LUMO and energy gap for all of them. In addition we show a number of biophysical parameters to further characterize all nucleobases (electron and hole transition voltage and energy barriers). These results highlight the robustness of Q-Seq as a technique for next-generation sequencing.

  1. Propionibacterium acnes: Disease-Causing Agent or Common Contaminant? Detection in Diverse Patient Samples by Next-Generation Sequencing.

    PubMed

    Mollerup, Sarah; Friis-Nielsen, Jens; Vinner, Lasse; Hansen, Thomas Arn; Richter, Stine Raith; Fridholm, Helena; Herrera, Jose Alejandro Romero; Lund, Ole; Brunak, Søren; Izarzugaza, Jose M G; Mourier, Tobias; Nielsen, Lars Peter; Hansen, Anders Johannes

    2016-04-01

    Propionibacterium acnesis the most abundant bacterium on human skin, particularly in sebaceous areas.P. acnesis suggested to be an opportunistic pathogen involved in the development of diverse medical conditions but is also a proven contaminant of human clinical samples and surgical wounds. Its significance as a pathogen is consequently a matter of debate. In the present study, we investigated the presence ofP. acnesDNA in 250 next-generation sequencing data sets generated from 180 samples of 20 different sample types, mostly of cancerous origin. The samples were subjected to either microbial enrichment, involving nuclease treatment to reduce the amount of host nucleic acids, or shotgun sequencing. We detected high proportions ofP. acnesDNA in enriched samples, particularly skin tissue-derived and other tissue samples, with the levels being higher in enriched samples than in shotgun-sequenced samples.P. acnesreads were detected in most samples analyzed, though the proportions in most shotgun-sequenced samples were low. Our results show thatP. acnescan be detected in practically all sample types when molecular methods, such as next-generation sequencing, are employed. The possibility of contamination from the patient or other sources, including laboratory reagents or environment, should therefore always be considered carefully whenP. acnesis detected in clinical samples. We advocate that detection ofP. acnesalways be accompanied by experiments validating the association between this bacterium and any clinical condition. PMID:26818667

  2. Propionibacterium acnes: Disease-Causing Agent or Common Contaminant? Detection in Diverse Patient Samples by Next-Generation Sequencing

    PubMed Central

    Friis-Nielsen, Jens; Vinner, Lasse; Hansen, Thomas Arn; Richter, Stine Raith; Fridholm, Helena; Herrera, Jose Alejandro Romero; Lund, Ole; Brunak, Søren; Izarzugaza, Jose M. G.; Mourier, Tobias; Nielsen, Lars Peter

    2016-01-01

    Propionibacterium acnes is the most abundant bacterium on human skin, particularly in sebaceous areas. P. acnes is suggested to be an opportunistic pathogen involved in the development of diverse medical conditions but is also a proven contaminant of human clinical samples and surgical wounds. Its significance as a pathogen is consequently a matter of debate. In the present study, we investigated the presence of P. acnes DNA in 250 next-generation sequencing data sets generated from 180 samples of 20 different sample types, mostly of cancerous origin. The samples were subjected to either microbial enrichment, involving nuclease treatment to reduce the amount of host nucleic acids, or shotgun sequencing. We detected high proportions of P. acnes DNA in enriched samples, particularly skin tissue-derived and other tissue samples, with the levels being higher in enriched samples than in shotgun-sequenced samples. P. acnes reads were detected in most samples analyzed, though the proportions in most shotgun-sequenced samples were low. Our results show that P. acnes can be detected in practically all sample types when molecular methods, such as next-generation sequencing, are employed. The possibility of contamination from the patient or other sources, including laboratory reagents or environment, should therefore always be considered carefully when P. acnes is detected in clinical samples. We advocate that detection of P. acnes always be accompanied by experiments validating the association between this bacterium and any clinical condition. PMID:26818667

  3. Amino acid sequence of the nonsecretory ribonuclease of human urine.

    PubMed

    Beintema, J J; Hofsteenge, J; Iwama, M; Morita, T; Ohgi, K; Irie, M; Sugiyama, R H; Schieven, G L; Dekker, C A; Glitz, D G

    1988-06-14

    The amino acid sequence of a nonsecretory ribonuclease isolated from human urine was determined except for the identity of the residue at position 7. Sequence information indicates that the ribonucleases of human liver and spleen and an eosinophil-derived neurotoxin are identical or very closely related gene products. The sequence is identical at about 30% of the amino acid positions with those of all of the secreted mammalian ribonucleases for which information is available. Identical residues include active-site residues histidine-12, histidine-119, and lysine-41, other residues known to be important for substrate binding and catalytic activity, and all eight half-cystine residues common to these enzymes. Major differences include a deletion of six residues in the (so-called) S-peptide loop, insertions of two, and nine residues, respectively, in three other external loops of the molecule, and an addition of three residues at the amino terminus. The sequence shows the human nonsecretory ribonuclease to belong to the same ribonuclease superfamily as the mammalian secretory ribonucleases, turtle pancreatic ribonuclease, and human angiogenin. Sequence data suggest that a gene duplication occurred in an ancient vertebrate ancestor; one branch led to the nonsecretory ribonuclease, while the other branch led to a second duplication, with one line leading to the secretory ribonucleases (in mammals) and the second line leading to pancreatic ribonuclease in turtle and an angiogenic factor in mammals (human angiogenin). The nonsecretory ribonuclease has five short carbohydrate chains attached via asparagine residues at the surface of the molecule; these chains may have been shortened by exoglycosidase action.(ABSTRACT TRUNCATED AT 250 WORDS) PMID:3166997

  4. Bacterial diversity in Adirondack mountain lakes as revealed by 16S rRNA gene sequences.

    PubMed Central

    Hiorns, W D; Methé, B A; Nierzwicki-Bauer, S A; Zehr, J P

    1997-01-01

    Bacterial communities of seven lakes in the Adirondack Mountains of New York State were characterized by amplification and sequencing of 16S ribosomal DNA. Analysis of over 100 partial sequences revealed a diverse collection of lineages, largely of the class Proteobacteria (19% alpha subdivision, 31% beta subdivision, and 9% gamma subdivision), the phylum Cytophaga-Flavobacteria-Bacteroides (15%), and the order Actinomycetales (18%). Additionally, a number of the sequences were similar to those of the order Verrucomicrobiales. However, few of the sequence types are closely related to those of characterized species. The relative contributions of the groups of sequences differed among the lakes, suggesting that bacterial population structure varies and that it may be possible to relate aquatic bacterial community structure to water chemistry. PMID:9212443

  5. Nucleotide Sequence Diversity and Linkage Disequilibrium of Four Nuclear Loci in Foxtail Millet (Setaria italica)

    PubMed Central

    He, Shui-lian; Yang, Yang; Morrell, Peter L.; Yi, Ting-shuang

    2015-01-01

    Foxtail millet (Setaria italica (L.) Beauv) is one of the earliest domesticated grains, which has been cultivated in northern China by 8,700 years before present (YBP) and across Eurasia by 4,000 YBP. Owing to a small genome and diploid nature, foxtail millet is a tractable model crop for studying functional genomics of millets and bioenergy grasses. In this study, we examined nucleotide sequence diversity, geographic structure, and levels of linkage disequilibrium at four nuclear loci (ADH1, G3PDH, IGS1 and TPI1) in representative samples of 311 landrace accessions across its cultivated range. Higher levels of nucleotide sequence and haplotype diversity were observed in samples from China relative to other sampled regions. Genetic assignment analysis classified the accessions into seven clusters based on nucleotide sequence polymorphisms. Intralocus LD decayed rapidly to half the initial value within ~1.2 kb or less. PMID:26325578

  6. Characterization and amino acid sequence of a fatty acid-binding protein from human heart.

    PubMed

    Offner, G D; Brecher, P; Sawlivich, W B; Costello, C E; Troxler, R F

    1988-05-15

    The complete amino acid sequence of a fatty acid-binding protein from human heart was determined by automated Edman degradation of CNBr, BNPS-skatole [3'-bromo-3-methyl-2-(2-nitrobenzenesulphenyl)indolenine], hydroxylamine, Staphylococcus aureus V8 proteinase, tryptic and chymotryptic peptides, and by digestion of the protein with carboxypeptidase A. The sequence of the blocked N-terminal tryptic peptide from citraconylated protein was determined by collisionally induced decomposition mass spectrometry. The protein contains 132 amino acid residues, is enriched with respect to threonine and lysine, lacks cysteine, has an acetylated valine residue at the N-terminus, and has an Mr of 14768 and an isoelectric point of 5.25. This protein contains two short internal repeated sequences from residues 48-54 and from residues 114-119 located within regions of predicted beta-structure and decreasing hydrophobicity. These short repeats are contained within two longer repeated regions from residues 48-60 and residues 114-125, which display 62% sequence similarity. These regions could accommodate the charged and uncharged moieties of long-chain fatty acids and may represent fatty acid-binding domains consistent with the finding that human heart fatty acid-binding protein binds 2 mol of oleate or palmitate/mol of protein. Detailed evidence for the amino acid sequences of the peptides has been deposited as Supplementary Publication SUP 50143 (23 pages) at the British Library Lending Division, Boston Spa, Yorkshire LS23 7BQ, U.K., from whom copies may be obtained as indicated in Biochem. J. (1988) 249, 5. PMID:3421901

  7. Molecular cloning and amino acid sequence of human 5-lipoxygenase

    SciTech Connect

    Matsumoto, T.; Funk, C.D.; Radmark, O.; Hoeoeg, J.O.; Joernvall, H.; Samuelsson, B.

    1988-01-01

    5-Lipoxygenase (EC 1.13.11.34), a Ca/sup 2 +/- and ATP-requiring enzyme, catalyzes the first two steps in the biosynthesis of the peptidoleukotrienes and the chemotactic factor leukotriene B/sub 4/. A cDNA clone corresponding to 5-lipoxygenase was isolated from a human lung lambda gt11 expression library by immunoscreening with a polyclonal antibody. Additional clones from a human placenta lambda gt11 cDNA library were obtained by plaque hybridization with the /sup 32/P-labeled lung cDNA clone. Sequence data obtained from several overlapping clones indicate that the composite DNAs contain the complete coding region for the enzyme. From the deduced primary structure, 5-lipoxygenase encodes a 673 amino acid protein with a calculated molecular weight of 77,839. Direct analysis of the native protein and its proteolytic fragments confirmed the deduced composition, the amino-terminal amino acid sequence, and the structure of many internal segments. 5-Lipoxygenase has no apparent sequence homology with leukotriene A/sub 4/ hydrolase or Ca/sup 2 +/-binding proteins. RNA blot analysis indicated substantial amounts of an mRNA species of approx. = 2700 nucleotides in leukocytes, lung, and placenta.

  8. Nucleic acid sequence detection using multiplexed oligonucleotide PCR

    DOEpatents

    Nolan, John P.; White, P. Scott

    2006-12-26

    Methods for rapidly detecting single or multiple sequence alleles in a sample nucleic acid are described. Provided are all of the oligonucleotide pairs capable of annealing specifically to a target allele and discriminating among possible sequences thereof, and ligating to each other to form an oligonucleotide complex when a particular sequence feature is present (or, alternatively, absent) in the sample nucleic acid. The design of each oligonucleotide pair permits the subsequent high-level PCR amplification of a specific amplicon when the oligonucleotide complex is formed, but not when the oligonucleotide complex is not formed. The presence or absence of the specific amplicon is used to detect the allele. Detection of the specific amplicon may be achieved using a variety of methods well known in the art, including without limitation, oligonucleotide capture onto DNA chips or microarrays, oligonucleotide capture onto beads or microspheres, electrophoresis, and mass spectrometry. Various labels and address-capture tags may be employed in the amplicon detection step of multiplexed assays, as further described herein.

  9. The amino acid sequence of rabbit muscle triose phosphate isomerase.

    PubMed Central

    Corran, P H; Waley, S G

    1975-01-01

    The amino acid sequence of rabbit muscle triose phosphate isomerase was deduced by characterizing peptides that overlap the tryptic peptides. Thiol groups were modified by oxidation, carboxymethylation or aminoen. About 50 peptides that provided information about overlaps were isolated; the peptides were mostly characterized by their compositions and N-terminal residues. The peptide chains contain 248 amino acid residues, and no evidence for dissimilarity of the two subunits that comprise the native enzyme was found. The sequence of the rabbit muscle enzyme may be compared with that of the coelacanth enzyme (Kolb et al., 1974): 84% of the residues are in identical positions. Similarly, comparison of the sequence with that inferred for the chicken enzyme (Furth et al., 1974) shows that 87% of the residues are in identical positions. Limited though these comparisons are, they suggest that triose phosphate isomerase has one of the lowest rates of evolutionary change. An extended version of the present paper has been deposited as Supplementary Publication SUP 50040 (42 pages) at the British Library (Lending Division) (formerly the National Lending Library for Science and Technology), Boston Spa, Yorks. LS23 7BQ, U.K., from whom copies can be obtained on the terms given in Biochem. J. (1975) 145, 5. PMID:1171682

  10. The amino acid sequence of chymopapain from Carica papaya.

    PubMed Central

    Watson, D C; Yaguchi, M; Lynn, K R

    1990-01-01

    Chymopapain is a polypeptide of 218 amino acid residues. It has considerable structural similarity with papain and papaya proteinase omega, including conservation of the catalytic site and of the disulphide bonding. Chymopapain is like papaya proteinase omega in carrying four extra residues between papain positions 168 and 169, but differs from both papaya proteinases in the composition of its S2 subsite, as well as in having a second thiol group, Cys-117. Some evidence for the amino acid sequence of chymopapain has been deposited as Supplementary Publication SUP 50153 (12 pages) at the British Library Document Supply Centre, Boston Spa., Wetherby, West Yorkshire LS23 7BQ, U.K., from whom copies may be obtained on the terms indicated in Biochem. J. (1990) 265, 5. The information comprises Supplement Tables 1-4, which contain, in order, amino acid compositions of peptides from tryptic, peptic, CNBr and mild acid cleavages, Supplement Fig. 1, showing re-fractionation of selected peaks from Fig. 2 of the main paper. Supplement Fig. 2, showing cation-exchange chromatography of the earliest-eluted peak of Fig. 3 of the main paper, Supplement Fig. 3, showing reverse-phase h.p.l.c. of the later-eluted peak from Fig. 3 of the main paper, and Supplement Fig. 4, showing the separation of peptides after mild acid hydrolysis of CNBr-cleavage fragment CB3. PMID:2106878

  11. The amino acid sequence of chymopapain from Carica papaya.

    PubMed

    Watson, D C; Yaguchi, M; Lynn, K R

    1990-02-15

    Chymopapain is a polypeptide of 218 amino acid residues. It has considerable structural similarity with papain and papaya proteinase omega, including conservation of the catalytic site and of the disulphide bonding. Chymopapain is like papaya proteinase omega in carrying four extra residues between papain positions 168 and 169, but differs from both papaya proteinases in the composition of its S2 subsite, as well as in having a second thiol group, Cys-117. Some evidence for the amino acid sequence of chymopapain has been deposited as Supplementary Publication SUP 50153 (12 pages) at the British Library Document Supply Centre, Boston Spa., Wetherby, West Yorkshire LS23 7BQ, U.K., from whom copies may be obtained on the terms indicated in Biochem. J. (1990) 265, 5. The information comprises Supplement Tables 1-4, which contain, in order, amino acid compositions of peptides from tryptic, peptic, CNBr and mild acid cleavages, Supplement Fig. 1, showing re-fractionation of selected peaks from Fig. 2 of the main paper. Supplement Fig. 2, showing cation-exchange chromatography of the earliest-eluted peak of Fig. 3 of the main paper, Supplement Fig. 3, showing reverse-phase h.p.l.c. of the later-eluted peak from Fig. 3 of the main paper, and Supplement Fig. 4, showing the separation of peptides after mild acid hydrolysis of CNBr-cleavage fragment CB3. PMID:2106878

  12. Archaeon and archaeal virus diversity classification via sequence entropy and fractal dimension

    NASA Astrophysics Data System (ADS)

    Tremberger, George, Jr.; Gallardo, Victor; Espinoza, Carola; Holden, Todd; Gadura, N.; Cheung, E.; Schneider, P.; Lieberman, D.; Cheung, T.

    2010-09-01

    Archaea are important potential candidates in astrobiology as their metabolism includes solar, inorganic and organic energy sources. Archaeal viruses would also be expected to be present in a sustainable archaeal exobiological community. Genetic sequence Shannon entropy and fractal dimension can be used to establish a two-dimensional measure for classification and phylogenetic study of these organisms. A sequence fractal dimension can be calculated from a numerical series consisting of the atomic numbers of each nucleotide. Archaeal 16S and 23S ribosomal RNA sequences were studied. Outliers in the 16S rRNA fractal dimension and entropy plot were found to be halophilic archaea. Positive correlation (R-square ~ 0.75, N = 18) was observed between fractal dimension and entropy across the studied species. The 16S ribosomal RNA sequence entropy correlates with the 23S ribosomal RNA sequence entropy across species with R-square 0.93, N = 18. Entropy values correspond positively with branch lengths of a published phylogeny. The studied archaeal virus sequences have high fractal dimensions of 2.02 or more. A comparison of selected extremophile sequences with archaeal sequences from the Humboldt Marine Ecosystem database (Wood-Hull Oceanography Institute, MIT) suggests the presence of continuous sequence expression as inferred from distributions of entropy and fractal dimension, consistent with the diversity expected in an exobiological archaeal community.

  13. Soil Parameters Drive the Structure, Diversity and Metabolic Potentials of the Bacterial Communities Across Temperate Beech Forest Soil Sequences.

    PubMed

    Jeanbille, M; Buée, M; Bach, C; Cébron, A; Frey-Klett, P; Turpault, M P; Uroz, S

    2016-02-01

    Soil and climatic conditions as well as land cover and land management have been shown to strongly impact the structure and diversity of the soil bacterial communities. Here, we addressed under a same land cover the potential effect of the edaphic parameters on the soil bacterial communities, excluding potential confounding factors as climate. To do this, we characterized two natural soil sequences occurring in the Montiers experimental site. Spatially distant soil samples were collected below Fagus sylvatica tree stands to assess the effect of soil sequences on the edaphic parameters, as well as the structure and diversity of the bacterial communities. Soil analyses revealed that the two soil sequences were characterized by higher pH and calcium and magnesium contents in the lower plots. Metabolic assays based on Biolog Ecoplates highlighted higher intensity and richness in usable carbon substrates in the lower plots than in the middle and upper plots, although no significant differences occurred in the abundance of bacterial and fungal communities along the soil sequences as assessed using quantitative PCR. Pyrosequencing analysis of 16S ribosomal RNA (rRNA) gene amplicons revealed that Proteobacteria, Acidobacteria and Bacteroidetes were the most abundantly represented phyla. Acidobacteria, Proteobacteria and Chlamydiae were significantly enriched in the most acidic and nutrient-poor soils compared to the Bacteroidetes, which were significantly enriched in the soils presenting the higher pH and nutrient contents. Interestingly, aluminium, nitrogen, calcium, nutrient availability and pH appeared to be the best predictors of the bacterial community structures along the soil sequences. PMID:26370112

  14. Diversity of lactic acid bacteria in two Flemish artisan raw milk Gouda-type cheeses.

    PubMed

    Van Hoorde, Koenraad; Verstraete, Tine; Vandamme, Peter; Huys, Geert

    2008-10-01

    PCR-denaturing gradient gel electrophoresis (PCR-DGGE) was used to study the diversity of lactic acid bacteria (LAB) in two Flemish artisan raw milk Gouda-type cheeses. In parallel, conventional culturing was performed. Isolates were identified using (GTG)(5)-PCR and sequence analysis of 16S rRNA and pheS genes. Discriminant analysis revealed some differences in overall LAB diversity between the two batches and between the two cheeses. Within each batch, the diversity of 8- and 12-week-old cheeses was relatively similar. Conventional isolation mainly revealed the presence of Lactobacillus paracasei, Lactobacillus plantarum, Lactobacillus brevis, Lactobacillus rhamnosus and Pediococcus pentosaceus. PCR-DGGE revealed the presence of three species of which no isolates were recovered, i.e. Enterococcus faecalis, Lactobacillus parabuchneri and Lactobacillus gallinarum. Conversely, not all isolated bacteria were detected by PCR-DGGE. We recommend the integrated use of culture-dependent and -independent approaches to maximally encompass the taxonomic spectrum of LAB occurring in Gouda-type and other cheeses. PMID:18721684

  15. Microbial Diversity and Population Structure of Extremely Acidic Sulfur-Oxidizing Biofilms From Sulfidic Caves

    NASA Astrophysics Data System (ADS)

    Jones, D.; Stoffer, T.; Lyon, E. H.; Macalady, J. L.

    2005-12-01

    Extremely acidic (pH 0-1) microbial biofilms called snottites form on the walls of sulfidic caves where gypsum replacement crusts isolate sulfur-oxidizing microorganisms from the buffering action of limestone host rock. We investigated the phylogeny and population structure of snottites from sulfidic caves in central Italy using full cycle rRNA methods. A small subunit rRNA bacterial clone library from a Frasassi cave complex snottite sample contained a single sequence group (>60 clones) similar to Acidithiobacillus thiooxidans. Bacterial and universal rRNA clone libraries from other Frasassi snottites were only slightly more diverse, containing a maximum of 4 bacterial species and probably 2 archaeal species. Fluorescence in situ hybridization (FISH) of snottites from Frasassi and from the much warmer Rio Garrafo cave complex revealed that all of the communities are simple (low-diversity) and dominated by Acidithiobacillus and/or Ferroplasma species, with smaller populations of an Acidimicrobium species, filamentous fungi, and protists. Our results suggest that sulfidic cave snottites will be excellent model microbial ecosystems suited for ecological and metagenomic studies aimed at elucidating geochemical and ecological controls on microbial diversity, and at mapping the spatial history of microbial evolutionary events such as adaptations, recombinations and gene transfers.

  16. Deep sequencing uncovers protistan plankton diversity in the Portuguese Ria Formosa solar saltern ponds.

    PubMed

    Filker, Sabine; Gimmler, Anna; Dunthorn, Micah; Mahé, Frédéric; Stoeck, Thorsten

    2015-03-01

    We used high-throughput sequencing to unravel the genetic diversity of protistan (including fungal) plankton in hypersaline ponds of the Ria Formosa solar saltern works in Portugal. From three ponds of different salinity (4, 12 and 38 %), we obtained ca. 105,000 amplicons (V4 region of the SSU rDNA). The genetic diversity we found was higher than what has been described from solar saltern ponds thus far by microscopy or molecular studies. The obtained operational taxonomic units (OTUs) could be assigned to 14 high-rank taxonomic groups and blasted to 120 eukaryotic families. The novelty of this genetic diversity was extremely high, with 27 % of all OTUs having a sequence divergence of more than 10 % to deposited sequences of described taxa. The highest degree of novelty was found at intermediate salinity of 12 % within the ciliates, which traditionally are considered as the best known and described taxon group within the kingdom Protista. Further substantial novelty was detected within the stramenopiles and the chlorophytes. Analyses of community structures suggest a transition boundary for protistan plankton between 4 and 12 % salinity, suggesting different haloadaptation strategies in individual evolutionary lineages as a result of environmental filtering. Our study makes evident the gaps in our knowledge not only of protistan and fungal plankton diversity in hypersaline environments, but also in their ecology and their strategies to cope with these environmental conditions. It substantiates that specific future research needs to fill these gaps. PMID:25472012

  17. Characterization of the microbial acid mine drainage microbial community using culturing and direct sequencing techniques.

    PubMed

    Auld, Ryan R; Myre, Maxine; Mykytczuk, Nadia C S; Leduc, Leo G; Merritt, Thomas J S

    2013-05-01

    We characterized the bacterial community from an AMD tailings pond using both classical culturing and modern direct sequencing techniques and compared the two methods. Acid mine drainage (AMD) is produced by the environmental and microbial oxidation of minerals dissolved from mining waste. Surprisingly, we know little about the microbial communities associated with AMD, despite the fundamental ecological roles of these organisms and large-scale economic impact of these waste sites. AMD microbial communities have classically been characterized by laboratory culturing-based techniques and more recently by direct sequencing of marker gene sequences, primarily the 16S rRNA gene. In our comparison of the techniques, we find that their results are complementary, overall indicating very similar community structure with similar dominant species, but with each method identifying some species that were missed by the other. We were able to culture the majority of species that our direct sequencing results indicated were present, primarily species within the Acidithiobacillus and Acidiphilium genera, although estimates of relative species abundance were only obtained from direct sequencing. Interestingly, our culture-based methods recovered four species that had been overlooked from our sequencing results because of the rarity of the marker gene sequences, likely members of the rare biosphere. Further, direct sequencing indicated that a single genus, completely missed in our culture-based study, Legionella, was a dominant member of the microbial community. Our results suggest that while either method does a reasonable job of identifying the dominant members of the AMD microbial community, together the methods combine to give a more complete picture of the true diversity of this environment. PMID:23485423

  18. Increased Bile Acid Synthesis and Deconjugation After Biliopancreatic Diversion.

    PubMed

    Ferrannini, Ele; Camastra, Stefania; Astiarraga, Brenno; Nannipieri, Monica; Castro-Perez, Jose; Xie, Dan; Wang, Liangsu; Chakravarthy, Manu; Haeusler, Rebecca A

    2015-10-01

    Biliopancreatic diversion (BPD) improves insulin sensitivity and decreases serum cholesterol out of proportion with weight loss. Mechanisms of these effects are unknown. One set of proposed contributors to metabolic improvements after bariatric surgeries is bile acids (BAs). We investigated the early and late effects of BPD on plasma BA levels, composition, and markers of BA synthesis in 15 patients with type 2 diabetes (T2D). We compared these to the early and late effects of Roux-en-Y gastric bypass (RYGB) in 22 patients with T2D and 16 with normal glucose tolerance. Seven weeks after BPD, insulin sensitivity had doubled and serum cholesterol had halved. At this time, BA synthesis markers and total plasma BAs, particularly unconjugated BAs, had markedly risen; this effect could not be entirely explained by low FGF19. In contrast, after RYGB, insulin sensitivity improved gradually with weight loss and cholesterol levels declined marginally; BA synthesis markers were decreased at an early time point (2 weeks) after surgery and returned to the normal range 1 year later. These findings indicate that BA synthesis contributes to the decreased serum cholesterol after BPD. Moreover, they suggest a potential role for altered enterohepatic circulation of BAs in improving insulin sensitivity and cholesterol metabolism after BPD. PMID:26015549

  19. Amino acid sequence prerequisites for the formation of cn ions.

    PubMed

    Downard, K M; Biemann, K

    1993-11-01

    Ammo acid sequence prerequisites are described for the formation of c, ions observed in high-energy collision-induced decomposition spectra of peptides. It is shown that the formation of cn ions is promoted by the nature of the amino acid C-terminal to the cleavage site. A propensity for cn cleavage preceding threonine, and to a lesser extent tryptophan, lysine, and serine, is demonstrated where fragmentation is directed N-terminally at these residues. In addition, the nature of the residue N-terminal to the cleavage site is shown to have little effect on cn ion formation. A mechanism for cn ion formation is proposed and its applicability to the results observed is discussed. PMID:24227531

  20. Ultrasensitive nucleic acid sequence detection by single-molecule electrophoresis

    SciTech Connect

    Castro, A; Shera, E.B.

    1996-09-01

    This is the final report of a one-year laboratory-directed research and development project at Los Alamos National Laboratory. There has been considerable interest in the development of very sensitive clinical diagnostic techniques over the last few years. Many pathogenic agents are often present in extremely small concentrations in clinical samples, especially at the initial stages of infection, making their detection very difficult. This project sought to develop a new technique for the detection and accurate quantification of specific bacterial and viral nucleic acid sequences in clinical samples. The scheme involved the use of novel hybridization probes for the detection of nucleic acids combined with our recently developed technique of single-molecule electrophoresis. This project is directly relevant to the DOE`s Defense Programs strategic directions in the area of biological warfare counter-proliferation.

  1. High Sequence Variability, Diverse Subcellular Localizations, and Ecological Implications of Alkaline Phosphatase in Dinoflagellates and Other Eukaryotic Phytoplankton

    PubMed Central

    Lin, Xin; Zhang, Huan; Cui, Yudong; Lin, Senjie

    2012-01-01

    Alkaline phosphatase (AP) is a key enzyme for phytoplankton to utilize dissolved organic phosphorus (DOP) when dissolved inorganic phosphorus is limited. While three major types of AP and their correspondingly diverse subcellular localization have been recognized in bacteria, little is known about AP in eukaryotic phytoplankton such as dinoflagellates. Here, we isolated a full-length AP cDNA from a latest-diverging dinoflagellate genus Alexandrium, and conducted comparative analyses with homologs from a relatively basal (Amphidinium carterae) and late-diverging (Karenia brevis) lineage of dinoflagellates as well as other eukaryotic algae. New data and previous studies indicate that AP is common in dinoflagellates and most other major eukaryotic groups of phytoplankton. AP sequences are more variable than many other genes studied in dinoflagellates, and are divergent among different eukaryotic phytoplankton lineages. Sequence comparison to the other characterized APs suggests that dinoflagellates and some other eukaryotic phytoplankton possess the putative AP as phoA type, but some other eukaryotic phytoplankton seem to have other types. Phylogenetic analyses based on AP amino acid sequences indicated that the “red-type” eukaryotic lineages formed a monophyletic group, suggesting a common origin of their APs. As different amino acid sequences have been found to predictably determine different spatial distribution in the cells, which may facilitate access to different pools of DOP, existing computational models were adopted to predict the subcellular localizations of putative AP in the three dinoflagellates and other eukaryotic phytoplankton. Results showed different subcellular localizations of APs in different dinoflagellates and other lineages. The linkage between AP sequence divergence, subcellular localization, and ecological niche differentiation requires rigorous experimental verification, and this study now provides a framework for such a future effort

  2. Rapid quantification of mutant fitness in diverse bacteria by sequencing randomly bar-coded transposons

    SciTech Connect

    Wetmore, Kelly M.; Price, Morgan N.; Waters, Robert J.; Lamson, Jacob S.; He, Jennifer; Hoover, Cindi A.; Blow, Matthew J.; Bristow, James; Butland, Gareth; Arkin, Adam P.; Deutschbauer, Adam

    2015-05-12

    Transposon mutagenesis with next-generation sequencing (TnSeq) is a powerful approach to annotate gene function in bacteria, but existing protocols for TnSeq require laborious preparation of every sample before sequencing. Thus, the existing protocols are not amenable to the throughput necessary to identify phenotypes and functions for the majority of genes in diverse bacteria. Here, we present a method, random bar code transposon-site sequencing (RB-TnSeq), which increases the throughput of mutant fitness profiling by incorporating random DNA bar codes into Tn5 and mariner transposons and by using bar code sequencing (BarSeq) to assay mutant fitness. RB-TnSeq can be used with any transposon, and TnSeq is performed once per organism instead of once per sample. Each BarSeq assay requires only a simple PCR, and 48 to 96 samples can be sequenced on one lane of an Illumina HiSeq system. We demonstrate the reproducibility and biological significance of RB-TnSeq with Escherichia coli, Phaeobacter inhibens, Pseudomonas stutzeri, Shewanella amazonensis, and Shewanella oneidensis. To demonstrate the increased throughput of RB-TnSeq, we performed 387 successful genome-wide mutant fitness assays representing 130 different bacterium-carbon source combinations and identified 5,196 genes with significant phenotypes across the five bacteria. In P. inhibens, we used our mutant fitness data to identify genes important for the utilization of diverse carbon substrates, including a putative D-mannose isomerase that is required for mannitol catabolism. RB-TnSeq will enable the cost-effective functional annotation of diverse bacteria using mutant fitness profiling. A large challenge in microbiology is the functional assessment of the millions of uncharacterized genes identified by genome sequencing. Transposon mutagenesis coupled to next-generation sequencing (TnSeq) is a powerful approach to assign phenotypes and functions to genes

  3. Rapid quantification of mutant fitness in diverse bacteria by sequencing randomly bar-coded transposons

    DOE PAGESBeta

    Wetmore, Kelly M.; Price, Morgan N.; Waters, Robert J.; Lamson, Jacob S.; He, Jennifer; Hoover, Cindi A.; Blow, Matthew J.; Bristow, James; Butland, Gareth; Arkin, Adam P.; et al

    2015-05-12

    Transposon mutagenesis with next-generation sequencing (TnSeq) is a powerful approach to annotate gene function in bacteria, but existing protocols for TnSeq require laborious preparation of every sample before sequencing. Thus, the existing protocols are not amenable to the throughput necessary to identify phenotypes and functions for the majority of genes in diverse bacteria. Here, we present a method, random bar code transposon-site sequencing (RB-TnSeq), which increases the throughput of mutant fitness profiling by incorporating random DNA bar codes into Tn5 and mariner transposons and by using bar code sequencing (BarSeq) to assay mutant fitness. RB-TnSeq can be used with anymore » transposon, and TnSeq is performed once per organism instead of once per sample. Each BarSeq assay requires only a simple PCR, and 48 to 96 samples can be sequenced on one lane of an Illumina HiSeq system. We demonstrate the reproducibility and biological significance of RB-TnSeq with Escherichia coli, Phaeobacter inhibens, Pseudomonas stutzeri, Shewanella amazonensis, and Shewanella oneidensis. To demonstrate the increased throughput of RB-TnSeq, we performed 387 successful genome-wide mutant fitness assays representing 130 different bacterium-carbon source combinations and identified 5,196 genes with significant phenotypes across the five bacteria. In P. inhibens, we used our mutant fitness data to identify genes important for the utilization of diverse carbon substrates, including a putative D-mannose isomerase that is required for mannitol catabolism. RB-TnSeq will enable the cost-effective functional annotation of diverse bacteria using mutant fitness profiling. A large challenge in microbiology is the functional assessment of the millions of uncharacterized genes identified by genome sequencing. Transposon mutagenesis coupled to next-generation sequencing (TnSeq) is a powerful approach to assign phenotypes and functions to genes. However, the current strategies for TnSeq are

  4. High-accuracy identification of incident HIV-1 infections using a sequence clustering based diversity measure.

    PubMed

    Xia, Xia-Yu; Ge, Meng; Hsi, Jenny H; He, Xiang; Ruan, Yu-Hua; Wang, Zhi-Xin; Shao, Yi-Ming; Pan, Xian-Ming

    2014-01-01

    Accurate estimates of HIV-1 incidence are essential for monitoring epidemic trends and evaluating intervention efforts. However, the long asymptomatic stage of HIV-1 infection makes it difficult to effectively distinguish incident infections from chronic ones. Current incidence assays based on serology or viral sequence diversity are both still lacking in accuracy. In the present work, a sequence clustering based diversity (SCBD) assay was devised by utilizing the fact that viral sequences derived from each transmitted/founder (T/F) strain tend to cluster together at early stage, and that only the intra-cluster diversity is correlated with the time since HIV-1 infection. The dot-matrix pairwise alignment was used to eliminate the disproportional impact of insertion/deletions (indels) and recombination events, and so was the proportion of clusterable sequences (Pc) as an index to identify late chronic infections with declined viral genetic diversity. Tested on a dataset containing 398 incident and 163 chronic infection cases collected from the Los Alamos HIV database (last modified 2/8/2012), our SCBD method achieved 99.5% sensitivity and 98.8% specificity, with an overall accuracy of 99.3%. Further analysis and evaluation also suggested its performance was not affected by host factors such as the viral subtypes and transmission routes. The SCBD method demonstrated the potential of sequencing based techniques to become useful for identifying incident infections. Its use may be most advantageous for settings with low to moderate incidence relative to available resources. The online service is available at http://www.bioinfo.tsinghua.edu.cn:8080/SCBD/index.jsp. PMID:24925130

  5. Detection of Diverse Novel Bat Astrovirus Sequences in the Czech Republic.

    PubMed

    Dufkova, Lucie; Straková, Petra; Širmarová, Jana; Salát, Jiří; Moutelíková, Romana; Chrudimský, Tomáš; Bartonička, Tomáš; Nowotny, Norbert; Růžek, Daniel

    2015-08-01

    Astroviruses are a major cause of gastroenteritis in humans and animals. Recently, novel groups of astroviruses were identified in apparently healthy insectivorous bats. We report the detection of diverse novel astrovirus sequences in nine different European bat species: Eptesicus serotinus, Hypsugo savii, Myotis emarginatus, M. mystacinus, Nyctalus noctula, Pipistrellus nathusii or P. pygmaeus, P. pipistrellus, Vespertilio murinus, and Rhinolophus hipposideros. In six bat species, astrovirus sequences were detected for the first time. One astrovirus strain detected in R. hipposideros clustered phylogenetically with Chinese astrovirus strains originating from bats of the families Rhinolophidae and Hipposideridae. All other Czech astrovirus sequences from vesper bats formed, together with one Hungarian sequence, a separate monophyletic lineage within the bat astrovirus group. These findings provide new insights into the molecular epidemiology, ecology, and prevalence of astroviruses in European bat populations. PMID:26273815

  6. New Tools For Understanding Microbial Diversity Using High-throughput Sequence Data

    NASA Astrophysics Data System (ADS)

    Knight, R.; Hamady, M.; Liu, Z.; Lozupone, C.

    2007-12-01

    High-throughput sequencing techniques such as 454 are straining the limits of tools traditionally used to build trees, choose OTUs, and perform other essential sequencing tasks. We have developed a workflow for phylogenetic analysis of large-scale sequence data sets that combines existing tools, such as the Arb phylogeny package and the NAST multiple sequence alignment tool, with new methods for choosing and clustering OTUs and for performing phylogenetic community analysis with UniFrac. This talk discusses the cyberinfrastructure we are developing to support the human microbiome project, and the application of these workflows to analyze very large data sets that contrast the gut microbiota with a range of physical environments. These tools will ultimately help to define core and peripheral microbiomes in a range of environments, and will allow us to understand the physical and biotic factors that contribute most to differences in microbial diversity.

  7. Increasing Sequence Diversity with Flexible Backbone Protein Design: The Complete Redesign of a Protein Hydrophobic Core

    SciTech Connect

    Murphy, Grant S.; Mills, Jeffrey L.; Miley, Michael J.; Machius, Mischa; Szyperski, Thomas; Kuhlman, Brian

    2015-10-15

    Protein design tests our understanding of protein stability and structure. Successful design methods should allow the exploration of sequence space not found in nature. However, when redesigning naturally occurring protein structures, most fixed backbone design algorithms return amino acid sequences that share strong sequence identity with wild-type sequences, especially in the protein core. This behavior places a restriction on functional space that can be explored and is not consistent with observations from nature, where sequences of low identity have similar structures. Here, we allow backbone flexibility during design to mutate every position in the core (38 residues) of a four-helix bundle protein. Only small perturbations to the backbone, 12 {angstrom}, were needed to entirely mutate the core. The redesigned protein, DRNN, is exceptionally stable (melting point >140C). An NMR and X-ray crystal structure show that the side chains and backbone were accurately modeled (all-atom RMSD = 1.3 {angstrom}).

  8. Analysis of environmental 18S ribosomal RNA sequences reveals unknown diversity of the cosmopolitan phylum Telonemia.

    PubMed

    Shalchian-Tabrizi, Kamran; Kauserud, Håvard; Massana, Ramon; Klaveness, Dag; Jakobsen, Kjetill S

    2007-04-01

    Telonemia has recently been described as a new eukaryotic phylum with uncertain evolutionary origin. So far, only two Telonemia species, Telonema subtilis and Telonema antarcticum, have been described, but there are substantial variations in size and morphology among Telonema isolates and field observations, indicating a hidden diversity of Telonemia-like species and populations. In this study, we investigated the diversity and the global distribution of this group by analyzing 18S rDNA sequences from marine environmental clone libraries published in GenBank as well as several unpublished sequences from the Indian Ocean. Phylogenetic analyses of the identified sequences suggest that the Telonemia phylum includes several undescribed 18S rDNA phylotypes, probably corresponding to a number of different species and/or populations. The Telonemia phylotypes form two main groups, here referred to as Telonemia Groups 1 and 2. Some of the closely related sequences originate from separate oceans, indicating worldwide distributions of various Telonemia phylotypes, while other phylotypes seem to have limited geographical distribution. Further investigations of the evolutionary relationships within Telonemia should be conducted on isolated cultures of Telonema-like strains using multi-locus sequencing and morphological data. PMID:17196879

  9. Antigenic and sequence diversity in gonococcal transferrin-binding protein A.

    PubMed

    Cornelissen, C N; Anderson, J E; Boulton, I C; Sparling, P F

    2000-08-01

    Neisseria gonorrhoeae is a gram-negative pathogen that is capable of satisfying its iron requirement with human iron-binding proteins such as transferrin and lactoferrin. Transferrin-iron utilization involves specific binding of human transferrin at the cell surface to what is believed to be a complex of two iron-regulated, transferrin-binding proteins, TbpA and TbpB. The genes encoding these proteins have been cloned and sequenced from a number of pathogenic, gram-negative bacteria. In the current study, we sequenced four additional tbpA genes from other N. gonorrhoeae strains to begin to assess the sequence diversity among gonococci. We compared these sequences to those from other pathogenic bacteria to identify conserved regions that might be important for the structure and function of these receptors. We generated polyclonal mouse sera against synthetic peptides deduced from the TbpA sequence from gonococcal strain FA19. Most of these synthetic peptides were predicted to correspond to surface-exposed regions of TbpA. We found that, while most reacted with denatured TbpA in Western blots, only one antipeptide serum reacted with native TbpA in the context of intact gonococci, consistent with surface exposure of the peptide to which this serum was raised. In addition, we evaluated a panel of gonococcal strains for antigenic diversity using these antipeptide sera. PMID:10899879

  10. Novel chytrid lineages dominate fungal sequences in diverse marine and freshwater habitats

    PubMed Central

    Comeau, André M.; Vincent, Warwick F.; Bernier, Louis; Lovejoy, Connie

    2016-01-01

    In aquatic environments, fungal communities remain little studied despite their taxonomic and functional diversity. To extend the ecological coverage of this group, we conducted an in-depth analysis of fungal sequences within our collection of 3.6 million V4 18S rRNA pyrosequences originating from 319 individual marine (including sea-ice) and freshwater samples from libraries generated within diverse projects studying Arctic and temperate biomes in the past decade. Among the ~1.7 million post-filtered reads of highest taxonomic and phylogenetic quality, 23,263 fungal sequences were identified. The overall mean proportion was 1.35%, but with large variability; for example, from 0.01 to 59% of total sequences for Arctic seawater samples. Almost all sample types were dominated by Chytridiomycota-like sequences, followed by moderate-to-minor contributions of Ascomycota, Cryptomycota and Basidiomycota. Species and/or strain richness was high, with many novel sequences and high niche separation. The affinity of the most common reads to phytoplankton parasites suggests that aquatic fungi deserve renewed attention for their role in algal succession and carbon cycling. PMID:27444055

  11. Novel chytrid lineages dominate fungal sequences in diverse marine and freshwater habitats

    NASA Astrophysics Data System (ADS)

    Comeau, André M.; Vincent, Warwick F.; Bernier, Louis; Lovejoy, Connie

    2016-07-01

    In aquatic environments, fungal communities remain little studied despite their taxonomic and functional diversity. To extend the ecological coverage of this group, we conducted an in-depth analysis of fungal sequences within our collection of 3.6 million V4 18S rRNA pyrosequences originating from 319 individual marine (including sea-ice) and freshwater samples from libraries generated within diverse projects studying Arctic and temperate biomes in the past decade. Among the ~1.7 million post-filtered reads of highest taxonomic and phylogenetic quality, 23,263 fungal sequences were identified. The overall mean proportion was 1.35%, but with large variability; for example, from 0.01 to 59% of total sequences for Arctic seawater samples. Almost all sample types were dominated by Chytridiomycota-like sequences, followed by moderate-to-minor contributions of Ascomycota, Cryptomycota and Basidiomycota. Species and/or strain richness was high, with many novel sequences and high niche separation. The affinity of the most common reads to phytoplankton parasites suggests that aquatic fungi deserve renewed attention for their role in algal succession and carbon cycling.

  12. Novel chytrid lineages dominate fungal sequences in diverse marine and freshwater habitats.

    PubMed

    Comeau, André M; Vincent, Warwick F; Bernier, Louis; Lovejoy, Connie

    2016-01-01

    In aquatic environments, fungal communities remain little studied despite their taxonomic and functional diversity. To extend the ecological coverage of this group, we conducted an in-depth analysis of fungal sequences within our collection of 3.6 million V4 18S rRNA pyrosequences originating from 319 individual marine (including sea-ice) and freshwater samples from libraries generated within diverse projects studying Arctic and temperate biomes in the past decade. Among the ~1.7 million post-filtered reads of highest taxonomic and phylogenetic quality, 23,263 fungal sequences were identified. The overall mean proportion was 1.35%, but with large variability; for example, from 0.01 to 59% of total sequences for Arctic seawater samples. Almost all sample types were dominated by Chytridiomycota-like sequences, followed by moderate-to-minor contributions of Ascomycota, Cryptomycota and Basidiomycota. Species and/or strain richness was high, with many novel sequences and high niche separation. The affinity of the most common reads to phytoplankton parasites suggests that aquatic fungi deserve renewed attention for their role in algal succession and carbon cycling. PMID:27444055

  13. Phylogenetic diversity in the genus Bacillus as seen by 16S rRNA sequencing studies

    NASA Technical Reports Server (NTRS)

    Rossler, D.; Ludwig, W.; Schleifer, K. H.; Lin, C.; McGill, T. J.; Wisotzkey, J. D.; Jurtshuk, P. Jr; Fox, G. E.

    1991-01-01

    Comparative sequence analysis of 16S ribosomal (r)RNAs or DNAs of Bacillus alvei, B. laterosporus, B. macerans, B. macquariensis, B. polymyxa and B. stearothermophilus revealed the phylogenetic diversity of the genus Bacillus. Based on the presently available data set of 16S rRNA sequences from bacilli and relatives at least four major "Bacillus clusters" can be defined: a "Bacillus subtilis cluster" including B. stearothermophilus, a "B. brevis cluster" including B. laterosporus, a "B. alvei cluster" including B. macerans, B. maquariensis and B. polymyxa and a "B. cycloheptanicus branch".

  14. Microbial diversity at the moderate acidic stage in three different sulfidic mine tailings dumps generating acid mine drainage.

    PubMed

    Korehi, Hananeh; Blöthe, Marco; Schippers, Axel

    2014-11-01

    In freshly deposited sulfidic mine tailings the pH is alkaline or circumneutral. Due to pyrite or pyrrhotite oxidation the pH is dropping over time to pH values <3 at which acidophilic iron- and sulfur-oxidizing prokaryotes prevail and accelerate the oxidation processes, well described for several mine waste sites. The microbial communities at the moderate acidic stage in mine tailings are only scarcely studied. Here we investigated the microbial diversity via 16S rRNA gene sequence analysis in eight samples (pH range 3.2-6.5) from three different sulfidic mine tailings dumps in Botswana, Germany and Sweden. In total 701 partial 16S rRNA gene sequences revealed a divergent microbial community between the three sites and at different tailings depths. Proteobacteria and Firmicutes were overall the most abundant phyla in the clone libraries. Acidobacteria, Actinobacteria, Bacteroidetes, and Nitrospira occurred less frequently. The found microbial communities were completely different to microbial communities in tailings at

  15. Diversity through duplication: whole-genome sequencing reveals novel gene retrocopies in the human population.

    PubMed

    Richardson, Sandra R; Salvador-Palomeque, Carmen; Faulkner, Geoffrey J

    2014-05-01

    Gene retrocopies are generated by reverse transcription and genomic integration of mRNA. As such, retrocopies present an important exception to the central dogma of molecular biology, and have substantially impacted the functional landscape of the metazoan genome. While an estimated 8,000-17,000 retrocopies exist in the human genome reference sequence, the extent of variation between individuals in terms of retrocopy content has remained largely unexplored. Three recent studies by Abyzov et al., Ewing et al. and Schrider et al. have exploited 1,000 Genomes Project Consortium data, as well as other sources of whole-genome sequencing data, to uncover novel gene retrocopies. Here, we compare the methods and results of these three studies, highlight the impact of retrocopies in human diversity and genome evolution, and speculate on the potential for somatic gene retrocopies to impact cancer etiology and genetic diversity among individual neurons in the mammalian brain. PMID:24615986

  16. Myeloma Ig heavy chain V region sequences reveal prior antigenic selection and marked somatic mutation but no intraclonal diversity

    SciTech Connect

    Vescio, R.A.; Cao, J.; Hong, C.H.

    1995-09-01

    The IgV{sub H} region sequence in 48 patients with multiple myeloma (MM) was analyzed to characterize the malignant cell of origin. The sequences were obtained after amplification of bone marrow cDNA by using V{sub H} family-specific and C{sub H} primers, then compared with either directly sequenced patient germ-line or published V{sub H} gene sequences to assay for somatic mutation. Because somatic hypermutation of the V{sub H} gene occurs late in B cell development, its presence has been helpful in determining the cell of origin in other B cell malignancies. Overall, a median of 8.2% of the nucleotides had evidence of substitution within each V{sub H} gene sequence (range = 2.7% to 16.5%), which is more prevalent than in any other reported tumor type. Strong evidence of prior antigenic selection pressure was also evident. The ratio of nucleotide substitutions that resulted in amino acid replacement was significantly higher in the complementarity-determining region than in the framework region (3.25 vs. 1.56, respectively; p < 0.00005). No V{sub H} gene intraclonal diversity was noted, despite sequencing multiple clones (3-16) from each patient, nor was there evidence of further V{sub H} gene somatic mutation over the course of three patients` disease. These findings strongly imply that the malignant clone in MM evolves from a cell late in B cell development. 63 refs., 4 figs., 2 tabs.

  17. AST: An Automated Sequence-Sampling Method for Improving the Taxonomic Diversity of Gene Phylogenetic Trees

    PubMed Central

    Zhou, Chan; Mao, Fenglou; Yin, Yanbin; Huang, Jinling; Gogarten, Johann Peter; Xu, Ying

    2014-01-01

    A challenge in phylogenetic inference of gene trees is how to properly sample a large pool of homologous sequences to derive a good representative subset of sequences. Such a need arises in various applications, e.g. when (1) accuracy-oriented phylogenetic reconstruction methods may not be able to deal with a large pool of sequences due to their high demand in computing resources; (2) applications analyzing a collection of gene trees may prefer to use trees with fewer operational taxonomic units (OTUs), for instance for the detection of horizontal gene transfer events by identifying phylogenetic conflicts; and (3) the pool of available sequences is biased towards extensively studied species. In the past, the creation of subsamples often relied on manual selection. Here we present an Automated sequence-Sampling method for improving the Taxonomic diversity of gene phylogenetic trees, AST, to obtain representative sequences that maximize the taxonomic diversity of the sampled sequences. To demonstrate the effectiveness of AST, we have tested it to solve four problems, namely, inference of the evolutionary histories of the small ribosomal subunit protein S5 of E. coli, 16 S ribosomal RNAs and glycosyl-transferase gene family 8, and a study of ancient horizontal gene transfers from bacteria to plants. Our results show that the resolution of our computational results is almost as good as that of manual inference by domain experts, hence making the tool generally useful to phylogenetic studies by non-phylogeny specialists. The program is available at http://csbl.bmb.uga.edu/~zhouchan/AST.php. PMID:24892935

  18. The pig gut microbial diversity: Understanding the pig gut microbial ecology through the next generation high throughput sequencing.

    PubMed

    Kim, Hyeun Bum; Isaacson, Richard E

    2015-06-12

    The importance of the gut microbiota of animals is widely acknowledged because of its pivotal roles in the health and well being of animals. The genetic diversity of the gut microbiota contributes to the overall development and metabolic needs of the animal, and provides the host with many beneficial functions including production of volatile fatty acids, re-cycling of bile salts, production of vitamin K, cellulose digestion, and development of immune system. Thus the intestinal microbiota of animals has been the subject of study for many decades. Although most of the older studies have used culture dependent methods, the recent advent of high throughput sequencing of 16S rRNA genes has facilitated in depth studies exploring microbial populations and their dynamics in the animal gut. These culture independent DNA based studies generate large amounts of data and as a result contribute to a more detailed understanding of the microbiota dynamics in the gut and the ecology of the microbial populations. Of equal importance, is being able to identify and quantify microbes that are difficult to grow or that have not been grown in the laboratory. Interpreting the data obtained from this type of study requires using basic principles of microbial diversity to understand importance of the composition of microbial populations. In this review, we summarize the literature on culture independent studies of the pig gut microbiota with an emphasis on its succession and alterations caused by diverse factors. PMID:25843944

  19. Comparison of a High-Resolution Melting Assay to Next-Generation Sequencing for Analysis of HIV Diversity

    PubMed Central

    Cousins, Matthew M.; Ou, San-San; Wawer, Maria J.; Munshaw, Supriya; Swan, David; Magaret, Craig A.; Mullis, Caroline E.; Serwadda, David; Porcella, Stephen F.; Gray, Ronald H.; Quinn, Thomas C.; Donnell, Deborah; Eshleman, Susan H.

    2012-01-01

    Next-generation sequencing (NGS) has recently been used for analysis of HIV diversity, but this method is labor-intensive, costly, and requires complex protocols for data analysis. We compared diversity measures obtained using NGS data to those obtained using a diversity assay based on high-resolution melting (HRM) of DNA duplexes. The HRM diversity assay provides a single numeric score that reflects the level of diversity in the region analyzed. HIV gag and env from individuals in Rakai, Uganda, were analyzed in a previous study using NGS (n = 220 samples from 110 individuals). Three sequence-based diversity measures were calculated from the NGS sequence data (percent diversity, percent complexity, and Shannon entropy). The amplicon pools used for NGS were analyzed with the HRM diversity assay. HRM scores were significantly associated with sequence-based measures of HIV diversity for both gag and env (P < 0.001 for all measures). The level of diversity measured by the HRM diversity assay and NGS increased over time in both regions analyzed (P < 0.001 for all measures except for percent complexity in gag), and similar amounts of diversification were observed with both methods (P < 0.001 for all measures except for percent complexity in gag). Diversity measures obtained using the HRM diversity assay were significantly associated with those from NGS, and similar increases in diversity over time were detected by both methods. The HRM diversity assay is faster and less expensive than NGS, facilitating rapid analysis of large studies of HIV diversity and evolution. PMID:22785188

  20. Strategies for Achieving High Sequencing Accuracy for Low Diversity Samples and Avoiding Sample Bleeding Using Illumina Platform

    PubMed Central

    Mitra, Abhishek; Skrzypczak, Magdalena; Ginalski, Krzysztof; Rowicka, Maga

    2015-01-01

    Sequencing microRNA, reduced representation sequencing, Hi-C technology and any method requiring the use of in-house barcodes result in sequencing libraries with low initial sequence diversity. Sequencing such data on the Illumina platform typically produces low quality data due to the limitations of the Illumina cluster calling algorithm. Moreover, even in the case of diverse samples, these limitations are causing substantial inaccuracies in multiplexed sample assignment (sample bleeding). Such inaccuracies are unacceptable in clinical applications, and in some other fields (e.g. detection of rare variants). Here, we discuss how both problems with quality of low-diversity samples and sample bleeding are caused by incorrect detection of clusters on the flowcell during initial sequencing cycles. We propose simple software modifications (Long Template Protocol) that overcome this problem. We present experimental results showing that our Long Template Protocol remarkably increases data quality for low diversity samples, as compared with the standard analysis protocol; it also substantially reduces sample bleeding for all samples. For comprehensiveness, we also discuss and compare experimental results from alternative approaches to sequencing low diversity samples. First, we discuss how the low diversity problem, if caused by barcodes, can be avoided altogether at the barcode design stage. Second and third, we present modified guidelines, which are more stringent than the manufacturer’s, for mixing low diversity samples with diverse samples and lowering cluster density, which in our experience consistently produces high quality data from low diversity samples. Fourth and fifth, we present rescue strategies that can be applied when sequencing results in low quality data and when there is no more biological material available. In such cases, we propose that the flowcell be re-hybridized and sequenced again using our Long Template Protocol. Alternatively, we discuss how

  1. Rapid Quantification of Mutant Fitness in Diverse Bacteria by Sequencing Randomly Bar-Coded Transposons

    PubMed Central

    Wetmore, Kelly M.; Price, Morgan N.; Waters, Robert J.; Lamson, Jacob S.; He, Jennifer; Hoover, Cindi A.; Blow, Matthew J.; Bristow, James; Butland, Gareth

    2015-01-01

    ABSTRACT Transposon mutagenesis with next-generation sequencing (TnSeq) is a powerful approach to annotate gene function in bacteria, but existing protocols for TnSeq require laborious preparation of every sample before sequencing. Thus, the existing protocols are not amenable to the throughput necessary to identify phenotypes and functions for the majority of genes in diverse bacteria. Here, we present a method, random bar code transposon-site sequencing (RB-TnSeq), which increases the throughput of mutant fitness profiling by incorporating random DNA bar codes into Tn5 and mariner transposons and by using bar code sequencing (BarSeq) to assay mutant fitness. RB-TnSeq can be used with any transposon, and TnSeq is performed once per organism instead of once per sample. Each BarSeq assay requires only a simple PCR, and 48 to 96 samples can be sequenced on one lane of an Illumina HiSeq system. We demonstrate the reproducibility and biological significance of RB-TnSeq with Escherichia coli, Phaeobacter inhibens, Pseudomonas stutzeri, Shewanella amazonensis, and Shewanella oneidensis. To demonstrate the increased throughput of RB-TnSeq, we performed 387 successful genome-wide mutant fitness assays representing 130 different bacterium-carbon source combinations and identified 5,196 genes with significant phenotypes across the five bacteria. In P. inhibens, we used our mutant fitness data to identify genes important for the utilization of diverse carbon substrates, including a putative d-mannose isomerase that is required for mannitol catabolism. RB-TnSeq will enable the cost-effective functional annotation of diverse bacteria using mutant fitness profiling. PMID:25968644

  2. Bacterial Community Structure of Acid-Impacted Lakes: What Controls Diversity?▿ †

    PubMed Central

    Percent, Sascha F.; Frischer, Marc E.; Vescio, Paul A.; Duffy, Ellen B.; Milano, Vincenzo; McLellan, Maggie; Stevens, Brett M.; Boylen, Charles W.; Nierzwicki-Bauer, Sandra A.

    2008-01-01

    Although it is recognized that acidification of freshwater systems results in decreased overall species richness of plants and animals, little is known about the response of aquatic microbial communities to acidification. In this study we examined bacterioplankton community diversity and structure in 18 lakes located in the Adirondack Park (in the state of New York in the United States) that were affected to various degrees by acidic deposition and assessed correlations with 31 physical and chemical parameters. The pH of these lakes ranged from 4.9 to 7.8. These studies were conducted as a component of the Adirondack Effects Assessment Program supported by the U.S. Environmental Protection Agency. Thirty-one independent 16S rRNA gene libraries consisting of 2,135 clones were constructed from epilimnion and hypolimnion water samples. Bacterioplankton community composition was determined by sequencing and amplified ribosomal DNA restriction analysis of the clone libraries. Nineteen bacterial classes representing 95 subclasses were observed, but clone libraries were dominated by representatives of the Actinobacteria and Betaproteobacteria classes. Although the diversity and richness of bacterioplankton communities were positively correlated with pH, the overall community composition assessed by principal component analysis was not. The strongest correlations were observed between bacterioplankton communities and lake depth, hydraulic retention time, dissolved inorganic carbon, and nonlabile monomeric aluminum concentrations. While there was not an overall correlation between bacterioplankton community structure and pH, several bacterial classes, including the Alphaproteobacteria, were directly correlated with acidity. These results indicate that unlike more identifiable correlations between acidity and species richness for higher trophic levels, controls on bacterioplankton community structure are likely more complex, involving both direct and indirect processes. PMID

  3. Mineralogical Controls on Microbial Diversity in a Sulfuric Acid Karst System

    NASA Astrophysics Data System (ADS)

    Jones, A. A.; Bennett, P.

    2011-12-01

    The role mineralogy plays on microbial community distribution, composition, niche differentiation, and accumulation is a complex and nebulous association. Microbial phylogenetic diversity and bacterial composition of communities obtained from Lower Kane Cave (LKC), WY, USA, were studied using next generation bacterial 16S rRNA sequencing techniques. The microbial consortium found within LKC was found to be primarily composed of neutrophilic sulfur-oxidizing members of the gamma- and epsilon-proteobacteria . The microbial population within LKC has been instigated in previous studies to have a significant role in the processes of sulfuric acid speleogenesis. Using a LKC biomat as the inoculant in a series of 3 nutrient limited laboratory reactor experiments, and a pure culture of Thiothrix unzii (ATCC type strain 49747) in a parallel experiment, we found that both limestone and dolostone substratum consistently had higher biomass accumulation than silicate minerals in the same reactor. At the Class level, the carbonate substratum (Calcite, Limestone, and Dolostone) had ~84% - 88.7% of phylotypes in common. Aside from Basalt (Simpson's Index, D of 0.53), the carbonate substratum produced the least diverse phylotype distributions. Feldspar and quartz were colonized by the most diverse communities with Simpson's Index values of 0.16 and 0.31. Evaluation of metabolic guild distribution shows that potential neutrophilic sulfur-oxidizers have an affinity for acid neutralizing carbonate substrata over silicate substrata. These potential sulfur-oxidizing guilds compose ~28%-38% of the total microbial community. For feldspar and chert substratum, potential sulfur-oxidizing metabolic guilds composed merely ~5% of the total microbial community. The quartz substratum, in contrast, was uniquely populated by potential acidophilic sulfur-oxidizers Acidithiobacillus and Acidithiomicrobium; composing ~19% of the total community. A quartz substratum may offer these acidophiles a

  4. Sequence-defined bioactive macrocycles via an acid-catalysed cascade reaction

    NASA Astrophysics Data System (ADS)

    Porel, Mintu; Thornlow, Dana N.; Phan, Ngoc N.; Alabi, Christopher A.

    2016-06-01

    Synthetic macrocycles derived from sequence-defined oligomers are a unique structural class whose ring size, sequence and structure can be tuned via precise organization of the primary sequence. Similar to peptides and other peptidomimetics, these well-defined synthetic macromolecules become pharmacologically relevant when bioactive side chains are incorporated into their primary sequence. In this article, we report the synthesis of oligothioetheramide (oligoTEA) macrocycles via a one-pot acid-catalysed cascade reaction. The versatility of the cyclization chemistry and modularity of the assembly process was demonstrated via the synthesis of >20 diverse oligoTEA macrocycles. Structural characterization via NMR spectroscopy revealed the presence of conformational isomers, which enabled the determination of local chain dynamics within the macromolecular structure. Finally, we demonstrate the biological activity of oligoTEA macrocycles designed to mimic facially amphiphilic antimicrobial peptides. The preliminary results indicate that macrocyclic oligoTEAs with just two-to-three cationic charge centres can elicit potent antibacterial activity against Gram-positive and Gram-negative bacteria.

  5. Fungal diversity in grape must and wine fermentation assessed by massive sequencing, quantitative PCR and DGGE.

    PubMed

    Wang, Chunxiao; García-Fernández, David; Mas, Albert; Esteve-Zarzoso, Braulio

    2015-01-01

    The diversity of fungi in grape must and during wine fermentation was investigated in this study by culture-dependent and culture-independent techniques. Carignan and Grenache grapes were harvested from three vineyards in the Priorat region (Spain) in 2012, and nine samples were selected from the grape must after crushing and during wine fermentation. From culture-dependent techniques, 362 isolates were randomly selected and identified by 5.8S-ITS-RFLP and 26S-D1/D2 sequencing. Meanwhile, genomic DNA was extracted directly from the nine samples and analyzed by qPCR, DGGE and massive sequencing. The results indicated that grape must after crushing harbored a high species richness of fungi with Aspergillus tubingensis, Aureobasidium pullulans, or Starmerella bacillaris as the dominant species. As fermentation proceeded, the species richness decreased, and yeasts such as Hanseniaspora uvarum, Starmerella bacillaris and Saccharomyces cerevisiae successively occupied the must samples. The "terroir" characteristics of the fungus population are more related to the location of the vineyard than to grape variety. Sulfur dioxide treatment caused a low effect on yeast diversity by similarity analysis. Because of the existence of large population of fungi on grape berries, massive sequencing was more appropriate to understand the fungal community in grape must after crushing than the other techniques used in this study. Suitable target sequences and databases were necessary for accurate evaluation of the community and the identification of species by the 454 pyrosequencing of amplicons. PMID:26557110

  6. Cytogenetic Diversity of Simple Sequences Repeats in Morphotypes of Brassica rapa ssp. chinensis

    PubMed Central

    Zheng, Jin-shuang; Sun, Cheng-zhen; Zhang, Shu-ning; Hou, Xi-lin; Bonnema, Guusje

    2016-01-01

    A significant fraction of the nuclear DNA of all eukaryotes is comprised of simple sequence repeats (SSRs). Although these sequences are widely used for studying genetic variation, linkage mapping and evolution, little attention had been paid to the chromosomal distribution and cytogenetic diversity of these sequences. In this paper, we report the distribution characterization of mono-, di-, and tri-nucleotide SSRs in Brassica rapa ssp. chinensis. Fluorescence in situ hybridization was used to characterize the cytogenetic diversity of SSRs among morphotypes of B. rapa ssp. chinensis. The proportion of different SSR motifs varied among morphotypes of B. rapa ssp. chinensis, with tri-nucleotide SSRs being more prevalent in the genome of B. rapa ssp. chinensis. We determined the chromosomal locations of mono-, di-, and tri-nucleotide repeat loci. The results showed that the chromosomal distribution of SSRs in the different morphotypes is non-random and motif-dependent, and allowed us to characterize the relative variability in terms of SSR numbers and similar chromosomal distributions in centromeric/peri-centromeric heterochromatin. The differences between SSR repeats with respect to abundance and distribution indicate that SSRs are a driving force in the genomic evolution of B. rapa species. Our results provide a comprehensive view of the SSR sequence distribution and evolution for comparison among morphotypes B. rapa ssp. chinensis. PMID:27507974

  7. Cytogenetic Diversity of Simple Sequences Repeats in Morphotypes of Brassica rapa ssp. chinensis.

    PubMed

    Zheng, Jin-Shuang; Sun, Cheng-Zhen; Zhang, Shu-Ning; Hou, Xi-Lin; Bonnema, Guusje

    2016-01-01

    A significant fraction of the nuclear DNA of all eukaryotes is comprised of simple sequence repeats (SSRs). Although these sequences are widely used for studying genetic variation, linkage mapping and evolution, little attention had been paid to the chromosomal distribution and cytogenetic diversity of these sequences. In this paper, we report the distribution characterization of mono-, di-, and tri-nucleotide SSRs in Brassica rapa ssp. chinensis. Fluorescence in situ hybridization was used to characterize the cytogenetic diversity of SSRs among morphotypes of B. rapa ssp. chinensis. The proportion of different SSR motifs varied among morphotypes of B. rapa ssp. chinensis, with tri-nucleotide SSRs being more prevalent in the genome of B. rapa ssp. chinensis. We determined the chromosomal locations of mono-, di-, and tri-nucleotide repeat loci. The results showed that the chromosomal distribution of SSRs in the different morphotypes is non-random and motif-dependent, and allowed us to characterize the relative variability in terms of SSR numbers and similar chromosomal distributions in centromeric/peri-centromeric heterochromatin. The differences between SSR repeats with respect to abundance and distribution indicate that SSRs are a driving force in the genomic evolution of B. rapa species. Our results provide a comprehensive view of the SSR sequence distribution and evolution for comparison among morphotypes B. rapa ssp. chinensis. PMID:27507974

  8. Sequence and diversity of the rat delta T-cell receptor.

    PubMed

    Watson, D; Ando, T; Knight, J F

    2000-07-01

    The cDNA sequence of the delta T-cell receptor (TCRD) in the adult Lewis rat thymus was determined using the technique of rapid amplification of cDNA ends. Sixteen variable region genes (TCRDV), two diversity regions (TCRDD), two joining regions (TCRDJ), and a single constant region gene (TCRDC) were identified. The sixteen unique TCRDV genes identified represented eight different subfamilies in the rat and were highly conserved (>80% nucleotide identity) to corresponding mouse sequences. Extensive junctional diversity was observed in the rat, with both TCRDD regions (TCRDD1 and TCRDD2) utilized in the majority of cDNA clones identified. The two TCRDJ genes were highly conserved and corresponded to TCRDJ1 and TCRDJ2 in the mouse; the majority of clones utilized TCRDJ1. The TCRDC region in the rat was 91.1% identical to the mouse TCRDC gene and was highly conserved to other species. Although extensive sequence information about mouse gamma-delta T-cell receptor genes is available, current knowledge of rat gamma-delta T-cells is limited. The sequence analysis presented in this study adds to our understanding of gamma-delta T-cells in general, and it may be utilized to study the role of gamma-delta T-cells in immune-mediated disease and transplantation models previously established in the rat. PMID:10941843

  9. Fungal diversity in grape must and wine fermentation assessed by massive sequencing, quantitative PCR and DGGE

    PubMed Central

    Wang, Chunxiao; García-Fernández, David; Mas, Albert; Esteve-Zarzoso, Braulio

    2015-01-01

    The diversity of fungi in grape must and during wine fermentation was investigated in this study by culture-dependent and culture-independent techniques. Carignan and Grenache grapes were harvested from three vineyards in the Priorat region (Spain) in 2012, and nine samples were selected from the grape must after crushing and during wine fermentation. From culture-dependent techniques, 362 isolates were randomly selected and identified by 5.8S-ITS-RFLP and 26S-D1/D2 sequencing. Meanwhile, genomic DNA was extracted directly from the nine samples and analyzed by qPCR, DGGE and massive sequencing. The results indicated that grape must after crushing harbored a high species richness of fungi with Aspergillus tubingensis, Aureobasidium pullulans, or Starmerella bacillaris as the dominant species. As fermentation proceeded, the species richness decreased, and yeasts such as Hanseniaspora uvarum, Starmerella bacillaris and Saccharomyces cerevisiae successively occupied the must samples. The “terroir” characteristics of the fungus population are more related to the location of the vineyard than to grape variety. Sulfur dioxide treatment caused a low effect on yeast diversity by similarity analysis. Because of the existence of large population of fungi on grape berries, massive sequencing was more appropriate to understand the fungal community in grape must after crushing than the other techniques used in this study. Suitable target sequences and databases were necessary for accurate evaluation of the community and the identification of species by the 454 pyrosequencing of amplicons. PMID:26557110

  10. Phylogeny and genetic diversity of Bridgeoporus nobilissimus inferred using mitochondrial and nuclear rDNA sequences

    USGS Publications Warehouse

    Redberg, G.L.; Hibbett, D.S.; Ammirati, J.F., Jr.; Rodriguez, R.J.

    2003-01-01

    The genetic diversity and phylogeny of Bridgeoporus nobilissimus have been analyzed. DNA was extracted from spores collected from individual fruiting bodies representing six geographically distinct populations in Oregon and Washington. Spore samples collected contained low levels of bacteria, yeast and a filamentous fungal species. Using taxon-specific PCR primers, it was possible to discriminate among rDNA from bacteria, yeast, a filamentous associate and B. nobilissimus. Nuclear rDNA internal transcribed spacer (ITS) region sequences of B. nobilissimus were compared among individuals representing six populations and were found to have less than 2% variation. These sequences also were used to design dual and nested PCR primers for B. nobilissimus-specific amplification. Mitochondrial small-subunit rDNA sequences were used in a phylogenetic analysis that placed B. nobilissimus in the hymenochaetoid clade, where it was associated with Oxyporus and Schizopora.

  11. Analysis of genetic variation and diversity of Rice stripe virus populations through high-throughput sequencing.

    PubMed

    Huang, Lingzhe; Li, Zefeng; Wu, Jianxiang; Xu, Yi; Yang, Xiuling; Fan, Longjiang; Fang, Rongxiang; Zhou, Xueping

    2015-01-01

    Plant RNA viruses often generate diverse populations in their host plants through error-prone replication and recombination. Recent studies on the genetic diversity of plant RNA viruses in various host plants have provided valuable information about RNA virus evolution and emergence of new diseases caused by RNA viruses. We analyzed and compared the genetic diversity of Rice stripe virus (RSV) populations in Oryza sativa (a natural host of RSV) and compared it with that of the RSV populations generated in an infection of Nicotiana benthamiana, an experimental host of RSV, using the high-throughput sequencing technology. From infected O. sativa and N. benthamiana plants, a total of 341 and 1675 site substitutions were identified in the RSV genome, respectively, and the average substitution ratio in these sites was 1.47 and 7.05 %, respectively, indicating that the RSV populations from infected N. benthamiana plant are more diverse than those from infected O. sativa plant. Our result gives a direct evidence that virus might allow higher genetic diversity for host adaptation. PMID:25852724

  12. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... in the sequence. (4) The enumeration of amino acids may start at the first amino acid of the first..., counting backwards starting with the amino acid next to number 1. Otherwise, the enumeration of amino acids... sequence every 5 amino acids. The enumeration method for amino acid sequences that is set forth......

  13. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... in the sequence. (4) The enumeration of amino acids may start at the first amino acid of the first..., counting backwards starting with the amino acid next to number 1. Otherwise, the enumeration of amino acids... sequence every 5 amino acids. The enumeration method for amino acid sequences that is set forth......

  14. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... in the sequence. (4) The enumeration of amino acids may start at the first amino acid of the first..., counting backwards starting with the amino acid next to number 1. Otherwise, the enumeration of amino acids... sequence every 5 amino acids. The enumeration method for amino acid sequences that is set forth......

  15. Multilocus Sequence Analysis for Assessment of Phylogenetic Diversity and Biogeography in Thalassospira Bacteria from Diverse Marine Environments

    PubMed Central

    Yuan, Jun; Du, Juan; Wang, Liping; Sun, Fengqin; Shao, Zongze

    2014-01-01

    Thalassospira bacteria are widespread and have been isolated from various marine environments. Less is known about their genetic diversity and biogeography, as well as their role in marine environments, many of them cannot be discriminated merely using the 16S rRNA gene. To address these issues, in this report, the phylogenetic analysis of 58 strains from seawater and deep sea sediments were carried out using the multilocus sequence analysis (MLSA) based on acsA, aroE, gyrB, mutL, rpoD and trpB genes, and the DNA-DNA hybridization (DDH) and average nucleotide identity (ANI) based on genome sequences. The MLSA analysis demonstrated that the 58 strains were clearly separated into 15 lineages, corresponding to seven validly described species and eight potential novel species. The DDH and ANI values further confirmed the validity of the MLSA analysis and eight potential novel species. The MLSA interspecies gap of the genus Thalassospira was determined to be 96.16–97.12% sequence identity on the basis of the combined analyses of the DDH and MLSA, while the ANIm interspecies gap was 95.76–97.20% based on the in silico DDH analysis. Meanwhile, phylogenetic analyses showed that the Thalassospira bacteria exhibited distribution pattern to a certain degree according to geographic regions. Moreover, they clustered together according to the habitats depth. For short, the phylogenetic analyses and biogeography of the Thalassospira bacteria were systematically investigated for the first time. These results will be helpful to explore further their ecological role and adaptive evolution in marine environments. PMID:25198177

  16. Predicting protein disorder by analyzing amino acid sequence

    PubMed Central

    Yang, Jack Y; Yang, Mary Qu

    2008-01-01

    Background Many protein regions and some entire proteins have no definite tertiary structure, presenting instead as dynamic, disorder ensembles under different physiochemical circumstances. These proteins and regions are known as Intrinsically Unstructured Proteins (IUP). IUP have been associated with a wide range of protein functions, along with roles in diseases characterized by protein misfolding and aggregation. Results Identifying IUP is important task in structural and functional genomics. We exact useful features from sequences and develop machine learning algorithms for the above task. We compare our IUP predictor with PONDRs (mainly neural-network-based predictors), disEMBL (also based on neural networks) and Globplot (based on disorder propensity). Conclusion We find that augmenting features derived from physiochemical properties of amino acids (such as hydrophobicity, complexity etc.) and using ensemble method proved beneficial. The IUP predictor is a viable alternative software tool for identifying IUP protein regions and proteins. PMID:18831799

  17. Multilocus Sequence Typing (MLST) for Lineage Assignment and High Resolution Diversity Studies in Trypanosoma cruzi

    PubMed Central

    Yeo, Matthew; Mauricio, Isabel L.; Messenger, Louisa A.; Lewis, Michael D.; Llewellyn, Martin S.; Acosta, Nidia; Bhattacharyya, Tapan; Diosque, Patricio; Carrasco, Hernan J.; Miles, Michael A.

    2011-01-01

    Background Multilocus sequence typing (MLST) is a powerful and highly discriminatory method for analysing pathogen population structure and epidemiology. Trypanosoma cruzi, the protozoan agent of American trypanosomiasis (Chagas disease), has remarkable genetic and ecological diversity. A standardised MLST protocol that is suitable for assignment of T. cruzi isolates to genetic lineage and for higher resolution diversity studies has not been developed. Methodology/Principal Findings We have sequenced and diplotyped nine single copy housekeeping genes and assessed their value as part of a systematic MLST scheme for T. cruzi. A minimum panel of four MLST targets (Met-III, RB19, TcGPXII, and DHFR-TS) was shown to provide unambiguous assignment of isolates to the six known T. cruzi lineages (Discrete Typing Units, DTUs TcI-TcVI). In addition, we recommend six MLST targets (Met-II, Met-III, RB19, TcMPX, DHFR-TS, and TR) for more in depth diversity studies on the basis that diploid sequence typing (DST) with this expanded panel distinguished 38 out of 39 reference isolates. Phylogenetic analysis implies a subdivision between North and South American TcIV isolates. Single Nucleotide Polymorphism (SNP) data revealed high levels of heterozygosity among DTUs TcI, TcIII, TcIV and, for three targets, putative corresponding homozygous and heterozygous loci within DTUs TcI and TcIII. Furthermore, individual gene trees gave incongruent topologies at inter- and intra-DTU levels, inconsistent with a model of strict clonality. Conclusions/Significance We demonstrate the value of systematic MLST diplotyping for describing inter-DTU relationships and for higher resolution diversity studies of T. cruzi, including presence of recombination events. The high levels of heterozygosity will facilitate future population genetics analysis based on MLST haplotypes. PMID:21713026

  18. The amino acid alphabet and the architecture of the protein sequence-structure map. I. Binary alphabets.

    PubMed

    Ferrada, Evandro

    2014-12-01

    The correspondence between protein sequences and structures, or sequence-structure map, relates to fundamental aspects of structural, evolutionary and synthetic biology. The specifics of the mapping, such as the fraction of accessible sequences and structures, or the sequences' ability to fold fast, are dictated by the type of interactions between the monomers that compose the sequences. The set of possible interactions between monomers is encapsulated by the potential energy function. In this study, I explore the impact of the relative forces of the potential on the architecture of the sequence-structure map. My observations rely on simple exact models of proteins and random samples of the space of potential energy functions of binary alphabets. I adopt a graph perspective and study the distribution of viable sequences and the structures they produce, as networks of sequences connected by point mutations. I observe that the relative proportion of attractive, neutral and repulsive forces defines types of potentials, that induce sequence-structure maps of vastly different architectures. I characterize the properties underlying these differences and relate them to the structure of the potential. Among these properties are the expected number and relative distribution of sequences associated to specific structures and the diversity of structures as a function of sequence divergence. I study the types of binary potentials observed in natural amino acids and show that there is a strong bias towards only some types of potentials, a bias that seems to characterize the folding code of natural proteins. I discuss implications of these observations for the architecture of the sequence-structure map of natural proteins, the construction of random libraries of peptides, and the early evolution of the natural amino acid alphabet. PMID:25473967

  19. The Amino Acid Alphabet and the Architecture of the Protein Sequence-Structure Map. I. Binary Alphabets

    PubMed Central

    Ferrada, Evandro

    2014-01-01

    The correspondence between protein sequences and structures, or sequence-structure map, relates to fundamental aspects of structural, evolutionary and synthetic biology. The specifics of the mapping, such as the fraction of accessible sequences and structures, or the sequences' ability to fold fast, are dictated by the type of interactions between the monomers that compose the sequences. The set of possible interactions between monomers is encapsulated by the potential energy function. In this study, I explore the impact of the relative forces of the potential on the architecture of the sequence-structure map. My observations rely on simple exact models of proteins and random samples of the space of potential energy functions of binary alphabets. I adopt a graph perspective and study the distribution of viable sequences and the structures they produce, as networks of sequences connected by point mutations. I observe that the relative proportion of attractive, neutral and repulsive forces defines types of potentials, that induce sequence-structure maps of vastly different architectures. I characterize the properties underlying these differences and relate them to the structure of the potential. Among these properties are the expected number and relative distribution of sequences associated to specific structures and the diversity of structures as a function of sequence divergence. I study the types of binary potentials observed in natural amino acids and show that there is a strong bias towards only some types of potentials, a bias that seems to characterize the folding code of natural proteins. I discuss implications of these observations for the architecture of the sequence-structure map of natural proteins, the construction of random libraries of peptides, and the early evolution of the natural amino acid alphabet. PMID:25473967

  20. Exploiting genes and functional diversity of chlorogenic acid and luteolin biosyntheses in Lonicera japonica and their substitutes.

    PubMed

    Yuan, Yuan; Wang, Zhouyong; Jiang, Chao; Wang, Xumin; Huang, Luqi

    2014-01-25

    Chlorogenic acids (CGAs) and luteolin are active compounds in Lonicera japonica, a plant of high medicinal value in traditional Chinese medicine. This study provides a comprehensive overview of gene families involved in chlorogenic acid and luteolin biosynthesis in L. japonica, as well as its substitutes Lonicera hypoglauca and Lonicera macranthoides. The gene sequence feature and gene expression patterns in various tissues and buds of the species were characterized. Bioinformatics analysis revealed that 14 chlorogenic acid and luteolin biosynthesis-related genes were identified from the L. japonica transcriptome assembly. Phylogenetic analyses suggested that the function of individual gene could be differentiation and induce active compound diversity. Their orthologous genes were also recognized in L. hypoglauca and L. macranthoides genomic datasets, except for LHCHS1 and LMC4H2. The expression patterns of these genes are different in the tissues of L. japonica, L. hypoglauca and L. macranthoides. Results also showed that CGAs were controlled in the first step of biosynthesis, whereas both steps controlled luteolin in the bud of L. japonica. The expression of LJFNS2 exhibited positive correlation with luteolin levels in L. japonica. This study provides significant information for understanding the functional diversity of gene families involved in chlorogenic acid and the luteolin biosynthesis, active compound diversity of L. japonica and its substitutes, and the different usages of the three species. PMID:23085319

  1. HIV-1 intrapatient sequence diversity in the immunogenic V3 region

    SciTech Connect

    Korber, B.; Myers, G. ); Wolinsky, S.; Kunstman, K.; Levy, R.; Furtado, M.; Otto, P. . Medical School); Haynes, B. . Dept. of Medicine)

    1991-11-12

    The third hypervariable domain (V3) of the human immunodeficiency virus type-1 (HIV-1) envelope protein (env) can serve as an epitope for potent type-specific neutralizing antibodies (NAbs) -- thus short peptides predicted on the most commonly found variants of the antigenic tip of the V3 loop have been considered as potential candidates for an HIV peptide vaccine. To evaluate the extent of intrapatient variation in the immunogenic crest of the V3 loop, sequence sets were analyzed from individuals for whom multiple V3 sequences were available. Several strategies for selecting the best sets of hexapeptides to represent the variable tip of the V3 loop were considered and their effectiveness was evaluated by comparing them with the sequence sets from individuals. Most individuals carried at least one, and frequently many, variants that did not match any of the sequences from among the ten most common hexapeptides. Intrapatient viral sequence variation was increased by including sequences derived from brain biopsy specimens as well as from blood. Additionally, sequences obtained from brain specimens of different individuals had common elements which were not conserved in the corresponding blood samples, suggesting that certain amino acids in the V3 loop may be requisite for viral propagation in the CNS.

  2. A versatile palindromic amphipathic repeat coding sequence horizontally distributed among diverse bacterial and eucaryotic microbes

    PubMed Central

    2010-01-01

    Background Intragenic tandem repeats occur throughout all domains of life and impart functional and structural variability to diverse translation products. Repeat proteins confer distinctive surface phenotypes to many unicellular organisms, including those with minimal genomes such as the wall-less bacterial monoderms, Mollicutes. One such repeat pattern in this clade is distributed in a manner suggesting its exchange by horizontal gene transfer (HGT). Expanding genome sequence databases reveal the pattern in a widening range of bacteria, and recently among eucaryotic microbes. We examined the genomic flux and consequences of the motif by determining its distribution, predicted structural features and association with membrane-targeted proteins. Results Using a refined hidden Markov model, we document a 25-residue protein sequence motif tandemly arrayed in variable-number repeats in ORFs lacking assigned functions. It appears sporadically in unicellular microbes from disparate bacterial and eucaryotic clades, representing diverse lifestyles and ecological niches that include host parasitic, marine and extreme environments. Tracts of the repeats predict a malleable configuration of recurring domains, with conserved hydrophobic residues forming an amphipathic secondary structure in which hydrophilic residues endow extensive sequence variation. Many ORFs with these domains also have membrane-targeting sequences that predict assorted topologies; others may comprise reservoirs of sequence variants. We demonstrate expressed variants among surface lipoproteins that distinguish closely related animal pathogens belonging to a subgroup of the Mollicutes. DNA sequences encoding the tandem domains display dyad symmetry. Moreover, in some taxa the domains occur in ORFs selectively associated with mobile elements. These features, a punctate phylogenetic distribution, and different patterns of dispersal in genomes of related taxa, suggest that the repeat may be disseminated by

  3. Hunting Down Frame Shifts: Ecological Analysis of Diverse Functional Gene Sequences

    PubMed Central

    Strejcek, Michal; Wang, Qiong; Ridl, Jakub; Uhlik, Ondrej

    2015-01-01

    Functional gene ecological analyses using amplicon sequencing can be challenging as translated sequences are often burdened with shifted reading frames. The aim of this work was to evaluate several bioinformatics tools designed to correct errors which arise during sequencing in an effort to reduce the number of frameshifts (FS). Genes encoding for alpha subunits of biphenyl (bphA) and benzoate (benA) dioxygenases were used as model sequences. FrameBot, a FS correction tool, was able to reduce the number of detected FS to zero. However, up to 44% of sequences were discarded by FrameBot as non-specific targets. Therefore, we proposed a de novo mode of FrameBot for FS correction, which works on a similar basis as common chimera identifying platforms and is not dependent on reference sequences. By nature of FrameBot de novo design, it is crucial to provide it with data as error free as possible. We tested the ability of several publicly available correction tools to decrease the number of errors in the data sets. The combination of maximum expected error filtering and single linkage pre-clustering proved to be the most efficient read processing approach. Applying FrameBot de novo on the processed data enabled analysis of BphA sequences with minimal losses of potentially functional sequences not homologous to those previously known. This experiment also demonstrated the extensive diversity of dioxygenases in soil. A script which performs FrameBot de novo is presented in the supplementary material to the study or available at https://github.com/strejcem/FBdenovo. The tool was also implemented into FunGene Pipeline available at http://fungene.cme.msu.edu/FunGenePipeline/. PMID:26635739

  4. Twenty-One Genome Sequences from Pseudomonas Species and 19 Genome Sequences from Diverse Bacteria Isolated from the Rhizosphere and Endosphere of Populus deltoides

    SciTech Connect

    Brown, Steven D; Utturkar, Sagar M; Klingeman, Dawn Marie; Johnson, Courtney M; Martin, Stanton; Land, Miriam L; Lu, Tse-Yuan; Schadt, Christopher Warren; Doktycz, Mitchel John; Pelletier, Dale A

    2012-01-01

    To aid in the investigation of the Populus deltoides microbiome we generated draft genome sequences for twenty one Pseudomonas and twenty one other diverse bacteria isolated from Populus deltoides roots. Genome sequences for isolates similar to Acidovorax, Bradyrhizobium, Brevibacillus, Burkholderia, Caulobacter, Chryseobacterium, Flavobacterium, Herbaspirillum, Novosphingobium, Pantoea, Phyllobacterium, Polaromonas, Rhizobium, Sphingobium and Variovorax were generated.

  5. Bacterial community compositions in sediment polluted by perfluoroalkyl acids (PFAAs) using Illumina high-throughput sequencing.

    PubMed

    Sun, Yajun; Wang, Tieyu; Peng, Xiawei; Wang, Pei; Lu, Yonglong

    2016-06-01

    The characterization of bacterial community compositions and the change in perfluoroalkyl acids (PFAAs) along a natural river distribution system were explored in the present study. Illumina high-throughput sequencing was used to explore bacterial community diversity and structure in sediment polluted by PFAAs from the Xiaoqing River, the area with concentrated fluorochemical facilities in China. The concentration of PFAAs was in the range of 8.44-465.60 ng/g dry weight (dw) in sediment. Perfluorooctanoic acid (PFOA) was the dominant PFAA in all samples, which accounted for 94.2 % of total PFAAs. High-level PFOA could lead to an obvious increase in relative abundance of Proteobacteria, ε-Proteobacteria, Thiobacillus, and Sulfurimonas and the decrease in relative abundance of other bacteria. Redundancy analysis revealed that PFOA played an important role in the formation of bacterial community, and PFOA at higher concentration could reduce the diversity of bacterial community. When the concentration of PFOA was below 100 ng/g dw in sediment, no significant effect on microbial community structure was observed. Thiobacillus and Sulfurimonas were positively correlated with the concentration of PFOA, suggesting that both genera were resistant to PFOA contamination. PMID:26780047

  6. Diversity and distribution of unicellular opisthokonts along the European coast analyzed using high-throughput sequencing

    PubMed Central

    del Campo, Javier; Mallo, Diego; Massana, Ramon; de Vargas, Colomban; Richards, Thomas A.; Ruiz-Trillo, Iñaki

    2015-01-01

    Summary The opisthokonts are one of the major super-groups of eukaryotes. It comprises two major clades: 1) the Metazoa and their unicellular relatives and 2) the Fungi and their unicellular relatives. There is, however, little knowledge of the role of opisthokont microbes in many natural environments, especially among non-metazoan and non-fungal opisthokonts. Here we begin to address this gap by analyzing high throughput 18S rDNA and 18S rRNA sequencing data from different European coastal sites, sampled at different size fractions and depths. In particular, we analyze the diversity and abundance of choanoflagellates, filastereans, ichthyosporeans, nucleariids, corallochytreans and their related lineages. Our results show the great diversity of choanoflagellates in coastal waters as well as a relevant role of the ichthyosporeans and the uncultured marine opisthokonts (MAOP). Furthermore, we describe a new lineage of marine fonticulids (MAFO) that appears to be abundant in sediments. Therefore, our work points to a greater potential ecological role for unicellular opisthokonts than previously appreciated in marine environments, both in water column and sediments, and also provides evidence of novel opisthokont phylogenetic lineages. This study highlights the importance of high throughput sequencing approaches to unravel the diversity and distribution of both known and novel eukaryotic lineages. PMID:25556908

  7. Genetic diversity in marine algal virus communities as revealed by sequence analysis of DNA polymerase genes.

    PubMed Central

    Chen, F; Suttle, C A; Short, S M

    1996-01-01

    Algal-virus-specific PCR primers were used to amplify DNA polymerase gene (pol) fragments (683 to 689 bp) from the virus-sized fraction (0.02 to 0.2 microns) concentrated from inshore and offshore water samples collected from the Gulf of Mexico. Algal-virus-like DNA pol genes were detected in five samples collected from the surface and deep chlorophyll maximum. PCR products from an offshore station were cloned, and the genetic diversity of 33 fragments was examined by restriction fragment length polymorphism and sequence analysis. The five different genotypes or operational taxonomic units (OTUs) that were identified on the basis of restriction fragment length polymorphism banding patterns were present in different relative abundances (9 to 34%). One clone from each OTU was sequenced, and phylogenetic analysis showed that all of the OTUs fell within the family Phycodnaviridae. Four of the OTUs fell within a group of viruses (MpV) which infect the photosynthetic picoplankter Micromonas pusilla. The genetic diversity among these genotypes was as large as that previously found for MpV isolates from different oceans. The remaining genotype formed its own clade between viruses which infect M. pusilla and Chrysochromulina brevifilum. These results imply that marine virus communities contain a diverse assemblage of MpV-like viruses, as well as other unknown members of the Phycodnaviridae. PMID:8702280

  8. Universal primers for the amplification and sequence analysis of actin-1 from diverse mosquito species.

    PubMed

    Staley, Molly; Dorman, Karin S; Bartholomay, Lyric C; Fernández-Salas, Ildefonso; Farfan-Ale, Jose A; Loroño-Pino, Maria A; Garcia-Rejon, Julian E; Ibarra-Juarez, Luis; Blitvich, Bradley J

    2010-06-01

    We report the development of universal primers for the reverse-transcription polymerase chain reaction (RT-PCR) amplification and nucleotide sequence analysis of actin cDNAs from taxonomically diverse mosquito species. Primers specific to conserved regions of the invertebrate actin-1 gene were designed after actin cDNA sequences of Anopheles gambiae, Bombyx mori, Drosophila melanogaster, and Caenorhabditis elegans. The efficacy of these primers was determined by RT-PCR with the use of total RNA from mosquitoes belonging to 30 species and 8 genera (Aedes, Anopheles, Culex, Deinocerites, Mansonia, Psorophora, Toxorhynchites, and Wyeomyia). The RT-PCR products were sequenced, and sequence data were used to design additional primers. One primer pair, denoted as Act-2F (5'-ATGGTCGGYATGGGNCAGAAGGACTC-3') and Act-8R (5'-GATTCCATACCCAGGAAGGADGG-3'), successfully amplified an RT-PCR product of the expected size (683-nt) in all mosquito spp. tested. We propose that this primer pair can be used as an internal control to test the quality of RNA from mosquitoes collected in vector surveillance studies. These primers can also be used in molecular experiments in which the detection, amplification or silencing of a ubiquitously expressed mosquito housekeeping gene is necessary. Sequence and phylogenetic data are also presented in this report. PMID:20649132

  9. Diverse and Widespread Contamination Evident in the Unmapped Depths of High Throughput Sequencing Data

    PubMed Central

    Lusk, Richard W.

    2014-01-01

    Trace quantities of contaminating DNA are widespread in the laboratory environment, but their presence has received little attention in the context of high throughput sequencing. This issue is highlighted by recent works that have rested controversial claims upon sequencing data that appear to support the presence of unexpected exogenous species. I used reads that preferentially aligned to alternate genomes to infer the distribution of potential contaminant species in a set of independent sequencing experiments. I confirmed that dilute samples are more exposed to contaminating DNA, and, focusing on four single-cell sequencing experiments, found that these contaminants appear to originate from a wide diversity of clades. Although negative control libraries prepared from ‘blank’ samples recovered the highest-frequency contaminants, low-frequency contaminants, which appeared to make heterogeneous contributions to samples prepared in parallel within a single experiment, were not well controlled for. I used these results to show that, despite heavy replication and plausible controls, contamination can explain all of the observations used to support a recent claim that complete genes pass from food to human blood. Contamination must be considered a potential source of signals of exogenous species in sequencing data, even if these signals are replicated in independent experiments, vary across conditions, or indicate a species which seems a priori unlikely to contaminate. Negative control libraries processed in parallel are essential to control for contaminant DNAs, but their limited ability to recover low-frequency contaminants must be recognized. PMID:25354084

  10. Combining genomic sequencing methods to explore viral diversity and reveal potential virus-host interactions.

    PubMed

    Chow, Cheryl-Emiliane T; Winget, Danielle M; White, Richard A; Hallam, Steven J; Suttle, Curtis A

    2015-01-01

    Viral diversity and virus-host interactions in oxygen-starved regions of the ocean, also known as oxygen minimum zones (OMZs), remain relatively unexplored. Microbial community metabolism in OMZs alters nutrient and energy flow through marine food webs, resulting in biological nitrogen loss and greenhouse gas production. Thus, viruses infecting OMZ microbes have the potential to modulate community metabolism with resulting feedback on ecosystem function. Here, we describe viral communities inhabiting oxic surface (10 m) and oxygen-starved basin (200 m) waters of Saanich Inlet, a seasonally anoxic fjord on the coast of Vancouver Island, British Columbia using viral metagenomics and complete viral fosmid sequencing on samples collected between April 2007 and April 2010. Of 6459 open reading frames (ORFs) predicted across all 34 viral fosmids, 77.6% (n = 5010) had no homology to reference viral genomes. These fosmids recruited a higher proportion of viral metagenomic sequences from Saanich Inlet than from nearby northeastern subarctic Pacific Ocean (Line P) waters, indicating differences in the viral communities between coastal and open ocean locations. While functional annotations of fosmid ORFs were limited, recruitment to NCBI's non-redundant "nr" database and publicly available single-cell genomes identified putative viruses infecting marine thaumarchaeal and SUP05 proteobacteria to provide potential host linkages with relevance to coupled biogeochemical cycling processes in OMZ waters. Taken together, these results highlight the power of coupled analyses of multiple sequence data types, such as viral metagenomic and fosmid sequence data with prokaryotic single cell genomes, to chart viral diversity, elucidate genomic and ecological contexts for previously unclassifiable viral sequences, and identify novel host interactions in natural and engineered ecosystems. PMID:25914678

  11. Combining genomic sequencing methods to explore viral diversity and reveal potential virus-host interactions

    PubMed Central

    Chow, Cheryl-Emiliane T.; Winget, Danielle M.; White, Richard A.; Hallam, Steven J.; Suttle, Curtis A.

    2015-01-01

    Viral diversity and virus-host interactions in oxygen-starved regions of the ocean, also known as oxygen minimum zones (OMZs), remain relatively unexplored. Microbial community metabolism in OMZs alters nutrient and energy flow through marine food webs, resulting in biological nitrogen loss and greenhouse gas production. Thus, viruses infecting OMZ microbes have the potential to modulate community metabolism with resulting feedback on ecosystem function. Here, we describe viral communities inhabiting oxic surface (10 m) and oxygen-starved basin (200 m) waters of Saanich Inlet, a seasonally anoxic fjord on the coast of Vancouver Island, British Columbia using viral metagenomics and complete viral fosmid sequencing on samples collected between April 2007 and April 2010. Of 6459 open reading frames (ORFs) predicted across all 34 viral fosmids, 77.6% (n = 5010) had no homology to reference viral genomes. These fosmids recruited a higher proportion of viral metagenomic sequences from Saanich Inlet than from nearby northeastern subarctic Pacific Ocean (Line P) waters, indicating differences in the viral communities between coastal and open ocean locations. While functional annotations of fosmid ORFs were limited, recruitment to NCBI's non-redundant “nr” database and publicly available single-cell genomes identified putative viruses infecting marine thaumarchaeal and SUP05 proteobacteria to provide potential host linkages with relevance to coupled biogeochemical cycling processes in OMZ waters. Taken together, these results highlight the power of coupled analyses of multiple sequence data types, such as viral metagenomic and fosmid sequence data with prokaryotic single cell genomes, to chart viral diversity, elucidate genomic and ecological contexts for previously unclassifiable viral sequences, and identify novel host interactions in natural and engineered ecosystems. PMID:25914678

  12. Diversity of prokaryotic community at a shallow marine hydrothermal site elucidated by Illumina sequencing technology.

    PubMed

    Lentini, Valeria; Gugliandolo, Concetta; Bunk, Boyke; Overmann, Jörg; Maugeri, Teresa L

    2014-10-01

    To investigate the prokaryotic community structure and composition in an active hydrothermal site, named Black Point, off Panarea Island (Eolian Islands, Italy), we examined sediment and fluid samples, differing in temperature, by a massive parallel sequencing (Illumina) technique targeting the V3 region of the 16S rRNA gene. The used technique enabled us to detect a greater prokaryotic diversity than that until now observed and to reveal also microorganisms occurring at very low abundance (≤0.01 %). Most of sequences were assigned to Bacteria while Archaea were a minor component of the microbial community in both low- and high-temperature samples. Proteobacteria (mainly consisting of Alpha-, Gamma-, and Epsilonproteobacteria) dominated among all samples followed by Actinobacteria and Bacteroidetes. Analyzed DNA obtained from samples taken at different temperatures indicated the presence of members of different dominant genera. The main differences were observed between sediment samples where Rhodovulum and Thiohalospira prevailed at high temperature, while Thalassomonas and Sulfurimonas at low temperature. Chlorobium, Acinetobacter, Sulfurimonas, and Brevundimonas were abundant in both low- and high-temperature fluid samples. Euryarchaeota dominated the archaeal community in all samples. Classes of Euryarchaeota embracing hyperthermophilic members (Thermococci and Thermoplasmata) and of Crenarchaeota (Thermoprotei) were more abundant in high-temperature samples. A great number of sequences referred to Bacteria and Archaea still remained unaffiliated, indicating that Black Point site represents a rich source of so-far uncharted prokaryotic diversity. PMID:24849732

  13. Application of mitochondrial genes sequences for measuring the genetic diversity of Arabian oryx.

    PubMed

    Khan, Haseeb A; Arif, Ibrahim A; Shobrak, Mohammad; Homaidan, Ali A Al; Farhan, Ahmad H Al; Sadoon, Mohammad Al

    2011-01-01

    Arabian oryx (Oryx leucoryx) had faced extinction in the wild more than three decades ago and was saved by the prudent efforts of captive breeding programs. A clear understanding of the molecular diversity of contemporary Arabian oryx population is important for the long term success of captive breeding and reintroduction of this potentially endangered species. We have sequenced the segments of mitochondrial DNA including12S rRNA, 16S rRNA, cytochrome b (Cyt-b) and control region (CR) genes of 24 captive-bred and reintroduced animals. Although the sequences of 12S rRNA, 16S rRNA and Cyt-b were found to be identical for all the samples, typical sequence variations in the CR gene were observed in the form of 7 haplotypes. One of these haplotypes has been reported earlier while the remaining 6 haplotypes are novel and represent different lineages from the founders. The haplotype and nucleotide diversities were found to be 0.789 and 0.009 respectively. The genetic distances among the 7 mtDNA haplotypes varied from 0.001 to 0.017. These findings are of potential relevance to the management of captive breeding programs for the conservation of Arabian oryx. PMID:21498924

  14. NEP: web server for epitope prediction based on antibody neutralization of viral strains with diverse sequences.

    PubMed

    Chuang, Gwo-Yu; Liou, David; Kwong, Peter D; Georgiev, Ivelin S

    2014-07-01

    Delineation of the antigenic site, or epitope, recognized by an antibody can provide clues about functional vulnerabilities and resistance mechanisms, and can therefore guide antibody optimization and epitope-based vaccine design. Previously, we developed an algorithm for antibody-epitope prediction based on antibody neutralization of viral strains with diverse sequences and validated the algorithm on a set of broadly neutralizing HIV-1 antibodies. Here we describe the implementation of this algorithm, NEP (Neutralization-based Epitope Prediction), as a web-based server. The users must supply as input: (i) an alignment of antigen sequences of diverse viral strains; (ii) neutralization data for the antibody of interest against the same set of antigen sequences; and (iii) (optional) a structure of the unbound antigen, for enhanced prediction accuracy. The prediction results can be downloaded or viewed interactively on the antigen structure (if supplied) from the web browser using a JSmol applet. Since neutralization experiments are typically performed as one of the first steps in the characterization of an antibody to determine its breadth and potency, the NEP server can be used to predict antibody-epitope information at no additional experimental costs. NEP can be accessed on the internet at http://exon.niaid.nih.gov/nep. PMID:24782517

  15. High diversity of picornaviruses in rats from different continents revealed by deep sequencing.

    PubMed

    Hansen, Thomas Arn; Mollerup, Sarah; Nguyen, Nam-Phuong; White, Nicole E; Coghlan, Megan; Alquezar-Planas, David E; Joshi, Tejal; Jensen, Randi Holm; Fridholm, Helena; Kjartansdóttir, Kristín Rós; Mourier, Tobias; Warnow, Tandy; Belsham, Graham J; Bunce, Michael; Willerslev, Eske; Nielsen, Lars Peter; Vinner, Lasse; Hansen, Anders Johannes

    2016-01-01

    Outbreaks of zoonotic diseases in humans and livestock are not uncommon, and an important component in containment of such emerging viral diseases is rapid and reliable diagnostics. Such methods are often PCR-based and hence require the availability of sequence data from the pathogen. Rattus norvegicus (R. norvegicus) is a known reservoir for important zoonotic pathogens. Transmission may be direct via contact with the animal, for example, through exposure to its faecal matter, or indirectly mediated by arthropod vectors. Here we investigated the viral content in rat faecal matter (n=29) collected from two continents by analyzing 2.2 billion next-generation sequencing reads derived from both DNA and RNA. Among other virus families, we found sequences from members of the Picornaviridae to be abundant in the microbiome of all the samples. Here we describe the diversity of the picornavirus-like contigs including near-full-length genomes closely related to the Boone cardiovirus and Theiler's encephalomyelitis virus. From this study, we conclude that picornaviruses within R. norvegicus are more diverse than previously recognized. The virome of R. norvegicus should be investigated further to assess the full potential for zoonotic virus transmission. PMID:27530749

  16. Genetic diversity and relationship of chicory (Cichorium intybus L.) using sequence-related amplified polymorphism markers.

    PubMed

    Liang, X Y; Zhang, X Q; Bai, S Q; Huang, L K; Luo, X M; Ji, Y; Jiang, L F

    2014-01-01

    Chicory is a crop with economically important roles and is cultivated worldwide. The genetic diversity and relationship of 80 accessions of chicories and endives were evaluated by sequence-related amplified polymorphism (SRAP) markers to provide a theoretical basis for future breeding programs in China. The polymorphic rate was 96.83%, and the average polymorphic information content was 0.323, suggesting the rich genetic diversity of chicory. The genetic diversity degree of chicory was higher (GS = 0.677) than that of endive (GS = 0.701). The accessions with the highest genetic diversity (effective number of alleles, NE = 1.609; Nei's genetic diversity, H = 0.372; Shannon information index, I = 0.556) were from Italy. The richest genetic diversity was revealed in a chicory line (NE = 1.478, H = 0.289, I = 0.443) among the 3 types (line, wild, and cultivar). The chicory genetic structure of 8 geographical groups showed that the genetic differentiation coefficient (GST) was 14.20% and the number of immigrants per generation (Nm) was 3.020. A GST of 6.80% and an Nm of 6.853 were obtained from different types. This observation suggests that these chicory lines, especially those from the Mediterranean region, have potential for providing rich genetic resources for further breeding programs, that the chicory genetic structure among different countries obviously differs with a certain amount of gene flow, and that SRAP markers could be applied to analyze genetic relationships and classifications of Cichorium intybus and C. endivia. PMID:25299087

  17. Diversity of lactic acid bacteria isolated from Brazilian water buffalo mozzarella cheese.

    PubMed

    Silva, Luana Faria; Casella, Tiago; Gomes, Elisangela Soares; Nogueira, Mara Correa Lelles; De Dea Lindner, Juliano; Penna, Ana Lúcia Barretto

    2015-02-01

    The water buffalo mozzarella cheese is a typical Italian cheese which has been introduced in the thriving Brazilian market in the last 10 y, with good acceptance by its consumers. Lactic acid bacteria (LAB) play an important role in the technological and sensory quality of mozzarella cheese. In this study, the aim was to evaluate the diversity of the autochthones viable LAB isolated from water buffalo mozzarella cheese under storage. Samples were collected in 3 independent trials in a dairy industry located in the southeast region of Brazil, on the 28th day of storage, at 4 ºC. The LAB were characterized by Gram staining, catalase test, capacity to assimilate citrate, and production of CO2 from glucose. The diversity of LAB was evaluated by RAPD-PCR (randomly amplified polymorphic DNA-polymerase chain reaction), 16S rRNA gene sequencing, and by Vitek 2 system. Twenty LAB strains were isolated and clustered into 12 different clusters, and identified as Streptococcus thermophilus, Enterococcus faecium, Enterococcus durans, Leuconostoc mesenteroides subsp. mesenteroides, Lactobacillus fermentum, Lactobacillus casei, Lactobacillus delbrueckii subsp. bulgaricus, and Lactobacillus helveticus. Enterococcus species were dominant and citrate-positive. Only the strains of L. mesenteroides subsp. mesenteroides and L. fermentum produced CO2 from glucose and were citrate-positive, while L. casei was only citrate positive. This is the first report which elucidates the LAB diversity involved in Brazilian water buffalo mozzarella cheese. Furthermore, the results show that despite the absence of natural whey cultures as starters in production, the LAB species identified are the ones typically found in mozzarella cheese. PMID:25597646

  18. Analysis of Plasmodium falciparum diversity in natural infections by deep sequencing

    PubMed Central

    Manske, Magnus; Miotto, Olivo; Campino, Susana; Auburn, Sarah; Almagro-Garcia, Jacob; Maslen, Gareth; O’Brien, Jack; Djimde, Abdoulaye; Doumbo, Ogobara; Zongo, Issaka; Ouedraogo, Jean-Bosco; Michon, Pascal; Mueller, Ivo; Siba, Peter; Nzila, Alexis; Borrmann, Steffen; Kiara, Steven M.; Marsh, Kevin; Jiang, Hongying; Su, Xin-Zhuan; Amaratunga, Chanaki; Fairhurst, Rick; Socheat, Duong; Nosten, Francois; Imwong, Mallika; White, Nicholas J.; Sanders, Mandy; Anastasi, Elisa; Alcock, Dan; Drury, Eleanor; Oyola, Samuel; Quail, Michael A.; Turner, Daniel J.; Rubio, Valentin Ruano; Jyothi, Dushyanth; Amenga-Etego, Lucas; Hubbart, Christina; Jeffreys, Anna; Rowlands, Kate; Sutherland, Colin; Roper, Cally; Mangano, Valentina; Modiano, David; Tan, John C.; Ferdig, Michael T.; Amambua-Ngwa, Alfred; Conway, David J.; Takala-Harrison, Shannon; Plowe, Christopher V.; Rayner, Julian C.; Rockett, Kirk A.; Clark, Taane G.; Newbold, Chris I.; Berriman, Matthew; MacInnis, Bronwyn; Kwiatkowski, Dominic P.

    2013-01-01

    Malaria elimination strategies require surveillance of the parasite population for genetic changes that demand a public health response, such as new forms of drug resistance. 1,2 Here we describe methods for large-scale analysis of genetic variation in Plasmodium falciparum by deep sequencing of parasite DNA obtained from the blood of patients with malaria, either directly or after short term culture. Analysis of 86,158 exonic SNPs that passed genotyping quality control in 227 samples from Africa, Asia and Oceania provides genome-wide estimates of allele frequency distribution, population structure and linkage disequilibrium. By comparing the genetic diversity of individual infections with that of the local parasite population, we derive a metric of within-host diversity that is related to the level of inbreeding in the population. An open-access web application has been established for exploration of regional differences in allele frequency and of highly differentiated loci in the P. falciparum genome. PMID:22722859

  19. Association of Novel and Highly Diverse Acid-Tolerant Denitrifiers with N2O Fluxes of an Acidic Fen▿ †

    PubMed Central

    Palmer, Katharina; Drake, Harold L.; Horn, Marcus A.

    2010-01-01

    Wetlands are sources of denitrification-derived nitrous oxide (N2O). Thus, the denitrifier community of an N2O-emitting fen (pH 4.7 to 5.2) was investigated. N2O was produced and consumed to subatmospheric concentrations in unsupplemented anoxic soil microcosms. Total cell counts and most probable numbers of denitrifiers approximated 1011 cells·gDW−1 (where DW is dry weight) and 108 cells·gDW−1, respectively, in both 0- to 10-cm and 30- to 40-cm depths. Despite this uniformity, depth-related maximum reaction rate (vmax) values for denitrification in anoxic microcosms ranged from 1 to 24 and −19 to −105 nmol N2O h−1· gDW−1, with maximal values occurring in the upper soil layers. Denitrification was enhanced by substrates that might be formed via fermentation in anoxic microzones of soil. N2O approximated 40% of total nitrogenous gases produced at in situ pH, which was likewise the optimal pH for denitrification. Gene libraries of narG and nosZ (encoding nitrate reductase and nitrous oxide reductase, respectively) from fen soil DNA yielded 15 and 18 species-level operational taxonomic units, respectively, many of which displayed phylogenetic novelty and were not closely related to cultured organisms. Although statistical analyses of narG and nosZ sequences indicated that the upper 20 cm of soil contained the highest denitrifier diversity and species richness, terminal restriction fragment length polymorphism analyses of narG and nosZ revealed only minor differences in denitrifier community composition from a soil depth of 0 to 40 cm. The collective data indicate that the regional fen harbors novel, highly diverse, acid-tolerant denitrifier communities capable of complete denitrification and consumption of atmospheric N2O at in situ pH. PMID:20023077

  20. Structural gene and complete amino acid sequence of Pseudomonas aeruginosa IFO 3455 elastase.

    PubMed Central

    Fukushima, J; Yamamoto, S; Morihara, K; Atsumi, Y; Takeuchi, H; Kawamoto, S; Okuda, K

    1989-01-01

    The DNA encoding the elastase of Pseudomonas aeruginosa IFO 3455 was cloned, and its complete nucleotide sequence was determined. When the cloned gene was ligated to pUC18, the Escherichia coli expression vector, bacteria carrying the gene exhibited high levels of both elastase activity and elastase antigens. The amino acid sequence, deduced from the nucleotide sequence, revealed that the mature elastase consisted of 301 amino acids with a relative molecular mass of 32,926 daltons. The amino acid composition predicted from the DNA sequence was quite similar to the chemically determined composition of purified elastase reported previously. We also observed nucleotide sequence encoding a signal peptide and "pro" sequence consisting of 197 amino acids upstream from the mature elastase protein gene. The amino acid sequence analysis revealed that both the N-terminal sequence of the purified elastase and the N-terminal side sequences of the C-terminal tryptic peptide as well as the internal lysyl peptide fragment were completely identical to the deduced amino acid sequences. The pattern of identity of amino acid sequences was quite evident in the regions that include structurally and functionally important residues of Bacillus subtilis thermolysin. PMID:2493453

  1. Multiple Amino Acid Sequence Alignment Nitrogenase Component 1: Insights into Phylogenetics and Structure-Function Relationships

    PubMed Central

    Howard, James B.; Kechris, Katerina J.; Rees, Douglas C.; Glazer, Alexander N.

    2013-01-01

    Amino acid residues critical for a protein's structure-function are retained by natural selection and these residues are identified by the level of variance in co-aligned homologous protein sequences. The relevant residues in the nitrogen fixation Component 1 α- and β-subunits were identified by the alignment of 95 protein sequences. Proteins were included from species encompassing multiple microbial phyla and diverse ecological niches as well as the nitrogen fixation genotypes, anf, nif, and vnf, which encode proteins associated with cofactors differing at one metal site. After adjusting for differences in sequence length, insertions, and deletions, the remaining >85% of the sequence co-aligned the subunits from the three genotypes. Six Groups, designated Anf, Vnf , and Nif I-IV, were assigned based upon genetic origin, sequence adjustments, and conserved residues. Both subunits subdivided into the same groups. Invariant and single variant residues were identified and were defined as “core” for nitrogenase function. Three species in Group Nif-III, Candidatus Desulforudis audaxviator, Desulfotomaculum kuznetsovii, and Thermodesulfatator indicus, were found to have a seleno-cysteine that replaces one cysteinyl ligand of the 8Fe:7S, P-cluster. Subsets of invariant residues, limited to individual groups, were identified; these unique residues help identify the gene of origin (anf, nif, or vnf) yet should not be considered diagnostic of the metal content of associated cofactors. Fourteen of the 19 residues that compose the cofactor pocket are invariant or single variant; the other five residues are highly variable but do not correlate with the putative metal content of the cofactor. The variable residues are clustered on one side of the cofactor, away from other functional centers in the three dimensional structure. Many of the invariant and single variant residues were not previously recognized as potentially critical and their identification provides the bases

  2. Genetic diversity and molecular evolution of arabis mosaic virus based on the CP gene sequence.

    PubMed

    Gao, Fangluan; Lin, Wuzhen; Shen, Jianguo; Liao, Furong

    2016-04-01

    Arabis mosaic virus (ArMV) is a virus with a wide host range. In this study, the genetic diversity of ArMV and the molecular mechanisms underlying its evolution were investigated using the coat protein (CP) sequence. Of the 33 ArMV isolates studied, three were found to be recombinants. The other 30 recombination-free ArMV isolates could be separated into two major lineages with a significant F ST value (0.384) and tended to cluster according to their geographical origin. Different evolutionary constraints were detected for the two linages, pointing to a role of natural selection in the differentiation of ArMV. PMID:26758729

  3. Within-Host Nucleotide Diversity of Virus Populations: Insights from Next-Generation Sequencing

    PubMed Central

    Nelson, Chase W.; Hughes, Austin L.

    2014-01-01

    Next-generation sequencing (NGS) technology offers new opportunities for understanding the evolution and dynamics of viral populations within individual hosts over the course of infection. We review simple methods for estimating synonymous and nonsynonymous nucleotide diversity in viral genes from NGS data without the need for inferring linkage. We discuss the potential usefulness of these data for addressing questions of both practical and theoretical interest, including fundamental questions regarding the effective population sizes of within-host viral populations and the modes of natural selection acting on them. PMID:25481279

  4. Heme binds to a short sequence that serves a regulatory function in diverse proteins.

    PubMed Central

    Zhang, L; Guarente, L

    1995-01-01

    Heme is a prosthetic group for numerous enzymes, cytochromes and globins, and it binds tightly, sometimes covalently, to these proteins. Interestingly, heme also potentiates binding of the yeast transcriptional activator HAP1 to DNA and inhibits mitochondrial import of the mammalian delta-aminolevulinate synthase (ALAS) and the catalytic activity of the reticulocyte kinase, HRI. All three of these proteins contain a short sequence, the heme regulatory motif (HRM), that occurs six times adjacent to the HAP1 DNA binding domain, twice in the leader targeting sequence of ALAS and twice near the catalytic domain of the HRI kinase. Here we show that a 10 amino acid peptide containing the HRM consensus binds to heme in the micromolar range, and shifts the heme absorption spectrum to a longer wavelength, a direction opposite to the change caused by cytochromes or globins. Further, we show that a single HRM regulates the acidic activation domains of HAP1 and GAL4 independently of regulation of DNA binding of the transcription factors. These findings thus establish a novel heme binding sequence which is structurally distinct from sequences in globins or cytochromes and which has a regulatory function. Images PMID:7835342

  5. Genetic diversity of Moringa peregrina species in Saudi Arabia with ITS sequences.

    PubMed

    Alaklabi, Abdullah

    2015-03-01

    The genus Moringa was the family of Moringaceae and Moringa oleifera and Moringa peregrina are the most famous species of Moringa. M. peregrina is widely grown in Saudi Arabia, Iran and India. Therefore, based on these reports, this study aimed to investigate the first systematic attempt to regulate the genetic diversity of the species M. peregrina in Saudi Arabian samples collected from several geographic locations using internal transcribed sequences. Genomic DNA was separated by CTAB extraction method and PCR was performed. Later on, DNA sequencing was performed for PCR products with ITS. In conclusion, the present study affords the first report on genetic stability of M. peregrina using ITS analysis in Saudi Arabia. Further studies are suggested in order to study in different regions. PMID:25737651

  6. Photobiont diversity in lichens from metal-rich substrata based on ITS rDNA sequences.

    PubMed

    Backor, Martin; Peksa, Ondrej; Skaloud, Pavel; Backorová, Miriam

    2010-05-01

    The photobiont is considered as the more sensitive partner of lichen symbiosis in metal pollution. For this reason the presence of a metal tolerant photobiont in lichens may be a key factor of ecological success of lichens growing on metal polluted substrata. The photobiont inventory was examined for terricolous lichen community growing in Cu mine-spoil heaps derived by historical mining. Sequences of internal transcribed spacer (ITS) were phylogenetically analyzed using maximum likelihood analyses. A total of 50 ITS algal sequences were obtained from 22 selected lichen taxa collected at three Cu mine-spoil heaps and two control localities. Algae associated with Cladonia and Stereocaulon were identified as members of several Asterochloris lineages, photobionts of cetrarioid lichens clustered with Trebouxia hypogymniae ined. We did not find close relationship between heavy metal content (in localities as well as lichen thalli) and photobiont diversity. Presence of multiple algal genotypes in single lichen thallus has been confirmed. PMID:20031214

  7. Human retroviruses and AIDS 1996. A compilation and analysis of nucleic acid and amino acid sequences

    SciTech Connect

    Myers, G.; Foley, B.; Korber, B.; Mellors, J.W.; Jeang, K.T.; Wain-Hobson, S.

    1997-04-01

    This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (1) Nuclear Acid Alignments and Sequences; (2) Amino Acid Alignments; (3) Analysis; (4) Related Sequences; and (5) Database Communications. Information within all the parts is updated throughout the year on the Web site, http://hiv-web.lanl.gov. While this publication could take the form of a review or sequence monograph, it is not so conceived. Instead, the literature from which the database is derived has simply been summarized and some elementary computational analyses have been performed upon the data. Interpretation and commentary have been avoided insofar as possible so that the reader can form his or her own judgments concerning the complex information. In addition to the general descriptions of the parts of the compendium, the user should read the individual introductions for each part.

  8. Targeted high-throughput growth hormone 1 gene sequencing reveals high within-breed genetic diversity in South African goats.

    PubMed

    Ncube, K T; Mdladla, K; Dzomba, E F; Muchadeyi, F C

    2016-06-01

    This study assessed the genetic diversity in the growth hormone 1 gene (GH1) within and between South African goat breeds. Polymerase chain reaction-targeted gene amplification together with Illumina MiSeq next-generation sequencing (NGS) was used to generate the full length (2.54 kb) of the growth hormone 1 gene and screen for SNPs in the South African Boer (SAB) (n = 17), Tankwa (n = 15) and South African village (n = 35) goat populations. A range of 27-58 SNPs per population were observed. Mutations resulting in amino acid changes were observed at exons 2 and 5. Higher within-breed diversity of 97.37% was observed within the population category consisting of SA village ecotypes and the Tankwa goats. Highest pairwise FST values ranging from 0.148 to 0.356 were observed between the SAB and both the South African village and Tankwa feral goat populations. Phylogenetic analysis indicated nine genetic clusters, which reflected close relationships between the South African populations and the other international breeds with the exception of the Italian Sarda breeds. Results imply greater potential for within-population selection programs, particularly with SA village goats. PMID:26919178

  9. Allelic Diversity and Population Structure in Oenococcus oeni as Determined from Sequence Analysis of Housekeeping Genes

    PubMed Central

    de las Rivas, Blanca; Marcobal, Ángela; Muñoz, Rosario

    2004-01-01

    Oenococcus oeni is the organism of choice for promoting malolactic fermentation in wine. The population biology of O. oeni is poorly understood and remains unclear. For a better understanding of the mode of genetic variation within this species, we investigated by using multilocus sequence typing (MLST) with the gyrB, pgm, ddl, recP, and mleA genes the genetic diversity and genetic relationships among 18 O. oeni strains isolated in various years from wines of the United States, France, Germany, Spain, and Italy. These strains have also been characterized by ribotyping and restriction fragment length polymorphism (RFLP) analysis of the PCR-amplified 16S-23S rRNA gene intergenic spacer region (ISR). Ribotyping grouped the strains into two groups; however, the RFLP analysis of the ISRs showed no differences in the strains analyzed. In contrast, MLST in oenococci had a good discriminatory ability, and we have found a higher genetic diversity than indicated by ribotyping analysis. All sequence types were represented by a single strain, and all the strains could be distinguished from each other because they had unique combinations of alleles. Strains assumed to be identical showed the same sequence type. Phylogenetic analyses indicated a panmictic population structure in O. oeni. Sequences were analyzed for evidence of recombination by split decomposition analysis and analysis of clustered polymorphisms. All results indicated that recombination plays a major role in creating the genetic heterogeneity of O. oeni. A low standardized index of association value indicated that the O. oeni genes analyzed are close to linkage equilibrium. This study constitutes the first step in the development of an MLST method for O. oeni and the first example of the application of MLST to a nonpathogenic food production bacteria. PMID:15574919

  10. Shotgun metagenomic sequencing based microbial diversity assessment of Lasundra hot spring, India.

    PubMed

    Mangrola, Amit V; Dudhagara, Pravin; Koringa, Prakash; Joshi, C G; Patel, Rajesh K

    2015-06-01

    This is the first report on the metagenomic approach for unveiling the microbial diversity of Lasundra hot spring, Gujarat State, India. High-throughput sequencing of community DNA was performed on an Ion Torrent PGM platform. Metagenome consisted of 606,867 sequences represent 98,567,305 bps size with an average length of 162 bps and 46% G + C content. Metagenome sequence information is available at EBI under EBI Metagenomic database with accession no. ERP009313. MG-RAST assisted community analysis revealed that 99.21% sequences were bacterial origin, 0.43% was fit to eukaryotes and 0.11% belongs to archaea. A total of 29 bacterial, 20 eukaryotic and 4 archaeal phyla were detected. Abundant genera were Bacillus (86.7%), Geobacillus (2.4%), Paenibacillus (1.0%), Clostridium (0.7%) and Listeria (0.5%), that represent 91.52% in metagenome. In functional analysis, Cluster of Orthologous Group (COG) based annotation revealed that 45.4% was metabolism connected and 19.6% falls in poorly characterized group. Subsystem based annotation approach suggests that the 14.0% was carbohydrates, 7.0% was protein metabolism and 3.0% genes for various stress responses together with the versatile presence of commercially useful traits. PMID:26484181

  11. Assessing Diversity of DNA Structure-Related Sequence Features in Prokaryotic Genomes

    PubMed Central

    Huang, Yongjie; Mrázek, Jan

    2014-01-01

    Prokaryotic genomes are diverse in terms of their nucleotide and oligonucleotide composition as well as presence of various sequence features that can affect physical properties of the DNA molecule. We present a survey of local sequence patterns which have a potential to promote non-canonical DNA conformations (i.e. different from standard B-DNA double helix) and interpret the results in terms of relationships with organisms' habitats, phylogenetic classifications, and other characteristics. Our present work differs from earlier similar surveys not only by investigating a wider range of sequence patterns in a large number of genomes but also by using a more realistic null model to assess significant deviations. Our results show that simple sequence repeats and Z-DNA-promoting patterns are generally suppressed in prokaryotic genomes, whereas palindromes and inverted repeats are over-represented. Representation of patterns that promote Z-DNA and intrinsic DNA curvature increases with increasing optimal growth temperature (OGT), and decreases with increasing oxygen requirement. Additionally, representations of close direct repeats, palindromes and inverted repeats exhibit clear negative trends with increasing OGT. The observed relationships with environmental characteristics, particularly OGT, suggest possible evolutionary scenarios of structural adaptation of DNA to particular environmental niches. PMID:24408877

  12. Shotgun metagenomic sequencing based microbial diversity assessment of Lasundra hot spring, India

    PubMed Central

    Mangrola, Amit V.; Dudhagara, Pravin; Koringa, Prakash; Joshi, C.G.; Patel, Rajesh K.

    2015-01-01

    This is the first report on the metagenomic approach for unveiling the microbial diversity of Lasundra hot spring, Gujarat State, India. High-throughput sequencing of community DNA was performed on an Ion Torrent PGM platform. Metagenome consisted of 606,867 sequences represent 98,567,305 bps size with an average length of 162 bps and 46% G + C content. Metagenome sequence information is available at EBI under EBI Metagenomic database with accession no. ERP009313. MG-RAST assisted community analysis revealed that 99.21% sequences were bacterial origin, 0.43% was fit to eukaryotes and 0.11% belongs to archaea. A total of 29 bacterial, 20 eukaryotic and 4 archaeal phyla were detected. Abundant genera were Bacillus (86.7%), Geobacillus (2.4%), Paenibacillus (1.0%), Clostridium (0.7%) and Listeria (0.5%), that represent 91.52% in metagenome. In functional analysis, Cluster of Orthologous Group (COG) based annotation revealed that 45.4% was metabolism connected and 19.6% falls in poorly characterized group. Subsystem based annotation approach suggests that the 14.0% was carbohydrates, 7.0% was protein metabolism and 3.0% genes for various stress responses together with the versatile presence of commercially useful traits. PMID:26484181

  13. Assessing diversity of DNA structure-related sequence features in prokaryotic genomes.

    PubMed

    Huang, Yongjie; Mrázek, Jan

    2014-06-01

    Prokaryotic genomes are diverse in terms of their nucleotide and oligonucleotide composition as well as presence of various sequence features that can affect physical properties of the DNA molecule. We present a survey of local sequence patterns which have a potential to promote non-canonical DNA conformations (i.e. different from standard B-DNA double helix) and interpret the results in terms of relationships with organisms' habitats, phylogenetic classifications, and other characteristics. Our present work differs from earlier similar surveys not only by investigating a wider range of sequence patterns in a large number of genomes but also by using a more realistic null model to assess significant deviations. Our results show that simple sequence repeats and Z-DNA-promoting patterns are generally suppressed in prokaryotic genomes, whereas palindromes and inverted repeats are over-represented. Representation of patterns that promote Z-DNA and intrinsic DNA curvature increases with increasing optimal growth temperature (OGT), and decreases with increasing oxygen requirement. Additionally, representations of close direct repeats, palindromes and inverted repeats exhibit clear negative trends with increasing OGT. The observed relationships with environmental characteristics, particularly OGT, suggest possible evolutionary scenarios of structural adaptation of DNA to particular environmental niches. PMID:24408877

  14. Intraspecific diversity within Diaporthe helianthi: evidence from rDNA intergenic spacer (IGS) sequence analysis.

    PubMed

    Pecchia, Susanna; Mercatelli, Elisabetta; Vannacci, Giovanni

    2004-04-01

    Diaporthe helianthi is the causal agent of sunflower stem canker, a serious pathogen of sunflower in Europe but recorded sporadically in Italy. The genetic diversity of D. helianthi isolates from different geographic origins (Argentina, France, Italy, Yugoslavia, Romania) was investigated using IGS sequences. A 400 bp fragment of the portion of the IGS region flanking the 5' end of the 18S gene was amplified from each isolate. The aligned nucleotide sequences showed intraspecific sequence homology from 99-100% among French/Yugoslavian isolates to 95-100% among Italian isolates. French/Yugoslavian isolates shared 90-92% sequence homology with Italian isolates. The phylogenetic tree obtained from the aligned data revealed three separate groups. Group 1 included all isolates from France and former Yugoslavia and one isolate from Argentina; Group 2 included all Italian isolates and one isolate from Argentina. The most distantly related isolate was that from Romania (Group 3). The average genetic distances among isolates within Group 1 and within Group 2 were 0.22 and 3.29 respectively. The analysis showed that all isolates originating from countries where severe outbreaks of the disease are reported annually (France and former Yugoslavia) form a well defined taxon characterized by relatively low variability. This group is distinct from the group formed by isolates originating from Italy, whose variability is relatively much higher. Results obtained revealed a marked differentiation among pathogen isolates, and members of Group 1 seem not yet to have spread into Italian sunflower-growing areas. PMID:15180160

  15. Analysis of diversity of chromophytic phytoplankton in a mangrove ecosystem using rbcL gene sequencing.

    PubMed

    Samanta, Brajogopal; Bhadury, Punyasloke

    2014-04-01

    Phytoplankton forms the basis of primary production in mangrove environments. The phylogeny and diversity based on the amplification and sequencing of rbcL, the large subunit encoding the key enzyme ribulose-1, 5-bisphosphate carboxylase/oxygenase was investigated for improved understanding of the community structure and temporal trends of chromophytic eukaryotic phytoplankton assemblages in Sundarbans, the world's largest continuous mangrove. Diatoms (Bacillariophyceae) were by far the most frequently detected group in clone libraries (485 out of 525 clones), consistent with their importance as a major bloom-forming group. Other major chromophytic algal groups including Cryptophyceae, Haptophyceae, Pelagophyceae, Eustigmatophyceae, and Raphidophyceae which are important component of the assemblages were detected for the first time from Sundarbans based on rbcL approach. Many of the sequences from Sundarbans rbcL clone libraries showed identity with key bloom forming diatom genera namely Thalassiosira, Skeletonema and Nitzschia. Similarly, several rbcL sequences which were diatom-like were also detected highlighting the need to explore diatom communities from the study area. Some of the rbcL sequences detected from Sundarbans were ubiquitous in distribution showing 100% identities with uncultured rbcL sequences targeted previously from the Gulf of Mexico and California upwelling system that are geographically separated from study area. Novel rbcL lineages were also detected highlighting the need to culture and sequence phytoplankton from the ecoregion. Principal component analysis revealed that nitrate is an important variable that is associated with observed variation in phytoplankton assemblages (operational taxonomic units). This study applied molecular tools to highlight the ecological significance of diatoms, in addition to other chromophytic algal groups in Sundarbans. PMID:26988190

  16. Natural vs. random protein sequences: Discovering combinatorics properties on amino acid words.

    PubMed

    Santoni, Daniele; Felici, Giovanni; Vergni, Davide

    2016-02-21

    Casual mutations and natural selection have driven the evolution of protein amino acid sequences that we observe at present in nature. The question about which is the dominant force of proteins evolution is still lacking of an unambiguous answer. Casual mutations tend to randomize protein sequences while, in order to have the correct functionality, one expects that selection mechanisms impose rigid constraints on amino acid sequences. Moreover, one also has to consider that the space of all possible amino acid sequences is so astonishingly large that it could be reasonable to have a well tuned amino acid sequence indistinguishable from a random one. In order to study the possibility to discriminate between random and natural amino acid sequences, we introduce different measures of association between pairs of amino acids in a sequence, and apply them to a dataset of 1047 natural protein sequences and 10,470 random sequences, carefully generated in order to preserve the relative length and amino acid distribution of the natural proteins. We analyze the multidimensional measures with machine learning techniques and show that, to a reasonable extent, natural protein sequences can be differentiated from random ones. PMID:26656109

  17. Transcriptome Sequencing in Response to Salicylic Acid in Salvia miltiorrhiza

    PubMed Central

    Zhang, Xiaoru; Dong, Juane; Liu, Hailong; Wang, Jiao; Qi, Yuexin; Liang, Zongsuo

    2016-01-01

    Salvia miltiorrhiza is a traditional Chinese herbal medicine, whose quality and yield are often affected by diseases and environmental stresses during its growing season. Salicylic acid (SA) plays a significant role in plants responding to biotic and abiotic stresses, but the involved regulatory factors and their signaling mechanisms are largely unknown. In order to identify the genes involved in SA signaling, the RNA sequencing (RNA-seq) strategy was employed to evaluate the transcriptional profiles in S. miltiorrhiza cell cultures. A total of 50,778 unigenes were assembled, in which 5,316 unigenes were differentially expressed among 0-, 2-, and 8-h SA induction. The up-regulated genes were mainly involved in stimulus response and multi-organism process. A core set of candidate novel genes coding SA signaling component proteins was identified. Many transcription factors (e.g., WRKY, bHLH and GRAS) and genes involved in hormone signal transduction were differentially expressed in response to SA induction. Detailed analysis revealed that genes associated with defense signaling, such as antioxidant system genes, cytochrome P450s and ATP-binding cassette transporters, were significantly overexpressed, which can be used as genetic tools to investigate disease resistance. Our transcriptome analysis will help understand SA signaling and its mechanism of defense systems in S. miltiorrhiza. PMID:26808150

  18. Transcriptome Sequencing in Response to Salicylic Acid in Salvia miltiorrhiza.

    PubMed

    Zhang, Xiaoru; Dong, Juane; Liu, Hailong; Wang, Jiao; Qi, Yuexin; Liang, Zongsuo

    2016-01-01

    Salvia miltiorrhiza is a traditional Chinese herbal medicine, whose quality and yield are often affected by diseases and environmental stresses during its growing season. Salicylic acid (SA) plays a significant role in plants responding to biotic and abiotic stresses, but the involved regulatory factors and their signaling mechanisms are largely unknown. In order to identify the genes involved in SA signaling, the RNA sequencing (RNA-seq) strategy was employed to evaluate the transcriptional profiles in S. miltiorrhiza cell cultures. A total of 50,778 unigenes were assembled, in which 5,316 unigenes were differentially expressed among 0-, 2-, and 8-h SA induction. The up-regulated genes were mainly involved in stimulus response and multi-organism process. A core set of candidate novel genes coding SA signaling component proteins was identified. Many transcription factors (e.g., WRKY, bHLH and GRAS) and genes involved in hormone signal transduction were differentially expressed in response to SA induction. Detailed analysis revealed that genes associated with defense signaling, such as antioxidant system genes, cytochrome P450s and ATP-binding cassette transporters, were significantly overexpressed, which can be used as genetic tools to investigate disease resistance. Our transcriptome analysis will help understand SA signaling and its mechanism of defense systems in S. miltiorrhiza. PMID:26808150

  19. Phylogenetic analysis of sequences from diverse bacteria with homology to the Escherichia coli rho gene.

    PubMed Central

    Opperman, T; Richardson, J P

    1994-01-01

    Genes from Pseudomonas fluorescens, Chromatium vinosum, Micrococcus luteus, Deinococcus radiodurans, and Thermotoga maritima with homology to the Escherichia coli rho gene were cloned and sequenced, and their sequences were compared with other available sequences. The species for all of the compared sequences are members of five bacterial phyla, including Thermotogales, the most deeply diverged phylum. This suggests that a rho-like gene is ubiquitous in the Bacteria and was present in their common ancestor. The comparative analysis revealed that the Rho homologs are highly conserved, exhibiting a minimum identity of 50% of their amino acid residues in pairwise comparisons. The ATP-binding domain had a particularly high degree of conservation, consisting of some blocks with sequences of residues that are very similar to segments of the alpha and beta subunits of F1-ATPase and of other blocks with sequences that are unique to Rho. The RNA-binding domain is more diverged than the ATP-binding domain. However, one of its most highly conserved segments includes a RNP1-like sequence, which is known to be involved in RNA binding. Overall, the degree of similarity is lowest in the first 50 residues (the first half of the RNA-binding domain), in the putative connector region between the RNA-binding and the ATP-binding domains, and in the last 50 residues of the polypeptide. Since functionally defective mutants for E. coli Rho exist in all three of these segments, they represent important parts of Rho that have undergone adaptive evolution. PMID:8051015

  20. Extracellular DNA amplicon sequencing reveals high levels of benthic eukaryotic diversity in the central Red Sea.

    PubMed

    Pearman, John K; Irigoien, Xabier; Carvalho, Susana

    2016-04-01

    The present study aims to characterize the benthic eukaryotic biodiversity patterns at a coarse taxonomic level in three areas of the central Red Sea (a lagoon, an offshore area in Thuwal and a shallow coastal area near Jeddah) based on extracellular DNA. High-throughput amplicon sequencing targeting the V9 region of the 18S rRNA gene was undertaken for 32 sediment samples. High levels of alpha-diversity were detected with 16,089 operational taxonomic units (OTUs) being identified. The majority of the OTUs were assigned to Metazoa (29.2%), Alveolata (22.4%) and Stramenopiles (17.8%). Stramenopiles (Diatomea) and Alveolata (Ciliophora) were frequent in a lagoon and in shallower coastal stations, whereas metazoans (Arthropoda: Maxillopoda) were dominant in deeper offshore stations. Only 24.6% of total OTUs were shared among all areas. Beta-diversity was generally lower between the lagoon and Jeddah (nearshore) than between either of those and the offshore area, suggesting a nearshore-offshore biodiversity gradient. The current approach allowed for a broad-range of benthic eukaryotic biodiversity to be analysed with significantly less labour than would be required by other traditional taxonomic approaches. Our findings suggest that next generation sequencing techniques have the potential to provide a fast and standardised screening of benthic biodiversity at large spatial and temporal scales. PMID:26525270

  1. High-throughput sequencing of microbial diversity in implant-associated infection.

    PubMed

    Wang, Xin; Hu, Xiaoyan; Deng, Kan; Cheng, Xigao; Wei, Junchao; Jiang, Meixiu; Wang, Xiaolei; Chen, Tingtao

    2016-09-01

    Few molecular studies have shown that the number of bacterial species in implant-associated infection may have been underestimated. To determine the actual microbial diversity in implant-associated infection, a high-throughput sequencing method was adopted to sequence the DNAs extracted from the tissues of infected and uninfected patients. Principal component analysis (PCA) and β diversity showed an obvious divergence of infected and uninfected groups, and that the overgrowth of Proteobacteria (80.87%), Firmicutes (13.41%) in the positive deep infection group (P.d, via biopsy) and Proteobacteria (91.68%) in the positive surface infection group (P.s, via swabs) might be the causative factors in implant-associated infection. Moreover, Venn results indicated that a mean of 330 common operational taxonomic units (OTUs) was obtained in all groups, of which 113, 109, 45, 20, 13 and 12 OTUs belonging to Proteobacteria, Actinobacteria, Acidobacteria, Planctomycetes, Gemmatimonadetes and Chloroflexi were identified. In conclusion, many traditional "pathogenic bacteria" were identified as the common bacteria in operation sites, and the disruption of their complex interaction caused infection; therefore, further work is need to illustrate the aetiology of implant-associated infection using in-depth systems-level analyses. PMID:27268150

  2. Abundance and genetic diversity of nifH gene sequences in anthropogenically affected Brazilian mangrove sediments.

    PubMed

    Dias, Armando Cavalcante Franco; Pereira e Silva, Michele de Cassia; Cotta, Simone Raposo; Dini-Andreote, Francisco; Soares, Fábio Lino; Salles, Joana Falcão; Azevedo, João Lúcio; van Elsas, Jan Dirk; Andreote, Fernando Dini

    2012-11-01

    Although mangroves represent ecosystems of global importance, the genetic diversity and abundance of functional genes that are key to their functioning scarcely have been explored. Here, we present a survey based on the nifH gene across transects of sediments of two mangrove systems located along the coast line of São Paulo state (Brazil) which differed by degree of disturbance, i.e., an oil-spill-affected and an unaffected mangrove. The diazotrophic communities were assessed by denaturing gradient gel electrophoresis (DGGE), quantitative PCR (qPCR), and clone libraries. The nifH gene abundance was similar across the two mangrove sediment systems, as evidenced by qPCR. However, the nifH-based PCR-DGGE profiles revealed clear differences between the mangroves. Moreover, shifts in the nifH gene diversities were noted along the land-sea transect within the previously oiled mangrove. The nifH gene diversity depicted the presence of nitrogen-fixing bacteria affiliated with a wide range of taxa, encompassing members of the Alphaproteobacteria, Betaproteobacteria, Gammaproteobacteria, Firmicutes, and also a group of anaerobic sulfate-reducing bacteria. We also detected a unique mangrove-specific cluster of sequences denoted Mgv-nifH. Our results indicate that nitrogen-fixing bacterial guilds can be partially endemic to mangroves, and these communities are modulated by oil contamination, which has important implications for conservation strategies. PMID:22941088

  3. Abundance and Genetic Diversity of nifH Gene Sequences in Anthropogenically Affected Brazilian Mangrove Sediments

    PubMed Central

    Dias, Armando Cavalcante Franco; Pereira e Silva, Michele de Cassia; Cotta, Simone Raposo; Dini-Andreote, Francisco; Soares, Fábio Lino; Salles, Joana Falcão; Azevedo, João Lúcio; van Elsas, Jan Dirk

    2012-01-01

    Although mangroves represent ecosystems of global importance, the genetic diversity and abundance of functional genes that are key to their functioning scarcely have been explored. Here, we present a survey based on the nifH gene across transects of sediments of two mangrove systems located along the coast line of São Paulo state (Brazil) which differed by degree of disturbance, i.e., an oil-spill-affected and an unaffected mangrove. The diazotrophic communities were assessed by denaturing gradient gel electrophoresis (DGGE), quantitative PCR (qPCR), and clone libraries. The nifH gene abundance was similar across the two mangrove sediment systems, as evidenced by qPCR. However, the nifH-based PCR-DGGE profiles revealed clear differences between the mangroves. Moreover, shifts in the nifH gene diversities were noted along the land-sea transect within the previously oiled mangrove. The nifH gene diversity depicted the presence of nitrogen-fixing bacteria affiliated with a wide range of taxa, encompassing members of the Alphaproteobacteria, Betaproteobacteria, Gammaproteobacteria, Firmicutes, and also a group of anaerobic sulfate-reducing bacteria. We also detected a unique mangrove-specific cluster of sequences denoted Mgv-nifH. Our results indicate that nitrogen-fixing bacterial guilds can be partially endemic to mangroves, and these communities are modulated by oil contamination, which has important implications for conservation strategies. PMID:22941088

  4. Utility of Metagenomic Next-Generation Sequencing for Characterization of HIV and Human Pegivirus Diversity

    PubMed Central

    Naccache, Samia N.; Kabre, Beniwende; Federman, Scot; Mbanya, Dora; Kaptué, Lazare; Chiu, Charles Y.; Brennan, Catherine A.; Hackett, John

    2015-01-01

    Given the dynamic changes in HIV-1 complexity and diversity, next-generation sequencing (NGS) has the potential to revolutionize strategies for effective HIV global surveillance. In this study, we explore the utility of metagenomic NGS to characterize divergent strains of HIV-1 and to simultaneously screen for other co-infecting viruses. Thirty-five HIV-1-infected Cameroonian blood donor specimens with viral loads of >4.4 log10 copies/ml were selected to include a diverse representation of group M strains. Random-primed NGS libraries, prepared from plasma specimens, resulted in greater than 90% genome coverage for 88% of specimens. Correct subtype designations based on NGS were concordant with sub-region PCR data in 31 of 35 (89%) cases. Complete genomes were assembled for 25 strains, including circulating recombinant forms with relatively limited data available (7 CRF11_cpx, 2 CRF13_cpx, 1 CRF18_cpx, and 1 CRF37_cpx), as well as 9 unique recombinant forms. HPgV (formerly designated GBV-C) co-infection was detected in 9 of 35 (25%) specimens, of which eight specimens yielded complete genomes. The recovered HPgV genomes formed a diverse cluster with genotype 1 sequences previously reported from Ghana, Uganda, and Japan. The extensive genome coverage obtained by NGS improved accuracy and confidence in phylogenetic classification of the HIV-1 strains present in the study population relative to conventional sub-region PCR. In addition, these data demonstrate the potential for metagenomic analysis to be used for routine characterization of HIV-1 and identification of other viral co-infections. PMID:26599538

  5. In vivo sequence diversity of the protease of human immunodeficiency virus type 1: presence of protease inhibitor-resistant variants in untreated subjects.

    PubMed Central

    Lech, W J; Wang, G; Yang, Y L; Chee, Y; Dorman, K; McCrae, D; Lazzeroni, L C; Erickson, J W; Sinsheimer, J S; Kaplan, A H

    1996-01-01

    We have evaluated the sequence diversity of the protease human immunodeficiency virus type 1 in vivo. Our analysis of 246 protease coding domain sequences obtained from 12 subjects indicates that amino acid substitutions predicted to give rise to protease inhibitor resistance may be present in patients who have not received protease inhibitors. In addition, we demonstrated that amino acid residues directly involved in enzyme-substrate interactions may be varied in infected individuals. Several of these substitutions occurred in combination either more or less frequently than would be expected if their appearance was independent, suggesting that one substitution may compensate for the effects of another. Taken together, our analysis indicates that the human immunodeficiency virus type 1 protease has flexibility sufficient to vary critical subsites in vivo, thereby retaining enzyme function and viral pathogenicity. PMID:8627733

  6. Rare recombination events generate sequence diversity among balancer chromosomes in Drosophila melanogaster

    PubMed Central

    Miller, Danny E.; Cook, Kevin R.; Yeganeh Kazemi, Nazanin; Smith, Clarissa B.; Cockrell, Alexandria J.; Hawley, R. Scott; Bergman, Casey M.

    2016-01-01

    Multiply inverted balancer chromosomes that suppress exchange with their homologs are an essential part of the Drosophila melanogaster genetic toolkit. Despite their widespread use, the organization of balancer chromosomes has not been characterized at the molecular level, and the degree of sequence variation among copies of balancer chromosomes is unknown. To map inversion breakpoints and study potential diversity in descendants of a structurally identical balancer chromosome, we sequenced a panel of laboratory stocks containing the most widely used X chromosome balancer, First Multiple 7 (FM7). We mapped the locations of FM7 breakpoints to precise euchromatic coordinates and identified the flanking sequence of breakpoints in heterochromatic regions. Analysis of SNP variation revealed megabase-scale blocks of sequence divergence among currently used FM7 stocks. We present evidence that this divergence arose through rare double-crossover events that replaced a female-sterile allele of the singed gene (snX2) on FM7c with a sequence from balanced chromosomes. We propose that although double-crossover events are rare in individual crosses, many FM7c chromosomes in the Bloomington Drosophila Stock Center have lost snX2 by this mechanism on a historical timescale. Finally, we characterize the original allele of the Bar gene (B1) that is carried on FM7, and validate the hypothesis that the origin and subsequent reversion of the B1 duplication are mediated by unequal exchange. Our results reject a simple nonrecombining, clonal mode for the laboratory evolution of balancer chromosomes and have implications for how balancer chromosomes should be used in the design and interpretation of genetic experiments in Drosophila. PMID:26903656

  7. Expanding the diversity of unnatural cell surface sialic acids

    SciTech Connect

    Luchansky, Sarah J.; Goon, Scarlett; Bertozzi, Carolyn R.

    2003-10-30

    Novel chemical reactivity can be introduced onto cell surfaces through metabolic oligosaccharide engineering. This technique exploits the substrate promiscuity of cellular biosynthetic enzymes to deliver unnatural monosaccharides bearing bioorthogonal functional groups into cellular glycans. For example, derivatives of N-acetylmannosamine (ManNAc) are converted by the cellular biosynthetic machinery into the corresponding sialic acids and subsequently delivered to the cell surface in the form of sialoglycoconjugates. Analogs of N-acetylglucosamine (GlcNAc) and N-acetylgalactosamine (GalNAc) are also metabolized and incorporated into cell surface glycans, likely through the sialic acid and GalNAc salvage pathways, respectively. Furthermore, GlcNAc analogs can be incorporated into nucleocytoplasmic proteins in place of {beta}-O-GlcNAc residues. These pathways have been exploited to integrate unique electrophiles such as ketones and azides into the target glycoconjugate class. These functional groups can be further elaborated in a chemoselective fashion by condensation with hydrazides and by Staudinger ligation, respectively, thereby introducing detectable probes onto the cell. In conclusion, sialic acid derivatives are efficient vehicles for delivery of bulky functional groups to cell surfaces and masking of their hydroxyl groups improves their cellular uptake and utilization. Furthermore, the successful introduction of photoactivatable aryl azides into cell surface glycans opens up new avenues for studying sialic acid-binding proteins and elucidating the role of sialic acid in essential processes such as signaling and cell adhesion.

  8. First insights into the microbial diversity in the omasum and reticulum of bovine using Illumina sequencing.

    PubMed

    Peng, Shuai; Yin, Jigang; Liu, Xiaolei; Jia, Boyin; Chang, Zhiguang; Lu, Huijun; Jiang, Ning; Chen, Qijun

    2015-08-01

    The digestive systems of mammals harbor a complex gut microbiome, comprising bacteria and other microorganisms that confer metabolic and immunological benefits to the host. Ruminants that digest plant-based foods have a four-compartment stomach consisting of the rumen, reticulum, omasum, and abomasum. The microorganisms in the stomach are essential for providing the host with critical nutrients. However, the majority of these microorganisms are unknown species. The microbiome of the stomach is diverse, and the majority of these organisms cannot be cultured. Next-generation sequencing (NGS) combined with bioinformatic analysis tools have allowed the dissection of the composition of the microbiome in samples collected from a specific environment. In this study, for the first time, the bacterial composition in two compartments, the reticulum and the omasum, of bovine were analyzed using a metagenomic approach and compared to the bacterial composition of the rumen. These data will assist in understanding the biology of ruminants and benefit the agricultural industry. The diversity and composition of the bacterial community in samples collected from the rumen, reticulum, and omasum of bovines in the Changchun Region of Northeast China were analyzed by sequencing the V3 region of the 16S rRNA gene using a barcoded Illumina paired-end sequencing technique, and the primary composition of the microbiome in the rumen, reticulum, and omasum of the bovines was determined. These microbiomes contained 17 phyla and 107 genera in all three samples. Five phyla, Bacteroidetes, Firmicutes, Proteobacteria, Spirochaetes, and Lentisphaerae, were the most abundant taxonomic groups. Additionally, the different stomach compartments harbored different compositions of the microorganisms. PMID:25604266

  9. Matrix genes of measles virus and canine distemper virus: cloning, nucleotide sequences, and deduced amino acid sequences.

    PubMed Central

    Bellini, W J; Englund, G; Richardson, C D; Rozenblatt, S; Lazzarini, R A

    1986-01-01

    The nucleotide sequences encoding the matrix (M) proteins of measles virus (MV) and canine distemper virus (CDV) were determined from cDNA clones containing these genes in their entirety. In both cases, single open reading frames specifying basic proteins of 335 amino acid residues were predicted from the nucleotide sequences. Both viral messages were composed of approximately 1,450 nucleotides and contained 400 nucleotides of presumptive noncoding sequences at their respective 3' ends. MV and CDV M-protein-coding regions were 67% homologous at the nucleotide level and 76% homologous at the amino acid level. Only chance homology was observed in the 400-nucleotide trailer sequences. Comparisons of the M protein sequences of MV and CDV with the sequence reported for Sendai virus (B. M. Blumberg, K. Rose, M. G. Simona, L. Roux, C. Giorgi, and D. Kolakofsky, J. Virol. 52:656-663; Y. Hidaka, T. Kanda, K. Iwasaki, A. Nomoto, T. Shioda, and H. Shibuta, Nucleic Acids Res. 12:7965-7973) indicated the greatest homology among these M proteins in the carboxyterminal third of the molecule. Secondary-structure analyses of this shared region indicated a structurally conserved, hydrophobic sequence which possibly interacted with the lipid bilayer. Images PMID:3754588

  10. Diversity of TRIM5α and TRIMCyp sequences in cynomolgus macaques from different geographical origins.

    PubMed

    Berry, Neil J; Marzetta, Flavia; Towers, Greg J; Rose, Nicola J

    2012-04-01

    The TRIM5α restriction factor can protect some species of monkeys, but not humans, from HIV infection. It has also emerged that some monkeys have a cyclophilin A domain retrotransposed into the TRIM5 locus resulting in the expression of a TRIMCyp protein with anti-retroviral activity. A high degree of sequence variation in the primate TRIM5 gene has been reported that varies between populations of rhesus macaques, a widely used non-human primate model of HIV/AIDS, and recently shown to correlate with susceptibility to simian immunodeficiency viruses in this species. Cynomolgus macaques are also used widely in HIV research. A non-indigenous population on Mauritius has highly restricted genetic diversity compared with macaques from Indonesia. The relative allelic diversity of TRIM5α and TRIMCyp within these two sub-populations may impact on the susceptibility of the macaques to simian immunodeficiency virus thereby influencing the outcome of studies using these monkeys. We sought to establish the genetic diversity of these alleles in cynomolgus macaques. We identified seven TRIM5α alleles in Indonesian macaques, three of which are novel, but only three in the Mauritian-origin macaques. Strikingly, 87% of Indonesian, but none of the Mauritian macaques, possessed a retrotransposed Cyp domain. A splice acceptor site single-nucleotide polymorphism that allows formation of a TRIMCyp protein was absent for the TRIM5α alleles found in the Mauritian macaques. The level of allelic diversity reported here is greater than previously proposed for cynomolgus macaque species. PMID:22124667

  11. Determining the Cellular Diversity of Hepatitis C Virus Quasispecies by Single-Cell Viral Sequencing

    PubMed Central

    McLauchlan, John

    2013-01-01

    Single-cell genomics is emerging as an important tool in cellular biology. We describe for the first time a system to investigate RNA virus quasispecies diversity at the cellular level utilizing hepatitis C virus (HCV) replicons. A high-fidelity nested reverse transcription (RT)-PCR assay was developed, and validation using control transcripts of known copy number indicated a detection limit of 3 copies of viral RNA/reaction. This system was used to determine the cellular diversity of subgenomic JFH-1 HCV replicons constitutively expressed in Huh7 cells. Each cell contained a unique quasispecies that was much less diverse than the quasispecies of the bulk cell population from which the single cells were derived, suggesting the occurrence of independent evolution at the cellular level. An assessment of the replicative fitness of the predominant single-cell quasispecies variants indicated a modest reduction in fitness compared to the wild type. Real-time RT-PCR methods capable of determining single-cell viral loads were developed and indicated an average of 113 copies of replicon RNA per cell, correlating with calculated RNA copy numbers in the bulk cell population. This study introduces a single-cell RNA viral-sequencing method with numerous potential applications to explore host-virus interactions during infection. HCV quasispecies diversity varied greatly between cells in vitro, suggesting different within-cell evolutionary pathways. Such divergent trajectories in vivo could have implications for the evolution and establishment of antiviral-resistant variants and host immune escape mutants. PMID:24049174

  12. Single molecule sequencing to track plasmid diversity of hospital-associated carbapenemase-producing Enterobacteriaceae

    PubMed Central

    Conlan, Sean; Thomas, Pamela J.; Deming, Clayton; Park, Morgan; Lau, Anna F.; Dekker, John P.; Snitkin, Evan S.; Clark, Tyson A.; Luong, Khai; Song, Yi; Tsai, Yu-Chih; Boitano, Matthew; Gupta, Jyoti; Brooks, Shelise Y.; Schmidt, Brian; Young, Alice C.; Thomas, James W.; Bouffard, Gerard G.; Blakesley, Robert W.; Mullikin, James C.; Korlach, Jonas; Henderson, David K.; Frank, Karen M.; Palmore, Tara N.; Segre, Julia A.

    2014-01-01

    Public health officials have raised concerns that plasmid transfer between Enterobacteriaceae species may spread resistance to carbapenems, an antibiotic class of last resort, thereby rendering common healthcare-associated infections nearly impossible to treat. We performed comprehensive surveillance and genomic sequencing to identify carbapenem-resistant Enterobacteriaceae in the NIH Clinical Center patient population and hospital environment in order to to articulate the diversity of carbapenemase-encoding plasmids and survey the mobility of and assess the mobility of these plasmids between bacterial species. We isolated a repertoire of carbapenemase-encoding Enterobacteriaceae, including multiple strains of Klebsiella pneumoniae, Klebsiella oxytoca, Escherichia coli, Enterobacter cloacae, Citrobacter freundii, and Pantoea species. Long-read genome sequencing with full end-to-end assembly revealed that these organisms carry the carbapenem-resistance genes on a wide array of plasmids. Klebsiella pneumoniae and Enterobacter cloacae isolated simultaneously from a single patient harbored two different carbapenemase-encoding plasmids, overriding the epidemiological scenario of plasmid transfer between organisms within this patient. We did, however, find evidence supporting horizontal transfer of carbapenemase-encoding plasmids between Klebsiella pneumoniae, Enterobacter cloacae and Citrobacter freundii in the hospital environment. Our comprehensive sequence data, with full plasmid identification, challenges assumptions about horizontal gene transfer events within patients and identified wider possible connections between patients and the hospital environment. In addition, we identified a new carbapenemase-encoding plasmid of potentially high clinical impact carried by Klebsiella pneumoniae, Escherichia coli, Enterobacter cloacae and Pantoea species, from unrelated patients and the hospital environment. PMID:25232178

  13. High-throughput nucleotide sequence analysis of diverse bacterial communities in leachates of decomposing pig carcasses

    PubMed Central

    Yang, Seung Hak; Lim, Joung Soo; Khan, Modabber Ahmed; Kim, Bong Soo; Choi, Dong Yoon; Lee, Eun Young; Ahn, Hee Kwon

    2015-01-01

    The leachate generated by the decomposition of animal carcass has been implicated as an environmental contaminant surrounding the burial site. High-throughput nucleotide sequencing was conducted to investigate the bacterial communities in leachates from the decomposition of pig carcasses. We acquired 51,230 reads from six different samples (1, 2, 3, 4, 6 and 14 week-old carcasses) and found that sequences representing the phylum Firmicutes predominated. The diversity of bacterial 16S rRNA gene sequences in the leachate was the highest at 6 weeks, in contrast to those at 2 and 14 weeks. The relative abundance of Firmicutes was reduced, while the proportion of Bacteroidetes and Proteobacteria increased from 3–6 weeks. The representation of phyla was restored after 14 weeks. However, the community structures between the samples taken at 1–2 and 14 weeks differed at the bacterial classification level. The trend in pH was similar to the changes seen in bacterial communities, indicating that the pH of the leachate could be related to the shift in the microbial community. The results indicate that the composition of bacterial communities in leachates of decomposing pig carcasses shifted continuously during the study period and might be influenced by the burial site. PMID:26500442

  14. Identification and characterization of rhizospheric microbial diversity by 16S ribosomal RNA gene sequencing.

    PubMed

    Naveed, Muhammad; Mubeen, Samavia; Khan, SamiUllah; Ahmed, Iftikhar; Khalid, Nauman; Suleria, Hafiz Ansar Rasul; Bano, Asghari; Mumtaz, Abdul Samad

    2014-01-01

    In the present study, samples of rhizosphere and root nodules were collected from different areas of Pakistan to isolate plant growth promoting rhizobacteria. Identification of bacterial isolates was made by 16S rRNA gene sequence analysis and taxonomical confirmation on EzTaxon Server. The identified bacterial strains were belonged to 5 genera i.e. Ensifer, Bacillus, Pseudomona, Leclercia and Rhizobium. Phylogenetic analysis inferred from 16S rRNA gene sequences showed the evolutionary relationship of bacterial strains with the respective genera. Based on phylogenetic analysis, some candidate novel species were also identified. The bacterial strains were also characterized for morphological, physiological, biochemical tests and glucose dehydrogenase (gdh) gene that involved in the phosphate solublization using cofactor pyrroloquinolone quinone (PQQ). Seven rhizoshperic and 3 root nodulating stains are positive for gdh gene. Furthermore, this study confirms a novel association between microbes and their hosts like field grown crops, leguminous and non-leguminous plants. It was concluded that a diverse group of bacterial population exist in the rhizosphere and root nodules that might be useful in evaluating the mechanisms behind plant microbial interactions and strains QAU-63 and QAU-68 have sequence similarity of 97 and 95% which might be declared as novel after further taxonomic characterization. PMID:25477935

  15. Identification and characterization of rhizospheric microbial diversity by 16S ribosomal RNA gene sequencing

    PubMed Central

    Naveed, Muhammad; Mubeen, Samavia; khan, SamiUllah; Ahmed, Iftikhar; Khalid, Nauman; Suleria, Hafiz Ansar Rasul; Bano, Asghari; Mumtaz, Abdul Samad

    2014-01-01

    In the present study, samples of rhizosphere and root nodules were collected from different areas of Pakistan to isolate plant growth promoting rhizobacteria. Identification of bacterial isolates was made by 16S rRNA gene sequence analysis and taxonomical confirmation on EzTaxon Server. The identified bacterial strains were belonged to 5 genera i.e. Ensifer, Bacillus, Pseudomona, Leclercia and Rhizobium. Phylogenetic analysis inferred from 16S rRNA gene sequences showed the evolutionary relationship of bacterial strains with the respective genera. Based on phylogenetic analysis, some candidate novel species were also identified. The bacterial strains were also characterized for morphological, physiological, biochemical tests and glucose dehydrogenase (gdh) gene that involved in the phosphate solublization using cofactor pyrroloquinolone quinone (PQQ). Seven rhizoshperic and 3 root nodulating stains are positive for gdh gene. Furthermore, this study confirms a novel association between microbes and their hosts like field grown crops, leguminous and non-leguminous plants. It was concluded that a diverse group of bacterial population exist in the rhizosphere and root nodules that might be useful in evaluating the mechanisms behind plant microbial interactions and strains QAU-63 and QAU-68 have sequence similarity of 97 and 95% which might be declared as novel after further taxonomic characterization. PMID:25477935

  16. Diversity of lactic acid bacteria associated with traditional fermented dairy products in Mongolia.

    PubMed

    Yu, J; Wang, W H; Menghe, B L G; Jiri, M T; Wang, H M; Liu, W J; Bao, Q H; Lu, Q; Zhang, J C; Wang, F; Xu, H Y; Sun, T S; Zhang, H P

    2011-07-01

    Spontaneous milk fermentation has a long history in Mongolia, and beneficial microorganisms have been handed down from one generation to the next for use in fermented dairy products. The objective of this study was to investigate the diversity of lactic acid bacteria (LAB) communities in fermented yak, mare, goat, and cow milk products by analyzing 189 samples collected from 13 different regions in Mongolia. The LAB counts in these samples varied from 3.41 to 9.03 log cfu/mL. Fermented yak and mare milks had almost identical mean numbers of LAB, which were significantly higher than those in fermented goat milk but slightly lower than those in fermented cow milk. In total, 668 isolates were obtained from these samples using de Man, Rogosa, and Sharpe agar and M17 agar. Each isolate was considered to be presumptive LAB based on gram-positive and catalase-negative properties, and was identified at the species level by 16S rRNA gene sequencing, multiplex PCR assay, and restriction fragment length polymorphism analysis. All isolates from Mongolian dairy products were accurately identified as Enterococcus faecalis (1 strain), Enterococcus durans (3 strains), Lactobacillus brevis (3 strains), Lactobacillus buchneri (2 strains), Lactobacillus casei (16 strains), Lactobacillus delbrueckii ssp. bulgaricus (142 strains), Lactobacillus diolivorans (17 strains), Lactobacillus fermentum (42 strains), Lactobacillus helveticus (183 strains), Lactobacillus kefiri (6 strains), Lactobacillus plantarum ssp. plantarum (7 strains), Lactococcus lactis ssp. lactis (7 strains), Leuconostoc lactis (22 strains), Leuconostoc mesenteroides (21 strains), Streptococcus thermophilus (195 strains), and Weissella cibaria (1 strain). The predominant LAB were Strep. thermophilus and Lb. helveticus, which were isolated from all sampling sites. The results demonstrate that traditional fermented dairy products from different regions of Mongolia have complex compositions of LAB species. Such diversity of

  17. Dietary supplementation of usnic acid, an antimicrobial compound in lichens, does not affect rumen bacterial diversity or density in reindeer.

    PubMed

    Glad, Trine; Barboza, Perry; Mackie, Roderick I; Wright, André-Denis G; Brusetti, Lorenzo; Mathiesen, Svein D; Sundset, Monica A

    2014-06-01

    Reindeer (Rangifer tarandus tarandus) may include large proportions of lichens in their winter diet. These dietary lichens are rich in phenolic secondary compounds, the most well-known being the antimicrobial usnic acid. Previous studies have shown that reindeer host rumen bacteria resistant to usnic acid and that usnic acid is quickly detoxified in their rumen. In the present study, reindeer (n = 3) were sampled before, during, and after usnic acid supplementation to determine the effect on their rumen microbial ecology. Ad libitum intake of usnic acid averaged up to 278 mg/kg body mass. Population densities of rumen bacteria and methanogenic archaea determined by real-time PCR, ranged from 1.36 × 10(9) to 11.8 × 10(9) and 9.0 × 10(5) to 1.35 × 10(8) cells/g wet weight, respectively, and the two populations did not change significantly during usnic acid supplementation (repeated measures ANOVA) or vary significantly between the rumen liquid and particle fraction (paired t test). Rumen bacterial community structure determined by denaturing gradient gel electrophoresis did not change in response to intake of usnic acid. Firmicutes (38.7 %) and Bacteriodetes (27.4 %) were prevalent among the 16S rRNA gene sequences (n = 62) from the DGGE gels, but representatives of the phyla Verrucomicrobia (14.5 %) and Proteobacteria (1.6 %) were also detected. Rapid detoxification of the usnic acid or resistance to usnic acid may explain why the diversity of the dominant bacterial populations and the bacterial density in the reindeer rumen does not change during usnic acid supplementation. PMID:24509720

  18. Diversity of (dihydro) hydroxycinnamic acid conjugates in Colombian potato tubers.

    PubMed

    Narváez-Cuenca, Carlos-Eduardo; Vincken, Jean-Paul; Zheng, Chaoya; Gruppen, Harry

    2013-08-15

    In potato tuber, caffeic acid (the predominant hydroxycinnamic acid (HCA)), its conjugates (HCAcs; i.e. chlorogenic acid (ChA), crypto-ChA, and neo-ChA), and anthocyanin-linked HCAs have been extensively described in the literature. In contrast, only little information is available on the occurrence of other HCAcs and didydrohydroxycinnamic acid conjugates (DHCAcs). Fifteen Colombian potato cultivars were screened for these less commonly described conjugates by reversed-phase ultrahigh performance liquid chromatography coupled to a diode array detector and a heated electrospray ionisation mass spectrometer. A total of 62 HCAs/HCAcs/DHCAcs were found in extracts from peel and flesh. Among them, only twelve compounds were common to all cultivars in both peel and flesh. The less commonly described compounds accounted for 7.1-20.1% w/w of the total amount of HCAs/HCAcs/DHCAcs in whole tubers, highlighting their contribution to the total phenolic profile of potato tubers. Among all cultivars, the abundance (mg/100 g DW whole tuber) of neo-ChA (0.8-7.4) ranged in similar quantities as the less commonly reported feruloyl octopamine (1.2-5.2), 5-O-feruloyl quinic acid (0.1-7.5), cis-ChA (1.1-2.2), caffeoyl putrescine (0.6-2.5), sinapoyl hexose (0.1-1.8), N(1),N(14)-bis-(dihydrocaffeoyl) spermine (0.2-1.7), N(1),N(10)-bis-(dihydrocaffeoyl) spermidine (1.1-2.6), and N(1),N(5),N(14)-tris-(dihydrocaffeoyl) spermine (trace - 11.1). PMID:23561213

  19. Detection and isolation of nucleic acid sequences using a bifunctional hybridization probe

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    2000-01-01

    A method for detecting and isolating a target sequence in a sample of nucleic acids is provided using a bifunctional hybridization probe capable of hybridizing to the target sequence that includes a detectable marker and a first complexing agent capable of forming a binding pair with a second complexing agent. A kit is also provided for detecting a target sequence in a sample of nucleic acids using a bifunctional hybridization probe according to this method.

  20. Using Whole Genome Analysis to Examine Recombination across Diverse Sequence Types of Staphylococcus aureus

    PubMed Central

    Driebe, Elizabeth M.; Sahl, Jason W.; Roe, Chandler; Bowers, Jolene R.; Schupp, James M.; Gillece, John D.; Kelley, Erin; Price, Lance B.; Pearson, Talima R.; Hepp, Crystal M.; Brzoska, Pius M.; Cummings, Craig A.; Furtado, Manohar R.; Andersen, Paal S.; Stegger, Marc; Engelthaler, David M.; Keim, Paul S.

    2015-01-01

    Staphylococcus aureus is an important clinical pathogen worldwide and understanding this organism's phylogeny and, in particular, the role of recombination, is important both to understand the overall spread of virulent lineages and to characterize outbreaks. To further elucidate the phylogeny of S. aureus, 35 diverse strains were sequenced using whole genome sequencing. In addition, 29 publicly available whole genome sequences were included to create a single nucleotide polymorphism (SNP)-based phylogenetic tree encompassing 11 distinct lineages. All strains of a particular sequence type fell into the same clade with clear groupings of the major clonal complexes of CC8, CC5, CC30, CC45 and CC1. Using a novel analysis method, we plotted the homoplasy density and SNP density across the whole genome and found evidence of recombination throughout the entire chromosome, but when we examined individual clonal lineages we found very little recombination. However, when we analyzed three branches of multiple lineages, we saw intermediate and differing levels of recombination between them. These data demonstrate that in S. aureus, recombination occurs across major lineages that subsequently expand in a clonal manner. Estimated mutation rates for the CC8 and CC5 lineages were different from each other. While the CC8 lineage rate was similar to previous studies, the CC5 lineage was 100-fold greater. Fifty known virulence genes were screened in all genomes in silico to determine their distribution across major clades. Thirty-three genes were present variably across clades, most of which were not constrained by ancestry, indicating horizontal gene transfer or gene loss. PMID:26161978

  1. Using Whole Genome Analysis to Examine Recombination across Diverse Sequence Types of Staphylococcus aureus.

    PubMed

    Driebe, Elizabeth M; Sahl, Jason W; Roe, Chandler; Bowers, Jolene R; Schupp, James M; Gillece, John D; Kelley, Erin; Price, Lance B; Pearson, Talima R; Hepp, Crystal M; Brzoska, Pius M; Cummings, Craig A; Furtado, Manohar R; Andersen, Paal S; Stegger, Marc; Engelthaler, David M; Keim, Paul S

    2015-01-01

    Staphylococcus aureus is an important clinical pathogen worldwide and understanding this organism's phylogeny and, in particular, the role of recombination, is important both to understand the overall spread of virulent lineages and to characterize outbreaks. To further elucidate the phylogeny of S. aureus, 35 diverse strains were sequenced using whole genome sequencing. In addition, 29 publicly available whole genome sequences were included to create a single nucleotide polymorphism (SNP)-based phylogenetic tree encompassing 11 distinct lineages. All strains of a particular sequence type fell into the same clade with clear groupings of the major clonal complexes of CC8, CC5, CC30, CC45 and CC1. Using a novel analysis method, we plotted the homoplasy density and SNP density across the whole genome and found evidence of recombination throughout the entire chromosome, but when we examined individual clonal lineages we found very little recombination. However, when we analyzed three branches of multiple lineages, we saw intermediate and differing levels of recombination between them. These data demonstrate that in S. aureus, recombination occurs across major lineages that subsequently expand in a clonal manner. Estimated mutation rates for the CC8 and CC5 lineages were different from each other. While the CC8 lineage rate was similar to previous studies, the CC5 lineage was 100-fold greater. Fifty known virulence genes were screened in all genomes in silico to determine their distribution across major clades. Thirty-three genes were present variably across clades, most of which were not constrained by ancestry, indicating horizontal gene transfer or gene loss. PMID:26161978

  2. Genetic diversity of Staphylococcus equorum isolates from Saeu-jeotgal evaluated by multilocus sequence typing.

    PubMed

    Jeong, Do-Won; Kim, Hye-Rim; Lee, Jong-Hoon

    2014-10-01

    Staphylococcus equorum, the predominant bacterial species detected in Saeu-jeotgal, a Korean high-salt fermented seafood, is a candidate starter bacterium for Saeu-jeotgal fermentation. A multilocus sequence typing (MLST) scheme was developed to evaluate the genetic diversity and background of S. equorum strains isolated from Saeu-jeotgal. A total of 135 strains, including 117 isolates from Saeu-jeotgal, and others from Myeolchi-jeotgal, sausage, cheese and horse skin, were subjected to MLST, and the internal fragments of seven housekeeping genes, aroE, dnaJ, glpF, gmk, hsp60, mutS, and pta, were compared. This MLST scheme produced 45 sequence types (STs) and the eBURST algorithm clustered the STs into nine clonal groups and seven singletons. Clonal group 1, the major group, consisted of 30 isolates from cheese, Saeu-jeotgal and sausages, which were classified into 12 STs. The predominant ST, ST26, comprised 25 isolates and presented as a singleton. Most of the isolates from Myeolchi-jeotgal and sausages clustered on two different branches of a phylogenetic tree generated with a cluster analysis using the maximum likelihood algorithm. This MLST scheme established the genetic backgrounds of S. equorum strains isolated from different types of food. Among the housekeeping genes used for MLST, gmk had the fewest allele types and fairly low sequence identities (74.0-90.0%) within the Staphylococcus species. Therefore, sequence analyses of the gmk gene and 16S rRNA gene can be used for the accurate and rapid identification of S. equorum. PMID:25103946

  3. Application of culture culture-independent molecular biology based methods to evaluate acetic acid bacteria diversity during vinegar processing.

    PubMed

    Ilabaca, Carolina; Navarrete, Paola; Mardones, Pamela; Romero, Jaime; Mas, Albert

    2008-08-15

    Acetic acid bacteria (AAB) are considered fastidious microorganisms because they are difficult to isolate and cultivate. Different molecular approaches were taken to detect AAB diversity, independently of their capacity to grow in culture media. Those methods were tested in samples that originated during traditional vinegar production. Bacterial diversity was assessed by analysis of 16S rRNA gene, obtained by PCR amplifications of DNA extracted directly from the acetification container. Bacterial composition was analyzed by RFLP-PCR of 16S rRNA gene, Temporal Temperature Gradient Gel Electrophoresis (TTGE) separation of amplicons containing region V3-V5 of 16S rRNA gene and cloning of those amplicons. TTGE bands and clones were grouped based on their electrophoretic pattern similarity and sequenced to be compared with reference strains. The main microorganism identified in vinegar was Acetobacter pasteurianus, which at the end of the acetification process was considered to be the only microorganism present. The diversity was the highest at 2% acetic acid, where indefinite species of Gluconacetobacter xylinus/europaeus/intermedius were also present. PMID:18571262

  4. Characterization of fatty acid-producing wastewater microbial communities using next generation sequencing technologies

    EPA Science Inventory

    While wastewater represents a viable source of bacterial biodiesel production, very little is known on the composition of these microbial communities. We studied the taxonomic diversity and succession of microbial communities in bioreactors accumulating fatty acids using 454-pyro...

  5. Neuronal subtypes and diversity revealed by single-nucleus RNA sequencing of the human brain.

    PubMed

    Lake, Blue B; Ai, Rizi; Kaeser, Gwendolyn E; Salathia, Neeraj S; Yung, Yun C; Liu, Rui; Wildberg, Andre; Gao, Derek; Fung, Ho-Lim; Chen, Song; Vijayaraghavan, Raakhee; Wong, Julian; Chen, Allison; Sheng, Xiaoyan; Kaper, Fiona; Shen, Richard; Ronaghi, Mostafa; Fan, Jian-Bing; Wang, Wei; Chun, Jerold; Zhang, Kun

    2016-06-24

    The human brain has enormously complex cellular diversity and connectivities fundamental to our neural functions, yet difficulties in interrogating individual neurons has impeded understanding of the underlying transcriptional landscape. We developed a scalable approach to sequence and quantify RNA molecules in isolated neuronal nuclei from a postmortem brain, generating 3227 sets of single-neuron data from six distinct regions of the cerebral cortex. Using an iterative clustering and classification approach, we identified 16 neuronal subtypes that were further annotated on the basis of known markers and cortical cytoarchitecture. These data demonstrate a robust and scalable method for identifying and categorizing single nuclear transcriptomes, revealing shared genes sufficient to distinguish previously unknown and orthologous neuronal subtypes as well as regional identity and transcriptomic heterogeneity within the human brain. PMID:27339989

  6. Genetic diversity and population structure of Castanopsis eyrei based on simple sequence repeat markers.

    PubMed

    Mao, L H; Zhou, X L; Fang, Y M

    2016-01-01

    Castanopsis eyrei (Fagaceae) is one of the dominant tree species in mid-subtropical, evergreen, broad-leaved forests. We obtained 14 pairs of simple sequence repeat (SSR) primers from previous studies, which were used to analyze 90 C. eyrei individuals from three populations at different altitudes. Low heterozygosity was detected (Fis = 0.6124), and the observed heterozygosity was lower than the expected heterozygosity, possibly because of inbreeding and/or the population substructure. The genetic differentiation between populations was relatively low (Fst = 0.0645); only 7% of the total genetic variation occurred between populations. The medium-altitude population had higher genetic diversity than the low-altitude or high-altitude populations. PMID:27173332

  7. Diversity of bacteria in ships ballast water as revealed by next generation DNA sequencing.

    PubMed

    Brinkmeyer, Robin

    2016-06-15

    The bacterial diversity in ballast water from five general cargo ships calling at the Port of Houston was determined with ion semiconductor DNA sequencing (Ion Torrent PGM) of PCR amplified 16S rRNA genes. Phylogenetic analysis revealed that the composition of bacteria in ballast water did not resemble that of typical marine habitats or even open ocean waters where BWEs occur. The predominant group of bacteria in ships conducting BWEs was the Roseobacter clade within the Alphaproteobacteria. In contrast, Gammaproteobacteria were predominant in the ship that did not conduct a BWE. All the ships contained human, fish, and terrestrial plant pathogens as well as bacteria indicative of fecal or activated sludge contamination. Most of the 60 pathogens had not been detected in ballast water previously. Among these were the human pathogens Corynebacterium diptheriae and several Legionella species and the fish pathogens Francisella piscicida and Piscirickettsia salmonis. PMID:27076378

  8. Seasonal diversity and dynamics of haptophytes in the Skagerrak, Norway, explored by high-throughput sequencing.

    PubMed

    Egge, Elianne Sirnaes; Johannessen, Torill Vik; Andersen, Tom; Eikrem, Wenche; Bittner, Lucie; Larsen, Aud; Sandaa, Ruth-Anne; Edvardsen, Bente

    2015-06-01

    Microalgae in the division Haptophyta play key roles in the marine ecosystem and in global biogeochemical processes. Despite their ecological importance, knowledge on seasonal dynamics, community composition and abundance at the species level is limited due to their small cell size and few morphological features visible under the light microscope. Here, we present unique data on haptophyte seasonal diversity and dynamics from two annual cycles, with the taxonomic resolution and sampling depth obtained with high-throughput sequencing. From outer Oslofjorden, S Norway, nano- and picoplanktonic samples were collected monthly for 2 years, and the haptophytes targeted by amplification of RNA/cDNA with Haptophyta-specific 18S rDNA V4 primers. We obtained 156 operational taxonomic units (OTUs), from c. 400.000 454 pyrosequencing reads, after rigorous bioinformatic filtering and clustering at 99.5%. Most OTUs represented uncultured and/or not yet 18S rDNA-sequenced species. Haptophyte OTU richness and community composition exhibited high temporal variation and significant yearly periodicity. Richness was highest in September-October (autumn) and lowest in April-May (spring). Some taxa were detected all year, such as Chrysochromulina simplex, Emiliania huxleyi and Phaeocystis cordata, whereas most calcifying coccolithophores only appeared from summer to early winter. We also revealed the seasonal dynamics of OTUs representing putative novel classes (clades HAP-3-5) or orders (clades D, E, F). Season, light and temperature accounted for 29% of the variation in OTU composition. Residual variation may be related to biotic factors, such as competition and viral infection. This study provides new, in-depth knowledge on seasonal diversity and dynamics of haptophytes in North Atlantic coastal waters. PMID:25893259

  9. Whole mitochondrial genome sequencing of domestic horses reveals incorporation of extensive wild horse diversity during domestication

    PubMed Central

    2011-01-01

    Background DNA target enrichment by micro-array capture combined with high throughput sequencing technologies provides the possibility to obtain large amounts of sequence data (e.g. whole mitochondrial DNA genomes) from multiple individuals at relatively low costs. Previously, whole mitochondrial genome data for domestic horses (Equus caballus) were limited to only a few specimens and only short parts of the mtDNA genome (especially the hypervariable region) were investigated for larger sample sets. Results In this study we investigated whole mitochondrial genomes of 59 domestic horses from 44 breeds and a single Przewalski horse (Equus przewalski) using a recently described multiplex micro-array capture approach. We found 473 variable positions within the domestic horses, 292 of which are parsimony-informative, providing a well resolved phylogenetic tree. Our divergence time estimate suggests that the mitochondrial genomes of modern horse breeds shared a common ancestor around 93,000 years ago and no later than 38,000 years ago. A Bayesian skyline plot (BSP) reveals a significant population expansion beginning 6,000-8,000 years ago with an ongoing exponential growth until the present, similar to other domestic animal species. Our data further suggest that a large sample of wild horse diversity was incorporated into the domestic population; specifically, at least 46 of the mtDNA lineages observed in domestic horses (73%) already existed before the beginning of domestication about 5,000 years ago. Conclusions Our study provides a window into the maternal origins of extant domestic horses and confirms that modern domestic breeds present a wide sample of the mtDNA diversity found in ancestral, now extinct, wild horse populations. The data obtained allow us to detect a population expansion event coinciding with the beginning of domestication and to estimate both the minimum number of female horses incorporated into the domestic gene pool and the time depth of the

  10. Seasonal diversity and dynamics of haptophytes in the Skagerrak, Norway, explored by high-throughput sequencing

    PubMed Central

    Egge, Elianne Sirnæs; Johannessen, Torill Vik; Andersen, Tom; Eikrem, Wenche; Bittner, Lucie; Larsen, Aud; Sandaa, Ruth-Anne; Edvardsen, Bente

    2015-01-01

    Microalgae in the division Haptophyta play key roles in the marine ecosystem and in global biogeochemical processes. Despite their ecological importance, knowledge on seasonal dynamics, community composition and abundance at the species level is limited due to their small cell size and few morphological features visible under the light microscope. Here, we present unique data on haptophyte seasonal diversity and dynamics from two annual cycles, with the taxonomic resolution and sampling depth obtained with high-throughput sequencing. From outer Oslofjorden, S Norway, nano- and picoplanktonic samples were collected monthly for 2 years, and the haptophytes targeted by amplification of RNA/cDNA with Haptophyta-specific 18S rDNA V4 primers. We obtained 156 operational taxonomic units (OTUs), from c. 400.000 454 pyrosequencing reads, after rigorous bioinformatic filtering and clustering at 99.5%. Most OTUs represented uncultured and/or not yet 18S rDNA-sequenced species. Haptophyte OTU richness and community composition exhibited high temporal variation and significant yearly periodicity. Richness was highest in September–October (autumn) and lowest in April–May (spring). Some taxa were detected all year, such as Chrysochromulina simplex, Emiliania huxleyi and Phaeocystis cordata, whereas most calcifying coccolithophores only appeared from summer to early winter. We also revealed the seasonal dynamics of OTUs representing putative novel classes (clades HAP-3–5) or orders (clades D, E, F). Season, light and temperature accounted for 29% of the variation in OTU composition. Residual variation may be related to biotic factors, such as competition and viral infection. This study provides new, in-depth knowledge on seasonal diversity and dynamics of haptophytes in North Atlantic coastal waters. PMID:25893259

  11. Next generation sequencing to define prokaryotic and fungal diversity in the bovine rumen.

    PubMed

    Fouts, Derrick E; Szpakowski, Sebastian; Purushe, Janaki; Torralba, Manolito; Waterman, Richard C; MacNeil, Michael D; Alexander, Leeson J; Nelson, Karen E

    2012-01-01

    A combination of Sanger and 454 sequences of small subunit rRNA loci were used to interrogate microbial diversity in the bovine rumen of 12 cows consuming a forage diet. Observed bacterial species richness, based on the V1-V3 region of the 16S rRNA gene, was between 1,903 to 2,432 species-level operational taxonomic units (OTUs) when 5,520 reads were sampled per animal. Eighty percent of species-level OTUs were dominated by members of the order Clostridiales, Bacteroidales, Erysipelotrichales and unclassified TM7. Abundance of Prevotella species varied widely among the 12 animals. Archaeal species richness, also based on 16S rRNA, was between 8 and 13 OTUs, representing 5 genera. The majority of archaeal OTUs (84%) found in this study were previously observed in public databases with only two new OTUs discovered. Observed rumen fungal species richness, based on the 18S rRNA gene, was between 21 and 40 OTUs with 98.4-99.9% of OTUs represented by more than one read, using Good's coverage. Examination of the fungal community identified numerous novel groups. Prevotella and Tannerella were overrepresented in the liquid fraction of the rumen while Butyrivibrio and Blautia were significantly overrepresented in the solid fraction of the rumen. No statistical difference was observed between the liquid and solid fractions in biodiversity of archaea and fungi. The survey of microbial communities and analysis of cross-domain correlations suggested there is a far greater extent of microbial diversity in the bovine rumen than previously appreciated, and that next generation sequencing technologies promise to reveal novel species, interactions and pathways that can be studied further in order to better understand how rumen microbial community structure and function affects ruminant feed efficiency, biofuel production, and environmental impact. PMID:23144861

  12. Molecular Diversity Assessment Using Sequence Related Amplified Polymorphism (SRAP) Markers in Vicia faba L.

    PubMed Central

    Alghamdi, Salem S.; Al-Faifi, Sulieman A.; Migdadi, Hussein M.; Khan, Muhammad Altaf; El-Harty, Ehab H.; Ammar, Megahed H.

    2012-01-01

    Sequence-related amplified polymorphism (SRAP) markers were used to assess the genetic diversity and relationship among 58 faba bean (Vicia faba L.) genotypes. Fourteen SRAP primer combinations amplified a total of 1036 differently sized well-resolved peaks (fragments), of which all were polymorphic with a 0.96 PIC value and discriminated all of the 58 faba bean genotypes. An average pairwise similarity of 21% was revealed among the genotypes ranging from 2% to 65%. At a similarity of 28%, UPGMA clustered the genotypes into three main groups comprising 78% of the genotypes. The local landraces and most of the Egyptian genotypes in addition to the Sudan genotypes were grouped in the first main cluster. The advanced breeding lines were scattered in the second and third main clusters with breeding lines from the ICARDA and genotypes introduced from Egypt. At a similarity of 47%, all the genotypes formed separated clusters with the exceptions of Hassawi 1 and Hassawi 2. Group analysis of the genotypes according to their geographic origin and type showed that the landraces were grouped according to their origin, while others were grouped according to their seed type. To our knowledge, this is the first application of SRAP markers for the assessment of genetic diversity in faba bean. Such information will be useful to determine optimal breeding strategies to allow continued progress in faba bean breeding. PMID:23211669

  13. Molecular identification of the microbial diversity in two sequencing batch reactors with activated sludge.

    PubMed

    Denecke, Martin; Eilmus, Sascha; Röder, Nadine; Roesch, Christopher; Bothe, Hermann

    2012-02-01

    The diversity of the microbial community was identified in two lab-scale, ideally mixed sequencing batch reactors which were run for 115 days. One of the reactors was intermittently aerated (2 h aerobically/2 h anaerobically) whereas the other was consistently aerated. The amount of biomass as dry matter, the degradation of organic carbon determined by chemical oxygen demand and nitrogen-degradation activity were followed over the operation of the two reactors and did not show significant differences between the two approaches at the end of the experiment. At this point, the composition of the microbial community was determined by a terminal restriction fragment length polymorphism approach using multiple restriction enzymes by which organisms were retrieved to the lowest taxonomic level. The microbial composition was then significantly different. The species richness was at least five-fold higher in the intermittently aerated reactor than in the permanently kept aerobic approach which is in line with the observation that ecosystem disturbances result in higher diversity. PMID:21786107

  14. Genetic diversity analysis of okra (Abelmoschus esculentus L.) by inter-simple sequence repeat (ISSR) markers.

    PubMed

    Yuan, C Y; Zhang, C; Wang, P; Hu, S; Chang, H P; Xiao, W J; Lu, X T; Jiang, S B; Ye, J Z; Guo, X H

    2014-01-01

    Okra (Abelmoschus esculentus L.) is not only a nutrient-rich vegetable but also an important medicinal herb. Inter-simple sequence repeat (ISSR) markers were employed to investigate the genetic diversity and differentiation of 24 okra genotypes. In this study, the PCR products were separated by electrophoresis on 8% nondenaturing polyacrylamide gel and visualized by silver staining. The 22 ISSR primers produced 289 amplified DNA fragments, and 145 (50%) fragments were polymorphic. The 289 markers were used to construct the dendrogram based on the unweighted pair-group method with arithmetic average (UPGMA) cluster analysis. The dendrogram indicated that 24 okras were clustered into 4 geographically distinct groups. The average polymorphism information content (PIC) was 0.531929, which showed that the majority of primers were informative. The high values of allele frequency, genetic diversity, and heterozygosity showed that primer-sample combinations produced measurable fragments. The mean distances ranged from 0.045455 to 0.454545. The dendrogram indicated that the ISSR markers succeeded in distinguishing most of the 24 varieties in relation to their genetic backgrounds and geographical origins. PMID:24841648

  15. AFM studies in diverse ionic environments of nucleosomes reconstituted on the 601 positioning sequence.

    PubMed

    Nazarov, Igor; Chekliarova, Iana; Rychkov, Georgy; Ilatovskiy, Andrey V; Crane-Robinson, Colyn; Tomilin, Alexey

    2016-02-01

    Atomic force microscopy (AFM) was used to study mononucleosomes reconstituted from a DNA duplex of 353 bp containing the strong 601 octamer positioning sequence, together with recombinant human core histone octamers. Three parameters were measured: 1) the length of DNA wrapped around the core histones; 2) the number of superhelical turns, calculated from the total angle through which the DNA is bent, and 3) the volume of the DNA-histone core. This approach allowed us to define in detail the structural diversity of nucleosomes caused by disassembly of the octasome to form subnucleosomal structures containing hexasomes, tetrasomes and disomes. At low ionic strength (TE buffer) and in the presence of physiological concentrations of monovalent cations, the majority of the particles were subnucleosomal, but physiological concentrations of bivalent cations resulted in about half of the nucleosomes being canonical octasomes in which the exiting DNA duplexes cross orthogonally. The dominance of this last species explains why bivalent but not monovalent cations can induce the initial step towards compaction and convergence of neighboring nucleosomes in nucleosomal arrays to form the chromatin fiber in the absence of linker histone. The observed nucleosome structural diversity may reflect the functional plasticity of nucleosomes under physiological conditions. PMID:26586109

  16. Genetic diversity of wild soybean populations in Dongying, China, by simple sequence repeat analysis.

    PubMed

    Wang, Y H; Zhang, X J; Fan, S J

    2015-01-01

    Annual wild soybean (Glycine soja Sieb. et Zucc.), the ancestor of cultivated soybean (G. max), is believed to be a potential gene source for further improvement of soybean to cope with environmental stress. In this study, 10 simple sequence repeat (SSR) markers were used to evaluate the genetic diversity and population genetic structure in five wild soybean populations using 195 accessions collected from Dongying, China. Ten SSR markers yielded 90 bands, with an average of nine bands per marker. The percentage of polymorphic loci (P) was 97.78%, the distribution of expected heterozygosity (HE) was 0.1994-0.4460 with an average of 0.3262, and the distribution from Shannon's information index (I) was 0.3595-0.6506 with an average of 0.5386. The results showed that wild soybean had a high degree of genetic diversity at the species level. Nei's differentiation coefficient (FST) was 0.1533, and gene flow (Nm) was 1.3805, which indicated that genetic variation mainly existed within populations and that there was a certain level of gene exchange between populations. Some genetic differentiation occurred among populations, although this was not significant. Cluster analysis indicated that there was no significant correlation between the genetic structure of wild soybean populations and their geographic distribution, and the clustering results may be relatively consistent with the habitats of the accessions. In the present study, the genetic diversity of wild soybeans showed a broad genetic base and enables suggestions for the conservation of this plant to be made. PMID:26436402

  17. Determination of Microbial Diversity and Nitrogen Cycling from Kizildere Geothermal Field with Next Generation Sequencing

    NASA Astrophysics Data System (ADS)

    Gulecal, Y.; Dilek, Y.

    2012-12-01

    The deep terrestrial subsurface biosphere represents an emerging frontier for studies of biodiversity, the physiological limits to life, microbial mechanisms of adaptation, and potentially analogous environments for extraterrestrial life (1). Last decade, researches of deep boreholes in the United States, Finland, Sweden, Japan and South Africa, using molecular tools, have shown an an active biosphere composed of diverse groups of microorganisms. The microbial communities reported from different subsurface communities vary widely; such differences are due to different host rock types and varied water origins and chemistry, as well as geography. Furthermore, nitrogen cycling is studied intensely in hot springs for instance in situ nifH expression in Yellowstone National Park, is a new upper temperature limit for nitrogen fixation in alkaline, terrestrial hydrothermal environments (2). This study explores the genetic diversity of microbial communities and genes of nitrogen cycling in Kizildere Geothermal Field, Turkey. The Kizildere thermal waters are located in the northern part of the Büyük Menderes rift zone. The hydrothermal alteration includes phyllic, argillic, silicic,hematitized, and carbonatized alteration zones. The surface temperatures of Kizildere thermal waters in drill holes range from 95 to100°C and pH 9.0-9.5. Microbial communities were examined using culture independent methods, next generation sequencing. Nitrogen fixation, the diversity of nifH, ammonia oxidation (amoA), narG, nosZ genes are investigated in deeply-sourced fluids. We present field observations and interpret new data, establishing a geobiological baseline for previously undescribed sitres of subsurface ecosystems. (1)Fredrickson et al. 2006. Geomicrobial processes and biodiversity in the deep terrestrial subsurface. Geomicrobiology J. 23:345-356. (2) Loiacono et al. 2012. Evidence for high-temperature in situ nifH transcription in an alkaline hot spring of Lower Geyser Basin

  18. Partial amino acid sequence of human factor D:homology with serine proteases.

    PubMed Central

    Volanakis, J E; Bhown, A; Bennett, J C; Mole, J E

    1980-01-01

    Human factor D purified to homogeneity by a modified procedure was subjected to NH2-terminal amino acid sequence analysis by using a modified automated Beckman sequencer. We identified 48 of the first 57 NH2-terminal amino acids in a single sequencer run, using microgram quantities of factor D. The deduced amino acid sequence represents approximately 25% of the primary structure of factor D. This extended NH2-terminal amino acid sequence of factor D was compared to that of other trypsin-related serine proteases. By visual inspection, strong homologies (33--50% identity) were observed with all the serine proteases included in the comparison. Interestingly, factor D showed a higher degree of homology to serine proteases of pancreatic origin than to those of serum origin. Images PMID:6987665

  19. Evaluation of cytochrome b mtDNA sequences in genetic diversity studies of Channa marulius (Channidae: Perciformes).

    PubMed

    Habib, Maria; Lakra, W S; Mohindra, Vindhya; Khare, Praveen; Barman, A S; Singh, Akanksha; Lal, Kuldeep K; Punia, Peyush; Khan, Asif A

    2011-02-01

    Channa marulius (Hamilton, 1822) is a commercially important freshwater fish and a potential candidate species for aquaculture. The present study evaluated partial Cytochrome b gene sequence of mtDNA for determining the genetic variation in wild populations of C. marulius. Genomic DNA extracted from C. marulius samples (n = 23) belonging to 3 distant rivers; Mahanadi, Teesta and Yamuna was analyzed. Sequencing of 307 bp Cytochrome b mtDNA fragment revealed the presence of 5 haplotypes with haplotype diversity value of 0.763 and nucleotide diversity value of 0.0128. Single population specific haplotype was observed in Mahanadi and Yamuna samples and 3 haplotypes in Teesta samples. The analysis of data demonstrated the suitability of partial Cytochrome b sequence in determining the genetic diversity in C. marulius population. PMID:20443065

  20. Sequence diversity among badnavirus isolates infecting black pepper and related species in India.

    PubMed

    Bhat, A I; Sasi, Shina; Revathy, K A; Deeshma, K P; Saji, K V

    2014-01-01

    The badnavirus, piper yellow mottle virus (PYMoV) is known to infect black pepper (Piper nigrum), betelvine (P. betle) and Indian long pepper (P. longum) in India and other parts of the world. Occurrence of PYMoV or other badnaviruses in other species of Piper and its variability is not reported so far. We have analysed sequence variability in the conserved putative reverse transcriptase (RT)/ribonuclease H (RNase H) coding region of the virus using specific badnavirus primers from 13 virus isolates of black pepper collected from different cultivars and regions and one isolate each from 23 other species of Piper. Of these, four species failed to produce expected amplicon while amplicon from four other species showed more similarities to plant sequences than to badnaviruses. Of the remaining, isolates from black pepper, P. argyrophyllum, P. attenuatum, P. barberi, P. betle, P. colubrinum, P. galeatum, P. longum, P. ornatum, P. sarmentosum and P. trichostachyon showed an identity of >85 % at the nucleotide and >90 % at the amino acid level with PYMoV indicating that they are isolates of PYMoV. On the other hand high sequence variability of 21-43 % at nucleotide and 17-46 % at amino acid level compared to PYMoV was found among isolates infecting P. bababudani, P. chaba, P. peepuloides, P. mullesua and P. thomsonii suggesting the presence of new badnaviruses. Phylogenetic analyses showed close clustering of all PYMoV isolates that were well separated from other known badnaviruses. This is the first report of occurrence of PYMoV in eight Piper spp and likely occurrence of four new species in five Piper spp. PMID:25674613

  1. Amino acid sequence of Japanese quail (Coturnix japonica) and northern bobwhite (Colinus virginianus) myoglobin.

    PubMed

    Goodson, John; Beckstead, Robert B; Payne, Jason; Singh, Rakesh K; Mohan, Anand

    2015-08-15

    Myoglobin has an important physiological role in vertebrates, and as the primary sarcoplasmic pigment in meat, influences quality perception and consumer acceptability. In this study, the amino acid sequences of Japanese quail and northern bobwhite myoglobin were deduced by cDNA cloning of the coding sequence from mRNA. Japanese quail myoglobin was isolated from quail cardiac muscles, purified using ammonium sulphate precipitation and gel-filtration, and subjected to multiple enzymatic digestions. Mass spectrometry corroborated the deduced protein amino acid sequence at the protein level. Sequence analysis revealed both species' myoglobin structures consist of 153 amino acids, differing at only three positions. When compared with chicken myoglobin, Japanese quail showed 98% sequence identity, and northern bobwhite 97% sequence identity. The myoglobin in both quail species contained eight histidine residues instead of the nine present in chicken and turkey. PMID:25794748

  2. Deep sequencing reveals exceptional diversity and modes of transmission for bacterial sponge symbionts

    PubMed Central

    Webster, Nicole S; Taylor, Michael W; Behnam, Faris; Lücker, Sebastian; Rattei, Thomas; Whalan, Stephen; Horn, Matthias; Wagner, Michael

    2010-01-01

    Marine sponges contain complex bacterial communities of considerable ecological and biotechnological importance, with many of these organisms postulated to be specific to sponge hosts. Testing this hypothesis in light of the recent discovery of the rare microbial biosphere, we investigated three Australian sponges by massively parallel 16S rRNA gene tag pyrosequencing. Here we show bacterial diversity that is unparalleled in an invertebrate host, with more than 250 000 sponge-derived sequence tags being assigned to 23 bacterial phyla and revealing up to 2996 operational taxonomic units (95% sequence similarity) per sponge species. Of the 33 previously described ‘sponge-specific’ clusters that were detected in this study, 48% were found exclusively in adults and larvae – implying vertical transmission of these groups. The remaining taxa, including ‘Poribacteria’, were also found at very low abundance among the 135 000 tags retrieved from surrounding seawater. Thus, members of the rare seawater biosphere may serve as seed organisms for widely occurring symbiont populations in sponges and their host association might have evolved much more recently than previously thought. PMID:21966903

  3. Genetic diversity and relationship of Mauremys mutica and M. annamensis assessed by DNA barcoding sequences.

    PubMed

    Zhao, Jian; Li, Wei; Wen, Ping; Zhang, Dandan; Zhu, Xinping

    2016-09-01

    The mitochondrial DNA cytochrome c oxidase subunit I gene (COI) has been used as an efficient barcoding tool for species identification of animals. In this study, the barcoding sequences were used to assess the genetic diversity and relationship of Mauremy mutica and M. annamensis. Four currently recognized groups of M. mutica were classified into two groups in this study, with 6% intergroup distances, the S group and the N group, consistent to the calling of "southern turtle" and "northern turtle" in folk of China. The north population and Taiwan population formed the N group, and further, the Taiwan population was differentiated as a monophyly originated from the north population, consistent to the calling of "big green head" for the Taiwan population and "small green head" for the north population. The Vietnam, Hainan population, and M. annamensis formed the S group, and the barcoding sequences could not distinguish them from each other. Based on the molecular data and phenotypes of existing hybrids, hybrid origin of M. annamensis may be another possibility. PMID:26260182

  4. Comparison of DNA extraction methodologies used for assessing fungal diversity via ITS sequencing

    PubMed Central

    Rittenour, William R.; Park, Ju-Hyeong; Cox-Ganser, Jean M.; Beezhold, Donald H.; Green, Brett J.

    2015-01-01

    Traditional methods of assessing fungal exposure have been confounded by a number of limiting variables. The recent utilization of molecular methods such as internal transcribed spacer (ITS) sequencing of ribosomal RNA genes has provided improved insight into the diversity of fungal bioaerosols in indoor, outdoor and occupational environments. However, ITS analyses may also be confounded by a number of methodological limitations. In this study, we have optimized this technology for use in occupational or environmental studies. Three commonly used DNA extraction methodologies (UltraClean Soil kit, High Pure PCR Template kit, and EluQuik/DNeasy kit) were compared in terms of sensitivity and susceptibility to PCR inhibitors in dust for three common fungal bioaerosols, Aspergillus versicolor, Rhizopus microsporus and Wallemia sebi. Environmental dust samples were then studied using each extraction methodology and results were compared to viable culture data. The extraction methods differed in terms of their ability to efficiently extract DNA from particular species of fungi (e.g. Aspergillus versicolor). In addition, the ability to remove PCR inhibitors from dust samples was most effective using the soil DNA extraction kit. The species composition varied greatly between ITS clone libraries generated with the different DNA extraction kits. However, compared to viable culture data, ITS clone libraries included additional fungal species that are incapable of growth on solid culture medium. Collectively, our data indicated that DNA extraction methodologies used in ITS sequencing studies of occupational or environmental dust samples can greatly influence the fungal species that are detected. PMID:22230933

  5. Chromosomal Organization and Sequence Diversity of Genes Encoding Lachrymatory Factor Synthase in Allium cepa L.

    PubMed

    Masamura, Noriya; McCallum, John; Khrustaleva, Ludmila; Kenel, Fernand; Pither-Joyce, Meegham; Shono, Jinji; Suzuki, Go; Mukai, Yasuhiko; Yamauchi, Naoki; Shigyo, Masayoshi

    2012-06-01

    Lachrymatory factor synthase (LFS) catalyzes the formation of lachrymatory factor, one of the most distinctive traits of bulb onion (Allium cepa L.). Therefore, we used LFS as a model for a functional gene in a huge genome, and we examined the chromosomal organization of LFS in A. cepa by multiple approaches. The first-level analysis completed the chromosomal assignment of LFS gene to chromosome 5 of A. cepa via the use of a complete set of A. fistulosum-shallot (A. cepa L. Aggregatum group) monosomic addition lines. Subsequent use of an F(2) mapping population from the interspecific cross A. cepa × A. roylei confirmed the assignment of an LFS locus to this chromosome. Sequence comparison of two BAC clones bearing LFS genes, LFS amplicons from diverse germplasm, and expressed sequences from a doubled haploid line revealed variation consistent with duplicated LFS genes. Furthermore, the BAC-FISH study using the two BAC clones as a probe showed that LFS genes are localized in the proximal region of the long arm of the chromosome. These results suggested that LFS in A. cepa is transcribed from at least two loci and that they are localized on chromosome 5. PMID:22690373

  6. Diverse gene sequences are overexpressed in werner syndrome fibroblasts undergoing premature replicative senescence.

    PubMed Central

    Murano, S; Thweatt, R; Shmookler Reis, R J; Jones, R A; Moerman, E J; Goldstein, S

    1991-01-01

    Genes that play a role in the senescent arrest of cellular replication are likely to be overexpressed in human diploid fibroblasts (HDF) derived from subjects with Werner syndrome (WS) because these cells have a severely curtailed replicative life span. To identify some of these genes, a cDNA library was constructed from WS HDF after they had been serum depleted and repleted (5 days in medium containing 1% serum followed by 24 h in medium containing 20% serum). Differential screening of 7,500 colonies revealed 102 clones that hybridized preferentially with [32P]cDNA derived from RNA of WS cells compared with [32P]cDNA derived from normal HDF. Cross-hybridization and partial DNA sequence determination identified 18 independent gene sequences, 9 of them known and 9 unknown. The known genes included alpha 1(I) procollagen, alpha 2(I) procollagen, fibronectin, ferritin heavy chain, insulinlike growth factor-binding protein-3 (IGFBP-3), osteonectin, human tissue plasminogen activator inhibitor type I, thrombospondin, and alpha B-crystallin. The nine unknown clones included two novel gene sequences and seven additional sequences that contained both novel segments and the Alu class of repetitive short interspersed nuclear elements; five of these seven Alu+ clones also contained the long interpersed nuclear element I (KpnI) family of repetitive elements. Northern (RNA) analysis, using the 18 sequences as probes, showed higher levels of these mRNAs in WS HDF than in normal HDF. Five selected mRNAs studied in greater detail [alpha 1(I) procollagen, fibronectin, insulinlike growth factor-binding protein-3, WS3-10, and WS9-14] showed higher mRNA levels in both WS and late-passage normal HDF than in early-passage normal HDF at various intervals following serum depletion/repletion and after subculture and growth from sparse to high-density confluent arrest. These results indicate that senescence of both WS and normal HDF is accompanied by overexpression of similar sets of

  7. Identification of random nucleic acid sequence aberrations using dual capture probes which hybridize to different chromosome regions

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    1998-01-01

    A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration.

  8. Identification of random nucleic acid sequence aberrations using dual capture probes which hybridize to different chromosome regions

    DOEpatents

    Lucas, J.N.; Straume, T.; Bogen, K.T.

    1998-03-24

    A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration. 14 figs.

  9. Assessing genetic diversity among Brettanomyces yeasts by DNA fingerprinting and whole-genome sequencing.

    PubMed

    Crauwels, Sam; Zhu, Bo; Steensels, Jan; Busschaert, Pieter; De Samblanx, Gorik; Marchal, Kathleen; Willems, Kris A; Verstrepen, Kevin J; Lievens, Bart

    2014-07-01

    Brettanomyces yeasts, with the species Brettanomyces (Dekkera) bruxellensis being the most important one, are generally reported to be spoilage yeasts in the beer and wine industry due to the production of phenolic off flavors. However, B. bruxellensis is also known to be a beneficial contributor in certain fermentation processes, such as the production of certain specialty beers. Nevertheless, despite its economic importance, Brettanomyces yeasts remain poorly understood at the genetic and genomic levels. In this study, the genetic relationship between more than 50 Brettanomyces strains from all presently known species and from several sources was studied using a combination of DNA fingerprinting techniques. This revealed an intriguing correlation between the B. bruxellensis fingerprints and the respective isolation source. To further explore this relationship, we sequenced a (beneficial) beer isolate of B. bruxellensis (VIB X9085; ST05.12/22) and compared its genome sequence with the genome sequences of two wine spoilage strains (AWRI 1499 and CBS 2499). ST05.12/22 was found to be substantially different from both wine strains, especially at the level of single nucleotide polymorphisms (SNPs). In addition, there were major differences in the genome structures between the strains investigated, including the presence of large duplications and deletions. Gene content analysis revealed the presence of 20 genes which were present in both wine strains but absent in the beer strain, including many genes involved in carbon and nitrogen metabolism, and vice versa, no genes that were missing in both AWRI 1499 and CBS 2499 were found in ST05.12/22. Together, this study provides tools to discriminate Brettanomyces strains and provides a first glimpse at the genetic diversity and genome plasticity of B. bruxellensis. PMID:24814796

  10. Assessing Genetic Diversity among Brettanomyces Yeasts by DNA Fingerprinting and Whole-Genome Sequencing

    PubMed Central

    Crauwels, Sam; Zhu, Bo; Steensels, Jan; Busschaert, Pieter; De Samblanx, Gorik; Marchal, Kathleen; Willems, Kris A.

    2014-01-01

    Brettanomyces yeasts, with the species Brettanomyces (Dekkera) bruxellensis being the most important one, are generally reported to be spoilage yeasts in the beer and wine industry due to the production of phenolic off flavors. However, B. bruxellensis is also known to be a beneficial contributor in certain fermentation processes, such as the production of certain specialty beers. Nevertheless, despite its economic importance, Brettanomyces yeasts remain poorly understood at the genetic and genomic levels. In this study, the genetic relationship between more than 50 Brettanomyces strains from all presently known species and from several sources was studied using a combination of DNA fingerprinting techniques. This revealed an intriguing correlation between the B. bruxellensis fingerprints and the respective isolation source. To further explore this relationship, we sequenced a (beneficial) beer isolate of B. bruxellensis (VIB X9085; ST05.12/22) and compared its genome sequence with the genome sequences of two wine spoilage strains (AWRI 1499 and CBS 2499). ST05.12/22 was found to be substantially different from both wine strains, especially at the level of single nucleotide polymorphisms (SNPs). In addition, there were major differences in the genome structures between the strains investigated, including the presence of large duplications and deletions. Gene content analysis revealed the presence of 20 genes which were present in both wine strains but absent in the beer strain, including many genes involved in carbon and nitrogen metabolism, and vice versa, no genes that were missing in both AWRI 1499 and CBS 2499 were found in ST05.12/22. Together, this study provides tools to discriminate Brettanomyces strains and provides a first glimpse at the genetic diversity and genome plasticity of B. bruxellensis. PMID:24814796

  11. The amino acid sequence of protein CM-3 from Dendroaspis polylepis polylepis (black mamba) venom.

    PubMed

    Joubert, F J

    1985-01-01

    Protein CM-3 from Dendroaspis polylepis polylepis venom was purified by gel filtration and ion exchange chromatography. It comprises 65 amino acids including eight half-cystines. The complete amino acid sequence of protein CM-3 has been elucidated. The sequence (residues 1-50) resembles that of the N-terminal sequence of the subunits of a synergistic type protein and residues 51-65 that of the C-terminal sequence of an angusticeps type protein. Mixtures of protein CM-3 and angusticeps type proteins showed no apparent synergistic effect, in that their toxicity in combination was no greater than the sum of their individual toxicities. PMID:4029488

  12. Complete ecological isolation and cryptic diversity in Polynucleobacter bacteria not resolved by 16S rRNA gene sequences.

    PubMed

    Hahn, Martin W; Jezberová, Jitka; Koll, Ulrike; Saueressig-Beck, Tanja; Schmidt, Johanna

    2016-07-01

    Transplantation experiments and genome comparisons were used to determine if lineages of planktonic Polynucleobacter almost indistinguishable by their 16S ribosomal RNA (rRNA) sequences differ distinctively in their ecophysiological and genomic traits. The results of three transplantation experiments differing in complexity of biotic interactions revealed complete ecological isolation between some of the lineages. This pattern fits well to the previously detected environmental distribution of lineages along chemical gradients, as well as to differences in gene content putatively providing adaptation to chemically distinct habitats. Patterns of distribution of iron transporter genes across 209 Polynucleobacter strains obtained from freshwater systems and representing a broad pH spectrum further emphasize differences in habitat-specific adaptations. Genome comparisons of six strains sharing ⩾99% 16S rRNA similarities suggested that each strain represents a distinct species. Comparison of sequence diversity among genomes with sequence diversity among 240 cultivated Polynucleobacter strains indicated a large cryptic species complex not resolvable by 16S rRNA sequences. The revealed ecological isolation and cryptic diversity in Polynucleobacter bacteria is crucial in the interpretation of diversity studies on freshwater bacterioplankton based on ribosomal sequences. PMID:26943621

  13. Complete ecological isolation and cryptic diversity in Polynucleobacter bacteria not resolved by 16S rRNA gene sequences

    PubMed Central

    Hahn, Martin W; Jezberová, Jitka; Koll, Ulrike; Saueressig-Beck, Tanja; Schmidt, Johanna

    2016-01-01

    Transplantation experiments and genome comparisons were used to determine if lineages of planktonic Polynucleobacter almost indistinguishable by their 16S ribosomal RNA (rRNA) sequences differ distinctively in their ecophysiological and genomic traits. The results of three transplantation experiments differing in complexity of biotic interactions revealed complete ecological isolation between some of the lineages. This pattern fits well to the previously detected environmental distribution of lineages along chemical gradients, as well as to differences in gene content putatively providing adaptation to chemically distinct habitats. Patterns of distribution of iron transporter genes across 209 Polynucleobacter strains obtained from freshwater systems and representing a broad pH spectrum further emphasize differences in habitat-specific adaptations. Genome comparisons of six strains sharing ⩾99% 16S rRNA similarities suggested that each strain represents a distinct species. Comparison of sequence diversity among genomes with sequence diversity among 240 cultivated Polynucleobacter strains indicated a large cryptic species complex not resolvable by 16S rRNA sequences. The revealed ecological isolation and cryptic diversity in Polynucleobacter bacteria is crucial in the interpretation of diversity studies on freshwater bacterioplankton based on ribosomal sequences. PMID:26943621

  14. The Chinese hamster Alu-equivalent sequence: a conserved highly repetitious, interspersed deoxyribonucleic acid sequence in mammals has a structure suggestive of a transposable element.

    PubMed Central

    Haynes, S R; Toomey, T P; Leinwand, L; Jelinek, W R

    1981-01-01

    A consensus sequence has been determined for a major interspersed deoxyribonucleic acid repeat in the genome of Chinese hamster ovary cells (CHO cells). This sequence is extensively homologous to (i) the human Alu sequence (P. L. Deininger et al., J. Mol. Biol., in press), (ii) the mouse B1 interspersed repetitious sequence (Krayev et al., Nucleic Acids Res. 8:1201-1215, 1980) (iii) an interspersed repetitious sequence from African green monkey deoxyribonucleic acid (Dhruva et al., Proc. Natl. Acad. Sci. U.S.A. 77:4514-4518, 1980) and (iv) the CHO and mouse 4.5S ribonucleic acid (this report; F. Harada and N. Kato, Nucleic Acids Res. 8:1273-1285, 1980). Because the CHO consensus sequence shows significant homology to the human Alu sequence it is termed the CHO Alu-equivalent sequence. A conserved structure surrounding CHO Alu-equivalent family members can be recognized. It is similar to that surrounding the human Alu and the mouse B1 sequences, and is represented as follows: direct repeat-CHO-Alu-A-rich sequence-direct repeat. A composite interspersed repetitious sequence has been identified. Its structure is represented as follows: direct repeat-residue 47 to 107 of CHO-Alu-non-Alu repetitious sequence-A-rich sequence-direct repeat. Because the Alu flanking sequences resemble those that flank known transposable elements, we think it likely that the Alu sequence dispersed throughout the mammalian genome by transposition. Images PMID:9279371

  15. Poly (beta-L-malic acid) production by diverse phylogenetic clades of Aureobasidium pullulans

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Poly (beta-L-malic acid) (PMA) is a natural biopolyester that has pharmaceutical applications and other potential uses. Here we examine PMA production by genetically diverse phylogenetic clades of the fungus A. pullulans. Thirty-six strains of A. pullulans were isolated for this study from various...

  16. Computer Simulation of the Determination of Amino Acid Sequences in Polypeptides

    ERIC Educational Resources Information Center

    Daubert, Stephen D.; Sontum, Stephen F.

    1977-01-01

    Describes a computer program that generates a random string of amino acids and guides the student in determining the correct sequence of a given protein by using experimental analytic data for that protein. (MLH)

  17. Repulsive parallel MCMC algorithm for discovering diverse motifs from large sequence sets

    PubMed Central

    Ikebata, Hisaki; Yoshida, Ryo

    2015-01-01

    Motivation: The motif discovery problem consists of finding recurring patterns of short strings in a set of nucleotide sequences. This classical problem is receiving renewed attention as most early motif discovery methods lack the ability to handle large data of recent genome-wide ChIP studies. New ChIP-tailored methods focus on reducing computation time and pay little regard to the accuracy of motif detection. Unlike such methods, our method focuses on increasing the detection accuracy while maintaining the computation efficiency at an acceptable level. The major advantage of our method is that it can mine diverse multiple motifs undetectable by current methods. Results: The repulsive parallel Markov chain Monte Carlo (RPMCMC) algorithm that we propose is a parallel version of the widely used Gibbs motif sampler. RPMCMC is run on parallel interacting motif samplers. A repulsive force is generated when different motifs produced by different samplers near each other. Thus, different samplers explore different motifs. In this way, we can detect much more diverse motifs than conventional methods can. Through application to 228 transcription factor ChIP-seq datasets of the ENCODE project, we show that the RPMCMC algorithm can find many reliable cofactor interacting motifs that existing methods are unable to discover. Availability and implementation: A C++ implementation of RPMCMC and discovered cofactor motifs for the 228 ENCODE ChIP-seq datasets are available from http://daweb.ism.ac.jp/yoshidalab/motif. Contact: ikebata.hisaki@ism.ac.jp, yoshidar@ism.ac.jp Supplementary information: Supplementary data are available from Bioinformatics online. PMID:25583120

  18. Reduced representation genome sequencing suggests low diversity on the sex chromosomes of tonkean macaque monkeys.

    PubMed

    Evans, Ben J; Zeng, Kai; Esselstyn, Jacob A; Charlesworth, Brian; Melnick, Don J

    2014-09-01

    In species with separate sexes, social systems can differ in the relative variances of male versus female reproductive success. Papionin monkeys (macaques, mangabeys, mandrills, drills, baboons, and geladas) exhibit hallmarks of a high variance in male reproductive success, including a female-biased adult sex ratio and prominent sexual dimorphism. To explore the potential genomic consequences of such sex differences, we used a reduced representation genome sequencing approach to quantifying polymorphism at sites on autosomes and sex chromosomes of the tonkean macaque (Macaca tonkeana), a species endemic to the Indonesian island of Sulawesi. The ratio of nucleotide diversity of the X chromosome to that of the autosomes was less than the value (0.75) expected with a 1:1 sex ratio and no sex differences in the variance in reproductive success. However, the significance of this difference was dependent on which outgroup was used to standardize diversity levels. Using a new model that includes the effects of varying population size, sex differences in mutation rate between the autosomes and X chromosome, and GC-biased gene conversion (gBGC) or selection on GC content, we found that the maximum-likelihood estimate of the ratio of effective population size of the X chromosome to that of the autosomes was 0.68, which did not differ significantly from 0.75. We also found evidence for 1) a higher level of purifying selection on genic than nongenic regions, 2) gBGC or natural selection favoring increased GC content, 3) a dynamic demography characterized by population growth and contraction, 4) a higher mutation rate in males than females, and 5) a very low polymorphism level on the Y chromosome. These findings shed light on the population genomic consequences of sex differences in the variance in reproductive success, which appear to be modest in the tonkean macaque; they also suggest the occurrence of hitchhiking on the Y chromosome. PMID:24987106

  19. DNA Sequence Analyses Reveal Abundant Diversity, Endemism and Evidence for Asian Origin of the Porcini Mushrooms

    PubMed Central

    Feng, Bang; Xu, Jianping; Wu, Gang; Zeng, Nian-Kai; Li, Yan-Chun; Tolgor, Bau; Kost, Gerhard W.; Yang, Zhu L.

    2012-01-01

    The wild gourmet mushroom Boletus edulis and its close allies are of significant ecological and economic importance. They are found throughout the Northern Hemisphere, but despite their ubiquity there are still many unresolved issues with regard to the taxonomy, systematics and biogeography of this group of mushrooms. Most phylogenetic studies of Boletus so far have characterized samples from North America and Europe and little information is available on samples from other areas, including the ecologically and geographically diverse regions of China. Here we analyzed DNA sequence variation in three gene markers from samples of these mushrooms from across China and compared our findings with those from other representative regions. Our results revealed fifteen novel phylogenetic species (about one-third of the known species) and a newly identified lineage represented by Boletus sp. HKAS71346 from tropical Asia. The phylogenetic analyses support eastern Asia as the center of diversity for the porcini sensu stricto clade. Within this clade, B. edulis is the only known holarctic species. The majority of the other phylogenetic species are geographically restricted in their distributions. Furthermore, molecular dating and geological evidence suggest that this group of mushrooms originated during the Eocene in eastern Asia, followed by dispersal to and subsequent speciation in other parts of Asia, Europe, and the Americas from the middle Miocene through the early Pliocene. In contrast to the ancient dispersal of porcini in the strict sense in the Northern Hemisphere, the occurrence of B. reticulatus and B. edulis sensu lato in the Southern Hemisphere was probably due to recent human-mediated introductions. PMID:22629418

  20. Sequence, structure and functional diversity of PD-(D/E)XK phosphodiesterase superfamily

    PubMed Central

    Steczkiewicz, Kamil; Muszewska, Anna; Knizewski, Lukasz; Rychlewski, Leszek; Ginalski, Krzysztof

    2012-01-01

    Proteins belonging to PD-(D/E)XK phosphodiesterases constitute a functionally diverse superfamily with representatives involved in replication, restriction, DNA repair and tRNA–intron splicing. Their malfunction in humans triggers severe diseases, such as Fanconi anemia and Xeroderma pigmentosum. To date there have been several attempts to identify and classify new PD-(D/E)KK phosphodiesterases using remote homology detection methods. Such efforts are complicated, because the superfamily exhibits extreme sequence and structural divergence. Using advanced homology detection methods supported with superfamily-wide domain architecture and horizontal gene transfer analyses, we provide a comprehensive reclassification of proteins containing a PD-(D/E)XK domain. The PD-(D/E)XK phosphodiesterases span over 21 900 proteins, which can be classified into 121 groups of various families. Eleven of them, including DUF4420, DUF3883, DUF4263, COG5482, COG1395, Tsp45I, HaeII, Eco47II, ScaI, HpaII and Replic_Relax, are newly assigned to the PD-(D/E)XK superfamily. Some groups of PD-(D/E)XK proteins are present in all domains of life, whereas others occur within small numbers of organisms. We observed multiple horizontal gene transfers even between human pathogenic bacteria or from Prokaryota to Eukaryota. Uncommon domain arrangements greatly elaborate the PD-(D/E)XK world. These include domain architectures suggesting regulatory roles in Eukaryotes, like stress sensing and cell-cycle regulation. Our results may inspire further experimental studies aimed at identification of exact biological functions, specific substrates and molecular mechanisms of reactions performed by these highly diverse proteins. PMID:22638584

  1. Genetic diversity among air yam (Dioscorea bulbifera) varieties based on single sequence repeat markers.

    PubMed

    Silva, D M; Siqueira, M V B M; Carrasco, N F; Mantello, C C; Nascimento, W F; Veasey, E A

    2016-01-01

    Dioscorea is the largest genus in the Dioscoreaceae family, and includes a number of economically important species including the air yam, D. bulbifera L. This study aimed to develop new single sequence repeat primers and characterize the genetic diversity of local varieties that originated in several municipalities of Brazil. We developed an enriched genomic library for D. bulbifera resulting in seven primers, six of which were polymorphic, and added four polymorphic loci developed for other Dioscorea species. This resulted in 10 polymorphic primers to evaluate 42 air yam accessions. Thirty-three alleles (bands) were found, with an average of 3.3 alleles per locus. The discrimination power ranged from 0.113 to 0.834, with an average of 0.595. Both principal coordinate and cluster analyses (using the Jaccard Index) failed to clearly separate the accessions according to their origins. However, the 13 accessions from Conceição dos Ouros, Minas Gerais State were clustered above zero on the principal coordinate 2 axis, and were also clustered into one subgroup in the cluster analysis. Accessions from Ubatuba, São Paulo State were clustered below zero on the same principal coordinate 2 axis, except for one accession, although they were scattered in several subgroups in the cluster analysis. Therefore, we found little spatial structure in the accessions, although those from Conceição dos Ouros and Ubatuba exhibited some spatial structure, and that there is a considerable level of genetic diversity in D. bulbifera maintained by traditional farmers in Brazil. PMID:27323077

  2. PCR Primers to Study the Diversity of Expressed Fungal Genes Encoding Lignocellulolytic Enzymes in Soils Using High-Throughput Sequencing

    PubMed Central

    Barbi, Florian; Bragalini, Claudia; Vallon, Laurent; Prudent, Elsa; Dubost, Audrey; Fraissinet-Tachet, Laurence; Marmeisse, Roland; Luis, Patricia

    2014-01-01

    Plant biomass degradation in soil is one of the key steps of carbon cycling in terrestrial ecosystems. Fungal saprotrophic communities play an essential role in this process by producing hydrolytic enzymes active on the main components of plant organic matter. Open questions in this field regard the diversity of the species involved, the major biochemical pathways implicated and how these are affected by external factors such as litter quality or climate changes. This can be tackled by environmental genomic approaches involving the systematic sequencing of key enzyme-coding gene families using soil-extracted RNA as material. Such an approach necessitates the design and evaluation of gene family-specific PCR primers producing sequence fragments compatible with high-throughput sequencing approaches. In the present study, we developed and evaluated PCR primers for the specific amplification of fungal CAZy Glycoside Hydrolase gene families GH5 (subfamily 5) and GH11 encoding endo-β-1,4-glucanases and endo-β-1,4-xylanases respectively as well as Basidiomycota class II peroxidases, corresponding to the CAZy Auxiliary Activity family 2 (AA2), active on lignin. These primers were experimentally validated using DNA extracted from a wide range of Ascomycota and Basidiomycota species including 27 with sequenced genomes. Along with the published primers for Glycoside Hydrolase GH7 encoding enzymes active on cellulose, the newly design primers were shown to be compatible with the Illumina MiSeq sequencing technology. Sequences obtained from RNA extracted from beech or spruce forest soils showed a high diversity and were uniformly distributed in gene trees featuring the global diversity of these gene families. This high-throughput sequencing approach using several degenerate primers constitutes a robust method, which allows the simultaneous characterization of the diversity of different fungal transcripts involved in plant organic matter degradation and may lead to the

  3. Reverse taxonomy: an approach towards determining the diversity of meiobenthic organisms based on ribosomal RNA signature sequences

    PubMed Central

    Markmann, Melanie; Tautz, Diethard

    2005-01-01

    Organisms living in or on the sediment layer of water bodies constitute the benthos fauna, which is known to harbour a large number of species of diverse taxonomic groups. The benthos plays a significant role in the nutrient cycle and it is, therefore, of high ecological relevance. Here, we have explored a DNA-taxonomic approach to access the meiobenthic organismic diversity, by focusing on obtaining signature sequences from a part of the large ribosomal subunit rRNA (28S), the D3–D5 region. To obtain a broad representation of taxa, benthos samples were taken from 12 lakes in Germany, representing different ecological conditions. In a first approach, we have extracted whole DNA from these samples, amplified the respective fragment by PCR, cloned the fragments and sequenced individual clones. However, we found a relatively large number of recombinant clones that must be considered PCR artefacts. In a second approach we have, therefore, directly sequenced PCR fragments that were obtained from DNA extracts of randomly picked individual organisms. In total, we have obtained 264 new unique sequences, which can be readily placed into taxon groups, based on phylogenetic comparison with currently available database sequences. The group with the highest taxon abundance were nematodes and protozoa, followed by chironomids. However, we find also that we have by far not exhausted the diversity of organisms in the samples. Still, our data provide a framework within which a meiobenthos DNA signature sequence database can be constructed, that will allow to develop the necessary techniques for studying taxon diversity in the context of ecological analysis. Since many taxa in our analysis are initially only identified via their signature sequences, but not yet their morphology, we propose to call this approach ‘reverse taxonomy’. PMID:16214749

  4. PCR primers to study the diversity of expressed fungal genes encoding lignocellulolytic enzymes in soils using high-throughput sequencing.

    PubMed

    Barbi, Florian; Bragalini, Claudia; Vallon, Laurent; Prudent, Elsa; Dubost, Audrey; Fraissinet-Tachet, Laurence; Marmeisse, Roland; Luis, Patricia

    2014-01-01

    Plant biomass degradation in soil is one of the key steps of carbon cycling in terrestrial ecosystems. Fungal saprotrophic communities play an essential role in this process by producing hydrolytic enzymes active on the main components of plant organic matter. Open questions in this field regard the diversity of the species involved, the major biochemical pathways implicated and how these are affected by external factors such as litter quality or climate changes. This can be tackled by environmental genomic approaches involving the systematic sequencing of key enzyme-coding gene families using soil-extracted RNA as material. Such an approach necessitates the design and evaluation of gene family-specific PCR primers producing sequence fragments compatible with high-throughput sequencing approaches. In the present study, we developed and evaluated PCR primers for the specific amplification of fungal CAZy Glycoside Hydrolase gene families GH5 (subfamily 5) and GH11 encoding endo-β-1,4-glucanases and endo-β-1,4-xylanases respectively as well as Basidiomycota class II peroxidases, corresponding to the CAZy Auxiliary Activity family 2 (AA2), active on lignin. These primers were experimentally validated using DNA extracted from a wide range of Ascomycota and Basidiomycota species including 27 with sequenced genomes. Along with the published primers for Glycoside Hydrolase GH7 encoding enzymes active on cellulose, the newly design primers were shown to be compatible with the Illumina MiSeq sequencing technology. Sequences obtained from RNA extracted from beech or spruce forest soils showed a high diversity and were uniformly distributed in gene trees featuring the global diversity of these gene families. This high-throughput sequencing approach using several degenerate primers constitutes a robust method, which allows the simultaneous characterization of the diversity of different fungal transcripts involved in plant organic matter degradation and may lead to the

  5. Accuracy of sequence alignment and fold assessment using reduced amino acid alphabets.

    PubMed

    Melo, Francisco; Marti-Renom, Marc A

    2006-06-01

    Reduced or simplified amino acid alphabets group the 20 naturally occurring amino acids into a smaller number of representative protein residues. To date, several reduced amino acid alphabets have been proposed, which have been derived and optimized by a variety of methods. The resulting reduced amino acid alphabets have been applied to pattern recognition, generation of consensus sequences from multiple alignments, protein folding, and protein structure prediction. In this work, amino acid substitution matrices and statistical potentials were derived based on several reduced amino acid alphabets and their performance assessed in a large benchmark for the tasks of sequence alignment and fold assessment of protein structure models, using as a reference frame the standard alphabet of 20 amino acids. The results showed that a large reduction in the total number of residue types does not necessarily translate into a significant loss of discriminative power for sequence alignment and fold assessment. Therefore, some definitions of a few residue types are able to encode most of the relevant sequence/structure information that is present in the 20 standard amino acids. Based on these results, we suggest that the use of reduced amino acid alphabets may allow to increasing the accuracy of current substitution matrices and statistical potentials for the prediction of protein structure of remote homologs. PMID:16506243

  6. Diversity of the causal genes in hearing impaired Algerian individuals identified by whole exome sequencing.

    PubMed

    Ammar-Khodja, Fatima; Bonnet, Crystel; Dahmani, Malika; Ouhab, Sofiane; Lefèvre, Gaelle M; Ibrahim, Hassina; Hardelin, Jean-Pierre; Weil, Dominique; Louha, Malek; Petit, Christine

    2015-05-01

    The genetic heterogeneity of congenital hearing disorders makes molecular diagnosis expensive and time-consuming using conventional techniques such as Sanger sequencing of DNA. In order to design an appropriate strategy of molecular diagnosis in the Algerian population, we explored the diversity of the involved mutations by studying 65 families affected by autosomal recessive forms of nonsyndromic hearing impairment (DFNB forms), which are the most prevalent early onset forms. We first carried out a systematic screening for mutations in GJB2 and the recurrent p.(Arg34*) mutation in TMC1, which were found in 31 (47.7%) families and 1 (1.5%) family, respectively. We then performed whole exome sequencing in nine of the remaining families, and identified the causative mutations in all the patients analyzed, either in the homozygous state (eight families) or in the compound heterozygous state (one family): (c.709C>T: p.(Arg237*)) and (c.2122C>T: p.(Arg708*)) in OTOF, (c.1334T>G: p.(Leu445Trp)) in SLC26A4, (c.764T>A: p.(Met255Lys)) in GIPC3, (c.518T>A: p.(Cys173Ser)) in LHFPL5, (c.5336T>C: p.(Leu1779Pro)) in MYO15A, (c.1807G>T: p.(Val603Phe)) in OTOA, (c.6080dup: p.(Asn2027Lys*9)) in PTPRQ, and (c.6017del: p.(Gly2006Alafs*13); c.7188_7189ins14: p.(Val2397Leufs*2)) in GPR98. Notably, 7 of these 10 mutations affecting 8 different genes had not been reported previously. These results highlight for the first time the genetic heterogeneity of the early onset forms of nonsyndromic deafness in Algerian families. PMID:26029705

  7. Diversity of the causal genes in hearing impaired Algerian individuals identified by whole exome sequencing

    PubMed Central

    Ammar-Khodja, Fatima; Bonnet, Crystel; Dahmani, Malika; Ouhab, Sofiane; Lefèvre, Gaelle M; Ibrahim, Hassina; Hardelin, Jean-Pierre; Weil, Dominique; Louha, Malek; Petit, Christine

    2015-01-01

    The genetic heterogeneity of congenital hearing disorders makes molecular diagnosis expensive and time-consuming using conventional techniques such as Sanger sequencing of DNA. In order to design an appropriate strategy of molecular diagnosis in the Algerian population, we explored the diversity of the involved mutations by studying 65 families affected by autosomal recessive forms of nonsyndromic hearing impairment (DFNB forms), which are the most prevalent early onset forms. We first carried out a systematic screening for mutations in GJB2 and the recurrent p.(Arg34*) mutation in TMC1, which were found in 31 (47.7%) families and 1 (1.5%) family, respectively. We then performed whole exome sequencing in nine of the remaining families, and identified the causative mutations in all the patients analyzed, either in the homozygous state (eight families) or in the compound heterozygous state (one family): (c.709C>T: p.(Arg237*)) and (c.2122C>T: p.(Arg708*)) in OTOF, (c.1334T>G: p.(Leu445Trp)) in SLC26A4, (c.764T>A: p.(Met255Lys)) in GIPC3, (c.518T>A: p.(Cys173Ser)) in LHFPL5, (c.5336T>C: p.(Leu1779Pro)) in MYO15A, (c.1807G>T: p.(Val603Phe)) in OTOA, (c.6080dup: p.(Asn2027Lys*9)) in PTPRQ, and (c.6017del: p.(Gly2006Alafs*13); c.7188_7189ins14: p.(Val2397Leufs*2)) in GPR98. Notably, 7 of these 10 mutations affecting 8 different genes had not been reported previously. These results highlight for the first time the genetic heterogeneity of the early onset forms of nonsyndromic deafness in Algerian families. PMID:26029705

  8. Microbial Diversity in Deep-sea Methane Seep Sediments Presented by SSU rRNA Gene Tag Sequencing

    PubMed Central

    Nunoura, Takuro; Takaki, Yoshihiro; Kazama, Hiromi; Hirai, Miho; Ashi, Juichiro; Imachi, Hiroyuki; Takai, Ken

    2012-01-01

    Microbial community structures in methane seep sediments in the Nankai Trough were analyzed by tag-sequencing analysis for the small subunit (SSU) rRNA gene using a newly developed primer set. The dominant members of Archaea were Deep-sea Hydrothermal Vent Euryarchaeotic Group 6 (DHVEG 6), Marine Group I (MGI) and Deep Sea Archaeal Group (DSAG), and those in Bacteria were Alpha-, Gamma-, Delta- and Epsilonproteobacteria, Chloroflexi, Bacteroidetes, Planctomycetes and Acidobacteria. Diversity and richness were examined by 8,709 and 7,690 tag-sequences from sediments at 5 and 25 cm below the seafloor (cmbsf), respectively. The estimated diversity and richness in the methane seep sediment are as high as those in soil and deep-sea hydrothermal environments, although the tag-sequences obtained in this study were not sufficient to show whole microbial diversity in this analysis. We also compared the diversity and richness of each taxon/division between the sediments from the two depths, and found that the diversity and richness of some taxa/divisions varied significantly along with the depth. PMID:22510646

  9. Characterization of mouse cellular deoxyribonucleic acid homologous to Abelson murine leukemia virus-specific sequences.

    PubMed Central

    Dale, B; Ozanne, B

    1981-01-01

    The genome of Abelson murine leukemia virus (A-MuLV) consists of sequences derived from both BALB/c mouse deoxyribonucleic acid and the genome of Moloney murine leukemia virus. Using deoxyribonucleic acid linear intermediates as a source of retroviral deoxyribonucleic acid, we isolated a recombinant plasmid which contained 1.9 kilobases of the 3.5-kilobase mouse-derived sequences found in A-MuLV (A-MuLV-specific sequences). We used this clone, designated pSA-17, as a probe restriction enzyme and Southern blot analyses to examine the arrangement of homologous sequences in BALB/c deoxyribonucleic acid (endogenous Abelson sequences). The endogenous Abelson sequences within the mouse genome were interrupted by noncoding regions, suggesting that a rearrangement of the cell sequences was required to produce the sequence found in the virus. Endogenous Abelson sequences were arranged similarly in mice that were susceptible to A-MuLV tumors and in mice that were resistant to A-MuLV tumors. An examination of three BALB/c plasmacytomas and a BALB/c early B-cell tumor likewise revealed no alteration in the arrangement of the endogenous Abelson sequences. Homology to pSA-17 was also observed in deoxyribonucleic acids prepared from rat, hamster, chicken, and human cells. An isolate of A-MuLV which encoded a 160,000-dalton transforming protein (P160) contained 700 more base pairs of mouse sequences than the standard A-MuLV isolate, which encoded a 120,000-dalton transforming protein (P120). Images PMID:9279386

  10. Evaluation of genetic diversity and pedigree within crapemyrtle (Lagerstroemia spp.) cultivars using simple sequence repeat (SSR) markers

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genetic diversity was estimated for 93 crapemyrtle (Lagerstroemia spp.) cultivars (51 L. indica cultivars, 5 L. fauriei cultivars, and 37 interspecific hybrids) using 78 simple sequence repeat (SSR) markers. SSR loci were highly variable among the cultivars, detecting an average of 6.6 alleles per l...

  11. Exploring the diversity of Arcobacter spp. in cattle in the UK using MLST and whole genome sequencing

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Arcobacter butzleri is considered to be an emerging human foodborne pathogen. The completion of an A. butzleri genome sequence along with microarray analysis of 13 isolates in 2007 revealed a surprising amount of diversity amongst A. butzleri isolates from humans, animals and food. In order to furth...

  12. Sequence-Related Amplified Polymorphism (SRAP) markers for assessing interrelationships and genetic diversity among members of the Saccharum complex

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Characterization of wild germplasm provides essential information on genetic diversity that breeders utilize for crop improvement. The potential of the sequence-related amplified polymorphism (SRAP) technique, which preferentially amplifies gene-rich regions, was evaluated to assess the genetic rela...

  13. Bacterial diversity assessment of pristine mangrove microbial community from Dhulibhashani, Sundarbans using 16S rRNA gene tag sequencing.

    PubMed

    Basak, Pijush; Pramanik, Arnab; Sengupta, Sohan; Nag, Sudip; Bhattacharyya, Anish; Roy, Debojyoti; Pattanayak, Rudradip; Ghosh, Abhrajyoti; Chattopadhyay, Dhrubajyoti; Bhattacharyya, Maitree

    2016-03-01

    The global knowledge of microbial diversity and function in Sundarbans ecosystem is still scarce, despite global advancement in understanding the microbial diversity. In the present study, we have analyzed the diversity and distribution of bacteria in the tropical mangrove sediments of Sundarbans using 16S rRNA gene amplicon sequencing. Metagenome is comprised of 1,53,926 sequences with 108.8 Mbp data and with 55 ± 2% G + C content. Metagenome sequence data are available at NCBI under the Bioproject database with accession no. PRJNA245459. Bacterial community metagenome sequences were analyzed by MG-RAST software representing the presence of 56,547 species belonging to 44 different phyla. The taxonomic analysis revealed the dominance of phyla Proteobacteria within our dataset. Further taxonomic analysis revealed abundance of Bacteroidetes, Acidobactreia, Firmicutes, Actinobacteria, Nitrospirae, Cyanobacteria, Planctomycetes and Fusobacteria group as the predominant bacterial assemblages in this largely pristine mangrove habitat. The distribution of different community datasets obtained from four sediment samples originated from one sampling station at two different depths providing better understanding of the sediment bacterial diversity and its relationship to the ecosystem dynamics of this pristine mangrove sediment of Dhulibhashani in, Sundarbans. PMID:26981367

  14. Palynological composition of a Lower Cretaceous South American tropical sequence: Climatic implications and diversity comparisons with other latitudes.

    USGS Publications Warehouse

    Mejia-Velasquez, Paula J.; Dilcher, David L.; Jaramillo, Carlos A.; Fortini, Lucas B.; Manchester, Steven R.

    2012-01-01

    Premise of the study: Reconstruction of floristic patterns during the early diversification of angiosperms is impeded by the scarce fossil record, especially in tropical latitudes. Here we collected quantitative palynological data from a stratigraphic sequence in tropical South America to provide floristic and climatic insights into such tropical environments during the Early Cretaceous. Methods: We reconstructed the floristic composition of an Aptian-Albian tropical sequence from central Colombia using quantitative palynology (rarefied species richness and abundance) and used it to infer its predominant climatic conditions. Additionally, we compared our results with available quantitative data from three other sequences encompassing 70 floristic assemblages to determine latitudinal diversity patterns. Key results: Abundance of humidity indicators was higher than that of aridity indicators (61% vs. 10%). Additionally, we found an angiosperm latitudinal diversity gradient (LDG) for the Aptian, but not for the Albian, and an inverted LDG of the overall diversity for the Albian. Angiosperm species turnover during the Albian, however, was higher in humid tropics. Conclusions: There were humid climates in northwestern South America during the Aptian-Albian interval contrary to the widespread aridity expected for the tropical belt. The Albian inverted overall LDG is produced by a faster increase in per-sample angiosperm and pteridophyte diversity in temperate latitudes. However, humid tropical sequences had higher rates of floristic turnover suggesting a higher degree of morphological variation than in temperate regions.

  15. Bacterial diversity assessment of pristine mangrove microbial community from Dhulibhashani, Sundarbans using 16S rRNA gene tag sequencing

    PubMed Central

    Basak, Pijush; Pramanik, Arnab; Sengupta, Sohan; Nag, Sudip; Bhattacharyya, Anish; Roy, Debojyoti; Pattanayak, Rudradip; Ghosh, Abhrajyoti; Chattopadhyay, Dhrubajyoti; Bhattacharyya, Maitree

    2015-01-01

    The global knowledge of microbial diversity and function in Sundarbans ecosystem is still scarce, despite global advancement in understanding the microbial diversity. In the present study, we have analyzed the diversity and distribution of bacteria in the tropical mangrove sediments of Sundarbans using 16S rRNA gene amplicon sequencing. Metagenome is comprised of 1,53,926 sequences with 108.8 Mbp data and with 55 ± 2% G + C content. Metagenome sequence data are available at NCBI under the Bioproject database with accession no. PRJNA245459. Bacterial community metagenome sequences were analyzed by MG-RAST software representing the presence of 56,547 species belonging to 44 different phyla. The taxonomic analysis revealed the dominance of phyla Proteobacteria within our dataset. Further taxonomic analysis revealed abundance of Bacteroidetes, Acidobactreia, Firmicutes, Actinobacteria, Nitrospirae, Cyanobacteria, Planctomycetes and Fusobacteria group as the predominant bacterial assemblages in this largely pristine mangrove habitat. The distribution of different community datasets obtained from four sediment samples originated from one sampling station at two different depths providing better understanding of the sediment bacterial diversity and its relationship to the ecosystem dynamics of this pristine mangrove sediment of Dhulibhashani in, Sundarbans. PMID:26981367

  16. The amino acid sequence of monal pheasant lysozyme and its activity.

    PubMed

    Araki, T; Matsumoto, T; Torikata, T

    1998-10-01

    The amino acid sequence of monal pheasant lysozyme and its activity were analyzed. Carboxymethylated lysozyme was digested with trypsin and the resulting peptides were sequenced. The established amino acid sequence had one amino acid substitution at position 102 (Arg to Gly) comparing with Indian peafowl lysozyme and four amino acid substitutions at positions 3 (Phe to Tyr), 15 (His to Leu), 41 (Gln to His), and 121 (Gln to His) with chicken lysozyme. Analysis of the time-courses of reaction using N-acetylglucosamine pentamer as a substrate showed a difference of binding free energy change (-0.4 kcal/mol) at subsites A between monal pheasant and Indian peafowl lysozyme. This was assumed to be caused by the amino acid substitution at subsite A with loss of a positive charge at position 102 (Arg102 to Gly). PMID:9836434

  17. MALDI-TOF Mass Spectrometry for Multilocus Sequence Typing of Escherichia coli Reveals Diversity among Isolates Carrying blaCMY₋₂-Like Genes.

    PubMed

    Tagg, Kaitlin A; Ginn, Andrew N; Partridge, Sally R; Iredell, Jonathan R

    2015-01-01

    Effective surveillance and management of pathogenic Escherichia coli relies on robust and reproducible typing methods such as multilocus sequence typing (MLST). Typing of E. coli by MLST enables tracking of pathogenic clones that are known to carry virulence factors or spread resistance, such as the globally-prevalent ST131 lineage. Standard MLST for E. coli requires sequencing of seven alleles, or a whole genome, and can take several days. Here, we have developed and validated a nucleic-acid-based MALDI-TOF mass spectrometry (MS) method for MLST as a rapid alternative to sequencing that requires minimal operator expertise. Identification of alleles was 99.6% concordant with sequencing. We employed MLST by MALDI-TOF MS to investigate diversity among 62 E. coli isolates from Sydney, Australia, carrying a blaCMY-2-like gene on an IncI1 plasmid to determine whether any dominant clonal lineages are associated with the spread of this globally-disseminated resistance gene. Thirty-four known sequence types were identified, including lineages associated with human disease, animal and environmental sources. This suggests that the dissemination of blaCMY-2-like-genes is more complex than the simple spread of successful pathogenic clones. E. coli MLST by MALDI-TOF MS, employed here for the first time, can be utilised as an automated tool for large-scale population analyses or for targeted screening for known high-risk clones in a diagnostic setting. PMID:26588228

  18. MALDI-TOF Mass Spectrometry for Multilocus Sequence Typing of Escherichia coli Reveals Diversity among Isolates Carrying blaCMY-2-Like Genes

    PubMed Central

    Tagg, Kaitlin A.; Ginn, Andrew N.; Partridge, Sally R.; Iredell, Jonathan R.

    2015-01-01

    Effective surveillance and management of pathogenic Escherichia coli relies on robust and reproducible typing methods such as multilocus sequence typing (MLST). Typing of E. coli by MLST enables tracking of pathogenic clones that are known to carry virulence factors or spread resistance, such as the globally-prevalent ST131 lineage. Standard MLST for E. coli requires sequencing of seven alleles, or a whole genome, and can take several days. Here, we have developed and validated a nucleic-acid-based MALDI-TOF mass spectrometry (MS) method for MLST as a rapid alternative to sequencing that requires minimal operator expertise. Identification of alleles was 99.6% concordant with sequencing. We employed MLST by MALDI-TOF MS to investigate diversity among 62 E. coli isolates from Sydney, Australia, carrying a blaCMY-2-like gene on an IncI1 plasmid to determine whether any dominant clonal lineages are associated with the spread of this globally-disseminated resistance gene. Thirty-four known sequence types were identified, including lineages associated with human disease, animal and environmental sources. This suggests that the dissemination of blaCMY-2-like-genes is more complex than the simple spread of successful pathogenic clones. E. coli MLST by MALDI-TOF MS, employed here for the first time, can be utilised as an automated tool for large-scale population analyses or for targeted screening for known high-risk clones in a diagnostic setting. PMID:26588228

  19. Studies on monotreme proteins. VII. Amino acid sequence of myoglobin from the platypus, Ornithoryhynchus anatinus.

    PubMed

    Fisher, W K; Thompson, E O

    1976-03-01

    Myoglobin isolated from skeletal muscle of the platypus contains 153 amino acid residues. The complete amino acid sequence has been determined following cleavage with cyanogen bromide and further digestion of the four fragments with trypsin, chymotrypsin, pepsin and thermolysin. Sequences of the purified peptides were determined by the dansyl-Edman procedure. The amino acid sequence showed 25 differences from human myoglobin and 24 from kangaroo myoglobin. Amino acid sequences in myoglobins are more conserved than sequences in the alpha- and beta-globin chains, and platypus myoglobin shows a similar number of variations in sequence to kangaroo myoglobin when compared with myoglobin of other species. The date of divergence of the platypus from other mammals was estimated at 102 +/- 31 million years, based on the number of amino acid differences between species and allowing for mutations during the evolutionary period. This estimate differs widely from the estimate given by similar treatment of the alpha- and beta-chain sequences and a constant rate of mutation of globin chains is not supported. PMID:962722

  20. cDNA-derived amino acid sequences of myoglobins from nine species of whales and dolphins.

    PubMed

    Iwanami, Kentaro; Mita, Hajime; Yamamoto, Yasuhiko; Fujise, Yoshihiro; Yamada, Tadasu; Suzuki, Tomohiko

    2006-10-01

    We determined the myoglobin (Mb) cDNA sequences of nine cetaceans, of which six are the first reports of Mb sequences: sei whale (Balaenoptera borealis), Bryde's whale (Balaenoptera edeni), pygmy sperm whale (Kogia breviceps), Stejneger's beaked whale (Mesoplodon stejnegeri), Longman's beaked whale (Indopacetus pacificus), and melon-headed whale (Peponocephala electra), and three confirm the previously determined chemical amino acid sequences: sperm whale (Physeter macrocephalus), common minke whale (Balaenoptera acutorostrata) and pantropical spotted dolphin (Stenella attenuata). We found two types of Mb in the skeletal muscle of pantropical spotted dolphin: Mb I with the same amino acid sequence as that deposited in the protein database, and Mb II, which differs at two amino acid residues compared with Mb I. Using an alignment of the amino acid or cDNA sequences of cetacean Mb, we constructed a phylogenetic tree by the NJ method. Clustering of cetacean Mb amino acid and cDNA sequences essentially follows the classical taxonomy of cetaceans, suggesting that Mb sequence data is valid for classification of cetaceans at least to the family level. PMID:16962803

  1. Antigenic and sequence diversity at the C-terminus of the merozoite surface protein-1 from rodent malaria isolates, and the binding of protective monoclonal antibodies.

    PubMed

    Benjamin, P A; Ling, I T; Clottey, G; Valero, L M; Ogun, S A; Fleck, S L; Walliker, D; Morgan, W D; Birdsall, B; Feeney, J; Holder, A A

    1999-11-30

    Merozoite surface protein-1 (MSP-1) is a major candidate in the development of a vaccine against malaria. Immunisation with a recombinant fusion protein containing the two Plasmodium yoelii MSP-1 C-terminal epidermal growth factor-like domains (MSP-1(19)) can protect mice against homologous but not heterologous challenge, and therefore, antigenic differences resulting from sequence diversity in MSP-1(19) may be crucial in determining the potential of this protein as a vaccine. Representative sequence variants from a number of distinct P. yoelii isolates were expressed in Escherichia coli and the resulting recombinant proteins were screened for binding to a panel of monoclonal antibodies (Mabs) capable of suppressing a P. yoelii YM challenge infection in passive immunisation experiments. The sequence polymorphisms affected the binding of the antibodies to the recombinant proteins. None of the Mabs recognised MSP-1(19) of P. yoelii yoelii 2CL or 33X or P. yoelii nigeriensis N67. The epitopes recognised by the Mabs were further distinguished by their reactivity with the other fusion proteins. The extent of sequence variation in MSP-1(19) among the isolates was extensive, with differences detected at 35 out of the 96 positions compared. Using the 3-dimensional structure of the Plasmodium falciparum MSP-1(19) as a model, the locations of the amino acid substitutions that may affect Mab binding were identified. The DNA sequence of MSP-1(19) from two Plasmodium vinckei isolates was also cloned and the deduced amino acid sequence compared with that in other species. PMID:10593171

  2. Intermediary Metabolism in Protists: a Sequence-based View of Facultative Anaerobic Metabolism in Evolutionarily Diverse Eukaryotes

    PubMed Central

    Ginger, Michael L.; Fritz-Laylin, Lillian K.; Fulton, Chandler; Cande, W. Zacheus; Dawson, Scott C.

    2011-01-01

    Protists account for the bulk of eukaryotic diversity. Through studies of gene and especially genome sequences the molecular basis for this diversity can be determined. Evident from genome sequencing are examples of versatile metabolism that go far beyond the canonical pathways described for eukaryotes in textbooks. In the last 2–3 years, genome sequencing and transcript profiling has unveiled several examples of heterotrophic and phototrophic protists that are unexpectedly well-equipped for ATP production using a facultative anaerobic metabolism, including some protists that can (Chlamydomonas reinhardtii) or are predicted (Naegleria gruberi, Acanthamoeba castellanii, Amoebidium parasiticum) to produce H2 in their metabolism. It is possible that some enzymes of anaerobic metabolism were acquired and distributed among eukaryotes by lateral transfer, but it is also likely that the common ancestor of eukaryotes already had far more metabolic versatility than was widely thought a few years ago. The discussion of core energy metabolism in unicellular eukaryotes is the subject of this review. Since genomic sequencing has so far only touched the surface of protist diversity, it is anticipated that sequences of additional protists may reveal an even wider range of metabolic capabilities, while simultaneously enriching our understanding of the early evolution of eukaryotes. PMID:21036663

  3. Inheritance and diversity of simple sequence repeat (SSR) microsatellite markers in various families of Picea abies.

    PubMed

    Yazdani, Reza; Scotti, Ivan; Jansson, Gunnar; Plomion, Christophe; Mathur, Gaurav

    2003-01-01

    A large number of sequence-specific SSRs were screened by using electrophoresis on metaphore agarose gels with the bands visualized by ethidium bromide staining. Many SSRs appeared as codominant and many as dominant markers, with presence or absence of bands. A simple Mendelian inheritance pattern for most codominant and dominant SSR loci was found. For many codominant SSR markers, null alleles were detected. The proportion of dominant microsatellites detected in this study (close to 50 %) was much higher than that commonly reported in many other studies. A high proportion of dominant markers together with a high frequency of codominant markers with null alleles may represent two important limitations for the use of microsatellites in different studies. On the other hand, many polymorphic codominant SSR microsatellite markers were found to be highly repeatable, and can be used for population studies, seed certification, quality control of controlled crosses, paternity analysis, pollen contamination, and mapping of QTL in related families. In this paper, we report on the inheritance pattern and diversity of codominant and dominant SSR microsatellites in seven families of Picea abies sharing a common mother. PMID:14641487

  4. Whole-exome sequencing of pancreatic cancer defines genetic diversity and therapeutic targets

    PubMed Central

    Witkiewicz, Agnieszka K.; McMillan, Elizabeth A.; Balaji, Uthra; Baek, GuemHee; Lin, Wan-Chi; Mansour, John; Mollaee, Mehri; Wagner, Kay-Uwe; Koduru, Prasad; Yopp, Adam; Choti, Michael A.; Yeo, Charles J.; McCue, Peter; White, Michael A.; Knudsen, Erik S.

    2015-01-01

    Pancreatic ductal adenocarcinoma (PDA) has a dismal prognosis and insights into both disease etiology and targeted intervention are needed. A total of 109 micro-dissected PDA cases were subjected to whole-exome sequencing. Microdissection enriches tumour cellularity and enhances mutation calling. Here we show that environmental stress and alterations in DNA repair genes associate with distinct mutation spectra. Copy number alterations target multiple tumour suppressive/oncogenic loci; however, amplification of MYC is uniquely associated with poor outcome and adenosquamous subtype. We identify multiple novel mutated genes in PDA, with select genes harbouring prognostic significance. RBM10 mutations associate with longer survival in spite of histological features of aggressive disease. KRAS mutations are observed in >90% of cases, but codon Q61 alleles are selectively associated with improved survival. Oncogenic BRAF mutations are mutually exclusive with KRAS and define sensitivity to vemurafenib in PDA models. High-frequency alterations in Wnt signalling, chromatin remodelling, Hedgehog signalling, DNA repair and cell cycle processes are observed. Together, these data delineate new genetic diversity of PDA and provide insights into prognostic determinants and therapeutic targets. PMID:25855536

  5. The expanded diversity of methylophilaceae from Lake Washington through cultivation and genomic sequencing of novel ecotypes.

    PubMed

    Beck, David A C; McTaggart, Tami L; Setboonsarng, Usanisa; Vorobev, Alexey; Kalyuzhnaya, Marina G; Ivanova, Natalia; Goodwin, Lynne; Woyke, Tanja; Lidstrom, Mary E; Chistoserdova, Ludmila

    2014-01-01

    We describe five novel Methylophilaceae ecotypes from a single ecological niche in Lake Washington, USA, and compare them to three previously described ecotypes, in terms of their phenotype and genome sequence divergence. Two of the ecotypes appear to represent novel genera within the Methylophilaceae. Genome-based metabolic reconstruction highlights metabolic versatility of Methylophilaceae with respect to methylotrophy and nitrogen metabolism, different ecotypes possessing different combinations of primary substrate oxidation systems (MxaFI-type methanol dehydrogenase versus XoxF-type methanol dehydrogenase; methylamine dehydrogenase versus N-methylglutamate pathway) and different potentials for denitrification (assimilatory versus respiratory nitrate reduction). By comparing pairs of closely related genomes, we uncover that site-specific recombination is the main means of genomic evolution and strain divergence, including lateral transfers of genes from both closely- and distantly related taxa. The new ecotypes and the new genomes contribute significantly to our understanding of the extent of genomic and metabolic diversity among organisms of the same family inhabiting the same ecological niche. These organisms also provide novel experimental models for studying the complexity and the function of the microbial communities active in methylotrophy. PMID:25058595

  6. The Expanded Diversity of Methylophilaceae from Lake Washington through Cultivation and Genomic Sequencing of Novel Ecotypes

    PubMed Central

    Beck, David A. C.; McTaggart, Tami L.; Setboonsarng, Usanisa; Vorobev, Alexey; Kalyuzhnaya, Marina G.; Ivanova, Natalia; Goodwin, Lynne; Woyke, Tanja; Lidstrom, Mary E.; Chistoserdova, Ludmila

    2014-01-01

    We describe five novel Methylophilaceae ecotypes from a single ecological niche in Lake Washington, USA, and compare them to three previously described ecotypes, in terms of their phenotype and genome sequence divergence. Two of the ecotypes appear to represent novel genera within the Methylophilaceae. Genome-based metabolic reconstruction highlights metabolic versatility of Methylophilaceae with respect to methylotrophy and nitrogen metabolism, different ecotypes possessing different combinations of primary substrate oxidation systems (MxaFI-type methanol dehydrogenase versus XoxF-type methanol dehydrogenase; methylamine dehydrogenase versus N-methylglutamate pathway) and different potentials for denitrification (assimilatory versus respiratory nitrate reduction). By comparing pairs of closely related genomes, we uncover that site-specific recombination is the main means of genomic evolution and strain divergence, including lateral transfers of genes from both closely- and distantly related taxa. The new ecotypes and the new genomes contribute significantly to our understanding of the extent of genomic and metabolic diversity among organisms of the same family inhabiting the same ecological niche. These organisms also provide novel experimental models for studying the complexity and the function of the microbial communities active in methylotrophy. PMID:25058595

  7. Evolutionary Analysis of Sequence Divergence and Diversity of Duplicate Genes in Aspergillus fumigatus

    PubMed Central

    Yang, Ence; Hulse, Amanda M.; Cai, James J.

    2012-01-01

    Gene duplication as a major source of novel genetic material plays an important role in evolution. In this study, we focus on duplicate genes in Aspergillus fumigatus, a ubiquitous filamentous fungus causing life-threatening human infections. We characterize the extent and evolutionary patterns of the duplicate genes in the genome of A. fumigatus. Our results show that A. fumigatus contains a large amount of duplicate genes with pronounced sequence divergence between two copies, and approximately 10% of them diverge asymmetrically, i.e. two copies of a duplicate gene pair diverge at significantly different rates. We use a Bayesian approach of the McDonald-Kreitman test to infer distributions of selective coefficients γ(=2Nes) and find that (1) the values of γ for two copies of duplicate genes co-vary positively and (2) the average γ for the two copies differs between genes from different gene families. This analysis highlights the usefulness of combining divergence and diversity data in studying the evolution of duplicate genes. Taken together, our results provide further support and refinement to the theories of gene duplication. Through characterizing the duplicate genes in the genome of A. fumigatus, we establish a computational framework, including parameter settings and methods, for comparative study of genetic redundancy and gene duplication between different fungal species. PMID:23225993

  8. Genetic Diversity in Passiflora Species Assessed by Morphological and ITS Sequence Analysis

    PubMed Central

    Ramaiya, Shiamala Devi; Bujang, Japar Sidik; Zakaria, Muta Harah

    2014-01-01

    This study used morphological characterization and phylogenetic analysis of the internal transcribed spacer (ITS) region of nuclear ribosomal DNA to investigate the phylogeny of Passiflora species. The samples were collected from various regions of East Malaysia, and discriminant function analysis based on linear combinations of morphological variables was used to classify the Passiflora species. The biplots generated five distinct groups discriminated by morphological variables. The group consisted of cultivars of P. edulis with high levels of genetic similarity; in contrast, P. foetida was highly divergent from other species in the morphological biplots. The final dataset of aligned sequences from nine studied Passiflora accessions and 30 other individuals obtained from GenBank database (NCBI) yielded one most parsimonious tree with two strongly supported clades. Maximum parsimony (MP) tree showed the phylogenetic relationships within this subgenus Passiflora support the classification at the series level. The constructed phylogenic tree also confirmed the divergence of P. foetida from all other species and the closeness of wild and cultivated species. The phylogenetic relationships were consistent with results of morphological assessments. The results of this study indicate that ITS region analysis represents a useful tool for evaluating genetic diversity in Passiflora at the species level. PMID:25050402

  9. ShoRAH: estimating the genetic diversity of a mixed sample from next-generation sequencing data

    PubMed Central

    2011-01-01

    Background With next-generation sequencing technologies, experiments that were considered prohibitive only a few years ago are now possible. However, while these technologies have the ability to produce enormous volumes of data, the sequence reads are prone to error. This poses fundamental hurdles when genetic diversity is investigated. Results We developed ShoRAH, a computational method for quantifying genetic diversity in a mixed sample and for identifying the individual clones in the population, while accounting for sequencing errors. The software was run on simulated data and on real data obtained in wet lab experiments to assess its reliability. Conclusions ShoRAH is implemented in C++, Python, and Perl and has been tested under Linux and Mac OS X. Source code is available under the GNU General Public License at http://www.cbg.ethz.ch/software/shorah. PMID:21521499

  10. Multilocus Sequence Analysis for the Assessment of Phylogenetic Diversity and Biogeography in Hyphomonas Bacteria from Diverse Marine Environments

    PubMed Central

    Li, Guizhen; Liu, Yang; Sun, Fengqin; Shao, Zongze

    2014-01-01

    Hyphomonas, a genus of budding, prosthecate bacteria, are primarily found in the marine environment. Seven type strains, and 35 strains from our collections of Hyphomonas, isolated from the Pacific Ocean, Atlantic Ocean, Arctic Ocean, South China Sea and the Baltic Sea, were investigated in this study using multilocus sequence analysis (MLSA). The phylogenetic structure of these bacteria was evaluated using the 16S rRNA gene, and five housekeeping genes (leuA, clpA, pyrH, gatA and rpoD) as well as their concatenated sequences. Our results showed that each housekeeping gene and the concatenated gene sequence all yield a higher taxonomic resolution than the 16S rRNA gene. The 42 strains assorted into 12 groups. Each group represents an independent species, which was confirmed by virtual DNA-DNA hybridization (DDH) estimated from draft genome sequences. Hyphomonas MLSA interspecies and intraspecies boundaries ranged from 93.3% to 96.3%, similarity calculated using a combined DDH and MLSA approach. Furthermore, six novel species (groups I, II, III, IV, V and XII) of the genus Hyphomonas exist, based on sequence similarities of the MLSA and DDH values. Additionally, we propose that the leuA gene (93.0% sequence similarity across our dataset) alone could be used as a fast and practical means for identifying species within Hyphomonas. Finally, Hyphomonas' geographic distribution shows that strains from the same area tend to cluster together as discrete species. This study provides a framework for the discrimination and phylogenetic analysis of the genus Hyphomonas for the first time, and will contribute to a more thorough understanding of the biological and ecological roles of this genus. PMID:25019154

  11. Function and evolutionary diversity of fatty acid amino acid conjugates (FACs)in Lepidopteran caterpillars

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Fatty acid amino acid conjugates (FACs) in regurgitant of larval Spodoptera exigua1 were initially identified as plant volatile elicitors and research has been focused on this apparent ecological disadvantage rather than on possible benefit for the caterpillar itself. Recently, we demonstrated that...

  12. Correlation between microbial diversity and toxicity of sludge treating synthetic wastewater containing 4-chlorophenol in sequencing batch reactors.

    PubMed

    Zhao, Jianguo; Chen, Xiurong; Bao, Linlin; Bao, Zheng; He, Yixuan; Zhang, Yuying; Li, Jiahui

    2016-06-01

    The relationship between microbial diversity and sludge toxicity in the biotreatment of refractory wastewater was investigated. Synthetic wastewater containing 4-chlorophenol (4-CP) was treated by an activated sludge using a sequencing batch bioreactor (SBR). At the end of a single SBR cycle, a stable operation stage was reached when the 4-CP was not detected both in aqueous and sludge phases and the effluent COD was maintained at approximately 70 mg L(-1) for the blank and control sludge groups. Then, the diversity of the microorganisms and the sludge toxicity were measured. The results showed that the Microtox acute toxicity of the control sludge was higher than those of the blank sludge. The difference analysis of the microbial diversity between the blank and control sludge indicated that the sludge toxicity was closely related to microbial diversity. PMID:27016808

  13. Diversity of Δ12 fatty acid desaturases in santalaceae and their role in production of seed oil acetylenic fatty acids.

    PubMed

    Okada, Shoko; Zhou, Xue-Rong; Damcevski, Katherine; Gibb, Nerida; Wood, Craig; Hamberg, Mats; Haritos, Victoria S

    2013-11-01

    Plants in the Santalaceae family, including the native cherry Exocarpos cupressiformis and sweet quandong Santalum acuminatum, accumulate ximenynic acid (trans-11-octadecen-9-ynoic acid) in their seed oil and conjugated polyacetylenic fatty acids in root tissue. Twelve full-length genes coding for microsomal Δ12 fatty acid desaturases (FADs) from the two Santalaceae species were identified by degenerate PCR. Phylogenetic analysis of the predicted amino acid sequences placed five Santalaceae FADs with Δ12 FADs, which include Arabidopsis thaliana FAD2. When expressed in yeast, the major activity of these genes was Δ12 desaturation of oleic acid, but unusual activities were also observed: i.e. Δ15 desaturation of linoleic acid as well as trans-Δ12 and trans-Δ11 desaturations of stearolic acid (9-octadecynoic acid). The trans-12-octadecen-9-ynoic acid product was also detected in quandong seed oil. The two other FAD groups (FADX and FADY) were present in both species; in a phylogenetic tree of microsomal FAD enzymes, FADX and FADY formed a unique clade, suggesting that are highly divergent. The FADX group enzymes had no detectable Δ12 FAD activity but instead catalyzed cis-Δ13 desaturation of stearolic acid when expressed in yeast. No products were detected for the FADY group when expressed recombinantly. Quantitative PCR analysis showed that the FADY genes were expressed in leaf rather than developing seed of the native cherry. FADs with promiscuous and unique activities have been identified in Santalaceae and explain the origin of some of the unusual lipids found in this plant family. PMID:24062307

  14. Diversity of Δ12 Fatty Acid Desaturases in Santalaceae and Their Role in Production of Seed Oil Acetylenic Fatty Acids*

    PubMed Central

    Okada, Shoko; Zhou, Xue-Rong; Damcevski, Katherine; Gibb, Nerida; Wood, Craig; Hamberg, Mats; Haritos, Victoria S.

    2013-01-01

    Plants in the Santalaceae family, including the native cherry Exocarpos cupressiformis and sweet quandong Santalum acuminatum, accumulate ximenynic acid (trans-11-octadecen-9-ynoic acid) in their seed oil and conjugated polyacetylenic fatty acids in root tissue. Twelve full-length genes coding for microsomal Δ12 fatty acid desaturases (FADs) from the two Santalaceae species were identified by degenerate PCR. Phylogenetic analysis of the predicted amino acid sequences placed five Santalaceae FADs with Δ12 FADs, which include Arabidopsis thaliana FAD2. When expressed in yeast, the major activity of these genes was Δ12 desaturation of oleic acid, but unusual activities were also observed: i.e. Δ15 desaturation of linoleic acid as well as trans-Δ12 and trans-Δ11 desaturations of stearolic acid (9-octadecynoic acid). The trans-12-octadecen-9-ynoic acid product was also detected in quandong seed oil. The two other FAD groups (FADX and FADY) were present in both species; in a phylogenetic tree of microsomal FAD enzymes, FADX and FADY formed a unique clade, suggesting that are highly divergent. The FADX group enzymes had no detectable Δ12 FAD activity but instead catalyzed cis-Δ13 desaturation of stearolic acid when expressed in yeast. No products were detected for the FADY group when expressed recombinantly. Quantitative PCR analysis showed that the FADY genes were expressed in leaf rather than developing seed of the native cherry. FADs with promiscuous and unique activities have been identified in Santalaceae and explain the origin of some of the unusual lipids found in this plant family. PMID:24062307

  15. Two distinct ferredoxins from Rhodobacter capsulatus: complete amino acid sequences and molecular evolution.

    PubMed

    Saeki, K; Suetsugu, Y; Yao, Y; Horio, T; Marrs, B L; Matsubara, H

    1990-09-01

    Two distinct ferredoxins were purified from Rhodobacter capsulatus SB1003. Their complete amino acid sequences were determined by a combination of protease digestion, BrCN cleavage and Edman degradation. Ferredoxins I and II were composed of 64 and 111 amino acids, respectively, with molecular weights of 6,728 and 12,549 excluding iron and sulfur atoms. Both contained two Cys clusters in their amino acid sequences. The first cluster of ferredoxin I and the second cluster of ferredoxin II had a sequence, CxxCxxCxxxCP, in common with the ferredoxins found in Clostridia. The second cluster of ferredoxin I had a sequence, CxxCxxxxxxxxCxxxCM, with extra amino acids between the second and third Cys, which has been reported for other photosynthetic bacterial ferredoxins and putative ferredoxins (nif-gene products) from nitrogen-fixing bacteria, and with a unique occurrence of Met. The first cluster of ferredoxin II had a CxxCxxxxCxxxCP sequence, with two additional amino acids between the second and third Cys, a characteristics feature of Azotobacter-[3Fe-4S] [4Fe-4S]-ferredoxin. Ferredoxin II was also similar to Azotobacter-type ferredoxins with an extended carboxyl (C-) terminal sequence compared to the common Clostridium-type. The evolutionary relationship of the two together with a putative one recently found to be encoded in nifENXQ region in this bacterium [Moreno-Vivian et al. (1989) J. Bacteriol. 171, 2591-2598] is discussed. PMID:2277040

  16. Microbial diversity and metabolite composition of Belgian red-brown acidic ales.

    PubMed

    Snauwaert, Isabel; Roels, Sanne P; Van Nieuwerburg, Filip; Van Landschoot, Anita; De Vuyst, Luc; Vandamme, Peter

    2016-03-16

    Belgian red-brown acidic ales are sour and alcoholic fermented beers, which are produced by mixed-culture fermentation and blending. The brews are aged in oak barrels for about two years, after which mature beer is blended with young, non-aged beer to obtain the end-products. The present study evaluated the microbial community diversity of Belgian red-brown acidic ales at the end of the maturation phase of three subsequent brews of three different breweries. The microbial diversity was compared with the metabolite composition of the brews at the end of the maturation phase. Therefore, mature brew samples were subjected to 454 pyrosequencing of the 16S rRNA gene (bacteria) and the internal transcribed spacer region (yeasts) and a broad range of metabolites was quantified. The most important microbial species present in the Belgian red-brown acidic ales investigated were Pediococcus damnosus, Dekkera bruxellensis, and Acetobacter pasteurianus. In addition, this culture-independent analysis revealed operational taxonomic units that were assigned to an unclassified fungal community member, Candida, and Lactobacillus. The main metabolites present in the brew samples were L-lactic acid, D-lactic acid, and ethanol, whereas acetic acid was produced in lower quantities. The most prevailing aroma compounds were ethyl acetate, isoamyl acetate, ethyl hexanoate, and ethyl octanoate, which might be of impact on the aroma of the end-products. PMID:26802571

  17. Amino Acid Sequence of Anionic Peroxidase from the Windmill Palm Tree Trachycarpus fortunei

    PubMed Central

    2015-01-01

    Palm peroxidases are extremely stable and have uncommon substrate specificity. This study was designed to fill in the knowledge gap about the structures of a peroxidase from the windmill palm tree Trachycarpus fortunei. The complete amino acid sequence and partial glycosylation were determined by MALDI-top-down sequencing of native windmill palm tree peroxidase (WPTP), MALDI-TOF/TOF MS/MS of WPTP tryptic peptides, and cDNA sequencing. The propeptide of WPTP contained N- and C-terminal signal sequences which contained 21 and 17 amino acid residues, respectively. Mature WPTP was 306 amino acids in length, and its carbohydrate content ranged from 21% to 29%. Comparison to closely related royal palm tree peroxidase revealed structural features that may explain differences in their substrate specificity. The results can be used to guide engineering of WPTP and its novel applications. PMID:25383699

  18. Protein chemotaxonomy. XIII. Amino acid sequence of ferredoxin from Panax ginseng.

    PubMed

    Mino, Yoshiki

    2006-08-01

    The complete amino acid sequence of [2Fe-2S] ferredoxin from Panax ginseng (Araliaceae) has been determined by automated Edman degradation of the entire S-carboxymethylcysteinyl protein and of the peptides obtained by enzymatic digestion. This ferredoxin has a unique amino acid sequence, which includes an insertion of Tyr at the 3rd position from the amino-terminus and a deletion of two amino acid residues at the carboxyl terminus. This ferredoxin had 18 differences in its amino acid sequence compared to that of Petroselinum sativum (Umbelliferae). In contrast, 23-33 differences were observed compared to other dicotyledonous plants. This suggests that Panax ginseng is related taxonomically to umbelliferous plants. PMID:16880642

  19. Complete amino acid sequence and structure characterization of the taste-modifying protein, miraculin.

    PubMed

    Theerasilp, S; Hitotsuya, H; Nakajo, S; Nakaya, K; Nakamura, Y; Kurihara, Y

    1989-04-25

    The taste-modifying protein, miraculin, has the unusual property of modifying sour taste into sweet taste. The complete amino acid sequence of miraculin purified from miracle fruits by a newly developed method (Theerasilp, S., and Kurihara, Y. (1988) J. Biol. Chem. 263, 11536-11539) was determined by an automatic Edman degradation method. Miraculin was a single polypeptide with 191 amino acid residues. The calculated molecular weight based on the amino acid sequence and the carbohydrate content (13.9%) was 24,600. Asn-42 and Asn-186 were linked N-glycosidically to carbohydrate chains. High homology was found between the amino acid sequences of miraculin and soybean trypsin inhibitor. PMID:2708331

  20. Phylogenetic Diversity of Lactic Acid Bacteria Associated with Paddy Rice Silage as Determined by 16S Ribosomal DNA Analysis

    PubMed Central

    Ennahar, Saïd; Cai, Yimin; Fujita, Yasuhito

    2003-01-01

    A total of 161 low-G+C-content gram-positive bacteria isolated from whole-crop paddy rice silage were classified and subjected to phenotypic and genetic analyses. Based on morphological and biochemical characters, these presumptive lactic acid bacterium (LAB) isolates were divided into 10 groups that included members of the genera Enterococcus, Lactobacillus, Lactococcus, Leuconostoc, Pediococcus, and Weissella. Analysis of the 16S ribosomal DNA (rDNA) was used to confirm the presence of the predominant groups indicated by phenotypic analysis and to determine the phylogenetic affiliation of representative strains. The virtually complete 16S rRNA gene was PCR amplified and sequenced. The sequences from the various LAB isolates showed high degrees of similarity to those of the GenBank reference strains (between 98.7 and 99.8%). Phylogenetic trees based on the 16S rDNA sequence displayed high consistency, with nodes supported by high bootstrap values. With the exception of one species, the genetic data was in agreement with the phenotypic identification. The prevalent LAB, predominantly homofermentative (66%), consisted of Lactobacillus plantarum (24%), Lactococcus lactis (22%), Leuconostoc pseudomesenteroides (20%), Pediococcus acidilactici (11%), Lactobacillus brevis (11%), Enterococcus faecalis (7%), Weissella kimchii (3%), and Pediococcus pentosaceus (2%). The present study, the first to fully document rice-associated LAB, showed a very diverse community of LAB with a relatively high number of species involved in the fermentation process of paddy rice silage. The comprehensive 16S rDNA-based approach to describing LAB community structure was valuable in revealing the large diversity of bacteria inhabiting paddy rice silage and enabling the future design of appropriate inoculants aimed at improving its fermentation quality. PMID:12514026

  1. Molecular characterization of insulin from squamate reptiles reveals sequence diversity and possible adaptive evolution.

    PubMed

    Yamagishi, Genki; Yoshida, Ayaka; Kobayashi, Aya; Park, Min Kyun

    2016-01-01

    The Squamata are the most adaptive and prosperous group among ectothermic amniotes, reptiles, due to their species-richness and geographically wide habitat. Although the molecular mechanisms underlying their prosperity remain largely unknown, unique features have been reported from hormones that regulate energy metabolism. Insulin, a central anabolic hormone, is one such hormone, as its roles and effectiveness in regulation of blood glucose levels remain to be examined in squamates. In the present study, cDNAs coding for insulin were isolated from multiple species that represent various groups of squamates. The deduced amino acid sequences showed a high degree of divergence, with four lineages showing obviously higher number of amino acid substitutions than most of vertebrates, from teleosts to mammals. Among 18 sites presented to comprise the two receptor binding surfaces (one with 12 sites and the other with 6 sites), substitutions were observed in 13 sites. Among them was the substitution of HisB10, which results in the loss of the ability to hexamerize. Furthermore, three of these substitutions were reported to increase mitogenicity in human analogues. These substitutions were also reported from insulin of hystricomorph rodents and agnathan fishes, whose mitogenic potency have been shown to be increased. The estimated value of the non-synonymous-to-synonymous substitution ratio (ω) for the Squamata clade was larger than those of the other reptiles and aves. Even higher values were estimated for several lineages among squamates. These results, together with the regulatory mechanisms of digestion and nutrient assimilation in squamates, suggested a possible adaptive process through the molecular evolution of squamate INS. Further studies on the roles of insulin, in relation to the physiological and ecological traits of squamate species, will provide an insight into the molecular mechanisms that have led to the adaptivity and prosperity of squamates. PMID:26344944

  2. Complete cDNA and derived amino acid sequence of human factor V

    SciTech Connect

    Jenny, R.J.; Pittman, D.D.; Toole, J.J.; Kriz, R.W.; Aldape, R.A.; Hewick, R.M.; Kaufman, R.J.; Mann, K.G.

    1987-07-01

    cDNA clones encoding human factor V have been isolated from an oligo(dT)-primed human fetal liver cDNA library prepared with vector Charon 21A. The cDNA sequence of factor V from three overlapping clones includes a 6672-base-pair (bp) coding region, a 90-bp 5' untranslated region, and a 163-bp 3' untranslated region within which is a poly(A)tail. The deduced amino acid sequence consists of 2224 amino acids inclusive of a 28-amino acid leader peptide. Direct comparison with human factor VIII reveals considerable homology between proteins in amino acid sequence and domain structure: a triplicated A domain and duplicated C domain show approx. 40% identity with the corresponding domains in factor VIII. As in factor VIII, the A domains of factor V share approx. 40% amino acid-sequence homology with the three highly conserved domains in ceruloplasmin. The B domain of factor V contains 35 tandem and approx. 9 additional semiconserved repeats of nine amino acids of the form Asp-Leu-Ser-Gln-Thr-Thr/Asn-Leu-Ser-Pro and 2 additional semiconserved repeats of 17 amino acids. Factor V contains 37 potential N-linked glycosylation sites, 25 of which are in the B domain, and a total of 19 cysteine residues.

  3. N-terminal sequence of amino acids and some properties of an acid-stable alpha-amylase from citric acid-koji (Aspergillus usamii var.).

    PubMed

    Suganuma, T; Tahara, N; Kitahara, K; Nagahama, T; Inuzuka, K

    1996-01-01

    An acid-stable alpha-amylase (AA) was purified from an acidic extract of citric acid-koji (A. usamii var.). The N-terminal sequence of the first 20 amino acids of the enzyme was identical with that of AA from A. niger, but the two enzymes differed in molecular weight. HPLC analysis for identifying the anomers of products indicated that the AA hydrolyzed maltopentaose (G5) at the third glycoside bond predominantly, which differed from Taka-amylase A and the neutral alpha-amylase (NA) from the citric acid-koji. PMID:8824843

  4. Prevalence and genetic diversity of clinical Vibrio parahaemolyticus isolates from China, revealed by multilocus sequence typing scheme.

    PubMed

    Han, Dongsheng; Tang, Hui; Ren, Chuanli; Wang, Guangzhou; Zhou, Lin; Han, Chongxu

    2015-01-01

    The population structure of clinical Vibrio parahaemolyticus isolates spreading in China remains undefined. We brought 218 clinical isolates from the pubMLST database originating from different regions of China collected since the year of 1990, analyzed by multilocus sequence typing (MLST), to elucidate the prevalence and genetic diversity of V. parahaemolyticus circulating in Chinese population. The MLST scheme produced 137 sequence types (STs). These STs were clustered into six clonal complexes (CCs), six doublets, and 91 singletons, exhibiting a high level of genetic diversity. However, less diversity was displayed on the peptide level: only 46 different peptide sequence type (pST) were generated, with pST2 (44.0%, 96/218) and pST1 (15.1%, 33/218) the predominant. Further analysis confirmed all the pSTs belong to a single complex founded by pST1, pST2, pST3, and pST4. recA presented the highest degree of nucleotide diversity (0.026) and the largest number of variable sites (176) on the nucleotide level. pyrC was the most diverse locus on the peptide level, possessing the highest percentage of variable sites (9.2%, 15/163). Significant linkage disequilibrium with the alleles was detected when the Standardized Index of Association (I(S) A ) was calculated both for the entire isolates collection (0.7169, P < 0.01) and for the 137 STs (I(S) A = 0.2648, P < 0.01). In conclusion, we provide an overview of prevalence and genetic diversity of clinical V. parahaemolyticus spreading in Chinese population using MLST analysis. The results would offer genetic evidences for uncovering the microevolution relationship of V. parahaemolyticus populations. PMID:25914691

  5. Estimates of Soil Bacterial Ribosome Content and Diversity Are Significantly Affected by the Nucleic Acid Extraction Method Employed.

    PubMed

    Wüst, Pia K; Nacke, Heiko; Kaiser, Kristin; Marhan, Sven; Sikorski, Johannes; Kandeler, Ellen; Daniel, Rolf; Overmann, Jörg

    2016-05-01

    Modern sequencing technologies allow high-resolution analyses of total and potentially active soil microbial communities based on their DNA and RNA, respectively. In the present study, quantitative PCR and 454 pyrosequencing were used to evaluate the effects of different extraction methods on the abundance and diversity of 16S rRNA genes and transcripts recovered from three different types of soils (leptosol, stagnosol, and gleysol). The quality and yield of nucleic acids varied considerably with respect to both the applied extraction method and the analyzed type of soil. The bacterial ribosome content (calculated as the ratio of 16S rRNA transcripts to 16S rRNA genes) can serve as an indicator of the potential activity of bacterial cells and differed by 2 orders of magnitude between nucleic acid extracts obtained by the various extraction methods. Depending on the extraction method, the relative abundances of dominant soil taxa, in particularActinobacteriaandProteobacteria, varied by a factor of up to 10. Through this systematic approach, the present study allows guidelines to be deduced for the selection of the appropriate extraction protocol according to the specific soil properties, the nucleic acid of interest, and the target organisms. PMID:26896137

  6. Phylogenetic Diversity of the Bacillus pumilus Group and the Marine Ecotype Revealed by Multilocus Sequence Analysis

    PubMed Central

    Dong, Chunming; Sun, Fengqin; Wang, Liping; Li, Guangyu; Shao, Zongze

    2013-01-01

    Bacteria closely related to Bacillus pumilus cannot be distinguished from such other species as B. safensis, B. stratosphericus, B. altitudinis and B. aerophilus simply by 16S rRNA gene sequence. In this report, 76 marine strains were subjected to phylogenetic analysis based on 7 housekeeping genes to understand the phylogeny and biogeography in comparison with other origins. A phylogenetic tree based on the 7 housekeeping genes concatenated in the order of gyrB-rpoB-pycA-pyrE-mutL-aroE-trpB was constructed and compared with trees based on the single genes. All these trees exhibited a similar topology structure with small variations. Our 79 strains were divided into 6 groups from A to F; Group A was the largest and contained 49 strains close to B. altitudinis. Additional two large groups were presented by B. safensis and B. pumilus respectively. Among the housekeeping genes, gyrB and pyrE showed comparatively better resolution power and may serve as molecular markers to distinguish these closely related strains. Furthermore, a recombinant phylogenetic tree based on the gyrB gene and containing 73 terrestrial and our isolates was constructed to detect the relationship between marine and other sources. The tree clearly showed that the bacteria of marine origin were clustered together in all the large groups. In contrast, the cluster belonging to B. safensis was mainly composed of bacteria of terrestrial origin. Interestingly, nearly all the marine isolates were at the top of the tree, indicating the possibility of the recent divergence of this bacterial group in marine environments. We conclude that B. altitudinis bacteria are the most widely spread of the B. pumilus group in marine environments. In summary, this report provides the first evidence regarding the systematic evolution of this bacterial group, and knowledge of their phylogenetic diversity will help in the understanding of their ecological role and distribution in marine environments. PMID:24244618

  7. Fingerprinting the Asterid Species Using Subtracted Diversity Array Reveals Novel Species-Specific Sequences

    PubMed Central

    Mantri, Nitin; Olarte, Alexandra; Li, Chun Guang; Xue, Charlie; Pang, Edwin C. K.

    2012-01-01

    Background Asterids is one of the major plant clades comprising of many commercially important medicinal species. One of the major concerns in medicinal plant industry is adulteration/contamination resulting from misidentification of herbal plants. This study reports the construction and validation of a microarray capable of fingerprinting medicinally important species from the Asterids clade. Methodology/Principal Findings Pooled genomic DNA of 104 non-asterid angiosperm and non-angiosperm species was subtracted from pooled genomic DNA of 67 asterid species. Subsequently, 283 subtracted DNA fragments were used to construct an Asterid-specific array. The validation of Asterid-specific array revealed a high (99.5%) subtraction efficiency. Twenty-five Asterid species (mostly medicinal) representing 20 families and 9 orders within the clade were hybridized onto the array to reveal its level of species discrimination. All these species could be successfully differentiated using their hybridization patterns. A number of species-specific probes were identified for commercially important species like tea, coffee, dandelion, yarrow, motherwort, Japanese honeysuckle, valerian, wild celery, and yerba mate. Thirty-seven polymorphic probes were characterized by sequencing. A large number of probes were novel species-specific probes whilst some of them were from chloroplast region including genes like atpB, rpoB, and ndh that have extensively been used for fingerprinting and phylogenetic analysis of plants. Conclusions/Significance Subtracted Diversity Array technique is highly efficient in fingerprinting species with little or no genomic information. The Asterid-specific array could fingerprint all 25 species assessed including three species that were not used in constructing the array. This study validates the use of chloroplast genes for bar-coding (fingerprinting) plant species. In addition, this method allowed detection of several new loci that can be explored to solve

  8. Whole-Genome Sequencing Reveals Diverse Models of Structural Variations in Esophageal Squamous Cell Carcinoma.

    PubMed

    Cheng, Caixia; Zhou, Yong; Li, Hongyi; Xiong, Teng; Li, Shuaicheng; Bi, Yanghui; Kong, Pengzhou; Wang, Fang; Cui, Heyang; Li, Yaoping; Fang, Xiaodong; Yan, Ting; Li, Yike; Wang, Juan; Yang, Bin; Zhang, Ling; Jia, Zhiwu; Song, Bin; Hu, Xiaoling; Yang, Jie; Qiu, Haile; Zhang, Gehong; Liu, Jing; Xu, Enwei; Shi, Ruyi; Zhang, Yanyan; Liu, Haiyan; He, Chanting; Zhao, Zhenxiang; Qian, Yu; Rong, Ruizhou; Han, Zhiwei; Zhang, Yanlin; Luo, Wen; Wang, Jiaqian; Peng, Shaoliang; Yang, Xukui; Li, Xiangchun; Li, Lin; Fang, Hu; Liu, Xingmin; Ma, Li; Chen, Yunqing; Guo, Shiping; Chen, Xing; Xi, Yanfeng; Li, Guodong; Liang, Jianfang; Yang, Xiaofeng; Guo, Jiansheng; Jia, JunMei; Li, Qingshan; Cheng, Xiaolong; Zhan, Qimin; Cui, Yongping

    2016-02-01

    Comprehensive identification of somatic structural variations (SVs) and understanding their mutational mechanisms in cancer might contribute to understanding biological differences and help to identify new therapeutic targets. Unfortunately, characterization of complex SVs across the whole genome and the mutational mechanisms underlying esophageal squamous cell carcinoma (ESCC) is largely unclear. To define a comprehensive catalog of somatic SVs, affected target genes, and their underlying mechanisms in ESCC, we re-analyzed whole-genome sequencing (WGS) data from 31 ESCCs using Meerkat algorithm to predict somatic SVs and Patchwork to determine copy-number changes. We found deletions and translocations with NHEJ and alt-EJ signature as the dominant SV types, and 16% of deletions were complex deletions. SVs frequently led to disruption of cancer-associated genes (e.g., CDKN2A and NOTCH1) with different mutational mechanisms. Moreover, chromothripsis, kataegis, and breakage-fusion-bridge (BFB) were identified as contributing to locally mis-arranged chromosomes that occurred in 55% of ESCCs. These genomic catastrophes led to amplification of oncogene through chromothripsis-derived double-minute chromosome formation (e.g., FGFR1 and LETM2) or BFB-affected chromosomes (e.g., CCND1, EGFR, ERBB2, MMPs, and MYC), with approximately 30% of ESCCs harboring BFB-derived CCND1 amplification. Furthermore, analyses of copy-number alterations reveal high frequency of whole-genome duplication (WGD) and recurrent focal amplification of CDCA7 that might act as a potential oncogene in ESCC. Our findings reveal molecular defects such as chromothripsis and BFB in malignant transformation of ESCCs and demonstrate diverse models of SVs-derived target genes in ESCCs. These genome-wide SV profiles and their underlying mechanisms provide preventive, diagnostic, and therapeutic implications for ESCCs. PMID:26833333

  9. Whole-Genome Sequencing Reveals Diverse Models of Structural Variations in Esophageal Squamous Cell Carcinoma

    PubMed Central

    Cheng, Caixia; Zhou, Yong; Li, Hongyi; Xiong, Teng; Li, Shuaicheng; Bi, Yanghui; Kong, Pengzhou; Wang, Fang; Cui, Heyang; Li, Yaoping; Fang, Xiaodong; Yan, Ting; Li, Yike; Wang, Juan; Yang, Bin; Zhang, Ling; Jia, Zhiwu; Song, Bin; Hu, Xiaoling; Yang, Jie; Qiu, Haile; Zhang, Gehong; Liu, Jing; Xu, Enwei; Shi, Ruyi; Zhang, Yanyan; Liu, Haiyan; He, Chanting; Zhao, Zhenxiang; Qian, Yu; Rong, Ruizhou; Han, Zhiwei; Zhang, Yanlin; Luo, Wen; Wang, Jiaqian; Peng, Shaoliang; Yang, Xukui; Li, Xiangchun; Li, Lin; Fang, Hu; Liu, Xingmin; Ma, Li; Chen, Yunqing; Guo, Shiping; Chen, Xing; Xi, Yanfeng; Li, Guodong; Liang, Jianfang; Yang, Xiaofeng; Guo, Jiansheng; Jia, JunMei; Li, Qingshan; Cheng, Xiaolong; Zhan, Qimin; Cui, Yongping

    2016-01-01

    Comprehensive identification of somatic structural variations (SVs) and understanding their mutational mechanisms in cancer might contribute to understanding biological differences and help to identify new therapeutic targets. Unfortunately, characterization of complex SVs across the whole genome and the mutational mechanisms underlying esophageal squamous cell carcinoma (ESCC) is largely unclear. To define a comprehensive catalog of somatic SVs, affected target genes, and their underlying mechanisms in ESCC, we re-analyzed whole-genome sequencing (WGS) data from 31 ESCCs using Meerkat algorithm to predict somatic SVs and Patchwork to determine copy-number changes. We found deletions and translocations with NHEJ and alt-EJ signature as the dominant SV types, and 16% of deletions were complex deletions. SVs frequently led to disruption of cancer-associated genes (e.g., CDKN2A and NOTCH1) with different mutational mechanisms. Moreover, chromothripsis, kataegis, and breakage-fusion-bridge (BFB) were identified as contributing to locally mis-arranged chromosomes that occurred in 55% of ESCCs. These genomic catastrophes led to amplification of oncogene through chromothripsis-derived double-minute chromosome formation (e.g., FGFR1 and LETM2) or BFB-affected chromosomes (e.g., CCND1, EGFR, ERBB2, MMPs, and MYC), with approximately 30% of ESCCs harboring BFB-derived CCND1 amplification. Furthermore, analyses of copy-number alterations reveal high frequency of whole-genome duplication (WGD) and recurrent focal amplification of CDCA7 that might act as a potential oncogene in ESCC. Our findings reveal molecular defects such as chromothripsis and BFB in malignant transformation of ESCCs and demonstrate diverse models of SVs-derived target genes in ESCCs. These genome-wide SV profiles and their underlying mechanisms provide preventive, diagnostic, and therapeutic implications for ESCCs. PMID:26833333

  10. Sequence-Structure-Function Classification of a Catalytically Diverse Oxidoreductase Superfamily in Mycobacteria.

    PubMed

    Ahmed, F Hafna; Carr, Paul D; Lee, Brendon M; Afriat-Jurnou, Livnat; Mohamed, A Elaaf; Hong, Nan-Sook; Flanagan, Jack; Taylor, Matthew C; Greening, Chris; Jackson, Colin J

    2015-11-01

    The deazaflavin cofactor F420 enhances the persistence of mycobacteria during hypoxia, oxidative stress, and antibiotic treatment. However, the identities and functions of the mycobacterial enzymes that utilize F420 under these conditions have yet to be resolved. In this work, we used sequence similarity networks to analyze the distribution of the largest F420-dependent protein family in mycobacteria. We show that these enzymes are part of a larger split β-barrel enzyme superfamily (flavin/deazaflavin oxidoreductases, FDORs) that include previously characterized pyridoxamine/pyridoxine-5'-phosphate oxidases and heme oxygenases. We show that these proteins variously utilize F420, flavin mononucleotide, flavin adenine dinucleotide, and heme cofactors. Functional annotation using phylogenetic, structural, and spectroscopic methods revealed their involvement in heme degradation, biliverdin reduction, fatty acid modification, and quinone reduction. Four novel crystal structures show that plasticity in substrate binding pockets and modifications to cofactor binding motifs enabled FDORs to carry out a variety of functions. This systematic classification and analysis provides a framework for further functional analysis of the roles of FDORs in mycobacterial pathogenesis and persistence. PMID:26434506

  11. Whole-Genome Sequencing of Kaposi's Sarcoma-Associated Herpesvirus from Zambian Kaposi's Sarcoma Biopsy Specimens Reveals Unique Viral Diversity

    PubMed Central

    Olp, Landon N.; Jeanniard, Adrien; Marimo, Clemence; West, John T.

    2015-01-01

    ABSTRACT Kaposi's sarcoma-associated herpesvirus (KSHV) is the etiological agent for Kaposi's sarcoma (KS). Both KSHV and KS are endemic in sub-Saharan Africa where approximately 84% of global KS cases occur. Nevertheless, whole-genome sequencing of KSHV has only been completed using isolates from Western countries—where KS is not endemic. The lack of whole-genome KSHV sequence data from the most clinically important geographical region, sub-Saharan Africa, represents an important gap since it remains unclear whether genomic diversity has a role on KSHV pathogenesis. We hypothesized that distinct KSHV genotypes might be present in sub-Saharan Africa compared to Western countries. Using a KSHV-targeted enrichment protocol followed by Illumina deep-sequencing, we generated and analyzed 16 unique Zambian, KS-derived, KSHV genomes. We enriched KSHV DNA over cellular DNA 1,851 to 18,235-fold. Enrichment provided coverage levels up to 24,740-fold; therefore, supporting highly confident polymorphism analysis. Multiple alignment of the 16 newly sequenced KSHV genomes showed low level variability across the entire central conserved region. This variability resulted in distinct phylogenetic clustering between Zambian KSHV genomic sequences and those derived from Western countries. Importantly, the phylogenetic segregation of Zambian from Western sequences occurred irrespective of inclusion of the highly variable genes K1 and K15. We also show that four genes within the more conserved region of the KSHV genome contained polymorphisms that partially, but not fully, contributed to the unique Zambian KSHV whole-genome phylogenetic structure. Taken together, our data suggest that the whole KSHV genome should be taken into consideration for accurate viral characterization. IMPORTANCE Our results represent the largest number of KSHV whole-genomic sequences published to date and the first time that multiple genomes have been sequenced from sub-Saharan Africa, a geographic area

  12. Detection and isolation of nucleic acid sequences using competitive hybridization probes

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    1997-01-01

    A method for detecting a target nucleic acid sequence in a sample is provided using hybridization probes which competitively hybridize to a target nucleic acid. According to the method, a target nucleic acid sequence is hybridized to first and second hybridization probes which are complementary to overlapping portions of the target nucleic acid sequence, the first hybridization probe including a first complexing agent capable of forming a binding pair with a second complexing agent and the second hybridization probe including a detectable marker. The first complexing agent attached to the first hybridization probe is contacted with a second complexing agent, the second complexing agent being attached to a solid support such that when the first and second complexing agents are attached, target nucleic acid sequences hybridized to the first hybridization probe become immobilized on to the solid support. The immobilized target nucleic acids are then separated and detected by detecting the detectable marker attached to the second hybridization probe. A kit for performing the method is also provided.

  13. Detection and isolation of nucleic acid sequences using competitive hybridization probes

    DOEpatents

    Lucas, J.N.; Straume, T.; Bogen, K.T.

    1997-04-01

    A method for detecting a target nucleic acid sequence in a sample is provided using hybridization probes which competitively hybridize to a target nucleic acid. According to the method, a target nucleic acid sequence is hybridized to first and second hybridization probes which are complementary to overlapping portions of the target nucleic acid sequence, the first hybridization probe including a first complexing agent capable of forming a binding pair with a second complexing agent and the second hybridization probe including a detectable marker. The first complexing agent attached to the first hybridization probe is contacted with a second complexing agent, the second complexing agent being attached to a solid support such that when the first and second complexing agents are attached, target nucleic acid sequences hybridized to the first hybridization probe become immobilized on to the solid support. The immobilized target nucleic acids are then separated and detected by detecting the detectable marker attached to the second hybridization probe. A kit for performing the method is also provided. 7 figs.

  14. Semiconductor Sequencing Reveals the Diversity of Bacterial Communities in an Amazon Reservoir Considered as a Methane Source

    NASA Astrophysics Data System (ADS)

    Graças, D. A.; Ramos, R. T.; Sá, P. G.; Baraúna, R. A.; Schneider, M. C.; Silva, A.

    2013-05-01

    The Amazon region has enormous hydro potential which is used for power generation. In fact, there are several hydroelectric power stations (HPS) already installed and many under construction or designed. It's in the Amazon which the HPS of Tucuruí, fifth largest in the world, is located. The construction of this hydroelectric dam flooded an area of 2,400 km2 of forest that decomposing, releasing greenhouse gases such as methane (CH4). Methane is the most abundant organic gas in the atmosphere and the second most important greenhouse gas. In this study, we use semicondutor sequencing to assess the bacterial diversity along a water column of 70 meters deep in the Tucuruí reservoir. One liter of water was collected every 10 meters along the water column for total DNA extraction. A fragment of approximately 150 base pairs of the 16S rRNA gene was amplified by polymerase chain reaction using universal primers. These fragments were then paralleled sequenced in Ion Torrent® platform using barcodes on the 316 chip. After the quality filters, about 237 thousands reads were obtained, representing more than 300 Mbp. For bacterial diversity analysis, we used only reads longer than 100 base pairs. The taxonomic diversity was obtained from the Ribosomal Database Project Classifier and alpha diversity analysis (diversity indices and rarefaction) was performed using the RDP pyrosequencing pipeline. Although it is recommended for data pyrosequencing, that pipeline is able to process data obtained from semiconductor sequencing once all of them are fasta files. Over 75% of the sequences were not classified in any phylum, which leads us to believe that there is a huge diversity in the bacterial environment whose function is still unclear. Among the sequences that could be classified, there is a predominance of proteobacteria in all layers, but in higher concentrations at the lower layers. Cyanobacteria accounted for about 3% in the layers of 0m and 10m, leading us to conclude that

  15. Distribution and diversity of Verrucomicrobia methanotrophs in geothermal and acidic environments.

    PubMed

    Sharp, Christine E; Smirnova, Angela V; Graham, Jaime M; Stott, Matthew B; Khadka, Roshan; Moore, Tim R; Grasby, Stephen E; Strack, Maria; Dunfield, Peter F

    2014-06-01

    Recently, methanotrophic members of the phylum Verrucomicrobia have been described, but little is known about their distribution in nature. We surveyed methanotrophic bacteria in geothermal springs and acidic wetlands via pyrosequencing of 16S rRNA gene amplicons. Putative methanotrophic Verrucomicrobia were found in samples covering a broad temperature range (22.5-81.6°C), but only in acidic conditions (pH 1.8-5.0) and only in geothermal environments, not in acidic bogs or fens. Phylogenetically, three 16S rRNA gene sequence clusters of putative methanotrophic Verrucomicrobia were observed. Those detected in high-temperature geothermal samples (44.1-81.6°C) grouped with known thermoacidiphilic 'Methylacidiphilum' isolates. A second group dominated in moderate-temperature geothermal samples (22.5-40.1°C) and a representative mesophilic methanotroph from this group was isolated (strain LP2A). Genome sequencing verified that strain LP2A possessed particulate methane monooxygenase, but its 16S rRNA gene sequence identity to 'Methylacidiphilum infernorum' strain V4 was only 90.6%. A third group clustered distantly with known methanotrophic Verrucomicrobia. Using pmoA-gene targeted quantitative polymerase chain reaction, two geothermal soil profiles showed a dominance of LP2A-like pmoA sequences in the cooler surface layers and 'Methylacidiphilum'-like pmoA sequences in deeper, hotter layers. Based on these results, there appears to be a thermophilic group and a mesophilic group of methanotrophic Verrucomicrobia. However, both were detected only in acidic geothermal environments. PMID:24650084

  16. Turkey fecal microbial community structure and functional gene diversity revealed by 16S rRNA gene and metagenomic sequences.

    PubMed

    Lu, Jingrang; Domingo, Jorge Santo

    2008-10-01

    The primary goal of this study was to better understand the microbial composition and functional genetic diversity associated with turkey fecal communities. To achieve this, 16S rRNA gene and metagenomic clone libraries were sequenced from turkey fecal samples. The analysis of 382 16S rRNA gene sequences showed that the most abundant bacteria were closely related to Lactobacillales (47%), Bacillales (31%), and Clostridiales (11%). Actinomycetales, Enterobacteriales, and Bacteroidales sequences were also identified, but represented a smaller part of the community. The analysis of 379 metagenomic sequences showed that most clones were similar to bacterial protein sequences (58%). Bacteriophage (10%) and avian viruses (3%) sequences were also represented. Of all metagenomic clones potentially encoding for bacterial proteins, most were similar to low G+C Gram-positive bacterial proteins, particularly from Lactobacillales (50%), Bacillales (11%), and Clostridiales (8%). Bioinformatic analyses suggested the presence of genes encoding for membrane proteins, lipoproteins, hydrolases, and functional genes associated with the metabolism of nitrogen and sulfur containing compounds. The results from this study further confirmed the predominance of Firmicutes in the avian gut and highlight the value of coupling 16S rRNA gene and metagenomic sequencing data analysis to study the microbial composition of avian fecal microbial communities. PMID:18974945

  17. Violation of an evolutionarily conserved immunoglobulin diversity gene sequence preference promotes production of dsDNA-specific IgG antibodies.

    PubMed

    Silva-Sanchez, Aaron; Liu, Cun Ren; Vale, Andre M; Khass, Mohamed; Kapoor, Pratibha; Elgavish, Ada; Ivanov, Ivaylo I; Ippolito, Gregory C; Schelonka, Robert L; Schoeb, Trenton R; Burrows, Peter D; Schroeder, Harry W

    2015-01-01

    Variability in the developing antibody repertoire is focused on the third complementarity determining region of the H chain (CDR-H3), which lies at the center of the antigen binding site where it often plays a decisive role in antigen binding. The power of VDJ recombination and N nucleotide addition has led to the common conception that the sequence of CDR-H3 is unrestricted in its variability and random in its composition. Under this view, the immune response is solely controlled by somatic positive and negative clonal selection mechanisms that act on individual B cells to promote production of protective antibodies and prevent the production of self-reactive antibodies. This concept of a repertoire of random antigen binding sites is inconsistent with the observation that diversity (DH) gene segment sequence content by reading frame (RF) is evolutionarily conserved, creating biases in the prevalence and distribution of individual amino acids in CDR-H3. For example, arginine, which is often found in the CDR-H3 of dsDNA binding autoantibodies, is under-represented in the commonly used DH RFs rearranged by deletion, but is a frequent component of rarely used inverted RF1 (iRF1), which is rearranged by inversion. To determine the effect of altering this germline bias in DH gene segment sequence on autoantibody production, we generated mice that by genetic manipulation are forced to utilize an iRF1 sequence encoding two arginines. Over a one year period we collected serial serum samples from these unimmunized, specific pathogen-free mice and found that more than one-fifth of them contained elevated levels of dsDNA-binding IgG, but not IgM; whereas mice with a wild type DH sequence did not. Thus, germline bias against the use of arginine enriched DH sequence helps to reduce the likelihood of producing self-reactive antibodies. PMID:25706374

  18. IMGT/HighV-QUEST Statistical Significance of IMGT Clonotype (AA) Diversity per Gene for Standardized Comparisons of Next Generation Sequencing Immunoprofiles of Immunoglobulins and T Cell Receptors.

    PubMed

    Aouinti, Safa; Malouche, Dhafer; Giudicelli, Véronique; Kossida, Sofia; Lefranc, Marie-Paule

    2015-01-01

    The adaptive immune responses of humans and of other jawed vertebrate species (gnasthostomata) are characterized by the B and T cells and their specific antigen receptors, the immunoglobulins (IG) or antibodies and the T cell receptors (TR) (up to 2.1012 different IG and TR per individual). IMGT, the international ImMunoGeneTics information system (http://www.imgt.org), was created in 1989 by Marie-Paule Lefranc (Montpellier University and CNRS) to manage the huge and complex diversity of these antigen receptors. IMGT built on IMGT-ONTOLOGY concepts of identification (keywords), description (labels), classification (gene and allele nomenclature) and numerotation (IMGT unique numbering), is at the origin of immunoinformatics, a science at the interface between immunogenetics and bioinformatics. IMGT/HighV-QUEST, the first web portal, and so far the only one, for the next generation sequencing (NGS) analysis of IG and TR, is the paradigm for immune repertoire standardized outputs and immunoprofiles of the adaptive immune responses. It provides the identification of the variable (V), diversity (D) and joining (J) genes and alleles, analysis of the V-(D)-J junction and complementarity determining region 3 (CDR3) and the characterization of the 'IMGT clonotype (AA)' (AA for amino acid) diversity and expression. IMGT/HighV-QUEST compares outputs of different batches, up to one million nucleotide sequencesfor the statistical module. These high throughput IG and TR repertoire immunoprofiles are of prime importance in vaccination, cancer, infectious diseases, autoimmunity and lymphoproliferative disorders, however their comparative statistical analysis still remains a challenge. We present a standardized statistical procedure to analyze IMGT/HighV-QUEST outputs for the evaluation of the significance of the IMGT clonotype (AA) diversity differences in proportions, per gene of a given group, between NGS IG and TR repertoire immunoprofiles. The procedure is generic and

  19. Diversity in the Major Polysaccharide Antigen of Acinetobacter Baumannii Assessed by DNA Sequencing, and Development of a Molecular Serotyping Scheme

    PubMed Central

    Dijkshoorn, Lenie; Wang, Lei; Reeves, Peter R.

    2013-01-01

    We have sequenced the gene clusters for type strains of the Acinetobacter baumannii serotyping scheme developed in the 1990s, and used the sequences to better understand diversity in surface polysaccharides of the genus. We obtained genome sequences for 27 available serovar type strains, and identified 25 polysaccharide gene cluster sequences. There are structures for 12 of these polysaccharides, and in general the genes present are appropriate to the structure where known. This greatly facilitates interpretation. We also find 53 different glycosyltransferase genes, and for 7 strains can provisionally allocate specific genes to all linkages. We identified primers that will distinguish the 25 sequence forms by PCR or microarray, or alternatively the genes can be used to determine serotype by “molecular serology”. We applied the latter to 190 Acinetobacter genome-derived gene-clusters, and found 76 that have one of the 25 gene-cluster forms. We also found novel gene clusters and added 52 new gene-cluster sequence forms with different wzy genes and different gene contents. Altogether, the strains that have one of the original 25 sequence forms include 98 A. baumannii (24 from our strains) and 5 A. nosocomialis (3 from our strains), whereas 32 genomes from 12 species other than A. baumannii or A. nosocomialis, all have new sequence forms. One of the 25 serovar type sequences is found to be in European clone I (EC I), 2 are in EC II but none in EC III. The public genome strains add an additional 52 new sequence forms, and also bring the number found in EC I to 5, in EC II to 9 and in EC III to 2. PMID:23922982

  20. Conservation of Shannon's redundancy for proteins. [information theory applied to amino acid sequences

    NASA Technical Reports Server (NTRS)

    Gatlin, L. L.

    1974-01-01

    Concepts of information theory are applied to examine various proteins in terms of their redundancy in natural originators such as animals and plants. The Monte Carlo method is used to derive information parameters for random protein sequences. Real protein sequence parameters are compared with the standard parameters of protein sequences having a specific length. The tendency of a chain to contain some amino acids more frequently than others and the tendency of a chain to contain certain amino acid pairs more frequently than other pairs are used as randomness measures of individual protein sequences. Non-periodic proteins are generally found to have random Shannon redundancies except in cases of constraints due to short chain length and genetic codes. Redundant characteristics of highly periodic proteins are discussed. A degree of periodicity parameter is derived.

  1. Multiple and Diverse vsp and vlp Sequences in Borrelia miyamotoi, a Hard Tick-Borne Zoonotic Pathogen

    PubMed Central

    Barbour, Alan G.

    2016-01-01

    Based on chromosome sequences, the human pathogen Borrelia miyamotoi phylogenetically clusters with species that cause relapsing fever. But atypically for relapsing fever agents, B. miyamotoi is transmitted not by soft ticks but by hard ticks, which also are vectors of Lyme disease Borrelia species. To further assess the relationships of B. miyamotoi to species that cause relapsing fever, I investigated extrachromosomal sequences of a North American strain with specific attention on plasmid-borne vsp and vlp genes, which are the underpinnings of antigenic variation during relapsing fever. For a hybrid approach to achieve assemblies that spanned more than one of the paralogous vsp and vlp genes, a database of short-reads from next-generation sequencing was supplemented with long-reads obtained with real-time DNA sequencing from single polymerase molecules. This yielded three contigs of 31, 16, and 11 kb, which each contained multiple and diverse sequences that were homologous to vsp and vlp genes of the relapsing fever agent B. hermsii. Two plasmid fragments had coding sequences for plasmid partition proteins that differed from each other from paralogous proteins for the megaplasmid and a small plasmid of B. miyamotoi. One of 4 vsp genes, vsp1, was present at two loci, one of which was downstream of a candiate prokaryotic promoter. A limited RNA-seq analysis of a population growing in the blood of mice indicated that of the 4 different vsp genes vsp1 was the one that was expressed. The findings indicate that B. miyamotoi has at least four types of plasmids, two or more of which bear vsp and vlp gene sequences that are as numerous and diverse as those of relapsing fever Borrelia. The database and insights from these findings provide a foundation for further investigations of the immune responses to this pathogen and of the capability of B. miyamotoi for antigenic variation. PMID:26785134

  2. Genetic diversity and molecular evolution of Naga King Chili inferred from internal transcribed spacer sequence of nuclear ribosomal DNA

    PubMed Central

    Kehie, Mechuselie; Kumaria, Suman; Devi, Khumuckcham Sangeeta; Tandon, Pramod

    2015-01-01

    Sequences of the Internal Transcribed Spacer (ITS1-5.8S-ITS2) of nuclear ribosomal DNAs were explored to study the genetic diversity and molecular evolution of Naga King Chili. Our study indicated the occurrence of nucleotide polymorphism and haplotypic diversity in the ITS regions. The present study demonstrated that the variability of ITS1 with respect to nucleotide diversity and sequence polymorphism exceeded that of ITS2. Sequence analysis of 5.8S gene revealed a much conserved region in all the accessions of Naga King Chili. However, strong phylogenetic information of this species is the distinct 13 bp deletion in the 5.8S gene which discriminated Naga King Chili from the rest of the Capsicum sp. Neutrality test results implied a neutral variation, and population seems to be evolving at drift–mutation equilibrium and free from directed selection pressure. Furthermore, mismatch analysis showed multimodal curve indicating a demographic equilibrium. Phylogenetic relationships revealed by Median Joining Network (MJN) analysis denoted a clear discrimination of Naga King Chili from its closest sister species (Capsicumchinense and Capsicumfrutescens). The absence of star-like network of haplotypes suggested an ancient population expansion of this chili. PMID:26862481

  3. Genetic diversity and molecular evolution of Naga King Chili inferred from internal transcribed spacer sequence of nuclear ribosomal DNA.

    PubMed

    Kehie, Mechuselie; Kumaria, Suman; Devi, Khumuckcham Sangeeta; Tandon, Pramod

    2016-02-01

    Sequences of the Internal Transcribed Spacer (ITS1-5.8S-ITS2) of nuclear ribosomal DNAs were explored to study the genetic diversity and molecular evolution of Naga King Chili. Our study indicated the occurrence of nucleotide polymorphism and haplotypic diversity in the ITS regions. The present study demonstrated that the variability of ITS1 with respect to nucleotide diversity and sequence polymorphism exceeded that of ITS2. Sequence analysis of 5.8S gene revealed a much conserved region in all the accessions of Naga King Chili. However, strong phylogenetic information of this species is the distinct 13 bp deletion in the 5.8S gene which discriminated Naga King Chili from the rest of the Capsicum sp. Neutrality test results implied a neutral variation, and population seems to be evolving at drift-mutation equilibrium and free from directed selection pressure. Furthermore, mismatch analysis showed multimodal curve indicating a demographic equilibrium. Phylogenetic relationships revealed by Median Joining Network (MJN) analysis denoted a clear discrimination of Naga King Chili from its closest sister species (Capsicum chinense and Capsicum frutescens). The absence of star-like network of haplotypes suggested an ancient population expansion of this chili. PMID:26862481

  4. Development of Microsatellite Markers Derived from Expressed Sequence Tags of Polyporales for Genetic Diversity Analysis of Endangered Polyporus umbellatus.

    PubMed

    Zhang, Yuejin; Chen, Yuanyuan; Wang, Ruihong; Zeng, Ailin; Deyholos, Michael K; Shu, Jia; Guo, Hongbo

    2015-01-01

    A large scale of EST sequences of Polyporales was screened in this investigation in order to identify EST-SSR markers for various applications. The distribution of EST sequences and SSRs in five families of Polyporales was analyzed, respectively. Mononucleotide was the most abundant type, followed by trinucleotide. Among five families, Ganodermataceae occupied the most SSR markers, followed by Coriolaceae. Functional prediction of SSR marker-containing EST sequences in Ganoderma lucidum obtained three main groups, namely, cellular component, biological process, and molecular function. Thirty EST-SSR primers were designed to evaluate the genetic diversity of 13 natural Polyporus umbellatus accessions. Twenty one EST-SSRs were polymorphic with average PIC value of 0.33 and transferability rate of 71%. These 13 P. umbellatus accessions showed relatively high genetic diversity. The expected heterozygosity, Nei's gene diversity, and Shannon information index were 0.41, 0.39, and 0.57, respectively. Both UPGMA dendrogram and principal coordinate analysis (PCA) showed the same cluster result that divided the 13 accessions into three or four groups. PMID:26146636

  5. Massively parallel rRNA gene sequencing exacerbates the potential for biased community diversity comparisons due to variable library sizes

    SciTech Connect

    Gihring, Thomas; Green, Stefan; Schadt, Christopher Warren

    2011-01-01

    Technologies for massively parallel sequencing are revolutionizing microbial ecology and are vastly increasing the scale of ribosomal RNA (rRNA) gene studies. Although pyrosequencing has increased the breadth and depth of possible rRNA gene sampling, one drawback is that the number of reads obtained per sample is difficult to control. Pyrosequencing libraries typically vary widely in the number of sequences per sample, even within individual studies, and there is a need to revisit the behaviour of richness estimators and diversity indices with variable gene sequence library sizes. Multiple reports and review papers have demonstrated the bias in non-parametric richness estimators (e.g. Chao1 and ACE) and diversity indices when using clone libraries. However, we found that biased community comparisons are accumulating in the literature. Here we demonstrate the effects of sample size on Chao1, ACE, CatchAll, Shannon, Chao-Shen and Simpson's estimations specifically using pyrosequencing libraries. The need to equalize the number of reads being compared across libraries is reiterated, and investigators are directed towards available tools for making unbiased diversity comparisons.

  6. Deep COI sequencing of standardized benthic samples unveils overlooked diversity of Jordanian coral reefs in the northern Red Sea.

    PubMed

    Al-Rshaidat, Mamoon M D; Snider, Allison; Rosebraugh, Sydney; Devine, Amanda M; Devine, Thomas D; Plaisance, Laetitia; Knowlton, Nancy; Leray, Matthieu

    2016-09-01

    High-throughput sequencing (HTS) of DNA barcodes (metabarcoding), particularly when combined with standardized sampling protocols, is one of the most promising approaches for censusing overlooked cryptic invertebrate communities. We present biodiversity estimates based on sequencing of the cytochrome c oxidase subunit 1 (COI) gene for coral reefs of the Gulf of Aqaba, a semi-enclosed system in the northern Red Sea. Samples were obtained from standardized sampling devices (Autonomous Reef Monitoring Structures (ARMS)) deployed for 18 months. DNA barcoding of non-sessile specimens >2 mm revealed 83 OTUs in six phyla, of which only 25% matched a reference sequence in public databases. Metabarcoding of the 2 mm - 500 μm and sessile bulk fractions revealed 1197 OTUs in 15 animal phyla, of which only 4.9% matched reference barcodes. These results highlight the scarcity of COI data for cryptobenthic organisms of the Red Sea. Compared with data obtained using similar methods, our results suggest that Gulf of Aqaba reefs are less diverse than two Pacific coral reefs but much more diverse than an Atlantic oyster reef at a similar latitude. The standardized approaches used here show promise for establishing baseline data on biodiversity, monitoring the impacts of environmental change, and quantifying patterns of diversity at regional and global scales. PMID:27584940

  7. Diversity of acid stress resistant variants of Listeria monocytogenes and the potential role of ribosomal protein S21 encoded by rpsU

    PubMed Central

    Metselaar, Karin I.; den Besten, Heidy M. W.; Boekhorst, Jos; van Hijum, Sacha A. F. T.; Zwietering, Marcel H.; Abee, Tjakko

    2015-01-01

    The dynamic response of microorganisms to environmental conditions depends on the behavior of individual cells within the population. Adverse environments can select for stable stress resistant subpopulations. In this study, we aimed to get more insight in the diversity within Listeria monocytogenes LO28 populations, and the genetic basis for the increased resistance of stable resistant fractions isolated after acid exposure. Phenotypic cluster analysis of 23 variants resulted in three clusters and four individual variants and revealed multiple-stress resistance, with both unique and overlapping features related to stress resistance, growth, motility, biofilm formation, and virulence indicators. A higher glutamate decarboxylase activity correlated with increased acid resistance. Whole genome sequencing revealed mutations in rpsU, encoding ribosomal protein S21 in the largest phenotypic cluster, while mutations in ctsR, which were previously shown to be responsible for increased resistance of heat and high hydrostatic pressure resistant variants, were not found in the acid resistant variants. This underlined that large population diversity exists within one L. monocytogenes strain and that different adverse conditions drive selection for different variants. The finding that acid stress selects for rpsU variants provides potential insights in the mechanisms underlying population diversity of L. monocytogenes. PMID:26005439

  8. Genome Sequencing of Mycobacterium abscessus Isolates from Patients in the United States and Comparisons to Globally Diverse Clinical Strains

    PubMed Central

    Davidson, Rebecca M.; Hasan, Nabeeh A.; Reynolds, Paul R.; Totten, Sarah; Garcia, Benjamin; Levin, Adrah; Ramamoorthy, Preveen; Heifets, Leonid; Daley, Charles L.

    2014-01-01

    Nontuberculous mycobacterial infections caused by Mycobacterium abscessus are responsible for a range of disease manifestations from pulmonary to skin infections and are notoriously difficult to treat, due to innate resistance to many antibiotics. Previous population studies of clinical M. abscessus isolates utilized multilocus sequence typing or pulsed-field gel electrophoresis, but high-resolution examinations of genetic diversity at the whole-genome level have not been well characterized, particularly among clinical isolates derived in the United States. We performed whole-genome sequencing of 11 clinical M. abscessus isolates derived from eight U.S. patients with pulmonary nontuberculous mycobacterial infections, compared them to 30 globally diverse clinical isolates, and investigated intrapatient genomic diversity and evolution. Phylogenomic analyses revealed a cluster of closely related U.S. and Western European M. abscessus subsp. abscessus isolates that are genetically distinct from other European isolates and all Asian isolates. Large-scale variation analyses suggested genome content differences of 0.3 to 8.3%, relative to the reference strain ATCC 19977T. Longitudinally sampled isolates showed very few single-nucleotide polymorphisms and correlated genomic deletion patterns, suggesting homogeneous infection populations. Our study explores the genomic diversity of clinical M. abscessus strains from multiple continents and provides insight into the genome plasticity of an opportunistic pathogen. PMID:25056330

  9. Compare Identity By Sequence Relationships of the Ames Diversity Panel using TYPSimSelector [abstract

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Maize genetic diversity has been exploited by mankind for 10,000 years. Scientific approaches applied to it by breeders for over a century transformed it into the world’s number one crop. Maize genomic diversity provides a rich resource of interest to evolutionary and population geneticists, constit...

  10. SMRT sequencing provides insight into the diversity of the bovine immunoglobulin heavy chain repertoire

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The vertebrate immune system produces a diverse antibody repertoire capable of responding to a vast array of antigens. This diversity is generated through a multifaceted process of gene segment recombination and somatic hypermutation or gene conversion. Recent advances in high-throughput sequencin...

  11. Conversion of amino-acid sequence in proteins to classical music: search for auditory patterns

    PubMed Central

    2007-01-01

    We have converted genome-encoded protein sequences into musical notes to reveal auditory patterns without compromising musicality. We derived a reduced range of 13 base notes by pairing similar amino acids and distinguishing them using variations of three-note chords and codon distribution to dictate rhythm. The conversion will help make genomic coding sequences more approachable for the general public, young children, and vision-impaired scientists. PMID:17477882

  12. Protein location prediction using atomic composition and global features of the amino acid sequence

    SciTech Connect

    Cherian, Betsy Sheena; Nair, Achuthsankar S.

    2010-01-22

    Subcellular location of protein is constructive information in determining its function, screening for drug candidates, vaccine design, annotation of gene products and in selecting relevant proteins for further studies. Computational prediction of subcellular localization deals with predicting the location of a protein from its amino acid sequence. For a computational localization prediction method to be more accurate, it should exploit all possible relevant biological features that contribute to the subcellular localization. In this work, we extracted the biological features from the full length protein sequence to incorporate more biological information. A new biological feature, distribution of atomic composition is effectively used with, multiple physiochemical properties, amino acid composition, three part amino acid composition, and sequence similarity for predicting the subcellular location of the protein. Support Vector Machines are designed for four modules and prediction is made by a weighted voting system. Our system makes prediction with an accuracy of 100, 82.47, 88.81 for self-consistency test, jackknife test and independent data test respectively. Our results provide evidence that the prediction based on the biological features derived from the full length amino acid sequence gives better accuracy than those derived from N-terminal alone. Considering the features as a distribution within the entire sequence will bring out underlying property distribution to a greater detail to enhance the prediction accuracy.

  13. Ab initio detection of fuzzy amino acid tandem repeats in protein sequences

    PubMed Central

    2012-01-01

    Background Tandem repetitions within protein amino acid sequences often correspond to regular secondary structures and form multi-repeat 3D assemblies of varied size and function. Developing internal repetitions is one of the evolutionary mechanisms that proteins employ to adapt their structure and function under evolutionary pressure. While there is keen interest in understanding such phenomena, detection of repeating structures based only on sequence analysis is considered an arduous task, since structure and function is often preserved even under considerable sequence divergence (fuzzy tandem repeats). Results In this paper we present PTRStalker, a new algorithm for ab-initio detection of fuzzy tandem repeats in protein amino acid sequences. In the reported results we show that by feeding PTRStalker with amino acid sequences from the UniProtKB/Swiss-Prot database we detect novel tandemly repeated structures not captured by other state-of-the-art tools. Experiments with membrane proteins indicate that PTRStalker can detect global symmetries in the primary structure which are then reflected in the tertiary structure. Conclusions PTRStalker is able to detect fuzzy tandem repeating structures in protein sequences, with performance beyond the current state-of-the art. Such a tool may be a valuable support to investigating protein structural properties when tertiary X-ray data is not available. PMID:22536906

  14. Development of Genomic Microsatellite Markers in Carthamus tinctorius L. (Safflower) Using Next Generation Sequencing and Assessment of Their Cross-Species Transferability and Utility for Diversity Analysis

    PubMed Central

    Variath, Murali Tottekkad; Joshi, Gopal; Bali, Sapinder; Agarwal, Manu; Kumar, Amar; Jagannath, Arun; Goel, Shailendra

    2015-01-01

    Background Safflower (Carthamus tinctorius L.), an Asteraceae member, yields high quality edible oil rich in unsaturated fatty acids and is resilient to dry conditions. The crop holds tremendous potential for improvement through concerted molecular breeding programs due to the availability of significant genetic and phenotypic diversity. Genomic resources that could facilitate such breeding programs remain largely underdeveloped in the crop. The present study was initiated to develop a large set of novel microsatellite markers for safflower using next generation sequencing. Principal Findings Low throughput genome sequencing of safflower was performed using Illumina paired end technology providing ~3.5X coverage of the genome. Analysis of sequencing data allowed identification of 23,067 regions harboring perfect microsatellite loci. The safflower genome was found to be rich in dinucleotide repeats followed by tri-, tetra-, penta- and hexa-nucleotides. Primer pairs were designed for 5,716 novel microsatellite sequences with repeat length ≥ 20 bases and optimal flanking regions. A subset of 325 microsatellite loci was tested for amplification, of which 294 loci produced robust amplification. The validated primers were used for assessment of 23 safflower accessions belonging to diverse agro-climatic zones of the world leading to identification of 93 polymorphic primers (31.6%). The numbers of observed alleles at each locus ranged from two to four and mean polymorphism information content was found to be 0.3075. The polymorphic primers were tested for cross-species transferability on nine wild relatives of cultivated safflower. All primers except one showed amplification in at least two wild species while 25 primers amplified across all the nine species. The UPGMA dendrogram clustered C. tinctorius accessions and wild species separately into two major groups. The proposed progenitor species of safflower, C. oxyacantha and C. palaestinus were genetically closer to

  15. The amino-acid sequence of leghemoglobin component a from Phaseolus vulgaris (kidney bean).

    PubMed

    Lehtovaara, P; Ellfolk, N

    1975-06-01

    1. Leghemoglobin component a from Phaseolus vulgaris (kidney bean) was digested with trypsin; 15 tryptic peptides and free lysine were purified and the amino acid sequences of the peptides determined. 2. The internal order of the tryptic peptides was determined by the bridge peptides obtained from the thermolytic digest and the dilute acid hydrolyzate of kidney bean leghemoglobin a; 12 thermolytic peptides and two acid hydrolysis peptides were purified and the sequences were partially or completely determined. 3. The complete amino acid sequence of kidney bean leghemoglobin a is compared to that of leghemoglobin a from soybean (Glycine max) and to some animal globins. As regards sequence, the kidney bean globin has 79% identity with the soybean globin and 21% identity with human hemoglobin gamma-chain. Seven of the 14 amino acid residues common to most globins are found in the kidney bean globin. Trp-15 and Tyr-145 are evolutionarily conserved in this globin, which confirms the concept of a common origin of animal and plant globins. PMID:809270

  16. Captured metagenomics: large-scale targeting of genes based on ‘sequence capture’ reveals functional diversity in soils

    PubMed Central

    Manoharan, Lokeshwaran; Kushwaha, Sandeep K.; Hedlund, Katarina; Ahrén, Dag

    2015-01-01

    Microbial enzyme diversity is a key to understand many ecosystem processes. Whole metagenome sequencing (WMG) obtains information on functional genes, but it is costly and inefficient due to large amount of sequencing that is required. In this study, we have applied a captured metagenomics technique for functional genes in soil microorganisms, as an alternative to WMG. Large-scale targeting of functional genes, coding for enzymes related to organic matter degradation, was applied to two agricultural soil communities through captured metagenomics. Captured metagenomics uses custom-designed, hybridization-based oligonucleotide probes that enrich functional genes of interest in metagenomic libraries where only probe-bound DNA fragments are sequenced. The captured metagenomes were highly enriched with targeted genes while maintaining their target diversity and their taxonomic distribution correlated well with the traditional ribosomal sequencing. The captured metagenomes were highly enriched with genes related to organic matter degradation; at least five times more than similar, publicly available soil WMG projects. This target enrichment technique also preserves the functional representation of the soils, thereby facilitating comparative metagenomics projects. Here, we present the first study that applies the captured metagenomics approach in large scale, and this novel method allows deep investigations of central ecosystem processes by studying functional gene abundances. PMID:26490729

  17. Draft genome sequence of the docosahexaenoic acid producing thraustochytrid Aurantiochytrium sp. T66.

    PubMed

    Liu, Bin; Ertesvåg, Helga; Aasen, Inga Marie; Vadstein, Olav; Brautaset, Trygve; Heggeset, Tonje Marita Bjerkan

    2016-06-01

    Thraustochytrids are unicellular, marine protists, and there is a growing industrial interest in these organisms, particularly because some species, including strains belonging to the genus Aurantiochytrium, accumulate high levels of docosahexaenoic acid (DHA). Here, we report the draft genome sequence of Aurantiochytrium sp. T66 (ATCC PRA-276), with a size of 43 Mbp, and 11,683 predicted protein-coding sequences. The data has been deposited at DDBJ/EMBL/Genbank under the accession LNGJ00000000. The genome sequence will contribute new insight into DHA biosynthesis and regulation, providing a basis for metabolic engineering of thraustochytrids. PMID:27222814

  18. HIV type 1 genetic diversity in Moyale, Mandera, and Turkana based on env-C2-V3 sequences.

    PubMed

    Khamadi, Samoel A; Lihana, Raphael W; Mwaniki, D L; Kinyua, Joyceline; Lagat, Nancy; Carter, Jane Y; Ichimura, Hiroshi; Oishi, Isao; Okoth, Fredrick A; Ochieng, Washington

    2008-12-01

    The genetic diversity of HIV-1 subtypes circulating in three districts of northern Kenya, i.e., Turkana, Mandera, and Moyale, was studied. DNA sequences encoding a portion of the env-C2-V3 region of the virus were amplified by PCR and sequenced directly. One hundred and fifty-nine samples were successfully sequenced in the env-C2-V3 region and analyzed. From the analysis, 57% were subtype A1, 27% were subtype C, 9% were subtype D, and the remaining 7% were unclassified. This study showed that HIV-1 subtype A1 was the dominant subtype in circulation in this region, though there was a significant percentage of HIV-1 subtype C in circulation there. PMID:19102688

  19. Adapt globally, act locally: the effect of selective sweeps on bacterial sequence diversity.

    PubMed Central

    Majewski, J; Cohan, F M

    1999-01-01

    Previous studies have shown that genetic exchange in bacteria is too rare to prevent neutral sequence divergence between ecological populations. That is, despite genetic exchange, each population should diverge into its own DNA sequence-similarity cluster. In those studies, each selective sweep was limited to acting within a single ecological population. Here we postulate the existence of globally adaptive mutations, which may confer a selective advantage to all ecological populations constituting a metapopulation. Such adaptations cause global selective sweeps, which purge the divergence both within and between populations. We found that the effect of recurrent global selective sweeps on neutral sequence divergence is highly dependent on the mechanism of genetic exchange. Global selective sweeps can prevent populations from reaching high levels of neutral sequence divergence, but they cannot cause two populations to become identical in neutral sequence characters. The model supports the earlier conclusion that each ecological population of bacteria should form its own distinct DNA sequence-similarity cluster. PMID:10430576

  20. Complete metagenome sequencing based bacterial diversity and functional insights from basaltic hot spring of Unkeshwar, Maharashtra, India.

    PubMed

    Mehetre, Gajanan T; Paranjpe, Aditi S; Dastager, Syed G; Dharne, Mahesh S

    2016-03-01

    Unkeshwar hot springs are located at geographical South East Deccan Continental basalt of India. Here, we report the microbial community analysis of this hot spring using whole metagenome shotgun sequencing approach. The analysis revealed a total of 848,096 reads with 212.87 Mbps with 50.87% G + C content. Metagenomic sequences were deposited in SRA database with accession number (SUB1242219). Community analysis revealed 99.98% sequences belonging to bacteria and 0.01% to archaea and 0.01% to Viruses. The data obtained revealed 41 phyla including bacteria and Archaea and including 719 different species. In taxonomic analysis, the dominant phyla were found as, Actinobacteria (56%), Verrucomicrobia (24%), Bacteriodes (13%), Deinococcus-Thermus (3%) and firmicutes (2%) and Viruses (2%). Furthermore, functional annotation using pathway information revealed dynamic potential of hot spring community in terms of metabolism, environmental information processing, cellular processes and other important aspects. Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis of each contig sequence by assigning KEGG Orthology (KO) numbers revealed contig sequences that were assigned to metabolism, organismal system, Environmental Information Processing, cellular processes and human diseases with some unclassified sequences. The Unkeshwar hot springs offer rich phylogenetic diversity and metabolic potential for biotechnological applications. PMID:26981391

  1. Complete metagenome sequencing based bacterial diversity and functional insights from basaltic hot spring of Unkeshwar, Maharashtra, India

    PubMed Central

    Mehetre, Gajanan T.; Paranjpe, Aditi S.; Dastager, Syed G.; Dharne, Mahesh S.

    2015-01-01

    Unkeshwar hot springs are located at geographical South East Deccan Continental basalt of India. Here, we report the microbial community analysis of this hot spring using whole metagenome shotgun sequencing approach. The analysis revealed a total of 848,096 reads with 212.87 Mbps with 50.87% G + C content. Metagenomic sequences were deposited in SRA database with accession number (SUB1242219). Community analysis revealed 99.98% sequences belonging to bacteria and 0.01% to archaea and 0.01% to Viruses. The data obtained revealed 41 phyla including bacteria and Archaea and including 719 different species. In taxonomic analysis, the dominant phyla were found as, Actinobacteria (56%), Verrucomicrobia (24%), Bacteriodes (13%), Deinococcus-Thermus (3%) and firmicutes (2%) and Viruses (2%). Furthermore, functional annotation using pathway information revealed dynamic potential of hot spring community in terms of metabolism, environmental information processing, cellular processes and other important aspects. Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis of each contig sequence by assigning KEGG Orthology (KO) numbers revealed contig sequences that were assigned to metabolism, organismal system, Environmental Information Processing, cellular processes and human diseases with some unclassified sequences. The Unkeshwar hot springs offer rich phylogenetic diversity and metabolic potential for biotechnological applications. PMID:26981391

  2. Assessing diversity of the female urine microbiota by high throughput sequencing of 16S rDNA amplicons

    PubMed Central

    2011-01-01

    Background Urine within the urinary tract is commonly regarded as "sterile" in cultivation terms. Here, we present a comprehensive in-depth study of bacterial 16S rDNA sequences associated with urine from healthy females by means of culture-independent high-throughput sequencing techniques. Results Sequencing of the V1V2 and V6 regions of the 16S ribosomal RNA gene using the 454 GS FLX system was performed to characterize the possible bacterial composition in 8 culture-negative (<100,000 CFU/ml) healthy female urine specimens. Sequences were compared to 16S rRNA databases and showed significant diversity, with the predominant genera detected being Lactobacillus, Prevotella and Gardnerella. The bacterial profiles in the female urine samples studied were complex; considerable variation between individuals was observed and a common microbial signature was not evident. Notably, a significant amount of sequences belonging to bacteria with a known pathogenic potential was observed. The number of operational taxonomic units (OTUs) for individual samples varied substantially and was in the range of 20 - 500. Conclusions Normal female urine displays a noticeable and variable bacterial 16S rDNA sequence richness, which includes fastidious and anaerobic bacteria previously shown to be associated with female urogenital pathology. PMID:22047020

  3. DNA-binding proteins from marine bacteria expand the known sequence diversity of TALE-like repeats

    PubMed Central

    de Lange, Orlando; Wolf, Christina; Thiel, Philipp; Krüger, Jens; Kleusch, Christian; Kohlbacher, Oliver; Lahaye, Thomas

    2015-01-01

    Transcription Activator-Like Effectors (TALEs) of Xanthomonas bacteria are programmable DNA binding proteins with unprecedented target specificity. Comparative studies into TALE repeat structure and function are hindered by the limited sequence variation among TALE repeats. More sequence-diverse TALE-like proteins are known from Ralstonia solanacearum (RipTALs) and Burkholderia rhizoxinica (Bats), but RipTAL and Bat repeats are conserved with those of TALEs around the DNA-binding residue. We study two novel marine-organism TALE-like proteins (MOrTL1 and MOrTL2), the first to date of non-terrestrial origin. We have assessed their DNA-binding properties and modelled repeat structures. We found that repeats from these proteins mediate sequence specific DNA binding conforming to the TALE code, despite low sequence similarity to TALE repeats, and with novel residues around the BSR. However, MOrTL1 repeats show greater sequence discriminating power than MOrTL2 repeats. Sequence alignments show that there are only three residues conserved between repeats of all TALE-like proteins including the two new additions. This conserved motif could prove useful as an identifier for future TALE-likes. Additionally, comparing MOrTL repeats with those of other TALE-likes suggests a common evolutionary origin for the TALEs, RipTALs and Bats. PMID:26481363

  4. A classification of glycosyl hydrolases based on amino acid sequence similarities.

    PubMed Central

    Henrissat, B

    1991-01-01

    The amino acid sequences of 301 glycosyl hydrolases and related enzymes have been compared. A total of 291 sequences corresponding to 39 EC entries could be classified into 35 families. Only ten sequences (less than 5% of the sample) could not be assigned to any family. With the sequences available for this analysis, 18 families were found to be monospecific (containing only one EC number) and 17 were found to be polyspecific (containing at least two EC numbers). Implications on the folding characteristics and mechanism of action of these enzymes and on the evolution of carbohydrate metabolism are discussed. With the steady increase in sequence and structural data, it is suggested that the enzyme classification system should perhaps be revised. PMID:1747104

  5. New families in the classification of glycosyl hydrolases based on amino acid sequence similarities.

    PubMed Central

    Henrissat, B; Bairoch, A

    1993-01-01

    301 glycosyl hydrolases and related enzymes corresponding to 39 EC entries of the I.U.B. classification system have been classified into 35 families on the basis of amino-acid-sequence similarities [Henrissat (1991) Biochem. J. 280, 309-316]. Approximately half of the families were found to be monospecific (containing only one EC number), whereas the other half were found to be polyspecific (containing at least two EC numbers). A > 60% increase in sequence data for glycosyl hydrolases (181 additional enzymes or enzyme domains sequences have since become available) allowed us to update the classification not only by the addition of more members to already identified families, but also by the finding of ten new families. On the basis of a comparison of 482 sequences corresponding to 52 EC entries, 45 families, out of which 22 are polyspecific, can now be defined. This classification has been implemented in the SWISS-PROT protein sequence data bank. PMID:8352747

  6. Sequence-specific purification of nucleic acids by PNA-controlled hybrid selection.

    PubMed

    Orum, H; Nielsen, P E; Jørgensen, M; Larsson, C; Stanley, C; Koch, T

    1995-09-01

    Using an oligohistidine peptide nucleic acids (oligohistidine-PNA) chimera, we have developed a rapid hybrid selection method that allows efficient, sequence-specific purification of a target nucleic acid. The method exploits two fundamental features of PNA. First, that PNA binds with high affinity and specificity to its complementary nucleic acid. Second, that amino acids are easily attached to the PNA oligomer during synthesis. We show that a (His)6-PNA chimera exhibits strong binding to chelated Ni2+ ions without compromising its native PNA hybridization properties. We further show that these characteristics allow the (His)6-PNA/DNA complex to be purified by the well-established method of metal ion affinity chromatography using a Ni(2+)-NTA (nitrilotriactic acid) resin. Specificity and efficiency are the touchstones of any nucleic acid purification scheme. We show that the specificity of the (His)6-PNA selection approach is such that oligonucleotides differing by only a single nucleotide can be selectively purified. We also show that large RNAs (2224 nucleotides) can be captured with high efficiency by using multiple (His)6-PNA probes. PNA can hybridize to nucleic acids in low-salt concentrations that destabilize native nucleic acid structures. We demonstrate that this property of PNA can be utilized to purify an oligonucleotide in which the target sequence forms part of an intramolecular stem/loop structure. PMID:7495562

  7. Ancient DNA analyses reveal high mitochondrial DNA sequence diversity and parallel morphological evolution of late pleistocene cave bears.

    PubMed

    Hofreiter, Michael; Capelli, Cristian; Krings, Matthias; Waits, Lisette; Conard, Nicholas; Münzel, Susanne; Rabeder, Gernot; Nagel, Doris; Paunovic, Maja; Jambrĕsić, Gordana; Meyer, Sonja; Weiss, Gunter; Pääbo, Svante

    2002-08-01

    Cave bears (Ursus spelaeus) existed in Europe and western Asia until the end of the last glaciation some 10,000 years ago. To investigate the genetic diversity, population history, and relationship among different cave bear populations, we have determined mitochondrial DNA sequences from 12 cave bears that range in age from about 26,500 to at least 49,000 years and originate from nine caves. The samples include one individual from the type specimen population, as well as two small-sized high-Alpine bears. The results show that about 49,000 years ago, the mtDNA diversity among cave bears was about 1.8-fold lower than the current species-wide diversity of brown bears (Ursus arctos). However, the current brown bear mtDNA gene pool consists of three clades, and cave bear mtDNA diversity is similar to the diversity observed within each of these clades. The results also show that geographically separated populations of the high-Alpine cave bear form were polyphyletic with respect to their mtDNA. This suggests that small size may have been an ancestral trait in cave bears and that large size evolved at least twice independently. PMID:12140236

  8. In silico comparative analysis of DNA and amino acid sequences for prion protein gene.

    PubMed

    Kim, Y; Lee, J; Lee, C

    2008-01-01

    Genetic variability might contribute to species specificity of prion diseases in various organisms. In this study, structures of the prion protein gene (PRNP) and its amino acids were compared among species of which sequence data were available. Comparisons of PRNP DNA sequences among 12 species including human, chimpanzee, monkey, bovine, ovine, dog, mouse, rat, wallaby, opossum, chicken and zebrafish allowed us to identify candidate regulatory regions in intron 1 and 3'-untranslated region (UTR) in addition to the coding region. Highly conserved putative binding sites for transcription factors, such as heat shock factor 2 (HSF2) and myocite enhancer factor 2 (MEF2), were discovered in the intron 1. In 3'-UTR, the functional sequence (ATTAAA) for nucleus-specific polyadenylation was found in all the analysed species. The functional sequence (TTTTTAT) for maturation-specific polyadenylation was identically observed only in ovine, and one or two nucleotide mismatches in the other species. A comparison of the amino acid sequences in 53 species revealed a large sequence identity. Especially the octapeptide repeat region was observed in all the species but frog and zebrafish. Functional changes and susceptibility to prion diseases with various isoforms of prion protein could be caused by numeric variability and conformational changes discovered in the repeat sequences. PMID:18397498

  9. Genome sequence analysis of five Canadian isolates of strawberry mottle virus reveals extensive intra-species diversity and a longer RNA2 with increased coding capacity compared to a previously characterized European isolate.

    PubMed

    Bhagwat, Basdeo; Dickison, Virginia; Ding, Xinlun; Walker, Melanie; Bernardy, Michael; Bouthillier, Michel; Creelman, Alexa; DeYoung, Robyn; Li, Yinzi; Nie, Xianzhou; Wang, Aiming; Xiang, Yu; Sanfaçon, Hélène

    2016-06-01

    In this study, we report the genome sequence of five isolates of strawberry mottle virus (family Secoviridae, order Picornavirales) from strawberry field samples with decline symptoms collected in Eastern Canada. The Canadian isolates differed from the previously characterized European isolate 1134 in that they had a longer RNA2, resulting in a 239-amino-acid extension of the C-terminal region of the polyprotein. Sequence analysis suggests that reassortment and recombination occurred among the isolates. Phylogenetic analysis revealed that the Canadian isolates are diverse, grouping in two separate branches along with isolates from Europe and the Americas. PMID:26984225

  10. A High Load of Non-neutral Amino-Acid Polymorphisms Explains High Protein Diversity Despite Moderate Effective Population Size in a Marine Bivalve With Sweepstakes Reproduction

    PubMed Central

    Harrang, Estelle; Lapègue, Sylvie; Morga, Benjamin; Bierne, Nicolas

    2013-01-01

    Marine bivalves show among the greatest allozyme diversity ever reported in Eukaryotes, putting them historically at the heart of the neutralist−selectionist controversy on the maintenance of genetic variation. Although it is now acknowledged that this high diversity is most probably a simple consequence of a large population size, convincing support for this explanation would require a rigorous assessment of the silent nucleotide diversity in natural populations of marine bivalves, which has not yet been done. This study investigated DNA sequence polymorphism in a set of 37 nuclear loci in wild samples of the flat oyster Ostrea edulis. Silent diversity was found to be only moderate (0.7%), and there was no departure from demographic equilibrium under the Wright-Fisher model, suggesting that the effective population size might not be as large as might have been expected. In accordance with allozyme heterozygosity, nonsynonymous diversity was comparatively very high (0.3%), so that the nonsynonymous to silent diversity ratio reached a value rarely observed in any other organism. We estimated that one-quarter of amino acid-changing mutations behave as neutral in O. edulis, and as many as one-third are sufficiently weakly selected to segregate at low frequency in the polymorphism. Finally, we inferred that one oyster is expected to carry more than 4800 non-neutral alleles (or 4.2 cM−1). We conclude that a high load of segregating non-neutral amino-acid polymorphisms contributes to high protein diversity in O. edulis. The high fecundity of marine bivalves together with an unpredictable and highly variable success of reproduction and recruitment (sweepstakes reproduction) might produce a greater decoupling between Ne and N than in other organisms with lower fecundities, and we suggest this could explain why a higher segregating load could be maintained for a given silent mutation effective size. PMID:23390609

  11. Antibody-specific model of amino acid substitution for immunological inferences from alignments of antibody sequences.

    PubMed

    Mirsky, Alexander; Kazandjian, Linda; Anisimova, Maria

    2015-03-01

    Antibodies are glycoproteins produced by the immune system as a dynamically adaptive line of defense against invading pathogens. Very elegant and specific mutational mechanisms allow B lymphocytes to produce a large and diversified repertoire of antibodies, which is modified and enhanced throughout all adulthood. One of these mechanisms is somatic hypermutation, which stochastically mutates nucleotides in the antibody genes, forming new sequences with different properties and, eventually, higher affinity and selectivity to the pathogenic target. As somatic hypermutation involves fast mutation of antibody sequences, this process can be described using a Markov substitution model of molecular evolution. Here, using large sets of antibody sequences from mice and humans, we infer an empirical amino acid substitution model AB, which is specific to antibody sequences. Compared with existing general amino acid models, we show that the AB model provides significantly better description for the somatic evolution of mice and human antibody sequences, as demonstrated on large next generation sequencing (NGS) antibody data. General amino acid models are reflective of conservation at the protein level due to functional constraints, with most frequent amino acids exchanges taking place between residues with the same or similar physicochemical properties. In contrast, within the variable part of antibody sequences we observed an elevated frequency of exchanges between amino acids with distinct physicochemical properties. This is indicative of a sui generis mutational mechanism, specific to antibody somatic hypermutation. We illustrate this property of antibody sequences by a comparative analysis of the network modularity implied by the AB model and general amino acid substitution models. We recommend using the new model for computational studies of antibody sequence maturation, including inference of alignments and phylogenetic trees describing antibody somatic hypermutation in

  12. Using increment of diversity to predict mitochondrial proteins of malaria parasite: integrating pseudo-amino acid composition and structural alphabet.

    PubMed

    Chen, Ying-Li; Li, Qian-Zhong; Zhang, Li-Qing

    2012-04-01

    Due to the complexity of Plasmodium falciparum (PF) genome, predicting mitochondrial proteins of PF is more difficult than other species. In this study, using the n-peptide composition of reduced amino acid alphabet (RAAA) obtained from structural alphabet named Protein Blocks as feature parameter, the increment of diversity (ID) is firstly developed to predict mitochondrial proteins. By choosing the 1-peptide compositions on the N-terminal regions with 20 residues as the only input vector, the prediction performance achieves 86.86% accuracy with 0.69 Mathew's correlation coefficient (MCC) by the jackknife test. Moreover, by combining with the hydropathy distribution along protein sequence and several reduced amino acid alphabets, we achieved maximum MCC 0.82 with accuracy 92% in the jackknife test by using the developed ID model. When evaluating on an independent dataset our method performs better than existing methods. The results indicate that the ID is a simple and efficient prediction method for mitochondrial proteins of malaria parasite. PMID:21191803

  13. [Microbial diversity and ammonia-oxidizing microorganism of a soil sample near an acid mine drainage lake].

    PubMed

    Liu, Ying; Wang, Li-Hua; Hao, Chun-Bo; Li, Lu; Li, Si-Yuan; Feng, Chuan-Ping

    2014-06-01

    The main physicochemical parameters of the soil sample which was collected near an acid mine drainage reservoir in Anhui province was analyzed. The microbial diversity and community structure was studied through the construction of bacteria and archaea 16S rRNA gene clone libraries and ammonia monooxygenase gene clone library of archaea. The functional groups which were responsible for the process of ammonia oxidation were also discussed. The results indicated that the soil sample had extreme low pH value (pH < 3) and high ions concentration, which was influenced by the acid mine drainage (AMD). All the 16S rRNA gene sequences of bacteria clone library fell into 11 phyla, and Acidobacteria played the most significant role in the ecosystem followed by Verrucomicrobia. A great number of acidophilic bacteria existed in the soil sample, such as Candidatus Koribacter versatilis and Holophaga sp.. The archaea clone library consisted of 2 phyla (Thaumarchaeota and Euryarchaeota). The abundance of Thaumarchaeota was remarkably higher than Euryarchaeota. The ammonia oxidation in the soil environment was probably driven by ammonia-oxidizing archaea, and new species of ammonia-oxidizing archaea existed in the soil sample. PMID:25158511

  14. Genetic Diversity and Phylogenetic Evolution of Tibetan Sheep Based on mtDNA D-Loop Sequences.

    PubMed

    Liu, Jianbin; Ding, Xuezhi; Zeng, Yufeng; Yue, Yaojing; Guo, Xian; Guo, Tingting; Chu, Min; Wang, Fan; Han, Jilong; Feng, Ruilin; Sun, Xiaoping; Niu, Chune; Yang, Bohui; Guo, Jian; Yuan, Chao

    2016-01-01

    The molecular and population genetic evidence of the phylogenetic status of the Tibetan sheep (Ovis aries) is not well understood, and little is known about this species' genetic diversity. This knowledge gap is partly due to the difficulty of sample collection. This is the first work to address this question. Here, the genetic diversity and phylogenetic relationship of 636 individual Tibetan sheep from fifteen populations were assessed using 642 complete sequences of the mitochondrial DNA D-loop. Samples were collected from the Qinghai-Tibetan Plateau area in China, and reference data were obtained from the six reference breed sequences available in GenBank. The length of the sequences varied considerably, between 1031 and 1259 bp. The haplotype diversity and nucleotide diversity were 0.992±0.010 and 0.019±0.001, respectively. The average number of nucleotide differences was 19.635. The mean nucleotide composition of the 350 haplotypes was 32.961% A, 29.708% T, 22.892% C, 14.439% G, 62.669% A+T, and 37.331% G+C. Phylogenetic analysis showed that all four previously defined haplogroups (A, B, C, and D) were found in the 636 individuals of the fifteen Tibetan sheep populations but that only the D haplogroup was found in Linzhou sheep. Further, the clustering analysis divided the fifteen Tibetan sheep populations into at least two clusters. The estimation of the demographic parameters from the mismatch analyses showed that haplogroups A, B, and C had at least one demographic expansion in Tibetan sheep. These results contribute to the knowledge of Tibetan sheep populations and will help inform future conservation programs about the Tibetan sheep native to the Qinghai-Tibetan Plateau. PMID:27463976

  15. Genetic Diversity and Phylogenetic Evolution of Tibetan Sheep Based on mtDNA D-Loop Sequences

    PubMed Central

    Yue, Yaojing; Guo, Xian; Guo, Tingting; Chu, Min; Wang, Fan; Han, Jilong; Feng, Ruilin; Sun, Xiaoping; Niu, Chune; Yang, Bohui; Guo, Jian; Yuan, Chao

    2016-01-01

    The molecular and population genetic evidence of the phylogenetic status of the Tibetan sheep (Ovis aries) is not well understood, and little is known about this species’ genetic diversity. This knowledge gap is partly due to the difficulty of sample collection. This is the first work to address this question. Here, the genetic diversity and phylogenetic relationship of 636 individual Tibetan sheep from fifteen populations were assessed using 642 complete sequences of the mitochondrial DNA D-loop. Samples were collected from the Qinghai-Tibetan Plateau area in China, and reference data were obtained from the six reference breed sequences available in GenBank. The length of the sequences varied considerably, between 1031 and 1259 bp. The haplotype diversity and nucleotide diversity were 0.992±0.010 and 0.019±0.001, respectively. The average number of nucleotide differences was 19.635. The mean nucleotide composition of the 350 haplotypes was 32.961% A, 29.708% T, 22.892% C, 14.439% G, 62.669% A+T, and 37.331% G+C. Phylogenetic analysis showed that all four previously defined haplogroups (A, B, C, and D) were found in the 636 individuals of the fifteen Tibetan sheep populations but that only the D haplogroup was found in Linzhou sheep. Further, the clustering analysis divided the fifteen Tibetan sheep populations into at least two clusters. The estimation of the demographic parameters from the mismatch analyses showed that haplogroups A, B, and C had at least one demographic expansion in Tibetan sheep. These results contribute to the knowledge of Tibetan sheep populations and will help inform future conservation programs about the Tibetan sheep native to the Qinghai-Tibetan Plateau. PMID:27463976

  16. High conopeptide diversity in Conus tribblei revealed through analysis of venom duct transcriptome using two high-throughput sequencing platforms

    PubMed Central

    Barghi, Neda; Concepcion, Gisela P.; Olivera, Baldomero M.; Lluisma, Arturo O.

    2015-01-01

    The venom of each species of Conus contains different kinds of pharmacologically-active peptides which are mostly unique to that species. Collectively, the ~500 – 700 species of Conus produce a large number of these peptides, perhaps exceeding 140,000 different types in total. To date, however, only a small fraction of this diversity has been characterized via transcriptome sequencing. In addition, the sampling of this chemical diversity has not been uniform across the different lineages in the genus. In this study, we used high-throughput transcriptome sequencing approach to further investigate the diversity of Conus venom peptides. We chose a species, Conus tribblei, as a representative of a poorly studied clade of Conus. Using the Roche 454 and Illumina platforms, we discovered 136 unique and novel putative conopeptides belonging to 30 known gene superfamilies and 6 new conopeptide groups, the greatest diversity so far observed from a transcriptome. Most of the identified peptides exhibited divergence from the known conopeptides and some contained cysteine frameworks observed for the first time in cone snails. In addition, several enzymes involved in post-translational modification of conopeptides and also some proteins involved in efficient delivery of the conopeptides to prey were identified as well. Interestingly, a number of conopeptides highly similar to the conopeptides identified in a phylogenetically distant species, the generalist feeder Conus californicus, were observed. The high diversity of conopeptides and the presence of conopeptides similar to those in C. californicus suggest that C. tribblei may have a broad range of prey preferences. PMID:25117477

  17. Insights into the Genetic Structure and Diversity of 38 South Asian Indians from Deep Whole-Genome Sequencing

    PubMed Central

    Saw, Woei-Yuh; Ong, Rick Twee-Hee; Cheng, Anthony Youzhi; Pillai, Nisha Esakimuthu; Liu, Xuanyao; Xu, Wenting; Chen, Peng; Foo, Jia-Nee; Tan, Linda Wei-Lin; Koo, Seok-Hwee; Soong, Richie; Wenk, Markus Rene; Lim, Wei-Yen; Khor, Chiea-Chuen; Little, Peter; Chia, Kee-Seng; Teo, Yik-Ying

    2014-01-01

    South Asia possesses a significant amount of genetic diversity due to considerable intergroup differences in culture and language. There have been numerous reports on the genetic structure of Asian Indians, although these have mostly relied on genotyping microarrays or targeted sequencing of the mitochondria and Y chromosomes. Asian Indians in Singapore are primarily descendants of immigrants from Dravidian-language–speaking states in south India, and 38 individuals from the general population underwent deep whole-genome sequencing with a target coverage of 30X as part of the Singapore Sequencing Indian Project (SSIP). The genetic structure and diversity of these samples were compared against samples from the Singapore Sequencing Malay Project and populations in Phase 1 of the 1,000 Genomes Project (1 KGP). SSIP samples exhibited greater intra-population genetic diversity and possessed higher heterozygous-to-homozygous genotype ratio than other Asian populations. When compared against a panel of well-defined Asian Indians, the genetic makeup of the SSIP samples was closely related to South Indians. However, even though the SSIP samples clustered distinctly from the Europeans in the global population structure analysis with autosomal SNPs, eight samples were assigned to mitochondrial haplogroups that were predominantly present in Europeans and possessed higher European admixture than the remaining samples. An analysis of the relative relatedness between SSIP with two archaic hominins (Denisovan, Neanderthal) identified higher ancient admixture in East Asian populations than in SSIP. The data resource for these samples is publicly available and is expected to serve as a valuable complement to the South Asian samples in Phase 3 of 1 KGP. PMID:24832686

  18. Genome-wide survey of genetic diversity of apple using genotyping-by-sequencing

    Technology Transfer Automated Retrieval System (TEKTRAN)

    With the advent of next-generation sequencing technologies it is now possible to generate large numbers of genetic markers without the need to rely on costly microarray platforms. Genotyping-by-Sequencing (GBS) enables the simultaneous identification and genotyping of single nucleotide polymorphisms...

  19. Pretreatment sequence diversity differences in the full-length hepatitis C virus open reading frame correlate with early response to therapy.

    PubMed

    Donlin, Maureen J; Cannon, Nathan A; Yao, Ermei; Li, Jia; Wahed, Abdus; Taylor, Milton W; Belle, Steven H; Di Bisceglie, Adrian M; Aurora, Rajeev; Tavis, John E

    2007-08-01

    Pegylated alpha interferon and ribavirin therapy for hepatitis C virus (HCV) genotype 1 infection fails for half of Caucasian American patients (CA) and more often for African Americans (AA). The reasons for these low response rates are unknown. HCV is highly genetically variable, but it is unknown how this variability affects response to therapy. To assess effects of viral diversity on response to therapy, the complete pretreatment genotype 1 HCV open reading frame was sequenced using samples from 94 participants in the Virahep-C study. Sequences from patients with >3.5 log declines in viral RNA levels by day 28 (marked responders) were more variable than those from patients with declines of <1.4 log (poor responders) in NS3 and NS5A for genotype 1a and in core and NS3 for genotype 1b. These correlations remained when all T-cell epitopes were excluded, indicating that these differences were not due to differential immune selection. When the sequences were compared by race of the patients, higher diversity in CA patients was found in E2 and NS2 but only for genotype 1b. Core, NS3, and NS5A can block the action of alpha interferon in vitro; hence, these genetic patterns are consistent with multiple amino acid variations independently impairing the function of HCV proteins that counteract interferon responses in humans, resulting in HCV strains with variable sensitivity to therapy. No evidence was found for novel HCV strains in the AA population, implying that AA patients may be infected with a higher proportion of the same resistant strains that are found in CA patients. PMID:17522222

  20. Amino acid sequence of a vitamin K-dependent Ca2+-binding peptide from bovine prothrombin.

    PubMed

    Howard, J B; Fausch, M D

    1975-08-10

    The amino acid sequence of a 31-residue peptide from bovine prothrombin has been determined. This peptide has been shown to contain the vitamin K-dependent modification required for Ca2+ binding (Nelsestuen, G. L., and Suttie, J. W. (1973) Proc. Natl. Acad. Sci. U. S. A. 70, 3366-3370) and the modified amino acid, gamma-carboxyglutamic acid (Nelsestuen, G. L., Zytkovicz, T., and Howard, J. B. (1974) J. Biol. Chem. 249, 6347-6350). The peptide was shown to correspond to residues 12 to 42 of prothrombin. PMID:807581

  1. Amino acid sequences around the cysteine residues of rabbit muscle triose phosphate isomerase

    PubMed Central

    Miller, Janet C.; Waley, S. G.

    1971-01-01

    1. The nature of the subunits in rabbit muscle triose phosphate isomerase has been investigated. 2. Amino acid analyses show that there are five cysteine residues and two methionine residues/subunit. 3. The amino acid sequences around the cysteine residues have been determined; these account for about 75 residues. 4. Cleavage at the methionine residues with cyanogen bromide gave three fragments. 5. These results show that the subunits correspond to polypeptide chains, containing about 230 amino acid residues. The chains in triose phosphate isomerase seem to be shorter than those of other glycolytic enzymes. PMID:5165707

  2. Complete genome sequencing and comparative genomic analysis of functionally diverse Lysinibacillus sphaericus III(3)7.

    PubMed

    Rey, Andrés; Silva-Quintero, Laura; Dussán, Jenny

    2016-09-01

    Lysinibacillus sphaericus III(3)7 is a native Colombian strain, the first one isolated from soil samples. This strain has shown high levels of pathogenic activity against Culex quinquefaciatus larvae in laboratory assays compared to other members of the same species. Using Pacific Biosciences sequencing technology we sequenced, annotated (de novo) and described the genome of strain III(3)7, achieving a complete genome sequence status. We then performed a comparative analysis between the newly sequenced genome and the ones previously reported for Colombian isolates L. sphaericus OT4b.31, CBAM5 and OT4b.25, with the inclusion of L. sphaericus C3-41 that has been used as a reference genome for most of previous genome sequencing projects. We concluded that L. sphaericus III(3)7 is highly similar with strain OT4b.25 and shares high levels of synteny with isolates CBAM5 and C3-41. PMID:27419068

  3. Complete amino acid sequence of the Mu heavy chain of a human IgM immunoglobulin.

    PubMed

    Putnam, F W; Florent, G; Paul, C; Shinoda, T; Shimizu, A

    1973-10-19

    The amino acid sequence of the micro, chain of a human IgM immunoglobulin, including the location of all disulfide bridges and oligosaccharides, has been determined. The homology of the constant regions of immunoglobulin micro, gamma, alpha, and epsilon heavy chains reveals evolutionary relationships and suggests that two genes code for each heavy chain. PMID:4742735

  4. Draft Genome Sequence of the Butyric Acid Producer Clostridium tyrobutyricum Strain CIP I-776 (IFP923)

    PubMed Central

    Clément, Benjamin; Lopes Ferreira, Nicolas

    2016-01-01

    Here, we report the draft genome sequence of Clostridium tyrobutyricum CIP I-776 (IFP923), an efficient producer of butyric acid. The genome consists of a single chromosome of 3.19 Mb and provides useful data concerning the metabolic capacities of the strain. PMID:26941139

  5. Draft Genome Sequence of Perfluorooctane Acid-Degrading Bacterium Pseudomonas parafulva YAB-1

    PubMed Central

    Tang, Chongjian; Peng, Qingjing; Peng, Qingzhong

    2015-01-01

    Pseudomonas parafulva YAB-1, isolated from perfluorinated compound-contaminated soil, has the ability to degrade perfluorooctane acid (PFOA) compound. Here, we report the draft genome sequence and annotation of the PFOA-degrading bacterium P. parafulva YAB-1. The data provide the basis to investigate the molecular mechanism of PFOA metabolism. PMID:26337877

  6. α,β-Unsaturated monoterpene acid glucose esters: structural diversity, bioactivities and functional roles.

    PubMed

    Goodger, Jason Q D; Woodrow, Ian E

    2011-12-01

    The glycosylation of lipophilic small molecules produces many important plant secondary metabolites. The majority of these are O-glycosides with relatively fewer occurring as glucose esters of aromatic or aliphatic acids. In particular, monoterpene acid glucose esters have much lower structural diversity and distribution compared to monoterpene glycosides. Nevertheless, there have been over 20 monoterpene acid glucose esters described from trees in the genus Eucalyptus (Myrtaceae) in recent years, all based on oleuropeic acid, menthiafolic acid or both. Here we review all of the glucose esters containing these monoterpenoids identified in plants to date. Many of the compounds contain phenolic aglycones and all contain at least one α,β-unsaturated carbonyl, affording a number of important potential therapeutic reactivities such as anti-tumor promotion, carcinogenesis suppression, and anti-oxidant and anti-inflammatory activities. Additional properties such as cytotoxicity, bitterness, and repellency are suggestive of a role in plant defence, but we also discuss their localization to the exterior of foliar secretory cavity lumina, and suggest they may also protect secretory cells from toxic terpenes housed within these structures. Finally we discuss how the use of a recently developed protocol to isolate secretory cavities in a functional state could be used in conjunction with systems biology approaches to help characterize their biosynthesis and roles in plants. PMID:21945720

  7. Legionella species diversity in an acidic biofilm community in Yellowstone National Park.

    PubMed

    Sheehan, Kathy B; Henson, Joan M; Ferris, Michael J

    2005-01-01

    Legionella species are frequently detected in aquatic environments, but their occurrence in extreme, acidic, geothermal habitats has not been explored with cultivation-independent methods. We investigated a predominately eukaryotic algal mat community in a pH 2.7 geothermal stream in Yellowstone National Park for the presence of Legionella and potential host amoebae. Our analyses, using PCR amplification with Legionella-specific primers targeting 16S rRNA genes, detected four known Legionella species, as well as Legionella sequences from species that are not represented in sequence databases, in mat samples and cultivated isolates. The nonrandom occurrence of sequences detected at lower (30 degrees C) and higher (35 to 38 degrees C) temperatures suggests that natural thermal gradients in the stream influence Legionella species distributions in this mat community. We detected only one sequence, Legionella micdadei, from cultivated isolates. We cultured and sequenced partial 18S rRNA gene regions from two potential hosts, Acanthamoeba and Euglena species. PMID:15640227

  8. The amino acid sequence of cytochrome c-555 from the methane-oxidizing bacterium Methylococcus capsulatus.

    PubMed Central

    Ambler, R P; Dalton, H; Meyer, T E; Bartsch, R G; Kamen, M D

    1986-01-01

    The amino acid sequence of the cytochrome c-555 from the obligate methanotroph Methylococcus capsulatus strain Bath (N.C.I.B. 11132) was determined. It is a single polypeptide chain of 96 residues, binding a haem group through the cysteine residues at positions 19 and 22, and the only methionine residue is a position 59. The sequence does not closely resemble that of any other cytochrome c that has yet been characterized. Detailed evidence for the amino acid sequence of the protein has been deposited as Supplementary Publication SUP 50131 (12 pages) at the British Library Lending Division, Boston Spa, West Yorkshire LS23 7BQ, U.K., from whom copies are available on prepayment. PMID:3006666

  9. Asymmetric Synthesis of Diverse Glycolic Acid Scaffolds via Dynamic Kinetic Resolution of α-Keto Esters

    PubMed Central

    Steward, Kimberly M.; Corbett, Michael T.; Goodman, C. Guy; Johnson, Jeffrey S.

    2012-01-01

    The dynamic kinetic resolution of α-keto esters via asymmetric transfer hydrogenation has been developed as a technique for the highly stereoselective construction of structurally diverse β-substituted-α-hydroxy carboxylic acid derivatives. Through the development of a privileged m-terphenylsulfonamide for (arene)RuCl(monosulfonamide) complexes with a high affinity for selective α-keto ester reduction, excellent levels of chemo-, diastereo-, and enantiocontrol can be realized in the reduction of β-aryl- and β-chloro-α-keto esters. PMID:23186551

  10. Tuning the Lewis acid phenol [Formula: see text]-prenylation as a molecular diversity tool.

    PubMed

    Jäger, Sebastián N; Porta, Exequiel O J; Labadie, Guillermo R

    2016-05-01

    A diversity-oriented approach for the synthesis of various structurally different prenylated alcohols from readily accessible and common precursors was developed. With varying approaches, this article describes some successful examples of a Friedel-Crafts alkylation using methoxyphenols and different prenyl alcohols (geraniol and (E,E)-farnesol). We demonstrated that just by varying the stoichiometry of the Lewis acid used, the course of the reaction can be shifted to produce the alkylated or the cyclized product. Eighteen unique products were obtained with good isolated yields by direct alkylation with or without a consecutive [Formula: see text]-cationic cyclization. PMID:26525879

  11. Allelic polymorphism in arabian camel ribonuclease and the amino acid sequence of bactrian camel ribonuclease.

    PubMed

    Welling, G W; Mulder, H; Beintema, J J

    1976-04-01

    Pancreatic ribonucleases from several species (whitetail deer, roe deer, guinea pig, and arabian camel) exhibit more than one amino acid at particular positions in their amino acid sequences. Since these enzymes were isolated from pooled pancreas, the origin of this heterogeneity is not clear. The pancreatic ribonucleases from 11 individual arabian camels (Camelus dromedarius) have been investigated with respect to the lysine-glutamine heterogeneity at position 103 (Welling et al., 1975). Six ribonucleases showed only one basic band and five showed two bands after polyacrylamide gel electrophoresis, suggesting a gene frequency of about 0.75 for the Lys gene and about 0.25 for the Gln gene. The amino acid sequence of bactrian camel (Camelus bactrianus) ribonuclease isolated from individual pancreatic tissue was determined and compared with that of arabian camel ribonuclease. The only difference was observed at position 103. In the ribonucleases from two unrelated bactrian camels, only glutamine was observed at that position. PMID:962846

  12. Comparative population genetics of the panicoid grasses: sequence polymorphism, linkage disequilibrium and selection in a diverse sample of sorghum bicolor.

    PubMed Central

    Hamblin, Martha T; Mitchell, Sharon E; White, Gemma M; Gallego, Javier; Kukatla, Rakesh; Wing, Rod A; Paterson, Andrew H; Kresovich, Stephen

    2004-01-01

    Levels of genetic variation and linkage disequilibrium (LD) are critical factors in association mapping methods as well as in identification of loci that have been targets of selection. Maize, an outcrosser, has a high level of sequence variation and a limited extent of LD. Sorghum, a closely related but largely self-pollinating panicoid grass, is expected to have higher levels of LD. As a first step in estimation of population genetic parameters in sorghum, we surveyed 27 diverse S. bicolor accessions for sequence variation at a total of 29,186 bp in 95 short regions derived from genetically mapped RFLPs located throughout the genome. Consistent with its higher level of inbreeding, the extent of LD is at least severalfold greater in sorghum than in maize. Total sequence variation in sorghum is about fourfold lower than that in maize, while synonymous variation is fivefold lower, suggesting a smaller effective population size in sorghum. Because we surveyed a species-wide sample, the mating system, which primarily affects population-level diversity, may not be primarily responsible for this difference. Comparisons of polymorphism and divergence suggest that both directional and diversifying selection have played important roles in shaping variation in the sorghum genome. PMID:15166170

  13. High-Throughput Sequencing Analysis of the Endophytic Bacterial Diversity and Dynamics in Roots of the Halophyte Salicornia europaea.

    PubMed

    Zhao, Shuai; Zhou, Na; Zhao, Zheng-Yong; Zhang, Ke; Tian, Chang-Yan

    2016-05-01

    Endophytic bacterial communities of halophyte Salicornia europaea roots were analyzed by 16S rRNA gene pyrosequencing. A total of 20,151 partial 16S rRNA gene sequences were obtained. These sequences revealed huge amounts of operational taxonomic units (OTUs), that is, 747-1405 OTUs in a root sample, at 3 % cut-off level. Root endophytes mainly comprised four phyla, among which Proteobacteria was the most represented, followed by Bacteroidetes, Actinobacteria, and Firmicutes. Gammaproteobacteria was the most abundant class of Proteobacteria, followed by Betaproteobacteria and Alphaproteobacteria. Genera Pantoea, Halomonas, Azomonas, Serpens, and Pseudomonas were shared by all growth periods. A marked difference in endophytic bacterial communities was evident in roots from different host life-history stages. Gammaproteobacteria increased during the five periods, while Betaproteobacteria decreased. The richest endophytic bacteria diversity was detected in the seedling stage. Endophytic bacteria diversity was reduced during the flowering stage and fruiting stage. The five libraries contained 2321 different OTUs with 41 OTUs in common. As a whole, this study first surveys communities of endophytic bacteria by tracing crucial stages in the process of halophyte growth using high-throughput sequencing methods. PMID:26787546

  14. Genetic diversity of Taenia asiatica from Thailand and other geographical locations as revealed by cytochrome c oxidase subunit 1 sequences.

    PubMed

    Anantaphruti, Malinee Thairungroj; Thaenkham, Urusa; Watthanakulpanich, Dorn; Phuphisut, Orawan; Maipanich, Wanna; Yoonuan, Tippayarat; Nuamtanong, Supaporn; Pubampen, Somjit; Sanguankiat, Surapol

    2013-02-01

    Twelve 924 bp cytochrome c oxidase subunit 1 (cox1) mitochondrial DNA sequences from Taenia asiatica isolates from Thailand were aligned and compared with multiple sequence isolates from Thailand and 6 other countries from the GenBank database. The genetic divergence of T. asiatica was also compared with Taenia saginata database sequences from 6 different countries in Asia, including Thailand, and 3 countries from other continents. The results showed that there were minor genetic variations within T. asiatica species, while high intraspecies variation was found in T. saginata. There were only 2 haplotypes and 1 polymorphic site found in T. asiatica, but 8 haplotypes and 9 polymorphic sites in T. saginata. Haplotype diversity was very low, 0.067, in T. asiatica and high, 0.700, in T. saginata. The very low genetic diversity suggested that T. asiatica may be at a risk due to the loss of potential adaptive alleles, resulting in reduced viability and decreased responses to environmental changes, which may endanger the species. PMID:23467439

  15. Deciphering the Diversities of Astroviruses and Noroviruses in Wastewater Treatment Plant Effluents by a High-Throughput Sequencing Method.

    PubMed

    Prevost, B; Lucas, F S; Ambert-Balay, K; Pothier, P; Moulin, L; Wurtzer, S

    2015-10-01

    Although clinical epidemiology lists human enteric viruses to be among the primary causes of acute gastroenteritis in the human population, their circulation in the environment remains poorly investigated. These viruses are excreted by the human population into sewers and may be released into rivers through the effluents of wastewater treatment plants (WWTPs). In order to evaluate the viral diversity and loads in WWTP effluents of the Paris, France, urban area, which includes about 9 million inhabitants (approximately 15% of the French population), the seasonal occurrence of astroviruses and noroviruses in 100 WWTP effluent samples was investigated over 1 year. The coupling of these measurements with a high-throughput sequencing approach allowed the specific estimation of the diversity of human astroviruses (human astrovirus genotype 1 [HAstV-1], HAstV-2, HAstV-5, and HAstV-6), 7 genotypes of noroviruses (NoVs) of genogroup I (NoV GI.1 to NoV GI.6 and NoV GI.8), and 16 genotypes of NoVs of genogroup II (NoV GII.1 to NoV GII.7, NoV GII.9, NoV GII.12 to NoV GII.17, NoV GII.20, and NoV GII.21) in effluent samples. Comparison of the viral diversity in WWTP effluents to the viral diversity found by analysis of clinical data obtained throughout France underlined the consistency between the identified genotypes. However, some genotypes were locally present in effluents and were not found in the analysis of the clinical data. These findings could highlight an underestimation of the diversity of enteric viruses circulating in the human population. Consequently, analysis of WWTP effluents could allow the exploration of viral diversity not only in environmental waters but also in a human population linked to a sewerage network in order to better comprehend viral epidemiology and to forecast seasonal outbreaks. PMID:26253673

  16. Deciphering the Diversities of Astroviruses and Noroviruses in Wastewater Treatment Plant Effluents by a High-Throughput Sequencing Method

    PubMed Central

    Prevost, B.; Lucas, F. S.; Ambert-Balay, K.; Pothier, P.; Wurtzer, S.

    2015-01-01

    Although clinical epidemiology lists human enteric viruses to be among the primary causes of acute gastroenteritis in the human population, their circulation in the environment remains poorly investigated. These viruses are excreted by the human population into sewers and may be released into rivers through the effluents of wastewater treatment plants (WWTPs). In order to evaluate the viral diversity and loads in WWTP effluents of the Paris, France, urban area, which includes about 9 million inhabitants (approximately 15% of the French population), the seasonal occurrence of astroviruses and noroviruses in 100 WWTP effluent samples was investigated over 1 year. The coupling of these measurements with a high-throughput sequencing approach allowed the specific estimation of the diversity of human astroviruses (human astrovirus genotype 1 [HAstV-1], HAstV-2, HAstV-5, and HAstV-6), 7 genotypes of noroviruses (NoVs) of genogroup I (NoV GI.1 to NoV GI.6 and NoV GI.8), and 16 genotypes of NoVs of genogroup II (NoV GII.1 to NoV GII.7, NoV GII.9, NoV GII.12 to NoV GII.17, NoV GII.20, and NoV GII.21) in effluent samples. Comparison of the viral diversity in WWTP effluents to the viral diversity found by analysis of clinical data obtained throughout France underlined the consistency between the identified genotypes. However, some genotypes were locally present in effluents and were not found in the analysis of the clinical data. These findings could highlight an underestimation of the diversity of enteric viruses circulating in the human population. Consequently, analysis of WWTP effluents could allow the exploration of viral diversity not only in environmental waters but also in a human population linked to a sewerage network in order to better comprehend viral epidemiology and to forecast seasonal outbreaks. PMID:26253673

  17. Morphological and sequence-related amplified polymorphism-based molecular diversity of local and exotic wheat genotypes.

    PubMed

    Abdelkhalik, S M; Salem, A K M; Abdelaziz, A R; Ammar, M H

    2016-01-01

    Assessing genetic diversity is a prerequisite for the genetic improvement of wheat. Molecular markers offer accurate and reproducible means for assessing genetic diversity. Field performance and sequence-related amplified polymorphism (SRAP)-based assessment of molecular diversity was carried out on a set of 10 local and introduced bread wheat (Triticum sativum L.) genotypes grown in the middle arid region of Saudi Arabia. The results revealed highly significant differences among the studied phenological traits and revealed a significant amount of genetic diversity across the tested genotypes. The overall performance revealed the superiority of KSU 102 in terms of yield and its components, with a yield potential of 8.7 tons/ha. Highly significant and positive correlations were observed among grain yield and biological yield, and also, spike length and spike weight. Thirteen SRAP primer combinations successfully amplified 954 fragments. The total number of genetic loci analyzed was 312. The overall polymorphism ratio was 99.67%, ranging from 98 to 100%. The polymorphic information content values ranged from 0.67 for ME11 x EM5 to 0.97 for ME9 x EM4 and ME11 x EM6, respectively. The wheat genotypes were clustered based on their genetic constitution and origin. The results demonstrate the power of SRAP primers for detecting molecular diversity and for varietal discrimination. The results show that high levels of genetic diversity exist, and suggest the potential of the tested materials for wheat crop improvement in the arid central region of Saudi Arabia. PMID:27173279

  18. Diversity analysis in Cannabis sativa based on large-scale development of expressed sequence tag-derived simple sequence repeat markers.

    PubMed

    Gao, Chunsheng; Xin, Pengfei; Cheng, Chaohua; Tang, Qing; Chen, Ping; Wang, Changbiao; Zang, Gonggu; Zhao, Lining

    2014-01-01

    Cannabis sativa L. is an important economic plant for the production of food, fiber, oils, and intoxicants. However, lack of sufficient simple sequence repeat (SSR) markers has limited the development of cannabis genetic research. Here, large-scale development of expressed sequence tag simple sequence repeat (EST-SSR) markers was performed to obtain more informative genetic markers, and to assess genetic diversity in cannabis (Cannabis sativa L.). Based on the cannabis transcriptome, 4,577 SSRs were identified from 3,624 ESTs. From there, a total of 3,442 complementary primer pairs were designed as SSR markers. Among these markers, trinucleotide repeat motifs (50.99%) were the most abundant, followed by hexanucleotide (25.13%), dinucleotide (16.34%), tetranucloetide (3.8%), and pentanucleotide (3.74%) repeat motifs, respectively. The AAG/CTT trinucleotide repeat (17.96%) was the most abundant motif detected in the SSRs. One hundred and seventeen EST-SSR markers were randomly selected to evaluate primer quality in 24 cannabis varieties. Among these 117 markers, 108 (92.31%) were successfully amplified and 87 (74.36%) were polymorphic. Forty-five polymorphic primer pairs were selected to evaluate genetic diversity and relatedness among the 115 cannabis genotypes. The results showed that 115 varieties could be divided into 4 groups primarily based on geography: Northern China, Europe, Central China, and Southern China. Moreover, the coefficient of similarity when comparing cannabis from Northern China with the European group cannabis was higher than that when comparing with cannabis from the other two groups, owing to a similar climate. This study outlines the first large-scale development of SSR markers for cannabis. These data may serve as a foundation for the development of genetic linkage, quantitative trait loci mapping, and marker-assisted breeding of cannabis. PMID:25329551

  19. Diversity Analysis in Cannabis sativa Based on Large-Scale Development of Expressed Sequence Tag-Derived Simple Sequence Repeat Markers

    PubMed Central

    Cheng, Chaohua; Tang, Qing; Chen, Ping; Wang, Changbiao; Zang, Gonggu; Zhao, Lining

    2014-01-01

    Cannabis sativa L. is an important economic plant for the production of food, fiber, oils, and intoxicants. However, lack of sufficient simple sequence repeat (SSR) markers has limited the development of cannabis genetic research. Here, large-scale development of expressed sequence tag simple sequence repeat (EST-SSR) markers was performed to obtain more informative genetic markers, and to assess genetic diversity in cannabis (Cannabis sativa L.). Based on the cannabis transcriptome, 4,577 SSRs were identified from 3,624 ESTs. From there, a total of 3,442 complementary primer pairs were designed as SSR markers. Among these markers, trinucleotide repeat motifs (50.99%) were the most abundant, followed by hexanucleotide (25.13%), dinucleotide (16.34%), tetranucloetide (3.8%), and pentanucleotide (3.74%) repeat motifs, respectively. The AAG/CTT trinucleotide repeat (17.96%) was the most abundant motif detected in the SSRs. One hundred and seventeen EST-SSR markers were randomly selected to evaluate primer quality in 24 cannabis varieties. Among these 117 markers, 108 (92.31%) were successfully amplified and 87 (74.36%) were polymorphic. Forty-five polymorphic primer pairs were selected to evaluate genetic diversity and relatedness among the 115 cannabis genotypes. The results showed that 115 varieties could be divided into 4 groups primarily based on geography: Northern China, Europe, Central China, and Southern China. Moreover, the coefficient of similarity when comparing cannabis from Northern China with the European group cannabis was higher than that when comparing with cannabis from the other two groups, owing to a similar climate. This study outlines the first large-scale development of SSR markers for cannabis. These data may serve as a foundation for the development of genetic linkage, quantitative trait loci mapping, and marker-assisted breeding of cannabis. PMID:25329551

  20. Use of a structural alphabet to find compatible folds for amino acid sequences

    PubMed Central

    Mahajan, Swapnil; de Brevern, Alexandre G; Sanejouand, Yves-Henri; Srinivasan, Narayanaswamy; Offmann, Bernard

    2015-01-01

    The structural annotation of proteins with no detectable homologs of known 3D structure identified using sequence-search methods is a major challenge today. We propose an original method that computes the conditional probabilities for the amino-acid sequence of a protein to fit to known protein 3D structures using a structural alphabet, known as “Protein Blocks” (PBs). PBs constitute a library of 16 local structural prototypes that approximate every part of protein backbone structures. It is used to encode 3D protein structures into 1D PB sequences and to capture sequence to structure relationships. Our method relies on amino acid occurrence matrices, one for each PB, to score global and local threading of query amino acid sequences to protein folds encoded into PB sequences. It does not use any information from residue contacts or sequence-search methods or explicit incorporation of hydrophobic effect. The performance of the method was assessed with independent test datasets derived from SCOP 1.75A. With a Z-score cutoff that achieved 95% specificity (i.e., less than 5% false positives), global and local threading showed sensitivity of 64.1% and 34.2%, respectively. We further tested its performance on 57 difficult CASP10 targets that had no known homologs in PDB: 38 compatible templates were identified by our approach and 66% of these hits yielded correctly predicted structures. This method scales-up well and offers promising perspectives for structural annotations at genomic level. It has been implemented in the form of a web-server that is freely available at http://www.bo-protscience.fr/forsa. PMID:25297700

  1. Use of a structural alphabet to find compatible folds for amino acid sequences.

    PubMed

    Mahajan, Swapnil; de Brevern, Alexandre G; Sanejouand, Yves-Henri; Srinivasan, Narayanaswamy; Offmann, Bernard

    2015-01-01

    The structural annotation of proteins with no detectable homologs of known 3D structure identified using sequence-search methods is a major challenge today. We propose an original method that computes the conditional probabilities for the amino-acid sequence of a protein to fit to known protein 3D structures using a structural alphabet, known as "Protein Blocks" (PBs). PBs constitute a library of 16 local structural prototypes that approximate every part of protein backbone structures. It is used to encode 3D protein structures into 1D PB sequences and to capture sequence to structure relationships. Our method relies on amino acid occurrence matrices, one for each PB, to score global and local threading of query amino acid sequences to protein folds encoded into PB sequences. It does not use any information from residue contacts or sequence-search methods or explicit incorporation of hydrophobic effect. The performance of the method was assessed with independent test datasets derived from SCOP 1.75A. With a Z-score cutoff that achieved 95% specificity (i.e., less than 5% false positives), global and local threading showed sensitivity of 64.1% and 34.2%, respectively. We further tested its performance on 57 difficult CASP10 targets that had no known homologs in PDB: 38 compatible templates were identified by our approach and 66% of these hits yielded correctly predicted structures. This method scales-up well and offers promising perspectives for structural annotations at genomic level. It has been implemented in the form of a web-server that is freely available at http://www.bo-protscience.fr/forsa. PMID:25297700

  2. Diverse Array of New Viral Sequences Identified in Worldwide Populations of the Asian Citrus Psyllid (Diaphorina citri) Using Viral Metagenomics

    PubMed Central

    Nouri, Shahideh; Salem, Nidá; Nigg, Jared C.

    2015-01-01

    ABSTRACT The Asian citrus psyllid, Diaphorina citri, is the natural vector of the causal agent of Huanglongbing (HLB), or citrus greening disease. Together; HLB and D. citri represent a major threat to world citrus production. As there is no cure for HLB, insect vector management is considered one strategy to help control the disease, and D. citri viruses might be useful. In this study, we used a metagenomic approach to analyze viral sequences associated with the global population of D. citri. By sequencing small RNAs and the transcriptome coupled with bioinformatics analysis, we showed that the virus-like sequences of D. citri are diverse. We identified novel viral sequences belonging to the picornavirus superfamily, the Reoviridae, Parvoviridae, and Bunyaviridae families, and an unclassified positive-sense single-stranded RNA virus. Moreover, a Wolbachia prophage-related sequence was identified. This is the first comprehensive survey to assess the viral community from worldwide populations of an agricultural insect pest. Our results provide valuable information on new putative viruses, some of which may have the potential to be used as biocontrol agents. IMPORTANCE Insects have the most species of all animals, and are hosts to, and vectors of, a great variety of known and unknown viruses. Some of these most likely have the potential to be important fundamental and/or practical resources. In this study, we used high-throughput next-generation sequencing (NGS) technology and bioinformatics analysis to identify putative viruses associated with Diaphorina citri, the Asian citrus psyllid. D. citri is the vector of the bacterium causing Huanglongbing (HLB), currently the most serious threat to citrus worldwide. Here, we report several novel viral sequences associated with D. citri. PMID:26676774

  3. On the use of high-throughput sequencing for the study of cyanobacterial diversity in Antarctic aquatic mats.

    PubMed

    Pessi, Igor Stelmach; Maalouf, Pedro De Carvalho; Laughinghouse, Haywood Dail; Baurain, Denis; Wilmotte, Annick

    2016-06-01

    The study of Antarctic cyanobacterial diversity has been mostly limited to morphological identification and traditional molecular techniques. High-throughput sequencing (HTS) allows a much better understanding of microbial distribution in the environment, but its application is hampered by several methodological and analytical challenges. In this work, we explored the use of HTS as a tool for the study of cyanobacterial diversity in Antarctic aquatic mats. Our results highlight the importance of using artificial communities to validate the parameters of the bioinformatics procedure used to analyze natural communities, since pipeline-dependent biases had a strong effect on the observed community structures. Analysis of microbial mats from five Antarctic lakes and an aquatic biofilm from the Sub-Antarctic showed that HTS is a valuable tool for the assessment of cyanobacterial diversity. The majority of the operational taxonomic units retrieved were related to filamentous taxa such as Leptolyngbya and Phormidium, which are common genera in Antarctic lacustrine microbial mats. However, other phylotypes related to different taxa such as Geitlerinema, Pseudanabaena, Synechococcus, Chamaesiphon, Calothrix, and Coleodesmium were also found. Results revealed a much higher diversity than what had been reported using traditional methods and also highlighted remarkable differences between the cyanobacterial communities of the studied lakes. The aquatic biofilm from the Sub-Antarctic had a distinct cyanobacterial community from the Antarctic lakes, which in turn displayed a salinity-dependent community structure at the phylotype level. PMID:27273529

  4. Genetic Diversity of Arabica Coffee (Coffea arabica L.) in Nicaragua as Estimated by Simple Sequence Repeat Markers

    PubMed Central

    Geleta, Mulatu; Herrera, Isabel; Monzón, Arnulfo; Bryngelsson, Tomas

    2012-01-01

    Coffea arabica L. (arabica coffee), the only tetraploid species in the genus Coffea, represents the majority of the world's coffee production and has a significant contribution to Nicaragua's economy. The present paper was conducted to determine the genetic diversity of arabica coffee in Nicaragua for its conservation and breeding values. Twenty-six populations that represent eight varieties in Nicaragua were investigated using simple sequence repeat (SSR) markers. A total of 24 alleles were obtained from the 12 loci investigated across 260 individual plants. The total Nei's gene diversity (HT) and the within-population gene diversity (HS) were 0.35 and 0.29, respectively, which is comparable with that previously reported from other countries and regions. Among the varieties, the highest diversity was recorded in the variety Catimor. Analysis of variance (AMOVA) revealed that about 87% of the total genetic variation was found within populations and the remaining 13% differentiate the populations (FST = 0.13; P < 0.001). The variation among the varieties was also significant. The genetic variation in Nicaraguan coffee is significant enough to be used in the breeding programs, and most of this variation can be conserved through ex situ conservation of a low number of populations from each variety. PMID:22701376

  5. Natural variation in Brachypodium disctachyon: Deep Sequencing of Highly Diverse Natural Accessions (2013 DOE JGI Genomics of Energy and Environment 8th Annual User Meeting)

    SciTech Connect

    Gordon, Sean

    2013-03-01

    Sean Gordon of the USDA on "Natural variation in Brachypodium disctachyon: Deep Sequencing of Highly Diverse Natural Accessions" at the 8th Annual Genomics of Energy & Environment Meeting on March 27, 2013 in Walnut Creek, Calif.

  6. Diversity of host species and strains of Pneumocystis carinii is based on rRNA sequences.

    PubMed Central

    Shah, J S; Pieciak, W; Liu, J; Buharin, A; Lane, D J

    1996-01-01

    We have amplified by PCR Pneumocystis carinii cytoplasmic small-subunit rRNA (variously referred to as 16S-like or 18S-like rRNA) genes from DNA extracted from bronchoalveolar lavage and induced sputum specimens from patients positive for P. carinii and from infected ferret lung tissue. The amplification products were cloned into pUC18, and individual clones were sequenced. Comparison of the determined sequences with each other and with published rat and partial human P.carinii small-subunit rRNA gene sequences reveals that, although all P. carinii small-subunit rRNAs are closely related (approximately 96% identity), small-subunit rRNA genes isolated from different host species (human, rat, and ferret) exhibit distinctive patterns of sequence variation. Two types of sequences were isolated from the infected ferret lung tissue, one as a predominant species and the other as a minor species. There was 96% identity between the two types. In situ hybridization of the infected ferret lung tissue with oligonucleotide probes specific for each type revealed that there were two distinct strains of P. carinii present in the ferret lung tissue. Unlike the ferret P. carinii isolates, the small-subunit rRNA gene sequences from different human P. carinii isolates have greater than 99% identity and are distinct from all rat and ferret sequences so far inspected or reported in the literature. Southern blot hybridization analysis of PCR amplification products from several additional bronchoalveolar lavage or induced sputum specimens from P. carinii-infected patients, using a 32P-labeled oligonucleotide probe specific for human P. carinii, also suggests that all of the human P. carinii isolates are identical. These findings indicate that human P. carinii isolates may represent a distinct species of P. carinii distinguishable from rat and ferret P. carinii on the basis of characterization of small-subunit rRNA gene sequences. PMID:8770515

  7. Unbiased analysis by high throughput sequencing of the viral diversity in fetal bovine serum and trypsin used in cell culture.

    PubMed

    Gagnieur, Léa; Cheval, Justine; Gratigny, Marlène; Hébert, Charles; Muth, Erika; Dumarest, Marine; Eloit, Marc

    2014-05-01

    Fetal bovine serum (FBS) and trypsin are reagents used in cell culture and have been the source of viral contamination of pharmaceutical products. We performed high throughput sequencing (HTS) of two pools of commercial batches of FBS and three commercial batches of trypsin. Taxonomies were assigned by comparing sequences of contigs and singletons to the entire NCBI nucleic acid and protein databases. The same major viral species were evidenced between batches of a given reagent but the proportion of viral reads among total reads varied markedly between samples (from 0.002% to 22.7%). In FBS, the sequences found were mainly from bovine viral diarrhea virus (BVDV) 1 to 3 and bovine parvovirus 3 (BPV3). The BVDV sequences derived from FBS showed only minor discrepancies with primers generally used for the screening of BVDV. Viral sequences in trypsin were mainly from porcine circovirus type 2. Other known viral sequences at lower read counts and potential new viral species (bovine parvovirus and bovine pegivirus) were evidenced. The load of some known and new viruses detected by HTS could be quantified by qPCR. Results of HTS provide a framework for evaluating the pertinence of control measures including the design of PCRs, bioassays and inactivation procedures. PMID:24661556

  8. Software scripts for quality checking of high-throughput nucleic acid sequencers.

    PubMed

    Lazo, G R; Tong, J; Miller, R; Hsia, C; Rausch, C; Kang, Y; Anderson, O D

    2001-06-01

    We have developed a graphical interface to allow the researcher to view and assess the quality of sequencing results using a series of program scripts developed to process data generated by automated sequencers. The scripts are written in Perl programming language and are executable under the cgibin directory of a Web server environment. The scripts direct nucleic acid sequencing trace file data output from automated sequencers to be analyzed by the phred molecular biology program and are displayed as graphical hypertext mark-up language (HTML) pages. The scripts are mainly designed to handle 96-well microtiter dish samples, but the scripts are also able to read data from 384-well microtiter dishes 96 samples at a time. The scripts may be customized for different laboratory environments and computer configurations. Web links to the sources and discussion page are provided. PMID:11414222

  9. Diversity and dynamics of dominant and rare bacterial taxa in replicate sequencing batch reactors operated under different solids retention time.

    PubMed

    Bagchi, Samik; Tellez, Berenice G; Rao, Hari Ananda; Lamendella, Regina; Saikaly, Pascal E

    2015-03-01

    In this study, 16S rRNA gene pyrosequencing was applied in order to provide a better insight on the diversity and dynamics of total, dominant, and rare bacterial taxa in replicate lab-scale sequencing batch reactors (SBRs) operated at different solids retention time (SRT). Rank-abundance curves showed few dominant operational taxonomic units (OTUs) and a long tail of rare OTUs in all reactors. Results revealed that there was no detectable effect of SRT (2 vs. 10 days) on Shannon diversity index and OTU richness of both dominant and rare taxa. Nonmetric multidimensional scaling analysis showed that the total, dominant, and rare bacterial taxa were highly dynamic during the entire period of stable reactor performance. Also, the rare taxa were more dynamic than the dominant taxa despite expected low invasion rates because of the use of sterile synthetic media. PMID:25326778

  10. Assessment of genetic diversity by simple sequence repeat markers among forty elite varieties in the germplasm for malting barley breeding*

    PubMed Central

    Wang, Jun-mei; Yang, Jian-ming; Zhu, Jing-huan; Jia, Qiao-jun; Tao, Yue-zhi

    2010-01-01

    The genetic diversity and relationship among 40 elite barley varieties were analyzed based on simple sequence repeat (SSR) genotyping data. The amplified fragments from SSR primers were highly polymorphic in the barley accessions investigated. A total of 85 alleles were detected at 35 SSR loci, and allelic variations existed at 29 SSR loci. The allele number per locus ranged from 1 to 5 with an average of 2.4 alleles per locus detected from the 40 barley accessions. A cluster analysis based on the genetic similarity coefficients was conducted and the 40 varieties were classified into two groups. Seven malting barley varieties from China fell into the same subgroup. It was found that the genetic diversity within the Chinese malting barley varieties was narrower than that in other barley germplasm sources, suggesting the importance and feasibility of introducing elite genotypes from different origins for malting barley breeding in China. PMID:20872987

  11. Multilocus sequence typing approach for a broader range of species of Leishmania genus: describing parasite diversity in Argentina.

    PubMed

    Marco, Jorge D; Barroso, Paola A; Locatelli, Fabricio M; Cajal, S Pamela; Hoyos, Carlos L; Nevot, M Cecilia; Lauthier, Juan J; Tomasini, Nicolás; Juarez, Marisa; Estévez, J Octavio; Korenaga, Masataka; Nasser, Julio R; Hashiguchi, Yoshihisa; Ruybal, Paula

    2015-03-01

    Leishmaniasis is a vector-borne protozoan infection affecting over 350 million people around the world. In Argentina cutaneous leishmaniasis is endemic in nine provinces and visceral leishmaniasis is spreading from autochthonous transmission foci in seven provinces. However, there is limited information about the diversity of the parasite in this country. Implementation of molecular strategies for parasite typing, particularly multilocus sequence typing (MLST), represents an improved approach for genetic variability and population dynamics analyses. We selected six loci as candidates implemented in reference strains and Argentinean isolates. Phylogenetic analysis showed high correlation with taxonomic classification of the parasite. Autochthonous Leishmania (Viannia) braziliensis showed higher genetic diversity than L. (Leishmania) infantum but low support was obtained for intra-L. braziliensis complex variants suggesting the need of new loci that contribute to phylogenetic resolution for an improved MLST or nested-MLST scheme. This study represents the first characterization of genetic variability of Leishmania spp. in Argentina. PMID:25558029

  12. Staphylococcus epidermidis pan-genome sequence analysis reveals diversity of skin commensal and hospital infection-associated isolates

    PubMed Central

    2012-01-01

    Background While Staphylococcus epidermidis is commonly isolated from healthy human skin, it is also the most frequent cause of nosocomial infections on indwelling medical devices. Despite its importance, few genome sequences existed and the most frequent hospital-associated lineage, ST2, had not been fully sequenced. Results We cultivated 71 commensal S. epidermidis isolates from 15 skin sites and compared them with 28 nosocomial isolates from venous catheters and blood cultures. We produced 21 commensal and 9 nosocomial draft genomes, and annotated and compared their gene content, phylogenetic relatedness and biochemical functions. The commensal strains had an open pan-genome with 80% core genes and 20% variable genes. The variable genome was characterized by an overabundance of transposable elements, transcription factors and transporters. Biochemical diversity, as assayed by antibiotic resistance and in vitro biofilm formation, demonstrated the varied phenotypic consequences of this genomic diversity. The nosocomial isolates exhibited both large-scale rearrangements and single-nucleotide variation. We showed that S. epidermidis genomes separate into two phylogenetic groups, one consisting only of commensals. The formate dehydrogenase gene, present only in commensals, is a discriminatory marker between the two groups. Conclusions Commensal skin S. epidermidis have an open pan-genome and show considerable diversity between isolates, even when derived from a single individual or body site. For ST2, the most common nosocomial lineage, we detect variation between three independent isolates sequenced. Finally, phylogenetic analyses revealed a previously unrecognized group of S. epidermidis strains characterized by reduced virulence and formate dehydrogenase, which we propose as a clinical molecular marker. PMID:22830599

  13. Genetic diversity, genetic structure and demographic history of Cycas simplicipinna (Cycadaceae) assessed by DNA sequences and SSR markers

    PubMed Central

    2014-01-01

    Background Cycas simplicipinna (T. Smitinand) K. Hill. (Cycadaceae) is an endangered species in China. There were seven populations and 118 individuals that we could collect were genotyped in this study. Here, we assessed the genetic diversity, genetic structure and demographic history of this species. Results Analyses of data of DNA sequences (two maternally inherited intergenic spacers of chloroplast, cpDNA and one biparentally inherited internal transcribed spacer region ITS4-ITS5, nrDNA) and sixteen microsatellite loci (SSR) were conducted in the species. Of the 118 samples, 86 individuals from the seven populations were used for DNA sequencing and 115 individuals from six populations were used for the microsatellite study. We found high genetic diversity at the species level, low genetic diversity within each of the seven populations and high genetic differentiation among the populations. There was a clear genetic structure within populations of C. simplicipinna. A demographic history inferred from DNA sequencing data indicates that C. simplicipinna experienced a recent population contraction without retreating to a common refugium during the last glacial period. The results derived from SSR data also showed that C. simplicipinna underwent past effective population contraction, likely during the Pleistocene. Conclusions Some genetic features of C. simplicipinna such as having high genetic differentiation among the populations, a clear genetic structure and a recent population contraction could provide guidelines for protecting this endangered species from extinction. Furthermore, the genetic features with population dynamics of the species in our study would help provide insights and guidelines for protecting other endangered species effectively. PMID:25016306

  14. Genetic diversity in wild and cultivated black raspberry evaluated by simple sequence repeat markers

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Breeding progress in black raspberry (Rubus occidentalis L.) has been limited by a lack of genetic diversity in elite germplasm. Black raspberry cultivars have been noted for showing very few phenotypic differences and seedlings from crosses between cultivars for a lack of segregation for important ...

  15. Genetic Diversity in Lens Species Revealed by EST and Genomic Simple Sequence Repeat Analysis.

    PubMed

    Dikshit, Harsh Kumar; Singh, Akanksha; Singh, Dharmendra; Aski, Muraleedhar Sidaram; Prakash, Prapti; Jain, Neelu; Meena, Suresh; Kumar, Shiv; Sarker, Ashutosh

    2015-01-01

    Low productivity of pilosae type lentils grown in South Asia is attributed to narrow genetic base of the released cultivars which results in susceptibility to biotic and abiotic stresses. For enhancement of productivity and production, broadening of genetic base is essentially required. The genetic base of released cultivars can be broadened by using diverse types including bold seeded and early maturing lentils from Mediterranean region and related wild species. Genetic diversity in eighty six accessions of three species of genus Lens was assessed based on twelve genomic and thirty one EST-SSR markers. The evaluated set of genotypes included diverse lentil varieties and advanced breeding lines from Indian programme, two early maturing ICARDA lines and five related wild subspecies/species endemic to the Mediterranean region. Genomic SSRs exhibited higher polymorphism in comparison to EST SSRs. GLLC 598 produced 5 alleles with highest gene diversity value of 0.80. Among the studied subspecies/species 43 SSRs detected maximum number of alleles in L. orientalis. Based on Nei's genetic distance cultivated lentil L. culinaris subsp. culinaris was found to be close to its wild progenitor L. culinaris subsp. orientalis. The Prichard's structure of 86 genotypes distinguished different subspecies/species. Higher variability was recorded among individuals within population than among populations. PMID:26381889

  16. NextGen sequencing reveals short double crossovers contribute disproportionately to genetic diversity in Toxoplasma gondii

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Toxoplasma gondii is a widespread protozoan parasite of animals that causes zoonotic disease in humans. Three clonal variants predominate in North America and Europe, while South American strains are genetically diverse, and undergo more frequent recombination. All three northern clonal variants s...

  17. Unusually high genetic diversity in COI sequences of Chimarra obscura (Trichoptera: Philopotamidae)

    EPA Science Inventory

    Chimarra obscura (Walker 1852) is a philopotamid caddisfly found throughout much of North America. Using the COI DNA barcode locus, we have found unexpectedly high amounts of genetic diversity and distances within C. obscura. Of the approximately 150 specimens sampled, we have fo...

  18. Genome Sequence of a Diverse Goose Circovirus Recovered from Greylag Goose

    PubMed Central

    Stenzel, Tomasz; Farkas, Kata

    2015-01-01

    A diverse goose circovirus (GoCV) genome was recovered from a wild hunted greylag goose (Anser anser) in Poland. The genome shares 83% pairwise identity with other GoCV genomes recovered from various geese from China, Germany, and Taiwan. PMID:26227589

  19. Genetic Diversity in Lens Species Revealed by EST and Genomic Simple Sequence Repeat Analysis

    PubMed Central

    Dikshit, Harsh Kumar; Singh, Akanksha; Singh, Dharmendra; Aski, Muraleedhar Sidaram; Prakash, Prapti; Jain, Neelu; Meena, Suresh; Kumar, Shiv; Sarker, Ashutosh

    2015-01-01

    Low productivity of pilosae type lentils grown in South Asia is attributed to narrow genetic base of the released cultivars which results in susceptibility to biotic and abiotic stresses. For enhancement of productivity and production, broadening of genetic base is essentially required. The genetic base of released cultivars can be broadened by using diverse types including bold seeded and early maturing lentils from Mediterranean region and related wild species. Genetic diversity in eighty six accessions of three species of genus Lens was assessed based on twelve genomic and thirty one EST-SSR markers. The evaluated set of genotypes included diverse lentil varieties and advanced breeding lines from Indian programme, two early maturing ICARDA lines and five related wild subspecies/species endemic to the Mediterranean region. Genomic SSRs exhibited higher polymorphism in comparison to EST SSRs. GLLC 598 produced 5 alleles with highest gene diversity value of 0.80. Among the studied subspecies/species 43 SSRs detected maximum number of alleles in L. orientalis. Based on Nei’s genetic distance cultivated lentil L. culinaris subsp. culinaris was found to be close to its wild progenitor L. culinaris subsp. orientalis. The Prichard’s structure of 86 genotypes distinguished different subspecies/species. Higher variability was recorded among individuals within population than among populations. PMID:26381889

  20. Association of simple sequence repeat (SSR) markers with submergence tolerance in diverse population of perennial ryegrass

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Submergence stress can cause the death of turfgrass plants. Identification of association between molecular markers and submergence tolerance-related traits facilitates an efficient selection of the tolerant cultivars for commercial production. A global collection of 99 diverse perennial ryegrass (L...

  1. Genetic diversity of human immunodeficiency virus type 2: evidence for distinct sequence subtypes with differences in virus biology.

    PubMed Central

    Gao, F; Yue, L; Robertson, D L; Hill, S C; Hui, H; Biggar, R J; Neequaye, A E; Whelan, T M; Ho, D D; Shaw, G M

    1994-01-01

    The virulence properties of human immunodeficiency virus type 2 (HIV-2) are known to vary significantly and to range from relative attenuation in certain individuals to high-level pathogenicity in others. These differences in clinical manifestations may, at least in part, be determined by genetic differences among infecting virus strains. Evaluation of the full spectrum of HIV-2 genetic diversity is thus a necessary first step towards understanding its molecular epidemiology, natural history of infection, and biological diversity. In this study, we have used nested PCR techniques to amplify viral sequences from the DNA of uncultured peripheral blood mononuclear cells from 12 patients with HIV-2 seroreactivity. Sequence analysis of four nonoverlapping genomic regions allowed a comprehensive analysis of HIV-2 phylogeny. The results revealed (i) the existence of five distinct and roughly equidistant evolutionary lineages of HIV-2 which, by analogy with HIV-1, have been termed sequence subtypes A to E; (ii) evidence for a mosaic HIV-2 genome, indicating that coinfection with genetically divergent strains and recombination can occur in HIV-2-infected individuals; and (iii) evidence supporting the conclusion that some of the HIV-2 subtypes may have arisen from independent introductions of genetically diverse sooty mangabey viruses into the human population. Importantly, only a subset of HIV-2 strains replicated in culture: all subtype A viruses grew to high titers, but attempts to isolate representatives of subtypes C, D, and E, as well as the majority of subtype B viruses, remained unsuccessful. Infection with all five viral subtypes was detectable by commercially available serological (Western immunoblot) assays, despite intersubtype sequence differences of up to 25% in the gag, pol, and env regions. These results indicate that the genetic and biological diversity of HIV-2 is far greater than previously appreciated and suggest that there may be subtype

  2. Diversion of phagosome trafficking by pathogenic Rhodococcus equi depends on mycolic acid chain length.

    PubMed

    Sydor, Tobias; von Bargen, Kristine; Hsu, Fong-Fu; Huth, Gitta; Holst, Otto; Wohlmann, Jens; Becken, Ulrike; Dykstra, Tobias; Söhl, Kristina; Lindner, Buko; Prescott, John F; Schaible, Ulrich E; Utermöhlen, Olaf; Haas, Albert

    2013-03-01

    Rhodococcus equi is a close relative of Mycobacterium spp. and a facultative intracellular pathogen which arrests phagosome maturation in macrophages before the late endocytic stage. We have screened a transposon mutant library of R. equi for mutants with decreased capability to prevent phagolysosome formation. This screen yielded a mutant in the gene for β-ketoacyl-(acyl carrier protein)-synthase A (KasA), a key enzyme of the long-chain mycolic acid synthesizing FAS-II system. The longest kasA mutant mycolic acid chains were 10 carbon units shorter than those of wild-type bacteria. Coating of non-pathogenic E. coli with purified wild-type trehalose dimycolate reduced phagolysosome formation substantially which was not the case with shorter kasA mutant-derived trehalose dimycolate. The mutant was moderately attenuated in macrophages and in a mouse infection model, but was fully cytotoxic.Whereas loss of KasA is lethal in mycobacteria, R. equi kasA mutant multiplication in broth was normal proving that long-chain mycolic acid compounds are not necessarily required for cellular integrity and viability of the bacteria that typically produce them. This study demonstrates a central role of mycolic acid chain length in diversion of trafficking by R. equi. PMID:23078612

  3. Nucleotide and predicted amino acid sequences of cloned human and mouse preprocathepsin B cDNAs.

    PubMed Central

    Chan, S J; San Segundo, B; McCormick, M B; Steiner, D F

    1986-01-01

    Cathepsin B is a lysosomal thiol proteinase that may have additional extralysosomal functions. To further our investigations on the structure, mode of biosynthesis, and intracellular sorting of this enzyme, we have determined the complete coding sequences for human and mouse preprocathepsin B by using cDNA clones isolated from human hepatoma and kidney phage libraries. The nucleotide sequences predict that the primary structure of preprocathepsin B contains 339 amino acids organized as follows: a 17-residue NH2-terminal prepeptide sequence followed by a 62-residue propeptide region, 254 residues in mature (single chain) cathepsin B, and a 6-residue extension at the COOH terminus. A comparison of procathepsin B sequences from three species (human, mouse, and rat) reveals that the homology between the propeptides is relatively conserved with a minimum of 68% sequence identity. In particular, two conserved sequences in the propeptide that may be functionally significant include a potential glycosylation site and the presence of a single cysteine at position 59. Comparative analysis of the three sequences also suggests that processing of procathepsin B is a multistep process, during which enzymatically active intermediate forms may be generated. The availability of the cDNA clones will facilitate the identification of possible active or inactive intermediate processive forms as well as studies on the transcriptional regulation of the cathepsin B gene. PMID:3463996

  4. Efficient Nucleic Acid Extraction and 16S rRNA Gene Sequencing for Bacterial Community Characterization.

    PubMed

    Anahtar, Melis N; Bowman, Brittany A; Kwon, Douglas S

    2016-01-01

    There is a growing appreciation for the role of microbial communities as critical modulators of human health and disease. High throughput sequencing technologies have allowed for the rapid and efficient characterization of bacterial communities using 16S rRNA gene sequencing from a variety of sources. Although readily available tools for 16S rRNA sequence analysis have standardized computational workflows, sample processing for DNA extraction remains a continued source of variability across studies. Here we describe an efficient, robust, and cost effective method for extracting nucleic acid from swabs. We also delineate downstream methods for 16S rRNA gene sequencing, including generation of sequencing libraries, data quality control, and sequence analysis. The workflow can accommodate multiple samples types, including stool and swabs collected from a variety of anatomical locations and host species. Additionally, recovered DNA and RNA can be separated and used for other applications, including whole genome sequencing or RNA-seq. The method described allows for a common processing approach for multiple sample types and accommodates downstream analysis of genomic, metagenomic and transcriptional information. PMID:27168460

  5. Efficient Nucleic Acid Extraction and 16S rRNA Gene Sequencing for Bacterial Community Characterization

    PubMed Central

    Anahtar, Melis N.; Bowman, Brittany A.; Kwon, Douglas S.

    2016-01-01

    There is a growing appreciation for the role of microbial communities as critical modulators of human health and disease. High throughput sequencing technologies have allowed for the rapid and efficient characterization of bacterial communities using 16S rRNA gene sequencing from a variety of sources. Although readily available tools for 16S rRNA sequence analysis have standardized computational workflows, sample processing for DNA extraction remains a continued source of variability across studies. Here we describe an efficient, robust, and cost effective method for extracting nucleic acid from swabs. We also delineate downstream methods for 16S rRNA gene sequencing, including generation of sequencing libraries, data quality control, and sequence analysis. The workflow can accommodate multiple samples types, including stool and swabs collected from a variety of anatomical locations and host species. Additionally, recovered DNA and RNA can be separated and used for other applications, including whole genome sequencing or RNA-seq. The method described allows for a common processing approach for multiple sample types and accommodates downstream analysis of genomic, metagenomic and transcriptional information. PMID:27168460

  6. Preparation of Nucleic Acid Libraries for Personalized Sequencing Systems Using an Integrated Microfluidic Hub Technology (Seventh Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting 2012)

    ScienceCinema

    Patel, Kamlesh D [Ken]; SNL,

    2013-01-25

    Kamlesh (Ken) Patel from Sandia National Laboratories (Livermore, California) presents "Preparation of Nucleic Acid Libraries for Personalized Sequencing Systems Using an Integrated Microfluidic Hub Technology " at the 7th Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting held in June, 2012 in Santa Fe, NM.

  7. The amino acid sequence of ribonuclease U2 from Ustilago sphaerogena.

    PubMed Central

    Sato, S; Uchida, T

    1975-01-01

    1. RNAase (ribonuclease) U2, a purine-specific RNAase, was reduced, aminoethylated and hydrolysed with trypsin, chymotrypsin and thermolysin. On the basis of the analyses of the resulting peptides, the complete amino acid sequence of RNAase U2 was determined, 2. When the sequence was compared with the amino acid sequence of RNAase T1 (EC 3.1.4.8), the following regions were found to be similar in the two enzymes; Tyr-Pro-His-Gln-Tyr (38-42) in RNAase U2 and Tyr-Pro-His-Lys-Tyr (38-42) in RNAase T1, Glu-Phe-Pro-Leu-Val (61-65) in RNAase U2 and Glu-Trp-Pro-Ile-Leu (58-62) in RNAase T1, Asp-Arg-Val-Ile-Tyr-Gln (83-88) in RNAase U2 and Asp-Arg-Val-Phe-Asn (76-81) in RNAase T1 and Val-Thr-His-Thr-Gly-Ala (98-103) in RNAase U2 and Ile-Thr-His-Thr-Gly-Ala (90-95) in RNAase T1. All of the amino acid residues, histidine-40, glutamate-58, arginine-77 and histidine-92, which were found to play a crucial role in the biological activity of RNAase T1, were included in the regions cited here. 3. Detailed evidence for the amino acid sequence of the sequence of the proteins has been deposited as Supplementary Publication SUP 50041 (33 PAGES) AT THE British Library (Lending Division)(formerly the National Lending Library for Science and Technology), Boston Spa, Yorks. LS23 7BQ, U.K., from whom copies can be obtained on the terms indicated in Biochem. J. (1975), 145, 5. PMID:1156364

  8. Deduced amino acid sequence of human pulmonary surfactant proteolipid: SPL(pVal)

    SciTech Connect

    Whitsett, J.A.; Glasser, S.W.; Korfhagen, T.R.; Weaver, T.E.; Clark, J.; Pilot-Matias, T.; Meuth, J.; Fox, J.L.

    1987-05-01

    Hydrophobic, proteolipid-like protein of Mr 6500 was isolated from ether/ethanol extracts of human, canine and bovine pulmonary surfactant. Amino acid composition of the protein demonstrated a remarkable abundance of hydrophobic residues, particularly valine and leucine. The N-terminal amino acid sequence of the human protein was determined: N-Leu-Ile-Pro-Cys-Cys-Pro-Val-Asn-Leu-Lys-Arg-Leu-Leu-Ile-Val4... An oligonucleotide probe was used to screen an adult human lung cDNA library and resulted in detection of cDNA clones with predicted amino acid sequence with close identity to the N-terminal amino acid sequence of the human peptide. SPL(pVal) was found within the reading frame of a larger peptide. SPL(pVal) results from proteolytic processing of a larger preprotein. Northern blot analysis detected in a single 1.0 kilobase SPL(pVal) RNA which was less abundant in fetal than in adult lung. Mixtures of purified canine and bovine SPL(pVal) and synthetic phospholipids display properties of rapid adsorption and surface tension lowering activity characteristic of surfactant. Human SPL(pVal) is a pulmonary surfactant proteolipid which may therefore be useful in combination with phospholipids and/or other surfactant proteins for the treatment of surfactant deficiency such as hyaline membrane disease in newborn infants.

  9. Complete nucleic acid sequence of Penaeus stylirostris densovirus (PstDNV) from India.

    PubMed

    Rai, Praveen; Safeena, Muhammed P; Karunasagar, Iddya; Karunasagar, Indrani

    2011-06-01

    Infectious hypodermal and hematopoietic necrosis virus (IHHNV) of shrimp, recently been classified as Penaeus stylirostris densovirus (PstDNV). The complete nucleic acid sequence of PstDNV from India was obtained by cloning and sequencing of different DNA fragment of the virus. The genome organisation of PstDNV revealed that there were three major coding domains: a left ORF (NS1) of 2001 bp, a mid ORF (NS2) of 1092 bp and a right ORF (VP) of 990 bp. The complete genome and amino acid sequences of three proteins viz., NS1, NS2 and VP were compared with the genomes of the virus reported from Hawaii, China and Mexico and with partial sequence available from isolates from different regions. The phylogenetic analysis of shrimp, insect and vertebrate parvovirus sequences showed that the Indian PstDNV isolate is phylogenetically more closely related to one of the three isolates from Taiwan (AY355307), and two isolates (AY362547 and AY102034) from Thailand. PMID:21402111

  10. Human liver type pyruvate kinase: complete amino acid sequence and the expression in mammalian cells.

    PubMed Central

    Tani, K; Fujii, H; Nagata, S; Miwa, S

    1988-01-01

    Pyruvate kinase (PK) has four isozymes (L, R, M1, M2) that are encoded by two different genes. Among these isozymes, abnormalities of liver (L)-type PK is considered to be associated with hereditary nonspherocytic hemolytic anemia in humans. We isolated and determined the full-length sequence of human L-type PK cDNA. The cDNA contains 1629 base pairs encoding 543 amino acids, 68 base pairs of 5'-noncoding sequence, and 734 base pairs of 3'-noncoding sequence. The similarity between human and rat L-type PK was 86.9% at the nucleotide sequence level and 92.4% at the amino acid sequence level. The full-length L-type PK cDNA was placed under the promoter of simian virus 40 and introduced into monkey COS cells. Human L-type PK activity was detected in the extract of COS cells by the classical PK electrophoresis method. Images PMID:3126495

  11. Human liver type pyruvate kinase: Complete amino acid sequence and the expression in mammalian cells

    SciTech Connect

    Tani, Kenzaburo; Nagata, Shigekazu ); Fujii, Hisaichi ); Miwa, Shiro )

    1988-03-01

    Pyruvate kinase (PK) has four isozymes (L, R, M{sub 1}, M{sub 2}) that are encoded by two different genes. Among these isozymes, abnormalities of liver (L)-type PK is considered to be associated with hereditary nonspherocytic hemolytic anemia in humans. The authors isolated and determined the full-length sequence of human L-type PK cDNA. The cDNA contains 1,629 base pairs encoding 543 amino acids, 68 base pairs of 5{prime}-noncoding sequence, and 734 base pairs of 3{prime}-noncoding sequence. The similarity between human and rat L-type PK was 86.9% at the nucleotide sequence level and 92.4% at the amino acid sequence level. The full-length L-type PK cDNA was placed under the promoter of simian virus 40 and introduced into monkey COS cells. Human L-type PK activity was detected in the extract of COS cells by the classical PK electrophoresis method.

  12. Application of Ion Torrent Sequencing to the Assessment of the Effect of Alkali Ballast Water Treatment on Microbial Community Diversity

    PubMed Central

    Fujimoto, Masanori; Moyerbrailean, Gregory A.; Noman, Sifat; Gizicki, Jason P.; Ram, Michal L.; Green, Phyllis A.; Ram, Jeffrey L.

    2014-01-01

    The impact of NaOH as a ballast water treatment (BWT) on microbial community diversity was assessed using the 16S rRNA gene based Ion Torrent sequencing with its new 400 base chemistry. Ballast water samples from a Great Lakes ship were collected from the intake and discharge of both control and NaOH (pH 12) treated tanks and were analyzed in duplicates. One set of duplicates was treated with the membrane-impermeable DNA cross-linking reagent propidium mono-azide (PMA) prior to PCR amplification to differentiate between live and dead microorganisms. Ion Torrent sequencing generated nearly 580,000 reads for 31 bar-coded samples and revealed alterations of the microbial community structure in ballast water that had been treated with NaOH. Rarefaction analysis of the Ion Torrent sequencing data showed that BWT using NaOH significantly decreased microbial community diversity relative to control discharge (p<0.001). UniFrac distance based principal coordinate analysis (PCoA) plots and UPGMA tree analysis revealed that NaOH-treated ballast water microbial communities differed from both intake communities and control discharge communities. After NaOH treatment, bacteria from the genus Alishewanella became dominant in the NaOH-treated samples, accounting for <0.5% of the total reads in intake samples but more than 50% of the reads in the treated discharge samples. The only apparent difference in microbial community structure between PMA-processed and non-PMA samples occurred in intake water samples, which exhibited a significantly higher amount of PMA-sensitive cyanobacteria/chloroplast 16S rRNA than their corresponding non-PMA total DNA samples. The community assembly obtained using Ion Torrent sequencing was comparable to that obtained from a subset of samples that were also subjected to 454 pyrosequencing. This study showed the efficacy of alkali ballast water treatment in reducing ballast water microbial diversity and demonstrated the application of new Ion Torrent

  13. Application of ion torrent sequencing to the assessment of the effect of alkali ballast water treatment on microbial community diversity.

    PubMed

    Fujimoto, Masanori; Moyerbrailean, Gregory A; Noman, Sifat; Gizicki, Jason P; Ram, Michal L; Green, Phyllis A; Ram, Jeffrey L

    2014-01-01

    The impact of NaOH as a ballast water treatment (BWT) on microbial community diversity was assessed using the 16S rRNA gene based Ion Torrent sequencing with its new 400 base chemistry. Ballast water samples from a Great Lakes ship were collected from the intake and discharge of both control and NaOH (pH 12) treated tanks and were analyzed in duplicates. One set of duplicates was treated with the membrane-impermeable DNA cross-linking reagent propidium mono-azide (PMA) prior to PCR amplification to differentiate between live and dead microorganisms. Ion Torrent sequencing generated nearly 580,000 reads for 31 bar-coded samples and revealed alterations of the microbial community structure in ballast water that had been treated with NaOH. Rarefaction analysis of the Ion Torrent sequencing data showed that BWT using NaOH significantly decreased microbial community diversity relative to control discharge (p<0.001). UniFrac distance based principal coordinate analysis (PCoA) plots and UPGMA tree analysis revealed that NaOH-treated ballast water microbial communities differed from both intake communities and control discharge communities. After NaOH treatment, bacteria from the genus Alishewanella became dominant in the NaOH-treated samples, accounting for <0.5% of the total reads in intake samples but more than 50% of the reads in the treated discharge samples. The only apparent difference in microbial community structure between PMA-processed and non-PMA samples occurred in intake water samples, which exhibited a significantly higher amount of PMA-sensitive cyanobacteria/chloroplast 16S rRNA than their corresponding non-PMA total DNA samples. The community assembly obtained using Ion Torrent sequencing was comparable to that obtained from a subset of samples that were also subjected to 454 pyrosequencing. This study showed the efficacy of alkali ballast water treatment in reducing ballast water microbial diversity and demonstrated the application of new Ion Torrent

  14. Simultaneous Presence of Insertion Sequence Excision Enhancer and Insertion Sequence IS629 Correlates with Increased Diversity and Virulence in Shiga Toxin-Producing Escherichia coli

    PubMed Central

    Toro, M.; Rump, L. V.; Cao, G.; Meng, J.; Brown, E. W.

    2015-01-01

    Although new serotypes of enterohemorrhagic Escherichia coli (EHEC) emerge constantly, the mechanisms by which these new pathogens arise and the reasons emerging serotypes tend to carry more virulence genes than other E. coli are not understood. An insertion sequence (IS) excision enhancer (IEE) was discovered in EHEC O157:H7 that promoted the excision of IS3 family members and generating various genomic deletions. One IS3 family member, IS629, actively transposes and proliferates in EHEC O157:H7 and enterotoxigenic E. coli (ETEC) O139 and O149. The simultaneous presence of the IEE and IS629 (and other IS3 family members) may be part of a system promoting not only adaptation and genome diversification in E. coli O157:H7 but also contributing to the development of pathogenicity among predominant serotypes. Prevalence comparisons of these elements in 461 strains, representing 72 different serotypes and 5 preassigned seropathotypes (SPT) A to E, showed that the presence of these two elements simultaneously was serotype specific and associated with highly pathogenic serotypes (O157 and top non-O157 Shiga toxin-producing Escherichia coli [STEC]) implicated in outbreaks and sporadic cases of human illness (SPT A and B). Serotypes lacking one or both elements were less likely to have been isolated from clinical cases. Our comparisons of IEE sequences showed sequence variations that could be divided into at least three clusters. Interestingly, the IEE sequences from O157 and the top 10 non-O157 STEC serotypes fell into clusters I and II, while less commonly isolated serotypes O5 and O174 fell into cluster III. These results suggest that IS629 and IEE elements may be acting synergistically to promote genome plasticity and genetic diversity among STEC strains, enhancing their abilities to adapt to hostile environments and rapidly take up virulence factors. PMID:26292302

  15. Diversity in neural firing dynamics supports both rigid and learned hippocampal sequences.

    PubMed

    Grosmark, Andres D; Buzsáki, György

    2016-03-25

    Cell assembly sequences during learning are "replayed" during hippocampal ripples and contribute to the consolidation of episodic memories. However, neuronal sequences may also reflect preexisting dynamics. We report that sequences of place-cell firing in a novel environment are formed from a combination of the contributions of a rigid, predominantly fast-firing subset of pyramidal neurons with low spatial specificity and limited change across sleep-experience-sleep and a slow-firing plastic subset. Slow-firing cells, rather than fast-firing cells, gained high place specificity during exploration, elevated their association with ripples, and showed increased bursting and temporal coactivation during postexperience sleep. Thus, slow- and fast-firing neurons, although forming a continuous distribution, have different coding and plastic properties. PMID:27013730

  16. Diversity in neural firing dynamics supports both rigid and learned hippocampal sequences

    PubMed Central

    Grosmark, Andres D.; Buzsáki, György

    2016-01-01

    Cell assembly sequences during learning are “replayed” during hippocampal ripples and contribute to the consolidation of episodic memories. However, neuronal sequences may also reflect preexisting dynamics. We report that sequences of place-cell firing in a novel environment are formed from a combination of the contributions of a rigid, predominantly fast-firing subset of pyramidal neurons with low spatial specificity and limited change across sleep-experience-sleep and a slow-firing plastic subset. Slow-firing cells, rather than fast-firing cells, gained high place specificity during exploration, elevated their association with ripples, and showed increased bursting and temporal coactivation during postexperience sleep. Thus, slow- and fast-firing neurons, although forming a continuous distribution, have different coding and plastic properties. PMID:27013730

  17. Conservation of the C-type lectin fold for massive sequence variation in a Treponema diversity-generating retroelement

    SciTech Connect

    Le Coq, Johanne; Ghosh, Partho

    2012-06-19

    Anticipatory ligand binding through massive protein sequence variation is rare in biological systems, having been observed only in the vertebrate adaptive immune response and in a phage diversity-generating retroelement (DGR). Earlier work has demonstrated that the prototypical DGR variable protein, major tropism determinant (Mtd), meets the demands of anticipatory ligand binding by novel means through the C-type lectin (CLec) fold. However, because of the low sequence identity among DGR variable proteins, it has remained unclear whether the CLec fold is a general solution for DGRs. We have addressed this problem by determining the structure of a second DGR variable protein, TvpA, from the pathogenic oral spirochete Treponema denticola. Despite its weak sequence identity to Mtd ({approx}16%), TvpA was found to also have a CLec fold, with predicted variable residues exposed in a ligand-binding site. However, this site in TvpA was markedly more variable than the one in Mtd, reflecting the unprecedented approximate 10{sup 20} potential variability of TvpA. In addition, similarity between TvpA and Mtd with formylglycine-generating enzymes was detected. These results provide strong evidence for the conservation of the formylglycine-generating enzyme-type CLec fold among DGRs as a means of accommodating massive sequence variation.

  18. Cytogenetic Diversity and the Evolutionary Dynamics of rDNA Genes and Telomeric Sequences in the Ancistrus Genus (Loricariidae: Ancistrini).

    PubMed

    Favarato, Ramon Marin; Silva, Maelin da; Oliveira, Renildo Ribeiro de; Artoni, Roberto Ferreira; Feldberg, Eliana; Matoso, Daniele Aparecida

    2016-04-01

    The Ancistrus genus differs from other Ancistrini due to its wide karyotypic diversity, varied diploid numbers, differences in sex chromosomes, and large number of species, as well as its tendency to form small populations with low vagility. This study investigated the role of 5S and 18S rDNA and telomeric repetitive sequences in the evolution of the karyotypic macrostructure of seven species of the genus Ancistrus from the Central Amazon. The results indicate a strong correlation between the location of ribosomal sites and fragile sites in the genome, particularly of 5S rDNA sequences, which are associated, in some species, with telomeric sequences at the sites of chromosomal healing. Moreover, the occurrence of two lineages was observed with regard to the synteny of ribosomal genes. The species of the genus Ancistrus showed high chromosomal lability associated with breakpoints, which was characterized by the presence of repetitive DNA sequences and this process is suggested to be an evolutionary model for the rapid fixation of structural rearrangements. PMID:26829587

  19. Multilocus sequence analysis of Brazilian Rhizobium microsymbionts of common bean (Phaseolus vulgaris L.) reveals unexpected taxonomic diversity.

    PubMed

    Ribeiro, Renan Augusto; Barcellos, Fernando Gomes; Thompson, Fabiano L; Hungria, Mariangela

    2009-05-01

    The diazotrophic bacteria collectively known as "rhizobia" are important for establishing symbiotic N(2)-fixing associations with many legumes. These microbes have been used for over a century as an environmentally beneficial and cost-effective means of ensuring acceptable yields of agricultural legumes. The most widely used phylogenetic marker for identification and classification of rhizobia has been the 16S rRNA gene; however, this marker fails to discriminate some closely related species. In this study, we established the first multilocus sequence analysis (MLSA) scheme for the identification and classification of rhizobial microsymbionts of common bean (Phaseolus vulgaris L.). We analyzed 12 Brazilian strains representative of a collection of over 850 isolates in addition to type and reference rhizobial strains, by sequencing recA, dnaK, gltA, glnII and rpoA genes. Gene sequence similarities among the five type/reference Rhizobium strains which are symbionts of common bean ranged from 95 to 100% for 16S rRNA, and from 83 to 99% for the other five genes. Rhizobial species described as symbionts of common bean also formed separate groups upon analysis of single and concatenated gene sequences, and clusters formed in each tree were in good mutual agreement. The five additional loci may thus be considered useful markers of the genus Rhizobium; in addition, MLSA also revealed broad genetic diversity among strains classified as Rhizobium tropici, providing evidence of new species. PMID:19403105

  20. Analysis of the sequence diversity of the P1, HC, P3, NIb and CP genomic regions of several yam mosaic potyvirus isolates: implications for the intraspecies molecular diversity of potyviruses.

    PubMed

    Aleman-Verdaguer, M E; Goudou-Urbino, C; Dubern, J; Beachy, R N; Fauquet, C

    1997-06-01

    Partial sequences from serologically characterized yam mosaic potyvirus (YMV) isolates were determined in conserved (helper-component proteinase, HC; nuclear inclusion b, NIb) and variable (first protein, P1; third protein, P3; and coat protein, CP) regions of the potyviral genome in order to investigate the intraspecies molecular diversity of YMV. Multiple sequence alignments and pairwise comparisons were used to quantify the sequence polymorphism in these regions. Two levels of diversity were observed among YMV isolates: above 90% nucleotide (nt) sequence identities were found between YMV isolates of the same group (intragroup) regardless of the region considered, whereas identities between isolates from different groups (intergroup) were lower and depended upon the protein chosen. For instance, the average intergroup nt sequence identity between YMV isolates was about 65% in the P1 protein and the N terminus of the CP while there was more than 80% nt identity in the HC, P3 and NIb proteins. Thus P3 appeared to be conserved between YMV isolates even though this region was variable between potyvirus species. Similar analysis of the intraspecies molecular diversity of other potyviruses (potato virus Y, zucchini yellow mosaic virus, plum pox virus, pea seed-borne mosaic virus) led to the same results: (i) two levels of intraspecies molecular diversity were found (intragroup and intergroup); (ii) intraspecies molecular diversity differed from interspecies molecular diversity in the P3, P1 and N-terminal regions. PMID:9191916

  1. Molecular cytogenetics by polymerase catalyzed amplification or in situ labelling of specific nucleic acid sequences

    SciTech Connect

    Bolund, L.; Brandt, C.; Hindkjaer, J.; Koch, J.; Koelvraa, S.; Pedersen, S. )

    1993-01-01

    The Polymerase Chain Reaction (PCR) can be performed on isolated cells or chromosomes and the product can be analyzed by DNA technology or by FISH to test metaphases. The authors have good experiences analyzing aberrant chromosomes by FACS sorting, PCR with degenerated primers and painting of test metaphases with the PCR product. They also utilize polymerases for PRimed IN Situ labelling (PRINS) of specific nucleic acid sequences. In PRINS oligonucleotides are hybridized to their target sequences and labeled nucleotides are incorporated at the site of hybridization with the oligonucleotide as primer. PRINS may eventually allow the study of individual genes, gene expression and even somatic mutations (in mRNA) in single cells.

  2. DNA Cloning of Plasmodium falciparum Circumsporozoite Gene: Amino Acid Sequence of Repetitive Epitope

    NASA Astrophysics Data System (ADS)

    Enea, Vincenzo; Ellis, Joan; Zavala, Fidel; Arnot, David E.; Asavanich, Achara; Masuda, Aoi; Quakyi, Isabella; Nussenzweig, Ruth S.

    1984-08-01

    A clone of complementary DNA encoding the circumsporozoite (CS) protein of the human malaria parasite Plasmodium falciparum has been isolated by screening an Escherichia coli complementary DNA library with a monoclonal antibody to the CS protein. The DNA sequence of the complementary DNA insert encodes a four-amino acid sequence: proline-asparagine-alanine-asparagine, tandemly repeated 23 times. The CS β -lactamase fusion protein specifically binds monoclonal antibodies to the CS protein and inhibits the binding of these antibodies to native Plasmodium falciparum CS protein. These findings provide a basis for the development of a vaccine against Plasmodium falciparum malaria.

  3. Method for high-volume sequencing of nucleic acids: random and directed priming with libraries of oligonucleotides

    DOEpatents

    Studier, F.W.

    1995-04-18

    Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient. 2 figs.

  4. Method for high-volume sequencing of nucleic acids: random and directed priming with libraries of oligonucleotides

    DOEpatents

    Studier, F. William

    1995-04-18

    Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient.

  5. Partial amino acid sequence of apolipoprotein(a) shows that it is homologous to plasminogen

    SciTech Connect

    Eaton, D.L.; Fless, G.M.; Kohr, W.J.; McLean, J.W.; Xu, Q.T.; Miller, C.G.; Lawn, R.M.; Scanu, A.M.

    1987-05-01

    Apolipoprotein(a) (apo(a)) is a glycoprotein with M/sub r/ approx. 280,000 that is disulfide linked to apolipoprotein B in lipoprotein(a) particles. Elevated plasma levels of lipoprotein(a) are correlated with atherosclerosis. Partial amino acid sequence of apo(a) shows that it has striking homology to plasminogen. Plasminogen is a plasma serine protease zymogen that consists of five homologous and tandemly repeated domains called kringles and a trypsin-like protease domain. The amino-terminal sequence obtained for apo(a) is homologous to the beginning of kringle 4 but not the amino terminus of plasminogen. Apo(a) was subjected to limited proteolysis by trypsin or V8 protease, and fragments generated were isolated and sequenced. Sequences obtained from several of these fragments are highly (77-100%) homologous to plasminogen residues 391-421, which reside within kringle 4. Analysis of these internal apo(a) sequences revealed that apo(a) may contain at least two kringle 4-like domains. A sequence obtained from another tryptic fragment also shows homology to the end of kringle 4 and the beginning of kringle 5. Sequence data obtained from the two tryptic fragments shows homology with the protease domain of plasminogen. One of these sequences is homologous to the sequences surrounding the activation site of plasminogen. Plasminogen is activated by the cleavage of a specific arginine residue by urokinase and tissue plasminogen activator; however, the corresponding site in apo(a) is a serine that would not be cleaved by tissue plasminogen activator or urokinase. Using a plasmin-specific assay, no proteolytic activity could be demonstrated for lipoprotein(a) particles. These results suggest that apo(a) contains kringle-like domains and an inactive protease domain.

  6. Microbial Diversity and Its Relationship to Physicochemical Characteristics of the Water in Two Extreme Acidic Pit Lakes from the Iberian Pyrite Belt (SW Spain)

    PubMed Central

    López-Pamo, Enrique; Gomariz, María; Amils, Ricardo; Aguilera, Ángeles

    2013-01-01

    The Iberian Pyrite Belt (IPB) hosts one of the world’s largest accumulations of acidic mine wastes and pit lakes. The mineralogical and textural characteristics of the IPB ores have favored the oxidation and dissolution of metallic sulfides, mainly pyrite, and the subsequent formation of acidic mining drainages. This work reports the physical properties, hydrogeochemical characteristics, and microbial diversity of two pit lakes located in the IPB. Both pit lakes are acidic and showed high concentrations of sulfate and dissolved metals. Concentrations of sulfate and heavy metals were higher in the Nuestra Señora del Carmen lake (NSC) by one order of magnitude than in the Concepción (CN) lake. The hydrochemical characteristics of NSC were typical of acid mine waters and can be compared with other acidic environments. When compared to other IPB acidic pit lakes, the superficial water of CN is more diluted than that of any of the others due, probably, to the strong influence of runoff water. Both pit lakes showed chemical and thermal stratification with well defined chemoclines. One particular characteristic of NSC is that it has developed a chemocline very close to the surface (2 m depth). Microbial community composition of the water column was analyzed by 16S and 18S rRNA gene cloning and sequencing. The microorganisms detected in NSC were characteristic of acid mine drainage (AMD), including iron oxidizing bacteria (Leptospirillum, Acidithiobacillus ferrooxidans) and facultative iron reducing bacteria and archaea (Acidithiobacillus ferrooxidans, Acidiphilium, Actinobacteria, Acidimicrobiales, Ferroplasma) detected in the bottom layer. Diversity in CN was higher than in NSC. Microorganisms known from AMD systems (Acidiphilium, Acidobacteria and Ferrovum) and microorganisms never reported from AMD systems were identified. Taking into consideration the hydrochemical characteristics of these pit lakes and the spatial distribution of the identified microorganisms, a

  7. Sequence and diversity of T-cell receptor alpha V, J, and C genes of the owl monkey Aotus nancymaae.

    PubMed

    Favre, N; Daubenberger, C; Marfurt, J; Moreno, A; Patarroyo, M; Pluschke, G

    1998-09-01

    We cloned and sequenced TcR alpha chain cDNA of three healthy Aotus nancymaae monkeys. Fifteen different TRAJ segments and 9 different TRAV genes were identified in the 29 rearrangements analyzed. As expected from the greater phylogenetic distance, A. nancymaae TRA gene sequences diverged more from the human sequences than those of the chimpanzee or the rhesus macaque. However, no Aotus TRAJ segment or TRAV gene was found which lacked a human counterpart. These counterparts were AJ02, AJ05, AJ09, AJ15, AJ22, AJ23, AJ28, AJ30, AJ32, AJ34, AJ37, AJ40, AJ42, AJ45, AJ52 and AV2S1, AV2S3, AV3S1, AV8S1, AV12S1, AV15S1, ADV21S1/DV5, AV22S1S and AV23S1, respectively. In most cases the identity of amino acid sequences between corresponding Aotus and human genes was greater than 80%. This marked conservation of TRA gene sequences indicates a close structural relationship of Aotus and human TcR and demonstrates that the TcR repertoire in primates is remarkably stable. The results support the concept of using Aotus monkeys, which are susceptible to infection with the human malaria parasite Plasmodium falciparum, as an animal model for the evaluation of molecularly defined malaria vaccine candidates. PMID:9716644

  8. Whole-genome sequencing reveals the diversity of cattle copy number variations and multicopy genes

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Structural and functional impacts of copy number variations (CNVs) on livestock genomes are not yet well understood. We identified 1853 CNV regions using population-scale sequencing data generated from 75 cattle representing 8 breeds (Angus, Brahman, Gir, Holstein, Jersey, Limousin, Nelore, Romagnol...

  9. Sequencing of diverse mandarin, pummelo and orange genomes reveals complex history of admixture during citrus domestication

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Cultivated citrus are selections from, or hybrids of, wild progenitor species whose identities and contributions to citrus domestication remain controversial. Here we sequence and compare citrus genomes—a high-quality reference haploid clementine genome and mandarin, pummelo, sweet-orange and sour-o...

  10. Sequence diversity in haloalkane dehalogenases, as revealed by PCR using family-specific primers.

    PubMed

    Kotik, Michael; Faměrová, Veronika

    2012-02-01

    Haloalkane dehalogenases (HLDs) are hydrolytic enzymes that cleave carbon-halogen bonds in various halogenated compounds. Interest initially grew in HLDs as biocatalysts for bioremediation and later for biotransformation applications; each specific HLD within the HLD family has its own substrate specificity, enantioselectivity and product inhibition characteristics. We developed degenerate oligonucleotide primers for HLD-encoding genes and used these to PCR-amplify large hld gene fragments using genomic DNA from the microbial community of a chlorinated-solvent-contaminated aquifer as a template. An analysis of small subunit ribosomal RNA genes revealed a high complexity in the eubacterial population, dominated by α-, β- and γ-Proteobacteria, and Acidobacteria. Using HLD-family-specific primers, we also retrieved transcribed hld homologues from the microbial consortium of this contaminated site. The DNA-derived hld sequences were phylogenetically broadly distributed over both HLD subclasses I and II. Most hld sequences of the environmental RNA data set clustered in three groups within both HLD subclasses, indicating that a considerable proportion of the microbial consortium carrying hld genes was actively involved in haloalkane dehalogenation. The small sequence variation in hld genes and transcripts within each HLD cluster inferred the presence of a substantial pool of highly related HLD genes. The sequence variability appeared to be unevenly distributed over the HLD genes, however, with no apparent preference for a particular protein segment or domain. PMID:22155739

  11. Yersinia spp. Identification Using Copy Diversity in the Chromosomal 16S rRNA Gene Sequence

    PubMed Central

    Chen, Yuhuang; Liu, Chang; Xiao, Yuchun; Li, Xu; Su, Mingming; Jing, Huaiqi; Wang, Xin

    2016-01-01

    API 20E strip test, the standard for Enterobacteriaceae identification, is not sufficient to discriminate some Yersinia species for some unstable biochemical reactions and the same biochemical profile presented in some species, e.g. Yersinia ferderiksenii and Yersinia intermedia, which need a variety of molecular biology methods as auxiliaries for identification. The 16S rRNA gene is considered a valuable tool for assigning bacterial strains to species. However, the resolution of the 16S rRNA gene may be insufficient for discrimination because of the high similarity of sequences between some species and heterogeneity within copies at the intra-genomic level. In this study, for each strain we randomly selected five 16S rRNA gene clones from 768 Yersinia strains, and collected 3,840 sequences of the 16S rRNA gene from 10 species, which were divided into 439 patterns. The similarity among the five clones of 16S rRNA gene is over 99% for most strains. Identical sequences were found in strains of different species. A phylogenetic tree was constructed using the five 16S rRNA gene sequences for each strain where the phylogenetic classifications are consistent with biochemical tests; and species that are difficult to identify by biochemical phenotype can be differentiated. Most Yersinia strains form distinct groups within each species. However Yersinia kristensenii, a heterogeneous species, clusters with some Yersinia enterocolitica and Yersinia ferderiksenii/intermedia strains, while not affecting the overall efficiency of this species classification. In conclusion, through analysis derived from integrated information from multiple 16S rRNA gene sequences, the discrimination ability of Yersinia species is improved using our method. PMID:26808495

  12. Yersinia spp. Identification Using Copy Diversity in the Chromosomal 16S rRNA Gene Sequence.

    PubMed

    Hao, Huijing; Liang, Junrong; Duan, Ran; Chen, Yuhuang; Liu, Chang; Xiao, Yuchun; Li, Xu; Su, Mingming; Jing, Huaiqi; Wang, Xin

    2016-01-01

    API 20E strip test, the standard for Enterobacteriaceae identification, is not sufficient to discriminate some Yersinia species for some unstable biochemical reactions and the same biochemical profile presented in some species, e.g. Yersinia ferderiksenii and Yersinia intermedia, which need a variety of molecular biology methods as auxiliaries for identification. The 16S rRNA gene is considered a valuable tool for assigning bacterial strains to species. However, the resolution of the 16S rRNA gene may be insufficient for discrimination because of the high similarity of sequences between some species and heterogeneity within copies at the intra-genomic level. In this study, for each strain we randomly selected five 16S rRNA gene clones from 768 Yersinia strains, and collected 3,840 sequences of the 16S rRNA gene from 10 species, which were divided into 439 patterns. The similarity among the five clones of 16S rRNA gene is over 99% for most strains. Identical sequences were found in strains of different species. A phylogenetic tree was constructed using the five 16S rRNA gene sequences for each strain where the phylogenetic classifications are consistent with biochemical tests; and species that are difficult to identify by biochemical phenotype can be differentiated. Most Yersinia strains form distinct groups within each species. However Yersinia kristensenii, a heterogeneous species, clusters with some Yersinia enterocolitica and Yersinia ferderiksenii/intermedia strains, while not affecting the overall efficiency of this species classification. In conclusion, through analysis derived from integrated information from multiple 16S rRNA gene sequences, the discrimination ability of Yersinia species is improved using our method. PMID:26808495

  13. The Complete Genome Sequence of the Lactic Acid Bacterium Lactococcus lactis ssp. lactis IL1403

    PubMed Central

    Bolotin, Alexander; Wincker, Patrick; Mauger, Stéphane; Jaillon, Olivier; Malarme, Karine; Weissenbach, Jean; Ehrlich, S. Dusko; Sorokin, Alexei

    2001-01-01

    Lactococcus lactis is a nonpathogenic AT-rich gram-positive bacterium closely related to the genus Streptococcus and is the most commonly used cheese starter. It is also the best-characterized lactic acid bacterium. We sequenced the genome of the laboratory strain IL1403, using a novel two-step strategy that comprises diagnostic sequencing of the entire genome and a shotgun polishing step. The genome contains 2,365,589 base pairs and encodes 2310 proteins, including 293 protein-coding genes belonging to six prophages and 43 insertion sequence (IS) elements. Nonrandom distribution of IS elements indicates that the chromosome of the sequenced strain may be a product of recent recombination between two closely related genomes. A complete set of late competence genes is present, indicating the ability of L. lactis to undergo DNA transformation. Genomic sequence revealed new possibilities for fermentation pathways and for aerobic respiration. It also indicated a horizontal transfer of genetic information from Lactococcus to gram-negative enteric bacteria of Salmonella-Escherichia group. [The sequence data described in this paper has been submitted to the GenBank data library under accession no. AE005176.] PMID:11337471

  14. On human disease-causing amino acid variants: statistical study of sequence and structural patterns

    PubMed Central

    Alexov, Emil

    2015-01-01

    Statistical analysis was carried out on large set of naturally occurring human amino acid variations and it was demonstrated that there is a preference for some amino acid substitutions to be associated with diseases. At an amino acid sequence level, it was shown that the disease-causing variants frequently involve drastic changes of amino acid physico-chemical properties of proteins such as charge, hydrophobicity and geometry. Structural analysis of variants involved in diseases and being frequently observed in human population showed similar trends: disease-causing variants tend to cause more changes of hydrogen bond network and salt bridges as compared with harmless amino acid mutations. Analysis of thermodynamics data reported in literature, both experimental and computational, indicated that disease-causing variants tend to destabilize proteins and their interactions, which prompted us to investigate the effects of amino acid mutations on large databases of experimentally measured energy changes in unrelated proteins. Although the experimental datasets were linked neither to diseases nor exclusory to human proteins, the observed trends were the same: amino acid mutations tend to destabilize proteins and their interactions. Having in mind that structural and thermodynamics properties are interrelated, it is pointed out that any large change of any of them is anticipated to cause a disease. PMID:25689729

  15. Self-sequencing of amino acids and origins of polyfunctional protocells.

    PubMed

    Fox, S W

    1984-01-01

    The primal role of the origins of proteins in molecular evolution is discussed. On the basis of this premise, the significance of the experimentally established self-sequencing of amino acids under simulated geological conditions is explained as due to the fact that the products are highly nonrandom and accordingly contain many kinds of information. When such thermal proteins are aggregated into laboratory protocells, an action that occurs readily, the resultant protocells also contain many kinds of information. Residue-by-residue order, enzymic activities, and lipid quality accordingly occur within each preparation of proteinoid (thermal protein). In this paper are reviewed briefly the phenomenon of self-sequencing of amino acids, its relationship to evolutionary processes, other significance of such self-ordering, and the experimental evidence for original polyfunctional protocells. PMID:6462684

  16. Self-Sequencing of Amino Acids and Origins of Polyfunctional Protocells

    NASA Astrophysics Data System (ADS)

    Fox, Sidney W.

    1984-12-01

    The primal role of the origins of proteins in molecular evolution is discussed. On the basis of this premise, the significance of the experimentally established self-sequencing of amino acids under simulated geological conditions is explained as due to the fact that the products are highly nonrandom and accordingly contain many kinds of information. When such thermal proteins are aggregated into laboratory protocells, an action that occurs readily, the resultant protocells also contain many kinds of information. Residue-by-residue order, enzymic activities, and lipid quality accordingly occur within each preparation of proteinoid (thermal protein). In this paper are reviewed briefly the phenomenon of self-sequencing of amino acids, its relationship to evolutionary processes, other significance of such self-ordering, and the experimental evidence for original polyfunctional protocells.

  17. Species diversity and relative abundance of lactic acid bacteria in the milk of rhesus monkeys (Macaca mulatta)

    PubMed Central

    Jin, L.; Hinde, K.; Tao, L.

    2013-01-01

    Background Mother’s milk is a source of bacteria that influences the development of the infant commensal gut microbiota. To date, the species diversity and relative abundance of lactic acid bacteria in the milk of non-human primates have not been described. Methods Milk samples were aseptically obtained from 54 female rhesus monkeys (Macaca mulatta) at peak lactation. Following GM17 and MRS agar plating, single bacterial colonies were isolated based on difference in morphotypes, then grouped based on whole-cell protein profiles on SDS–PAGE. Bacterial DNA was isolated and the sequence the 16S rRNA gene was analyzed. Results A total of 106 strains of 19 distinct bacterial species, belonging to five genera, Bacillus, Enterococcus, Lactobacillus, Pediococcus, and Streptococcus, were identified. Conclusions Maternal gut and oral commensal bacteria may be translocated to the mammary gland during lactation and present in milk. This pathway can be an important source of commensal bacteria to the infant gut and oral cavity. PMID:20946146

  18. Next-generation sequencing reveals cryptic mtDNA diversity of Plasmodium relictum in the Hawaiian Islands

    USGS Publications Warehouse

    Jarvi, S.I.; Farias, M.E.; Lapointe, D.A.; Belcaid, M.; Atkinson, C.T.

    2013-01-01

    Next-generation 454 sequencing techniques were used to re-examine diversity of mitochondrial cytochrome b lineages of avian malaria (Plasmodium relictum) in Hawaii. We document a minimum of 23 variant lineages of the parasite based on single nucleotide transitional changes, in addition to the previously reported single lineage (GRW4). A new, publicly available portal (Integroomer) was developed for initial parsing of 454 datasets. Mean variant prevalence and frequency was higher in low elevation Hawaii Amakihi (Hemignathus virens) with Avipoxvirus-like lesions (P = 0·001), suggesting that the variants may be biologically distinct. By contrast, variant prevalence and frequency did not differ significantly among mid-elevation Apapane (Himatione sanguinea) with or without lesions (P = 0·691). The low frequency and the lack of detection of variants independent of GRW4 suggest that multiple independent introductions of P. relictum to Hawaii are unlikely. Multiple variants may have been introduced in heteroplasmy with GRW4 or exist within the tandem repeat structure of the mitochondrial genome. The discovery of multiple mitochondrial lineages of P. relictum in Hawaii provides a measure of genetic diversity within a geographically isolated population of this parasite and suggests the origins and evolution of parasite diversity may be more complicated than previously recognized.

  19. Empirical Comparison of Simple Sequence Repeats and Single Nucleotide Polymorphisms in Assessment of Maize Diversity and Relatedness

    PubMed Central

    Hamblin, Martha T.; Warburton, Marilyn L.; Buckler, Edward S.

    2007-01-01

    While Simple Sequence Repeats (SSRs) are extremely useful genetic markers, recent advances in technology have produced a shift toward use of single nucleotide polymorphisms (SNPs). The different mutational properties of these two classes of markers result in differences in heterozygosities and allele frequencies that may have implications for their use in assessing relatedness and evaluation of genetic diversity. We compared analyses based on 89 SSRs (primarily dinucleotide repeats) to analyses based on 847 SNPs in individuals from the same 259 inbred maize lines, which had been chosen to represent the diversity available among current and historic lines used in breeding. The SSRs performed better at clustering germplasm into populations than did a set of 847 SNPs or 554 SNP haplotypes, and SSRs provided more resolution in measuring genetic distance based on allele-sharing. Except for closely related pairs of individuals, measures of distance based on SSRs were only weakly correlated with measures of distance based on SNPs. Our results suggest that 1) large numbers of SNP loci will be required to replace highly polymorphic SSRs in studies of diversity and relatedness and 2) relatedness among highly-diverged maize lines is difficult to measure accurately regardless of the marker system. PMID:18159250

  20. Microbial community structure of two freshwater sponges using Illumina MiSeq sequencing revealed high microbial diversity.

    PubMed

    Gaikwad, Swapnil; Shouche, Yogesh S; Gade, Wasudev N

    2016-12-01

    Sponges are primitive metazoans that are known to harbour diverse and abundant microbes. All over the world attempts are being made to exploit these microbes for their biotechnological potential to produce, bioactive compounds and antimicrobial peptides. However, the majority of the studies are focussed on the marine sponges and studies on the freshwater sponges have been neglected so far. To increase our understanding of the microbial community structure of freshwater sponges, microbiota of two fresh water sponges namely, Eunapius carteri and Corvospongilla lapidosa is explored for the first time using Next Generation Sequencing (NGS) technology. Overall the microbial composition of these sponges comprises of 14 phyla and on an average, more than 2900 OTUs were obtained from C. lapidosa while E. carteri showed 980 OTUs which is higher than OTUs obtained in the marine sponges. Thus, our study showed that, fresh water sponges also posses highly diverse microbial community than previously thought and it is distinct from the marine sponge microbiota. The present study also revealed that microbial community structure of both the sponges is significantly different from each other and their respective water samples. In the present study, we have detected many bacterial lineages belonging to Firmicutes, Actinobacteria, Proteobacteria, Planctomycetes, etc. that are known to produce compounds of biotechnological importance. Overall, this study gives insight into the microbial composition of the freshwater sponges which is highly diverse and needs to be studied further to exploit their biotechnological capabilities. PMID:27299740

  1. Assessment of genetic diversity in Mucuna species of India using randomly amplified polymorphic DNA and inter simple sequence repeat markers.

    PubMed

    Patil, Ravishankar R; Pawar, Kiran D; Rane, Manali R; Yadav, Shrirang R; Bapat, Vishwas A; Jadhav, Jyoti P

    2016-04-01

    Genus Mucuna which is native to China and Eastern India comprises of perennial climbing legume with long slender branches, trifoliate leaves and bear green or brown pod covered with soft or rigid hairs that cause intense irritation. The plants of this genus are agronomically and economically important and commercially cultivated in India, China and other regions of the world. The high degrees of taxonomical confusions exist in Mucuna species that make authentic identification and classification difficult. In the present study, the genetic diversity among the 59 accessions of six species and three varieties of M. pruriens has been assessed using DNA fingerprinting based molecular markers techniques namely randomly amplified polymorphic DNA (RAPD), inter simple sequence repeats (ISSR) and combined dataset of RAPD and ISSR. Also, genetic relationship among two endemic species of Mucuna namely M. imbricata and M. macrocarpa and two varieties namely IIHR hybrid (MHR) and Dhanwantari (MD) with other species under study was investigated by using cluster analysis and principal coordinate analysis. The cluster analysis of RAPD, ISSR and combined dataset of RAPD and ISSR clearly demonstrated the existence of high interspecific variation than intra-specific variation in genus Mucuna. The utility and efficacy of RAPD and ISSR for the study of intra species and interspecies genetic diversity was evident from AMOVA and PCoA analysis. This study demonstrates the genetic diversity in Mucuna species and indicates that these markers could be successfully used to assess genetic variation among the accessions of Mucuna species. PMID:27436912

  2. Genetic diversity across natural populations of three montane plant species from the Western Ghats, India revealed by intersimple sequence repeats.

    PubMed

    Deshpande, A U; Apte, G S; Bahulikar, R A; Lagu, M D; Kulkarni, B G; Suresh, H S; Singh, N P; Rao, M K; Gupta, V S; Pant, A; Ranjekar, P K

    2001-10-01

    We analysed genetic diversity across the natural populations of three montane plant species in the Western Ghats, India; Symplocos laurina, Gaultheria fragrantissima and Eurya nitida using intersimple sequence repeat (ISSR) markers. These markers revealed genetic diversity within the populations of these plants from Nilgiri and also between two populations of S. laurina from Nilgiri and Amboli. Genetic variation within and between populations was analysed using various parameters such as total heterozygosity (HT), heterozygosity within population (HS), diversity between populations (DST), coefficient of population differentiation (GST), genetic distance (D) and gene flow (Nm). Total heterozygosity (HT) was higher for S. laurina (0.238) than for G. fragrantissima (0.172) and E. nitida (0.182). Two populations of S. laurina, separated by > 1000 km, showed a high within-population variation (53.7%) and a low gene flow (Nm = 0.447). upgma phenograms depicted a tendency of accessions to group according to their geographical locations in all the three plant species. The insight gained into the genetic structure of these plant populations might have implications in developing in situ and ex situ conservation strategies. PMID:11742544

  3. A not-so-big crisis: re-reading Silurian conodont diversity in a sequence-stratigraphic framework

    NASA Astrophysics Data System (ADS)

    Jarochowska, Emilia; Munnecke, Axel

    2016-04-01

    Conodonts are extensively used in Ordovician through Triassic biostratigraphy and fossil-based geochemistry. However, their distribution in rock successions is commonly taken at face value, without taking into account their diverse and poorly understood ecology. Multielement taxonomy, ontogenetic and environmental variability, difficulties in extraction, and relative rarity all contribute to the general lack of quantitative studies on conodont stratigraphic distribution and temporal turnover. With respect to Silurian conodonts, the concept of recurrent conodont extinction events - the so called Ireviken, Mulde and Lau events - has become a standard in the stratigraphic literature. The concept has been proposed based on qualitative observations of local extirpations of open-marine pelagic or nekto-benthic taxa and temporary dominance of shallow-water species in the Silurian succession of the Swedish island of Gotland. These changes coincided with positive carbon isotope excursions, abrupt facies shifts, "blooms" of benthic fauna, and changes in reef communities, which have all been combined into a general view of Silurian bio-geochemical events. This view posits a deterministic, reproducible pattern in Silurian conodont diversity, attributed to recurrent ecological or geochemical conditions. The growing body of sequence-stratigraphic interpretations across these events in Gotland and other sections worldwide indicate that in all cases the Silurian "events" are associated with rapid global regressions. This suggests that faunal changes such as the dominance of shallow-water, low-diversity conodont fauna and the increase of benthic invertebrate diversity and abundance represent predictable consequences of the variation in the completeness of the rock record and preservation potential of different environments. Our studies in Poland and Ukraine indicate that the magnitude of change in the taxonomic composition of conodont assemblages across the middle Silurian global

  4. Genetic diversity in two Japanese flounder populations from China seas inferred using microsatellite markers and COI sequences

    NASA Astrophysics Data System (ADS)

    Xu, Dongdong; Li, Sanlei; Lou, Bao; Zhang, Yurong; Zhan, Wei; Shi, Huilai

    2012-07-01

    Japanese flounder is one of the most important commercial species in China; however, information on the genetic background of natural populations in China seas is scarce. The lack of genetic data has hampered fishery management and aquaculture development programs for this species. In the present study, we have analyzed the genetic diversity in natural populations of Japanese flounder sampled from the Yellow Sea (Qingdao population, QD) and East China Sea (Zhoushan population, ZS) using 10 polymorphic microsatellite loci and cytochrome c oxidase subunit I (COI) sequencing data. A total of 68 different alleles were observed over 10 microsatellite loci. The total number of alleles per locus ranged from 2 to 9, and the number of genotypes per locus ranged from 3 to 45. The observed heterozygosity and expected heterozygosity in QD were 0.733 and 0.779, respectively, and in ZS the heterozygosity values were 0.708 and 0.783, respectively. Significant departures from Hardy-Weinberg equilibrium were observed in 7 of the 10 microsatellite loci in each of the two populations. The COI sequencing analysis revealed 25 polymorphic sites and 15 haplotypes in the two populations. The haplotype diversity and nucleotide diversity in the QD population were 0.746±0.072 8 and 0.003 34±0.001 03 respectively, and in ZS population the genetic diversity values were 0.712±0.047 0 and 0.003 18±0.000 49, respectively. The microsatellite data ( F st =0.048 7, P <0.001) and mitochondrial DNA data ( F st =0.128, P <0.001) both revealed significant genetic differentiation between the two populations. The information on the genetic variation and differentiation in Japanese flounder obtained in this study could be used to set up suitable guidelines for the management and conservation of this species, as well as for managing artificial selection programs. In future studies, more geographically diverse stocks should be used to obtain a deeper understanding of the population structure of Japanese

  5. Sequence of morphological transitions in two-dimensional pattern growth from aqueous ascorbic Acid solutions.

    PubMed

    Paranjpe, A S

    2002-08-12

    A sequence of morphological transitions in two-dimensional dehydration patterns of aqueous solutions of ascorbic acid is observed with humidity as a control parameter. Change in morphology occurs due to humidity induced variation in the concentration of the metastable supersaturated solution phase formed after initial solvent evaporation. As percent humidity is varied from 40 to 80, patterns change from compact circular --> radial --> density modulated radial (a new morphology) --> density modulated circular --> density modulated dendritic (a new morphology) --> dense branching. PMID:12190528

  6. Snake venom. The amino acid sequence of protein A from Dendroaspis polylepis polylepis (black mamba) venom.

    PubMed

    Joubert, F J; Strydom, D J

    1980-12-01

    Protein A from Dendroaspis polylepis polylepis venom comprises 81 amino acids, including ten half-cystine residues. The complete primary structures of protein A and its variant A' were elucidated. The sequences of proteins A and A', which differ in a single position, show no homology with various neurotoxins and non-neurotoxic proteins and represent a new type of elapid venom protein. PMID:7461607

  7. Sequence of the rhesus monkey T-cell receptor {beta} chain diversity and joining loci

    SciTech Connect

    Cheynier, R.; Henrichwark, S.; Wain-Hobson, S.

    1996-06-01

    Rhesus monkeys are frequently used as animal models for human diseases, most noticeably for simian immunodeficiency virus (SIV) infection and simian AIDS. An analysis of HIV proviruses and HIV-specific cytotoxic T cells in splenic white pulps relied heavily on the analysis of rearranged TCRBV sequences. The spleens were derived from patients with drug-insensitive idiopathic thrombocytopenia purpura and frequently taken at an advanced stage of disease. In order to obtain some insight into the balance of forces between the virus and the immune system during earlier stages of infection, one must inevitably turn to the SIV/macaque AIDS model. As a prerequisite to undertaking similar virological and immunological studies the nucleotide sequence of the macaque TCRBJ loci had to be established. 9 refs., 4 figs., 1 tab.

  8. BLAST: at the core of a powerful and diverse set of sequence analysis tools.

    PubMed

    McGinnis, Scott; Madden, Thomas L

    2004-07-01

    Basic Local Alignment Search Tool (BLAST) is one of the most heavily used sequence analysis tools available in the public domain. There is now a wide choice of BLAST algorithms that can be used to search many different sequence databases via the BLAST web pages (http://www.ncbi.nlm.nih.gov/BLAST/). All the algorithm-database combinations can be executed with default parameters or with customized settings, and the results can be viewed in a variety of ways. A new online resource, the BLAST Program Selection Guide, has been created to assist in the definition of search strategies. This article discusses optimal search strategies and highlights some BLAST features that can make your searches more powerful. PMID:15215342

  9. Investigation of bacterial and fungal diversity in tarag using high-throughput sequencing.

    PubMed

    Sun, Zhihong; Liu, Wenjun; Bao, Qiuhua; Zhang, Jiachao; Hou, Qiangchuan; Kwok, Laiyu; Sun, Tiansong; Zhang, Heping

    2014-10-01

    This is the first study on the bacterial and fungal community diversity in 17 tarag samples (naturally fermented dairy products) through a metagenomic approach involving high-throughput pyrosequencing. Our results revealed the presence of a total of 47 bacterial and 43 fungal genera in all tarag samples, in which Lactobacillus and Galactomyces were the predominant genera of bacteria and fungi, respectively. The number of some microbial genera, such as Lactococcus, Acetobacter, Saccharomyces, Trichosporon, and Kluyveromyces, among others, was found to vary between different samples. Altogether, our results showed that the microbial flora in different samples may be stratified by geographic region. PMID:25129502

  10. Sequence Assembly of Yarrowia lipolytica Strain W29/CLIB89 Shows Transposable Element Diversity.

    PubMed

    Magnan, Christophe; Yu, James; Chang, Ivan; Jahn, Ethan; Kanomata, Yuzo; Wu, Jenny; Zeller, Michael; Oakes, Melanie; Baldi, Pierre; Sandmeyer, Suzanne

    2016-01-01

    Yarrowia lipolytica, an oleaginous yeast, is capable of accumulating significant cellular mass in lipid making it an important source of biosustainable hydrocarbon-based chemicals. In spite of a similar number of protein-coding genes to that in other Hemiascomycetes, the Y. lipolytica genome is almost double that of model yeasts. Despite its economic importance and several distinct strains in common use, an independent genome assembly exists for only one strain. We report here a de novo annotated assembly of the chromosomal genome of an industrially-relevant strain, W29/CLIB89, determined by hybrid next-generation sequencing. For the first time, each Y. lipolytica chromosome is represented by a single contig. The telomeric rDNA repeats were localized by Irys long-range genome mapping and one complete copy of the rDNA sequence is reported. Two large structural variants and retroelement differences with reference strain CLIB122 including a full-length, novel Ty3/Gypsy long terminal repeat (LTR) retrotransposon and multiple LTR-like sequences are described. Strikingly, several of these are adjacent to RNA polymerase III-transcribed genes, which are almost double in number in Y. lipolytica compared to other Hemiascomycetes. In addition to previously-reported dimeric RNA polymerase III-transcribed genes, tRNA pseudogenes were identified. Multiple full-length and truncated LINE elements are also present. Therefore, although identified transposons do not constitute a significant fraction of the Y. lipolytica genome, they could have played an active role in its evolution. Differences between the sequence of this strain and of the existing reference strain underscore the utility of an additional independent genome assembly for this economically important organism. PMID:27603307

  11. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... approved by the Director of the Federal Register in accordance with 5 U.S.C. 552(a) and 1 CFR part 51... base or modified or unusual amino acid may be presented in a given sequence as the corresponding unmodified base or amino acid if the modified base or modified or unusual amino acid is one of those...