Science.gov

Sample records for acid sequence analyses

  1. Homology analyses of the protein sequences of fatty acid synthases from chicken liver, rat mammary gland, and yeast

    SciTech Connect

    Chang, Soo-Ik ); Hammes, G.G. )

    1989-11-01

    Homology analyses of the protein sequences of chicken liver and rat mammary gland fatty acid synthases were carried out. The amino acid sequences of the chicken and rat enzymes are 67% identical. If conservative substitutions are allowed, 78% of the amino acids are matched. A region of low homologies exists between the functional domains, in particular around amino acid residues 1059-1264 of the chicken enzyme. Homologies between the active sites of chicken and rat and of chicken and yeast enzymes have been analyzed by an alignment method. A high degree of homology exists between the active sites of the chicken and rat enzymes. However, the chicken and yeast enzymes show a lower degree of homology. The DADPH-binding dinucleotide folds of the {beta}-ketoacyl reductase and the enoyl reductase sites were identified by comparison with a known consensus sequence for the DADP- and FAD-binding dinucleotide folds. The active sites of all of the enzymes are primarily in hydrophobic regions of the protein. This study suggests that the genes for the functional domains of fatty acid synthase were originally separated, and these genes were connected to each other by using different connecting nucleotide sequences in different species. An alternative explanation for the differences in rat and chicken is a common ancestry and mutations in the joining regions during evolution.

  2. Analyses of mitochondrial amino acid sequence datasets support the proposal that specimens of Hypodontus macropi from three species of macropodid hosts represent distinct species

    PubMed Central

    2013-01-01

    Background Hypodontus macropi is a common intestinal nematode of a range of kangaroos and wallabies (macropodid marsupials). Based on previous multilocus enzyme electrophoresis (MEE) and nuclear ribosomal DNA sequence data sets, H. macropi has been proposed to be complex of species. To test this proposal using independent molecular data, we sequenced the whole mitochondrial (mt) genomes of individuals of H. macropi from three different species of hosts (Macropus robustus robustus, Thylogale billardierii and Macropus [Wallabia] bicolor) as well as that of Macropicola ocydromi (a related nematode), and undertook a comparative analysis of the amino acid sequence datasets derived from these genomes. Results The mt genomes sequenced by next-generation (454) technology from H. macropi from the three host species varied from 13,634 bp to 13,699 bp in size. Pairwise comparisons of the amino acid sequences predicted from these three mt genomes revealed differences of 5.8% to 18%. Phylogenetic analysis of the amino acid sequence data sets using Bayesian Inference (BI) showed that H. macropi from the three different host species formed distinct, well-supported clades. In addition, sliding window analysis of the mt genomes defined variable regions for future population genetic studies of H. macropi in different macropodid hosts and geographical regions around Australia. Conclusions The present analyses of inferred mt protein sequence datasets clearly supported the hypothesis that H. macropi from M. robustus robustus, M. bicolor and T. billardierii represent distinct species. PMID:24261823

  3. Assignment of fatty acid-beta-oxidizing syntrophic bacteria to Syntrophomonadaceae fam. nov. on the basis of 16S rRNA sequence analyses

    NASA Technical Reports Server (NTRS)

    Zhao, H.; Yang, D.; Woese, C. R.; Bryant, M. P.

    1993-01-01

    After enrichment from Chinese rural anaerobic digestor sludge, anaerobic, sporing and nonsporing, saturated fatty acid-beta-oxidizing syntrophic bacteria were isolated as cocultures with H2- and formate-utilizing Methanospirillum hungatei or Desulfovibrio sp. strain G-11. The syntrophs degraded C4 to C8 saturated fatty acids, including isobutyrate and 2-methylbutyrate. They were adapted to grow on crotonate and were isolated as pure cultures. The crotonate-grown pure cultures alone did not grow on butyrate in either the presence or the absence of some common electron acceptors. However, when they were reconstituted with M. hungatei, growth on butyrate again occurred. In contrast, crotonate-grown Clostridium kluyveri and Clostridium sticklandii, as well as Clostridium sporogenes, failed to grow on butyrate when these organisms were cocultured with M. hungatei. The crotonate-grown pure subcultures of the syntrophs described above were subjected to 16S rRNA sequence analysis. Several previously documented fatty acid-beta-oxidizing syntrophs grown in pure cultures with crotonate were also subjected to comparative sequence analyses. The sequence analyses revealed that the new sporing and nonsporing isolates and other syntrophs that we sequenced, which had either gram-negative or gram-positive cell wall ultrastructure, all belonged to the phylogenetically gram-positive phylum. They were not closely related to any of the previously known subdivisions in the gram-positive phylum with which they were compared, but were closely related to each other, forming a new subdivision in the phylum. We recommend that this group be designated Syntrophomonadaceae fam. nov.; a description is given.

  4. Composition for nucleic acid sequencing

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2008-08-26

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  5. Comparative studies on tree pollen allergens. X. Further purification and N-terminal amino acid sequence analyses of the major allergen of birch pollen (Betula verrucosa).

    PubMed

    Vik, H; Elsayed, S

    1986-01-01

    The previously isolated major allergen of birch pollen (fraction BV45), Int. Archs Allergy appl. Immun. 68: 70-78 (1982), was further purified by recycling chromatography. The purified preparation was run on a high-performance liquid chromatography (HPLC) TSK-G-2000 gel filtration chromatography column and, finally, on paper high-volt electrophoresis. The protein recovered met the homogeneity criteria required for performing the N-terminal sequence analysis. The allergenic and antigenic reactivities of the HPLC-purified protein, designated BV45B, was examined. A single homogeneous precipitation line in crossed immunoelectrophoresis (CIE) was shown. Specific IgE-inhibition tests and immuno-autoradiographic prints indicated that this allergen could bind reaginic IgE specificially and with good affinity. The homogeneity of BV45B was examined by isoelectric focusing (IEF). Several minor bands of pI differences of less than 0.1 units were visible, demonstrating the existence of some molecular variants of this protein. The N-terminal sequence analysis of the molecule was performed, and the following four amino acids were tentatively shown by sequential cleavage: NH2-Ala-Gly-Ile-Val-. The demonstration of one dominant N-terminal 1-dimethyl-amino-5-naphthalene sulphonyl (DNS)-amino acid by polyamide thin-layer chromatography at each sequence step confirmed that the N-terminal residue of the protein was not blocked; the heterogeneity shown by the IEF system was merely due to the presence of several homologous polymorphic proteins with identical N-terminal amino acid, the adequacy of the purification repertoire used. PMID:3957444

  6. Comparative sequence analyses of sixteen reptilian paramyxoviruses

    USGS Publications Warehouse

    Ahne, W.; Batts, W.N.; Kurath, G.; Winton, J.R.

    1999-01-01

    Viral genomic RNA of Fer-de-Lance virus (FDLV), a paramyxovirus highly pathogenic for reptiles, was reverse transcribed and cloned. Plasmids with significant sequence similarities to the hemagglutinin-neuraminidase (HN) and polymerase (L) genes of mammalian paramyxoviruses were identified by BLAST search. Partial sequences of the FDLV genes were used to design primers for amplification by nested polymerase chain reaction (PCR) and sequencing of 518-bp L gene and 352-bp HN gene fragments from a collection of 15 previously uncharacterized reptilian paramyxoviruses. Phylogenetic analyses of the partial L and HN sequences produced similar trees in which there were two distinct subgroups of isolates that were supported with maximum bootstrap values, and several intermediate isolates. Within each subgroup the nucleotide divergence values were less than 2.5%, while the divergence between the two subgroups was 20-22%. This indicated that the two subgroups represent distinct virus species containing multiple virus strains. The five intermediate isolates had nucleotide divergence values of 11-20% and may represent additional distinct species. In addition to establishing diversity among reptilian paramyxoviruses, the phylogenetic groupings showed some correlation with geographic location, and clearly demonstrated a low level of host species-specificity within these viruses. Copyright (C) 1999 Elsevier Science B.V.

  7. Analyses of Intestinal Microbiota: Culture versus Sequencing.

    PubMed

    Hiergeist, Andreas; Gläsner, Joachim; Reischl, Udo; Gessner, André

    2015-01-01

    Analyzing human as well as animal microbiota composition has gained growing interest because structural components and metabolites of microorganisms fundamentally influence all aspects of host physiology. Originally dominated by culture-dependent methods for exploring these ecosystems, the development of molecular techniques such as high throughput sequencing has dramatically increased our knowledge. Because many studies of the microbiota are based on the bacterial 16S ribosomal RNA (rRNA) gene targets, they can, at least in principle, be compared to determine the role of the microbiome composition for developmental processes, host metabolism, and physiology as well as different diseases. In our review, we will summarize differences and pitfalls in current experimental protocols, including all steps from nucleic acid extraction to bioinformatical analysis which may produce variation that outweighs subtle biological differences. Future developments, such as integration of metabolomic, transcriptomic, and metagenomic data sets and standardization of the procedures, will be discussed. PMID:26323632

  8. High speed nucleic acid sequencing

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2011-05-17

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid. Each type of labeled nucleotide comprises an acceptor fluorophore attached to a phosphate portion of the nucleotide such that the fluorophore is removed upon incorporation into a growing strand. Fluorescent signal is emitted via fluorescent resonance energy transfer between the donor fluorophore and the acceptor fluorophore as each nucleotide is incorporated into the growing strand. The sequence is deduced by identifying which base is being incorporated into the growing strand.

  9. Amino acid analyses of Apollo 14 samples.

    NASA Technical Reports Server (NTRS)

    Gehrke, C. W.; Zumwalt, R. W.; Kuo, K.; Aue, W. A.; Stalling, D. L.; Kvenvolden, K. A.; Ponnamperuma, C.

    1972-01-01

    Detection limits were between 300 pg and 1 ng for different amino acids, in an analysis by gas-liquid chromatography of water extracts from Apollo 14 lunar fines in which amino acids were converted to their N-trifluoro-acetyl-n-butyl esters. Initial analyses of water and HCl extracts of sample 14240 and 14298 samples showed no amino acids above background levels.

  10. Chip-based sequencing nucleic acids

    DOEpatents

    Beer, Neil Reginald

    2014-08-26

    A system for fast DNA sequencing by amplification of genetic material within microreactors, denaturing, demulsifying, and then sequencing the material, while retaining it in a PCR/sequencing zone by a magnetic field. One embodiment includes sequencing nucleic acids on a microchip that includes a microchannel flow channel in the microchip. The nucleic acids are isolated and hybridized to magnetic nanoparticles or to magnetic polystyrene-coated beads. Microreactor droplets are formed in the microchannel flow channel. The microreactor droplets containing the nucleic acids and the magnetic nanoparticles are retained in a magnetic trap in the microchannel flow channel and sequenced.

  11. Distinguishing Proteins From Arbitrary Amino Acid Sequences

    PubMed Central

    Yau, Stephen S.-T.; Mao, Wei-Guang; Benson, Max; He, Rong Lucy

    2015-01-01

    What kinds of amino acid sequences could possibly be protein sequences? From all existing databases that we can find, known proteins are only a small fraction of all possible combinations of amino acids. Beginning with Sanger's first detailed determination of a protein sequence in 1952, previous studies have focused on describing the structure of existing protein sequences in order to construct the protein universe. No one, however, has developed a criteria for determining whether an arbitrary amino acid sequence can be a protein. Here we show that when the collection of arbitrary amino acid sequences is viewed in an appropriate geometric context, the protein sequences cluster together. This leads to a new computational test, described here, that has proved to be remarkably accurate at determining whether an arbitrary amino acid sequence can be a protein. Even more, if the results of this test indicate that the sequence can be a protein, and it is indeed a protein sequence, then its identity as a protein sequence is uniquely defined. We anticipate our computational test will be useful for those who are attempting to complete the job of discovering all proteins, or constructing the protein universe. PMID:25609314

  12. Method for sequencing nucleic acid molecules

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2006-05-30

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  13. Method for sequencing nucleic acid molecules

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2006-06-06

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  14. p53-Regulated Networks of Protein, mRNA, miRNA, and lncRNA Expression Revealed by Integrated Pulsed Stable Isotope Labeling With Amino Acids in Cell Culture (pSILAC) and Next Generation Sequencing (NGS) Analyses.

    PubMed

    Hünten, Sabine; Kaller, Markus; Drepper, Friedel; Oeljeklaus, Silke; Bonfert, Thomas; Erhard, Florian; Dueck, Anne; Eichner, Norbert; Friedel, Caroline C; Meister, Gunter; Zimmer, Ralf; Warscheid, Bettina; Hermeking, Heiko

    2015-10-01

    We determined the effect of p53 activation on de novo protein synthesis using quantitative proteomics (pulsed stable isotope labeling with amino acids in cell culture/pSILAC) in the colorectal cancer cell line SW480. This was combined with mRNA and noncoding RNA expression analyses by next generation sequencing (RNA-, miR-Seq). Furthermore, genome-wide DNA binding of p53 was analyzed by chromatin-immunoprecipitation (ChIP-Seq). Thereby, we identified differentially regulated proteins (542 up, 569 down), mRNAs (1258 up, 415 down), miRNAs (111 up, 95 down) and lncRNAs (270 up, 123 down). Changes in protein and mRNA expression levels showed a positive correlation (r = 0.50, p < 0.0001). In total, we detected 133 direct p53 target genes that were differentially expressed and displayed p53 occupancy in the vicinity of their promoter. More transcriptionally induced genes displayed occupied p53 binding sites (4.3% mRNAs, 7.2% miRNAs, 6.3% lncRNAs, 5.9% proteins) than repressed genes (2.4% mRNAs, 3.2% miRNAs, 0.8% lncRNAs, 1.9% proteins), suggesting indirect mechanisms of repression. Around 50% of the down-regulated proteins displayed seed-matching sequences of p53-induced miRNAs in the corresponding 3'-UTRs. Moreover, proteins repressed by p53 significantly overlapped with those previously shown to be repressed by miR-34a. We confirmed up-regulation of the novel direct p53 target genes LINC01021, MDFI, ST14 and miR-486 and showed that ectopic LINC01021 expression inhibits proliferation in SW480 cells. Furthermore, KLF12, HMGB1 and CIT mRNAs were confirmed as direct targets of the p53-induced miR-34a, miR-205 and miR-486-5p, respectively. In line with the loss of p53 function during tumor progression, elevated expression of KLF12, HMGB1 and CIT was detected in advanced stages of cancer. In conclusion, the integration of multiple omics methods allowed the comprehensive identification of direct and indirect effectors of p53 that provide new insights and leads into the

  15. Pegasys: software for executing and integrating analyses of biological sequences

    PubMed Central

    Shah, Sohrab P; He, David YM; Sawkins, Jessica N; Druce, Jeffrey C; Quon, Gerald; Lett, Drew; Zheng, Grace XY; Xu, Tao; Ouellette, BF Francis

    2004-01-01

    Background We present Pegasys – a flexible, modular and customizable software system that facilitates the execution and data integration from heterogeneous biological sequence analysis tools. Results The Pegasys system includes numerous tools for pair-wise and multiple sequence alignment, ab initio gene prediction, RNA gene detection, masking repetitive sequences in genomic DNA as well as filters for database formatting and processing raw output from various analysis tools. We introduce a novel data structure for creating workflows of sequence analyses and a unified data model to store its results. The software allows users to dynamically create analysis workflows at run-time by manipulating a graphical user interface. All non-serial dependent analyses are executed in parallel on a compute cluster for efficiency of data generation. The uniform data model and backend relational database management system of Pegasys allow for results of heterogeneous programs included in the workflow to be integrated and exported into General Feature Format for further analyses in GFF-dependent tools, or GAME XML for import into the Apollo genome editor. The modularity of the design allows for new tools to be added to the system with little programmer overhead. The database application programming interface allows programmatic access to the data stored in the backend through SQL queries. Conclusions The Pegasys system enables biologists and bioinformaticians to create and manage sequence analysis workflows. The software is released under the Open Source GNU General Public License. All source code and documentation is available for download at . PMID:15096276

  16. Comparative analyses of lysophosphatidic acid receptor-mediated signaling.

    PubMed

    Fukushima, Nobuyuki; Ishii, Shoichi; Tsujiuchi, Toshifumi; Kagawa, Nao; Katoh, Kazutaka

    2015-06-01

    Lysophosphatidic acid (LPA) is a bioactive lipid mediator that activates G protein-coupled LPA receptors to exert fundamental cellular functions. Six LPA receptor genes have been identified in vertebrates and are classified into two subfamilies, the endothelial differentiation genes (edg) and the non-edg family. Studies using genetically engineered mice, frogs, and zebrafish have demonstrated that LPA receptor-mediated signaling has biological, developmental, and pathophysiological functions. Computational analyses have also identified several amino acids (aa) critical for LPA recognition by human LPA receptors. This review focuses on the evolutionary aspects of LPA receptor-mediated signaling by comparing the aa sequences of vertebrate LPA receptors and LPA-producing enzymes; it also summarizes the LPA receptor-dependent effects commonly observed in mouse, frog, and fish. PMID:25732591

  17. Amino Acid Analyses of Acid Hydrolysates in Desert Varnish

    NASA Technical Reports Server (NTRS)

    Perry, Randall S.; Staley, James T.; Dworkin, Jason P.; Engel, Mike

    2001-01-01

    There has long been a debate as to whether rock varnish deposits are microbially mediated or are deposited by inorganic processes. Varnished rocks are found throughout the world primarily in arid and semi-arid regions. The varnish coats are typically up to 200 microns thick and are composed of clays and alternating layers enriched in manganese and iron oxides. The individual layers range in thickness from 1 micron to greater than 10 microns and may continue laterally for more than a 100 microns. Overlapping botryoidal structures are visible in thin section and scanning electron micrographs. The coatings also include small amounts of organic mater and detrital grains. Amino-acid hydrolysates offer a means of assessing the organic composition of rock varnish collected from the Sonoran Desert, near Phoenix, AZ. Chromatographic analyses of hydrolysates from powdered samples of rock varnish suggest that the interior of rock varnish is relatively enriched in amino acids and specifically in d-alanine and glutamic acid. Peptidoglycan (murein) is the main structural component of gram-positive bacterial cell walls. The d-enantiomer of alanine and glutamic acid are specific to peptidoglycan and are consequently an indicator for the presence of bacteria. D-alanine is also found in teichoic acid which is only found in gram-positive bacteria. Several researchers have cultured bacteria from the surface of rock varnish and most have been gram-positive, suggesting that gram-positive bacteria are intimately associated with varnish coatings and may play a role in the formation of varnish coatings.

  18. Sequence and structural analyses of interleukin-8-like chemokine superfamily.

    PubMed

    Kanagarajadurai, Karuppiah; Sowdhamini, Ramanathan

    2008-01-01

    Interleukin-8 and related chemokines are small proteins that bind to receptors belonging to the large family of G-protein-coupled receptors. They can cause migration of cells like neutrophils and eosinophils and some of them are implicated in angiogenic diseases. More than 40 subfamilies of these ligands are known that share poor sequence similarity and display receptor specificity. There is very little structural information about the mode of binding between ligands and the receptors. We have employed multi-fold sensitive sequence search methods to provide a repertoire of 252 putative interleukin-8 proteins and homologues, which are shared across humans, aves and fish. The sequences can be organized into five major known clusters. The propensity of occurrence of certain amino acid alphabets is found to be specific in different locations of the polypeptide fold. The sequence dispersion is also observed to be cluster-specific when examined by Evolutionary Trace procedure. Amino acid alphabet analysis and Evolutionary Trace procedure reveal cluster-specific amino acid distribution that provide clues about how the small fold of the ligand could display remarkable receptor specificity. We notice regions, like the beta1-beta2 loop of the fold, that are potentially involved in receptor recognition and specificity that could be potential sites for residue mutations. Systematic studies of the distribution patterns enable better understanding of the evolution and molecular recognition of this important and diverse protein superfamily. PMID:19032164

  19. Sequence and Structural Analyses for Functional Non-coding RNAs

    NASA Astrophysics Data System (ADS)

    Sakakibara, Yasubumi; Sato, Kengo

    Analysis and detection of functional RNAs are currently important topics in both molecular biology and bioinformatics research. Several computational methods based on stochastic context-free grammars (SCFGs) have been developed for modeling and analysing functional RNA sequences. These grammatical methods have succeeded in modeling typical secondary structures of RNAs and are used for structural alignments of RNA sequences. Such stochastic models, however, are not sufficient to discriminate member sequences of an RNA family from non-members, and hence to detect non-coding RNA regions from genome sequences. Recently, the support vector machine (SVM) and kernel function techniques have been actively studied and proposed as a solution to various problems in bioinformatics. SVMs are trained from positive and negative samples and have strong, accurate discrimination abilities, and hence are more appropriate for the discrimination tasks. A few kernel functions that extend the string kernel to measure the similarity of two RNA sequences from the viewpoint of secondary structures have been proposed. In this article, we give an overview of recent progress in SCFG-based methods for RNA sequence analysis and novel kernel functions tailored to measure the similarity of two RNA sequences and developed for use with support vector machines (SVM) in discriminating members of an RNA family from non-members.

  20. Sequencing and comparative analyses of the genomes of zoysiagrasses.

    PubMed

    Tanaka, Hidenori; Hirakawa, Hideki; Kosugi, Shunichi; Nakayama, Shinobu; Ono, Akiko; Watanabe, Akiko; Hashiguchi, Masatsugu; Gondo, Takahiro; Ishigaki, Genki; Muguerza, Melody; Shimizu, Katsuya; Sawamura, Noriko; Inoue, Takayasu; Shigeki, Yuichi; Ohno, Naoki; Tabata, Satoshi; Akashi, Ryo; Sato, Shusei

    2016-04-01

    Zoysiais a warm-season turfgrass, which comprises 11 allotetraploid species (2n= 4x= 40), each possessing different morphological and physiological traits. To characterize the genetic systems of Zoysia plants and to analyse their structural and functional differences in individual species and accessions, we sequenced the genomes of Zoysia species using HiSeq and MiSeq platforms. As a reference sequence of Zoysia species, we generated a high-quality draft sequence of the genome of Z. japonica accession 'Nagirizaki' (334 Mb) in which 59,271 protein-coding genes were predicted. In parallel, draft genome sequences of Z. matrella 'Wakaba' and Z. pacifica 'Zanpa' were also generated for comparative analyses. To investigate the genetic diversity among the Zoysia species, genome sequence reads of three additional accessions, Z. japonica'Kyoto', Z. japonica'Miyagi' and Z. matrella'Chiba Fair Green', were accumulated, and aligned against the reference genome of 'Nagirizaki' along with those from 'Wakaba' and 'Zanpa'. As a result, we detected 7,424,163 single-nucleotide polymorphisms and 852,488 short indels among these species. The information obtained in this study will be valuable for basic studies on zoysiagrass evolution and genetics as well as for the breeding of zoysiagrasses, and is made available in the 'Zoysia Genome Database' at http://zoysia.kazusa.or.jp. PMID:26975196

  1. Sequencing and comparative analyses of the genomes of zoysiagrasses

    PubMed Central

    Tanaka, Hidenori; Hirakawa, Hideki; Kosugi, Shunichi; Nakayama, Shinobu; Ono, Akiko; Watanabe, Akiko; Hashiguchi, Masatsugu; Gondo, Takahiro; Ishigaki, Genki; Muguerza, Melody; Shimizu, Katsuya; Sawamura, Noriko; Inoue, Takayasu; Shigeki, Yuichi; Ohno, Naoki; Tabata, Satoshi; Akashi, Ryo; Sato, Shusei

    2016-01-01

    Zoysia is a warm-season turfgrass, which comprises 11 allotetraploid species (2n = 4x = 40), each possessing different morphological and physiological traits. To characterize the genetic systems of Zoysia plants and to analyse their structural and functional differences in individual species and accessions, we sequenced the genomes of Zoysia species using HiSeq and MiSeq platforms. As a reference sequence of Zoysia species, we generated a high-quality draft sequence of the genome of Z. japonica accession ‘Nagirizaki’ (334 Mb) in which 59,271 protein-coding genes were predicted. In parallel, draft genome sequences of Z. matrella ‘Wakaba’ and Z. pacifica ‘Zanpa’ were also generated for comparative analyses. To investigate the genetic diversity among the Zoysia species, genome sequence reads of three additional accessions, Z. japonica ‘Kyoto’, Z. japonica ‘Miyagi’ and Z. matrella ‘Chiba Fair Green’, were accumulated, and aligned against the reference genome of ‘Nagirizaki’ along with those from ‘Wakaba’ and ‘Zanpa’. As a result, we detected 7,424,163 single-nucleotide polymorphisms and 852,488 short indels among these species. The information obtained in this study will be valuable for basic studies on zoysiagrass evolution and genetics as well as for the breeding of zoysiagrasses, and is made available in the ‘Zoysia Genome Database’ at http://zoysia.kazusa.or.jp. PMID:26975196

  2. Phenolic acid esterases, coding sequences and methods

    DOEpatents

    Blum, David L.; Kataeva, Irina; Li, Xin-Liang; Ljungdahl, Lars G.

    2002-01-01

    Described herein are four phenolic acid esterases, three of which correspond to domains of previously unknown function within bacterial xylanases, from XynY and XynZ of Clostridium thermocellum and from a xylanase of Ruminococcus. The fourth specifically exemplified xylanase is a protein encoded within the genome of Orpinomyces PC-2. The amino acids of these polypeptides and nucleotide sequences encoding them are provided. Recombinant host cells, expression vectors and methods for the recombinant production of phenolic acid esterases are also provided.

  3. Amino-Acid Sequence of Porcine Pepsin

    PubMed Central

    Tang, J.; Sepulveda, P.; Marciniszyn, J.; Chen, K. C. S.; Huang, W-Y.; Tao, N.; Liu, D.; Lanier, J. P.

    1973-01-01

    As the culmination of several years of experiments, we propose a complete amino-acid sequence for porcine pepsin, an enzyme containing 327 amino-acid residues in a single polypeptide chain. In the sequence determination, the enzyme was treated with cyanogen bromide. Five resulting fragments were purified. The amino-acid sequence of four of the fragments accounted for 290 residues. Because the structure of a 37-residue carboxyl-terminal fragment was already known, it was not studied. The alignment of these fragments was determined from the sequence of methionyl-peptides we had previously reported. We also discovered the locations of activesite aspartyl residues, as well as the pairing of the three disulfide bridges. A minor component of commercial crystalline pepsin was found to contain two extra amino-acid residues, Ala-Leu-, at the amino-terminus of the molecule. This minor component was apparently derived from a different site of cleavage during the activation of porcine pepsinogen. PMID:4587252

  4. Method for identifying and quantifying nucleic acid sequence aberrations

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    1998-01-01

    A method for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe.

  5. Method for identifying and quantifying nucleic acid sequence aberrations

    DOEpatents

    Lucas, J.N.; Straume, T.; Bogen, K.T.

    1998-07-21

    A method is disclosed for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe. 11 figs.

  6. Whale song analyses using bioinformatics sequence analysis approaches

    NASA Astrophysics Data System (ADS)

    Chen, Yian A.; Almeida, Jonas S.; Chou, Lien-Siang

    2005-04-01

    Animal songs are frequently analyzed using discrete hierarchical units, such as units, themes and songs. Because animal songs and bio-sequences may be understood as analogous, bioinformatics analysis tools DNA/protein sequence alignment and alignment-free methods are proposed to quantify the theme similarities of the songs of false killer whales recorded off northeast Taiwan. The eighteen themes with discrete units that were identified in an earlier study [Y. A. Chen, masters thesis, University of Charleston, 2001] were compared quantitatively using several distance metrics. These metrics included the scores calculated using the Smith-Waterman algorithm with the repeated procedure; the standardized Euclidian distance and the angle metrics based on word frequencies. The theme classifications based on different metrics were summarized and compared in dendrograms using cluster analyses. The results agree with earlier classifications derived by human observation qualitatively. These methods further quantify the similarities among themes. These methods could be applied to the analyses of other animal songs on a larger scale. For instance, these techniques could be used to investigate song evolution and cultural transmission quantifying the dissimilarities of humpback whale songs across different seasons, years, populations, and geographic regions. [Work supported by SC Sea Grant, and Ilan County Government, Taiwan.

  7. Methods for analyzing nucleic acid sequences

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2011-05-17

    The present invention is directed to a method of sequencing a target nucleic acid. The method provides a complex comprising a polymerase enzyme, a target nucleic acid molecule, and a primer, wherein the complex is immobilized on a support Fluorescent label is attached to a terminal phosphate group of the nucleotide or nucleotide analog. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The time duration of the signal from labeled nucleotides or nucleotide analogs that become incorporated is distinguished from freely diffusing labels by a longer retention in the observation volume for the nucleotides or nucleotide analogs that become incorporated than for the freely diffusing labels.

  8. Note on the chromatographic analyses of marine polyunsaturated fatty acids

    USGS Publications Warehouse

    Schultz, D.M.; Quinn, J.G.

    1977-01-01

    Gas-liquid chromatography was used to study the effects of saponification/methylation and thin-layer chromatographic isolation on the analyses of polyunsaturated fatty acids. Using selected procedures, the qualitative and quantitative distribution of these acids in marine organisms can be determined with a high degree of accuracy. ?? 1977 Springer-Verlag.

  9. Detection of nucleic acid sequences by invader-directed cleavage

    DOEpatents

    Brow, Mary Ann D.; Hall, Jeff Steven Grotelueschen; Lyamichev, Victor; Olive, David Michael; Prudent, James Robert

    1999-01-01

    The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The 5' nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based by charge.

  10. Complete chloroplast genome sequences of Solanum bulbocastanum, Solanum lycopersicum and comparative analyses with other Solanaceae genomes.

    PubMed

    Daniell, Henry; Lee, Seung-Bum; Grevich, Justin; Saski, Christopher; Quesada-Vargas, Tania; Guda, Chittibabu; Tomkins, Jeffrey; Jansen, Robert K

    2006-05-01

    Despite the agricultural importance of both potato and tomato, very little is known about their chloroplast genomes. Analysis of the complete sequences of tomato, potato, tobacco, and Atropa chloroplast genomes reveals significant insertions and deletions within certain coding regions or regulatory sequences (e.g., deletion of repeated sequences within 16S rRNA, ycf2 or ribosomal binding sites in ycf2). RNA, photosynthesis, and atp synthase genes are the least divergent and the most divergent genes are clpP, cemA, ccsA, and matK. Repeat analyses identified 33-45 direct and inverted repeats >or=30 bp with a sequence identity of at least 90%; all but five of the repeats shared by all four Solanaceae genomes are located in the same genes or intergenic regions, suggesting a functional role. A comprehensive genome-wide analysis of all coding sequences and intergenic spacer regions was done for the first time in chloroplast genomes. Only four spacer regions are fully conserved (100% sequence identity) among all genomes; deletions or insertions within some intergenic spacer regions result in less than 25% sequence identity, underscoring the importance of choosing appropriate intergenic spacers for plastid transformation and providing valuable new information for phylogenetic utility of the chloroplast intergenic spacer regions. Comparison of coding sequences with expressed sequence tags showed considerable amount of variation, resulting in amino acid changes; none of the C-to-U conversions observed in potato and tomato were conserved in tobacco and Atropa. It is possible that there has been a loss of conserved editing sites in potato and tomato. PMID:16575560

  11. Sequence and structural analyses of nuclear export signals in the NESdb database

    PubMed Central

    Xu, Darui; Farmer, Alicia; Collett, Garen; Grishin, Nick V.; Chook, Yuh Min

    2012-01-01

    We compiled >200 nuclear export signal (NES)–containing CRM1 cargoes in a database named NESdb. We analyzed the sequences and three-dimensional structures of natural, experimentally identified NESs and of false-positive NESs that were generated from the database in order to identify properties that might distinguish the two groups of sequences. Analyses of amino acid frequencies, sequence logos, and agreement with existing NES consensus sequences revealed strong preferences for the Φ1-X3-Φ2-X2-Φ3-X-Φ4 pattern and for negatively charged amino acids in the nonhydrophobic positions of experimentally identified NESs but not of false positives. Strong preferences against certain hydrophobic amino acids in the hydrophobic positions were also revealed. These findings led to a new and more precise NES consensus. More important, three-dimensional structures are now available for 68 NESs within 56 different cargo proteins. Analyses of these structures showed that experimentally identified NESs are more likely than the false positives to adopt α-helical conformations that transition to loops at their C-termini and more likely to be surface accessible within their protein domains or be present in disordered or unobserved parts of the structures. Such distinguishing features for real NESs might be useful in future NES prediction efforts. Finally, we also tested CRM1-binding of 40 NESs that were found in the 56 structures. We found that 16 of the NES peptides did not bind CRM1, hence illustrating how NESs are easily misidentified. PMID:22833565

  12. Genomic sequencing and analyses of Lymantria xylina multiple nucleopolyhedrovirus

    PubMed Central

    2010-01-01

    Background Outbreaks of the casuarina moth, Lymantria xylina Swinehoe (Lepidoptera: Lymantriidae), which is a very important forest pest in Taiwan, have occurred every five to 10 years. This moth has expanded its range of host plants to include more than 65 species of broadleaf trees. LyxyMNPV (L. xylina multiple nucleopolyhedrovirus) is highly virulent to the casuarina moth and has been investigated as a possible biopesticide for controlling this moth. LdMNPV-like virus has also been isolated from Lymantria xylina larvae but LyxyMNPV was more virulent than LdMNPV-like virus both in NTU-LY and IPLB-LD-652Y cell lines. To better understand LyxyMNPV, the nucleotide sequence of the LyxyMNPV DNA genome was determined and analysed. Results The genome of LyxyMNPV consists of 156,344 bases, has a G+C content of 53.4% and contains 157 putative open reading frames (ORFs). The gene content and gene order of LyxyMNPV were similar to those of LdMNPV, with 151 ORFs identified as homologous to those reported in the LdMNPV genome. Two genes (Lyxy49 and Lyxy123) were homologous to other baculoviruses, and four unique LyxyMNPV ORFs (Lyxy11, Lyxy19, Lyxy130 and Lyxy131) were identified in the LyxyMNPV genome, including a gag-like gene that was not reported in baculoviruses. LdMNPV contains 23 ORFs that are absent in LyxyMNPV. Readily identifiable homologues of the gene host range factor-1 (hrf-1), which appears to be involved in the susceptibility of L. dispar to NPV infection, were not present in LyxyMNPV. Additionally, two putative odv-e27 homologues were identified in LyxyMNPV. The LyxyMNPV genome encoded 14 bro genes compared with 16 in LdMNPV, which occupied more than 8% of the LyxyMNPV genome. Thirteen homologous regions (hrs) were identified containing 48 repeated sequences composed of 30-bp imperfect palindromes. However, they differed in the relative positions, number of repeats and orientation in the genome compared to LdMNPV. Conclusion The gene parity plot analysis

  13. Hybridization and sequencing of nucleic acids using base pair mismatches

    DOEpatents

    Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua

    2001-01-01

    Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.

  14. Reclassification of ascomycetous yeasts from gene sequence analyses

    Technology Transfer Automated Retrieval System (TEKTRAN)

    During the past decade, identification of yeasts and their classification has been based almost exclusively on gene sequence analysis. Primarily as a result of using diagnostic gene sequences, such as D1/D2 LSU and ITS ribosomal RNAs, the number of known species has doubled. With the faster sequen...

  15. 77 FR 65537 - Requirements for Patent Applications Containing Nucleotide Sequence and/or Amino Acid Sequence...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2012-10-29

    ... Amino Acid Sequence Disclosures ACTION: Proposed collection; comment request. SUMMARY: The United States....'' SUPPLEMENTARY INFORMATION: I. Abstract Patent applications that contain nucleotide and/or amino acid sequence disclosures must include a copy of the sequence listing in accordance with the requirements in 37 CFR...

  16. Nucleic acid renaturation and restriction endonuclease cleavage analyses show that the DNAs of a transforming and a nontransforming strain of Epstein-Barr virus share approximately 90% of their nucleotide sequences.

    PubMed Central

    Sugden, B; Summers, W C; Klein, G

    1976-01-01

    Viral DNA molecules were purified from a nontransforming and a transforming strain of Epstein-Barr virus. Each viral DNA was labeled in vitro and renatured in the presence of an excess of either one or the other unlabeled viral DNA. Both viral DNAs were also digested with the Eco R1 restriction endonuclease and subsequently labeled by using avian myeloblastosis virus DNA polymerase to repair either the EcoR1 nuclease-generated single-stranded ends of the DNAs or their single-stranded ends produced by a second digestion with exonuclease III after the first EcoR1 nuclease digestion. The results of these experiments support three general conclusions: (i) the DNAs of these two strains of Epstein-Barr virus share approximately 90% of their nucleotide sequences; (ii) both viral DNA populations are reasonably homogenous; and (iii) both DNAs contain repetitions or inverted repetitions of some of their nucleotide sequences. Images PMID:178907

  17. Analyses of response-stimulus sequences in descriptive observations.

    PubMed

    Samaha, Andrew L; Vollmer, Timothy R; Borrero, Carrie; Sloman, Kimberly; Pipkin, Claire St Peter; Bourret, Jason

    2009-01-01

    Descriptive observations were conducted to record problem behavior displayed by participants and to record antecedents and consequences delivered by caregivers. Next, functional analyses were conducted to identify reinforcers for problem behavior. Then, using data from the descriptive observations, lag-sequential analyses were conducted to examine changes in the probability of environmental events across time in relation to occurrences of problem behavior. The results of the lag-sequential analyses were interpreted in light of the results of functional analyses. Results suggested that events identified as reinforcers in a functional analysis followed behavior in idiosyncratic ways: after a range of delays and frequencies. Thus, it is possible that naturally occurring reinforcement contingencies are arranged in ways different from those typically evaluated in applied research. Further, these complex response-stimulus relations can be represented by lag-sequential analyses. However, limitations to the lag-sequential analysis are evident. PMID:19949537

  18. Predicting intrinsic disorder from amino acid sequence.

    PubMed

    Obradovic, Zoran; Peng, Kang; Vucetic, Slobodan; Radivojac, Predrag; Brown, Celeste J; Dunker, A Keith

    2003-01-01

    Blind predictions of intrinsic order and disorder were made on 42 proteins subsequently revealed to contain 9,044 ordered residues, 284 disordered residues in 26 segments of length 30 residues or less, and 281 disordered residues in 2 disordered segments of length greater than 30 residues. The accuracies of the six predictors used in this experiment ranged from 77% to 91% for the ordered regions and from 56% to 78% for the disordered segments. The average of the order and disorder predictions ranged from 73% to 77%. The prediction of disorder in the shorter segments was poor, from 25% to 66% correct, while the prediction of disorder in the longer segments was better, from 75% to 95% correct. Four of the predictors were composed of ensembles of neural networks. This enabled them to deal more efficiently with the large asymmetry in the training data through diversified sampling from the significantly larger ordered set and achieve better accuracy on ordered and long disordered regions. The exclusive use of long disordered regions for predictor training likely contributed to the disparity of the predictions on long versus short disordered regions, while averaging the output values over 61-residue windows to eliminate short predictions of order or disorder probably contributed to the even greater disparity for three of the predictors. This experiment supports the predictability of intrinsic disorder from amino acid sequence. PMID:14579347

  19. Sequence and intramolecular distance scoring analyses of microbial rhodopsins

    PubMed Central

    Asano, Miki; Ide, Shunta; Kamata, Atsushi; Takahasi, Kiyohiro; Okada, Tetsuji

    2016-01-01

    Recent accumulation of sequence and structural data, in conjunction with systematical classification into a set of families, has significantly advanced our understanding of diverse and specific protein functions. Analysis and interpretation of protein family data requires comprehensive sequence and structural alignments. Here, we present a simple scheme for analyzing a set of experimental structures of a given protein or family of proteins, using microbial rhodopsins as an example. For a data set comprised of around a dozen highly similar structures to each other (overall pairwise root-mean-squared deviation < 2.3 Å), intramolecular distance scoring analysis yielded valuable information with respect to structural properties, such as differences in the relative variability of transmembrane helices. Furthermore, a comparison with recent results for G protein-coupled receptors demonstrates how the results of the present analysis can be interpreted and effectively utilized for structural characterization of diverse protein families in general. PMID:26998236

  20. Methods and compositions for efficient nucleic acid sequencing

    DOEpatents

    Drmanac, Radoje

    2002-01-01

    Disclosed are novel methods and compositions for rapid and highly efficient nucleic acid sequencing based upon hybridization with two sets of small oligonucleotide probes of known sequences. Extremely large nucleic acid molecules, including chromosomes and non-amplified RNA, may be sequenced without prior cloning or subcloning steps. The methods of the invention also solve various current problems associated with sequencing technology such as, for example, high noise to signal ratios and difficult discrimination, attaching many nucleic acid fragments to a surface, preparing many, longer or more complex probes and labelling more species.

  1. Methods and compositions for efficient nucleic acid sequencing

    DOEpatents

    Drmanac, Radoje

    2006-07-04

    Disclosed are novel methods and compositions for rapid and highly efficient nucleic acid sequencing based upon hybridization with two sets of small oligonucleotide probes of known sequences. Extremely large nucleic acid molecules, including chromosomes and non-amplified RNA, may be sequenced without prior cloning or subcloning steps. The methods of the invention also solve various current problems associated with sequencing technology such as, for example, high noise to signal ratios and difficult discrimination, attaching many nucleic acid fragments to a surface, preparing many, longer or more complex probes and labelling more species.

  2. Kit for detecting nucleic acid sequences using competitive hybridization probes

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    2001-01-01

    A kit is provided for detecting a target nucleic acid sequence in a sample, the kit comprising: a first hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the first hybridization probe including a first complexing agent for forming a binding pair with a second complexing agent; and a second hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the first hybridization probe does not selectively hybridize, the second hybridization probe including a detectable marker; a third hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the third hybridization probe including the same detectable marker as the second hybridization probe; and a fourth hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the third hybridization probe does not selectively hybridize, the fourth hybridization probe including the first complexing agent for forming a binding pair with the second complexing agent; wherein the first and second hybridization probes are capable of simultaneously hybridizing to the target sequence and the third and fourth hybridization probes are capable of simultaneously hybridizing to the target sequence, the detectable marker is not present on the first or fourth hybridization probes and the first, second, third, and fourth hybridization probes each include a competitive nucleic acid sequence which is sufficiently complementary to a third portion of the target sequence that the competitive sequences of the first, second, third, and fourth hybridization probes compete with each other to hybridize to the third portion of the

  3. Evidence for Balancing Selection from Nucleotide Sequence Analyses of Human G6PD

    PubMed Central

    Verrelli, Brian C.; McDonald, John H.; Argyropoulos, George; Destro-Bisol, Giovanni; Froment, Alain; Drousiotou, Anthi; Lefranc, Gerard; Helal, Ahmed N.; Loiselet, Jacques; Tishkoff, Sarah A.

    2002-01-01

    Glucose-6-phosphate dehydrogenase (G6PD) mutations that result in reduced enzyme activity have been implicated in malarial resistance and constitute one of the best examples of selection in the human genome. In the present study, we characterize the nucleotide diversity across a 5.2-kb region of G6PD in a sample of 160 Africans and 56 non-Africans, to determine how selection has shaped patterns of DNA variation at this gene. Our global sample of enzymatically normal B alleles and A, A−, and Med alleles with reduced enzyme activities reveals many previously uncharacterized silent-site polymorphisms. In comparison with the absence of amino acid divergence between human and chimpanzee G6PD sequences, we find that the number of G6PD amino acid polymorphisms in human populations is significantly high. Unlike many other G6PD-activity alleles with reduced activity, we find that the age of the A variant, which is common in Africa, may not be consistent with the recent emergence of severe malaria and therefore may have originally had a historically different adaptive function. Overall, our observations strongly support previous genotype-phenotype association studies that proposed that balancing selection maintains G6PD deficiencies within human populations. The present study demonstrates that nucleotide sequence analyses can reveal signatures of both historical and recent selection in the genome and may elucidate the impact that infectious disease has had during human evolution. PMID:12378426

  4. Sequence analyses of type IV pili from Vibrio cholerae, Vibrio parahaemolyticus, and Vibrio vulnificus.

    PubMed

    Aagesen, Alisha M; Häse, Claudia C

    2012-08-01

    Bacterial surface structures called pili have been studied extensively for their role as possible colonization factors. Most sequenced Vibrio genomes predict a variety of pili genes in these organisms, including several types of type IV pili. In particular, the mannose-sensitive hemagglutinin (MSHA) and the PilA pili, also known as the chitin-regulated pilus (ChiRP), are type IVa pili commonly found in Vibrio genomes and have been shown to play a role in the colonization of Vibrio species in the environment and/or host tissue. Here, we report sequence comparisons of two type IVa pilin subunit genes, mshA and pilA, and their corresponding amino acid sequences, for several strains from the three main human pathogenic Vibrio species, V. cholerae, V. parahaemolyticus, and V. vulnificus. We identified specific groupings of these two genes in V. cholerae, whereas V. parahaemolyticus and V. vulnificus strains had no apparent allelic clusters, and these genes were strikingly divergent. These results were compared with other genes from the MSHA and PilA operons as well as another Vibrio pili from the type IVb group, the toxin co-regulated pilus (TCP) from V. cholerae. Our data suggest that a selective pressure exists to cause these strains to vary their MSHA and PilA pilin subunits. Interestingly, V. cholerae strains possessing TCP have the same allele for both mshA and pilA. In contrast, V. cholerae isolates without TCP have polymorphisms in their mshA and pilA sequences similar to what was observed for both V. parahaemolyticus and V. vulnificus. This data suggests a possible linkage between host interactions and maintaining a highly conserved type IV pili sequence in V. cholerae. Although the mechanism underlying this intriguing diversity has yet to be elucidated, our analyses are an important first step towards gaining insights into the various aspects of Vibrio ecology. PMID:22383120

  5. Comparative ribosomal protein sequence analyses of a phylogenetically defined genus, Pseudomonas, and its relatives.

    PubMed

    Ochi, K

    1995-04-01

    I analyzed various families of ribosomal proteins obtained from selected species belonging to the genus Pseudomonas sensu stricto and allied organisms which were previously classified in the genus Pseudomonas. Partial amino acid sequencing of L30 preparations revealed that the strains which I examined could be divided into three clusters. The first cluster, which was assigned to the genus Pseudomonas sensu stricto, included Pseudomonas aeruginosa, Pseudomonas putida, Pseudomonas mendocina, and Pseudomonas fluorescens. The second cluster included Burkholderia pickettii and Burkholderia plantarii. The third cluster, which was a deeply branching cluster in the stem of gram-negative bacteria, included Brevundimonas diminuta and Brevundimonas vesicularis. Despite the different levels of conservation of the N-terminal sequences of ribosomal protein families (the highest level of similarity was 74% for L27 proteins and the lowest level of similarity was 42% for L30 proteins), similar phylogenetic trees were constructed by using data obtained from sequence analyses of various ribosomal protein families, including the S20, S21, L27, L29, L31, L32, and L33 protein families. Thus, I demonstrated the efficacy of ribosomal protein analysis in bacterial taxonomy. PMID:7727274

  6. Solid phase sequencing of double-stranded nucleic acids

    DOEpatents

    Fu, Dong-Jing; Cantor, Charles R.; Koster, Hubert; Smith, Cassandra L.

    2002-01-01

    This invention relates to methods for detecting and sequencing of target double-stranded nucleic acid sequences, to nucleic acid probes and arrays of probes useful in these methods, and to kits and systems which contain these probes. Useful methods involve hybridizing the nucleic acids or nucleic acids which represent complementary or homologous sequences of the target to an array of nucleic acid probes. These probe comprise a single-stranded portion, an optional double-stranded portion and a variable sequence within the single-stranded portion. The molecular weights of the hybridized nucleic acids of the set can be determined by mass spectroscopy, and the sequence of the target determined from the molecular weights of the fragments. Nucleic acids whose sequences can be determined include nucleic acids in biological samples such as patient biopsies and environmental samples. Probes may be fixed to a solid support such as a hybridization chip to facilitate automated determination of molecular weights and identification of the target sequence.

  7. Analysis and Annotation of Nucleic Acid Sequence

    SciTech Connect

    States, David J.

    2004-07-28

    The aims of this project were to develop improved methods for computational genome annotation and to apply these methods to improve the annotation of genomic sequence data with a specific focus on human genome sequencing. The project resulted in a substantial body of published work. Notable contributions of this project were the identification of basecalling and lane tracking as error processes in genome sequencing and contributions to improved methods for these steps in genome sequencing. This technology improved the accuracy and throughput of genome sequence analysis. Probabilistic methods for physical map construction were developed. Improved methods for sequence alignment, alternative splicing analysis, promoter identification and NF kappa B response gene prediction were also developed.

  8. Analysis and Annotation of Nucleic Acid Sequence

    SciTech Connect

    David J. States

    1998-08-01

    The aims of this project were to develop improved methods for computational genome annotation and to apply these methods to improve the annotation of genomic sequence data with a specific focus on human genome sequencing. The project resulted in a substantial body of published work. Notable contributions of this project were the identification of basecalling and lane tracking as error processes in genome sequencing and contributions to improved methods for these steps in genome sequencing. This technology improved the accuracy and throughput of genome sequence analysis. Probabilistic methods for physical map construction were developed. Improved methods for sequence alignment, alternative splicing analysis, promoter identification and NF kappa B response gene prediction were also developed.

  9. Matrix genes of measles virus and canine distemper virus: cloning, nucleotide sequences, and deduced amino acid sequences.

    PubMed Central

    Bellini, W J; Englund, G; Richardson, C D; Rozenblatt, S; Lazzarini, R A

    1986-01-01

    The nucleotide sequences encoding the matrix (M) proteins of measles virus (MV) and canine distemper virus (CDV) were determined from cDNA clones containing these genes in their entirety. In both cases, single open reading frames specifying basic proteins of 335 amino acid residues were predicted from the nucleotide sequences. Both viral messages were composed of approximately 1,450 nucleotides and contained 400 nucleotides of presumptive noncoding sequences at their respective 3' ends. MV and CDV M-protein-coding regions were 67% homologous at the nucleotide level and 76% homologous at the amino acid level. Only chance homology was observed in the 400-nucleotide trailer sequences. Comparisons of the M protein sequences of MV and CDV with the sequence reported for Sendai virus (B. M. Blumberg, K. Rose, M. G. Simona, L. Roux, C. Giorgi, and D. Kolakofsky, J. Virol. 52:656-663; Y. Hidaka, T. Kanda, K. Iwasaki, A. Nomoto, T. Shioda, and H. Shibuta, Nucleic Acids Res. 12:7965-7973) indicated the greatest homology among these M proteins in the carboxyterminal third of the molecule. Secondary-structure analyses of this shared region indicated a structurally conserved, hydrophobic sequence which possibly interacted with the lipid bilayer. Images PMID:3754588

  10. Structural and biochemical analyses reveal how ornithine acetyl transferase binds acidic and basic amino acid substrates.

    PubMed

    Iqbal, Aman; Clifton, Ian J; Chowdhury, Rasheduzzaman; Ivison, David; Domene, Carmen; Schofield, Christopher J

    2011-09-21

    Structural and biochemical analyses reveal how ornithine acetyl-transferases catalyse the reversible transfer of an acetyl-group from a basic (ornithine) to an acidic (glutamate) amino acid by employing a common mechanism involving an acetyl-enzyme intermediate but using different side chain binding modes. PMID:21796301

  11. Effects of fixed versus random condition sequencing during multielement functional analyses.

    PubMed

    Hammond, Jennifer L; Iwata, Brian A; Rooker, Griffin W; Fritz, Jennifer N; Bloom, Sarah E

    2013-01-01

    It has been suggested that a fixed condition sequence might facilitate differential responding during multielement functional analyses (FAs) by capitalizing on or limiting sequence effects (Iwata, Pace, et al., 1994); however, the effects of condition sequence have not been examined empirically. We conducted fixed- and random-sequence FAs for 7 individuals with developmental disabilities to determine the relative effects that sequence may have on assessment outcomes. Experimental conditions during the fixed sequence were conducted in the following order: ignore, attention, play, and demand; condition order during the random sequence was determined randomly. Results showed that sequence had no influence on the FA outcomes for 3 subjects, whereas differential responding emerged either faster (1 subject) or only (3 subjects) under the fixed sequence for the remaining subjects. These results suggest that the fixed sequence, a simple modification, should be used when conducting multielement FAs to accommodate the influence of establishing operations across assessment conditions. PMID:24114082

  12. Phosphatidylinositol transfer proteins: sequence motifs in structural and evolutionary analyses

    PubMed Central

    Wyckoff, Gerald J.; Solidar, Ada; Yoden, Marilyn D.

    2016-01-01

    Phosphatidylinositol transfer proteins (PITP) are a family of monomeric proteins that bind and transfer phosphatidylinositol and phosphatidylcholine between membrane compartments. They are required for production of inositol and diacylglycerol second messengers, and are found in most metazoan organisms. While PITPs are known to carry out crucial cell-signaling roles in many organisms, the structure, function and evolution of the majority of family members remains unexplored; primarily because the ubiquity and diversity of the family thwarts traditional methods of global alignment. To surmount this obstacle, we instead took a novel approach, using MEME and a parsimony-based analysis to create a cladogram of conserved sequence motifs in 56 PITP family proteins from 26 species. In keeping with previous functional annotations, three clades were supported within our evolutionary analysis; two classes of soluble proteins and a class of membrane-associated proteins. By, focusing on conserved regions, the analysis allowed for in depth queries regarding possible functional roles of PITP proteins in both intra- and extra- cellular signaling.

  13. Multimodal phylogeny for taxonomy: integrating information from nucleotide and amino acid sequences.

    PubMed

    Bicego, Manuele; Dellaglio, Franco; Felis, Giovanna E

    2007-10-01

    The crucial role played by the analysis of microbial diversity in biotechnology-based innovations has increased the interest in the microbial taxonomy research area. Phylogenetic sequence analyses have contributed significantly to the advances in this field, also in the view of the large amount of sequence data collected in recent years. Phylogenetic analyses could be realized on the basis of protein-encoding nucleotide sequences or encoded amino acid molecules: these two mechanisms present different peculiarities, still starting from two alternative representations of the same information. This complementarity could be exploited to achieve a multimodal phylogenetic scheme that is able to integrate gene and protein information in order to realize a single final tree. This aspect has been poorly addressed in the literature. In this paper, we propose to integrate the two phylogenetic analyses using basic schemes derived from the multimodality fusion theory (or multiclassifier systems theory), a well-founded and rigorous branch for which its powerfulness has already been demonstrated in other pattern recognition contexts. The proposed approach could be applied to distance matrix-based phylogenetic techniques (like neighbor joining), resulting in a smart and fast method. The proposed methodology has been tested in a real case involving sequences of some species of lactic acid bacteria. With this dataset, both nucleotide sequence- and amino acid sequence-based phylogenetic analyses present some drawbacks, which are overcome with the multimodal analysis. PMID:17933011

  14. From Artificial Amino Acids to Sequence-Defined Targeted Oligoaminoamides.

    PubMed

    Morys, Stephan; Wagner, Ernst; Lächelt, Ulrich

    2016-01-01

    Artificial oligoamino acids with appropriate protecting groups can be used for the sequential assembly of oligoaminoamides on solid-phase. With the help of these oligoamino acids multifunctional nucleic acid (NA) carriers can be designed and produced in highly defined topologies. Here we describe the synthesis of the artificial oligoamino acid Fmoc-Stp(Boc3)-OH, the subsequent assembly into sequence-defined oligomers and the formulation of tumor-targeted plasmid DNA (pDNA) polyplexes. PMID:27436323

  15. Human retroviruses and AIDS 1996. A compilation and analysis of nucleic acid and amino acid sequences

    SciTech Connect

    Myers, G.; Foley, B.; Korber, B.; Mellors, J.W.; Jeang, K.T.; Wain-Hobson, S.

    1997-04-01

    This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (1) Nuclear Acid Alignments and Sequences; (2) Amino Acid Alignments; (3) Analysis; (4) Related Sequences; and (5) Database Communications. Information within all the parts is updated throughout the year on the Web site, http://hiv-web.lanl.gov. While this publication could take the form of a review or sequence monograph, it is not so conceived. Instead, the literature from which the database is derived has simply been summarized and some elementary computational analyses have been performed upon the data. Interpretation and commentary have been avoided insofar as possible so that the reader can form his or her own judgments concerning the complex information. In addition to the general descriptions of the parts of the compendium, the user should read the individual introductions for each part.

  16. Analyses of the Sequence and Structural Properties Corresponding to Pentapeptide and Large Palindromes in Proteins.

    PubMed

    Sridhar, Settu; Nagamruta, Mallapragada; Guruprasad, Kunchur

    2015-01-01

    The analyses of 3967 representative proteins selected from the Protein Data Bank revealed the presence of 2803 pentapeptide and large palindrome sequences with known secondary structure conformation. These represent 2014 unique palindrome sequences. 60% palindromes are not associated with any regular secondary structure and 28% are in helix conformation, 11% in strand conformation and 1% in the coil conformation. The average solvent accessibility values are in the range between 0-155.28 Å2 suggesting that the palindromes in proteins can be either buried, exposed to the solvent or share an intermittent property. The number of residue neighborhood contacts defined by interactions ≤ 3.2 Ǻ is in the range between 0-29 residues. Palindromes of the same length in helix, strand and coil conformation are associated with different amino acid residue preferences at the individual positions. Nearly, 20% palindromes interact with catalytic/active site residues, ligand or metal ions in proteins and may therefore be important for function in the corresponding protein. The average hydrophobicity values for the pentapeptide and large palindromes range between -4.3 to +4.32 and the number of palindromes is almost equally distributed between the negative and positive hydrophobicity values. The palindromes represent 107 different protein families and the hydrolases, transferases, oxidoreductases and lyases contain relatively large number of palindromes. PMID:26465610

  17. Analyses of the Sequence and Structural Properties Corresponding to Pentapeptide and Large Palindromes in Proteins

    PubMed Central

    Sridhar, Settu; Nagamruta, Mallapragada; Guruprasad, Kunchur

    2015-01-01

    The analyses of 3967 representative proteins selected from the Protein Data Bank revealed the presence of 2803 pentapeptide and large palindrome sequences with known secondary structure conformation. These represent 2014 unique palindrome sequences. 60% palindromes are not associated with any regular secondary structure and 28% are in helix conformation, 11% in strand conformation and 1% in the coil conformation. The average solvent accessibility values are in the range between 0–155.28 Å2 suggesting that the palindromes in proteins can be either buried, exposed to the solvent or share an intermittent property. The number of residue neighborhood contacts defined by interactions ≤ 3.2 Ǻ is in the range between 0–29 residues. Palindromes of the same length in helix, strand and coil conformation are associated with different amino acid residue preferences at the individual positions. Nearly, 20% palindromes interact with catalytic/active site residues, ligand or metal ions in proteins and may therefore be important for function in the corresponding protein. The average hydrophobicity values for the pentapeptide and large palindromes range between -4.3 to +4.32 and the number of palindromes is almost equally distributed between the negative and positive hydrophobicity values. The palindromes represent 107 different protein families and the hydrolases, transferases, oxidoreductases and lyases contain relatively large number of palindromes. PMID:26465610

  18. Detecting frame shifts by amino acid sequence comparison.

    PubMed

    Claverie, J M

    1993-12-20

    Various amino acid substitution scoring matrices are used in conjunction with local alignments programs to detect regions of similarity and infer potential common ancestry between proteins. The usual scoring schemes derive from the implicit hypothesis that related proteins evolve from a common ancestor by the accumulation of point mutations and that amino acids tend to be progressively substituted by others with similar properties. However, other frequent single mutation events, like nucleotide insertion or deletion and gene inversion, change the translation reading frame and cause previously encoded amino acid sequences to become unrecognizable at once. Here, I derive five new types of scoring matrix, each capable of detecting a specific frame shift (deletion, insertion and inversion in 3 frames) and use them with a regular local alignments program to detect amino acid sequences that may have derived from alternative reading frames of the same nucleotide sequence. Frame shifts are inferred from the sole comparison of the protein sequences. The five scoring matrices were used with the BLASTP program to compare all the protein sequences in the Swissprot database. Surprisingly, the searches revealed hundreds of highly significant frame shift matches, of which many are likely to represent sequencing errors. Others provide some evidence that frame shift mutations might be used in protein evolution as a way to create new amino acid sequences from pre-existing coding regions. PMID:7903399

  19. Life in hot acid: pathway analyses in extremely thermoacidophilic archaea.

    PubMed

    Auernik, Kathryne S; Cooper, Charlotte R; Kelly, Robert M

    2008-10-01

    The extremely thermoacidophilic archaea are a particularly intriguing group of microorganisms that must simultaneously cope with biologically extreme pHs (< or = 4) and temperatures (Topt > or = 60 degrees C) in their natural environments. Their expanding biotechnological significance relates to their role in biomining of base and precious metals and their unique mechanisms of survival in hot acid, at both the cellular and biomolecular levels. Recent developments, such as advances in understanding of heavy metal tolerance mechanisms, implementation of a genetic system, and discovery of a new carbon fixation pathway, have been facilitated by the availability of genome sequence data and molecular genetic systems. As a result, new insights into the metabolic pathways and physiological features that define extreme thermoacidophily have been obtained, in some cases suggesting prospects for biotechnological opportunities. PMID:18760359

  20. Segments of amino acid sequence similarity in beta-amylases.

    PubMed

    Friedberg, F; Rhodes, C

    1988-01-01

    In alpha-amylases from animals, plants and bacteria and in beta-amylases from plants and bacteria a number of segments exhibit amino acid sequence similarity specific to the alpha or to the beta type, respectively. In the case of the beta-amylases the similar sequence regions are extensive and they are disrupted only by short interspersed dissimilar regions. Close to the C terminus, however, no such sequence similarity exist. PMID:2464171

  1. Deciphering Clostridium tyrobutyricum Metabolism Based on the Whole-Genome Sequence and Proteome Analyses

    PubMed Central

    Lee, Joungmin; Jang, Yu-Sin; Han, Mee-Jung; Kim, Jin Young

    2016-01-01

    ABSTRACT Clostridium tyrobutyricum is a Gram-positive anaerobic bacterium that efficiently produces butyric acid and is considered a promising host for anaerobic production of bulk chemicals. Due to limited knowledge on the genetic and metabolic characteristics of this strain, however, little progress has been made in metabolic engineering of this strain. Here we report the complete genome sequence of C. tyrobutyricum KCTC 5387 (ATCC 25755), which consists of a 3.07-Mbp chromosome and a 63-kbp plasmid. The results of genomic analyses suggested that C. tyrobutyricum produces butyrate from butyryl-coenzyme A (butyryl-CoA) through acetate reassimilation by CoA transferase, differently from Clostridium acetobutylicum, which uses the phosphotransbutyrylase-butyrate kinase pathway; this was validated by reverse transcription-PCR (RT-PCR) of related genes, protein expression levels, in vitro CoA transferase assay, and fed-batch fermentation. In addition, the changes in protein expression levels during the course of batch fermentations on glucose were examined by shotgun proteomics. Unlike C. acetobutylicum, the expression levels of proteins involved in glycolytic and fermentative pathways in C. tyrobutyricum did not decrease even at the stationary phase. Proteins related to energy conservation mechanisms, including Rnf complex, NfnAB, and pyruvate-phosphate dikinase that are absent in C. acetobutylicum, were identified. Such features explain why this organism can produce butyric acid to a much higher titer and better tolerate toxic metabolites. This study presenting the complete genome sequence, global protein expression profiles, and genome-based metabolic characteristics during the batch fermentation of C. tyrobutyricum will be valuable in designing strategies for metabolic engineering of this strain. PMID:27302759

  2. Use of gene sequence analyses and genome comparisons for yeast systematics

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Detection, identification, and classification of yeasts has undergone a major transformation in the past decade and a half following application of gene sequence analyses and genome comparisons. Development of a database (barcode) of easily determined gene sequences from domains 1 and 2 of large sub...

  3. Identification of food and beverage spoilage yeasts from DNA sequence analyses

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Detection, identification, and classification of yeasts has undergone a major transformation in the last decade and a half following application of gene sequence analyses and genome comparisons. Development of a database (barcode) of easily determined DNA sequences from domains 1 and 2 (D1/D2) of th...

  4. Characterization of bud emergence 46 (BEM46) protein: sequence, structural, phylogenetic and subcellular localization analyses.

    PubMed

    Kumar, Abhishek; Kollath-Leiß, Krisztina; Kempken, Frank

    2013-08-30

    The bud emergence 46 (BEM46) protein from Neurospora crassa belongs to the α/β-hydrolase superfamily. Recently, we have reported that the BEM46 protein is localized in the perinuclear ER and also forms spots close by the plasma membrane. The protein appears to be required for cell type-specific polarity formation in N. crassa. Furthermore, initial studies suggested that the BEM46 amino acid sequence is conserved in eukaryotes and is considered to be one of the widespread conserved "known unknown" eukaryotic genes. This warrants for a comprehensive phylogenetic analysis of this superfamily to unravel origin and molecular evolution of these genes in different eukaryotes. Herein, we observe that all eukaryotes have at least a single copy of a bem46 ortholog. Upon scanning of these proteins in various genomes, we find that there are expansions leading into several paralogs in vertebrates. Usingcomparative genomic analyses, we identified insertion/deletions (indels) in the conserved domain of BEM46 protein, which allow to differentiate fungal classes such as ascomycetes from basidiomycetes. We also find that exonic indels are able to differentiate BEM46 homologs of different eukaryotic lineage. Furthermore, we unravel that BEM46 protein from N. crassa possess a novel endoplasmic-retention signal (PEKK) using GFP-fusion tagging experiments. We propose that three residues namely a serine 188S, a histidine 292H and an aspartic acid 262D are most critical residues, forming a catalytic triad in BEM46 protein from N. crassa. We carried out a comprehensive study on bem46 genes from a molecular evolution perspective with combination of functional analyses. The evolutionary history of BEM46 proteins is characterized by exonic indels in lineage specific manner. PMID:23916612

  5. Characterization of mouse cellular deoxyribonucleic acid homologous to Abelson murine leukemia virus-specific sequences.

    PubMed Central

    Dale, B; Ozanne, B

    1981-01-01

    The genome of Abelson murine leukemia virus (A-MuLV) consists of sequences derived from both BALB/c mouse deoxyribonucleic acid and the genome of Moloney murine leukemia virus. Using deoxyribonucleic acid linear intermediates as a source of retroviral deoxyribonucleic acid, we isolated a recombinant plasmid which contained 1.9 kilobases of the 3.5-kilobase mouse-derived sequences found in A-MuLV (A-MuLV-specific sequences). We used this clone, designated pSA-17, as a probe restriction enzyme and Southern blot analyses to examine the arrangement of homologous sequences in BALB/c deoxyribonucleic acid (endogenous Abelson sequences). The endogenous Abelson sequences within the mouse genome were interrupted by noncoding regions, suggesting that a rearrangement of the cell sequences was required to produce the sequence found in the virus. Endogenous Abelson sequences were arranged similarly in mice that were susceptible to A-MuLV tumors and in mice that were resistant to A-MuLV tumors. An examination of three BALB/c plasmacytomas and a BALB/c early B-cell tumor likewise revealed no alteration in the arrangement of the endogenous Abelson sequences. Homology to pSA-17 was also observed in deoxyribonucleic acids prepared from rat, hamster, chicken, and human cells. An isolate of A-MuLV which encoded a 160,000-dalton transforming protein (P160) contained 700 more base pairs of mouse sequences than the standard A-MuLV isolate, which encoded a 120,000-dalton transforming protein (P120). Images PMID:9279386

  6. Quantitative Estimates of Sequence Divergence for Comparative Analyses of Mammalian Genomes

    PubMed Central

    Cooper, Gregory M.; Brudno, Michael; Program, NISC Comparative Sequencing; Green, Eric D.; Batzoglou, Serafim; Sidow, Arend

    2003-01-01

    Comparative sequence analyses on a collection of carefully chosen mammalian genomes could facilitate identification of functional elements within the human genome and allow quantification of evolutionary constraint at the single nucleotide level. High-resolution quantification would be informative for determining the distribution of important positions within functional elements and for evaluating the relative importance of nucleotide sites that carry single nucleotide polymorphisms (SNPs). Because the level of resolution in comparative sequence analyses is a direct function of sequence diversity, we propose that the information content of a candidate mammalian genome be defined as the sequence divergence it would add relative to already-sequenced genomes. We show that reliable estimates of genomic sequence divergence can be obtained from small genomic regions. On the basis of a multiple sequence alignment of ∼1.4 megabases each from eight mammals, we generate such estimates for five unsequenced mammals. Estimates of the neutral divergence in these data suggest that a small number of diverse mammalian genomes in addition to human, mouse, and rat would allow single nucleotide resolution in comparative sequence analyses. [The multiple sequence alignment of the CFTR region and a spreadsheet with the calculations performed, will be available as supplementary information online at www.genome.org.] PMID:12727901

  7. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... acids are not intended to be embraced by this definition. Any amino acid sequence that contains post-translationally modified amino acids may be described as the amino acid sequence that is initially translated... sequence of four or more amino acids or an unbranched sequence of ten or more nucleotides....

  8. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... acids are not intended to be embraced by this definition. Any amino acid sequence that contains post-translationally modified amino acids may be described as the amino acid sequence that is initially translated... sequence of four or more amino acids or an unbranched sequence of ten or more nucleotides....

  9. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... acids are not intended to be embraced by this definition. Any amino acid sequence that contains post-translationally modified amino acids may be described as the amino acid sequence that is initially translated... sequence of four or more amino acids or an unbranched sequence of ten or more nucleotides....

  10. Characterization of bud emergence 46 (BEM46) protein: Sequence, structural, phylogenetic and subcellular localization analyses

    SciTech Connect

    Kumar, Abhishek; Kollath-Leiß, Krisztina; Kempken, Frank

    2013-08-30

    Highlights: •All eukaryotes have at least a single copy of a bem46 ortholog. •The catalytic triad of BEM46 is illustrated using sequence and structural analysis. •We identified indels in the conserved domain of BEM46 protein. •Localization studies of BEM46 protein were carried out using GFP-fusion tagging. -- Abstract: The bud emergence 46 (BEM46) protein from Neurospora crassa belongs to the α/β-hydrolase superfamily. Recently, we have reported that the BEM46 protein is localized in the perinuclear ER and also forms spots close by the plasma membrane. The protein appears to be required for cell type-specific polarity formation in N. crassa. Furthermore, initial studies suggested that the BEM46 amino acid sequence is conserved in eukaryotes and is considered to be one of the widespread conserved “known unknown” eukaryotic genes. This warrants for a comprehensive phylogenetic analysis of this superfamily to unravel origin and molecular evolution of these genes in different eukaryotes. Herein, we observe that all eukaryotes have at least a single copy of a bem46 ortholog. Upon scanning of these proteins in various genomes, we find that there are expansions leading into several paralogs in vertebrates. Usingcomparative genomic analyses, we identified insertion/deletions (indels) in the conserved domain of BEM46 protein, which allow to differentiate fungal classes such as ascomycetes from basidiomycetes. We also find that exonic indels are able to differentiate BEM46 homologs of different eukaryotic lineage. Furthermore, we unravel that BEM46 protein from N. crassa possess a novel endoplasmic-retention signal (PEKK) using GFP-fusion tagging experiments. We propose that three residues namely a serine 188S, a histidine 292H and an aspartic acid 262D are most critical residues, forming a catalytic triad in BEM46 protein from N. crassa. We carried out a comprehensive study on bem46 genes from a molecular evolution perspective with combination of functional

  11. A method to find palindromes in nucleic acid sequences.

    PubMed

    Anjana, Ramnath; Shankar, Mani; Vaishnavi, Marthandan Kirti; Sekar, Kanagaraj

    2013-01-01

    Various types of sequences in the human genome are known to play important roles in different aspects of genomic functioning. Among these sequences, palindromic nucleic acid sequences are one such type that have been studied in detail and found to influence a wide variety of genomic characteristics. For a nucleotide sequence to be considered as a palindrome, its complementary strand must read the same in the opposite direction. For example, both the strands i.e the strand going from 5' to 3' and its complementary strand from 3' to 5' must be complementary. A typical nucleotide palindromic sequence would be TATA (5' to 3') and its complimentary sequence from 3' to 5' would be ATAT. Thus, a new method has been developed using dynamic programming to fetch the palindromic nucleic acid sequences. The new method uses less memory and thereby it increases the overall speed and efficiency. The proposed method has been tested using the bacterial (3891 KB bases) and human chromosomal sequences (Chr-18: 74366 kb and Chr-Y: 25554 kb) and the computation time for finding the palindromic sequences is in milli seconds. PMID:23515654

  12. Metagenomic analyses reveal phylogenetic diversity of carboxypeptidase gene sequences in activated sludge of a wastewater treatment plant in Shanghai, China.

    PubMed

    Jin, Hao; Li, Bailin; Peng, Xu; Chen, Lanming

    2014-01-01

    Activated sludge of wastewater treatment plants carries a diverse microflora. However, up to 80-90 % of microorganisms in activated sludge cannot be cultured by current laboratory techniques, leaving an enzyme reservoir largely unexplored. In this study, we investigated carboxypeptidase diversity in activated sludge of a wastewater treatment plant in Shanghai, China, by a culture-independent metagenomic approach. Three sets of consensus degenerate hybrid oligonucleotide primers (CODEHOPs) targeting conserved domains of public carboxypeptidases have been designed to amplify carboxypeptidase gene sequences in the metagenomic DNA of activated sludge by PCR. The desired amplicons were evaluated by carboxypeptidase sequence clone libraries and phylogenetic analyses. We uncovered a significant diversity of carboxypeptidases present in the activated sludge. Deduced carboxypeptidase amino acid sequences (127-208 amino acids) were classified into three distinct clusters, α, β, and γ. Sequences belonging to clusters α and β shared 58-97 % identity to known carboxypeptidase sequences from diverse species, whereas sequences in the cluster γ were remarkably less related to public carboxypeptidase homologous in the GenBank database, strongly suggesting that novel carboxypeptidase families or microbial niches exist in the activated sludge. We also observed numerous carboxypeptidase sequences that were much closer to those from representative strains present in industrial and sewage treatment and bioremediation. Thermostable and halotolerant carboxypeptidase sequences were also detected in clusters α and β. Coexistence of various carboxypeptidases is evidence of a diverse microflora in the activated sludge, a feature suggesting a valuable gene resource to be further explored for biotechnology application. PMID:24860282

  13. Amino acid sequence repertoire of the bacterial proteome and the occurrence of untranslatable sequences.

    PubMed

    Navon, Sharon Penias; Kornberg, Guy; Chen, Jin; Schwartzman, Tali; Tsai, Albert; Puglisi, Elisabetta Viani; Puglisi, Joseph D; Adir, Noam

    2016-06-28

    Bioinformatic analysis of Escherichia coli proteomes revealed that all possible amino acid triplet sequences occur at their expected frequencies, with four exceptions. Two of the four underrepresented sequences (URSs) were shown to interfere with translation in vivo and in vitro. Enlarging the URS by a single amino acid resulted in increased translational inhibition. Single-molecule methods revealed stalling of translation at the entrance of the peptide exit tunnel of the ribosome, adjacent to ribosomal nucleotides A2062 and U2585. Interaction with these same ribosomal residues is involved in regulation of translation by longer, naturally occurring protein sequences. The E. coli exit tunnel has evidently evolved to minimize interaction with the exit tunnel and maximize the sequence diversity of the proteome, although allowing some interactions for regulatory purposes. Bioinformatic analysis of the human proteome revealed no underrepresented triplet sequences, possibly reflecting an absence of regulation by interaction with the exit tunnel. PMID:27307442

  14. Amino acid sequences around the cysteine residues of rabbit muscle triose phosphate isomerase

    PubMed Central

    Miller, Janet C.; Waley, S. G.

    1971-01-01

    1. The nature of the subunits in rabbit muscle triose phosphate isomerase has been investigated. 2. Amino acid analyses show that there are five cysteine residues and two methionine residues/subunit. 3. The amino acid sequences around the cysteine residues have been determined; these account for about 75 residues. 4. Cleavage at the methionine residues with cyanogen bromide gave three fragments. 5. These results show that the subunits correspond to polypeptide chains, containing about 230 amino acid residues. The chains in triose phosphate isomerase seem to be shorter than those of other glycolytic enzymes. PMID:5165707

  15. Correlation between Serological and Sequencing Analyses of the PorB Outer Membrane Protein in the Neisseria meningitidis Serotyping System

    PubMed Central

    Sacchi, Claudio T.; Lemos, Ana P. S.; Whitney, Anne M.; Solari, Claude A.; Brandt, Mary E.; Melles, Carmo E. A.; Frasch, Carl E.; Mayer, Leonard W.

    1998-01-01

    The current serological typing scheme for Neisseria meningitidis is not comprehensive; a proportion of isolates are not serotypeable. DNA sequence analysis and predicted amino acid sequences were used to characterize the structures of variable-region (VR) epitopes on N. meningitidis PorB proteins (PorB VR typing). Twenty-six porB gene sequences were obtained from GenBank and aligned with 41 new sequences. Primary amino acid structures predicted from those genes were grouped into 30 VR families of related variants that displayed at least 60% similarity. We correlated VR families with monoclonal antibody (MAb) reactivities, establishing a relationship between VR families and epitope locations for 15 serotype-defining MAbs. The current panel of serotype-defining MAbs underestimates by at least 50% the PorB VR variability because reagents for several major VR families are lacking or because a number of VR variants within some families are not recognized by serotype-defining MAbs. These difficulties, also reported for serosubtyping based on the PorA protein, are shown as inconsistent results between serological and sequence analyses, leading to inaccurate strain identification and incomplete epidemiological data. The information from this study enabled the expansion of the panel of MAbs currently available for serotyping, by including MAbs of previously undetermined specificities. Use of the expanded serotype panel enabled us to improve the sensitivity of serotyping by resolving a number of formerly nonserotypeable strains. In most cases, this information can be used to predict the VR family placement of unknown PorB proteins without sequencing the entire porB gene. PorB VR typing complements serotyping, and a combination of both techniques may be used for full characterization of meningococcal strains. The present work represents the most complete and integrated data set of PorB VR sequences and MAb reactivities of serogroup B and C meningococci produced to date. PMID

  16. On Quantum Algorithm for Multiple Alignment of Amino Acid Sequences

    NASA Astrophysics Data System (ADS)

    Iriyama, Satoshi; Ohya, Masanori

    2009-02-01

    The alignment of genome sequences or amino acid sequences is one of fundamental operations for the study of life. Usual computational complexity for the multiple alignment of N sequences with common length L by dynamic programming is O(LN). This alignment is considered as one of the NP problems, so that it is desirable to find a nice algorithm of the multiple alignment. Thus in this paper we propose the quantum algorithm for the multiple alignment based on the works12,1,2 in which the NP complete problem was shown to be the P problem by means of quantum algorithm and chaos information dynamics.

  17. Comparative Sequence Analyses of La Crosse Virus Strain Isolated from Patient with Fatal Encephalitis, Tennessee, USA

    PubMed Central

    Fryxell, Rebecca Trout; Freyman, Kimberly; Ulloa, Armando; Velez, Jason O.; Paulsen, Dave; Lanciotti, Robert S.; Moncayo, Abelardo

    2015-01-01

    We characterized a La Crosse virus (LACV) isolate from the brain of a child who died of encephalitis-associated complications in eastern Tennessee, USA, during summer 2012. We compared the isolate with LACV sequences from mosquitoes collected near the child’s home just after his postmortem diagnosis. In addition, we conducted phylogenetic analyses of these and other sequences derived from LACV strains representing varied temporal, geographic, and ecologic origins. Consistent with historical findings, results of these analyses indicate that a limited range of LACV lineage I genotypes is associated with severe clinical outcomes. PMID:25898269

  18. Prebiotically plausible mechanisms increase compositional diversity of nucleic acid sequences

    PubMed Central

    Derr, Julien; Manapat, Michael L.; Rajamani, Sudha; Leu, Kevin; Xulvi-Brunet, Ramon; Joseph, Isaac; Nowak, Martin A.; Chen, Irene A.

    2012-01-01

    During the origin of life, the biological information of nucleic acid polymers must have increased to encode functional molecules (the RNA world). Ribozymes tend to be compositionally unbiased, as is the vast majority of possible sequence space. However, ribonucleotides vary greatly in synthetic yield, reactivity and degradation rate, and their non-enzymatic polymerization results in compositionally biased sequences. While natural selection could lead to complex sequences, molecules with some activity are required to begin this process. Was the emergence of compositionally diverse sequences a matter of chance, or could prebiotically plausible reactions counter chemical biases to increase the probability of finding a ribozyme? Our in silico simulations using a two-letter alphabet show that template-directed ligation and high concatenation rates counter compositional bias and shift the pool toward longer sequences, permitting greater exploration of sequence space and stable folding. We verified experimentally that unbiased DNA sequences are more efficient templates for ligation, thus increasing the compositional diversity of the pool. Our work suggests that prebiotically plausible chemical mechanisms of nucleic acid polymerization and ligation could predispose toward a diverse pool of longer, potentially structured molecules. Such mechanisms could have set the stage for the appearance of functional activity very early in the emergence of life. PMID:22319215

  19. The amino-acid sequence of kangaroo pancreatic ribonuclease.

    PubMed

    Gaastra, W; Welling, G W; Beintema, J J

    1978-05-01

    Red kangaroo (Macropus rufus) ribonuclease was isolated from pancreatic tissue by affinity chromatography. The amino acid sequence was determined by automatic sequencing of overlapping large fragments and by analysis of shorter peptides obtained by digestion with a number of proteolytic enzymes. The polypeptide chain consists of 122 amino acid residues. Compared to other ribonucleases, the N-terminal residue and residue 114 are deleted. In other pancreatic ribonucleases position 114 is occupied by a cis proline residue in an external loop at the surface of the molecule. Other remarkable substitutions are the presence of a tyrosine residue at position 123 instead of a serine which forms a hydrogen bond with the pyrimidine ring of a nucleotide substrate, and a number of hydrophobichydrophilic interchanges in the sequence 51-55, which forms part of an alpha-helix in bovine ribonuclease and exhibits few substitutions in the placental mammals. Kangaroo ribonuclease contains no carbohydrate, although the enzyme possesses a recognition site for carbohydrate attachment in the sequence Asn-Val-Thr (62-64). The enzyme differs at about 35-40% of the positions from all other mammalian pancreatic ribonucleases sequenced to date, which is in agreement with the early divergence between the marsupials and the placental mammals. From fragmentary data a tentative sequence of red-necked wallaby (Macropus rufogriseus) pancreatic ribonuclease has been derived. Eight differences with the kangaroo sequence were found. PMID:658039

  20. Accumulated analyses of amino acid precursors in returned lunar samples

    NASA Technical Reports Server (NTRS)

    Fox, S. W.; Harada, K.; Hare, P. E.

    1973-01-01

    Six amino acids (glycine, alanine, aspartic acid, glutamic acid, serine, and threonine) obtained by hydrolysis of extracts have been quantitatively determined in ten collections of fines from five Apollo missions. Although the amounts found, 7-45 ng/g, are small, the lunar amino acid/carbon ratios are comparable to those of the carbonaceous chondrites, Murchison and Murray, as analyzed by the same procedures. Since both the ratios of amino acid to carbon, and the four or five most common types of proteinous amino acid found, are comparable for the two extraterrestrial sources despite different cosmophysical histories of the moon and meteorites, common cosmochemical processes are suggested.

  1. Amino acid analyses of R and CK chondrites

    NASA Astrophysics Data System (ADS)

    Burton, Aaron S.; McLain, Hannah; Glavin, Daniel P.; Elsila, Jamie E.; Davidson, Jemma; Miller, Kelly E.; Andronikov, Alexander V.; Lauretta, Dante; Dworkin, Jason P.

    2015-03-01

    Exogenous delivery of amino acids and other organic molecules to planetary surfaces may have played an important role in the origins of life on Earth and other solar system bodies. Previous studies have revealed the presence of indigenous amino acids in a wide range of carbon-rich meteorites, with the abundances and structural distributions differing significantly depending on parent body mineralogy and alteration conditions. Here we report on the amino acid abundances of seven type 3-6 CK chondrites and two Rumuruti (R) chondrites. Amino acid measurements were made on hot water extracts from these meteorites by ultrahigh-performance liquid chromatography with fluorescence detection and time-of-flight mass spectrometry. Of the nine meteorites analyzed, four were depleted in amino acids, and one had experienced significant amino acid contamination by terrestrial biology. The remaining four, comprised of two R and two CK chondrites, contained low levels of amino acids that were predominantly the straight chain, amino-terminal (n-ω-amino) acids β-alanine, and γ-amino-n-butyric acid. This amino acid distribution is similar to what we reported previously for thermally altered ureilites and CV and CO chondrites, and these n-ω-amino acids appear to be indigenous to the meteorites and not the result of terrestrial contamination. The amino acids may have been formed by Fischer-Tropsch-type reactions, although this hypothesis needs further testing.

  2. Gene sequence analyses and other DNA-based methods for yeast species recognition

    Technology Transfer Automated Retrieval System (TEKTRAN)

    DNA sequence analyses, as well as other DNA-based methodologies, have transformed the way in which yeasts are identified. The focus of this chapter will be on the resolution of species using various types of DNA comparisons. In other chapters in this book, Rozpedowska, Piškur and Wolfe discuss mul...

  3. Amino acid sequence of Salmonella typhimurium branched-chain amino acid aminotransferase.

    PubMed

    Feild, M J; Nguyen, D C; Armstrong, F B

    1989-06-13

    The complete amino acid sequence of the subunit of branched-chain amino acid aminotransferase (transaminase B, EC 2.6.1.42) of Salmonella typhimurium was determined. An Escherichia coli recombinant containing the ilvGEDAY gene cluster of Salmonella was used as the source of the hexameric enzyme. The peptide fragments used for sequencing were generated by treatment with trypsin, Staphylococcus aureus V8 protease, endoproteinase Lys-C, and cyanogen bromide. The enzyme subunit contains 308 residues and has a molecular weight of 33,920. To determine the coenzyme-binding site, the pyridoxal 5-phosphate containing enzyme was treated with tritiated sodium borohydride prior to trypsin digestion. Peptide map comparisons with an apoenzyme tryptic digest and monitoring radioactivity incorporation allowed identification of the pyridoxylated peptide, which was then isolated and sequenced. The coenzyme-binding site is the lysyl residue at position 159. The amino acid sequence of Salmonella transaminase B is 97.4% identical with that of Escherichia coli, differing in only eight amino acid positions. Sequence comparisons of transaminase B to other known aminotransferase sequences revealed limited sequence similarity (24-33%) when conserved amino acid substitutions are allowed and alignments were forced to occur on the coenzyme-binding site. PMID:2669973

  4. GBSA: a comprehensive software for analysing whole genome bisulfite sequencing data

    PubMed Central

    Benoukraf, Touati; Wongphayak, Sarawut; Hadi, Luqman Hakim Abdul; Wu, Mengchu; Soong, Richie

    2013-01-01

    High-throughput sequencing is increasingly being used in combination with bisulfite (BS) assays to study DNA methylation at nucleotide resolution. Although several programmes provide genome-wide alignment of BS-treated reads, the resulting information is not readily interpretable and often requires further bioinformatic steps for meaningful analysis. Current post-alignment BS-sequencing programmes are generally focused on the gene-specific level, a restrictive feature when analysis in the non-coding regions, such as enhancers and intergenic microRNAs, is required. Here, we present Genome Bisulfite Sequencing Analyser (GBSA—http://ctrad-csi.nus.edu.sg/gbsa), a free open-source software capable of analysing whole-genome bisulfite sequencing data with either a gene-centric or gene-independent focus. Through analysis of the largest published data sets to date, we demonstrate GBSA’s features in providing sequencing quality assessment, methylation scoring, functional data management and visualization of genomic methylation at nucleotide resolution. Additionally, we show that GBSA’s output can be easily integrated with other high-throughput sequencing data, such as RNA-Seq or ChIP-seq, to elucidate the role of methylated intergenic regions in gene regulation. In essence, GBSA allows an investigator to explore not only known loci but also all the genomic regions, for which methylation studies could lead to the discovery of new regulatory mechanisms. PMID:23268441

  5. Cloud-based bioinformatics workflow platform for large-scale next-generation sequencing analyses

    PubMed Central

    Liu, Bo; Madduri, Ravi K; Sotomayor, Borja; Chard, Kyle; Lacinski, Lukasz; Dave, Utpal J; Li, Jianqiang; Liu, Chunchen; Foster, Ian T

    2014-01-01

    Due to the upcoming data deluge of genome data, the need for storing and processing large-scale genome data, easy access to biomedical analyses tools, efficient data sharing and retrieval has presented significant challenges. The variability in data volume results in variable computing and storage requirements, therefore biomedical researchers are pursuing more reliable, dynamic and convenient methods for conducting sequencing analyses. This paper proposes a Cloud-based bioinformatics workflow platform for large-scale next-generation sequencing analyses, which enables reliable and highly scalable execution of sequencing analyses workflows in a fully automated manner. Our platform extends the existing Galaxy workflow system by adding data management capabilities for transferring large quantities of data efficiently and reliably (via Globus Transfer), domain-specific analyses tools preconfigured for immediate use by researchers (via user-specific tools integration), automatic deployment on Cloud for on-demand resource allocation and pay-as-you-go pricing (via Globus Provision), a Cloud provisioning tool for auto-scaling (via HTCondor scheduler), and the support for validating the correctness of workflows (via semantic verification tools). Two bioinformatics workflow use cases as well as performance evaluation are presented to validate the feasibility of the proposed approach. PMID:24462600

  6. In silico comparative analysis of DNA and amino acid sequences for prion protein gene.

    PubMed

    Kim, Y; Lee, J; Lee, C

    2008-01-01

    Genetic variability might contribute to species specificity of prion diseases in various organisms. In this study, structures of the prion protein gene (PRNP) and its amino acids were compared among species of which sequence data were available. Comparisons of PRNP DNA sequences among 12 species including human, chimpanzee, monkey, bovine, ovine, dog, mouse, rat, wallaby, opossum, chicken and zebrafish allowed us to identify candidate regulatory regions in intron 1 and 3'-untranslated region (UTR) in addition to the coding region. Highly conserved putative binding sites for transcription factors, such as heat shock factor 2 (HSF2) and myocite enhancer factor 2 (MEF2), were discovered in the intron 1. In 3'-UTR, the functional sequence (ATTAAA) for nucleus-specific polyadenylation was found in all the analysed species. The functional sequence (TTTTTAT) for maturation-specific polyadenylation was identically observed only in ovine, and one or two nucleotide mismatches in the other species. A comparison of the amino acid sequences in 53 species revealed a large sequence identity. Especially the octapeptide repeat region was observed in all the species but frog and zebrafish. Functional changes and susceptibility to prion diseases with various isoforms of prion protein could be caused by numeric variability and conformational changes discovered in the repeat sequences. PMID:18397498

  7. Amino acid sequence of bovine heart coupling factor 6.

    PubMed Central

    Fang, J K; Jacobs, J W; Kanner, B I; Racker, E; Bradshaw, R A

    1984-01-01

    The amino acid sequence of bovine heart mitochondrial coupling factor 6 (F6) has been determined by automated Edman degradation of the whole protein and derived peptides. Preparations based on heat precipitation and ethanol extraction showed allotypic variation at three positions while material further purified by HPLC yielded only one sequence that also differed by a Phe-Thr replacement at residue 62. The mature protein contains 76 amino acids with a calculated molecular weight of 9006 and a pI of approximately equal to 5, in good agreement with experimentally measured values. The charged amino acids are mainly clustered at the termini and in one section in the middle; these three polar segments are separated by two segments relatively rich in nonpolar residues. Chou-Fasman analysis suggests three stretches of alpha-helix coinciding (or within) the high-charge-density sequences with a single beta-turn at the first polar-nonpolar junction. Comparison of the F6 sequence with those of other proteins did not reveal any homologous structures. PMID:6149548

  8. Sequences Of Amino Acids For Human Serum Albumin

    NASA Technical Reports Server (NTRS)

    Carter, Daniel C.

    1992-01-01

    Sequences of amino acids defined for use in making polypeptides one-third to one-sixth as large as parent human serum albumin molecule. Smaller, chemically stable peptides have diverse applications including service as artificial human serum and as active components of biosensors and chromatographic matrices. In applications involving production of artificial sera from new sequences, little or no concern about viral contaminants. Smaller genetically engineered polypeptides more easily expressed and produced in large quantities, making commercial isolation and production more feasible and profitable.

  9. Nanopores and nucleic acids: prospects for ultrarapid sequencing

    NASA Technical Reports Server (NTRS)

    Deamer, D. W.; Akeson, M.

    2000-01-01

    DNA and RNA molecules can be detected as they are driven through a nanopore by an applied electric field at rates ranging from several hundred microseconds to a few milliseconds per molecule. The nanopore can rapidly discriminate between pyrimidine and purine segments along a single-stranded nucleic acid molecule. Nanopore detection and characterization of single molecules represents a new method for directly reading information encoded in linear polymers. If single-nucleotide resolution can be achieved, it is possible that nucleic acid sequences can be determined at rates exceeding a thousand bases per second.

  10. Amino acid sequence of the Amur tiger prion protein.

    PubMed

    Wu, Changde; Pang, Wanyong; Zhao, Deming

    2006-10-01

    Prion diseases are fatal neurodegenerative disorders in human and animal associated with conformational conversion of a cellular prion protein (PrP(C)) into the pathologic isoform (PrP(Sc)). Various data indicate that the polymorphisms within the open reading frame (ORF) of PrP are associated with the susceptibility and control the species barrier in prion diseases. In the present study, partial Prnp from 25 Amur tigers (tPrnp) were cloned and screened for polymorphisms. Four single nucleotide polymorphisms (T423C, A501G, C511A, A610G) were found; the C511A and A610G nucleotide substitutions resulted in the amino acid changes Lysine171Glutamine and Alanine204Threoine, respectively. The tPrnp amino acid sequence is similar to house cat (Felis catus ) and sheep, but differs significantly from other two cat Prnp sequences that were previously deposited in GenBank. PMID:16780982

  11. Identification of a Herbal Powder by Deoxyribonucleic Acid Barcoding and Structural Analyses

    PubMed Central

    Sheth, Bhavisha P.; Thaker, Vrinda S.

    2015-01-01

    Background: Authentic identification of plants is essential for exploiting their medicinal properties as well as to stop the adulteration and malpractices with the trade of the same. Objective: To identify a herbal powder obtained from a herbalist in the local vicinity of Rajkot, Gujarat, using deoxyribonucleic acid (DNA) barcoding and molecular tools. Materials and Methods: The DNA was extracted from a herbal powder and selected Cassia species, followed by the polymerase chain reaction (PCR) and sequencing of the rbcL barcode locus. Thereafter the sequences were subjected to National Center for Biotechnology Information (NCBI) basic local alignment search tool (BLAST) analysis, followed by the protein three-dimension structure determination of the rbcL protein from the herbal powder and Cassia species namely Cassia fistula, Cassia tora and Cassia javanica (sequences obtained in the present study), Cassia Roxburghii, and Cassia abbreviata (sequences retrieved from Genbank). Further, the multiple and pairwise structural alignment were carried out in order to identify the herbal powder. Results: The nucleotide sequences obtained from the selected species of Cassia were submitted to Genbank (Accession No. JX141397, JX141405, JX141420). The NCBI BLAST analysis of the rbcL protein from the herbal powder showed an equal sequence similarity (with reference to different parameters like E value, maximum identity, total score, query coverage) to C. javanica and C. roxburghii. In order to solve the ambiguities of the BLAST result, a protein structural approach was implemented. The protein homology models obtained in the present study were submitted to the protein model database (PM0079748-PM0079753). The pairwise structural alignment of the herbal powder (as template) and C. javanica and C. roxburghii (as targets individually) revealed a close similarity of the herbal powder with C. javanica. Conclusion: A strategy as used here, incorporating the integrated use of DNA

  12. Genome sequencing elucidates Sardinian genetic architecture and augments association analyses for lipid and blood inflammatory markers

    PubMed Central

    Zoledziewska, Magdalena; Mulas, Antonella; Pistis, Giorgio; Steri, Maristella; Danjou, Fabrice; Kwong, Alan; Ortega del Vecchyo, Vicente Diego; Chiang, Charleston W. K.; Bragg-Gresham, Jennifer; Pitzalis, Maristella; Nagaraja, Ramaiah; Tarrier, Brendan; Brennan, Christine; Uzzau, Sergio; Fuchsberger, Christian; Atzeni, Rossano; Reinier, Frederic; Berutti, Riccardo; Huang, Jie; Timpson, Nicholas J; Toniolo, Daniela; Gasparini, Paolo; Malerba, Giovanni; Dedoussis, George; Zeggini, Eleftheria; Soranzo, Nicole; Jones, Chris; Lyons, Robert; Angius, Andrea; Kang, Hyun M.; Novembre, John; Sanna, Serena; Schlessinger, David; Cucca, Francesco; Abecasis, Gonçalo R

    2015-01-01

    We report ~17.6M genetic variants from whole-genome sequencing of 2,120 Sardinians; 22% are absent from prior sequencing-based compilations and enriched for predicted functional consequence. Furthermore, ~76K variants common in our sample (frequency >5%) are rare elsewhere (<0.5% in the 1000 Genomes Project). We assessed the impact of these variants on circulating lipid levels and five inflammatory biomarkers. Fourteen signals, including two major new loci, were observed for lipid levels, and 19, including two novel loci, for inflammatory markers. New associations would be missed in analyses based on 1000 Genomes data, underlining the advantages of large-scale sequencing in this founder population. PMID:26366554

  13. Genome sequencing elucidates Sardinian genetic architecture and augments association analyses for lipid and blood inflammatory markers.

    PubMed

    Sidore, Carlo; Busonero, Fabio; Maschio, Andrea; Porcu, Eleonora; Naitza, Silvia; Zoledziewska, Magdalena; Mulas, Antonella; Pistis, Giorgio; Steri, Maristella; Danjou, Fabrice; Kwong, Alan; Ortega Del Vecchyo, Vicente Diego; Chiang, Charleston W K; Bragg-Gresham, Jennifer; Pitzalis, Maristella; Nagaraja, Ramaiah; Tarrier, Brendan; Brennan, Christine; Uzzau, Sergio; Fuchsberger, Christian; Atzeni, Rossano; Reinier, Frederic; Berutti, Riccardo; Huang, Jie; Timpson, Nicholas J; Toniolo, Daniela; Gasparini, Paolo; Malerba, Giovanni; Dedoussis, George; Zeggini, Eleftheria; Soranzo, Nicole; Jones, Chris; Lyons, Robert; Angius, Andrea; Kang, Hyun M; Novembre, John; Sanna, Serena; Schlessinger, David; Cucca, Francesco; Abecasis, Gonçalo R

    2015-11-01

    We report ∼17.6 million genetic variants from whole-genome sequencing of 2,120 Sardinians; 22% are absent from previous sequencing-based compilations and are enriched for predicted functional consequences. Furthermore, ∼76,000 variants common in our sample (frequency >5%) are rare elsewhere (<0.5% in the 1000 Genomes Project). We assessed the impact of these variants on circulating lipid levels and five inflammatory biomarkers. We observe 14 signals, including 2 major new loci, for lipid levels and 19 signals, including 2 new loci, for inflammatory markers. The new associations would have been missed in analyses based on 1000 Genomes Project data, underlining the advantages of large-scale sequencing in this founder population. PMID:26366554

  14. Power Spectrum and Mutual Information Analyses of DNA Base (Nucleotide) Sequences

    NASA Astrophysics Data System (ADS)

    Isohata, Yasuhiko; Hayashi, Masaki

    2003-03-01

    On the basis of the power spectrum analyses for the base (nucleotide) sequences of various genes, we have studied long-range correlations in total base sequences which are expressed as 1/fα, behaviour of the exponent α for the accumulated base sequences as well as periodicities at short range. In particular from the analysis of content rate distributions of α we have obtained the average value \\barα=0.40± 0.01 and \\barα=0.20± 0.01 for the human genes and S. cerevisiae genes, respectively. We have also performed the analyses using the mutual information function. We show that there exists a clear difference between the content rate distributions of correlation lengths for the sample human genes and the S. cerevisiae genes. We are led to a conjecture that the elongation of the correlation length in the base sequences of genes from the early eukaryote (S. cerevisiae) to the late eukaryote (human) should be the definite reflection of the evolutionary process.

  15. Quantum-Sequencing: Biophysics of quantum tunneling through nucleic acids

    NASA Astrophysics Data System (ADS)

    Casamada Ribot, Josep; Chatterjee, Anushree; Nagpal, Prashant

    2014-03-01

    Tunneling microscopy and spectroscopy has extensively been used in physical surface sciences to study quantum tunneling to measure electronic local density of states of nanomaterials and to characterize adsorbed species. Quantum-Sequencing (Q-Seq) is a new method based on tunneling microscopy for electronic sequencing of single molecule of nucleic acids. A major goal of third-generation sequencing technologies is to develop a fast, reliable, enzyme-free single-molecule sequencing method. Here, we present the unique ``electronic fingerprints'' for all nucleotides on DNA and RNA using Q-Seq along their intrinsic biophysical parameters. We have analyzed tunneling spectra for the nucleotides at different pH conditions and analyzed the HOMO, LUMO and energy gap for all of them. In addition we show a number of biophysical parameters to further characterize all nucleobases (electron and hole transition voltage and energy barriers). These results highlight the robustness of Q-Seq as a technique for next-generation sequencing.

  16. Correlation between fibroin amino acid sequence and physical silk properties.

    PubMed

    Fedic, Robert; Zurovec, Michal; Sehnal, Frantisek

    2003-09-12

    The fiber properties of lepidopteran silk depend on the amino acid repeats that interact during H-fibroin polymerization. The aim of our research was to relate repeat composition to insect biology and fiber strength. Representative regions of the H-fibroin genes were sequenced and analyzed in three pyralid species: wax moth (Galleria mellonella), European flour moth (Ephestia kuehniella), and Indian meal moth (Plodia interpunctella). The amino acid repeats are species-specific, evidently a diversification of an ancestral region of 43 residues, and include three types of regularly dispersed motifs: modifications of GSSAASAA sequence, stretches of tripeptides GXZ where X and Z represent bulky residues, and sequences similar to PVIVIEE. No concatenations of GX dipeptide or alanine, which are typical for Bombyx silkworms and Antheraea silk moths, respectively, were found. Despite different repeat structure, the silks of G. mellonella and E. kuehniella exhibit similar tensile strength as the Bombyx and Antheraea silks. We suggest that in these latter two species, variations in the repeat length obstruct repeat alignment, but sufficiently long stretches of iterated residues get superposed to interact. In the pyralid H-fibroins, interactions of the widely separated and diverse motifs depend on the precision of repeat matching; silk is strong in G. mellonella and E. kuehniella, with 2-3 types of long homogeneous repeats, and nearly 10 times weaker in P. interpunctella, with seven types of shorter erratic repeats. The high proportion of large amino acids in the H-fibroin of pyralids has probably evolved in connection with the spinning habit of caterpillars that live in protective silk tubes and spin continuously, enlarging the tubes on one end and partly devouring the other one. The silk serves as a depot of energetically rich and essential amino acids that may be scarce in the diet. PMID:12816957

  17. A Weighted U Statistic for Genetic Association Analyses of Sequencing Data

    PubMed Central

    Wei, Changshuai; Li, Ming; He, Zihuai; Vsevolozhskaya, Olga; Schaid, Daniel J.; Lu, Qing

    2014-01-01

    With advancements in next generation sequencing technology, a massive amount of sequencing data are generated, which offers a great opportunity to comprehensively investigate the role of rare variants in the genetic etiology of complex diseases. Nevertheless, the high-dimensional sequencing data poses a great challenge for statistical analysis. The association analyses based on traditional statistical methods suffer substantial power loss because of the low frequency of genetic variants and the extremely high dimensionality of the data. We developed a weighted U statistic, referred to as WU-SEQ, for the high-dimensional association analysis of sequencing data. Based on a non-parametric U statistic, WU-SEQ makes no assumption of the underlying disease model and phenotype distribution, and can be applied to a variety of phenotypes. Through simulation studies and an empirical study, we showed that WU-SEQ outperformed a commonly used SKAT method when the underlying assumptions were violated (e.g., the phenotype followed a heavy-tailed distribution). Even when the assumptions were satisfied, WU-SEQ still attained comparable performance to SKAT. Finally, we applied WU-SEQ to sequencing data from the Dallas Heart Study (DHS), and detected an association between ANGPTL 4 and very low density lipoprotein cholesterol. PMID:25331574

  18. Amino acid sequence of the nonsecretory ribonuclease of human urine.

    PubMed

    Beintema, J J; Hofsteenge, J; Iwama, M; Morita, T; Ohgi, K; Irie, M; Sugiyama, R H; Schieven, G L; Dekker, C A; Glitz, D G

    1988-06-14

    The amino acid sequence of a nonsecretory ribonuclease isolated from human urine was determined except for the identity of the residue at position 7. Sequence information indicates that the ribonucleases of human liver and spleen and an eosinophil-derived neurotoxin are identical or very closely related gene products. The sequence is identical at about 30% of the amino acid positions with those of all of the secreted mammalian ribonucleases for which information is available. Identical residues include active-site residues histidine-12, histidine-119, and lysine-41, other residues known to be important for substrate binding and catalytic activity, and all eight half-cystine residues common to these enzymes. Major differences include a deletion of six residues in the (so-called) S-peptide loop, insertions of two, and nine residues, respectively, in three other external loops of the molecule, and an addition of three residues at the amino terminus. The sequence shows the human nonsecretory ribonuclease to belong to the same ribonuclease superfamily as the mammalian secretory ribonucleases, turtle pancreatic ribonuclease, and human angiogenin. Sequence data suggest that a gene duplication occurred in an ancient vertebrate ancestor; one branch led to the nonsecretory ribonuclease, while the other branch led to a second duplication, with one line leading to the secretory ribonucleases (in mammals) and the second line leading to pancreatic ribonuclease in turtle and an angiogenic factor in mammals (human angiogenin). The nonsecretory ribonuclease has five short carbohydrate chains attached via asparagine residues at the surface of the molecule; these chains may have been shortened by exoglycosidase action.(ABSTRACT TRUNCATED AT 250 WORDS) PMID:3166997

  19. Characterization and amino acid sequence of a fatty acid-binding protein from human heart.

    PubMed

    Offner, G D; Brecher, P; Sawlivich, W B; Costello, C E; Troxler, R F

    1988-05-15

    The complete amino acid sequence of a fatty acid-binding protein from human heart was determined by automated Edman degradation of CNBr, BNPS-skatole [3'-bromo-3-methyl-2-(2-nitrobenzenesulphenyl)indolenine], hydroxylamine, Staphylococcus aureus V8 proteinase, tryptic and chymotryptic peptides, and by digestion of the protein with carboxypeptidase A. The sequence of the blocked N-terminal tryptic peptide from citraconylated protein was determined by collisionally induced decomposition mass spectrometry. The protein contains 132 amino acid residues, is enriched with respect to threonine and lysine, lacks cysteine, has an acetylated valine residue at the N-terminus, and has an Mr of 14768 and an isoelectric point of 5.25. This protein contains two short internal repeated sequences from residues 48-54 and from residues 114-119 located within regions of predicted beta-structure and decreasing hydrophobicity. These short repeats are contained within two longer repeated regions from residues 48-60 and residues 114-125, which display 62% sequence similarity. These regions could accommodate the charged and uncharged moieties of long-chain fatty acids and may represent fatty acid-binding domains consistent with the finding that human heart fatty acid-binding protein binds 2 mol of oleate or palmitate/mol of protein. Detailed evidence for the amino acid sequences of the peptides has been deposited as Supplementary Publication SUP 50143 (23 pages) at the British Library Lending Division, Boston Spa, Yorkshire LS23 7BQ, U.K., from whom copies may be obtained as indicated in Biochem. J. (1988) 249, 5. PMID:3421901

  20. Molecular cloning and amino acid sequence of human 5-lipoxygenase

    SciTech Connect

    Matsumoto, T.; Funk, C.D.; Radmark, O.; Hoeoeg, J.O.; Joernvall, H.; Samuelsson, B.

    1988-01-01

    5-Lipoxygenase (EC 1.13.11.34), a Ca/sup 2 +/- and ATP-requiring enzyme, catalyzes the first two steps in the biosynthesis of the peptidoleukotrienes and the chemotactic factor leukotriene B/sub 4/. A cDNA clone corresponding to 5-lipoxygenase was isolated from a human lung lambda gt11 expression library by immunoscreening with a polyclonal antibody. Additional clones from a human placenta lambda gt11 cDNA library were obtained by plaque hybridization with the /sup 32/P-labeled lung cDNA clone. Sequence data obtained from several overlapping clones indicate that the composite DNAs contain the complete coding region for the enzyme. From the deduced primary structure, 5-lipoxygenase encodes a 673 amino acid protein with a calculated molecular weight of 77,839. Direct analysis of the native protein and its proteolytic fragments confirmed the deduced composition, the amino-terminal amino acid sequence, and the structure of many internal segments. 5-Lipoxygenase has no apparent sequence homology with leukotriene A/sub 4/ hydrolase or Ca/sup 2 +/-binding proteins. RNA blot analysis indicated substantial amounts of an mRNA species of approx. = 2700 nucleotides in leukocytes, lung, and placenta.

  1. Nucleic acid sequence detection using multiplexed oligonucleotide PCR

    DOEpatents

    Nolan, John P.; White, P. Scott

    2006-12-26

    Methods for rapidly detecting single or multiple sequence alleles in a sample nucleic acid are described. Provided are all of the oligonucleotide pairs capable of annealing specifically to a target allele and discriminating among possible sequences thereof, and ligating to each other to form an oligonucleotide complex when a particular sequence feature is present (or, alternatively, absent) in the sample nucleic acid. The design of each oligonucleotide pair permits the subsequent high-level PCR amplification of a specific amplicon when the oligonucleotide complex is formed, but not when the oligonucleotide complex is not formed. The presence or absence of the specific amplicon is used to detect the allele. Detection of the specific amplicon may be achieved using a variety of methods well known in the art, including without limitation, oligonucleotide capture onto DNA chips or microarrays, oligonucleotide capture onto beads or microspheres, electrophoresis, and mass spectrometry. Various labels and address-capture tags may be employed in the amplicon detection step of multiplexed assays, as further described herein.

  2. The amino acid sequence of rabbit muscle triose phosphate isomerase.

    PubMed Central

    Corran, P H; Waley, S G

    1975-01-01

    The amino acid sequence of rabbit muscle triose phosphate isomerase was deduced by characterizing peptides that overlap the tryptic peptides. Thiol groups were modified by oxidation, carboxymethylation or aminoen. About 50 peptides that provided information about overlaps were isolated; the peptides were mostly characterized by their compositions and N-terminal residues. The peptide chains contain 248 amino acid residues, and no evidence for dissimilarity of the two subunits that comprise the native enzyme was found. The sequence of the rabbit muscle enzyme may be compared with that of the coelacanth enzyme (Kolb et al., 1974): 84% of the residues are in identical positions. Similarly, comparison of the sequence with that inferred for the chicken enzyme (Furth et al., 1974) shows that 87% of the residues are in identical positions. Limited though these comparisons are, they suggest that triose phosphate isomerase has one of the lowest rates of evolutionary change. An extended version of the present paper has been deposited as Supplementary Publication SUP 50040 (42 pages) at the British Library (Lending Division) (formerly the National Lending Library for Science and Technology), Boston Spa, Yorks. LS23 7BQ, U.K., from whom copies can be obtained on the terms given in Biochem. J. (1975) 145, 5. PMID:1171682

  3. The amino acid sequence of chymopapain from Carica papaya.

    PubMed Central

    Watson, D C; Yaguchi, M; Lynn, K R

    1990-01-01

    Chymopapain is a polypeptide of 218 amino acid residues. It has considerable structural similarity with papain and papaya proteinase omega, including conservation of the catalytic site and of the disulphide bonding. Chymopapain is like papaya proteinase omega in carrying four extra residues between papain positions 168 and 169, but differs from both papaya proteinases in the composition of its S2 subsite, as well as in having a second thiol group, Cys-117. Some evidence for the amino acid sequence of chymopapain has been deposited as Supplementary Publication SUP 50153 (12 pages) at the British Library Document Supply Centre, Boston Spa., Wetherby, West Yorkshire LS23 7BQ, U.K., from whom copies may be obtained on the terms indicated in Biochem. J. (1990) 265, 5. The information comprises Supplement Tables 1-4, which contain, in order, amino acid compositions of peptides from tryptic, peptic, CNBr and mild acid cleavages, Supplement Fig. 1, showing re-fractionation of selected peaks from Fig. 2 of the main paper. Supplement Fig. 2, showing cation-exchange chromatography of the earliest-eluted peak of Fig. 3 of the main paper, Supplement Fig. 3, showing reverse-phase h.p.l.c. of the later-eluted peak from Fig. 3 of the main paper, and Supplement Fig. 4, showing the separation of peptides after mild acid hydrolysis of CNBr-cleavage fragment CB3. PMID:2106878

  4. The amino acid sequence of chymopapain from Carica papaya.

    PubMed

    Watson, D C; Yaguchi, M; Lynn, K R

    1990-02-15

    Chymopapain is a polypeptide of 218 amino acid residues. It has considerable structural similarity with papain and papaya proteinase omega, including conservation of the catalytic site and of the disulphide bonding. Chymopapain is like papaya proteinase omega in carrying four extra residues between papain positions 168 and 169, but differs from both papaya proteinases in the composition of its S2 subsite, as well as in having a second thiol group, Cys-117. Some evidence for the amino acid sequence of chymopapain has been deposited as Supplementary Publication SUP 50153 (12 pages) at the British Library Document Supply Centre, Boston Spa., Wetherby, West Yorkshire LS23 7BQ, U.K., from whom copies may be obtained on the terms indicated in Biochem. J. (1990) 265, 5. The information comprises Supplement Tables 1-4, which contain, in order, amino acid compositions of peptides from tryptic, peptic, CNBr and mild acid cleavages, Supplement Fig. 1, showing re-fractionation of selected peaks from Fig. 2 of the main paper. Supplement Fig. 2, showing cation-exchange chromatography of the earliest-eluted peak of Fig. 3 of the main paper, Supplement Fig. 3, showing reverse-phase h.p.l.c. of the later-eluted peak from Fig. 3 of the main paper, and Supplement Fig. 4, showing the separation of peptides after mild acid hydrolysis of CNBr-cleavage fragment CB3. PMID:2106878

  5. Sequence and expression analyses of the UL37 and UL38 genes of Aujeszky's disease virus.

    PubMed

    Braun, A; Kaliman, A; Boldogköi, Z; Aszódi, A; Fodor, I

    2000-01-01

    Previously, we sequenced the HSV-1 Ul39-Ul40 homologue genes of Aujeszky's disease virus (ADV), also designated as pseudorabies virus (Kaliman et al., 1994a, b). Now we report the nucleotide sequence of the adjacent DNA that encodes Ul38, the 5'-region (750 bp) of Ul37, and the promoter regions between these divergently arranged two genes. The ADV Ul38 gene encodes a protein of 368 amino acids. Amino acid sequence comparison of ADV Ul38 with that of other herpesviruses revealed significant structural homology. In a transcription study using RNase protection assay and Northern blot hybridization, we found that the Ul38 gene had one initiation site, but the Ul37 gene was initiated at two transcription sites with two potential initiator AUGs, one of which was dominant. Comparison of ADV Ul37, Ul38 and ribonucleotide reductase gene expression showed that these genes belong to the same temporal class with early kinetics. Data of structural and transcriptional studies suggest that regulation of the expression of these two ADV genes could differ from that of the HSV-1 virus. PMID:11402671

  6. Phylogenetic Analyses of Novel Squamate Adenovirus Sequences in Wild-Caught Anolis Lizards

    PubMed Central

    Ascher, Jill M.; Geneva, Anthony J.; Ng, Julienne; Wyatt, Jeffrey D.; Glor, Richard E.

    2013-01-01

    Adenovirus infection has emerged as a serious threat to the health of captive snakes and lizards (i.e., squamates), but we know relatively little about this virus' range of possible hosts, pathogenicity, modes of transmission, and sources from nature. We report the first case of adenovirus infection in the Iguanidae, a diverse family of lizards that is widely-studied and popular in captivity. We report adenovirus infections from two closely-related species of Anolis lizards (anoles) that were recently imported from wild populations in the Dominican Republic to a laboratory colony in the United States. We investigate the evolution of adenoviruses in anoles and other squamates using phylogenetic analyses of adenovirus polymerase gene sequences sampled from Anolis and a range of other vertebrate taxa. These phylogenetic analyses reveal that (1) the sequences detected from each species of Anolis are novel, and (2) adenoviruses are not necessarily host-specific and do not always follow a co-speciation model under which host and virus phylogenies are perfectly concordant. Together with the fact that the Anolis adenovirus sequences reported in our study were detected in animals that became ill and subsequently died shortly after importation while exhibiting clinical signs consistent with acute adenovirus infection, our discoveries suggest the need for renewed attention to biosecurity measures intended to prevent the spread of adenovirus both within and among species of snakes and lizards housed in captivity. PMID:23593364

  7. Comparative sequence and genetic analyses of asparagus BACs reveal no microsynteny with onion or rice.

    PubMed

    Jakse, Jernej; Telgmann, Alexa; Jung, Christian; Khar, Anil; Melgar, Sergio; Cheung, Foo; Town, Christopher D; Havey, Michael J

    2006-12-01

    The Poales (includes the grasses) and Asparagales [includes onion (Allium cepa L.) and asparagus (Asparagus officinalis L.)] are the two most economically important monocot orders. The Poales are a member of the commelinoid monocots, a group of orders sister to the Asparagales. Comparative genomic analyses have revealed a high degree of synteny among the grasses; however, it is not known if this synteny extends to other major monocot groups such as the Asparagales. Although we previously reported no evidence for synteny at the recombinational level between onion and rice, microsynteny may exist across shorter genomic regions in the grasses and Asparagales. We sequenced nine asparagus BACs to reveal physically linked genic-like sequences and determined their most similar positions in the onion and rice genomes. Four of the asparagus BACs were selected using molecular markers tightly linked to the sex-determining M locus on chromosome 5 of asparagus. These BACs possessed only two putative coding regions and had long tracts of degenerated retroviral elements and transposons. Five asparagus BACs were selected after hybridization of three onion cDNAs that mapped to three different onion chromosomes. Genic-like sequences that were physically linked on the cDNA-selected BACs or genetically linked on the M-linked BACs showed significant similarities (e < -20) to expressed sequences on different rice chromosomes, revealing no evidence for microsynteny between asparagus and rice across these regions. Genic-like sequences that were linked in asparagus were used to identify highly similar (e < -20) expressed sequence tags (ESTs) of onion. These onion ESTs mapped to different onion chromosomes and no relationship was observed between physical or genetic linkages in asparagus and genetic linkages in onion. These results further indicate that synteny among grass genomes does not extend to a sister order in the monocots and that asparagus may not be an appropriate smaller genome

  8. Amino acid sequence prerequisites for the formation of cn ions.

    PubMed

    Downard, K M; Biemann, K

    1993-11-01

    Ammo acid sequence prerequisites are described for the formation of c, ions observed in high-energy collision-induced decomposition spectra of peptides. It is shown that the formation of cn ions is promoted by the nature of the amino acid C-terminal to the cleavage site. A propensity for cn cleavage preceding threonine, and to a lesser extent tryptophan, lysine, and serine, is demonstrated where fragmentation is directed N-terminally at these residues. In addition, the nature of the residue N-terminal to the cleavage site is shown to have little effect on cn ion formation. A mechanism for cn ion formation is proposed and its applicability to the results observed is discussed. PMID:24227531

  9. Ultrasensitive nucleic acid sequence detection by single-molecule electrophoresis

    SciTech Connect

    Castro, A; Shera, E.B.

    1996-09-01

    This is the final report of a one-year laboratory-directed research and development project at Los Alamos National Laboratory. There has been considerable interest in the development of very sensitive clinical diagnostic techniques over the last few years. Many pathogenic agents are often present in extremely small concentrations in clinical samples, especially at the initial stages of infection, making their detection very difficult. This project sought to develop a new technique for the detection and accurate quantification of specific bacterial and viral nucleic acid sequences in clinical samples. The scheme involved the use of novel hybridization probes for the detection of nucleic acids combined with our recently developed technique of single-molecule electrophoresis. This project is directly relevant to the DOE`s Defense Programs strategic directions in the area of biological warfare counter-proliferation.

  10. 5S ribosomal ribonucleic acid sequences in Bacteroides and Fusobacterium: evolutionary relationships within these genera and among eubacteria in general

    NASA Technical Reports Server (NTRS)

    Van den Eynde, H.; De Baere, R.; Shah, H. N.; Gharbia, S. E.; Fox, G. E.; Michalik, J.; Van de Peer, Y.; De Wachter, R.

    1989-01-01

    The 5S ribosomal ribonucleic acid (rRNA) sequences were determined for Bacteroides fragilis, Bacteroides thetaiotaomicron, Bacteroides capillosus, Bacteroides veroralis, Porphyromonas gingivalis, Anaerorhabdus furcosus, Fusobacterium nucleatum, Fusobacterium mortiferum, and Fusobacterium varium. A dendrogram constructed by a clustering algorithm from these sequences, which were aligned with all other hitherto known eubacterial 5S rRNA sequences, showed differences as well as similarities with respect to results derived from 16S rRNA analyses. In the 5S rRNA dendrogram, Bacteroides clustered together with Cytophaga and Fusobacterium, as in 16S rRNA analyses. Intraphylum relationships deduced from 5S rRNAs suggested that Bacteroides is specifically related to Cytophaga rather than to Fusobacterium, as was suggested by 16S rRNA analyses. Previous taxonomic considerations concerning the genus Bacteroides, based on biochemical and physiological data, were confirmed by the 5S rRNA sequence analysis.

  11. The amino acid sequence of ribonuclease U2 from Ustilago sphaerogena.

    PubMed Central

    Sato, S; Uchida, T

    1975-01-01

    1. RNAase (ribonuclease) U2, a purine-specific RNAase, was reduced, aminoethylated and hydrolysed with trypsin, chymotrypsin and thermolysin. On the basis of the analyses of the resulting peptides, the complete amino acid sequence of RNAase U2 was determined, 2. When the sequence was compared with the amino acid sequence of RNAase T1 (EC 3.1.4.8), the following regions were found to be similar in the two enzymes; Tyr-Pro-His-Gln-Tyr (38-42) in RNAase U2 and Tyr-Pro-His-Lys-Tyr (38-42) in RNAase T1, Glu-Phe-Pro-Leu-Val (61-65) in RNAase U2 and Glu-Trp-Pro-Ile-Leu (58-62) in RNAase T1, Asp-Arg-Val-Ile-Tyr-Gln (83-88) in RNAase U2 and Asp-Arg-Val-Phe-Asn (76-81) in RNAase T1 and Val-Thr-His-Thr-Gly-Ala (98-103) in RNAase U2 and Ile-Thr-His-Thr-Gly-Ala (90-95) in RNAase T1. All of the amino acid residues, histidine-40, glutamate-58, arginine-77 and histidine-92, which were found to play a crucial role in the biological activity of RNAase T1, were included in the regions cited here. 3. Detailed evidence for the amino acid sequence of the sequence of the proteins has been deposited as Supplementary Publication SUP 50041 (33 PAGES) AT THE British Library (Lending Division)(formerly the National Lending Library for Science and Technology), Boston Spa, Yorks. LS23 7BQ, U.K., from whom copies can be obtained on the terms indicated in Biochem. J. (1975), 145, 5. PMID:1156364

  12. Complete amino acid sequence of globin chains and biological activity of fragmented crocodile hemoglobin (Crocodylus siamensis).

    PubMed

    Srihongthong, Saowaluck; Pakdeesuwan, Anawat; Daduang, Sakda; Araki, Tomohiro; Dhiravisit, Apisak; Thammasirirak, Sompong

    2012-08-01

    Hemoglobin, α-chain, β-chain and fragmented hemoglobin of Crocodylus siamensis demonstrated both antibacterial and antioxidant activities. Antibacterial and antioxidant properties of the hemoglobin did not depend on the heme structure but could result from the compositions of amino acid residues and structures present in their primary structure. Furthermore, thirteen purified active peptides were obtained by RP-HPLC analyses, corresponding to fragments in the α-globin chain and the β-globin chain which are mostly located at the N-terminal and C-terminal parts. These active peptides operate on the bacterial cell membrane. The globin chains of Crocodylus siamensis showed similar amino acids to the sequences of Crocodylus niloticus. The novel amino acid substitutions of α-chain and β-chain are not associated with the heme binding site or the bicarbonate ion binding site, but could be important through their interactions with membranes of bacteria. PMID:22648692

  13. Molecular Characterization of Five Potyviruses Infecting Korean Sweet Potatoes Based on Analyses of Complete Genome Sequences.

    PubMed

    Kwak, Hae-Ryun; Kim, Jaedeok; Kim, Mi-Kyeong; Seo, Jang-Kyun; Jung, Mi-Nam; Kim, Jeong-Soo; Lee, Sukchan; Choi, Hong-Soo

    2015-12-01

    Sweet potatoes (Ipomea batatas L.) are grown extensively, in tropical and temperate regions, and are important food crops worldwide. In Korea, potyviruses, including Sweet potato feathery mottle virus (SPFMV), Sweet potato virus C (SPVC), Sweet potato virus G (SPVG), Sweet potato virus 2 (SPV2), and Sweet potato latent virus (SPLV), have been detected in sweet potato fields at a high (~95%) incidence. In the present work, complete genome sequences of 18 isolates, representing the five potyviruses mentioned above, were compared with previously reported genome sequences. The complete genomes consisted of 10,081 to 10,830 nucleotides, excluding the poly-A tails. Their genomic organizations were typical of the Potyvirus genus, including one target open reading frame coding for a putative polyprotein. Based on phylogenetic analyses and sequence comparisons, the Korean SPFMV isolates belonged to the strains RC and O with >98% nucleotide sequence identity. Korean SPVC isolates had 99% identity to the Japanese isolate SPVC-Bungo and 70% identity to the SPFMV isolates. The Korean SPVG isolates showed 99% identity to the three previously reported SPVG isolates. Korean SPV2 isolates had 97% identity to the SPV2 GWB-2 isolate from the USA. Korean SPLV isolates had a relatively low (88%) nucleotide sequence identity with the Taiwanese SPLV-TW isolates, and they were phylogenetically distantly related to SPFMV isolates. Recombination analysis revealed that possible recombination events occurred in the P1, HC-Pro and NIa-NIb regions of SPFMV and SPLV isolates and these regions were identified as hotspots for recombination in the sweet potato potyviruses. PMID:26673876

  14. Molecular Characterization of Five Potyviruses Infecting Korean Sweet Potatoes Based on Analyses of Complete Genome Sequences

    PubMed Central

    Kwak, Hae-Ryun; Kim, Jaedeok; Kim, Mi-Kyeong; Seo, Jang-Kyun; Jung, Mi-Nam; Kim, Jeong-Soo; Lee, Sukchan; Choi, Hong-Soo

    2015-01-01

    Sweet potatoes (Ipomea batatas L.) are grown extensively, in tropical and temperate regions, and are important food crops worldwide. In Korea, potyviruses, including Sweet potato feathery mottle virus (SPFMV), Sweet potato virus C (SPVC), Sweet potato virus G (SPVG), Sweet potato virus 2 (SPV2), and Sweet potato latent virus (SPLV), have been detected in sweet potato fields at a high (~95%) incidence. In the present work, complete genome sequences of 18 isolates, representing the five potyviruses mentioned above, were compared with previously reported genome sequences. The complete genomes consisted of 10,081 to 10,830 nucleotides, excluding the poly-A tails. Their genomic organizations were typical of the Potyvirus genus, including one target open reading frame coding for a putative polyprotein. Based on phylogenetic analyses and sequence comparisons, the Korean SPFMV isolates belonged to the strains RC and O with >98% nucleotide sequence identity. Korean SPVC isolates had 99% identity to the Japanese isolate SPVC-Bungo and 70% identity to the SPFMV isolates. The Korean SPVG isolates showed 99% identity to the three previously reported SPVG isolates. Korean SPV2 isolates had 97% identity to the SPV2 GWB-2 isolate from the USA. Korean SPLV isolates had a relatively low (88%) nucleotide sequence identity with the Taiwanese SPLV-TW isolates, and they were phylogenetically distantly related to SPFMV isolates. Recombination analysis revealed that possible recombination events occurred in the P1, HC-Pro and NIa-NIb regions of SPFMV and SPLV isolates and these regions were identified as hotspots for recombination in the sweet potato potyviruses. PMID:26673876

  15. Phylogeny of yeasts and related filamentous fungi within Pucciniomycotina determined from multigene sequence analyses

    PubMed Central

    Wang, Q.-M.; Groenewald, M.; Takashima, M.; Theelen, B.; Han, P.-J.; Liu, X.-Z.; Boekhout, T.; Bai, F.-Y.

    2015-01-01

    In addition to rusts, the subphylum Pucciniomycotina (Basidiomycota) includes a large number of unicellular or dimorphic fungi which are usually studied as yeasts. Ribosomal DNA sequence analyses have shown that the current taxonomic system of the pucciniomycetous yeasts which is based on phenotypic criteria is not concordant with the molecular phylogeny and many genera are polyphyletic. Here we inferred the molecular phylogeny of 184 pucciniomycetous yeast species and related filamentous fungi using maximum likelihood, maximum parsimony and Bayesian inference analyses based on the sequences of seven genes, including the small subunit ribosomal DNA (rDNA), the large subunit rDNA D1/D2 domains, the internal transcribed spacer regions (ITS 1 and 2) of rDNA including the 5.8S rDNA gene; the nuclear protein-coding genes of the two subunits of DNA polymerase II (RPB1 and RPB2) and the translation elongation factor 1-α (TEF1); and the mitochondrial gene cytochrome b (CYTB). A total of 33 monophyletic clades and 18 single species lineages were recognised among the pucciniomycetous yeasts employed, which belonged to four major lineages corresponding to Agaricostilbomycetes, Cystobasidiomycetes, Microbotryomycetes and Mixiomycetes. These lineages remained independent from the classes Atractiellomycetes, Classiculomycetes, Pucciniomycetes and Tritirachiomycetes formed by filamentous taxa in Pucciniomycotina. An updated taxonomic system of pucciniomycetous yeasts implementing the ‘One fungus = One name’ principle will be proposed based on the phylogenetic framework presented here. PMID:26955197

  16. Molecular phylogenetic and dating analyses using mitochondrial DNA sequences of eyelid geckos (Squamata: Eublepharidae).

    PubMed

    Jonniaux, Pierre; Kumazawa, Yoshinori

    2008-01-15

    Mitochondrial DNA sequences of approximately 2.3 kbp including the complete NADH dehydrogenase subunit 2 gene and its flanking genes, as well as parts of 12S and 16S rRNA genes were determined from major species of the eyelid gecko family Eublepharidae sensu [Kluge, A.G. 1987. Cladistic relationships in the Gekkonoidea (Squamata, Sauria). Misc. Publ. Mus. Zool. Univ. Michigan 173, 1-54.]. In contrast to previous morphological studies, phylogenetic analyses based on these sequences supported that Eublepharidae and Gekkonidae form a sister group with Pygopodidae, raising the possibility of homoplasious character change in some key features of geckos, such as reduction of movable eyelids and innovation of climbing toe pads. The phylogenetic analyses also provided a well-resolved tree for relationships between the eublepharid species. The Bayesian estimation of divergence times without assuming the molecular clock suggested the Jurassic divergence of Eublepharidae from Gekkonidae and radiations of most eublepharid genera around the Cretaceous. These dating results appeared to be robust against some conditional changes for time estimation, such as gene regions used, taxon representation, and data partitioning. Taken together with geological evidence, these results support the vicariant divergence of Eublepharidae and Gekkonidae by the breakup of Pangea into Laurasia and Gondwanaland, and recent dispersal of two African eublepharid genera from Eurasia to Africa after these landmasses were connected in the Early Miocene. PMID:18029117

  17. Human Retroviruses and AIDS. A compilation and analysis of nucleic acid and amino acid sequences: I--II; III--V

    SciTech Connect

    Myers, G.; Korber, B.; Wain-Hobson, S.; Smith, R.F.; Pavlakis, G.N.

    1993-12-31

    This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (I) HIV and SIV Nucleotide Sequences; (II) Amino Acid Sequences; (III) Analyses; (IV) Related Sequences; and (V) Database Communications. Information within all the parts is updated at least twice in each year, which accounts for the modes of binding and pagination in the compendium.

  18. Identification of food and beverage spoilage yeasts from DNA sequence analyses.

    PubMed

    Kurtzman, Cletus P

    2015-11-20

    Detection, identification and classification of yeasts have undergone major changes in the last decade and a half following application of gene sequence analyses and genome comparisons. Development of a database (barcode) of easily determined DNA sequences from domains 1 and 2 (D1/D2) of the nuclear large subunit rRNA gene and from ITS now permits many laboratories to identify species quickly and accurately, thus replacing the laborious and often inaccurate phenotypic tests previously used. Phylogenetic analysis of gene sequences has resulted in a major revision of yeast systematics resulting in redefinition of nearly all genera. This new understanding of species relationships has prompted a change of rules for naming and classifying yeasts and other fungi, and these new rules are presented in the recently implemented International Code of Nomenclature for algae, fungi, and plants (Melbourne Code). The use of molecular methods for species identification and the impact of Code changes on classification will be discussed, especially in the context of food and beverage spoilage yeasts. PMID:26051959

  19. cDNA sequence and protein bioinformatics analyses of MSTN in African catfish (Clarias gariepinus).

    PubMed

    Kanjanaworakul, Poonmanee; Sawatdichaikul, Orathai; Poompuang, Supawadee

    2016-04-01

    Myostatin, also known as growth differentiation factor 8, has been identified as a potent negative regulator of skeletal muscle growth. The purpose of this study was to characterize and predict function of the myostatin gene of the African catfish (Cg-MSTN). Expression of Cg-MSTN was determined at three growth stages to establish the relationship between the levels of MSTN transcript and skeletal muscle growth. The partial cDNA sequence of Cg-MSTN was cloned by using published information from its congener walking catfish (Cm-MSTN). The Cg-MSTN was 1194 bp in length encoding a protein of 397 amino acids. The deduced MSTN sequence exhibited key functional sites similar to those of other members of the TGF-β superfamily, especially, the proteolytic processing site (RXXR motif) and nine conserved cysteines at the C-terminal. Expression of MSTN appeared to be correlated with muscle development and growth of African catfish. Protein bioinformatics revealed that the primary sequence of Cg-MSTN shared 98 % sequence identity with that of walking catfish Cm-MSTN with only two different residues, [Formula: see text]. and [Formula: see text]. The proposed model of Cg-MSTN revealed the key point mutation [Formula: see text] causing a 7.35 Å shorter distance between the N- and C-lobes and an approximately 11° narrow angle than those of Cm-MSTN. The substitution of a proline residue near the proteolytic processing site which altered the structure of myostatin may play a critical role in reducing proteolytic activity of this protein in African catfish. PMID:26912268

  20. DNA sequence analyses of blended herbal products including synthetic cannabinoids as designer drugs.

    PubMed

    Ogata, Jun; Uchiyama, Nahoko; Kikura-Hanajiri, Ruri; Goda, Yukihiro

    2013-04-10

    In recent years, various herbal products adulterated with synthetic cannabinoids have been distributed worldwide via the Internet. These herbal products are mostly sold as incense, and advertised as not for human consumption. Although their labels indicate that they contain mixtures of several potentially psychoactive plants, and numerous studies have reported that they contain a variety of synthetic cannabinoids, their exact botanical contents are not always clear. In this study, we investigated the origins of botanical materials in 62 Spice-like herbal products distributed on the illegal drug market in Japan, by DNA sequence analyses and BLAST searches. The nucleotide sequences of four regions were analyzed to identify the origins of each plant species in the herbal mixtures. The sequences of "Damiana" (Turnera diffusa) and Lamiaceae herbs (Mellissa, Mentha and Thymus) were frequently detected in a number of products. However, the sequences of other plant species indicated on the packaging labels were not detected. In a few products, DNA fragments of potent psychotropic plants were found, including marijuana (Cannabis sativa), "Diviner's Sage" (Salvia divinorum) and "Kratom" (Mitragyna speciosa). Their active constituents were also confirmed using gas chromatography-mass spectrometry (GC-MS) and liquid chromatography-mass spectrometry (LC-MS), although these plant names were never indicated on the labels. Most plant species identified in the products were different from the plants indicated on the labels. The plant materials would be used mainly as diluents for the psychoactive synthetic compounds, because no reliable psychoactive effects have been reported for most of the identified plants, with the exception of the psychotropic plants named above. PMID:23092848

  1. Fatty acid analyses may provide insight into the progression of starvation among squamate reptiles.

    PubMed

    McCue, Marshall D

    2008-10-01

    Fasting-induced changes in fatty acid composition have been reported to occur within the body lipids of several types of animals; however, little is known about the changes in fatty acid profiles exhibited by reptiles subjected to prolonged fasting. This study characterizes the fatty acid profiles of six reptile species subjected to sublethal periods of fasting lasting 0, 56, 112, and 168 days. Analyses of fatty acid methyl esters (FAMEs) conducted on the total body lipids of rattlesnakes (Crotalus atrox), ratsnakes (Elaphe obsoleta), pythons (Python regius), boas (Boa constrictor), true vipers (Bitis gabonica), and monitor lizards (Varanus exanthematicus) revealed that all of the species exhibited similar characteristic changes in their fatty acid profiles during starvation stress. According to ANOVAs, the four most effective indicators of the onset of starvation were significant increases in the [1] fatty acid unsaturation index as well as ratios of [2] linoleic to palmitoleic acid, [3] oleic to palmitic, and [4] arachidonic to total fatty acid concentrations. The results of this study suggest that FAME analyses might be useful for identifying nutritional stress and/or starvation among squamate reptiles; however, forthcoming studies will be required to validate the generality of these responses. I also review the potential limitations of this approach, and suggest experiments that will be important for future applications of FAME analyses. Ultimately, it is hoped that FAME analyses can be used in conjunction with current practices as an additional tool to characterize the prevalence of starvation experienced by free-living reptiles. PMID:18657629

  2. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... in the sequence. (4) The enumeration of amino acids may start at the first amino acid of the first..., counting backwards starting with the amino acid next to number 1. Otherwise, the enumeration of amino acids... sequence every 5 amino acids. The enumeration method for amino acid sequences that is set forth......

  3. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... in the sequence. (4) The enumeration of amino acids may start at the first amino acid of the first..., counting backwards starting with the amino acid next to number 1. Otherwise, the enumeration of amino acids... sequence every 5 amino acids. The enumeration method for amino acid sequences that is set forth......

  4. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... in the sequence. (4) The enumeration of amino acids may start at the first amino acid of the first..., counting backwards starting with the amino acid next to number 1. Otherwise, the enumeration of amino acids... sequence every 5 amino acids. The enumeration method for amino acid sequences that is set forth......

  5. Predicting protein disorder by analyzing amino acid sequence

    PubMed Central

    Yang, Jack Y; Yang, Mary Qu

    2008-01-01

    Background Many protein regions and some entire proteins have no definite tertiary structure, presenting instead as dynamic, disorder ensembles under different physiochemical circumstances. These proteins and regions are known as Intrinsically Unstructured Proteins (IUP). IUP have been associated with a wide range of protein functions, along with roles in diseases characterized by protein misfolding and aggregation. Results Identifying IUP is important task in structural and functional genomics. We exact useful features from sequences and develop machine learning algorithms for the above task. We compare our IUP predictor with PONDRs (mainly neural-network-based predictors), disEMBL (also based on neural networks) and Globplot (based on disorder propensity). Conclusion We find that augmenting features derived from physiochemical properties of amino acids (such as hydrophobicity, complexity etc.) and using ensemble method proved beneficial. The IUP predictor is a viable alternative software tool for identifying IUP protein regions and proteins. PMID:18831799

  6. Computer-aided analyses of transport protein sequences: gleaning evidence concerning function, structure, biogenesis, and evolution.

    PubMed Central

    Saier, M H

    1994-01-01

    Three-dimensional structures have been elucidated for very few integral membrane proteins. Computer methods can be used as guides for estimation of solute transport protein structure, function, biogenesis, and evolution. In this paper the application of currently available computer programs to over a dozen distinct families of transport proteins is reviewed. The reliability of sequence-based topological and localization analyses and the importance of sequence and residue conservation to structure and function are evaluated. Evidence concerning the nature and frequency of occurrence of domain shuffling, splicing, fusion, deletion, and duplication during evolution of specific transport protein families is also evaluated. Channel proteins are proposed to be functionally related to carriers. It is argued that energy coupling to transport was a late occurrence, superimposed on preexisting mechanisms of solute facilitation. It is shown that several transport protein families have evolved independently of each other, employing different routes, at different times in evolutionary history, to give topologically similar transmembrane protein complexes. The possible significance of this apparent topological convergence is discussed. PMID:8177172

  7. RAPHIDOPHYCEAE [CHADEFAUD EX SILVA] SYSTEMATICS AND RAPID IDENTIFICATION: SEQUENCE ANALYSES AND REAL-TIME PCR ASSAYS

    PubMed Central

    Bowers, Holly A.; Tomas, Carmelo; Tengs, Torstein; Kempton, Jason W.; Lewitus, Alan J.; Oldach, David W.

    2010-01-01

    Species within the class Raphidophyceae were associated with fish kill events in Japanese, European, Canadian, and U.S. coastal waters. Fish mortality was attributable to gill damage with exposure to reactive oxygen species (peroxide, superoxide, and hydroxide radicals), neurotoxins, physical clogging, and hemolytic substances. Morphological identification of these organisms in environmental water samples is difficult, particularly when fixatives are used. Because of this difficulty and the continued global emergence of these species in coastal estuarine waters, we initiated the development and validation of a suite of real-time polymerase chain reaction (PCR) assays. Sequencing was used to generate complete data sets for nuclear encoded small-subunit ribosomal RNA (SSU rRNA; 18S); internal transcribed spacers 1 and 2, 5.8S; and plastid encoded SSU rRNA (16S) for confirmed raphidophyte cultures from various geographic locations. Sequences for several Chattonella species (C. antiqua, C. marina, C. ovata, C. subsalsa, and C. verruculosa), Heterosigma akashiwo, and Fibrocapsa japonica were generated and used to design rapid and specific PCR assays for several species including C. verruculosa Hara et Chihara, C. subsalsa Biecheler, the complex comprised of C. marina Hara et Chihara, C. antiqua Ono and C. ovata, H. akashiwo Ono, and F. japonica Toriumi et Takano using appropriate loci. With this comprehensive data set, we were also able to perform phylogenetic analyses to determine the relationship between these species. PMID:20411032

  8. Merging Fargesia dracocephala into Fargesia decurvata (Bambusoideae, Poaceae): Implications from Morphological and ITS Sequence Analyses

    PubMed Central

    Wu, A-Li; Ren, Yi

    2014-01-01

    Aims Fargesia decurvata is closely allied with F. dracocephala and differs in 5 major characters (i.e. the culm sheath blade base shape, the width of the culm sheath blade base, the auricle shape, and the lower surface of leaf blade) in Fargesia. It is difficult to distinguish these two species because of existing of transitional statements of characters. The aims of this paper are to (i) investigate whether the variation of the characters is continuous or not; (ii) reveal whether the publishment of F. dracocephala was the result of discontinuous sampling of F. decurvata or not. Methods Ten populations of F. decurvata and F. dracocephala were investigated in their entire distribution (including type localities). The statements of 5 major characters were measured from 693 annual and 693 perennial culms of 231 individuals in 10 populations, and analyzed at population, individual and culm levels. UPGMA cluster analysis was carried out based on 29 characters from 10 populations of F. decurvata and F. dracocephala and 2 populations of F. qinlingensis as outgroup. The ITS sequences were also sequenced and analyzed. Important Findings Five major characters exhibited great variation not only at population level, but at individual level within a population, even the culm level within an individual and in different parts of the same culm. Cluster analyses showed that 10 populations of F. decurvata and F. dracocephala were not divided into two species, but they were well separated with outgroup. There was no difference in floral organ between F. decurvata and F. dracocephala. MP and NJ trees based on ITS sequences showed the same results with the cluster analysis on morphological characters. All the facts indicated that the publishment of F. dracocephala was the result of discontinuous sampling of F. decurvata, and F. dracocephala should be treated as the synonym of F. decurvata. PMID:24988081

  9. Insight in Genome-Wide Association of Metabolite Quantitative Traits by Exome Sequence Analyses

    PubMed Central

    Verhoeven, Aswin; Dharuri, Harish; Amin, Najaf; van Klinken, Jan Bert; Karssen, Lennart C.; de Vries, Boukje; Meissner, Axel; Göraler, Sibel; van den Maagdenberg, Arn M. J. M.; Deelder, André M.; C ’t Hoen, Peter A.; van Duijn, Cornelia M.; van Dijk, Ko Willems

    2015-01-01

    Metabolite quantitative traits carry great promise for epidemiological studies, and their genetic background has been addressed using Genome-Wide Association Studies (GWAS). Thus far, the role of less common variants has not been exhaustively studied. Here, we set out a GWAS for metabolite quantitative traits in serum, followed by exome sequence analysis to zoom in on putative causal variants in the associated genes. 1H Nuclear Magnetic Resonance (1H-NMR) spectroscopy experiments yielded successful quantification of 42 unique metabolites in 2,482 individuals from The Erasmus Rucphen Family (ERF) study. Heritability of metabolites were estimated by SOLAR. GWAS was performed by linear mixed models, using HapMap imputations. Based on physical vicinity and pathway analyses, candidate genes were screened for coding region variation using exome sequence data. Heritability estimates for metabolites ranged between 10% and 52%. GWAS replicated three known loci in the metabolome wide significance: CPS1 with glycine (P-value  = 1.27×10−32), PRODH with proline (P-value  = 1.11×10−19), SLC16A9 with carnitine level (P-value  = 4.81×10−14) and uncovered a novel association between DMGDH and dimethyl-glycine (P-value  = 1.65×10−19) level. In addition, we found three novel, suggestively significant loci: TNP1 with pyruvate (P-value  = 1.26×10−8), KCNJ16 with 3-hydroxybutyrate (P-value  = 1.65×10−8) and 2p12 locus with valine (P-value  = 3.49×10−8). Exome sequence analysis identified potentially causal coding and regulatory variants located in the genes CPS1, KCNJ2 and PRODH, and revealed allelic heterogeneity for CPS1 and PRODH. Combined GWAS and exome analyses of metabolites detected by high-resolution 1H-NMR is a robust approach to uncover metabolite quantitative trait loci (mQTL), and the likely causative variants in these loci. It is anticipated that insight in the genetics of intermediate phenotypes will provide additional

  10. Structural gene and complete amino acid sequence of Pseudomonas aeruginosa IFO 3455 elastase.

    PubMed Central

    Fukushima, J; Yamamoto, S; Morihara, K; Atsumi, Y; Takeuchi, H; Kawamoto, S; Okuda, K

    1989-01-01

    The DNA encoding the elastase of Pseudomonas aeruginosa IFO 3455 was cloned, and its complete nucleotide sequence was determined. When the cloned gene was ligated to pUC18, the Escherichia coli expression vector, bacteria carrying the gene exhibited high levels of both elastase activity and elastase antigens. The amino acid sequence, deduced from the nucleotide sequence, revealed that the mature elastase consisted of 301 amino acids with a relative molecular mass of 32,926 daltons. The amino acid composition predicted from the DNA sequence was quite similar to the chemically determined composition of purified elastase reported previously. We also observed nucleotide sequence encoding a signal peptide and "pro" sequence consisting of 197 amino acids upstream from the mature elastase protein gene. The amino acid sequence analysis revealed that both the N-terminal sequence of the purified elastase and the N-terminal side sequences of the C-terminal tryptic peptide as well as the internal lysyl peptide fragment were completely identical to the deduced amino acid sequences. The pattern of identity of amino acid sequences was quite evident in the regions that include structurally and functionally important residues of Bacillus subtilis thermolysin. PMID:2493453

  11. Searching for Extraterrestrial Amino Acids in a Contaminated Meteorite: Amino Acid Analyses of the Canakkale L6 Chondrite

    NASA Technical Reports Server (NTRS)

    Burton, A. S.; Elsila, J. E.; Glavin, D. P.; Dworkin, J. P.; Ornek, C. Y.; Esenoglu, H. H.; Unsalan, O.; Ozturk, B.

    2016-01-01

    Amino acids can serve as important markers of cosmochemistry, as their abundances and isomeric and isotopic compositions have been found to vary predictably with changes in parent body chemistry and alteration processes. Amino acids are also of astrobiological interest because they are essential for life on Earth. Analyses of a range of meteorites, including all groups of carbonaceous chondrites, along with H, R, and LL chondrites, ureilites, and a martian shergottite, have revealed that amino acids of plausible extraterrestrial origin can be formed in and persist after a wide range of parent body conditions. However, amino acid analyses of L6 chondrites to date have not provided evidence for indigenous amino acids. In the present study, we performed amino acid analysis on larger samples of a different L6 chondite, Canakkale, to determine whether or not trace levels of indigenous amino acids could be found. The Canakkale meteor was an observed fall in late July, 1964, near Canakkale, Turkey. The meteorite samples (1.36 and 1.09 g) analyzed in this study were allocated by C. Y. Ornek, along with a soil sample (1.5 g) collected near the Canakkale recovery site.

  12. The Mitochondrial Genomes of Aquila fasciata and Buteo lagopus (Aves, Accipitriformes): Sequence, Structure and Phylogenetic Analyses

    PubMed Central

    Jiang, Lan; Chen, Juan; Wang, Ping; Ren, Qiongqiong; Yuan, Jian; Qian, Chaoju; Hua, Xinghong; Guo, Zhichun; Zhang, Lei; Yang, Jianke; Wang, Ying; Zhang, Qin; Ding, Hengwu; Bi, De; Zhang, Zongmeng; Wang, Qingqing; Chen, Dongsheng; Kan, Xianzhao

    2015-01-01

    The family Accipitridae is one of the largest groups of non-passerine birds, including 68 genera and 243 species globally distributed. In the present study, we determined the complete mitochondrial sequences of two species of accipitrid, namely Aquila fasciata and Buteo lagopus, and conducted a comparative mitogenome analysis across the family. The mitogenome length of A. fasciata and B. lagopus are 18,513 and 18,559 bp with an A + T content of 54.2% and 55.0%, respectively. For both the two accipitrid birds mtDNAs, obvious positive AT-skew and negative GC-skew biases were detected for all 12 PCGs encoded by the H strand, whereas the reverse was found in MT-ND6 encoded by the L strand. One extra nucleotide‘C’is present at the position 174 of MT-ND3 gene of A. fasciata, which is not observed at that of B. lagopus. Six conserved sequence boxes in the Domain II, named boxes F, E, D, C, CSBa, and CSBb, respectively, were recognized in the CRs of A. fasciata and B. lagopus. Rates and patterns of mitochondrial gene evolution within Accipitridae were also estimated. The highest dN/dS was detected for the MT-ATP8 gene (0.32493) among Accipitridae, while the lowest for the MT-CO1 gene (0.01415). Mitophylogenetic analysis supported the robust monophyly of Accipitriformes, and Cathartidae was basal to the balance of the order. Moreover, we performed phylogenetic analyses using two other data sets (two mitochondrial loci, and combined nuclear and mitochondrial loci). Our results indicate that the subfamily Aquilinae and all currently polytypic genera of this subfamily are monophyletic. These two novel mtDNA data will be useful in refining the phylogenetic relationships and evolutionary processes of Accipitriformes. PMID:26295156

  13. Specific catalysis of asparaginyl deamidation by carboxylic acids: kinetic, thermodynamic, and quantitative structure-property relationship analyses.

    PubMed

    Connolly, Brian D; Tran, Benjamin; Moore, Jamie M R; Sharma, Vikas K; Kosky, Andrew

    2014-04-01

    Asparaginyl (Asn) deamidation could lead to altered potency, safety, and/or pharmacokinetics of therapeutic protein drugs. In this study, we investigated the effects of several different carboxylic acids on Asn deamidation rates using an IgG1 monoclonal antibody (mAb1*) and a model hexapeptide (peptide1) with the sequence YGKNGG. Thermodynamic analyses of the kinetics data revealed that higher deamidation rates are associated with predominantly more negative ΔS and, to a lesser extent, more positive ΔH. The observed differences in deamidation rates were attributed to the unique ability of each type of carboxylic acid to stabilize the energetically unfavorable transition-state conformations required for imide formation. Quantitative structure property relationship (QSPR) analysis using kinetic data demonstrated that molecular descriptors encoding for the geometric spatial distribution of atomic properties on various carboxylic acids are effective determinants for the deamidation reaction. Specifically, the number of O-O and O-H atom pairs on carboxyl and hydroxyl groups with interatomic distances of 4-5 Å on a carboxylic acid buffer appears to determine the rate of deamidation. Collectively, the results from structural and thermodynamic analyses indicate that carboxylic acids presumably form multiple hydrogen bonds and charge-charge interactions with the relevant deamidation site and provide alignment between the reactive atoms on the side chain and backbone. We propose that carboxylic acids catalyze deamidation by stabilizing a specific, energetically unfavorable transition-state conformation of l-asparaginyl intermediate II that readily facilitates bond formation between the γ-carbonyl carbon and the deprotonated backbone nitrogen for cyclic imide formation. PMID:24620787

  14. Natural vs. random protein sequences: Discovering combinatorics properties on amino acid words.

    PubMed

    Santoni, Daniele; Felici, Giovanni; Vergni, Davide

    2016-02-21

    Casual mutations and natural selection have driven the evolution of protein amino acid sequences that we observe at present in nature. The question about which is the dominant force of proteins evolution is still lacking of an unambiguous answer. Casual mutations tend to randomize protein sequences while, in order to have the correct functionality, one expects that selection mechanisms impose rigid constraints on amino acid sequences. Moreover, one also has to consider that the space of all possible amino acid sequences is so astonishingly large that it could be reasonable to have a well tuned amino acid sequence indistinguishable from a random one. In order to study the possibility to discriminate between random and natural amino acid sequences, we introduce different measures of association between pairs of amino acids in a sequence, and apply them to a dataset of 1047 natural protein sequences and 10,470 random sequences, carefully generated in order to preserve the relative length and amino acid distribution of the natural proteins. We analyze the multidimensional measures with machine learning techniques and show that, to a reasonable extent, natural protein sequences can be differentiated from random ones. PMID:26656109

  15. Multilocus Sequence Typing and Phylogenetic Analyses of Pseudomonas aeruginosa Isolates from the Ocean▿

    PubMed Central

    Khan, Nurul Huda; Ahsan, Mahbuba; Yoshizawa, Susumu; Hosoya, Shoichi; Yokota, Akira; Kogure, Kazuhiro

    2008-01-01

    Recent isolation of Pseudomonas aeruginosa strains from the open ocean and subsequent pulsed-field gel electrophoresis analyses indicate that these strains have a unique genotype (N. H. Khan, Y. Ishii, N. Kimata-Kino, H. Esaki, T. Nishino, M. Nishimura, and K. Kogure, Microb. Ecol. 53:173-186, 2007). We hypothesized that ocean P. aeruginosa strains have a unique phylogenetic position relative to other strains. The objective of this study was to clarify the intraspecies phylogenetic relationship between marine strains and other strains from various geographical locations. Considering the advantages of using databases, multilocus sequence typing (MLST) was chosen for the typing and discrimination of ocean P. aeruginosa strains. Seven housekeeping genes (acsA, aroE, guaA, mutL, nuoD, ppsA, and trpE) were analyzed, and the results were compared with data on the MLST website. These genes were also used for phylogenetic analysis of P. aeruginosa. Rooted and unrooted phylogenetic trees were generated for each gene locus and the concatenated gene fragments. MLST data showed that all the ocean strains were new. Trees constructed for individual and concatenated genes revealed that ocean P. aeruginosa strains have clusters distinct from those of other P. aeruginosa strains. These clusters roughly reflected the geographical locations of the isolates. These data support our previous findings that P. aeruginosa strains are present in the ocean. It can be concluded that the ocean P. aeruginosa strains have diverged from other isolates and form a distinct cluster based on MLST and phylogenetic analyses of seven housekeeping genes. PMID:18757570

  16. Phylogeny of tremellomycetous yeasts and related dimorphic and filamentous basidiomycetes reconstructed from multiple gene sequence analyses

    PubMed Central

    Liu, X.-Z.; Wang, Q.-M.; Theelen, B.; Groenewald, M.; Bai, F.-Y.; Boekhout, T.

    2015-01-01

    The Tremellomycetes (Basidiomycota) contains a large number of unicellular and dimorphic fungi with stable free-living unicellular states in their life cycles. These fungi have been conventionally classified as basidiomycetous yeasts based on physiological and biochemical characteristics. Many currently recognised genera of these yeasts are mainly defined based on phenotypical characters and are highly polyphyletic. Here we reconstructed the phylogeny of the majority of described anamorphic and teleomorphic tremellomycetous yeasts using Bayesian inference, maximum likelihood, and neighbour-joining analyses based on the sequences of seven genes, including three rRNA genes, namely the small subunit of the ribosomal DNA (rDNA), D1/D2 domains of the large subunit rDNA, and the internal transcribed spacer regions (ITS 1 and 2) of rDNA including 5.8S rDNA; and four protein-coding genes, namely the two subunits of the RNA polymerase II (RPB1 and RPB2), the translation elongation factor 1-α (TEF1) and the mitochondrial gene cytochrome b (CYTB). With the consideration of morphological, physiological and chemotaxonomic characters and the congruence of phylogenies inferred from analyses using different algorithms based on different data sets consisting of the combined seven genes, the three rRNA genes, and the individual protein-coding genes, five major lineages corresponding to the orders Cystofilobasidiales, Filobasidiales, Holtermanniales, Tremellales, and Trichosporonales were resolved. A total of 45 strongly supported monophyletic clades with multiple species and 23 single species clades were recognised. This phylogenetic framework will be the basis for the proposal of an updated taxonomic system of tremellomycetous yeasts that will be compatible with the current taxonomic system of filamentous basidiomycetes accommodating the ‘one fungus, one name’ principle. PMID:26955196

  17. Transcriptome Sequencing in Response to Salicylic Acid in Salvia miltiorrhiza

    PubMed Central

    Zhang, Xiaoru; Dong, Juane; Liu, Hailong; Wang, Jiao; Qi, Yuexin; Liang, Zongsuo

    2016-01-01

    Salvia miltiorrhiza is a traditional Chinese herbal medicine, whose quality and yield are often affected by diseases and environmental stresses during its growing season. Salicylic acid (SA) plays a significant role in plants responding to biotic and abiotic stresses, but the involved regulatory factors and their signaling mechanisms are largely unknown. In order to identify the genes involved in SA signaling, the RNA sequencing (RNA-seq) strategy was employed to evaluate the transcriptional profiles in S. miltiorrhiza cell cultures. A total of 50,778 unigenes were assembled, in which 5,316 unigenes were differentially expressed among 0-, 2-, and 8-h SA induction. The up-regulated genes were mainly involved in stimulus response and multi-organism process. A core set of candidate novel genes coding SA signaling component proteins was identified. Many transcription factors (e.g., WRKY, bHLH and GRAS) and genes involved in hormone signal transduction were differentially expressed in response to SA induction. Detailed analysis revealed that genes associated with defense signaling, such as antioxidant system genes, cytochrome P450s and ATP-binding cassette transporters, were significantly overexpressed, which can be used as genetic tools to investigate disease resistance. Our transcriptome analysis will help understand SA signaling and its mechanism of defense systems in S. miltiorrhiza. PMID:26808150

  18. Transcriptome Sequencing in Response to Salicylic Acid in Salvia miltiorrhiza.

    PubMed

    Zhang, Xiaoru; Dong, Juane; Liu, Hailong; Wang, Jiao; Qi, Yuexin; Liang, Zongsuo

    2016-01-01

    Salvia miltiorrhiza is a traditional Chinese herbal medicine, whose quality and yield are often affected by diseases and environmental stresses during its growing season. Salicylic acid (SA) plays a significant role in plants responding to biotic and abiotic stresses, but the involved regulatory factors and their signaling mechanisms are largely unknown. In order to identify the genes involved in SA signaling, the RNA sequencing (RNA-seq) strategy was employed to evaluate the transcriptional profiles in S. miltiorrhiza cell cultures. A total of 50,778 unigenes were assembled, in which 5,316 unigenes were differentially expressed among 0-, 2-, and 8-h SA induction. The up-regulated genes were mainly involved in stimulus response and multi-organism process. A core set of candidate novel genes coding SA signaling component proteins was identified. Many transcription factors (e.g., WRKY, bHLH and GRAS) and genes involved in hormone signal transduction were differentially expressed in response to SA induction. Detailed analysis revealed that genes associated with defense signaling, such as antioxidant system genes, cytochrome P450s and ATP-binding cassette transporters, were significantly overexpressed, which can be used as genetic tools to investigate disease resistance. Our transcriptome analysis will help understand SA signaling and its mechanism of defense systems in S. miltiorrhiza. PMID:26808150

  19. Associations between Homocysteine, Folic Acid, Vitamin B12 and Alzheimer's Disease: Insights from Meta-Analyses.

    PubMed

    Shen, Liang; Ji, Hong-Fang

    2015-01-01

    The associations between homocysteine (Hcy), folic acid, and vitamin B12 and Alzheimer's disease (AD) have gained much interest, while remaining controversial. We aim to perform meta-analyses to evaluate comprehensively: i) Hcy, folic acid, and vitamin B12 levels in AD patients in comparison with controls; and ii) the association between Hcy, folic acid, and vitamin B12 levels and risk of AD. A literature search was performed using Medline and Scopus databases. A total of 68 studies were identified and included in the meta-analyses. Stata 12.0 statistical software was used to perform the meta-analyses. First, AD patients may have higher level of Hcy, and lower levels of folate and vitamin B12 in plasma than controls. Further age-subgroup analysis showed no age effect for Hcy levels in plasma between AD patients and matched controls, while the differences in folate and vitamin B12 levels further enlarged with increased age. Second, data suggests that high Hcy and low folate levels may correlate with increased risk of AD occurrence. The comprehensive meta-analyses not only confirmed higher Hcy, lower folic acid, and vitamin B12 levels in AD patients than controls, but also implicated that high Hcy and low folic acid levels may be risk factors of AD. Further studies are encouraged to elucidate mechanisms linking these conditions. PMID:25854931

  20. The GeneCards Suite: From Gene Data Mining to Disease Genome Sequence Analyses.

    PubMed

    Stelzer, Gil; Rosen, Naomi; Plaschkes, Inbar; Zimmerman, Shahar; Twik, Michal; Fishilevich, Simon; Stein, Tsippi Iny; Nudel, Ron; Lieder, Iris; Mazor, Yaron; Kaplan, Sergey; Dahary, Dvir; Warshawsky, David; Guan-Golan, Yaron; Kohn, Asher; Rappaport, Noa; Safran, Marilyn; Lancet, Doron

    2016-01-01

    GeneCards, the human gene compendium, enables researchers to effectively navigate and inter-relate the wide universe of human genes, diseases, variants, proteins, cells, and biological pathways. Our recently launched Version 4 has a revamped infrastructure facilitating faster data updates, better-targeted data queries, and friendlier user experience. It also provides a stronger foundation for the GeneCards suite of companion databases and analysis tools. Improved data unification includes gene-disease links via MalaCards and merged biological pathways via PathCards, as well as drug information and proteome expression. VarElect, another suite member, is a phenotype prioritizer for next-generation sequencing, leveraging the GeneCards and MalaCards knowledgebase. It automatically infers direct and indirect scored associations between hundreds or even thousands of variant-containing genes and disease phenotype terms. VarElect's capabilities, either independently or within TGex, our comprehensive variant analysis pipeline, help prepare for the challenge of clinical projects that involve thousands of exome/genome NGS analyses. © 2016 by John Wiley & Sons, Inc. PMID:27322403

  1. Sequence and Expression Analyses of Ethylene Response Factors Highly Expressed in Latex Cells from Hevea brasiliensis

    PubMed Central

    Piyatrakul, Piyanuch; Yang, Meng; Putranto, Riza-Arief; Pirrello, Julien; Dessailly, Florence; Hu, Songnian; Summo, Marilyne; Theeravatanasuk, Kannikar; Leclercq, Julie; Kuswanhadi; Montoro, Pascal

    2014-01-01

    The AP2/ERF superfamily encodes transcription factors that play a key role in plant development and responses to abiotic and biotic stress. In Hevea brasiliensis, ERF genes have been identified by RNA sequencing. This study set out to validate the number of HbERF genes, and identify ERF genes involved in the regulation of latex cell metabolism. A comprehensive Hevea transcriptome was improved using additional RNA reads from reproductive tissues. Newly assembled contigs were annotated in the Gene Ontology database and were assigned to 3 main categories. The AP2/ERF superfamily is the third most represented compared with other transcription factor families. A comparison with genomic scaffolds led to an estimation of 114 AP2/ERF genes and 1 soloist in Hevea brasiliensis. Based on a phylogenetic analysis, functions were predicted for 26 HbERF genes. A relative transcript abundance analysis was performed by real-time RT-PCR in various tissues. Transcripts of ERFs from group I and VIII were very abundant in all tissues while those of group VII were highly accumulated in latex cells. Seven of the thirty-five ERF expression marker genes were highly expressed in latex. Subcellular localization and transactivation analyses suggested that HbERF-VII candidate genes encoded functional transcription factors. PMID:24971876

  2. Whole-Genome Analyses of Korean Native and Holstein Cattle Breeds by Massively Parallel Sequencing

    PubMed Central

    Stothard, Paul; Chung, Won-Hyong; Jeon, Heoyn-Jeong; Miller, Stephen P.; Choi, So-Young; Lee, Jeong-Koo; Yang, Bokyoung; Lee, Kyung-Tai; Han, Kwang-Jin; Kim, Hyeong-Cheol; Jeong, Dongkee; Oh, Jae-Don; Kim, Namshin; Kim, Tae-Hun; Lee, Hak-Kyo; Lee, Sung-Jin

    2014-01-01

    A main goal of cattle genomics is to identify DNA differences that account for variations in economically important traits. In this study, we performed whole-genome analyses of three important cattle breeds in Korea—Hanwoo, Jeju Heugu, and Korean Holstein—using the Illumina HiSeq 2000 sequencing platform. We achieved 25.5-, 29.6-, and 29.5-fold coverage of the Hanwoo, Jeju Heugu, and Korean Holstein genomes, respectively, and identified a total of 10.4 million single nucleotide polymorphisms (SNPs), of which 54.12% were found to be novel. We also detected 1,063,267 insertions–deletions (InDels) across the genomes (78.92% novel). Annotations of the datasets identified a total of 31,503 nonsynonymous SNPs and 859 frameshift InDels that could affect phenotypic variations in traits of interest. Furthermore, genome-wide copy number variation regions (CNVRs) were detected by comparing the Hanwoo, Jeju Heugu, and previously published Chikso genomes against that of Korean Holstein. A total of 992, 284, and 1881 CNVRs, respectively, were detected throughout the genome. Moreover, 53, 65, 45, and 82 putative regions of homozygosity (ROH) were identified in Hanwoo, Jeju Heugu, Chikso, and Korean Holstein respectively. The results of this study provide a valuable foundation for further investigations to dissect the molecular mechanisms underlying variation in economically important traits in cattle and to develop genetic markers for use in cattle breeding. PMID:24992012

  3. Genome sequence and origin analyses of the recombinant novel IBV virulent isolate SAIBK2.

    PubMed

    Wu, Xuan; Yang, Xin; Xu, Pengwei; Zhou, Long; Zhang, Zhikun; Wang, Hongning

    2016-08-01

    Recombination between infectious bronchitis viruses (IBVs), together with point mutations, insertions, and deletions, is thought to be responsible for the emergence of new IBV variants. SAIBK2 is a nephropathogenic strain isolated from layer flocks vaccinated with live attenuated H120 vaccine in Sichuan province, China in 2011. SAIBK2 causes severe kidney lesions and results in 50 % mortality in 30-day-old specific-pathogen-free chickens (with a dose of 10(5) EID50/0.1 mL SAIBK2 per chicken). The complete genome of SAIBK2 consists of 27669 nucleotides, excluding the poly-A tail at the 3' end. SAIBK2 has the highest identity to YX10 in terms of complete genome. Phylogenetic analysis of complete sequence showed that SAIBK2 belongs to the most dominant genotype in China. Comparison and recombination analyses with other IBV strains revealed that SAIBK2 may originate from recombination events among a YX10-, a YN-, and a Mass-like strain. Furthermore, whole gene 5 and parts of nsp 3, nsp 4, nsp 16, and N genes are involved in the recombination events, and the uptake of these regions from YN and Mass strains by SAIBK2 may increase its replication efficiency and be responsible for its increased virulence in specific-pathogen-free chickens. PMID:27108998

  4. Genomic Resources for Water Yam (Dioscorea alata L.): Analyses of EST-Sequences, De Novo Sequencing and GBS Libraries.

    PubMed

    Saski, Christopher A; Bhattacharjee, Ranjana; Scheffler, Brian E; Asiedu, Robert

    2015-01-01

    The reducing cost and rapid progress in next-generation sequencing techniques coupled with high performance computational approaches have resulted in large-scale discovery of advanced genomic resources in several model and non-model plant species. Yam (Dioscorea spp.) is a major food and cash crop in many countries but research efforts have been limited to understand the genetics and generate genomic information for the crop. The availability of a large number of genomic resources including genome-wide molecular markers will accelerate the breeding efforts and application of genomic selection in yams. In the present study, several methods including expressed sequence tags (EST)-sequencing, de novo sequencing, and genotyping-by-sequencing (GBS) profiles on two yam (Dioscorea alata L.) genotypes (TDa 95/00328 and TDa 95-310) was performed to generate genomic resources for use in its improvement programs. This includes a comprehensive set of EST-SSRs, genomic SSRs, whole genome SNPs, and reduced representation SNPs. A total of 1,152 EST-SSRs were developed from >40,000 EST-sequences generated from the two genotypes. A set of 388 EST-SSRs were validated as polymorphic showing a polymorphism rate of 34% when tested on two diverse parents targeted for anthracnose disease. In addition, approximately 40X de novo whole genome sequence coverage was generated for each of the two genotypes, and a total of 18,584 and 15,952 genomic SSRs were identified for TDa 95/00328 and TDa 95-310, respectively. A custom made pipeline resulted in the selection of 573 genomic SSRs common across the two genotypes, of which only eight failed, 478 being polymorphic and 62 monomorphic indicating a polymorphic rate of 83.5%. Additionally, 288,505 high quality SNPs were also identified between these two genotypes. Genotyping by sequencing reads on these two genotypes also revealed 36,790 overlapping SNP positions that are distributed throughout the genome. Our efforts in using different approaches

  5. Genomic Resources for Water Yam (Dioscorea alata L.): Analyses of EST-Sequences, De Novo Sequencing and GBS Libraries

    PubMed Central

    Saski, Christopher A.; Bhattacharjee, Ranjana; Scheffler, Brian E.; Asiedu, Robert

    2015-01-01

    The reducing cost and rapid progress in next-generation sequencing techniques coupled with high performance computational approaches have resulted in large-scale discovery of advanced genomic resources in several model and non-model plant species. Yam (Dioscorea spp.) is a major food and cash crop in many countries but research efforts have been limited to understand the genetics and generate genomic information for the crop. The availability of a large number of genomic resources including genome-wide molecular markers will accelerate the breeding efforts and application of genomic selection in yams. In the present study, several methods including expressed sequence tags (EST)-sequencing, de novo sequencing, and genotyping-by-sequencing (GBS) profiles on two yam (Dioscorea alata L.) genotypes (TDa 95/00328 and TDa 95-310) was performed to generate genomic resources for use in its improvement programs. This includes a comprehensive set of EST-SSRs, genomic SSRs, whole genome SNPs, and reduced representation SNPs. A total of 1,152 EST-SSRs were developed from >40,000 EST-sequences generated from the two genotypes. A set of 388 EST-SSRs were validated as polymorphic showing a polymorphism rate of 34% when tested on two diverse parents targeted for anthracnose disease. In addition, approximately 40X de novo whole genome sequence coverage was generated for each of the two genotypes, and a total of 18,584 and 15,952 genomic SSRs were identified for TDa 95/00328 and TDa 95-310, respectively. A custom made pipeline resulted in the selection of 573 genomic SSRs common across the two genotypes, of which only eight failed, 478 being polymorphic and 62 monomorphic indicating a polymorphic rate of 83.5%. Additionally, 288,505 high quality SNPs were also identified between these two genotypes. Genotyping by sequencing reads on these two genotypes also revealed 36,790 overlapping SNP positions that are distributed throughout the genome. Our efforts in using different approaches

  6. Studies on the high-sulphur proteins of reduced Merino wool. Amino acid sequence of protein SCMKB-IIIB4

    PubMed Central

    Swart, L. S.; Haylett, T.

    1971-01-01

    The complete amino acid sequence of protein SCMKB-IIIB4 is presented. It is closely related to the sequence of protein SCMKB-IIIB3 (Haylett, Swart & Parris, 1971) differing in only four positions. The peptic and thermolysin peptides of protein SCMKB-IIIB4 were analysed by the dansyl–Edman method (Gray, 1967) and by tritium-labelling of C-terminal residues (Matsuo, Fujimoto & Tatsuno, 1966). This protein is the third member of a group of high-sulphur wool proteins with molecular weight of about 11400. It consists of 98 residues and has acetylalanine and carboxymethylcysteine as N- and C-terminal residues respectively. PMID:4942536

  7. Genomic resources for water yam (Dioscorea alata L.): analyses of EST-Sequences, De Novo sequencing and GBS libraries

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The reducing cost and rapid progress in next-generation sequencing techniques coupled with high performance computational approaches have resulted in large-scale discovery of advanced genomic resources such as SSRs, SNPs and InDels in several model and non-model plant species. Yam (Dioscorea spp.) i...

  8. Multiple Amino Acid Sequence Alignment Nitrogenase Component 1: Insights into Phylogenetics and Structure-Function Relationships

    PubMed Central

    Howard, James B.; Kechris, Katerina J.; Rees, Douglas C.; Glazer, Alexander N.

    2013-01-01

    Amino acid residues critical for a protein's structure-function are retained by natural selection and these residues are identified by the level of variance in co-aligned homologous protein sequences. The relevant residues in the nitrogen fixation Component 1 α- and β-subunits were identified by the alignment of 95 protein sequences. Proteins were included from species encompassing multiple microbial phyla and diverse ecological niches as well as the nitrogen fixation genotypes, anf, nif, and vnf, which encode proteins associated with cofactors differing at one metal site. After adjusting for differences in sequence length, insertions, and deletions, the remaining >85% of the sequence co-aligned the subunits from the three genotypes. Six Groups, designated Anf, Vnf , and Nif I-IV, were assigned based upon genetic origin, sequence adjustments, and conserved residues. Both subunits subdivided into the same groups. Invariant and single variant residues were identified and were defined as “core” for nitrogenase function. Three species in Group Nif-III, Candidatus Desulforudis audaxviator, Desulfotomaculum kuznetsovii, and Thermodesulfatator indicus, were found to have a seleno-cysteine that replaces one cysteinyl ligand of the 8Fe:7S, P-cluster. Subsets of invariant residues, limited to individual groups, were identified; these unique residues help identify the gene of origin (anf, nif, or vnf) yet should not be considered diagnostic of the metal content of associated cofactors. Fourteen of the 19 residues that compose the cofactor pocket are invariant or single variant; the other five residues are highly variable but do not correlate with the putative metal content of the cofactor. The variable residues are clustered on one side of the cofactor, away from other functional centers in the three dimensional structure. Many of the invariant and single variant residues were not previously recognized as potentially critical and their identification provides the bases

  9. Detection and isolation of nucleic acid sequences using a bifunctional hybridization probe

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    2000-01-01

    A method for detecting and isolating a target sequence in a sample of nucleic acids is provided using a bifunctional hybridization probe capable of hybridizing to the target sequence that includes a detectable marker and a first complexing agent capable of forming a binding pair with a second complexing agent. A kit is also provided for detecting a target sequence in a sample of nucleic acids using a bifunctional hybridization probe according to this method.

  10. DNA Sequence Analyses Reveal Abundant Diversity, Endemism and Evidence for Asian Origin of the Porcini Mushrooms

    PubMed Central

    Feng, Bang; Xu, Jianping; Wu, Gang; Zeng, Nian-Kai; Li, Yan-Chun; Tolgor, Bau; Kost, Gerhard W.; Yang, Zhu L.

    2012-01-01

    The wild gourmet mushroom Boletus edulis and its close allies are of significant ecological and economic importance. They are found throughout the Northern Hemisphere, but despite their ubiquity there are still many unresolved issues with regard to the taxonomy, systematics and biogeography of this group of mushrooms. Most phylogenetic studies of Boletus so far have characterized samples from North America and Europe and little information is available on samples from other areas, including the ecologically and geographically diverse regions of China. Here we analyzed DNA sequence variation in three gene markers from samples of these mushrooms from across China and compared our findings with those from other representative regions. Our results revealed fifteen novel phylogenetic species (about one-third of the known species) and a newly identified lineage represented by Boletus sp. HKAS71346 from tropical Asia. The phylogenetic analyses support eastern Asia as the center of diversity for the porcini sensu stricto clade. Within this clade, B. edulis is the only known holarctic species. The majority of the other phylogenetic species are geographically restricted in their distributions. Furthermore, molecular dating and geological evidence suggest that this group of mushrooms originated during the Eocene in eastern Asia, followed by dispersal to and subsequent speciation in other parts of Asia, Europe, and the Americas from the middle Miocene through the early Pliocene. In contrast to the ancient dispersal of porcini in the strict sense in the Northern Hemisphere, the occurrence of B. reticulatus and B. edulis sensu lato in the Southern Hemisphere was probably due to recent human-mediated introductions. PMID:22629418

  11. Comparative metagenomic and metatranscriptomic analyses of microbial communities in acid mine drainage.

    PubMed

    Chen, Lin-xing; Hu, Min; Huang, Li-nan; Hua, Zheng-shuang; Kuang, Jia-liang; Li, Sheng-jin; Shu, Wen-sheng

    2015-07-01

    The microbial communities in acid mine drainage have been extensively studied to reveal their roles in acid generation and adaption to this environment. Lacking, however, are integrated community- and organism-wide comparative gene transcriptional analyses that could reveal the response and adaptation mechanisms of these extraordinary microorganisms to different environmental conditions. In this study, comparative metagenomics and metatranscriptomics were performed on microbial assemblages collected from four geochemically distinct acid mine drainage (AMD) sites. Taxonomic analysis uncovered unexpectedly high microbial biodiversity of these extremely acidophilic communities, and the abundant taxa of Acidithiobacillus, Leptospirillum and Acidiphilium exhibited high transcriptional activities. Community-wide comparative analyses clearly showed that the AMD microorganisms adapted to the different environmental conditions via regulating the expression of genes involved in multiple in situ functional activities, including low-pH adaptation, carbon, nitrogen and phosphate assimilation, energy generation, environmental stress resistance, and other functions. Organism-wide comparative analyses of the active taxa revealed environment-dependent gene transcriptional profiles, especially the distinct strategies used by Acidithiobacillus ferrivorans and Leptospirillum ferrodiazotrophum in nutrients assimilation and energy generation for survival under different conditions. Overall, these findings demonstrate that the gene transcriptional profiles of AMD microorganisms are closely related to the site physiochemical characteristics, providing clues into the microbial response and adaptation mechanisms in the oligotrophic, extremely acidic environments. PMID:25535937

  12. Comparative metagenomic and metatranscriptomic analyses of microbial communities in acid mine drainage

    PubMed Central

    Chen, Lin-xing; Hu, Min; Huang, Li-nan; Hua, Zheng-shuang; Kuang, Jia-liang; Li, Sheng-jin; Shu, Wen-sheng

    2015-01-01

    The microbial communities in acid mine drainage have been extensively studied to reveal their roles in acid generation and adaption to this environment. Lacking, however, are integrated community- and organism-wide comparative gene transcriptional analyses that could reveal the response and adaptation mechanisms of these extraordinary microorganisms to different environmental conditions. In this study, comparative metagenomics and metatranscriptomics were performed on microbial assemblages collected from four geochemically distinct acid mine drainage (AMD) sites. Taxonomic analysis uncovered unexpectedly high microbial biodiversity of these extremely acidophilic communities, and the abundant taxa of Acidithiobacillus, Leptospirillum and Acidiphilium exhibited high transcriptional activities. Community-wide comparative analyses clearly showed that the AMD microorganisms adapted to the different environmental conditions via regulating the expression of genes involved in multiple in situ functional activities, including low-pH adaptation, carbon, nitrogen and phosphate assimilation, energy generation, environmental stress resistance, and other functions. Organism-wide comparative analyses of the active taxa revealed environment-dependent gene transcriptional profiles, especially the distinct strategies used by Acidithiobacillus ferrivorans and Leptospirillum ferrodiazotrophum in nutrients assimilation and energy generation for survival under different conditions. Overall, these findings demonstrate that the gene transcriptional profiles of AMD microorganisms are closely related to the site physiochemical characteristics, providing clues into the microbial response and adaptation mechanisms in the oligotrophic, extremely acidic environments. PMID:25535937

  13. Partial amino acid sequence of human factor D:homology with serine proteases.

    PubMed Central

    Volanakis, J E; Bhown, A; Bennett, J C; Mole, J E

    1980-01-01

    Human factor D purified to homogeneity by a modified procedure was subjected to NH2-terminal amino acid sequence analysis by using a modified automated Beckman sequencer. We identified 48 of the first 57 NH2-terminal amino acids in a single sequencer run, using microgram quantities of factor D. The deduced amino acid sequence represents approximately 25% of the primary structure of factor D. This extended NH2-terminal amino acid sequence of factor D was compared to that of other trypsin-related serine proteases. By visual inspection, strong homologies (33--50% identity) were observed with all the serine proteases included in the comparison. Interestingly, factor D showed a higher degree of homology to serine proteases of pancreatic origin than to those of serum origin. Images PMID:6987665

  14. Technical note: improved methodology for analyses of acid detergent fiber and acid detergent lignin.

    PubMed

    Raffrenato, E; Van Amburgh, M E

    2011-07-01

    The objective of this study was to evaluate the methodology of the acid detergent lignin (ADL) assay in an effort to evaluate particle loss, improve repeatability, and decrease variation within and among samples. The original ADL method relied on asbestos as a filtering aid, but that was removed in 1989 with the mandate from the Environmental Protection Agency to eliminate asbestos in the environment. Furthermore, recent work on fiber methodology indicated that pore size in the Gooch sintered glass crucible (40-60 μm) was too large to trap all of the small particles associated with neutral detergent fiber (NDF) and acid detergent fiber (ADF). Thus, any loss of ADF could potentially result in a loss of ADL. Sixty forages including conventional and brown midrib corn silages, alfalfa silages and hays, mature grasses, early vegetative grasses, and 9 feces samples, were analyzed sequentially for ADF and ADL as outlined in the 1973 procedure of Van Soest except for the use of the asbestos fiber. A glass microfiber filter with a 1.5-μm pore size was chosen as a filtering aid because it met the criteria required by the assay: glass, heat resistant, acid resistant, chemically inert, and hydrophobic. To compare with the current ADF and ADL assays, the assays were conducted with either no filter or the glass filter inserted into crucibles, rinsed with acetone, and then according to the 1973 procedure of Van Soest. The samples analyzed covered a range from 18.11 to 55.79% ADF and from 0.96 to 9.94% ADL on a dry matter (DM) basis. With the use of the filter, the mean ADF values increased 4.2% and mean ADL values increased 18.9%. Overall, both ADF and ADL values were greater with the use of the glass microfiber filter than without, indicating that as the type of sample analyzed changed, use of the Gooch crucible without the filtering aid results in particle loss. The adoption of the use of a small pore size (1.5 μm) glass microfiber filter to improve filtration and recovery

  15. FIGG: Simulating populations of whole genome sequences for heterogeneous data analyses

    PubMed Central

    2014-01-01

    Background High-throughput sequencing has become one of the primary tools for investigation of the molecular basis of disease. The increasing use of sequencing in investigations that aim to understand both individuals and populations is challenging our ability to develop analysis tools that scale with the data. This issue is of particular concern in studies that exhibit a wide degree of heterogeneity or deviation from the standard reference genome. The advent of population scale sequencing studies requires analysis tools that are developed and tested against matching quantities of heterogeneous data. Results We developed a large-scale whole genome simulation tool, FIGG, which generates large numbers of whole genomes with known sequence characteristics based on direct sampling of experimentally known or theorized variations. For normal variations we used publicly available data to determine the frequency of different mutation classes across the genome. FIGG then uses this information as a background to generate new sequences from a parent sequence with matching frequencies, but different actual mutations. The background can be normal variations, known disease variations, or a theoretical frequency distribution of variations. Conclusion In order to enable the creation of large numbers of genomes, FIGG generates simulated sequences from known genomic variation and iteratively mutates each genome separately. The result is multiple whole genome sequences with unique variations that can primarily be used to provide different reference genomes, model heterogeneous populations, and can offer a standard test environment for new analysis algorithms or bioinformatics tools. PMID:24885193

  16. Amino acid sequence of Japanese quail (Coturnix japonica) and northern bobwhite (Colinus virginianus) myoglobin.

    PubMed

    Goodson, John; Beckstead, Robert B; Payne, Jason; Singh, Rakesh K; Mohan, Anand

    2015-08-15

    Myoglobin has an important physiological role in vertebrates, and as the primary sarcoplasmic pigment in meat, influences quality perception and consumer acceptability. In this study, the amino acid sequences of Japanese quail and northern bobwhite myoglobin were deduced by cDNA cloning of the coding sequence from mRNA. Japanese quail myoglobin was isolated from quail cardiac muscles, purified using ammonium sulphate precipitation and gel-filtration, and subjected to multiple enzymatic digestions. Mass spectrometry corroborated the deduced protein amino acid sequence at the protein level. Sequence analysis revealed both species' myoglobin structures consist of 153 amino acids, differing at only three positions. When compared with chicken myoglobin, Japanese quail showed 98% sequence identity, and northern bobwhite 97% sequence identity. The myoglobin in both quail species contained eight histidine residues instead of the nine present in chicken and turkey. PMID:25794748

  17. Chances and pitfalls of leaf wax biomarker analyses applied to fluvial sediment sequences - the example of a Holocene fluvial sediment-paleosol sequence from the upper Alazani River, eastern Georgia

    NASA Astrophysics Data System (ADS)

    von Suchodoletz, Hans; Bliedtner, Marcel; Zielhofer, Christoph; Faust, Dominik; Zech, Roland

    2016-04-01

    During the last decades, fluvial sediment sequences in many regions have intensively been studied to reconstruct Late Quaternary palaeoenvironmental and palaeohydrological conditions. However, up to now analyses of leaf wax biomarkers that are increasingly used to reconstruct paleoenvironmental and -climate conditions e.g. from lake sediments or loess-paleosol sequences were not systematically applied to Late Quaternary fluvial sediments. Given the ubiquitous distribution of fluvial sediment sequences on the earth's surface such investigations could potentially strongly enhance the knowledge about former environmental conditions in many regions. For this conceptual study we exemplarily analysed leaf wax biomarker (long-chain n-alkanes, n-alkanoic acids) in a fluvial sediment palaeosol sequence from the upper Alazani River in eastern Georgia to discuss general possibilities and pitfalls: Generally, biomarker records from fluvial archives can be divided into i) a catchment signal recorded in the fluvial sediment layers and ii) a local in-situ signal recorded in the intercalated paleosols. This offers the great chance to reconstruct paleoenvironmental conditions in both the whole catchment and at the sampling site. However, potential pitfalls are, for example, that inherited catchment signals can bias the in-situ signal from paleosols, while intermediate sediment storage in the catchment prior to sediment deposition and postsedimentary processes may alter the original catchment signal in the fluvial sediment layers. Thus, when applying leaf wax biomarker analyses to fluvial sediment sequences one has to be careful: The interpretation of the biomarker record strongly depends on the specific geomorphological and sedimentological conditions of the investigated site and of the catchment area.

  18. Identification of random nucleic acid sequence aberrations using dual capture probes which hybridize to different chromosome regions

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    1998-01-01

    A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration.

  19. Identification of random nucleic acid sequence aberrations using dual capture probes which hybridize to different chromosome regions

    DOEpatents

    Lucas, J.N.; Straume, T.; Bogen, K.T.

    1998-03-24

    A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration. 14 figs.

  20. tax and rex Sequences of bovine leukaemia virus from globally diverse isolates: rex amino acid sequence more variable than tax.

    PubMed

    McGirr, K M; Buehring, G C

    2005-02-01

    Bovine leukaemia virus (BLV) is an important agricultural problem with high costs to the dairy industry. Here, we examine the variation of the tax and rex genes of BLV. The tax and rex genes share 420 bases and have overlapping reading frames. The tax gene encodes a protein that functions as a transactivator of the BLV promoter, is required for viral replication, acts on cellular promoters, and is responsible for oncogenesis. The rex facilitates the export of viral mRNAs from the nucleus and regulates transcription. We have sequenced five new isolates of the tax/rex gene. We examined the five new and three previously published tax/rex DNA and predicted amino acid sequences of BLV isolates from cattle in representative regions worldwide. The highest variation among nucleic acid sequences for tax and rex was 7% and 5%, respectively; among predicted amino acid sequences for Tax and Rex, 9% and 11%, respectively. Significantly more nucleotide changes resulted in predicted amino acid changes in the rex gene than in the tax gene (P < or = 0.0006). This variability is higher than previously reported for any region of the viral genome. This research may also have implications for the development of Tax-based vaccines. PMID:15702995

  1. Automation of Molecular-Based Analyses: A Primer on Massively Parallel Sequencing

    PubMed Central

    Nguyen, Lan; Burnett, Leslie

    2014-01-01

    Recent advances in genetics have been enabled by new genetic sequencing techniques called massively parallel sequencing (MPS) or next-generation sequencing. Through the ability to sequence in parallel hundreds of thousands to millions of DNA fragments, the cost and time required for sequencing has dramatically decreased. There are a number of different MPS platforms currently available and being used in Australia. Although they differ in the underlying technology involved, their overall processes are very similar: DNA fragmentation, adaptor ligation, immobilisation, amplification, sequencing reaction and data analysis. MPS is being used in research, translational and increasingly now also in clinical settings. Common applications include sequencing of whole genomes, whole exomes or targeted genes for disease-causing gene discovery, genetic diagnosis and targeted cancer therapy. Even though the revolution that is occurring with MPS is exciting due to its increasing use, improving and emerging technologies and new applications, significant challenges still exist. Particularly challenging issues are the bioinformatics required for data analysis, interpretation of results and the ethical dilemma of ‘incidental findings’. PMID:25336762

  2. The amino acid sequence of protein CM-3 from Dendroaspis polylepis polylepis (black mamba) venom.

    PubMed

    Joubert, F J

    1985-01-01

    Protein CM-3 from Dendroaspis polylepis polylepis venom was purified by gel filtration and ion exchange chromatography. It comprises 65 amino acids including eight half-cystines. The complete amino acid sequence of protein CM-3 has been elucidated. The sequence (residues 1-50) resembles that of the N-terminal sequence of the subunits of a synergistic type protein and residues 51-65 that of the C-terminal sequence of an angusticeps type protein. Mixtures of protein CM-3 and angusticeps type proteins showed no apparent synergistic effect, in that their toxicity in combination was no greater than the sum of their individual toxicities. PMID:4029488

  3. Differentiation of Pseudomonas aeruginosa pili based on sequence and B-cell epitope analyses.

    PubMed Central

    Castric, P A; Deal, C D

    1994-01-01

    The nucleotide sequences of three previously undescribed Pseudomonas aeruginosa pilin structural genes are presented. Comparisons of deduced pilin primary structure and flanking DNA sequence allowed placement of these and six previously published sequences into one of two groups. Epitope mapping, using overlapping immobilized peptides representing the pilin primary structure, with antipilin monoclonal antibodies revealed several B-cell determinants grouped near the carboxyl terminus of P. aeruginosa 1244 pilin. One determinant was found to reside near the pilin constant region. These determinants were found associated with the pili of 31 of 95 P. aeruginosa clinical isolates. PMID:7507890

  4. Isotopic and molecular analyses of hydrocarbons and monocarboxylic acids of the Murchison meteorite

    NASA Technical Reports Server (NTRS)

    Krishnamurthy, R. V.; Epstein, S.; Cronin, John R.; Pizzarello, Sandra; Yuen, George U.

    1992-01-01

    The monocarboxylic acids and hydrocarbons of the Murchison meteorite (CM2) were isolated for isotropic analysis. The nonvolatile hydrocarbons were analyzed as crude methanol and benzene-methanol extracts and also after separation by silica gel chromatography into predominantly aliphatic, aromatic, and polar hydrocarbon fractions. The volatile hydrocarbons were obtained after progressive decomposition of the meteorite matrix by freeze-thaw, hot water, and acid treatment. Molecular analyses of the aromatic hydrocarbons showed them to comprise a complex suite of compounds in which pyrene, fluoranthene, phenanthrene, and acenaphthene were the most abundant components, a result similar to earlier analyses. The polar hydrocarbons also comprise a very complex mixture in which aromatic ketones, nitrogen, and sulfur heterocycles were identified. The monocarboxylic acids, aliphatic, aromatic, and polar hydrocarbons, and the indigenous volatile hydrocarbons were found to be D-rich. The deuterium enrichment observed in these compounds is suggestive. In two separate analyses, the delta-D values of the nonvolatile hydrocarbons were observed to increase in the following order: aliphatic-aromatic-polar. This finding is consistent with an early solar system or parent body conversion of aromatic to aliphatic compounds as well as the suggestion of pyrolytic formation of aromatic from aliphatic compounds.

  5. Factorial Moments Analyses Show a Characteristic Length Scale in DNA Sequences

    NASA Astrophysics Data System (ADS)

    Mohanty, A. K.; Narayana Rao, A. V. S. S.

    2000-02-01

    A unique feature of most of the DNA sequences, found through the factorial moments analysis, is the existence of a characteristic length scale around which the density distribution is nearly Poissonian. Above this point, the DNA sequences, irrespective of their intron contents, show long range correlations with a significant deviation from the Gaussian statistics, while, below this point, the DNA statistics are essentially Gaussian. The famous DNA walk representation is also shown to be a special case of the present analysis.

  6. Complete Plastid Genome Sequence of the Basal Asterid Ardisia polysticta Miq. and Comparative Analyses of Asterid Plastid Genomes

    PubMed Central

    Ku, Chuan; Hu, Jer-Ming; Kuo, Chih-Horng

    2013-01-01

    Ardisia is a basal asterid genus well known for its medicinal values and has the potential for development of novel phytopharmaceuticals. In this genus of nearly 500 species, many ornamental species are commonly grown worldwide and some have become invasive species that caused ecological problems. As there is no completed plastid genome (plastome) sequence in related taxa, we sequenced and characterized the plastome of Ardisia polysticta to find plastid markers of potential utility for phylogenetic analyses at low taxonomic levels. The complete A. polysticta plastome is 156,506 bp in length and has gene content and organization typical of most asterids and other angiosperms. We identified seven intergenic regions as potentially informative markers with resolution for interspecific relationships. Additionally, we characterized the diversity of asterid plastomes with respect to GC content, plastome organization, gene content, and repetitive sequences through comparative analyses. The results demonstrated that the genome organizations near the boundaries between inverted repeats (IRs) and single-copy regions (SCs) are polymorphic. The boundary organization found in Ardisia appears to be the most common type among asterids, while six other types are also found in various asterid lineages. In general, the repetitive sequences in genic regions tend to be more conserved, whereas those in noncoding regions are usually lineage-specific. Finally, we inferred the whole-plastome phylogeny with the available asterid sequences. With the improvement in taxon sampling of asterid orders and families, our result highlights the uncertainty of the position of Gentianales within euasterids I. PMID:23638113

  7. The Chinese hamster Alu-equivalent sequence: a conserved highly repetitious, interspersed deoxyribonucleic acid sequence in mammals has a structure suggestive of a transposable element.

    PubMed Central

    Haynes, S R; Toomey, T P; Leinwand, L; Jelinek, W R

    1981-01-01

    A consensus sequence has been determined for a major interspersed deoxyribonucleic acid repeat in the genome of Chinese hamster ovary cells (CHO cells). This sequence is extensively homologous to (i) the human Alu sequence (P. L. Deininger et al., J. Mol. Biol., in press), (ii) the mouse B1 interspersed repetitious sequence (Krayev et al., Nucleic Acids Res. 8:1201-1215, 1980) (iii) an interspersed repetitious sequence from African green monkey deoxyribonucleic acid (Dhruva et al., Proc. Natl. Acad. Sci. U.S.A. 77:4514-4518, 1980) and (iv) the CHO and mouse 4.5S ribonucleic acid (this report; F. Harada and N. Kato, Nucleic Acids Res. 8:1273-1285, 1980). Because the CHO consensus sequence shows significant homology to the human Alu sequence it is termed the CHO Alu-equivalent sequence. A conserved structure surrounding CHO Alu-equivalent family members can be recognized. It is similar to that surrounding the human Alu and the mouse B1 sequences, and is represented as follows: direct repeat-CHO-Alu-A-rich sequence-direct repeat. A composite interspersed repetitious sequence has been identified. Its structure is represented as follows: direct repeat-residue 47 to 107 of CHO-Alu-non-Alu repetitious sequence-A-rich sequence-direct repeat. Because the Alu flanking sequences resemble those that flank known transposable elements, we think it likely that the Alu sequence dispersed throughout the mammalian genome by transposition. Images PMID:9279371

  8. MannDB – A microbial database of automated protein sequence analyses and evidence integration for protein characterization

    PubMed Central

    Zhou, Carol L Ecale; Lam, Marisa W; Smith, Jason R; Zemla, Adam T; Dyer, Matthew D; Kuczmarski, Thomas A; Vitalis, Elizabeth A; Slezak, Thomas R

    2006-01-01

    Background MannDB was created to meet a need for rapid, comprehensive automated protein sequence analyses to support selection of proteins suitable as targets for driving the development of reagents for pathogen or protein toxin detection. Because a large number of open-source tools were needed, it was necessary to produce a software system to scale the computations for whole-proteome analysis. Thus, we built a fully automated system for executing software tools and for storage, integration, and display of automated protein sequence analysis and annotation data. Description MannDB is a relational database that organizes data resulting from fully automated, high-throughput protein-sequence analyses using open-source tools. Types of analyses provided include predictions of cleavage, chemical properties, classification, features, functional assignment, post-translational modifications, motifs, antigenicity, and secondary structure. Proteomes (lists of hypothetical and known proteins) are downloaded and parsed from Genbank and then inserted into MannDB, and annotations from SwissProt are downloaded when identifiers are found in the Genbank entry or when identical sequences are identified. Currently 36 open-source tools are run against MannDB protein sequences either on local systems or by means of batch submission to external servers. In addition, BLAST against protein entries in MvirDB, our database of microbial virulence factors, is performed. A web client browser enables viewing of computational results and downloaded annotations, and a query tool enables structured and free-text search capabilities. When available, links to external databases, including MvirDB, are provided. MannDB contains whole-proteome analyses for at least one representative organism from each category of biological threat organism listed by APHIS, CDC, HHS, NIAID, USDA, USFDA, and WHO. Conclusion MannDB comprises a large number of genomes and comprehensive protein sequence analyses

  9. Computer Simulation of the Determination of Amino Acid Sequences in Polypeptides

    ERIC Educational Resources Information Center

    Daubert, Stephen D.; Sontum, Stephen F.

    1977-01-01

    Describes a computer program that generates a random string of amino acids and guides the student in determining the correct sequence of a given protein by using experimental analytic data for that protein. (MLH)

  10. The Complete Chloroplast Genome Sequences of Five Epimedium Species: Lights into Phylogenetic and Taxonomic Analyses

    PubMed Central

    Zhang, Yanjun; Du, Liuwen; Liu, Ao; Chen, Jianjun; Wu, Li; Hu, Weiming; Zhang, Wei; Kim, Kyunghee; Lee, Sang-Choon; Yang, Tae-Jin; Wang, Ying

    2016-01-01

    Epimedium L. is a phylogenetically and economically important genus in the family Berberidaceae. We here sequenced the complete chloroplast (cp) genomes of four Epimedium species using Illumina sequencing technology via a combination of de novo and reference-guided assembly, which was also the first comprehensive cp genome analysis on Epimedium combining the cp genome sequence of E. koreanum previously reported. The five Epimedium cp genomes exhibited typical quadripartite and circular structure that was rather conserved in genomic structure and the synteny of gene order. However, these cp genomes presented obvious variations at the boundaries of the four regions because of the expansion and contraction of the inverted repeat (IR) region and the single-copy (SC) boundary regions. The trnQ-UUG duplication occurred in the five Epimedium cp genomes, which was not found in the other basal eudicotyledons. The rapidly evolving cp genome regions were detected among the five cp genomes, as well as the difference of simple sequence repeats (SSR) and repeat sequence were identified. Phylogenetic relationships among the five Epimedium species based on their cp genomes showed accordance with the updated system of the genus on the whole, but reminded that the evolutionary relationships and the divisions of the genus need further investigation applying more evidences. The availability of these cp genomes provided valuable genetic information for accurately identifying species, taxonomy and phylogenetic resolution and evolution of Epimedium, and assist in exploration and utilization of Epimedium plants. PMID:27014326

  11. Choice of Reference Sequence and Assembler for Alignment of Listeria monocytogenes Short-Read Sequence Data Greatly Influences Rates of Error in SNP Analyses

    PubMed Central

    Pightling, Arthur W.; Petronella, Nicholas; Pagotto, Franco

    2014-01-01

    The wide availability of whole-genome sequencing (WGS) and an abundance of open-source software have made detection of single-nucleotide polymorphisms (SNPs) in bacterial genomes an increasingly accessible and effective tool for comparative analyses. Thus, ensuring that real nucleotide differences between genomes (i.e., true SNPs) are detected at high rates and that the influences of errors (such as false positive SNPs, ambiguously called sites, and gaps) are mitigated is of utmost importance. The choices researchers make regarding the generation and analysis of WGS data can greatly influence the accuracy of short-read sequence alignments and, therefore, the efficacy of such experiments. We studied the effects of some of these choices, including: i) depth of sequencing coverage, ii) choice of reference-guided short-read sequence assembler, iii) choice of reference genome, and iv) whether to perform read-quality filtering and trimming, on our ability to detect true SNPs and on the frequencies of errors. We performed benchmarking experiments, during which we assembled simulated and real Listeria monocytogenes strain 08-5578 short-read sequence datasets of varying quality with four commonly used assemblers (BWA, MOSAIK, Novoalign, and SMALT), using reference genomes of varying genetic distances, and with or without read pre-processing (i.e., quality filtering and trimming). We found that assemblies of at least 50-fold coverage provided the most accurate results. In addition, MOSAIK yielded the fewest errors when reads were aligned to a nearly identical reference genome, while using SMALT to align reads against a reference sequence that is ∼0.82% distant from 08-5578 at the nucleotide level resulted in the detection of the greatest numbers of true SNPs and the fewest errors. Finally, we show that whether read pre-processing improves SNP detection depends upon the choice of reference sequence and assembler. In total, this study demonstrates that researchers should

  12. Choice of reference sequence and assembler for alignment of Listeria monocytogenes short-read sequence data greatly influences rates of error in SNP analyses.

    PubMed

    Pightling, Arthur W; Petronella, Nicholas; Pagotto, Franco

    2014-01-01

    The wide availability of whole-genome sequencing (WGS) and an abundance of open-source software have made detection of single-nucleotide polymorphisms (SNPs) in bacterial genomes an increasingly accessible and effective tool for comparative analyses. Thus, ensuring that real nucleotide differences between genomes (i.e., true SNPs) are detected at high rates and that the influences of errors (such as false positive SNPs, ambiguously called sites, and gaps) are mitigated is of utmost importance. The choices researchers make regarding the generation and analysis of WGS data can greatly influence the accuracy of short-read sequence alignments and, therefore, the efficacy of such experiments. We studied the effects of some of these choices, including: i) depth of sequencing coverage, ii) choice of reference-guided short-read sequence assembler, iii) choice of reference genome, and iv) whether to perform read-quality filtering and trimming, on our ability to detect true SNPs and on the frequencies of errors. We performed benchmarking experiments, during which we assembled simulated and real Listeria monocytogenes strain 08-5578 short-read sequence datasets of varying quality with four commonly used assemblers (BWA, MOSAIK, Novoalign, and SMALT), using reference genomes of varying genetic distances, and with or without read pre-processing (i.e., quality filtering and trimming). We found that assemblies of at least 50-fold coverage provided the most accurate results. In addition, MOSAIK yielded the fewest errors when reads were aligned to a nearly identical reference genome, while using SMALT to align reads against a reference sequence that is ∼0.82% distant from 08-5578 at the nucleotide level resulted in the detection of the greatest numbers of true SNPs and the fewest errors. Finally, we show that whether read pre-processing improves SNP detection depends upon the choice of reference sequence and assembler. In total, this study demonstrates that researchers should

  13. Analyses of DNA Base Sequences for Eukaryotes in Terms of Power Spectrum Method

    NASA Astrophysics Data System (ADS)

    Isohata, Yasuhiko; Hayashi, Masaki

    2005-02-01

    By adopting a power spectrum method we have analyzed long-range correlations in the gene base sequences, exons and introns for five or six eukaryote species. As a measure of the long-range correlations, we have used an exponent α in 1/fα, which is an approximation of a power spectrum in a low-frequency region. We have analyzed frequency distributions of α and the dependence of its average values <α> on the sequence length for the five or six species, paying particular attention to the species dependence. We have shown that long-range correlations have been formed mainly due to the intron's elongation as well as by the sequence structures of introns acquired over the course of evolution.

  14. Accuracy of sequence alignment and fold assessment using reduced amino acid alphabets.

    PubMed

    Melo, Francisco; Marti-Renom, Marc A

    2006-06-01

    Reduced or simplified amino acid alphabets group the 20 naturally occurring amino acids into a smaller number of representative protein residues. To date, several reduced amino acid alphabets have been proposed, which have been derived and optimized by a variety of methods. The resulting reduced amino acid alphabets have been applied to pattern recognition, generation of consensus sequences from multiple alignments, protein folding, and protein structure prediction. In this work, amino acid substitution matrices and statistical potentials were derived based on several reduced amino acid alphabets and their performance assessed in a large benchmark for the tasks of sequence alignment and fold assessment of protein structure models, using as a reference frame the standard alphabet of 20 amino acids. The results showed that a large reduction in the total number of residue types does not necessarily translate into a significant loss of discriminative power for sequence alignment and fold assessment. Therefore, some definitions of a few residue types are able to encode most of the relevant sequence/structure information that is present in the 20 standard amino acids. Based on these results, we suggest that the use of reduced amino acid alphabets may allow to increasing the accuracy of current substitution matrices and statistical potentials for the prediction of protein structure of remote homologs. PMID:16506243

  15. The amino acid sequence of monal pheasant lysozyme and its activity.

    PubMed

    Araki, T; Matsumoto, T; Torikata, T

    1998-10-01

    The amino acid sequence of monal pheasant lysozyme and its activity were analyzed. Carboxymethylated lysozyme was digested with trypsin and the resulting peptides were sequenced. The established amino acid sequence had one amino acid substitution at position 102 (Arg to Gly) comparing with Indian peafowl lysozyme and four amino acid substitutions at positions 3 (Phe to Tyr), 15 (His to Leu), 41 (Gln to His), and 121 (Gln to His) with chicken lysozyme. Analysis of the time-courses of reaction using N-acetylglucosamine pentamer as a substrate showed a difference of binding free energy change (-0.4 kcal/mol) at subsites A between monal pheasant and Indian peafowl lysozyme. This was assumed to be caused by the amino acid substitution at subsite A with loss of a positive charge at position 102 (Arg102 to Gly). PMID:9836434

  16. Studies on monotreme proteins. VII. Amino acid sequence of myoglobin from the platypus, Ornithoryhynchus anatinus.

    PubMed

    Fisher, W K; Thompson, E O

    1976-03-01

    Myoglobin isolated from skeletal muscle of the platypus contains 153 amino acid residues. The complete amino acid sequence has been determined following cleavage with cyanogen bromide and further digestion of the four fragments with trypsin, chymotrypsin, pepsin and thermolysin. Sequences of the purified peptides were determined by the dansyl-Edman procedure. The amino acid sequence showed 25 differences from human myoglobin and 24 from kangaroo myoglobin. Amino acid sequences in myoglobins are more conserved than sequences in the alpha- and beta-globin chains, and platypus myoglobin shows a similar number of variations in sequence to kangaroo myoglobin when compared with myoglobin of other species. The date of divergence of the platypus from other mammals was estimated at 102 +/- 31 million years, based on the number of amino acid differences between species and allowing for mutations during the evolutionary period. This estimate differs widely from the estimate given by similar treatment of the alpha- and beta-chain sequences and a constant rate of mutation of globin chains is not supported. PMID:962722

  17. cDNA-derived amino acid sequences of myoglobins from nine species of whales and dolphins.

    PubMed

    Iwanami, Kentaro; Mita, Hajime; Yamamoto, Yasuhiko; Fujise, Yoshihiro; Yamada, Tadasu; Suzuki, Tomohiko

    2006-10-01

    We determined the myoglobin (Mb) cDNA sequences of nine cetaceans, of which six are the first reports of Mb sequences: sei whale (Balaenoptera borealis), Bryde's whale (Balaenoptera edeni), pygmy sperm whale (Kogia breviceps), Stejneger's beaked whale (Mesoplodon stejnegeri), Longman's beaked whale (Indopacetus pacificus), and melon-headed whale (Peponocephala electra), and three confirm the previously determined chemical amino acid sequences: sperm whale (Physeter macrocephalus), common minke whale (Balaenoptera acutorostrata) and pantropical spotted dolphin (Stenella attenuata). We found two types of Mb in the skeletal muscle of pantropical spotted dolphin: Mb I with the same amino acid sequence as that deposited in the protein database, and Mb II, which differs at two amino acid residues compared with Mb I. Using an alignment of the amino acid or cDNA sequences of cetacean Mb, we constructed a phylogenetic tree by the NJ method. Clustering of cetacean Mb amino acid and cDNA sequences essentially follows the classical taxonomy of cetaceans, suggesting that Mb sequence data is valid for classification of cetaceans at least to the family level. PMID:16962803

  18. Molecular Characterization and Variation of the Broad bean wilt virus 2 Isolates Based on Analyses of Complete Genome Sequences.

    PubMed

    Kwak, Hae-Ryun; Kim, Mi-Kyeong; Lee, Ye-Ji; Seo, Jang-Kyun; Kim, Jeong-Soo; Kim, Kook-Hyung; Cha, Byeongjin; Choi, Hong-Soo

    2013-12-01

    The full-genome sequences of fourteen isolates of Broad bean wilt virus 2 (BBWV2), collected from broad bean, pea, spinach, bell pepper and paprika plants in Korea during the years 2006-2012, were determined and analyzed comparatively along with fifteen previously reported BBWV2 genome sequences. Sequence analyses showed that RNA-1 and RNA-2 sequences of BBWV2 Korean isolates consisted of 5950-5956 and 3568-3604 nucleotides, respectively. Full-length genome sequence-based phylogenetic analyses revealed that the BBWV2 Korean isolates could be divided into three major groups comprising GS-I (isolates BB2 and RP7) along with isolate IP, GS-II (isolates BB5, P2, P3 and RP3) along with isolate B935, and GS-III including 16 BBWV2 Korean isolates. Interestingly, GS-III appears to be newly emerged and predominant in Korea. Recombination analyses identified two recombination events in the analyzed BBWV2 population: one in the RNA-1 of isolate K and another one in the RNA-2 of isolate XJ14-3. However, no recombination events were detected in the other 21 Korean isolates. On the other hand, out of 29 BBWV2 isolates, 16 isolates were found to be reassortants, of which each RNA segment (i.e. RNA1 and RNA2) was originated from different parental isolates. Our findings suggested that reassortment rather than recombination is a major evolutionary force in the genetic diversification of BBWV population in Korea. PMID:25288968

  19. Molecular Characterization and Variation of the Broad bean wilt virus 2 Isolates Based on Analyses of Complete Genome Sequences

    PubMed Central

    Kwak, Hae-Ryun; Kim, Mi-Kyeong; Lee, Ye-Ji; Seo, Jang-Kyun; Kim, Jeong-Soo; Kim, Kook-Hyung; Cha, Byeongjin; Choi, Hong-Soo

    2013-01-01

    The full-genome sequences of fourteen isolates of Broad bean wilt virus 2 (BBWV2), collected from broad bean, pea, spinach, bell pepper and paprika plants in Korea during the years 2006–2012, were determined and analyzed comparatively along with fifteen previously reported BBWV2 genome sequences. Sequence analyses showed that RNA-1 and RNA-2 sequences of BBWV2 Korean isolates consisted of 5950–5956 and 3568–3604 nucleotides, respectively. Full-length genome sequence-based phylogenetic analyses revealed that the BBWV2 Korean isolates could be divided into three major groups comprising GS-I (isolates BB2 and RP7) along with isolate IP, GS-II (isolates BB5, P2, P3 and RP3) along with isolate B935, and GS-III including 16 BBWV2 Korean isolates. Interestingly, GS-III appears to be newly emerged and predominant in Korea. Recombination analyses identified two recombination events in the analyzed BBWV2 population: one in the RNA-1 of isolate K and another one in the RNA-2 of isolate XJ14-3. However, no recombination events were detected in the other 21 Korean isolates. On the other hand, out of 29 BBWV2 isolates, 16 isolates were found to be reassortants, of which each RNA segment (i.e. RNA1 and RNA2) was originated from different parental isolates. Our findings suggested that reassortment rather than recombination is a major evolutionary force in the genetic diversification of BBWV population in Korea. PMID:25288968

  20. Sequence analyses and evolutionary relationships among the energy-coupling proteins Enzyme I and HPr of the bacterial phosphoenolpyruvate: sugar phosphotransferase system.

    PubMed Central

    Reizer, J.; Hoischen, C.; Reizer, A.; Pham, T. N.; Saier, M. H.

    1993-01-01

    We have previously reported the overexpression, purification, and biochemical properties of the Bacillus subtilis Enzyme I of the phosphoenolpyruvate: sugar phosphotransferase system (PTS) (Reizer, J., et al., 1992, J. Biol. Chem. 267, 9158-9169). We now report the sequencing of the ptsI gene of B. subtilis encoding Enzyme I (570 amino acids and 63,076 Da). Putative transcriptional regulatory signals are identified, and the pts operon is shown to be subject to carbon source-dependent regulation. Multiple alignments of the B. subtilis Enzyme I with (1) six other sequenced Enzymes I of the PTS from various bacterial species, (2) phosphoenolpyruvate synthase of Escherichia coli, and (3) bacterial and plant pyruvate: phosphate dikinases (PPDKs) revealed regions of sequence similarity as well as divergence. Statistical analyses revealed that these three types of proteins comprise a homologous family, and the phylogenetic tree of the 11 sequenced protein members of this family was constructed. This tree was compared with that of the 12 sequence HPr proteins or protein domains. Antibodies raised against the B. subtilis and E. coli Enzymes I exhibited immunological cross-reactivity with each other as well as with PPDK of Bacteroides symbiosus, providing support for the evolutionary relationships of these proteins suggested from the sequence comparisons. Putative flexible linkers tethering the N-terminal and the C-terminal domains of protein members of the Enzyme I family were identified, and their potential significance with regard to Enzyme I function is discussed. The codon choice pattern of the B. subtilis and E. coli ptsI and ptsH genes was found to exhibit a bias toward optimal codons in these organisms.(ABSTRACT TRUNCATED AT 250 WORDS) PMID:7686067

  1. Whole genome sequence analyses of Xylella fastidiosa PD strains from different geographical regions

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genome sequences were determined for two Pierce’s disease (PD) causing Xylella fastidiosa (Xf) strains, one from Florida and one from Taiwan. The Florida strain was ATCC 35879, the type of strain used as a standard reference for related taxonomy research. By contrast, the Taiwan strain used was only...

  2. spads 1.0: a toolbox to perform spatial analyses on DNA sequence data sets.

    PubMed

    Dellicour, Simon; Mardulyn, Patrick

    2014-05-01

    SPADS 1.0 (for 'Spatial and Population Analysis of DNA Sequences') is a population genetic toolbox for characterizing genetic variability within and among populations from DNA sequences. In view of the drastic increase in genetic information available through sequencing methods, spads was specifically designed to deal with multilocus data sets of DNA sequences. It computes several summary statistics from populations or groups of populations, performs input file conversions for other population genetic programs and implements locus-by-locus and multilocus versions of two clustering algorithms to study the genetic structure of populations. The toolbox also includes two MATLAB and r functions, GDISPAL and GDIVPAL, to display differentiation and diversity patterns across landscapes. These functions aim to generate interpolating surfaces based on multilocus distance and diversity indices. In the case of multiple loci, such surfaces can represent a useful alternative to multiple pie charts maps traditionally used in phylogeography to represent the spatial distribution of genetic diversity. These coloured surfaces can also be used to compare different data sets or different diversity and/or distance measures estimated on the same data set. PMID:24215429

  3. DNA Sequence and Expression Variation of Hop (Humulus lupulus) Valerophenone Synthase (VPS), a Key Gene in Bitter Acid Biosynthesis

    PubMed Central

    Castro, Consuelo B.; Whittock, Lucy D.; Whittock, Simon P.; Leggett, Grey; Koutoulis, Anthony

    2008-01-01

    Background The hop plant (Humulus lupulus) is a source of many secondary metabolites, with bitter acids essential in the beer brewing industry and others having potential applications for human health. This study investigated variation in DNA sequence and gene expression of valerophenone synthase (VPS), a key gene in the bitter acid biosynthesis pathway of hop. Methods Sequence variation was studied in 12 varieties, and expression was analysed in four of the 12 varieties in a series across the development of the hop cone. Results Nine single nucleotide polymorphisms (SNPs) were detected in VPS, seven of which were synonymous. The two non-synonymous polymorphisms did not appear to be related to typical bitter acid profiles of the varieties studied. However, real-time quantitative reverse-transcription polymerase chain reaction (qRT-PCR) analysis of VPS expression during hop cone development showed a clear link with the bitter acid content. The highest levels of VPS expression were observed in two triploid varieties, ‘Symphony’ and ‘Ember’, which typically have high bitter acid levels. Conclusions In all hop varieties studied, VPS expression was lowest in the leaves and an increase in expression was consistently observed during the early stages of cone development. PMID:18519445

  4. ChIPseq in Yeast Species: From Chromatin Immunoprecipitation to High-Throughput Sequencing and Bioinformatics Data Analyses.

    PubMed

    Lelandais, Gaëlle; Blugeon, Corinne; Merhej, Jawad

    2016-01-01

    Chromatin immunoprecipitation (ChIP) followed by high-throughput sequencing (ChIPseq) is a powerful technique for the genome-wide location of protein DNA-binding sites. The ChIP experiment consists in treating living cells with a cross-linking agent to bind proteins to their DNA substrates. After fragmentation of DNA, specific fractions associated with a particular protein of interest are purified by immunoaffinity. They are next sequenced and identified on the reference genome using dedicated bioinformatics programs. Several technical aspects are important to obtain high-quality ChIPseq results. This includes the quality of antibodies, the sequencing protocols, the use of accurate controls and the careful choice of bioinformatics tools. We present here a general protocol to perform ChIPseq analyses in yeast species. This protocol has been optimized to identify target genes of specific transcription factors but can be used for any other DNA binding proteins. PMID:26483023

  5. Phylogeny of the Sphaerotilus-Leptothrix group inferred from morphological comparisons, genomic fingerprinting, and 16S ribosomal DNA sequence analyses.

    PubMed

    Siering, P L; Ghiorse, W C

    1996-01-01

    Phase-contrast light microscopy revealed that only one of eight cultivated strains belonging to the Sphaerotilus-Leptothrix group of sheathed bacteria actually produced a sheath in standard growth media. Two Sphaerotilus natans strains produced branched cells, but other morphological characteristics that were used to identify these bacteria were consistent with previously published descriptions. Genomic fingerprints, which were obtained by performing PCR amplification with primers corresponding to enterobacterial repetitive intergenic consensus sequences, were useful for distinguishing between the genera Sphaerotilus and Leptothrix, as well as among individual strains. The complete 16S ribosomal DNA (rDNA) sequences of two strains of "Leptothrix discophora" (strains SP-6 and SS-1) were determined. In addition, partial sequences (approximately 300 nucleotides) of one strain of Leptothrix cholodnii (strain LMG 7171), an unidentified Leptothrix strain (strain NC-1), and four strains of Sphaerotilus natans (strains ATCC 13338T [T = type strain], ATCC 15291, ATCC 29329, and ATCC 29330) were determined. We found that two of the S. natans strains (ATCC 15291 and ATCC 13338T), which differed in morphology and in their genomic fingerprints, had identical sequences in the 300-nucleotide region sequenced. Both parsimony and distance matrix methods were used to infer the evolutionary relationships of the eight strains in a comparison of the 16S rDNA sequences of these organisms with 16S rDNA sequences obtained from ribosomal sequence databases. All of the strains clustered in the Rubrivivax subdivision of the beta subclass of the Proteobacteria, which confirmed previously published conclusions concerning selected individual strains. Additional analyses revealed that all of the S. natans strains clustered in one closely related group, while the Leptothrix strains clustered in two separate lineages that were approximately equidistant from the S. natans cluster. This finding

  6. Genetic Analyses of the Internal Transcribed Spacer Sequences Suggest Introgression and Duplication in the Medicinal Mushroom Agaricus subrufescens.

    PubMed

    Chen, Jie; Moinard, Magalie; Xu, Jianping; Wang, Shouxian; Foulongne-Oriol, Marie; Zhao, Ruilin; Hyde, Kevin D; Callac, Philippe

    2016-01-01

    The internal transcribed spacer (ITS) region of the nuclear ribosomal RNA gene cluster is widely used in fungal taxonomy and phylogeographic studies. The medicinal and edible mushroom Agaricus subrufescens has a worldwide distribution with a high level of polymorphism in the ITS region. A previous analysis suggested notable ITS sequence heterogeneity within the wild French isolate CA487. The objective of this study was to investigate the pattern and potential mechanism of ITS sequence heterogeneity within this strain. Using PCR, cloning, and sequencing, we identified three types of ITS sequences, A, B, and C with a balanced distribution, which differed from each other at 13 polymorphic positions. The phylogenetic comparisons with samples from different continents revealed that the type C sequence was similar to those found in Oceanian and Asian specimens of A. subrufescens while types A and B sequences were close to those found in the Americas or in Europe. We further investigated the inheritance of these three ITS sequence types by analyzing their distribution among single-spore isolates from CA487. In this analysis, three co-dominant markers were used firstly to distinguish the homokaryotic offspring from the heterokaryotic offspring. The homokaryotic offspring were then analyzed for their ITS types. Our genetic analyses revealed that types A and B were two alleles segregating at one locus ITSI, while type C was not allelic with types A and B but was located at another unlinked locus ITSII. Furthermore, type C was present in only one of the two constitutive haploid nuclei (n) of the heterokaryotic (n+n) parent CA487. These data suggest that there was a relatively recent introduction of the type C sequence and a duplication of the ITS locus in this strain. Whether other genes were also transferred and duplicated and their impacts on genome structure and stability remain to be investigated. PMID:27228131

  7. Genetic Analyses of the Internal Transcribed Spacer Sequences Suggest Introgression and Duplication in the Medicinal Mushroom Agaricus subrufescens

    PubMed Central

    Chen, Jie; Moinard, Magalie; Xu, Jianping; Wang, Shouxian; Foulongne-Oriol, Marie; Zhao, Ruilin; Hyde, Kevin D.; Callac, Philippe

    2016-01-01

    The internal transcribed spacer (ITS) region of the nuclear ribosomal RNA gene cluster is widely used in fungal taxonomy and phylogeographic studies. The medicinal and edible mushroom Agaricus subrufescens has a worldwide distribution with a high level of polymorphism in the ITS region. A previous analysis suggested notable ITS sequence heterogeneity within the wild French isolate CA487. The objective of this study was to investigate the pattern and potential mechanism of ITS sequence heterogeneity within this strain. Using PCR, cloning, and sequencing, we identified three types of ITS sequences, A, B, and C with a balanced distribution, which differed from each other at 13 polymorphic positions. The phylogenetic comparisons with samples from different continents revealed that the type C sequence was similar to those found in Oceanian and Asian specimens of A. subrufescens while types A and B sequences were close to those found in the Americas or in Europe. We further investigated the inheritance of these three ITS sequence types by analyzing their distribution among single-spore isolates from CA487. In this analysis, three co-dominant markers were used firstly to distinguish the homokaryotic offspring from the heterokaryotic offspring. The homokaryotic offspring were then analyzed for their ITS types. Our genetic analyses revealed that types A and B were two alleles segregating at one locus ITSI, while type C was not allelic with types A and B but was located at another unlinked locus ITSII. Furthermore, type C was present in only one of the two constitutive haploid nuclei (n) of the heterokaryotic (n+n) parent CA487. These data suggest that there was a relatively recent introduction of the type C sequence and a duplication of the ITS locus in this strain. Whether other genes were also transferred and duplicated and their impacts on genome structure and stability remain to be investigated. PMID:27228131

  8. Comparative sequence analyses of genome and transcriptome reveal novel transcripts and variants in the Asian elephant Elephas maximus.

    PubMed

    Reddy, Puli Chandramouli; Sinha, Ishani; Kelkar, Ashwin; Habib, Farhat; Pradhan, Saurabh J; Sukumar, Raman; Galande, Sanjeev

    2015-12-01

    The Asian elephant Elephas maximus and the African elephant Loxodonta africana that diverged 5-7 million years ago exhibit differences in their physiology, behaviour and morphology. A comparative genomics approach would be useful and necessary for evolutionary and functional genetic studies of elephants. We performed sequencing of E. maximus and map to L. africana at ~15X coverage. Through comparative sequence analyses, we have identified Asian elephant specific homozygous, non-synonymous single nucleotide variants (SNVs) that map to 1514 protein coding genes, many of which are involved in olfaction. We also present the first report of a high-coverage transcriptome sequence in E. maximus from peripheral blood lymphocytes. We have identified 103 novel protein coding transcripts and 66-long non-coding (lnc)RNAs. We also report the presence of 181 protein domains unique to elephants when compared to other Afrotheria species. Each of these findings can be further investigated to gain a better understanding of functional differences unique to elephant species, as well as those unique to elephantids in comparison with other mammals. This work therefore provides a valuable resource to explore the immense research potential of comparative analyses of transcriptome and genome sequences in the Asian elephant. PMID:26648035

  9. [Whole-sequence Analyses for 12 HBV C/D Recombinants from a Population in Tibet (China)].

    PubMed

    Liu, Tiezhu; Shen, Liping; Yin, Wenjiao; Wang, Feng; Wang, Fuzhen; Zhang, Guomin; Zheng, Hui; Dunzhu, Duoji; Bi, Shengli; Cui, Fuqiang

    2016-03-01

    We wished to undertake molecular genetic typing and evaluate recombinants of the hepatitis-B virus (HBV) in Tibet (China). Multistage random sampling was used to collect HBsAg-positive samples. Nested polymerase chain reactions were used to amplify the whole sequence of the HBV. DNAstar, MEGA6 and SimPlot were used to assemble sequences, create phylogenetic trees, and undertake recombination analyses. Twelve whole sequences of the HBV of a Tibetan population were collected using these methods. Results showed that all 12 strains were C/D recombinants. Nine of the recombinations were at nt750, and the other three at nt1526. Therefore, the 12 strains could be divided into two types of recombinants: C/Da and C/Db. Analyses of the sequence of the whole genome revealed that the 12 strains belonged to genotype C, and that the nucleotide distance was > 4% between the 12 strains and sub-genotypes C1 to C15 in Genbank. The most likely sub-genotype was C1. Individuals with C/Da were from central and northern Tibet (e.g., Lasa, Linzhi, Ali) and those with C/Db recombinants were from Shannan in southern Tibet. These data suggest that the two types of recombinants had a good distribution in Tibet. Also, they can provide important information for studies on HBV recombination, gene features, virus evolution, as well as the control and prevention of HBV infection in Tibet. PMID:27396158

  10. Complete mitochondrial genome DNA sequence for two ophiuroids and a holothuroid: the utility of protein gene sequence and gene maps in the analyses of deep deuterostome phylogeny.

    PubMed

    Scouras, Andrea; Beckenbach, Karen; Arndt, Allan; Smith, Michael J

    2004-04-01

    The complete mitochondrial genome sequences have been determined for the holothuroid Cucumaria miniata and two ophiuroid species Ophiopholis aculeata and Ophiura lütkeni. In addition, the nucleotide sequence of the mitochondrial protein-coding genes for the asteroid Pisaster ochraceus has been completed. Maximum-likelihood and LogDet distance analyses of concatenated protein-coding sequences produced a series of trees that did not conclusively support generally accepted models of echinoderm phylogeny. The ophiuroid data consistently demonstrated accelerated nucleotide divergence rates and lack of stationarity. This confounds the phylogenetic analyses. Molecular investigations using individual protein-coding gene alignments demonstrated that the cytochrome b gene exhibits the least deviation in rate and stationarity and generated some trees consistent with proposed echinoderm phylogenies. Phylogenies based on echinoderm mitochondrial gene rearrangements also proved problematic because of extensive variation in gene order between and within classes. A comparison of the two distinctive ophiuroid mitochondrial gene orders supports the hypothesis that O. lütkeni has a more derived mitochondrial gene order versus O. aculeata. The variation in the echinoderm mitochondrial gene maps reinforces the limitations of the application of mitochondrial gene rearrangements as a global phylogenetic tool. PMID:15019608

  11. Draft Genome Sequences of Two Novel Acidimicrobiaceae Members from an Acid Mine Drainage Biofilm Metagenome.

    PubMed

    Pinto, Ameet J; Sharp, Jonathan O; Yoder, Michael J; Almstrand, Robert

    2016-01-01

    Bacteria belonging to the family Acidimicrobiaceae are frequently encountered in heavy metal-contaminated acidic environments. However, their phylogenetic and metabolic diversity is poorly resolved. We present draft genome sequences of two novel and phylogenetically distinct Acidimicrobiaceae members assembled from an acid mine drainage biofilm metagenome. PMID:26769942

  12. Draft Genome Sequences of Two Novel Acidimicrobiaceae Members from an Acid Mine Drainage Biofilm Metagenome

    PubMed Central

    Pinto, Ameet J.; Sharp, Jonathan O.; Yoder, Michael J.

    2016-01-01

    Bacteria belonging to the family Acidimicrobiaceae are frequently encountered in heavy metal-contaminated acidic environments. However, their phylogenetic and metabolic diversity is poorly resolved. We present draft genome sequences of two novel and phylogenetically distinct Acidimicrobiaceae members assembled from an acid mine drainage biofilm metagenome. PMID:26769942

  13. A computer program for the estimation of protein and nucleic acid sequence diversity in random point mutagenesis libraries

    PubMed Central

    Volles, Michael J.; Lansbury, Peter T.

    2005-01-01

    A computer program for the generation and analysis of in silico random point mutagenesis libraries is described. The program operates by mutagenizing an input nucleic acid sequence according to mutation parameters specified by the user for each sequence position and type of point mutation. The program can mimic almost any type of random mutagenesis library, including those produced via error-prone PCR (ep-PCR), mutator Escherichia coli strains, chemical mutagenesis, and doped or random oligonucleotide synthesis. The program analyzes the generated nucleic acid sequences and/or the associated protein library to produce several estimates of library diversity (number of unique sequences, point mutations, and single point mutants) and the rate of saturation of these diversities during experimental screening or selection of clones. This information allows one to select the optimal screen size for a given mutagenesis library, necessary to efficiently obtain a certain coverage of the sequence-space. The program also reports the abundance of each specific protein mutation at each sequence position, which is useful as a measure of the level and type of mutation bias in the library. Alternatively, one can use the program to evaluate the relative merits of preexisting libraries, or to examine various hypothetical mutation schemes to determine the optimal method for creating a library that serves the screen/selection of interest. Simulated libraries of at least 109 sequences are accessible by the numerical algorithm with currently available personal computers; an analytical algorithm is also available which can rapidly calculate a subset of the numerical statistics in libraries of arbitrarily large size. A multi-type double-strand stochastic model of ep-PCR is developed in an appendix to demonstrate the applicability of the algorithm to amplifying mutagenesis procedures. Estimators of DNA polymerase mutation-type-specific error rates are derived using the model. Analyses of an

  14. Complete nuclear ribosomal DNA sequence amplification and molecular analyses of Bangia (Bangiales, Rhodophyta) from China

    NASA Astrophysics Data System (ADS)

    Xu, Jiajie; Jiang, Bo; Chai, Sanming; He, Yuan; Zhu, Jianyi; Shen, Zonggen; Shen, Songdong

    2016-01-01

    Filamentous Bangia, which are distributed extensively throughout the world, have simple and similar morphological characteristics. Scientists can classify these organisms using molecular markers in combination with morphology. We successfully sequenced the complete nuclear ribosomal DNA, approximately 13 kb in length, from a marine Bangia population. We further analyzed the small subunit ribosomal DNA gene (nrSSU) and the internal transcribed spacer (ITS) sequence regions along with nine other marine, and two freshwater Bangia samples from China. Pairwise distances of the nrSSU and 5.8S ribosomal DNA gene sequences show the marine samples grouping together with low divergences (00.003; 0-0.006, respectively) from each other, but high divergences (0.123-0.126; 0.198, respectively) from freshwater samples. An exception is the marine sample collected from Weihai, which shows high divergence from both other marine samples (0.063-0.065; 0.129, respectively) and the freshwater samples (0.097; 0.120, respectively). A maximum likelihood phylogenetic tree based on a combined SSU-ITS dataset with maximum likelihood method shows the samples divided into three clades, with the two marine sample clades containing Bangia spp. from North America, Europe, Asia, and Australia; and one freshwater clade, containing Bangia atropurpurea from North America and China.

  15. Rtools: a web server for various secondary structural analyses on single RNA sequences.

    PubMed

    Hamada, Michiaki; Ono, Yukiteru; Kiryu, Hisanori; Sato, Kengo; Kato, Yuki; Fukunaga, Tsukasa; Mori, Ryota; Asai, Kiyoshi

    2016-07-01

    The secondary structures, as well as the nucleotide sequences, are the important features of RNA molecules to characterize their functions. According to the thermodynamic model, however, the probability of any secondary structure is very small. As a consequence, any tool to predict the secondary structures of RNAs has limited accuracy. On the other hand, there are a few tools to compensate the imperfect predictions by calculating and visualizing the secondary structural information from RNA sequences. It is desirable to obtain the rich information from those tools through a friendly interface. We implemented a web server of the tools to predict secondary structures and to calculate various structural features based on the energy models of secondary structures. By just giving an RNA sequence to the web server, the user can get the different types of solutions of the secondary structures, the marginal probabilities such as base-paring probabilities, loop probabilities and accessibilities of the local bases, the energy changes by arbitrary base mutations as well as the measures for validations of the predicted secondary structures. The web server is available at http://rtools.cbrc.jp, which integrates software tools, CentroidFold, CentroidHomfold, IPKnot, CapR, Raccess, Rchange and RintD. PMID:27131356

  16. Complete nuclear ribosomal DNA sequence amplification and molecular analyses of Bangia (Bangiales, Rhodophyta) from China

    NASA Astrophysics Data System (ADS)

    Xu, Jiajie; Jiang, Bo; Chai, Sanming; He, Yuan; Zhu, Jianyi; Shen, Zonggen; Shen, Songdong

    2016-09-01

    Filamentous Bangia, which are distributed extensively throughout the world, have simple and similar morphological characteristics. Scientists can classify these organisms using molecular markers in combination with morphology. We successfully sequenced the complete nuclear ribosomal DNA, approximately 13 kb in length, from a marine Bangia population. We further analyzed the small subunit ribosomal DNA gene (nrSSU) and the internal transcribed spacer (ITS) sequence regions along with nine other marine, and two freshwater Bangia samples from China. Pairwise distances of the nrSSU and 5.8S ribosomal DNA gene sequences show the marine samples grouping together with low divergences (00.003; 0-0.006, respectively) from each other, but high divergences (0.123-0.126; 0.198, respectively) from freshwater samples. An exception is the marine sample collected from Weihai, which shows high divergence from both other marine samples (0.063-0.065; 0.129, respectively) and the freshwater samples (0.097; 0.120, respectively). A maximum likelihood phylogenetic tree based on a combined SSU-ITS dataset with maximum likelihood method shows the samples divided into three clades, with the two marine sample clades containing Bangia spp. from North America, Europe, Asia, and Australia; and one freshwater clade, containing Bangia atropurpurea from North America and China.

  17. Two distinct ferredoxins from Rhodobacter capsulatus: complete amino acid sequences and molecular evolution.

    PubMed

    Saeki, K; Suetsugu, Y; Yao, Y; Horio, T; Marrs, B L; Matsubara, H

    1990-09-01

    Two distinct ferredoxins were purified from Rhodobacter capsulatus SB1003. Their complete amino acid sequences were determined by a combination of protease digestion, BrCN cleavage and Edman degradation. Ferredoxins I and II were composed of 64 and 111 amino acids, respectively, with molecular weights of 6,728 and 12,549 excluding iron and sulfur atoms. Both contained two Cys clusters in their amino acid sequences. The first cluster of ferredoxin I and the second cluster of ferredoxin II had a sequence, CxxCxxCxxxCP, in common with the ferredoxins found in Clostridia. The second cluster of ferredoxin I had a sequence, CxxCxxxxxxxxCxxxCM, with extra amino acids between the second and third Cys, which has been reported for other photosynthetic bacterial ferredoxins and putative ferredoxins (nif-gene products) from nitrogen-fixing bacteria, and with a unique occurrence of Met. The first cluster of ferredoxin II had a CxxCxxxxCxxxCP sequence, with two additional amino acids between the second and third Cys, a characteristics feature of Azotobacter-[3Fe-4S] [4Fe-4S]-ferredoxin. Ferredoxin II was also similar to Azotobacter-type ferredoxins with an extended carboxyl (C-) terminal sequence compared to the common Clostridium-type. The evolutionary relationship of the two together with a putative one recently found to be encoded in nifENXQ region in this bacterium [Moreno-Vivian et al. (1989) J. Bacteriol. 171, 2591-2598] is discussed. PMID:2277040

  18. Transcriptome analyses of Sclerotinia sclerotiorum infecting chickpea and lentil using RNA sequencing

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Sclerotinia sclerotiorum causes white mold of many important crops. To elucidate its pathogenic mechanisms, transcriptome analyses were used to study its interactions with chickpea and lentil. Five mRNA libraries were constructed from S. sclertiorum (strain WM-A1), healthy chickpea (cv. Spansih Whit...

  19. Amino Acid Sequence of Anionic Peroxidase from the Windmill Palm Tree Trachycarpus fortunei

    PubMed Central

    2015-01-01

    Palm peroxidases are extremely stable and have uncommon substrate specificity. This study was designed to fill in the knowledge gap about the structures of a peroxidase from the windmill palm tree Trachycarpus fortunei. The complete amino acid sequence and partial glycosylation were determined by MALDI-top-down sequencing of native windmill palm tree peroxidase (WPTP), MALDI-TOF/TOF MS/MS of WPTP tryptic peptides, and cDNA sequencing. The propeptide of WPTP contained N- and C-terminal signal sequences which contained 21 and 17 amino acid residues, respectively. Mature WPTP was 306 amino acids in length, and its carbohydrate content ranged from 21% to 29%. Comparison to closely related royal palm tree peroxidase revealed structural features that may explain differences in their substrate specificity. The results can be used to guide engineering of WPTP and its novel applications. PMID:25383699

  20. Isotopic analyses of nitrogenous compounds from the Murchison meteorite: ammonia, amines, amino acids, and polar hydrocarbons

    NASA Technical Reports Server (NTRS)

    Pizzarello, S.; Feng, X.; Epstein, S.; Cronin, J. R.

    1994-01-01

    The combined volatile bases (ammonia, aliphatic amines, and possibly other bases), ammonia, amino acids, and polar hydrocarbons were prepared from the Murchison meteorite for isotopic analyses. The volatile bases were obtained by cryogenic transfer after acid-hydrolysis of a hot-water extract and analyzed by combined gas chromatography-mass spectrometry of pentafluoropropionyl derivatives. The aliphatic amines present in this preparation comprise a mixture that includes both primary and secondary isomers through C5 at a total concentration of > or = 100 nmoles g-1. As commonly observed for meteoritic organic compounds, almost all isomers through C5 are present, and the concentrations within homologous series decrease with increasing chain length. Ammonia was chromatographically separated from the other volatile bases and found at a concentration of 1.1-1.3 micromoles g-1 meteorite. The ammonia analyzed includes contributions from ammonium salts and the hydrolysis of extractable organic compounds, e.g., carboxamides. Stable isotope analyses showed the volatile bases to be substantially enriched in the heavier isotopes, relative to comparable terrestrial compounds delta D < or = +1221%; delta 13C = +22%; delta 15N = +93%). Ammonia, per se, was found to have a somewhat lower delta 15N value (+69%) than the total volatile bases; consequently, a higher delta 15N (>93%) can be inferred for the other bases, which include the amines. Solvent-extractable polar hydrocarbons obtained separately were found to be enriched in 15N (delta 15N = +104%). Total amino acids, prepared from a hydrolyzed hot-water extract by cation exchange chromatography, gave a delta 15N of +94%, a value in good agreement with that obtained previously. Nitrogen isotopic data are also given for amino acid fractions separated chromatographically. The delta 15N values of the Murchison soluble organic compounds analyzed to date fall within a rather narrow range (delta 15N = +94 +/- 8%), an observation

  1. Protein chemotaxonomy. XIII. Amino acid sequence of ferredoxin from Panax ginseng.

    PubMed

    Mino, Yoshiki

    2006-08-01

    The complete amino acid sequence of [2Fe-2S] ferredoxin from Panax ginseng (Araliaceae) has been determined by automated Edman degradation of the entire S-carboxymethylcysteinyl protein and of the peptides obtained by enzymatic digestion. This ferredoxin has a unique amino acid sequence, which includes an insertion of Tyr at the 3rd position from the amino-terminus and a deletion of two amino acid residues at the carboxyl terminus. This ferredoxin had 18 differences in its amino acid sequence compared to that of Petroselinum sativum (Umbelliferae). In contrast, 23-33 differences were observed compared to other dicotyledonous plants. This suggests that Panax ginseng is related taxonomically to umbelliferous plants. PMID:16880642

  2. Complete amino acid sequence and structure characterization of the taste-modifying protein, miraculin.

    PubMed

    Theerasilp, S; Hitotsuya, H; Nakajo, S; Nakaya, K; Nakamura, Y; Kurihara, Y

    1989-04-25

    The taste-modifying protein, miraculin, has the unusual property of modifying sour taste into sweet taste. The complete amino acid sequence of miraculin purified from miracle fruits by a newly developed method (Theerasilp, S., and Kurihara, Y. (1988) J. Biol. Chem. 263, 11536-11539) was determined by an automatic Edman degradation method. Miraculin was a single polypeptide with 191 amino acid residues. The calculated molecular weight based on the amino acid sequence and the carbohydrate content (13.9%) was 24,600. Asn-42 and Asn-186 were linked N-glycosidically to carbohydrate chains. High homology was found between the amino acid sequences of miraculin and soybean trypsin inhibitor. PMID:2708331

  3. Complete cDNA and derived amino acid sequence of human factor V

    SciTech Connect

    Jenny, R.J.; Pittman, D.D.; Toole, J.J.; Kriz, R.W.; Aldape, R.A.; Hewick, R.M.; Kaufman, R.J.; Mann, K.G.

    1987-07-01

    cDNA clones encoding human factor V have been isolated from an oligo(dT)-primed human fetal liver cDNA library prepared with vector Charon 21A. The cDNA sequence of factor V from three overlapping clones includes a 6672-base-pair (bp) coding region, a 90-bp 5' untranslated region, and a 163-bp 3' untranslated region within which is a poly(A)tail. The deduced amino acid sequence consists of 2224 amino acids inclusive of a 28-amino acid leader peptide. Direct comparison with human factor VIII reveals considerable homology between proteins in amino acid sequence and domain structure: a triplicated A domain and duplicated C domain show approx. 40% identity with the corresponding domains in factor VIII. As in factor VIII, the A domains of factor V share approx. 40% amino acid-sequence homology with the three highly conserved domains in ceruloplasmin. The B domain of factor V contains 35 tandem and approx. 9 additional semiconserved repeats of nine amino acids of the form Asp-Leu-Ser-Gln-Thr-Thr/Asn-Leu-Ser-Pro and 2 additional semiconserved repeats of 17 amino acids. Factor V contains 37 potential N-linked glycosylation sites, 25 of which are in the B domain, and a total of 19 cysteine residues.

  4. N-terminal sequence of amino acids and some properties of an acid-stable alpha-amylase from citric acid-koji (Aspergillus usamii var.).

    PubMed

    Suganuma, T; Tahara, N; Kitahara, K; Nagahama, T; Inuzuka, K

    1996-01-01

    An acid-stable alpha-amylase (AA) was purified from an acidic extract of citric acid-koji (A. usamii var.). The N-terminal sequence of the first 20 amino acids of the enzyme was identical with that of AA from A. niger, but the two enzymes differed in molecular weight. HPLC analysis for identifying the anomers of products indicated that the AA hydrolyzed maltopentaose (G5) at the third glycoside bond predominantly, which differed from Taka-amylase A and the neutral alpha-amylase (NA) from the citric acid-koji. PMID:8824843

  5. Multilocus Sequence Typing for Analyses of Clonality of Candida albicans Strains in Taiwan

    PubMed Central

    Chen, Kuo-Wei; Chen, Yee-Chun; Lo, Hsiu-Jung; Odds, Frank C.; Wang, Tzu-Hui; Lin, Chi-Yang; Li, Shu-Ying

    2006-01-01

    Multilocus sequence typing (MLST) was used to characterize the genetic profiles of 51 Candida albicans isolates collected from 12 hospitals in Taiwan. Among the 51 isolates, 16 were epidemiologically unrelated, 28 were isolates from 11 critically ill, human immunodeficiency virus (HIV)-negative patients, and 7 were long-term serial isolates from 3 HIV-positive patients. Internal regions of seven housekeeping genes were sequenced. A total of 83 polymorphic nucleotide sites were identified. Ten to 20 different genotypes were observed at the different loci, resulting, when combined, in 45 unique genotype combinations or diploid sequence types (DSTs). Thirty (36.1%) of the 83 individual changes were synonymous and 53 (63.9%) were nonsynonymous. Due to the diploid nature of C. albicans, MLST was more discriminatory than the pulsed-field gel electrophoresis-BssHII-restricted fragment method in discriminating epidemiologically related strains. MLST is able to trace the microevolution over time of C. albicans isolates in the same patient. All but one of the DSTs of our Taiwanese strain collections were novel to the internet C. albicans DST database (http://test1.mlst.net/). The DSTs of C. albicans in Taiwan were analyzed together with those of the reference strains and of the strains from the United Kingdom and United States by unweighted-pair group method using average linkages and minimum spanning tree. Our result showed that the DNA type of each isolate was patient specific and associated with ABC type and decade of isolation but not associated with mating type, anatomical source of isolation, hospital origin, or fluconazole resistance patterns. PMID:16757617

  6. Analyses of MYMIV-induced transcriptome in Vigna mungo as revealed by next generation sequencing

    PubMed Central

    Ganguli, Sayak; Dey, Avishek; Banik, Rahul; Kundu, Anirban; Pal, Amita

    2016-01-01

    Mungbean Yellow Mosaic Virus (MYMIV) is the viral pathogen that causes yellow mosaic disease to a number of legumes including Vigna mungo. VM84 is a recombinant inbred line resistant to MYMIV, developed in our laboratory through introgression of resistance trait from V. mungo line VM-1. Here we present the quality control passed transcriptome data of mock inoculated (control) and MYMIV-infected VM84, those have already been submitted in Sequence Read Archive (SRX1032950, SRX1082731) of NCBI. QC reports of FASTQ files generated by ‘SeqQC V2.2’ bioinformatics tool. PMID:26981413

  7. Mitochondrial DNA sequence analyses in Bornean sucker fishes (Balitoridae: Teleostei: Gastromyzontinae).

    PubMed

    Sulaiman, Zohrah Haji; Hui, Tan Heok; Lim, Kelvin K P; Ng, Peter K L

    2006-03-01

    Phylogenetic relationships among Bornean sucker fishes (Teleostei: Balitoridae: Gastromyzontinae) were investigated by comparing cytochrome b gene sequences from eight species. The results were in general agreement with previous morphology-based studies. It was found that the genera Gastromyzon and Neogastromyzon are both monophyletic and that the Chinese homalopterid Crossostoma lacustre (Homalopterinae) is not related to the Bornean species. This molecular-level study of cytochrome b gene variation in Bornean gastromyzontins will undoubtedly help to shed light on the molecular systematics of this unique freshwater fish. PMID:21395984

  8. Genome Sequence and Transcriptome Analyses of Chrysochromulina tobin: Metabolic Tools for Enhanced Algal Fitness in the Prominent Order Prymnesiales (Haptophyceae)

    PubMed Central

    Hovde, Blake T.; Deodato, Chloe R.; Hunsperger, Heather M.; Ryken, Scott A.; Yost, Will; Jha, Ramesh K.; Patterson, Johnathan; Monnat, Raymond J.; Barlow, Steven B.; Starkenburg, Shawn R.; Cattolico, Rose Ann

    2015-01-01

    Haptophytes are recognized as seminal players in aquatic ecosystem function. These algae are important in global carbon sequestration, form destructive harmful blooms, and given their rich fatty acid content, serve as a highly nutritive food source to a broad range of eco-cohorts. Haptophyte dominance in both fresh and marine waters is supported by the mixotrophic nature of many taxa. Despite their importance the nuclear genome sequence of only one haptophyte, Emiliania huxleyi (Isochrysidales), is available. Here we report the draft genome sequence of Chrysochromulina tobin (Prymnesiales), and transcriptome data collected at seven time points over a 24-hour light/dark cycle. The nuclear genome of C. tobin is small (59 Mb), compact (∼40% of the genome is protein coding) and encodes approximately 16,777 genes. Genes important to fatty acid synthesis, modification, and catabolism show distinct patterns of expression when monitored over the circadian photoperiod. The C. tobin genome harbors the first hybrid polyketide synthase/non-ribosomal peptide synthase gene complex reported for an algal species, and encodes potential anti-microbial peptides and proteins involved in multidrug and toxic compound extrusion. A new haptophyte xanthorhodopsin was also identified, together with two “red” RuBisCO activases that are shared across many algal lineages. The Chrysochromulina tobin genome sequence provides new information on the evolutionary history, ecology and economic importance of haptophytes. PMID:26397803

  9. Genome Sequence and Transcriptome Analyses of Chrysochromulina tobin: Metabolic Tools for Enhanced Algal Fitness in the Prominent Order Prymnesiales (Haptophyceae).

    PubMed

    Hovde, Blake T; Deodato, Chloe R; Hunsperger, Heather M; Ryken, Scott A; Yost, Will; Jha, Ramesh K; Patterson, Johnathan; Monnat, Raymond J; Barlow, Steven B; Starkenburg, Shawn R; Cattolico, Rose Ann

    2015-01-01

    Haptophytes are recognized as seminal players in aquatic ecosystem function. These algae are important in global carbon sequestration, form destructive harmful blooms, and given their rich fatty acid content, serve as a highly nutritive food source to a broad range of eco-cohorts. Haptophyte dominance in both fresh and marine waters is supported by the mixotrophic nature of many taxa. Despite their importance the nuclear genome sequence of only one haptophyte, Emiliania huxleyi (Isochrysidales), is available. Here we report the draft genome sequence of Chrysochromulina tobin (Prymnesiales), and transcriptome data collected at seven time points over a 24-hour light/dark cycle. The nuclear genome of C. tobin is small (59 Mb), compact (∼ 40% of the genome is protein coding) and encodes approximately 16,777 genes. Genes important to fatty acid synthesis, modification, and catabolism show distinct patterns of expression when monitored over the circadian photoperiod. The C. tobin genome harbors the first hybrid polyketide synthase/non-ribosomal peptide synthase gene complex reported for an algal species, and encodes potential anti-microbial peptides and proteins involved in multidrug and toxic compound extrusion. A new haptophyte xanthorhodopsin was also identified, together with two "red" RuBisCO activases that are shared across many algal lineages. The Chrysochromulina tobin genome sequence provides new information on the evolutionary history, ecology and economic importance of haptophytes. PMID:26397803

  10. Genome Sequence and Transcriptome Analyses of Chrysochromulina tobin: Metabolic Tools for Enhanced Algal Fitness in the Prominent Order Prymnesiales (Haptophyceae)

    SciTech Connect

    Hovde, Blake T.; Deodato, Chloe R.; Hunsperger, Heather M.; Ryken, Scott A.; Yost, Will; Jha, Ramesh K.; Patterson, Johnathan; Monnat, Raymond J.; Barlow, Steven B.; Starkenburg, Shawn R.; Cattolico, Rose Ann; Richardson, Paul M.

    2015-09-23

    Haptophytes are recognized as seminal players in aquatic ecosystem function. These algae are important in global carbon sequestration, form destructive harmful blooms, and given their rich fatty acid content, serve as a highly nutritive food source to a broad range of eco-cohorts. Haptophyte dominance in both fresh and marine waters is supported by the mixotrophic nature of many taxa. Despite their importance the nuclear genome sequence of only one haptophyte, Emiliania huxleyi (Isochrysidales), is available. Here we report the draft genome sequence of Chrysochromulina tobin (Prymnesiales), and transcriptome data collected at seven time points over a 24-hour light/dark cycle. The nuclear genome of C. tobin is small (59 Mb), compact (∼40% of the genome is protein coding) and encodes approximately 16,777 genes. Genes important to fatty acid synthesis, modification, and catabolism show distinct patterns of expression when monitored over the circadian photoperiod. The C. tobin genome harbors the first hybrid polyketide synthase/non-ribosomal peptide synthase gene complex reported for an algal species, and encodes potential anti-microbial peptides and proteins involved in multidrug and toxic compound extrusion. A new haptophyte xanthorhodopsin was also identified, together with two “red” RuBisCO activases that are shared across many algal lineages. In conclusion, the Chrysochromulina tobin genome sequence provides new information on the evolutionary history, ecology and economic importance of haptophytes.

  11. Genome Sequence and Transcriptome Analyses of Chrysochromulina tobin: Metabolic Tools for Enhanced Algal Fitness in the Prominent Order Prymnesiales (Haptophyceae)

    DOE PAGESBeta

    Hovde, Blake T.; Deodato, Chloe R.; Hunsperger, Heather M.; Ryken, Scott A.; Yost, Will; Jha, Ramesh K.; Patterson, Johnathan; Monnat, Raymond J.; Barlow, Steven B.; Starkenburg, Shawn R.; et al

    2015-09-23

    Haptophytes are recognized as seminal players in aquatic ecosystem function. These algae are important in global carbon sequestration, form destructive harmful blooms, and given their rich fatty acid content, serve as a highly nutritive food source to a broad range of eco-cohorts. Haptophyte dominance in both fresh and marine waters is supported by the mixotrophic nature of many taxa. Despite their importance the nuclear genome sequence of only one haptophyte, Emiliania huxleyi (Isochrysidales), is available. Here we report the draft genome sequence of Chrysochromulina tobin (Prymnesiales), and transcriptome data collected at seven time points over a 24-hour light/dark cycle. Themore » nuclear genome of C. tobin is small (59 Mb), compact (∼40% of the genome is protein coding) and encodes approximately 16,777 genes. Genes important to fatty acid synthesis, modification, and catabolism show distinct patterns of expression when monitored over the circadian photoperiod. The C. tobin genome harbors the first hybrid polyketide synthase/non-ribosomal peptide synthase gene complex reported for an algal species, and encodes potential anti-microbial peptides and proteins involved in multidrug and toxic compound extrusion. A new haptophyte xanthorhodopsin was also identified, together with two “red” RuBisCO activases that are shared across many algal lineages. In conclusion, the Chrysochromulina tobin genome sequence provides new information on the evolutionary history, ecology and economic importance of haptophytes.« less

  12. Drug-Resistant Genotypes and Multi-Clonality in Plasmodium falciparum Analysed by Direct Genome Sequencing from Peripheral Blood of Malaria Patients

    PubMed Central

    Auburn, Sarah; Assefa, Samuel A.; Polley, Spencer D.; Manske, Magnus; MacInnis, Bronwyn; Rockett, Kirk A.; Maslen, Gareth L.; Sanders, Mandy; Quail, Michael A.; Chiodini, Peter L.; Kwiatkowski, Dominic P.; Clark, Taane G.; Sutherland, Colin J.

    2011-01-01

    Naturally acquired blood-stage infections of the malaria parasite Plasmodium falciparum typically harbour multiple haploid clones. The apparent number of clones observed in any single infection depends on the diversity of the polymorphic markers used for the analysis, and the relative abundance of rare clones, which frequently fail to be detected among PCR products derived from numerically dominant clones. However, minority clones are of clinical interest as they may harbour genes conferring drug resistance, leading to enhanced survival after treatment and the possibility of subsequent therapeutic failure. We deployed new generation sequencing to derive genome data for five non-propagated parasite isolates taken directly from 4 different patients treated for clinical malaria in a UK hospital. Analysis of depth of coverage and length of sequence intervals between paired reads identified both previously described and novel gene deletions and amplifications. Full-length sequence data was extracted for 6 loci considered to be under selection by antimalarial drugs, and both known and previously unknown amino acid substitutions were identified. Full mitochondrial genomes were extracted from the sequencing data for each isolate, and these are compared against a panel of polymorphic sites derived from published or unpublished but publicly available data. Finally, genome-wide analysis of clone multiplicity was performed, and the number of infecting parasite clones estimated for each isolate. Each patient harboured at least 3 clones of P. falciparum by this analysis, consistent with results obtained with conventional PCR analysis of polymorphic merozoite antigen loci. We conclude that genome sequencing of peripheral blood P. falciparum taken directly from malaria patients provides high quality data useful for drug resistance studies, genomic structural analyses and population genetics, and also robustly represents clonal multiplicity. PMID:21853089

  13. Discrimination of prey species of juvenile swordfish Xiphias gladius (Linnaeus, 1758) using signature fatty acid analyses

    NASA Astrophysics Data System (ADS)

    Young, Jock W.; Guest, Michaela A.; Lansdell, Matt; Phleger, Charles F.; Nichols, Peter D.

    2010-07-01

    Signature lipid and fatty acid analysis were used to discriminate the diet of swordfish ( Xiphias gladius, orbital fork length: 60-203 cm) from waters off eastern Australia. The fatty acid (FA) composition of a range of known prey (squid, myctophids, and other fishes) of swordfish, taken from stomach samples and from net tows, was compared with that of the white muscle tissue (WMT) of swordfish from the same region. Swordfish muscle was lipid rich (average 24-42% dry weight), as was the skeleton (28-41%). The robustness of the approach was also tested by comparison against a key squid prey species that was collected and stored using different protocols: (i) fresh frozen, (ii) fresh frozen, then thawed, and (iii) stomach content collection. The FA profiles were generally similar, with the ratio of docosahexaenoic acid (DHA) and palmitic acid (16:0) in particular showing no significant difference. Major fatty acids in swordfish WMT were 18:1ω9c, 16:0, 22:6ω3, and 18:0. Multidimensional scaling showed that the swordfish WMT grouped closely with small fish prey including myctophids, and not with squid. Squid contained markedly higher 22:6ω3 than swordfish. Individual prey species of the myctophidae could also be separated by the same technique. These results were supported by traditional stomach content analyses (SCA) that showed fish were the dominant prey for small swordfish sampled from southern waters whereas squid were the main prey in more northern waters, matching the FA patterns we found for the two regions. We propose that where general diet patterns are established, signature FA analysis has good potential to compliment or in some cases, replace temporal and spatial monitoring of trophic pathways for swordfish and other marine species.

  14. Fad7 gene identification and fatty acids phenotypic variation in an olive collection by EcoTILLING and sequencing approaches.

    PubMed

    Sabetta, Wilma; Blanco, Antonio; Zelasco, Samanta; Lombardo, Luca; Perri, Enzo; Mangini, Giacomo; Montemurro, Cinzia

    2013-08-01

    The ω-3 fatty acid desaturases (FADs) are enzymes responsible for catalyzing the conversion of linoleic acid to α-linolenic acid localized in the plastid or in the endoplasmic reticulum. In this research we report the genotypic and phenotypic variation of Italian Olea europaea L. germoplasm for the fatty acid composition. The phenotypic oil characterization was followed by the molecular analysis of the plastidial-type ω-3 FAD gene (fad7) (EC 1.14.19), whose full-length sequence has been here identified in cultivar Leccino. The gene consisted of 2635 bp with 8 exons and 5'- and 3'-UTRs of 336 and 282 bp respectively, and showed a high level of heterozygousity (1/110 bp). The natural allelic variation was investigated both by a LiCOR EcoTILLING assay and the PCR product direct sequencing. Only three haplotypes were identified among the 96 analysed cultivars, highlighting the strong degree of conservation of this gene. PMID:23685785

  15. Metabolomic Analyses of Leishmania Reveal Multiple Species Differences and Large Differences in Amino Acid Metabolism

    PubMed Central

    Wang, Lijie; Zhang, Tong; Watson, David G.; Silva, Ana Marta; Coombs, Graham H.

    2015-01-01

    Comparative genomic analyses of Leishmania species have revealed relatively minor heterogeneity amongst recognised housekeeping genes and yet the species cause distinct infections and pathogenesis in their mammalian hosts. To gain greater information on the biochemical variation between species, and insights into possible metabolic mechanisms underpinning visceral and cutaneous leishmaniasis, we have undertaken in this study a comparative analysis of the metabolomes of promastigotes of L. donovani, L. major and L. mexicana. The analysis revealed 64 metabolites with confirmed identity differing 3-fold or more between the cell extracts of species, with 161 putatively identified metabolites differing similarly. Analysis of the media from cultures revealed an at least 3-fold difference in use or excretion of 43 metabolites of confirmed identity and 87 putatively identified metabolites that differed to a similar extent. Strikingly large differences were detected in their extent of amino acid use and metabolism, especially for tryptophan, aspartate, arginine and proline. Major pathways of tryptophan and arginine catabolism were shown to be to indole-3-lactate and arginic acid, respectively, which were excreted. The data presented provide clear evidence on the value of global metabolomic analyses in detecting species-specific metabolic features, thus application of this technology should be a major contributor to gaining greater understanding of how pathogens are adapted to infecting their hosts. PMID:26368322

  16. Does more sequence data improve estimates of galliform phylogeny? Analyses of a rapid radiation using a complete data matrix

    PubMed Central

    Braun, Edward L.

    2014-01-01

    The resolution of rapid evolutionary radiations or “bushes” in the tree of life has been one of the most difficult and interesting problems in phylogenetics. The avian order Galliformes appears to have undergone several rapid radiations that have limited the resolution of prior studies and obscured the position of taxa important both agriculturally and as model systems (chicken, turkey, Japanese quail). Here we present analyses of a multi-locus data matrix comprising over 15,000 sites, primarily from nuclear introns but also including three mitochondrial regions, from 46 galliform taxa with all gene regions sampled for all taxa. The increased sampling of unlinked nuclear genes provided strong bootstrap support for all but a small number of relationships. Coalescent-based methods to combine individual gene trees and analyses of datasets that are independent of published data indicated that this well-supported topology is likely to reflect the galliform species tree. The inclusion or exclusion of mitochondrial data had a limited impact upon analyses upon analyses using either concatenated data or multispecies coalescent methods. Some of the key phylogenetic findings include support for a second major clade within the core phasianids that includes the chicken and Japanese quail and clarification of the phylogenetic relationships of turkey. Jackknifed datasets suggested that there is an advantage to sampling many independent regions across the genome rather than obtaining long sequences for a small number of loci, possibly reflecting the differences among gene trees that differ due to incomplete lineage sorting. Despite the novel insights we obtained using this increased sampling of gene regions, some nodes remain unresolved, likely due to periods of rapid diversification. Resolving these remaining groups will likely require sequencing a very large number of gene regions, but our analyses now appear to support a robust backbone for this order. PMID:24795852

  17. GIbPSs: a toolkit for fast and accurate analyses of genotyping-by-sequencing data without a reference genome.

    PubMed

    Hapke, A; Thiele, D

    2016-07-01

    Genotyping-by-sequencing (GBS) and related methods are increasingly used for studies of non-model organisms from population genetic to phylogenetic scales. We present GIbPSs, a new genotyping toolkit for the analysis of data from various protocols such as RAD, double-digest RAD, GBS, and two-enzyme GBS without a reference genome. GIbPSs can handle paired-end GBS data and is able to assign reads from both strands of a restriction fragment to the same locus. GIbPSs is most suitable for population genetic and phylogeographic analyses. It avoids genotyping errors due to indel variation by identifying and discarding affected loci. GIbPSs creates a genotype database that offers rich functionality for data filtering and export in numerous formats. We performed comparative analyses of simulated and real GBS data with GIbPSs and another program, pyRAD. This program accounts for indel variation by aligning homologous sequences. GIbPSs performed better than pyRAD in several aspects. It required much less computation time and displayed higher genotyping accuracy. GIbPSs retained smaller numbers of loci overall in analyses of real GBS data. It nevertheless delivered more complete genotype matrices with greater locus overlap between individuals and greater numbers of loci sampled in all individuals. PMID:26858004

  18. Reprint of "Sequence and phylogenetic analyses of novel totivirus-like double-stranded RNAs from field-collected powdery mildew fungi".

    PubMed

    Kondo, Hideki; Hisano, Sakae; Chiba, Sotaro; Maruyama, Kazuyuki; Andika, Ida Bagus; Toyoda, Kazuhiro; Fujimori, Fumihiro; Suzuki, Nobuhiro

    2016-07-01

    The identification of mycoviruses contributes greatly to understanding of the diversity and evolutionary aspects of viruses. Powdery mildew fungi are important and widely studied obligate phytopathogenic agents, but there has been no report on mycoviruses infecting these fungi. In this study, we used a deep sequencing approach to analyze the double-stranded RNA (dsRNA) segments isolated from field-collected samples of powdery mildew fungus-infected red clover plants in Japan. Database searches identified the presence of at least ten totivirus (genus Totivirus)-like sequences, termed red clover powdery mildew-associated totiviruses (RPaTVs). The majority of these sequences shared moderate amino acid sequence identity with each other (<44%) and with other known totiviruses (<59%). Nine of these identified sequences (RPaTV1a, 1b and 2-8) resembled the genome of the prototype totivirus, Saccharomyces cerevisiae virus-L-A (ScV-L-A) in that they contained two overlapping open reading frames (ORFs) encoding a putative coat protein (CP) and an RNA dependent RNA polymerase (RdRp), while one sequence (RPaTV9) showed similarity to another totivirus, Ustilago maydis virus H1 (UmV-H1) that encodes a single polyprotein (CP-RdRp fusion). Similar to yeast totiviruses, each ScV-L-A-like RPaTV contains a -1 ribosomal frameshift site downstream of a predicted pseudoknot structure in the overlapping region of these ORFs, suggesting that the RdRp is translated as a CP-RdRp fusion. Moreover, several ScV-L-A-like sequences were also found by searches of the transcriptome shotgun assembly (TSA) libraries from rust fungi, plants and insects. Phylogenetic analyses show that nine ScV-L-A-like RPaTVs along with ScV-L-A-like sequences derived from TSA libraries are clustered with most established members of the genus Totivirus, while one RPaTV forms a new distinct clade with UmV-H1, possibly establishing an additional genus in the family. Taken together, our results indicate the presence of

  19. Genomic sequencing and analyses of HearMNPV—a new Multinucleocapsid nucleopolyhedrovirus isolated from Helicoverpa armigera

    PubMed Central

    2012-01-01

    Background HearMNPV, a nucleopolyhedrovirus (NPV), which infects the cotton bollworm, Helicoverpa armigera, comprises multiple rod-shaped nucleocapsids in virion(as detected by electron microscopy). HearMNPV shows a different host range compared with H. armigera single-nucleocapsid NPV (HearSNPV). To better understand HearMNPV, the HearMNPV genome was sequenced and analyzed. Methods The morphology of HearMNPV was observed by electron microscope. The qPCR was used to determine the replication kinetics of HearMNPV infectious for H. armigera in vivo. A random genomic library of HearMNPV was constructed according to the “partial filling-in” method, the sequence and organization of the HearMNPV genome was analyzed and compared with sequence data from other baculoviruses. Results Real time qPCR showed that HearMNPV DNA replication included a decreasing phase, latent phase, exponential phase, and a stationary phase during infection of H. armigera. The HearMNPV genome consists of 154,196 base pairs, with a G + C content of 40.07%. 162 putative ORFs were detected in the HearMNPV genome, which represented 90.16% of the genome. The remaining 9.84% constitute four homologous regions and other non-coding regions. The gene content and gene arrangement in HearMNPV were most similar to those of Mamestra configurata NPV-B (MacoNPV-B), but was different to HearSNPV. Comparison of the genome of HearMNPV and MacoNPV-B suggested that HearMNPV has a deletion of a 5.4-kb fragment containing five ORFs. In addition, HearMNPV orf66, bro genes, and hrs are different to the corresponding parts of the MacoNPV-B genome. Conclusions HearMNPV can replicate in vivo in H. armigera and in vitro, and is a new NPV isolate distinguished from HearSNPV. HearMNPV is most closely related to MacoNPV-B, but has a distinct genomic structure, content, and organization. PMID:22913743

  20. Novel evolutionary lineages in Labeobarbus (Cypriniformes; Cyprinidae) based on phylogenetic analyses of mtDNA sequences.

    PubMed

    Beshera, Kebede A; Harris, Phillip M; Mayden, Richard L

    2016-01-01

    Phylogenetic relationships within Labeobarbus, the large-sized hexaploid cyprinids, were examined using cytochrome b gene sequences from a broad range of geographic localities and multiple taxa. Maximum likelihood and Bayesian methods revealed novel lineages from previously unsampled drainages in central (Congo River), eastern (Genale River) and southeastern (Revue and Mussapa Grande rivers) Africa. Relationships of some species of Varicorhinus in Africa (excluding 'V.' maroccanus) render Labeobarbus as paraphyletic. 'Varicorhinus' beso, 'V.' jubae, 'V.' mariae, 'V.' nelspruitensis, and 'V.' steindachneri are transferred to Labeobarbus. Bayesian estimation of time to most recent common ancestor suggested that Labeobarbus originated in the Late Miocene while lineage diversification began during the Late Miocene-Early Pliocene and continued to the late Pleistocene. The relationships presented herein provide phylogenetic resolution within Labeobarbus and advances our knowledge of genetic diversity within the lineage as well as provides some interesting insight into the hydrographic and geologic history of Africa. PMID:27394501

  1. Cytogenetic and Sequence Analyses of Mitochondrial DNA Insertions in Nuclear Chromosomes of Maize

    PubMed Central

    Lough, Ashley N.; Faries, Kaitlyn M.; Koo, Dal-Hoe; Hussain, Abid; Roark, Leah M.; Langewisch, Tiffany L.; Backes, Teresa; Kremling, Karl A. G.; Jiang, Jiming; Birchler, James A.; Newton, Kathleen J.

    2015-01-01

    The transfer of mitochondrial DNA (mtDNA) into nuclear genomes is a regularly occurring process that has been observed in many species. Few studies, however, have focused on the variation of nuclear-mtDNA sequences (NUMTs) within a species. This study examined mtDNA insertions within chromosomes of a diverse set of Zea mays ssp. mays (maize) inbred lines by the use of fluorescence in situ hybridization. A relatively large NUMT on the long arm of chromosome 9 (9L) was identified at approximately the same position in four inbred lines (B73, M825, HP301, and Oh7B). Further examination of the similarly positioned 9L NUMT in two lines, B73 and M825, indicated that the large size of these sites is due to the presence of a majority of the mitochondrial genome; however, only portions of this NUMT (∼252 kb total) were found in the publically available B73 nuclear sequence for chromosome 9. Fiber-fluorescence in situ hybridization analysis estimated the size of the B73 9L NUMT to be ∼1.8 Mb and revealed that the NUMT is methylated. Two regions of mtDNA (2.4 kb and 3.3 kb) within the 9L NUMT are not present in the B73 mitochondrial NB genome; however, these 2.4-kb and 3.3-kb segments are present in other Zea mitochondrial genomes, including that of Zea mays ssp. parviglumis, a progenitor of domesticated maize. PMID:26333837

  2. Detection and isolation of nucleic acid sequences using competitive hybridization probes

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    1997-01-01

    A method for detecting a target nucleic acid sequence in a sample is provided using hybridization probes which competitively hybridize to a target nucleic acid. According to the method, a target nucleic acid sequence is hybridized to first and second hybridization probes which are complementary to overlapping portions of the target nucleic acid sequence, the first hybridization probe including a first complexing agent capable of forming a binding pair with a second complexing agent and the second hybridization probe including a detectable marker. The first complexing agent attached to the first hybridization probe is contacted with a second complexing agent, the second complexing agent being attached to a solid support such that when the first and second complexing agents are attached, target nucleic acid sequences hybridized to the first hybridization probe become immobilized on to the solid support. The immobilized target nucleic acids are then separated and detected by detecting the detectable marker attached to the second hybridization probe. A kit for performing the method is also provided.

  3. Detection and isolation of nucleic acid sequences using competitive hybridization probes

    DOEpatents

    Lucas, J.N.; Straume, T.; Bogen, K.T.

    1997-04-01

    A method for detecting a target nucleic acid sequence in a sample is provided using hybridization probes which competitively hybridize to a target nucleic acid. According to the method, a target nucleic acid sequence is hybridized to first and second hybridization probes which are complementary to overlapping portions of the target nucleic acid sequence, the first hybridization probe including a first complexing agent capable of forming a binding pair with a second complexing agent and the second hybridization probe including a detectable marker. The first complexing agent attached to the first hybridization probe is contacted with a second complexing agent, the second complexing agent being attached to a solid support such that when the first and second complexing agents are attached, target nucleic acid sequences hybridized to the first hybridization probe become immobilized on to the solid support. The immobilized target nucleic acids are then separated and detected by detecting the detectable marker attached to the second hybridization probe. A kit for performing the method is also provided. 7 figs.

  4. Genome-Wide Linkage, Exome Sequencing and Functional Analyses Identify ABCB6 as the Pathogenic Gene of Dyschromatosis Universalis Hereditaria

    PubMed Central

    Wang, Na; Wang, Chuan; Chen, Xuechao; Sheng, Donglai; Fu, Xi’an; See, Kelvin; Foo, Jia Nee; Low, Huiqi; Liany, Herty; Irwan, Ishak Darryl; Liu, Jian; Yang, Baoqi; Chen, Mingfei; Yu, Yongxiang; Yu, Gongqi; Niu, Guiye; You, Jiabao; Zhou, Yan; Ma, Shanshan; Wang, Ting; Yan, Xiaoxiao; Goh, Boon Kee; Common, John E. A.; Lane, Birgitte E.; Sun, Yonghu; Zhou, Guizhi; Lu, Xianmei; Wang, Zhenhua; Tian, Hongqing; Cao, Yuanhua; Chen, Shumin; Liu, Qiji; Liu, Jianjun; Zhang, Furen

    2014-01-01

    Background As a genetic disorder of abnormal pigmentation, the molecular basis of dyschromatosis universalis hereditaria (DUH) had remained unclear until recently when ABCB6 was reported as a causative gene of DUH. Methodology We performed genome-wide linkage scan using Illumina Human 660W-Quad BeadChip and exome sequencing analyses using Agilent SureSelect Human All Exon Kits in a multiplex Chinese DUH family to identify the pathogenic mutations and verified the candidate mutations using Sanger sequencing. Quantitative RT-PCR and Immunohistochemistry was performed to verify the expression of the pathogenic gene, Zebrafish was also used to confirm the functional role of ABCB6 in melanocytes and pigmentation. Results Genome-wide linkage (assuming autosomal dominant inheritance mode) and exome sequencing analyses identified ABCB6 as the disease candidate gene by discovering a coding mutation (c.1358C>T; p.Ala453Val) that co-segregates with the disease phenotype. Further mutation analysis of ABCB6 in four other DUH families and two sporadic cases by Sanger sequencing confirmed the mutation (c.1358C>T; p.Ala453Val) and discovered a second, co-segregating coding mutation (c.964A>C; p.Ser322Lys) in one of the four families. Both mutations were heterozygous in DUH patients and not present in the 1000 Genome Project and dbSNP database as well as 1,516 unrelated Chinese healthy controls. Expression analysis in human skin and mutagenesis interrogation in zebrafish confirmed the functional role of ABCB6 in melanocytes and pigmentation. Given the involvement of ABCB6 mutations in coloboma, we performed ophthalmological examination of the DUH carriers of ABCB6 mutations and found ocular abnormalities in them. Conclusion Our study has advanced our understanding of DUH pathogenesis and revealed the shared pathological mechanism between pigmentary DUH and ocular coloboma. PMID:24498303

  5. [Sequencing Analyses of the Hypervariable Region within the VP2 Gene of a Strain of the Aleutian Mink Disease Virus].

    PubMed

    Zhang, Lei; Hu, Bo; Bai, Xue; Zhang, Hailing; Zhao, Jianjun; Wang, Zhenjun; Ma, Fanshu; Yan, Xijun; Wu, Wei; Xu, Shujuan

    2015-05-01

    To analyze the molecular mechanisms of cross-host transmission of the Aleutian mink disease vi rus (ADV), the hypervariable region fragment of the VP2 gene of the ADV in Jilin Province (China) was amplified. Sequencing analyses showed diversity at residue 174 by comparison with other VP2 genes in GenBank. The phylogenetic tree indicated that the ADV-JL strain had a close relationship with the highly pathogenic strain from Denmark: ADV-K. Results implied that residue 174 may be associated with ADV infectivity. PMID:26470526

  6. Conservation of Shannon's redundancy for proteins. [information theory applied to amino acid sequences

    NASA Technical Reports Server (NTRS)

    Gatlin, L. L.

    1974-01-01

    Concepts of information theory are applied to examine various proteins in terms of their redundancy in natural originators such as animals and plants. The Monte Carlo method is used to derive information parameters for random protein sequences. Real protein sequence parameters are compared with the standard parameters of protein sequences having a specific length. The tendency of a chain to contain some amino acids more frequently than others and the tendency of a chain to contain certain amino acid pairs more frequently than other pairs are used as randomness measures of individual protein sequences. Non-periodic proteins are generally found to have random Shannon redundancies except in cases of constraints due to short chain length and genetic codes. Redundant characteristics of highly periodic proteins are discussed. A degree of periodicity parameter is derived.

  7. RAPD and Internal Transcribed Spacer Sequence Analyses Reveal Zea nicaraguensis as a Section Luxuriantes Species Close to Zea luxurians

    PubMed Central

    Wang, Pei; Lu, Yanli; Zheng, Mingmin; Rong, Tingzhao; Tang, Qilin

    2011-01-01

    Genetic relationship of a newly discovered teosinte from Nicaragua, Zea nicaraguensis with waterlogging tolerance, was determined based on randomly amplified polymorphic DNA (RAPD) markers and the internal transcribed spacer (ITS) sequences of nuclear ribosomal DNA using 14 accessions from Zea species. RAPD analysis showed that a total of 5,303 fragments were produced by 136 random decamer primers, of which 84.86% bands were polymorphic. RAPD-based UPGMA analysis demonstrated that the genus Zea can be divided into section Luxuriantes including Zea diploperennis, Zea luxurians, Zea perennis and Zea nicaraguensis, and section Zea including Zea mays ssp. mexicana, Zea mays ssp. parviglumis, Zea mays ssp. huehuetenangensis and Zea mays ssp. mays. ITS sequence analysis showed the lengths of the entire ITS region of the 14 taxa in Zea varied from 597 to 605 bp. The average GC content was 67.8%. In addition to the insertion/deletions, 78 variable sites were recorded in the total ITS region with 47 in ITS1, 5 in 5.8S, and 26 in ITS2. Sequences of these taxa were analyzed with neighbor-joining (NJ) and maximum parsimony (MP) methods to construct the phylogenetic trees, selecting Tripsacum dactyloides L. as the outgroup. The phylogenetic relationships of Zea species inferred from the ITS sequences are highly concordant with the RAPD evidence that resolved two major subgenus clades. Both RAPD and ITS sequence analyses indicate that Zea nicaraguensis is more closely related to Zea luxurians than the other teosintes and cultivated maize, which should be regarded as a section Luxuriantes species. PMID:21525982

  8. Conversion of amino-acid sequence in proteins to classical music: search for auditory patterns

    PubMed Central

    2007-01-01

    We have converted genome-encoded protein sequences into musical notes to reveal auditory patterns without compromising musicality. We derived a reduced range of 13 base notes by pairing similar amino acids and distinguishing them using variations of three-note chords and codon distribution to dictate rhythm. The conversion will help make genomic coding sequences more approachable for the general public, young children, and vision-impaired scientists. PMID:17477882

  9. Protein location prediction using atomic composition and global features of the amino acid sequence

    SciTech Connect

    Cherian, Betsy Sheena; Nair, Achuthsankar S.

    2010-01-22

    Subcellular location of protein is constructive information in determining its function, screening for drug candidates, vaccine design, annotation of gene products and in selecting relevant proteins for further studies. Computational prediction of subcellular localization deals with predicting the location of a protein from its amino acid sequence. For a computational localization prediction method to be more accurate, it should exploit all possible relevant biological features that contribute to the subcellular localization. In this work, we extracted the biological features from the full length protein sequence to incorporate more biological information. A new biological feature, distribution of atomic composition is effectively used with, multiple physiochemical properties, amino acid composition, three part amino acid composition, and sequence similarity for predicting the subcellular location of the protein. Support Vector Machines are designed for four modules and prediction is made by a weighted voting system. Our system makes prediction with an accuracy of 100, 82.47, 88.81 for self-consistency test, jackknife test and independent data test respectively. Our results provide evidence that the prediction based on the biological features derived from the full length amino acid sequence gives better accuracy than those derived from N-terminal alone. Considering the features as a distribution within the entire sequence will bring out underlying property distribution to a greater detail to enhance the prediction accuracy.

  10. Comprehensive analyses of prostate gene expression: convergence of expressed sequence tag databases, transcript profiling and proteomics.

    PubMed

    Nelson, P S; Han, D; Rochon, Y; Corthals, G L; Lin, B; Monson, A; Nguyen, V; Franza, B R; Plymate, S R; Aebersold, R; Hood, L

    2000-05-01

    Several methods have been developed for the comprehensive analysis of gene expression in complex biological systems. Generally these procedures assess either a portion of the cellular transcriptome or a portion of the cellular proteome. Each approach has distinct conceptual and methodological advantages and disadvantages. We have investigated the application of both methods to characterize the gene expression pathway mediated by androgens and the androgen receptor in prostate cancer cells. This pathway is of critical importance for the development and progression of prostate cancer. Of clinical importance, modulation of androgens remains the mainstay of treatment for patients with advanced disease. To facilitate global gene expression studies we have first sought to define the prostate transcriptome by assembling and annotating prostate-derived expressed sequence tags (ESTs). A total of 55000 prostate ESTs were assembled into a set of 15953 clusters putatively representing 15953 distinct transcripts. These clusters were used to construct cDNA microarrays suitable for examining the androgen-response pathway at the level of transcription. The expression of 20 genes was found to be induced by androgens. This cohort included known androgen-regulated genes such as prostate-specific antigen (PSA) and several novel complementary DNAs (cDNAs). Protein expression profiles of androgen-stimulated prostate cancer cells were generated by two-dimensional electrophoresis (2-DE). Mass spectrometric analysis of androgen-regulated proteins in these cells identified the metastasis-suppressor gene NDKA/nm23, a finding that may explain a marked reduction in metastatic potential when these cells express a functional androgen receptor pathway. PMID:10870968

  11. Evolution and biogeography of Centaurea section Acrocentron inferred from nuclear and plastid DNA sequence analyses

    PubMed Central

    Font, Mònica; Garcia-Jacas, Núria; Vilatersana, Roser; Roquet, Cristina; Susanna, Alfonso

    2009-01-01

    Background and Aims Section Acrocentron of the genus Centaurea is one of the largest sections of Centaurea with approx. 100 species. The geographic distribution, centred in the Mediterranean, makes it an excellent example for studies of the biogeographic history of this biodiversity-rich region. Methods Plastid (trnH-psbA) and nuclear (ITS and ETS) DNA sequence analysis was used for phylogenetic reconstruction. Ancestral biogeographic patterns were inferred by dispersal-vicariance analysis (DIVA). Key Results The resulting phylogeny has implications for the sectional classification of Acrocentron and confirms merging sect. Chamaecyanus into Acrocentron as a subsection. Previous suggestions of an eastern Mediterranean origin of the group are confirmed. The main centres of diversification established in previous studies are now strongly supported. Expansion of the group in two different radiations that followed patently diverse paths is inferred. Conclusions Radiation followed two waves, widely separated in time scale. The oldest one, from Turkey to Greece and the northern Balkans and then to North Africa and Iberia, should be dated at the end of the Miocene in the Messinian period. It reached the Iberian Peninsula from the south, following a route that is landmarked by several relictic taxa in Sicily and North Africa. A later radiation during the Holocene interglacial periods followed, involving species from the north of the Balkan Peninsula, along a Eurasian pathway running from Central Iberia to the steppes of Kazakhstan. A generalized pattern of reticulation is also evident from the results, indicating past contacts between presently separated species. Molecular data also confirmed the extent of hybridization within Acrocentron and were successful in reconstructing the paleogeography of the section. PMID:19228702

  12. Phylogeography and evolution in matsutake and close allies inferred by analyses of ITS sequences and AFLPs.

    PubMed

    Chapela, Ignacio H; Garbelotto, Matteo

    2004-01-01

    Matsutake are commercially important ectomycorrhizal basidiomycetes in the genus Tricholoma. Despite their importance, the systematics of this species complex have remained elusive and little is known about their origin and biogeography. Using DNA analyses on a worldwide sample of matsutake, we present here the first comprehensive definition of natural groupings in this species complex. We infer patterns of migration and propose Eocene origins for the group in western North America by a transfer from an angiosperm-associated ancestor to an increasingly specialized conifer symbiont. From these origins, matsutake appear to have followed migratory routes parallel to those of coniferous hosts. Patterns of vicariance between eastern North America and eastern Asia are resolved and their origins are suggested to stem from migration through Beringia. Using an analysis of genetic dissimilarity and geographical distance, we reject both the possibility that migration into Europe and Asia occurred through Atlantic bridges and the connection between matsutake populations in the Mahgrebi Mountains and those from Europe. Instead, African and European matsutake appear to be the most recent ends of a westward expansion of the domain of these fungi from North America. PMID:21148894

  13. Ab initio detection of fuzzy amino acid tandem repeats in protein sequences

    PubMed Central

    2012-01-01

    Background Tandem repetitions within protein amino acid sequences often correspond to regular secondary structures and form multi-repeat 3D assemblies of varied size and function. Developing internal repetitions is one of the evolutionary mechanisms that proteins employ to adapt their structure and function under evolutionary pressure. While there is keen interest in understanding such phenomena, detection of repeating structures based only on sequence analysis is considered an arduous task, since structure and function is often preserved even under considerable sequence divergence (fuzzy tandem repeats). Results In this paper we present PTRStalker, a new algorithm for ab-initio detection of fuzzy tandem repeats in protein amino acid sequences. In the reported results we show that by feeding PTRStalker with amino acid sequences from the UniProtKB/Swiss-Prot database we detect novel tandemly repeated structures not captured by other state-of-the-art tools. Experiments with membrane proteins indicate that PTRStalker can detect global symmetries in the primary structure which are then reflected in the tertiary structure. Conclusions PTRStalker is able to detect fuzzy tandem repeating structures in protein sequences, with performance beyond the current state-of-the art. Such a tool may be a valuable support to investigating protein structural properties when tertiary X-ray data is not available. PMID:22536906

  14. The amino-acid sequence of leghemoglobin component a from Phaseolus vulgaris (kidney bean).

    PubMed

    Lehtovaara, P; Ellfolk, N

    1975-06-01

    1. Leghemoglobin component a from Phaseolus vulgaris (kidney bean) was digested with trypsin; 15 tryptic peptides and free lysine were purified and the amino acid sequences of the peptides determined. 2. The internal order of the tryptic peptides was determined by the bridge peptides obtained from the thermolytic digest and the dilute acid hydrolyzate of kidney bean leghemoglobin a; 12 thermolytic peptides and two acid hydrolysis peptides were purified and the sequences were partially or completely determined. 3. The complete amino acid sequence of kidney bean leghemoglobin a is compared to that of leghemoglobin a from soybean (Glycine max) and to some animal globins. As regards sequence, the kidney bean globin has 79% identity with the soybean globin and 21% identity with human hemoglobin gamma-chain. Seven of the 14 amino acid residues common to most globins are found in the kidney bean globin. Trp-15 and Tyr-145 are evolutionarily conserved in this globin, which confirms the concept of a common origin of animal and plant globins. PMID:809270

  15. Organic Analysis in the Miller Range 090657 CR2 Chondrite: Part 2 Amino Acid Analyses

    NASA Technical Reports Server (NTRS)

    Burton, A. S.; Cao, T.; Nakamura-Messenger, K.; Berger, E. L.; Messenger, S.; Clemett, S. J.; Aponte, J. C.; Elsila, J. E.

    2016-01-01

    Primitive carbonaceous chondrites contain a wide variety of organic material, ranging from soluble discrete molecules to insoluble, unstructured kerogen-like components, as well as structured nano-globules of macromolecular carbon. The relationship between the soluble organic molecules, macromolecular organic material, and host minerals are poorly understood. Due to the differences in extractability of soluble and insoluble organic materials, the analysis methods for each differ and are often performed independently. The combination of soluble and insoluble analyses, when performed concurrently, can provide a wider understanding of spatial distribution, and elemental, structural and isotopic composition of organic material in primitive meteorites. Using macroscale extraction and analysis techniques in combination with in situ microscale observation, we have been studying both insoluble and soluble organic material in the primitive CR2 chondrite Miller Range (MIL) 090657. In accompanying abstracts (Cao et al. and Messenger et al.) we discuss insoluble organic material in the samples. By performing the consortium studies, we aim to improve our understanding of the relationship between the meteorite minerals and the soluble and insoluble organic phases and to delineate which species formed within the meteorite and those that formed in nebular or presolar environments. In this abstract, we present the results of amino acid analyses of MIL 090657 by ultra performance liquid chromatography with fluorescence detection and quadrupole-time of flight mass spectrometry. Amino acids are of interest because they are essential to life on Earth, and because they are present in sufficient structural, enantiomeric and isotopic diversity to allow insights into early solar system chemical processes. Furthermore, these are among the most isotopically anomalous species, yet at least some fraction are thought to have formed by aqueously-mediated processes during parent body alteration.

  16. NCI-60 Whole Exome Sequencing and Pharmacological CellMiner Analyses

    PubMed Central

    Reinhold, William C.; Varma, Sudhir; Sousa, Fabricio; Sunshine, Margot; Abaan, Ogan D.; Davis, Sean R.; Reinhold, Spencer W.; Kohn, Kurt W.; Morris, Joel; Meltzer, Paul S.; Doroshow, James H.; Pommier, Yves

    2014-01-01

    Exome sequencing provides unprecedented insights into cancer biology and pharmacological response. Here we assess these two parameters for the NCI-60, which is among the richest genomic and pharmacological publicly available cancer cell line databases. Homozygous genetic variants that putatively affect protein function were identified in 1,199 genes (approximately 6% of all genes). Variants that are either enriched or depleted compared to non-cancerous genomes, and thus may be influential in cancer progression and differential drug response were identified for 2,546 genes. Potential gene knockouts are made available. Assessment of cell line response to 19,940 compounds, including 110 FDA-approved drugs, reveals ≈80-fold range in resistance versus sensitivity response across cell lines. 103,422 gene variants were significantly correlated with at least one compound (at p<0.0002). These include genes of known pharmacological importance such as IGF1R, BRAF, RAD52, MTOR, STAT2 and TSC2 as well as a large number of candidate genes such as NOM1, TLL2, and XDH. We introduce two new web-based CellMiner applications that enable exploration of variant-to-compound relationships for a broad range of researchers, especially those without bioinformatics support. The first tool, “Genetic variant versus drug visualization”, provides a visualization of significant correlations between drug activity-gene variant combinations. Examples are given for the known vemurafenib-BRAF, and novel ifosfamide-RAD52 pairings. The second, “Genetic variant summation” allows an assessment of cumulative genetic variations for up to 150 combined genes together; and is designed to identify the variant burden for molecular pathways or functional grouping of genes. An example of its use is provided for the EGFR-ERBB2 pathway gene variant data and the identification of correlated EGFR, ERBB2, MTOR, BRAF, MEK and ERK inhibitors. The new tools are implemented as an updated web-based Cell

  17. Draft genome sequence of the docosahexaenoic acid producing thraustochytrid Aurantiochytrium sp. T66.

    PubMed

    Liu, Bin; Ertesvåg, Helga; Aasen, Inga Marie; Vadstein, Olav; Brautaset, Trygve; Heggeset, Tonje Marita Bjerkan

    2016-06-01

    Thraustochytrids are unicellular, marine protists, and there is a growing industrial interest in these organisms, particularly because some species, including strains belonging to the genus Aurantiochytrium, accumulate high levels of docosahexaenoic acid (DHA). Here, we report the draft genome sequence of Aurantiochytrium sp. T66 (ATCC PRA-276), with a size of 43 Mbp, and 11,683 predicted protein-coding sequences. The data has been deposited at DDBJ/EMBL/Genbank under the accession LNGJ00000000. The genome sequence will contribute new insight into DHA biosynthesis and regulation, providing a basis for metabolic engineering of thraustochytrids. PMID:27222814

  18. A classification of glycosyl hydrolases based on amino acid sequence similarities.

    PubMed Central

    Henrissat, B

    1991-01-01

    The amino acid sequences of 301 glycosyl hydrolases and related enzymes have been compared. A total of 291 sequences corresponding to 39 EC entries could be classified into 35 families. Only ten sequences (less than 5% of the sample) could not be assigned to any family. With the sequences available for this analysis, 18 families were found to be monospecific (containing only one EC number) and 17 were found to be polyspecific (containing at least two EC numbers). Implications on the folding characteristics and mechanism of action of these enzymes and on the evolution of carbohydrate metabolism are discussed. With the steady increase in sequence and structural data, it is suggested that the enzyme classification system should perhaps be revised. PMID:1747104

  19. New families in the classification of glycosyl hydrolases based on amino acid sequence similarities.

    PubMed Central

    Henrissat, B; Bairoch, A

    1993-01-01

    301 glycosyl hydrolases and related enzymes corresponding to 39 EC entries of the I.U.B. classification system have been classified into 35 families on the basis of amino-acid-sequence similarities [Henrissat (1991) Biochem. J. 280, 309-316]. Approximately half of the families were found to be monospecific (containing only one EC number), whereas the other half were found to be polyspecific (containing at least two EC numbers). A > 60% increase in sequence data for glycosyl hydrolases (181 additional enzymes or enzyme domains sequences have since become available) allowed us to update the classification not only by the addition of more members to already identified families, but also by the finding of ten new families. On the basis of a comparison of 482 sequences corresponding to 52 EC entries, 45 families, out of which 22 are polyspecific, can now be defined. This classification has been implemented in the SWISS-PROT protein sequence data bank. PMID:8352747

  20. Sequence-specific purification of nucleic acids by PNA-controlled hybrid selection.

    PubMed

    Orum, H; Nielsen, P E; Jørgensen, M; Larsson, C; Stanley, C; Koch, T

    1995-09-01

    Using an oligohistidine peptide nucleic acids (oligohistidine-PNA) chimera, we have developed a rapid hybrid selection method that allows efficient, sequence-specific purification of a target nucleic acid. The method exploits two fundamental features of PNA. First, that PNA binds with high affinity and specificity to its complementary nucleic acid. Second, that amino acids are easily attached to the PNA oligomer during synthesis. We show that a (His)6-PNA chimera exhibits strong binding to chelated Ni2+ ions without compromising its native PNA hybridization properties. We further show that these characteristics allow the (His)6-PNA/DNA complex to be purified by the well-established method of metal ion affinity chromatography using a Ni(2+)-NTA (nitrilotriactic acid) resin. Specificity and efficiency are the touchstones of any nucleic acid purification scheme. We show that the specificity of the (His)6-PNA selection approach is such that oligonucleotides differing by only a single nucleotide can be selectively purified. We also show that large RNAs (2224 nucleotides) can be captured with high efficiency by using multiple (His)6-PNA probes. PNA can hybridize to nucleic acids in low-salt concentrations that destabilize native nucleic acid structures. We demonstrate that this property of PNA can be utilized to purify an oligonucleotide in which the target sequence forms part of an intramolecular stem/loop structure. PMID:7495562

  1. HPLC and ELISA analyses of larval bile acids from Pacific and western brook lampreys

    USGS Publications Warehouse

    Yun, S.-S.; Scott, A.P.; Bayer, J.M.; Seelye, J.G.; Close, D.A.; Li, W.

    2003-01-01

    Comparative studies were performed on two native lamprey species, Pacific lamprey (Lampetra tridentata) and western brook lamprey (Lampetra richardsoni) from the Pacific coast along with sea lamprey (Petromyzon marinus) from the Great Lakes, to investigate their bile acid production and release. HPLC and ELISA analyses of the gall bladders and liver extract revealed that the major bile acid compound from Pacific and western brook larval lampreys was petromyzonol sulfate (PZS), previously identified as a migratory pheromone in larval sea lamprey. An ELISA for PZS has been developed in a working range of 20pg-10ng per well. The tissue concentrations of PZS in gall bladder were 127.40, 145.86, and 276.96??g/g body mass in sea lamprey, Pacific lamprey, and western brook lamprey, respectively. Releasing rates for PZS in the three species were measured using ELISA to find that western brook and sea lamprey released PZS 20 times higher than Pacific lamprey did. Further studies are required to determine whether PZS is a chemical cue in Pacific and western brook lampreys. ?? 2003 Elsevier Inc. All rights reserved.

  2. Stable Isotope and Signature Fatty Acid Analyses Suggest Reef Manta Rays Feed on Demersal Zooplankton

    PubMed Central

    Couturier, Lydie I. E.; Rohner, Christoph A.; Richardson, Anthony J.; Marshall, Andrea D.; Jaine, Fabrice R. A.; Bennett, Michael B.; Townsend, Kathy A.; Weeks, Scarla J.; Nichols, Peter D.

    2013-01-01

    Assessing the trophic role and interaction of an animal is key to understanding its general ecology and dynamics. Conventional techniques used to elucidate diet, such as stomach content analysis, are not suitable for large threatened marine species. Non-lethal sampling combined with biochemical methods provides a practical alternative for investigating the feeding ecology of these species. Stable isotope and signature fatty acid analyses of muscle tissue were used for the first time to examine assimilated diet of the reef manta ray Manta alfredi, and were compared with different zooplankton functional groups (i.e. near-surface zooplankton collected during manta ray feeding events and non-feeding periods, epipelagic zooplankton, demersal zooplankton and several different zooplankton taxa). Stable isotope δ15N values confirmed that the reef manta ray is a secondary consumer. This species had relatively high levels of docosahexaenoic acid (DHA) indicating a flagellate-based food source in the diet, which likely reflects feeding on DHA-rich near-surface and epipelagic zooplankton. However, high levels of ω6 polyunsaturated fatty acids and slightly enriched δ13C values in reef manta ray tissue suggest that they do not feed solely on pelagic zooplankton, but rather obtain part of their diet from another origin. The closest match was with demersal zooplankton, suggesting it is an important component of the reef manta ray diet. The ability to feed on demersal zooplankton is likely linked to the horizontal and vertical movement patterns of this giant planktivore. These new insights into the habitat use and feeding ecology of the reef manta ray will assist in the effective evaluation of its conservation needs. PMID:24167562

  3. Antibody-specific model of amino acid substitution for immunological inferences from alignments of antibody sequences.

    PubMed

    Mirsky, Alexander; Kazandjian, Linda; Anisimova, Maria

    2015-03-01

    Antibodies are glycoproteins produced by the immune system as a dynamically adaptive line of defense against invading pathogens. Very elegant and specific mutational mechanisms allow B lymphocytes to produce a large and diversified repertoire of antibodies, which is modified and enhanced throughout all adulthood. One of these mechanisms is somatic hypermutation, which stochastically mutates nucleotides in the antibody genes, forming new sequences with different properties and, eventually, higher affinity and selectivity to the pathogenic target. As somatic hypermutation involves fast mutation of antibody sequences, this process can be described using a Markov substitution model of molecular evolution. Here, using large sets of antibody sequences from mice and humans, we infer an empirical amino acid substitution model AB, which is specific to antibody sequences. Compared with existing general amino acid models, we show that the AB model provides significantly better description for the somatic evolution of mice and human antibody sequences, as demonstrated on large next generation sequencing (NGS) antibody data. General amino acid models are reflective of conservation at the protein level due to functional constraints, with most frequent amino acids exchanges taking place between residues with the same or similar physicochemical properties. In contrast, within the variable part of antibody sequences we observed an elevated frequency of exchanges between amino acids with distinct physicochemical properties. This is indicative of a sui generis mutational mechanism, specific to antibody somatic hypermutation. We illustrate this property of antibody sequences by a comparative analysis of the network modularity implied by the AB model and general amino acid substitution models. We recommend using the new model for computational studies of antibody sequence maturation, including inference of alignments and phylogenetic trees describing antibody somatic hypermutation in

  4. Comparative molecular cytogenetic analyses of a major tandemly repeated DNA family and retrotransposon sequences in cultivated jute Corchorus species (Malvaceae)

    PubMed Central

    Begum, Rabeya; Zakrzewski, Falk; Menzel, Gerhard; Weber, Beatrice; Alam, Sheikh Shamimul; Schmidt, Thomas

    2013-01-01

    Background and Aims The cultivated jute species Corchorus olitorius and Corchorus capsularis are important fibre crops. The analysis of repetitive DNA sequences, comprising a major part of plant genomes, has not been carried out in jute but is useful to investigate the long-range organization of chromosomes. The aim of this study was the identification of repetitive DNA sequences to facilitate comparative molecular and cytogenetic studies of two jute cultivars and to develop a fluorescent in situ hybridization (FISH) karyotype for chromosome identification. Methods A plasmid library was generated from C. olitorius and C. capsularis with genomic restriction fragments of 100–500 bp, which was complemented by targeted cloning of satellite DNA by PCR. The diversity of the repetitive DNA families was analysed comparatively. The genomic abundance and chromosomal localization of different repeat classes were investigated by Southern analysis and FISH, respectively. The cytosine methylation of satellite arrays was studied by immunolabelling. Key Results Major satellite repeats and retrotransposons have been identified from C. olitorius and C. capsularis. The satellite family CoSat I forms two undermethylated species-specific subfamilies, while the long terminal repeat (LTR) retrotransposons CoRetro I and CoRetro II show similarity to the Metaviridea of plant retroelements. FISH karyotypes were developed by multicolour FISH using these repetitive DNA sequences in combination with 5S and 18S–5·8S–25S rRNA genes which enable the unequivocal chromosome discrimination in both jute species. Conclusions The analysis of the structure and diversity of the repeated DNA is crucial for genome sequence annotation. The reference karyotypes will be useful for breeding of jute and provide the basis for karyotyping homeologous chromosomes of wild jute species to reveal the genetic and evolutionary relationship between cultivated and wild Corchorus species. PMID:23666888

  5. Analyses of transcriptome sequences reveal multiple ancient large-scale duplication events in the ancestor of Sphagnopsida (Bryophyta).

    PubMed

    Devos, Nicolas; Szövényi, Péter; Weston, David J; Rothfels, Carl J; Johnson, Matthew G; Shaw, A Jonathan

    2016-07-01

    The goal of this research was to investigate whether there has been a whole-genome duplication (WGD) in the ancestry of Sphagnum (peatmoss) or the class Sphagnopsida, and to determine if the timing of any such duplication(s) and patterns of paralog retention could help explain the rapid radiation and current ecological dominance of peatmosses. RNA sequencing (RNA-seq) data were generated for nine taxa in Sphagnopsida (Bryophyta). Analyses of frequency plots for synonymous substitutions per synonymous site (Ks ) between paralogous gene pairs and reconciliation of 578 gene trees were conducted to assess evidence of large-scale or genome-wide duplication events in each transcriptome. Both Ks frequency plots and gene tree-based analyses indicate multiple duplication events in the history of the Sphagnopsida. The most recent WGD event predates divergence of Sphagnum from the two other genera of Sphagnopsida. Duplicate retention is highly variable across species, which might be best explained by local adaptation. Our analyses indicate that the last WGD could have been an important factor underlying the diversification of peatmosses and facilitated their rise to ecological dominance in peatlands. The timing of the duplication events and their significance in the evolutionary history of peat mosses are discussed. PMID:26900928

  6. Amino acid sequence of a vitamin K-dependent Ca2+-binding peptide from bovine prothrombin.

    PubMed

    Howard, J B; Fausch, M D

    1975-08-10

    The amino acid sequence of a 31-residue peptide from bovine prothrombin has been determined. This peptide has been shown to contain the vitamin K-dependent modification required for Ca2+ binding (Nelsestuen, G. L., and Suttie, J. W. (1973) Proc. Natl. Acad. Sci. U. S. A. 70, 3366-3370) and the modified amino acid, gamma-carboxyglutamic acid (Nelsestuen, G. L., Zytkovicz, T., and Howard, J. B. (1974) J. Biol. Chem. 249, 6347-6350). The peptide was shown to correspond to residues 12 to 42 of prothrombin. PMID:807581

  7. Complete amino acid sequence of the Mu heavy chain of a human IgM immunoglobulin.

    PubMed

    Putnam, F W; Florent, G; Paul, C; Shinoda, T; Shimizu, A

    1973-10-19

    The amino acid sequence of the micro, chain of a human IgM immunoglobulin, including the location of all disulfide bridges and oligosaccharides, has been determined. The homology of the constant regions of immunoglobulin micro, gamma, alpha, and epsilon heavy chains reveals evolutionary relationships and suggests that two genes code for each heavy chain. PMID:4742735

  8. Draft Genome Sequence of the Butyric Acid Producer Clostridium tyrobutyricum Strain CIP I-776 (IFP923)

    PubMed Central

    Clément, Benjamin; Lopes Ferreira, Nicolas

    2016-01-01

    Here, we report the draft genome sequence of Clostridium tyrobutyricum CIP I-776 (IFP923), an efficient producer of butyric acid. The genome consists of a single chromosome of 3.19 Mb and provides useful data concerning the metabolic capacities of the strain. PMID:26941139

  9. Draft Genome Sequence of Perfluorooctane Acid-Degrading Bacterium Pseudomonas parafulva YAB-1

    PubMed Central

    Tang, Chongjian; Peng, Qingjing; Peng, Qingzhong

    2015-01-01

    Pseudomonas parafulva YAB-1, isolated from perfluorinated compound-contaminated soil, has the ability to degrade perfluorooctane acid (PFOA) compound. Here, we report the draft genome sequence and annotation of the PFOA-degrading bacterium P. parafulva YAB-1. The data provide the basis to investigate the molecular mechanism of PFOA metabolism. PMID:26337877

  10. The sequence diversity and expression among genes of the folic acid biosynthesis pathway in industrial Saccharomyces strains.

    PubMed

    Goncerzewicz, Anna; Misiewicz, Anna

    2015-01-01

    Folic acid is an important vitamin in human nutrition and its deficiency in pregnant women's diets results in neural tube defects and other neurological damage to the fetus. Additionally, DNA synthesis, cell division and intestinal absorption are inhibited in case of adults. Since this discovery, governments and health organizations worldwide have made recommendations concerning folic acid supplementation of food for women planning to become pregnant. In many countries this has led to the introduction of fortifications, where synthetic folic acid is added to flour. It is known that Saccharomyces strains (brewing and bakers' yeast) are one of the main producers of folic acid and they can be used as a natural source of this vitamin. Proper selection of the most efficient strains may enhance the folate content in bread, fermented vegetables, dairy products and beer by 100% and may be used in the food industry. The objective of this study was to select the optimal producing yeast strain by determining the differences in nucleotide sequences in the FOL2, FOL3 and DFR1 genes of folic acid biosynthesis pathway. The Multitemperature Single Strand Conformation Polymorphism (MSSCP) method and further nucleotide sequencing for selected strains were applied to indicate SNPs in selected gene fragments. The RT qPCR technique was also applied to examine relative expression of the FOL3 gene. Furthermore, this is the first time ever that industrial yeast strains were analysed regarding genes of the folic acid biosynthesis pathway. It was observed that a correlation exists between the folic acid amount produced by industrial yeast strains and changes in the nucleotide sequence of adequate genes. The most significant changes occur in the DFR1 gene, mostly in the first part, which causes major protein structure modifications in KKP 232, KKP 222 and KKP 277 strains. Our study shows that the large amount of SNP contributes to impairment of the selected enzymes and S. cerevisiae and S

  11. The amino acid sequence of cytochrome c-555 from the methane-oxidizing bacterium Methylococcus capsulatus.

    PubMed Central

    Ambler, R P; Dalton, H; Meyer, T E; Bartsch, R G; Kamen, M D

    1986-01-01

    The amino acid sequence of the cytochrome c-555 from the obligate methanotroph Methylococcus capsulatus strain Bath (N.C.I.B. 11132) was determined. It is a single polypeptide chain of 96 residues, binding a haem group through the cysteine residues at positions 19 and 22, and the only methionine residue is a position 59. The sequence does not closely resemble that of any other cytochrome c that has yet been characterized. Detailed evidence for the amino acid sequence of the protein has been deposited as Supplementary Publication SUP 50131 (12 pages) at the British Library Lending Division, Boston Spa, West Yorkshire LS23 7BQ, U.K., from whom copies are available on prepayment. PMID:3006666

  12. Folic acid alone or multivitamin containing folic acid intake during pregnancy and the risk of gestational hypertension and preeclampsia through meta-analyses

    PubMed Central

    Shim, Sang-Min; Yun, Yeo-Ul

    2016-01-01

    Objective The objective of this study was to assess the effect of folic acid and multivitamin use during pregnancy on the risk of developing of hypertensive disorder of pregnancy. Methods Two reviewers independently determined all prospective cohort study, retrospective cohort study, large population based cohort study, retrospective secondary analysis, and double blinded, placebo-controlled, randomized clinical trial published using PubMed Medline database, KERIS (Korea Education and Research Information Service), Scopus, and the Cochrane Central Register of controlled trials comparing before conception throughout pregnancy intake oral multivitamin containing folic acid or folic acid alone. Meta-analyses were estimated with odds ratios and 95% confidence intervals (CIs) using random effect analysis according to heterogeneity of studies. Results Data from six effect sizes from six studies involving 201,661 patients were enrolled. These meta-analyses showed multivitamin containing folic acid or folic acid alone was not significantly effective in reducing gestational hypertension or preeclampsia incidence (odds ratio, 0.91; 95% CI, 0.81 to 1.03) than the placebo. And the difference of effective sizes of preeclampsia and gestational hypertension according to two dependent variables, multivitamin and folic acid were not significant, respectively (point estimate, 0.66; 95% CI, 0.46 to 0.96). Conclusion These meta-analyses demonstrate multivitamin containing folic acid or folic acid alone was not significantly effective in reducing gestational hypertension or preeclampsia incidence. PMID:27004201

  13. Allelic polymorphism in arabian camel ribonuclease and the amino acid sequence of bactrian camel ribonuclease.

    PubMed

    Welling, G W; Mulder, H; Beintema, J J

    1976-04-01

    Pancreatic ribonucleases from several species (whitetail deer, roe deer, guinea pig, and arabian camel) exhibit more than one amino acid at particular positions in their amino acid sequences. Since these enzymes were isolated from pooled pancreas, the origin of this heterogeneity is not clear. The pancreatic ribonucleases from 11 individual arabian camels (Camelus dromedarius) have been investigated with respect to the lysine-glutamine heterogeneity at position 103 (Welling et al., 1975). Six ribonucleases showed only one basic band and five showed two bands after polyacrylamide gel electrophoresis, suggesting a gene frequency of about 0.75 for the Lys gene and about 0.25 for the Gln gene. The amino acid sequence of bactrian camel (Camelus bactrianus) ribonuclease isolated from individual pancreatic tissue was determined and compared with that of arabian camel ribonuclease. The only difference was observed at position 103. In the ribonucleases from two unrelated bactrian camels, only glutamine was observed at that position. PMID:962846

  14. Use of a structural alphabet to find compatible folds for amino acid sequences

    PubMed Central

    Mahajan, Swapnil; de Brevern, Alexandre G; Sanejouand, Yves-Henri; Srinivasan, Narayanaswamy; Offmann, Bernard

    2015-01-01

    The structural annotation of proteins with no detectable homologs of known 3D structure identified using sequence-search methods is a major challenge today. We propose an original method that computes the conditional probabilities for the amino-acid sequence of a protein to fit to known protein 3D structures using a structural alphabet, known as “Protein Blocks” (PBs). PBs constitute a library of 16 local structural prototypes that approximate every part of protein backbone structures. It is used to encode 3D protein structures into 1D PB sequences and to capture sequence to structure relationships. Our method relies on amino acid occurrence matrices, one for each PB, to score global and local threading of query amino acid sequences to protein folds encoded into PB sequences. It does not use any information from residue contacts or sequence-search methods or explicit incorporation of hydrophobic effect. The performance of the method was assessed with independent test datasets derived from SCOP 1.75A. With a Z-score cutoff that achieved 95% specificity (i.e., less than 5% false positives), global and local threading showed sensitivity of 64.1% and 34.2%, respectively. We further tested its performance on 57 difficult CASP10 targets that had no known homologs in PDB: 38 compatible templates were identified by our approach and 66% of these hits yielded correctly predicted structures. This method scales-up well and offers promising perspectives for structural annotations at genomic level. It has been implemented in the form of a web-server that is freely available at http://www.bo-protscience.fr/forsa. PMID:25297700

  15. Use of a structural alphabet to find compatible folds for amino acid sequences.

    PubMed

    Mahajan, Swapnil; de Brevern, Alexandre G; Sanejouand, Yves-Henri; Srinivasan, Narayanaswamy; Offmann, Bernard

    2015-01-01

    The structural annotation of proteins with no detectable homologs of known 3D structure identified using sequence-search methods is a major challenge today. We propose an original method that computes the conditional probabilities for the amino-acid sequence of a protein to fit to known protein 3D structures using a structural alphabet, known as "Protein Blocks" (PBs). PBs constitute a library of 16 local structural prototypes that approximate every part of protein backbone structures. It is used to encode 3D protein structures into 1D PB sequences and to capture sequence to structure relationships. Our method relies on amino acid occurrence matrices, one for each PB, to score global and local threading of query amino acid sequences to protein folds encoded into PB sequences. It does not use any information from residue contacts or sequence-search methods or explicit incorporation of hydrophobic effect. The performance of the method was assessed with independent test datasets derived from SCOP 1.75A. With a Z-score cutoff that achieved 95% specificity (i.e., less than 5% false positives), global and local threading showed sensitivity of 64.1% and 34.2%, respectively. We further tested its performance on 57 difficult CASP10 targets that had no known homologs in PDB: 38 compatible templates were identified by our approach and 66% of these hits yielded correctly predicted structures. This method scales-up well and offers promising perspectives for structural annotations at genomic level. It has been implemented in the form of a web-server that is freely available at http://www.bo-protscience.fr/forsa. PMID:25297700

  16. From Amino Acid to Glucosinolate Biosynthesis: Protein Sequence Changes in the Evolution of Methylthioalkylmalate Synthase in Arabidopsis[W][OA

    PubMed Central

    de Kraker, Jan-Willem; Gershenzon, Jonathan

    2011-01-01

    Methylthioalkylmalate synthase (MAM) catalyzes the committed step in the side chain elongation of Met, yielding important precursors for glucosinolate biosynthesis in Arabidopsis thaliana and other Brassicaceae species. MAM is believed to have evolved from isopropylmalate synthase (IPMS), an enzyme involved in Leu biosynthesis, based on phylogenetic analyses and an overlap of catalytic abilities. Here, we investigated the changes in protein structure that have occurred during the recruitment of IPMS from amino acid to glucosinolate metabolism. The major sequence difference between IPMS and MAM is the absence of 120 amino acids at the C-terminal end of MAM that constitute a regulatory domain for Leu-mediated feedback inhibition. Truncation of this domain in Arabidopsis IPMS2 results in loss of Leu feedback inhibition and quaternary structure, two features common to MAM enzymes, plus an 8.4-fold increase in the kcat/Km for a MAM substrate. Additional exchange of two amino acids in the active site resulted in a MAM-like enzyme that had little residual IPMS activity. Hence, combination of the loss of the regulatory domain and a few additional amino acid exchanges can explain the evolution of MAM from IPMS during its recruitment from primary to secondary metabolism. PMID:21205930

  17. Live births after simultaneous avoidance of monogenic diseases and chromosome abnormality by next-generation sequencing with linkage analyses.

    PubMed

    Yan, Liying; Huang, Lei; Xu, Liya; Huang, Jin; Ma, Fei; Zhu, Xiaohui; Tang, Yaqiong; Liu, Mingshan; Lian, Ying; Liu, Ping; Li, Rong; Lu, Sijia; Tang, Fuchou; Qiao, Jie; Xie, X Sunney

    2015-12-29

    In vitro fertilization (IVF), preimplantation genetic diagnosis (PGD), and preimplantation genetic screening (PGS) help patients to select embryos free of monogenic diseases and aneuploidy (chromosome abnormality). Next-generation sequencing (NGS) methods, while experiencing a rapid cost reduction, have improved the precision of PGD/PGS. However, the precision of PGD has been limited by the false-positive and false-negative single-nucleotide variations (SNVs), which are not acceptable in IVF and can be circumvented by linkage analyses, such as short tandem repeats or karyomapping. It is noteworthy that existing methods of detecting SNV/copy number variation (CNV) and linkage analysis often require separate procedures for the same embryo. Here we report an NGS-based PGD/PGS procedure that can simultaneously detect a single-gene disorder and aneuploidy and is capable of linkage analysis in a cost-effective way. This method, called "mutated allele revealed by sequencing with aneuploidy and linkage analyses" (MARSALA), involves multiple annealing and looping-based amplification cycles (MALBAC) for single-cell whole-genome amplification. Aneuploidy is determined by CNVs, whereas SNVs associated with the monogenic diseases are detected by PCR amplification of the MALBAC product. The false-positive and -negative SNVs are avoided by an NGS-based linkage analysis. Two healthy babies, free of the monogenic diseases of their parents, were born after such embryo selection. The monogenic diseases originated from a single base mutation on the autosome and the X-chromosome of the disease-carrying father and mother, respectively. PMID:26712022

  18. Software scripts for quality checking of high-throughput nucleic acid sequencers.

    PubMed

    Lazo, G R; Tong, J; Miller, R; Hsia, C; Rausch, C; Kang, Y; Anderson, O D

    2001-06-01

    We have developed a graphical interface to allow the researcher to view and assess the quality of sequencing results using a series of program scripts developed to process data generated by automated sequencers. The scripts are written in Perl programming language and are executable under the cgibin directory of a Web server environment. The scripts direct nucleic acid sequencing trace file data output from automated sequencers to be analyzed by the phred molecular biology program and are displayed as graphical hypertext mark-up language (HTML) pages. The scripts are mainly designed to handle 96-well microtiter dish samples, but the scripts are also able to read data from 384-well microtiter dishes 96 samples at a time. The scripts may be customized for different laboratory environments and computer configurations. Web links to the sources and discussion page are provided. PMID:11414222

  19. Comparative genomic analyses identify the Vibrio harveyi genome sequenced strains BAA-1116 and HY01 as Vibrio campbellii

    PubMed Central

    Lin, Baochuan; Wang, Zheng; Malanoski, Anthony P; O'Grady, Elizabeth A; Wimpee, Charles F; Vuddhakul, Varaporn; Alves Jr, Nelson; Thompson, Fabiano L; Gomez-Gil, Bruno; Vora, Gary J

    2010-01-01

    Three notable members of the Harveyi clade, Vibrio harveyi, Vibrio alginolyticus and Vibrio parahaemolyticus, are best known as marine pathogens of commercial and medical import. In spite of this fact, the discrimination of Harveyi clade members remains difficult due to genetic and phenotypic similarities, and this has led to misidentifications and inaccurate estimations of a species' involvement in certain environments. To begin to understand the underlying genetics that complicate species level discrimination, we compared the genomes of Harveyi clade members isolated from different environments (seawater, shrimp, corals, oysters, finfish, humans) using microarray-based comparative genomic hybridization (CGH) and multilocus sequence analyses (MLSA). Surprisingly, we found that the only two V. harveyi strains that have had their genomes sequenced (strains BAA-1116 and HY01) have themselves been misidentified. Instead of belonging to the species harveyi, they are actually members of the species campbellii. In total, 28% of the strains tested were found to be misidentified and 42% of these appear to comprise a novel species. Taken together, our findings correct a number of species misidentifications while validating the ability of both CGH and MLSA to distinguish closely related members of the Harveyi clade. PMID:20686623

  20. Deep sequencing and in silico analyses identify MYB-regulated gene networks and signaling pathways in pancreatic cancer

    PubMed Central

    Azim, Shafquat; Zubair, Haseeb; Srivastava, Sanjeev K.; Bhardwaj, Arun; Zubair, Asif; Ahmad, Aamir; Singh, Seema; Khushman, Moh’d.; Singh, Ajay P.

    2016-01-01

    We have recently demonstrated that the transcription factor MYB can modulate several cancer-associated phenotypes in pancreatic cancer. In order to understand the molecular basis of these MYB-associated changes, we conducted deep-sequencing of transcriptome of MYB-overexpressing and -silenced pancreatic cancer cells, followed by in silico pathway analysis. We identified significant modulation of 774 genes upon MYB-silencing (p < 0.05) that were assigned to 25 gene networks by in silico analysis. Further analyses placed genes in our RNA sequencing-generated dataset to several canonical signalling pathways, such as cell-cycle control, DNA-damage and -repair responses, p53 and HIF1α. Importantly, we observed downregulation of the pancreatic adenocarcinoma signaling pathway in MYB-silenced pancreatic cancer cells exhibiting suppression of EGFR and NF-κB. Decreased expression of EGFR and RELA was validated by both qPCR and immunoblotting and they were both shown to be under direct transcriptional control of MYB. These observations were further confirmed in a converse approach wherein MYB was overexpressed ectopically in a MYB-null pancreatic cancer cell line. Our findings thus suggest that MYB potentially regulates growth and genomic stability of pancreatic cancer cells via targeting complex gene networks and signaling pathways. Further in-depth functional studies are warranted to fully understand MYB signaling in pancreatic cancer. PMID:27354262

  1. Deep sequencing and in silico analyses identify MYB-regulated gene networks and signaling pathways in pancreatic cancer.

    PubMed

    Azim, Shafquat; Zubair, Haseeb; Srivastava, Sanjeev K; Bhardwaj, Arun; Zubair, Asif; Ahmad, Aamir; Singh, Seema; Khushman, Moh'd; Singh, Ajay P

    2016-01-01

    We have recently demonstrated that the transcription factor MYB can modulate several cancer-associated phenotypes in pancreatic cancer. In order to understand the molecular basis of these MYB-associated changes, we conducted deep-sequencing of transcriptome of MYB-overexpressing and -silenced pancreatic cancer cells, followed by in silico pathway analysis. We identified significant modulation of 774 genes upon MYB-silencing (p < 0.05) that were assigned to 25 gene networks by in silico analysis. Further analyses placed genes in our RNA sequencing-generated dataset to several canonical signalling pathways, such as cell-cycle control, DNA-damage and -repair responses, p53 and HIF1α. Importantly, we observed downregulation of the pancreatic adenocarcinoma signaling pathway in MYB-silenced pancreatic cancer cells exhibiting suppression of EGFR and NF-κB. Decreased expression of EGFR and RELA was validated by both qPCR and immunoblotting and they were both shown to be under direct transcriptional control of MYB. These observations were further confirmed in a converse approach wherein MYB was overexpressed ectopically in a MYB-null pancreatic cancer cell line. Our findings thus suggest that MYB potentially regulates growth and genomic stability of pancreatic cancer cells via targeting complex gene networks and signaling pathways. Further in-depth functional studies are warranted to fully understand MYB signaling in pancreatic cancer. PMID:27354262

  2. Nucleotide and predicted amino acid sequences of cloned human and mouse preprocathepsin B cDNAs.

    PubMed Central

    Chan, S J; San Segundo, B; McCormick, M B; Steiner, D F

    1986-01-01

    Cathepsin B is a lysosomal thiol proteinase that may have additional extralysosomal functions. To further our investigations on the structure, mode of biosynthesis, and intracellular sorting of this enzyme, we have determined the complete coding sequences for human and mouse preprocathepsin B by using cDNA clones isolated from human hepatoma and kidney phage libraries. The nucleotide sequences predict that the primary structure of preprocathepsin B contains 339 amino acids organized as follows: a 17-residue NH2-terminal prepeptide sequence followed by a 62-residue propeptide region, 254 residues in mature (single chain) cathepsin B, and a 6-residue extension at the COOH terminus. A comparison of procathepsin B sequences from three species (human, mouse, and rat) reveals that the homology between the propeptides is relatively conserved with a minimum of 68% sequence identity. In particular, two conserved sequences in the propeptide that may be functionally significant include a potential glycosylation site and the presence of a single cysteine at position 59. Comparative analysis of the three sequences also suggests that processing of procathepsin B is a multistep process, during which enzymatically active intermediate forms may be generated. The availability of the cDNA clones will facilitate the identification of possible active or inactive intermediate processive forms as well as studies on the transcriptional regulation of the cathepsin B gene. PMID:3463996

  3. Efficient Nucleic Acid Extraction and 16S rRNA Gene Sequencing for Bacterial Community Characterization.

    PubMed

    Anahtar, Melis N; Bowman, Brittany A; Kwon, Douglas S

    2016-01-01

    There is a growing appreciation for the role of microbial communities as critical modulators of human health and disease. High throughput sequencing technologies have allowed for the rapid and efficient characterization of bacterial communities using 16S rRNA gene sequencing from a variety of sources. Although readily available tools for 16S rRNA sequence analysis have standardized computational workflows, sample processing for DNA extraction remains a continued source of variability across studies. Here we describe an efficient, robust, and cost effective method for extracting nucleic acid from swabs. We also delineate downstream methods for 16S rRNA gene sequencing, including generation of sequencing libraries, data quality control, and sequence analysis. The workflow can accommodate multiple samples types, including stool and swabs collected from a variety of anatomical locations and host species. Additionally, recovered DNA and RNA can be separated and used for other applications, including whole genome sequencing or RNA-seq. The method described allows for a common processing approach for multiple sample types and accommodates downstream analysis of genomic, metagenomic and transcriptional information. PMID:27168460

  4. Efficient Nucleic Acid Extraction and 16S rRNA Gene Sequencing for Bacterial Community Characterization

    PubMed Central

    Anahtar, Melis N.; Bowman, Brittany A.; Kwon, Douglas S.

    2016-01-01

    There is a growing appreciation for the role of microbial communities as critical modulators of human health and disease. High throughput sequencing technologies have allowed for the rapid and efficient characterization of bacterial communities using 16S rRNA gene sequencing from a variety of sources. Although readily available tools for 16S rRNA sequence analysis have standardized computational workflows, sample processing for DNA extraction remains a continued source of variability across studies. Here we describe an efficient, robust, and cost effective method for extracting nucleic acid from swabs. We also delineate downstream methods for 16S rRNA gene sequencing, including generation of sequencing libraries, data quality control, and sequence analysis. The workflow can accommodate multiple samples types, including stool and swabs collected from a variety of anatomical locations and host species. Additionally, recovered DNA and RNA can be separated and used for other applications, including whole genome sequencing or RNA-seq. The method described allows for a common processing approach for multiple sample types and accommodates downstream analysis of genomic, metagenomic and transcriptional information. PMID:27168460

  5. Neuroinformatics analyses reveal GABAt and SSADH as major proteins involved in anticonvulsant activity of valproic acid.

    PubMed

    Piplani, Sakshi; Verma, Prabhakar Kumar; Kumar, Ajit

    2016-07-01

    The unequivocal hypotheses about anticonvulsant activity of valproic acid (VPA) have always been a basic hurdle in designing next generation neurotherapeutics, particularly the anti-epileptic drugs. The present study reports about a comprehensive in-silico investigation into qualitative and quantitative binding of VPA and corresponding natural ligands of four major enzymes involved in neurotransmissions, namely-GABA transaminase (GABAt), α-keto glutarate dehydrogenase (α-KGDH), Succinate Semialdehyde dehydrogenase (SSADH) and Glutamate Decarboxylase (GAD), respectively. The molecular docking analyses revealed that VPA inhibits GABAt and α-KGDH through allosteric while SSADH through competitive mode of binding. There is an observed elevation in binding of glutamate over GAD in the presence of VPA. The docking inhibition constant (Ki) of VPA to all the studied enzymatic receptors were observed to be well below the therapeutic concentration of VPA in blood, except for α-KGDH, thus favouring GABAergic over glutamatergic mode of anticonvulsant activity of VPA. The report is probably the first comprehensive in-silico molecular study about VPA action. PMID:27261619

  6. Single-cell analyses of transcriptional heterogeneity during drug tolerance transition in cancer cells by RNA sequencing

    PubMed Central

    Lee, Mei-Chong Wendy; Lopez-Diaz, Fernando J.; Khan, Shahid Yar; Tariq, Muhammad Akram; Dayn, Yelena; Vaske, Charles Joseph; Radenbaugh, Amie J.; Kim, Hyunsung John; Emerson, Beverly M.; Pourmand, Nader

    2014-01-01

    The acute cellular response to stress generates a subpopulation of reversibly stress-tolerant cells under conditions that are lethal to the majority of the population. Stress tolerance is attributed to heterogeneity of gene expression within the population to ensure survival of a minority. We performed whole transcriptome sequencing analyses of metastatic human breast cancer cells subjected to the chemotherapeutic agent paclitaxel at the single-cell and population levels. Here we show that specific transcriptional programs are enacted within untreated, stressed, and drug-tolerant cell groups while generating high heterogeneity between single cells within and between groups. We further demonstrate that drug-tolerant cells contain specific RNA variants residing in genes involved in microtubule organization and stabilization, as well as cell adhesion and cell surface signaling. In addition, the gene expression profile of drug-tolerant cells is similar to that of untreated cells within a few doublings. Thus, single-cell analyses reveal the dynamics of the stress response in terms of cell-specific RNA variants driving heterogeneity, the survival of a minority population through generation of specific RNA variants, and the efficient reconversion of stress-tolerant cells back to normalcy. PMID:25339441

  7. Preparation of Nucleic Acid Libraries for Personalized Sequencing Systems Using an Integrated Microfluidic Hub Technology (Seventh Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting 2012)

    ScienceCinema

    Patel, Kamlesh D [Ken]; SNL,

    2013-01-25

    Kamlesh (Ken) Patel from Sandia National Laboratories (Livermore, California) presents "Preparation of Nucleic Acid Libraries for Personalized Sequencing Systems Using an Integrated Microfluidic Hub Technology " at the 7th Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting held in June, 2012 in Santa Fe, NM.

  8. Deduced amino acid sequence of human pulmonary surfactant proteolipid: SPL(pVal)

    SciTech Connect

    Whitsett, J.A.; Glasser, S.W.; Korfhagen, T.R.; Weaver, T.E.; Clark, J.; Pilot-Matias, T.; Meuth, J.; Fox, J.L.

    1987-05-01

    Hydrophobic, proteolipid-like protein of Mr 6500 was isolated from ether/ethanol extracts of human, canine and bovine pulmonary surfactant. Amino acid composition of the protein demonstrated a remarkable abundance of hydrophobic residues, particularly valine and leucine. The N-terminal amino acid sequence of the human protein was determined: N-Leu-Ile-Pro-Cys-Cys-Pro-Val-Asn-Leu-Lys-Arg-Leu-Leu-Ile-Val4... An oligonucleotide probe was used to screen an adult human lung cDNA library and resulted in detection of cDNA clones with predicted amino acid sequence with close identity to the N-terminal amino acid sequence of the human peptide. SPL(pVal) was found within the reading frame of a larger peptide. SPL(pVal) results from proteolytic processing of a larger preprotein. Northern blot analysis detected in a single 1.0 kilobase SPL(pVal) RNA which was less abundant in fetal than in adult lung. Mixtures of purified canine and bovine SPL(pVal) and synthetic phospholipids display properties of rapid adsorption and surface tension lowering activity characteristic of surfactant. Human SPL(pVal) is a pulmonary surfactant proteolipid which may therefore be useful in combination with phospholipids and/or other surfactant proteins for the treatment of surfactant deficiency such as hyaline membrane disease in newborn infants.

  9. Complete nucleic acid sequence of Penaeus stylirostris densovirus (PstDNV) from India.

    PubMed

    Rai, Praveen; Safeena, Muhammed P; Karunasagar, Iddya; Karunasagar, Indrani

    2011-06-01

    Infectious hypodermal and hematopoietic necrosis virus (IHHNV) of shrimp, recently been classified as Penaeus stylirostris densovirus (PstDNV). The complete nucleic acid sequence of PstDNV from India was obtained by cloning and sequencing of different DNA fragment of the virus. The genome organisation of PstDNV revealed that there were three major coding domains: a left ORF (NS1) of 2001 bp, a mid ORF (NS2) of 1092 bp and a right ORF (VP) of 990 bp. The complete genome and amino acid sequences of three proteins viz., NS1, NS2 and VP were compared with the genomes of the virus reported from Hawaii, China and Mexico and with partial sequence available from isolates from different regions. The phylogenetic analysis of shrimp, insect and vertebrate parvovirus sequences showed that the Indian PstDNV isolate is phylogenetically more closely related to one of the three isolates from Taiwan (AY355307), and two isolates (AY362547 and AY102034) from Thailand. PMID:21402111

  10. Human liver type pyruvate kinase: complete amino acid sequence and the expression in mammalian cells.

    PubMed Central

    Tani, K; Fujii, H; Nagata, S; Miwa, S

    1988-01-01

    Pyruvate kinase (PK) has four isozymes (L, R, M1, M2) that are encoded by two different genes. Among these isozymes, abnormalities of liver (L)-type PK is considered to be associated with hereditary nonspherocytic hemolytic anemia in humans. We isolated and determined the full-length sequence of human L-type PK cDNA. The cDNA contains 1629 base pairs encoding 543 amino acids, 68 base pairs of 5'-noncoding sequence, and 734 base pairs of 3'-noncoding sequence. The similarity between human and rat L-type PK was 86.9% at the nucleotide sequence level and 92.4% at the amino acid sequence level. The full-length L-type PK cDNA was placed under the promoter of simian virus 40 and introduced into monkey COS cells. Human L-type PK activity was detected in the extract of COS cells by the classical PK electrophoresis method. Images PMID:3126495

  11. Human liver type pyruvate kinase: Complete amino acid sequence and the expression in mammalian cells

    SciTech Connect

    Tani, Kenzaburo; Nagata, Shigekazu ); Fujii, Hisaichi ); Miwa, Shiro )

    1988-03-01

    Pyruvate kinase (PK) has four isozymes (L, R, M{sub 1}, M{sub 2}) that are encoded by two different genes. Among these isozymes, abnormalities of liver (L)-type PK is considered to be associated with hereditary nonspherocytic hemolytic anemia in humans. The authors isolated and determined the full-length sequence of human L-type PK cDNA. The cDNA contains 1,629 base pairs encoding 543 amino acids, 68 base pairs of 5{prime}-noncoding sequence, and 734 base pairs of 3{prime}-noncoding sequence. The similarity between human and rat L-type PK was 86.9% at the nucleotide sequence level and 92.4% at the amino acid sequence level. The full-length L-type PK cDNA was placed under the promoter of simian virus 40 and introduced into monkey COS cells. Human L-type PK activity was detected in the extract of COS cells by the classical PK electrophoresis method.

  12. Molecular cytogenetics by polymerase catalyzed amplification or in situ labelling of specific nucleic acid sequences

    SciTech Connect

    Bolund, L.; Brandt, C.; Hindkjaer, J.; Koch, J.; Koelvraa, S.; Pedersen, S. )

    1993-01-01

    The Polymerase Chain Reaction (PCR) can be performed on isolated cells or chromosomes and the product can be analyzed by DNA technology or by FISH to test metaphases. The authors have good experiences analyzing aberrant chromosomes by FACS sorting, PCR with degenerated primers and painting of test metaphases with the PCR product. They also utilize polymerases for PRimed IN Situ labelling (PRINS) of specific nucleic acid sequences. In PRINS oligonucleotides are hybridized to their target sequences and labeled nucleotides are incorporated at the site of hybridization with the oligonucleotide as primer. PRINS may eventually allow the study of individual genes, gene expression and even somatic mutations (in mRNA) in single cells.

  13. DNA Cloning of Plasmodium falciparum Circumsporozoite Gene: Amino Acid Sequence of Repetitive Epitope

    NASA Astrophysics Data System (ADS)

    Enea, Vincenzo; Ellis, Joan; Zavala, Fidel; Arnot, David E.; Asavanich, Achara; Masuda, Aoi; Quakyi, Isabella; Nussenzweig, Ruth S.

    1984-08-01

    A clone of complementary DNA encoding the circumsporozoite (CS) protein of the human malaria parasite Plasmodium falciparum has been isolated by screening an Escherichia coli complementary DNA library with a monoclonal antibody to the CS protein. The DNA sequence of the complementary DNA insert encodes a four-amino acid sequence: proline-asparagine-alanine-asparagine, tandemly repeated 23 times. The CS β -lactamase fusion protein specifically binds monoclonal antibodies to the CS protein and inhibits the binding of these antibodies to native Plasmodium falciparum CS protein. These findings provide a basis for the development of a vaccine against Plasmodium falciparum malaria.

  14. Comparative Analyses of the Lipooligosaccharides from Nontypeable Haemophilus influenzae and Haemophilus haemolyticus Show Differences in Sialic Acid and Phosphorylcholine Modifications

    PubMed Central

    Post, Deborah M. B.; Ketterer, Margaret R.; Coffin, Jeremy E.; Reinders, Lorri M.; Munson, Robert S.; Bair, Thomas; Murphy, Timothy F.; Foster, Eric D.; Gibson, Bradford W.

    2016-01-01

    Haemophilus haemolyticus and nontypeable Haemophilus influenzae (NTHi) are closely related upper airway commensal bacteria that are difficult to distinguish phenotypically. NTHi causes upper and lower airway tract infections in individuals with compromised airways, while H. haemolyticus rarely causes such infections. The lipooligosaccharide (LOS) is an outer membrane component of both species and plays a role in NTHi pathogenesis. In this study, comparative analyses of the LOS structures and corresponding biosynthesis genes were performed. Mass spectrometric and immunochemical analyses showed that NTHi LOS contained terminal sialic acid more frequently and to a higher extent than H. haemolyticus LOS did. Genomic analyses of 10 strains demonstrated that H. haemolyticus lacked the sialyltransferase genes lic3A and lic3B (9/10) and siaA (10/10), but all strains contained the sialic acid uptake genes siaP and siaT (10/10). However, isothermal titration calorimetry analyses of SiaP from two H. haemolyticus strains showed a 3.4- to 7.3-fold lower affinity for sialic acid compared to that of NTHi SiaP. Additionally, mass spectrometric and immunochemical analyses showed that the LOS from H. haemolyticus contained phosphorylcholine (ChoP) less frequently than the LOS from NTHi strains. These differences observed in the levels of sialic acid and ChoP incorporation in the LOS structures from H. haemolyticus and NTHi may explain some of the differences in their propensities to cause disease. PMID:26729761

  15. Method for high-volume sequencing of nucleic acids: random and directed priming with libraries of oligonucleotides

    DOEpatents

    Studier, F.W.

    1995-04-18

    Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient. 2 figs.

  16. Method for high-volume sequencing of nucleic acids: random and directed priming with libraries of oligonucleotides

    DOEpatents

    Studier, F. William

    1995-04-18

    Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient.

  17. Partial amino acid sequence of apolipoprotein(a) shows that it is homologous to plasminogen

    SciTech Connect

    Eaton, D.L.; Fless, G.M.; Kohr, W.J.; McLean, J.W.; Xu, Q.T.; Miller, C.G.; Lawn, R.M.; Scanu, A.M.

    1987-05-01

    Apolipoprotein(a) (apo(a)) is a glycoprotein with M/sub r/ approx. 280,000 that is disulfide linked to apolipoprotein B in lipoprotein(a) particles. Elevated plasma levels of lipoprotein(a) are correlated with atherosclerosis. Partial amino acid sequence of apo(a) shows that it has striking homology to plasminogen. Plasminogen is a plasma serine protease zymogen that consists of five homologous and tandemly repeated domains called kringles and a trypsin-like protease domain. The amino-terminal sequence obtained for apo(a) is homologous to the beginning of kringle 4 but not the amino terminus of plasminogen. Apo(a) was subjected to limited proteolysis by trypsin or V8 protease, and fragments generated were isolated and sequenced. Sequences obtained from several of these fragments are highly (77-100%) homologous to plasminogen residues 391-421, which reside within kringle 4. Analysis of these internal apo(a) sequences revealed that apo(a) may contain at least two kringle 4-like domains. A sequence obtained from another tryptic fragment also shows homology to the end of kringle 4 and the beginning of kringle 5. Sequence data obtained from the two tryptic fragments shows homology with the protease domain of plasminogen. One of these sequences is homologous to the sequences surrounding the activation site of plasminogen. Plasminogen is activated by the cleavage of a specific arginine residue by urokinase and tissue plasminogen activator; however, the corresponding site in apo(a) is a serine that would not be cleaved by tissue plasminogen activator or urokinase. Using a plasmin-specific assay, no proteolytic activity could be demonstrated for lipoprotein(a) particles. These results suggest that apo(a) contains kringle-like domains and an inactive protease domain.

  18. Stability Test and Quantitative and Qualitative Analyses of the Amino Acids in Pharmacopuncture Extracted from Scolopendra subspinipes mutilans

    PubMed Central

    Cho, GyeYoon; Han, KyuChul; Yoon, JinYoung

    2015-01-01

    Objectives: Scolopendra subspinipes mutilans (S. subspinipes mutilans) is known as a traditional medicine and includes various amino acids, peptides and proteins. The amino acids in the pharmacopuncture extracted from S. subspinipes mutilans by using derivatization methods were analyzed quantitatively and qualitatively by using high performance liquid chromatography (HPLC) over a 12 month period to confirm its stability. Methods: Amino acids of pharmacopuncture extracted from S. subspinipes mutilans were derived by using O-phthaldialdehyde (OPA) & 9-fluorenyl methoxy carbonyl chloride (FMOC) reagent and were analyzed using HPLC. The amino acids were detected by using a diode array detector (DAD) and a fluorescence detector (FLD) to compare a mixed amino acid standard (STD) to the pharmacopuncture from centipedes. The stability tests on the pharmacopuncture from centipedes were done using HPLC for three conditions: a room temperature test chamber, an acceleration test chamber, and a cold test chamber. Results: The pharmacopuncture from centipedes was prepared by using the method of the Korean Pharmacopuncture Institute (KPI) and through quantitative analyses was shown to contain 9 amino acids of the 16 amino acids in the mixed amino acid STD. The amounts of the amino acids in the pharmacopuncture from centipedes were 34.37 ppm of aspartate, 123.72 ppm of arginine, 170.63 ppm of alanine, 59.55 ppm of leucine and 57 ppm of lysine. The relative standard deviation (RSD %) results for the pharmacopuncture from centipedes had a maximum value of 14.95% and minimum value of 1.795% on the room temperature test chamber, the acceleration test chamber and the cold test chamber stability tests. Conclusion: Stability tests on and quantitative and qualitative analyses of the amino acids in the pharmacopuncture extracted from centipedes by using derivatization methods were performed by using HPLC. Through research, we hope to determine the relationship between time and the

  19. The Complete Genome Sequence of the Lactic Acid Bacterium Lactococcus lactis ssp. lactis IL1403

    PubMed Central

    Bolotin, Alexander; Wincker, Patrick; Mauger, Stéphane; Jaillon, Olivier; Malarme, Karine; Weissenbach, Jean; Ehrlich, S. Dusko; Sorokin, Alexei

    2001-01-01

    Lactococcus lactis is a nonpathogenic AT-rich gram-positive bacterium closely related to the genus Streptococcus and is the most commonly used cheese starter. It is also the best-characterized lactic acid bacterium. We sequenced the genome of the laboratory strain IL1403, using a novel two-step strategy that comprises diagnostic sequencing of the entire genome and a shotgun polishing step. The genome contains 2,365,589 base pairs and encodes 2310 proteins, including 293 protein-coding genes belonging to six prophages and 43 insertion sequence (IS) elements. Nonrandom distribution of IS elements indicates that the chromosome of the sequenced strain may be a product of recent recombination between two closely related genomes. A complete set of late competence genes is present, indicating the ability of L. lactis to undergo DNA transformation. Genomic sequence revealed new possibilities for fermentation pathways and for aerobic respiration. It also indicated a horizontal transfer of genetic information from Lactococcus to gram-negative enteric bacteria of Salmonella-Escherichia group. [The sequence data described in this paper has been submitted to the GenBank data library under accession no. AE005176.] PMID:11337471

  20. On human disease-causing amino acid variants: statistical study of sequence and structural patterns

    PubMed Central

    Alexov, Emil

    2015-01-01

    Statistical analysis was carried out on large set of naturally occurring human amino acid variations and it was demonstrated that there is a preference for some amino acid substitutions to be associated with diseases. At an amino acid sequence level, it was shown that the disease-causing variants frequently involve drastic changes of amino acid physico-chemical properties of proteins such as charge, hydrophobicity and geometry. Structural analysis of variants involved in diseases and being frequently observed in human population showed similar trends: disease-causing variants tend to cause more changes of hydrogen bond network and salt bridges as compared with harmless amino acid mutations. Analysis of thermodynamics data reported in literature, both experimental and computational, indicated that disease-causing variants tend to destabilize proteins and their interactions, which prompted us to investigate the effects of amino acid mutations on large databases of experimentally measured energy changes in unrelated proteins. Although the experimental datasets were linked neither to diseases nor exclusory to human proteins, the observed trends were the same: amino acid mutations tend to destabilize proteins and their interactions. Having in mind that structural and thermodynamics properties are interrelated, it is pointed out that any large change of any of them is anticipated to cause a disease. PMID:25689729

  1. Self-sequencing of amino acids and origins of polyfunctional protocells.

    PubMed

    Fox, S W

    1984-01-01

    The primal role of the origins of proteins in molecular evolution is discussed. On the basis of this premise, the significance of the experimentally established self-sequencing of amino acids under simulated geological conditions is explained as due to the fact that the products are highly nonrandom and accordingly contain many kinds of information. When such thermal proteins are aggregated into laboratory protocells, an action that occurs readily, the resultant protocells also contain many kinds of information. Residue-by-residue order, enzymic activities, and lipid quality accordingly occur within each preparation of proteinoid (thermal protein). In this paper are reviewed briefly the phenomenon of self-sequencing of amino acids, its relationship to evolutionary processes, other significance of such self-ordering, and the experimental evidence for original polyfunctional protocells. PMID:6462684

  2. Self-Sequencing of Amino Acids and Origins of Polyfunctional Protocells

    NASA Astrophysics Data System (ADS)

    Fox, Sidney W.

    1984-12-01

    The primal role of the origins of proteins in molecular evolution is discussed. On the basis of this premise, the significance of the experimentally established self-sequencing of amino acids under simulated geological conditions is explained as due to the fact that the products are highly nonrandom and accordingly contain many kinds of information. When such thermal proteins are aggregated into laboratory protocells, an action that occurs readily, the resultant protocells also contain many kinds of information. Residue-by-residue order, enzymic activities, and lipid quality accordingly occur within each preparation of proteinoid (thermal protein). In this paper are reviewed briefly the phenomenon of self-sequencing of amino acids, its relationship to evolutionary processes, other significance of such self-ordering, and the experimental evidence for original polyfunctional protocells.

  3. Comparative sequence analyses of rhodopsin and RPE65 reveal patterns of selective constraint across hereditary retinal disease mutations.

    PubMed

    Hauser, Frances E; Schott, Ryan K; Castiglione, Gianni M; Van Nynatten, Alexander; Kosyakov, Alexander; Tang, Portia L; Gow, Daniel A; Chang, Belinda S W

    2016-01-01

    Retinitis pigmentosa (RP) comprises several heritable diseases that involve photoreceptor, and ultimately retinal, degeneration. Currently, mutations in over 50 genes have known links to RP. Despite advances in clinical characterization, molecular characterization of RP remains challenging due to the heterogeneous nature of causal genes, mutations, and clinical phenotypes. In this study, we compiled large datasets of two important visual genes associated with RP: rhodopsin, which initiates the phototransduction cascade, and the retinoid isomerase RPE65, which regenerates the visual cycle. We used a comparative evolutionary approach to investigate the relationship between interspecific sequence variation and pathogenic mutations that lead to degenerative retinal disease. Using codon-based likelihood methods, we estimated evolutionary rates (d N/d S) across both genes in a phylogenetic context to investigate differences between pathogenic and nonpathogenic amino acid sites. In both genes, disease-associated sites showed significantly lower evolutionary rates compared to nondisease sites, and were more likely to occur in functionally critical areas of the proteins. The nature of the dataset (e.g., vertebrate or mammalian sequences), as well as selection of pathogenic sites, affected the differences observed between pathogenic and nonpathogenic sites. Our results illustrate that these methods can serve as an intermediate step in understanding protein structure and function in a clinical context, particularly in predicting the relative pathogenicity (i.e., functional impact) of point mutations and their downstream phenotypic effects. Extensions of this approach may also contribute to current methods for predicting the deleterious effects of candidate mutations and to the identification of protein regions under strong constraint where we expect pathogenic mutations to occur. PMID:26750628

  4. Minding the gap: Frequency of indels in mtDNA control region sequence data and influence on population genetic analyses

    USGS Publications Warehouse

    Pearce, J.M.

    2006-01-01

    Insertions and deletions (indels) result in sequences of various lengths when homologous gene regions are compared among individuals or species. Although indels are typically phylogenetically informative, occurrence and incorporation of these characters as gaps in intraspecific population genetic data sets are rarely discussed. Moreover, the impact of gaps on estimates of fixation indices, such as FST, has not been reviewed. Here, I summarize the occurrence and population genetic signal of indels among 60 published studies that involved alignments of multiple sequences from the mitochondrial DNA (mtDNA) control region of vertebrate taxa. Among 30 studies observing indels, an average of 12% of both variable and parsimony-informative sites were composed of these sites. There was no consistent trend between levels of population differentiation and the number of gap characters in a data block. Across all studies, the average influence on estimates of ??ST was small, explaining only an additional 1.8% of among population variance (range 0.0-8.0%). Studies most likely to observe an increase in ??ST with the inclusion of gap characters were those with < 20 variable sites, but a near equal number of studies with few variable sites did not show an increase. In contrast to studies at interspecific levels, the influence of indels for intraspecific population genetic analyses of control region DNA appears small, dependent upon total number of variable sites in the data block, and related to species-specific characteristics and the spatial distribution of mtDNA lineages that contain indels. ?? 2006 Blackwell Publishing Ltd.

  5. Comparative Analyses of Plastid Sequences between Native and Introduced Populations of Aquatic Weeds Elodea canadensis and E. nuttallii

    PubMed Central

    Huotari, Tea; Korpelainen, Helena

    2013-01-01

    Non-indigenous species (NIS) are species living outside their historic or native range. Invasive NIS often cause severe environmental impacts, and may have large economical and social consequences. Elodea (Hydrocharitaceae) is a New World genus with at least five submerged aquatic angiosperm species living in fresh water environments. Our aim was to survey the geographical distribution of cpDNA haplotypes within the native and introduced ranges of invasive aquatic weeds Elodea canadensis and E. nuttallii and to reconstruct the spreading histories of these invasive species. In order to reveal informative chloroplast (cp) genome regions for phylogeographic analyses, we compared the plastid sequences of native and introduced individuals of E. canadensis. In total, we found 235 variable sites (186 SNPs, 47 indels and two inversions) between the two plastid sequences consisting of 112,193 bp and developed primers flanking the most variable genomic areas. These 29 primer pairs were used to compare the level and pattern of intraspecific variation within E. canadensis to interspecific variation between E. canadensis and E. nuttallii. Nine potentially informative primer pairs were used to analyze the phylogeographic structure of both Elodea species, based on 70 E. canadensis and 25 E. nuttallii individuals covering native and introduced distributions. On the whole, the level of variation between the two Elodea species was 53% higher than that within E. canadensis. In our phylogeographic analysis, only a single haplotype was found in the introduced range in both species. These haplotypes H1 (E. canadensis) and A (E. nuttallii) were also widespread in the native range, covering the majority of native populations analyzed. Therefore, we were not able to identify either the geographic origin of the introduced populations or test the hypothesis of single versus multiple introductions. The divergence between E. canadensis haplotypes was surprisingly high, and future research may

  6. Sequence of morphological transitions in two-dimensional pattern growth from aqueous ascorbic Acid solutions.

    PubMed

    Paranjpe, A S

    2002-08-12

    A sequence of morphological transitions in two-dimensional dehydration patterns of aqueous solutions of ascorbic acid is observed with humidity as a control parameter. Change in morphology occurs due to humidity induced variation in the concentration of the metastable supersaturated solution phase formed after initial solvent evaporation. As percent humidity is varied from 40 to 80, patterns change from compact circular --> radial --> density modulated radial (a new morphology) --> density modulated circular --> density modulated dendritic (a new morphology) --> dense branching. PMID:12190528

  7. Self-sequencing of amino acids and origins of polyfunctional protocells

    NASA Technical Reports Server (NTRS)

    Fox, S. W.

    1984-01-01

    The role of proteins in the origin of living things is discussed. It has been experimentally established that amino acids can sequence themselves under simulated geological conditions with highly nonrandom products which accordingly contain diverse information. Multiple copies of each type of macromolecule are formed, resulting in greater power for any protoenzymic molecule than would accrue from a single copy of each type. Thermal proteins are readily incorporated into laboratory protocells. The experimental evidence for original polyfunctional protocells is discussed.

  8. Snake venom. The amino acid sequence of protein A from Dendroaspis polylepis polylepis (black mamba) venom.

    PubMed

    Joubert, F J; Strydom, D J

    1980-12-01

    Protein A from Dendroaspis polylepis polylepis venom comprises 81 amino acids, including ten half-cystine residues. The complete primary structures of protein A and its variant A' were elucidated. The sequences of proteins A and A', which differ in a single position, show no homology with various neurotoxins and non-neurotoxic proteins and represent a new type of elapid venom protein. PMID:7461607

  9. Characterization of the microbial acid mine drainage microbial community using culturing and direct sequencing techniques.

    PubMed

    Auld, Ryan R; Myre, Maxine; Mykytczuk, Nadia C S; Leduc, Leo G; Merritt, Thomas J S

    2013-05-01

    We characterized the bacterial community from an AMD tailings pond using both classical culturing and modern direct sequencing techniques and compared the two methods. Acid mine drainage (AMD) is produced by the environmental and microbial oxidation of minerals dissolved from mining waste. Surprisingly, we know little about the microbial communities associated with AMD, despite the fundamental ecological roles of these organisms and large-scale economic impact of these waste sites. AMD microbial communities have classically been characterized by laboratory culturing-based techniques and more recently by direct sequencing of marker gene sequences, primarily the 16S rRNA gene. In our comparison of the techniques, we find that their results are complementary, overall indicating very similar community structure with similar dominant species, but with each method identifying some species that were missed by the other. We were able to culture the majority of species that our direct sequencing results indicated were present, primarily species within the Acidithiobacillus and Acidiphilium genera, although estimates of relative species abundance were only obtained from direct sequencing. Interestingly, our culture-based methods recovered four species that had been overlooked from our sequencing results because of the rarity of the marker gene sequences, likely members of the rare biosphere. Further, direct sequencing indicated that a single genus, completely missed in our culture-based study, Legionella, was a dominant member of the microbial community. Our results suggest that while either method does a reasonable job of identifying the dominant members of the AMD microbial community, together the methods combine to give a more complete picture of the true diversity of this environment. PMID:23485423

  10. Transcriptome Sequencing and Genome-wide Association Analyses Reveal Lysosomal Function and Actin Cytoskeleton Remodeling in Schizophrenia and Bipolar Disorder

    PubMed Central

    Kim, Sanghyeon; Reimers, Mark; Bacanu, Silviu-Alin; Yu, Hui; Liu, Chunyu; Sun, Jingchun; Wang, Quan; Jia, Peilin; Xu, Fengping; Zhang, Yong; Kendler, Kenneth S.; Peng, Zhiyu; Chen, Xiangning

    2014-01-01

    Schizophrenia (SCZ) and bipolar disorder (BPD) are severe mental disorders with high heritability. Clinicians have long noticed the similarities of clinic symptoms between these disorders. In recent years, accumulating evidence indicates some shared genetic liabilities. However, what is shared remains elusive. In this study, we conducted whole transcriptome analysis of postmortem brain tissues (cingulate cortex) from SCZ, BPD and control subjects, and identified differentially expressed genes in these disorders. We found 105 and 153 genes differentially expressed in SCZ and BPD, respectively. By comparing the t-test scores, we found that many of the genes differentially expressed in SCZ and BPD are concordant in their expression level (q ≤ 0.01, 53 genes; q ≤ 0.05, 213 genes; q ≤ 0.1, 885 genes). Using genome-wide association data from the Psychiatric Genomics Consortium, we found that these differentially and concordantly expressed genes were enriched in association signals for both SCZ (p < 10−7 ) and BPD (p = 0.029). To our knowledge, this is the first time that a substantially large number of genes shows concordant expression and association for both SCZ and BPD. Pathway analyses of these genes indicated that they are involved in the lysosome, Fc gamma receptor mediated phagocytosis, regulation of actin skeleton pathways, along with several cancer pathways. Functional analyses of these genes revealed an interconnected pathway network centered on lysosomal function and the regulation of actin cytoskeleton. These pathways and their interacting network were principally confirmed by an independent transcriptome sequencing dataset of hippocampus. Dysregulation of lysosomal function and cytoskeleton remodeling has direct impacts on endocytosis, phagocytosis, exocytosis, vesicle trafficking, neuronal maturation and migration, neurite outgrowth, and synaptic density and plasticity, and different aspects of these processes have been implicated in SCZ and BPD

  11. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... approved by the Director of the Federal Register in accordance with 5 U.S.C. 552(a) and 1 CFR part 51... base or modified or unusual amino acid may be presented in a given sequence as the corresponding unmodified base or amino acid if the modified base or modified or unusual amino acid is one of those...

  12. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... approved by the Director of the Federal Register in accordance with 5 U.S.C. 552(a) and 1 CFR part 51... base or modified or unusual amino acid may be presented in a given sequence as the corresponding unmodified base or amino acid if the modified base or modified or unusual amino acid is one of those...

  13. Nanopore Analysis of Nucleic Acids: Single-Molecule Studies of Molecular Dynamics, Structure, and Base Sequence

    NASA Astrophysics Data System (ADS)

    Olasagasti, Felix; Deamer, David W.

    Nucleic acids are linear polynucleotides in which each base is covalently linked to a pentose sugar and a phosphate group carrying a negative charge. If a pore having roughly the crosssectional diameter of a single-stranded nucleic acid is embedded in a thin membrane and a voltage of 100 mV or more is applied, individual nucleic acids in solution can be captured by the electrical field in the pore and translocated through by single-molecule electrophoresis. The dimensions of the pore cannot accommodate anything larger than a single strand, so each base in the molecule passes through the pore in strict linear sequence. The nucleic acid strand occupies a large fraction of the pore's volume during translocation and therefore produces a transient blockade of the ionic current created by the applied voltage. If it could be demonstrated that each nucleotide in the polymer produced a characteristic modulation of the ionic current during its passage through the nanopore, the sequence of current modulations would reflect the sequence of bases in the polymer. According to this basic concept, nanopores are analogous to a Coulter counter that detects nanoscopic molecules rather than microscopic [1,2]. However, the advantage of nanopores is that individual macromolecules can be characterized because different chemical and physical properties affect their passage through the pore. Because macromolecules can be captured in the pore as well as translocated, the nanopore can be used to detect individual functional complexes that form between a nucleic acid and an enzyme. No other technique has this capability.

  14. Complete amino acid sequence of a histidine-rich proteolytic fragment of human ceruloplasmin.

    PubMed

    Kingston, I B; Kingston, B L; Putnam, F W

    1979-04-01

    The complete amino acid sequence has been determined for a fragment of human ceruloplasmin [ferroxidase; iron(II):oxygen oxidoreductase, EC 1.16.3.1]. The fragment (designated Cp F5) contains 159 amino acid residues and has a molecular weight of 18,650; it lacks carbohydrate, is rich in histidine, and contains one free cysteine that may be part of a copper-binding site. This fragment is present in most commercial preparations of ceruloplasmin, probably owing to proteolytic degradation, but can also be obtained by limited cleavage of single-chain ceruloplasmin with plasmin. Cp F5 probably is an intact domain attached to the COOH-terminal end of single-chain ceruloplasmin via a labile interdomain peptide bond. A model of the secondary structure predicted by empirical methods suggests that almost one-third of the amino acid residues are distributed in alpha helices, about a third in beta-sheet structure, and the remainder in beta turns and unidentified structures. Computer analysis of the amino acid sequence has not demonstrated a statistically significant relationship between this ceruloplasmin fragment and any other protein, but there is some evidence for an internal duplication. PMID:287005

  15. Leaf waxes, compound-specific D/H and 14C analyses in the Loess Paleosol Sequence Möhlin, Switzerland

    NASA Astrophysics Data System (ADS)

    Wüthrich, Lorenz; Bliedtner, Marcel; Kathrin Schäfer, Imke; Zech, Jana; Gaar, Dorian; Preusser, Frank; Zech, Roland

    2016-04-01

    Leaf waxes, such as long-chain n-alkanes and n-alkanoic acids, and their D/H isotopic composition, are increasingly used for paleoenvironmental and -climate reconstructions. Recent technological innovations now also allow to perform radiocarbon analyses on leaf waxes. For this study, we analyzed leaf waxes and their δD and 14C composition in the 7 m Loess Paleosol Sequence Möhlin, Switzerland. The chain length patterns in the upper part of the profile indicate n-alkane contribution from deciduous trees, while the underlying loess is dominated by inputs from grasses and herbs. Our δD record does not show depleted, glacial values compared to the Holocene, as we had expected in analogy to the Greenland ice core records. Values are most enriched at 1 m depth, i.e. well below the topsoil. Further research is needed to disentangle source effects and evapotranspirative enrichment, before the δD record can be interpreted robustly. Our radiocarbon ages for the leaf waxes are in very good agreement with independent age control based on luminescence ages, corroborating that massive loess accumulation occurred already at 35 ka. Only the uppermost 3 m were deposited during the last glacial maximum.

  16. Gene Mutation Profiles in Primary Diffuse Large B Cell Lymphoma of Central Nervous System: Next Generation Sequencing Analyses

    PubMed Central

    Todorovic Balint, Milena; Jelicic, Jelena; Mihaljevic, Biljana; Kostic, Jelena; Stanic, Bojana; Balint, Bela; Pejanovic, Nadja; Lucic, Bojana; Tosic, Natasa; Marjanovic, Irena; Stojiljkovic, Maja; Karan-Djurasevic, Teodora; Perisic, Ognjen; Rakocevic, Goran; Popovic, Milos; Raicevic, Sava; Bila, Jelena; Antic, Darko; Andjelic, Bosko; Pavlovic, Sonja

    2016-01-01

    The existence of a potential primary central nervous system lymphoma-specific genomic signature that differs from the systemic form of diffuse large B cell lymphoma (DLBCL) has been suggested, but is still controversial. We investigated 19 patients with primary DLBCL of central nervous system (DLBCL CNS) using the TruSeq Amplicon Cancer Panel (TSACP) for 48 cancer-related genes. Next generation sequencing (NGS) analyses have revealed that over 80% of potentially protein-changing mutations were located in eight genes (CTNNB1, PIK3CA, PTEN, ATM, KRAS, PTPN11, TP53 and JAK3), pointing to the potential role of these genes in lymphomagenesis. TP53 was the only gene harboring mutations in all 19 patients. In addition, the presence of mutated TP53 and ATM genes correlated with a higher total number of mutations in other analyzed genes. Furthermore, the presence of mutated ATM correlated with poorer event-free survival (EFS) (p = 0.036). The presence of the mutated SMO gene correlated with earlier disease relapse (p = 0.023), inferior event-free survival (p = 0.011) and overall survival (OS) (p = 0.017), while mutations in the PTEN gene were associated with inferior OS (p = 0.048). Our findings suggest that the TP53 and ATM genes could be involved in the molecular pathophysiology of primary DLBCL CNS, whereas mutations in the PTEN and SMO genes could affect survival regardless of the initial treatment approach. PMID:27164089

  17. Analyses of nuclear ldhA gene and mtDNA control region sequences of Atlantic northern bluefin tuna populations.

    PubMed

    Ely, B; Stoner, D S; Bremer, Alvarado J R; Dean, J M; Addis, P; Cau, A; Thelen, E J; Jones, W J; Black, D E; Smith, L; Scott, K; Naseri, I; Quattro, J M

    2002-12-01

    There has been considerable debate about whether the Atlantic northern bluefin tuna exist as a single panmictic unit. We have addressed this issue by examining both mitochondrial DNA control region nucleotide sequences and nuclear gene ldhA allele frequencies in replicate size or year class samples of northern bluefin tuna from the Mediterranean Sea and the northwestern Atlantic Ocean. Pairwise comparisons of multiple year class samples from the 2 regions provided no evidence for population subdivision. Similarly, analyses of molecular variance of both mitochondrial and ldhA data revealed no significant differences among or between samples from the 2 regions. These results demonstrate the importance of analyzing multiple year classes and large sample sizes to obtain accurate estimates when using allele frequencies to characterize a population. It is important to note that the absence of genetic evidence for population substructure does not unilaterally constitute evidence of a single panmictic population, as genetic differentiation can be prevented by large population sizes and by migration. PMID:14961233

  18. Complete genome sequence and transcriptomics analyses reveal pigment biosynthesis and regulatory mechanisms in an industrial strain, Monascus purpureus YY-1

    PubMed Central

    Yang, Yue; Liu, Bin; Du, Xinjun; Li, Ping; Liang, Bin; Cheng, Xiaozhen; Du, Liangcheng; Huang, Di; Wang, Lei; Wang, Shuo

    2015-01-01

    Monascus has been used to produce natural colorants and food supplements for more than one thousand years, and approximately more than one billion people eat Monascus-fermented products during their daily life. In this study, using next-generation sequencing and optical mapping approaches, a 24.1-Mb complete genome of an industrial strain, Monascus purpureus YY-1, was obtained. This genome consists of eight chromosomes and 7,491 genes. Phylogenetic analysis at the genome level provides convincing evidence for the evolutionary position of M. purpureus. We provide the first comprehensive prediction of the biosynthetic pathway for Monascus pigment. Comparative genomic analyses show that the genome of M. purpureus is 13.6–40% smaller than those of closely related filamentous fungi and has undergone significant gene losses, most of which likely occurred during its specialized adaptation to starch-based foods. Comparative transcriptome analysis reveals that carbon starvation stress, resulting from the use of relatively low-quality carbon sources, contributes to the high yield of pigments by repressing central carbon metabolism and augmenting the acetyl-CoA pool. Our work provides important insights into the evolution of this economically important fungus and lays a foundation for future genetic manipulation and engineering of this strain. PMID:25660389

  19. Gene Mutation Profiles in Primary Diffuse Large B Cell Lymphoma of Central Nervous System: Next Generation Sequencing Analyses.

    PubMed

    Todorovic Balint, Milena; Jelicic, Jelena; Mihaljevic, Biljana; Kostic, Jelena; Stanic, Bojana; Balint, Bela; Pejanovic, Nadja; Lucic, Bojana; Tosic, Natasa; Marjanovic, Irena; Stojiljkovic, Maja; Karan-Djurasevic, Teodora; Perisic, Ognjen; Rakocevic, Goran; Popovic, Milos; Raicevic, Sava; Bila, Jelena; Antic, Darko; Andjelic, Bosko; Pavlovic, Sonja

    2016-01-01

    The existence of a potential primary central nervous system lymphoma-specific genomic signature that differs from the systemic form of diffuse large B cell lymphoma (DLBCL) has been suggested, but is still controversial. We investigated 19 patients with primary DLBCL of central nervous system (DLBCL CNS) using the TruSeq Amplicon Cancer Panel (TSACP) for 48 cancer-related genes. Next generation sequencing (NGS) analyses have revealed that over 80% of potentially protein-changing mutations were located in eight genes (CTNNB1, PIK3CA, PTEN, ATM, KRAS, PTPN11, TP53 and JAK3), pointing to the potential role of these genes in lymphomagenesis. TP53 was the only gene harboring mutations in all 19 patients. In addition, the presence of mutated TP53 and ATM genes correlated with a higher total number of mutations in other analyzed genes. Furthermore, the presence of mutated ATM correlated with poorer event-free survival (EFS) (p = 0.036). The presence of the mutated SMO gene correlated with earlier disease relapse (p = 0.023), inferior event-free survival (p = 0.011) and overall survival (OS) (p = 0.017), while mutations in the PTEN gene were associated with inferior OS (p = 0.048). Our findings suggest that the TP53 and ATM genes could be involved in the molecular pathophysiology of primary DLBCL CNS, whereas mutations in the PTEN and SMO genes could affect survival regardless of the initial treatment approach. PMID:27164089

  20. Complete genome sequence and transcriptomics analyses reveal pigment biosynthesis and regulatory mechanisms in an industrial strain, Monascus purpureus YY-1.

    PubMed

    Yang, Yue; Liu, Bin; Du, Xinjun; Li, Ping; Liang, Bin; Cheng, Xiaozhen; Du, Liangcheng; Huang, Di; Wang, Lei; Wang, Shuo

    2015-01-01

    Monascus has been used to produce natural colorants and food supplements for more than one thousand years, and approximately more than one billion people eat Monascus-fermented products during their daily life. In this study, using next-generation sequencing and optical mapping approaches, a 24.1-Mb complete genome of an industrial strain, Monascus purpureus YY-1, was obtained. This genome consists of eight chromosomes and 7,491 genes. Phylogenetic analysis at the genome level provides convincing evidence for the evolutionary position of M. purpureus. We provide the first comprehensive prediction of the biosynthetic pathway for Monascus pigment. Comparative genomic analyses show that the genome of M. purpureus is 13.6-40% smaller than those of closely related filamentous fungi and has undergone significant gene losses, most of which likely occurred during its specialized adaptation to starch-based foods. Comparative transcriptome analysis reveals that carbon starvation stress, resulting from the use of relatively low-quality carbon sources, contributes to the high yield of pigments by repressing central carbon metabolism and augmenting the acetyl-CoA pool. Our work provides important insights into the evolution of this economically important fungus and lays a foundation for future genetic manipulation and engineering of this strain. PMID:25660389

  1. Allele frequency-based analyses robustly map sequence sites under balancing selection in a malaria vaccine candidate antigen.

    PubMed Central

    Polley, Spencer D; Chokejindachai, Watcharee; Conway, David J

    2003-01-01

    The Plasmodium falciparum apical membrane antigen 1 (AMA1) is a leading candidate for a malaria vaccine. Here, within-population analyses of alleles from 50 Thai P. falciparum isolates yield significant evidence for balancing selection on polymorphisms within the disulfide-bonded domains I and III of the surface accessible ectodomain of AMA1, a result very similar to that seen previously in a Nigerian population. Studying the frequency of nucleotide polymorphisms in both populations shows that the between-population component of variance (F(ST)) is significantly lower in domains I and III compared to the intervening domain II and compared to 11 unlinked microsatellite loci. A nucleotide site-by-site analysis shows that sites with exceptionally high or low F(ST) values cluster significantly into serial runs, with four runs of low values in domain I and one in domain III. These runs may map the sequences that are consistently under the strongest balancing selection from naturally acquired immune responses. PMID:14573469

  2. The amino acid sequence of Lady Amherst's pheasant (Chrysolophus amherstiae) and golden pheasant (Chrysolophus pictus) egg-white lysozymes.

    PubMed

    Araki, T; Kuramoto, M; Torikata, T

    1990-09-01

    The amino acids of Lady Amherst's pheasant and golden pheasant egg-white lysozymes have been sequenced. The carboxymethylated lysozymes were digested with trypsin followed by sequencing of the tryptic peptides. Lady Amherst's pheasant lysozyme proved to consist of 129 amino acid residues, and a relative molecular mass of 14,423 Da was calculated. This lysozyme had 6 amino acids substitutions when compared with hen egg-white lysozyme: Phe3 to Tyr, His15 to Leu, Gln41 to His, Asn77 to His, Gln 121 to Asn, and a newly found substitution of Ile124 to Thr. The amino acid sequence of golden pheasant lysozyme was identical to that of Lady Amherst's phesant lysozyme. The phylogenetic tree constructured by the comparison of amino acid sequences of phasianoid birds lysozymes revealed a minimum genetic distance between these pheasants and the turkey-peafowl group. PMID:1368578

  3. Complete Genome Sequence of a thermotolerant sporogenic lactic acid bacterium, Bacillus coagulans strain 36D1

    PubMed Central

    Rhee, Mun Su; Moritz, Brélan E.; Xie, Gary; Glavina del Rio, T.; Dalin, E.; Tice, H.; Bruce, D.; Goodwin, L.; Chertkov, O.; Brettin, T.; Han, C.; Detter, C.; Pitluck, S.; Land, Miriam L.; Patel, Milind; Ou, Mark; Harbrucker, Roberta; Ingram, Lonnie O.; Shanmugam, K. T.

    2011-01-01

    Bacillus coagulans is a ubiquitous soil bacterium that grows at 50-55 °C and pH 5.0 and ferments various sugars that constitute plant biomass to L (+)-lactic acid. The ability of this sporogenic lactic acid bacterium to grow at 50-55 °C and pH 5.0 makes this organism an attractive microbial biocatalyst for production of optically pure lactic acid at industrial scale not only from glucose derived from cellulose but also from xylose, a major constituent of hemicellulose. This bacterium is also considered as a potential probiotic. Complete genome sequence of a representative strain, B. coagulans strain 36D1, is presented and discussed. PMID:22675583

  4. [Partial sequence homology of FtsZ in phylogenetics analysis of lactic acid bacteria].

    PubMed

    Zhang, Bin; Dong, Xiu-zhu

    2005-10-01

    FtsZ is a structurally conserved protein, which is universal among the prokaryotes. It plays a key role in prokaryote cell division. A partial fragment of the ftsZ gene about 800bp in length was amplified and sequenced and a partial FtsZ protein phylogenetic tree for the lactic acid bacteria was constructed. By comparing the FtsZ phylogenetic tree with the 16S rDNA tree, it was shown that the two trees were similar in topology. Both trees revealed that Pediococcus spp. were closely related with L. casei group of Lactobacillus spp. , but less related with other lactic acid cocci such as Enterococcus and Streptococcus. The results also showed that the discriminative power of FtsZ was higher than that of 16S rDNA for either inter-species or inter-genus and could be a very useful tool in species identification of lactic acid bacteria. PMID:16342751

  5. Comparative characterization of random-sequence proteins consisting of 5, 12, and 20 kinds of amino acids.

    PubMed

    Tanaka, Junko; Doi, Nobuhide; Takashima, Hideaki; Yanagawa, Hiroshi

    2010-04-01

    Screening of functional proteins from a random-sequence library has been used to evolve novel proteins in the field of evolutionary protein engineering. However, random-sequence proteins consisting of the 20 natural amino acids tend to aggregate, and the occurrence rate of functional proteins in a random-sequence library is low. From the viewpoint of the origin of life, it has been proposed that primordial proteins consisted of a limited set of amino acids that could have been abundantly formed early during chemical evolution. We have previously found that members of a random-sequence protein library constructed with five primitive amino acids show high solubility (Doi et al., Protein Eng Des Sel 2005;18:279-284). Although such a library is expected to be appropriate for finding functional proteins, the functionality may be limited, because they have no positively charged amino acid. Here, we constructed three libraries of 120-amino acid, random-sequence proteins using alphabets of 5, 12, and 20 amino acids by preselection using mRNA display (to eliminate sequences containing stop codons and frameshifts) and characterized and compared the structural properties of random-sequence proteins arbitrarily chosen from these libraries. We found that random-sequence proteins constructed with the 12-member alphabet (including five primitive amino acids and positively charged amino acids) have higher solubility than those constructed with the 20-member alphabet, though other biophysical properties are very similar in the two libraries. Thus, a library of moderate complexity constructed from 12 amino acids may be a more appropriate resource for functional screening than one constructed from 20 amino acids. PMID:20162614

  6. N-Terminal Amino Acid Sequence Determination of Proteins by N-Terminal Dimethyl Labeling: Pitfalls and Advantages When Compared with Edman Degradation Sequence Analysis.

    PubMed

    Chang, Elizabeth; Pourmal, Sergei; Zhou, Chun; Kumar, Rupesh; Teplova, Marianna; Pavletich, Nikola P; Marians, Kenneth J; Erdjument-Bromage, Hediye

    2016-07-01

    In recent history, alternative approaches to Edman sequencing have been investigated, and to this end, the Association of Biomolecular Resource Facilities (ABRF) Protein Sequencing Research Group (PSRG) initiated studies in 2014 and 2015, looking into bottom-up and top-down N-terminal (Nt) dimethyl derivatization of standard quantities of intact proteins with the aim to determine Nt sequence information. We have expanded this initiative and used low picomole amounts of myoglobin to determine the efficiency of Nt-dimethylation. Application of this approach on protein domains, generated by limited proteolysis of overexpressed proteins, confirms that it is a universal labeling technique and is very sensitive when compared with Edman sequencing. Finally, we compared Edman sequencing and Nt-dimethylation of the same polypeptide fragments; results confirm that there is agreement in the identity of the Nt amino acid sequence between these 2 methods. PMID:27006647

  7. N-Terminal Amino Acid Sequence Determination of Proteins by N-Terminal Dimethyl Labeling: Pitfalls and Advantages When Compared with Edman Degradation Sequence Analysis

    PubMed Central

    Chang, Elizabeth; Pourmal, Sergei; Zhou, Chun; Kumar, Rupesh; Teplova, Marianna; Pavletich, Nikola P.; Marians, Kenneth J.

    2016-01-01

    In recent history, alternative approaches to Edman sequencing have been investigated, and to this end, the Association of Biomolecular Resource Facilities (ABRF) Protein Sequencing Research Group (PSRG) initiated studies in 2014 and 2015, looking into bottom-up and top-down N-terminal (Nt) dimethyl derivatization of standard quantities of intact proteins with the aim to determine Nt sequence information. We have expanded this initiative and used low picomole amounts of myoglobin to determine the efficiency of Nt-dimethylation. Application of this approach on protein domains, generated by limited proteolysis of overexpressed proteins, confirms that it is a universal labeling technique and is very sensitive when compared with Edman sequencing. Finally, we compared Edman sequencing and Nt-dimethylation of the same polypeptide fragments; results confirm that there is agreement in the identity of the Nt amino acid sequence between these 2 methods. PMID:27006647

  8. Partial amino acid sequence of fructose-1,6-bisphosphatase from the blue-green algae Synechococcus leopoliensis.

    PubMed

    Marcus, F; Latshaw, S P; Steup, M; Gerbling, K P

    1989-08-01

    Purified fructose-1,6-bisphosphatase from the cyanobacterium Synechococcus leopoliensis was S-carboxymethylated and cleaved with trypsin. The resulting peptides were purified by reversed-phase high performance liquid chromatography and the amino acid sequence of six of the purified peptides was determined by gas-phase microsequencing. The results revealed sequence homology with other fructose-1,6-bisphosphatases. The obtained sequence data provides information required for the design of oligonucleotide hybridization probes to screen existing libraries of cyanobacterial DNA. The determination of the amino acid sequence of cyanobacterial proteins may yield important information with respect to the endosymbiotic theory of evolution. PMID:2550924

  9. Protein sequence analysis by incorporating modified chaos game and physicochemical properties into Chou's general pseudo amino acid composition.

    PubMed

    Xu, Chunrui; Sun, Dandan; Liu, Shenghui; Zhang, Yusen

    2016-10-01

    In this contribution we introduced a novel graphical method to compare protein sequences. By mapping a protein sequence into 3D space based on codons and physicochemical properties of 20 amino acids, we are able to get a unique P-vector from the 3D curve. This approach is consistent with wobble theory of amino acids. We compute the distance between sequences by their P-vectors to measure similarities/dissimilarities among protein sequences. Finally, we use our method to analyze four datasets and get better results compared with previous approaches. PMID:27375218

  10. Nucleotide sequence of the phosphoglycerate kinase gene from the extreme thermophile Thermus thermophilus. Comparison of the deduced amino acid sequence with that of the mesophilic yeast phosphoglycerate kinase.

    PubMed Central

    Bowen, D; Littlechild, J A; Fothergill, J E; Watson, H C; Hall, L

    1988-01-01

    Using oligonucleotide probes derived from amino acid sequencing information, the structural gene for phosphoglycerate kinase from the extreme thermophile, Thermus thermophilus, was cloned in Escherichia coli and its complete nucleotide sequence determined. The gene consists of an open reading frame corresponding to a protein of 390 amino acid residues (calculated Mr 41,791) with an extreme bias for G or C (93.1%) in the codon third base position. Comparison of the deduced amino acid sequence with that of the corresponding mesophilic yeast enzyme indicated a number of significant differences. These are discussed in terms of the unusual codon bias and their possible role in enhanced protein thermal stability. Images Fig. 1. PMID:3052437

  11. Bacteria obtained from a sequencing batch reactor that are capable of growth on dehydroabietic acid.

    PubMed Central

    Mohn, W W

    1995-01-01

    Eleven isolates capable of growth on the resin acid dehydroabietic acid (DhA) were obtained from a sequencing batch reactor designed to treat a high-strength process stream from a paper mill. The isolates belonged to two groups, represented by strains DhA-33 and DhA-35, which were characterized. In the bioreactor, bacteria like DhA-35 were more abundant than those like DhA-33. The population in the bioreactor of organisms capable of growth on DhA was estimated to be 1.1 x 10(6) propagules per ml, based on a most-probable-number determination. Analysis of small-subunit rRNA partial sequences indicated that DhA-33 was most closely related to Sphingomonas yanoikuyae (Sab = 0.875) and that DhA-35 was most closely related to Zoogloea ramigera (Sab = 0.849). Both isolates additionally grew on other abietanes, i.e., abietic and palustric acids, but not on the pimaranes, pimaric and isopimaric acids. For DhA-33 and DhA-35 with DhA as the sole organic substrate, doubling times were 2.7 and 2.2 h, respectively, and growth yields were 0.30 and 0.25 g of protein per g of DhA, respectively. Glucose as a cosubstrate stimulated growth of DhA-33 on DhA and stimulated DhA degradation by the culture. Pyruvate as a cosubstrate did not stimulate growth of DhA-35 on DhA and reduced the specific rate of DhA degradation of the culture. DhA induced DhA and abietic acid degradation activities in both strains, and these activities were heat labile. Cell suspensions of both strains consumed DhA at a rate of 6 mumol mg of protein-1 h-1.(ABSTRACT TRUNCATED AT 250 WORDS) PMID:7793937

  12. A review of the occurrence, analyses, toxicity, and biodegradation of naphthenic acids.

    PubMed

    Clemente, Joyce S; Fedorak, Phillip M

    2005-07-01

    Naphthenic acids occur naturally in crude oils and in oil sands bitumens. They are toxic components in refinery wastewaters and in oil sands extraction waters. In addition, there are many industrial uses for naphthenic acids, so there is a potential for their release to the environment from a variety of activities. Studies have shown that naphthenic acids are susceptible to biodegradation, which decreases their concentration and reduces toxicity. This is a complex group of carboxylic acids with the general formula CnH(2n+Z)O2, where n indicates the carbon number and Z specifies the hydrogen deficiency resulting from ring formation. Measuring the concentrations of naphthenic acids in environmental samples and determining the chemical composition of a naphthenic acids mixture are huge analytical challenges. However, new analytical methods are being applied to these problems and progress is being made to better understand this mixture of chemically similar compounds. This paper reviews a variety of analytical methods and their application to assessing biodegradation of naphthenic acids. PMID:15963797

  13. Cloning and Nucleotide Sequence Analyses of 11 Genome Segments of Two American and One British Equine Rotavirus Strains

    PubMed Central

    Ma, Yongping; Wen, Xiaobo; Hoshino, Yasutaka; Yuan, L

    2015-01-01

    Group A equine rotavirus (ERV) is the main cause of diarrhea in foals and causes severe economic loss due to morbidity and mortality on stud farming worldwide. Molecular evolution of equine rotaviruses remains understudies. In this study, whole-genomic analysis of 2 group A ERV, FI-14 (G3P[12]), H-2 (G3P[12]) isolated from American, and FI23 (G14P[12]) from British was carried out and genotype constellations were determined as G3-P[12]-I6-R2-C2-M3-A10-N2-T3-E2-H7 for FI-14; G14-P[12]-I2-R2-C2-M3-A10-N2-T3-E2-H7 for FI23; and G3-P[12]-I6-R2-C2-M3-A10-N2-T3-E2-H7 for H-2, respectively. With the exception of the VP7 and VP6 gene, 2 G3P[12] strains (FI-14 and H-2) and one G14P[12] strain (FI23) were highly related genetically. Of note, the VP6 genotype of H-2 strain was previously reported to be I2, however, sequence and phylogenetic analyses demonstrated that it was I6. Therefore, it showed that G3P[12] ERV strains and G14P[12] ERV strains bore a distinct VP6 genotype: I6 for G3P[12] strains and I2 for G14P[12] strains. Moreover, it demonstrated that T-cell epitope 299P-300P/Q residues (PP/Q) of VP6 may be considered as I2 ERV typical molecular marker, which facilitates the analysis of the molecular evolution of equine rotaviruses. PMID:25631250

  14. Predictive functional profiling using marker gene sequences and community diversity analyses of microbes in full-scale anaerobic sludge digesters.

    PubMed

    Gao, Jing; Liu, Guoji; Li, Hongping; Xu, Li; Du, Lili; Yang, Bo

    2016-07-01

    Anaerobic digestion (AD) is widely used in treating the sewage sludge, as it can reduce the amount of sludge, eliminate pathogens and produce biofuel. To enhance the operational performance and stability of anaerobic bioreactors, operational and conventional chemical data from full-scale sludge anaerobic digesters were collected over a 2-year period and summarized, and the microbial community diversity of the sludge sample was investigated at various stages of the AD process. For the purpose of distinguishing between the functional and community diversity of the microbes, Phylogenetic Investigation of Communities by Reconstruction of Unobserved States (PICRUSt) software was used to impute the prevalence of 16S rDNA marker gene sequences in the difference in various sludge samples. Meanwhile, a taxa analysis was also carried out to investigate the different sludge samples. The microbial community diversity analysis of one AD sludge sample showed that the most dominant bacterial genera were Saccharicrinis, Syntrophus, Anaerotruncus and Thermanaerothrix. Among archaea, acetoclastic Methanosaeta represented 56.0 %, and hydrogenotrophic Methanospirillum, Methanoculleus, Methanothermus and Methanolinea accounted for 41.3 % of all methanogens. The taxa, genetic and functional prediction analyses of the feedstock and AD sludge samples suggested great community diversity differences between them. The taxa of bacteria in two AD sludge samples were considerably different, but the abundances of the functional KEGG pathways took on similar levels. The numbers of identified pathogens were significantly lower in the digested sludge than in the feedstock, but the PICRUSt results showed the difference in "human diseases" abundances in the level-1 pathway between the two sludge samples was small. PMID:27016946

  15. Cloning and nucleotide sequence analyses of 11 genome segments of two American and one British equine rotavirus strains.

    PubMed

    Ma, Yongping; Wen, Xiaobo; Hoshino, Yasutaka; Yuan, L

    2015-03-23

    Group A equine rotavirus (ERV) is the main cause of diarrhea in foals and causes severe economic loss due to morbidity and mortality on stud farming worldwide. Molecular evolution of equine rotaviruses remains understudies. In this study, whole-genomic analysis of 2 group A ERV, FI-14 (G3P[12]), H-2 (G3P[12]) isolated from American, and FI23 (G14P[12]) from British was carried out and genotype constellations were determined as G3-P[12]-I6-R2-C2-M3-A10-N2-T3-E2-H7 for FI-14; G14-P[12]-I2-R2-C2-M3-A10-N2-T3-E2-H7 for FI23; and G3-P[12]-I6-R2-C2-M3-A10-N2-T3-E2-H7 for H-2, respectively. With the exception of the VP7 and VP6 gene, 2 G3P[12] strains (FI-14 and H-2) and one G14P[12] strain (FI23) were highly related genetically. Of note, the VP6 genotype of H-2 strain was previously reported to be I2, however, sequence and phylogenetic analyses demonstrated that it was I6. Therefore, it showed that G3P[12] ERV strains and G14P[12] ERV strains bore a distinct VP6 genotype: I6 for G3P[12] strains and I2 for G14P[12] strains. Moreover, it demonstrated that T-cell epitope 299P-300P/Q residues (PP/Q) of VP6 may be considered as I2 ERV typical molecular marker, which facilitates the analysis of the molecular evolution of equine rotaviruses. PMID:25631250

  16. Nucleic and amino acid sequences relating to a novel transketolase, and methods for the expression thereof

    DOEpatents

    Croteau, Rodney Bruce; Wildung, Mark Raymond; Lange, Bernd Markus; McCaskill, David G.

    2001-01-01

    cDNAs encoding 1-deoxyxylulose-5-phosphate synthase from peppermint (Mentha piperita) have been isolated and sequenced, and the corresponding amino acid sequences have been determined. Accordingly, isolated DNA sequences (SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7) are provided which code for the expression of 1-deoxyxylulose-5-phosphate synthase from plants. In another aspect the present invention provides for isolated, recombinant DXPS proteins, such as the proteins having the sequences set forth in SEQ ID NO:4, SEQ ID NO:6 and SEQ ID NO:8. In other aspects, replicable recombinant cloning vehicles are provided which code for plant 1-deoxyxylulose-5-phosphate synthases, or for a base sequence sufficiently complementary to at least a portion of 1-deoxyxylulose-5-phosphate synthase DNA or RNA to enable hybridization therewith. In yet other aspects, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding a plant 1-deoxyxylulose-5-phosphate synthase. Thus, systems and methods are provided for the recombinant expression of the aforementioned recombinant 1-deoxyxylulose-5-phosphate synthase that may be used to facilitate its production, isolation and purification in significant amounts. Recombinant 1-deoxyxylulose-5-phosphate synthase may be used to obtain expression or enhanced expression of 1-deoxyxylulose-5-phosphate synthase in plants in order to enhance the production of 1-deoxyxylulose-5-phosphate, or its derivatives such as isopentenyl diphosphate (BP), or may be otherwise employed for the regulation or expression of 1-deoxyxylulose-5-phosphate synthase, or the production of its products.

  17. Novel method for PIK3CA mutation analysis: locked nucleic acid--PCR sequencing.

    PubMed

    Ang, Daphne; O'Gara, Rebecca; Schilling, Amy; Beadling, Carol; Warrick, Andrea; Troxell, Megan L; Corless, Christopher L

    2013-05-01

    Somatic mutations in PIK3CA are commonly seen in invasive breast cancer and several other carcinomas, occurring in three hotspots: codons 542 and 545 of exon 9 and in codon 1047 of exon 20. We designed a locked nucleic acid (LNA)-PCR sequencing assay to detect low levels of mutant PIK3CA DNA with attention to avoiding amplification of a pseudogene on chromosome 22 that has >95% homology to exon 9 of PIK3CA. We tested 60 FFPE breast DNA samples with known PIK3CA mutation status (48 cases had one or more PIK3CA mutations, and 12 were wild type) as identified by PCR-mass spectrometry. PIK3CA exons 9 and 20 were amplified in the presence or absence of LNA-oligonucleotides designed to bind to the wild-type sequences for codons 542, 545, and 1047, and partially suppress their amplification. LNA-PCR sequencing confirmed all 51 PIK3CA mutations; however, the mutation detection rate by standard Sanger sequencing was only 69% (35 of 51). Of the 12 PIK3CA wild-type cases, LNA-PCR sequencing detected three additional H1047R mutations in "normal" breast tissue and one E545K in usual ductal hyperplasia. Histopathological review of these three normal breast specimens showed columnar cell change in two (both with known H1047R mutations) and apocrine metaplasia in one. The novel LNA-PCR shows higher sensitivity than standard Sanger sequencing and did not amplify the known pseudogene. PMID:23541593

  18. THERMAL AND SPECTROSCOPIC ANALYSES OF CAUSTIC SIDE SOLVENT EXTRACTION SOLVENT CONTACTED WITH 1 MOLARAND 3 MOLAR NITRIC ACID

    SciTech Connect

    Fondeur, F; David Hobbs, D; Samuel Fink, S

    2007-07-23

    Thermal and spectroscopic analyses were performed on multiple layers formed from contacting Caustic Side Solvent Extraction (CSSX) solvent with 1 M or 3 M nitric acid. A slow chemical reaction occurs (i.e., over several weeks) between the solvent and 1 M or 3 M nitric acid as evidenced by color changes and the detection of nitro groups in the infrared spectrum of the aged samples. Thermal analysis revealed that decomposition of the resulting mixture does not meet the definition of explosive or deflagrating material.

  19. THERMAL AND SPECTROSCOPIC ANALYSES OF CAUSTIC LIDE SOLVENT EXTRACTION SOLVENT CONTACTED WITH 16 MOLAR AND 8 MOLAR NITRIC ACID

    SciTech Connect

    Fondeur, F; David Hobbs, D; Samuel Fink, S

    2007-07-12

    Thermal and spectroscopic analyses were performed on multiple layers formed from contacting Caustic Side Solvent Extraction (CSSX) solvent with 1 M or 3 M nitric acid. A slow chemical reaction occurs (i.e., over several weeks) between the solvent and 1 M or 3 M nitric acid as evidenced by color changes and the detection of nitro groups in the infrared spectrum of the aged samples. Thermal analysis revealed that decomposition of the resulting mixture does not meet the definition of explosive or deflagrating material.

  20. Genome Sequence Analysis of the Naphthenic Acid Degrading and Metal Resistant Bacterium Cupriavidus gilardii CR3.

    PubMed

    Wang, Xiaoyu; Chen, Meili; Xiao, Jingfa; Hao, Lirui; Crowley, David E; Zhang, Zhewen; Yu, Jun; Huang, Ning; Huo, Mingxin; Wu, Jiayan

    2015-01-01

    Cupriavidus sp. are generally heavy metal tolerant bacteria with the ability to degrade a variety of aromatic hydrocarbon compounds, although the degradation pathways and substrate versatilities remain largely unknown. Here we studied the bacterium Cupriavidus gilardii strain CR3, which was isolated from a natural asphalt deposit, and which was shown to utilize naphthenic acids as a sole carbon source. Genome sequencing of C. gilardii CR3 was carried out to elucidate possible mechanisms for the naphthenic acid biodegradation. The genome of C. gilardii CR3 was composed of two circular chromosomes chr1 and chr2 of respectively 3,539,530 bp and 2,039,213 bp in size. The genome for strain CR3 encoded 4,502 putative protein-coding genes, 59 tRNA genes, and many other non-coding genes. Many genes were associated with xenobiotic biodegradation and metal resistance functions. Pathway prediction for degradation of cyclohexanecarboxylic acid, a representative naphthenic acid, suggested that naphthenic acid undergoes initial ring-cleavage, after which the ring fission products can be degraded via several plausible degradation pathways including a mechanism similar to that used for fatty acid oxidation. The final metabolic products of these pathways are unstable or volatile compounds that were not toxic to CR3. Strain CR3 was also shown to have tolerance to at least 10 heavy metals, which was mainly achieved by self-detoxification through ion efflux, metal-complexation and metal-reduction, and a powerful DNA self-repair mechanism. Our genomic analysis suggests that CR3 is well adapted to survive the harsh environment in natural asphalts containing naphthenic acids and high concentrations of heavy metals. PMID:26301592

  1. Genome Sequence Analysis of the Naphthenic Acid Degrading and Metal Resistant Bacterium Cupriavidus gilardii CR3

    PubMed Central

    Xiao, Jingfa; Hao, Lirui; Crowley, David E.; Zhang, Zhewen; Yu, Jun; Huang, Ning; Huo, Mingxin; Wu, Jiayan

    2015-01-01

    Cupriavidus sp. are generally heavy metal tolerant bacteria with the ability to degrade a variety of aromatic hydrocarbon compounds, although the degradation pathways and substrate versatilities remain largely unknown. Here we studied the bacterium Cupriavidus gilardii strain CR3, which was isolated from a natural asphalt deposit, and which was shown to utilize naphthenic acids as a sole carbon source. Genome sequencing of C. gilardii CR3 was carried out to elucidate possible mechanisms for the naphthenic acid biodegradation. The genome of C. gilardii CR3 was composed of two circular chromosomes chr1 and chr2 of respectively 3,539,530 bp and 2,039,213 bp in size. The genome for strain CR3 encoded 4,502 putative protein-coding genes, 59 tRNA genes, and many other non-coding genes. Many genes were associated with xenobiotic biodegradation and metal resistance functions. Pathway prediction for degradation of cyclohexanecarboxylic acid, a representative naphthenic acid, suggested that naphthenic acid undergoes initial ring-cleavage, after which the ring fission products can be degraded via several plausible degradation pathways including a mechanism similar to that used for fatty acid oxidation. The final metabolic products of these pathways are unstable or volatile compounds that were not toxic to CR3. Strain CR3 was also shown to have tolerance to at least 10 heavy metals, which was mainly achieved by self-detoxification through ion efflux, metal-complexation and metal-reduction, and a powerful DNA self-repair mechanism. Our genomic analysis suggests that CR3 is well adapted to survive the harsh environment in natural asphalts containing naphthenic acids and high concentrations of heavy metals. PMID:26301592

  2. Bile acid sulfotransferase I from rat liver sulfates bile acids and 3-hydroxy steroids: purification, N-terminal amino acid sequence, and kinetic properties.

    PubMed

    Barnes, S; Buchina, E S; King, R J; McBurnett, T; Taylor, K B

    1989-04-01

    A bile acid:3'phosphoadenosine-5'phosphosulfate:sulfotransferase (BAST I) from adult female rat liver cytosol has been purified 157-fold by a two-step isolation procedure. The N-terminal amino acid sequence of the 30,000 subunit has been determined for the first 35 residues. The Vmax of purified BAST I is 18.7 nmol/min per mg protein with N-(3-hydroxy-5 beta-cholanoyl)glycine (glycolithocholic acid) as substrate, comparable to that of the corresponding purified human BAST (Chen, L-J., and I. H. Segel, 1985. Arch. Biochem. Biophys. 241: 371-379). BAST I activity has a broad pH optimum from 5.5-7.5. Although maximum activity occurs with 5 mM MgCl2, Mg2+ is not essential for BAST I activity. The greatest sulfotransferase activity and the highest substrate affinity is observed with bile acids or steroids that have a steroid nucleus containing a 3 beta-hydroxy group and a 5-6 double bond or a trans A-B ring junction. These substrates have normal hyperbolic initial velocity curves with substrate inhibition occurring above 5 microM. Of the saturated 5 beta-bile acids, those with a single 3-hydroxy group are the most active. The addition of a second hydroxy group at the 6- or 7-position eliminates more than 99% of the activity. In contrast, 3 alpha,12 alpha-dihydroxy-5 beta-cholan-24-oic acid (deoxycholic acid) is an excellent substrate. The initial velocity curves for glycolithocholic and deoxycholic acid conjugates are sigmoidal rather than hyperbolic, suggestive of an allosteric effect. Maximum activity is observed at 80 microM for glycolithocholic acid. All substrates, bile acids and steroids, are inhibited by the 5 beta-bile acid, 3-keto-5 beta-cholanoic acid. The data suggest that BAST I is the same protein as hydrosteroid sulfotransferase 2 (Marcus, C. J., et al. 1980. Anal. Biochem. 107: 296-304). PMID:2754334

  3. Fatty acid composition analyses of the DCMU resistant mutants of Nannochloropsis oculata (eustigmatophyceae)

    NASA Astrophysics Data System (ADS)

    Jimin, Zhang; Shuang, Liu; Xue, Sun; Guanpin, Yang; Xuecheng, Zhang; Zhenhui, Gao

    2003-04-01

    Ultraviolet mutagenesis was applied to Nannochloropsis oculata and three mutants resistant to 3-(3, 4-dichlorophenyl)-1,1-dimethylurea (DCMU) were isolated. The cellular chlorophyll a and total lipid content of the wild are higher in the medium supplemented with DCMU than in the control without DCMU. Without DCMU, the growth rates and chlorophyll a contents of the mutants are similar to those of the wild. Significant changes of fatty acid content and composition have occurred in DCMU-resistant mutants growing in the medium supplemented with DCMU. The total lipid, palmitic acid (16:0), palmitoleic acid (16:1ω9) and oleic (18:1ω9) contents decrease significantly, while the vaccenic acid (18:1ω11) increases significantly and the EPA content of dried powder increases slightly in the mutants. The study may provide a basis to improve EPA content in Nannochloropsis oculata in the future.

  4. Sequence-defined bioactive macrocycles via an acid-catalysed cascade reaction

    NASA Astrophysics Data System (ADS)

    Porel, Mintu; Thornlow, Dana N.; Phan, Ngoc N.; Alabi, Christopher A.

    2016-06-01

    Synthetic macrocycles derived from sequence-defined oligomers are a unique structural class whose ring size, sequence and structure can be tuned via precise organization of the primary sequence. Similar to peptides and other peptidomimetics, these well-defined synthetic macromolecules become pharmacologically relevant when bioactive side chains are incorporated into their primary sequence. In this article, we report the synthesis of oligothioetheramide (oligoTEA) macrocycles via a one-pot acid-catalysed cascade reaction. The versatility of the cyclization chemistry and modularity of the assembly process was demonstrated via the synthesis of >20 diverse oligoTEA macrocycles. Structural characterization via NMR spectroscopy revealed the presence of conformational isomers, which enabled the determination of local chain dynamics within the macromolecular structure. Finally, we demonstrate the biological activity of oligoTEA macrocycles designed to mimic facially amphiphilic antimicrobial peptides. The preliminary results indicate that macrocyclic oligoTEAs with just two-to-three cationic charge centres can elicit potent antibacterial activity against Gram-positive and Gram-negative bacteria.

  5. Unconventional amino acid sequence of the sun anemone (Stoichactis helianthus) polypeptide neurotoxin

    SciTech Connect

    Kem, W.; Dunn, B.; Parten, B.; Pennington, M.; Price, D.

    1986-05-01

    A 5000 dalton polypeptide neurotoxin (Sh-NI) purified by G50 Sephadex, P-cellulose, and SP-Sephadex chromatography was homogeneous by isoelectric focusing. Sh-NI was highly toxic to crayfish (LD/sub 50/ 0.6 ..mu..g/kg) but without effect upon mice at 15,000 ..mu..g/kg (i.p. injection). The reduced, /sup 3/H-carboxymethylated toxin and its fragments were subjected to automatic Edman degradation and the resulting PTH-amino acids were identified by HPLC, back hydrolysis, and scintillation counting. Peptides resulting from proteolytic (clostripain, staphylococcal protease) and chemical (tryptophan) cleavage were sequenced. The sequence is: AACKCDDEGPDIRTAPLTGTVDLGSCNAGWEKCASYYTIIADCCRKKK. This sequence differs considerably from the homologous Anemonia and Anthopleura toxins; many of the identical residues (6 half-cystines, G9, P10, R13, G19, G29, W30) are probably critical for folding rather than receptor recognition. However, the Sh-NI sequence closely resembles Radioanthus macrodactylus neurotoxin III and r. paumotensis II. The authors propose that Sh-NI and related Radioanthus toxins act upon a different site on the sodium channel.

  6. Repeat sequence chromosome specific nucleic acid probes and methods of preparing and using

    DOEpatents

    Weier, H.U.G.; Gray, J.W.

    1995-06-27

    A primer directed DNA amplification method to isolate efficiently chromosome-specific repeated DNA wherein degenerate oligonucleotide primers are used is disclosed. The probes produced are a heterogeneous mixture that can be used with blocking DNA as a chromosome-specific staining reagent, and/or the elements of the mixture can be screened for high specificity, size and/or high degree of repetition among other parameters. The degenerate primers are sets of primers that vary in sequence but are substantially complementary to highly repeated nucleic acid sequences, preferably clustered within the template DNA, for example, pericentromeric alpha satellite repeat sequences. The template DNA is preferably chromosome-specific. Exemplary primers and probes are disclosed. The probes of this invention can be used to determine the number of chromosomes of a specific type in metaphase spreads, in germ line and/or somatic cell interphase nuclei, micronuclei and/or in tissue sections. Also provided is a method to select arbitrarily repeat sequence probes that can be screened for chromosome-specificity. 18 figs.

  7. Repeat sequence chromosome specific nucleic acid probes and methods of preparing and using

    DOEpatents

    Weier, Heinz-Ulrich G.; Gray, Joe W.

    1995-01-01

    A primer directed DNA amplification method to isolate efficiently chromosome-specific repeated DNA wherein degenerate oligonucleotide primers are used is disclosed. The probes produced are a heterogeneous mixture that can be used with blocking DNA as a chromosome-specific staining reagent, and/or the elements of the mixture can be screened for high specificity, size and/or high degree of repetition among other parameters. The degenerate primers are sets of primers that vary in sequence but are substantially complementary to highly repeated nucleic acid sequences, preferably clustered within the template DNA, for example, pericentromeric alpha satellite repeat sequences. The template DNA is preferably chromosome-specific. Exemplary primers ard probes are disclosed. The probes of this invention can be used to determine the number of chromosomes of a specific type in metaphase spreads, in germ line and/or somatic cell interphase nuclei, micronuclei and/or in tissue sections. Also provided is a method to select arbitrarily repeat sequence probes that can be screened for chromosome-specificity.

  8. Conservation Weighting Functions Enable Covariance Analyses to Detect Functionally Important Amino Acids

    PubMed Central

    Colwell, Lucy J.; Brenner, Michael P.; Murray, Andrew W.

    2014-01-01

    The explosive growth in the number of protein sequences gives rise to the possibility of using the natural variation in sequences of homologous proteins to find residues that control different protein phenotypes. Because in many cases different phenotypes are each controlled by a group of residues, the mutations that separate one version of a phenotype from another will be correlated. Here we incorporate biological knowledge about protein phenotypes and their variability in the sequence alignment of interest into algorithms that detect correlated mutations, improving their ability to detect the residues that control those phenotypes. We demonstrate the power of this approach using simulations and recent experimental data. Applying these principles to the protein families encoded by Dscam and Protocadherin allows us to make testable predictions about the residues that dictate the specificity of molecular interactions. PMID:25379728

  9. Detection of Nucleic Acids with Graphene Nanopores: Ab Initio Characterization of a Novel Sequencing Device

    NASA Astrophysics Data System (ADS)

    Nelson, Tammie; Zhang, Bo; Prezhdo, Oleg

    2010-03-01

    We report an ab initio study of the interaction of two nucleobases, cytosine and adenine, with a novel graphene nanopore device for detecting the base sequence of a single-stranded nucleic acid (ssDNA or RNA). The nucleobases were inserted into a pore in a graphene nanoribbon, and the electrical current and conductance spectra were calculated as functions of voltage applied across the nanoribbon. The conductance spectra and charge densities were analyzed in the presence of each nucleobase in the graphene nanopore. The results indicate that, due to significant differences in the conductance spectra, the proposed device has adequate sensitivity to discriminate between different nucleotides. Moreover, we show that the nucleotide conductance spectra is not affected by its orientation inside the graphene nanopore. The proposed technique may be extremely useful for real applications in developing ultrafast, low cost DNA sequencing methods.

  10. Morphological tranformation of calcite crystal growth by prismatic "acidic" polypeptide sequences.

    SciTech Connect

    Kim, I; Giocondi, J L; Orme, C A; Collino, J; Evans, J S

    2007-02-13

    Many of the interesting mechanical and materials properties of the mollusk shell are thought to stem from the prismatic calcite crystal assemblies within this composite structure. It is now evident that proteins play a major role in the formation of these assemblies. Recently, a superfamily of 7 conserved prismatic layer-specific mollusk shell proteins, Asprich, were sequenced, and the 42 AA C-terminal sequence region of this protein superfamily was found to introduce surface voids or porosities on calcite crystals in vitro. Using AFM imaging techniques, we further investigate the effect that this 42 AA domain (Fragment-2) and its constituent subdomains, DEAD-17 and Acidic-2, have on the morphology and growth kinetics of calcite dislocation hillocks. We find that Fragment-2 adsorbs on terrace surfaces and pins acute steps, accelerates then decelerates the growth of obtuse steps, forms clusters and voids on terrace surfaces, and transforms calcite hillock morphology from a rhombohedral form to a rounded one. These results mirror yet are distinct from some of the earlier findings obtained for nacreous polypeptides. The subdomains Acidic-2 and DEAD-17 were found to accelerate then decelerate obtuse steps and induce oval rather than rounded hillock morphologies. Unlike DEAD-17, Acidic-2 does form clusters on terrace surfaces and exhibits stronger obtuse velocity inhibition effects than either DEAD-17 or Fragment-2. Interestingly, a 1:1 mixture of both subdomains induces an irregular polygonal morphology to hillocks, and exhibits the highest degree of acute step pinning and obtuse step velocity inhibition. This suggests that there is some interplay between subdomains within an intra (Fragment-2) or intermolecular (1:1 mixture) context, and sequence interplay phenomena may be employed by biomineralization proteins to exert net effects on crystal growth and morphology.

  11. Evolution of alpha-lactalbumins. The complete amino acid sequence of the alpha-lactalbumin from a marsupial (Macropus rufogriseus) and corrections to regions of sequence in bovine and goat alpha-lactalbumins.

    PubMed

    Shewale, J G; Sinha, S K; Brew, K

    1984-04-25

    alpha-Lactalbumin was purified from a whey protein fraction of the milk of the red-necked wallaby (Macropus rufogriseus). The complete amino acid sequence was determined from the results of automatic sequenator analyses of the intact protein, the three cyanogen bromide fragments, and of peptides generated from the larger, COOH-terminal CNBr fragment by digestion with trypsin or staphylococcal protease. This is the first sequence to be determined of an alpha-lactalbumin from a marsupial and differs from known eutherian alpha-lactalbumins in size and locations of deletions in alignments with the homologous type c lysozymes, as well as in having amino acid substitutions at 8 sites that are invariant in known eutherian proteins. Some corrections are also reported for two regions of sequence in both bovine and goat alpha-lactalbumins. The new and previously published information on alpha-lactalbumin sequences is analyzed in relation to the evolutionary history of the alpha-lactalbumin line as well as the relationship of structure to function in these proteins. PMID:6715332

  12. Spectroscopic analyses and studies on respective interaction of cyanuric acid and uric acid with bovine serum albumin and melamine

    NASA Astrophysics Data System (ADS)

    Chen, Dandan; Wu, Qiong; Wang, Jun; Wang, Qi; Qiao, Heng

    2015-01-01

    In this work, the fluorescence quenching was used to study the interaction of cyanuric acid (CYA) and uric acid (UA) with bovine serum albumin (BSA) at two different temperatures (283 K and 310 K). The bimolecular quenching constant (Kq), apparent quenching constant (Ksv), effective binding constant (KA) and corresponding dissociation constant (KD), binding site number (n) and binding distance (r) were calculated by adopting Stern-Volmer, Lineweaver-Burk, Double logarithm and overlap integral equations. The results show that CYA and UA are both able to obviously bind to BSA, but the binding strength order is BSA + CYA < BSA + UA. And then, the interactions of CYA and UA with melamine (MEL) under the same conditions were also studied by using similar methods. The results indicates that both CYA and UA can bind together closely with melamine (MEL). It is wished that these research results would facilitate the understanding the formation of kidney stones and gout in the body after ingesting excess MEL.

  13. A case study on discovery of novel Citrus leprosis virus cytoplasmic type 2 utilizing small RNA libraries by next generation sequencing and bioinformatic analyses

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The identification of novel plant viruses is a tricky matter. Most plant virus diagnostics are based on immunological or nucleic acid based assays, where prior characterization of the virus (either antibodies or genetic sequence) is required for reagent production. There are no universal nucleic a...

  14. Fast computational methods for predicting protein structure from primary amino acid sequence

    DOEpatents

    Agarwal, Pratul Kumar

    2011-07-19

    The present invention provides a method utilizing primary amino acid sequence of a protein, energy minimization, molecular dynamics and protein vibrational modes to predict three-dimensional structure of a protein. The present invention also determines possible intermediates in the protein folding pathway. The present invention has important applications to the design of novel drugs as well as protein engineering. The present invention predicts the three-dimensional structure of a protein independent of size of the protein, overcoming a significant limitation in the prior art.

  15. Amino-terminal amino acid sequence of the major structural polypeptides of avian retroviruses: sequence homology between reticuloendotheliosis virus p30 and p30s of mammalian retroviruses.

    PubMed Central

    Hunter, E; Bhown, A S; Bennett, J C

    1978-01-01

    The major structural polypeptides, p30 of reticuloendotheliosis virus (REV) (strain T) and p27 of avian sarcoma virus B77, have been compared with regard to amino acid composition. NH2-terminal amino acid sequence, and immunological crossreactions. The amino acid composition of the two polypeptides is distinct, and a comparison of the first 30 NH2-terminal amino acids of REV p30 with that for the first 25 of B77 p27 yields only three homologous residues. In competition radioimmunoassays the polypeptides show no crossreactivity. A comparison of the amino acid composition and NH2-terminal amino acid sequence of REV p30 with those reported for several mammalian retrovirus p30s shows remarkable similarities. Both REV and mammalian p30s contain a large number of polar residues in their amino acid composition and show approximately 40% homology in the first 30 NH2-terminal amino acids. No crossreactivity could be observed, however, in competition radioimmunoassays between Rauscher murine leukemia virus p30 and that of REV. The observations reported here suggest a close evolutionary relationship between REV and the mammalian retroviruses. Images PMID:208072

  16. Purification and amino acid sequence of aminopeptidase P from pig kidney.

    PubMed

    Vergas Romero, C; Neudorfer, I; Mann, K; Schäfer, W

    1995-04-01

    Aminopeptidase P from kidney cortex was purified in high yield (recovery greater than or equal to 20%) by a series of column chromatographic steps after solubilization of the membrane-bound glycoprotein with n-butanol. A coupled enzymic assay, using Gly-Pro-Pro-NH-Nap as substrate and dipeptidyl-peptidase IV as auxilliary enzyme, was used to monitor the purification. The purification procedure yielded two forms of aminopeptidase P differing in their carbohydrate composition (glycoforms). Both enzyme preparations were homogeneous as assessed by SDS/PAGE silver staining, and isoelectric focusing. Both forms possessed the same substrate specificity, catalysed the same reaction, and consisted of identical protein chains. The amino acid sequence determined by Edman degradation and mass spectrometry consisted of 623 amino acids. Six N-glycosylation sites, all contained in the N-terminal half of the protein, were characterized. PMID:7744038

  17. Draft Genome Sequence of Cupriavidus sp. Strain SK-3, a 4-Chlorobiphenyl- and 4-Clorobenzoic Acid-Degrading Bacterium

    PubMed Central

    Vilo, Claudia; Benedik, Michael J.; Ilori, Matthew

    2014-01-01

    We report the draft genome sequence of Cupriavidus sp. strain SK-3, which can use 4-chlorobiphenyl and 4-clorobenzoic acid as the sole carbon source for growth. The draft genome sequence allowed the study of the polychlorinated biphenyl degradation mechanism and the recharacterization of the strain SK-3 as a Cupriavidus species. PMID:24994805

  18. Draft Genome Sequence of Bacillus subtilis subsp. natto Strain CGMCC 2108, a High Producer of Poly-γ-Glutamic Acid

    PubMed Central

    Tan, Siyuan; Su, Anping; Zhang, Chen; Ren, Yuanyuan

    2016-01-01

    Here, we report the 4.1-Mb draft genome sequence of Bacillus subtilis subsp. natto strain CGMCC 2108, a high producer of poly-γ-glutamic acid (γ-PGA). This sequence will provide further help for the biosynthesis of γ-PGA and will greatly facilitate research efforts in metabolic engineering of B. subtilis subsp. natto strain CGMCC 2108. PMID:27231363

  19. New monoclonal antibodies to the Ebola virus glycoprotein: Identification and analysis of the amino acid sequence of the variable domains.

    PubMed

    Panina, A A; Aliev, T K; Shemchukova, O B; Dement'yeva, I G; Varlamov, N E; Pozdnyakova, L P; Bokov, M N; Dolgikh, D A; Sveshnikov, P G; Kirpichnikov, M P

    2016-03-01

    We determined the nucleotide and amino acid sequences of variable domains of three new monoclonal antibodies to the glycoprotein of Ebola virus capsid. The framework and hypervariable regions of immunoglobulin heavy and light chains were identified. The primary structures were confirmed using massspectrometry analysis. Immunoglobulin database search showed the uniqueness of the sequences obtained. PMID:27193713

  20. Genome Sequence of the Lactic Acid Bacterium Lactococcus lactis subsp. lactis TOMSC161, Isolated from a Nonscalded Curd Pressed Cheese

    PubMed Central

    Velly, H.; Abraham, A.-L.; Loux, V.; Delacroix-Buchet, A.; Fonseca, F.; Bouix, M.

    2014-01-01

    Lactococcus lactis is a lactic acid bacterium used in the production of many fermented foods, such as dairy products. Here, we report the genome sequence of L. lactis subsp. lactis TOMSC161, isolated from nonscalded curd pressed cheese. This genome sequence provides information in relation to dairy environment adaptation. PMID:25377704

  1. Draft Genome Sequence of Bacillus subtilis subsp. natto Strain CGMCC 2108, a High Producer of Poly-γ-Glutamic Acid.

    PubMed

    Tan, Siyuan; Meng, Yonghong; Su, Anping; Zhang, Chen; Ren, Yuanyuan

    2016-01-01

    Here, we report the 4.1-Mb draft genome sequence of Bacillus subtilis subsp. natto strain CGMCC 2108, a high producer of poly-γ-glutamic acid (γ-PGA). This sequence will provide further help for the biosynthesis of γ-PGA and will greatly facilitate research efforts in metabolic engineering of B. subtilis subsp. natto strain CGMCC 2108. PMID:27231363

  2. Community Genomic and Proteomic Analyses of Chemoautotrophic Iron-Oxidizing "Leptospirillum rubarum" (Group II) and "Leptospirillum ferrodiazotrophum" (Group III) Bacteria in Acid Mine Drainage Biofilms

    SciTech Connect

    Goltsman, Daniela; Denef, Vincent; Singer, Steven; Verberkmoes, Nathan C; Lefsrud, Mark G; Mueller, Ryan; Dick, Gregory J.; Sun, Christine; Wheeler, Korin; Zelma, Adam; Baker, Brett J.; Hauser, Loren John; Land, Miriam L; Shah, Manesh B; Thelen, Michael P.; Hettich, Robert {Bob} L; Banfield, Jillian F.

    2009-01-01

    We analyzed near-complete population (composite) genomic sequences for coexisting acidophilic iron-oxidizing Leptospirillum group II and III bacteria (phylum Nitrospirae) and an extrachromosomal plasmid from a Richmond Mine, Iron Mountain, CA, acid mine drainage biofilm. Community proteomic analysis of the genomically characterized sample and two other biofilms identified 64.6% and 44.9% of the predicted proteins of Leptospirillum groups II and III, respectively, and 20% of the predicted plasmid proteins. The bacteria share 92% 16S rRNA gene sequence identity and >60% of their genes, including integrated plasmid-like regions. The extrachromosomal plasmid carries conjugation genes with detectable sequence similarity to genes in the integrated conjugative plasmid, but only those on the extrachromosomal element were identified by proteomics. Both bacterial groups have genes for community-essential functions, including carbon fixation and biosynthesis of vitamins, fatty acids, and biopolymers (including cellulose); proteomic analyses reveal these activities. Both Leptospirillum types have multiple pathways for osmotic protection. Although both are motile, signal transduction and methyl-accepting chemotaxis proteins are more abundant in Leptospirillum group III, consistent with its distribution in gradients within biofilms. Interestingly, Leptospirillum group II uses a methyl-dependent and Leptospirillum group III a methyl-independent response pathway. Although only Leptospirillum group III can fix nitrogen, these proteins were not identified by proteomics. The abundances of core proteins are similar in all communities, but the abundance levels of unique and shared proteins of unknown function vary. Some proteins unique to one organism were highly expressed and may be key to the functional and ecological differentiation of Leptospirillum groups II and III.

  3. ANTICALIgN: visualizing, editing and analyzing combined nucleotide and amino acid sequence alignments for combinatorial protein engineering.

    PubMed

    Jarasch, Alexander; Kopp, Melanie; Eggenstein, Evelyn; Richter, Antonia; Gebauer, Michaela; Skerra, Arne

    2016-07-01

    ANTIC ALIGN: is an interactive software developed to simultaneously visualize, analyze and modify alignments of DNA and/or protein sequences that arise during combinatorial protein engineering, design and selection. ANTIC ALIGN: combines powerful functions known from currently available sequence analysis tools with unique features for protein engineering, in particular the possibility to display and manipulate nucleotide sequences and their translated amino acid sequences at the same time. ANTIC ALIGN: offers both template-based multiple sequence alignment (MSA), using the unmutated protein as reference, and conventional global alignment, to compare sequences that share an evolutionary relationship. The application of similarity-based clustering algorithms facilitates the identification of duplicates or of conserved sequence features among a set of selected clones. Imported nucleotide sequences from DNA sequence analysis are automatically translated into the corresponding amino acid sequences and displayed, offering numerous options for selecting reading frames, highlighting of sequence features and graphical layout of the MSA. The MSA complexity can be reduced by hiding the conserved nucleotide and/or amino acid residues, thus putting emphasis on the relevant mutated positions. ANTIC ALIGN: is also able to handle suppressed stop codons or even to incorporate non-natural amino acids into a coding sequence. We demonstrate crucial functions of ANTIC ALIGN: in an example of Anticalins selected from a lipocalin random library against the fibronectin extradomain B (ED-B), an established marker of tumor vasculature. Apart from engineered protein scaffolds, ANTIC ALIGN: provides a powerful tool in the area of antibody engineering and for directed enzyme evolution. PMID:27261456

  4. Formation Sequences of Iron Minerals in the Acidic Alteration Products and Variation of Hydrothermal Fluid Conditions

    NASA Astrophysics Data System (ADS)

    Isobe, H.; Yoshizawa, M.

    2008-12-01

    Iron minerals have important role in environmental issues not only on the Earth but also other terrestrial planets. Iron mineral species related to alteration products of primary minerals with surface or subsurface fluids are characterized by temperature, acidity and redox conditions of the fluids. We can see various iron- bearing alteration products in alteration products around fumaroles in geothermal/volcanic areas. In this study, zonal structures of iron minerals in alteration products of the geothermal area are observed to elucidate temporal and spatial variation of hydrothermal fluids. Alteration of the pyroxene-amphibole andesite of Garan-dake volcano, Oita, Japan occurs by the acidic hydrothermal fluid to form cristobalite leaching out elements other than Si. Hand specimens with unaltered or weakly altered core and cristobalite crust show various sequences of layers. XRD analysis revealed that the alteration degree is represented by abundance of cristobalite. Intermediately altered layers are characterized by occurrence including alunite, pyrite, kaolinite, goethite and hematite. A specimen with reddish brown core surrounded by cristobalite-rich white crust has brown colored layers at the boundary of core and the crust. Reddish core is characterized by occurrence of crystalline hematite by XRD. Another hand specimen has light gray core, which represents reduced conditions, and white cristobalite crust with light brown and reddish brown layers of ferric iron minerals between the core and the crust. On the other hand, hornblende crystals, typical ferrous iron-bearing mineral of the host rock, are well preserved in some samples with strongly decolorized cristobalite-rich groundmass. Hydrothermal alteration experiments of iron-rich basaltic material shows iron mineral species depend on acidity and temperature of the fluid. Oxidation states of the iron-bearing mineral species are strongly influenced by the acidity and redox conditions. Variations of alteration

  5. Data on the evolutionary history of the V(D)J recombination-activating protein 1 - RAG1 coupled with sequence and variant analyses.

    PubMed

    Kumar, Abhishek; Bhandari, Anita; Sarde, Sandeep J; Muppavarapu, Sekhar; Tandon, Ravi

    2016-09-01

    RAG1 protein is one of the key component of RAG complex regulating the V(D)J recombination. There are only few studies for RAG1 concerning evolutionary history, detailed sequence and mutational hotspots. Herein, we present out datasets used for the recent comprehensive study of RAG1 based on sequence, phylogenetic and genetic variant analyses (Kumar et al., 2015) [1]. Protein sequence alignment helped in characterizing the conserved domains and regions of RAG1. It also aided in unraveling ancestral RAG1 in the sea urchin. Human genetic variant analyses revealed 751 mutational hotspots, located both in the coding and the non-coding regions. For further analysis and discussion, see (Kumar et al., 2015) [1]. PMID:27284568

  6. Application of molecular methods for analysing the distribution and diversity of acetic acid bacteria in Chilean vineyards.

    PubMed

    Prieto, Carmen; Jara, Carla; Mas, Albert; Romero, Jaime

    2007-04-20

    The presence of acetic acid bacteria populations on grape surfaces from several Chilean valleys is reported. The bacteria were analysed at both the species and the strain level by molecular methods such as RFLP-PCR 16S rRNA gene, RFLP-PCR ITS 16S-23S rRNA gene regions and Arbitrary Primed (AP) PCR. Our results show that there are limited numbers of species of acetic acid bacteria in the grapes and that there is a need for an enrichment medium before plating to recover the individual colonies. In the Northernmost region analysed, the major species recovered was a non-acetic acid bacteria, Stenotrophomonas maltophila. Following the North-South axis of Chilean valleys, the observed distribution of acetic acid bacteria was zonified: Acetobacter cerevisiae was only present in the North and Gluconobacter oxydans in the South. Both species were recovered together in only one location. The influence of the grape cultivar was negligible. Variability in strains was found to be high (more than 40%) for both Acetobacteraceae species. PMID:17289199

  7. Draft Genome Sequences of Gluconobacter cerinus CECT 9110 and Gluconobacter japonicus CECT 8443, Acetic Acid Bacteria Isolated from Grape Must

    PubMed Central

    Sainz, Florencia

    2016-01-01

    We report here the draft genome sequences of Gluconobacter cerinus strain CECT9110 and Gluconobacter japonicus CECT8443, acetic acid bacteria isolated from grape must. Gluconobacter species are well known for their ability to oxidize sugar alcohols into the corresponding acids. Our objective was to select strains to oxidize effectively d-glucose. PMID:27365351

  8. DEVELOPMENT OF THE INDUSTRIAL COMBUSTION EMISSIONS MODEL FOR ACID RAIN ANALYSES

    EPA Science Inventory

    The paper discusses forecasts of industrial combustion emissions being developed by the U.S. EPA as part of the National Acid Precipitation Assessment Program (NAPAP). The Industrial Combustion Emissions (ICE) Model will estimate sulfur dioxide (SO2), nitrogen oxides (NOx), and p...

  9. Functional analyses of carnivorous plant-specific amino acid residues in S-like ribonucleases.

    PubMed

    Arai, Naoki; Nishimura, Emi; Kikuchi, Yo; Ohyama, Takashi

    2015-09-11

    Unlike plants with no carnivory, carnivorous plants seem to use S-like ribonucleases (RNases) as an enzyme for carnivory. Carnivorous plant-specific conserved amino acid residues are present at four positions around the conserved active site (CAS). The roles of these conserved amino acid residues in the enzymatic function were explored in the current study by preparing five recombinant variants of DA-I, the S-like RNase of Drosera adelae. The kcat and kcat/Km values of the enzymes revealed that among the four variants with a single mutation, the serine to glycine mutation at position 111 most negatively influenced the enzymatic activity. The change in the bulkiness of the amino acid residue side-chain seemed to be the major cause of the above effect. Modeling of the three dimensional (3D) structures strongly suggested that the S to G mutation at 111 greatly altered the overall enzyme conformation. The conserved four amino acid residues are likely to function in keeping the two histidine residues, which are essential for the cleavage of RNA strands, and the CAS in the most functional enzymatic conformation. PMID:26235877

  10. Lysosomal acid lipase deficiency in rats: Lipid analyses and lipase activities in liver and spleen

    SciTech Connect

    Kuriyama, M.; Yoshida, H.; Suzuki, M.; Fujiyama, J.; Igata, A. )

    1990-09-01

    We report the biological characterization of an animal model of a genetic lipid storage disease analogous to human Wolman's disease. Affected rats accumulated cholesteryl esters (13.3-fold), free cholesterol (2.8-fold), and triglycerides (5.4-fold) in the liver, as well as cholesteryl esters (2.5-fold) and free cholesterol (1.33-fold) in the spleen. Triglycerides did not accumulate, and the levels actually decreased in the spleen. Analysis of the fatty acid composition of the cholesteryl esters and triglycerides showed high percentages of linoleic acid (18:2) and arachidonic acid (20:4) in both organs, especially in the liver. No accumulation of phospholipids, neutral glycosphingolipids, or gangliosides was found in the affected rats. Acid lipase activity for (14C)triolein, (14C)cholesteryl oleate, and 4-methyl-umbelliferyl oleate was deficient in both the liver and spleen of affected rats. Lipase activity at neutral pH was normal in both liver and spleen. Heterozygous rats showed intermediate utilization of these substrates in both organs at levels between those for affected rats and those for normal controls, although they did not accumulate any lipids. These data suggest that these rats represent an animal counterpart of Wolman's disease in humans.

  11. Quantitative analyses of tartaric acid based on terahertz time domain spectroscopy

    NASA Astrophysics Data System (ADS)

    Cao, Binghua; Fan, Mengbao

    2010-10-01

    Terahertz wave is the electromagnetic spectrum situated between microwave and infrared wave. Quantitative analysis based on terahertz spectroscopy is very important for the application of terahertz techniques. But how to realize it is still under study. L-tartaric acid is widely used as acidulant in beverage, and other food, such as soft drinks, wine, candy, bread and some colloidal sweetmeats. In this paper, terahertz time-domain spectroscopy is applied to quantify the tartaric acid. Two methods are employed to process the terahertz spectra of different samples with different content of tartaric acid. The first one is linear regression combining correlation analysis. The second is partial least square (PLS), in which the absorption spectra in the 0.8-1.4THz region are used to quantify the tartaric acid. To compare the performance of these two principles, the relative error of the two methods is analyzed. For this experiment, the first method does better than the second one. But the first method is suitable for the quantitative analysis of materials which has obvious terahertz absorption peaks, while for material which has no obvious terahertz absorption peaks, the second one is more appropriate.

  12. Swfoldrate: predicting protein folding rates from amino acid sequence with sliding window method.

    PubMed

    Cheng, Xiang; Xiao, Xuan; Wu, Zhi-cheng; Wang, Pu; Lin, Wei-zhong

    2013-01-01

    Protein folding is the process by which a protein processes from its denatured state to its specific biologically active conformation. Understanding the relationship between sequences and the folding rates of proteins remains an important challenge. Most previous methods of predicting protein folding rate require the tertiary structure of a protein as an input. In this study, the long-range and short-range contact in protein were used to derive extended version of the pseudo amino acid composition based on sliding window method. This method is capable of predicting the protein folding rates just from the amino acid sequence without the aid of any structural class information. We systematically studied the contributions of individual features to folding rate prediction. The optimal feature selection procedures are adopted by means of combining the forward feature selection and sequential backward selection method. Using the jackknife cross validation test, the method was demonstrated on the large dataset. The predictor was achieved on the basis of multitudinous physicochemical features and statistical features from protein using nonlinear support vector machine (SVM) regression model, the method obtained an excellent agreement between predicted and experimentally observed folding rates of proteins. The correlation coefficient is 0.9313 and the standard error is 2.2692. The prediction server is freely available at http://www.jci-bioinfo.cn/swfrate/input.jsp. PMID:22933332

  13. From amino acid sequence to bioactivity: The biomedical potential of antitumor peptides.

    PubMed

    Blanco-Míguez, Aitor; Gutiérrez-Jácome, Alberto; Pérez-Pérez, Martín; Pérez-Rodríguez, Gael; Catalán-García, Sandra; Fdez-Riverola, Florentino; Lourenço, Anália; Sánchez, Borja

    2016-06-01

    Chemoprevention is the use of natural and/or synthetic substances to block, reverse, or retard the process of carcinogenesis. In this field, the use of antitumor peptides is of interest as, (i) these molecules are small in size, (ii) they show good cell diffusion and permeability, (iii) they affect one or more specific molecular pathways involved in carcinogenesis, and (iv) they are not usually genotoxic. We have checked the Web of Science Database (23/11/2015) in order to collect papers reporting on bioactive peptide (1691 registers), which was further filtered searching terms such as "antiproliferative," "antitumoral," or "apoptosis" among others. Works reporting the amino acid sequence of an antiproliferative peptide were kept (60 registers), and this was complemented with the peptides included in CancerPPD, an extensive resource for antiproliferative peptides and proteins. Peptides were grouped according to one of the following mechanism of action: inhibition of cell migration, inhibition of tumor angiogenesis, antioxidative mechanisms, inhibition of gene transcription/cell proliferation, induction of apoptosis, disorganization of tubulin structure, cytotoxicity, or unknown mechanisms. The main mechanisms of action of those antiproliferative peptides with known amino acid sequences are presented and finally, their potential clinical usefulness and future challenges on their application is discussed. PMID:27010507

  14. The amino acid sequences and activities of synergistic hemolysins from Staphylococcus cohnii.

    PubMed

    Mak, Pawel; Maszewska, Agnieszka; Rozalska, Malgorzata

    2008-10-01

    Staphylococcus cohnii ssp. cohnii and S. cohnii ssp. urealyticus are a coagulase-negative staphylococci considered for a long time as unable to cause infections. This situation changed recently and pathogenic strains of these bacteria were isolated from hospital environments, patients and medical staff. Most of the isolated strains were resistant to many antibiotics. The present work describes isolation and characterization of several synergistic peptide hemolysins produced by these bacteria and acting as virulence factors responsible for hemolytic and cytotoxic activities. Amino acid sequences of respective hemolysins from S. cohnii ssp. cohnii (named as H1C, H2C and H3C) and S. cohnii ssp. urealyticus (H1U, H2U and H3U) were identical. Peptides H1 and H3 possessed significant amino acid homology to three synergistic hemolysins secreted by Staphylococcus lugdunensis and to putative antibacterial peptide produced by Staphylococcus saprophyticus ssp. saprophyticus. On the other hand, hemolysin H2 had a unique sequence. All isolated peptides lysed red cells from different mammalian species and exerted a cytotoxic effect on human fibroblasts. PMID:18752624

  15. Clostridium sticklandii, a specialist in amino acid degradation:revisiting its metabolism through its genome sequence

    PubMed Central

    2010-01-01

    Background Clostridium sticklandii belongs to a cluster of non-pathogenic proteolytic clostridia which utilize amino acids as carbon and energy sources. Isolated by T.C. Stadtman in 1954, it has been generally regarded as a "gold mine" for novel biochemical reactions and is used as a model organism for studying metabolic aspects such as the Stickland reaction, coenzyme-B12- and selenium-dependent reactions of amino acids. With the goal of revisiting its carbon, nitrogen, and energy metabolism, and comparing studies with other clostridia, its genome has been sequenced and analyzed. Results C. sticklandii is one of the best biochemically studied proteolytic clostridial species. Useful additional information has been obtained from the sequencing and annotation of its genome, which is presented in this paper. Besides, experimental procedures reveal that C. sticklandii degrades amino acids in a preferential and sequential way. The organism prefers threonine, arginine, serine, cysteine, proline, and glycine, whereas glutamate, aspartate and alanine are excreted. Energy conservation is primarily obtained by substrate-level phosphorylation in fermentative pathways. The reactions catalyzed by different ferredoxin oxidoreductases and the exergonic NADH-dependent reduction of crotonyl-CoA point to a possible chemiosmotic energy conservation via the Rnf complex. C. sticklandii possesses both the F-type and V-type ATPases. The discovery of an as yet unrecognized selenoprotein in the D-proline reductase operon suggests a more detailed mechanism for NADH-dependent D-proline reduction. A rather unusual metabolic feature is the presence of genes for all the enzymes involved in two different CO2-fixation pathways: C. sticklandii harbours both the glycine synthase/glycine reductase and the Wood-Ljungdahl pathways. This unusual pathway combination has retrospectively been observed in only four other sequenced microorganisms. Conclusions Analysis of the C. sticklandii genome and

  16. Complete amino acid sequence of the myoglobin from the Pacific spotted dolphin, Stenella attenuata graffmani.

    PubMed

    Jones, B N; Wang, C C; Dwulet, F E; Lehman, L D; Meuth, J L; Bogardt, R A; Gurd, F R

    1979-04-25

    The complete amino acid sequence of the major component myoglobin from the Pacific spotted dolphin, Stenella attenuata graffmani, was determined by the automated Edman degradation of several large peptides obtained by specific cleavage of the protein. The acetimidated apomyoglobin was selectively cleaved at its two methionyl residues with cyanogen bromide and at its three arginyl residues by trypsin. By subjecting four of these peptides and the apomyoglobin to automated Edman degradation, over 80% of the primary structure of the protein was obtained. The remainder of the covalent structure was determined by the sequence analysis of peptides that resulted from further digestion of the central cyanogen bromide fragment. This fragment was cleaved at its glutamyl residues with staphylococcal protease and its lysyl residues with trypsin. The action of trypsin was restricted to the lysyl residues by chemical modification of the single arginyl residue of the fragment with 1,2-cyclohexanedione. The primary structure of this myoglobin proved to be identical with that from the Atlantic bottlenosed dolphin and Pacific common dolphin but differs from the myoglobins of the killer whale and pilot whale at two positions. The above sequence identities and differences reflect the close taxonomic relationship of these five species of Cetacea. PMID:454657

  17. Isolation and amino acid sequences of squirrel monkey (Saimiri sciurea) insulin and glucagon.

    PubMed Central

    Yu, J H; Eng, J; Yalow, R S

    1990-01-01

    It was reported two decades ago that insulin was not detectable in the glucose-stimulated state in Saimiri sciurea, the New World squirrel monkey, by a radioimmunoassay system developed with guinea pig anti-pork insulin antibody and labeled pork insulin. With the same system, reasonable levels were observed in rhesus monkeys and chimpanzees. This suggested that New World monkeys, like the New World hystricomorph rodents such as the guinea pig and the coypu, might have insulins whose sequences differ markedly from those of Old World mammals. In this report we describe the purification and amino acid sequences of squirrel monkey insulin and glucagon. We demonstrate that the substitutions at B29, B27, A2, A4, and A17 of squirrel monkey insulin are identical with those previously found in another New World primate, the owl monkey (Aotus trivirgatus). The immunologic cross-reactivity of this insulin in our immunoassay system is only a few percent of that of human insulin. Squirrel monkey glucagon is identical with the usual glucagon found in Old World mammals, which predicts that the glucagons of other New World monkeys would not differ from the usual Old World mammalian glucagon. It appears that the peptides of the New World monkeys have diverged less from those of the Old World mammals than have those of the New World hystricomorph rodents. The striking improvements in peptide purification and sequencing have the potential for adding new information concerning the evolutionary divergence of species. PMID:2263627

  18. Isolation and amino acid sequences of squirrel monkey (Saimiri sciurea) insulin and glucagon

    SciTech Connect

    Yu, Jinghua ); Eng, J.; Yalow, R.S. City Univ. of New York, NY )

    1990-12-01

    It was reported two decades ago that insulin was not detectable in the glucose-stimulated state in Saimiri sciurea, the New World squirrel monkey, by a radioimmunoassay system developed with guinea pig anti-pork insulin antibody and labeled park insulin. With the same system, reasonable levels were observed in rhesus monkeys and chimpanzees. This suggested that New World monkeys, like the New World hystricomorph rodents such as the guinea pig and the coypu, might have insulins whose sequences differ markedly from those of Old World mammals. In this report the authors describe the purification and amino acid sequences of squirrel monkey insulin and glucagon. They demonstrate that the substitutions at B29, B27, A2, A4, and A17 of squirrel monkey insulin are identical with those previously found in another New World primate, the owl monkey (Aotus trivirgatus). The immunologic cross-reactivity of this insulin in their immunoassay system is only a few percent of that of human insulin. It appears that the peptides of the New World monkeys have diverged less from those of the Old World mammals than have those of the New World hystricomorph rodents. The striking improvements in peptide purification and sequencing have the potential for adding new information concerning the evolutionary divergence of species.

  19. Binding site discovery from nucleic acid sequences by discriminative learning of hidden Markov models

    PubMed Central

    Maaskola, Jonas; Rajewsky, Nikolaus

    2014-01-01

    We present a discriminative learning method for pattern discovery of binding sites in nucleic acid sequences based on hidden Markov models. Sets of positive and negative example sequences are mined for sequence motifs whose occurrence frequency varies between the sets. The method offers several objective functions, but we concentrate on mutual information of condition and motif occurrence. We perform a systematic comparison of our method and numerous published motif-finding tools. Our method achieves the highest motif discovery performance, while being faster than most published methods. We present case studies of data from various technologies, including ChIP-Seq, RIP-Chip and PAR-CLIP, of embryonic stem cell transcription factors and of RNA-binding proteins, demonstrating practicality and utility of the method. For the alternative splicing factor RBM10, our analysis finds motifs known to be splicing-relevant. The motif discovery method is implemented in the free software package Discrover. It is applicable to genome- and transcriptome-scale data, makes use of available repeat experiments and aside from binary contrasts also more complex data configurations can be utilized. PMID:25389269

  20. Robust gene expression and mutation analyses of RNA-sequencing of formalin-fixed diagnostic tumor samples.

    PubMed

    Graw, Stefan; Meier, Richard; Minn, Kay; Bloomer, Clark; Godwin, Andrew K; Fridley, Brooke; Vlad, Anda; Beyerlein, Peter; Chien, Jeremy

    2015-01-01

    Current genomic studies are limited by the availability of fresh tissue samples. Here, we show that Illumina RNA sequencing of formalin-fixed diagnostic tumor samples produces gene expression that is strongly correlated with matched frozen tumor samples (r > 0.89). In addition, sequence variations identified from FFPE RNA show 99.67% concordance with that from exome sequencing of matched frozen tumor samples. Because FFPE is a routine diagnostic sample preparation, the feasibility results reported here will facilitate the setup of large-scale research and clinical studies in medical genomics that are currently limited by the availability of fresh frozen samples. PMID:26202458

  1. Sequence and Transcriptional Analyses of the Fish Retroviruses Walleye Epidermal Hyperplasia Virus Types 1 and 2: Evidence for a Gene Duplication

    PubMed Central

    LaPierre, Lorie A.; Holzschu, Donald L.; Bowser, Paul R.; Casey, James W.

    1999-01-01

    Walleye epidermal hyperplasia virus types 1 and 2 (WEHV1 and WEHV2, respectively) are associated with a hyperproliferative skin lesion on walleyes that appears and regresses seasonally. We have determined the complete nucleotide sequences and transcriptional profiles of these viruses. WEHV1 and WEHV2 are large, complex retroviruses of 12,999 and 13,125 kb in length, respectively, that are closely related to one another and to walleye dermal sarcoma virus (WDSV). These walleye retroviruses contain three open reading frames, orfA, orfB, and orfC, in addition to gag, pol, and env. orfA and orfB are adjacent to one another and located downstream of env. The OrfA proteins were previously identified as cyclin D homologs that may contribute to the induction of cell proliferation leading to epidermal hyperplasia and dermal sarcoma. The sequence analysis of WEHV1 and WEHV2 revealed that the OrfB proteins are distantly related to the OrfA proteins, suggesting that orfB arose by gene duplication. Presuming that the precursor of orfA and orfB was derived from a cellular cyclin, these genes are the first accessory genes of complex retroviruses that can be traced to a cellular origin. WEHV1, WEHV2, and WDSV are the only retroviruses that have an open reading frame, orfC, of considerable size (ca. 130 amino acids) in the leader region preceding gag. While we were unable to predict a function for the OrfC proteins, they are more conserved than OrfA and OrfB, suggesting that they may be biologically important to the viruses. The transcriptional profiles of WEHV1 and WEHV2 were also similar to that of WDSV; Northern blot analyses detected only low levels of the orfA transcripts in developing lesions, whereas abundant levels of genomic, env, orfA, and orfB transcripts were detected in regressing lesions. The splice donors and acceptors of individual transcripts were identified by reverse transcriptase PCR. The similarities of WEHV1, WEHV2, and WDSV suggest that these viruses use

  2. Nucleotide and derived amino acid sequences of the major porin of Comamonas acidovorans and comparison of porin primary structures.

    PubMed Central

    Gerbl-Rieger, S; Peters, J; Kellermann, J; Lottspeich, F; Baumeister, W

    1991-01-01

    The DNA sequence of the gene which codes for the major outer membrane porin (Omp32) of Comamonas acidovorans has been determined. The structural gene encodes a precursor consisting of 351 amino acid residues with a signal peptide of 19 amino acid residues. Comparisons with amino acid sequences of outer membrane proteins and porins from several other members of the class Proteobacteria and of the Chlamydia trachomatis porin and the Neurospora crassa mitochondrial porin revealed a motif of eight regions of local homology. The results of this analysis are discussed with regard to common structural features of porins. PMID:1848840

  3. Nucleotide sequence analysis with polynucleotide kinase and nucleotide `mapping' methods. 5′-Terminal sequence of deoxyribonucleic acid from bacteriophages λ and 424

    PubMed Central

    Murray, Kenneth

    1973-01-01

    The polynucleotide kinase reaction was used in analyses of complex mixtures of oligodeoxynucleotides which were fractionated by various two-dimensional nucleotide `mapping' procedures. Parallel ionophoretic analyses on DEAE-cellulose paper, pH2, and AE-cellulose paper, pH3.5, of venom phosphodiesterase partial digests of 5′-terminally labelled oligonucleotides enabled the sequence of the nucleotides to be deduced uniquely. A `diagonal ionophoresis' method has been used with mixtures of nucleotides. Application of these methods to 5′-terminally labelled DNA from bacteriophage λ gave the terminal sequences pA-G-G-T-C-G and pG-G-G-C-G. Identical 5′-terminal sequences were found with DNA from bacteriophage 424. ImagesPLATE 5PLATE 1PLATE 2PLATE 3PLATE 4 PMID:4352720

  4. Molecular cloning of the. alpha. -subunit of human prolyl 4-hydroxylase: The complete cDNA-derived amino acid sequence and evidence for alternative splicing of RNA transcripts

    SciTech Connect

    Helaakoski, T.; Vuori, K.; Myllylae, R.; Kivirikko, K.I.; Pihlajaniemi, T. )

    1989-06-01

    Prolyl 4-hydroxylase an {alpha}{sub 2}{beta}{sub 2} tetramer, catalyzes the formation of 4-hydroxyproline in collagens by the hydroxylation of proline residues in peptide linkages. The authors report here on the isolation of cDNA clones encoding the {alpha}-subunit of the enzyme from human tumor HT-1080, placenta, and fibroblast cDNA libraries. Eight overlapping clones covering almost all of the corresponding 3,000-nucleotide mRNA, including all the coding sequences, were characterized. These clones encode a polypeptide of 517 amino acid residues and a signal peptide of 17 amino acids. Previous characterization of cDNA clones for the {beta}-subunit of prolyl 4-hydroxylase has indicated that its C terminus has the amino acid sequence Lys-Asp-Gly-Leu, which, it has been suggested, is necessary for the retention of a polypeptide within the lumen of the endoplasmic reticulum. The {alpha}-subunit does not have this C-terminal sequence, and thus one function of the {beta}-subunit in the prolyl 4-hydroxylase tetramer appears to be to retain the enzyme within this cell organelle. Southern blot analyses of human genomic DNA with a cDNA probe for the {alpha}-subunit suggested the presence of only one gene encoding the two types of mRNA, which appear to result from mutually exclusive alternative splicing of primary transcripts of one gene.

  5. Amino acid sequence analysis and characterization of a ribonuclease from starfish Asterias amurensis.

    PubMed

    Motoyoshi, Naomi; Kobayashi, Hiroko; Itagaki, Tadashi; Inokuchi, Norio

    2016-09-01

    The aim of this study was to phylogenetically characterize the location of the RNase T2 enzyme in the starfish (Asterias amurensis). We isolated an RNase T2 ribonuclease (RNase Aa) from the ovaries of starfish and determined its amino acid sequence by protein chemistry and cloning cDNA encoding RNase Aa. The isolated protein had 231 amino acid residues, a predicted molecular mass of 25,906 Da, and an optimal pH of 5.0. RNase Aa preferentially released guanylic acid from the RNA. The catalytic sites of the RNase T2 family are conserved in RNase Aa; furthermore, the distribution of the cysteine residues in RNase Aa is similar to that in other animal and plant T2 RNases. RNase Aa is cleaved at two points: 21 residues from the N-terminus and 29 residues from the C-terminus; however, both fragments may remain attached to the protein via disulfide bridges, leading to the maintenance of its conformation, as suggested by circular dichroism spectrum analysis. The phylogenetic analysis revealed that starfish RNase Aa is evolutionarily an intermediate between protozoan and oyster RNases. PMID:26920046

  6. An intronic peroxisome proliferator-activated receptor-binding sequence mediates fatty acid induction of the human carnitine palmitoyltransferase 1A.

    PubMed

    Napal, Laura; Marrero, Pedro F; Haro, Diego

    2005-12-01

    The liver plays a central role in the response to fasting. The hormonal profile in this condition, low insulin, and high concentrations of glucagon in plasma, induce the release of large amounts of fatty acids from adipose tissue. Prolonged starvation can therefore induce a dramatic change in the fatty acid oxidative capacity of liver metabolism. Modulation of gene expression by PPARalpha plays a crucial role in this response. While a major role for PPARalpha in the liver is to produce ketone bodies as fuel through beta-oxidation for peripheral tissues during fast, its participation in the control of CPT1A, the rate-limiting step of the pathway, remains controversial. Using Web-based software (VISTA) combining transcription factor binding site database searches with comparative sequence analyses, we have localized a conserved functional PPAR responsive element downstream of the transcriptional start site of the human CPT1A gene. We have shown that this sequence is fundamental for fatty acids or PGC1-induced transcriptional activation of the CPT1A gene. These results corroborate the hypothesis that PPARalpha regulates the limiting step in the oxidation of fatty acids in liver mitochondria. PMID:16271724

  7. A Bacterial Analysis Platform: An Integrated System for Analysing Bacterial Whole Genome Sequencing Data for Clinical Diagnostics and Surveillance

    PubMed Central

    Ahrenfeldt, Johanne; Cisneros, Jose Luis Bellod; Jurtz, Vanessa; Larsen, Mette Voldby; Hasman, Henrik; Aarestrup, Frank Møller; Lund, Ole

    2016-01-01

    Recent advances in whole genome sequencing have made the technology available for routine use in microbiological laboratories. However, a major obstacle for using this technology is the availability of simple and automatic bioinformatics tools. Based on previously published and already available web-based tools we developed a single pipeline for batch uploading of whole genome sequencing data from multiple bacterial isolates. The pipeline will automatically identify the bacterial species and, if applicable, assemble the genome, identify the multilocus sequence type, plasmids, virulence genes and antimicrobial resistance genes. A short printable report for each sample will be provided and an Excel spreadsheet containing all the metadata and a summary of the results for all submitted samples can be downloaded. The pipeline was benchmarked using datasets previously used to test the individual services. The reported results enable a rapid overview of the major results, and comparing that to the previously found results showed that the platform is reliable and able to correctly predict the species and find most of the expected genes automatically. In conclusion, a combined bioinformatics platform was developed and made publicly available, providing easy-to-use automated analysis of bacterial whole genome sequencing data. The platform may be of immediate relevance as a guide for investigators using whole genome sequencing for clinical diagnostics and surveillance. The platform is freely available at: https://cge.cbs.dtu.dk/services/CGEpipeline-1.1 and it is the intention that it will continue to be expanded with new features as these become available. PMID:27327771

  8. Phylogenetic analyses of Chlamydia psittaci strains from birds based on 16S rRNA gene sequence.

    PubMed Central

    Takahashi, T; Masuda, M; Tsuruno, T; Mori, Y; Takashima, I; Hiramune, T; Kikuchi, N

    1997-01-01

    The nucleotide sequences of 16S ribosomal DNA (rDNA) were determined for 39 strains of Chlamydia psittaci (34 from birds and 5 from mammals) and for 4 Chlamydia pecorum strains. The sequences were compared phylogenetically with the gene sequences of nine Chlamydia strains (covering four species of the genus) retrieved from nucleotide databases. In the neighbor-joining tree, C. psittaci strains were more closely related to each other than to the other Chlamydia species, although a feline pneumonitis strain was distinct (983 to 98.6% similarity to other strains) and appeared to form the deepest subline within the species of C. psittaci (bootstrap value, 99%). The other strains of C. psittaci exhibiting similarity values of more than 99% were branched into several subgroups. Two pigeon strains and one turkey strain formed a distinct clade recovered in 97% of the bootstrapped trees. The other pigeon strains seemed to be distinct from the strains from psittacine birds, with 88% of bootstrap value. In the cluster of psittacine strains, three parakeet strains and an ovine abortion strain exhibited a specific association (level of sequence similarity, 99.9% or more; bootstrap value, 95%). These suggest that at least four groups of strains exist within the species C. psittaci. The 16S rDNA sequence is a valuable phylogenetic marker for the taxonomy of chlamydiae, and its analysis is a reliable tool for identification of the organisms. PMID:9350757

  9. Global Transcriptome and Mutagenic Analyses of the Acid Tolerance Response of Salmonella enterica Serovar Typhimurium

    PubMed Central

    Ryan, Daniel; Pati, Niladri Bhusan; Ojha, Urmesh K.; Padhi, Chandrashekhar; Ray, Shilpa; Jaiswal, Sangeeta; Singh, Gajinder P.; Mannala, Gopala K.; Schultze, Tilman; Chakraborty, Trinad

    2015-01-01

    Salmonella enterica serovar Typhimurium (S. Typhimurium) is one of the leading causative agents of food-borne bacterial gastroenteritis. Swift invasion through the intestinal tract and successful establishment in systemic organs are associated with the adaptability of S. Typhimurium to different stress environments. Low-pH stress serves as one of the first lines of defense in mammalian hosts, which S. Typhimurium must efficiently overcome to establish an infection. Therefore, a better understanding of the molecular mechanisms underlying the adaptability of S. Typhimurium to acid stress is highly relevant. In this study, we have performed a transcriptome analysis of S. Typhimurium under the acid tolerance response (ATR) and found a large number of genes (∼47%) to be differentially expressed (more than 1.5-fold or less than −1.5-fold; P < 0.01). Functional annotation revealed differentially expressed genes to be associated with regulation, metabolism, transport and binding, pathogenesis, and motility. Additionally, our knockout analysis of a subset of differentially regulated genes facilitated the identification of proteins that contribute to S. Typhimurium ATR and virulence. Mutants lacking genes encoding the K+ binding and transport protein KdpA, hypothetical protein YciG, the flagellar hook cap protein FlgD, and the nitrate reductase subunit NarZ were significantly deficient in their ATRs and displayed varied in vitro virulence characteristics. This study offers greater insight into the transcriptome changes of S. Typhimurium under the ATR and provides a framework for further research on the subject. PMID:26386064

  10. Full Genome Virus Detection in Fecal Samples Using Sensitive Nucleic Acid Preparation, Deep Sequencing, and a Novel Iterative Sequence Classification Algorithm

    PubMed Central

    Cotten, Matthew; Oude Munnink, Bas; Canuti, Marta; Deijs, Martin; Watson, Simon J.; Kellam, Paul; van der Hoek, Lia

    2014-01-01

    We have developed a full genome virus detection process that combines sensitive nucleic acid preparation optimised for virus identification in fecal material with Illumina MiSeq sequencing and a novel post-sequencing virus identification algorithm. Enriched viral nucleic acid was converted to double-stranded DNA and subjected to Illumina MiSeq sequencing. The resulting short reads were processed with a novel iterative Python algorithm SLIM for the identification of sequences with homology to known viruses. De novo assembly was then used to generate full viral genomes. The sensitivity of this process was demonstrated with a set of fecal samples from HIV-1 infected patients. A quantitative assessment of the mammalian, plant, and bacterial virus content of this compartment was generated and the deep sequencing data were sufficient to assembly 12 complete viral genomes from 6 virus families. The method detected high levels of enteropathic viruses that are normally controlled in healthy adults, but may be involved in the pathogenesis of HIV-1 infection and will provide a powerful tool for virus detection and for analyzing changes in the fecal virome associated with HIV-1 progression and pathogenesis. PMID:24695106

  11. Molar ratio iron: zinc and folic acid in Brazilian biscuits and snacks and test for classification using principal component analyses.

    PubMed

    Godoy, Adriana Teixeira; Rebelatto, Ana Paula; Borin-Nogueira, Alessandra; Lima-Pallone, Juliana Azevedo

    2014-06-01

    The aim of the present work was to evaluate molar ratio iron: zinc and the levels of folic acid in biscuit and snacks commercialized in Brazil, prepared with folic acid and iron fortified flours. These nutrients are important for human nutrition; however, iron can have a negative effect on zinc absorption. Molar ratio iron:zinc can indicate if there will be any problems for absorption of these nutrients. The folic acid content varied from 58 to 433 μg/100 g and iron and zinc levels varied from 2.9 to 9.4 mg/100 g and from 0.2 to 1.3 mg/100 g, respectively, for 75 analyzed samples. The average iron contents observed in the products and molar ratio iron:zinc (in average 8:1 for biscuits and 12.8:1 for snacks) could result in problems with the zinc absorption. Moreover, principal compo- nent analyses (PCA) indicated low uniformity in the distribution of minerals and vitamin in the majority of the samples, mainly among brands. The results indicated that for the majority of the samples tested folic acid and iron content was higher than expected for flours and could be useful to governmental authorities in their evaluation program of flour fortification. PMID:25799687

  12. Global trophic position comparison of two dominant mesopelagic fish families (Myctophidae, Stomiidae) using amino acid nitrogen isotopic analyses

    USGS Publications Warehouse

    Choy, C. Anela; Davison, Peter C.; Drazen, Jeffrey C.; Flynn, Adrian; Gier, Elizabeth J.; Hoffman, Joel C.; McClain-Counts, Jennifer P.; Miller, Todd W.; Popp, Brian N.; Ross, Steve W.; Sutton, Tracey T.

    2012-01-01

    The δ15N values of organisms are commonly used across diverse ecosystems to estimate trophic position and infer trophic connectivity. We undertook a novel cross-basin comparison of trophic position in two ecologically well-characterized and different groups of dominant mid-water fish consumers using amino acid nitrogen isotope compositions. We found that trophic positions estimated from the δ15N values of individual amino acids are nearly uniform within both families of these fishes across five global regions despite great variability in bulk tissue δ15N values. Regional differences in the δ15N values of phenylalanine confirmed that bulk tissue δ15N values reflect region-specific water mass biogeochemistry controlling δ15N values at the base of the food web. Trophic positions calculated from amino acid isotopic analyses (AA-TP) for lanternfishes (family Myctophidae) (AA-TP ~2.9) largely align with expectations from stomach content studies (TP ~3.2), while AA-TPs for dragonfishes (family Stomiidae) (AA-TP ~3.2) were lower than TPs derived from stomach content studies (TP~4.1). We demonstrate that amino acid nitrogen isotope analysis can overcome shortcomings of bulk tissue isotope analysis across biogeochemically distinct systems to provide globally comparative information regarding marine food web structure.

  13. Global Trophic Position Comparison of Two Dominant Mesopelagic Fish Families (Myctophidae, Stomiidae) Using Amino Acid Nitrogen Isotopic Analyses

    PubMed Central

    Choy, C. Anela; Davison, Peter C.; Drazen, Jeffrey C.; Flynn, Adrian; Gier, Elizabeth J.; Hoffman, Joel C.; McClain-Counts, Jennifer P.; Miller, Todd W.; Popp, Brian N.; Ross, Steve W.; Sutton, Tracey T.

    2012-01-01

    The δ15N values of organisms are commonly used across diverse ecosystems to estimate trophic position and infer trophic connectivity. We undertook a novel cross-basin comparison of trophic position in two ecologically well-characterized and different groups of dominant mid-water fish consumers using amino acid nitrogen isotope compositions. We found that trophic positions estimated from the δ15N values of individual amino acids are nearly uniform within both families of these fishes across five global regions despite great variability in bulk tissue δ15N values. Regional differences in the δ15N values of phenylalanine confirmed that bulk tissue δ15N values reflect region-specific water mass biogeochemistry controlling δ15N values at the base of the food web. Trophic positions calculated from amino acid isotopic analyses (AA-TP) for lanternfishes (family Myctophidae) (AA-TP ∼2.9) largely align with expectations from stomach content studies (TP ∼3.2), while AA-TPs for dragonfishes (family Stomiidae) (AA-TP ∼3.2) were lower than TPs derived from stomach content studies (TP∼4.1). We demonstrate that amino acid nitrogen isotope analysis can overcome shortcomings of bulk tissue isotope analysis across biogeochemically distinct systems to provide globally comparative information regarding marine food web structure. PMID:23209656

  14. Evolutionary connections of biological kingdoms based on protein and nucleic acid sequence evidence

    NASA Technical Reports Server (NTRS)

    Dayhoff, M. O.

    1983-01-01

    Prokaryotic and eukaryotic evolutionary trees are developed from protein and nucleic-acid sequences by the methods of numerical taxonomy. Trees are presented for bacterial ferredoxins, 5S ribosomal RNA, c-type cytochromes , cytochromes c2 and c', and 5.8S ribosomal RNA; the implications for early evolution are discussed; and a composite tree showing the branching of the anaerobes, aerobes, archaebacteria, and eukaryotes is shown. Single lines are found for all oxygen-evolving photosynthetic forms and for the salt-loving and high-temperature forms of archaebacteria. It is argued that the eukaryote mitochondria, chloroplasts, and cytoplasmic host material are descended from free-living prokaryotes that formed symbiotic associations, with more than one symbiotic event involved in the evolution of each organelle.

  15. The amino acid alphabet and the architecture of the protein sequence-structure map. I. Binary alphabets.

    PubMed

    Ferrada, Evandro

    2014-12-01

    The correspondence between protein sequences and structures, or sequence-structure map, relates to fundamental aspects of structural, evolutionary and synthetic biology. The specifics of the mapping, such as the fraction of accessible sequences and structures, or the sequences' ability to fold fast, are dictated by the type of interactions between the monomers that compose the sequences. The set of possible interactions between monomers is encapsulated by the potential energy function. In this study, I explore the impact of the relative forces of the potential on the architecture of the sequence-structure map. My observations rely on simple exact models of proteins and random samples of the space of potential energy functions of binary alphabets. I adopt a graph perspective and study the distribution of viable sequences and the structures they produce, as networks of sequences connected by point mutations. I observe that the relative proportion of attractive, neutral and repulsive forces defines types of potentials, that induce sequence-structure maps of vastly different architectures. I characterize the properties underlying these differences and relate them to the structure of the potential. Among these properties are the expected number and relative distribution of sequences associated to specific structures and the diversity of structures as a function of sequence divergence. I study the types of binary potentials observed in natural amino acids and show that there is a strong bias towards only some types of potentials, a bias that seems to characterize the folding code of natural proteins. I discuss implications of these observations for the architecture of the sequence-structure map of natural proteins, the construction of random libraries of peptides, and the early evolution of the natural amino acid alphabet. PMID:25473967

  16. The Amino Acid Alphabet and the Architecture of the Protein Sequence-Structure Map. I. Binary Alphabets

    PubMed Central

    Ferrada, Evandro

    2014-01-01

    The correspondence between protein sequences and structures, or sequence-structure map, relates to fundamental aspects of structural, evolutionary and synthetic biology. The specifics of the mapping, such as the fraction of accessible sequences and structures, or the sequences' ability to fold fast, are dictated by the type of interactions between the monomers that compose the sequences. The set of possible interactions between monomers is encapsulated by the potential energy function. In this study, I explore the impact of the relative forces of the potential on the architecture of the sequence-structure map. My observations rely on simple exact models of proteins and random samples of the space of potential energy functions of binary alphabets. I adopt a graph perspective and study the distribution of viable sequences and the structures they produce, as networks of sequences connected by point mutations. I observe that the relative proportion of attractive, neutral and repulsive forces defines types of potentials, that induce sequence-structure maps of vastly different architectures. I characterize the properties underlying these differences and relate them to the structure of the potential. Among these properties are the expected number and relative distribution of sequences associated to specific structures and the diversity of structures as a function of sequence divergence. I study the types of binary potentials observed in natural amino acids and show that there is a strong bias towards only some types of potentials, a bias that seems to characterize the folding code of natural proteins. I discuss implications of these observations for the architecture of the sequence-structure map of natural proteins, the construction of random libraries of peptides, and the early evolution of the natural amino acid alphabet. PMID:25473967

  17. Trypsin inhibitors from ridged gourd (Luffa acutangula Linn.) seeds: purification, properties, and amino acid sequences.

    PubMed

    Haldar, U C; Saha, S K; Beavis, R C; Sinha, N K

    1996-02-01

    Two trypsin inhibitors, LA-1 and LA-2, have been isolated from ridged gourd (Luffa acutangula Linn.) seeds and purified to homogeneity by gel filtration followed by ion-exchange chromatography. The isoelectric point is at pH 4.55 for LA-1 and at pH 5.85 for LA-2. The Stokes radius of each inhibitor is 11.4 A. The fluorescence emission spectrum of each inhibitor is similar to that of the free tyrosine. The biomolecular rate constant of acrylamide quenching is 1.0 x 10(9) M-1 sec-1 for LA-1 and 0.8 x 10(9) M-1 sec-1 for LA-2 and that of K2HPO4 quenching is 1.6 x 10(11) M-1 sec-1 for LA-1 and 1.2 x 10(11) M-1 sec-1 for LA-2. Analysis of the circular dichroic spectra yields 40% alpha-helix and 60% beta-turn for La-1 and 45% alpha-helix and 55% beta-turn for LA-2. Inhibitors LA-1 and LA-2 consist of 28 and 29 amino acid residues, respectively. They lack threonine, alanine, valine, and tryptophan. Both inhibitors strongly inhibit trypsin by forming enzyme-inhibitor complexes at a molar ratio of unity. A chemical modification study suggests the involvement of arginine of LA-1 and lysine of LA-2 in their reactive sites. The inhibitors are very similar in their amino acid sequences, and show sequence homology with other squash family inhibitors. PMID:8924202

  18. Microfluidic platform for isolating nucleic acid targets using sequence specific hybridization

    PubMed Central

    Wang, Jingjing; Morabito, Kenneth; Tang, Jay X.; Tripathi, Anubhav

    2013-01-01

    The separation of target nucleic acid sequences from biological samples has emerged as a significant process in today's diagnostics and detection strategies. In addition to the possible clinical applications, the fundamental understanding of target and sequence specific hybridization on surface modified magnetic beads is of high value. In this paper, we describe a novel microfluidic platform that utilizes a mobile magnetic field in static microfluidic channels, where single stranded DNA (ssDNA) molecules are isolated via nucleic acid hybridization. We first established efficient isolation of biotinylated capture probe (BP) using streptavidin-coated magnetic beads. Subsequently, we investigated the hybridization of target ssDNA with BP bound to beads and explained these hybridization kinetics using a dual-species kinetic model. The number of hybridized target ssDNA molecules was determined to be about 6.5 times less than that of BP on the bead surface, due to steric hindrance effects. The hybridization of target ssDNA with non-complementary BP bound to bead was also examined, and non-specific hybridization was found to be insignificant. Finally, we demonstrated highly efficient capture and isolation of target ssDNA in the presence of non-target ssDNA, where as low as 1% target ssDNA can be detected from mixture. The microfluidic method described in this paper is significantly relevant and is broadly applicable, especially towards point-of-care biological diagnostic platforms that require binding and separation of known target biomolecules, such as RNA, ssDNA, or protein. PMID:24404041

  19. The quest for the best: The impact of different EPI sequences on the sensitivity of random effect fMRI group analyses.

    PubMed

    Kirilina, Evgeniya; Lutti, Antoine; Poser, Benedikt A; Blankenburg, Felix; Weiskopf, Nikolaus

    2016-02-01

    We compared the sensitivity of standard single-shot 2D echo planar imaging (EPI) to three advanced EPI sequences, i.e., 2D multi-echo EPI, 3D high resolution EPI and 3D dual-echo fast EPI in fixed effect and random effects group level fMRI analyses at 3T. The study focused on how well the variance reduction in fixed effect analyses achieved by advanced EPI sequences translates into increased sensitivity in the random effects group level analysis. The sensitivity was estimated in a functional MRI experiment of an emotional learning and a reward based learning tasks in a group of 24 volunteers. Each experiment was acquired with the four different sequences. The task-related response amplitude, contrast level and respective t-value were proxies for the functional sensitivity across the brain. All three advanced EPI methods increased the sensitivity in the fixed effects analyses, but standard single-shot 2D EPI provided a comparable performance in random effects group analysis when whole brain coverage and moderate resolution are required. In this experiment inter-subject variability determined the sensitivity of the random effects analysis for most brain regions, making the impact of EPI pulse sequence improvements less relevant or even negligible for random effects analyses. An exception concerns the optimization of EPI reducing susceptibility-related signal loss that translates into an enhanced sensitivity e.g. in the orbitofrontal cortex for multi-echo EPI. Thus, future optimization strategies may best aim at reducing inter-subject variability for higher sensitivity in standard fMRI group studies at moderate spatial resolution. PMID:26515905

  20. The quest for the best: The impact of different EPI sequences on the sensitivity of random effect fMRI group analyses

    PubMed Central

    Kirilina, Evgeniya; Lutti, Antoine; Poser, Benedikt A.; Blankenburg, Felix; Weiskopf, Nikolaus

    2016-01-01

    We compared the sensitivity of standard single-shot 2D echo planar imaging (EPI) to three advanced EPI sequences, i.e., 2D multi-echo EPI, 3D high resolution EPI and 3D dual-echo fast EPI in fixed effect and random effects group level fMRI analyses at 3 T. The study focused on how well the variance reduction in fixed effect analyses achieved by advanced EPI sequences translates into increased sensitivity in the random effects group level analysis. The sensitivity was estimated in a functional MRI experiment of an emotional learning and a reward based learning tasks in a group of 24 volunteers. Each experiment was acquired with the four different sequences. The task-related response amplitude, contrast level and respective t-value were proxies for the functional sensitivity across the brain. All three advanced EPI methods increased the sensitivity in the fixed effects analyses, but standard single-shot 2D EPI provided a comparable performance in random effects group analysis when whole brain coverage and moderate resolution are required. In this experiment inter-subject variability determined the sensitivity of the random effects analysis for most brain regions, making the impact of EPI pulse sequence improvements less relevant or even negligible for random effects analyses. An exception concerns the optimization of EPI reducing susceptibility-related signal loss that translates into an enhanced sensitivity e.g. in the orbitofrontal cortex for multi-echo EPI. Thus, future optimization strategies may best aim at reducing inter-subject variability for higher sensitivity in standard fMRI group studies at moderate spatial resolution. PMID:26515905

  1. Comparative sequence analyses indicate that Coffea (Asterids) and Vitis (Rosids) derive from the same paleo-hexaploid ancestral genome.

    PubMed

    Cenci, Alberto; Combes, Marie-Christine; Lashermes, Philippe

    2010-05-01

    The complete sequence of Vitis vinifera revealed that the rosid clade derives from a hexaploid ancestor. At present, no analysis of complete genome sequence is available for an asterid, the other large eudicot clade, which includes the economically important species potato, tomato and coffee. To elucidate the genomic history of asterids, we compared the sequence of an 800 kb region of diploid Coffea genome to the orthologous regions of V. vinifera, Populus trichocarpa and Arabidopsis thaliana. We found a very high level of collinearity between around 80 genes of the three rosid species and Coffea. Collinearity comparisons between orthologous and paralogous regions indicates that (1) the Coffea (and consequently all asterids) and rosids share the same hexaploid ancestor; (2) the diploidization process (loss of duplicated and redundant copies from the whole genome duplication) was very advanced in the most recent common ancestor of rosids and asterids. Finally, no additional polyploidization events were detected in the Coffea lineage. Differences in gene loss rates were detected among the three rosid species and linked to the divergence in protein sequences. PMID:20361338

  2. Using Synthetic Nanopores for Single-Molecule Analyses: Detecting SNPs, Trapping DNA Molecules, and the Prospects for Sequencing DNA

    ERIC Educational Resources Information Center

    Dimitrov, Valentin V.

    2009-01-01

    This work focuses on studying properties of DNA molecules and DNA-protein interactions using synthetic nanopores, and it examines the prospects of sequencing DNA using synthetic nanopores. We have developed a method for discriminating between alleles that uses a synthetic nanopore to measure the binding of a restriction enzyme to DNA. There exists…

  3. Learning Hypotheses and an Associated Tool to Design and to Analyse Teaching-Learning Sequences. Special Issue

    ERIC Educational Resources Information Center

    Buty, Christian; Tiberghien, Andree; Le Marechal, Jean-Francois

    2004-01-01

    This contribution presents a tool elaborated from a theoretical framework linking epistemological, learning and didactical hypotheses. This framework lead to design teaching sequences from a socio-constructivist perspective, and is based on the role of models in physics or chemistry, and on the role of students' initial knowledge in learning…

  4. The amino acid sequences of two alpha chains of hemoglobins from Komodo dragon Varanus komodoensis and phylogenetic relationships of amniotes.

    PubMed

    Fushitani, K; Higashiyama, K; Moriyama, E N; Imai, K; Hosokawa, K

    1996-09-01

    To elucidate phylogenetic relationships among amniotes and the evolution of alpha globins, hemoglobins were analyzed from the Komodo dragon (Komodo monitor lizard) Varanus komodoensis, the world's largest extant lizard, inhabiting Komodo Islands, Indonesia. Four unique globin chains (alpha A, alpha D, beta B, and beta C) were isolated in an equal molar ratio by high performance liquid chromatography from the hemolysate. The amino acid sequences of two alpha chains were determined. The alpha D chain has a glutamine at E7 as does an alpha chain of a snake, Liophis miliaris, but the alpha A chain has a histidine at E7 like the majority of hemoglobins. Phylogenetic analyses of 19 globins including two alpha chains of Komodo dragon and ones from representative amniotes showed the following results: (1) The a chains of squamates (snakes and lizards), which have a glutamine at E7, are clustered with the embryonic alpha globin family, which typically includes the alpha D chain from birds; (2) birds form a sister group with other reptiles but not with mammals; (3) the genes for embryonic and adult types of alpha globins were possibly produced by duplication of the ancestral alpha gene before ancestral amniotes diverged, indicating that each of the present amniotes might carry descendants of the two types of alpha globin genes; (4) squamates first split off from the ancestor of other reptiles and birds. PMID:8752011

  5. Characterization of N-glycosylation and amino acid sequence features of immunoglobulins from swine.

    PubMed

    Lopez, Paul G; Girard, Lauren; Buist, Marjorie; de Oliveira, Andrey Giovanni Gomes; Bodnar, Edward; Salama, Apolline; Soulillou, Jean-Paul; Perreault, Hélène

    2016-02-01

    The primary goal of this study was to develop a method to study the N-glycosylation of IgG from swine in order to detect epitopes containing N-glycolylneuraminic acid (Neu5Gc) and/or terminal galactose residues linked in α1-3 susceptible to cause xenograft-related problems. Samples of immunoglobulin were isolated from porcine serum using protein-A affinity chromatography. The eluate was then separated on electrophoretic gel, and bands corresponding to the N-glycosylated heavy chains were cut off the gel and subjected to tryptic digestion. Peptides and glycopeptides were separated by reversed phase liquid chromatography and fractions were collected for matrix-assisted laser desorption/ionization time-of-flight mass spectrometric (MALDI-TOF-MS) analysis. Overall no α1-3 galactose was detected, as demonstrated by complete susceptibility of terminal galactose residues to β-galactosidase digestion. Neu5Gc was detected on singly sialylated structures. Two major N-glycopeptides were found, EEQFNSTYR and EAQFNSTYR as determined by tandem MS (MS/MS), as previously reported by Butler et al. (Immunogenetics, 61, 2009, 209-230), who found 11 subclasses for porcine IgG. Out of the 11, ten include the sequence corresponding to EEQFNSTYR, and only one codes for EAQFNSTYR. In this study, glycosylation patterns associated with both chains were slightly different, in that EEQFNSTYR had a higher content of galactose. The last step of this study consisted of peptide-mapping the 11 reported porcine IgG sequences. Although there was considerable overlap, at least one unique tryptic peptide was found per IgG sequence. The workflow presented in this manuscript constitutes the first study to use MALDI-TOF-MS in the investigation of porcine IgG structural features. PMID:26586247

  6. Lactic acid production from potato peel waste by anaerobic sequencing batch fermentation using undefined mixed culture.

    PubMed

    Liang, Shaobo; McDonald, Armando G; Coats, Erik R

    2015-11-01

    Lactic acid (LA) is a necessary industrial feedstock for producing the bioplastic, polylactic acid (PLA), which is currently produced by pure culture fermentation of food carbohydrates. This work presents an alternative to produce LA from potato peel waste (PPW) by anaerobic fermentation in a sequencing batch reactor (SBR) inoculated with undefined mixed culture from a municipal wastewater treatment plant. A statistical design of experiments approach was employed using set of 0.8L SBRs using gelatinized PPW at a solids content range from 30 to 50 g L(-1), solids retention time of 2-4 days for yield and productivity optimization. The maximum LA production yield of 0.25 g g(-1) PPW and highest productivity of 125 mg g(-1) d(-1) were achieved. A scale-up SBR trial using neat gelatinized PPW (at 80 g L(-1) solids content) at the 3 L scale was employed and the highest LA yield of 0.14 g g(-1) PPW and a productivity of 138 mg g(-1) d(-1) were achieved with a 1 d SRT. PMID:25708409

  7. Bacterial community compositions in sediment polluted by perfluoroalkyl acids (PFAAs) using Illumina high-throughput sequencing.

    PubMed

    Sun, Yajun; Wang, Tieyu; Peng, Xiawei; Wang, Pei; Lu, Yonglong

    2016-06-01

    The characterization of bacterial community compositions and the change in perfluoroalkyl acids (PFAAs) along a natural river distribution system were explored in the present study. Illumina high-throughput sequencing was used to explore bacterial community diversity and structure in sediment polluted by PFAAs from the Xiaoqing River, the area with concentrated fluorochemical facilities in China. The concentration of PFAAs was in the range of 8.44-465.60 ng/g dry weight (dw) in sediment. Perfluorooctanoic acid (PFOA) was the dominant PFAA in all samples, which accounted for 94.2 % of total PFAAs. High-level PFOA could lead to an obvious increase in relative abundance of Proteobacteria, ε-Proteobacteria, Thiobacillus, and Sulfurimonas and the decrease in relative abundance of other bacteria. Redundancy analysis revealed that PFOA played an important role in the formation of bacterial community, and PFOA at higher concentration could reduce the diversity of bacterial community. When the concentration of PFOA was below 100 ng/g dw in sediment, no significant effect on microbial community structure was observed. Thiobacillus and Sulfurimonas were positively correlated with the concentration of PFOA, suggesting that both genera were resistant to PFOA contamination. PMID:26780047

  8. Mass spectrometric detection of the amino acid sequence polymorphism of the hepatitis C virus antigen.

    PubMed

    Kaysheva, A L; Ivanov, Yu D; Frantsuzov, P A; Krohin, N V; Pavlova, T I; Uchaikin, V F; Konev, V А; Kovalev, O B; Ziborov, V S; Archakov, A I

    2016-03-01

    A method for detection and identification of the hepatitis C virus antigen (HCVcoreAg) in human serum with consideration for possible amino acid substitutions is proposed. The method is based on a combination of biospecific capturing and concentrating of the target protein on the surface of the chip for atomic force microscope (AFM chip) with subsequent protein identification by tandem mass spectrometric (MS/MS) analysis. Biospecific AFM-capturing of viral particles containing HCVcoreAg from serum samples was performed by use of AFM chips with monoclonal antibodies (anti-HCVcore) covalently immobilized on the surface. Biospecific complexes were registered and counted by AFM. Further MS/MS analysis allowed to reliably identify the HCVcoreAg in the complexes formed on the AFM chip surface. Analysis of MS/MS spectra, with the account taken of the possible polymorphisms in the amino acid sequence of the HCVcoreAg, enabled us to increase the number of identified peptides. PMID:26773170

  9. Peptide sequencing by using a combination of partial acid hydrolysis and fast-atom-bombardment mass spectrometry.

    PubMed Central

    De Angelis, F; Botta, M; Ceccarelli, S; Nicoletti, R

    1986-01-01

    To overcome the limit of the intensity of ions carrying sequence information in structural determinations of peptides by fast-atom-bombardment m.s., we have developed a method that consists in taking spectra of the peptide acid hydrolysates at different hydrolysis times. Peaks correspond to the oligomers arising from the peptide partial hydrolysis. The sequence can then be identified from the structurally overlapping fragments. PMID:2428356

  10. Canine preprorelaxin: nucleic acid sequence and localization within the canine placenta.

    PubMed

    Klonisch, T; Hombach-Klonisch, S; Froehlich, C; Kauffold, J; Steger, K; Steinetz, B G; Fischer, B

    1999-03-01

    Employing uteroplacental tissue at Day 35 of gestation, we determined the nucleic acid sequence of canine preprorelaxin using reverse transcription- and rapid amplification of cDNA ends-polymerase chain reaction. Canine preprorelaxin cDNA consisted of 534 base pairs encoding a protein of 177 amino acids with a signal peptide of 25 amino acids (aa), a B domain of 35 aa, a C domain of 93 aa, and an A domain of 24 aa. The putative receptor binding region in the N'-terminal part of the canine relaxin B domain GRDYVR contained two substitutions from the classical motif (E-->D and L-->Y). Canine preprorelaxin shared highest homology with porcine and equine preprorelaxin. Northern analysis revealed a 1-kilobase transcript present in total RNA of canine uteroplacental tissue but not of kidney tissue. Uteroplacental tissue from two bitches each at Days 30 and 35 of gestation were studied by in situ hybridization to localize relaxin mRNA. Immunohistochemistry for relaxin, cytokeratin, vimentin, and von Willebrand factor was performed on uteroplacental tissue at Day 30 of gestation. The basal cell layer at the core of the chorionic villi was devoid of relaxin mRNA and immunoreactive relaxin or vimentin but was immunopositive for cytokeratin and identified as cytotrophoblast cells. The cell layer surrounding the chorionic villi displayed specific hybridization signals for relaxin mRNA and immunoreactivity for relaxin and cytokeratin but not for vimentin, and was identified as syncytiotrophoblast. Those areas of the chorioallantoic tissue with most intense relaxin immunoreactivity were highly vascularized as demonstrated by immunoreactive von Willebrand factor expressed on vascular endothelium. The uterine glands and nonplacental uterine areas of the canine zonary girdle placenta were devoid of relaxin mRNA and relaxin. We conclude that the syncytiotrophoblast is the source of relaxin in the canine placenta. PMID:10026098

  11. Purification and partial amino acid sequence of the chloroplast cytochrome b-559.

    PubMed

    Widger, W R; Cramer, W A; Hermodson, M; Meyer, D; Gullifor, M

    1984-03-25

    The hydrophobic cytochrome b-559, purified from unstacked, ethanol-washed spinach thylakoid membranes, using extraction with 2% Triton X-100 in 4 M urea and three chromatographic steps in the presence of protease inhibitors, has a dominant band on sodium dodecyl sulfate-urea gels corresponding to Mr = 10,000. The yield of this preparation is 30-50% (5-10 mg) starting with 600 mg of chlorophyll. The heme content yields a calculated molecular weight of no more than 17,500/heme, and perhaps somewhat smaller after correction for impurities. The Mr = 10,000 band is stained by the tetramethylbenzidine-H2O2 heme reagent on lithium dodecyl sulfate gels run at 0 degrees C. The Mr = 10,000 protein, further separated by high performance liquid chromatography, contains a unique NH2 terminus that is not blocked, and the amino acid sequence for the first 27 residues is NH2-Ser-Gly-Ser-Thr-Gly-Glu-Arg-Ser-Phe-Ala-Asp-Ile-Ile-Thr-Ser-Ile-Arg-Tyr-Trp -Val-Ile-X-Ser-Ile-Thr-Ile-Pro. . . COOH. Approximately 55% of the amino acids are hydrophobic, based on amino acid analysis of the Mr = 10,000 peptide, which also indicated the presence of at least one histidine. Only one cytochrome b-559 component could be identified, whose yield indicated that it arises from a single b-559 protein in chloroplasts corresponding to the in situ high potential cytochrome of the chloroplast photosystem II. PMID:6706983

  12. Sequence-Specific Electrical Purification of Nucleic Acids with Nanoporous Gold Electrodes.

    PubMed

    Daggumati, Pallavi; Appelt, Sandra; Matharu, Zimple; Marco, Maria L; Seker, Erkin

    2016-06-22

    Nucleic-acid-based biosensors have enabled rapid and sensitive detection of pathogenic targets; however, these devices often require purified nucleic acids for analysis since the constituents of complex biological fluids adversely affect sensor performance. This purification step is typically performed outside the device, thereby increasing sample-to-answer time and introducing contaminants. We report a novel approach using a multifunctional matrix, nanoporous gold (np-Au), which enables both detection of specific target sequences in a complex biological sample and their subsequent purification. The np-Au electrodes modified with 26-mer DNA probes (via thiol-gold chemistry) enabled sensitive detection and capture of complementary DNA targets in the presence of complex media (fetal bovine serum) and other interfering DNA fragments in the range of 50-1500 base pairs. Upon capture, the noncomplementary DNA fragments and serum constituents of varying sizes were washed away. Finally, the surface-bound DNA-DNA hybrids were released by electrochemically cleaving the thiol-gold linkage, and the hybrids were iontophoretically eluted from the nanoporous matrix. The optical and electrophoretic characterization of the analytes before and after the detection-purification process revealed that low target DNA concentrations (80 pg/μL) can be successfully detected in complex biological fluids and subsequently released to yield pure hybrids free of polydisperse digested DNA fragments and serum biomolecules. Taken together, this multifunctional platform is expected to enable seamless integration of detection and purification of nucleic acid biomarkers of pathogens and diseases in miniaturized diagnostic devices. PMID:27244455

  13. Comparative sequence analyses of the genes coding for 16S rRNA of Lactobacillus casei-related taxa.

    PubMed

    Mori, K; Yamazaki, K; Ishiyama, T; Katsumata, M; Kobayashi, K; Kawai, Y; Inoue, N; Shinano, H

    1997-01-01

    The primary structures of the 16S rRNA genes of the type strains of Lactobacillus casei and related taxa were determined by PCR DNA-sequencing methods. The sequences of Lactobacillus casei, Lactobacillus zeae, Lactobacillus paracasei, and Lactobacillus rhamnosus were different. The Knuc values ranged from 0.0040 to 0.0126. On the basis of the Knuc values and the levels of DNA-DNA relatedness among the strains of these species, the L. casei-related taxa should be classified in the following three species: L. zeae, which includes the type strains of L. zeae and L. casei; a species that includes the strains of L. paracasei and L. casei ATCC 334; and L. rhamnosus. PMID:8995801

  14. Negative Ion In-Source Decay Matrix-Assisted Laser Desorption/Ionization Mass Spectrometry for Sequencing Acidic Peptides

    NASA Astrophysics Data System (ADS)

    McMillen, Chelsea L.; Wright, Patience M.; Cassady, Carolyn J.

    2016-05-01

    Matrix-assisted laser desorption/ionization (MALDI) in-source decay was studied in the negative ion mode on deprotonated peptides to determine its usefulness for obtaining extensive sequence information for acidic peptides. Eight biological acidic peptides, ranging in size from 11 to 33 residues, were studied by negative ion mode ISD (nISD). The matrices 2,5-dihydroxybenzoic acid, 2-aminobenzoic acid, 2-aminobenzamide, 1,5-diaminonaphthalene, 5-amino-1-naphthol, 3-aminoquinoline, and 9-aminoacridine were used with each peptide. Optimal fragmentation was produced with 1,5-diaminonphthalene (DAN), and extensive sequence informative fragmentation was observed for every peptide except hirudin(54-65). Cleavage at the N-Cα bond of the peptide backbone, producing c' and z' ions, was dominant for all peptides. Cleavage of the N-Cα bond N-terminal to proline residues was not observed. The formation of c and z ions is also found in electron transfer dissociation (ETD), electron capture dissociation (ECD), and positive ion mode ISD, which are considered to be radical-driven techniques. Oxidized insulin chain A, which has four highly acidic oxidized cysteine residues, had less extensive fragmentation. This peptide also exhibited the only charged localized fragmentation, with more pronounced product ion formation adjacent to the highly acidic residues. In addition, spectra were obtained by positive ion mode ISD for each protonated peptide; more sequence informative fragmentation was observed via nISD for all peptides. Three of the peptides studied had no product ion formation in ISD, but extensive sequence informative fragmentation was found in their nISD spectra. The results of this study indicate that nISD can be used to readily obtain sequence information for acidic peptides.

  15. Phylogenetic analyses of Vitis (Vitaceae) based on complete chloroplast genome sequences: effects of taxon sampling and phylogenetic methods on resolving relationships among rosids

    PubMed Central

    Jansen, Robert K; Kaittanis, Charalambos; Saski, Christopher; Lee, Seung-Bum; Tomkins, Jeffrey; Alverson, Andrew J; Daniell, Henry

    2006-01-01

    Background The Vitaceae (grape) is an economically important family of angiosperms whose phylogenetic placement is currently unresolved. Recent phylogenetic analyses based on one to several genes have suggested several alternative placements of this family, including sister to Caryophyllales, asterids, Saxifragales, Dilleniaceae or to rest of rosids, though support for these different results has been weak. There has been a recent interest in using complete chloroplast genome sequences for resolving phylogenetic relationships among angiosperms. These studies have clarified relationships among several major lineages but they have also emphasized the importance of taxon sampling and the effects of different phylogenetic methods for obtaining accurate phylogenies. We sequenced the complete chloroplast genome of Vitis vinifera and used these data to assess relationships among 27 angiosperms, including nine taxa of rosids. Results The Vitis vinifera chloroplast genome is 160,928 bp in length, including a pair of inverted repeats of 26,358 bp that are separated by small and large single copy regions of 19,065 bp and 89,147 bp, respectively. The gene content and order of Vitis is identical to many other unrearranged angiosperm chloroplast genomes, including tobacco. Phylogenetic analyses using maximum parsimony and maximum likelihood were performed on DNA sequences of 61 protein-coding genes for two datasets with 28 or 29 taxa, including eight or nine taxa from four of the seven currently recognized major clades of rosids. Parsimony and likelihood phylogenies of both data sets provide strong support for the placement of Vitaceae as sister to the remaining rosids. However, the position of the Myrtales and support for the monophyly of the eurosid I clade differs between the two data sets and the two methods of analysis. In parsimony analyses, the inclusion of Gossypium is necessary to obtain trees that support the monophyly of the eurosid I clade. However, maximum

  16. Authentication of Cordyceps sinensis by DNA Analyses: Comparison of ITS Sequence Analysis and RAPD-Derived Molecular Markers.

    PubMed

    Lam, Kelly Y C; Chan, Gallant K L; Xin, Gui-Zhong; Xu, Hong; Ku, Chuen-Fai; Chen, Jian-Ping; Yao, Ping; Lin, Huang-Quan; Dong, Tina T X; Tsim, Karl W K

    2015-01-01

    Cordyceps sinensis is an endoparasitic fungus widely used as a tonic and medicinal food in the practice of traditional Chinese medicine (TCM). In historical usage, Cordyceps specifically is referring to the species of C. sinensis. However, a number of closely related species are named themselves as Cordyceps, and they are sold commonly as C. sinensis. The substitutes and adulterants of C. sinensis are often introduced either intentionally or accidentally in the herbal market, which seriously affects the therapeutic effects or even leads to life-threatening poisoning. Here, we aim to identify Cordyceps by DNA sequencing technology. Two different DNA-based approaches were compared. The internal transcribed spacer (ITS) sequences and the random amplified polymorphic DNA (RAPD)-sequence characterized amplified region (SCAR) were developed here to authenticate different species of Cordyceps. Both approaches generally enabled discrimination of C. sinensis from others. The application of the two methods, supporting each other, increases the security of identification. For better reproducibility and faster analysis, the SCAR markers derived from the RAPD results provide a new method for quick authentication of Cordyceps. PMID:26694332

  17. Sequence analyses of two neuropeptides of the AKH/RPCH-family from the lubber grasshopper, Romalea microptera.

    PubMed

    Gäde, G; Hilbich, C; Beyreuther, K; Rinehart, K L

    1988-01-01

    Two neuropeptides with adipokinetic activity in Locusta migratoria and hypertrehalosaemic activity in Periplaneta americana were purified by high-performance liquid chromatography from the corpus cardiacum of the lubber grasshopper, Romalea microptera. The sequences of both peptides, designated Ro I and Ro II, were determined by gas-phase sequencing employing Edman degradation after the N-terminal pyroglutamate residue was enzymatically deblocked, as well as by fast atom bombardment mass spectrometry. Ro I was found to be a decapeptide with the primary structure: pGlu-Val-Asn-Phe-Thr-Pro-Asn-Trp-Gly-Thr-NH2, whereas Ro II is an octapeptide with the structure: pGlu-Val-Asn-Phe-Ser-Thr-Gly-Trp-NH2. Ro II is identical with AKH-G isolated from the cricket Gryllus bimaculatus. Synthetic materials having the assigned structures were found to be chromatographically, mass spectrometrically, and biologically indistinguishable from the natural peptides, confirming the sequences and establishing the Romalea peptides as members of the AKH/RPCH-family of peptides. PMID:3226948

  18. Mitochondrial DNA sequence analyses and phylogenetic relationships among two Nigerian goat breeds and the South African Kalahari Red.

    PubMed

    Awotunde, Esther O; Bemji, Martha N; Olowofeso, Olajide; James, Ikechukwu J; Ajayi, O O; Adebambo, Ayotunde O

    2015-01-01

    The first hypervariable (HV1) region of mitochondrial DNA (mtDNA) of two popular Nigerian goat breeds: West African Dwarf (WAD) (n=35) and Red Sokoto (RS) (n=37) and one exotic breed: Kalahari Red (KR) (n=38) imported from South Africa were sequenced to investigate sequence diversity, genetic structure, origin, and demographic history of the populations. A total of 68 polymorphic sites were found in 110 sequences that grouped into 68 haplotypes. Average haplotype and nucleotide diversities for all breeds were 0.982±0.005 and 0.02350±0.00213, respectively. Phylogenetic analysis revealed two mtDNA lineages (A and B). Lineage A was predominant and included all haplotypes from WAD and RS and 5 out of 11 haplotypes of KR goats. The remaining haplotypes (6) of KR belong to lineage B. The analysis of molecular variance revealed a high-within breed genetic variance of 82.4% and a low-between breed genetic variance of 17.6%. The three breeds clustered with Capra aegagrus as their wild ancestor. Mismatch distribution analysis showed that WAD, RS and haplogroup A have experienced population expansion events. The study has revealed very high diversity within the three breeds which are not strongly separated from each other based on mtDNA analysis. The information obtained on the genetic structure of the breeds will be useful in planning improvement and conservation programs for the local populations. PMID:25695640

  19. De novo assembly and next-generation sequencing to analyse full-length gene variants from codon-barcoded libraries.

    PubMed

    Cho, Namjin; Hwang, Byungjin; Yoon, Jung-ki; Park, Sangun; Lee, Joongoo; Seo, Han Na; Lee, Jeewon; Huh, Sunghoon; Chung, Jinsoo; Bang, Duhee

    2015-01-01

    Interpreting epistatic interactions is crucial for understanding evolutionary dynamics of complex genetic systems and unveiling structure and function of genetic pathways. Although high resolution mapping of en masse variant libraries renders molecular biologists to address genotype-phenotype relationships, long-read sequencing technology remains indispensable to assess functional relationship between mutations that lie far apart. Here, we introduce JigsawSeq for multiplexed sequence identification of pooled gene variant libraries by combining a codon-based molecular barcoding strategy and de novo assembly of short-read data. We first validate JigsawSeq on small sub-pools and observed high precision and recall at various experimental settings. With extensive simulations, we then apply JigsawSeq to large-scale gene variant libraries to show that our method can be reliably scaled using next-generation sequencing. JigsawSeq may serve as a rapid screening tool for functional genomics and offer the opportunity to explore evolutionary trajectories of protein variants. PMID:26387459

  20. De novo assembly and next-generation sequencing to analyse full-length gene variants from codon-barcoded libraries

    PubMed Central

    Cho, Namjin; Hwang, Byungjin; Yoon, Jung-ki; Park, Sangun; Lee, Joongoo; Seo, Han Na; Lee, Jeewon; Huh, Sunghoon; Chung, Jinsoo; Bang, Duhee

    2015-01-01

    Interpreting epistatic interactions is crucial for understanding evolutionary dynamics of complex genetic systems and unveiling structure and function of genetic pathways. Although high resolution mapping of en masse variant libraries renders molecular biologists to address genotype-phenotype relationships, long-read sequencing technology remains indispensable to assess functional relationship between mutations that lie far apart. Here, we introduce JigsawSeq for multiplexed sequence identification of pooled gene variant libraries by combining a codon-based molecular barcoding strategy and de novo assembly of short-read data. We first validate JigsawSeq on small sub-pools and observed high precision and recall at various experimental settings. With extensive simulations, we then apply JigsawSeq to large-scale gene variant libraries to show that our method can be reliably scaled using next-generation sequencing. JigsawSeq may serve as a rapid screening tool for functional genomics and offer the opportunity to explore evolutionary trajectories of protein variants. PMID:26387459

  1. The genetic diversity of genus Bacillus and the related genera revealed by 16s rRNA gene sequences and ardra analyses isolated from geothermal regions of turkey

    PubMed Central

    Cihan, Arzu Coleri; Tekin, Nilgun; Ozcan, Birgul; Cokmus, Cumhur

    2012-01-01

    Previously isolated 115 endospore-forming bacilli were basically grouped according to their temperature requirements for growth: the thermophiles (74%), the facultative thermophiles (14%) and the mesophiles (12%). These isolates were taken into 16S rRNA gene sequence analyses, and they were clustered among the 7 genera: Anoxybacillus, Aeribacillus, Bacillus, Brevibacillus, Geobacillus, Paenibacillus, and Thermoactinomycetes. Of these bacilli, only the thirty two isolates belonging to genera Bacillus (16), Brevibacillus (13), Paenibacillus (1) and Thermoactinomycetes (2) were selected and presented in this paper. The comparative sequence analyses revealed that the similarity values were ranged as 91.4–100 %, 91.8- 99.2 %, 92.6- 99.8 % and 90.7 - 99.8 % between the isolates and the related type strains from these four genera, respectively. Twenty nine of them were found to be related with the validly published type strains. The most abundant species was B. thermoruber with 9 isolates followed by B. pumilus (6), B. lichenformis (3), B. subtilis (3), B. agri (3), B. smithii (2), T. vulgaris (2) and finally P. barengoltzii (1). In addition, isolates of A391a, B51a and D295 were proposed as novel species as their 16S rRNA gene sequences displayed similarities ≤ 97% to their closely related type strains. The AluI-, HaeIII- and TaqI-ARDRA results were in congruence with the 16S rRNA gene sequence analyses. The ARDRA results allowed us to differentiate these isolates, and their discriminative restriction fragments were able to be determined. Some of their phenotypic characters and their amylase, chitinase and protease production were also studied and biotechnologically valuable enzyme producing isolates were introduced in order to use in further studies. PMID:24031834

  2. Unifying bacteria from decaying wood with various ubiquitous Gibbsiella species as G. acetica sp. nov. based on nucleotide sequence similarities and their acetic acid secretion.

    PubMed

    Geider, Klaus; Gernold, Marina; Jock, Susanne; Wensing, Annette; Völksch, Beate; Gross, Jürgen; Spiteller, Dieter

    2015-12-01

    Bacteria were isolated from necrotic apple and pear tree tissue and from dead wood in Germany and Austria as well as from pear tree exudate in China. They were selected for growth at 37 °C, screened for levan production and then characterized as Gram-negative, facultatively anaerobic rods. Nucleotide sequences from 16S rRNA genes, the housekeeping genes dnaJ, gyrB, recA and rpoB alignments, BLAST searches and phenotypic data confirmed by MALDI-TOF analysis showed that these bacteria belong to the genus Gibbsiella and resembled strains isolated from diseased oaks in Britain and Spain. Gibbsiella-specific PCR primers were designed from the proline isomerase and the levansucrase genes. Acid secretion was investigated by screening for halo formation on calcium carbonate agar and the compound identified by NMR as acetic acid. Its production by Gibbsiella spp. strains was also determined in culture supernatants by GC/MS analysis after derivatization with pentafluorobenzyl bromide. Some strains were differentiated by the PFGE patterns of SpeI digests and by sequence analyses of the lsc and the ppiD genes, and the Chinese Gibbsiella strain was most divergent. The newly investigated bacteria as well as Gibbsiella querinecans, Gibbsiella dentisursi and Gibbsiella papilionis, isolated in Britain, Spain, Korea and Japan, are taxonomically related Enterobacteriaceae, tolerate and secrete acetic acid. We therefore propose to unify them in the species Gibbsiella acetica sp. nov. PMID:26071988

  3. Molecular systematics of Gagea and Lloydia (Liliaceae; Liliales): implications of analyses of nuclear ribosomal and plastid DNA sequences for infrageneric classification

    PubMed Central

    Zarrei, M.; Wilkin, P.; Fay, M. F.; Ingrouille, M. J.; Zarre, S.; Chase, M. W.

    2009-01-01

    Background and Aims Gagea is a Eurasian genus of petaloid monocots, with a few species in North Africa, comprising between 70 and approximately 275 species depending on the author. Lloydia (thought to be the closest relative of Gagea) consists of 12–20 species that have a mostly eastern Asian distribution. Delimitation of these genera and their subdivisions are unresolved questions in Liliaceae taxonomy. The objective of this study is to evaluate generic and infrageneric circumscription of Gagea and Lloydia using DNA sequence data. Methods A phylogenetic study of Gagea and Lloydia (Liliaceae) was conducted using sequences of nuclear ribosomal internal transcribed spacer (ITS) and plastid (rpl16 intron, trnL intron, trnL-F spacer, matK and the psbA-trnH spacer) DNA regions. This included 149 accessions (seven as outgroups), with multiple accessions of some taxa; 552 sequences were included, of which 393 were generated as part of this research. Key Results A close relationship of Gagea and Lloydia was confirmed in analyses using different datasets, but neither Gagea nor Lloydia forms a monophyletic group as currently circumscribed; however, the ITS and plastid analyses did not produce congruent results for the placement of Lloydia relative to the major groups within Gagea. Gagea accessions formed five moderately to strongly supported clades in all trees, with most Lloydia taxa positioned at the basal nodes; in the strict consensus trees from the combined data a basal polytomy occurs. There is limited congruence between the classical, morphology-derived infrageneric taxonomy in Gagea (including Lloydia) and clades in the present phylogenetic analyses. Conclusions The analyses support monophyly of Gagea/Lloydia collectively, and they clearly comprise a single lineage, as some previous authors have hypothesized. The results provide the basis for a new classification of Gagea that has support from some morphological features. Incongruence between plastid and nuclear

  4. Relationships in the Caryophyllales as suggested by phylogenetic analyses of partial chloroplast DNA ORF2280 homolog sequences.

    PubMed

    Downie, S; Katz-Downie, D; Cho, K

    1997-02-01

    Phylogenetic relationships within the angiosperm order Caryophyllales were investigated by comparative sequencing of two portions of the highly conserved inverted repeat (totaling some 1100 base pairs) coinciding with the region occupied by ORF2280 in Nicotiana, the largest gene in the plastid genomes of most land plants. Data were obtained for 33 species in 11 families within the order and for one species each of Plumbaginaceae, Polygonaceae, and Nepenthaceae. These data, when analyzed along with previously published ORF (open reading frame) sequences from Nicotiana. Spinacia. Epifagus, and Pelargonium using parsimony, neighbor-joining, and maximum likelihood methods, reveal that: (1) Amaranthus, Celosia, and Froelichia (all Amaranthaceae) do not comprise a monophyletic group; (2) Amaranthus may be nested within a paraphyletic Chenopodiaceae; (3) Sarcobatus (Chenopodiaceae) is allied with Nyctaginaceae + Phytolaccaceae (the latter family excluding Stegnosperma but including Petiveria); and (4) Caryophyllaceae (with Corrigiola basal within the clade) are sister group to Chenopodiaceae + Amaranthaceae. Basal relations within the order remain obscure. Sequence divergence values in pairwise comparisons across all Caryophyllales taxa ranged from 0.1 to 5% of nucleotides. However, despite these low values, 23 insertion and deletion events were apparent, of which five were informative phylogenetically and bolstered several of the relationships listed above. A polymerase chain reaction (PCR) survey for ORF homolog length variants in representatives from 70 additional angiosperm families revealed major deletions, of 100 to 1400 base pairs, in 19 of these families. Although the ORF is located within the mutationally retarded inverted repeat region of most angiosperm chloroplast DNAs, this gene appears particularly prone to length mutation. PMID:21712205

  5. GENOME-WIDE ASSOCIATION ANALYSES BASED ON WHOLE-GENOME SEQUENCING IN SARDINIA PROVIDE INSIGHTS INTO REGULATION OF HEMOGLOBIN LEVELS

    PubMed Central

    Danjou, Fabrice; Zoledziewska, Magdalena; Sidore, Carlo; Steri, Maristella; Busonero, Fabio; Maschio, Andrea; Mulas, Antonella; Perseu, Lucia; Barella, Susanna; Porcu, Eleonora; Pistis, Giorgio; Pitzalis, Maristella; Pala, Mauro; Menzel, Stephan; Metrustry, Sarah; Spector, Timothy D.; Leoni, Lidia; Angius, Andrea; Uda, Manuela; Moi, Paolo; Thein, Swee Lay; Galanello, Renzo; Abecasis, Gonçalo R.; Schlessinger, David; Sanna, Serena; Cucca, Francesco

    2015-01-01

    We report GWAS results for the levels of A1, A2 and fetal hemoglobins, analyzed for the first time concurrently. Integrating high-density array genotyping and whole-genome sequencing in a large general population cohort from Sardinia, we detected 23 associations at 10 loci. Five are due to variants at previously undetected loci: MPHOSPH9, PLTP-PCIF1, FOG1, NFIX, and CCND3. Among those at known loci, 10 are new lead variants and 4 are novel independent signals. Half of all variants also showed pleiotropic associations with different hemoglobins, which further corroborated some of the detected associations and revealed features of coordinated hemoglobin species production. PMID:26366553

  6. Complete amino acid sequence of the medium-chain S-acyl fatty acid synthetase thio ester hydrolase from rat mammary gland

    SciTech Connect

    Randhawa, Z.I.; Smith, S.

    1987-03-10

    The complete amino acid sequence of the medium-chain S-acyl fatty acid synthetase thio ester hydrolase (thioesterase II) from rat mammary gland is presented. Most of the sequence was derived by analysis of (/sup 14/C)-labelled peptide fragments produced by cleavage at methionyl, glutamyl, lysyl, arginyl, and tryptophanyl residues. A small section of the sequence was deduced from a previously analyzed cDNA clone. The protein consists of 260 residues and has a blocked amino-terminal methionine and calculated M/sub r/ of 29,212. The carboxy-terminal sequence, verified by Edman degradation of the carboxy-terminal cyanogen bromide fragment and carboxypeptidase Y digestion of the intact thioesterase II, terminates with a serine residue and lacks three additional residues predicted by the cDNA sequence. The native enzyme contains three cysteine residues but no disulfide bridges. The active site serine residue is located at position 101. The rat mammary gland thioesterase II exhibits approximately 40% homology with a thioesterase from mallard uropygial gland, the sequence of which was recently determined by cDNA analysis. Thus the two enzymes may share similar structural features and a common evolutionary origin. The location of the active site in these thioesterases differs from that of other serine active site esterases; indeed, the enzymes do not exhibit any significant homology with other serine esterases, suggesting that they may constitute a separate new family of serine active site enzymes.

  7. The complete amino acid sequence of the A-chain of human plasma alpha 2HS-glycoprotein.

    PubMed

    Yoshioka, Y; Gejyo, F; Marti, T; Rickli, E E; Bürgi, W; Offner, G D; Troxler, R F; Schmid, K

    1986-02-01

    Normal human plasma alpha 2HS-glycoprotein has earlier been shown to be comprised of two polypeptide chains. Recently, the amino acid and carbohydrate sequences of the short chain were elucidated (Gejyo, F., Chang, J.-L., Bürgi, W., Schmid, K., Offner, G. D., Troxler, R.F., van Halbeck, H., Dorland, L., Gerwig, G. J., and Vliegenthart, J.F.G. (1983) J. Biol. Chem. 258, 4966-4971). In the present study, the amino acid sequence of the long chain of this protein, designated A-chain, was determined and found to consist of 282 amino acid residues. Twenty-four amino acid doublets were found; the most abundant of these are Pro-Pro and Ala-Ala which each occur five times. Of particular interest is the presence of three Gly-X-Pro and one Gly-Pro-X sequences that are characteristic of the repeating sequences of collagens. Chou-Fasman evaluation of the secondary structure suggested that the A-chain contains 29% alpha-helix, 24% beta-pleated sheet, and 26% reverse turns and, thus, approximately 80% of the polypeptide chain may display ordered structure. Four glycosylation sites were identified. The two N-glycosidic oligosaccharides were found in the center region (residues 138 and 158), whereas the two O-glycosidic heterosaccharides, both linked to threonine (residues 238 and 252), occur within the carboxyl-terminal region. The N-glycans are linked to Asn residues in beta-turns, while the O-glycans are located in short random segments. Comparison of the sequence of the amino- and carboxyl-terminal 30 residues with protein sequences in a data bank demonstrated that the A-chain is not significantly related to any known proteins. However, the proline-rich carboxyl-terminal region of the A-chain displays some sequence similarity to collagens and the collagen-like domains of complement subcomponent C1q. PMID:3944104

  8. Comparative sequence analyses on the 16S rRNA (rDNA) of Bacillus acidocaldarius, Bacillus acidoterrestris, and Bacillus cycloheptanicus and proposal for creation of a new genus, Alicyclobacillus gen. nov

    NASA Technical Reports Server (NTRS)

    Wisotzkey, J. D.; Jurtshuk, P. Jr; Fox, G. E.; Deinhard, G.; Poralla, K.

    1992-01-01

    Comparative 16S rRNA (rDNA) sequence analyses performed on the thermophilic Bacillus species Bacillus acidocaldarius, Bacillus acidoterrestris, and Bacillus cycloheptanicus revealed that these organisms are sufficiently different from the traditional Bacillus species to warrant reclassification in a new genus, Alicyclobacillus gen. nov. An analysis of 16S rRNA sequences established that these three thermoacidophiles cluster in a group that differs markedly from both the obligately thermophilic organisms Bacillus stearothermophilus and the facultatively thermophilic organism Bacillus coagulans, as well as many other common mesophilic and thermophilic Bacillus species. The thermoacidophilic Bacillus species B. acidocaldarius, B. acidoterrestris, and B. cycloheptanicus also are unique in that they possess omega-alicylic fatty acid as the major natural membranous lipid component, which is a rare phenotype that has not been found in any other Bacillus species characterized to date. This phenotype, along with the 16S rRNA sequence data, suggests that these thermoacidophiles are biochemically and genetically unique and supports the proposal that they should be reclassified in the new genus Alicyclobacillus.

  9. Analysis of the functional domains of biosynthetic threonine deaminase by comparison of the amino acid sequences of three wild-type alleles to the amino acid sequence of biodegradative threonine deaminase.

    PubMed

    Taillon, B E; Little, R; Lawther, R P

    1988-03-31

    The nucleotide sequence of the gene, ilvA, for biosynthetic threonine deaminase (Tda) from Salmonella typhimurium was determined. The deduced amino acid sequence was compared with the deduced amino acid sequences of the biosynthetic Tda from Escherichia coli K-12 (ilvA) and Saccharomyces cerevisiae (ILV1) and the biodegradative Tda from E. coli K-12 (tdc). The comparison indicated the presence of two types of blocks of homologous amino acids. The first type of homology is in the N-terminal portion of all four isozymes of Tda and probably indicates amino acids involved in catalysis. The second type of homology is found in the C-terminal portion of the three biosynthetic isozymes and presumably is involved in either (i) the binding or interaction of the allosteric effector isoleucine with the enzyme, or (ii) subunit interactions. The sites of amino acid changes of two E. coli K-12 ilvA alleles with altered response to isoleucine are consistent with the conclusion that the C-terminal portion of biosynthetic Tda is involved in allosteric regulation. PMID:3290055

  10. Integrated microRNA-mRNA analyses reveal OPLL specific microRNA regulatory network using high-throughput sequencing.

    PubMed

    Xu, Chen; Chen, Yu; Zhang, Hao; Chen, Yuanyuan; Shen, Xiaolong; Shi, Changgui; Liu, Yang; Yuan, Wen

    2016-01-01

    Ossification of the posterior longitudinal ligament (OPLL) is a genetic disorder which involves pathological heterotopic ossification of the spinal ligaments. Although studies have identified several genes that correlated with OPLL, the underlying regulation network is far from clear. Through small RNA sequencing, we compared the microRNA expressions of primary posterior longitudinal ligament cells form OPLL patients with normal patients (PLL) and identified 218 dysregulated miRNAs (FDR < 0.01). Furthermore, assessing the miRNA profiling data of multiple cell types, we found these dysregulated miRNAs were mostly OPLL specific. In order to decipher the regulation network of these OPLL specific miRNAs, we integrated mRNA expression profiling data with miRNA sequencing data. Through computational approaches, we showed the pivotal roles of these OPLL specific miRNAs in heterotopic ossification of longitudinal ligament by discovering highly correlated miRNA/mRNA pairs that associated with skeletal system development, collagen fibril organization, and extracellular matrix organization. The results of which provide strong evidence that the miRNA regulatory networks we established may indeed play vital roles in OPLL onset and progression. To date, this is the first systematic analysis of the micronome in OPLL, and thus may provide valuable resources in finding novel treatment and diagnostic targets of OPLL. PMID:26868491

  11. The developmental transcriptome landscape of bovine skeletal muscle defined by Ribo-Zero ribonucleic acid sequencing.

    PubMed

    Sun, X; Li, M; Sun, Y; Cai, H; Li, R; Wei, X; Lan, X; Huang, Y; Lei, C; Chen, H

    2015-12-01

    Ribonucleic acid sequencing (RNA-Seq) libraries are normally prepared with oligo(dT) selection of poly(A)+ mRNA, but it depends on intact total RNA samples. Recent studies have described Ribo-Zero technology, a novel method that can capture both poly(A)+ and poly(A)- transcripts from intact or fragmented RNA samples. We report here the first application of Ribo-Zero RNA-Seq for the analysis of the bovine embryonic, neonatal, and adult skeletal muscle whole transcriptome at an unprecedented depth. Overall, 19,893 genes were found to be expressed, with a high correlation of expression levels between the calf and the adult. Hundreds of genes were found to be highly expressed in the embryo and decreased at least 10-fold after birth, indicating their potential roles in embryonic muscle development. In addition, we present for the first time the analysis of global transcript isoform discovery in bovine skeletal muscle and identified 36,694 transcript isoforms. Transcriptomic data were also analyzed to unravel sequence variations; 185,036 putative SNP and 12,428 putative short insertions-deletions (InDel) were detected. Specifically, many stop-gain, stop-loss, and frameshift mutations were identified that probably change the relative protein production and sequentially affect the gene function. Notably, the numbers of stage-specific transcripts, alternative splicing events, SNP, and InDel were greater in the embryo than in the calf and the adult, suggesting that gene expression is most active in the embryo. The resulting view of the transcriptome at a single-base resolution greatly enhances the comprehensive transcript catalog and uncovers the global trends in gene expression during bovine skeletal muscle development. PMID:26641174

  12. Method for the detection of specific nucleic acid sequences by polymerase nucleotide incorporation

    DOEpatents

    Castro, Alonso

    2004-06-01

    A method for rapid and efficient detection of a target DNA or RNA sequence is provided. A primer having a 3'-hydroxyl group at one end and having a sequence of nucleotides sufficiently homologous with an identifying sequence of nucleotides in the target DNA is selected. The primer is hybridized to the identifying sequence of nucleotides on the DNA or RNA sequence and a reporter molecule is synthesized on the target sequence by progressively binding complementary nucleotides to the primer, where the complementary nucleotides include nucleotides labeled with a fluorophore. Fluorescence emitted by fluorophores on single reporter molecules is detected to identify the target DNA or RNA sequence.

  13. Characterization and cDNA sequence of Bothriechis schlegeliil-amino acid oxidase with antibacterial activity.

    PubMed

    Vargas Muñoz, Leidy Johana; Estrada-Gomez, Sebastian; Núñez, Vitelbina; Sanz, Libia; Calvete, Juan J

    2014-08-01

    Snake venoms are complex mixtures of proteins including l-amino acid oxidase (lAAO). A lAAO (named BslAAO) with a mass of 56kDa and a theoretical Ip of 5.79, was purified from Bothriechis schlegelii venom through size-exclusion, ion exchange and affinity chromatography. The entire protein sequence of 498 amino acids, was determined from cDNA using reverse-transcribed mRNA isolated from venom gland. The enzyme showed dose-dependent inhibition of bacterial growth. BslAAO showed inhibitory effect against S. aureus with a MIC of 4μg/mL and a MBC of 8μg/mL. Against Acinetobacter baumannii, showed a MIC of 2μg/mL and MBC of 4μg/mL, No effect was observed in Escherichia coli. This antibacterial activity was inhibited by catalase, indicating that antimicrobial activity was due to H2O2 production. BslAAO did not show any cytotoxic activity toward mouse myoblast cell line C2C12 or peripheral blood mononuclear cells. The enzyme oxidated l-Leu, with a Km of 16.37μM and a Vmax of 0.39μM/min. Snake venoms lAAOs, are potential frames of different therapeutics molecules since these enzymes exhibit low MICs and MBCs and show to be harmless to human cells due to microorganisms being generally several fold more sensitive to reactive oxygen species than human tissues. PMID:24875315

  14. Genome Sequence of a Candidate World Health Organization Reference Strain of Zika Virus for Nucleic Acid Testing

    PubMed Central

    Trösemeier, Jan-Hendrik; Musso, Didier; Blümel, Johannes; Thézé, Julien; Pybus, Oliver G.

    2016-01-01

    We report here the sequence of a candidate reference strain of Zika virus (ZIKV) developed on behalf of the World Health Organization (WHO). The ZIKV reference strain is intended for use in nucleic acid amplification (NAT)-based assays for the detection and quantification of ZIKV RNA. PMID:27587826

  15. Genome Sequence of Schizochytrium sp. CCTCC M209059, an Effective Producer of Docosahexaenoic Acid-Rich Lipids

    PubMed Central

    Ji, Xiao-Jun; Mo, Kai-Qiang; Ren, Lu-Jing; Li, Gan-Lu; Huang, Jian-Zhong

    2015-01-01

    Schizochytrium is an effective species for producing omega-3 docosahexaenoic acid (DHA). Here, we report a genome sequence of Schizochytrium sp. CCTCC M209059, which has a genome size of 39.09 Mb. It will provide the genomic basis for further insights into the metabolic and regulatory mechanisms underlying the DHA formation. PMID:26251485

  16. Evolutionary Distance of Amino Acid Sequence Orthologs across Macaque Subspecies: Identifying Candidate Genes for SIV Resistance in Chinese Rhesus Macaques

    PubMed Central

    Ross, Cody T.; Roodgar, Morteza; Smith, David Glenn

    2015-01-01

    We use the Reciprocal Smallest Distance (RSD) algorithm to identify amino acid sequence orthologs in the Chinese and Indian rhesus macaque draft sequences and estimate the evolutionary distance between such orthologs. We then use GOanna to map gene function annotations and human gene identifiers to the rhesus macaque amino acid sequences. We conclude methodologically by cross-tabulating a list of amino acid orthologs with large divergence scores with a list of genes known to be involved in SIV or HIV pathogenesis. We find that many of the amino acid sequences with large evolutionary divergence scores, as calculated by the RSD algorithm, have been shown to be related to HIV pathogenesis in previous laboratory studies. Four of the strongest candidate genes for SIVmac resistance in Chinese rhesus macaques identified in this study are CDK9, CXCL12, TRIM21, and TRIM32. Additionally, ANKRD30A, CTSZ, GORASP2, GTF2H1, IL13RA1, MUC16, NMDAR1, Notch1, NT5M, PDCD5, RAD50, and TM9SF2 were identified as possible candidates, among others. We failed to find many laboratory experiments contrasting the effects of Indian and Chinese orthologs at these sites on SIVmac pathogenesis, but future comparative studies might hold fertile ground for research into the biological mechanisms underlying innate resistance to SIVmac in Chinese rhesus macaques. PMID:25884674

  17. Evolutionary distance of amino acid sequence orthologs across macaque subspecies: identifying candidate genes for SIV resistance in Chinese rhesus macaques.

    PubMed

    Ross, Cody T; Roodgar, Morteza; Smith, David Glenn

    2015-01-01

    We use the Reciprocal Smallest Distance (RSD) algorithm to identify amino acid sequence orthologs in the Chinese and Indian rhesus macaque draft sequences and estimate the evolutionary distance between such orthologs. We then use GOanna to map gene function annotations and human gene identifiers to the rhesus macaque amino acid sequences. We conclude methodologically by cross-tabulating a list of amino acid orthologs with large divergence scores with a list of genes known to be involved in SIV or HIV pathogenesis. We find that many of the amino acid sequences with large evolutionary divergence scores, as calculated by the RSD algorithm, have been shown to be related to HIV pathogenesis in previous laboratory studies. Four of the strongest candidate genes for SIVmac resistance in Chinese rhesus macaques identified in this study are CDK9, CXCL12, TRIM21, and TRIM32. Additionally, ANKRD30A, CTSZ, GORASP2, GTF2H1, IL13RA1, MUC16, NMDAR1, Notch1, NT5M, PDCD5, RAD50, and TM9SF2 were identified as possible candidates, among others. We failed to find many laboratory experiments contrasting the effects of Indian and Chinese orthologs at these sites on SIVmac pathogenesis, but future comparative studies might hold fertile ground for research into the biological mechanisms underlying innate resistance to SIVmac in Chinese rhesus macaques. PMID:25884674

  18. Draft Genome Sequence of Lactobacillus delbrueckii subsp. bulgaricus CFL1, a Lactic Acid Bacterium Isolated from French Handcrafted Fermented Milk.

    PubMed

    Meneghel, Julie; Dugat-Bony, Eric; Irlinger, Françoise; Loux, Valentin; Vidal, Marie; Passot, Stéphanie; Béal, Catherine; Layec, Séverine; Fonseca, Fernanda

    2016-01-01

    Lactobacillus delbrueckii subsp. bulgaricus (L. bulgaricus) is a lactic acid bacterium widely used for the production of yogurt and cheeses. Here, we report the genome sequence of L. bulgaricus CFL1 to improve our knowledge on its stress-induced damages following production and end-use processes. PMID:26941141

  19. Draft Genome Sequence of Cutaneotrichosporon curvatus DSM 101032 (Formerly Cryptococcus curvatus), an Oleaginous Yeast Producing Polyunsaturated Fatty Acids.

    PubMed

    Hofmeyer, Thomas; Hackenschmidt, Silke; Nadler, Florian; Thürmer, Andrea; Daniel, Rolf; Kabisch, Johannes

    2016-01-01

    Cutaneotrichosporon curvatus DSM 101032 is an oleaginous yeast that can be isolated from various habitats and is capable of producing substantial amounts of polyunsaturated fatty acids. Here, we present the first draft genome sequence of any C. curvatus species. PMID:27174275

  20. Complete genome sequence of Lactobacillus plantarum ZS2058, a probiotic strain with high conjugated linoleic acid production ability.

    PubMed

    Yang, Bo; Chen, Haiqin; Tian, Fengwei; Zhao, Jianxin; Gu, Zhennan; Zhang, Hao; Chen, Yong Q; Chen, Wei

    2015-11-20

    Lactobacillus plantarum ZS2058 was isolated from sauerkraut and identified to synthesize the beneficial metabolite conjugated linoleic acid. The genome contains a 319,7363-bp chromosome and three plasmids. The sequence will facilitate identification and characterization of the genetic determinants for its putative biological benefits. PMID:26439428

  1. Draft Genome Sequence of Burkholderia stabilis LA20W, a Trehalose Producer That Uses Levulinic Acid as a Substrate

    PubMed Central

    Sato, Yuya; Koike, Hideaki; Kondo, Susumu; Hori, Tomoyuki; Kanno, Manabu; Kimura, Nobutada; Morita, Tomotake; Kirimura, Kohtaro

    2016-01-01

    Burkholderia stabilis LA20W produces trehalose using levulinic acid (LA) as a substrate. Here, we report the 7.97-Mb draft genome sequence of B. stabilis LA20W, which will be useful in investigations of the enzymes involved in LA metabolism and the mechanism of LA-induced trehalose production. PMID:27491978

  2. Draft Genome Sequence of Acetobacter tropicalis Type Strain NBRC16470, a Producer of Optically Pure d-Glyceric Acid.

    PubMed

    Koike, Hideaki; Sato, Shun; Morita, Tomotake; Fukuoka, Tokuma; Habe, Hiroshi

    2014-01-01

    Here we report the 3.7-Mb draft genome sequence of Acetobacter tropicalis NBRC16470(T), which can produce optically pure d-glyceric acid (d-GA; 99% enantiomeric excess) from raw glycerol feedstock derived from biodiesel fuel production processes. PMID:25523780

  3. Genome Sequence of a Candidate World Health Organization Reference Strain of Zika Virus for Nucleic Acid Testing.

    PubMed

    Trösemeier, Jan-Hendrik; Musso, Didier; Blümel, Johannes; Thézé, Julien; Pybus, Oliver G; Baylis, Sally A

    2016-01-01

    We report here the sequence of a candidate reference strain of Zika virus (ZIKV) developed on behalf of the World Health Organization (WHO). The ZIKV reference strain is intended for use in nucleic acid amplification (NAT)-based assays for the detection and quantification of ZIKV RNA. PMID:27587826

  4. Draft Genome Sequence of Burkholderia stabilis LA20W, a Trehalose Producer That Uses Levulinic Acid as a Substrate.

    PubMed

    Sato, Yuya; Koike, Hideaki; Kondo, Susumu; Hori, Tomoyuki; Kanno, Manabu; Kimura, Nobutada; Morita, Tomotake; Kirimura, Kohtaro; Habe, Hiroshi

    2016-01-01

    Burkholderia stabilis LA20W produces trehalose using levulinic acid (LA) as a substrate. Here, we report the 7.97-Mb draft genome sequence of B. stabilis LA20W, which will be useful in investigations of the enzymes involved in LA metabolism and the mechanism of LA-induced trehalose production. PMID:27491978

  5. Draft Genome Sequence of Cutaneotrichosporon curvatus DSM 101032 (Formerly Cryptococcus curvatus), an Oleaginous Yeast Producing Polyunsaturated Fatty Acids

    PubMed Central

    Hofmeyer, Thomas; Hackenschmidt, Silke; Nadler, Florian; Thürmer, Andrea; Daniel, Rolf

    2016-01-01

    Cutaneotrichosporon curvatus DSM 101032 is an oleaginous yeast that can be isolated from various habitats and is capable of producing substantial amounts of polyunsaturated fatty acids. Here, we present the first draft genome sequence of any C. curvatus species. PMID:27174275

  6. Ultra high-throughput nucleic acid sequencing as a tool for virus discovery in the turkey gut.

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Recently, the use of the next generation of nucleic acid sequencing technology (i.e., 454 pyrosequencing, as developed by Roche/454 Life Sciences) has allowed an in-depth look at the uncultivated microorganisms present in complex environmental samples, including samples with agricultural importance....

  7. Draft Genome Sequence of Lactobacillus delbrueckii subsp. bulgaricus CFL1, a Lactic Acid Bacterium Isolated from French Handcrafted Fermented Milk

    PubMed Central

    Meneghel, Julie; Irlinger, Françoise; Loux, Valentin; Vidal, Marie; Passot, Stéphanie; Béal, Catherine; Layec, Séverine

    2016-01-01

    Lactobacillus delbrueckii subsp. bulgaricus (L. bulgaricus) is a lactic acid bacterium widely used for the production of yogurt and cheeses. Here, we report the genome sequence of L. bulgaricus CFL1 to improve our knowledge on its stress-induced damages following production and end-use processes. PMID:26941141

  8. Genome-wide association analyses based on whole-genome sequencing in Sardinia provide insights into regulation of hemoglobin levels.

    PubMed

    Danjou, Fabrice; Zoledziewska, Magdalena; Sidore, Carlo; Steri, Maristella; Busonero, Fabio; Maschio, Andrea; Mulas, Antonella; Perseu, Lucia; Barella, Susanna; Porcu, Eleonora; Pistis, Giorgio; Pitzalis, Maristella; Pala, Mauro; Menzel, Stephan; Metrustry, Sarah; Spector, Timothy D; Leoni, Lidia; Angius, Andrea; Uda, Manuela; Moi, Paolo; Thein, Swee Lay; Galanello, Renzo; Abecasis, Gonçalo R; Schlessinger, David; Sanna, Serena; Cucca, Francesco

    2015-11-01

    We report genome-wide association study results for the levels of A1, A2 and fetal hemoglobins, analyzed for the first time concurrently. Integrating high-density array genotyping and whole-genome sequencing in a large general population cohort from Sardinia, we detected 23 associations at 10 loci. Five signals are due to variants at previously undetected loci: MPHOSPH9, PLTP-PCIF1, ZFPM1 (FOG1), NFIX and CCND3. Among the signals at known loci, ten are new lead variants and four are new independent signals. Half of all variants also showed pleiotropic associations with different hemoglobins, which further corroborated some of the detected associations and identified features of coordinated hemoglobin species production. PMID:26366553

  9. Identification of medicinal Dendrobium species by phylogenetic analyses using matK and rbcL sequences.

    PubMed

    Asahina, Haruka; Shinozaki, Junichi; Masuda, Kazuo; Morimitsu, Yasujiro; Satake, Motoyoshi

    2010-04-01

    Species identification of five Dendrobium plants was conducted using phylogenetic analysis and the validity of the method was verified. Some Dendrobium plants (Orchidaceae) have been used as herbal medicines but the difficulty in identifying their botanical origin by traditional methods prevented their full modern utilization. Based on the emerging field of molecular systematics as a powerful classification tool, a phylogenetic analysis was conducted using sequences of two plastid genes, the maturase-coding gene (matK) and the large subunit of ribulose 1,5-bisphosphate carboxylase-coding gene (rbcL), as DNA barcodes for species identification of Dendrobium plants. We investigated five medicinal Dendrobium species, Dendrobium fimbriatum, D. moniliforme, D. nobile, D. pulchellum, and D. tosaense. The phylogenetic trees constructed from matK data successfully distinguished each species from each other. On the other hand, rbcL, as a single-locus barcode, offered less species discriminating power than matK, possibly due to its being present with little variation. When results using matK sequences of D. officinale that was deposited in the DNA database were combined, D. officinale and D. tosaense showed a close genetic relationship, which brought us closer to resolving the question of their taxonomic identity. Identification of the plant source as well as the uniformity of the chemical components is critical for the quality control of herbal medicines and it is important that the processed materials be validated. The methods presented here could be applied to the analysis of processed Dendrobium plants and be a promising tool for the identification of botanical origins of crude drugs. PMID:20140532

  10. Sequence-Specific Recognition of MicroRNAs and Other Short Nucleic Acids with Solid-State Nanopores.

    PubMed

    Zahid, Osama K; Wang, Fanny; Ruzicka, Jan A; Taylor, Ethan W; Hall, Adam R

    2016-03-01

    The detection and quantification of short nucleic acid sequences has many potential applications in studying biological processes, monitoring disease initiation and progression, and evaluating environmental systems, but is challenging by nature. We present here an assay based on the solid-state nanopore platform for the identification of specific sequences in solution. We demonstrate that hybridization of a target nucleic acid with a synthetic probe molecule enables discrimination between duplex and single-stranded molecules with high efficacy. Our approach requires limited preparation of samples and yields an unambiguous translocation event rate enhancement that can be used to determine the presence and abundance of a single sequence within a background of nontarget oligonucleotides. PMID:26824296

  11. Amino acid sequence of rabbit kidney neutral endopeptidase 24.11 (enkephalinase) deduced from a complementary DNA.

    PubMed Central

    Devault, A; Lazure, C; Nault, C; Le Moual, H; Seidah, N G; Chrétien, M; Kahn, P; Powell, J; Mallet, J; Beaumont, A

    1987-01-01

    Neutral endopeptidase (EC 3.4.24.11) is a major constituent of kidney brush border membranes. It is also present in the brain where it has been shown to be involved in the inactivation of opioid peptides, methionine- and leucine-enkephalins. For this reason this enzyme is often called 'enkephalinase'. In order to characterize the primary structure of the enzyme, oligonucleotide probes were designed from partial amino acid sequences and used to isolate clones from kidney cDNA libraries. Sequencing of the cDNA inserts revealed the complete primary structure of the enzyme. Neutral endopeptidase consists of 750 amino acids. It contains a short N-terminal cytoplasmic domain (27 amino acids), a single membrane-spanning segment (23 amino acids) and an extracellular domain that comprises most of the protein mass. The comparison of the primary structure of neutral endopeptidase with that of thermolysin, a bacterial Zn-metallopeptidase, indicates that most of the amino acid residues involved in Zn coordination and catalytic activity in thermolysin are found within highly honmologous sequences in neutral endopeptidase. Images Fig. 1. Fig. 3. PMID:2440677

  12. Human parainfluenza type 3 virus hemagglutinin-neuraminidase glycoprotein: nucleotide sequence of mRNA and limited amino acid sequence of the purified protein.

    PubMed Central

    Elango, N; Coligan, J E; Jambou, R C; Venkatesan, S

    1986-01-01

    The nucleotide sequence of mRNA for the hemagglutinin-neuraminidase (HN) protein of human parainfluenza type 3 virus obtained from the corresponding cDNA clone had a single long open reading frame encoding a putative protein of 64,254 daltons consisting of 572 amino acids. The deduced protein sequence was confirmed by limited N-terminal amino acid microsequencing of CNBr cleavage fragments of native HN that was purified by immunoprecipitation. The HN protein is moderately hydrophobic and has four potential sites (Asn-X-Ser/Thr) of N-glycosylation in the C-terminal half of the molecule. It is devoid of both the N-terminal signal sequence and the C-terminal membrane anchorage domain characteristic of the hemagglutinin of influenza virus and the fusion (F0) protein of the paramyxoviruses. Instead, it has a single prominent hydrophobic region capable of membrane insertion beginning at 32 residues from the N terminus. This N-terminal membrane insertion is similar to that of influenza virus neuraminidase and the recently reported structures of HN proteins of Sendai virus and simian virus 5. Images PMID:3003381

  13. Sequence of cDNA for rat cystathionine gamma-lyase and comparison of deduced amino acid sequence with related Escherichia coli enzymes.

    PubMed Central

    Erickson, P F; Maxwell, I H; Su, L J; Baumann, M; Glode, L M

    1990-01-01

    A cDNA clone for cystathionine gamma-lyase was isolated from a rat cDNA library in lambda gt11 by screening with a monospecific antiserum. The identity of this clone, containing 600 bp proximal to the 3'-end of the gene, was confirmed by positive hybridization selection. Northern-blot hybridization showed the expected higher abundance of the corresponding mRNA in liver than in brain. Two further cDNA clones from a plasmid pcD library were isolated by colony hybridization with the first clone and were found to contain inserts of 1600 and 1850 bp. One of these was confirmed as encoding cystathionine gamma-lyase by hybridization with two independent pools of oligodeoxynucleotides corresponding to partial amino acid sequence information for cystathionine gamma-lyase. The other clone (estimated to represent all but 8% of the 5'-end of the mRNA) was sequenced and its deduced amino acid sequence showed similarity to those of the Escherichia coli enzymes cystathionine beta-lyase and cystathionine gamma-synthase throughout its length, especially to that of the latter. Images Fig. 1. Fig. 2. Fig. 3. Fig. 5. PMID:2201285

  14. Sequence dependent N-terminal rearrangement and degradation of peptide nucleic acid (PNA) in aqueous solution

    NASA Technical Reports Server (NTRS)

    Eriksson, M.; Christensen, L.; Schmidt, J.; Haaima, G.; Orgel, L.; Nielsen, P. E.

    1998-01-01

    The stability of the PNA (peptide nucleic acid) thymine monomer inverted question markN-[2-(thymin-1-ylacetyl)]-N-(2-aminoaminoethyl)glycine inverted question mark and those of various PNA oligomers (5-8-mers) have been measured at room temperature (20 degrees C) as a function of pH. The thymine monomer undergoes N-acyl transfer rearrangement with a half-life of 34 days at pH 11 as analyzed by 1H NMR; and two reactions, the N-acyl transfer and a sequential degradation, are found by HPLC analysis to occur at measurable rates for the oligomers at pH 9 or above. Dependent on the amino-terminal sequence, half-lives of 350 h to 163 days were found at pH 9. At pH 12 the half-lives ranged from 1.5 h to 21 days. The results are discussed in terms of PNA as a gene therapeutic drug as well as a possible prebiotic genetic material.

  15. Structural analysis of complementary DNA and amino acid sequences of human and rat androgen receptors

    SciTech Connect

    Chang, C.; Kokontis, J.; Liao, S. )

    1988-10-01

    Structural analysis of cDNAs for human and rat androgen receptors (ARs) indicates that the amino-terminal regions of ARs are rich in oligo- and poly(amino acid) motifs as in some homeotic genes. The human AR has a long stretch of repeated glycines, whereas rat AR has a long stretch of glutamines. There is a considerable sequence similarity among ARs and the receptors for glucocorticoids, progestins, and mineralocorticoids within the steroid-binding domains. The cysteine-rich DNA-binding domains are well conserved. Translation of mRNA transcribed from AR cDNAs yielded 94- and 76-kDa proteins and smaller forms that bind to DNA and have high affinity toward androgens. These rat or human ARs were recognized by human autoantibodies to natural Ars. Molecular hybridization studies, using AR cDNAs as probes, indicated that the ventral prostate and other male accessory organs are rich in AR mRNA and that the production of AR mRNA in the target organs may be autoregulated by androgens.

  16. Rapid and Sensitive Isothermal Detection of Nucleic-acid Sequence by Multiple Cross Displacement Amplification

    PubMed Central

    Wang, Yi; Wang, Yan; Ma, Ai-Jing; Li, Dong-Xun; Luo, Li-Juan; Liu, Dong-Xin; Jin, Dong; Liu, Kai; Ye, Chang-Yun

    2015-01-01

    We have devised a novel amplification strategy based on isothermal strand-displacement polymerization reaction, which was termed multiple cross displacement amplification (MCDA). The approach employed a set of ten specially designed primers spanning ten distinct regions of target sequence and was preceded at a constant temperature (61–65 °C). At the assay temperature, the double-stranded DNAs were at dynamic reaction environment of primer-template hybrid, thus the high concentration of primers annealed to the template strands without a denaturing step to initiate the synthesis. For the subsequent isothermal amplification step, a series of primer binding and extension events yielded several single-stranded DNAs and single-stranded single stem-loop DNA structures. Then, these DNA products enabled the strand-displacement reaction to enter into the exponential amplification. Three mainstream methods, including colorimetric indicators, agarose gel electrophoresis and real-time turbidity, were selected for monitoring the MCDA reaction. Moreover, the practical application of the MCDA assay was successfully evaluated by detecting the target pathogen nucleic acid in pork samples, which offered advantages on quick results, modest equipment requirements, easiness in operation, and high specificity and sensitivity. Here we expounded the basic MCDA mechanism and also provided details on an alternative (Single-MCDA assay, S-MCDA) to MCDA technique. PMID:26154567

  17. Snake venoms. The amino acid sequences of two proteinase inhibitor homologues from Dendroaspis angusticeps venom.

    PubMed

    Joubert, F J; Taljaard, N

    1980-05-01

    Toxins C13S1C3 and C13S2C3 from D. angusticeps venom were purified by gel filtration and ion exchange chromatography. Whereas C13S1C3 contains 57 amino acids, C13S2C3 contains 59 but each include six half-cystine residues. The complete primary structure of the low toxicity proteins have been elucidated. The sequences and the invariant residues of toxins C13S1C3 and C13S2C3 from D. angusticeps venom resemble, respectively, those of the proteinase inhibitor homologues K and I from D. polylepis polylepis venom and they are also homologous to the active proteinase inhibitors from various sources. In C13S1C3 and K the active site lysyl residue of active bovine pancreatic proteinase inhibitor is conserved but the site residue alanine, is replaced by lysine. In C13S2C3 and I the active site residue is replaced by tyrosine. PMID:7429422

  18. Advanced nuclear magnetic resonance lanthanide probe analyses of short-range conformational interrelations controlling ribonucleic acid structures.

    PubMed

    Yokoyama, S; Inagaki, F; Miyazawa, T

    1981-05-12

    An advanced method was developed for lanthanide-probe analyses of the conformations of flexible biomolecules such as nucleotides. The new method is to determine structure parameters (such as internal-rotation angles) and population parameters for local conformational equilibria of flexible sites, together with standard deviations of these parameters. As the prominent advantage of this method, the interrelations among local conformations of flexible sites may be quantitatively elucidated from the experimental data of lanthanide-induced shifts and relaxations and vicinal coupling constants. As a structural unit of ribonucleic acids, the molecular conformations and conformational equilibria of uridine 3'-monophosphate in aqueous solution were analyzed. The stable local conformers about the C3'-O3' bond are the G+ (phi' = 281 +/- 11 degrees) and G- (phi' = 211 +/- 8 degrees) forms. The internal rotation about the C3'-O3' bond and the ribose-ring puckering are interrelated; 97 +/- 5% of the C3'-endo ribose ring is associated with the G- form while 70 +/- 22% o the C2'-endo ribose ring is associated with the G+ form. An interdependency also exists between the internal rotation about the C4'-C5' bond and the ribose-ring puckering. These short-range conformational interrelations are probably important in controlling the dynamic aspects of ribonucleic acid structures. PMID:6166319

  19. Dynamics of a microbial community associated with manure hot spots as revealed by phospholipid fatty acid analyses.

    PubMed Central

    Frostegård, A; Petersen, S O; Bååth, E; Nielsen, T H

    1997-01-01

    Microbial community dynamics associated with manure hot spots were studied by using a model system consisting of a gel-stabilized mixture of soil and manure, placed between layers of soil, during a 3-week incubation period. The microbial biomass, measured as the total amount of phospholipid fatty acids (PLFA), had doubled within a 2-mm distance from the soil-manure interface after 3 days. Principal-component analyses demonstrated that this increase was accompanied by reproducible changes in the composition of PLFA, indicating changes in the microbial community structure. The effect of the manure was strongest in the 2-mm-thick soil layer closest to the interface, in which the PLFA composition was statistically significantly different (P < 0.05) from that of the unaffected soil layers throughout the incubation period. An effect was also observed in the soil layer 2 to 4 mm from the interface. The changes in microbial biomass and community structure were mainly attributed to the diffusion of dissolved organic carbon from the manure. During the initial period of microbial growth, PLFA, which were already more abundant in the manure than in the soil, increased in the manure core and in the 2-mm soil layer closest to the interface. After day 3, the PLFA composition of these layers gradually became more similar to that of the soil. The dynamics of individual PLFA suggested that both taxonomic and physiological changes occurred during growth. Examples of the latter were decreases in the ratios of 16:1 omega 7t to 16:1 omega 7c and of cyclopropyl fatty acids to their respective precursors, indicating a more active bacterial community. An inverse relationship between bacterial PLFA and the eucaryotic 20:4 PLFA (arachidonic acid) suggested that grazing was important. PMID:9172342

  20. Effects of simple acid leaching of crushed and powdered geological materials on high-precision Pb isotope analyses

    NASA Astrophysics Data System (ADS)

    Todd, Erin; Stracke, Andreas; Scherer, Erik E.

    2015-07-01

    We present new results of simple acid leaching experiments on the Pb isotope composition of USGS standard reference material powders and on ocean island basalt whole rock splits and powders. Rock samples were leached with cold 6 N HCl in an ultrasonic bath, then on a hot plate, and washed with ultrapure H2O before sample digestion in HF-HNO3 and chromatographic purification of Pb. Lead isotope analyses were measured by Tl-doped MC-ICPMS. Intrasession and intersession analytical reproducibilities of repeated analyses of both synthetic Pb solutions and Pb from single digests of chemically processed natural samples were generally better than 100 ppm (2 SD). The comparison of leached and unleached samples shows that leaching consistently removes variable amounts of contaminants that differ in Pb isotopic composition for different starting materials. For repeated digests of a single sample, analyses of leached samples reproduce better than those of unleached ones, confirming that leaching effectively removes most of the heterogeneously distributed extraneous Pb. Nevertheless, the external reproducibility of leached samples is still up to an order of magnitude worse than that of Pb solution standards (˜100 ppm). More complex leaching methods employed by earlier studies yield Pb isotope ratios within error of those produced by our method and at similar levels of reproducibility, demonstrating that our simple leaching method is as effective as more complex leaching techniques. Therefore, any Pb isotope heterogeneity among multiple leached digests of samples in excess of the external reproducibility is attributed to inherent isotopic heterogeneity of the sample. The external precision of ˜100 ppm (2 SD) achieved for Pb isotope ratio determination by Tl-doped MC-ICPMS is thus sufficient for most rocks. The full advantage of the most precise Pb isotope analytical methods is only realized in cases where the natural isotopic heterogeneity among samples in a studied suite is

  1. Phylogenetic Analysis of Bolivian Bat Trypanosomes of the Subgenus Schizotrypanum Based on Cytochrome b Sequence and Minicircle Analyses

    PubMed Central

    García, Lineth; Ortiz, Sylvia; Osorio, Gonzalo; Torrico, Mary Cruz; Torrico, Faustino; Solari, Aldo

    2012-01-01

    The aim of this study was to establish the phylogenetic relationships of trypanosomes present in blood samples of Bolivian Carollia bats. Eighteen cloned stocks were isolated from 115 bats belonging to Carollia perspicillata (Phyllostomidae) from three Amazonian areas of the Chapare Province of Bolivia and studied by xenodiagnosis using the vectors Rhodnius robustus and Triatoma infestans (Trypanosoma cruzi marenkellei) or haemoculture (Trypanosoma dionisii). The PCR DNA amplified was analyzed by nucleotide sequences of maxicircles encoding cytochrome b and by means of the molecular size of hyper variable regions of minicircles. Ten samples were classified as Trypanosoma cruzi marinkellei and 8 samples as Trypanosoma dionisii. The two species have a different molecular size profile with respect to the amplified regions of minicircles and also with respect to Trypanosoma cruzi and Trypanosoma rangeli used for comparative purpose. We conclude the presence of two species of bat trypanosomes in these samples, which can clearly be identified by the methods used in this study. The presence of these trypanosomes in Amazonian bats is discussed. PMID:22590570

  2. Ancient DNA analyses reveal high mitochondrial DNA sequence diversity and parallel morphological evolution of late pleistocene cave bears.

    PubMed

    Hofreiter, Michael; Capelli, Cristian; Krings, Matthias; Waits, Lisette; Conard, Nicholas; Münzel, Susanne; Rabeder, Gernot; Nagel, Doris; Paunovic, Maja; Jambrĕsić, Gordana; Meyer, Sonja; Weiss, Gunter; Pääbo, Svante

    2002-08-01

    Cave bears (Ursus spelaeus) existed in Europe and western Asia until the end of the last glaciation some 10,000 years ago. To investigate the genetic diversity, population history, and relationship among different cave bear populations, we have determined mitochondrial DNA sequences from 12 cave bears that range in age from about 26,500 to at least 49,000 years and originate from nine caves. The samples include one individual from the type specimen population, as well as two small-sized high-Alpine bears. The results show that about 49,000 years ago, the mtDNA diversity among cave bears was about 1.8-fold lower than the current species-wide diversity of brown bears (Ursus arctos). However, the current brown bear mtDNA gene pool consists of three clades, and cave bear mtDNA diversity is similar to the diversity observed within each of these clades. The results also show that geographically separated populations of the high-Alpine cave bear form were polyphyletic with respect to their mtDNA. This suggests that small size may have been an ancestral trait in cave bears and that large size evolved at least twice independently. PMID:12140236

  3. Phylogenetic analysis of Bolivian bat trypanosomes of the subgenus schizotrypanum based on cytochrome B sequence and minicircle analyses.

    PubMed

    García, Lineth; Ortiz, Sylvia; Osorio, Gonzalo; Torrico, Mary Cruz; Torrico, Faustino; Solari, Aldo

    2012-01-01

    The aim of this study was to establish the phylogenetic relationships of trypanosomes present in blood samples of Bolivian Carollia bats. Eighteen cloned stocks were isolated from 115 bats belonging to Carollia perspicillata (Phyllostomidae) from three Amazonian areas of the Chapare Province of Bolivia and studied by xenodiagnosis using the vectors Rhodnius robustus and Triatoma infestans (Trypanosoma cruzi marenkellei) or haemoculture (Trypanosoma dionisii). The PCR DNA amplified was analyzed by nucleotide sequences of maxicircles encoding cytochrome b and by means of the molecular size of hyper variable regions of minicircles. Ten samples were classified as Trypanosoma cruzi marinkellei and 8 samples as Trypanosoma dionisii. The two species have a different molecular size profile with respect to the amplified regions of minicircles and also with respect to Trypanosoma cruzi and Trypanosoma rangeli used for comparative purpose. We conclude the presence of two species of bat trypanosomes in these samples, which can clearly be identified by the methods used in this study. The presence of these trypanosomes in Amazonian bats is discussed. PMID:22590570

  4. Transcriptome Sequencing Analyses between the Cytoplasmic Male Sterile Line and Its Maintainer Line in Welsh Onion (Allium fistulosum L.)

    PubMed Central

    Liu, Qianchun; Lan, Yanping; Wen, Changlong; Zhao, Hong; Wang, Jian; Wang, Yongqin

    2016-01-01

    Cytoplasmic male sterility (CMS) is important for exploiting heterosis in crop plants and also serves as a model for investigating nuclear–cytoplasmic interaction. The molecular mechanism of cytoplasmic male sterility and fertility restoration was investigated in several important economic crops but remains poorly understood in the Welsh onion. Therefore, we compared the differences between the CMS line 64-2 and its maintainer line 64-1 using transcriptome sequencing with the aim of determining critical genes and pathways associated with male sterility. This study combined two years of RNA-seq data; there were 1504 unigenes (in May 2013) and 2928 unigenes (in May 2014) that were differentially expressed between the CMS and cytoplasmic male maintainer Welsh onion varieties. Known CMS-related genes were found in the set of differentially expressed genes and checked by qPCR. These genes included F-type ATPase, NADH dehydrogenase, cytochrome c oxidase, etc. Overall, this study demonstrated that the CMS regulatory genes and pathways may be associated with the mitochondria and nucleus in the Welsh onion. We believe that this transcriptome dataset will accelerate the research on CMS gene clones and other functional genomics research on A. fistulosum L. PMID:27376286

  5. Analyses of the population structure in a global collection of Phytophthora nicotianae isolates inferred from mitochondrial and nuclear DNA sequences.

    PubMed

    Mammella, Marco A; Martin, Frank N; Cacciola, Santa O; Coffey, Michael D; Faedda, Roberto; Schena, Leonardo

    2013-06-01

    Genetic variation within the heterothallic cosmopolitan plant pathogen Phytophthora nicotianae was determined in 96 isolates from a wide range of hosts and geographic locations by characterizing four mitochondrial (10% of the genome) and three nuclear loci. In all, 52 single-nucleotide polymorphisms (SNPs) (an average of 1 every 58 bp) and 313 sites with gaps representing 5,450 bases enabled the identification of 50 different multilocus mitochondrial haplotypes. Similarly, 24 SNPs (an average of 1 every 69 bp), with heterozygosity observed at each locus, were observed in three nuclear regions (hyp, scp, and β-tub) differentiating 40 multilocus nuclear genotypes. Both mitochondrial and nuclear markers revealed a high level of dispersal of isolates and an inconsistent geographic structuring of populations. However, a specific association was observed for host of origin and genetic grouping with both nuclear and mitochondrial sequences. In particular, the majority of citrus isolates from Italy, California, Florida, Syria, Albania, and the Philippines clustered in the same mitochondrial group and shared at least one nuclear allele. A similar association was also observed for isolates recovered from Nicotiana and Solanum spp. The present study suggests an important role of nursery populations in increasing genetic recombination within the species and the existence of extensive phenomena of migration of isolates that have been likely spread worldwide with infected plant material. PMID:23384862

  6. Transcriptome Sequencing Analyses between the Cytoplasmic Male Sterile Line and Its Maintainer Line in Welsh Onion (Allium fistulosum L.).

    PubMed

    Liu, Qianchun; Lan, Yanping; Wen, Changlong; Zhao, Hong; Wang, Jian; Wang, Yongqin

    2016-01-01

    Cytoplasmic male sterility (CMS) is important for exploiting heterosis in crop plants and also serves as a model for investigating nuclear-cytoplasmic interaction. The molecular mechanism of cytoplasmic male sterility and fertility restoration was investigated in several important economic crops but remains poorly understood in the Welsh onion. Therefore, we compared the differences between the CMS line 64-2 and its maintainer line 64-1 using transcriptome sequencing with the aim of determining critical genes and pathways associated with male sterility. This study combined two years of RNA-seq data; there were 1504 unigenes (in May 2013) and 2928 unigenes (in May 2014) that were differentially expressed between the CMS and cytoplasmic male maintainer Welsh onion varieties. Known CMS-related genes were found in the set of differentially expressed genes and checked by qPCR. These genes included F-type ATPase, NADH dehydrogenase, cytochrome c oxidase, etc. Overall, this study demonstrated that the CMS regulatory genes and pathways may be associated with the mitochondria and nucleus in the Welsh onion. We believe that this transcriptome dataset will accelerate the research on CMS gene clones and other functional genomics research on A. fistulosum L. PMID:27376286

  7. Compact variant-rich customized sequence database and a fast and sensitive database search for efficient proteogenomic analyses.

    PubMed

    Park, Heejin; Bae, Junwoo; Kim, Hyunwoo; Kim, Sangok; Kim, Hokeun; Mun, Dong-Gi; Joh, Yoonsung; Lee, Wonyeop; Chae, Sehyun; Lee, Sanghyuk; Kim, Hark Kyun; Hwang, Daehee; Lee, Sang-Won; Paek, Eunok

    2014-12-01

    In proteogenomic analysis, construction of a compact, customized database from mRNA-seq data and a sensitive search of both reference and customized databases are essential to accurately determine protein abundances and structural variations at the protein level. However, these tasks have not been systematically explored, but rather performed in an ad-hoc fashion. Here, we present an effective method for constructing a compact database containing comprehensive sequences of sample-specific variants--single nucleotide variants, insertions/deletions, and stop-codon mutations derived from Exome-seq and RNA-seq data. It, however, occupies less space by storing variant peptides, not variant proteins. We also present an efficient search method for both customized and reference databases. The separate searches of the two databases increase the search time, and a unified search is less sensitive to identify variant peptides due to the smaller size of the customized database, compared to the reference database, in the target-decoy setting. Our method searches the unified database once, but performs target-decoy validations separately. Experimental results show that our approach is as fast as the unified search and as sensitive as the separate searches. Our customized database includes mutation information in the headers of variant peptides, thereby facilitating the inspection of peptide-spectrum matches. PMID:25316439

  8. Nucleotide and predicted amino acid sequence of a cDNA clone encoding part of human transketolase.

    PubMed

    Abedinia, M; Layfield, R; Jones, S M; Nixon, P F; Mattick, J S

    1992-03-31

    Transketolase is a key enzyme in the pentose-phosphate pathway which has been implicated in the latent human genetic disease, Wernicke-Korsakoff syndrome. Here we report the cloning and partial characterisation of the coding sequences encoding human transketolase from a human brain cDNA library. The library was screened with oligonucleotide probes based on the amino acid sequence of proteolytic fragments of the purified protein. Northern blots showed that the transketolase mRNA is approximately 2.2 kb, close to the minimum expected, of which approximately 60% was represented in the largest cDNA clone. Sequence analysis of the transketolase coding sequences reveals a number of homologies with related enzymes from other species. PMID:1567394

  9. Monitoring of Chlamydia trachomatis infections after antibiotic treatment using RNA detection by nucleic acid sequence based amplification.

    PubMed Central

    Morré, S A; Sillekens, P T; Jacobs, M V; de Blok, S; Ossewaarde, J M; van Aarle, P; van Gemen, B; Walboomers, J M; Meijer, C J; van den Brule, A J

    1998-01-01

    AIM: To investigate the value of RNA detection by nucleic acid sequence based amplification (NASBA) for the monitoring of Chlamydia trachomatis infections after antibiotic treatment. METHODS: Cervical smears (n = 97) and urine specimens (n = 61) from 25 C trachomatis positive female patients were analysed for the presence of C trachomatis 16S ribosomal RNA (rRNA) by NASBA and C trachomatis plasmid DNA by the polymerase chain reaction (PCR) before and up to five weeks after antibiotic treatment. RESULTS: Chlamydia trachomatis RNA was found in all cervical smears taken before antibiotic treatment (n = 24) and in two smears taken one week after antibiotic treatment; no C trachomatis RNA was detected after two weeks or more. In contrast, C trachomatis DNA was found in all such specimens before treatment, and 21 of 25, six of 21, and five of 20 smears were found to be positive at one, two, and three weeks after treatment, respectively. After four weeks, only one of six smears was positive, and this smear had been negative in the two preceding weeks. Of the 61 urine samples investigated, C trachomatis DNA and C trachomatis RNA were found in all before treatment (n = 15), whereas one week after treatment four of 15 were C trachomatis DNA positive and C trachomatis RNA was detected in one sample only. CONCLUSIONS: These data show that RNA detection by NASBA can be used successfully to monitor C trachomatis infections after antibiotic treatment. Furthermore, it might be possible to use urine specimens as a test of cure because neither C. trachomatis DNA or RNA could be detected two weeks or more after treatment. PMID:9850338

  10. Sample Prep, Workflow Automation and Nucleic Acid Fractionation for Next Generation Sequencing

    SciTech Connect

    Roskey, Mark

    2010-06-03

    Mark Roskey of Caliper LifeSciences discusses how the company's technologies fit into the next generation sequencing workflow on June 3, 2010 at the "Sequencing, Finishing, Analysis in the Future" meeting in Santa Fe, NM

  11. Evolution of vertebrate IgM: complete amino acid sequence of the constant region of Ambystoma mexicanum mu chain deduced from cDNA sequence.

    PubMed

    Fellah, J S; Wiles, M V; Charlemagne, J; Schwager, J

    1992-10-01

    cDNA clones coding for the constant region of the Mexican axolotl (Ambystoma mexicanum) mu heavy immunoglobulin chain were selected from total spleen RNA, using a cDNA polymerase chain reaction technique. The specific 5'-end primer was an oligonucleotide homologous to the JH segment of Xenopus laevis mu chain. One of the clones, JHA/3, corresponded to the complete constant region of the axolotl mu chain, consisting of a 1362-nucleotide sequence coding for a polypeptide of 454 amino acids followed in 3' direction by a 179-nucleotide untranslated region and a polyA+ tail. The axolotl C mu is divided into four typical domains (C mu 1-C mu 4) and can be aligned with the Xenopus C mu with an overall identity of 56% at the nucleotide level. Percent identities were particularly high between C mu 1 (59%) and C mu 4 (71%). The C-terminal 20-amino acid segment which constitutes the secretory part of the mu chain is strongly homologous to the equivalent sequences of chondrichthyans and of other tetrapods, including a conserved N-linked oligosaccharide, the penultimate cysteine and the C-terminal lysine. The four C mu domains of 13 vertebrate species ranging from chondrichthyans to mammals were aligned and compared at the amino acid level. The significant number of mu-specific residues which are conserved into each of the four C mu domains argues for a continuous line of evolution of the vertebrate mu chain. This notion was confirmed by the ability to reconstitute a consistent vertebrate evolution tree based on the phylogenic parsimony analysis of the C mu 4 sequences. PMID:1382992

  12. Full Vector Analyses of the Short-Term Behavior Recorded in Long Volcanic Sequences in Hawaii, USA.

    NASA Astrophysics Data System (ADS)

    Herrero-Bervera, E.

    2007-05-01

    The Hawaiian volcanoes, in principle, offer the opportunity of observing the geomagnetic field behavior from present back to 5.72 Ma (from the Big Island of Hawaii to the island of Kauai). Thus, new paleomagnetic measurements coupled with radioisotopic dating are revolutionizing our understanding of the geodynamo by providing terrestrial lava records of the short-term behavior of the paleomagnetic field. As part of our investigations of some of these Hawaiian volcanoes, we have sampled long volcanic sequences of the Waianae, Koolau (island of O'ahu) and Mauna Loa (Big Island of Hawaii) volcanoes. These volcanic edifices have collapsed in the past leaving highly dissected lava sequences ideal for paleomagnetic sampling. We have sampled and studied in detail the directional characteristics and their respective absolute paleointensities of three successive reversals, namely, Gilbert-Gauss, Lower and Upper Mammoth polarity transitions recorded on Waianae lavas, Cryptochron C2r.2r-1 (ca. 2.514 +/- 0.030 Ma) as well as the Kaena Subchron recorded in the Koolau Volcano and the Laschamp and Pringle Falls excursions recorded on lavas from the Mauna Loa volcano (Big Island of Hawaii). The records of the three successive Gilbert-Gauss, Lower and Upper Mammoth reversals confirm that large oscillations of directions precede or follow the reversals, which reminds waveforms typical of paleosecular variation with their amplitude being considerably amplified by the decrease of the dipole. Determinations of absolute paleointensity were attempted on more than 750 samples. Special care was taken at selecting data obtained from segments covering more than 50% of the remanent magnetization of the samples. This procedure limited the success rate to 13% for the Waianae lavas, 70% for the Cryptochron flows and 25% for the Kaena samples but provided consistent and reliable paleointensities. In addition to other time intervals, the results document the field variations surrounding the five

  13. Statistical analyses of soil properties on a quaternary terrace sequence in the upper sava river valley, Slovenia, Yugoslavia

    USGS Publications Warehouse

    Vidic, N.; Pavich, M.; Lobnik, F.

    1991-01-01

    Alpine glaciations, climatic changes and tectonic movements have created a Quaternary sequence of gravely carbonate sediments in the upper Sava River Valley, Slovenia, Yugoslavia. The names for terraces, assigned in this model, Gu??nz, Mindel, Riss and Wu??rm in order of decreasing age, are used as morphostratigraphic terms. Soil chronosequence on the terraces was examined to evaluate which soil properties are time dependent and can be used to help constrain the ages of glaciofluvial sedimentation. Soil thickness, thickness of Bt horizons, amount and continuity of clay coatings and amount of Fe and Me concretions increase with soil age. The main source of variability consists of solutions of carbonate, leaching of basic cations and acidification of soils, which are time dependent and increase with the age of soils. The second source of variability is the content of organic matter, which is less time dependent, but varies more within soil profiles. Textural changes are significant, presented by solution of carbonate pebbles and sand, and formation is silt loam matrix, which with age becomes finer, with clay loam or clayey texture. The oldest, Gu??nz, terrace shows slight deviation from general progressive trends of changes of soil properties with time. The hypothesis of single versus multiple depositional periods of deposition was tested with one-way analysis of variance (ANOVA) on a staggered, nested hierarchical sampling design on a terrace of largest extent and greatest gravel volume, the Wu??rm terrace. The variability of soil properties is generally higher within subareas than between areas of the terrace, except for the soil thickness. Observed differences in soil thickness between the areas of the terrace could be due to multiple periods of gravel deposition, or to the initial differences of texture of the deposits. ?? 1991.

  14. Biochemical and phylogenetic analyses of a cold-active {beta}-galactosidase from the lactic acid bacterium Carnobacterium piscicola BA

    SciTech Connect

    Coombs, J.M.; Brenchley, J.E.

    1999-12-01

    The authors are investigating glycosyl hydrolases from new psychrophilic isolates to examine the adaptations of enzymes to low temperatures. A {beta}-galactosidase from isolate BA, which they have classified as a strain of the lactic acid bacterium Carnobacterium piscicola, was capable of hydrolyzing the chromogen 5-bromo-4-chloro-3-indolyl {beta}-D-galactopyranoside (X-Gal) at 4 C and possessed higher activity in crude cell lysates at 25 than at 37 C. Sequence analysis of a cloned DNA fragment encoding this activity revealed a gene cluster containing three glycosyl hydrolases with homology to an {alpha}-galactosidase and two {beta}-galactosidases. The larger of the two {beta}-galactosidase genes, bgaB, encoded the 76.9-kDa cold-active enzyme. This gene was homologous to family 42 glycosyl hydrolases, a group which contains several thermophilic enzymes but none from lactic acid bacteria. The bgaB gene from isolate BA was subcloned in Escherichia coli, and its enzyme, BgaB, was purified. The purified enzyme was highly unstable and required 10% glycerol to maintain activity. Its optimal temperature for activity was 30 C, and it was inactivated at 40 C in 10 min. The K{sub m} of freshly purified enzyme at 30 C was 1.7 mM, and the V{sub max} was 450 {micro}mol {sm{underscore}bullet} min{sup {minus}1}{sm{underscore}bullet}mg{sup {minus}1} with o-nitrophenyl {beta}-D-galactopyranoside. This cold-active enzyme is interesting because it is homologous to a thermophilic enzyme from Bacillus stearothermophilus, and comparisons could provide information about structural features important for activity at low temperatures.

  15. Low levels of haptoglobin and putative amino acid sequence in Taiwanese Lanyu miniature pigs.

    PubMed

    Yueh, Sunny C H; Wang, Yao Horng; Lin, Kuan Yu; Tseng, Chi Feng; Chu, Hsien Pin; Chen, Kuen Jaw; Wang, Shih Sheng; Lai, I Hsiang; Mao, Simon J T

    2008-04-01

    Porcine haptoglobin (Hp) is an acute phase protein. Its plasma level increases significantly during inflammation and infection. One of the main functions of Hp is to bind free hemoglobin (Hb) and inhibit its oxidative activity. In the present report, we studied the Hp phenotype of Taiwanese Lanyu miniature pigs (TLY minipigs; n=43) and found their Hp structure to be a homodimer (beta-alpha-alpha-beta) similar to human Hp 1-1. Interestingly, Western blot and high performance liquid chromatographic (HPLC) analysis showed that 25% of the TLY minipigs possessed low or no plasma Hp level (<0.05 mg/ml). The Hp cDNA of these TLY minipigs was then cloned, and the translated amino acid sequence was analyzed. No sequences were found to be deficient; they showed a 99.7% identity with domestic pigs (NP_999165). The mean overall Hp level of the TLY minipigs (0.21 +/- 0.25 mg/ml; n=43) determined by enzyme-linked immunosorbent assay (ELISA) was markedly lower than that of domestic pigs (0.78 +/- 0.45 mg/ml; p<0.001), while 25% of the TLY minipigs had an Hp level that was extremely low (<0.05 mg/ml). In addition, the initial recovery rate (first 40 min) in the circulation of infused fluorescein isothiocyanate (FITC)-Hb was significantly higher in the TLY minipigs with extremely low Hp levels than those with high levels. This data suggests that the low concentration of Hp-Hb complex is responsible for the higher recovery rate of Hb in the circulation. TLY minipigs have been used as an experimental model for cardiovascular diseases; whether they can be used as a model for inflammatory diseases, with Hp as a marker, remains a topic of interest. However, since the Hp level varies significantly among individual TLY minipigs, it is necessary to prescreen the Hp levels of the animals to minimize variation in the experimental baseline. The present study may provide a reference value for future use of the TLY minipig as an animal model for inflammation-associated diseases. PMID:18460833

  16. Sequence Comparison and Phylogeny of Nucleotide Sequence of Coat Protein and Nucleic Acid Binding Protein of a Distinct Isolate of Shallot virus X from India.

    PubMed

    Majumder, S; Baranwal, V K

    2011-06-01

    Shallot virus X (ShVX), a type species in the genus Allexivirus of the family Alfaflexiviridae has been associated with shallot plants in India and other shallot growing countries like Russia, Germany, Netherland, and New Zealand. Coat protein (CP) and nucleic acid binding protein (NB) region of the virus was obtained by reverse transcriptase polymerase chain reaction from scales leaves of shallot bulbs. The partial cDNA contained two open reading frames encoding proteins of molecular weights of 28.66 and 14.18 kDa belonging to Flexi_CP super-family and viral NB super-family, respectively. The percent identity and phylogenetic analysis of amino acid sequences of CP and NB region of the virus associated with shallot indicated that it was a distinct isolate of ShVX. PMID:23637504

  17. The phylogenetic position of the Loimoidae Price, 1936 (Monogenoidea: Monocotylidea) based on analyses of partial rDNA sequences and morphological data.

    PubMed

    Boeger, W A; Kritsky, D C; Domingues, M V; Bueno-Silva, M

    2014-06-01

    Phylogenetic analyses of partial sequences of 18S and 28S rDNA of some monogenoids, including monocotylids and a specimen of Loimosina sp. collected from a hammerhead shark off Brazil, indicated that the Loimoidae (as represented by the specimen of Loimosina sp.) represents an in-group taxon of the Monocotylidae. In all analyses, the Loimoidae fell within a major monocotylid clade including species of the Heterocotylinae, Decacotylinae, and Monocotylinae. The Loimoidae formed a terminal clade with two heterocotyline species, Troglocephalus rhinobatidis and Neoheterocotyle rhinobatis, for which it represented the sister taxon. The following morphological characters supported the clade comprising the Loimoidae, Heterocotylinae, Decacotylinae and Monocotylinae: single vagina present, presence of a narrow deep anchor root, and presence of a marginal haptoral membrane. The presence of cephalic pits was identified as a putative synapomorphy for the clade (Loimoidae (T. rhinobatidis, N. rhinobatis)). Although rDNA sequence data support the rejection of the Loimoidae and incorporating its species into the Monocotylidae, this action was not recommended pending a full phylogenetic analysis of morphological data. PMID:24491371

  18. Amino acid sequence of mouse nidogen, a multidomain basement membrane protein with binding activity for laminin, collagen IV and cells.

    PubMed Central

    Mann, K; Deutzmann, R; Aumailley, M; Timpl, R; Raimondi, L; Yamada, Y; Pan, T C; Conway, D; Chu, M L

    1989-01-01

    The whole amino acid sequence of nidogen was deduced from cDNA clones isolated from expression libraries and confirmed to approximately 50% by Edman degradation of peptides. The protein consists of some 1217 amino acid residues and a 28-residue signal peptide. The data support a previously proposed dumb-bell model of nidogen by demonstrating a large N-terminal globular domain (641 residues), five EGF-like repeats constituting the rod-like domain (248 residues) and a smaller C-terminal globule (328 residues). Two more EGF-like repeats interrupt the N-terminal and terminate the C-terminal sequences. Weak sequence homologies (25%) were detected between some regions of nidogen, the LDL receptor, thyroglobulin and the EGF precursor. Nidogen contains two consensus sequences for tyrosine sulfation and for asparagine beta-hydroxylation, two N-linked carbohydrate acceptor sites and, within one of the EGF-like repeats an Arg-Gly-Asp sequence. The latter was shown to be functional in cell attachment to nidogen. Binding sites for laminin and collagen IV are present on the C-terminal globule but not yet precisely localized. Images PMID:2496973

  19. Jack bean α-mannosidase: amino acid sequencing and N-glycosylation analysis of a valuable glycomics tool.

    PubMed

    Gnanesh Kumar, B S; Pohlentz, Gottfried; Schulte, Mona; Mormann, Michael; Siva Kumar, Nadimpalli

    2014-03-01

    Jack bean (Canavalia ensiformis) seeds contain several biologically important proteins among which α-mannosidase (EC 3.2.1.24) has been purified, its biochemical properties studied and widely used in glycan analysis. In the present study, we have used the purified enzyme and derived its amino acid sequence covering both the known subunits (molecular mass of ∼66,000 and ∼44,000 Da) hitherto not known in its entirety. Peptide de novo sequencing and structural elucidation of N-glycopeptides obtained either directly from proteolytic digestion or after zwitterionic hydrophilic interaction liquid chromatography solid phase extraction-based separation were performed by use of nanoelectrospray ionization quadrupole time-of-flight mass spectrometry and low-energy collision-induced dissociation experiments. De novo sequencing provided new insights into the disulfide linkage organization, intersection of subunits and complete N-glycan structures along with site specificities. The primary sequence suggests that the enzyme belongs to glycosyl hydrolase family 38 and the N-glycan sequence analysis revealed high-mannose oligosaccharides, which were found to be heterogeneous with varying number of hexoses viz, Man8-9GlcNAc2 and Glc1Man9GlcNAc2 in an evolutionarily conserved N-glycosylation site. This site with two proximal cysteines is present in all the acidic α-mannosidases reported so far in eukaryotes. Further, a truncated paucimannose type was identified to be lacking terminal two mannose, Man1(Xyl)GlcNAc2 (Fuc). PMID:24295789

  20. Complete Genome Sequence of Enterococcus mundtii QU 25, an Efficient l-(+)-Lactic Acid-Producing Bacterium

    PubMed Central

    Shiwa, Yuh; Yanase, Hiroaki; Hirose, Yuu; Satomi, Shohei; Araya-Kojima, Tomoko; Watanabe, Satoru; Zendo, Takeshi; Chibazakura, Taku; Shimizu-Kadota, Mariko; Yoshikawa, Hirofumi; Sonomoto, Kenji

    2014-01-01

    Enterococcus mundtii QU 25, a non-dairy bacterial strain of ovine faecal origin, can ferment both cellobiose and xylose to produce l-lactic acid. The use of this strain is highly desirable for economical l-lactate production from renewable biomass substrates. Genome sequence determination is necessary for the genetic improvement of this strain. We report the complete genome sequence of strain QU 25, primarily determined using Pacific Biosciences sequencing technology. The E. mundtii QU 25 genome comprises a 3 022 186-bp single circular chromosome (GC content, 38.6%) and five circular plasmids: pQY182, pQY082, pQY039, pQY024, and pQY003. In all, 2900 protein-coding sequences, 63 tRNA genes, and 6 rRNA operons were predicted in the QU 25 chromosome. Plasmid pQY024 harbours genes for mundticin production. We found that strain QU 25 produces a bacteriocin, suggesting that mundticin-encoded genes on plasmid pQY024 were functional. For lactic acid fermentation, two gene clusters were identified—one involved in the initial metabolism of xylose and uptake of pentose and the second containing genes for the pentose phosphate pathway and uptake of related sugars. This is the first complete genome sequence of an E. mundtii strain. The data provide insights into lactate production in this bacterium and its evolution among enterococci. PMID:24568933

  1. Depositional sequences of offshore Canterbury, New Zealand, and preliminary results of stable isotope analyses of the samples from IODP Expedition 317

    NASA Astrophysics Data System (ADS)

    Hoyanagi, K.; Koto, S.; Kawagata, S.; Fulthorpe, C.; Blum, P.; Shipboard Scientific Party, E.

    2010-12-01

    INTRODUCTION Integrated Ocean Drilling Program Expedition 317 was devoted to understanding the relative importance of global sea level (eustasy) versus local tectonic and sedimentary processes in controlling continental-margin sedimentary cycles. In order to achieve these objectives, upper Miocene to Recent sedimentary sequences were cored in a transect of three sites on the continental shelf (landward to basinward, Sites U1353, U1354, U1351). Highest recovery was achieved in cores of upper Pliocene (3.5 Ma) to Recent sediments. We also drilled one site (Site U1352) on the continental slope, reaching a depth of 1927.5 m below sea floor and obtaining Eocene samples. CORRELATION OF SEISMIC SEQUENCE BOUNDARIES AND DISCONTINUITIES IN THE CORES Nineteen regional seismic sequence boundaries (U1-U19, in ascending order) were idendified in the middle Miocene to recent shelf-slope sediment prism of the offshore Canterbury Basin (Lu and Fulthope, 2004). Discontinuities identified in cores may correlate to U19-U8 at Site U1353, and to U19-U10 at Sites U1354 and U1351. We estimate the ages of the discontinuities, based on shipboard analyses, to correspond to both Marine Isotope Stages (Lisiecki and Raymo, 2005) and global sequence boundaries (Haq et al., 1987). STABLE ISOTOPE MEASUREMENTS OF THE ORGANIC MATTER AND FORAMINIFERA TESTS We are analyzing carbon isotopic ratios of organic matter in the sediments and oxygen isotopic ratios of foraminifer tests. Carbon isotopic ratio indicates whether the origin of the organic matter is terrestrial or marine. Samples for stable isotope analysis of organic carbon are treated with HCl to dissolve calcium carbonate. Analyses are carried out at the Faculty of Science, Shinshu University, using an elemental analyzer (FlashEA1122, ThermoQuest Ltd.) and a mass spectrometer (Delta V, ThermoQuest Ltd.). We are picking foraminifera tests from core samples from slope Site U1352 and measure oxygen isotope ratios of the calcium carbonate to

  2. Gastropod arginine kinases from Cellana grata and Aplysia kurodai. Isolation and cDNA-derived amino acid sequences.

    PubMed

    Suzuki, T; Inoue, N; Higashi, T; Mizobuchi, R; Sugimura, N; Yokouchi, K; Furukohri, T

    2000-12-01

    Arginine kinase (AK) was isolated from the radular muscle of the gastropod molluscs Cellana grata (subclass Prosobranchia) and Aplysia kurodai (subclass Opisthobranchia), respectively, by ammonium sulfate fractionation, Sephadex G-75 gel filtration and DEAE-ion exchange chromatography. The denatured relative molecular mass values were estimated to be 40 kDa by sodium dodecyl sulfate-polyacrylamide gel electrophoresis. The isolated enzyme from Aplysia gave a Km value of 0.6 mM for arginine and a Vmax value of 13 micromole Pi min(-1) mg protein(-1) for the forward reaction. These values are comparable to other molluscan AKs. The cDNAs encoding Cellana and Aplysia AKs were amplified by polymerase chain reaction, and the nucleotide sequences of 1,608 and 1,239 bp, respectively, were determined. The open reading frame for Cellana AK is 1044 nucleotides in length and encodes a protein with 347 amino acid residues, and that for A. kurodai is 1077 nucleotides and 354 residues. The cDNA-derived amino acid sequences were validated by chemical sequencing of internal lysyl endopeptidase peptides. The amino acid sequences of Cellana and Aplysia AKs showed the highest percent identity (66-73%) with those of the abalone Nordotis and turbanshell Battilus belonging to the same class Gastropoda. These AK sequences still have a strong homology (63-71%) with that of the chiton Liolophura (class Polyplacophora), which is believed to be one of the most primitive molluscs. On the other hand, these AK sequences are less homologous (55-57%) with that of the clam Pseudocardium (class Bivalvia), suggesting that the biological position of the class Polyplacophora should be reconsidered. PMID:11281267

  3. Analyses of Methylomes Derived from Meso-American Common Bean (Phaseolus vulgaris L.) Using MeDIP-Seq and Whole Genome Sodium Bisulfite-Sequencing

    PubMed Central

    Crampton, Mollee; Sripathi, Venkateswara R.; Hossain, Khwaja; Kalavacharla, Venu

    2016-01-01

    Common bean (Phaseolus vulgaris L.) is economically important for its high protein, fiber, and micronutrient contents, with a relatively small genome size of ∼587 Mb. Common bean is genetically diverse with two major gene pools, Meso-American and Andean. The phenotypic variability within common bean is partly attributed to the genetic diversity and epigenetic changes that are largely influenced by environmental factors. It is well established that an important epigenetic regulator of gene expression is DNA methylation. Here, we present results generated from two high-throughput sequencing technologies, methylated DNA immunoprecipitation-sequencing (MeDIP-seq) and whole genome bisulfite-sequencing (BS-Seq). Our analyses revealed that this Meso-American common bean displays similar methylation patterns as other previously published plant methylomes, with CG ∼50%, CHG ∼30%, and CHH ∼2.7% methylation, however, these differ from the common bean reference methylome of Andean origin. We identified higher CG methylation levels in both promoter and genic regions than CHG and CHH contexts. Moreover, we found relatively higher CG methylation levels in genes than in promoters. Conversely, the CHG and CHH methylation levels were highest in promoters than in genes. This is the first genome-wide DNA methylation profiling study in a Meso-American common bean cultivar (“Sierra”) using NGS approaches. Our long-term goal is to generate genome-wide epigenomic maps in common bean focusing on chromatin accessibility, histone modifications, and DNA methylation. PMID:27199997

  4. Analyses of Methylomes Derived from Meso-American Common Bean (Phaseolus vulgaris L.) Using MeDIP-Seq and Whole Genome Sodium Bisulfite-Sequencing.

    PubMed

    Crampton, Mollee; Sripathi, Venkateswara R; Hossain, Khwaja; Kalavacharla, Venu

    2016-01-01

    Common bean (Phaseolus vulgaris L.) is economically important for its high protein, fiber, and micronutrient contents, with a relatively small genome size of ∼587 Mb. Common bean is genetically diverse with two major gene pools, Meso-American and Andean. The phenotypic variability within common bean is partly attributed to the genetic diversity and epigenetic changes that are largely influenced by environmental factors. It is well established that an important epigenetic regulator of gene expression is DNA methylation. Here, we present results generated from two high-throughput sequencing technologies, methylated DNA immunoprecipitation-sequencing (MeDIP-seq) and whole genome bisulfite-sequencing (BS-Seq). Our analyses revealed that this Meso-American common bean displays similar methylation patterns as other previously published plant methylomes, with CG ∼50%, CHG ∼30%, and CHH ∼2.7% methylation, however, these differ from the common bean reference methylome of Andean origin. We identified higher CG methylation levels in both promoter and genic regions than CHG and CHH contexts. Moreover, we found relatively higher CG methylation levels in genes than in promoters. Conversely, the CHG and CHH methylation levels were highest in promoters than in genes. This is the first genome-wide DNA methylation profiling study in a Meso-American common bean cultivar ("Sierra") using NGS approaches. Our long-term goal is to generate genome-wide epigenomic maps in common bean focusing on chromatin accessibility, histone modifications, and DNA methylation. PMID:27199997

  5. Identification and de novo sequencing of housekeeping genes appropriate for gene expression analyses in farmed maraena whitefish (Coregonus maraena) during crowding stress.

    PubMed

    Altmann, Simone; Rebl, Alexander; Kühn, Carsten; Goldammer, Tom

    2015-04-01

    Maraena whitefish (Coregonus maraena; synonym Coregonus lavaretus f. balticus) is a high-quality food fish in the Southern Baltic Sea belonging to the group of salmonid fishes. Coregonus sp. is successfully kept in aquaculture throughout northern Europe (e.g. in Finland, Germany, Russia) and North America. In this regard, the molecular and immunological characterisation of stress response in maraena whitefish contributes to the development of robust and fast-growing maraena whitefish breeding strains for aquaculture. Thus, in the present study, the potential housekeeping genes beta actin (ACTB), elongation factor 1 alpha (EEF1A1), glyceraldehydes-3-phosphate dehydrogenase (GAPDH), ribosomal protein 9 (RPL9), ribosomal protein 32 (RPL32) and ribosomal protein S20 (RPS20) were de novo sequenced and tested concerning their applicability as reference genes in quantitative real-time PCR (qPCR) in maraena whitefish under different stocking densities. For this purpose, tissue samples of liver, kidney, gills, head kidney, skin, adipose tissue, heart and dorsal fin were investigated. qPCR data were analysed with Normfinder tool to determine gene expression stability. DNA sequencing exposed transcribed paralogous EEF1A1A and EEF1A1B genes differing in their putative protein structure. Normfinder analysis revealed RPL9 and RPL32 as most stable, GAPDH and ACTB as least stable genes for qPCR analyses, respectively. This is the first study that provides a subset of seven de novo sequenced housekeeping genes usable as reference genes in studies of stress response in maraena whitefish. PMID:25249196

  6. Powerful Sequence Similarity Search Methods and In-Depth Manual Analyses Can Identify Remote Homologs in Many Apparently “Orphan” Viral Proteins

    PubMed Central

    Kuchibhatla, Durga B.; Chung, Betty Y. W.; Cook, Shelley; Schneider, Georg; Eisenhaber, Birgit

    2014-01-01

    The genome sequences of new viruses often contain many “orphan” or “taxon-specific” proteins apparently lacking homologs. However, because viral proteins evolve very fast, commonly used sequence similarity detection methods such as BLAST may overlook homologs. We analyzed a data set of proteins from RNA viruses characterized as “genus specific” by BLAST. More powerful methods developed recently, such as HHblits or HHpred (available through web-based, user-friendly interfaces), could detect distant homologs of a quarter of these proteins, suggesting that these methods should be used to annotate viral genomes. In-depth manual analyses of a subset of the remaining sequences, guided by contextual information such as taxonomy, gene order, or domain cooccurrence, identified distant homologs of another third. Thus, a combination of powerful automated methods and manual analyses can uncover distant homologs of many proteins thought to be orphans. We expect these methodological results to be also applicable to cellular organisms, since they generally evolve much more slowly than RNA viruses. As an application, we reanalyzed the genome of a bee pathogen, Chronic bee paralysis virus (CBPV). We could identify homologs of most of its proteins thought to be orphans; in each case, identifying homologs provided functional clues. We discovered that CBPV encodes a domain homologous to the Alphavirus methyltransferase-guanylyltransferase; a putative membrane protein, SP24, with homologs in unrelated insect viruses and insect-transmitted plant viruses having different morphologies (cileviruses, higreviruses, blunerviruses, negeviruses); and a putative virion glycoprotein, ORF2, also found in negeviruses. SP24 and ORF2 are probably major structural components of the virions. PMID:24155369

  7. Sequence-specific DNA binding by long hairpin pyrrole-imidazole polyamides containing an 8-amino-3,6-dioxaoctanoic acid unit.

    PubMed

    Sawatani, Yoshito; Kashiwazaki, Gengo; Chandran, Anandhakumar; Asamitsu, Sefan; Guo, Chuanxin; Sato, Shinsuke; Hashiya, Kaori; Bando, Toshikazu; Sugiyama, Hiroshi

    2016-08-15

    With the aim of improving aqueous solubility, we designed and synthesized five N-methylpyrrole (Py)-N-methylimidazole (Im) polyamides capable of recognizing 9-bp sequences. Their DNA-binding affinities and sequence specificities were evaluated by SPR and Bind-n-Seq analyses. The design of polyamide 1 was based on a conventional model, with three consecutive Py or Im rings separated by a β-alanine to match the curvature and twist of long DNA helices. Polyamides 2 and 3 contained an 8-amino-3,6-dioxaoctanoic acid (AO) unit, which has previously only been used as a linker within linear Py-Im polyamides or between Py-Im hairpin motifs for tandem hairpin. It is demonstrated herein that AO also functions as a linker element that can extend to 2-bp in hairpin motifs. Notably, although the AO-containing unit can fail to bind the expected sequence, polyamide 4, which has two AO units facing each other in a hairpin form, successfully showed the expected motif and a KD value of 16nM was recorded. Polyamide 5, containing a β-alanine-β-alanine unit instead of the AO of polyamide 2, was synthesized for comparison. The aqueous solubilities and nuclear localization of three of the polyamides were also examined. The results suggest the possibility of applying the AO unit in the core of Py-Im polyamide compounds. PMID:27301681

  8. Transcriptome de novo assembly from next-generation sequencing and comparative analyses in the hexaploid salt marsh species Spartina maritima and Spartina alterniflora (Poaceae)

    PubMed Central

    Ferreira de Carvalho, J; Poulain, J; Da Silva, C; Wincker, P; Michon-Coudouel, S; Dheilly, A; Naquin, D; Boutte, J; Salmon, A; Ainouche, M

    2013-01-01

    Spartina species have a critical ecological role in salt marshes and represent an excellent system to investigate recurrent polyploid speciation. Using the 454 GS-FLX pyrosequencer, we assembled and annotated the first reference transcriptome (from roots and leaves) for two related hexaploid Spartina species that hybridize in Western Europe, the East American invasive Spartina alterniflora and the Euro-African S. maritima. The de novo read assembly generated 38 478 consensus sequences and 99% found an annotation using Poaceae databases, representing a total of 16 753 non-redundant genes. Spartina expressed sequence tags were mapped onto the Sorghum bicolor genome, where they were distributed among the subtelomeric arms of the 10 S. bicolor chromosomes, with high gene density correlation. Normalization of the complementary DNA library improved the number of annotated genes. Ecologically relevant genes were identified among GO biological function categories in salt and heavy metal stress response, C4 photosynthesis and in lignin and cellulose metabolism. Expression of some of these genes had been found to be altered by hybridization and genome duplication in a previous microarray-based study in Spartina. As these species are hexaploid, up to three duplicated homoeologs may be expected per locus. When analyzing sequence polymorphism at four different loci in S. maritima and S. alterniflora, we found up to four haplotypes per locus, suggesting the presence of two expressed homoeologous sequences with one or two allelic variants each. This reference transcriptome will allow analysis of specific Spartina genes of ecological or evolutionary interest, estimation of homoeologous gene expression variation using RNA-seq and further gene expression evolution analyses in natural populations. PMID:23149455

  9. The amino acid sequence of protein SCMK-B2A from the high-sulphur fraction of wool keratin

    PubMed Central

    Elleman, T. C.

    1972-01-01

    1. The amino acid sequence of protein SCMK-B2A, a reduced and S-carboxymethylated protein from the high-sulphur fraction of wool, has been determined. 2. This protein of 171 amino acid residues displays both a high degree of internal homology and extensive external homology with other members of the SCMK-B2 group of proteins. 3. Evidence is presented which suggests that the SCMK-B2 group of proteins are produced by separate non-allelic genes. ImagesPLATE 1 PMID:4679226

  10. Indications of human activity from amino acid and amino sugar analyses on Holocene sediments from lake Lonar, central India

    NASA Astrophysics Data System (ADS)

    Menzel, P.; Gaye, B.; Wiesner, M.; Prasad, S.; Basavaiah, N.; Stebich, M.; Anoop, A.; Riedel, N.; Brauer, A.

    2012-04-01

    The DFG funded HIMPAC (Himalaya: Modern and Past Climates) programme aims to reconstruct Holocene Indian Monsoon climate using a multi-proxy and multi-archive approach. First investigations made on sediments from a ca. 10 m long core covering the whole Holocene taken from the lake Lonar in central India's state Maharashtra, Buldhana District, serve to identify changes in sedimentation, lake chemistry, local vegetation and regional to supra-regional climate patterns. Lake Lonar occupies the floor of an impact crater that formed on the ~ 65 Ma old basalt flows of the Deccan Traps. It covers an area of ca. 1 km2 and is situated in India's core monsoon area. The modern lake has a maximum depth of about 5 m, is highly alkaline, and hyposaline, grouped in the Na-Cl-CO3 subtype of saline lakes. No out-flowing stream is present and only three small streams feed the lake, resulting in a lake level highly sensitive to precipitation and evaporation. The lake is eutrophic and stratified throughout most of the year with sub- to anoxic waters below 2 m depth. In this study the core sediments were analysed for their total amino acid (AA) and amino sugar (AS) content, the amino acid bound C and N percentage of organic C and total N in the sediment and the distribution of individual amino acids. The results roughly show three zones within the core separated by distinct changes in their AA content and distribution. (i) The bottom part of the core from ca. 12000 cal a BP to 11400 cal a BP with very low AA and AS percentage indicating high lithogenic contribution, most probably related to dry conditions. (ii) From 11400 cal a BP to 1200 cal a BP the sediments show moderate AA and AS percentages and low values for the ratios of proteinogenic AAs to their non-proteinogenic degradation products (e.g. ASP/β-ALA; GLU/γ-ABA). (iii) The top part of the core (< 1200 cal a BP) is characterised by an intense increase in total AA and AS, AA-C/Corg and AA-N/Ntotas well as in the ratio of

  11. Unique honey bee (Apis mellifera) hive component-based communities as detected by a hybrid of phospholipid fatty-acid and fatty-acid methyl ester analyses.

    PubMed

    Grubbs, Kirk J; Scott, Jarrod J; Budsberg, Kevin J; Read, Harry; Balser, Teri C; Currie, Cameron R

    2015-01-01

    Microbial communities (microbiomes) are associated with almost all metazoans, including the honey bee Apis mellifera. Honey bees are social insects, maintaining complex hive systems composed of a variety of integral components including bees, comb, propolis, honey, and stored pollen. Given that the different components within hives can be physically separated and are nutritionally variable, we hypothesize that unique microbial communities may occur within the different microenvironments of honey bee colonies. To explore this hypothesis and to provide further insights into the microbiome of honey bees, we use a hybrid of fatty acid methyl ester (FAME) and phospholipid-derived fatty acid (PLFA) analysis to produce broad, lipid-based microbial community profiles of stored pollen, adults, pupae, honey, empty comb, and propolis for 11 honey bee hives. Averaging component lipid profiles by hive, we show that, in decreasing order, lipid markers representing fungi, Gram-negative bacteria, and Gram-positive bacteria have the highest relative abundances within honey bee colonies. Our lipid profiles reveal the presence of viable microbial communities in each of the six hive components sampled, with overall microbial community richness varying from lowest to highest in honey, comb, pupae, pollen, adults and propolis, respectively. Finally, microbial community lipid profiles were more similar when compared by component than by hive, location, or sampling year. Specifically, we found that individual hive components typically exhibited several dominant lipids and that these dominant lipids differ between components. Principal component and two-way clustering analyses both support significant grouping of lipids by hive component. Our findings indicate that in addition to the microbial communities present in individual workers, honey bee hives have resident microbial communities associated with different colony components. PMID:25849080

  12. Unique Honey Bee (Apis mellifera) Hive Component-Based Communities as Detected by a Hybrid of Phospholipid Fatty-Acid and Fatty-Acid Methyl Ester Analyses

    PubMed Central

    2015-01-01

    Microbial communities (microbiomes) are associated with almost all metazoans, including the honey bee Apis mellifera. Honey bees are social insects, maintaining complex hive systems composed of a variety of integral components including bees, comb, propolis, honey, and stored pollen. Given that the different components within hives can be physically separated and are nutritionally variable, we hypothesize that unique microbial communities may occur within the different microenvironments of honey bee colonies. To explore this hypothesis and to provide further insights into the microbiome of honey bees, we use a hybrid of fatty acid methyl ester (FAME) and phospholipid-derived fatty acid (PLFA) analysis to produce broad, lipid-based microbial community profiles of stored pollen, adults, pupae, honey, empty comb, and propolis for 11 honey bee hives. Averaging component lipid profiles by hive, we show that, in decreasing order, lipid markers representing fungi, Gram-negative bacteria, and Gram-positive bacteria have the highest relative abundances within honey bee colonies. Our lipid profiles reveal the presence of viable microbial communities in each of the six hive components sampled, with overall microbial community richness varying from lowest to highest in honey, comb, pupae, pollen, adults and propolis, respectively. Finally, microbial community lipid profiles were more similar when compared by component than by hive, location, or sampling year. Specifically, we found that individual hive components typically exhibited several dominant lipids and that these dominant lipids differ between components. Principal component and two-way clustering analyses both support significant grouping of lipids by hive component. Our findings indicate that in addition to the microbial communities present in individual workers, honey bee hives have resident microbial communities associated with different colony components. PMID:25849080

  13. High-affinity homologous peptide nucleic acid probes for targeting a quadruplex-forming sequence from a MYC promoter element.

    PubMed

    Roy, Subhadeep; Tanious, Farial A; Wilson, W David; Ly, Danith H; Armitage, Bruce A

    2007-09-18

    Guanine-rich DNA and RNA sequences are known to fold into secondary structures known as G-quadruplexes. Recent biochemical evidence along with the discovery of an increasing number of sequences in functionally important regions of the genome capable of forming G-quadruplexes strongly indicates important biological roles for these structures. Thus, molecular probes that can selectively target quadruplex-forming sequences (QFSs) are envisioned as tools to delineate biological functions of quadruplexes as well as potential therapeutic agents. Guanine-rich peptide nucleic acids have been previously shown to hybridize to homologous DNA or RNA sequences forming PNA-DNA (or RNA) quadruplexes. For this paper we studied the hybridization of an eight-mer G-rich PNA to a quadruplex-forming sequence derived from the promoter region of the MYC proto-oncogene. UV melting analysis, fluorescence assays, and surface plasmon resonance experiments reveal that this PNA binds to the MYC QFS in a 2:1 stoichiometry and with an average binding constant Ka = (2.0 +/- 0.2) x 10(8) M(-1) or Kd = 5.0 nM. In addition, experiments carried out with short DNA targets revealed a dependence of the affinity on the sequence of bases in the loop region of the DNA. A structural model for the hybrid quadruplex is proposed, and implications for gene targeting by G-rich PNAs are discussed. PMID:17718513

  14. A knowledge engineering approach to recognizing and extracting sequences of nucleic acids from scientific literature.

    PubMed

    García-Remesal, Miguel; Maojo, Victor; Crespo, José

    2010-01-01

    In this paper we present a knowledge engineering approach to automatically recognize and extract genetic sequences from scientific articles. To carry out this task, we use a preliminary recognizer based on a finite state machine to extract all candidate DNA/RNA sequences. The latter are then fed into a knowledge-based system that automatically discards false positives and refines noisy and incorrectly merged sequences. We created the knowledge base by manually analyzing different manuscripts containing genetic sequences. Our approach was evaluated using a test set of 211 full-text articles in PDF format containing 3134 genetic sequences. For such set, we achieved 87.76% precision and 97.70% recall respectively. This method can facilitate different research tasks. These include text mining, information extraction, and information retrieval research dealing with large collections of documents containing genetic sequences. PMID:21096556

  15. Ferredoxin:NADP oxidoreductase of Cyanophora paradoxa: purification, partial characterization, and N-terminal amino acid sequence.

    PubMed

    Gebhart, U B; Maier, T L; Stevanović, S; Bayer, M G; Schenk, H E

    1992-06-01

    The ferredoxin:NADP+ oxidoreductase of the protist Cyanophora paradoxa, as a descendant of a former symbiotic consortium, an important model organism in view of the Endosymbiosis Theory, is the first enzyme purified from a formerly original endocytobiont (cyanelle) that is found to be encoded in the nucleus of the host. This cyanoplast enzyme was isolated by FPLC (19% yield) and characterized with respect to the uv-vis spectrum, pH optimum (pH 9), molecular mass of 34 kDa, and an N-terminal amino acid sequence (24 residues). The enzyme shows, as known from other organisms, molecular heterogeneity. The N-terminus of a further ferredoxin:NADP+ oxidoreductase polypeptide represents a shorter sequence missing the first four amino acids of the mature enzyme. PMID:1392619

  16. Purification, characterization, and amino acid sequencing of a. delta. /sup 5/-3-oxosteroid isomerase from Pseudomonas putida biotype B

    SciTech Connect

    Linden, K.G.

    1986-01-01

    Studies were performed on the ..delta../sup 5/-3-oxosteroid isomerase from Pseudomonas putida biotype B. The studies have involved three broad areas: improvement in the purification of the enzyme, further characterization of the purified enzyme, and completion of the amino acid sequence of the enzyme. For the purification of the enzyme, techniques for removing the isomerase from whole cells were studied, the effects of ionic strength on the binding of the isomerase to steroidal affinity resins was explored, and a new affinity resin was developed. Absorption spectra and the proton NMR spectra of the isomerase were obtained. Amino acid sequencing of the oxosteroid isomerase indicates that the enzyme is a dimeric protein consisting of two identical subunits each consisting of a polypeptide chain of 131 residues and a M/sub r/ = 14,536.

  17. Identification of novel rice low phytic acid mutations via TILLING by sequencing

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Phytic acid (myo-inositol-1,2,3,4,5,6-hexakisphosphate or InsP6) accounts for 75-85% of the total phosphorus in seeds. Low phytic acid (lpa) mutants exhibit decreases in seed InsP6 with corresponding increases in inorganic P which, unlike phytic acid P, is readily utilized by humans and monogastric ...

  18. A combined study based on experimental analyses and theoretical calculations on properties of poly (lactic acid) under annealing treatment

    NASA Astrophysics Data System (ADS)

    Loued, W.; Wéry, J.; Dorlando, A.; Alimi, K.

    2015-02-01

    In this paper, the significance of annealing, in two different atmospheres (air and vacuum), on the surface characteristics of poly (lactic acid) (PLA) films was investigated. X-ray diffraction (XRD) measurements correlated to atomic force microscopy (AFM) observations of the cast PLA films show that thermal treatment under air atmosphere is responsible for a significant increase of crystallinity with the increase of temperature. However, band gap energy of the title compound is slightly affected by annealing at different temperatures. As for the untreated PLA, the molecular geometry was optimized using density functional theory (DFT/B3LYP) method with 6-31g (d) basis set in ground state. From the optimized geometry, HOMO and LUMO energies and quantum chemical parameters were performed at B3LYP/6-31g (d). The theoretical results, applied to simulated optical spectra of the compound, were compared to the observed ones. On the basis of theoretical vibrational analyses, the thermodynamic properties were calculated at different temperatures, revealing the correlation between internal energy (U), enthalpy (H), entropy (S), Free energy (G) and temperatures.

  19. Snake venoms. The amino-acid sequence of trypsin inhibitor E of Dendroaspis polylepis polylepis (Black Mamba) venom.

    PubMed

    Joubert, F J; Strydom, D J

    1978-06-01

    Trypsin inhibitor E from black mamba venom comprises 59 amino acid residues in a single polypeptide chain, cross-linked by three intrachain disulphide bridges. The complete primary structure of inhibitor E was elucidated. The sequence is homologous with trypsin inhibitors from different sources. Unique among this homologous series of proteinase inhibitors, inhibitor E has an affinity for transition metal ions, exemplified here by Cu2 and Co2+. PMID:668688

  20. Draft Genome Sequence of Escherichia coli Strain VKPM B-10182, Producing the Enzyme for Synthesis of Cephalosporin Acids

    PubMed Central

    Mardanov, Andrey V.; Eldarov, Mikhail A.; Sklyarenko, Anna V.; Dumina, Maria V.; Beletsky, Alexey V.; Yarotsky, Sergey V.

    2014-01-01

    Escherichia coli strain VKPM B-10182, obtained by chemical mutagenesis from E. coli strain ATCC 9637, produces cephalosporin acid synthetase employed in the synthesis of β-lactam antibiotics, such as cefazolin. The draft genome sequence of strain VKPM B-10182 revealed 32 indels and 1,780 point mutations that might account for the improvement in antibiotic synthesis that we observed. PMID:25414512

  1. First draft genome sequencing of indole acetic acid producing and plant growth promoting fungus Preussia sp. BSL10.

    PubMed

    Khan, Abdul Latif; Asaf, Sajjad; Khan, Abdur Rahim; Al-Harrasi, Ahmed; Al-Rawahi, Ahmed; Lee, In-Jung

    2016-05-10

    Preussia sp. BSL10, family Sporormiaceae, was actively producing phytohormone (indole-3-acetic acid) and extra-cellular enzymes (phosphatases and glucosidases). The fungus was also promoting the growth of arid-land tree-Boswellia sacra. Looking at such prospects of this fungus, we sequenced its draft genome for the first time. The Illumina based sequence analysis reveals an approximate genome size of 31.4Mbp for Preussia sp. BSL10. Based on ab initio gene prediction, total 32,312 coding sequences were annotated consisting of 11,967 coding genes, pseudogenes, and 221 tRNA genes. Furthermore, 321 carbohydrate-active enzymes were predicted and classified into many functional families. PMID:26995610

  2. A simple ligation-based method to increase the information density in sequencing reactions used to deconvolute nucleic acid selections

    PubMed Central

    Childs-Disney, Jessica L.; Disney, Matthew D.

    2008-01-01

    Herein, a method is described to increase the information density of sequencing experiments used to deconvolute nucleic acid selections. The method is facile and should be applicable to any selection experiment. A critical feature of this method is the use of biotinylated primers to amplify and encode a BamHI restriction site on both ends of a PCR product. After amplification, the PCR reaction is captured onto streptavidin resin, washed, and digested directly on the resin. Resin-based digestion affords clean product that is devoid of partially digested products and unincorporated PCR primers. The product's complementary ends are annealed and ligated together with T4 DNA ligase. Analysis of ligation products shows formation of concatemers of different length and little detectable monomer. Sequencing results produced data that routinely contained three to four copies of the library. This method allows for more efficient formulation of structure-activity relationships since multiple active sequences are identified from a single clone. PMID:18065718

  3. A novel T-cell-defined HLA-DR polymorphism not predicted from the linear amino acid sequence.

    PubMed

    Termijtelen, A; van den Elsen, P; Koning, F; de Koster, S; Schroeijers, W; Vanderkerckhove, B

    1989-09-01

    Recent investigations have shown that alloreactive T cells are capable of responding to structures defined by specific linear amino acid sequences on class II molecules. In the present study we show that also a polymorphism can be recognized that is not defined by such linear amino acid sequences. Two human T-cell clones, sensitized to DRw13 haplotypes, are described. The description of clone c50 serves to exemplify the first model. This DRB1-specific clone responds to stimulator cells that carry DR molecules, different in their DRB1 first and second hypervariable regions (HV1 and HV2) but identical in their HV3 regions (i.e., DRw13,Dw18; DRw13,Dw19; DR4,Dw10; and DRw11,LDVII). The second clone, c1443, behaves nonconventionally. It responds to DRw13,Dw18; DRw13,Dw19; and DR4,Dw4 stimulator cells, although no specific amino acid sequence is shared between these specificities. The latter pattern of reactivity suggests the existence of a novel polymorphism recognized by alloreactive T cells. This particular polymorphism may also be biologically significant. PMID:2476425

  4. cDNA-derived amino-acid sequence of a land turtle (Geochelone carbonaria) beta-chain hemoglobin.

    PubMed

    Bordin, S; Meza, A N; Saad, S T; Ogo, S H; Costa, F F

    1997-06-01

    The cDNA sequence encoding the turtle Geochelone carbonaria beta-chain was determinated. The isolation of hemoglobin mRNA was based on degenerate primers' PCR in combination with 5'- and 3'-RACE protocol. The full length cDNA is 615 bp with the ATG start codon at position 53 and TGA stop codon at position 495; The AATAAA polyadenylation signal is found at position 599. The deduced polypeptyde contains 146 amino-acid residues. The predicted amino acid sequence shares 83% identity with the beta-globin of a related specie, the aquatic turtle C. p. belli. Otherwise, identity is higher when compared with chicken beta-Hb (80%) than with other reptilian orders (Squamata, 69%, and Crocodilia, 61%). Compared with human HbA, there is 67% identity, and at least three amino acid substitutions could be of some functional significance (Glu43 beta-->Ser, His116 beta-->Thr and His143 beta-->Leu). To our knowledge this represents the first cDNA sequence of a reptile globin gene described. PMID:9238523

  5. Amino acid sequence of the serine-repeat antigen (SERA) of Plasmodium falciparum determined from cloned cDNA.

    PubMed

    Bzik, D J; Li, W B; Horii, T; Inselburg, J

    1988-09-01

    We report the isolation of cDNA clones for a Plasmodium falciparum gene that encodes the complete amino acid sequence of a previously identified exported blood stage antigen. The Mr of this antigen protein had been determined by sodium dodecylsulphate-polyacrylamide gel electrophoresis analysis, by different workers, to be 113,000, 126,000, and 140,000. We show, by cDNA nucleotide sequence analysis, that this antigen gene encodes a 989 amino acid protein (111 kDa) that contains a potential signal peptide, but not a membrane anchor domain. In the FCR3 strain the serine content of the protein was 11%, of which 57% of the serine residues were localized within a 201 amino acid sequence that included 35 consecutive serine residues. The protein also contained three possible N-linked glycosylation sites and numerous possible O-linked glycosylation sites. The mRNA was abundant during late trophozoite-schizont parasite stages. We propose to identity this antigen, which had been called p126, by the acronym SERA, serine-repeat antigen, based on its complete structure. The usefulness of the cloned cDNA as a source of a possible malaria vaccine is considered in view of the previously demonstrated ability of the antigen to induce parasite-inhibitory antibodies and a protective immune response in Saimiri monkeys. PMID:2847041

  6. Amino acid sequences of lysozymes newly purified from invertebrates imply wide distribution of a novel class in the lysozyme family.

    PubMed

    Ito, Y; Yoshikawa, A; Hotani, T; Fukuda, S; Sugimura, K; Imoto, T

    1999-01-01

    Lysozymes were purified from three invertebrates: a marine bivalve, a marine conch, and an earthworm. The purified lysozymes all showed a similar molecular weight of 13 kDa on SDS/PAGE. Their N-terminal sequences up to the 33rd residue determined here were apparently homologous among them; in addition, they had a homology with a partial sequence of a starfish lysozyme which had been reported before. The complete sequence of the bivalve lysozyme was determined by peptide mapping and subsequent sequence analysis. This was composed of 123 amino acids including as many as 14 cysteine residues and did not show a clear homology with the known types of lysozymes. However, the homology search of this protein on the protein or nucleic acid database revealed two homologous proteins. One of them was a gene product, CELF22 A3.6 of C. elegans, which was a functionally unknown protein. The other was an isopeptidase of a medicinal leech, named destabilase. Thus, a new type of lysozyme found in at least four species across the three classes of the invertebrates demonstrates a novel class of protein/lysozyme family in invertebrates. The bivalve lysozyme, first characterized here, showed extremely high protein stability and hen lysozyme-like enzymatic features. PMID:9914527

  7. Complete Genome Sequences of Escherichia coli O157:H7 Strains SRCC 1675 and 28RC, Which Vary in Acid Resistance

    PubMed Central

    Baranzoni, Gian Marco; Reichenberger, Erin R.; Kim, Gwang-Hee; Breidt, Frederick; Kay, Kathryn; Oh, Deog-Hwan

    2016-01-01

    The level of acid resistance among Escherichia coli O157:H7 strains varies, and strains with higher resistance to acid may have a lower infectious dose. The complete genome sequences belonging to two strains of Escherichia coli O157:H7 with different levels of acid resistance are presented here. PMID:27469964

  8. Complete Genome Sequences of Escherichia coli O157:H7 Strains SRCC 1675 and 28RC, Which Vary in Acid Resistance.

    PubMed

    Baranzoni, Gian Marco; Fratamico, Pina M; Reichenberger, Erin R; Kim, Gwang-Hee; Breidt, Frederick; Kay, Kathryn; Oh, Deog-Hwan

    2016-01-01

    The level of acid resistance among Escherichia coli O157:H7 strains varies, and strains with higher resistance to acid may have a lower infectious dose. The complete genome sequences belonging to two strains of Escherichia coli O157:H7 with different levels of acid resistance are presented here. PMID:27469964

  9. Complete genome sequences of Escherichia coli O157:H7 strains SRCC 1675 and 28RC that vary in acid resistance

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The level of acid resistance among Escherichia coli O157:H7 strains varies, and strains with higher resistance to acid may have a lower infectious dose. The complete genome sequences belonging to two strains of Escherichia coli O157:H7 with different levels of acid resistance are presented....

  10. Cloning and genetic and sequence analyses of the bacteriocin 21 determinant encoded on the Enterococcus faecalis pheromone-responsive conjugative plasmid pPD1.

    PubMed Central

    Tomita, H; Fujimoto, S; Tanimoto, K; Ike, Y

    1997-01-01

    The pheromone-responsive conjugative plasmid pPD1 (59 kb) of Enterococcus faecalis encodes the bacteriocin 21 (bac21) determinant. Cloning, transposon insertion mutagenesis and sequence analysis of the bac21 determinant showed that an 8.5-kb fragment lying between kb 27.1 and 35.6 of the pPD1 map is required for complete expression of the bacteriocin. The 8.5-kb fragment contained nine open reading frames (ORFs), bacA to bac1, which were oriented in the same (upstream-to-downstream) direction. Transposon insertions into the bacA to bacE ORFs, which are located in the proximal half of bac21, resulted in defective bacteriocin expression. Insertions into the bacF to bac1 ORFs, which are located in the distal half of bac21, resulted in reduced bacteriocin expression. Deletion mutant analysis of the cloned 8.5-kb fragment revealed that the deletion of segments between kb 31.6 and 35.6 of the pPD1 map, which contained the distal region of the determinant encoding bacF to bac1, resulted in reduced bacteriocin expression. The smallest fragment (4.5 kb) retaining some degree of bacteriocin expression contained the bacA to bacE sequences located in the proximal half of the determinant. The cloned fragment encoding the 4.5-kb proximal region and a Tn916 insertion mutant into pPD1 bacB trans-complemented intracellularly to give complete expression of the bacteriocin. bacA encoded a 105-residue sequence with a molecular mass of 11.1 kDa. The deduced BacA protein showed 100% homology to the broad-spectrum antibiotic peptide AS-48, which is encoded on the E. faecalis conjugative plasmid pMB2 (58 kb). bacH encoded a 195-residue sequence with a molecular mass of 21.9 kDa. The deduced amino acid sequence showed significant homology to the C-terminal region of HlyB (31.1% identical residues), a protein located in the Escherichia coli alpha-hemolysin operon that is a representative bacterial ATP-binding cassette export protein. PMID:9401046

  11. Deep Sequencing and Circos Analyses of Antibody Libraries Reveal Antigen-driven Selection of Ig VH Genes during HIV-1 Infection

    PubMed Central

    Xiao, Madelyne; Ponraj, Prabakaran; Chen, Weizao; Kessing, Bailey; Dimitrov, Dimiter S.

    2013-01-01

    The vast diversity of antibody repertoires is largely attributed to heavy chain (VH) recombination of variable (V), diversity (D) and joining (J) gene segments. We used 454 sequencing information of the variable domains of the antibody heavy chain repertoires from neonates, normal adults and an HIV-1-infected individual, to analyze, with Circos software, the VDJ pairing patterns at birth, adulthood and a time-dependent response to HIV-1 infection. Our comparative analyses of the Ig VDJ repertoires from these libraries indicated that, from birth to adulthood, VDJ recombination patterns remain the same with some slight changes, whereas some VH families are selected and preferentially expressed after long-term infection with HIV-1. We also demonstrated that the immune system responds to HIV-1 chronic infection by selectively expanding certain HV families in an attempt to combat infection. Our findings may have implications for understanding immune responses in pathology as well as for development of new therapeutics and vaccines. PMID:24158018

  12. Alloantibody Responses After Renal Transplant Failure Can Be Better Predicted by Donor-Recipient HLA Amino Acid Sequence