Science.gov

Sample records for acid sequence analyses

  1. Analyses of mitochondrial amino acid sequence datasets support the proposal that specimens of Hypodontus macropi from three species of macropodid hosts represent distinct species

    PubMed Central

    2013-01-01

    Background Hypodontus macropi is a common intestinal nematode of a range of kangaroos and wallabies (macropodid marsupials). Based on previous multilocus enzyme electrophoresis (MEE) and nuclear ribosomal DNA sequence data sets, H. macropi has been proposed to be complex of species. To test this proposal using independent molecular data, we sequenced the whole mitochondrial (mt) genomes of individuals of H. macropi from three different species of hosts (Macropus robustus robustus, Thylogale billardierii and Macropus [Wallabia] bicolor) as well as that of Macropicola ocydromi (a related nematode), and undertook a comparative analysis of the amino acid sequence datasets derived from these genomes. Results The mt genomes sequenced by next-generation (454) technology from H. macropi from the three host species varied from 13,634 bp to 13,699 bp in size. Pairwise comparisons of the amino acid sequences predicted from these three mt genomes revealed differences of 5.8% to 18%. Phylogenetic analysis of the amino acid sequence data sets using Bayesian Inference (BI) showed that H. macropi from the three different host species formed distinct, well-supported clades. In addition, sliding window analysis of the mt genomes defined variable regions for future population genetic studies of H. macropi in different macropodid hosts and geographical regions around Australia. Conclusions The present analyses of inferred mt protein sequence datasets clearly supported the hypothesis that H. macropi from M. robustus robustus, M. bicolor and T. billardierii represent distinct species. PMID:24261823

  2. Assignment of fatty acid-beta-oxidizing syntrophic bacteria to Syntrophomonadaceae fam. nov. on the basis of 16S rRNA sequence analyses

    NASA Technical Reports Server (NTRS)

    Zhao, H.; Yang, D.; Woese, C. R.; Bryant, M. P.

    1993-01-01

    After enrichment from Chinese rural anaerobic digestor sludge, anaerobic, sporing and nonsporing, saturated fatty acid-beta-oxidizing syntrophic bacteria were isolated as cocultures with H2- and formate-utilizing Methanospirillum hungatei or Desulfovibrio sp. strain G-11. The syntrophs degraded C4 to C8 saturated fatty acids, including isobutyrate and 2-methylbutyrate. They were adapted to grow on crotonate and were isolated as pure cultures. The crotonate-grown pure cultures alone did not grow on butyrate in either the presence or the absence of some common electron acceptors. However, when they were reconstituted with M. hungatei, growth on butyrate again occurred. In contrast, crotonate-grown Clostridium kluyveri and Clostridium sticklandii, as well as Clostridium sporogenes, failed to grow on butyrate when these organisms were cocultured with M. hungatei. The crotonate-grown pure subcultures of the syntrophs described above were subjected to 16S rRNA sequence analysis. Several previously documented fatty acid-beta-oxidizing syntrophs grown in pure cultures with crotonate were also subjected to comparative sequence analyses. The sequence analyses revealed that the new sporing and nonsporing isolates and other syntrophs that we sequenced, which had either gram-negative or gram-positive cell wall ultrastructure, all belonged to the phylogenetically gram-positive phylum. They were not closely related to any of the previously known subdivisions in the gram-positive phylum with which they were compared, but were closely related to each other, forming a new subdivision in the phylum. We recommend that this group be designated Syntrophomonadaceae fam. nov.; a description is given.

  3. Plant DNA sequencing for phylogenetic analyses: from plants to sequences.

    PubMed

    Neves, Susana S; Forrest, Laura L

    2011-01-01

    DNA sequences are important sources of data for phylogenetic analysis. Nowadays, DNA sequencing is a routine technique in molecular biology laboratories. However, there are specific questions associated with project design and sequencing of plant samples for phylogenetic analysis, which may not be familiar to researchers starting in the field. This chapter gives an overview of methods and protocols involved in the sequencing of plant samples, including general recommendations on the selection of species/taxa and DNA regions to be sequenced, and field collection of plant samples. Protocols of plant sample preparation, DNA extraction, PCR and cloning, which are critical to the success of molecular phylogenetic projects, are described in detail. Common problems of sequencing (using the Sanger method) are also addressed. Possible applications of second-generation sequencing techniques in plant phylogenetics are briefly discussed. Finally, orientation on the preparation of sequence data for phylogenetic analyses and submission to public databases is also given.

  4. Comparative sequence analyses of sixteen reptilian paramyxoviruses

    USGS Publications Warehouse

    Ahne, W.; Batts, W.N.; Kurath, G.; Winton, J.R.

    1999-01-01

    Viral genomic RNA of Fer-de-Lance virus (FDLV), a paramyxovirus highly pathogenic for reptiles, was reverse transcribed and cloned. Plasmids with significant sequence similarities to the hemagglutinin-neuraminidase (HN) and polymerase (L) genes of mammalian paramyxoviruses were identified by BLAST search. Partial sequences of the FDLV genes were used to design primers for amplification by nested polymerase chain reaction (PCR) and sequencing of 518-bp L gene and 352-bp HN gene fragments from a collection of 15 previously uncharacterized reptilian paramyxoviruses. Phylogenetic analyses of the partial L and HN sequences produced similar trees in which there were two distinct subgroups of isolates that were supported with maximum bootstrap values, and several intermediate isolates. Within each subgroup the nucleotide divergence values were less than 2.5%, while the divergence between the two subgroups was 20-22%. This indicated that the two subgroups represent distinct virus species containing multiple virus strains. The five intermediate isolates had nucleotide divergence values of 11-20% and may represent additional distinct species. In addition to establishing diversity among reptilian paramyxoviruses, the phylogenetic groupings showed some correlation with geographic location, and clearly demonstrated a low level of host species-specificity within these viruses. Copyright (C) 1999 Elsevier Science B.V.

  5. Composition for nucleic acid sequencing

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2008-08-26

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  6. Amino acid analyses of Apollo 14 samples.

    NASA Technical Reports Server (NTRS)

    Gehrke, C. W.; Zumwalt, R. W.; Kuo, K.; Aue, W. A.; Stalling, D. L.; Kvenvolden, K. A.; Ponnamperuma, C.

    1972-01-01

    Detection limits were between 300 pg and 1 ng for different amino acids, in an analysis by gas-liquid chromatography of water extracts from Apollo 14 lunar fines in which amino acids were converted to their N-trifluoro-acetyl-n-butyl esters. Initial analyses of water and HCl extracts of sample 14240 and 14298 samples showed no amino acids above background levels.

  7. High speed nucleic acid sequencing

    SciTech Connect

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2011-05-17

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid. Each type of labeled nucleotide comprises an acceptor fluorophore attached to a phosphate portion of the nucleotide such that the fluorophore is removed upon incorporation into a growing strand. Fluorescent signal is emitted via fluorescent resonance energy transfer between the donor fluorophore and the acceptor fluorophore as each nucleotide is incorporated into the growing strand. The sequence is deduced by identifying which base is being incorporated into the growing strand.

  8. A close relationship between Cercozoa and Foraminifera supported by phylogenetic analyses based on combined amino acid sequences of three cytoskeletal proteins (actin, alpha-tubulin, and beta-tubulin).

    PubMed

    Takishita, Kiyotaka; Inagaki, Yuji; Tsuchiya, Masashi; Sakaguchi, Miako; Maruyama, Tadashi

    2005-12-05

    Recently, there has been increasing molecular evidence of phylogenetic affinity between Cercozoa and Foraminifera in the eukaryotic lineage. We performed phylogenetic analyses based on the combined (concatenated) amino acid sequence data of actin, alpha-tubulin, and beta-tubulin from a wide variety of eukaryotes, including the foraminifers Planoglabratella opercularis and Reticulomyxa filosa, as well as cercomonad and chlorarachniophyte members of Cercozoa. A monophyletic lineage composed of two foraminiferan species branched with the centroheliozoan species Raphidiophrys contractilis was reconstructed in both Bayesian and maximum-likelihood (ML) analyses under 'linked' models, enforcing a single set of the parameters (the parameter for among-site rate variation and branch lengths) on the entire combined alignment. Considering the extremely divergent nature of Foraminifera and Raphidiophyrs tubulins, the union of these lineages recovered is most probably a long-branch attraction artifact due to ignoring gene-specific evolutionary processes. On the other hand, the foraminiferan lineage was within the radiation of Cercozoa in Bayesian analyses under 'unlinked' model conditions, accommodating differences in evolutionary processes across the three genes in the combined alignment. The Foraminifera+Cercozoa affinity recovered in the latter multi-gene analyses is most likely genuine, and thus our data presented here provide further support for the close relationship between these two protist lineages.

  9. Chip-based sequencing nucleic acids

    DOEpatents

    Beer, Neil Reginald

    2014-08-26

    A system for fast DNA sequencing by amplification of genetic material within microreactors, denaturing, demulsifying, and then sequencing the material, while retaining it in a PCR/sequencing zone by a magnetic field. One embodiment includes sequencing nucleic acids on a microchip that includes a microchannel flow channel in the microchip. The nucleic acids are isolated and hybridized to magnetic nanoparticles or to magnetic polystyrene-coated beads. Microreactor droplets are formed in the microchannel flow channel. The microreactor droplets containing the nucleic acids and the magnetic nanoparticles are retained in a magnetic trap in the microchannel flow channel and sequenced.

  10. Distinguishing Proteins From Arbitrary Amino Acid Sequences

    PubMed Central

    Yau, Stephen S.-T.; Mao, Wei-Guang; Benson, Max; He, Rong Lucy

    2015-01-01

    What kinds of amino acid sequences could possibly be protein sequences? From all existing databases that we can find, known proteins are only a small fraction of all possible combinations of amino acids. Beginning with Sanger's first detailed determination of a protein sequence in 1952, previous studies have focused on describing the structure of existing protein sequences in order to construct the protein universe. No one, however, has developed a criteria for determining whether an arbitrary amino acid sequence can be a protein. Here we show that when the collection of arbitrary amino acid sequences is viewed in an appropriate geometric context, the protein sequences cluster together. This leads to a new computational test, described here, that has proved to be remarkably accurate at determining whether an arbitrary amino acid sequence can be a protein. Even more, if the results of this test indicate that the sequence can be a protein, and it is indeed a protein sequence, then its identity as a protein sequence is uniquely defined. We anticipate our computational test will be useful for those who are attempting to complete the job of discovering all proteins, or constructing the protein universe. PMID:25609314

  11. The complete amino acid sequence of prochymosin.

    PubMed Central

    Foltmann, B; Pedersen, V B; Jacobsen, H; Kauffman, D; Wybrandt, G

    1977-01-01

    The total sequence of 365 amino acid residues in bovine prochymosin is presented. Alignment with the amino acid sequence of porcine pepsinogen shows that 204 amino acid residues are common to the two zymogens. Further comparison and alignment with the amino acid sequence of penicillopepsin shows that 66 residues are located at identical positions in all three proteases. The three enzymes belong to a large group of proteases with two aspartate residues in the active center. This group forms a family derived from one common ancestor. PMID:329280

  12. Method for sequencing nucleic acid molecules

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2006-05-30

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  13. Method for sequencing nucleic acid molecules

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2006-06-06

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  14. p53-Regulated Networks of Protein, mRNA, miRNA, and lncRNA Expression Revealed by Integrated Pulsed Stable Isotope Labeling With Amino Acids in Cell Culture (pSILAC) and Next Generation Sequencing (NGS) Analyses*

    PubMed Central

    Hünten, Sabine; Kaller, Markus; Drepper, Friedel; Oeljeklaus, Silke; Bonfert, Thomas; Erhard, Florian; Dueck, Anne; Eichner, Norbert; Friedel, Caroline C.; Meister, Gunter; Zimmer, Ralf; Warscheid, Bettina; Hermeking, Heiko

    2015-01-01

    We determined the effect of p53 activation on de novo protein synthesis using quantitative proteomics (pulsed stable isotope labeling with amino acids in cell culture/pSILAC) in the colorectal cancer cell line SW480. This was combined with mRNA and noncoding RNA expression analyses by next generation sequencing (RNA-, miR-Seq). Furthermore, genome-wide DNA binding of p53 was analyzed by chromatin-immunoprecipitation (ChIP-Seq). Thereby, we identified differentially regulated proteins (542 up, 569 down), mRNAs (1258 up, 415 down), miRNAs (111 up, 95 down) and lncRNAs (270 up, 123 down). Changes in protein and mRNA expression levels showed a positive correlation (r = 0.50, p < 0.0001). In total, we detected 133 direct p53 target genes that were differentially expressed and displayed p53 occupancy in the vicinity of their promoter. More transcriptionally induced genes displayed occupied p53 binding sites (4.3% mRNAs, 7.2% miRNAs, 6.3% lncRNAs, 5.9% proteins) than repressed genes (2.4% mRNAs, 3.2% miRNAs, 0.8% lncRNAs, 1.9% proteins), suggesting indirect mechanisms of repression. Around 50% of the down-regulated proteins displayed seed-matching sequences of p53-induced miRNAs in the corresponding 3′-UTRs. Moreover, proteins repressed by p53 significantly overlapped with those previously shown to be repressed by miR-34a. We confirmed up-regulation of the novel direct p53 target genes LINC01021, MDFI, ST14 and miR-486 and showed that ectopic LINC01021 expression inhibits proliferation in SW480 cells. Furthermore, KLF12, HMGB1 and CIT mRNAs were confirmed as direct targets of the p53-induced miR-34a, miR-205 and miR-486–5p, respectively. In line with the loss of p53 function during tumor progression, elevated expression of KLF12, HMGB1 and CIT was detected in advanced stages of cancer. In conclusion, the integration of multiple omics methods allowed the comprehensive identification of direct and indirect effectors of p53 that provide new insights and leads into the

  15. Sequence and Structural Analyses for Functional Non-coding RNAs

    NASA Astrophysics Data System (ADS)

    Sakakibara, Yasubumi; Sato, Kengo

    Analysis and detection of functional RNAs are currently important topics in both molecular biology and bioinformatics research. Several computational methods based on stochastic context-free grammars (SCFGs) have been developed for modeling and analysing functional RNA sequences. These grammatical methods have succeeded in modeling typical secondary structures of RNAs and are used for structural alignments of RNA sequences. Such stochastic models, however, are not sufficient to discriminate member sequences of an RNA family from non-members, and hence to detect non-coding RNA regions from genome sequences. Recently, the support vector machine (SVM) and kernel function techniques have been actively studied and proposed as a solution to various problems in bioinformatics. SVMs are trained from positive and negative samples and have strong, accurate discrimination abilities, and hence are more appropriate for the discrimination tasks. A few kernel functions that extend the string kernel to measure the similarity of two RNA sequences from the viewpoint of secondary structures have been proposed. In this article, we give an overview of recent progress in SCFG-based methods for RNA sequence analysis and novel kernel functions tailored to measure the similarity of two RNA sequences and developed for use with support vector machines (SVM) in discriminating members of an RNA family from non-members.

  16. Sequencing and comparative analyses of the genomes of zoysiagrasses

    PubMed Central

    Tanaka, Hidenori; Hirakawa, Hideki; Kosugi, Shunichi; Nakayama, Shinobu; Ono, Akiko; Watanabe, Akiko; Hashiguchi, Masatsugu; Gondo, Takahiro; Ishigaki, Genki; Muguerza, Melody; Shimizu, Katsuya; Sawamura, Noriko; Inoue, Takayasu; Shigeki, Yuichi; Ohno, Naoki; Tabata, Satoshi; Akashi, Ryo; Sato, Shusei

    2016-01-01

    Zoysia is a warm-season turfgrass, which comprises 11 allotetraploid species (2n = 4x = 40), each possessing different morphological and physiological traits. To characterize the genetic systems of Zoysia plants and to analyse their structural and functional differences in individual species and accessions, we sequenced the genomes of Zoysia species using HiSeq and MiSeq platforms. As a reference sequence of Zoysia species, we generated a high-quality draft sequence of the genome of Z. japonica accession ‘Nagirizaki’ (334 Mb) in which 59,271 protein-coding genes were predicted. In parallel, draft genome sequences of Z. matrella ‘Wakaba’ and Z. pacifica ‘Zanpa’ were also generated for comparative analyses. To investigate the genetic diversity among the Zoysia species, genome sequence reads of three additional accessions, Z. japonica ‘Kyoto’, Z. japonica ‘Miyagi’ and Z. matrella ‘Chiba Fair Green’, were accumulated, and aligned against the reference genome of ‘Nagirizaki’ along with those from ‘Wakaba’ and ‘Zanpa’. As a result, we detected 7,424,163 single-nucleotide polymorphisms and 852,488 short indels among these species. The information obtained in this study will be valuable for basic studies on zoysiagrass evolution and genetics as well as for the breeding of zoysiagrasses, and is made available in the ‘Zoysia Genome Database’ at http://zoysia.kazusa.or.jp. PMID:26975196

  17. Mouse Vk gene classification by nucleic acid sequence similarity.

    PubMed

    Strohal, R; Helmberg, A; Kroemer, G; Kofler, R

    1989-01-01

    Analyses of immunoglobulin (Ig) variable (V) region gene usage in the immune response, estimates of V gene germline complexity, and other nucleic acid hybridization-based studies depend on the extent to which such genes are related (i.e., sequence similarity) and their organization in gene families. While mouse Igh heavy chain V region (VH) gene families are relatively well-established, a corresponding systematic classification of Igk light chain V region (Vk) genes has not been reported. The present analysis, in the course of which we reviewed the known extent of the Vk germline gene repertoire and Vk gene usage in a variety of responses to foreign and self antigens, provides a classification of mouse Vk genes in gene families composed of members with greater than 80% overall nucleic acid sequence similarity. This classification differed in several aspects from that of VH genes: only some Vk gene families were as clearly separated (by greater than 25% sequence dissimilarity) as typical VH gene families; most Vk gene families were closely related and, in several instances, members from different families were very similar (greater than 80%) over large sequence portions; frequently, classification by nucleic acid sequence similarity diverged from existing classifications based on amino-terminal protein sequence similarity. Our data have implications for Vk gene analyses by nucleic acid hybridization and describe potentially important differences in sequence organization between VH and Vk genes.

  18. Amino acid sequence of mouse submaxillary gland renin.

    PubMed Central

    Misono, K S; Chang, J J; Inagami, T

    1982-01-01

    The complete amino acid sequences of the heavy chain and light chain of mouse submaxillary gland renin have been determined. The heavy chain consists of 288 amino acid residues having a Mr of 31,036 calculated from the sequence. The light chain contains 48 amino acid residues with a Mr of 5,458. The sequence of the heavy chain was determined by automated Edman degradations of the cyanogen bromide peptides and tryptic peptides generated after citraconylation, as well as other peptides generated therefrom. The sequence of the light chain was derived from sequence analyses of the peptides generated by cyanogen bromide cleavage or by digestion with Staphylococcus aureus protease. The sequences in the active site regions in renin containing two catalytically essential aspartyl residues 32 and 215 were found identical with those in pepsin, chymosin, and penicillopepsin. Comparison of the amino acid sequence of renin with that of porcine pepsin indicated a 42% sequence identity of the heavy chain with the amino-terminal and middle regions and a 46% identity of the light chain with the carboxyl-terminal region of the porcine pepsin sequence. Residues identical in renin and pepsin are distributed throughout the length of the molecules, suggesting a similarity in their overall structures. PMID:6812055

  19. Amino Acid Analyses of Acid Hydrolysates in Desert Varnish

    NASA Technical Reports Server (NTRS)

    Perry, Randall S.; Staley, James T.; Dworkin, Jason P.; Engel, Mike

    2001-01-01

    There has long been a debate as to whether rock varnish deposits are microbially mediated or are deposited by inorganic processes. Varnished rocks are found throughout the world primarily in arid and semi-arid regions. The varnish coats are typically up to 200 microns thick and are composed of clays and alternating layers enriched in manganese and iron oxides. The individual layers range in thickness from 1 micron to greater than 10 microns and may continue laterally for more than a 100 microns. Overlapping botryoidal structures are visible in thin section and scanning electron micrographs. The coatings also include small amounts of organic mater and detrital grains. Amino-acid hydrolysates offer a means of assessing the organic composition of rock varnish collected from the Sonoran Desert, near Phoenix, AZ. Chromatographic analyses of hydrolysates from powdered samples of rock varnish suggest that the interior of rock varnish is relatively enriched in amino acids and specifically in d-alanine and glutamic acid. Peptidoglycan (murein) is the main structural component of gram-positive bacterial cell walls. The d-enantiomer of alanine and glutamic acid are specific to peptidoglycan and are consequently an indicator for the presence of bacteria. D-alanine is also found in teichoic acid which is only found in gram-positive bacteria. Several researchers have cultured bacteria from the surface of rock varnish and most have been gram-positive, suggesting that gram-positive bacteria are intimately associated with varnish coatings and may play a role in the formation of varnish coatings.

  20. Whale song analyses using bioinformatics sequence analysis approaches

    NASA Astrophysics Data System (ADS)

    Chen, Yian A.; Almeida, Jonas S.; Chou, Lien-Siang

    2005-04-01

    Animal songs are frequently analyzed using discrete hierarchical units, such as units, themes and songs. Because animal songs and bio-sequences may be understood as analogous, bioinformatics analysis tools DNA/protein sequence alignment and alignment-free methods are proposed to quantify the theme similarities of the songs of false killer whales recorded off northeast Taiwan. The eighteen themes with discrete units that were identified in an earlier study [Y. A. Chen, masters thesis, University of Charleston, 2001] were compared quantitatively using several distance metrics. These metrics included the scores calculated using the Smith-Waterman algorithm with the repeated procedure; the standardized Euclidian distance and the angle metrics based on word frequencies. The theme classifications based on different metrics were summarized and compared in dendrograms using cluster analyses. The results agree with earlier classifications derived by human observation qualitatively. These methods further quantify the similarities among themes. These methods could be applied to the analyses of other animal songs on a larger scale. For instance, these techniques could be used to investigate song evolution and cultural transmission quantifying the dissimilarities of humpback whale songs across different seasons, years, populations, and geographic regions. [Work supported by SC Sea Grant, and Ilan County Government, Taiwan.

  1. Phenolic acid esterases, coding sequences and methods

    DOEpatents

    Blum, David L.; Kataeva, Irina; Li, Xin-Liang; Ljungdahl, Lars G.

    2002-01-01

    Described herein are four phenolic acid esterases, three of which correspond to domains of previously unknown function within bacterial xylanases, from XynY and XynZ of Clostridium thermocellum and from a xylanase of Ruminococcus. The fourth specifically exemplified xylanase is a protein encoded within the genome of Orpinomyces PC-2. The amino acids of these polypeptides and nucleotide sequences encoding them are provided. Recombinant host cells, expression vectors and methods for the recombinant production of phenolic acid esterases are also provided.

  2. Structural gene and complete amino acid sequence of Vibrio alginolyticus collagenase.

    PubMed Central

    Takeuchi, H; Shibano, Y; Morihara, K; Fukushima, J; Inami, S; Keil, B; Gilles, A M; Kawamoto, S; Okuda, K

    1992-01-01

    The DNA encoding the collagenase of Vibrio alginolyticus was cloned, and its complete nucleotide sequence was determined. When the cloned gene was ligated to pUC18, the Escherichia coli expression vector, bacteria carrying the gene exhibited both collagenase antigen and collagenase activity. The open reading frame from the ATG initiation codon was 2442 bp in length for the collagenase structural gene. The amino acid sequence, deduced from the nucleotide sequence, revealed that the mature collagenase consists of 739 amino acids with an Mr of 81875. The amino acid sequences of 20 polypeptide fragments were completely identical with the deduced amino acid sequences of the collagenase gene. The amino acid composition predicted from the DNA sequence was similar to the chemically determined composition of purified collagenase reported previously. The analyses of both the DNA and amino acid sequences of the collagenase gene were rigorously performed, but we could not detect any significant sequence similarity to other collagenases. Images Fig. 2. PMID:1311172

  3. Method for identifying and quantifying nucleic acid sequence aberrations

    DOEpatents

    Lucas, J.N.; Straume, T.; Bogen, K.T.

    1998-07-21

    A method is disclosed for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe. 11 figs.

  4. Method for identifying and quantifying nucleic acid sequence aberrations

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    1998-01-01

    A method for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe.

  5. Next generation sequencing and comparative analyses of Xenopus mitogenomes

    PubMed Central

    2012-01-01

    Background Mitochondrial genomes comprise a small but critical component of the total DNA in eukaryotic organisms. They encode several key proteins for the cell’s major energy producing apparatus, the mitochondrial respiratory chain. Additonally, their nucleotide and amino acid sequences are of great utility as markers for systematics, molecular ecology and forensics. Their characterization through nucleotide sequencing is a fundamental starting point in mitogenomics. Methods to amplify complete mitochondrial genomes rapidly and efficiently from microgram quantities of tissue of single individuals are, however, not always available. Here we validate two approaches, which combine long-PCR with Roche 454 pyrosequencing technology, to obtain two complete mitochondrial genomes from individual amphibian species. Results We obtained two new xenopus frogs (Xenopus borealis and X. victorianus) complete mitochondrial genome sequences by means of long-PCR followed by 454 of individual genomes (approach 1) or of multiple pooled genomes (approach 2), the mean depth of coverage per nucleotide was 9823 and 186, respectively. We also characterised and compared the new mitogenomes against their sister taxa; X. laevis and Silurana tropicalis, two of the most intensely studied amphibians. Our results demonstrate how our approaches can be used to obtain complete amphibian mitogenomes with depths of coverage that far surpass traditional primer-walking strategies, at either the same cost or less. Our results also demonstrate: that the size, gene content and order are the same among xenopus mitogenomes and that S. tropicalis form a separate clade to the other xenopus, among which X. laevis and X. victorianus were most closely related. Nucleotide and amino acid diversity was found to vary across the xenopus mitogenomes, with the greatest diversity observed in the Complex 1 gene nad4l and the least diversity observed in Complex 4 genes (cox1-3). All protein-coding genes were shown to be

  6. Methods for analyzing nucleic acid sequences

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2011-05-17

    The present invention is directed to a method of sequencing a target nucleic acid. The method provides a complex comprising a polymerase enzyme, a target nucleic acid molecule, and a primer, wherein the complex is immobilized on a support Fluorescent label is attached to a terminal phosphate group of the nucleotide or nucleotide analog. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The time duration of the signal from labeled nucleotides or nucleotide analogs that become incorporated is distinguished from freely diffusing labels by a longer retention in the observation volume for the nucleotides or nucleotide analogs that become incorporated than for the freely diffusing labels.

  7. Note on the chromatographic analyses of marine polyunsaturated fatty acids

    USGS Publications Warehouse

    Schultz, D.M.; Quinn, J.G.

    1977-01-01

    Gas-liquid chromatography was used to study the effects of saponification/methylation and thin-layer chromatographic isolation on the analyses of polyunsaturated fatty acids. Using selected procedures, the qualitative and quantitative distribution of these acids in marine organisms can be determined with a high degree of accuracy. ?? 1977 Springer-Verlag.

  8. Complete chloroplast genome sequences of Solanum bulbocastanum, Solanum lycopersicum and comparative analyses with other Solanaceae genomes.

    PubMed

    Daniell, Henry; Lee, Seung-Bum; Grevich, Justin; Saski, Christopher; Quesada-Vargas, Tania; Guda, Chittibabu; Tomkins, Jeffrey; Jansen, Robert K

    2006-05-01

    Despite the agricultural importance of both potato and tomato, very little is known about their chloroplast genomes. Analysis of the complete sequences of tomato, potato, tobacco, and Atropa chloroplast genomes reveals significant insertions and deletions within certain coding regions or regulatory sequences (e.g., deletion of repeated sequences within 16S rRNA, ycf2 or ribosomal binding sites in ycf2). RNA, photosynthesis, and atp synthase genes are the least divergent and the most divergent genes are clpP, cemA, ccsA, and matK. Repeat analyses identified 33-45 direct and inverted repeats >or=30 bp with a sequence identity of at least 90%; all but five of the repeats shared by all four Solanaceae genomes are located in the same genes or intergenic regions, suggesting a functional role. A comprehensive genome-wide analysis of all coding sequences and intergenic spacer regions was done for the first time in chloroplast genomes. Only four spacer regions are fully conserved (100% sequence identity) among all genomes; deletions or insertions within some intergenic spacer regions result in less than 25% sequence identity, underscoring the importance of choosing appropriate intergenic spacers for plastid transformation and providing valuable new information for phylogenetic utility of the chloroplast intergenic spacer regions. Comparison of coding sequences with expressed sequence tags showed considerable amount of variation, resulting in amino acid changes; none of the C-to-U conversions observed in potato and tomato were conserved in tobacco and Atropa. It is possible that there has been a loss of conserved editing sites in potato and tomato.

  9. Comparative analyses of potato expressed sequence tag libraries.

    PubMed

    Ronning, Catherine M; Stegalkina, Svetlana S; Ascenzi, Robert A; Bougri, Oleg; Hart, Amy L; Utterbach, Teresa R; Vanaken, Susan E; Riedmuller, Steve B; White, Joseph A; Cho, Jennifer; Pertea, Geo M; Lee, Yuandan; Karamycheva, Svetlana; Sultana, Razvan; Tsai, Jennifer; Quackenbush, John; Griffiths, Helen M; Restrepo, Silvia; Smart, Christine D; Fry, William E; Van Der Hoeven, Rutger; Tanksley, Steve; Zhang, Peifen; Jin, Hailing; Yamamoto, Miki L; Baker, Barbara J; Buell, C Robin

    2003-02-01

    The cultivated potato (Solanum tuberosum) shares similar biology with other members of the Solanaceae, yet has features unique within the family, such as modified stems (stolons) that develop into edible tubers. To better understand potato biology, we have undertaken a survey of the potato transcriptome using expressed sequence tags (ESTs) from diverse tissues. A total of 61,940 ESTs were generated from aerial tissues, below-ground tissues, and tissues challenged with the late-blight pathogen (Phytophthora infestans). Clustering and assembly of these ESTs resulted in a total of 19,892 unique sequences with 8,741 tentative consensus sequences and 11,151 singleton ESTs. We were able to identify a putative function for 43.7% of these sequences. A number of sequences (48) were expressed throughout the libraries sampled, representing constitutively expressed sequences. Other sequences (13,068, 21%) were uniquely expressed and were detected only in a single library. Using hierarchal and k means clustering of the EST sequences, we were able to correlate changes in gene expression with major physiological events in potato biology. Using pair-wise comparisons of tuber-related tissues, we were able to associate genes with tuber initiation, dormancy, and sprouting. We also were able to identify a number of characterized as well as novel sequences that were unique to the incompatible interaction of late-blight pathogen, thereby providing a foundation for further understanding the mechanism of resistance.

  10. Genome sequence determinations and analyses of novel circoviruses from goose and pigeon.

    PubMed

    Todd, D; Weston, J H; Soike, D; Smyth, J A

    2001-08-01

    The genomes of novel circoviruses from goose and pigeon, which were isolated using degenerate primer and inverse primer PCR methods, were cloned and sequenced. Comparative nucleotide (nt) sequence analyses showed that the goose circovirus (GCV) and pigeon circovirus (PiCV) possessed genomes which were 1821 and 2037 or 2036 nt, respectively, and which had features in common with the genomes of porcine circoviruses types 1 and 2 (PCV1, PCV2) and psittacine beak and feather disease virus (BFDV), such that they can now be assigned to the genus Circovirus of the family Circoviridae. Common features include the possession of (i) a potential stem-loop/nonanucleotide motif with which the initiation of rolling circle replication of the virus DNA is associated; (ii) two major ORFs, located on the virus (V1 ORF) and complementary (C1 ORF) strands, which encode the replication-associated protein (Rep) and capsid protein, respectively; (iii) high levels of amino acid identity (41.2--58.2%) shared with other circovirus Rep proteins; and (iv) direct/inverted repeat sequences within the putative intergenic region. On the basis of nt and amino acid sequence identities, GCV is substantially less closely related to BFDV than PiCV is to BFDV.

  11. 77 FR 65537 - Requirements for Patent Applications Containing Nucleotide Sequence and/or Amino Acid Sequence...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2012-10-29

    ... Amino Acid Sequence Disclosures ACTION: Proposed collection; comment request. SUMMARY: The United States....'' SUPPLEMENTARY INFORMATION: I. Abstract Patent applications that contain nucleotide and/or amino acid...

  12. Analyses of Response-Stimulus Sequences in Descriptive Observations

    ERIC Educational Resources Information Center

    Samaha, Andrew L.; Vollmer, Timothy R.; Borrero, Carrie; Sloman, Kimberly; Pipkin, Claire St. Peter; Bourret, Jason

    2009-01-01

    Descriptive observations were conducted to record problem behavior displayed by participants and to record antecedents and consequences delivered by caregivers. Next, functional analyses were conducted to identify reinforcers for problem behavior. Then, using data from the descriptive observations, lag-sequential analyses were conducted to examine…

  13. Analyses of Expressed Sequence Tags from Apple1

    PubMed Central

    Newcomb, Richard D.; Crowhurst, Ross N.; Gleave, Andrew P.; Rikkerink, Erik H.A.; Allan, Andrew C.; Beuning, Lesley L.; Bowen, Judith H.; Gera, Emma; Jamieson, Kim R.; Janssen, Bart J.; Laing, William A.; McArtney, Steve; Nain, Bhawana; Ross, Gavin S.; Snowden, Kimberley C.; Souleyre, Edwige J.F.; Walton, Eric F.; Yauk, Yar-Khing

    2006-01-01

    The domestic apple (Malus domestica; also known as Malus pumila Mill.) has become a model fruit crop in which to study commercial traits such as disease and pest resistance, grafting, and flavor and health compound biosynthesis. To speed the discovery of genes involved in these traits, develop markers to map genes, and breed new cultivars, we have produced a substantial expressed sequence tag collection from various tissues of apple, focusing on fruit tissues of the cultivar Royal Gala. Over 150,000 expressed sequence tags have been collected from 43 different cDNA libraries representing 34 different tissues and treatments. Clustering of these sequences results in a set of 42,938 nonredundant sequences comprising 17,460 tentative contigs and 25,478 singletons, together representing what we predict are approximately one-half the expressed genes from apple. Many potential molecular markers are abundant in the apple transcripts. Dinucleotide repeats are found in 4,018 nonredundant sequences, mainly in the 5′-untranslated region of the gene, with a bias toward one repeat type (containing AG, 88%) and against another (repeats containing CG, 0.1%). Trinucleotide repeats are most common in the predicted coding regions and do not show a similar degree of sequence bias in their representation. Bi-allelic single-nucleotide polymorphisms are highly abundant with one found, on average, every 706 bp of transcribed DNA. Predictions of the numbers of representatives from protein families indicate the presence of many genes involved in disease resistance and the biosynthesis of flavor and health-associated compounds. Comparisons of some of these gene families with Arabidopsis (Arabidopsis thaliana) suggest instances where there have been duplications in the lineages leading to apple of biosynthetic and regulatory genes that are expressed in fruit. This resource paves the way for a concerted functional genomics effort in this important temperate fruit crop. PMID:16531485

  14. Detection of nucleic acid sequences by invader-directed cleavage

    DOEpatents

    Brow, Mary Ann D.; Hall, Jeff Steven Grotelueschen; Lyamichev, Victor; Olive, David Michael; Prudent, James Robert

    1999-01-01

    The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The 5' nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based by charge.

  15. [Pathological Diagnoses and Whole-genome Sequence Analyses of the Jaagsiekte Sheep Retrovirus in Xinjiang, China].

    PubMed

    Yang, Sufang; Liang, Tian; Zhao, Qingliang; Zhang, Dianqing; Si Junqiang; Zhang, Jing; Yang, Xia; Sheng, Jinliang

    2015-05-01

    To carry out pathologic diagnoses and whole-genome sequence analyses of the Jaagsiekte sheep retrovirus (JSRV) in Xinjiang, China, we first observed sheep suspected to have the JSRV. Then, the extracted virus suspension was observed by transmission electron microscopy (TEM). Total RNAs from lungs of JSRV-infected sheep were extracted and reverse-transcribed using a cDNA synthesis kit. Six pairs of primers were designed according to the exogenous reference virus strain (AF105220). Reverse transcription-polymerase chain reaction was carried out from JSRV-infected tissue, and the whole genome of the JSRV sequenced. Our results showed: flow of nasal fluid ("wheelbarrow test"); different sizes of adenoma lesions in the lungs; papillary hyperplasia of alveolar epithelial cells; alveolar cavity filled with macrophages; dissolute nuclei in central lesions. TEM revealed JSRV particles with a diameter of 88 nm to 125. 4 nm. The full-length of the viral genome sequence was 7456 bp. BLAST analyses showed nucleotide homology of 96% and 95% compared with that of the representative strain from the USA (AF105220) and UK (AF357971). Nucleotide homology was 89.8% and 89.9% compared with the endogenous Jaagsiekte sheep retrovirus, Inner Mongolia strain (DQ838493) and USA strain (EF680300). The specific pathogenic amino-acid sequence "YXXM" was found in the TM district, similar to the exogenous JSRV: this gene has been reported to be oncogenic. This is the first report of the complete genomic sequence of the exogenous JSRV from Xinjiang, and could lay the foundation for study of the biological characteristics and pathogenic mechanisms of the pulmonary adenomatosis virus in sheep.

  16. Los Alamos sequence analysis package for nucleic acids and proteins.

    PubMed Central

    Kanehisa, M I

    1982-01-01

    An interactive system for computer analysis of nucleic acid and protein sequences has been developed for the Los Alamos DNA Sequence Database. It provides a convenient way to search or verify various sequence features, e.g., restriction enzyme sites, protein coding frames, and properties of coded proteins. Further, the comprehensive analysis package on a large-scale database can be used for comparative studies on sequence and structural homologies in order to find unnoted information stored in nucleic acid sequences. PMID:6174934

  17. Hybridization and sequencing of nucleic acids using base pair mismatches

    DOEpatents

    Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua

    2001-01-01

    Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.

  18. Isotopic analyses of amino acids from the Murchison meteorite

    NASA Technical Reports Server (NTRS)

    Pizzarello, S.; Cronin, J. R.; Krishnamurthy, R. V.; Epstein, S.

    1991-01-01

    An account is given of the results of H-2, C-13 isotopic analyses of the Murchison meteorite incorporating an ultrafiltration step to exclude the possibility of fine particulate contaminants. The meteorite's amino acids were chromatographically separated in order to preclude isotopic enrichment by basic compounds other than the amino acids. The results indicate that the Murchison amino acids are isotopically highly unusual; delta-C-13 is elevated by about 40 percent, and delta-D by fully 2500 percent. This high D content of the meteorite's alpha-amino acids may be due to the synthesis of their molecular precursors by low-temperature ion-molecule reactions in an interstellar cloud.

  19. Evidence for Balancing Selection from Nucleotide Sequence Analyses of Human G6PD

    PubMed Central

    Verrelli, Brian C.; McDonald, John H.; Argyropoulos, George; Destro-Bisol, Giovanni; Froment, Alain; Drousiotou, Anthi; Lefranc, Gerard; Helal, Ahmed N.; Loiselet, Jacques; Tishkoff, Sarah A.

    2002-01-01

    Glucose-6-phosphate dehydrogenase (G6PD) mutations that result in reduced enzyme activity have been implicated in malarial resistance and constitute one of the best examples of selection in the human genome. In the present study, we characterize the nucleotide diversity across a 5.2-kb region of G6PD in a sample of 160 Africans and 56 non-Africans, to determine how selection has shaped patterns of DNA variation at this gene. Our global sample of enzymatically normal B alleles and A, A−, and Med alleles with reduced enzyme activities reveals many previously uncharacterized silent-site polymorphisms. In comparison with the absence of amino acid divergence between human and chimpanzee G6PD sequences, we find that the number of G6PD amino acid polymorphisms in human populations is significantly high. Unlike many other G6PD-activity alleles with reduced activity, we find that the age of the A variant, which is common in Africa, may not be consistent with the recent emergence of severe malaria and therefore may have originally had a historically different adaptive function. Overall, our observations strongly support previous genotype-phenotype association studies that proposed that balancing selection maintains G6PD deficiencies within human populations. The present study demonstrates that nucleotide sequence analyses can reveal signatures of both historical and recent selection in the genome and may elucidate the impact that infectious disease has had during human evolution. PMID:12378426

  20. Analysing the performance of personal computers based on Intel microprocessors for sequence aligning bioinformatics applications.

    PubMed

    Nair, Pradeep S; John, Eugene B

    2007-01-01

    Aligning specific sequences against a very large number of other sequences is a central aspect of bioinformatics. With the widespread availability of personal computers in biology laboratories, sequence alignment is now often performed locally. This makes it necessary to analyse the performance of personal computers for sequence aligning bioinformatics benchmarks. In this paper, we analyse the performance of a personal computer for the popular BLAST and FASTA sequence alignment suites. Results indicate that these benchmarks have a large number of recurring operations and use memory operations extensively. It seems that the performance can be improved with a bigger L1-cache.

  1. Amino acid sequence of a mouse immunoglobulin mu chain.

    PubMed Central

    Kehry, M; Sibley, C; Fuhrman, J; Schilling, J; Hood, L E

    1979-01-01

    The complete amino acid sequence of the mouse mu chain from the BALB/c myeloma tumor MOPC 104E is reported. The C mu region contains four consecutive homology regions of approximately 110 residues and a COOH-terminal region of 19 residues. A comparison of this mu chain from mouse with a complete mu sequence from human (Ou) and a partial mu chain sequence from dog (Moo) reveals a striking gradient of increasing homology from the NH2-terminal to the COOH-terminal portion of these mu chains, with the former being the least and the latter the most highly conserved. Four of the five sites of carbohydrate attachment appear to be at identical residue positions when the constant regions of the mouse and human mu chains are compared. The mu chain of MOPC 104E has a carbohydrate moiety attached in the second hypervariable region. This is particularly interesting in view of the fact that MOPC 104E binds alpha-(1 leads to 3)-dextran, a simple carbohydrate. The structural and functional constraints imposed by these comparative sequence analyses are discussed. PMID:111247

  2. Methods and compositions for efficient nucleic acid sequencing

    DOEpatents

    Drmanac, Radoje

    2006-07-04

    Disclosed are novel methods and compositions for rapid and highly efficient nucleic acid sequencing based upon hybridization with two sets of small oligonucleotide probes of known sequences. Extremely large nucleic acid molecules, including chromosomes and non-amplified RNA, may be sequenced without prior cloning or subcloning steps. The methods of the invention also solve various current problems associated with sequencing technology such as, for example, high noise to signal ratios and difficult discrimination, attaching many nucleic acid fragments to a surface, preparing many, longer or more complex probes and labelling more species.

  3. Methods and compositions for efficient nucleic acid sequencing

    DOEpatents

    Drmanac, Radoje

    2002-01-01

    Disclosed are novel methods and compositions for rapid and highly efficient nucleic acid sequencing based upon hybridization with two sets of small oligonucleotide probes of known sequences. Extremely large nucleic acid molecules, including chromosomes and non-amplified RNA, may be sequenced without prior cloning or subcloning steps. The methods of the invention also solve various current problems associated with sequencing technology such as, for example, high noise to signal ratios and difficult discrimination, attaching many nucleic acid fragments to a surface, preparing many, longer or more complex probes and labelling more species.

  4. Kit for detecting nucleic acid sequences using competitive hybridization probes

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    2001-01-01

    A kit is provided for detecting a target nucleic acid sequence in a sample, the kit comprising: a first hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the first hybridization probe including a first complexing agent for forming a binding pair with a second complexing agent; and a second hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the first hybridization probe does not selectively hybridize, the second hybridization probe including a detectable marker; a third hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the third hybridization probe including the same detectable marker as the second hybridization probe; and a fourth hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the third hybridization probe does not selectively hybridize, the fourth hybridization probe including the first complexing agent for forming a binding pair with the second complexing agent; wherein the first and second hybridization probes are capable of simultaneously hybridizing to the target sequence and the third and fourth hybridization probes are capable of simultaneously hybridizing to the target sequence, the detectable marker is not present on the first or fourth hybridization probes and the first, second, third, and fourth hybridization probes each include a competitive nucleic acid sequence which is sufficiently complementary to a third portion of the target sequence that the competitive sequences of the first, second, third, and fourth hybridization probes compete with each other to hybridize to the third portion of the

  5. Analysis of cloned cDNA and genomic sequences for phytochrome: complete amino acid sequences for two gene products expressed in etiolated Avena.

    PubMed Central

    Hershey, H P; Barker, R F; Idler, K B; Lissemore, J L; Quail, P H

    1985-01-01

    Cloned cDNA and genomic sequences have been analyzed to deduce the amino acid sequence of phytochrome from etiolated Avena. Restriction endonuclease site polymorphism between clones indicates that at least four phytochrome genes are expressed in this tissue. Sequence analysis of two complete and one partial coding region shows approximately 98% homology at both the nucleotide and amino acid levels, with the majority of amino acid changes being conservative. High sequence homology is also found in the 5'-untranslated region but significant divergence occurs in the 3'-untranslated region. The phytochrome polypeptides are 1128 amino acid residues long corresponding to a molecular mass of 125 kdaltons. The known protein sequence at the chromophore attachment site occurs only once in the polypeptide, establishing that phytochrome has a single chromophore per monomer covalently linked to Cys-321. Computer analyses of the amino acid sequences have provided predictions regarding a number of structural features of the phytochrome molecule. PMID:3001642

  6. Analysis and Annotation of Nucleic Acid Sequence

    SciTech Connect

    States, David J.

    2004-07-28

    The aims of this project were to develop improved methods for computational genome annotation and to apply these methods to improve the annotation of genomic sequence data with a specific focus on human genome sequencing. The project resulted in a substantial body of published work. Notable contributions of this project were the identification of basecalling and lane tracking as error processes in genome sequencing and contributions to improved methods for these steps in genome sequencing. This technology improved the accuracy and throughput of genome sequence analysis. Probabilistic methods for physical map construction were developed. Improved methods for sequence alignment, alternative splicing analysis, promoter identification and NF kappa B response gene prediction were also developed.

  7. Optimizing selection of microsatellite loci from 454 pyrosequencing via post-sequencing bioinformatic analyses.

    PubMed

    Fernandez-Silva, Iria; Toonen, Robert J

    2013-01-01

    The comparatively low cost of massive parallel sequencing technology, also known as next-generation sequencing (NGS), has transformed the isolation of microsatellite loci. The most common NGS approach consists of obtaining large amounts of sequence data from genomic DNA or enriched microsatellite libraries, which is then mined for the discovery of microsatellite repeats using bioinformatics analyses. Here, we describe a bioinformatics approach to isolate microsatellite loci, starting from the raw sequence data through a subset of microsatellite primer pairs. The primary difference to previously published approaches includes analyses to select the most accurate sequence data and to eliminate repetitive elements prior to the design of primers. These analyses aim to minimize the testing of primer pairs by identifying the most promising microsatellite loci.

  8. Human retroviruses and aids, 1992. A compilation and analysis of nucleic acid and amino acid sequences

    SciTech Connect

    Myers, G.; Korber, B.; Berzofsky, J.A.; Pavlakis, G.N.; Smith, R.F.

    1992-10-01

    This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (1) HIV and SIV Nucleotide Sequences; (H) Amino Acid Sequences; (III) Analyses; (IV) Related Sequences; and (V) Database Communications. information within all the parts is updated at least twice in each year, which accounts for the modes of binding and pagination in the compendium. While this publication could take the form of a review or sequence monograph, it is not so conceived. Instead, the literature from which the database is derived has simply been summarized and some elementary computational analyses have been performed upon the data. Interpretation and commentary have been avoided insofar as possible so that the reader can form his or her own judgments concerning the complex information. In addition to the general descriptions below of the parts of the compendium, the user should read the individual introductions for each part.

  9. Solid phase sequencing of double-stranded nucleic acids

    DOEpatents

    Fu, Dong-Jing; Cantor, Charles R.; Koster, Hubert; Smith, Cassandra L.

    2002-01-01

    This invention relates to methods for detecting and sequencing of target double-stranded nucleic acid sequences, to nucleic acid probes and arrays of probes useful in these methods, and to kits and systems which contain these probes. Useful methods involve hybridizing the nucleic acids or nucleic acids which represent complementary or homologous sequences of the target to an array of nucleic acid probes. These probe comprise a single-stranded portion, an optional double-stranded portion and a variable sequence within the single-stranded portion. The molecular weights of the hybridized nucleic acids of the set can be determined by mass spectroscopy, and the sequence of the target determined from the molecular weights of the fragments. Nucleic acids whose sequences can be determined include nucleic acids in biological samples such as patient biopsies and environmental samples. Probes may be fixed to a solid support such as a hybridization chip to facilitate automated determination of molecular weights and identification of the target sequence.

  10. Deciphering Clostridium tyrobutyricum Metabolism Based on the Whole-Genome Sequence and Proteome Analyses

    PubMed Central

    Lee, Joungmin; Jang, Yu-Sin; Han, Mee-Jung; Kim, Jin Young

    2016-01-01

    ABSTRACT Clostridium tyrobutyricum is a Gram-positive anaerobic bacterium that efficiently produces butyric acid and is considered a promising host for anaerobic production of bulk chemicals. Due to limited knowledge on the genetic and metabolic characteristics of this strain, however, little progress has been made in metabolic engineering of this strain. Here we report the complete genome sequence of C. tyrobutyricum KCTC 5387 (ATCC 25755), which consists of a 3.07-Mbp chromosome and a 63-kbp plasmid. The results of genomic analyses suggested that C. tyrobutyricum produces butyrate from butyryl-coenzyme A (butyryl-CoA) through acetate reassimilation by CoA transferase, differently from Clostridium acetobutylicum, which uses the phosphotransbutyrylase-butyrate kinase pathway; this was validated by reverse transcription-PCR (RT-PCR) of related genes, protein expression levels, in vitro CoA transferase assay, and fed-batch fermentation. In addition, the changes in protein expression levels during the course of batch fermentations on glucose were examined by shotgun proteomics. Unlike C. acetobutylicum, the expression levels of proteins involved in glycolytic and fermentative pathways in C. tyrobutyricum did not decrease even at the stationary phase. Proteins related to energy conservation mechanisms, including Rnf complex, NfnAB, and pyruvate-phosphate dikinase that are absent in C. acetobutylicum, were identified. Such features explain why this organism can produce butyric acid to a much higher titer and better tolerate toxic metabolites. This study presenting the complete genome sequence, global protein expression profiles, and genome-based metabolic characteristics during the batch fermentation of C. tyrobutyricum will be valuable in designing strategies for metabolic engineering of this strain. PMID:27302759

  11. Dipeptide Sequence Determination: Analyzing Phenylthiohydantoin Amino Acids by HPLC

    NASA Astrophysics Data System (ADS)

    Barton, Janice S.; Tang, Chung-Fei; Reed, Steven S.

    2000-02-01

    Amino acid composition and sequence determination, important techniques for characterizing peptides and proteins, are essential for predicting conformation and studying sequence alignment. This experiment presents improved, fundamental methods of sequence analysis for an upper-division biochemistry laboratory. Working in pairs, students use the Edman reagent to prepare phenylthiohydantoin derivatives of amino acids for determination of the sequence of an unknown dipeptide. With a single HPLC technique, students identify both the N-terminal amino acid and the composition of the dipeptide. This method yields good precision of retention times and allows use of a broad range of amino acids as components of the dipeptide. Students learn fundamental principles and techniques of sequence analysis and HPLC.

  12. Characterization of bud emergence 46 (BEM46) protein: Sequence, structural, phylogenetic and subcellular localization analyses

    SciTech Connect

    Kumar, Abhishek; Kollath-Leiß, Krisztina; Kempken, Frank

    2013-08-30

    Highlights: •All eukaryotes have at least a single copy of a bem46 ortholog. •The catalytic triad of BEM46 is illustrated using sequence and structural analysis. •We identified indels in the conserved domain of BEM46 protein. •Localization studies of BEM46 protein were carried out using GFP-fusion tagging. -- Abstract: The bud emergence 46 (BEM46) protein from Neurospora crassa belongs to the α/β-hydrolase superfamily. Recently, we have reported that the BEM46 protein is localized in the perinuclear ER and also forms spots close by the plasma membrane. The protein appears to be required for cell type-specific polarity formation in N. crassa. Furthermore, initial studies suggested that the BEM46 amino acid sequence is conserved in eukaryotes and is considered to be one of the widespread conserved “known unknown” eukaryotic genes. This warrants for a comprehensive phylogenetic analysis of this superfamily to unravel origin and molecular evolution of these genes in different eukaryotes. Herein, we observe that all eukaryotes have at least a single copy of a bem46 ortholog. Upon scanning of these proteins in various genomes, we find that there are expansions leading into several paralogs in vertebrates. Usingcomparative genomic analyses, we identified insertion/deletions (indels) in the conserved domain of BEM46 protein, which allow to differentiate fungal classes such as ascomycetes from basidiomycetes. We also find that exonic indels are able to differentiate BEM46 homologs of different eukaryotic lineage. Furthermore, we unravel that BEM46 protein from N. crassa possess a novel endoplasmic-retention signal (PEKK) using GFP-fusion tagging experiments. We propose that three residues namely a serine 188S, a histidine 292H and an aspartic acid 262D are most critical residues, forming a catalytic triad in BEM46 protein from N. crassa. We carried out a comprehensive study on bem46 genes from a molecular evolution perspective with combination of functional

  13. Human retroviruses and AIDS 1996. A compilation and analysis of nucleic acid and amino acid sequences

    SciTech Connect

    Myers, G.; Foley, B.; Korber, B.; Mellors, J.W.; Jeang, K.T.; Wain-Hobson, S.

    1997-04-01

    This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (1) Nuclear Acid Alignments and Sequences; (2) Amino Acid Alignments; (3) Analysis; (4) Related Sequences; and (5) Database Communications. Information within all the parts is updated throughout the year on the Web site, http://hiv-web.lanl.gov. While this publication could take the form of a review or sequence monograph, it is not so conceived. Instead, the literature from which the database is derived has simply been summarized and some elementary computational analyses have been performed upon the data. Interpretation and commentary have been avoided insofar as possible so that the reader can form his or her own judgments concerning the complex information. In addition to the general descriptions of the parts of the compendium, the user should read the individual introductions for each part.

  14. Amino Acid Sequence of Human Cholinesterase

    DTIC Science & Technology

    1985-10-01

    liquid chromatography (HPLC). Activity testing of the aged, DFP-labeled cholinesterase showed that 99.8% of the active sites had been labeled, since...acids were quantitated by ninhydrin at the AAA Labs, or by derivatization with phenylisothiocyanate at the University of Michigan. The latter method

  15. Comparative Sequence Analyses of La Crosse Virus Strain Isolated from Patient with Fatal Encephalitis, Tennessee, USA

    PubMed Central

    Fryxell, Rebecca Trout; Freyman, Kimberly; Ulloa, Armando; Velez, Jason O.; Paulsen, Dave; Lanciotti, Robert S.; Moncayo, Abelardo

    2015-01-01

    We characterized a La Crosse virus (LACV) isolate from the brain of a child who died of encephalitis-associated complications in eastern Tennessee, USA, during summer 2012. We compared the isolate with LACV sequences from mosquitoes collected near the child’s home just after his postmortem diagnosis. In addition, we conducted phylogenetic analyses of these and other sequences derived from LACV strains representing varied temporal, geographic, and ecologic origins. Consistent with historical findings, results of these analyses indicate that a limited range of LACV lineage I genotypes is associated with severe clinical outcomes. PMID:25898269

  16. Cystatin. Amino acid sequence and possible secondary structure.

    PubMed Central

    Schwabe, C; Anastasi, A; Crow, H; McDonald, J K; Barrett, A J

    1984-01-01

    The amino acid sequence of cystatin, the protein from chicken egg-white that is a tight-binding inhibitor of many cysteine proteinases, is reported. Cystatin is composed of 116 amino acid residues, and the Mr is calculated to be 13 143. No striking similarity to any other known sequence has been detected. The results of computer analysis of the sequence and c.d. spectrometry indicate that the secondary structure includes relatively little alpha-helix (about 20%) and that the remainder is mainly beta-structure. PMID:6712597

  17. Arachnid relationships based on mitochondrial genomes: asymmetric nucleotide and amino acid bias affects phylogenetic analyses.

    PubMed

    Masta, Susan E; Longhorn, Stuart J; Boore, Jeffrey L

    2009-01-01

    Phylogenetic analyses based on mitochondrial DNA have yielded widely differing relationships among members of the arthropod lineage Arachnida, depending on the nucleotide coding schemes and models of evolution used. We enhanced taxonomic coverage within the Arachnida greatly by sequencing seven new arachnid mitochondrial genomes from five orders. We then used all 13 mitochondrial protein-coding genes from these genomes to evaluate patterns of nucleotide and amino acid biases. Our data show that two of the six orders of arachnids (spiders and scorpions) have experienced shifts in both nucleotide and amino acid usage in all their protein-coding genes, and that these biases mislead phylogeny reconstruction. These biases are most striking for the hydrophobic amino acids isoleucine and valine, which appear to have evolved asymmetrical exchanges in response to shifts in nucleotide composition. To improve phylogenetic accuracy based on amino acid differences, we tested two recoding methods: (1) removing all isoleucine and valine sites and (2) recoding amino acids based on their physiochemical properties. We find that these methods yield phylogenetic trees that are consistent in their support of ancient intraordinal divergences within the major arachnid lineages. Further refinement of amino acid recoding methods may help us better delineate interordinal relationships among these diverse organisms.

  18. Nucleic and Amino Acid Sequences Support Structure-Based Viral Classification

    PubMed Central

    Sinclair, Robert M.; Ravantti, Janne J.

    2017-01-01

    ABSTRACT Viral capsids ensure viral genome integrity by protecting the enclosed nucleic acids. Interactions between the genome and capsid and between individual capsid proteins (i.e., capsid architecture) are intimate and are expected to be characterized by strong evolutionary conservation. For this reason, a capsid structure-based viral classification has been proposed as a way to bring order to the viral universe. The seeming lack of sufficient sequence similarity to reproduce this classification has made it difficult to reject structural convergence as the basis for the classification. We reinvestigate whether the structure-based classification for viral coat proteins making icosahedral virus capsids is in fact supported by previously undetected sequence similarity. Since codon choices can influence nascent protein folding cotranslationally, we searched for both amino acid and nucleotide sequence similarity. To demonstrate the sensitivity of the approach, we identify a candidate gene for the pandoravirus capsid protein. We show that the structure-based classification is strongly supported by amino acid and also nucleotide sequence similarities, suggesting that the similarities are due to common descent. The correspondence between structure-based and sequence-based analyses of the same proteins shown here allow them to be used in future analyses of the relationship between linear sequence information and macromolecular function, as well as between linear sequence and protein folds. IMPORTANCE Viral capsids protect nucleic acid genomes, which in turn encode capsid proteins. This tight coupling of protein shell and nucleic acids, together with strong functional constraints on capsid protein folding and architecture, leads to the hypothesis that capsid protein-coding nucleotide sequences may retain signatures of ancient viral evolution. We have been able to show that this is indeed the case, using the major capsid proteins of viruses forming icosahedral capsids

  19. Phylogenetic study on Shiraia bambusicola by rDNA sequence analyses.

    PubMed

    Cheng, Tian-Fan; Jia, Xiao-Ming; Ma, Xiao-Hang; Lin, Hai-Ping; Zhao, Yu-Hua

    2004-01-01

    In this study, 18S rDNA and ITS-5.8S rDNA regions of four Shiraia bambusicola isolates collected from different species of bamboos were amplified by PCR with universal primer pairs NS1/NS8 and ITS5/ITS4, respectively, and sequenced. Phylogenetic analyses were conducted on three selected datasets of rDNA sequences. Maximum parsimony, distance and maximum likelihood criteria were used to infer trees. Morphological characteristics were also observed. The positioning of Shiraia in the order Pleosporales was well supported by bootstrap, which agreed with the placement by Amano (1980) according to their morphology. We did not find significant inter-hostal differences among these four isolates from different species of bamboos. From the results of analyses and comparison of their rDNA sequences, we conclude that Shiraia should be classified into Pleosporales as Amano (1980) proposed and suggest that it might be positioned in the family Phaeosphaeriaceae.

  20. Accumulated analyses of amino acid precursors in returned lunar samples

    NASA Technical Reports Server (NTRS)

    Fox, S. W.; Harada, K.; Hare, P. E.

    1973-01-01

    Six amino acids (glycine, alanine, aspartic acid, glutamic acid, serine, and threonine) obtained by hydrolysis of extracts have been quantitatively determined in ten collections of fines from five Apollo missions. Although the amounts found, 7-45 ng/g, are small, the lunar amino acid/carbon ratios are comparable to those of the carbonaceous chondrites, Murchison and Murray, as analyzed by the same procedures. Since both the ratios of amino acid to carbon, and the four or five most common types of proteinous amino acid found, are comparable for the two extraterrestrial sources despite different cosmophysical histories of the moon and meteorites, common cosmochemical processes are suggested.

  1. Identification and sequence analyses of the granulin gene of Choristoneura fumiferana granulovirus.

    PubMed

    Bah, A; Bergeron, J; Arella, M; Lucarotti, C J; Guertin, C

    1997-01-01

    The nucleotide sequence of the granulin gene of Choristoneura fumiferana granulovirus (CfGV) was determined. The gene encodes a protein of 248 amino acids with a predicted Mr of 29.299 kDa. The granulin genes of Trichoplusia ni, Pieris brassicae and Cryptophlebia leucotreta granuloviruses showed homologies ranging from 76.7-80.5% for nucleotide sequences and 84.2-88.3% for amino acid sequences when compared to CfGV. The secondary structure of CfGV granulin protein, including the hydrophilic (polar) and hydrophobic (basic) regions, was predicted and found to be similar to other granulins. A very late baculovirus promoter motif, ATAAG, was found within the putative promoter region of the CfGV granulin gene.

  2. Cloud-based bioinformatics workflow platform for large-scale next-generation sequencing analyses

    PubMed Central

    Liu, Bo; Madduri, Ravi K; Sotomayor, Borja; Chard, Kyle; Lacinski, Lukasz; Dave, Utpal J; Li, Jianqiang; Liu, Chunchen; Foster, Ian T

    2014-01-01

    Due to the upcoming data deluge of genome data, the need for storing and processing large-scale genome data, easy access to biomedical analyses tools, efficient data sharing and retrieval has presented significant challenges. The variability in data volume results in variable computing and storage requirements, therefore biomedical researchers are pursuing more reliable, dynamic and convenient methods for conducting sequencing analyses. This paper proposes a Cloud-based bioinformatics workflow platform for large-scale next-generation sequencing analyses, which enables reliable and highly scalable execution of sequencing analyses workflows in a fully automated manner. Our platform extends the existing Galaxy workflow system by adding data management capabilities for transferring large quantities of data efficiently and reliably (via Globus Transfer), domain-specific analyses tools preconfigured for immediate use by researchers (via user-specific tools integration), automatic deployment on Cloud for on-demand resource allocation and pay-as-you-go pricing (via Globus Provision), a Cloud provisioning tool for auto-scaling (via HTCondor scheduler), and the support for validating the correctness of workflows (via semantic verification tools). Two bioinformatics workflow use cases as well as performance evaluation are presented to validate the feasibility of the proposed approach. PMID:24462600

  3. Cloud-based bioinformatics workflow platform for large-scale next-generation sequencing analyses.

    PubMed

    Liu, Bo; Madduri, Ravi K; Sotomayor, Borja; Chard, Kyle; Lacinski, Lukasz; Dave, Utpal J; Li, Jianqiang; Liu, Chunchen; Foster, Ian T

    2014-06-01

    Due to the upcoming data deluge of genome data, the need for storing and processing large-scale genome data, easy access to biomedical analyses tools, efficient data sharing and retrieval has presented significant challenges. The variability in data volume results in variable computing and storage requirements, therefore biomedical researchers are pursuing more reliable, dynamic and convenient methods for conducting sequencing analyses. This paper proposes a Cloud-based bioinformatics workflow platform for large-scale next-generation sequencing analyses, which enables reliable and highly scalable execution of sequencing analyses workflows in a fully automated manner. Our platform extends the existing Galaxy workflow system by adding data management capabilities for transferring large quantities of data efficiently and reliably (via Globus Transfer), domain-specific analyses tools preconfigured for immediate use by researchers (via user-specific tools integration), automatic deployment on Cloud for on-demand resource allocation and pay-as-you-go pricing (via Globus Provision), a Cloud provisioning tool for auto-scaling (via HTCondor scheduler), and the support for validating the correctness of workflows (via semantic verification tools). Two bioinformatics workflow use cases as well as performance evaluation are presented to validate the feasibility of the proposed approach.

  4. Amino acid sequence repertoire of the bacterial proteome and the occurrence of untranslatable sequences

    PubMed Central

    Navon, Sharon Penias; Kornberg, Guy; Chen, Jin; Schwartzman, Tali; Tsai, Albert; Puglisi, Elisabetta Viani; Puglisi, Joseph D.; Adir, Noam

    2016-01-01

    Bioinformatic analysis of Escherichia coli proteomes revealed that all possible amino acid triplet sequences occur at their expected frequencies, with four exceptions. Two of the four underrepresented sequences (URSs) were shown to interfere with translation in vivo and in vitro. Enlarging the URS by a single amino acid resulted in increased translational inhibition. Single-molecule methods revealed stalling of translation at the entrance of the peptide exit tunnel of the ribosome, adjacent to ribosomal nucleotides A2062 and U2585. Interaction with these same ribosomal residues is involved in regulation of translation by longer, naturally occurring protein sequences. The E. coli exit tunnel has evidently evolved to minimize interaction with the exit tunnel and maximize the sequence diversity of the proteome, although allowing some interactions for regulatory purposes. Bioinformatic analysis of the human proteome revealed no underrepresented triplet sequences, possibly reflecting an absence of regulation by interaction with the exit tunnel. PMID:27307442

  5. Amino acid sequence repertoire of the bacterial proteome and the occurrence of untranslatable sequences.

    PubMed

    Navon, Sharon Penias; Kornberg, Guy; Chen, Jin; Schwartzman, Tali; Tsai, Albert; Puglisi, Elisabetta Viani; Puglisi, Joseph D; Adir, Noam

    2016-06-28

    Bioinformatic analysis of Escherichia coli proteomes revealed that all possible amino acid triplet sequences occur at their expected frequencies, with four exceptions. Two of the four underrepresented sequences (URSs) were shown to interfere with translation in vivo and in vitro. Enlarging the URS by a single amino acid resulted in increased translational inhibition. Single-molecule methods revealed stalling of translation at the entrance of the peptide exit tunnel of the ribosome, adjacent to ribosomal nucleotides A2062 and U2585. Interaction with these same ribosomal residues is involved in regulation of translation by longer, naturally occurring protein sequences. The E. coli exit tunnel has evidently evolved to minimize interaction with the exit tunnel and maximize the sequence diversity of the proteome, although allowing some interactions for regulatory purposes. Bioinformatic analysis of the human proteome revealed no underrepresented triplet sequences, possibly reflecting an absence of regulation by interaction with the exit tunnel.

  6. Amino acid analyses of R and CK chondrites

    NASA Astrophysics Data System (ADS)

    Burton, Aaron S.; McLain, Hannah; Glavin, Daniel P.; Elsila, Jamie E.; Davidson, Jemma; Miller, Kelly E.; Andronikov, Alexander V.; Lauretta, Dante; Dworkin, Jason P.

    2015-03-01

    Exogenous delivery of amino acids and other organic molecules to planetary surfaces may have played an important role in the origins of life on Earth and other solar system bodies. Previous studies have revealed the presence of indigenous amino acids in a wide range of carbon-rich meteorites, with the abundances and structural distributions differing significantly depending on parent body mineralogy and alteration conditions. Here we report on the amino acid abundances of seven type 3-6 CK chondrites and two Rumuruti (R) chondrites. Amino acid measurements were made on hot water extracts from these meteorites by ultrahigh-performance liquid chromatography with fluorescence detection and time-of-flight mass spectrometry. Of the nine meteorites analyzed, four were depleted in amino acids, and one had experienced significant amino acid contamination by terrestrial biology. The remaining four, comprised of two R and two CK chondrites, contained low levels of amino acids that were predominantly the straight chain, amino-terminal (n-ω-amino) acids β-alanine, and γ-amino-n-butyric acid. This amino acid distribution is similar to what we reported previously for thermally altered ureilites and CV and CO chondrites, and these n-ω-amino acids appear to be indigenous to the meteorites and not the result of terrestrial contamination. The amino acids may have been formed by Fischer-Tropsch-type reactions, although this hypothesis needs further testing.

  7. Amino acid sequences of proteins from Leptospira serovar pomona.

    PubMed

    Alves, S F; Lefebvre, R B; Probert, W

    2000-01-01

    This report describes a partial amino acid sequences from three putative outer envelope proteins from Leptospira serovar pomona. In order to obtain internal fragments for protein sequencing, enzymatic and chemical digestion was performed. The enzyme clostripain was used to digest the proteins 32 and 45 kDa. In situ digestion of 40 kDa molecular weight protein was accomplished using cyanogen bromide. The 32 kDa protein generated two fragments, one of 21 kDa and another of 10 kDa that yielded five residues. A fragment of 24 kDa that yielded nineteen residues of amino acids was obtained from 45 kDa protein. A fragment with a molecular weight of 20 kDa, yielding a twenty amino acids sequence from the 40 kDa protein.

  8. A weighted U-statistic for genetic association analyses of sequencing data.

    PubMed

    Wei, Changshuai; Li, Ming; He, Zihuai; Vsevolozhskaya, Olga; Schaid, Daniel J; Lu, Qing

    2014-12-01

    With advancements in next-generation sequencing technology, a massive amount of sequencing data is generated, which offers a great opportunity to comprehensively investigate the role of rare variants in the genetic etiology of complex diseases. Nevertheless, the high-dimensional sequencing data poses a great challenge for statistical analysis. The association analyses based on traditional statistical methods suffer substantial power loss because of the low frequency of genetic variants and the extremely high dimensionality of the data. We developed a Weighted U Sequencing test, referred to as WU-SEQ, for the high-dimensional association analysis of sequencing data. Based on a nonparametric U-statistic, WU-SEQ makes no assumption of the underlying disease model and phenotype distribution, and can be applied to a variety of phenotypes. Through simulation studies and an empirical study, we showed that WU-SEQ outperformed a commonly used sequence kernel association test (SKAT) method when the underlying assumptions were violated (e.g., the phenotype followed a heavy-tailed distribution). Even when the assumptions were satisfied, WU-SEQ still attained comparable performance to SKAT. Finally, we applied WU-SEQ to sequencing data from the Dallas Heart Study (DHS), and detected an association between ANGPTL 4 and very low density lipoprotein cholesterol.

  9. Extensive amino acid sequence homologies between animal lectins

    SciTech Connect

    Paroutaud, P.; Levi, G.; Teichberg, V.I.; Strosberg, A.D.

    1987-09-01

    The authors have established the amino acid sequence of the ..beta..-D-galactoside binding lectin from the electric eel and the sequences of several peptides from a similar lectin isolated from human placenta. These sequences were compared with the published sequences of peptides derived from the ..beta..-D-galactoside binding lectin from human lung and with sequences deduced from cDNAs assigned to the ..beta..-D-galactoside binding lectins from chicken embryo skin and human hepatomas. Significant homologies were observed. One of the highly conserved regions that contains a tryptophan residue and two glutamic acid resides is probably part of the ..beta..-D-galactoside binding site, which, on the basis of spectroscopic studies of the electric eel lectin, is expected to contain such residues. The similarity of the hydropathy profiles and the predicted secondary structure of the lectins from chicken skin and electric eel, in spite of differences in their amino acid sequences, strongly suggests that these proteins have maintained structural homologies during evolution and together with the other ..beta..-D-galactoside binding lectins were derived form a common ancestor gene.

  10. Amino acid sequence of porcine spleen cathepsin D.

    PubMed Central

    Shewale, J G; Tang, J

    1984-01-01

    The amino acid sequence of porcine spleen cathepsin D heavy chain has been determined and, hence, the complete structure of this enzyme is now known. The sequence of heavy chain was constructed by aligning the structures of peptides generated by cyanogen bromide, trypsin, and endo-proteinase Lys C cleavages. The structure of the light chain has been published previously. The cathepsin D molecule contains 339 amino acid residues in two polypeptide chains: a 97-residue light chain and a 242-residue heavy chain, with a combined Mr of 36,779 (without carbohydrate). There are two carbohydrate units linked to asparagine residues 70 and 192. The disulfide bond arrangement in cathepsin D is probably similar to that of pepsin, because the positions of six half-cystine residues are conserved. The active site aspartyl residues, corresponding to aspartic acid-32 and -215 of pepsin, are located at residues 33 and 224 in the cathepsin D molecule. The amino acid sequence around these aspartyl residues is strongly conserved. Cathepsin D shows a strong homology with other acid proteases. When the sequence of cathepsin D, renin, and pepsin are aligned, 32.7% of the residues are identical. The homology is observed throughout the length of the molecules, indicating that three-dimensional structures of all three molecules are similar. PMID:6587385

  11. Genome sequencing elucidates Sardinian genetic architecture and augments association analyses for lipid and blood inflammatory markers

    PubMed Central

    Zoledziewska, Magdalena; Mulas, Antonella; Pistis, Giorgio; Steri, Maristella; Danjou, Fabrice; Kwong, Alan; Ortega del Vecchyo, Vicente Diego; Chiang, Charleston W. K.; Bragg-Gresham, Jennifer; Pitzalis, Maristella; Nagaraja, Ramaiah; Tarrier, Brendan; Brennan, Christine; Uzzau, Sergio; Fuchsberger, Christian; Atzeni, Rossano; Reinier, Frederic; Berutti, Riccardo; Huang, Jie; Timpson, Nicholas J; Toniolo, Daniela; Gasparini, Paolo; Malerba, Giovanni; Dedoussis, George; Zeggini, Eleftheria; Soranzo, Nicole; Jones, Chris; Lyons, Robert; Angius, Andrea; Kang, Hyun M.; Novembre, John; Sanna, Serena; Schlessinger, David; Cucca, Francesco; Abecasis, Gonçalo R

    2015-01-01

    We report ~17.6M genetic variants from whole-genome sequencing of 2,120 Sardinians; 22% are absent from prior sequencing-based compilations and enriched for predicted functional consequence. Furthermore, ~76K variants common in our sample (frequency >5%) are rare elsewhere (<0.5% in the 1000 Genomes Project). We assessed the impact of these variants on circulating lipid levels and five inflammatory biomarkers. Fourteen signals, including two major new loci, were observed for lipid levels, and 19, including two novel loci, for inflammatory markers. New associations would be missed in analyses based on 1000 Genomes data, underlining the advantages of large-scale sequencing in this founder population. PMID:26366554

  12. Improved procedures for automated liquid phase sequence analyses of protein and peptide.

    PubMed

    Hayashi, H; Ohe, Y; Hayashi, T; Iwai, K

    1985-02-01

    For the sequence analysis of histones rich in lysine, we modified the subprograms for two reagents of a JEOL JAS-47KS protein sequence analyzer. Together with this modification, the use of a synthetic carrier, Polybrene, the minimization of aldehyde contamination in Quadrol buffer, and the introduction of hydrophilic groups into epsilon-N-amino groups of lysine residues, markedly increased the repetitive yield of PTH-amino acids. Tetrahymena histones H3 and H4 were thus sequenced up to residues 104 and 92, respectively, in each consecutive analysis (Hayashi, T., Hayashi, H., Fusauchi, Y., & Iwai, K. (1984) J. Biochem. 95, 1741-1749; Hayashi, H., Nomoto, M., & Iwai, K. (1984) J. Biochem. 96, 1449-1456). The details for these improved procedures and results are described here.

  13. Power Spectrum and Mutual Information Analyses of DNA Base (Nucleotide) Sequences

    NASA Astrophysics Data System (ADS)

    Isohata, Yasuhiko; Hayashi, Masaki

    2003-03-01

    On the basis of the power spectrum analyses for the base (nucleotide) sequences of various genes, we have studied long-range correlations in total base sequences which are expressed as 1/fα, behaviour of the exponent α for the accumulated base sequences as well as periodicities at short range. In particular from the analysis of content rate distributions of α we have obtained the average value \\barα=0.40± 0.01 and \\barα=0.20± 0.01 for the human genes and S. cerevisiae genes, respectively. We have also performed the analyses using the mutual information function. We show that there exists a clear difference between the content rate distributions of correlation lengths for the sample human genes and the S. cerevisiae genes. We are led to a conjecture that the elongation of the correlation length in the base sequences of genes from the early eukaryote (S. cerevisiae) to the late eukaryote (human) should be the definite reflection of the evolutionary process.

  14. Active site amino acid sequence of human factor D.

    PubMed

    Davis, A E

    1980-08-01

    Factor D was isolated from human plasma by chromatography on CM-Sephadex C50, Sephadex G-75, and hydroxylapatite. Digestion of reduced, S-carboxymethylated factor D with cyanogen bromide resulted in three peptides which were isolated by chromatography on Sephadex G-75 (superfine) equilibrated in 20% formic acid. NH2-Terminal sequences were determined by automated Edman degradation with a Beckman 890C sequencer using a 0.1 M Quadrol program. The smallest peptide (CNBr III) consisted of the NH2-terminal 14 amino acids. The other two peptides had molecular weights of 17,000 (CNBr I) and 7000 (CNBr II). Overlap of the NH2-terminal sequence of factor D with the NH2-terminal sequence of CNBr I established the order of the peptides. The NH2-terminal 53 residues of factor D are somewhat more homologous with the group-specific protease of rat intestine than with other serine proteases. The NH2-terminal sequence of CNBr II revealed the active site serine of factor D. The typical serine protease active site sequence (Gly-Asp-Ser-Gly-Gly-Pro was found at residues 12-17. The region surrounding the active site serine does not appear to be more highly homologous with any one of the other serine proteases. The structural data obtained point out the similarities between factor D and the other proteases. However, complete definition of the degree of relationship between factor D and other proteases will require determination of the remainder of the primary structure.

  15. The amino acid sequence of iguana (Iguana iguana) pancreatic ribonuclease.

    PubMed

    Zhao, W; Beintema, J J; Hofsteenge, J

    1994-01-15

    The pyrimidine-specific ribonuclease superfamily constitutes a group of homologous proteins so far found only in higher vertebrates. Four separate families are found in mammals, which have resulted from gene duplications in mammalian ancestors. To learn more about the evolutionary history of this superfamily, the primary structure and other characteristics of the pancreatic enzyme from iguana (Iguana iguana), a herbivorous lizard species belonging to the reptiles, have been determined. The polypeptide chain consists of 119 amino acid residues. The positions of insertions and deletions in the sequence are identical to those in the enzyme from snapping turtle. However, the two enzymes differ at 54% of the amino acid positions. Iguana ribonuclease contains no carbohydrate, although the enzyme possesses three recognition sites for carbohydrate attachment, and has a high number of acidic residues in a localized part of the sequence.

  16. Novel Primer Sets for Next Generation Sequencing-Based Analyses of Water Quality

    PubMed Central

    Lee, Elvina; Khurana, Maninder S.; Whiteley, Andrew S.; Monis, Paul T.; Bath, Andrew; Gordon, Cameron; Ryan, Una M.; Paparini, Andrea

    2017-01-01

    Next generation sequencing (NGS) has rapidly become an invaluable tool for the detection, identification and relative quantification of environmental microorganisms. Here, we demonstrate two new 16S rDNA primer sets, which are compatible with NGS approaches and are primarily for use in water quality studies. Compared to 16S rRNA gene based universal primers, in silico and experimental analyses demonstrated that the new primers showed increased specificity for the Cyanobacteria and Proteobacteria phyla, allowing increased sensitivity for the detection, identification and relative quantification of toxic bloom-forming microalgae, microbial water quality bioindicators and common pathogens. Significantly, Cyanobacterial and Proteobacterial sequences accounted for ca. 95% of all sequences obtained within NGS runs (when compared to ca. 50% with standard universal NGS primers), providing higher sensitivity and greater phylogenetic resolution of key water quality microbial groups. The increased selectivity of the new primers allow the parallel sequencing of more samples through reduced sequence retrieval levels required to detect target groups, potentially reducing NGS costs by 50% but still guaranteeing optimal coverage and species discrimination. PMID:28118368

  17. Amino acid sequence of myoglobin from white-tailed deer (Odocoileus virginianus).

    PubMed

    Joseph, Poulson; Suman, Surendranath P; Li, Shuting; Fontaine, Michele; Steinke, Laurey

    2012-10-01

    Our objective was to determine the primary structure of white-tailed deer myoglobin (Mb). White-tailed deer Mb was isolated from cardiac muscles employing ammonium sulfate precipitation and gel-filtration chromatography. The amino acid sequence was determined by Edman degradation. Sequence analyses of intact Mb as well as tryptic- and cyanogen bromide-peptides yielded the complete primary structure of white-tailed deer Mb, which shared 100% similarity with red deer Mb. White-tailed deer Mb consists of 153 amino acid residues and shares more than 96% sequence similarity with myoglobins from meat-producing ruminants, such as cattle, buffalo, sheep, and goat. Similar to sheep and goat myoglobins, white-tailed deer Mb contains 12 histidine residues. Proximal (position 93) and distal (position 64) histidine residues responsible for maintaining the stability of heme are conserved in white-tailed deer Mb.

  18. Identification of a Herbal Powder by Deoxyribonucleic Acid Barcoding and Structural Analyses

    PubMed Central

    Sheth, Bhavisha P.; Thaker, Vrinda S.

    2015-01-01

    Background: Authentic identification of plants is essential for exploiting their medicinal properties as well as to stop the adulteration and malpractices with the trade of the same. Objective: To identify a herbal powder obtained from a herbalist in the local vicinity of Rajkot, Gujarat, using deoxyribonucleic acid (DNA) barcoding and molecular tools. Materials and Methods: The DNA was extracted from a herbal powder and selected Cassia species, followed by the polymerase chain reaction (PCR) and sequencing of the rbcL barcode locus. Thereafter the sequences were subjected to National Center for Biotechnology Information (NCBI) basic local alignment search tool (BLAST) analysis, followed by the protein three-dimension structure determination of the rbcL protein from the herbal powder and Cassia species namely Cassia fistula, Cassia tora and Cassia javanica (sequences obtained in the present study), Cassia Roxburghii, and Cassia abbreviata (sequences retrieved from Genbank). Further, the multiple and pairwise structural alignment were carried out in order to identify the herbal powder. Results: The nucleotide sequences obtained from the selected species of Cassia were submitted to Genbank (Accession No. JX141397, JX141405, JX141420). The NCBI BLAST analysis of the rbcL protein from the herbal powder showed an equal sequence similarity (with reference to different parameters like E value, maximum identity, total score, query coverage) to C. javanica and C. roxburghii. In order to solve the ambiguities of the BLAST result, a protein structural approach was implemented. The protein homology models obtained in the present study were submitted to the protein model database (PM0079748-PM0079753). The pairwise structural alignment of the herbal powder (as template) and C. javanica and C. roxburghii (as targets individually) revealed a close similarity of the herbal powder with C. javanica. Conclusion: A strategy as used here, incorporating the integrated use of DNA

  19. Amino acid sequence of bovine heart coupling factor 6.

    PubMed Central

    Fang, J K; Jacobs, J W; Kanner, B I; Racker, E; Bradshaw, R A

    1984-01-01

    The amino acid sequence of bovine heart mitochondrial coupling factor 6 (F6) has been determined by automated Edman degradation of the whole protein and derived peptides. Preparations based on heat precipitation and ethanol extraction showed allotypic variation at three positions while material further purified by HPLC yielded only one sequence that also differed by a Phe-Thr replacement at residue 62. The mature protein contains 76 amino acids with a calculated molecular weight of 9006 and a pI of approximately equal to 5, in good agreement with experimentally measured values. The charged amino acids are mainly clustered at the termini and in one section in the middle; these three polar segments are separated by two segments relatively rich in nonpolar residues. Chou-Fasman analysis suggests three stretches of alpha-helix coinciding (or within) the high-charge-density sequences with a single beta-turn at the first polar-nonpolar junction. Comparison of the F6 sequence with those of other proteins did not reveal any homologous structures. PMID:6149548

  20. Amino acid sequence and comparative antigenicity of chicken metallothionein.

    PubMed Central

    McCormick, C C; Fullmer, C S; Garvey, J S

    1988-01-01

    The complete amino acid sequence of metallothionein (MT) from chicken liver is reported. The primary structure was determined by automated sequence analysis of peptides produced by limited acid hydrolysis and by trypsin digestion. The comparative antigenicity of chicken MT was determined by radioimmunoassay using rabbit anti-rat MT polyclonal antibody. Chicken MT consists of 63 amino acids as compared to 61 found in MTs from mammals. One insertion (and two substitutions) occurs in the amino-terminal region, a region considered invariant among mammalian MTs. Eighteen of the 20 cysteines in chicken MT were aligned with cysteines from other mammalian sequences. Two cysteines near the carboxyl terminus are shifted by one residue due to the insertion of proline in that region. Overall, the chicken protein showed approximately equal to 68% sequence identity in a comparison with various mammalian MTs. The affinity of the polyclonal antibody for chicken MT was decreased by 2 orders of magnitude in comparison to that of a mammalian MT (rat MT isoforms). This reduced affinity is attributed to major substitutions in chicken MT in the regions of the principal determinants of mammalian MTs. Theoretical analysis of the primary structure predicted the secondary structure to consist of reverse turns and random coils with no stable beta or helix conformations. There is no evidence that chicken MT differs functionally from mammalian MTs. PMID:2448773

  1. In silico comparative analysis of DNA and amino acid sequences for prion protein gene.

    PubMed

    Kim, Y; Lee, J; Lee, C

    2008-01-01

    Genetic variability might contribute to species specificity of prion diseases in various organisms. In this study, structures of the prion protein gene (PRNP) and its amino acids were compared among species of which sequence data were available. Comparisons of PRNP DNA sequences among 12 species including human, chimpanzee, monkey, bovine, ovine, dog, mouse, rat, wallaby, opossum, chicken and zebrafish allowed us to identify candidate regulatory regions in intron 1 and 3'-untranslated region (UTR) in addition to the coding region. Highly conserved putative binding sites for transcription factors, such as heat shock factor 2 (HSF2) and myocite enhancer factor 2 (MEF2), were discovered in the intron 1. In 3'-UTR, the functional sequence (ATTAAA) for nucleus-specific polyadenylation was found in all the analysed species. The functional sequence (TTTTTAT) for maturation-specific polyadenylation was identically observed only in ovine, and one or two nucleotide mismatches in the other species. A comparison of the amino acid sequences in 53 species revealed a large sequence identity. Especially the octapeptide repeat region was observed in all the species but frog and zebrafish. Functional changes and susceptibility to prion diseases with various isoforms of prion protein could be caused by numeric variability and conformational changes discovered in the repeat sequences.

  2. Phylogenetic analyses of novel squamate adenovirus sequences in wild-caught Anolis lizards.

    PubMed

    Ascher, Jill M; Geneva, Anthony J; Ng, Julienne; Wyatt, Jeffrey D; Glor, Richard E

    2013-01-01

    Adenovirus infection has emerged as a serious threat to the health of captive snakes and lizards (i.e., squamates), but we know relatively little about this virus' range of possible hosts, pathogenicity, modes of transmission, and sources from nature. We report the first case of adenovirus infection in the Iguanidae, a diverse family of lizards that is widely-studied and popular in captivity. We report adenovirus infections from two closely-related species of Anolis lizards (anoles) that were recently imported from wild populations in the Dominican Republic to a laboratory colony in the United States. We investigate the evolution of adenoviruses in anoles and other squamates using phylogenetic analyses of adenovirus polymerase gene sequences sampled from Anolis and a range of other vertebrate taxa. These phylogenetic analyses reveal that (1) the sequences detected from each species of Anolis are novel, and (2) adenoviruses are not necessarily host-specific and do not always follow a co-speciation model under which host and virus phylogenies are perfectly concordant. Together with the fact that the Anolis adenovirus sequences reported in our study were detected in animals that became ill and subsequently died shortly after importation while exhibiting clinical signs consistent with acute adenovirus infection, our discoveries suggest the need for renewed attention to biosecurity measures intended to prevent the spread of adenovirus both within and among species of snakes and lizards housed in captivity.

  3. Phylogenetic analyses of complete mitochondrial genome sequences suggest a basal divergence of the enigmatic rodent Anomalurus

    PubMed Central

    Horner, David S; Lefkimmiatis, Konstantinos; Reyes, Aurelio; Gissi, Carmela; Saccone, Cecilia; Pesole, Graziano

    2007-01-01

    Background Phylogenetic relationships between Lagomorpha, Rodentia and Primates and their allies (Euarchontoglires) have long been debated. While it is now generally agreed that Rodentia constitutes a monophyletic sister-group of Lagomorpha and that this clade (Glires) is sister to Primates and Dermoptera, higher-level relationships within Rodentia remain contentious. Results We have sequenced and performed extensive evolutionary analyses on the mitochondrial genome of the scaly-tailed flying squirrel Anomalurus sp., an enigmatic rodent whose phylogenetic affinities have been obscure and extensively debated. Our phylogenetic analyses of the coding regions of available complete mitochondrial genome sequences from Euarchontoglires suggest that Anomalurus is a sister taxon to the Hystricognathi, and that this clade represents the most basal divergence among sampled Rodentia. Bayesian dating methods incorporating a relaxed molecular clock provide divergence-time estimates which are consistently in agreement with the fossil record and which indicate a rapid radiation within Glires around 60 million years ago. Conclusion Taken together, the data presented provide a working hypothesis as to the phylogenetic placement of Anomalurus, underline the utility of mitochondrial sequences in the resolution of even relatively deep divergences and go some way to explaining the difficulty of conclusively resolving higher-level relationships within Glires with available data and methodologies. PMID:17288612

  4. Phylogenetic Analyses of Novel Squamate Adenovirus Sequences in Wild-Caught Anolis Lizards

    PubMed Central

    Ascher, Jill M.; Geneva, Anthony J.; Ng, Julienne; Wyatt, Jeffrey D.; Glor, Richard E.

    2013-01-01

    Adenovirus infection has emerged as a serious threat to the health of captive snakes and lizards (i.e., squamates), but we know relatively little about this virus' range of possible hosts, pathogenicity, modes of transmission, and sources from nature. We report the first case of adenovirus infection in the Iguanidae, a diverse family of lizards that is widely-studied and popular in captivity. We report adenovirus infections from two closely-related species of Anolis lizards (anoles) that were recently imported from wild populations in the Dominican Republic to a laboratory colony in the United States. We investigate the evolution of adenoviruses in anoles and other squamates using phylogenetic analyses of adenovirus polymerase gene sequences sampled from Anolis and a range of other vertebrate taxa. These phylogenetic analyses reveal that (1) the sequences detected from each species of Anolis are novel, and (2) adenoviruses are not necessarily host-specific and do not always follow a co-speciation model under which host and virus phylogenies are perfectly concordant. Together with the fact that the Anolis adenovirus sequences reported in our study were detected in animals that became ill and subsequently died shortly after importation while exhibiting clinical signs consistent with acute adenovirus infection, our discoveries suggest the need for renewed attention to biosecurity measures intended to prevent the spread of adenovirus both within and among species of snakes and lizards housed in captivity. PMID:23593364

  5. Heteroduplex Mobility and Sequence Analyses for Assessment of Variability of Zucchini yellow mosaic virus.

    PubMed

    Lin, S S; Hou, R F; Yeh, S D

    2000-03-01

    ABSTRACT A heteroduplex mobility assay (HMA) was used to analyze the variability among five isolates of Zucchini yellow mosaic virus (ZYMV; TW-TC1, TW-CY2, TW-TN3, TW-TNML1, and TW-NT1) collected from cucurbit fields in different areas of Taiwan. A cDNA fragment of 760 bp covering the variable region of the N terminal half of the coat protein (CP) gene was amplified by reverse transcription-polymerase chain reaction (RT-PCR) and subsequently subjected to HMA analysis for sequence variation. When TW-NT1 combined with any of the other Taiwan isolates, the heteroduplexes obtained migrated much more slowly than did the heteroduplexes obtained in combinations among the other four Taiwan isolates, indicating that TW-TC1, TW-CY2, TW-TN3, and TW-TNML1 share a high degree of sequence homology, while the TW-NT1 isolate is more distinct. The complete nucleotide sequences of the CP genes and the 3' noncoding regions of the five isolates were determined from RT-PCR-derived cDNA clones. A phylogenetic tree derived from the actual sequences of the 760-bp fragments of the five Taiwan and another six ZYMV isolates from different geographic areas revealed four genotypes. TW-TNML1, TW-TC1, TWC-Y2, and TW-TN3 were in genotype I, while TW-NT1 and U.S. isolates were in genotype II. The Singapore and Reunion Island isolates were separated into genotypes III and IV, respectively. Comparison of the CP genes of the five Taiwan isolates indicated that they share 92.8 to 98.7% nucleotide identities and 96.4 to 99.3% amino acid identities. The amino acid positions 73, 102, 109, and 149 of the CP gene, where lysine, serine, arginine, and aspartic acid reside, respectively, were uniquely conserved for genotype I Taiwan isolates. Thus, results of HMA agreed well with those of phylogenetic analysis based on the sequence data of the five Taiwan ZYMV isolates. These five ZYMV isolates of known sequence can be used as reference strains for HMA to analyze the variability of ZYMV in Taiwan.

  6. Sequences Of Amino Acids For Human Serum Albumin

    NASA Technical Reports Server (NTRS)

    Carter, Daniel C.

    1992-01-01

    Sequences of amino acids defined for use in making polypeptides one-third to one-sixth as large as parent human serum albumin molecule. Smaller, chemically stable peptides have diverse applications including service as artificial human serum and as active components of biosensors and chromatographic matrices. In applications involving production of artificial sera from new sequences, little or no concern about viral contaminants. Smaller genetically engineered polypeptides more easily expressed and produced in large quantities, making commercial isolation and production more feasible and profitable.

  7. Comparative analyses of the complete genome sequences of Pierce's disease and citrus variegated chlorosis strains of Xylella fastidiosa.

    PubMed

    Van Sluys, M A; de Oliveira, M C; Monteiro-Vitorello, C B; Miyaki, C Y; Furlan, L R; Camargo, L E A; da Silva, A C R; Moon, D H; Takita, M A; Lemos, E G M; Machado, M A; Ferro, M I T; da Silva, F R; Goldman, M H S; Goldman, G H; Lemos, M V F; El-Dorry, H; Tsai, S M; Carrer, H; Carraro, D M; de Oliveira, R C; Nunes, L R; Siqueira, W J; Coutinho, L L; Kimura, E T; Ferro, E S; Harakava, R; Kuramae, E E; Marino, C L; Giglioti, E; Abreu, I L; Alves, L M C; do Amaral, A M; Baia, G S; Blanco, S R; Brito, M S; Cannavan, F S; Celestino, A V; da Cunha, A F; Fenille, R C; Ferro, J A; Formighieri, E F; Kishi, L T; Leoni, S G; Oliveira, A R; Rosa, V E; Sassaki, F T; Sena, J A D; de Souza, A A; Truffi, D; Tsukumo, F; Yanai, G M; Zaros, L G; Civerolo, E L; Simpson, A J G; Almeida, N F; Setubal, J C; Kitajima, J P

    2003-02-01

    Xylella fastidiosa is a xylem-dwelling, insect-transmitted, gamma-proteobacterium that causes diseases in many plants, including grapevine, citrus, periwinkle, almond, oleander, and coffee. X. fastidiosa has an unusually broad host range, has an extensive geographical distribution throughout the American continent, and induces diverse disease phenotypes. Previous molecular analyses indicated three distinct groups of X. fastidiosa isolates that were expected to be genetically divergent. Here we report the genome sequence of X. fastidiosa (Temecula strain), isolated from a naturally infected grapevine with Pierce's disease (PD) in a wine-grape-growing region of California. Comparative analyses with a previously sequenced X. fastidiosa strain responsible for citrus variegated chlorosis (CVC) revealed that 98% of the PD X. fastidiosa Temecula genes are shared with the CVC X. fastidiosa strain 9a5c genes. Furthermore, the average amino acid identity of the open reading frames in the strains is 95.7%. Genomic differences are limited to phage-associated chromosomal rearrangements and deletions that also account for the strain-specific genes present in each genome. Genomic islands, one in each genome, were identified, and their presence in other X. fastidiosa strains was analyzed. We conclude that these two organisms have identical metabolic functions and are likely to use a common set of genes in plant colonization and pathogenesis, permitting convergence of functional genomic strategies.

  8. Comparative sequence and genetic analyses of asparagus BACs reveal no microsynteny with onion or rice.

    PubMed

    Jakse, Jernej; Telgmann, Alexa; Jung, Christian; Khar, Anil; Melgar, Sergio; Cheung, Foo; Town, Christopher D; Havey, Michael J

    2006-12-01

    The Poales (includes the grasses) and Asparagales [includes onion (Allium cepa L.) and asparagus (Asparagus officinalis L.)] are the two most economically important monocot orders. The Poales are a member of the commelinoid monocots, a group of orders sister to the Asparagales. Comparative genomic analyses have revealed a high degree of synteny among the grasses; however, it is not known if this synteny extends to other major monocot groups such as the Asparagales. Although we previously reported no evidence for synteny at the recombinational level between onion and rice, microsynteny may exist across shorter genomic regions in the grasses and Asparagales. We sequenced nine asparagus BACs to reveal physically linked genic-like sequences and determined their most similar positions in the onion and rice genomes. Four of the asparagus BACs were selected using molecular markers tightly linked to the sex-determining M locus on chromosome 5 of asparagus. These BACs possessed only two putative coding regions and had long tracts of degenerated retroviral elements and transposons. Five asparagus BACs were selected after hybridization of three onion cDNAs that mapped to three different onion chromosomes. Genic-like sequences that were physically linked on the cDNA-selected BACs or genetically linked on the M-linked BACs showed significant similarities (e < -20) to expressed sequences on different rice chromosomes, revealing no evidence for microsynteny between asparagus and rice across these regions. Genic-like sequences that were linked in asparagus were used to identify highly similar (e < -20) expressed sequence tags (ESTs) of onion. These onion ESTs mapped to different onion chromosomes and no relationship was observed between physical or genetic linkages in asparagus and genetic linkages in onion. These results further indicate that synteny among grass genomes does not extend to a sister order in the monocots and that asparagus may not be an appropriate smaller genome

  9. Nanopores and nucleic acids: prospects for ultrarapid sequencing

    NASA Technical Reports Server (NTRS)

    Deamer, D. W.; Akeson, M.

    2000-01-01

    DNA and RNA molecules can be detected as they are driven through a nanopore by an applied electric field at rates ranging from several hundred microseconds to a few milliseconds per molecule. The nanopore can rapidly discriminate between pyrimidine and purine segments along a single-stranded nucleic acid molecule. Nanopore detection and characterization of single molecules represents a new method for directly reading information encoded in linear polymers. If single-nucleotide resolution can be achieved, it is possible that nucleic acid sequences can be determined at rates exceeding a thousand bases per second.

  10. Sequence and phylogenetic analyses of novel totivirus-like double-stranded RNAs from field-collected powdery mildew fungi.

    PubMed

    Kondo, Hideki; Hisano, Sakae; Chiba, Sotaro; Maruyama, Kazuyuki; Andika, Ida Bagus; Toyoda, Kazuhiro; Fujimori, Fumihiro; Suzuki, Nobuhiro

    2016-02-02

    The identification of mycoviruses contributes greatly to understanding of the diversity and evolutionary aspects of viruses. Powdery mildew fungi are important and widely studied obligate phytopathogenic agents, but there has been no report on mycoviruses infecting these fungi. In this study, we used a deep sequencing approach to analyze the double-stranded RNA (dsRNA) segments isolated from field-collected samples of powdery mildew fungus-infected red clover plants in Japan. Database searches identified the presence of at least ten totivirus (genus Totivirus)-like sequences, termed red clover powdery mildew-associated totiviruses (RPaTVs). The majority of these sequences shared moderate amino acid sequence identity with each other (<44%) and with other known totiviruses (<59%). Nine of these identified sequences (RPaTV1a, 1b and 2-8) resembled the genome of the prototype totivirus, Saccharomyces cerevisiae virus-L-A (ScV-L-A) in that they contained two overlapping open reading frames (ORFs) encoding a putative coat protein (CP) and an RNA dependent RNA polymerase (RdRp), while one sequence (RPaTV9) showed similarity to another totivirus, Ustilago maydis virus H1 (UmV-H1) that encodes a single polyprotein (CP-RdRp fusion). Similar to yeast totiviruses, each ScV-L-A-like RPaTV contains a -1 ribosomal frameshift site downstream of a predicted pseudoknot structure in the overlapping region of these ORFs, suggesting that the RdRp is translated as a CP-RdRp fusion. Moreover, several ScV-L-A-like sequences were also found by searches of the transcriptome shotgun assembly (TSA) libraries from rust fungi, plants and insects. Phylogenetic analyses show that nine ScV-L-A-like RPaTVs along with ScV-L-A-like sequences derived from TSA libraries are clustered with most established members of the genus Totivirus, while one RPaTV forms a new distinct clade with UmV-H1, possibly establishing an additional genus in the family. Taken together, our results indicate the presence of

  11. Molecular Characterization of Five Potyviruses Infecting Korean Sweet Potatoes Based on Analyses of Complete Genome Sequences.

    PubMed

    Kwak, Hae-Ryun; Kim, Jaedeok; Kim, Mi-Kyeong; Seo, Jang-Kyun; Jung, Mi-Nam; Kim, Jeong-Soo; Lee, Sukchan; Choi, Hong-Soo

    2015-12-01

    Sweet potatoes (Ipomea batatas L.) are grown extensively, in tropical and temperate regions, and are important food crops worldwide. In Korea, potyviruses, including Sweet potato feathery mottle virus (SPFMV), Sweet potato virus C (SPVC), Sweet potato virus G (SPVG), Sweet potato virus 2 (SPV2), and Sweet potato latent virus (SPLV), have been detected in sweet potato fields at a high (~95%) incidence. In the present work, complete genome sequences of 18 isolates, representing the five potyviruses mentioned above, were compared with previously reported genome sequences. The complete genomes consisted of 10,081 to 10,830 nucleotides, excluding the poly-A tails. Their genomic organizations were typical of the Potyvirus genus, including one target open reading frame coding for a putative polyprotein. Based on phylogenetic analyses and sequence comparisons, the Korean SPFMV isolates belonged to the strains RC and O with >98% nucleotide sequence identity. Korean SPVC isolates had 99% identity to the Japanese isolate SPVC-Bungo and 70% identity to the SPFMV isolates. The Korean SPVG isolates showed 99% identity to the three previously reported SPVG isolates. Korean SPV2 isolates had 97% identity to the SPV2 GWB-2 isolate from the USA. Korean SPLV isolates had a relatively low (88%) nucleotide sequence identity with the Taiwanese SPLV-TW isolates, and they were phylogenetically distantly related to SPFMV isolates. Recombination analysis revealed that possible recombination events occurred in the P1, HC-Pro and NIa-NIb regions of SPFMV and SPLV isolates and these regions were identified as hotspots for recombination in the sweet potato potyviruses.

  12. Diversity and distribution of unicellular opisthokonts along the European coast analysed using high-throughput sequencing.

    PubMed

    Del Campo, Javier; Mallo, Diego; Massana, Ramon; de Vargas, Colomban; Richards, Thomas A; Ruiz-Trillo, Iñaki

    2015-09-01

    The opisthokonts are one of the major super groups of eukaryotes. It comprises two major clades: (i) the Metazoa and their unicellular relatives and (ii) the Fungi and their unicellular relatives. There is, however, little knowledge of the role of opisthokont microbes in many natural environments, especially among non-metazoan and non-fungal opisthokonts. Here, we begin to address this gap by analysing high-throughput 18S rDNA and 18S rRNA sequencing data from different European coastal sites, sampled at different size fractions and depths. In particular, we analyse the diversity and abundance of choanoflagellates, filastereans, ichthyosporeans, nucleariids, corallochytreans and their related lineages. Our results show the great diversity of choanoflagellates in coastal waters as well as a relevant representation of the ichthyosporeans and the uncultured marine opisthokonts (MAOP). Furthermore, we describe a new lineage of marine fonticulids (MAFO) that appears to be abundant in sediments. Taken together, our work points to a greater potential ecological role for unicellular opisthokonts than previously appreciated in marine environments, both in water column and sediments, and also provides evidence of novel opisthokont phylogenetic lineages. This study highlights the importance of high-throughput sequencing approaches to unravel the diversity and distribution of both known and novel eukaryotic lineages.

  13. Phylogeny of yeasts and related filamentous fungi within Pucciniomycotina determined from multigene sequence analyses

    PubMed Central

    Wang, Q.-M.; Groenewald, M.; Takashima, M.; Theelen, B.; Han, P.-J.; Liu, X.-Z.; Boekhout, T.; Bai, F.-Y.

    2015-01-01

    In addition to rusts, the subphylum Pucciniomycotina (Basidiomycota) includes a large number of unicellular or dimorphic fungi which are usually studied as yeasts. Ribosomal DNA sequence analyses have shown that the current taxonomic system of the pucciniomycetous yeasts which is based on phenotypic criteria is not concordant with the molecular phylogeny and many genera are polyphyletic. Here we inferred the molecular phylogeny of 184 pucciniomycetous yeast species and related filamentous fungi using maximum likelihood, maximum parsimony and Bayesian inference analyses based on the sequences of seven genes, including the small subunit ribosomal DNA (rDNA), the large subunit rDNA D1/D2 domains, the internal transcribed spacer regions (ITS 1 and 2) of rDNA including the 5.8S rDNA gene; the nuclear protein-coding genes of the two subunits of DNA polymerase II (RPB1 and RPB2) and the translation elongation factor 1-α (TEF1); and the mitochondrial gene cytochrome b (CYTB). A total of 33 monophyletic clades and 18 single species lineages were recognised among the pucciniomycetous yeasts employed, which belonged to four major lineages corresponding to Agaricostilbomycetes, Cystobasidiomycetes, Microbotryomycetes and Mixiomycetes. These lineages remained independent from the classes Atractiellomycetes, Classiculomycetes, Pucciniomycetes and Tritirachiomycetes formed by filamentous taxa in Pucciniomycotina. An updated taxonomic system of pucciniomycetous yeasts implementing the ‘One fungus = One name’ principle will be proposed based on the phylogenetic framework presented here. PMID:26955197

  14. Molecular phylogenetic and dating analyses using mitochondrial DNA sequences of eyelid geckos (Squamata: Eublepharidae).

    PubMed

    Jonniaux, Pierre; Kumazawa, Yoshinori

    2008-01-15

    Mitochondrial DNA sequences of approximately 2.3 kbp including the complete NADH dehydrogenase subunit 2 gene and its flanking genes, as well as parts of 12S and 16S rRNA genes were determined from major species of the eyelid gecko family Eublepharidae sensu [Kluge, A.G. 1987. Cladistic relationships in the Gekkonoidea (Squamata, Sauria). Misc. Publ. Mus. Zool. Univ. Michigan 173, 1-54.]. In contrast to previous morphological studies, phylogenetic analyses based on these sequences supported that Eublepharidae and Gekkonidae form a sister group with Pygopodidae, raising the possibility of homoplasious character change in some key features of geckos, such as reduction of movable eyelids and innovation of climbing toe pads. The phylogenetic analyses also provided a well-resolved tree for relationships between the eublepharid species. The Bayesian estimation of divergence times without assuming the molecular clock suggested the Jurassic divergence of Eublepharidae from Gekkonidae and radiations of most eublepharid genera around the Cretaceous. These dating results appeared to be robust against some conditional changes for time estimation, such as gene regions used, taxon representation, and data partitioning. Taken together with geological evidence, these results support the vicariant divergence of Eublepharidae and Gekkonidae by the breakup of Pangea into Laurasia and Gondwanaland, and recent dispersal of two African eublepharid genera from Eurasia to Africa after these landmasses were connected in the Early Miocene.

  15. The complementary deoxyribonucleic acid sequence of guinea pig endometrial prorelaxin.

    PubMed

    Lee, Y A; Bryant-Greenwood, G D; Mandel, M; Greenwood, F C

    1992-03-01

    The nucleotide sequence of the relaxin gene transcript in the endometrium of the late pregnant guinea pig has been determined. The strategy used was a combination of polymerase chain reaction (PCR) with primers designed from the mRNA sequence of porcine preprorelaxin, rapid amplification of cDNA ends-PCR, and blunt end cloning in M13 mp18. With heterologous primers, a 226-basepair (bp) segment of the guinea pig relaxin gene sequence was obtained and was used to design a guinea pig-specific primer for use with the rapid amplification of cDNA ends-PCR method. The latter allowed completion of the sequence of 336 bp, with a 96-bp overlap. The sequence obtained shows greater homology at both the nucleotide and amino acid levels with porcine and human relaxins H1 and H2 than with rat relaxin, supporting the thesis that the guinea pig is not a rodent. The transcription of the guinea pig endometrial relaxin gene during pregnancy was confirmed by Northern analysis of guinea pig endometrial tissues with a species-specific cDNA probe. The endometrial relaxin gene is transcribed during pregnancy, but not in lactation, consistent with the observed immunostaining for relaxin.

  16. Quantum-Sequencing: Biophysics of quantum tunneling through nucleic acids

    NASA Astrophysics Data System (ADS)

    Casamada Ribot, Josep; Chatterjee, Anushree; Nagpal, Prashant

    2014-03-01

    Tunneling microscopy and spectroscopy has extensively been used in physical surface sciences to study quantum tunneling to measure electronic local density of states of nanomaterials and to characterize adsorbed species. Quantum-Sequencing (Q-Seq) is a new method based on tunneling microscopy for electronic sequencing of single molecule of nucleic acids. A major goal of third-generation sequencing technologies is to develop a fast, reliable, enzyme-free single-molecule sequencing method. Here, we present the unique ``electronic fingerprints'' for all nucleotides on DNA and RNA using Q-Seq along their intrinsic biophysical parameters. We have analyzed tunneling spectra for the nucleotides at different pH conditions and analyzed the HOMO, LUMO and energy gap for all of them. In addition we show a number of biophysical parameters to further characterize all nucleobases (electron and hole transition voltage and energy barriers). These results highlight the robustness of Q-Seq as a technique for next-generation sequencing.

  17. Circumscription and phylogeny of the Orthotrichales (Bryopsida) inferred from RBCL sequence analyses.

    PubMed

    Goffinet, B; Bayer, R J; Vitt, D H

    1998-09-01

    The affinities as well as the circumscription of the Orthotrichaceae (Bryopsida), one of the most diverse families of mosses, have been the focus of a controversy for much of the last century. We obtained rbcL sequences for 37 arthrodontous mosses, including 27 taxa of the Orthotrichales. The sequences were analyzed using maximum parsimony and neighbor joining in order to (1) test the monophyly of the Orthotrichales and the Orthotrichaceae; (2) determine their phylogenetic relationships; and (3) test the current subfamilial classification within the Orthotrichaceae. Both analyses suggest that the Orthotrichales are polyphyletic. The Erpodiaceae and the Rhachitheciaceae as well as Amphidium and Drummondia, two genera of the Orthotrichaceae, are shown to be of haplolepideous affinity. The Splachnales, the Bryales sensu lato, and the Orthotrichales form a monophyletic clade sister to the Haplolepideae. Both neighbor joining and maximum parsimony also suggest that the Orthotrichaceae are composed of two major lineages dominated either by acrocarpous or cladocarpous taxa. The monophyly of the family is, however, only well supported by Tamura's distances. The genera Macrocoma, Macromitrium, Orthotrichum, Ulota, and Zygodon all appear to be artificial assemblages. This study illustrates the contribution of rbcL sequence data to bryophyte systematics and, particularly, in determining the affinities of taxa lacking a peristome, whose characters are central to the classification of mosses.

  18. Identification of food and beverage spoilage yeasts from DNA sequence analyses.

    PubMed

    Kurtzman, Cletus P

    2015-11-20

    Detection, identification and classification of yeasts have undergone major changes in the last decade and a half following application of gene sequence analyses and genome comparisons. Development of a database (barcode) of easily determined DNA sequences from domains 1 and 2 (D1/D2) of the nuclear large subunit rRNA gene and from ITS now permits many laboratories to identify species quickly and accurately, thus replacing the laborious and often inaccurate phenotypic tests previously used. Phylogenetic analysis of gene sequences has resulted in a major revision of yeast systematics resulting in redefinition of nearly all genera. This new understanding of species relationships has prompted a change of rules for naming and classifying yeasts and other fungi, and these new rules are presented in the recently implemented International Code of Nomenclature for algae, fungi, and plants (Melbourne Code). The use of molecular methods for species identification and the impact of Code changes on classification will be discussed, especially in the context of food and beverage spoilage yeasts.

  19. DNA sequence analyses of blended herbal products including synthetic cannabinoids as designer drugs.

    PubMed

    Ogata, Jun; Uchiyama, Nahoko; Kikura-Hanajiri, Ruri; Goda, Yukihiro

    2013-04-10

    In recent years, various herbal products adulterated with synthetic cannabinoids have been distributed worldwide via the Internet. These herbal products are mostly sold as incense, and advertised as not for human consumption. Although their labels indicate that they contain mixtures of several potentially psychoactive plants, and numerous studies have reported that they contain a variety of synthetic cannabinoids, their exact botanical contents are not always clear. In this study, we investigated the origins of botanical materials in 62 Spice-like herbal products distributed on the illegal drug market in Japan, by DNA sequence analyses and BLAST searches. The nucleotide sequences of four regions were analyzed to identify the origins of each plant species in the herbal mixtures. The sequences of "Damiana" (Turnera diffusa) and Lamiaceae herbs (Mellissa, Mentha and Thymus) were frequently detected in a number of products. However, the sequences of other plant species indicated on the packaging labels were not detected. In a few products, DNA fragments of potent psychotropic plants were found, including marijuana (Cannabis sativa), "Diviner's Sage" (Salvia divinorum) and "Kratom" (Mitragyna speciosa). Their active constituents were also confirmed using gas chromatography-mass spectrometry (GC-MS) and liquid chromatography-mass spectrometry (LC-MS), although these plant names were never indicated on the labels. Most plant species identified in the products were different from the plants indicated on the labels. The plant materials would be used mainly as diluents for the psychoactive synthetic compounds, because no reliable psychoactive effects have been reported for most of the identified plants, with the exception of the psychotropic plants named above.

  20. cDNA sequence and protein bioinformatics analyses of MSTN in African catfish (Clarias gariepinus).

    PubMed

    Kanjanaworakul, Poonmanee; Sawatdichaikul, Orathai; Poompuang, Supawadee

    2016-04-01

    Myostatin, also known as growth differentiation factor 8, has been identified as a potent negative regulator of skeletal muscle growth. The purpose of this study was to characterize and predict function of the myostatin gene of the African catfish (Cg-MSTN). Expression of Cg-MSTN was determined at three growth stages to establish the relationship between the levels of MSTN transcript and skeletal muscle growth. The partial cDNA sequence of Cg-MSTN was cloned by using published information from its congener walking catfish (Cm-MSTN). The Cg-MSTN was 1194 bp in length encoding a protein of 397 amino acids. The deduced MSTN sequence exhibited key functional sites similar to those of other members of the TGF-β superfamily, especially, the proteolytic processing site (RXXR motif) and nine conserved cysteines at the C-terminal. Expression of MSTN appeared to be correlated with muscle development and growth of African catfish. Protein bioinformatics revealed that the primary sequence of Cg-MSTN shared 98 % sequence identity with that of walking catfish Cm-MSTN with only two different residues, [Formula: see text]. and [Formula: see text]. The proposed model of Cg-MSTN revealed the key point mutation [Formula: see text] causing a 7.35 Å shorter distance between the N- and C-lobes and an approximately 11° narrow angle than those of Cm-MSTN. The substitution of a proline residue near the proteolytic processing site which altered the structure of myostatin may play a critical role in reducing proteolytic activity of this protein in African catfish.

  1. Molecular cloning and amino acid sequence of human 5-lipoxygenase

    SciTech Connect

    Matsumoto, T.; Funk, C.D.; Radmark, O.; Hoeoeg, J.O.; Joernvall, H.; Samuelsson, B.

    1988-01-01

    5-Lipoxygenase (EC 1.13.11.34), a Ca/sup 2 +/- and ATP-requiring enzyme, catalyzes the first two steps in the biosynthesis of the peptidoleukotrienes and the chemotactic factor leukotriene B/sub 4/. A cDNA clone corresponding to 5-lipoxygenase was isolated from a human lung lambda gt11 expression library by immunoscreening with a polyclonal antibody. Additional clones from a human placenta lambda gt11 cDNA library were obtained by plaque hybridization with the /sup 32/P-labeled lung cDNA clone. Sequence data obtained from several overlapping clones indicate that the composite DNAs contain the complete coding region for the enzyme. From the deduced primary structure, 5-lipoxygenase encodes a 673 amino acid protein with a calculated molecular weight of 77,839. Direct analysis of the native protein and its proteolytic fragments confirmed the deduced composition, the amino-terminal amino acid sequence, and the structure of many internal segments. 5-Lipoxygenase has no apparent sequence homology with leukotriene A/sub 4/ hydrolase or Ca/sup 2 +/-binding proteins. RNA blot analysis indicated substantial amounts of an mRNA species of approx. = 2700 nucleotides in leukocytes, lung, and placenta.

  2. Nucleic acid sequence detection using multiplexed oligonucleotide PCR

    DOEpatents

    Nolan, John P.; White, P. Scott

    2006-12-26

    Methods for rapidly detecting single or multiple sequence alleles in a sample nucleic acid are described. Provided are all of the oligonucleotide pairs capable of annealing specifically to a target allele and discriminating among possible sequences thereof, and ligating to each other to form an oligonucleotide complex when a particular sequence feature is present (or, alternatively, absent) in the sample nucleic acid. The design of each oligonucleotide pair permits the subsequent high-level PCR amplification of a specific amplicon when the oligonucleotide complex is formed, but not when the oligonucleotide complex is not formed. The presence or absence of the specific amplicon is used to detect the allele. Detection of the specific amplicon may be achieved using a variety of methods well known in the art, including without limitation, oligonucleotide capture onto DNA chips or microarrays, oligonucleotide capture onto beads or microspheres, electrophoresis, and mass spectrometry. Various labels and address-capture tags may be employed in the amplicon detection step of multiplexed assays, as further described herein.

  3. The amino acid sequence of chymopapain from Carica papaya.

    PubMed Central

    Watson, D C; Yaguchi, M; Lynn, K R

    1990-01-01

    Chymopapain is a polypeptide of 218 amino acid residues. It has considerable structural similarity with papain and papaya proteinase omega, including conservation of the catalytic site and of the disulphide bonding. Chymopapain is like papaya proteinase omega in carrying four extra residues between papain positions 168 and 169, but differs from both papaya proteinases in the composition of its S2 subsite, as well as in having a second thiol group, Cys-117. Some evidence for the amino acid sequence of chymopapain has been deposited as Supplementary Publication SUP 50153 (12 pages) at the British Library Document Supply Centre, Boston Spa., Wetherby, West Yorkshire LS23 7BQ, U.K., from whom copies may be obtained on the terms indicated in Biochem. J. (1990) 265, 5. The information comprises Supplement Tables 1-4, which contain, in order, amino acid compositions of peptides from tryptic, peptic, CNBr and mild acid cleavages, Supplement Fig. 1, showing re-fractionation of selected peaks from Fig. 2 of the main paper. Supplement Fig. 2, showing cation-exchange chromatography of the earliest-eluted peak of Fig. 3 of the main paper, Supplement Fig. 3, showing reverse-phase h.p.l.c. of the later-eluted peak from Fig. 3 of the main paper, and Supplement Fig. 4, showing the separation of peptides after mild acid hydrolysis of CNBr-cleavage fragment CB3. PMID:2106878

  4. The amino acid sequence of rabbit cardiac troponin I.

    PubMed Central

    Grand, R J; Wilkinson, J M

    1976-01-01

    The complete amino acid sequence of troponin I from rabbit cardiac muscle was determined by the isolation of four unique CNBr fragments, together with overlapping tryptic peptides containing radioactive methionine residues. Overlap data for residues 35-36, 93-94 and 140-145 are incomplete, the sequence at these positions being based on homology with the sequence of the fast-skeletal-muscle protein. Cardiac troponin I is a single polypeptide chain of 206 residues with mol.wt. 23550 and an extinction coefficient, E 1%,1cm/280, of 4.37. The protein has a net positive charge of 14 and is thus somewhat more basic than troponin I from fast-skeletal muscle. Comparison of the sequences of troponin I from cardiac and fast skeletal muscle show that the cardiac protein has 26 extra residues at the N-terminus which account for the larger size of the protein. In the remainder of sequence there is a considerable degree of homology, this being greater in the C-terminal two-thirds of the molecule. The region in the cardiac protein corresponding to the peptide with inhibitory activity from the fast-skeletal-muscle protein is very similar and it seems unlikely that this is the cause of the difference in inhibitory activity between the two proteins. The region responsible for binding troponin C, however, possesses a lower degree of homology. Detailed evidence on which the sequence is based has been deposited as Supplementary Publication SUP 50072 (20 pages), at the British Library Lending Division, Boston Spa, Wetherby, West Yorkshire LS23 7QB, U.K., from whom copies may be obtained on the terms given in Biochem. J. (1976) 153, 5. PMID:1008822

  5. Evolutionary dynamics of influenza A nucleoprotein (NP) lineages revealed by large-scale sequence analyses.

    PubMed

    Xu, Jianpeng; Christman, Mary C; Donis, Ruben O; Lu, Guoqing

    2011-12-01

    Influenza A viral nucleoprotein (NP) plays a critical role in virus replication and host adaptation, however, the underlying molecular evolutionary dynamics of NP lineages are less well-understood. In this study, large-scale analyses of 5094 NP nucleotide sequences revealed eight distinct evolutionary lineages, including three host-specific lineages (human, classical swine and equine), two cross-host lineages (Eurasian avian-like swine and swine-origin human pandemic H1N1 2009) and three geographically isolated avian lineages (Eurasian, North American and Oceanian). The average nucleotide substitution rate of the NP lineages was estimated to be 2.4 × 10(-3) substitutions per site per year, with the highest value observed in pandemic H1N1 2009 (3.4 × 10(-3)) and the lowest in equine (0.9 × 10(-3)). The estimated time of most recent common ancestor (TMRCA) for each lineage demonstrated that the earliest human lineage was derived around 1906, and the latest pandemic H1N1 2009 lineage dated back to December 17, 2008. A marked time gap was found between the times when the viruses emerged and were first sampled, suggesting the crucial role for long-term surveillance of newly emerging viruses. The selection analyses showed that human lineage had six positive selection sites, whereas pandemic H1N1 2009, classical swine, Eurasian avian and Eurasian swine had only one or two sites. Protein structure analyses revealed several positive selection sites located in epitope regions or host adaptation regions, indicating strong adaptation to host immune system pressures in influenza viruses. Along with previous studies, this study provides new insights into the evolutionary dynamics of influenza A NP lineages. Further lineage analyses of other gene segments will allow better understanding of influenza A virus evolution and assist in the improvement of global influenza surveillance.

  6. A comparative study of cold- and warm-adapted Endonucleases A using sequence analyses and molecular dynamics simulations.

    PubMed

    Michetti, Davide; Brandsdal, Bjørn Olav; Bon, Davide; Isaksen, Geir Villy; Tiberti, Matteo; Papaleo, Elena

    2017-01-01

    The psychrophilic and mesophilic endonucleases A (EndA) from Aliivibrio salmonicida (VsEndA) and Vibrio cholera (VcEndA) have been studied experimentally in terms of the biophysical properties related to thermal adaptation. The analyses of their static X-ray structures was no sufficient to rationalize the determinants of their adaptive traits at the molecular level. Thus, we used Molecular Dynamics (MD) simulations to compare the two proteins and unveil their structural and dynamical differences. Our simulations did not show a substantial increase in flexibility in the cold-adapted variant on the nanosecond time scale. The only exception is a more rigid C-terminal region in VcEndA, which is ascribable to a cluster of electrostatic interactions and hydrogen bonds, as also supported by MD simulations of the VsEndA mutant variant where the cluster of interactions was introduced. Moreover, we identified three additional amino acidic substitutions through multiple sequence alignment and the analyses of MD-based protein structure networks. In particular, T120V occurs in the proximity of the catalytic residue H80 and alters the interaction with the residue Y43, which belongs to the second coordination sphere of the Mg2+ ion. This makes T120V an amenable candidate for future experimental mutagenesis.

  7. A comparative study of cold- and warm-adapted Endonucleases A using sequence analyses and molecular dynamics simulations

    PubMed Central

    Michetti, Davide; Brandsdal, Bjørn Olav; Bon, Davide; Isaksen, Geir Villy; Tiberti, Matteo; Papaleo, Elena

    2017-01-01

    The psychrophilic and mesophilic endonucleases A (EndA) from Aliivibrio salmonicida (VsEndA) and Vibrio cholera (VcEndA) have been studied experimentally in terms of the biophysical properties related to thermal adaptation. The analyses of their static X-ray structures was no sufficient to rationalize the determinants of their adaptive traits at the molecular level. Thus, we used Molecular Dynamics (MD) simulations to compare the two proteins and unveil their structural and dynamical differences. Our simulations did not show a substantial increase in flexibility in the cold-adapted variant on the nanosecond time scale. The only exception is a more rigid C-terminal region in VcEndA, which is ascribable to a cluster of electrostatic interactions and hydrogen bonds, as also supported by MD simulations of the VsEndA mutant variant where the cluster of interactions was introduced. Moreover, we identified three additional amino acidic substitutions through multiple sequence alignment and the analyses of MD-based protein structure networks. In particular, T120V occurs in the proximity of the catalytic residue H80 and alters the interaction with the residue Y43, which belongs to the second coordination sphere of the Mg2+ ion. This makes T120V an amenable candidate for future experimental mutagenesis. PMID:28192428

  8. Ultrasensitive nucleic acid sequence detection by single-molecule electrophoresis

    SciTech Connect

    Castro, A; Shera, E.B.

    1996-09-01

    This is the final report of a one-year laboratory-directed research and development project at Los Alamos National Laboratory. There has been considerable interest in the development of very sensitive clinical diagnostic techniques over the last few years. Many pathogenic agents are often present in extremely small concentrations in clinical samples, especially at the initial stages of infection, making their detection very difficult. This project sought to develop a new technique for the detection and accurate quantification of specific bacterial and viral nucleic acid sequences in clinical samples. The scheme involved the use of novel hybridization probes for the detection of nucleic acids combined with our recently developed technique of single-molecule electrophoresis. This project is directly relevant to the DOE`s Defense Programs strategic directions in the area of biological warfare counter-proliferation.

  9. 5S ribosomal ribonucleic acid sequences in Bacteroides and Fusobacterium: evolutionary relationships within these genera and among eubacteria in general

    NASA Technical Reports Server (NTRS)

    Van den Eynde, H.; De Baere, R.; Shah, H. N.; Gharbia, S. E.; Fox, G. E.; Michalik, J.; Van de Peer, Y.; De Wachter, R.

    1989-01-01

    The 5S ribosomal ribonucleic acid (rRNA) sequences were determined for Bacteroides fragilis, Bacteroides thetaiotaomicron, Bacteroides capillosus, Bacteroides veroralis, Porphyromonas gingivalis, Anaerorhabdus furcosus, Fusobacterium nucleatum, Fusobacterium mortiferum, and Fusobacterium varium. A dendrogram constructed by a clustering algorithm from these sequences, which were aligned with all other hitherto known eubacterial 5S rRNA sequences, showed differences as well as similarities with respect to results derived from 16S rRNA analyses. In the 5S rRNA dendrogram, Bacteroides clustered together with Cytophaga and Fusobacterium, as in 16S rRNA analyses. Intraphylum relationships deduced from 5S rRNAs suggested that Bacteroides is specifically related to Cytophaga rather than to Fusobacterium, as was suggested by 16S rRNA analyses. Previous taxonomic considerations concerning the genus Bacteroides, based on biochemical and physiological data, were confirmed by the 5S rRNA sequence analysis.

  10. RAPHIDOPHYCEAE [CHADEFAUD EX SILVA] SYSTEMATICS AND RAPID IDENTIFICATION: SEQUENCE ANALYSES AND REAL-TIME PCR ASSAYS

    PubMed Central

    Bowers, Holly A.; Tomas, Carmelo; Tengs, Torstein; Kempton, Jason W.; Lewitus, Alan J.; Oldach, David W.

    2010-01-01

    Species within the class Raphidophyceae were associated with fish kill events in Japanese, European, Canadian, and U.S. coastal waters. Fish mortality was attributable to gill damage with exposure to reactive oxygen species (peroxide, superoxide, and hydroxide radicals), neurotoxins, physical clogging, and hemolytic substances. Morphological identification of these organisms in environmental water samples is difficult, particularly when fixatives are used. Because of this difficulty and the continued global emergence of these species in coastal estuarine waters, we initiated the development and validation of a suite of real-time polymerase chain reaction (PCR) assays. Sequencing was used to generate complete data sets for nuclear encoded small-subunit ribosomal RNA (SSU rRNA; 18S); internal transcribed spacers 1 and 2, 5.8S; and plastid encoded SSU rRNA (16S) for confirmed raphidophyte cultures from various geographic locations. Sequences for several Chattonella species (C. antiqua, C. marina, C. ovata, C. subsalsa, and C. verruculosa), Heterosigma akashiwo, and Fibrocapsa japonica were generated and used to design rapid and specific PCR assays for several species including C. verruculosa Hara et Chihara, C. subsalsa Biecheler, the complex comprised of C. marina Hara et Chihara, C. antiqua Ono and C. ovata, H. akashiwo Ono, and F. japonica Toriumi et Takano using appropriate loci. With this comprehensive data set, we were also able to perform phylogenetic analyses to determine the relationship between these species. PMID:20411032

  11. Computer-aided analyses of transport protein sequences: gleaning evidence concerning function, structure, biogenesis, and evolution.

    PubMed Central

    Saier, M H

    1994-01-01

    Three-dimensional structures have been elucidated for very few integral membrane proteins. Computer methods can be used as guides for estimation of solute transport protein structure, function, biogenesis, and evolution. In this paper the application of currently available computer programs to over a dozen distinct families of transport proteins is reviewed. The reliability of sequence-based topological and localization analyses and the importance of sequence and residue conservation to structure and function are evaluated. Evidence concerning the nature and frequency of occurrence of domain shuffling, splicing, fusion, deletion, and duplication during evolution of specific transport protein families is also evaluated. Channel proteins are proposed to be functionally related to carriers. It is argued that energy coupling to transport was a late occurrence, superimposed on preexisting mechanisms of solute facilitation. It is shown that several transport protein families have evolved independently of each other, employing different routes, at different times in evolutionary history, to give topologically similar transmembrane protein complexes. The possible significance of this apparent topological convergence is discussed. PMID:8177172

  12. Insight in genome-wide association of metabolite quantitative traits by exome sequence analyses.

    PubMed

    Demirkan, Ayşe; Henneman, Peter; Verhoeven, Aswin; Dharuri, Harish; Amin, Najaf; van Klinken, Jan Bert; Karssen, Lennart C; de Vries, Boukje; Meissner, Axel; Göraler, Sibel; van den Maagdenberg, Arn M J M; Deelder, André M; C 't Hoen, Peter A; van Duijn, Cornelia M; van Dijk, Ko Willems

    2015-01-01

    Metabolite quantitative traits carry great promise for epidemiological studies, and their genetic background has been addressed using Genome-Wide Association Studies (GWAS). Thus far, the role of less common variants has not been exhaustively studied. Here, we set out a GWAS for metabolite quantitative traits in serum, followed by exome sequence analysis to zoom in on putative causal variants in the associated genes. 1H Nuclear Magnetic Resonance (1H-NMR) spectroscopy experiments yielded successful quantification of 42 unique metabolites in 2,482 individuals from The Erasmus Rucphen Family (ERF) study. Heritability of metabolites were estimated by SOLAR. GWAS was performed by linear mixed models, using HapMap imputations. Based on physical vicinity and pathway analyses, candidate genes were screened for coding region variation using exome sequence data. Heritability estimates for metabolites ranged between 10% and 52%. GWAS replicated three known loci in the metabolome wide significance: CPS1 with glycine (P-value  = 1.27×10-32), PRODH with proline (P-value  = 1.11×10-19), SLC16A9 with carnitine level (P-value  = 4.81×10-14) and uncovered a novel association between DMGDH and dimethyl-glycine (P-value  = 1.65×10-19) level. In addition, we found three novel, suggestively significant loci: TNP1 with pyruvate (P-value  = 1.26×10-8), KCNJ16 with 3-hydroxybutyrate (P-value  = 1.65×10-8) and 2p12 locus with valine (P-value  = 3.49×10-8). Exome sequence analysis identified potentially causal coding and regulatory variants located in the genes CPS1, KCNJ2 and PRODH, and revealed allelic heterogeneity for CPS1 and PRODH. Combined GWAS and exome analyses of metabolites detected by high-resolution 1H-NMR is a robust approach to uncover metabolite quantitative trait loci (mQTL), and the likely causative variants in these loci. It is anticipated that insight in the genetics of intermediate phenotypes will provide additional insight into the

  13. Insight in Genome-Wide Association of Metabolite Quantitative Traits by Exome Sequence Analyses

    PubMed Central

    Verhoeven, Aswin; Dharuri, Harish; Amin, Najaf; van Klinken, Jan Bert; Karssen, Lennart C.; de Vries, Boukje; Meissner, Axel; Göraler, Sibel; van den Maagdenberg, Arn M. J. M.; Deelder, André M.; C ’t Hoen, Peter A.; van Duijn, Cornelia M.; van Dijk, Ko Willems

    2015-01-01

    Metabolite quantitative traits carry great promise for epidemiological studies, and their genetic background has been addressed using Genome-Wide Association Studies (GWAS). Thus far, the role of less common variants has not been exhaustively studied. Here, we set out a GWAS for metabolite quantitative traits in serum, followed by exome sequence analysis to zoom in on putative causal variants in the associated genes. 1H Nuclear Magnetic Resonance (1H-NMR) spectroscopy experiments yielded successful quantification of 42 unique metabolites in 2,482 individuals from The Erasmus Rucphen Family (ERF) study. Heritability of metabolites were estimated by SOLAR. GWAS was performed by linear mixed models, using HapMap imputations. Based on physical vicinity and pathway analyses, candidate genes were screened for coding region variation using exome sequence data. Heritability estimates for metabolites ranged between 10% and 52%. GWAS replicated three known loci in the metabolome wide significance: CPS1 with glycine (P-value  = 1.27×10−32), PRODH with proline (P-value  = 1.11×10−19), SLC16A9 with carnitine level (P-value  = 4.81×10−14) and uncovered a novel association between DMGDH and dimethyl-glycine (P-value  = 1.65×10−19) level. In addition, we found three novel, suggestively significant loci: TNP1 with pyruvate (P-value  = 1.26×10−8), KCNJ16 with 3-hydroxybutyrate (P-value  = 1.65×10−8) and 2p12 locus with valine (P-value  = 3.49×10−8). Exome sequence analysis identified potentially causal coding and regulatory variants located in the genes CPS1, KCNJ2 and PRODH, and revealed allelic heterogeneity for CPS1 and PRODH. Combined GWAS and exome analyses of metabolites detected by high-resolution 1H-NMR is a robust approach to uncover metabolite quantitative trait loci (mQTL), and the likely causative variants in these loci. It is anticipated that insight in the genetics of intermediate phenotypes will provide additional

  14. Nucleic acid (cDNA) and amino acid sequences of alpha-type gliadins from wheat (Triticum aestivum).

    PubMed Central

    Kasarda, D D; Okita, T W; Bernardin, J E; Baecker, P A; Nimmo, C C; Lew, E J; Dietler, M D; Greene, F C

    1984-01-01

    The complete amino acid sequence for an alpha-type gliadin protein of wheat (Triticum aestivum Linnaeus) endosperm has been derived from a cloned cDNA sequence. An additional cDNA clone that corresponds to about 75% of a similar alpha-type gliadin has been sequenced and shows some important differences. About 97% of the composite sequence of A-gliadin (an alpha-type gliadin fraction) has also been obtained by direct amino acid sequencing. This sequence shows a high degree of similarity with amino acid sequences derived from both cDNA clones and is virtually identical to one of them. On the basis of sequence information, after loss of the signal sequence, the mature alpha-type gliadins may be divided into five different domains, two of which may have evolved from an ancestral gliadin gene, whereas the remaining three contain repeating sequences that may have developed independently. Images PMID:6589619

  15. Human Retroviruses and AIDS. A compilation and analysis of nucleic acid and amino acid sequences: I--II; III--V

    SciTech Connect

    Myers, G.; Korber, B.; Wain-Hobson, S.; Smith, R.F.; Pavlakis, G.N.

    1993-12-31

    This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (I) HIV and SIV Nucleotide Sequences; (II) Amino Acid Sequences; (III) Analyses; (IV) Related Sequences; and (V) Database Communications. Information within all the parts is updated at least twice in each year, which accounts for the modes of binding and pagination in the compendium.

  16. The Mitochondrial Genomes of Aquila fasciata and Buteo lagopus (Aves, Accipitriformes): Sequence, Structure and Phylogenetic Analyses

    PubMed Central

    Jiang, Lan; Chen, Juan; Wang, Ping; Ren, Qiongqiong; Yuan, Jian; Qian, Chaoju; Hua, Xinghong; Guo, Zhichun; Zhang, Lei; Yang, Jianke; Wang, Ying; Zhang, Qin; Ding, Hengwu; Bi, De; Zhang, Zongmeng; Wang, Qingqing; Chen, Dongsheng; Kan, Xianzhao

    2015-01-01

    The family Accipitridae is one of the largest groups of non-passerine birds, including 68 genera and 243 species globally distributed. In the present study, we determined the complete mitochondrial sequences of two species of accipitrid, namely Aquila fasciata and Buteo lagopus, and conducted a comparative mitogenome analysis across the family. The mitogenome length of A. fasciata and B. lagopus are 18,513 and 18,559 bp with an A + T content of 54.2% and 55.0%, respectively. For both the two accipitrid birds mtDNAs, obvious positive AT-skew and negative GC-skew biases were detected for all 12 PCGs encoded by the H strand, whereas the reverse was found in MT-ND6 encoded by the L strand. One extra nucleotide‘C’is present at the position 174 of MT-ND3 gene of A. fasciata, which is not observed at that of B. lagopus. Six conserved sequence boxes in the Domain II, named boxes F, E, D, C, CSBa, and CSBb, respectively, were recognized in the CRs of A. fasciata and B. lagopus. Rates and patterns of mitochondrial gene evolution within Accipitridae were also estimated. The highest dN/dS was detected for the MT-ATP8 gene (0.32493) among Accipitridae, while the lowest for the MT-CO1 gene (0.01415). Mitophylogenetic analysis supported the robust monophyly of Accipitriformes, and Cathartidae was basal to the balance of the order. Moreover, we performed phylogenetic analyses using two other data sets (two mitochondrial loci, and combined nuclear and mitochondrial loci). Our results indicate that the subfamily Aquilinae and all currently polytypic genera of this subfamily are monophyletic. These two novel mtDNA data will be useful in refining the phylogenetic relationships and evolutionary processes of Accipitriformes. PMID:26295156

  17. Phylogeny of tremellomycetous yeasts and related dimorphic and filamentous basidiomycetes reconstructed from multiple gene sequence analyses

    PubMed Central

    Liu, X.-Z.; Wang, Q.-M.; Theelen, B.; Groenewald, M.; Bai, F.-Y.; Boekhout, T.

    2015-01-01

    The Tremellomycetes (Basidiomycota) contains a large number of unicellular and dimorphic fungi with stable free-living unicellular states in their life cycles. These fungi have been conventionally classified as basidiomycetous yeasts based on physiological and biochemical characteristics. Many currently recognised genera of these yeasts are mainly defined based on phenotypical characters and are highly polyphyletic. Here we reconstructed the phylogeny of the majority of described anamorphic and teleomorphic tremellomycetous yeasts using Bayesian inference, maximum likelihood, and neighbour-joining analyses based on the sequences of seven genes, including three rRNA genes, namely the small subunit of the ribosomal DNA (rDNA), D1/D2 domains of the large subunit rDNA, and the internal transcribed spacer regions (ITS 1 and 2) of rDNA including 5.8S rDNA; and four protein-coding genes, namely the two subunits of the RNA polymerase II (RPB1 and RPB2), the translation elongation factor 1-α (TEF1) and the mitochondrial gene cytochrome b (CYTB). With the consideration of morphological, physiological and chemotaxonomic characters and the congruence of phylogenies inferred from analyses using different algorithms based on different data sets consisting of the combined seven genes, the three rRNA genes, and the individual protein-coding genes, five major lineages corresponding to the orders Cystofilobasidiales, Filobasidiales, Holtermanniales, Tremellales, and Trichosporonales were resolved. A total of 45 strongly supported monophyletic clades with multiple species and 23 single species clades were recognised. This phylogenetic framework will be the basis for the proposal of an updated taxonomic system of tremellomycetous yeasts that will be compatible with the current taxonomic system of filamentous basidiomycetes accommodating the ‘one fungus, one name’ principle. PMID:26955196

  18. Non-canonical integration events in Pichia pastoris encountered during standard transformation analysed with genome sequencing

    PubMed Central

    Schwarzhans, Jan-Philipp; Wibberg, Daniel; Winkler, Anika; Luttermann, Tobias; Kalinowski, Jörn; Friehs, Karl

    2016-01-01

    The non-conventional yeast Pichia pastoris is a popular host for recombinant protein production in scientific research and industry. Typically, the expression cassette is integrated into the genome via homologous recombination. Due to unknown integration events, a large clonal variability is often encountered consisting of clones with different productivities as well as aberrant morphological or growth characteristics. In this study, we analysed several clones with abnormal colony morphology and discovered unpredicted integration events via whole genome sequencing. These include (i) the relocation of the locus targeted for replacement to another chromosome (ii) co-integration of DNA from the E. coli plasmid host and (iii) the disruption of untargeted genes affecting colony morphology. Most of these events have not been reported so far in literature and present challenges for genetic engineering approaches in this yeast. Especially, the presence and independent activity of E. coli DNA elements in P. pastoris is of concern. In our study, we provide a deeper insight into these events and their potential origins. Steps preventing or reducing the risk for these phenomena are proposed and will help scientists working on genetic engineering of P. pastoris or similar non-conventional yeast to better understand and control clonal variability. PMID:27958335

  19. The GeneCards Suite: From Gene Data Mining to Disease Genome Sequence Analyses.

    PubMed

    Stelzer, Gil; Rosen, Naomi; Plaschkes, Inbar; Zimmerman, Shahar; Twik, Michal; Fishilevich, Simon; Stein, Tsippi Iny; Nudel, Ron; Lieder, Iris; Mazor, Yaron; Kaplan, Sergey; Dahary, Dvir; Warshawsky, David; Guan-Golan, Yaron; Kohn, Asher; Rappaport, Noa; Safran, Marilyn; Lancet, Doron

    2016-06-20

    GeneCards, the human gene compendium, enables researchers to effectively navigate and inter-relate the wide universe of human genes, diseases, variants, proteins, cells, and biological pathways. Our recently launched Version 4 has a revamped infrastructure facilitating faster data updates, better-targeted data queries, and friendlier user experience. It also provides a stronger foundation for the GeneCards suite of companion databases and analysis tools. Improved data unification includes gene-disease links via MalaCards and merged biological pathways via PathCards, as well as drug information and proteome expression. VarElect, another suite member, is a phenotype prioritizer for next-generation sequencing, leveraging the GeneCards and MalaCards knowledgebase. It automatically infers direct and indirect scored associations between hundreds or even thousands of variant-containing genes and disease phenotype terms. VarElect's capabilities, either independently or within TGex, our comprehensive variant analysis pipeline, help prepare for the challenge of clinical projects that involve thousands of exome/genome NGS analyses. © 2016 by John Wiley & Sons, Inc.

  20. Palaeoenvironmental and sequence stratigraphic analyses of the Jurassic Datta Formation, Salt Range, Pakistan

    NASA Astrophysics Data System (ADS)

    Iqbal, Shahid; Jan, Irfan U.; Akhter, M. Gulraiz; Bibi, Mehwish

    2015-06-01

    The Lower Jurassic Datta Formation, western Salt Range, Pakistan, comprises three facies associations: (1) channel belt facies association (CBFA), (2) channel margin, and overbank facies association (CMOFA), and (3) lagoonal facies association (LFA). A cyclic fining-upward trend in the succession is represented by basal quartzose conglomerate/pebbly sandstone, through coarse to fine quartzose sandstone to siltstone and shales/claystone, which contains some carbonate accumulation. Two prominent depositional sequences are recognized in the Datta Formation with the lower high and upper low magnitude cycles. The Datta Formation thus represents a thick sedimentary succession and in the study area, i.e., western Salt Range, mainly channel belt, flood plain and/or delta top facies are exposed. The palaeocurrent analysis shows that the source area with acidic plutonic rocks laid to S-SE in the Indian shield, aravalies or older sedimentary rocks of the Indus Basin (i.e., Khewra, Tobra and Warchha formations). A tentative stratigraphic correlation of the Datta Formation with the lower Jurassic Lathi Formation, India invites further work in parts of India, which will elaborate the extent of the Datta Formation in the Greater Indian peninsula and develop palaeogeographic setting for this Lower Jurassic deltaic rock unit.

  1. Nucleic acid (cDNA) and amino acid sequences of the maize endosperm protein glutelin-2.

    PubMed Central

    Prat, S; Cortadas, J; Puigdomènech, P; Palau, J

    1985-01-01

    The cDNA coding for a glutelin-2 protein from maize endosperm has been cloned and the complete amino acid sequence of the protein derived for the first time. An immature maize endosperm cDNA bank was screened for the expression of a beta-lactamase:glutelin-2 (G2) fusion polypeptide by using antibodies against the purified 28 kd G2 protein. A clone corresponding to the 28 kd G2 protein was sequenced and the primary structure of this protein was derived. Five regions can be defined in the protein sequence: an 11 residue N-terminal part, a repeated region formed by eight units of the sequence Pro-Pro-Pro-Val-His-Leu, an alternating Pro-X stretch 21 residues long, a Cys rich domain and a C-terminal part rich in Gln. The protein sequence is preceded by 19 residues which have the characteristics of the signal peptide found in secreted proteins. Unlike zeins, the main maize storage proteins, 28 kd glutelin-2 has several homologous sequences in common with other cereal storage proteins. Images PMID:3839076

  2. Genomic Resources for Water Yam (Dioscorea alata L.): Analyses of EST-Sequences, De Novo Sequencing and GBS Libraries.

    PubMed

    Saski, Christopher A; Bhattacharjee, Ranjana; Scheffler, Brian E; Asiedu, Robert

    2015-01-01

    The reducing cost and rapid progress in next-generation sequencing techniques coupled with high performance computational approaches have resulted in large-scale discovery of advanced genomic resources in several model and non-model plant species. Yam (Dioscorea spp.) is a major food and cash crop in many countries but research efforts have been limited to understand the genetics and generate genomic information for the crop. The availability of a large number of genomic resources including genome-wide molecular markers will accelerate the breeding efforts and application of genomic selection in yams. In the present study, several methods including expressed sequence tags (EST)-sequencing, de novo sequencing, and genotyping-by-sequencing (GBS) profiles on two yam (Dioscorea alata L.) genotypes (TDa 95/00328 and TDa 95-310) was performed to generate genomic resources for use in its improvement programs. This includes a comprehensive set of EST-SSRs, genomic SSRs, whole genome SNPs, and reduced representation SNPs. A total of 1,152 EST-SSRs were developed from >40,000 EST-sequences generated from the two genotypes. A set of 388 EST-SSRs were validated as polymorphic showing a polymorphism rate of 34% when tested on two diverse parents targeted for anthracnose disease. In addition, approximately 40X de novo whole genome sequence coverage was generated for each of the two genotypes, and a total of 18,584 and 15,952 genomic SSRs were identified for TDa 95/00328 and TDa 95-310, respectively. A custom made pipeline resulted in the selection of 573 genomic SSRs common across the two genotypes, of which only eight failed, 478 being polymorphic and 62 monomorphic indicating a polymorphic rate of 83.5%. Additionally, 288,505 high quality SNPs were also identified between these two genotypes. Genotyping by sequencing reads on these two genotypes also revealed 36,790 overlapping SNP positions that are distributed throughout the genome. Our efforts in using different approaches

  3. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... 37 Patents, Trademarks, and Copyrights 1 2010-07-01 2010-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide...

  4. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... 37 Patents, Trademarks, and Copyrights 1 2012-07-01 2012-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide...

  5. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... 37 Patents, Trademarks, and Copyrights 1 2014-07-01 2014-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide...

  6. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... 37 Patents, Trademarks, and Copyrights 1 2011-07-01 2011-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide...

  7. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... 37 Patents, Trademarks, and Copyrights 1 2013-07-01 2013-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide...

  8. Searching for Extraterrestrial Amino Acids in a Contaminated Meteorite: Amino Acid Analyses of the Canakkale L6 Chondrite

    NASA Technical Reports Server (NTRS)

    Burton, A. S.; Elsila, J. E.; Glavin, D. P.; Dworkin, J. P.; Ornek, C. Y.; Esenoglu, H. H.; Unsalan, O.; Ozturk, B.

    2016-01-01

    Amino acids can serve as important markers of cosmochemistry, as their abundances and isomeric and isotopic compositions have been found to vary predictably with changes in parent body chemistry and alteration processes. Amino acids are also of astrobiological interest because they are essential for life on Earth. Analyses of a range of meteorites, including all groups of carbonaceous chondrites, along with H, R, and LL chondrites, ureilites, and a martian shergottite, have revealed that amino acids of plausible extraterrestrial origin can be formed in and persist after a wide range of parent body conditions. However, amino acid analyses of L6 chondrites to date have not provided evidence for indigenous amino acids. In the present study, we performed amino acid analysis on larger samples of a different L6 chondite, Canakkale, to determine whether or not trace levels of indigenous amino acids could be found. The Canakkale meteor was an observed fall in late July, 1964, near Canakkale, Turkey. The meteorite samples (1.36 and 1.09 g) analyzed in this study were allocated by C. Y. Ornek, along with a soil sample (1.5 g) collected near the Canakkale recovery site.

  9. Genomic resources for water yam (Dioscorea alata L.): analyses of EST-Sequences, De Novo sequencing and GBS libraries

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The reducing cost and rapid progress in next-generation sequencing techniques coupled with high performance computational approaches have resulted in large-scale discovery of advanced genomic resources such as SSRs, SNPs and InDels in several model and non-model plant species. Yam (Dioscorea spp.) i...

  10. DNA Sequence Analyses Reveal Abundant Diversity, Endemism and Evidence for Asian Origin of the Porcini Mushrooms

    PubMed Central

    Feng, Bang; Xu, Jianping; Wu, Gang; Zeng, Nian-Kai; Li, Yan-Chun; Tolgor, Bau; Kost, Gerhard W.; Yang, Zhu L.

    2012-01-01

    The wild gourmet mushroom Boletus edulis and its close allies are of significant ecological and economic importance. They are found throughout the Northern Hemisphere, but despite their ubiquity there are still many unresolved issues with regard to the taxonomy, systematics and biogeography of this group of mushrooms. Most phylogenetic studies of Boletus so far have characterized samples from North America and Europe and little information is available on samples from other areas, including the ecologically and geographically diverse regions of China. Here we analyzed DNA sequence variation in three gene markers from samples of these mushrooms from across China and compared our findings with those from other representative regions. Our results revealed fifteen novel phylogenetic species (about one-third of the known species) and a newly identified lineage represented by Boletus sp. HKAS71346 from tropical Asia. The phylogenetic analyses support eastern Asia as the center of diversity for the porcini sensu stricto clade. Within this clade, B. edulis is the only known holarctic species. The majority of the other phylogenetic species are geographically restricted in their distributions. Furthermore, molecular dating and geological evidence suggest that this group of mushrooms originated during the Eocene in eastern Asia, followed by dispersal to and subsequent speciation in other parts of Asia, Europe, and the Americas from the middle Miocene through the early Pliocene. In contrast to the ancient dispersal of porcini in the strict sense in the Northern Hemisphere, the occurrence of B. reticulatus and B. edulis sensu lato in the Southern Hemisphere was probably due to recent human-mediated introductions. PMID:22629418

  11. Human liver apolipoprotein B-100 cDNA: complete nucleic acid and derived amino acid sequence.

    PubMed Central

    Law, S W; Grant, S M; Higuchi, K; Hospattankar, A; Lackner, K; Lee, N; Brewer, H B

    1986-01-01

    Human apolipoprotein B-100 (apoB-100), the ligand on low density lipoproteins that interacts with the low density lipoprotein receptor and initiates receptor-mediated endocytosis and low density lipoprotein catabolism, has been cloned, and the complete nucleic acid and derived amino acid sequences have been determined. ApoB-100 cDNAs were isolated from normal human liver cDNA libraries utilizing immunoscreening as well as filter hybridization with radiolabeled apoB-100 oligodeoxynucleotides. The apoB-100 mRNA is 14.1 kilobases long encoding a mature apoB-100 protein of 4536 amino acids with a calculated amino acid molecular weight of 512,723. ApoB-100 contains 20 potential glycosylation sites, and 12 of a total of 25 cysteine residues are located in the amino-terminal region of the apolipoprotein providing a potential globular structure of the amino terminus of the protein. ApoB-100 contains relatively few regions of amphipathic helices, but compared to other human apolipoproteins it is enriched in beta-structure. The delineation of the entire human apoB-100 sequence will now permit a detailed analysis of the conformation of the protein, the low density lipoprotein receptor binding domain(s), and the structural relationship between apoB-100 and apoB-48 and will provide the basis for the study of genetic defects in apoB-100 in patients with dyslipoproteinemias. PMID:3464946

  12. Computer selection of oligonucleotide probes from amino acid sequences for use in gene library screening.

    PubMed

    Yang, J H; Ye, J H; Wallace, D C

    1984-01-11

    We present a computer program, FINPROBE, which utilizes known amino acid sequence data to deduce minimum redundancy oligonucleotide probes for use in screening cDNA or genomic libraries or in primer extension. The user enters the amino acid sequence of interest, the desired probe length, the number of probes sought, and the constraints on oligonucleotide synthesis. The computer generates a table of possible probes listed in increasing order of redundancy and provides the location of each probe in the protein and mRNA coding sequence. Activation of a next function provides the amino acid and mRNA sequences of each probe of interest as well as the complementary sequence and the minimum dissociation temperature of the probe. A final routine prints out the amino acid sequence of the protein in parallel with the mRNA sequence listing all possible codons for each amino acid.

  13. Specific catalysis of asparaginyl deamidation by carboxylic acids: kinetic, thermodynamic, and quantitative structure-property relationship analyses.

    PubMed

    Connolly, Brian D; Tran, Benjamin; Moore, Jamie M R; Sharma, Vikas K; Kosky, Andrew

    2014-04-07

    Asparaginyl (Asn) deamidation could lead to altered potency, safety, and/or pharmacokinetics of therapeutic protein drugs. In this study, we investigated the effects of several different carboxylic acids on Asn deamidation rates using an IgG1 monoclonal antibody (mAb1*) and a model hexapeptide (peptide1) with the sequence YGKNGG. Thermodynamic analyses of the kinetics data revealed that higher deamidation rates are associated with predominantly more negative ΔS and, to a lesser extent, more positive ΔH. The observed differences in deamidation rates were attributed to the unique ability of each type of carboxylic acid to stabilize the energetically unfavorable transition-state conformations required for imide formation. Quantitative structure property relationship (QSPR) analysis using kinetic data demonstrated that molecular descriptors encoding for the geometric spatial distribution of atomic properties on various carboxylic acids are effective determinants for the deamidation reaction. Specifically, the number of O-O and O-H atom pairs on carboxyl and hydroxyl groups with interatomic distances of 4-5 Å on a carboxylic acid buffer appears to determine the rate of deamidation. Collectively, the results from structural and thermodynamic analyses indicate that carboxylic acids presumably form multiple hydrogen bonds and charge-charge interactions with the relevant deamidation site and provide alignment between the reactive atoms on the side chain and backbone. We propose that carboxylic acids catalyze deamidation by stabilizing a specific, energetically unfavorable transition-state conformation of l-asparaginyl intermediate II that readily facilitates bond formation between the γ-carbonyl carbon and the deprotonated backbone nitrogen for cyclic imide formation.

  14. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data...

  15. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data...

  16. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data...

  17. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data...

  18. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data...

  19. Improved analyses for soil carbohydrates, amino acids, and phenols: Tools for understanding soil processes

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A process-level understanding of soil carbon(C) and nitrogen (N) cycling will be facilitated by precise measurement of biochemical compounds in soil organic matter. This review summarizes some recent developments in analyses for soil carbohydrates, amino compounds (amino acids and amino sugars), and...

  20. Transcriptome Sequencing in Response to Salicylic Acid in Salvia miltiorrhiza

    PubMed Central

    Zhang, Xiaoru; Dong, Juane; Liu, Hailong; Wang, Jiao; Qi, Yuexin; Liang, Zongsuo

    2016-01-01

    Salvia miltiorrhiza is a traditional Chinese herbal medicine, whose quality and yield are often affected by diseases and environmental stresses during its growing season. Salicylic acid (SA) plays a significant role in plants responding to biotic and abiotic stresses, but the involved regulatory factors and their signaling mechanisms are largely unknown. In order to identify the genes involved in SA signaling, the RNA sequencing (RNA-seq) strategy was employed to evaluate the transcriptional profiles in S. miltiorrhiza cell cultures. A total of 50,778 unigenes were assembled, in which 5,316 unigenes were differentially expressed among 0-, 2-, and 8-h SA induction. The up-regulated genes were mainly involved in stimulus response and multi-organism process. A core set of candidate novel genes coding SA signaling component proteins was identified. Many transcription factors (e.g., WRKY, bHLH and GRAS) and genes involved in hormone signal transduction were differentially expressed in response to SA induction. Detailed analysis revealed that genes associated with defense signaling, such as antioxidant system genes, cytochrome P450s and ATP-binding cassette transporters, were significantly overexpressed, which can be used as genetic tools to investigate disease resistance. Our transcriptome analysis will help understand SA signaling and its mechanism of defense systems in S. miltiorrhiza. PMID:26808150

  1. An on-line potentiometric sequential injection titration process analyser for the determination of acetic acid.

    PubMed

    van Staden, J F; Mashamba, Mulalo G; Stefan, Raluca I

    2002-09-01

    An on-line potentiometric sequential injection titration process analyser for the determination of acetic acid is proposed. A solution of 0.1 mol L(-1) sodium chloride is used as carrier. Titration is achieved by aspirating acetic acid samples between two strong base-zone volumes into a holding coil and by channelling the stack of well-defined zones with flow reversal through a reaction coil to a potentiometric sensor where the peak widths were measured. A linear relationship between peak width and logarithm of the acid concentration was obtained in the range 1-9 g/100 mL. Vinegar samples were analysed without any sample pre-treatment. The method has a relative standard deviation of 0.4% with a sample frequency of 28 samples per hour. The results revealed good agreement between the proposed sequential injection and an automated batch titration method.

  2. Cloning and sequencing of the Bet v 1-homologous allergen Fra a 1 in strawberry (Fragaria ananassa) shows the presence of an intron and little variability in amino acid sequence.

    PubMed

    Musidlowska-Persson, Anna; Alm, Rikard; Emanuelsson, Cecilia

    2007-02-01

    The Fra a 1 allergen in strawberry (Fragaria ananassa) is homologous to the major birch pollen allergen Bet v 1, which has numerous isoforms differing in terms of amino acid sequence and immunological impact. To map the extent of sequence differences in the Fra a 1 allergen, PCR cloning and sequencing was applied. Several genomic sequences of Fra a 1, with a length of either 584, 591 or 594 nucleotides, were obtained from three different strawberry varieties. All contained one intron, with the length of either 101 or 110 nucleotides. By sequencing 30 different clones, eight different DNA sequences were obtained, giving in total five potential Fra a 1 protein isoforms, with high sequence similarity (>97% sequence identity) and only seven positions of amino acid variability, which were largely confirmed by mass spectrometry of expressed proteins. We conclude that the sequence variability in the strawberry allergen Fra a 1 is small, within and between strawberry varieties, and that multiple spots, previously detected in 2DE, are presumably due to differences in post-translational modification rather than differences in amino acid sequence. The most abundant Fra a 1 isoform sequence, recombinantly expressed in Escherichia coli after removal of the intron, was recognized by IgE from strawberry allergic patients. It cross-reacted with antibodies to Bet v 1 and the homologous apple allergen Mal d 1 (61 and 78% sequence identity, respectively), and will be used in further analyses of variation in Fra a 1-expression.

  3. Automation of Molecular-Based Analyses: A Primer on Massively Parallel Sequencing

    PubMed Central

    Nguyen, Lan; Burnett, Leslie

    2014-01-01

    Recent advances in genetics have been enabled by new genetic sequencing techniques called massively parallel sequencing (MPS) or next-generation sequencing. Through the ability to sequence in parallel hundreds of thousands to millions of DNA fragments, the cost and time required for sequencing has dramatically decreased. There are a number of different MPS platforms currently available and being used in Australia. Although they differ in the underlying technology involved, their overall processes are very similar: DNA fragmentation, adaptor ligation, immobilisation, amplification, sequencing reaction and data analysis. MPS is being used in research, translational and increasingly now also in clinical settings. Common applications include sequencing of whole genomes, whole exomes or targeted genes for disease-causing gene discovery, genetic diagnosis and targeted cancer therapy. Even though the revolution that is occurring with MPS is exciting due to its increasing use, improving and emerging technologies and new applications, significant challenges still exist. Particularly challenging issues are the bioinformatics required for data analysis, interpretation of results and the ethical dilemma of ‘incidental findings’. PMID:25336762

  4. Nonradioactive sequence-tagged microsatellite site analyses: a method transferable to the tropics.

    PubMed

    Lagoda, P J; Dambier, D; Grapin, A; Baurens, F C; Lanaud, C; Noyer, J L

    1998-02-01

    Utilization of existing isozyme analysis facilities to detect sequence-tagged microsatellite site (STMS) polymorphism or any simple sequence repeat (SSR) variation is described. Different parameters concerning the difficulties in transferring molecular techniques to less sophisticated laboratory infrastructures (i.e. tropical outstations) are discussed (e.g. reproducibility, efficacy, precision). Nonradioactive STMS analysis is bound to foster collaborative research between "biodiversity" and "biotechnology" centers.

  5. Factorial Moments Analyses Show a Characteristic Length Scale in DNA Sequences

    NASA Astrophysics Data System (ADS)

    Mohanty, A. K.; Narayana Rao, A. V. S. S.

    2000-02-01

    A unique feature of most of the DNA sequences, found through the factorial moments analysis, is the existence of a characteristic length scale around which the density distribution is nearly Poissonian. Above this point, the DNA sequences, irrespective of their intron contents, show long range correlations with a significant deviation from the Gaussian statistics, while, below this point, the DNA statistics are essentially Gaussian. The famous DNA walk representation is also shown to be a special case of the present analysis.

  6. Completion of the amino acid sequence of the alpha 1 chain from type I calf skin collagen. Amino acid sequence of alpha 1(I)B8.

    PubMed Central

    Glanville, R W; Breitkreutz, D; Meitinger, M; Fietzek, P P

    1983-01-01

    The complete amino acid sequence of the 279-residue CNBr peptide CB8 from the alpha 1 chain of type I calf skin collagen is presented. It was determined by sequencing overlapping fragments of CB8 produced by Staphylococcus aureus V8 proteinase, trypsin, Endoproteinase Arg-C and hydroxylamine. Tryptic cleavages were also made specific for lysine by blocking arginine residues with cyclohexane-1,2-dione. This completes the amino acid sequence analysis of the 1054-residues-long alpha (I) chain of calf skin collagen. PMID:6354180

  7. Chances and pitfalls of leaf wax biomarker analyses applied to fluvial sediment sequences - the example of a Holocene fluvial sediment-paleosol sequence from the upper Alazani River, eastern Georgia

    NASA Astrophysics Data System (ADS)

    von Suchodoletz, Hans; Bliedtner, Marcel; Zielhofer, Christoph; Faust, Dominik; Zech, Roland

    2016-04-01

    During the last decades, fluvial sediment sequences in many regions have intensively been studied to reconstruct Late Quaternary palaeoenvironmental and palaeohydrological conditions. However, up to now analyses of leaf wax biomarkers that are increasingly used to reconstruct paleoenvironmental and -climate conditions e.g. from lake sediments or loess-paleosol sequences were not systematically applied to Late Quaternary fluvial sediments. Given the ubiquitous distribution of fluvial sediment sequences on the earth's surface such investigations could potentially strongly enhance the knowledge about former environmental conditions in many regions. For this conceptual study we exemplarily analysed leaf wax biomarker (long-chain n-alkanes, n-alkanoic acids) in a fluvial sediment palaeosol sequence from the upper Alazani River in eastern Georgia to discuss general possibilities and pitfalls: Generally, biomarker records from fluvial archives can be divided into i) a catchment signal recorded in the fluvial sediment layers and ii) a local in-situ signal recorded in the intercalated paleosols. This offers the great chance to reconstruct paleoenvironmental conditions in both the whole catchment and at the sampling site. However, potential pitfalls are, for example, that inherited catchment signals can bias the in-situ signal from paleosols, while intermediate sediment storage in the catchment prior to sediment deposition and postsedimentary processes may alter the original catchment signal in the fluvial sediment layers. Thus, when applying leaf wax biomarker analyses to fluvial sediment sequences one has to be careful: The interpretation of the biomarker record strongly depends on the specific geomorphological and sedimentological conditions of the investigated site and of the catchment area.

  8. An Integrated Sequence-Structure Database incorporating matching mRNA sequence, amino acid sequence and protein three-dimensional structure data.

    PubMed Central

    Adzhubei, I A; Adzhubei, A A; Neidle, S

    1998-01-01

    We have constructed a non-homologous database, termed the Integrated Sequence-Structure Database (ISSD) which comprises the coding sequences of genes, amino acid sequences of the corresponding proteins, their secondary structure and straight phi,psi angles assignments, and polypeptide backbone coordinates. Each protein entry in the database holds the alignment of nucleotide sequence, amino acid sequence and the PDB three-dimensional structure data. The nucleotide and amino acid sequences for each entry are selected on the basis of exact matches of the source organism and cell environment. The current version 1.0 of ISSD is available on the WWW at http://www.protein.bio.msu.su/issd/ and includes 107 non-homologous mammalian proteins, of which 80 are human proteins. The database has been used by us for the analysis of synonymous codon usage patterns in mRNA sequences showing their correlation with the three-dimensional structure features in the encoded proteins. Possible ISSD applications include optimisation of protein expression, improvement of the protein structure prediction accuracy, and analysis of evolutionary aspects of the nucleotide sequence-protein structure relationship. PMID:9399866

  9. The Complete Chloroplast Genome Sequences of Five Epimedium Species: Lights into Phylogenetic and Taxonomic Analyses

    PubMed Central

    Zhang, Yanjun; Du, Liuwen; Liu, Ao; Chen, Jianjun; Wu, Li; Hu, Weiming; Zhang, Wei; Kim, Kyunghee; Lee, Sang-Choon; Yang, Tae-Jin; Wang, Ying

    2016-01-01

    Epimedium L. is a phylogenetically and economically important genus in the family Berberidaceae. We here sequenced the complete chloroplast (cp) genomes of four Epimedium species using Illumina sequencing technology via a combination of de novo and reference-guided assembly, which was also the first comprehensive cp genome analysis on Epimedium combining the cp genome sequence of E. koreanum previously reported. The five Epimedium cp genomes exhibited typical quadripartite and circular structure that was rather conserved in genomic structure and the synteny of gene order. However, these cp genomes presented obvious variations at the boundaries of the four regions because of the expansion and contraction of the inverted repeat (IR) region and the single-copy (SC) boundary regions. The trnQ-UUG duplication occurred in the five Epimedium cp genomes, which was not found in the other basal eudicotyledons. The rapidly evolving cp genome regions were detected among the five cp genomes, as well as the difference of simple sequence repeats (SSR) and repeat sequence were identified. Phylogenetic relationships among the five Epimedium species based on their cp genomes showed accordance with the updated system of the genus on the whole, but reminded that the evolutionary relationships and the divisions of the genus need further investigation applying more evidences. The availability of these cp genomes provided valuable genetic information for accurately identifying species, taxonomy and phylogenetic resolution and evolution of Epimedium, and assist in exploration and utilization of Epimedium plants. PMID:27014326

  10. Genomic distribution and functional analyses of potential G-quadruplex-forming sequences in Saccharomyces cerevisiae

    PubMed Central

    Hershman, Steve G.; Chen, Qijun; Lee, Julia Y.; Kozak, Marina L.; Yue, Peng; Wang, Li-San; Johnson, F. Brad

    2008-01-01

    Although well studied in vitro, the in vivo functions of G-quadruplexes (G4-DNA and G4-RNA) are only beginning to be defined. Recent studies have demonstrated enrichment for sequences with intramolecular G-quadruplex forming potential (QFP) in transcriptional promoters of humans, chickens and bacteria. Here we survey the yeast genome for QFP sequences and similarly find strong enrichment for these sequences in upstream promoter regions, as well as weaker but significant enrichment in open reading frames (ORFs). Further, four findings are consistent with roles for QFP sequences in transcriptional regulation. First, QFP is correlated with upstream promoter regions with low histone occupancy. Second, treatment of cells with N-methyl mesoporphyrin IX (NMM), which binds G-quadruplexes selectively in vitro, causes significant upregulation of loci with QFP-possessing promoters or ORFs. NMM also causes downregulation of loci connected with the function of the ribosomal DNA (rDNA), which itself has high QFP. Third, ORFs with QFP are selectively downregulated in sgs1 mutants that lack the G4-DNA-unwinding helicase Sgs1p. Fourth, a screen for yeast mutants that enhance or suppress growth inhibition by NMM revealed enrichment for chromatin and transcriptional regulators, as well as telomere maintenance factors. These findings raise the possibility that QFP sequences form bona fide G-quadruplexes in vivo and thus regulate transcription. PMID:17999996

  11. Technical note: improved methodology for analyses of acid detergent fiber and acid detergent lignin.

    PubMed

    Raffrenato, E; Van Amburgh, M E

    2011-07-01

    The objective of this study was to evaluate the methodology of the acid detergent lignin (ADL) assay in an effort to evaluate particle loss, improve repeatability, and decrease variation within and among samples. The original ADL method relied on asbestos as a filtering aid, but that was removed in 1989 with the mandate from the Environmental Protection Agency to eliminate asbestos in the environment. Furthermore, recent work on fiber methodology indicated that pore size in the Gooch sintered glass crucible (40-60 μm) was too large to trap all of the small particles associated with neutral detergent fiber (NDF) and acid detergent fiber (ADF). Thus, any loss of ADF could potentially result in a loss of ADL. Sixty forages including conventional and brown midrib corn silages, alfalfa silages and hays, mature grasses, early vegetative grasses, and 9 feces samples, were analyzed sequentially for ADF and ADL as outlined in the 1973 procedure of Van Soest except for the use of the asbestos fiber. A glass microfiber filter with a 1.5-μm pore size was chosen as a filtering aid because it met the criteria required by the assay: glass, heat resistant, acid resistant, chemically inert, and hydrophobic. To compare with the current ADF and ADL assays, the assays were conducted with either no filter or the glass filter inserted into crucibles, rinsed with acetone, and then according to the 1973 procedure of Van Soest. The samples analyzed covered a range from 18.11 to 55.79% ADF and from 0.96 to 9.94% ADL on a dry matter (DM) basis. With the use of the filter, the mean ADF values increased 4.2% and mean ADL values increased 18.9%. Overall, both ADF and ADL values were greater with the use of the glass microfiber filter than without, indicating that as the type of sample analyzed changed, use of the Gooch crucible without the filtering aid results in particle loss. The adoption of the use of a small pore size (1.5 μm) glass microfiber filter to improve filtration and recovery

  12. Complete amino acid sequence and structure characterization of the taste-modifying protein, miraculin.

    PubMed

    Theerasilp, S; Hitotsuya, H; Nakajo, S; Nakaya, K; Nakamura, Y; Kurihara, Y

    1989-04-25

    The taste-modifying protein, miraculin, has the unusual property of modifying sour taste into sweet taste. The complete amino acid sequence of miraculin purified from miracle fruits by a newly developed method (Theerasilp, S., and Kurihara, Y. (1988) J. Biol. Chem. 263, 11536-11539) was determined by an automatic Edman degradation method. Miraculin was a single polypeptide with 191 amino acid residues. The calculated molecular weight based on the amino acid sequence and the carbohydrate content (13.9%) was 24,600. Asn-42 and Asn-186 were linked N-glycosidically to carbohydrate chains. High homology was found between the amino acid sequences of miraculin and soybean trypsin inhibitor.

  13. Purification, sequencing, and phylogenetic analyses of novel Lys-49 phospholipases A(2) from the venoms of rattlesnakes and other pit vipers.

    PubMed

    Tsai, I H; Chen, Y H; Wang, Y M; Tu, M C; Tu, A T

    2001-10-15

    Basic phospholipase A(2) homologs with Lys49 substitution at the essential Ca(2+)-binding site are present in the venom of pit vipers under many genera. However, they have not been found in rattlesnake venoms before. We have now screened for this protein in the venom of rattlesnakes and other less studied pit vipers. By gel filtration chromatography and RP-HPLC, Lys49-phospholipase-like proteins were purified from the venoms of two rattlers, Crotalus atrox and Crotalus m. molossus, and five nonrattlers, Porthidium nummifer, Porthidium godmani, Bothriechis schlegelii, Trimeresurus puniceus, and Trimeresurus albolabris. Their N-terminal amino acid sequences were shown to be characteristic for this phospholipase subfamily. The purified basic proteins from rattlesnakes caused myonecrosis and edema in experimental animals. We have also cloned the cDNAs and solved the complete sequences of four novel Lys49-phospholipases from the venom glands of C. atrox, P. godmani, B. schlegelii, and Deinagkistrodon acutus (hundred-pace). Phylogenetic analyses based on the amino acid sequences of 28 Lys49-phospholipases separate the pitviper of the New World from those of the Old World, and the arboreal Asiatic species from the terrestrial Asiatic species. The implications of the phylogeny tree to the systematics of pit vipers, and structure-function relationship of the Lys49-phospholipases are discussed.

  14. ADN-Viewer: a 3D approach for bioinformatic analyses of large DNA sequences.

    PubMed

    Hérisson, Joan; Ferey, Nicolas; Gros, Pierre-Emmanuel; Gherbi, Rachid

    2007-01-20

    Most of biologists work on textual DNA sequences that are limited to the linear representation of DNA. In this paper, we address the potential offered by Virtual Reality for 3D modeling and immersive visualization of large genomic sequences. The representation of the 3D structure of naked DNA allows biologists to observe and analyze genomes in an interactive way at different levels. We developed a powerful software platform that provides a new point of view for sequences analysis: ADNViewer. Nevertheless, a classical eukaryotic chromosome of 40 million base pairs requires about 6 Gbytes of 3D data. In order to manage these huge amounts of data in real-time, we designed various scene management algorithms and immersive human-computer interaction for user-friendly data exploration. In addition, one bioinformatics study scenario is proposed.

  15. Analyses of DNA Base Sequences for Eukaryotes in Terms of Power Spectrum Method

    NASA Astrophysics Data System (ADS)

    Isohata, Yasuhiko; Hayashi, Masaki

    2005-02-01

    By adopting a power spectrum method we have analyzed long-range correlations in the gene base sequences, exons and introns for five or six eukaryote species. As a measure of the long-range correlations, we have used an exponent α in 1/fα, which is an approximation of a power spectrum in a low-frequency region. We have analyzed frequency distributions of α and the dependence of its average values <α> on the sequence length for the five or six species, paying particular attention to the species dependence. We have shown that long-range correlations have been formed mainly due to the intron's elongation as well as by the sequence structures of introns acquired over the course of evolution.

  16. Detection and isolation of nucleic acid sequences using a bifunctional hybridization probe

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    2000-01-01

    A method for detecting and isolating a target sequence in a sample of nucleic acids is provided using a bifunctional hybridization probe capable of hybridizing to the target sequence that includes a detectable marker and a first complexing agent capable of forming a binding pair with a second complexing agent. A kit is also provided for detecting a target sequence in a sample of nucleic acids using a bifunctional hybridization probe according to this method.

  17. Feasibility of mini-sequencing schemes based on nucleotide polymorphisms for microbial identification and population analyses.

    PubMed

    Araujo, Ricardo; Eusebio, Nadia; Caramalho, Rita

    2015-03-01

    Practical schemes based on single nucleotide polymorphisms (SNP) have been proposed as alternatives to simplify and replace the molecular methodologies based on the extensive sequencing analysis of genes. SNaPshot mini-sequencing has been progressively experienced during the last decade and represents a fast and robust strategy to analyze critical polymorphisms. Such assays have been proposed to characterize some bacteria and microbial eukaryotes, and its feasibility was now reviewed in the present manuscript. The mini-sequencing schemes showed high discriminatory power and competence for identification of microorganisms, but some specificity errors were still found, particularly for species of the Burkholderia cepacia complex and mycobacteria. SNP assays designed for other goals, e.g., comparison of strains, detection of serotypes, virulence, epidemic, and phylogenetic-related subgroups of isolates, can be very useful by facilitating the investigation of large collections of isolates. The next-generation of SNP assays might consider the inclusion of large number of markers to fully characterize microbial taxonomy and strains; nevertheless, these new technologies are still prone to errors and can largely benefit from integration with well-established mini-sequencing assays. Newly proposed molecular tools should be systematically tested in collections of isolates with high indexes of diversity and guarantee interlaboratorial validation.

  18. Analyses of binding sequences of the two LexA proteins of Xanthomonas axonopodis pathovar citri.

    PubMed

    Yang, Mei-Kwei; Hsu, Chien-Hsiu; Sung, Vin-Long

    2008-07-01

    Xanthomonas axonopodis pv. citri (X. axonopodis pv. citri) possesses two lexA genes, designated lexA1 and lexA2. Electrophoretic mobility shift data show that LexA1 binds to both lexA1 and lexA2 promoters, but LexA2 does not bind to the lexA1 promoter, suggesting that LexA1 and LexA2 play different roles in regulating the expression of SOS genes. In this study, we have determined that LexA2 binds to a 14-bp dyad-spacer-dyad palindromic sequence, 5'-TGTACAAATGTACA-3', located at nucleotides -41 to -28 relative to the translation start site of lexA2 of X. axonopodis pv. citri. The two spacer nucleotides in this sequence can be changed from AA to TT without affecting LexA2 binding; all other base deletions or substitutions abolish LexA2 binding. The LexA1 binding sequence in the promoter region of lexA2 is TTAGTACTAAAGTTATAA and is located at -133 to -116, and that in the lexA1 gene is AGTAGTAATACTACT located at nucleotides -19 to -5 relative to the translation start site of lexA1. Any base change in the latter sequence abolishes LexA1 binding.

  19. Genome sequencing and analyses of the postharvest fungus Penicillium expansum R21

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Blue mold is the vernacular name of a common postharvest disease of stored apples, pears and quince that is caused by several common species of Penicillium. This study reports the draft genome sequence of Penicillium expansum strain R21, a strain isolated from a Red Delicious apple in 2011 in Pennsy...

  20. Whole genome sequence analyses of Xylella fastidiosa PD strains from different geographical regions

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genome sequences were determined for two Pierce’s disease (PD) causing Xylella fastidiosa (Xf) strains, one from Florida and one from Taiwan. The Florida strain was ATCC 35879, the type of strain used as a standard reference for related taxonomy research. By contrast, the Taiwan strain used was only...

  1. Molecular analyses of a repetitive DNA sequence in wheat (Triticum aestivum L.).

    PubMed

    Ueng, P P; Hang, A; Tsang, H; Vega, J M; Wang, L; Burton, C S; He, F T; Liu, B

    2000-06-01

    A repetitive sequence designated WE35 was isolated from wheat genomic DNA. This sequence consists of a 320-bp repeat unit and represents approximately 0.002% of the total wheat DNA. It is unidirectionally distributed either continuously or discretely in the genome. Ladder-like banding patterns were observed in Southern blots when the wheat genomic DNA was restricted with endonuclease enzymes EcoRI, HincII, NciI, and NdeI, which is characteristic for tandemly organized sequences. Two DNA fragments in p451 were frequently associated with the WE35 repetitive unit in a majority of lambda wheat genomic clones. A 475-bp fragment homologous to the 5'-end long terminal repeat (LTR) of cereal retroelements was also found in some lambda wheat genomic clones containing the repetitive unit. Physical mapping by fluorescence in situ hybridization (FISH) indicated that one pair of wheat chromosomes could be specifically detected with the WE35 positive probe p551. WE35 can be considered a chromosome-specific repetitive sequence. This repetitive unit could be used as a molecular marker for genetic, phylogenetic, and evolutionary studies in the tribe Triticeae.

  2. Choice of Reference Sequence and Assembler for Alignment of Listeria monocytogenes Short-Read Sequence Data Greatly Influences Rates of Error in SNP Analyses

    PubMed Central

    Pightling, Arthur W.; Petronella, Nicholas; Pagotto, Franco

    2014-01-01

    The wide availability of whole-genome sequencing (WGS) and an abundance of open-source software have made detection of single-nucleotide polymorphisms (SNPs) in bacterial genomes an increasingly accessible and effective tool for comparative analyses. Thus, ensuring that real nucleotide differences between genomes (i.e., true SNPs) are detected at high rates and that the influences of errors (such as false positive SNPs, ambiguously called sites, and gaps) are mitigated is of utmost importance. The choices researchers make regarding the generation and analysis of WGS data can greatly influence the accuracy of short-read sequence alignments and, therefore, the efficacy of such experiments. We studied the effects of some of these choices, including: i) depth of sequencing coverage, ii) choice of reference-guided short-read sequence assembler, iii) choice of reference genome, and iv) whether to perform read-quality filtering and trimming, on our ability to detect true SNPs and on the frequencies of errors. We performed benchmarking experiments, during which we assembled simulated and real Listeria monocytogenes strain 08-5578 short-read sequence datasets of varying quality with four commonly used assemblers (BWA, MOSAIK, Novoalign, and SMALT), using reference genomes of varying genetic distances, and with or without read pre-processing (i.e., quality filtering and trimming). We found that assemblies of at least 50-fold coverage provided the most accurate results. In addition, MOSAIK yielded the fewest errors when reads were aligned to a nearly identical reference genome, while using SMALT to align reads against a reference sequence that is ∼0.82% distant from 08-5578 at the nucleotide level resulted in the detection of the greatest numbers of true SNPs and the fewest errors. Finally, we show that whether read pre-processing improves SNP detection depends upon the choice of reference sequence and assembler. In total, this study demonstrates that researchers should

  3. The Complete Chloroplast DNA Sequence of Eleutherococcus senticosus (Araliaceae); Comparative Evolutionary Analyses with Other Three Asterids

    PubMed Central

    Yi, Dong-Keun; Lee, Hae-Lim; Sun, Byung-Yun; Chung, Mi Yoon; Kim, Ki-Joong

    2012-01-01

    This study reports the complete chloroplast (cp) DNA sequence of Eleutherococcus senticosus (GenBank: JN 637765), an endangered endemic species. The genome is 156,768 bp in length, and contains a pair of inverted repeat (IR) regions of 25,930 bp each, a large single copy (LSC) region of 86,755 bp and a small single copy (SSC) region of 18,153 bp. The structural organization, gene and intron contents, gene order, AT content, codon usage, and transcription units of the E. senticosus chloroplast genome are similar to that of typical land plant cp DNA. We aligned and analyzed the sequences of 86 coding genes, 19 introns and 113 intergenic spacers (IGS) in three different taxonomic hierarchies; Eleutherococcus vs. Panax, Eleutherococcus vs. Daucus, and Eleutherococcus vs. Nicotiana. The distribution of indels, the number of polymorphic sites and nucleotide diversity indicate that positional constraint is more important than functional constraint for the evolution of cp genome sequences in Asterids. For example, the intron sequences in the LSC region exhibited base substitution rates 5-11-times higher than that of the IR regions, while the intron sequences in the SSC region evolved 7-14-times faster than those in the IR region. Furthermore, the Ka/Ks ratio of the gene coding sequences supports a stronger evolutionary constraint in the IR region than in the LSC or SSC regions. Therefore, our data suggest that selective sweeps by base collection mechanisms more frequently eliminate polymorphisms in the IR region than in other regions. Chloroplast genome regions that have high levels of base substitutions also show higher incidences of indels. Thirty-five simple sequence repeat (SSR) loci were identified in the Eleutherococcus chloroplast genome. Of these, 27 are homopolymers, while six are di-polymers and two are tri-polymers. In addition to the SSR loci, we also identified 18 medium size repeat units ranging from 22 to 79 bp, 11 of which are distributed in the IGS or

  4. The complete chloroplast DNA sequence of Eleutherococcus senticosus (Araliaceae); comparative evolutionary analyses with other three asterids.

    PubMed

    Yi, Dong-Keun; Lee, Hae-Lim; Sun, Byung-Yun; Chung, Mi Yoon; Kim, Ki-Joong

    2012-05-01

    This study reports the complete chloroplast (cp) DNA sequence of Eleutherococcus senticosus (GenBank: JN 637765), an endangered endemic species. The genome is 156,768 bp in length, and contains a pair of inverted repeat (IR) regions of 25,930 bp each, a large single copy (LSC) region of 86,755 bp and a small single copy (SSC) region of 18,153 bp. The structural organization, gene and intron contents, gene order, AT content, codon usage, and transcription units of the E. senticosus chloroplast genome are similar to that of typical land plant cp DNA. We aligned and analyzed the sequences of 86 coding genes, 19 introns and 113 intergenic spacers (IGS) in three different taxonomic hierarchies; Eleutherococcus vs. Panax, Eleutherococcus vs. Daucus, and Eleutherococcus vs. Nicotiana. The distribution of indels, the number of polymorphic sites and nucleotide diversity indicate that positional constraint is more important than functional constraint for the evolution of cp genome sequences in Asterids. For example, the intron sequences in the LSC region exhibited base substitution rates 5-11-times higher than that of the IR regions, while the intron sequences in the SSC region evolved 7-14-times faster than those in the IR region. Furthermore, the Ka/Ks ratio of the gene coding sequences supports a stronger evolutionary constraint in the IR region than in the LSC or SSC regions. Therefore, our data suggest that selective sweeps by base collection mechanisms more frequently eliminate polymorphisms in the IR region than in other regions. Chloroplast genome regions that have high levels of base substitutions also show higher incidences of indels. Thirty-five simple sequence repeat (SSR) loci were identified in the Eleutherococcus chloroplast genome. Of these, 27 are homopolymers, while six are di-polymers and two are tri-polymers. In addition to the SSR loci, we also identified 18 medium size repeat units ranging from 22 to 79 bp, 11 of which are distributed in the IGS or

  5. Phylogenetic analysis of beta-papillomaviruses as inferred from nucleotide and amino acid sequence data.

    PubMed

    Gottschling, Marc; Köhler, Anja; Stockfleth, Eggert; Nindl, Ingo

    2007-01-01

    Human papillomaviruses (HPV) of the beta-group seem to be involved in the pathogenesis of non-melanoma skin cancer. Papillomaviruses are host specific and are considered closely co-evolving with their hosts. Evolutionary incongruence between early genes and late genes has been reported among oncogenic genital alpha-papillomaviruses and considerably challenge phylogenetic reconstructions. We investigated the relationships of 29 beta-HPV (25 types plus four putative new types, subtypes, or variants) as inferred from codon aligned and amino acid sequence data of the genes E1, E2, E6, E7, L1, and L2 using likelihood, distance, and parsimony approaches. An analysis of a L1 fragment included additional nucleotide and amino acid sequences from seven non-human beta-papillomaviruses. Early genes and late genes evolution did not conflict significantly in beta-papillomaviruses based on partition homogeneity tests (p > or = 0.001). As inferred from the complete genome analyses, beta-papillomaviruses were monophyletic and segregated into four highly supported monophyletic assemblages corresponding to the species 1, 2, 3, and fused 4/5. They basically split into the species 1 and the remainder of beta-papillomaviruses, whose species 3, 4, and 5 constituted the sistergroup of species 2. beta-Papillomaviruses have been isolated from humans, apes, and monkeys, and phylogenetic analyses of the L1 fragment showed non-human papillomaviruses highly polyphyletic nesting within the HPV species. Thus, host and virus phylogenies were not congruent in beta-papillomaviruses, and multiple invasions across species borders may contribute (additionally to host-linked evolution) to their diversification.

  6. Isotopic and molecular analyses of hydrocarbons and monocarboxylic acids of the Murchison meteorite

    NASA Technical Reports Server (NTRS)

    Krishnamurthy, R. V.; Epstein, S.; Cronin, John R.; Pizzarello, Sandra; Yuen, George U.

    1992-01-01

    The monocarboxylic acids and hydrocarbons of the Murchison meteorite (CM2) were isolated for isotropic analysis. The nonvolatile hydrocarbons were analyzed as crude methanol and benzene-methanol extracts and also after separation by silica gel chromatography into predominantly aliphatic, aromatic, and polar hydrocarbon fractions. The volatile hydrocarbons were obtained after progressive decomposition of the meteorite matrix by freeze-thaw, hot water, and acid treatment. Molecular analyses of the aromatic hydrocarbons showed them to comprise a complex suite of compounds in which pyrene, fluoranthene, phenanthrene, and acenaphthene were the most abundant components, a result similar to earlier analyses. The polar hydrocarbons also comprise a very complex mixture in which aromatic ketones, nitrogen, and sulfur heterocycles were identified. The monocarboxylic acids, aliphatic, aromatic, and polar hydrocarbons, and the indigenous volatile hydrocarbons were found to be D-rich. The deuterium enrichment observed in these compounds is suggestive. In two separate analyses, the delta-D values of the nonvolatile hydrocarbons were observed to increase in the following order: aliphatic-aromatic-polar. This finding is consistent with an early solar system or parent body conversion of aromatic to aliphatic compounds as well as the suggestion of pyrolytic formation of aromatic from aliphatic compounds.

  7. Across the Gap: Geochronological and Sedimentological Analyses from the Late Pleistocene-Holocene Sequence of Goda Buticha, Southeastern Ethiopia.

    PubMed

    Tribolo, Chantal; Asrat, Asfawossen; Bahain, Jean-Jacques; Chapon, Cécile; Douville, Eric; Fragnol, Carole; Hernandez, Marion; Hovers, Erella; Leplongeon, Alice; Martin, Loïc; Pleurdeau, David; Pearson, Osbjorn; Puaud, Simon; Assefa, Zelalem

    2017-01-01

    Goda Buticha is a cave site near Dire Dawa in southeastern Ethiopia that contains an archaeological sequence sampling the late Pleistocene and Holocene of the region. The sedimentary sequence displays complex cultural, chronological and sedimentological histories that seem incongruent with one another. A first set of radiocarbon ages suggested a long sedimentological gap from the end of Marine Isotopic Stage (MIS) 3 to the mid-Holocene. Macroscopic observations suggest that the main sedimentological change does not coincide with the chronostratigraphic hiatus. The cultural sequence shows technological continuity with a late persistence of artifacts that are usually attributed to the Middle Stone Age into the younger parts of the stratigraphic sequence, yet become increasingly associated with lithic artifacts typically related to the Later Stone Age. While not a unique case, this combination of features is unusual in the Horn of Africa. In order to evaluate the possible implications of these observations, sedimentological analyses combined with optically stimulated luminescence (OSL) were conducted. The OSL data now extend the radiocarbon chronology up to 63 ± 7 ka; they also confirm the existence of the chronological gap between 24.8 ± 2.6 ka and 7.5 ± 0.3 ka. The sedimentological analyses suggest that the origin and mode of deposition were largely similar throughout the whole sequence, although the anthropic and faunal activities increased in the younger levels. Regional climatic records are used to support the sedimentological observations and interpretations. We discuss the implications of the sedimentological and dating analyses for understanding cultural processes in the region.

  8. Across the Gap: Geochronological and Sedimentological Analyses from the Late Pleistocene-Holocene Sequence of Goda Buticha, Southeastern Ethiopia

    PubMed Central

    Asrat, Asfawossen; Bahain, Jean-Jacques; Chapon, Cécile; Douville, Eric; Fragnol, Carole; Hernandez, Marion; Hovers, Erella; Leplongeon, Alice; Martin, Loïc; Pleurdeau, David; Pearson, Osbjorn; Puaud, Simon; Assefa, Zelalem

    2017-01-01

    Goda Buticha is a cave site near Dire Dawa in southeastern Ethiopia that contains an archaeological sequence sampling the late Pleistocene and Holocene of the region. The sedimentary sequence displays complex cultural, chronological and sedimentological histories that seem incongruent with one another. A first set of radiocarbon ages suggested a long sedimentological gap from the end of Marine Isotopic Stage (MIS) 3 to the mid-Holocene. Macroscopic observations suggest that the main sedimentological change does not coincide with the chronostratigraphic hiatus. The cultural sequence shows technological continuity with a late persistence of artifacts that are usually attributed to the Middle Stone Age into the younger parts of the stratigraphic sequence, yet become increasingly associated with lithic artifacts typically related to the Later Stone Age. While not a unique case, this combination of features is unusual in the Horn of Africa. In order to evaluate the possible implications of these observations, sedimentological analyses combined with optically stimulated luminescence (OSL) were conducted. The OSL data now extend the radiocarbon chronology up to 63 ± 7 ka; they also confirm the existence of the chronological gap between 24.8 ± 2.6 ka and 7.5 ± 0.3 ka. The sedimentological analyses suggest that the origin and mode of deposition were largely similar throughout the whole sequence, although the anthropic and faunal activities increased in the younger levels. Regional climatic records are used to support the sedimentological observations and interpretations. We discuss the implications of the sedimentological and dating analyses for understanding cultural processes in the region. PMID:28125597

  9. Genetic Analyses of the Internal Transcribed Spacer Sequences Suggest Introgression and Duplication in the Medicinal Mushroom Agaricus subrufescens.

    PubMed

    Chen, Jie; Moinard, Magalie; Xu, Jianping; Wang, Shouxian; Foulongne-Oriol, Marie; Zhao, Ruilin; Hyde, Kevin D; Callac, Philippe

    2016-01-01

    The internal transcribed spacer (ITS) region of the nuclear ribosomal RNA gene cluster is widely used in fungal taxonomy and phylogeographic studies. The medicinal and edible mushroom Agaricus subrufescens has a worldwide distribution with a high level of polymorphism in the ITS region. A previous analysis suggested notable ITS sequence heterogeneity within the wild French isolate CA487. The objective of this study was to investigate the pattern and potential mechanism of ITS sequence heterogeneity within this strain. Using PCR, cloning, and sequencing, we identified three types of ITS sequences, A, B, and C with a balanced distribution, which differed from each other at 13 polymorphic positions. The phylogenetic comparisons with samples from different continents revealed that the type C sequence was similar to those found in Oceanian and Asian specimens of A. subrufescens while types A and B sequences were close to those found in the Americas or in Europe. We further investigated the inheritance of these three ITS sequence types by analyzing their distribution among single-spore isolates from CA487. In this analysis, three co-dominant markers were used firstly to distinguish the homokaryotic offspring from the heterokaryotic offspring. The homokaryotic offspring were then analyzed for their ITS types. Our genetic analyses revealed that types A and B were two alleles segregating at one locus ITSI, while type C was not allelic with types A and B but was located at another unlinked locus ITSII. Furthermore, type C was present in only one of the two constitutive haploid nuclei (n) of the heterokaryotic (n+n) parent CA487. These data suggest that there was a relatively recent introduction of the type C sequence and a duplication of the ITS locus in this strain. Whether other genes were also transferred and duplicated and their impacts on genome structure and stability remain to be investigated.

  10. Genetic Analyses of the Internal Transcribed Spacer Sequences Suggest Introgression and Duplication in the Medicinal Mushroom Agaricus subrufescens

    PubMed Central

    Chen, Jie; Moinard, Magalie; Xu, Jianping; Wang, Shouxian; Foulongne-Oriol, Marie; Zhao, Ruilin; Hyde, Kevin D.; Callac, Philippe

    2016-01-01

    The internal transcribed spacer (ITS) region of the nuclear ribosomal RNA gene cluster is widely used in fungal taxonomy and phylogeographic studies. The medicinal and edible mushroom Agaricus subrufescens has a worldwide distribution with a high level of polymorphism in the ITS region. A previous analysis suggested notable ITS sequence heterogeneity within the wild French isolate CA487. The objective of this study was to investigate the pattern and potential mechanism of ITS sequence heterogeneity within this strain. Using PCR, cloning, and sequencing, we identified three types of ITS sequences, A, B, and C with a balanced distribution, which differed from each other at 13 polymorphic positions. The phylogenetic comparisons with samples from different continents revealed that the type C sequence was similar to those found in Oceanian and Asian specimens of A. subrufescens while types A and B sequences were close to those found in the Americas or in Europe. We further investigated the inheritance of these three ITS sequence types by analyzing their distribution among single-spore isolates from CA487. In this analysis, three co-dominant markers were used firstly to distinguish the homokaryotic offspring from the heterokaryotic offspring. The homokaryotic offspring were then analyzed for their ITS types. Our genetic analyses revealed that types A and B were two alleles segregating at one locus ITSI, while type C was not allelic with types A and B but was located at another unlinked locus ITSII. Furthermore, type C was present in only one of the two constitutive haploid nuclei (n) of the heterokaryotic (n+n) parent CA487. These data suggest that there was a relatively recent introduction of the type C sequence and a duplication of the ITS locus in this strain. Whether other genes were also transferred and duplicated and their impacts on genome structure and stability remain to be investigated. PMID:27228131

  11. Naked but Not Hairless: The Pitfalls of Analyses of Molecular Adaptation Based on Few Genome Sequence Comparisons

    PubMed Central

    Delsuc, Frédéric; Tilak, Marie-Ka

    2015-01-01

    The naked mole-rat (Heterocephalus glaber) is the only rodent species that naturally lacks fur. Genome sequencing of this atypical rodent species recently shed light on a number of its morphological and physiological adaptations. More specifically, its hairless phenotype has been traced back to a single amino acid change (C397W) in the hair growth associated (HR) protein (or Hairless). By considering the available species diversity, we show that this specific position is in fact variable across mammals, including in the horse that was misleadingly reported to have the ancestral Cysteine. Moreover, by sequencing the corresponding HR exon in additional rodent species, we demonstrate that the C397W substitution is actually not a peculiarity of the naked mole-rat. Instead, this specific amino acid substitution is present in all hystricognath rodents investigated, which are all fully furred, including the naked mole-rat closest relative, the Damaraland mole-rat (Fukomys damarensis). Overall, we found no statistical correlation between amino acid changes at position 397 of the HR protein and reduced pilosity across the mammalian phylogeny. This demonstrates that this single amino acid change does not explain the naked mole-rat hairless phenotype. Our case study calls for caution before making strong claims regarding the molecular basis of phenotypic adaptation based on the screening of specific amino acid substitutions using only few model species in genome sequence comparisons. It also exposes the more general problem of the dilution of essential information in the supplementary material of genome papers thereby increasing the probability that misleading results will escape the scrutiny of editors, reviewers, and ultimately readers. PMID:25714745

  12. Naked but not Hairless: the pitfalls of analyses of molecular adaptation based on few genome sequence comparisons.

    PubMed

    Delsuc, Frédéric; Tilak, Marie-Ka

    2015-02-20

    The naked mole-rat (Heterocephalus glaber) is the only rodent species that naturally lacks fur. Genome sequencing of this atypical rodent species recently shed light on a number of its morphological and physiological adaptations. More specifically, its hairless phenotype has been traced back to a single amino acid change (C397W) in the hair growth associated (HR) protein (or Hairless). By considering the available species diversity, we show that this specific position is in fact variable across mammals, including in the horse that was misleadingly reported to have the ancestral Cysteine. Moreover, by sequencing the corresponding HR exon in additional rodent species, we demonstrate that the C397W substitution is actually not a peculiarity of the naked mole-rat. Instead, this specific amino acid substitution is present in all hystricognath rodents investigated, which are all fully furred, including the naked mole-rat closest relative, the Damaraland mole-rat (Fukomys damarensis). Overall, we found no statistical correlation between amino acid changes at position 397 of the HR protein and reduced pilosity across the mammalian phylogeny. This demonstrates that this single amino acid change does not explain the naked mole-rat hairless phenotype. Our case study calls for caution before making strong claims regarding the molecular basis of phenotypic adaptation based on the screening of specific amino acid substitutions using only few model species in genome sequence comparisons. It also exposes the more general problem of the dilution of essential information in the supplementary material of genome papers thereby increasing the probability that misleading results will escape the scrutiny of editors, reviewers, and ultimately readers.

  13. Transcriptome analyses of Sclerotinia sclerotiorum infecting chickpea and lentil using RNA sequencing

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Sclerotinia sclerotiorum causes white mold of many important crops. To elucidate its pathogenic mechanisms, transcriptome analyses were used to study its interactions with chickpea and lentil. Five mRNA libraries were constructed from S. sclertiorum (strain WM-A1), healthy chickpea (cv. Spansih Whit...

  14. Complete nuclear ribosomal DNA sequence amplification and molecular analyses of Bangia (Bangiales, Rhodophyta) from China

    NASA Astrophysics Data System (ADS)

    Xu, Jiajie; Jiang, Bo; Chai, Sanming; He, Yuan; Zhu, Jianyi; Shen, Zonggen; Shen, Songdong

    2016-09-01

    Filamentous Bangia, which are distributed extensively throughout the world, have simple and similar morphological characteristics. Scientists can classify these organisms using molecular markers in combination with morphology. We successfully sequenced the complete nuclear ribosomal DNA, approximately 13 kb in length, from a marine Bangia population. We further analyzed the small subunit ribosomal DNA gene (nrSSU) and the internal transcribed spacer (ITS) sequence regions along with nine other marine, and two freshwater Bangia samples from China. Pairwise distances of the nrSSU and 5.8S ribosomal DNA gene sequences show the marine samples grouping together with low divergences (00.003; 0-0.006, respectively) from each other, but high divergences (0.123-0.126; 0.198, respectively) from freshwater samples. An exception is the marine sample collected from Weihai, which shows high divergence from both other marine samples (0.063-0.065; 0.129, respectively) and the freshwater samples (0.097; 0.120, respectively). A maximum likelihood phylogenetic tree based on a combined SSU-ITS dataset with maximum likelihood method shows the samples divided into three clades, with the two marine sample clades containing Bangia spp. from North America, Europe, Asia, and Australia; and one freshwater clade, containing Bangia atropurpurea from North America and China.

  15. Trichomonas vaginalis acidic phospholipase A2: isolation and partial amino acid sequence.

    PubMed

    Escobedo-Guajardo, Brenda L; González-Salazar, Francisco; Palacios-Corona, Rebeca; Torres de la Cruz, Víctor M; Morales-Vallarta, Mario; Mata-Cárdenas, Benito D; Garza-González, Jesús N; Rivera-Silva, Gerardo; Vargas-Villarreal, Javier

    2013-12-01

    Sexually transmitted diseases are a major cause of acute disease worldwide, and trichomoniasis is the most common and curable disease, generating more than 170 million cases annually worldwide. Trichomonas vaginalis is the causal agent of trichomoniasis and has the ability to destroy in vitro cell monolayers of the vaginal mucosa, where the phospholipases A2 (PLA2) have been reported as potential virulence factors. These enzymes have been partially characterized from the subcellular fraction S30 of pathogenic T. vaginalis strains. The main objective of this study was to purify a phospholipase A2 from T. vaginalis, make a partial characterization, obtain a partial amino acid sequence, and determine its enzymatic participation as hemolytic factor causing lysis of erythrocytes. Trichomonas S30, RF30 and UFF30 sub-fractions from GT-15 strain have the capacity to hydrolyze [2-(14)C-PA]-PC at pH 6.0. Proteins from the UFF30 sub-fraction were separated by affinity chromatography into two eluted fractions with detectable PLA A2 activity. The EDTA-eluted fraction was analyzed by HPLC using on-line HPLC-tandem mass spectrometry and two protein peaks were observed at 8.2 and 13 kDa. Peptide sequences were identified from the proteins present in the eluted EDTA UFF30 fraction; bioinformatic analysis using Protein Link Global Server charged with T. vaginalis protein database suggests that eluted peptides correspond a putative ubiquitin protein in the 8.2 kDa fraction and a phospholipase preserved in the 13 kDa fraction. The EDTA-eluted fraction hydrolyzed [2-(14)C-PA]-PC lyses erythrocytes from Sprague-Dawley in a time and dose-dependent manner. The acidic hemolytic activity decreased by 84% with the addition of 100 μM of Rosenthal's inhibitor.

  16. Diversity of anaerobic gut fungal populations analysed using ribosomal ITS1 sequences in faeces of wild and domesticated herbivores.

    PubMed

    Nicholson, Matthew J; McSweeney, Christopher S; Mackie, Roderick I; Brookman, Jayne L; Theodorou, Michael K

    2010-04-01

    Gut fungal-specific PCR primers have been used to selectively amplify the ITS1 region of gut fungal rDNA recovered from faeces of domestic and wild animals to investigate population diversity. Two different gel-based methods are described for separating populations of gut fungal rDNA amplicons, namely (1) denaturing gradient gel electrophoresis (DGGE) and (2) separation according to small size differences using Spreadex, a proprietary matrix for electrophoresis. Gut fungal populations were characterised by analysis of rDNA in faeces of seventeen domesticated and ten wild herbivores. Sequences derived from these gel-based characterisations were analysed and classified using a hidden Markov model-based fingerprint matching algorithm. Faecal samples contained a broad spectrum of fungi and sequences from five of the six recognised genera were identified, including Cyllamyces, the most recently described gut fungal genus, which was found to be widely distributed in the samples. Furthermore, four other novel groupings of gut fungal sequences were identified that did not cluster with sequences from any of the previously described genera. Both gel- and sequence- based profiles for gut fungal populations suggested a lack of geographical restriction on occurrence of any individual fungal type.

  17. Gene Sequence Analyses of the Healthy Oral Microbiome in Humans and Companion Animals.

    PubMed

    Davis, Eric M

    2016-06-01

    It has long been accepted that certain oral bacterial species are responsible for the development of periodontal disease. However, the focus of microbial and immunological research is shifting from studying the organisms associated with disease to examining the indigenous microbial inhabitants that are present in health. Microbiome refers to the aggregate genetic material of all microorganisms living in, or on, a defined habitat. Recent developments in gene sequence analysis have enabled detection and identification of bacteria from polymicrobial samples, including subgingival plaque. Diversity surveys utilizing this technology have demonstrated that bacterial culture techniques have vastly underestimated the richness and diversity of microorganisms in vivo, since only certain bacteria grow in vitro. Surveys using gene sequence analysis have demonstrated that the healthy oral microbiome is composed of an unexpectedly high number of diverse species, including putative pathogens. These findings support the view that coevolution microorganisms and macroscopic hosts has occurred in which certain microorganisms have adapted to survive in the oral cavity and host immune tolerance has allowed the establishment of a symbiotic relationship in which both parties receive benefits (mutualism). This review describes gene sequence analysis as an increasingly common, culture-independent tool for detecting bacteria in vivo and describes the results of recent oral microbiome diversity surveys of clinically healthy humans, dogs, and cats. Six bacterial phyla consistently dominated the healthy oral microbiome of all 3 host species. Previous hypotheses on etiology of periodontitis are reviewed in light of new scientific findings. Finally, the consideration that clinically relevant periodontal disease occurs when immune tolerance of the symbiotic oral microbiome is altered to a proinflammatory response will be discussed.

  18. Identification of random nucleic acid sequence aberrations using dual capture probes which hybridize to different chromosome regions

    DOEpatents

    Lucas, J.N.; Straume, T.; Bogen, K.T.

    1998-03-24

    A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration. 14 figs.

  19. Identification of random nucleic acid sequence aberrations using dual capture probes which hybridize to different chromosome regions

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    1998-01-01

    A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration.

  20. Genome sequence analyses show that Neisseria oralis is the same species as ‘Neisseria mucosa var. heidelbergensis’

    PubMed Central

    Jolley, Keith A.; Maiden, Martin C. J.

    2013-01-01

    Phylogenies generated from whole genome sequence (WGS) data provide definitive means of bacterial isolate characterization for typing and taxonomy. The species status of strains recently defined with conventional taxonomic approaches as representing Neisseria oralis was examined by the analysis of sequences derived from WGS data, specifically: (i) 53 Neisseria ribosomal protein subunit (rps) genes (ribosomal multi-locus sequence typing, rMLST); and (ii) 246 Neisseria core genes (core genome MLST, cgMLST). These data were compared with phylogenies derived from 16S and 23S rRNA gene sequences, demonstrating that the N. oralis strains were monophyletic with strains described previously as representing ‘Neisseria mucosa var. heidelbergensis’ and that this group was of equivalent taxonomic status to other well-described species of the genus Neisseria. Phylogenetic analyses also indicated that Neisseria sicca and Neisseria macacae should be considered the same species as Neisseria mucosa and that Neisseria flavescens should be considered the same species as Neisseria subflava. Analyses using rMLST showed that some strains currently defined as belonging to the genus Neisseria were more closely related to species belonging to other genera within the family; however, whole genome analysis of a more comprehensive selection of strains from within the family Neisseriaceae would be necessary to confirm this. We suggest that strains previously identified as representing ‘N. mucosa var. heidelbergensis’ and deposited in culture collections should be renamed N. oralis. Finally, one of the strains of N. oralis was able to ferment lactose, due to the presence of β-galactosidase and lactose permease genes, a characteristic previously thought to be unique to Neisseria lactamica, which therefore cannot be thought of as diagnostic for this species; however, the rMLST and cgMLST analyses confirm that N. oralis is most closely related to N. mucosa. PMID:24097834

  1. The amino acid sequence of protein CM-3 from Dendroaspis polylepis polylepis (black mamba) venom.

    PubMed

    Joubert, F J

    1985-01-01

    Protein CM-3 from Dendroaspis polylepis polylepis venom was purified by gel filtration and ion exchange chromatography. It comprises 65 amino acids including eight half-cystines. The complete amino acid sequence of protein CM-3 has been elucidated. The sequence (residues 1-50) resembles that of the N-terminal sequence of the subunits of a synergistic type protein and residues 51-65 that of the C-terminal sequence of an angusticeps type protein. Mixtures of protein CM-3 and angusticeps type proteins showed no apparent synergistic effect, in that their toxicity in combination was no greater than the sum of their individual toxicities.

  2. The amino acid sequences of the Fd fragments of two human γ heavy chains

    PubMed Central

    Press, E. M.; Hogg, N. M.

    1970-01-01

    The amino acid sequences of the Fd fragments of two human pathological immunoglobulins of the immunoglobulin G1 class are reported. Comparison of the two sequences shows that the heavy-chain variable regions are similar in length to those of the light chains. The existence of heavy chain variable region subgroups is also deduced, from a comparison of these two sequences with those of another γ 1 chain, Eu, a μ chain, Ou, and the partial sequence of a fourth γ 1 chain, Ste. Carbohydrate has been found to be linked to an aspartic acid residue in the variable region of one of the γ 1 chains, Cor. PMID:5449120

  3. Complete Genome Sequence and Immunoproteomic Analyses of the Bacterial Fish Pathogen Streptococcus parauberis▿†

    PubMed Central

    Nho, Seong Won; Hikima, Jun-ichi; Cha, In Seok; Park, Seong Bin; Jang, Ho Bin; del Castillo, Carmelo S.; Kondo, Hidehiro; Hirono, Ikuo; Aoki, Takashi; Jung, Tae Sung

    2011-01-01

    Although Streptococcus parauberis is known as a bacterial pathogen associated with bovine udder mastitis, it has recently become one of the major causative agents of olive flounder (Paralichthys olivaceus) streptococcosis in northeast Asia, causing massive mortality resulting in severe economic losses. S. parauberis contains two serotypes, and it is likely that capsular polysaccharide antigens serve to differentiate the serotypes. In the present study, the complete genome sequence of S. parauberis (serotype I) was determined using the GS-FLX system to investigate its phylogeny, virulence factors, and antigenic proteins. S. parauberis possesses a single chromosome of 2,143,887 bp containing 1,868 predicted coding sequences (CDSs), with an average GC content of 35.6%. Whole-genome dot plot analysis and phylogenetic analysis of a 60-kDa chaperonin-encoding gene and the glyceraldehyde-3-phosphate dehydrogenase (GAPDH)-encoding gene showed that the strain was evolutionarily closely related to Streptococcus uberis. S. parauberis antigenic proteins were analyzed using an immunoproteomic technique. Twenty-one antigenic protein spots were identified in S. parauberis, by reaction with an antiserum obtained from S. parauberis-challenged olive flounder. This work provides the foundation needed to understand more clearly the relationship between pathogen and host and develops new approaches toward prophylactic and therapeutic strategies to deal with streptococcosis in fish. The work also provides a better understanding of the physiology and evolution of a significant representative of the Streptococcaceae. PMID:21531805

  4. The Chinese hamster Alu-equivalent sequence: a conserved highly repetitious, interspersed deoxyribonucleic acid sequence in mammals has a structure suggestive of a transposable element.

    PubMed Central

    Haynes, S R; Toomey, T P; Leinwand, L; Jelinek, W R

    1981-01-01

    A consensus sequence has been determined for a major interspersed deoxyribonucleic acid repeat in the genome of Chinese hamster ovary cells (CHO cells). This sequence is extensively homologous to (i) the human Alu sequence (P. L. Deininger et al., J. Mol. Biol., in press), (ii) the mouse B1 interspersed repetitious sequence (Krayev et al., Nucleic Acids Res. 8:1201-1215, 1980) (iii) an interspersed repetitious sequence from African green monkey deoxyribonucleic acid (Dhruva et al., Proc. Natl. Acad. Sci. U.S.A. 77:4514-4518, 1980) and (iv) the CHO and mouse 4.5S ribonucleic acid (this report; F. Harada and N. Kato, Nucleic Acids Res. 8:1273-1285, 1980). Because the CHO consensus sequence shows significant homology to the human Alu sequence it is termed the CHO Alu-equivalent sequence. A conserved structure surrounding CHO Alu-equivalent family members can be recognized. It is similar to that surrounding the human Alu and the mouse B1 sequences, and is represented as follows: direct repeat-CHO-Alu-A-rich sequence-direct repeat. A composite interspersed repetitious sequence has been identified. Its structure is represented as follows: direct repeat-residue 47 to 107 of CHO-Alu-non-Alu repetitious sequence-A-rich sequence-direct repeat. Because the Alu flanking sequences resemble those that flank known transposable elements, we think it likely that the Alu sequence dispersed throughout the mammalian genome by transposition. Images PMID:9279371

  5. A rapid method for manual or automated purification of fluorescently labeled nucleic acids for sequencing, genotyping, and microarrays.

    PubMed

    Springer, Amy L; Booth, Lisa R; Braid, Michael D; Houde, Christiane M; Hughes, Karin A; Kaiser, Robert J; Pedrak, Casandra; Spicer, Douglas A; Stolyar, Sergey

    2003-03-01

    Fluorescent dyes provide specific, sensitive, and multiplexed detection of nucleic acids. To maximize sensitivity, fluorescently labeled reaction products (e.g., cycle sequencing or primer extension products) must be purified away from residual dye-labeled precursors. Successful high-throughput analyses require that this purification be reliable, rapid, and amenable to automation. Common methods for purifying reaction products involve several steps and require processes that are not easily automated. Prolinx, Inc. has devel oped RapXtract superparamagnetic separation technology affording rapid and easy-to-perform methods that yield high-quality product and are easily automated. The technology uses superparamagnetic particles that specifically remove unincorporated dye-labeled precursors. These particles are efficiently pelleted in the presence of a magnetic field, making them ideal for purification because of the rapid separations that they allow. RapXtract-purified sequencing reactions yield data with good signal and high Phred quality scores, and they work with various sequencing dye chemistries, including BigDye and near-infrared fluorescence IRDyes. RapXtract technology can also be used to purify dye primer sequencing reactions, primer extension reactions for genotyping analysis, and nucleic acid labeling reactions for microarray hybridization. The ease of use and versatility of RapXtract technology makes it a good choice for manual or automated purification of fluorescently labeled nucleic acids.

  6. Genome-wide analyses of Epstein-Barr virus reveal conserved RNA structures and a novel stable intronic sequence RNA

    PubMed Central

    2013-01-01

    Background Epstein-Barr virus (EBV) is a human herpesvirus implicated in cancer and autoimmune disorders. Little is known concerning the roles of RNA structure in this important human pathogen. This study provides the first comprehensive genome-wide survey of RNA and RNA structure in EBV. Results Novel EBV RNAs and RNA structures were identified by computational modeling and RNA-Seq analyses of EBV. Scans of the genomic sequences of four EBV strains (EBV-1, EBV-2, GD1, and GD2) and of the closely related Macacine herpesvirus 4 using the RNAz program discovered 265 regions with high probability of forming conserved RNA structures. Secondary structure models are proposed for these regions based on a combination of free energy minimization and comparative sequence analysis. The analysis of RNA-Seq data uncovered the first observation of a stable intronic sequence RNA (sisRNA) in EBV. The abundance of this sisRNA rivals that of the well-known and highly expressed EBV-encoded non-coding RNAs (EBERs). Conclusion This work identifies regions of the EBV genome likely to generate functional RNAs and RNA structures, provides structural models for these regions, and discusses potential functions suggested by the modeled structures. Enhanced understanding of the EBV transcriptome will guide future experimental analyses of the discovered RNAs and RNA structures. PMID:23937650

  7. Genomic analyses of multidrug resistant Pseudomonas aeruginosa PA1 resequenced by single-molecule real-time sequencing

    PubMed Central

    Li, Gang; Shen, Mengyu; Le, Shuai; Tan, Yinling; Li, Ming; Zhao, Xia; Shen, Wei; Yang, Yuhui; Wang, Jing; Zhu, Hongbin; Li, Shu; Rao, Xiancai; Hu, Fuquan; Lu, Shuguang

    2016-01-01

    As a third-generation sequencing (TGS) method, single-molecule real-time (SMRT) technology provides long read length, and it is well suited for resequencing projects and de novo assembly. In the present study, Pseudomonas aeruginosa PA1 was characterized and resequenced using SMRT technology. PA1 was also subjected to genomic, comparative and pan-genomic analyses. The multidrug resistant strain PA1 possesses a 6,498,072 bp genome and a sequence type of ST-782. The genome of PA1 was also visualized, and the results revealed the details of general genome annotations, virulence factors, regulatory proteins (RPs), secretion system proteins, type II toxin–antitoxin (T–A) pairs and genomic islands. Whole genome comparison analysis suggested that PA1 exhibits similarity to other P. aeruginosa strains but differs in terms of horizontal gene transfer (HGT) regions, such as prophages and genomic islands. Phylogenetic analyses based on 16S rRNA sequences demonstrated that PA1 is closely related to PAO1, and P. aeruginosa strains can be divided into two main groups. The pan-genome of P. aeruginosa consists of a core genome of approximately 4,000 genes and an accessory genome of at least 6,600 genes. The present study presented a detailed, visualized and comparative analysis of the PA1 genome, to enhance our understanding of this notorious pathogen. PMID:27765811

  8. Genome Sequence and Transcriptome Analyses of Chrysochromulina tobin: Metabolic Tools for Enhanced Algal Fitness in the Prominent Order Prymnesiales (Haptophyceae)

    SciTech Connect

    Hovde, Blake T.; Deodato, Chloe R.; Hunsperger, Heather M.; Ryken, Scott A.; Yost, Will; Jha, Ramesh K.; Patterson, Johnathan; Monnat, Raymond J.; Barlow, Steven B.; Starkenburg, Shawn R.; Cattolico, Rose Ann; Richardson, Paul M.

    2015-09-23

    Haptophytes are recognized as seminal players in aquatic ecosystem function. These algae are important in global carbon sequestration, form destructive harmful blooms, and given their rich fatty acid content, serve as a highly nutritive food source to a broad range of eco-cohorts. Haptophyte dominance in both fresh and marine waters is supported by the mixotrophic nature of many taxa. Despite their importance the nuclear genome sequence of only one haptophyte, Emiliania huxleyi (Isochrysidales), is available. Here we report the draft genome sequence of Chrysochromulina tobin (Prymnesiales), and transcriptome data collected at seven time points over a 24-hour light/dark cycle. The nuclear genome of C. tobin is small (59 Mb), compact (∼40% of the genome is protein coding) and encodes approximately 16,777 genes. Genes important to fatty acid synthesis, modification, and catabolism show distinct patterns of expression when monitored over the circadian photoperiod. The C. tobin genome harbors the first hybrid polyketide synthase/non-ribosomal peptide synthase gene complex reported for an algal species, and encodes potential anti-microbial peptides and proteins involved in multidrug and toxic compound extrusion. A new haptophyte xanthorhodopsin was also identified, together with two “red” RuBisCO activases that are shared across many algal lineages. In conclusion, the Chrysochromulina tobin genome sequence provides new information on the evolutionary history, ecology and economic importance of haptophytes.

  9. Genome Sequence and Transcriptome Analyses of Chrysochromulina tobin: Metabolic Tools for Enhanced Algal Fitness in the Prominent Order Prymnesiales (Haptophyceae)

    DOE PAGES

    Hovde, Blake T.; Deodato, Chloe R.; Hunsperger, Heather M.; ...

    2015-09-23

    Haptophytes are recognized as seminal players in aquatic ecosystem function. These algae are important in global carbon sequestration, form destructive harmful blooms, and given their rich fatty acid content, serve as a highly nutritive food source to a broad range of eco-cohorts. Haptophyte dominance in both fresh and marine waters is supported by the mixotrophic nature of many taxa. Despite their importance the nuclear genome sequence of only one haptophyte, Emiliania huxleyi (Isochrysidales), is available. Here we report the draft genome sequence of Chrysochromulina tobin (Prymnesiales), and transcriptome data collected at seven time points over a 24-hour light/dark cycle. Themore » nuclear genome of C. tobin is small (59 Mb), compact (∼40% of the genome is protein coding) and encodes approximately 16,777 genes. Genes important to fatty acid synthesis, modification, and catabolism show distinct patterns of expression when monitored over the circadian photoperiod. The C. tobin genome harbors the first hybrid polyketide synthase/non-ribosomal peptide synthase gene complex reported for an algal species, and encodes potential anti-microbial peptides and proteins involved in multidrug and toxic compound extrusion. A new haptophyte xanthorhodopsin was also identified, together with two “red” RuBisCO activases that are shared across many algal lineages. In conclusion, the Chrysochromulina tobin genome sequence provides new information on the evolutionary history, ecology and economic importance of haptophytes.« less

  10. Genome Sequence and Transcriptome Analyses of Chrysochromulina tobin: Metabolic Tools for Enhanced Algal Fitness in the Prominent Order Prymnesiales (Haptophyceae).

    PubMed

    Hovde, Blake T; Deodato, Chloe R; Hunsperger, Heather M; Ryken, Scott A; Yost, Will; Jha, Ramesh K; Patterson, Johnathan; Monnat, Raymond J; Barlow, Steven B; Starkenburg, Shawn R; Cattolico, Rose Ann

    2015-01-01

    Haptophytes are recognized as seminal players in aquatic ecosystem function. These algae are important in global carbon sequestration, form destructive harmful blooms, and given their rich fatty acid content, serve as a highly nutritive food source to a broad range of eco-cohorts. Haptophyte dominance in both fresh and marine waters is supported by the mixotrophic nature of many taxa. Despite their importance the nuclear genome sequence of only one haptophyte, Emiliania huxleyi (Isochrysidales), is available. Here we report the draft genome sequence of Chrysochromulina tobin (Prymnesiales), and transcriptome data collected at seven time points over a 24-hour light/dark cycle. The nuclear genome of C. tobin is small (59 Mb), compact (∼ 40% of the genome is protein coding) and encodes approximately 16,777 genes. Genes important to fatty acid synthesis, modification, and catabolism show distinct patterns of expression when monitored over the circadian photoperiod. The C. tobin genome harbors the first hybrid polyketide synthase/non-ribosomal peptide synthase gene complex reported for an algal species, and encodes potential anti-microbial peptides and proteins involved in multidrug and toxic compound extrusion. A new haptophyte xanthorhodopsin was also identified, together with two "red" RuBisCO activases that are shared across many algal lineages. The Chrysochromulina tobin genome sequence provides new information on the evolutionary history, ecology and economic importance of haptophytes.

  11. Genome Sequence and Transcriptome Analyses of Chrysochromulina tobin: Metabolic Tools for Enhanced Algal Fitness in the Prominent Order Prymnesiales (Haptophyceae)

    PubMed Central

    Hovde, Blake T.; Deodato, Chloe R.; Hunsperger, Heather M.; Ryken, Scott A.; Yost, Will; Jha, Ramesh K.; Patterson, Johnathan; Monnat, Raymond J.; Barlow, Steven B.; Starkenburg, Shawn R.; Cattolico, Rose Ann

    2015-01-01

    Haptophytes are recognized as seminal players in aquatic ecosystem function. These algae are important in global carbon sequestration, form destructive harmful blooms, and given their rich fatty acid content, serve as a highly nutritive food source to a broad range of eco-cohorts. Haptophyte dominance in both fresh and marine waters is supported by the mixotrophic nature of many taxa. Despite their importance the nuclear genome sequence of only one haptophyte, Emiliania huxleyi (Isochrysidales), is available. Here we report the draft genome sequence of Chrysochromulina tobin (Prymnesiales), and transcriptome data collected at seven time points over a 24-hour light/dark cycle. The nuclear genome of C. tobin is small (59 Mb), compact (∼40% of the genome is protein coding) and encodes approximately 16,777 genes. Genes important to fatty acid synthesis, modification, and catabolism show distinct patterns of expression when monitored over the circadian photoperiod. The C. tobin genome harbors the first hybrid polyketide synthase/non-ribosomal peptide synthase gene complex reported for an algal species, and encodes potential anti-microbial peptides and proteins involved in multidrug and toxic compound extrusion. A new haptophyte xanthorhodopsin was also identified, together with two “red” RuBisCO activases that are shared across many algal lineages. The Chrysochromulina tobin genome sequence provides new information on the evolutionary history, ecology and economic importance of haptophytes. PMID:26397803

  12. Sequence analyses and chromosomal distribution of the Tc1/Mariner element in Parodontidae fish (Teleostei: Characiformes).

    PubMed

    Schemberger, Michelle Orane; Nogaroto, Viviane; Almeida, Mara Cristina; Artoni, Roberto Ferreira; Valente, Guilherme Targino; Martins, Cesar; Moreira-Filho, Orlando; Cestari, Marta Margarete; Vicari, Marcelo Ricardo

    2016-11-30

    Transposable elements are able to move along eukaryotic genomes. They are divided into two classes according to their transposition intermediate: RNA (class I or retrotransposons) or DNA (class II or DNA transposons). Most of these sequences are inactive or non-autonomous in eukaryotic genomes. Inactivate transposons can accumulate mutations at neutral rates until losing their molecular identity. They may either be eliminated from the genome or take on different molecular functions. Transposable elements may also participate in the differentiation of sex chromosomes. Therefore, the structural variations and nucleotide similarity of Tc1/Mariner sequences were analyzed along with their potential participation in the differentiation processes of sex chromosomes in the genomes of Parodontidae fish. All Parodontidae species presented non-autonomous copies of Tc1/Mariner with structural variation, different levels of deterioration (genetic distance), and variations in insertion and deletion patterns. The physical mapping of Tc1/Mariner on chromosomes revealed dispersed signals in euchromatins, with small accumulations in terminal regions and in the sex chromosomes. The gene dosage ratios indicated copy number variations of Tc1/Mariner among the genomes and high transposase open reading frame deterioration in Parodon hilarii and Parodon pongoensis genomes. This transposon presented transcriptional activity in gonads, but there was no significant difference between sexes. This may indicate non-functional protein expression or may correspond to DNA binding proteins derived from Tc1/Mariner. Thus, our results show Tc1/Mariner inactivation along with a diversity in Parodontidae genomes and its participation in the differentiation of the W sex chromosome.

  13. Cytogenetic and Sequence Analyses of Mitochondrial DNA Insertions in Nuclear Chromosomes of Maize

    PubMed Central

    Lough, Ashley N.; Faries, Kaitlyn M.; Koo, Dal-Hoe; Hussain, Abid; Roark, Leah M.; Langewisch, Tiffany L.; Backes, Teresa; Kremling, Karl A. G.; Jiang, Jiming; Birchler, James A.; Newton, Kathleen J.

    2015-01-01

    The transfer of mitochondrial DNA (mtDNA) into nuclear genomes is a regularly occurring process that has been observed in many species. Few studies, however, have focused on the variation of nuclear-mtDNA sequences (NUMTs) within a species. This study examined mtDNA insertions within chromosomes of a diverse set of Zea mays ssp. mays (maize) inbred lines by the use of fluorescence in situ hybridization. A relatively large NUMT on the long arm of chromosome 9 (9L) was identified at approximately the same position in four inbred lines (B73, M825, HP301, and Oh7B). Further examination of the similarly positioned 9L NUMT in two lines, B73 and M825, indicated that the large size of these sites is due to the presence of a majority of the mitochondrial genome; however, only portions of this NUMT (∼252 kb total) were found in the publically available B73 nuclear sequence for chromosome 9. Fiber-fluorescence in situ hybridization analysis estimated the size of the B73 9L NUMT to be ∼1.8 Mb and revealed that the NUMT is methylated. Two regions of mtDNA (2.4 kb and 3.3 kb) within the 9L NUMT are not present in the B73 mitochondrial NB genome; however, these 2.4-kb and 3.3-kb segments are present in other Zea mitochondrial genomes, including that of Zea mays ssp. parviglumis, a progenitor of domesticated maize. PMID:26333837

  14. Acinetobacter seifertii Isolated from China: Genomic Sequence and Molecular Epidemiology Analyses.

    PubMed

    Yang, Yunxing; Wang, Jianfeng; Fu, Ying; Ruan, Zhi; Yu, Yunsong

    2016-03-01

    Clinical infections caused by Acinetobacter spp. have increasing public health concerns because of their global occurrence and ability to acquire multidrug resistance. Acinetobacter calcoaceticus-Acinetobacter baumannii (ACB) complex encompasses A. calcoaceticus, A. baumannii, A. pittii (formerly genomic species 3), and A nosocomial (formerly genomic species 13TU), which are predominantly responsible for clinical pathogenesis in the Acinetobacter genus. In our previous study, a putative novel species isolated from 385 non-A. baumannii spp. strains based on the rpoB gene phylogenetic tree was reported. Here, the putative novel species was identified as A. seifertii based on the whole-genome phylogenetic tree. A. seifertii was recognized as a novel member of the ACB complex and close to A. baumannii and A. nosocomials. Furthermore, we studied the characteristics of 10 A. seifertii isolates, which were distributed widely in 6 provinces in China and mainly caused infections in the elderly or children. To define the taxonomic status and characteristics, the biochemical reactions, antimicrobial susceptibility testing, pulsed field gel electrophoresis (PFGE), multilocus sequence typing (MLST), and whole-genome sequence analysis were performed. The phenotypic characteristics failed to distinguish A. serfertii from other species in the ACB complex. Most of the A. seifertii isolates were susceptible to antibiotics commonly used for nosocomial Acinetobacter spp. infections, but one isolate (strain A362) was resistant to ampicillin/sulbactam, ceftazidime and amikacin. The different patterns of MLST and PFGE suggested that the 10 isolates were not identical and lacked clonal relatedness. Our study reported for the first time the molecular epidemiological and genomic features of widely disseminated A. seifertii in China. These observations could enrich the knowledge of infections caused by non-A. baumannii and may provide a scientific basis for future clinical treatment.

  15. Does more sequence data improve estimates of galliform phylogeny? Analyses of a rapid radiation using a complete data matrix

    PubMed Central

    Braun, Edward L.

    2014-01-01

    The resolution of rapid evolutionary radiations or “bushes” in the tree of life has been one of the most difficult and interesting problems in phylogenetics. The avian order Galliformes appears to have undergone several rapid radiations that have limited the resolution of prior studies and obscured the position of taxa important both agriculturally and as model systems (chicken, turkey, Japanese quail). Here we present analyses of a multi-locus data matrix comprising over 15,000 sites, primarily from nuclear introns but also including three mitochondrial regions, from 46 galliform taxa with all gene regions sampled for all taxa. The increased sampling of unlinked nuclear genes provided strong bootstrap support for all but a small number of relationships. Coalescent-based methods to combine individual gene trees and analyses of datasets that are independent of published data indicated that this well-supported topology is likely to reflect the galliform species tree. The inclusion or exclusion of mitochondrial data had a limited impact upon analyses upon analyses using either concatenated data or multispecies coalescent methods. Some of the key phylogenetic findings include support for a second major clade within the core phasianids that includes the chicken and Japanese quail and clarification of the phylogenetic relationships of turkey. Jackknifed datasets suggested that there is an advantage to sampling many independent regions across the genome rather than obtaining long sequences for a small number of loci, possibly reflecting the differences among gene trees that differ due to incomplete lineage sorting. Despite the novel insights we obtained using this increased sampling of gene regions, some nodes remain unresolved, likely due to periods of rapid diversification. Resolving these remaining groups will likely require sequencing a very large number of gene regions, but our analyses now appear to support a robust backbone for this order. PMID:24795852

  16. The amino acid sequence of goat beta-lactoglobulin.

    PubMed

    Préaux, G; Braunitzer, G; Schrank, B; Stangl, A

    1979-11-01

    The isolation of beta-lactoglobulin from milk of the goat is described. The purified protein was checked for purity and has been characterized by its gross composition and end groups. The native or the modified protein was then degraded by tryptic and cyanogen bromide cleavage. The cleavage products were isolated and sequenced in the sequenator using a Quadrol and propyne program. These data provide the complete sequence of beta-lactoglobulin of the goat. The results are discussed and compared particularly with bovine beta-lactoglobulin components AB. Some biological aspects are described.

  17. Layered materials with coexisting acidic and basic sites for catalytic one-pot reaction sequences.

    PubMed

    Motokura, Ken; Tada, Mizuki; Iwasawa, Yasuhiro

    2009-06-17

    Acidic montmorillonite-immobilized primary amines (H-mont-NH(2)) were found to be excellent acid-base bifunctional catalysts for one-pot reaction sequences, which are the first materials with coexisting acid and base sites active for acid-base tamdem reactions. For example, tandem deacetalization-Knoevenagel condensation proceeded successfully with the H-mont-NH(2), affording the corresponding condensation product in a quantitative yield. The acidity of the H-mont-NH(2) was strongly influenced by the preparation solvent, and the base-catalyzed reactions were enhanced by interlayer acid sites.

  18. Isolation and a partial amino acid sequence of insulin from the islet tissue of cod (Gadus callarias)

    PubMed Central

    Grant, P. T.; Reid, K. B. M.

    1968-01-01

    1. Insulin has been isolated by gel filtration and ion-exchange chromatography from extracts of the discrete islet tissue of cod. The final preparation yielded a single band on electrophoresis at two pH values. The biological potency was 11·5 international units/mg. in mouse-convulsion and other assay procedures. 2. Glycine and methionine were shown to be the N-terminal amino acids of the A and B chains respectively. An estimate of the molecular weight together with amino acid analyses indicated that cod insulin, like the bovine hormone, consists of 51 amino acid residues. In contrast, the amino acid composition differs markedly from bovine insulin. 3. Oxidation of insulin with performic acid yielded the A and B peptide chains, which were separated by ion-exchange chromatography. Sequence studies on smaller peptides isolated from enzymic digests or from dilute acetic acid hydrolysates of the two chains have established the sequential order of 14 of the 21 amino acid residues of the A chain and 25 of the 30 amino acid residues of the B chain. PMID:4866431

  19. Synthesis of gamma,delta-unsaturated glycolic acids via sequenced brook and Ireland--claisen rearrangements.

    PubMed

    Schmitt, Daniel C; Johnson, Jeffrey S

    2010-03-05

    Organozinc, -magnesium, and -lithium nucleophiles initiate a Brook/Ireland-Claisen rearrangement sequence of allylic silyl glyoxylates resulting in the formation of gamma,delta-unsaturated alpha-silyloxy acids.

  20. Computer Simulation of the Determination of Amino Acid Sequences in Polypeptides

    ERIC Educational Resources Information Center

    Daubert, Stephen D.; Sontum, Stephen F.

    1977-01-01

    Describes a computer program that generates a random string of amino acids and guides the student in determining the correct sequence of a given protein by using experimental analytic data for that protein. (MLH)

  1. Isotopic analyses of nitrogenous compounds from the Murchison meteorite: ammonia, amines, amino acids, and polar hydrocarbons

    NASA Technical Reports Server (NTRS)

    Pizzarello, S.; Feng, X.; Epstein, S.; Cronin, J. R.

    1994-01-01

    The combined volatile bases (ammonia, aliphatic amines, and possibly other bases), ammonia, amino acids, and polar hydrocarbons were prepared from the Murchison meteorite for isotopic analyses. The volatile bases were obtained by cryogenic transfer after acid-hydrolysis of a hot-water extract and analyzed by combined gas chromatography-mass spectrometry of pentafluoropropionyl derivatives. The aliphatic amines present in this preparation comprise a mixture that includes both primary and secondary isomers through C5 at a total concentration of > or = 100 nmoles g-1. As commonly observed for meteoritic organic compounds, almost all isomers through C5 are present, and the concentrations within homologous series decrease with increasing chain length. Ammonia was chromatographically separated from the other volatile bases and found at a concentration of 1.1-1.3 micromoles g-1 meteorite. The ammonia analyzed includes contributions from ammonium salts and the hydrolysis of extractable organic compounds, e.g., carboxamides. Stable isotope analyses showed the volatile bases to be substantially enriched in the heavier isotopes, relative to comparable terrestrial compounds delta D < or = +1221%; delta 13C = +22%; delta 15N = +93%). Ammonia, per se, was found to have a somewhat lower delta 15N value (+69%) than the total volatile bases; consequently, a higher delta 15N (>93%) can be inferred for the other bases, which include the amines. Solvent-extractable polar hydrocarbons obtained separately were found to be enriched in 15N (delta 15N = +104%). Total amino acids, prepared from a hydrolyzed hot-water extract by cation exchange chromatography, gave a delta 15N of +94%, a value in good agreement with that obtained previously. Nitrogen isotopic data are also given for amino acid fractions separated chromatographically. The delta 15N values of the Murchison soluble organic compounds analyzed to date fall within a rather narrow range (delta 15N = +94 +/- 8%), an observation

  2. Phylogenetic analyses of termite post-embryonic sequences illuminate caste and developmental pathway evolution.

    PubMed

    Legendre, Frédéric; Whiting, Michael F; Grandcolas, Philippe

    2013-01-01

    Termites are highly eusocial insects with a caste polyphenism (i.e., discontinuous morphological differences between castes) and elaborated behaviors. While the developmental pathways leading to caste occurrence are well-known in many species, the evolutionary origin of these pathways is still obscure. Recent molecular phylogenetic studies suggest multiple independent origins of sterile castes in termites, reviving a 30 years old debate. We demonstrate here that diploid sterile castes ("true" workers) evolved several times independently in this group and that this caste was lost at least once in a lineage with developmentally more flexible workers called pseudergates or "false" workers. We also infer that flexibility in post-embryonic development was acquired multiple times independently during termite evolution. We suggest that focusing on detailed developmental pathways in phylogenetic analyses is essential for elucidating the origin of caste polyphenism in termites.

  3. The complete chloroplast genome sequences of Lychnis wilfordii and Silene capitata and comparative analyses with other Caryophyllaceae genomes

    PubMed Central

    Kang, Jong-Soo; Lee, Byoung Yoon; Kwak, Myounghai

    2017-01-01

    The complete chloroplast genomes of Lychnis wilfordii and Silene capitata were determined and compared with ten previously reported Caryophyllaceae chloroplast genomes. The chloroplast genome sequences of L. wilfordii and S. capitata contain 152,320 bp and 150,224 bp, respectively. The gene contents and orders among 12 Caryophyllaceae species are consistent, but several microstructural changes have occurred. Expansion of the inverted repeat (IR) regions at the large single copy (LSC)/IRb and small single copy (SSC)/IR boundaries led to partial or entire gene duplications. Additionally, rearrangements of the LSC region were caused by gene inversions and/or transpositions. The 18 kb inversions, which occurred three times in different lineages of tribe Sileneae, were thought to be facilitated by the intermolecular duplicated sequences. Sequence analyses of the L. wilfordii and S. capitata genomes revealed 39 and 43 repeats, respectively, including forward, palindromic, and reverse repeats. In addition, a total of 67 and 56 simple sequence repeats were discovered in the L. wilfordii and S. capitata chloroplast genomes, respectively. Finally, we constructed phylogenetic trees of the 12 Caryophyllaceae species and two Amaranthaceae species based on 73 protein-coding genes using both maximum parsimony and likelihood methods. PMID:28241056

  4. Multilocus sequence analyses reveal extensive diversity and multiple origins of fluconazole resistance in Candida tropicalis from tropical China

    PubMed Central

    Wu, Jin-Yan; Guo, Hong; Wang, Hua-Min; Yi, Guo-Hui; Zhou, Li-Min; He, Xiao-Wen; Zhang, Ying; Xu, Jianping

    2017-01-01

    Candida tropicalis is among the most prevalent human pathogenic yeast species, second only to C. albicans in certain geographic regions such as East Asia and Brazil. However, compared to C. albicans, relatively little is known about the patterns of genetic variation in C. tropicalis. This study analyzed the genetic diversity and relationships among isolates of C. tropicalis from the southern Chinese island of Hainan. A total of 116 isolates were obtained from seven geographic regions located across the Island. For each isolate, a total of 2677 bp from six gene loci were sequenced and 79 (2.96%) polymorphic nucleotide sites were found in our sample. Comparisons with strains reported from other parts of the world identified significant novel diversities in Hainan, including an average of six novel sequences (with a range 1 to 14) per locus and 80 novel diploid sequence types. Most of the genetic variation was found within individual strains and there was abundant evidence for gene flow among the seven geographic locations within Hainan. Interestingly, our analyses identified no significant correlation between the diploid sequence types at the six loci and fluconazole susceptibility, consistent with multiple origins of fluconazole resistance in the Hainan population of C. tropicalis. PMID:28186162

  5. Fatty acid and DNA analyses of Permian bacteria isolated from ancient salt crystals reveal differences with their modern relatives.

    PubMed

    Vreeland, Russell H; Rosenzweig, William D; Lowenstein, Tim; Satterfield, Cindy; Ventosa, Antonio

    2006-02-01

    The isolation of living microorganisms from primary 250-million-year-old (MYA) salt crystals has been questioned by several researchers. The most intense discussion has arisen from questions about the texture and age of the crystals used, the ability of organisms to survive 250 million years when exposed to environmental factors such as radiation and the close similarity between 16S rRNA sequences in the Permian and modern microbes. The data in this manuscript are not meant to provide support for the antiquity of the isolated bacterial strains. Rather, the data presents several comparisons between the Permian microbes and other isolates to which they appear related. The analyses include whole cell fatty acid profiling, DNA-DNA hybridizations, ribotyping, and random amplified polymorphic DNA amplification (RAPD). These data show that the Permian strains, studied here, differ significantly from their more modern relatives. These differences are accumulating in both phenotypic and molecular areas of the cells. At the fatty acid level the differences are approaching but have not reached separate species status. At the molecular level the variation appears to be distributed across the genome and within the gene regions flanking the highly conserved 16S rRNA itself. The data show that these bacteria are not identical and help to rule out questions of contamination by putatively modern strains.

  6. Genome sequence of the acid-tolerant strain Rhizobium sp. LPU83.

    PubMed

    Wibberg, Daniel; Tejerizo, Gonzalo Torres; Del Papa, María Florencia; Martini, Carla; Pühler, Alfred; Lagares, Antonio; Schlüter, Andreas; Pistorio, Mariano

    2014-04-20

    Rhizobia are important members of the soil microbiome since they enter into nitrogen-fixing symbiosis with different legume host plants. Rhizobium sp. LPU83 is an acid-tolerant Rhizobium strain featuring a broad-host-range. However, it is ineffective in nitrogen fixation. Here, the improved draft genome sequence of this strain is reported. Genome sequence information provides the basis for analysis of its acid tolerance, symbiotic properties and taxonomic classification.

  7. Requirements for Efficient Correction of ΔF508 CFTR Revealed by Analyses of Evolved Sequences

    PubMed Central

    Mendoza, Juan L.; Schmidt, André; Li, Qin; Caspa, Emmanuel; Barrett, Tyler; Bridges, Robert J.; Feranchak, Andrew P.; Brautigam, Chad A.; Thomas, Philip J.

    2012-01-01

    SUMMARY Misfolding of ΔF508 CFTR underlies pathology in most CF patients. F508 resides in the first nucleotide binding domain (NBD1) of CFTR near a predicted interface with the fourth intracellular loop (ICL4). Efforts to identify small molecules that restore function by correcting the folding defect have revealed an apparent efficacy ceiling. To understand the mechanistic basis of this obstacle, positions statistically coupled to 508, in evolved sequences, were identified and assessed for their impact on both NBD1 and CFTR folding. The results indicate that both NBD1 folding and interaction with ICL4 are altered by the ΔF508 mutation and that correction of either individual process is only partially effective. By contrast, combination of mutations that counteract both defects restores ΔF508 maturation and function to wild type levels. These results provide a mechanistic rationale for the limited efficacy of extant corrector compounds and suggest approaches for identifying compounds that correct both defective steps. PMID:22265409

  8. Neotomine-peromyscine rodent systematics based on combined analyses of nuclear and mitochondrial DNA sequences.

    PubMed

    Reeder, Serena A; Carroll, Darin S; Edwards, Cody W; Kilpatrick, C William; Bradley, Robert D

    2006-07-01

    Recently, sequences from two nuclear genes (exon 6 of the dentin matrix protein 1 gene and intron 7 of the beta-fibrinogen gene) and one mitochondrial gene (cytochrome b gene) were used independently in an attempt to resolve phylogenetic relationships within the neotomine-peromyscine complex. Although these studies provided testable hypotheses regarding this group of rodents, the affinities of certain tribes and genera remain uncertain. To elucidate these relationships, the three data partitions were tested for heterogeneity and then concatenated according to conditional data combination and total evidence approaches. Support was found for five clades, four of which correspond to well recognized tribes (the Neotomini, Peromyscini=Reithrodontomyini, Baiomyini, and Tylomyini). Recommendations are made regarding the recognition of Ochrotomys as a tribe of its own, the Ochrotomyini, paralleling other recent findings. The Peromyscini, Baiomyini, and Ochrotomyini are unresolved in relation to each other, but as a whole are sister to the Neotomini. The Tylomyini is basal to all clades. It appears that combined data from the nuclear and mitochondrial genes (analyzing all three partitions simultaneously) resulted in the best phylogenetic hypothesis regarding the complex.

  9. Integrative analyses of transcriptome sequencing identify novel functional lncRNAs in esophageal squamous cell carcinoma

    PubMed Central

    Li, C-Q; Huang, G-W; Wu, Z-Y; Xu, Y-J; Li, X-C; Xue, Y-J; Zhu, Y; Zhao, J-M; Li, M; Zhang, J; Wu, J-Y; Lei, F; Wang, Q-Y; Li, S; Zheng, C-P; Ai, B; Tang, Z-D; Feng, C-C; Liao, L-D; Wang, S-H; Shen, J-H; Liu, Y-J; Bai, X-F; He, J-Z; Cao, H-H; Wu, B-L; Wang, M-R; Lin, D-C; Koeffler, H P; Wang, L-D; Li, X; Li, E-M; Xu, L-Y

    2017-01-01

    Long non-coding RNAs (lncRNAs) have a critical role in cancer initiation and progression, and thus may mediate oncogenic or tumor suppressing effects, as well as be a new class of cancer therapeutic targets. We performed high-throughput sequencing of RNA (RNA-seq) to investigate the expression level of lncRNAs and protein-coding genes in 30 esophageal samples, comprised of 15 esophageal squamous cell carcinoma (ESCC) samples and their 15 paired non-tumor tissues. We further developed an integrative bioinformatics method, denoted URW-LPE, to identify key functional lncRNAs that regulate expression of downstream protein-coding genes in ESCC. A number of known onco-lncRNA and many putative novel ones were effectively identified by URW-LPE. Importantly, we identified lncRNA625 as a novel regulator of ESCC cell proliferation, invasion and migration. ESCC patients with high lncRNA625 expression had significantly shorter survival time than those with low expression. LncRNA625 also showed specific prognostic value for patients with metastatic ESCC. Finally, we identified E1A-binding protein p300 (EP300) as a downstream executor of lncRNA625-induced transcriptional responses. These findings establish a catalog of novel cancer-associated functional lncRNAs, which will promote our understanding of lncRNA-mediated regulation in this malignancy. PMID:28194033

  10. The amino acid sequence of monal pheasant lysozyme and its activity.

    PubMed

    Araki, T; Matsumoto, T; Torikata, T

    1998-10-01

    The amino acid sequence of monal pheasant lysozyme and its activity were analyzed. Carboxymethylated lysozyme was digested with trypsin and the resulting peptides were sequenced. The established amino acid sequence had one amino acid substitution at position 102 (Arg to Gly) comparing with Indian peafowl lysozyme and four amino acid substitutions at positions 3 (Phe to Tyr), 15 (His to Leu), 41 (Gln to His), and 121 (Gln to His) with chicken lysozyme. Analysis of the time-courses of reaction using N-acetylglucosamine pentamer as a substrate showed a difference of binding free energy change (-0.4 kcal/mol) at subsites A between monal pheasant and Indian peafowl lysozyme. This was assumed to be caused by the amino acid substitution at subsite A with loss of a positive charge at position 102 (Arg102 to Gly).

  11. Molecular phylogenetics of subclass Peritrichia (Ciliophora: Oligohymenophorea) based on expanded analyses of 18S rRNA sequences.

    PubMed

    Utz, Laura R P; Eizirik, Eduardo

    2007-01-01

    Phylogenetic relationships among peritrich ciliates remain unclear in spite of recent progress. To expand the analyses performed in previous studies, and to statistically test hypotheses of monophyly, we analyzed a broad sample of 18s rRNA sequences (including 15 peritrich genera), applying a conservative alignment strategy and several phylogenetic approaches. The main results are that: (i) the monophyly of Peritrichia cannot be rejected; (ii) the two main clades of Sessilida do not correspond to formally recognized taxa; (iii) the monophyly of genera Vorticella and Epistylis is significantly rejected; and (iv) morphological structures commonly used in peritrich taxonomy may be evolutionarily labile.

  12. Reprint of "Sequence and phylogenetic analyses of novel totivirus-like double-stranded RNAs from field-collected powdery mildew fungi".

    PubMed

    Kondo, Hideki; Hisano, Sakae; Chiba, Sotaro; Maruyama, Kazuyuki; Andika, Ida Bagus; Toyoda, Kazuhiro; Fujimori, Fumihiro; Suzuki, Nobuhiro

    2016-07-02

    The identification of mycoviruses contributes greatly to understanding of the diversity and evolutionary aspects of viruses. Powdery mildew fungi are important and widely studied obligate phytopathogenic agents, but there has been no report on mycoviruses infecting these fungi. In this study, we used a deep sequencing approach to analyze the double-stranded RNA (dsRNA) segments isolated from field-collected samples of powdery mildew fungus-infected red clover plants in Japan. Database searches identified the presence of at least ten totivirus (genus Totivirus)-like sequences, termed red clover powdery mildew-associated totiviruses (RPaTVs). The majority of these sequences shared moderate amino acid sequence identity with each other (<44%) and with other known totiviruses (<59%). Nine of these identified sequences (RPaTV1a, 1b and 2-8) resembled the genome of the prototype totivirus, Saccharomyces cerevisiae virus-L-A (ScV-L-A) in that they contained two overlapping open reading frames (ORFs) encoding a putative coat protein (CP) and an RNA dependent RNA polymerase (RdRp), while one sequence (RPaTV9) showed similarity to another totivirus, Ustilago maydis virus H1 (UmV-H1) that encodes a single polyprotein (CP-RdRp fusion). Similar to yeast totiviruses, each ScV-L-A-like RPaTV contains a -1 ribosomal frameshift site downstream of a predicted pseudoknot structure in the overlapping region of these ORFs, suggesting that the RdRp is translated as a CP-RdRp fusion. Moreover, several ScV-L-A-like sequences were also found by searches of the transcriptome shotgun assembly (TSA) libraries from rust fungi, plants and insects. Phylogenetic analyses show that nine ScV-L-A-like RPaTVs along with ScV-L-A-like sequences derived from TSA libraries are clustered with most established members of the genus Totivirus, while one RPaTV forms a new distinct clade with UmV-H1, possibly establishing an additional genus in the family. Taken together, our results indicate the presence of

  13. Single-chain structure of human ceruloplasmin: the complete amino acid sequence of the whole molecule.

    PubMed Central

    Takahashi, N; Ortel, T L; Putnam, F W

    1984-01-01

    We have determined the amino acid sequence of the amino-terminal 67,000-dalton (67-kDa) fragment of human ceruloplasmin and have established overlapping sequences between the 67-kDa and 50-kDa fragments and between the 50-kDa and 19-kDa fragments. The 67-kDa fragment contains 480 amino acid residues and three glucosamine oligosaccharides. These results together with our previous sequence data for the 50-kDa and 19-kDa fragments complete the amino acid sequence of human ceruloplasmin. The polypeptide chain has a total of 1,046 amino acid residues (Mr 120,085) and has attachment sites for four glucosamine oligosaccharides; together these account for the total molecular mass of human ceruloplasmin (132 kDa). The sequence analysis of the peptides overlapping the fragments showed that one additional amino acid, arginine, is present between the 67-kDa and 50-kDa fragments, and another, lysine, is between the 50-kDa and 19-kDa fragments. Only two apparent sites of amino acid interchange have been identified in the polypeptide chain. Both involve a single-point interchange of glycine and lysine that would result in a difference in charge. The results of the complete sequence analysis verified that human ceruloplasmin is composed of a single polypeptide chain and that the subunit-like fragments are produced by proteolytic cleavage during purification (and possibly also in vivo). PMID:6582496

  14. Multiple Genome Sequences of Important Beer-Spoiling Lactic Acid Bacteria

    PubMed Central

    Geissler, Andreas J.; Vogel, Rudi F.

    2016-01-01

    Seven strains of important beer-spoiling lactic acid bacteria were sequenced using single-molecule real-time sequencing. Complete genomes were obtained for strains of Lactobacillus paracollinoides, Lactobacillus lindneri, and Pediococcus claussenii. The analysis of these genomes emphasizes the role of plasmids as the genomic foundation of beer-spoiling ability. PMID:27795248

  15. Evolution and biogeography of Centaurea section Acrocentron inferred from nuclear and plastid DNA sequence analyses

    PubMed Central

    Font, Mònica; Garcia-Jacas, Núria; Vilatersana, Roser; Roquet, Cristina; Susanna, Alfonso

    2009-01-01

    Background and Aims Section Acrocentron of the genus Centaurea is one of the largest sections of Centaurea with approx. 100 species. The geographic distribution, centred in the Mediterranean, makes it an excellent example for studies of the biogeographic history of this biodiversity-rich region. Methods Plastid (trnH-psbA) and nuclear (ITS and ETS) DNA sequence analysis was used for phylogenetic reconstruction. Ancestral biogeographic patterns were inferred by dispersal-vicariance analysis (DIVA). Key Results The resulting phylogeny has implications for the sectional classification of Acrocentron and confirms merging sect. Chamaecyanus into Acrocentron as a subsection. Previous suggestions of an eastern Mediterranean origin of the group are confirmed. The main centres of diversification established in previous studies are now strongly supported. Expansion of the group in two different radiations that followed patently diverse paths is inferred. Conclusions Radiation followed two waves, widely separated in time scale. The oldest one, from Turkey to Greece and the northern Balkans and then to North Africa and Iberia, should be dated at the end of the Miocene in the Messinian period. It reached the Iberian Peninsula from the south, following a route that is landmarked by several relictic taxa in Sicily and North Africa. A later radiation during the Holocene interglacial periods followed, involving species from the north of the Balkan Peninsula, along a Eurasian pathway running from Central Iberia to the steppes of Kazakhstan. A generalized pattern of reticulation is also evident from the results, indicating past contacts between presently separated species. Molecular data also confirmed the extent of hybridization within Acrocentron and were successful in reconstructing the paleogeography of the section. PMID:19228702

  16. Metabolomic Analyses of Leishmania Reveal Multiple Species Differences and Large Differences in Amino Acid Metabolism

    PubMed Central

    Wang, Lijie; Zhang, Tong; Watson, David G.; Silva, Ana Marta; Coombs, Graham H.

    2015-01-01

    Comparative genomic analyses of Leishmania species have revealed relatively minor heterogeneity amongst recognised housekeeping genes and yet the species cause distinct infections and pathogenesis in their mammalian hosts. To gain greater information on the biochemical variation between species, and insights into possible metabolic mechanisms underpinning visceral and cutaneous leishmaniasis, we have undertaken in this study a comparative analysis of the metabolomes of promastigotes of L. donovani, L. major and L. mexicana. The analysis revealed 64 metabolites with confirmed identity differing 3-fold or more between the cell extracts of species, with 161 putatively identified metabolites differing similarly. Analysis of the media from cultures revealed an at least 3-fold difference in use or excretion of 43 metabolites of confirmed identity and 87 putatively identified metabolites that differed to a similar extent. Strikingly large differences were detected in their extent of amino acid use and metabolism, especially for tryptophan, aspartate, arginine and proline. Major pathways of tryptophan and arginine catabolism were shown to be to indole-3-lactate and arginic acid, respectively, which were excreted. The data presented provide clear evidence on the value of global metabolomic analyses in detecting species-specific metabolic features, thus application of this technology should be a major contributor to gaining greater understanding of how pathogens are adapted to infecting their hosts. PMID:26368322

  17. DNA Sequence and Expression Variation of Hop (Humulus lupulus) Valerophenone Synthase (VPS), a Key Gene in Bitter Acid Biosynthesis

    PubMed Central

    Castro, Consuelo B.; Whittock, Lucy D.; Whittock, Simon P.; Leggett, Grey; Koutoulis, Anthony

    2008-01-01

    Background The hop plant (Humulus lupulus) is a source of many secondary metabolites, with bitter acids essential in the beer brewing industry and others having potential applications for human health. This study investigated variation in DNA sequence and gene expression of valerophenone synthase (VPS), a key gene in the bitter acid biosynthesis pathway of hop. Methods Sequence variation was studied in 12 varieties, and expression was analysed in four of the 12 varieties in a series across the development of the hop cone. Results Nine single nucleotide polymorphisms (SNPs) were detected in VPS, seven of which were synonymous. The two non-synonymous polymorphisms did not appear to be related to typical bitter acid profiles of the varieties studied. However, real-time quantitative reverse-transcription polymerase chain reaction (qRT-PCR) analysis of VPS expression during hop cone development showed a clear link with the bitter acid content. The highest levels of VPS expression were observed in two triploid varieties, ‘Symphony’ and ‘Ember’, which typically have high bitter acid levels. Conclusions In all hop varieties studied, VPS expression was lowest in the leaves and an increase in expression was consistently observed during the early stages of cone development. PMID:18519445

  18. Discrimination of prey species of juvenile swordfish Xiphias gladius (Linnaeus, 1758) using signature fatty acid analyses

    NASA Astrophysics Data System (ADS)

    Young, Jock W.; Guest, Michaela A.; Lansdell, Matt; Phleger, Charles F.; Nichols, Peter D.

    2010-07-01

    Signature lipid and fatty acid analysis were used to discriminate the diet of swordfish ( Xiphias gladius, orbital fork length: 60-203 cm) from waters off eastern Australia. The fatty acid (FA) composition of a range of known prey (squid, myctophids, and other fishes) of swordfish, taken from stomach samples and from net tows, was compared with that of the white muscle tissue (WMT) of swordfish from the same region. Swordfish muscle was lipid rich (average 24-42% dry weight), as was the skeleton (28-41%). The robustness of the approach was also tested by comparison against a key squid prey species that was collected and stored using different protocols: (i) fresh frozen, (ii) fresh frozen, then thawed, and (iii) stomach content collection. The FA profiles were generally similar, with the ratio of docosahexaenoic acid (DHA) and palmitic acid (16:0) in particular showing no significant difference. Major fatty acids in swordfish WMT were 18:1ω9c, 16:0, 22:6ω3, and 18:0. Multidimensional scaling showed that the swordfish WMT grouped closely with small fish prey including myctophids, and not with squid. Squid contained markedly higher 22:6ω3 than swordfish. Individual prey species of the myctophidae could also be separated by the same technique. These results were supported by traditional stomach content analyses (SCA) that showed fish were the dominant prey for small swordfish sampled from southern waters whereas squid were the main prey in more northern waters, matching the FA patterns we found for the two regions. We propose that where general diet patterns are established, signature FA analysis has good potential to compliment or in some cases, replace temporal and spatial monitoring of trophic pathways for swordfish and other marine species.

  19. PASTA: Ultra-Large Multiple Sequence Alignment for Nucleotide and Amino-Acid Sequences.

    PubMed

    Mirarab, Siavash; Nguyen, Nam; Guo, Sheng; Wang, Li-San; Kim, Junhyong; Warnow, Tandy

    2015-05-01

    We introduce PASTA, a new multiple sequence alignment algorithm. PASTA uses a new technique to produce an alignment given a guide tree that enables it to be both highly scalable and very accurate. We present a study on biological and simulated data with up to 200,000 sequences, showing that PASTA produces highly accurate alignments, improving on the accuracy and scalability of the leading alignment methods (including SATé). We also show that trees estimated on PASTA alignments are highly accurate--slightly better than SATé trees, but with substantial improvements relative to other methods. Finally, PASTA is faster than SATé, highly parallelizable, and requires relatively little memory.

  20. RAPD and internal transcribed spacer sequence analyses reveal Zea nicaraguensis as a section Luxuriantes species close to Zea luxurians.

    PubMed

    Wang, Pei; Lu, Yanli; Zheng, Mingmin; Rong, Tingzhao; Tang, Qilin

    2011-04-15

    Genetic relationship of a newly discovered teosinte from Nicaragua, Zea nicaraguensis with waterlogging tolerance, was determined based on randomly amplified polymorphic DNA (RAPD) markers and the internal transcribed spacer (ITS) sequences of nuclear ribosomal DNA using 14 accessions from Zea species. RAPD analysis showed that a total of 5,303 fragments were produced by 136 random decamer primers, of which 84.86% bands were polymorphic. RAPD-based UPGMA analysis demonstrated that the genus Zea can be divided into section Luxuriantes including Zea diploperennis, Zea luxurians, Zea perennis and Zea nicaraguensis, and section Zea including Zea mays ssp. mexicana, Zea mays ssp. parviglumis, Zea mays ssp. huehuetenangensis and Zea mays ssp. mays. ITS sequence analysis showed the lengths of the entire ITS region of the 14 taxa in Zea varied from 597 to 605 bp. The average GC content was 67.8%. In addition to the insertion/deletions, 78 variable sites were recorded in the total ITS region with 47 in ITS1, 5 in 5.8S, and 26 in ITS2. Sequences of these taxa were analyzed with neighbor-joining (NJ) and maximum parsimony (MP) methods to construct the phylogenetic trees, selecting Tripsacum dactyloides L. as the outgroup. The phylogenetic relationships of Zea species inferred from the ITS sequences are highly concordant with the RAPD evidence that resolved two major subgenus clades. Both RAPD and ITS sequence analyses indicate that Zea nicaraguensis is more closely related to Zea luxurians than the other teosintes and cultivated maize, which should be regarded as a section Luxuriantes species.

  1. NCI-60 Whole Exome Sequencing and Pharmacological CellMiner Analyses

    PubMed Central

    Reinhold, William C.; Varma, Sudhir; Sousa, Fabricio; Sunshine, Margot; Abaan, Ogan D.; Davis, Sean R.; Reinhold, Spencer W.; Kohn, Kurt W.; Morris, Joel; Meltzer, Paul S.; Doroshow, James H.; Pommier, Yves

    2014-01-01

    Exome sequencing provides unprecedented insights into cancer biology and pharmacological response. Here we assess these two parameters for the NCI-60, which is among the richest genomic and pharmacological publicly available cancer cell line databases. Homozygous genetic variants that putatively affect protein function were identified in 1,199 genes (approximately 6% of all genes). Variants that are either enriched or depleted compared to non-cancerous genomes, and thus may be influential in cancer progression and differential drug response were identified for 2,546 genes. Potential gene knockouts are made available. Assessment of cell line response to 19,940 compounds, including 110 FDA-approved drugs, reveals ≈80-fold range in resistance versus sensitivity response across cell lines. 103,422 gene variants were significantly correlated with at least one compound (at p<0.0002). These include genes of known pharmacological importance such as IGF1R, BRAF, RAD52, MTOR, STAT2 and TSC2 as well as a large number of candidate genes such as NOM1, TLL2, and XDH. We introduce two new web-based CellMiner applications that enable exploration of variant-to-compound relationships for a broad range of researchers, especially those without bioinformatics support. The first tool, “Genetic variant versus drug visualization”, provides a visualization of significant correlations between drug activity-gene variant combinations. Examples are given for the known vemurafenib-BRAF, and novel ifosfamide-RAD52 pairings. The second, “Genetic variant summation” allows an assessment of cumulative genetic variations for up to 150 combined genes together; and is designed to identify the variant burden for molecular pathways or functional grouping of genes. An example of its use is provided for the EGFR-ERBB2 pathway gene variant data and the identification of correlated EGFR, ERBB2, MTOR, BRAF, MEK and ERK inhibitors. The new tools are implemented as an updated web-based Cell

  2. SETG: Nucleic Acid Extraction and Sequencing for In Situ Life Detection on Mars

    NASA Astrophysics Data System (ADS)

    Mojarro, A.; Hachey, J.; Tani, J.; Smith, A.; Bhattaru, S. A.; Pontefract, A.; Doebler, R.; Brown, M.; Ruvkun, G.; Zuber, M. T.; Carr, C. E.

    2016-10-01

    We are developing an integrated nucleic acid extraction and sequencing instrument: the Search for Extra-Terrestrial Genomes (SETG) for in situ life detection on Mars. Our goals are to identify related or unrelated nucleic acid-based life on Mars.

  3. Draft Genome Sequence of Cyanobacterium sp. Strain IPPAS B-1200 with a Unique Fatty Acid Composition

    PubMed Central

    Starikov, Alexander Y.; Usserbaeva, Aizhan A.; Sinetova, Maria A.; Sarsekeyeva, Fariza K.; Zayadan, Bolatkhan K.; Ustinova, Vera V.; Kupriyanova, Elena V.; Los, Dmitry A.

    2016-01-01

    Here, we report the draft genome of Cyanobacterium sp. IPPAS strain B-1200, isolated from Lake Balkhash, Kazakhstan, and characterized by the unique fatty acid composition of its membrane lipids, which are enriched with myristic and myristoleic acids. The approximate genome size is 3.4 Mb, and the predicted number of coding sequences is 3,119. PMID:27856596

  4. Sequencing and computational analysis of complete genome sequences of Citrus yellow mosaic badna virus from acid lime and pummelo.

    PubMed

    Borah, Basanta K; Johnson, A M Anthony; Sai Gopal, D V R; Dasgupta, Indranil

    2009-08-01

    Citrus yellow mosaic badna virus (CMBV), a member of the Family Caulimoviridae, Genus Badnavirus, is the causative agent of Citrus mosaic disease in India. Although the virus has been detected in several citrus species, only two full-length genomes, one each from Sweet orange and Rangpur lime, are available in publicly accessible databases. In order to obtain a better understanding of the genetic variability of the virus in other citrus mosaic-affected citrus species, we performed the cloning and sequence analysis of complete genomes of CMBV from two additional citrus species, Acid lime and Pummelo. We show that CMBV genomes from the two hosts share high homology with previously reported CMBV sequences and hence conclude that the new isolates represent variants of the virus present in these species. Based on in silico sequence analysis, we predict the possible function of the protein encoded by one of the five ORFs.

  5. Parvalbumins from coelacanth muscle. III. Amino acid sequence of the major component.

    PubMed

    Jauregui-Adell, J; Pechere, J F

    1978-09-26

    The primary structure of the major parvalbumin (pI = 4.52) from coelacanth muscle (Latimeria chalumnae) has been determined. Sequence analysis of the tryptic peptides, in some cases obtained with beta-trypsin, accounts for the total amino acid content of the protein. Chymotryptic peptides provide appropriate sequence overlaps, to complete the localization of the tryptic peptides. Examination of the amino acid sequence of this protein shows the typical structure of a beta-parvalbumin. Its position in the dendrogram of related calcium-binding proteins corresponds to that usually accepted for crossopterygians.

  6. Ancient DNA analyses of museum specimens from selected Presbytis (primate: Colobinae) based on partial Cyt b sequences

    NASA Astrophysics Data System (ADS)

    Aifat, N. R.; Yaakop, S.; Md-Zain, B. M.

    2016-11-01

    The IUCN Red List of Threatened Species has categorized Malaysian primates from being data deficient to critically endanger. Thus, ancient DNA analyses hold great potential to understand phylogeny, phylogeography and population history of extinct and extant species. Museum samples are one of the alternatives to provide important sources of biological materials for a large proportion of ancient DNA studies. In this study, a total of six museum skin samples from species Presbytis hosei (4 samples) and Presbytis frontata (2 samples), aged between 43 and 124 years old were extracted to obtain the DNA. Extraction was done by using QIAGEN QIAamp DNA Investigator Kit and the ability of this kit to extract museum skin samples was tested by amplification of partial Cyt b sequence using species-specific designed primer. Two primer pairs were designed specifically for P. hosei and P. frontata, respectively. These primer pairs proved to be efficient in amplifying 200bp of the targeted species in the optimized PCR conditions. The performance of the sequences were tested to determine genetic distance of genus Presbytis in Malaysia. From the analyses, P. hosei is closely related to P. chrysomelas and P. frontata with the value of 0.095 and 0.106, respectively. Cyt b gave a clear data in determining relationships among Bornean species. Thus, with the optimized condition, museum specimens can be used for molecular systematic studies of the Malaysian primates.

  7. Characterization of the mechanism of prolonged adaptation to osmotic stress of Jeotgalibacillus malaysiensis via genome and transcriptome sequencing analyses

    PubMed Central

    Yaakop, Amira Suriaty; Chan, Kok-Gan; Ee, Robson; Lim, Yan Lue; Lee, Siew-Kim; Manan, Fazilah Abd; Goh, Kian Mau

    2016-01-01

    Jeotgalibacillus malaysiensis, a moderate halophilic bacterium isolated from a pelagic area, can endure higher concentrations of sodium chloride (NaCl) than other Jeotgalibacillus type strains. In this study, we therefore chose to sequence and assemble the entire J. malaysiensis genome. This is the first report to provide a detailed analysis of the genomic features of J. malaysiensis, and to perform genetic comparisons between this microorganism and other halophiles. J. malaysiensis encodes a native megaplasmid (pJeoMA), which is greater than 600 kilobases in size, that is absent from other sequenced species of Jeotgalibacillus. Subsequently, RNA-Seq-based transcriptome analysis was utilised to examine adaptations of J. malaysiensis to osmotic stress. Specifically, the eggNOG (evolutionary genealogy of genes: Non-supervised Orthologous Groups) and KEGG (Kyoto Encyclopaedia of Genes and Genomes) databases were used to elucidate the overall effects of osmotic stress on the organism. Generally, saline stress significantly affected carbohydrate, energy, and amino acid metabolism, as well as fatty acid biosynthesis. Our findings also indicate that J. malaysiensis adopted a combination of approaches, including the uptake or synthesis of osmoprotectants, for surviving salt stress. Among these, proline synthesis appeared to be the preferred method for withstanding prolonged osmotic stress in J. malaysiensis. PMID:27641516

  8. Application of carbon and hydrogen stable isotope analyses to detect exogenous citric acid in Japanese apricot liqueur.

    PubMed

    Akamatsu, Fumikazu; Oe, Takaaki; Hashiguchi, Tomokazu; Hisatsune, Yuri; Kawao, Takafumi; Fujii, Tsutomu

    2017-08-01

    Japanese apricot liqueur manufacturers are required to control the quality and authenticity of their liqueur products. Citric acid made from corn is the main acidulant used in commercial liqueurs. In this study, we conducted spiking experiments and carbon and hydrogen stable isotope analyses to detect exogenous citric acid used as an acidulant in Japanese apricot liqueurs. Our results showed that the δ(13)C values detected exogenous citric acid originating from C4 plants but not from C3 plants. The δ(2)H values of citric acid decreased as the amount of citric acid added increased, whether the citric acid originated from C3 or C4 plants. Commercial liqueurs with declared added acidulant provided higher δ(13)C values and lower δ(2)H values than did authentic liqueurs and commercial liqueurs with no declared added acidulant. Carbon and hydrogen stable isotope analyses are suitable as routine methods for detecting exogenous citric acid in Japanese apricot liqueur.

  9. Purification, characterization and partial amino acid sequence of glycogen synthase from Saccharomyces cerevisiae.

    PubMed Central

    Carabaza, A; Arino, J; Fox, J W; Villar-Palasi, C; Guinovart, J J

    1990-01-01

    Glycogen synthase from Saccharomyces cerevisiae was purified to homogeneity. The enzyme showed a subunit molecular mass of 80 kDa. The holoenzyme appears to be a tetramer. Antibodies developed against purified yeast glycogen synthase inactivated the enzyme in yeast extracts and allowed the detection of the protein in Western blots. Amino acid analysis showed that the enzyme is very rich in glutamate and/or glutamine residues. The N-terminal sequence (11 amino acid residues) was determined. In addition, selected tryptic-digest peptides were purified by reverse-phase h.p.l.c. and submitted to gas-phase sequencing. Up to eight sequences (79 amino acid residues) could be aligned with the human muscle enzyme sequence. Levels of identity range between 37 and 100%, indicating that, although human and yeast glycogen synthases probably share some conserved regions, significant differences in their primary structure should be expected. Images Fig. 1. Fig. 2. Fig. 3. PMID:2114092

  10. Amino acid sequence of anionic peroxidase from the windmill palm tree Trachycarpus fortunei.

    PubMed

    Baker, Margaret R; Zhao, Hongwei; Sakharov, Ivan Yu; Li, Qing X

    2014-12-10

    Palm peroxidases are extremely stable and have uncommon substrate specificity. This study was designed to fill in the knowledge gap about the structures of a peroxidase from the windmill palm tree Trachycarpus fortunei. The complete amino acid sequence and partial glycosylation were determined by MALDI-top-down sequencing of native windmill palm tree peroxidase (WPTP), MALDI-TOF/TOF MS/MS of WPTP tryptic peptides, and cDNA sequencing. The propeptide of WPTP contained N- and C-terminal signal sequences which contained 21 and 17 amino acid residues, respectively. Mature WPTP was 306 amino acids in length, and its carbohydrate content ranged from 21% to 29%. Comparison to closely related royal palm tree peroxidase revealed structural features that may explain differences in their substrate specificity. The results can be used to guide engineering of WPTP and its novel applications.

  11. TranslatorX: multiple alignment of nucleotide sequences guided by amino acid translations.

    PubMed

    Abascal, Federico; Zardoya, Rafael; Telford, Maximilian J

    2010-07-01

    We present TranslatorX, a web server designed to align protein-coding nucleotide sequences based on their corresponding amino acid translations. Many comparisons between biological sequences (nucleic acids and proteins) involve the construction of multiple alignments. Alignments represent a statement regarding the homology between individual nucleotides or amino acids within homologous genes. As protein-coding DNA sequences evolve as triplets of nucleotides (codons) and it is known that sequence similarity degrades more rapidly at the DNA than at the amino acid level, alignments are generally more accurate when based on amino acids than on their corresponding nucleotides. TranslatorX novelties include: (i) use of all documented genetic codes and the possibility of assigning different genetic codes for each sequence; (ii) a battery of different multiple alignment programs; (iii) translation of ambiguous codons when possible; (iv) an innovative criterion to clean nucleotide alignments with GBlocks based on protein information; and (v) a rich output, including Jalview-powered graphical visualization of the alignments, codon-based alignments coloured according to the corresponding amino acids, measures of compositional bias and first, second and third codon position specific alignments. The TranslatorX server is freely available at http://translatorx.co.uk.

  12. Amino acid sequence of homologous rat atrial peptides: natriuretic activity of native and synthetic forms.

    PubMed Central

    Seidah, N G; Lazure, C; Chrétien, M; Thibault, G; Garcia, R; Cantin, M; Genest, J; Nutt, R F; Brady, S F; Lyle, T A

    1984-01-01

    A substance called atrial natriuretic factor (ANF), localized in secretory granules of atrial cardiocytes, was isolated as four homologous natriuretic peptides from homogenates of rat atria. The complete sequence of the longest form showed that it is composed of 33 amino acids. The three other shorter forms (2-33, 3-33, and 8-33) represent amino-terminally truncated versions of the 33 amino acid parent molecule as shown by analysis of sequence, amino acid composition, or both. The proposed primary structure agrees entirely with the amino acid composition and reveals no significant sequence homology with any known protein or segment of protein. The short form ANF-(8-33) was synthesized by a multi-fragment condensation approach and the synthetic product was shown to exhibit specific activity comparable to that of the natural ANF-(3-33). PMID:6232612

  13. Nucleotide and deduced amino acid sequences of a new subtilisin from an alkaliphilic Bacillus isolate.

    PubMed

    Saeki, Katsuhisa; Magallones, Marietta V; Takimura, Yasushi; Hatada, Yuji; Kobayashi, Tohru; Kawai, Shuji; Ito, Susumu

    2003-10-01

    The gene for a new subtilisin from the alkaliphilic Bacillus sp. KSM-LD1 was cloned and sequenced. The open reading frame of the gene encoded a 97 amino-acid prepro-peptide plus a 307 amino-acid mature enzyme that contained a possible catalytic triad of residues, Asp32, His66, and Ser224. The deduced amino acid sequence of the mature enzyme (LD1) showed approximately 65% identity to those of subtilisins SprC and SprD from alkaliphilic Bacillus sp. LG12. The amino acid sequence identities of LD1 to those of previously reported true subtilisins and high-alkaline proteases were below 60%. LD1 was characteristically stable during incubation with surfactants and chemical oxidants. Interestingly, an oxidizable Met residue is located next to the catalytic Ser224 of the enzyme as in the cases of the oxidation-susceptible subtilisins reported to date.

  14. Shark myelin basic protein: amino acid sequence, secondary structure, and self-association.

    PubMed

    Milne, T J; Atkins, A R; Warren, J A; Auton, W P; Smith, R

    1990-09-01

    Myelin basic protein (MBP) from the Whaler shark (Carcharhinus obscurus) has been purified from acid extracts of a chloroform/methanol pellet from whole brains. The amino acid sequence of the majority of the protein has been determined and compared with the sequences of other MBPs. The shark protein has only 44% homology with the bovine protein, but, in common with other MBPs, it has basic residues distributed throughout the sequence and no extensive segments that are predicted to have an ordered secondary structure in solution. Shark MBP lacks the triproline sequence previously postulated to form a hairpin bend in the molecule. The region containing the putative consensus sequence for encephalitogenicity in the guinea pig contains several substitutions, thus accounting for the lack of activity of the shark protein. Studies of the secondary structure and self-association have shown that shark MBP possesses solution properties similar to those of the bovine protein, despite the extensive differences in primary structure.

  15. Complete cDNA and derived amino acid sequence of human factor V

    SciTech Connect

    Jenny, R.J.; Pittman, D.D.; Toole, J.J.; Kriz, R.W.; Aldape, R.A.; Hewick, R.M.; Kaufman, R.J.; Mann, K.G.

    1987-07-01

    cDNA clones encoding human factor V have been isolated from an oligo(dT)-primed human fetal liver cDNA library prepared with vector Charon 21A. The cDNA sequence of factor V from three overlapping clones includes a 6672-base-pair (bp) coding region, a 90-bp 5' untranslated region, and a 163-bp 3' untranslated region within which is a poly(A)tail. The deduced amino acid sequence consists of 2224 amino acids inclusive of a 28-amino acid leader peptide. Direct comparison with human factor VIII reveals considerable homology between proteins in amino acid sequence and domain structure: a triplicated A domain and duplicated C domain show approx. 40% identity with the corresponding domains in factor VIII. As in factor VIII, the A domains of factor V share approx. 40% amino acid-sequence homology with the three highly conserved domains in ceruloplasmin. The B domain of factor V contains 35 tandem and approx. 9 additional semiconserved repeats of nine amino acids of the form Asp-Leu-Ser-Gln-Thr-Thr/Asn-Leu-Ser-Pro and 2 additional semiconserved repeats of 17 amino acids. Factor V contains 37 potential N-linked glycosylation sites, 25 of which are in the B domain, and a total of 19 cysteine residues.

  16. Analyses of transcriptome sequences reveal multiple ancient large-scale duplication events in the ancestor of Sphagnopsida (Bryophyta).

    PubMed

    Devos, Nicolas; Szövényi, Péter; Weston, David J; Rothfels, Carl J; Johnson, Matthew G; Shaw, A Jonathan

    2016-07-01

    The goal of this research was to investigate whether there has been a whole-genome duplication (WGD) in the ancestry of Sphagnum (peatmoss) or the class Sphagnopsida, and to determine if the timing of any such duplication(s) and patterns of paralog retention could help explain the rapid radiation and current ecological dominance of peatmosses. RNA sequencing (RNA-seq) data were generated for nine taxa in Sphagnopsida (Bryophyta). Analyses of frequency plots for synonymous substitutions per synonymous site (Ks ) between paralogous gene pairs and reconciliation of 578 gene trees were conducted to assess evidence of large-scale or genome-wide duplication events in each transcriptome. Both Ks frequency plots and gene tree-based analyses indicate multiple duplication events in the history of the Sphagnopsida. The most recent WGD event predates divergence of Sphagnum from the two other genera of Sphagnopsida. Duplicate retention is highly variable across species, which might be best explained by local adaptation. Our analyses indicate that the last WGD could have been an important factor underlying the diversification of peatmosses and facilitated their rise to ecological dominance in peatlands. The timing of the duplication events and their significance in the evolutionary history of peat mosses are discussed.

  17. An analysis of amino acid sequences surrounding archaeal glycoprotein sequons.

    PubMed

    Abu-Qarn, Mehtap; Eichler, Jerry

    2007-05-01

    Despite having provided the first example of a prokaryal glycoprotein, little is known of the rules governing the N-glycosylation process in Archaea. As in Eukarya and Bacteria, archaeal N-glycosylation takes place at the Asn residues of Asn-X-Ser/Thr sequons. Since not all sequons are utilized, it is clear that other factors, including the context in which a sequon exists, affect glycosylation efficiency. As yet, the contribution to N-glycosylation made by sequon-bordering residues and other related factors in Archaea remains unaddressed. In the following, the surroundings of Asn residues confirmed by experiment as modified were analyzed in an attempt to define sequence rules and requirements for archaeal N-glycosylation.

  18. An integrated portable hand-held analyser for real-time isothermal nucleic acid amplification.

    PubMed

    Smith, Matthew C; Steimle, George; Ivanov, Stan; Holly, Mark; Fries, David P

    2007-08-29

    A compact hand-held heated fluorometric instrument for performing real-time isothermal nucleic acid amplification and detection is described. The optoelectronic instrument combines a Printed Circuit Board/Micro Electro Mechanical Systems (PCB/MEMS) reaction detection/chamber containing an integrated resistive heater with attached miniature LED light source and photo-detector and a disposable glass waveguide capillary to enable a mini-fluorometer. The fluorometer is fabricated and assembled in planar geometry, rolled into a tubular format and packaged with custom control electronics to form the hand-held reactor. Positive or negative results for each reaction are displayed to the user using an LED interface. Reaction data is stored in FLASH memory for retrieval via an in-built USB connection. Operating on one disposable 3 V lithium battery >12, 60 min reactions can be performed. Maximum dimensions of the system are 150 mm (h) x 48 mm (d) x 40 mm (w), the total instrument weight (with battery) is 140 g. The system produces comparable results to laboratory instrumentation when performing a real-time nucleic acid sequence-based amplification (NASBA) reaction, and also displayed comparable precision, accuracy and resolution to laboratory-based real-time nucleic acid amplification instrumentation. A good linear response (R2 = 0.948) to fluorescein gradients ranging from 0.5 to 10 microM was also obtained from the instrument indicating that it may be utilized for other fluorometric assays. This instrument enables an inexpensive, compact approach to in-field genetic screening, providing results comparable to laboratory equipment with rapid user feedback as to the status of the reaction.

  19. Amino acid and DNA analyses in a family with ornithine transcarbamylase deficiency.

    PubMed

    Hou, J W; Wang, T R

    1996-02-01

    Ornithine transcarbamylase (OTC) is a hepatic mitochondrial enzyme involved in the detoxification of ammonia by the urea cycle. OTC deficiency is an X-linked genetic disorder, usually causing neonatal or infantile hyperammonemia, coma and death. We attended a male newborn who had poor feeding since 30 hours of age, at which time, he then rapidly progressed to a comatose state. Hyperammonemia and liver dysfunction were noted. Analysis of plasma amino acids showed elevated levels of glutamine and alanine, but a decreased level of arginine and no citrulline. OTC deficiency was diagnosed by family history of early death of newborn males on the maternal side and characteristic biochemical findings. In addition, it was proved by Southern blot analysis of genomic DNA. Although OTC deficiency has been described as the most common inborn error of ureagenesis in humans, to our knowledge, this is the first report in a Chinese family confirmed by biochemical and DNA analyses.

  20. Organic Analysis in the Miller Range 090657 CR2 Chondrite: Part 2 Amino Acid Analyses

    NASA Technical Reports Server (NTRS)

    Burton, A. S.; Cao, T.; Nakamura-Messenger, K.; Berger, E. L.; Messenger, S.; Clemett, S. J.; Aponte, J. C.; Elsila, J. E.

    2016-01-01

    Primitive carbonaceous chondrites contain a wide variety of organic material, ranging from soluble discrete molecules to insoluble, unstructured kerogen-like components, as well as structured nano-globules of macromolecular carbon. The relationship between the soluble organic molecules, macromolecular organic material, and host minerals are poorly understood. Due to the differences in extractability of soluble and insoluble organic materials, the analysis methods for each differ and are often performed independently. The combination of soluble and insoluble analyses, when performed concurrently, can provide a wider understanding of spatial distribution, and elemental, structural and isotopic composition of organic material in primitive meteorites. Using macroscale extraction and analysis techniques in combination with in situ microscale observation, we have been studying both insoluble and soluble organic material in the primitive CR2 chondrite Miller Range (MIL) 090657. In accompanying abstracts (Cao et al. and Messenger et al.) we discuss insoluble organic material in the samples. By performing the consortium studies, we aim to improve our understanding of the relationship between the meteorite minerals and the soluble and insoluble organic phases and to delineate which species formed within the meteorite and those that formed in nebular or presolar environments. In this abstract, we present the results of amino acid analyses of MIL 090657 by ultra performance liquid chromatography with fluorescence detection and quadrupole-time of flight mass spectrometry. Amino acids are of interest because they are essential to life on Earth, and because they are present in sufficient structural, enantiomeric and isotopic diversity to allow insights into early solar system chemical processes. Furthermore, these are among the most isotopically anomalous species, yet at least some fraction are thought to have formed by aqueously-mediated processes during parent body alteration.

  1. HPLC and ELISA analyses of larval bile acids from Pacific and western brook lampreys

    USGS Publications Warehouse

    Yun, S.-S.; Scott, A.P.; Bayer, J.M.; Seelye, J.G.; Close, D.A.; Li, W.

    2003-01-01

    Comparative studies were performed on two native lamprey species, Pacific lamprey (Lampetra tridentata) and western brook lamprey (Lampetra richardsoni) from the Pacific coast along with sea lamprey (Petromyzon marinus) from the Great Lakes, to investigate their bile acid production and release. HPLC and ELISA analyses of the gall bladders and liver extract revealed that the major bile acid compound from Pacific and western brook larval lampreys was petromyzonol sulfate (PZS), previously identified as a migratory pheromone in larval sea lamprey. An ELISA for PZS has been developed in a working range of 20pg-10ng per well. The tissue concentrations of PZS in gall bladder were 127.40, 145.86, and 276.96??g/g body mass in sea lamprey, Pacific lamprey, and western brook lamprey, respectively. Releasing rates for PZS in the three species were measured using ELISA to find that western brook and sea lamprey released PZS 20 times higher than Pacific lamprey did. Further studies are required to determine whether PZS is a chemical cue in Pacific and western brook lampreys. ?? 2003 Elsevier Inc. All rights reserved.

  2. Classification of mouse VK groups based on the partial amino acid sequence to the first invariant tryptophan: impact of 14 new sequences from IgG myeloma proteins.

    PubMed

    Potter, M; Newell, J B; Rudikoff, S; Haber, E

    1982-12-01

    Fourteen new VK sequences derived from BALB/c IgG myeloma proteins were determined to the first invariant tryptophan (Trp 35). These partial sequences were compared with 65 other published VK sequences using a computer program. The 79 sequences were organized according to the length of the sequence from the amino terminus to the first invariant tryptophan (Trp 35), into seven groups (33, 34, 35, 36, 39, 40 and 41aa). A distance matrix of all 79 sequences was then computed, i.e. the number of amino acid substitutions necessary to convert one sequence to another was determined. From these data a dendrogram was constructed. Most of the VK sequences fell into clusters or closely related groups. The definition of a sequence group is arbitrary but facilitates the classification of VK proteins. We used 12 substitutions as the basis for defining a sequence group based on the known number of substitutions that are found in the VK21 proteins. By this criterion there were 18 groups in the Trp 35 dendrogram. Twelve of the 14 new sequences fell into one of these sequence groups; two formed new sequence groups. Collective amino acid sequencing is still encountering new VK structures indicating more sequences will be required to attain an accurate estimate of the total number of VK groups. Updated dendrograms can be quickly generated to include newly generated sequences.

  3. Stable isotope and signature fatty acid analyses suggest reef manta rays feed on demersal zooplankton.

    PubMed

    Couturier, Lydie I E; Rohner, Christoph A; Richardson, Anthony J; Marshall, Andrea D; Jaine, Fabrice R A; Bennett, Michael B; Townsend, Kathy A; Weeks, Scarla J; Nichols, Peter D

    2013-01-01

    Assessing the trophic role and interaction of an animal is key to understanding its general ecology and dynamics. Conventional techniques used to elucidate diet, such as stomach content analysis, are not suitable for large threatened marine species. Non-lethal sampling combined with biochemical methods provides a practical alternative for investigating the feeding ecology of these species. Stable isotope and signature fatty acid analyses of muscle tissue were used for the first time to examine assimilated diet of the reef manta ray Manta alfredi, and were compared with different zooplankton functional groups (i.e. near-surface zooplankton collected during manta ray feeding events and non-feeding periods, epipelagic zooplankton, demersal zooplankton and several different zooplankton taxa). Stable isotope δ(15)N values confirmed that the reef manta ray is a secondary consumer. This species had relatively high levels of docosahexaenoic acid (DHA) indicating a flagellate-based food source in the diet, which likely reflects feeding on DHA-rich near-surface and epipelagic zooplankton. However, high levels of ω6 polyunsaturated fatty acids and slightly enriched δ(13)C values in reef manta ray tissue suggest that they do not feed solely on pelagic zooplankton, but rather obtain part of their diet from another origin. The closest match was with demersal zooplankton, suggesting it is an important component of the reef manta ray diet. The ability to feed on demersal zooplankton is likely linked to the horizontal and vertical movement patterns of this giant planktivore. These new insights into the habitat use and feeding ecology of the reef manta ray will assist in the effective evaluation of its conservation needs.

  4. Stable Isotope and Signature Fatty Acid Analyses Suggest Reef Manta Rays Feed on Demersal Zooplankton

    PubMed Central

    Couturier, Lydie I. E.; Rohner, Christoph A.; Richardson, Anthony J.; Marshall, Andrea D.; Jaine, Fabrice R. A.; Bennett, Michael B.; Townsend, Kathy A.; Weeks, Scarla J.; Nichols, Peter D.

    2013-01-01

    Assessing the trophic role and interaction of an animal is key to understanding its general ecology and dynamics. Conventional techniques used to elucidate diet, such as stomach content analysis, are not suitable for large threatened marine species. Non-lethal sampling combined with biochemical methods provides a practical alternative for investigating the feeding ecology of these species. Stable isotope and signature fatty acid analyses of muscle tissue were used for the first time to examine assimilated diet of the reef manta ray Manta alfredi, and were compared with different zooplankton functional groups (i.e. near-surface zooplankton collected during manta ray feeding events and non-feeding periods, epipelagic zooplankton, demersal zooplankton and several different zooplankton taxa). Stable isotope δ15N values confirmed that the reef manta ray is a secondary consumer. This species had relatively high levels of docosahexaenoic acid (DHA) indicating a flagellate-based food source in the diet, which likely reflects feeding on DHA-rich near-surface and epipelagic zooplankton. However, high levels of ω6 polyunsaturated fatty acids and slightly enriched δ13C values in reef manta ray tissue suggest that they do not feed solely on pelagic zooplankton, but rather obtain part of their diet from another origin. The closest match was with demersal zooplankton, suggesting it is an important component of the reef manta ray diet. The ability to feed on demersal zooplankton is likely linked to the horizontal and vertical movement patterns of this giant planktivore. These new insights into the habitat use and feeding ecology of the reef manta ray will assist in the effective evaluation of its conservation needs. PMID:24167562

  5. Detection and isolation of nucleic acid sequences using competitive hybridization probes

    DOEpatents

    Lucas, J.N.; Straume, T.; Bogen, K.T.

    1997-04-01

    A method for detecting a target nucleic acid sequence in a sample is provided using hybridization probes which competitively hybridize to a target nucleic acid. According to the method, a target nucleic acid sequence is hybridized to first and second hybridization probes which are complementary to overlapping portions of the target nucleic acid sequence, the first hybridization probe including a first complexing agent capable of forming a binding pair with a second complexing agent and the second hybridization probe including a detectable marker. The first complexing agent attached to the first hybridization probe is contacted with a second complexing agent, the second complexing agent being attached to a solid support such that when the first and second complexing agents are attached, target nucleic acid sequences hybridized to the first hybridization probe become immobilized on to the solid support. The immobilized target nucleic acids are then separated and detected by detecting the detectable marker attached to the second hybridization probe. A kit for performing the method is also provided. 7 figs.

  6. Detection and isolation of nucleic acid sequences using competitive hybridization probes

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    1997-01-01

    A method for detecting a target nucleic acid sequence in a sample is provided using hybridization probes which competitively hybridize to a target nucleic acid. According to the method, a target nucleic acid sequence is hybridized to first and second hybridization probes which are complementary to overlapping portions of the target nucleic acid sequence, the first hybridization probe including a first complexing agent capable of forming a binding pair with a second complexing agent and the second hybridization probe including a detectable marker. The first complexing agent attached to the first hybridization probe is contacted with a second complexing agent, the second complexing agent being attached to a solid support such that when the first and second complexing agents are attached, target nucleic acid sequences hybridized to the first hybridization probe become immobilized on to the solid support. The immobilized target nucleic acids are then separated and detected by detecting the detectable marker attached to the second hybridization probe. A kit for performing the method is also provided.

  7. Four Moloney murine leukemia virus-infected rat cell clones producing replication-defective particles: protein and nucleic acid analyses.

    PubMed Central

    Yoshimura, F K; Yamamura, J M

    1981-01-01

    Four cloned rat cell lines (NX-1 to -4) infected with Moloney murine leukemia virus and defective in virus replication were found to be all different by viral protein and nucleic acid analyses. All four clones produced noninfectious particles and, except for NX-2, at about the same level as wild type. Compared with wild-type virions these defective particles contained larger amounts of gag precursor proteins and very little or no p30 or p15. Analysis of intracellular precursor proteins revealed that NX-2 to -4 synthesized normal Pr65gag, whereas NX-1 produced a slightly smaller precursor. Both NX-1 and NX-4 synthesized an intracellular polyprotein with a size similar to that of wild-type Pr180 gag-pol. Restriction endonuclease analysis of NX-1 to -4 cellular DNA showed that each clone contained a single integrated provirus which possessed large terminal repeat sequences at both the 5' and 3' ends. The proviruses of NX-1 to -3 appeared normal by restriction endonuclease analysis, but NX-4 provirus had a deletion of 1,700 base pairs comprising part of the polymerase region. The noninfectious particles produced by all four clones packaged Moloney viral RNAs and rat RNAs of two different sizes. Images PMID:6165841

  8. Live births after simultaneous avoidance of monogenic diseases and chromosome abnormality by next-generation sequencing with linkage analyses.

    PubMed

    Yan, Liying; Huang, Lei; Xu, Liya; Huang, Jin; Ma, Fei; Zhu, Xiaohui; Tang, Yaqiong; Liu, Mingshan; Lian, Ying; Liu, Ping; Li, Rong; Lu, Sijia; Tang, Fuchou; Qiao, Jie; Xie, X Sunney

    2015-12-29

    In vitro fertilization (IVF), preimplantation genetic diagnosis (PGD), and preimplantation genetic screening (PGS) help patients to select embryos free of monogenic diseases and aneuploidy (chromosome abnormality). Next-generation sequencing (NGS) methods, while experiencing a rapid cost reduction, have improved the precision of PGD/PGS. However, the precision of PGD has been limited by the false-positive and false-negative single-nucleotide variations (SNVs), which are not acceptable in IVF and can be circumvented by linkage analyses, such as short tandem repeats or karyomapping. It is noteworthy that existing methods of detecting SNV/copy number variation (CNV) and linkage analysis often require separate procedures for the same embryo. Here we report an NGS-based PGD/PGS procedure that can simultaneously detect a single-gene disorder and aneuploidy and is capable of linkage analysis in a cost-effective way. This method, called "mutated allele revealed by sequencing with aneuploidy and linkage analyses" (MARSALA), involves multiple annealing and looping-based amplification cycles (MALBAC) for single-cell whole-genome amplification. Aneuploidy is determined by CNVs, whereas SNVs associated with the monogenic diseases are detected by PCR amplification of the MALBAC product. The false-positive and -negative SNVs are avoided by an NGS-based linkage analysis. Two healthy babies, free of the monogenic diseases of their parents, were born after such embryo selection. The monogenic diseases originated from a single base mutation on the autosome and the X-chromosome of the disease-carrying father and mother, respectively.

  9. Amino acid sequence around the active-site serine residue in the acyltransferase domain of goat mammary fatty acid synthetase.

    PubMed Central

    Mikkelsen, J; Højrup, P; Rasmussen, M M; Roepstorff, P; Knudsen, J

    1985-01-01

    Goat mammary fatty acid synthetase was labelled in the acyltransferase domain by formation of O-ester intermediates by incubation with [1-14C]acetyl-CoA and [2-14C]malonyl-CoA. Tryptic-digest and CNBr-cleavage peptides were isolated and purified by high-performance reverse-phase and ion-exchange liquid chromatography. The sequences of the malonyl- and acetyl-labelled peptides were shown to be identical. The results confirm the hypothesis that both acetyl and malonyl groups are transferred to the mammalian fatty acid synthetase complex by the same transferase. The sequence is compared with those of other fatty acid synthetase transferases. PMID:3922356

  10. Ligation with nucleic acid sequence-based amplification.

    PubMed

    Ong, Carmichael; Tai, Warren; Sarma, Aartik; Opal, Steven M; Artenstein, Andrew W; Tripathi, Anubhav

    2012-01-01

    This work presents a novel method for detecting nucleic acid targets using a ligation step along with an isothermal, exponential amplification step. We use an engineered ssDNA with two variable regions on the ends, allowing us to design the probe for optimal reaction kinetics and primer binding. This two-part probe is ligated by T4 DNA Ligase only when both parts bind adjacently to the target. The assay demonstrates that the expected 72-nt RNA product appears only when the synthetic target, T4 ligase, and both probe fragments are present during the ligation step. An extraneous 38-nt RNA product also appears due to linear amplification of unligated probe (P3), but its presence does not cause a false-positive result. In addition, 40 mmol/L KCl in the final amplification mix was found to be optimal. It was also found that increasing P5 in excess of P3 helped with ligation and reduced the extraneous 38-nt RNA product. The assay was also tested with a single nucleotide polymorphism target, changing one base at the ligation site. The assay was able to yield a negative signal despite only a single-base change. Finally, using P3 and P5 with longer binding sites results in increased overall sensitivity of the reaction, showing that increasing ligation efficiency can improve the assay overall. We believe that this method can be used effectively for a number of diagnostic assays.

  11. Hedysarum L. (Fabaceae: Hedysareae) Is Not Monophyletic – Evidence from Phylogenetic Analyses Based on Five Nuclear and Five Plastid Sequences

    PubMed Central

    Liu, Pei-Liang; Wen, Jun; Duan, Lei; Arslan, Emine; Ertuğrul, Kuddisi; Chang, Zhao-Yang

    2017-01-01

    The legume family (Fabaceae) exhibits a high level of species diversity and evolutionary success worldwide. Previous phylogenetic studies of the genus Hedysarum L. (Fabaceae: Hedysareae) showed that the nuclear and the plastid topologies might be incongruent, and the systematic position of the Hedysarum sect. Stracheya clade was uncertain. In this study, phylogenetic relationships of Hedysarum were investigated based on the nuclear ITS, ETS, PGDH, SQD1, TRPT and the plastid psbA-trnH, trnC-petN, trnL-trnF, trnS-trnG, petN-psbM sequences. Both nuclear and plastid data support two major lineages in Hedysarum: the Hedysarum s.s. clade and the Sartoria clade. In the nuclear tree, Hedysarum is biphyletic with the Hedysarum s.s. clade sister to the Corethrodendron + Eversmannia + Greuteria + Onobrychis clade (the CEGO clade), whereas the Sartoria clade is sister to the genus Taverniera DC. In the plastid tree, Hedysarum is monophyletic and sister to Taverniera. The incongruent position of the Hedysarum s.s. clade between the nuclear and plastid trees may be best explained by a chloroplast capture hypothesis via introgression. The Hedysarum sect. Stracheya clade is resolved as sister to the H. sect. Hedysarum clade in both nuclear and plastid trees, and our analyses support merging Stracheya into Hedysarum. Based on our new evidence from multiple sequences, Hedysarum is not monophyletic, and its generic delimitation needs to be reconsidered. PMID:28122062

  12. Deep sequencing and in silico analyses identify MYB-regulated gene networks and signaling pathways in pancreatic cancer

    PubMed Central

    Azim, Shafquat; Zubair, Haseeb; Srivastava, Sanjeev K.; Bhardwaj, Arun; Zubair, Asif; Ahmad, Aamir; Singh, Seema; Khushman, Moh’d.; Singh, Ajay P.

    2016-01-01

    We have recently demonstrated that the transcription factor MYB can modulate several cancer-associated phenotypes in pancreatic cancer. In order to understand the molecular basis of these MYB-associated changes, we conducted deep-sequencing of transcriptome of MYB-overexpressing and -silenced pancreatic cancer cells, followed by in silico pathway analysis. We identified significant modulation of 774 genes upon MYB-silencing (p < 0.05) that were assigned to 25 gene networks by in silico analysis. Further analyses placed genes in our RNA sequencing-generated dataset to several canonical signalling pathways, such as cell-cycle control, DNA-damage and -repair responses, p53 and HIF1α. Importantly, we observed downregulation of the pancreatic adenocarcinoma signaling pathway in MYB-silenced pancreatic cancer cells exhibiting suppression of EGFR and NF-κB. Decreased expression of EGFR and RELA was validated by both qPCR and immunoblotting and they were both shown to be under direct transcriptional control of MYB. These observations were further confirmed in a converse approach wherein MYB was overexpressed ectopically in a MYB-null pancreatic cancer cell line. Our findings thus suggest that MYB potentially regulates growth and genomic stability of pancreatic cancer cells via targeting complex gene networks and signaling pathways. Further in-depth functional studies are warranted to fully understand MYB signaling in pancreatic cancer. PMID:27354262

  13. Comparative genomic analyses identify the Vibrio harveyi genome sequenced strains BAA-1116 and HY01 as Vibrio campbellii.

    PubMed

    Lin, Baochuan; Wang, Zheng; Malanoski, Anthony P; O'Grady, Elizabeth A; Wimpee, Charles F; Vuddhakul, Varaporn; Alves Jr, Nelson; Thompson, Fabiano L; Gomez-Gil, Bruno; Vora, Gary J

    2010-02-01

    Three notable members of the Harveyi clade, Vibrio harveyi, Vibrio alginolyticus and Vibrio parahaemolyticus, are best known as marine pathogens of commercial and medical import. In spite of this fact, the discrimination of Harveyi clade members remains difficult due to genetic and phenotypic similarities, and this has led to misidentifications and inaccurate estimations of a species' involvement in certain environments. To begin to understand the underlying genetics that complicate species level discrimination, we compared the genomes of Harveyi clade members isolated from different environments (seawater, shrimp, corals, oysters, finfish, humans) using microarray-based comparative genomic hybridization (CGH) and multilocus sequence analyses (MLSA). Surprisingly, we found that the only two V. harveyi strains that have had their genomes sequenced (strains BAA-1116 and HY01) have themselves been misidentified. Instead of belonging to the species harveyi, they are actually members of the species campbellii. In total, 28% of the strains tested were found to be misidentified and 42% of these appear to comprise a novel species. Taken together, our findings correct a number of species misidentifications while validating the ability of both CGH and MLSA to distinguish closely related members of the Harveyi clade.

  14. Deep sequencing and transcriptome analyses to identify genes involved in secoiridoid biosynthesis in the Tibetan medicinal plant Swertia mussotii.

    PubMed

    Liu, Yue; Wang, Yi; Guo, Fengxian; Zhan, Lin; Mohr, Toni; Cheng, Prisca; Huo, Naxin; Gu, Ronghui; Pei, Danning; Sun, Jiaqing; Tang, Li; Long, Chunlin; Huang, Luqi; Gu, Yong Q

    2017-02-22

    Swertia mussotii Franch. is an important traditional Tibetan medicinal plant with pharmacological properties effective in the treatment of various ailments including hepatitis. Secoiridoids are the major bioactive compounds in S. mussotii. To better understand the secoiridoid biosynthesis pathway, we generated transcriptome sequences from the root, leaf, stem, and flower tissues, and performed de novo sequence assembly, yielding 98,613 unique transcripts with an N50 of 1,085 bp. Putative functions could be assigned to 35,029 transcripts (35.52%) based on BLAST searches against annotation databases including GO and KEGG. The expression profiles of 39 candidate transcripts encoding the key enzymes for secoiridoid biosynthesis were examined in different S. mussotii tissues, validated by qRT-PCR, and compared with the homologous genes from S. japonica, a species in the same family, unveiling the gene expression, regulation, and conservation of the pathway. The examination of the accumulated levels of three bioactive compounds, sweroside, swertiamarin, and gentiopicroside, revealed their considerable variations in different tissues, with no significant correlation with the expression profiles of key genes in the pathway, suggesting complex biological behaviours in the coordination of metabolite biosynthesis and accumulation. The genomic dataset and analyses presented here lay the foundation for further research on this important medicinal plant.

  15. Deep sequencing and transcriptome analyses to identify genes involved in secoiridoid biosynthesis in the Tibetan medicinal plant Swertia mussotii

    PubMed Central

    Liu, Yue; Wang, Yi; Guo, Fengxian; Zhan, Lin; Mohr, Toni; Cheng, Prisca; Huo, Naxin; Gu, Ronghui; Pei, Danning; Sun, Jiaqing; Tang, Li; Long, Chunlin; Huang, Luqi; Gu, Yong Q.

    2017-01-01

    Swertia mussotii Franch. is an important traditional Tibetan medicinal plant with pharmacological properties effective in the treatment of various ailments including hepatitis. Secoiridoids are the major bioactive compounds in S. mussotii. To better understand the secoiridoid biosynthesis pathway, we generated transcriptome sequences from the root, leaf, stem, and flower tissues, and performed de novo sequence assembly, yielding 98,613 unique transcripts with an N50 of 1,085 bp. Putative functions could be assigned to 35,029 transcripts (35.52%) based on BLAST searches against annotation databases including GO and KEGG. The expression profiles of 39 candidate transcripts encoding the key enzymes for secoiridoid biosynthesis were examined in different S. mussotii tissues, validated by qRT-PCR, and compared with the homologous genes from S. japonica, a species in the same family, unveiling the gene expression, regulation, and conservation of the pathway. The examination of the accumulated levels of three bioactive compounds, sweroside, swertiamarin, and gentiopicroside, revealed their considerable variations in different tissues, with no significant correlation with the expression profiles of key genes in the pathway, suggesting complex biological behaviours in the coordination of metabolite biosynthesis and accumulation. The genomic dataset and analyses presented here lay the foundation for further research on this important medicinal plant. PMID:28225035

  16. Sequence and stress-response analyses of the DNA mismatch repair gene hexA in Lactococcus lactis.

    PubMed

    Ren, J; Park, J H; Dunn, N W; Kim, W S

    2001-10-01

    The DNA mismatch repair gene hexA was identified in Lactococcus lactis by PCR amplification by using a pair of primers homologous to the DNA-binding Dps protein. The gene in its entirety, including the regulatory regions, was sequenced, by using a strategy of chromosomal walking based on two PCR protocols. The open reading frame of 2526 bp was preceded by a strong ribosome-binding site (AGGAAG) and was followed by a potential transcription terminator (hairpin loop structure). The 5' terminus of the hexA mRNA was located 135 bp upstream of the start codon, and putative -10 and -35 regions were identified. The deduced amino acid sequence revealed two motifs, the ATP/GTP-binding site (P-loop) and the "MutS family signature". The hexA promoter was cloned into pMU1327, which contained a promoter-less CAT reporter gene, and the promoter activity was examined under oxidative-stress conditions. It appears that the promoter activity is down-shifted by H2O2 at 4 mM.

  17. Thin-film technology for direct visual detection of nucleic acid sequences: applications in clinical research.

    PubMed

    Jenison, Robert D; Bucala, Richard; Maul, Diana; Ward, David C

    2006-01-01

    Certain optical conditions permit the unaided eye to detect thickness changes on surfaces on the order of 20 A, which are of similar dimensions to monomolecular interactions between proteins or hybridization of complementary nucleic acid sequences. Such detection exploits specific interference of reflected white light, wherein thickness changes are perceived as surface color changes. This technology, termed thin-film detection, allows for the visualization of subattomole amounts of nucleic acid targets, even in complex clinical samples. Thin-film technology has been applied to a broad range of clinically relevant indications, including the detection of pathogenic bacterial and viral nucleic acid sequences and the discrimination of sequence variations in human genes causally related to susceptibility or severity of disease.

  18. Conservation of Shannon's redundancy for proteins. [information theory applied to amino acid sequences

    NASA Technical Reports Server (NTRS)

    Gatlin, L. L.

    1974-01-01

    Concepts of information theory are applied to examine various proteins in terms of their redundancy in natural originators such as animals and plants. The Monte Carlo method is used to derive information parameters for random protein sequences. Real protein sequence parameters are compared with the standard parameters of protein sequences having a specific length. The tendency of a chain to contain some amino acids more frequently than others and the tendency of a chain to contain certain amino acid pairs more frequently than other pairs are used as randomness measures of individual protein sequences. Non-periodic proteins are generally found to have random Shannon redundancies except in cases of constraints due to short chain length and genetic codes. Redundant characteristics of highly periodic proteins are discussed. A degree of periodicity parameter is derived.

  19. RNA internal standard synthesis by nucleic acid sequence-based amplification for competitive quantitative amplification reactions.

    PubMed

    Lo, Wan-Yu; Baeumner, Antje J

    2007-02-15

    Nucleic acid sequence-based amplification (NASBA) reactions have been demonstrated to successfully synthesize new sequences based on deletion and insertion reactions. Two RNA internal standards were synthesized for use in competitive amplification reactions in which quantitative analysis can be achieved by coamplifying the internal standard with the wild type sample. The sequences were created in two consecutive NASBA reactions using the E. coli clpB mRNA sequence as model analyte. The primer sequences of the wild type sequence were maintained, and a 20-nt-long segment inside the amplicon region was exchanged for a new segment of similar GC content and melting temperature. The new RNA sequence was thus amplifiable using the wild type primers and detectable via a new inserted sequence. In the first reaction, the forwarding primer and an additional 20-nt-long sequence was deleted and replaced by a new 20-nt-long sequence. In the second reaction, a forwarding primer containing as 5' overhang sequence the wild type primer sequence was used. The presence of pure internal standard was verified using electrochemiluminescence and RNA lateral-flow biosensor analysis. Additional sequence deletion in order to shorten the internal standard amplicons and thus generate higher detection signals was found not to be required. Finally, a competitive NASBA reaction between one internal standard and the wild type sequence was carried out proving its functionality. This new rapid construction method via NASBA provides advantages over the traditional techniques since it requires no traditional cloning procedures, no thermocyclers, and can be completed in less than 4 h.

  20. metaBIT, an integrative and automated metagenomic pipeline for analysing microbial profiles from high-throughput sequencing shotgun data.

    PubMed

    Louvel, Guillaume; Der Sarkissian, Clio; Hanghøj, Kristian; Orlando, Ludovic

    2016-11-01

    Micro-organisms account for most of the Earth's biodiversity and yet remain largely unknown. The complexity and diversity of microbial communities present in clinical and environmental samples can now be robustly investigated in record times and prices thanks to recent advances in high-throughput DNA sequencing (HTS). Here, we develop metaBIT, an open-source computational pipeline automatizing routine microbial profiling of shotgun HTS data. Customizable by the user at different stringency levels, it performs robust taxonomy-based assignment and relative abundance calculation of microbial taxa, as well as cross-sample statistical analyses of microbial diversity distributions. We demonstrate the versatility of metaBIT within a range of published HTS data sets sampled from the environment (soil and seawater) and the human body (skin and gut), but also from archaeological specimens. We present the diversity of outputs provided by the pipeline for the visualization of microbial profiles (barplots, heatmaps) and for their characterization and comparison (diversity indices, hierarchical clustering and principal coordinates analyses). We show that metaBIT allows an automatic, fast and user-friendly profiling of the microbial DNA present in HTS shotgun data sets. The applications of metaBIT are vast, from monitoring of laboratory errors and contaminations, to the reconstruction of past and present microbiota, and the detection of candidate species, including pathogens.

  1. Cloning, sequence analysis and expression of the F1F0-ATPase beta-subunit from wine lactic acid bacteria.

    PubMed

    Sievers, Martin; Uermösi, Christina; Fehlmann, Marc; Krieger, Sibylle

    2003-09-01

    The nucleotide sequences of the genes encoding the F1F0-ATPase beta-subunit from Oenococcus oeni, Leuconostoc mesenteroides subsp. mesenteroides, Pediococcus damnosus, Pediococcus parvulus, Lactobacillus brevis and Lactobacillus hilgardii were determined. Their deduced amino acid sequences showed homology values of 79-98%. Data from the alignment and ATPase tree indicated that O. oeni and L. mesenteroides subsp. mesenteroides formed a group well-separated from P. damnosus and P. parvulus and from the group comprises L. brevis and L. hilgardii. The N-terminus of the F1F0-ATPase beta-subunit of O. oeni contains a stretch of additional 38 amino acid residues. The catalytic site of the ATPase beta-subunit of the investigated strains is characterized by the two conserved motifs GGAGVGKT and GERTRE. The amplified atpD coding sequences were inserted into the pCRT7/CT-TOPO vector using TA-cloning strategy and transformed in Escherichia coli. SDS-PAGE and Western blot analyses confirmed that O. oeni has an ATPase beta-subunit protein which is larger in size than the corresponding molecules from the investigated strains.

  2. Time-resolved detection probe for homogeneous nucleic acid analyses in one-step format.

    PubMed

    Laitala, Ville; Ylikoski, Alice; Raussi, Hanna-Mari; Ollikka, Pia; Hemmilä, Ilkka

    2007-02-01

    We report here an extension of homogeneous assays based on fluorescence intensity and lifetime measuring on DNA hybridization. A novel decay probe that allows simple one-step nucleic acid detection with subnanomolar sensitivity, and is suitable for closed-tube applications, is introduced. The decay probe uses fluorescence resonance energy transfer (FRET) between a europium chelate donor and an organic fluorophore acceptor. The substantial change in the acceptor emission decay time on hybridization with the target sequence allows the direct separation of the hybridized and unhybridized probe populations in a time-resolved measurement. No additional sample manipulation or self-hybridization of the probes is required. The wavelength and decay time of a decay probe can be adjusted according to the selection of probe length and acceptor fluorophore, thereby making the probes applicable to multiplexed assays. Here we demonstrate the decay probe principle and decay probe-based, one-step, dual DNA assay using celiac disease-related target oligonucleotides (single-nucleotide polymorphisms [SNPs]) as model analytes. Decay probes showed specific response for their complementary DNA target and allowed good signal deconvolution based on simultaneous optical and temporal filtering. This technique potentially could be used to further increase the number of simultaneously detected DNA targets in a simple one-step homogeneous assay.

  3. Amino acid sequences of two nonspecific lipid-transfer proteins from germinated castor bean.

    PubMed

    Takishima, K; Watanabe, S; Yamada, M; Suga, T; Mamiya, G

    1988-11-01

    The amino acid sequence of two nonspecific lipid-transfer proteins (nsLTP) B and C from germinated castor bean seeds have been determined. Both the proteins consist of 92 residues, as for nsLTP previously reported, and their calculated Mr values are 9847 and 9593 for nsLTP-B and nsLTP-C, respectively. The sequences of nsLTP-B and nsLTP-C, compared to the known sequence of nsLTP-A from the same source, are 68% and 35% similar, respectively. No variation was found at the positions of the cysteine residues, indicating that they might be involved in disulfide bridges.

  4. A classification of glycosyl hydrolases based on amino acid sequence similarities.

    PubMed Central

    Henrissat, B

    1991-01-01

    The amino acid sequences of 301 glycosyl hydrolases and related enzymes have been compared. A total of 291 sequences corresponding to 39 EC entries could be classified into 35 families. Only ten sequences (less than 5% of the sample) could not be assigned to any family. With the sequences available for this analysis, 18 families were found to be monospecific (containing only one EC number) and 17 were found to be polyspecific (containing at least two EC numbers). Implications on the folding characteristics and mechanism of action of these enzymes and on the evolution of carbohydrate metabolism are discussed. With the steady increase in sequence and structural data, it is suggested that the enzyme classification system should perhaps be revised. PMID:1747104

  5. Complete amino acid sequence of the N-terminal extension of calf skin type III procollagen.

    PubMed Central

    Brandt, A; Glanville, R W; Hörlein, D; Bruckner, P; Timpl, R; Fietzek, P P; Kühn, K

    1984-01-01

    The N-terminal extension peptide of type III procollagen, isolated from foetal-calf skin, contains 130 amino acid residues. To determine its amino acid sequence, the peptide was reduced and carboxymethylated or aminoethylated and fragmented with trypsin, Staphylococcus aureus V8 proteinase and bacterial collagenase. Pyroglutamate aminopeptidase was used to deblock the N-terminal collagenase fragment to enable amino acid sequencing. The type III collagen extension peptide is homologous to that of the alpha 1 chain of type I procollagen with respect to a three-domain structure. The N-terminal 79 amino acids, which contain ten of the 12 cysteine residues, form a compact globular domain. The next 39 amino acids are in a collagenase triplet sequence (Gly- Xaa - Yaa )n with a high hydroxyproline content. Finally, another short non-collagenous domain of 12 amino acids ends at the cleavage site for procollagen aminopeptidase, which cleaves a proline-glutamine bond. In contrast with type I procollagen, the type III procollagen extension peptides contain interchain disulphide bridges located at the C-terminus of the triple-helical domain. PMID:6331392

  6. 37 CFR 1.824 - Form and format for nucleotide and/or amino acid sequence submissions in computer readable form.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... nucleotide and/or amino acid sequence submissions in computer readable form. 1.824 Section 1.824 Patents... And/or Amino Acid Sequences § 1.824 Form and format for nucleotide and/or amino acid sequence... readable form may be created by any means, such as word processors, nucleotide/amino acid sequence...

  7. 37 CFR 1.824 - Form and format for nucleotide and/or amino acid sequence submissions in computer readable form.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... nucleotide and/or amino acid sequence submissions in computer readable form. 1.824 Section 1.824 Patents... And/or Amino Acid Sequences § 1.824 Form and format for nucleotide and/or amino acid sequence... readable form may be created by any means, such as word processors, nucleotide/amino acid sequence...

  8. 37 CFR 1.824 - Form and format for nucleotide and/or amino acid sequence submissions in computer readable form.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... nucleotide and/or amino acid sequence submissions in computer readable form. 1.824 Section 1.824 Patents... And/or Amino Acid Sequences § 1.824 Form and format for nucleotide and/or amino acid sequence... readable form may be created by any means, such as word processors, nucleotide/amino acid sequence...

  9. Molecular Biocomputing Suite: a word processor add-in for the analysis and manipulation of nucleic acid and protein sequence data.

    PubMed

    Muller, P Y; Studer, E; Miserez, A R

    2001-12-01

    In all fields of molecular biology, researchers are increasingly challenged by experiments planned and evaluated on the basis of nucleic acid and protein sequence data generally retrieved from public databases. Despite the wide spectrum of available Web-based software tools for sequence analysis, the routine use of these tools has disadvantages, particularly because of the elaborate and heterogeneous ways of data input, output, and storage. Here we present a Visual Basic-encoded Microsoft Word Add-In, the Molecular BioComputing Suite (MBCS), available at the BioTechniques Software Library (www.BioTechniques.com). The MBCS software aims to manage and expedite a wide range of sequence analyses and manipulations using an integrated text editor environment including menu-guided commands. Its independence of sequence formats enables MBCS to be used as a pivotal application between other software tools for sequence analysis, manipulation, annotation, and editing.

  10. Complete amino acid sequence of branched-chain amino acid aminotransferase (transaminase B) of Salmonella typhimurium, identification of the coenzyme-binding site and sequence comparison analysis

    SciTech Connect

    Feild, M.J.

    1988-01-01

    The complete amino acid sequence of the subunit of branched-chain amino acid aminotransferase of Salmonella typhimurium was determined by automated Edman degradation of peptide fragments generated by chemical and enzymatic digestion of S-carboxymethylated and S-pyridylethylated transaminase B. Peptide fragments of transaminase B were generated by treatment of the enzyme with trypsin, Staphylococcus aureus V8 protease, endoproteinase Lys-C, and cyanogen bromide. Protocols were developed for separation of the peptide fragments by reverse-phase high performance liquid chromatography (HPLC), ion-exchange HPLC, and SDS-urea gel electrophoresis. The enzyme subunit contains 308 amino acid residues and has a molecular weight of 33,920 daltons. The coenzyme-binding site was determined by treatment of the enzyme, containing bound pyridoxal 5-phosphate, with tritiated sodium borohydride prior to trypsin digestion. Monitoring radioactivity incorporation and peptide map comparisons with an apoenzyme tryptic digest, allowed identification of the pyridoxylated-peptide which was isolated by reverse-phase HPLC and sequenced. The coenzyme-binding site is a lysyl residue at position 159. Some peptides were further characterized by fast atom bombardment mass spectrometry.

  11. Comparative Analyses of Plastid Sequences between Native and Introduced Populations of Aquatic Weeds Elodea canadensis and E. nuttallii

    PubMed Central

    Huotari, Tea; Korpelainen, Helena

    2013-01-01

    Non-indigenous species (NIS) are species living outside their historic or native range. Invasive NIS often cause severe environmental impacts, and may have large economical and social consequences. Elodea (Hydrocharitaceae) is a New World genus with at least five submerged aquatic angiosperm species living in fresh water environments. Our aim was to survey the geographical distribution of cpDNA haplotypes within the native and introduced ranges of invasive aquatic weeds Elodea canadensis and E. nuttallii and to reconstruct the spreading histories of these invasive species. In order to reveal informative chloroplast (cp) genome regions for phylogeographic analyses, we compared the plastid sequences of native and introduced individuals of E. canadensis. In total, we found 235 variable sites (186 SNPs, 47 indels and two inversions) between the two plastid sequences consisting of 112,193 bp and developed primers flanking the most variable genomic areas. These 29 primer pairs were used to compare the level and pattern of intraspecific variation within E. canadensis to interspecific variation between E. canadensis and E. nuttallii. Nine potentially informative primer pairs were used to analyze the phylogeographic structure of both Elodea species, based on 70 E. canadensis and 25 E. nuttallii individuals covering native and introduced distributions. On the whole, the level of variation between the two Elodea species was 53% higher than that within E. canadensis. In our phylogeographic analysis, only a single haplotype was found in the introduced range in both species. These haplotypes H1 (E. canadensis) and A (E. nuttallii) were also widespread in the native range, covering the majority of native populations analyzed. Therefore, we were not able to identify either the geographic origin of the introduced populations or test the hypothesis of single versus multiple introductions. The divergence between E. canadensis haplotypes was surprisingly high, and future research may

  12. Comparative analyses of plastid sequences between native and introduced populations of aquatic weeds Elodea canadensis and E. nuttallii.

    PubMed

    Huotari, Tea; Korpelainen, Helena

    2013-01-01

    Non-indigenous species (NIS) are species living outside their historic or native range. Invasive NIS often cause severe environmental impacts, and may have large economical and social consequences. Elodea (Hydrocharitaceae) is a New World genus with at least five submerged aquatic angiosperm species living in fresh water environments. Our aim was to survey the geographical distribution of cpDNA haplotypes within the native and introduced ranges of invasive aquatic weeds Elodea canadensis and E. nuttallii and to reconstruct the spreading histories of these invasive species. In order to reveal informative chloroplast (cp) genome regions for phylogeographic analyses, we compared the plastid sequences of native and introduced individuals of E. canadensis. In total, we found 235 variable sites (186 SNPs, 47 indels and two inversions) between the two plastid sequences consisting of 112,193 bp and developed primers flanking the most variable genomic areas. These 29 primer pairs were used to compare the level and pattern of intraspecific variation within E. canadensis to interspecific variation between E. canadensis and E. nuttallii. Nine potentially informative primer pairs were used to analyze the phylogeographic structure of both Elodea species, based on 70 E. canadensis and 25 E. nuttallii individuals covering native and introduced distributions. On the whole, the level of variation between the two Elodea species was 53% higher than that within E. canadensis. In our phylogeographic analysis, only a single haplotype was found in the introduced range in both species. These haplotypes H1 (E. canadensis) and A (E. nuttallii) were also widespread in the native range, covering the majority of native populations analyzed. Therefore, we were not able to identify either the geographic origin of the introduced populations or test the hypothesis of single versus multiple introductions. The divergence between E. canadensis haplotypes was surprisingly high, and future research may

  13. Minding the gap: Frequency of indels in mtDNA control region sequence data and influence on population genetic analyses

    USGS Publications Warehouse

    Pearce, J.M.

    2006-01-01

    Insertions and deletions (indels) result in sequences of various lengths when homologous gene regions are compared among individuals or species. Although indels are typically phylogenetically informative, occurrence and incorporation of these characters as gaps in intraspecific population genetic data sets are rarely discussed. Moreover, the impact of gaps on estimates of fixation indices, such as FST, has not been reviewed. Here, I summarize the occurrence and population genetic signal of indels among 60 published studies that involved alignments of multiple sequences from the mitochondrial DNA (mtDNA) control region of vertebrate taxa. Among 30 studies observing indels, an average of 12% of both variable and parsimony-informative sites were composed of these sites. There was no consistent trend between levels of population differentiation and the number of gap characters in a data block. Across all studies, the average influence on estimates of ??ST was small, explaining only an additional 1.8% of among population variance (range 0.0-8.0%). Studies most likely to observe an increase in ??ST with the inclusion of gap characters were those with < 20 variable sites, but a near equal number of studies with few variable sites did not show an increase. In contrast to studies at interspecific levels, the influence of indels for intraspecific population genetic analyses of control region DNA appears small, dependent upon total number of variable sites in the data block, and related to species-specific characteristics and the spatial distribution of mtDNA lineages that contain indels. ?? 2006 Blackwell Publishing Ltd.

  14. The amino acid sequence of cytochromes c-551 from three species of Pseudomonas

    PubMed Central

    Ambler, R. P.; Wynn, Margaret

    1973-01-01

    The amino acid sequences of the cytochromes c-551 from three species of Pseudomonas have been determined. Each resembles the protein from Pseudomonas strain P6009 (now known to be Pseudomonas aeruginosa, not Pseudomonas fluorescens) in containing 82 amino acids in a single peptide chain, with a haem group covalently attached to cysteine residues 12 and 15. In all four sequences 43 residues are identical. Although by bacteriological criteria the organisms are closely related, the differences between pairs of sequences range from 22% to 39%. These values should be compared with the differences in the sequence of mitochondrial cytochrome c between mammals and amphibians (about 18%) or between mammals and insects (about 33%). Detailed evidence for the amino acid sequences of the proteins has been deposited as Supplementary Publication SUP 50015 at the National Lending Library for Science and Technology, Boston Spa, Yorks. LS23 7BQ, U.K., from whom copies can be obtained on the terms indicated in Biochem. J. (1973), 131, 5. PMID:4352718

  15. Draft Genome Sequence of Sorghum Grain Mold Fungus Epicoccum sorghinum, a Producer of Tenuazonic Acid

    PubMed Central

    Oliveira, Rodrigo C.; Davenport, Karen W.; Hovde, Blake; Silva, Danielle; Chain, Patrick S. G.; Correa, Benedito

    2017-01-01

    ABSTRACT The facultative plant pathogen Epicoccum sorghinum is associated with grain mold of sorghum and produces the mycotoxin tenuazonic acid. This fungus can have serious economic impact on sorghum production. Here, we report the draft genome sequence of E. sorghinum (USPMTOX48). PMID:28126937

  16. Snake venom. The amino acid sequence of protein A from Dendroaspis polylepis polylepis (black mamba) venom.

    PubMed

    Joubert, F J; Strydom, D J

    1980-12-01

    Protein A from Dendroaspis polylepis polylepis venom comprises 81 amino acids, including ten half-cystine residues. The complete primary structures of protein A and its variant A' were elucidated. The sequences of proteins A and A', which differ in a single position, show no homology with various neurotoxins and non-neurotoxic proteins and represent a new type of elapid venom protein.

  17. Draft Genome Sequence of Bacillus coagulans NL01, a Wonderful l-Lactic Acid Producer

    PubMed Central

    Zheng, Zhaojuan; Jiang, Ting; Lin, Xi; Zhou, Jie

    2015-01-01

    Here, we report the draft genome sequence of Bacillus coagulans NL01, which could produce high optically pure l-lactic acid using xylose as a sole carbon source. The draft genome is 3,505,081 bp, with 144 contigs. About 3,903 protein-coding genes and 92 rRNAs are predicted from this assembly. PMID:26089419

  18. Transcriptome sequencing and genome-wide association analyses reveal lysosomal function and actin cytoskeleton remodeling in schizophrenia and bipolar disorder.

    PubMed

    Zhao, Z; Xu, J; Chen, J; Kim, S; Reimers, M; Bacanu, S-A; Yu, H; Liu, C; Sun, J; Wang, Q; Jia, P; Xu, F; Zhang, Y; Kendler, K S; Peng, Z; Chen, X

    2015-05-01

    Schizophrenia (SCZ) and bipolar disorder (BPD) are severe mental disorders with high heritability. Clinicians have long noticed the similarities of clinic symptoms between these disorders. In recent years, accumulating evidence indicates some shared genetic liabilities. However, what is shared remains elusive. In this study, we conducted whole transcriptome analysis of post-mortem brain tissues (cingulate cortex) from SCZ, BPD and control subjects, and identified differentially expressed genes in these disorders. We found 105 and 153 genes differentially expressed in SCZ and BPD, respectively. By comparing the t-test scores, we found that many of the genes differentially expressed in SCZ and BPD are concordant in their expression level (q⩽0.01, 53 genes; q⩽0.05, 213 genes; q⩽0.1, 885 genes). Using genome-wide association data from the Psychiatric Genomics Consortium, we found that these differentially and concordantly expressed genes were enriched in association signals for both SCZ (P<10(-7)) and BPD (P=0.029). To our knowledge, this is the first time that a substantially large number of genes show concordant expression and association for both SCZ and BPD. Pathway analyses of these genes indicated that they are involved in the lysosome, Fc gamma receptor-mediated phagocytosis, regulation of actin cytoskeleton pathways, along with several cancer pathways. Functional analyses of these genes revealed an interconnected pathway network centered on lysosomal function and the regulation of actin cytoskeleton. These pathways and their interacting network were principally confirmed by an independent transcriptome sequencing data set of the hippocampus. Dysregulation of lysosomal function and cytoskeleton remodeling has direct impacts on endocytosis, phagocytosis, exocytosis, vesicle trafficking, neuronal maturation and migration, neurite outgrowth and synaptic density and plasticity, and different aspects of these processes have been implicated in SCZ and BPD.

  19. Amino acid sequences of heterotrophic and photosynthetic ferredoxins from the tomato plant (Lycopersicon esculentum Mill.).

    PubMed

    Kamide, K; Sakai, H; Aoki, K; Sanada, Y; Wada, K; Green, L S; Yee, B C; Buchanan, B B

    1995-11-01

    Several forms (isoproteins) of ferredoxin in roots, leaves, and green and red pericarps in tomato plants (Lycopersicon esculentum Mill.) were earlier identified on the basis of N-terminal amino acid sequence and chromatographic behavior (Green et al. 1991). In the present study, a large scale preparation made possible determination of the full length amino acid sequence of the two ferredoxins from leaves. The ferredoxins characteristic of fruit and root were sequenced from the amino terminus to the 30th residue or beyond. The leaf ferredoxins were confirmed to be expressed in pericarp of both green and red fruit. The ferredoxins characteristic of fruit and root appeared to be restricted to those tissue. The results extend earlier findings in demonstrating that ferredoxin occurs in the major organs of the tomato plant where it appears to function irrespective of photosynthetic competence.

  20. Comparative Analyses of the Lipooligosaccharides from Nontypeable Haemophilus influenzae and Haemophilus haemolyticus Show Differences in Sialic Acid and Phosphorylcholine Modifications

    PubMed Central

    Post, Deborah M. B.; Ketterer, Margaret R.; Coffin, Jeremy E.; Reinders, Lorri M.; Munson, Robert S.; Bair, Thomas; Murphy, Timothy F.; Foster, Eric D.; Gibson, Bradford W.

    2016-01-01

    Haemophilus haemolyticus and nontypeable Haemophilus influenzae (NTHi) are closely related upper airway commensal bacteria that are difficult to distinguish phenotypically. NTHi causes upper and lower airway tract infections in individuals with compromised airways, while H. haemolyticus rarely causes such infections. The lipooligosaccharide (LOS) is an outer membrane component of both species and plays a role in NTHi pathogenesis. In this study, comparative analyses of the LOS structures and corresponding biosynthesis genes were performed. Mass spectrometric and immunochemical analyses showed that NTHi LOS contained terminal sialic acid more frequently and to a higher extent than H. haemolyticus LOS did. Genomic analyses of 10 strains demonstrated that H. haemolyticus lacked the sialyltransferase genes lic3A and lic3B (9/10) and siaA (10/10), but all strains contained the sialic acid uptake genes siaP and siaT (10/10). However, isothermal titration calorimetry analyses of SiaP from two H. haemolyticus strains showed a 3.4- to 7.3-fold lower affinity for sialic acid compared to that of NTHi SiaP. Additionally, mass spectrometric and immunochemical analyses showed that the LOS from H. haemolyticus contained phosphorylcholine (ChoP) less frequently than the LOS from NTHi strains. These differences observed in the levels of sialic acid and ChoP incorporation in the LOS structures from H. haemolyticus and NTHi may explain some of the differences in their propensities to cause disease. PMID:26729761

  1. Comparative Analyses of the Lipooligosaccharides from Nontypeable Haemophilus influenzae and Haemophilus haemolyticus Show Differences in Sialic Acid and Phosphorylcholine Modifications.

    PubMed

    Post, Deborah M B; Ketterer, Margaret R; Coffin, Jeremy E; Reinders, Lorri M; Munson, Robert S; Bair, Thomas; Murphy, Timothy F; Foster, Eric D; Gibson, Bradford W; Apicella, Michael A

    2016-01-04

    Haemophilus haemolyticus and nontypeable Haemophilus influenzae (NTHi) are closely related upper airway commensal bacteria that are difficult to distinguish phenotypically. NTHi causes upper and lower airway tract infections in individuals with compromised airways, while H. haemolyticus rarely causes such infections. The lipooligosaccharide (LOS) is an outer membrane component of both species and plays a role in NTHi pathogenesis. In this study, comparative analyses of the LOS structures and corresponding biosynthesis genes were performed. Mass spectrometric and immunochemical analyses showed that NTHi LOS contained terminal sialic acid more frequently and to a higher extent than H. haemolyticus LOS did. Genomic analyses of 10 strains demonstrated that H. haemolyticus lacked the sialyltransferase genes lic3A and lic3B (9/10) and siaA (10/10), but all strains contained the sialic acid uptake genes siaP and siaT (10/10). However, isothermal titration calorimetry analyses of SiaP from two H. haemolyticus strains showed a 3.4- to 7.3-fold lower affinity for sialic acid compared to that of NTHi SiaP. Additionally, mass spectrometric and immunochemical analyses showed that the LOS from H. haemolyticus contained phosphorylcholine (ChoP) less frequently than the LOS from NTHi strains. These differences observed in the levels of sialic acid and ChoP incorporation in the LOS structures from H. haemolyticus and NTHi may explain some of the differences in their propensities to cause disease.

  2. Nucleotide sequence and the encoded amino acids of human apolipoprotein A-I mRNA.

    PubMed Central

    Law, S W; Brewer, H B

    1984-01-01

    The cDNA clones encoding the precursor form of human liver apolipoprotein A-I (apoA-I), preproapoA-I, have been isolated from a cDNA library. A 17-base synthetic oligonucleotide based on residues 108-113 of apoA-I and a 26-base primer-extended, dideoxynucleotide-terminated cDNA were used as hybridization probes to select for recombinant plasmids bearing the apoA-I sequence. The complete nucleic acid sequence of human liver preproapoA-I has been determined by analysis of the cloned cDNA. The sequence is composed of 801 nucleotides encoding 267 amino acid residues. PreproapoA-I contains an 18-amino-acid prepeptide and a 6-amino-acid propeptide connected to the amino terminus of the 243-amino acid mature apoA-I. Southern blotting analysis of chromosomal DNA obtained from peripheral blood indicated the apoA-I gene is contained in a 2.1-kilobase-pair Pst I fragment and there is no gross difference in structural organization between the normal apoA-I gene and the Tangier disease apoA-I gene. Images PMID:6198645

  3. Mathematical Characterization of Protein Sequences Using Patterns as Chemical Group Combinations of Amino Acids.

    PubMed

    Das, Jayanta Kumar; Das, Provas; Ray, Korak Kumar; Choudhury, Pabitra Pal; Jana, Siddhartha Sankar

    2016-01-01

    Comparison of amino acid sequence similarity is the fundamental concept behind the protein phylogenetic tree formation. By virtue of this method, we can explain the evolutionary relationships, but further explanations are not possible unless sequences are studied through the chemical nature of individual amino acids. Here we develop a new methodology to characterize the protein sequences on the basis of the chemical nature of the amino acids. We design various algorithms for studying the variation of chemical group transitions and various chemical group combinations as patterns in the protein sequences. The amino acid sequence of conventional myosin II head domain of 14 family members are taken to illustrate this new approach. We find two blocks of maximum length 6 aa as 'FPKATD' and 'Y/FTNEKL' without repeating the same chemical nature and one block of maximum length 20 aa with the repetition of chemical nature which are common among all 14 members. We also check commonality with another motor protein sub-family kinesin, KIF1A. Based on our analysis we find a common block of length 8 aa both in myosin II and KIF1A. This motif is located in the neck linker region which could be responsible for the generation of mechanical force, enabling us to find the unique blocks which remain chemically conserved across the family. We also validate our methodology with different protein families such as MYOI, Myosin light chain kinase (MLCK) and Rho-associated protein kinase (ROCK), Na+/K+-ATPase and Ca2+-ATPase. Altogether, our studies provide a new methodology for investigating the conserved amino acids' pattern in different proteins.

  4. Gene Mutation Profiles in Primary Diffuse Large B Cell Lymphoma of Central Nervous System: Next Generation Sequencing Analyses.

    PubMed

    Todorovic Balint, Milena; Jelicic, Jelena; Mihaljevic, Biljana; Kostic, Jelena; Stanic, Bojana; Balint, Bela; Pejanovic, Nadja; Lucic, Bojana; Tosic, Natasa; Marjanovic, Irena; Stojiljkovic, Maja; Karan-Djurasevic, Teodora; Perisic, Ognjen; Rakocevic, Goran; Popovic, Milos; Raicevic, Sava; Bila, Jelena; Antic, Darko; Andjelic, Bosko; Pavlovic, Sonja

    2016-05-06

    The existence of a potential primary central nervous system lymphoma-specific genomic signature that differs from the systemic form of diffuse large B cell lymphoma (DLBCL) has been suggested, but is still controversial. We investigated 19 patients with primary DLBCL of central nervous system (DLBCL CNS) using the TruSeq Amplicon Cancer Panel (TSACP) for 48 cancer-related genes. Next generation sequencing (NGS) analyses have revealed that over 80% of potentially protein-changing mutations were located in eight genes (CTNNB1, PIK3CA, PTEN, ATM, KRAS, PTPN11, TP53 and JAK3), pointing to the potential role of these genes in lymphomagenesis. TP53 was the only gene harboring mutations in all 19 patients. In addition, the presence of mutated TP53 and ATM genes correlated with a higher total number of mutations in other analyzed genes. Furthermore, the presence of mutated ATM correlated with poorer event-free survival (EFS) (p = 0.036). The presence of the mutated SMO gene correlated with earlier disease relapse (p = 0.023), inferior event-free survival (p = 0.011) and overall survival (OS) (p = 0.017), while mutations in the PTEN gene were associated with inferior OS (p = 0.048). Our findings suggest that the TP53 and ATM genes could be involved in the molecular pathophysiology of primary DLBCL CNS, whereas mutations in the PTEN and SMO genes could affect survival regardless of the initial treatment approach.

  5. Gene Mutation Profiles in Primary Diffuse Large B Cell Lymphoma of Central Nervous System: Next Generation Sequencing Analyses

    PubMed Central

    Todorovic Balint, Milena; Jelicic, Jelena; Mihaljevic, Biljana; Kostic, Jelena; Stanic, Bojana; Balint, Bela; Pejanovic, Nadja; Lucic, Bojana; Tosic, Natasa; Marjanovic, Irena; Stojiljkovic, Maja; Karan-Djurasevic, Teodora; Perisic, Ognjen; Rakocevic, Goran; Popovic, Milos; Raicevic, Sava; Bila, Jelena; Antic, Darko; Andjelic, Bosko; Pavlovic, Sonja

    2016-01-01

    The existence of a potential primary central nervous system lymphoma-specific genomic signature that differs from the systemic form of diffuse large B cell lymphoma (DLBCL) has been suggested, but is still controversial. We investigated 19 patients with primary DLBCL of central nervous system (DLBCL CNS) using the TruSeq Amplicon Cancer Panel (TSACP) for 48 cancer-related genes. Next generation sequencing (NGS) analyses have revealed that over 80% of potentially protein-changing mutations were located in eight genes (CTNNB1, PIK3CA, PTEN, ATM, KRAS, PTPN11, TP53 and JAK3), pointing to the potential role of these genes in lymphomagenesis. TP53 was the only gene harboring mutations in all 19 patients. In addition, the presence of mutated TP53 and ATM genes correlated with a higher total number of mutations in other analyzed genes. Furthermore, the presence of mutated ATM correlated with poorer event-free survival (EFS) (p = 0.036). The presence of the mutated SMO gene correlated with earlier disease relapse (p = 0.023), inferior event-free survival (p = 0.011) and overall survival (OS) (p = 0.017), while mutations in the PTEN gene were associated with inferior OS (p = 0.048). Our findings suggest that the TP53 and ATM genes could be involved in the molecular pathophysiology of primary DLBCL CNS, whereas mutations in the PTEN and SMO genes could affect survival regardless of the initial treatment approach. PMID:27164089

  6. Complete genome sequence and transcriptomics analyses reveal pigment biosynthesis and regulatory mechanisms in an industrial strain, Monascus purpureus YY-1.

    PubMed

    Yang, Yue; Liu, Bin; Du, Xinjun; Li, Ping; Liang, Bin; Cheng, Xiaozhen; Du, Liangcheng; Huang, Di; Wang, Lei; Wang, Shuo

    2015-02-09

    Monascus has been used to produce natural colorants and food supplements for more than one thousand years, and approximately more than one billion people eat Monascus-fermented products during their daily life. In this study, using next-generation sequencing and optical mapping approaches, a 24.1-Mb complete genome of an industrial strain, Monascus purpureus YY-1, was obtained. This genome consists of eight chromosomes and 7,491 genes. Phylogenetic analysis at the genome level provides convincing evidence for the evolutionary position of M. purpureus. We provide the first comprehensive prediction of the biosynthetic pathway for Monascus pigment. Comparative genomic analyses show that the genome of M. purpureus is 13.6-40% smaller than those of closely related filamentous fungi and has undergone significant gene losses, most of which likely occurred during its specialized adaptation to starch-based foods. Comparative transcriptome analysis reveals that carbon starvation stress, resulting from the use of relatively low-quality carbon sources, contributes to the high yield of pigments by repressing central carbon metabolism and augmenting the acetyl-CoA pool. Our work provides important insights into the evolution of this economically important fungus and lays a foundation for future genetic manipulation and engineering of this strain.

  7. Phylogenetic analyses of nucleotide sequences confirm a unique plant intercontinental disjunction between tropical Africa, the Caribbean, and the Hawaiian Islands.

    PubMed

    Namoff, Sandra; Luke, Quentin; Jiménez, Francisco; Veloz, Alberto; Lewis, Carl E; Sosa, Victoria; Maunder, Mike; Francisco-Ortega, Javier

    2010-01-01

    Phylogenetic analyses of nucleotide sequences of the internal transcribed spacers and 5.8 regions of the nuclear ribosomal DNA and of the trnH-psbA spacer of the chloroplast genome confirm that the three taxa of the Jacquemontia ovalifolia (Choicy) Hallier f. complex (Convolvulaceae) form a monophyletic group. Levels of nucleotide divergence and morphological differentiation among these taxa support the view that each should be recognized as distinct species. These three species display unique intercontinental disjunction, with one species endemic to Hawaii (Jacquemontia sandwicensis A. Gray.), another restricted to eastern Mexico and the Antilles [Jacquemontia obcordata (Millspaugh) House], and the third confined to East and West Africa (J. ovalifolia). The Caribbean and Hawaiian species are sister taxa and are another example of a biogeographical link between the Caribbean Basin and Polynesia. We provide a brief conservation review of the three taxa based on our collective field work and investigations; it is apparent that J. obcordata is highly threatened and declining in the Caribbean.

  8. Rat androgen-binding protein: evidence for identical subunits and amino acid sequence homology with human sex hormone-binding globulin.

    PubMed

    Joseph, D R; Hall, S H; French, F S

    1987-01-01

    The cDNA for rat androgen-binding protein (ABP) was previously isolated from a bacteriophage lambda gt11 rat testis cDNA library and its identity was confirmed by epitope selection. Hybrid-arrested translation studies have now demonstrated the identity of the isolates. The nucleotide sequence of a near full-length cDNA encodes a 403-amino acid precursor (Mr = 44,539), which agrees in size with the cell-free translation product (Mr = 45,000) of ABP mRNA. Putative sites of N-glycosylation and signal peptide cleavage were identified. Comparison of the predicted amino acid sequence of rat ABP with the amino-terminal amino acid sequence of human sex hormone-binding globulin revealed that 17 of 25 residues are identical. On the basis of the predicted amino acid sequence the molecular weight of the primary translation product, lacking the signal peptide, was 41,183. Hybridization analyses indicated that the two subunits of ABP are coded for by a single gene and a single mRNA species. Our results suggest that ABP consists of two subunits with identical primary sequences and that differences in post-translational processing result in the production of 47,000 and 41,000 molecular weight monomers.

  9. Amino acid analyses of type 3 chondrites Colony, Ornans, Chainpur, and Bishunpur

    NASA Astrophysics Data System (ADS)

    Chan, H.-S.; Martins, Zita; Sephton, Mark A.

    2012-09-01

    The CO3s Colony and Ornans and LL3s Chainpur and Bishunpur were analyzed for the first time for amino acids using gas chromatography-mass spectrometry (GC-MS). Type 3 chondrites have relatively unaltered metamorphic and petrological histories. Chainpur was the most amino acid rich of the four type 3 chondrites with a total amino acid abundance of 3330 parts per billion (ppb). The other type 3 chondrites had total amino acid abundances that ranged from 660 to 1110 ppb. A D/L ratio of <0.7 for all proteic amino acids suggests at least some amino acid terrestrial contamination. However, a small fraction of indigenous extraterrestrial amino acids cannot be excluded because of the presence of the nonprotein amino acid α-aminoisobutyric acid (α-AIB), and unusually high relative abundances (to glycine) of β-alanine and γ-ABA. The comparisons between the free and total amino acid contents of the samples also indicate a low free/total amino acid ratio (ranging from about 1:4 in CO chondrites to about 1:50 in Chainpur), which indicate that amino acids are present mainly in the bound form and were made detectable after acid hydrolysis.

  10. The human erythrocyte anion-transport protein. Partial amino acid sequence, conformation and a possible molecular mechanism for anion exchange.

    PubMed Central

    Brock, C J; Tanner, M J; Kempf, C

    1983-01-01

    The N-terminal 72 residues of an integral membrane fragment, P5, of the human erythrocyte anion-transport protein, which is known to be directly involved in the anion-exchange process, was shown to have the following amino acid sequence: Met-Val-Pro-Lys-Pro-Gln-Gly-Pro-Leu-Pro-Asn-Thr-Ala-Leu-Leu-Ser-Leu-Val-Leu-Met -Ala-Gly-Thr-Phe-Phe-Phe-Ala-Met-Met-Leu-Arg-Lys-Phe-Lys-Asn-Ser-Ser-Tyr-Phe-Pro-Gly-Lys-Leu-Arg-Arg-Val-Ile-Gly-Asp-Phe-Gly-Val-Pro-Ile-Ser-Ile-Leu-Ile-Met-Val-Leu-Val-Asp-Phe-Phe-Ile-Gln-Asp-Thr-Tyr-Thr-Gln- The structure of this fragment was analysed, with account being taken of the constraints that apply to the folding of integral membrane proteins and the topographical locations of various sites in the sequence. It was concluded that this sequence forms two transmembrane alpha-helices. These are probably part of a cluster of amphipathic transmembrane alpha-helices, which could comprise that part of the protein responsible for transport activity. The presently available evidence relating to the anion-exchange process was considered with the structural features noted in this study and a possible molecular mechanism is proposed. In this model the rearrangement of a network of intramembranous charged pairs mediates the translocation of an anion between anion-binding regions at each surface of the membrane, which are composed of clusters of positively charged amino acids. This model imposes a sequential exchange mechanism on the system. Supplementary material, including Tables and Figures describing the compositions of peptides determined by amino acid analysis and sequence studies, quantitative and qualitative data that provide a residue-by-residue justification for the sequence assignment and a description of modifications to and use of the solid-phase sequencer has been deposited as Supplementary Publication SUP 50123 (12 pages) with the British Library Lending Division, Boston Spa, Wetherby, West Yorkshire LS23 7BQ, U.K., from whom copies can be

  11. Molecular cloning, nucleotide sequence, and abscisic acid induction of a suberization-associated highly anionic peroxidase.

    PubMed

    Roberts, E; Kolattukudy, P E

    1989-06-01

    A highly anionic peroxidase induced in suberizing cells was suggested to be the key enzyme involved in polymerization of phenolic monomers to generate the aromatic matrix of suberin. The enzyme encoded by a potato cDNA was found to be highly homologous to the anionic peroxidase induced in suberizing tomato fruit. A tomato genomic library was screened using the potato anionic peroxidase cDNA and one genomic clone was isolated that contained two tandemly oriented anionic peroxidase genes. These genes were sequenced and were 96% and 87% identical to the mRNA for potato anionic peroxidase. Both genes consist of three exons with the relative positions of their two introns being conserved between the two genes. Primer extension analysis showed that only one of the genes is expressed in the periderm of 3 day wound-healed tomato fruits. Southern blot analyses suggested that there are two copies each of the two highly homologous genes per haploid genome in both potato and tomato. Abscisic acid (ABA) induced the accumulation of the anionic peroxidase transcripts in potato and tomato callus tissues. Northern blots showed that peroxidase mRNA was detectable at 2 days and was maximal at 8 days after transfer of potato callus to solid agar media containing 10(-4) M ABA. The transcripts induced by ABA in both potato and tomato callus were identical in size to those induced in wound-healing potato tuber and tomato fruit. The anionic peroxidase peptide was detected in extracts of potato callus grown on the ABA-containing media by western blot analysis. The results support the suggestion that stimulation of suberization by ABA involves the induction of the highly anionic peroxidase.

  12. Complete genome sequence of the probiotic lactic acid bacterium Lactobacillus acidophilus NCFM

    PubMed Central

    Altermann, Eric; Russell, W. Michael; Azcarate-Peril, M. Andrea; Barrangou, Rodolphe; Buck, B. Logan; McAuliffe, Olivia; Souther, Nicole; Dobson, Alleson; Duong, Tri; Callanan, Michael; Lick, Sonja; Hamrick, Alice; Cano, Raul; Klaenhammer, Todd R.

    2005-01-01

    Lactobacillus acidophilus NCFM is a probiotic bacterium that has been produced commercially since 1972. The complete genome is 1,993,564 nt and devoid of plasmids. The average GC content is 34.71% with 1,864 predicted ORFs, of which 72.5% were functionally classified. Nine phage-related integrases were predicted, but no complete prophages were found. However, three unique regions designated as potential autonomous units (PAUs) were identified. These units resemble a unique structure and bear characteristics of both plasmids and phages. Analysis of the three PAUs revealed the presence of two R/M systems and a prophage maintenance system killer protein. A spacers interspersed direct repeat locus containing 32 nearly perfect 29-bp repeats was discovered and may provide a unique molecular signature for this organism. In silico analyses predicted 17 transposase genes and a chromosomal locus for lactacin B, a class II bacteriocin. Several mucus- and fibronectin-binding proteins, implicated in adhesion to human intestinal cells, were also identified. Gene clusters for transport of a diverse group of carbohydrates, including fructooligosaccharides and raffinose, were present and often accompanied by transcriptional regulators of the lacI family. For protein degradation and peptide utilization, the organism encoded 20 putative peptidases, homologs for PrtP and PrtM, and two complete oligopeptide transport systems. Nine two-component regulatory systems were predicted, some associated with determinants implicated in bacteriocin production and acid tolerance. Collectively, these features within the genome sequence of L. acidophilus are likely to contribute to the organisms' gastric survival and promote interactions with the intestinal mucosa and microbiota. PMID:15671160

  13. From Amino Acid to Glucosinolate Biosynthesis: Protein Sequence Changes in the Evolution of Methylthioalkylmalate Synthase in Arabidopsis[W][OA

    PubMed Central

    de Kraker, Jan-Willem; Gershenzon, Jonathan

    2011-01-01

    Methylthioalkylmalate synthase (MAM) catalyzes the committed step in the side chain elongation of Met, yielding important precursors for glucosinolate biosynthesis in Arabidopsis thaliana and other Brassicaceae species. MAM is believed to have evolved from isopropylmalate synthase (IPMS), an enzyme involved in Leu biosynthesis, based on phylogenetic analyses and an overlap of catalytic abilities. Here, we investigated the changes in protein structure that have occurred during the recruitment of IPMS from amino acid to glucosinolate metabolism. The major sequence difference between IPMS and MAM is the absence of 120 amino acids at the C-terminal end of MAM that constitute a regulatory domain for Leu-mediated feedback inhibition. Truncation of this domain in Arabidopsis IPMS2 results in loss of Leu feedback inhibition and quaternary structure, two features common to MAM enzymes, plus an 8.4-fold increase in the kcat/Km for a MAM substrate. Additional exchange of two amino acids in the active site resulted in a MAM-like enzyme that had little residual IPMS activity. Hence, combination of the loss of the regulatory domain and a few additional amino acid exchanges can explain the evolution of MAM from IPMS during its recruitment from primary to secondary metabolism. PMID:21205930

  14. Effect of alpha-lipoic acid on relieving ammonia stress and hepatic proteomic analyses of broilers.

    PubMed

    Lu, M; Bai, J; Xu, B; Sun, Q Y; Wei, F X; Tang, X F; Zhang, H F; Li, J; Wang, G L; Yin, Q Q; Li, S Y

    2017-01-01

    Ammonia in poultry houses not only affects worker health but also induces a variety of poultry diseases. Alpha-lipoic acid (LA) is an effective antioxidant that protects cells against oxidative injury during various toxic and pathological processes. This study was designed to evaluate the mitigating effects of LA supplementation on ammonia stress and hepatic proteome changes in broilers. Male broilers (22 d old) were allocated to 3 groups: (1) a control group without ammonia stress (CTRL); (2) exposure to 70 ppm ammonia (AM); and (3) exposure to 70 ppm ammonia and dietary administration of 300 mg/kg LA (AM+LA). Ammonia exposure significantly decreased broiler growth performance and plasma glutathione peroxidase activity (P < 0.05), and increased plasma malondialdehyde content and glutamic-pyruvic transaminase activity (P < 0.05). These negative effects were eliminated by LA supplementation. Comparative proteomic analyses revealed 291 differentially expressed proteins in the AM group compared to the CTRL and AM+LA groups. A total of 30 proteins were differentially expressed between the AM/CTRL and (AM+LA)/AM groups. The addition of LA restored 24 of these proteins to control levels; these proteins were mainly related to transcription regulation, detoxification, protein translation and degradation, and immune and stress responses. The differentially expressed proteins included the high mobility group box (HMGB) and glutathione S-transferase (GST), which is closely related to immune response and oxidative stress, and collagens, which are implicated in liver injury. The addition of LA to broiler diet may reduce ammonia toxicity by maintaining the antioxidant system, xenobiotic metabolism, and metabolic pathways.

  15. Metabolic analyses elucidate non-trivial gene targets for amplifying dihydroartemisinic acid production in yeast

    PubMed Central

    Misra, Ashish; Conway, Matthew F.; Johnnie, Joseph; Qureshi, Tabish M.; Lige, Bao; Derrick, Anne M.; Agbo, Eddy C.; Sriram, Ganesh

    2013-01-01

    Synthetic biology enables metabolic engineering of industrial microbes to synthesize value-added molecules. In this, a major challenge is the efficient redirection of carbon to the desired metabolic pathways. Pinpointing strategies toward this goal requires an in-depth investigation of the metabolic landscape of the organism, particularly primary metabolism, to identify precursor and cofactor availability for the target compound. The potent antimalarial therapeutic artemisinin and its precursors are promising candidate molecules for production in microbial hosts. Recent advances have demonstrated the production of artemisinin precursors in engineered yeast strains as an alternative to extraction from plants. We report the application of in silico and in vivo metabolic pathway analyses to identify metabolic engineering targets to improve the yield of the direct artemisinin precursor dihydroartemisinic acid (DHA) in yeast. First, in silico extreme pathway (ExPa) analysis identified NADPH-malic enzyme and the oxidative pentose phosphate pathway (PPP) as mechanisms to meet NADPH demand for DHA synthesis. Next, we compared key DHA-synthesizing ExPas to the metabolic flux distributions obtained from in vivo 13C metabolic flux analysis of a DHA-synthesizing strain. This comparison revealed that knocking out ethanol synthesis and overexpressing glucose-6-phosphate dehydrogenase in the oxidative PPP (gene YNL241C) or the NADPH-malic enzyme ME2 (YKL029C) are vital steps toward overproducing DHA. Finally, we employed in silico flux balance analysis and minimization of metabolic adjustment on a yeast genome-scale model to identify gene knockouts for improving DHA yields. The best strategy involved knockout of an oxaloacetate transporter (YKL120W) and an aspartate aminotransferase (YKL106W), and was predicted to improve DHA yields by 70-fold. Collectively, our work elucidates multiple non-trivial metabolic engineering strategies for improving DHA yield in yeast. PMID:23898325

  16. Software scripts for quality checking of high-throughput nucleic acid sequencers.

    PubMed

    Lazo, G R; Tong, J; Miller, R; Hsia, C; Rausch, C; Kang, Y; Anderson, O D

    2001-06-01

    We have developed a graphical interface to allow the researcher to view and assess the quality of sequencing results using a series of program scripts developed to process data generated by automated sequencers. The scripts are written in Perl programming language and are executable under the cgibin directory of a Web server environment. The scripts direct nucleic acid sequencing trace file data output from automated sequencers to be analyzed by the phred molecular biology program and are displayed as graphical hypertext mark-up language (HTML) pages. The scripts are mainly designed to handle 96-well microtiter dish samples, but the scripts are also able to read data from 384-well microtiter dishes 96 samples at a time. The scripts may be customized for different laboratory environments and computer configurations. Web links to the sources and discussion page are provided.

  17. Leaf waxes, compound-specific D/H and 14C analyses in the Loess Paleosol Sequence Möhlin, Switzerland

    NASA Astrophysics Data System (ADS)

    Wüthrich, Lorenz; Bliedtner, Marcel; Kathrin Schäfer, Imke; Zech, Jana; Gaar, Dorian; Preusser, Frank; Zech, Roland

    2016-04-01

    Leaf waxes, such as long-chain n-alkanes and n-alkanoic acids, and their D/H isotopic composition, are increasingly used for paleoenvironmental and -climate reconstructions. Recent technological innovations now also allow to perform radiocarbon analyses on leaf waxes. For this study, we analyzed leaf waxes and their δD and 14C composition in the 7 m Loess Paleosol Sequence Möhlin, Switzerland. The chain length patterns in the upper part of the profile indicate n-alkane contribution from deciduous trees, while the underlying loess is dominated by inputs from grasses and herbs. Our δD record does not show depleted, glacial values compared to the Holocene, as we had expected in analogy to the Greenland ice core records. Values are most enriched at 1 m depth, i.e. well below the topsoil. Further research is needed to disentangle source effects and evapotranspirative enrichment, before the δD record can be interpreted robustly. Our radiocarbon ages for the leaf waxes are in very good agreement with independent age control based on luminescence ages, corroborating that massive loess accumulation occurred already at 35 ka. Only the uppermost 3 m were deposited during the last glacial maximum.

  18. Complete Bordetella avium, Bordetella hinzii and Bordetella trematum lipid A structures and genomic sequence analyses of the loci involved in their modifications.

    PubMed

    Novikov, Alexey; Shah, Nita R; AlBitar-Nehme, Sami; Basheer, Soorej M; Trento, Ilaria; Tirsoaga, Alina; Moksa, Michelle; Hirst, Martin; Perry, Malcolm B; Hamidi, Asmaa El; Fernandez, Rachel C; Caroff, Martine

    2014-08-01

    Endotoxin is recognized as one of the virulence factors of the Bordetella avium bird pathogen, and characterization of its structure and corresponding genomic features are important for an understanding of its role in pathogenicity and for an improved general knowledge of Bordetella spp virulence factors. The structure of the biologically active part of B. avium LPS, lipid A, is described and compared to those of another bird pathogen, opportunistic in humans, Bordetella hinzii, and to that of Bordetella trematum, a human pathogen. Sequence analyses showed that the three strains have homologues of acyl-chain modifying enzymes PagL, PagP and LpxO, of the 1-phosphatase LpxE, in addition to LgmA, LgmB and LgmC, which are required for the glucosamine modification. MALDI mass spectrometry identified a high amount of glucosamine substituting the phosphate groups of B. avium lipid A; this modification was absent from B. hinzii and B. trematum. The acylation patterns of the three lipid As were similar, but they differed from those of Bordetella pertussis and Bordetella parapertussis. They were also found to be close to the lipid A structure of Bordetella bronchiseptica, a mammalian pathogen, only differing from the latter by the degree of hydroxylation of the branched fatty acid.

  19. Genome Sequence of Azospirillum brasilense CBG497 and Comparative Analyses of Azospirillum Core and Accessory Genomes provide Insight into Niche Adaptation

    PubMed Central

    Wisniewski-Dyé, Florence; Lozano, Luis; Acosta-Cruz, Erika; Borland, Stéphanie; Drogue, Benoît; Prigent-Combaret, Claire; Rouy, Zoé; Barbe, Valérie; Mendoza Herrera, Alberto; González, Victor; Mavingui, Patrick

    2012-01-01

    Bacteria of the genus Azospirillum colonize roots of important cereals and grasses, and promote plant growth by several mechanisms, notably phytohormone synthesis. The genomes of several Azospirillum strains belonging to different species, isolated from various host plants and locations, were recently sequenced and published. In this study, an additional genome of an A. brasilense strain, isolated from maize grown on an alkaline soil in the northeast of Mexico, strain CBG497, was obtained. Comparative genomic analyses were performed on this new genome and three other genomes (A. brasilense Sp245, A. lipoferum 4B and Azospirillum sp. B510). The Azospirillum core genome was established and consists of 2,328 proteins, representing between 30% to 38% of the total encoded proteins within a genome. It is mainly chromosomally-encoded and contains 74% of genes of ancestral origin shared with some aquatic relatives. The non-ancestral part of the core genome is enriched in genes involved in signal transduction, in transport and in metabolism of carbohydrates and amino-acids, and in surface properties features linked to adaptation in fluctuating environments, such as soil and rhizosphere. Many genes involved in colonization of plant roots, plant-growth promotion (such as those involved in phytohormone biosynthesis), and properties involved in rhizosphere adaptation (such as catabolism of phenolic compounds, uptake of iron) are restricted to a particular strain and/or species, strongly suggesting niche-specific adaptation. PMID:24705077

  20. Amino acid sequence of band-3 protein from rainbow trout erythrocytes derived from cDNA.

    PubMed Central

    Hübner, S; Michel, F; Rudloff, V; Appelhans, H

    1992-01-01

    In this report we present the first complete band-3 cDNA sequence of a poikilothermic lower vertebrate. The primary structure of the anion-exchange protein band 3 (AE1) from rainbow trout erythrocytes was determined by nucleotide sequencing of cDNA clones. The overlapping clones have a total length of 3827 bp with a 5'-terminal untranslated region of 150 bp, a 2754 bp open reading frame and a 3'-untranslated region of 924 bp. Band-3 protein from trout erythrocytes consists of 918 amino acid residues with a calculated molecular mass of 101 827 Da. Comparison of its amino acid sequence revealed a 60-65% identity within the transmembrane spanning sequence of band-3 proteins published so far. An additional insertion of 24 amino acid residues within the membrane-associated domain of trout band-3 protein was identified, which until now was thought to be a general feature only of mammalian band-3-related proteins. PMID:1637296

  1. Preparation of Nucleic Acid Libraries for Personalized Sequencing Systems Using an Integrated Microfluidic Hub Technology (Seventh Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting 2012)

    ScienceCinema

    Patel, Kamlesh D [Ken; SNL,

    2016-07-12

    Kamlesh (Ken) Patel from Sandia National Laboratories (Livermore, California) presents "Preparation of Nucleic Acid Libraries for Personalized Sequencing Systems Using an Integrated Microfluidic Hub Technology " at the 7th Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting held in June, 2012 in Santa Fe, NM.

  2. Development of SI-traceable C-peptide certified reference material NMIJ CRM 6901-a using isotope-dilution mass spectrometry-based amino acid analyses.

    PubMed

    Kinumi, Tomoya; Goto, Mari; Eyama, Sakae; Kato, Megumi; Kasama, Takeshi; Takatsu, Akiko

    2012-07-01

    A certified reference material (CRM) is a higher-order calibration material used to enable a traceable analysis. This paper describes the development of a C-peptide CRM (NMIJ CRM 6901-a) by the National Metrology Institute of Japan using two independent methods for amino acid analysis based on isotope-dilution mass spectrometry. C-peptide is a 31-mer peptide that is utilized for the evaluation of β-cell function in the pancreas in clinical testing. This CRM is a lyophilized synthetic peptide having the human C-peptide sequence, and contains deamidated and pyroglutamylated forms of C-peptide. By adding water (1.00 ± 0.01) g into the vial containing the CRM, the C-peptide solution in 10 mM phosphate buffer saline (pH 6.6) is reconstituted. We assigned two certified values that represent the concentrations of total C-peptide (mixture of C-peptide, deamidated C-peptide, and pyroglutamylated C-peptide) and C-peptide. The certified concentration of total C-peptide was determined by two amino acid analyses using pre-column derivatization liquid chromatography-mass spectrometry and hydrophilic chromatography-mass spectrometry following acid hydrolysis. The certified concentration of C-peptide was determined by multiplying the concentration of total C-peptide by the ratio of the relative area of C-peptide to that of the total C-peptide measured by liquid chromatography. The certified value of C-peptide (80.7 ± 5.0) mg/L represents the concentration of the specific entity of C-peptide; on the other hand, the certified value of total C-peptide, (81.7 ± 5.1) mg/L can be used for analyses that does not differentiate deamidated and pyroglutamylated C-peptide from C-peptide itself, such as amino acid analyses and immunochemical assays.

  3. Role of the two-component leader sequence and mature amino acid sequences in extracellular export of endoglucanase EGL from Pseudomonas solanacearum.

    PubMed Central

    Huang, J Z; Schell, M A

    1992-01-01

    The egl gene of Pseudomonas solanacearum encodes a 43-kDa extracellular endoglucanase (mEGL) involved in wilt disease caused by this phytopathogen. Egl is initially translated with a 45-residue, two-part leader sequence. The first 19 residues are apparently removed by signal peptidase II during export of Egl across the inner membrane (IM); the remaining residues of the leader sequence (modified with palmitate) are removed during export across the outer membrane (OM). Localization of Egl-PhoA fusion proteins showed that the first 26 residues of the Egl leader sequence are required and sufficient to direct lipid modification, processing, and export of Egl or PhoA across the IM but not the OM. Fusions of the complete 45-residue leader sequence or of the leader and increasing portions of mEgl sequences to PhoA did not cause its export across the OM. In-frame deletion of portions of mEGL-coding sequences blocked export of the truncated polypeptides across the OM without affecting export across the IM. These results indicate that the first part of the leader sequence functions independently to direct export of Egl across the IM while the second part and sequences and structures in mEGL are involved in export across the OM. Computer analysis of the mEgl amino acid sequence obtained from its nucleotide sequence identified a region of mEGL similar in amino acid sequence to regions in other prokaryotic endoglucanases. Images PMID:1735723

  4. Studies on adenosine triphosphate transphosphorylases. Amino acid sequence of rabbit muscle ATP-AMP transphosphorylase.

    PubMed

    Kuby, S A; Palmieri, R H; Frischat, A; Fischer, A H; Wu, L H; Maland, L; Manship, M

    1984-05-22

    The total amino acid sequence of rabbit muscle adenylate kinase has been determined, and the single polypeptide chain of 194 amino acid residues starts with N-acetylmethionine and ends with leucyllysine at its carboxyl terminus, in agreement with the earlier data on its amino acid composition [Mahowald, T. A., Noltmann, E. A., & Kuby, S. A. (1962) J. Biol. Chem. 237, 1138-1145] and its carboxyl-terminus sequence [Olson, O. E., & Kuby, S. A. (1964) J. Biol. Chem. 239, 460-467]. Elucidation of the primary structure was based on tryptic and chymotryptic cleavages of the performic acid oxidized protein, cyanogen bromide cleavages of the 14C-labeled S-carboxymethylated protein at its five methionine sites (followed by maleylation of peptide fragments), and tryptic cleavages at its 12 arginine sites of the maleylated 14C-labeled S-carboxymethylated protein. Calf muscle myokinase, whose sequence has also been established, differs primarily from the rabbit muscle myokinase's sequence in the following: His-30 is replaced by Gln-30; Lys-56 is replaced by Met-56; Ala-84 and Asp 85 are replaced by Val-84 and Asn-85. A comparison of the four muscle-type adenylate kinases, whose covalent structures have now been determined, viz., rabbit, calf, porcine, and human [for the latter two sequences see Heil, A., Müller, G., Noda, L., Pinder, T., Schirmer, H., Schirmer, I., & Von Zabern, I. (1974) Eur. J. Biochem. 43, 131-144, and Von Zabern, I., Wittmann-Liebold, B., Untucht-Grau, R., Schirmer, R. H., & Pai, E. F. (1976) Eur. J. Biochem. 68, 281-290], demonstrates an extraordinary degree of homology.(ABSTRACT TRUNCATED AT 250 WORDS)

  5. Mathematical Characterization of Protein Sequences Using Patterns as Chemical Group Combinations of Amino Acids

    PubMed Central

    Choudhury, Pabitra Pal; Jana, Siddhartha Sankar

    2016-01-01

    Comparison of amino acid sequence similarity is the fundamental concept behind the protein phylogenetic tree formation. By virtue of this method, we can explain the evolutionary relationships, but further explanations are not possible unless sequences are studied through the chemical nature of individual amino acids. Here we develop a new methodology to characterize the protein sequences on the basis of the chemical nature of the amino acids. We design various algorithms for studying the variation of chemical group transitions and various chemical group combinations as patterns in the protein sequences. The amino acid sequence of conventional myosin II head domain of 14 family members are taken to illustrate this new approach. We find two blocks of maximum length 6 aa as ‘FPKATD’ and ‘Y/FTNEKL’ without repeating the same chemical nature and one block of maximum length 20 aa with the repetition of chemical nature which are common among all 14 members. We also check commonality with another motor protein sub-family kinesin, KIF1A. Based on our analysis we find a common block of length 8 aa both in myosin II and KIF1A. This motif is located in the neck linker region which could be responsible for the generation of mechanical force, enabling us to find the unique blocks which remain chemically conserved across the family. We also validate our methodology with different protein families such as MYOI, Myosin light chain kinase (MLCK) and Rho-associated protein kinase (ROCK), Na+/K+-ATPase and Ca2+-ATPase. Altogether, our studies provide a new methodology for investigating the conserved amino acids’ pattern in different proteins. PMID:27930687

  6. The complete amino acid sequence of a trypsin inhibitor from Bauhinia variegata var. candida seeds.

    PubMed

    Di Ciero, L; Oliva, M L; Torquato, R; Köhler, P; Weder, J K; Camillo Novello, J; Sampaio, C A; Oliveira, B; Marangoni, S

    1998-11-01

    Trypsin inhibitors of two varieties of Bauhinia variegata seeds have been isolated and characterized. Bauhinia variegata candida trypsin inhibitor (BvcTI) and B. variegata lilac trypsin inhibitor (BvlTI) are proteins with Mr of about 20,000 without free sulfhydryl groups. Amino acid analysis shows a high content of aspartic acid, glutamic acid, serine, and glycine, and a low content of histidine, tyrosine, methionine, and lysine in both inhibitors. Isoelectric focusing for both varieties detected three isoforms (pI 4.85, 5.00, and 5.15), which were resolved by HPLC procedure. The trypsin inhibitors show Ki values of 6.9 and 1.2 nM for BvcTI and BvlTI, respectively. The N-terminal sequences of the three trypsin inhibitor isoforms from both varieties of Bauhinia variegata and the complete amino acid sequence of B. variegata var. candida L. trypsin inhibitor isoform 3 (BvcTI-3) are presented. The sequences have been determined by automated Edman degradation of the reduced and carboxymethylated proteins of the peptides resulting from Staphylococcus aureus protease and trypsin digestion. BvcTI-3 is composed of 167 residues and has a calculated molecular mass of 18,529. Homology studies with other trypsin inhibitors show that BvcTI-3 belongs to the Kunitz family. The putative active site encompasses Arg (63)-Ile (64).

  7. Multiple site-selective insertions of non-canonical amino acids into sequence-repetitive polypeptides

    PubMed Central

    Wu, I-Lin; Patterson, Melissa A.; Carpenter Desai, Holly E.; Mehl, Ryan A.; Giorgi, Gianluca

    2013-01-01

    A simple and efficient method is described for introduction of non-canonical amino acids at multiple, structurally defined sites within recombinant polypeptide sequences. E. coli MRA30, a bacterial host strain with attenuated activity for release factor 1 (RF1), is assessed for its ability to support the incorporation of a diverse range of non-canonical amino acids in response to multiple encoded amber (TAG) codons within genetic templates derived from superfolder GFP and an elastin-mimetic protein polymer. Suppression efficiency and isolated protein yield were observed to depend on the identity of the orthogonal aminoacyl-tRNA synthetase/tRNACUA pair and the non-canonical amino acid substrate. This approach afforded elastin-mimetic protein polymers containing non-canonical amino acid derivatives at up to twenty-two positions within the repeat sequence with high levels of substitution. The identity and position of the variant residues was confirmed by mass spectrometric analysis of the full-length polypeptides and proteolytic cleavage fragments resulting from thermolysin digestion. The accumulated data suggest that this multi-site suppression approach permits the preparation of protein-based materials in which novel chemical functionality can be introduced at precisely defined positions within the polypeptide sequence. PMID:23625817

  8. Deduced amino acid sequence of human pulmonary surfactant proteolipid: SPL(pVal)

    SciTech Connect

    Whitsett, J.A.; Glasser, S.W.; Korfhagen, T.R.; Weaver, T.E.; Clark, J.; Pilot-Matias, T.; Meuth, J.; Fox, J.L.

    1987-05-01

    Hydrophobic, proteolipid-like protein of Mr 6500 was isolated from ether/ethanol extracts of human, canine and bovine pulmonary surfactant. Amino acid composition of the protein demonstrated a remarkable abundance of hydrophobic residues, particularly valine and leucine. The N-terminal amino acid sequence of the human protein was determined: N-Leu-Ile-Pro-Cys-Cys-Pro-Val-Asn-Leu-Lys-Arg-Leu-Leu-Ile-Val4... An oligonucleotide probe was used to screen an adult human lung cDNA library and resulted in detection of cDNA clones with predicted amino acid sequence with close identity to the N-terminal amino acid sequence of the human peptide. SPL(pVal) was found within the reading frame of a larger peptide. SPL(pVal) results from proteolytic processing of a larger preprotein. Northern blot analysis detected in a single 1.0 kilobase SPL(pVal) RNA which was less abundant in fetal than in adult lung. Mixtures of purified canine and bovine SPL(pVal) and synthetic phospholipids display properties of rapid adsorption and surface tension lowering activity characteristic of surfactant. Human SPL(pVal) is a pulmonary surfactant proteolipid which may therefore be useful in combination with phospholipids and/or other surfactant proteins for the treatment of surfactant deficiency such as hyaline membrane disease in newborn infants.

  9. SUBGROUPS OF AMINO ACID SEQUENCES IN THE VARIABLE REGIONS OF IMMUNOGLOBULIN HEAVY CHAINS*

    PubMed Central

    Cunningham, Bruce A.; Pflumm, Mollie N.; User, Urs Rutisha; Edelman, Gerald M.

    1969-01-01

    The amino acid sequence of the first 133 residues of the heavy (γ) chain from a human γG immunoglobulin (He) has been determined. This γ-chain is identical in Gm type to that of protein Eu, the complete sequence of which has been reported. Comparison of the two sequences substantiates the previous suggestion that there are subgroups of variable regions of heavy chains. The variable region of Eu has been assigned to subgroup I and that of He to subgroup II; on the other hand, the constant regions of the two proteins appear to be identical. Comparison of the sequence of the heavy chain of He with the heavy chain sequences determined in other laboratories suggests that the variable region of subgroup II is at least 118 residues long. The nature and distribution of amino acid variations in this heavy chain subgroup resemble those observed in light chain subgroups. These studies provide evidence that the translocation hypothesis applies to heavy as well as to light chains, viz., genes for variable regions (V) are somatically translocated to genes for constant regions (C) to form complete VC structural genes. Images PMID:5264153

  10. Complete nucleic acid sequence of Penaeus stylirostris densovirus (PstDNV) from India.

    PubMed

    Rai, Praveen; Safeena, Muhammed P; Karunasagar, Iddya; Karunasagar, Indrani

    2011-06-01

    Infectious hypodermal and hematopoietic necrosis virus (IHHNV) of shrimp, recently been classified as Penaeus stylirostris densovirus (PstDNV). The complete nucleic acid sequence of PstDNV from India was obtained by cloning and sequencing of different DNA fragment of the virus. The genome organisation of PstDNV revealed that there were three major coding domains: a left ORF (NS1) of 2001 bp, a mid ORF (NS2) of 1092 bp and a right ORF (VP) of 990 bp. The complete genome and amino acid sequences of three proteins viz., NS1, NS2 and VP were compared with the genomes of the virus reported from Hawaii, China and Mexico and with partial sequence available from isolates from different regions. The phylogenetic analysis of shrimp, insect and vertebrate parvovirus sequences showed that the Indian PstDNV isolate is phylogenetically more closely related to one of the three isolates from Taiwan (AY355307), and two isolates (AY362547 and AY102034) from Thailand.

  11. DNA Cloning of Plasmodium falciparum Circumsporozoite Gene: Amino Acid Sequence of Repetitive Epitope

    NASA Astrophysics Data System (ADS)

    Enea, Vincenzo; Ellis, Joan; Zavala, Fidel; Arnot, David E.; Asavanich, Achara; Masuda, Aoi; Quakyi, Isabella; Nussenzweig, Ruth S.

    1984-08-01

    A clone of complementary DNA encoding the circumsporozoite (CS) protein of the human malaria parasite Plasmodium falciparum has been isolated by screening an Escherichia coli complementary DNA library with a monoclonal antibody to the CS protein. The DNA sequence of the complementary DNA insert encodes a four-amino acid sequence: proline-asparagine-alanine-asparagine, tandemly repeated 23 times. The CS β -lactamase fusion protein specifically binds monoclonal antibodies to the CS protein and inhibits the binding of these antibodies to native Plasmodium falciparum CS protein. These findings provide a basis for the development of a vaccine against Plasmodium falciparum malaria.

  12. Amino-Acid Sequence of NADP-Specific Glutamate Dehydrogenase of Neurospora crassa

    PubMed Central

    Wootton, John C.; Chambers, Geoffrey K.; Holder, Anthony A.; Baron, Andrew J.; Taylor, John G.; Fincham, John R. S.; Blumenthal, Kenneth M.; Moon, Kenneth; Smith, Emil L.

    1974-01-01

    A tentative primary structure of the NADP-specific glutamate dehydrogenase [L-glutamate: NADP oxidoreductase (deaminating), EC 1.4.1.4] from Neurospora crassa has been determined. The proposed sequence contains 452 amino-acid residues in each of the identical subunits of the hexameric enzyme. Comparison of the sequence with that of the bovine liver enzyme reveals considerable homology in the amino-terminal portion of the chain, including the vicinity of the reactive lysine, with only shorter stretches of homology within the carboxyl-terminal regions. The significance of this distribution of homologous regions is discussed. PMID:4155068

  13. Microarray and Functional Gene Analyses of Sulfate-Reducing Prokaryotes in Low-Sulfate, Acidic Fens Reveal Cooccurrence of Recognized Genera and Novel Lineages

    PubMed Central

    Loy, Alexander; Küsel, Kirsten; Lehner, Angelika; Drake, Harold L.; Wagner, Michael

    2004-01-01

    Low-sulfate, acidic (approximately pH 4) fens in the Lehstenbach catchment in the Fichtelgebirge mountains in Germany are unusual habitats for sulfate-reducing prokaryotes (SRPs) that have been postulated to facilitate the retention of sulfur and protons in these ecosystems. Despite the low in situ availability of sulfate (concentration in the soil solution, 20 to 200 μM) and the acidic conditions (soil and soil solution pHs, approximately 4 and 5, respectively), the upper peat layers of the soils from two fens (Schlöppnerbrunnen I and II) of this catchment displayed significant sulfate-reducing capacities. 16S rRNA gene-based oligonucleotide microarray analyses revealed stable diversity patterns for recognized SRPs in the upper 30 cm of both fens. Members of the family “Syntrophobacteraceae” were detected in both fens, while signals specific for the genus Desulfomonile were observed only in soils from Schlöppnerbrunnen I. These results were confirmed and extended by comparative analyses of environmentally retrieved 16S rRNA and dissimilatory (bi)sulfite reductase (dsrAB) gene sequences; dsrAB sequences from Desulfobacca-like SRPs, which were not identified by microarray analysis, were obtained from both fens. Hypotheses concerning the ecophysiological role of these three SRP groups in the fens were formulated based on the known physiological properties of their cultured relatives. In addition to these recognized SRP lineages, six novel dsrAB types that were phylogenetically unrelated to all known SRPs were detected in the fens. These dsrAB sequences had no features indicative of pseudogenes and likely represent novel, deeply branching, sulfate- or sulfite-reducing prokaryotes that are specialized colonists of low-sulfate habitats. PMID:15574893

  14. Analyses of HTLV-1 sequences suggest interaction between ORF-I mutations and HAM/TSP outcome.

    PubMed

    Barreto, Fernanda Khouri; Khouri, Ricardo; Rego, Filipe Ferreira de Almeida; Santos, Luciane Amorim; Castro-Amarante, Maria Fernanda de; Bialuk, Izabela; Pise-Masison, Cynthia A; Galvão-Castro, Bernardo; Gessain, Antoine; Jacobson, Steven; Franchini, Genoveffa; Alcantara, Luiz Carlos

    2016-11-01

    The region known as pX in the 3' end of the human T-cell lymphotropic virus type 1 (HTLV-1) genome contains four overlapping open reading frames (ORF) that encode regulatory proteins. HTLV-1 ORF-I produces the protein p12 and its cleavage product p8. The functions of these proteins have been linked to immune evasion and viral infectivity and persistence. It is known that the HTLV-1 infection does not necessarily imply the development of pathological processes and here we evaluated whether natural mutations in HTLV-1 ORF-I can influence the proviral load and clinical manifestation of HTLV-I-associated myelopathy/tropical spastic paraparesis (HAM/TSP). For that, we performed molecular characterization, datamining and phylogenetic analysis with HTLV-1 ORF-I sequences from 156 patients with negative or positive diagnosis for HAM/TSP. Our analyses demonstrated that some mutations may be associated with the outcome of HAM/TSP (C39R, L40F, P45L, S69G and R88K) or with proviral load (P34L and F61L). We further examined the presence of mutations in motifs of HBZ and observed that P45L mutation is located within the HBZ nuclear localization signal and was found more frequently between patients with HAM/TSP and high proviral load. These results indicate that some natural mutations are located in functional domains of ORF-I and suggests a potential association between these mutations and the proviral loads and development of HAM/TSP. Therefore it is necessary to conduct functional studies aimed at evaluating the impact of these mutations on the virus persistence and immune evasion.

  15. Seq2Logo: a method for construction and visualization of amino acid binding motifs and sequence profiles including sequence weighting, pseudo counts and two-sided representation of amino acid enrichment and depletion

    PubMed Central

    Thomsen, Martin Christen Frølund; Nielsen, Morten

    2012-01-01

    Seq2Logo is a web-based sequence logo generator. Sequence logos are a graphical representation of the information content stored in a multiple sequence alignment (MSA) and provide a compact and highly intuitive representation of the position-specific amino acid composition of binding motifs, active sites, etc. in biological sequences. Accurate generation of sequence logos is often compromised by sequence redundancy and low number of observations. Moreover, most methods available for sequence logo generation focus on displaying the position-specific enrichment of amino acids, discarding the equally valuable information related to amino acid depletion. Seq2logo aims at resolving these issues allowing the user to include sequence weighting to correct for data redundancy, pseudo counts to correct for low number of observations and different logotype representations each capturing different aspects related to amino acid enrichment and depletion. Besides allowing input in the format of peptides and MSA, Seq2Logo accepts input as Blast sequence profiles, providing easy access for non-expert end-users to characterize and identify functionally conserved/variable amino acids in any given protein of interest. The output from the server is a sequence logo and a PSSM. Seq2Logo is available at http://www.cbs.dtu.dk/biotools/Seq2Logo (14 May 2012, date last accessed). PMID:22638583

  16. Seq2Logo: a method for construction and visualization of amino acid binding motifs and sequence profiles including sequence weighting, pseudo counts and two-sided representation of amino acid enrichment and depletion.

    PubMed

    Thomsen, Martin Christen Frølund; Nielsen, Morten

    2012-07-01

    Seq2Logo is a web-based sequence logo generator. Sequence logos are a graphical representation of the information content stored in a multiple sequence alignment (MSA) and provide a compact and highly intuitive representation of the position-specific amino acid composition of binding motifs, active sites, etc. in biological sequences. Accurate generation of sequence logos is often compromised by sequence redundancy and low number of observations. Moreover, most methods available for sequence logo generation focus on displaying the position-specific enrichment of amino acids, discarding the equally valuable information related to amino acid depletion. Seq2logo aims at resolving these issues allowing the user to include sequence weighting to correct for data redundancy, pseudo counts to correct for low number of observations and different logotype representations each capturing different aspects related to amino acid enrichment and depletion. Besides allowing input in the format of peptides and MSA, Seq2Logo accepts input as Blast sequence profiles, providing easy access for non-expert end-users to characterize and identify functionally conserved/variable amino acids in any given protein of interest. The output from the server is a sequence logo and a PSSM. Seq2Logo is available at http://www.cbs.dtu.dk/biotools/Seq2Logo (14 May 2012, date last accessed).

  17. Method for high-volume sequencing of nucleic acids: random and directed priming with libraries of oligonucleotides

    DOEpatents

    Studier, F.W.

    1995-04-18

    Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient. 2 figs.

  18. Method for high-volume sequencing of nucleic acids: random and directed priming with libraries of oligonucleotides

    DOEpatents

    Studier, F. William

    1995-04-18

    Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient.

  19. Sequence-specific thermodynamic properties of nucleic acids influence both transcriptional pausing and backtracking in yeast

    PubMed Central

    2017-01-01

    RNA Polymerase II pauses and backtracks during transcription, with many consequences for gene expression and cellular physiology. Here, we show that the energy required to melt double-stranded nucleic acids in the transcription bubble predicts pausing in Saccharomyces cerevisiae far more accurately than nucleosome roadblocks do. In addition, the same energy difference also determines when the RNA polymerase backtracks instead of continuing to move forward. This data-driven model corroborates—in a genome wide and quantitative manner—previous evidence that sequence-dependent thermodynamic features of nucleic acids influence both transcriptional pausing and backtracking. PMID:28301878

  20. Respiratory syncytial virus fusion glycoprotein: nucleotide sequence of mRNA, identification of cleavage activation site and amino acid sequence of N-terminus of F1 subunit.

    PubMed Central

    Elango, N; Satake, M; Coligan, J E; Norrby, E; Camargo, E; Venkatesan, S

    1985-01-01

    The amino acid sequence of respiratory syncytial virus fusion protein (Fo) was deduced from the sequence of a partial cDNA clone of mRNA and from the 5' mRNA sequence obtained by primer extension and dideoxysequencing. The encoded protein of 574 amino acids is extremely hydrophobic and has a molecular weight of 63371 daltons. The site of proteolytic cleavage within this protein was accurately mapped by determining a partial amino acid sequence of the N-terminus of the larger subunit (F1) purified by radioimmunoprecipitation using monoclonal antibodies. Alignment of the N-terminus of the F1 subunit within the deduced amino acid sequence of Fo permitted us to identify a sequence of lys-lys-arg-lys-arg-arg at the C-terminus of the smaller N-terminal F2 subunit that appears to represent the cleavage/activation domain. Five potential sites of glycosylation, four within the F2 subunit, were also identified. Three extremely hydrophobic domains are present in the protein; a) the N-terminal signal sequence, b) the N-terminus of the F1 subunit that is analogous to the N-terminus of the paramyxovirus F1 subunit and the HA2 subunit of influenza virus hemagglutinin, and c) the putative membrane anchorage domain near the C-terminus of F1. Images PMID:2987829

  1. Analysis of protein function and its prediction from amino acid sequence.

    PubMed

    Clark, Wyatt T; Radivojac, Predrag

    2011-07-01

    Understanding protein function is one of the keys to understanding life at the molecular level. It is also important in the context of human disease because many conditions arise as a consequence of alterations of protein function. The recent availability of relatively inexpensive sequencing technology has resulted in thousands of complete or partially sequenced genomes with millions of functionally uncharacterized proteins. Such a large volume of data, combined with the lack of high-throughput experimental assays to functionally annotate proteins, attributes to the growing importance of automated function prediction. Here, we study proteins annotated by Gene Ontology (GO) terms and estimate the accuracy of functional transfer from protein sequence only. We find that the transfer of GO terms by pairwise sequence alignments is only moderately accurate, showing a surprisingly small influence of sequence identity (SID) in a broad range (30-100%). We developed and evaluated a new predictor of protein function, functional annotator (FANN), from amino acid sequence. The predictor exploits a multioutput neural network framework which is well suited to simultaneously modeling dependencies between functional terms. Experiments provide evidence that FANN-GO (predictor of GO terms; available from http://www.informatics.indiana.edu/predrag) outperforms standard methods such as transfer by global or local SID as well as GOtcha, a method that incorporates the structure of GO.

  2. The Complete Genome Sequence of the Lactic Acid Bacterium Lactococcus lactis ssp. lactis IL1403

    PubMed Central

    Bolotin, Alexander; Wincker, Patrick; Mauger, Stéphane; Jaillon, Olivier; Malarme, Karine; Weissenbach, Jean; Ehrlich, S. Dusko; Sorokin, Alexei

    2001-01-01

    Lactococcus lactis is a nonpathogenic AT-rich gram-positive bacterium closely related to the genus Streptococcus and is the most commonly used cheese starter. It is also the best-characterized lactic acid bacterium. We sequenced the genome of the laboratory strain IL1403, using a novel two-step strategy that comprises diagnostic sequencing of the entire genome and a shotgun polishing step. The genome contains 2,365,589 base pairs and encodes 2310 proteins, including 293 protein-coding genes belonging to six prophages and 43 insertion sequence (IS) elements. Nonrandom distribution of IS elements indicates that the chromosome of the sequenced strain may be a product of recent recombination between two closely related genomes. A complete set of late competence genes is present, indicating the ability of L. lactis to undergo DNA transformation. Genomic sequence revealed new possibilities for fermentation pathways and for aerobic respiration. It also indicated a horizontal transfer of genetic information from Lactococcus to gram-negative enteric bacteria of Salmonella-Escherichia group. [The sequence data described in this paper has been submitted to the GenBank data library under accession no. AE005176.] PMID:11337471

  3. Amino acid sequence of myoglobin from emu (Dromaius novaehollandiae) skeletal muscle.

    PubMed

    Suman, S P; Joseph, P; Li, S; Beach, C M; Fontaine, M; Steinke, L

    2010-11-01

    The objective of the present study was to characterize the primary structure of emu myoglobin (Mb). Emu Mb was isolated from Iliofibularis muscle employing gel-filtration chromatography. Matrix Assisted Laser Desorption Ionization-Time of Flight Mass Spectrometry was employed to determine the exact molecular mass of emu Mb in comparison with horse Mb, and Edman degradation was utilized to characterize the amino acid sequence. The molecular mass of emu Mb was 17,380 Da and was close to those reported for ratite and poultry myoglobins. Similar to myoglobins from meat-producing livestock and birds, emu Mb has 153 amino acids. Emu Mb contains 9 histidines. Proximal and distal histidines, responsible for coordinating oxygen-binding property of Mb, are conserved in emu. Emu Mb shared more than 90% homology with ratite and chicken myoglobins, whereas it demonstrated only less than 70% sequence similarity with ruminant myoglobins.

  4. Stereochemical Sequence Ion Selectivity: Proline versus Pipecolic-acid-containing Protonated Peptides

    NASA Astrophysics Data System (ADS)

    Abutokaikah, Maha T.; Guan, Shanshan; Bythell, Benjamin J.

    2017-01-01

    Substitution of proline by pipecolic acid, the six-membered ring congener of proline, results in vastly different tandem mass spectra. The well-known proline effect is eliminated and amide bond cleavage C-terminal to pipecolic acid dominates instead. Why do these two ostensibly similar residues produce dramatically differing spectra? Recent evidence indicates that the proton affinities of these residues are similar, so are unlikely to explain the result [Raulfs et al., J. Am. Soc. Mass Spectrom. 25, 1705-1715 (2014)]. An additional hypothesis based on increased flexibility was also advocated. Here, we provide a computational investigation of the "pipecolic acid effect," to test this and other hypotheses to determine if theory can shed additional light on this fascinating result. Our calculations provide evidence for both the increased flexibility of pipecolic-acid-containing peptides, and structural changes in the transition structures necessary to produce the sequence ions. The most striking computational finding is inversion of the stereochemistry of the transition structures leading to "proline effect"-type amide bond fragmentation between the proline/pipecolic acid-congeners: R (proline) to S (pipecolic acid). Additionally, our calculations predict substantial stabilization of the amide bond cleavage barriers for the pipecolic acid congeners by reduction in deleterious steric interactions and provide evidence for the importance of experimental energy regime in rationalizing the spectra.

  5. Self-sequencing of amino acids and origins of polyfunctional protocells

    NASA Technical Reports Server (NTRS)

    Fox, S. W.

    1984-01-01

    The role of proteins in the origin of living things is discussed. It has been experimentally established that amino acids can sequence themselves under simulated geological conditions with highly nonrandom products which accordingly contain diverse information. Multiple copies of each type of macromolecule are formed, resulting in greater power for any protoenzymic molecule than would accrue from a single copy of each type. Thermal proteins are readily incorporated into laboratory protocells. The experimental evidence for original polyfunctional protocells is discussed.

  6. Amino acid sequence of atrial natriuretic peptides in human coronary sinus plasma.

    PubMed

    Yandle, T; Crozier, I; Nicholls, G; Espiner, E; Carne, A; Brennan, S

    1987-07-31

    Two atrial natriuretic peptides were purified from pooled human coronary sinus plasma by Sep-Pak extraction, immunoaffinity chromatography and reverse phase HPLC. The amino acid sequences of the two peptides were homologous with 99-126 human atrial natriuretic peptide (hANP) and 106-126 hANP, the latter being most probably linked to 99-105 ANP by the disulphide bond. The molar ratio of the peptides in plasma, as assessed by radioimmunoassay was 10:3.

  7. Amino Acid Sequences Mediating Vascular Cell Adhesion Molecule 1 Binding to Integrin Alpha 4: Homologous DSP Sequence Found for JC Polyoma VP1 Coat Protein

    PubMed Central

    Meyer, Michael Andrew

    2013-01-01

    The JC polyoma viral coat protein VP1 was analyzed for amino acid sequences homologies to the IDSP sequence which mediates binding of VLA-4 (integrin alpha 4) to vascular cell adhesion molecule 1. Although the full sequence was not found, a DSP sequence was located near the critical arginine residue linked to infectivity of the virus and binding to sialic acid containing molecules such as integrins (3). For the JC polyoma virus, a DSP sequence was found at residues 70, 71 and 72 with homology also noted for the mouse polyoma virus and SV40 virus. Three dimensional modeling of the VP1 molecule suggests that the DSP loop has an accessible site for interaction from the external side of the assembled viral capsid pentamer. PMID:24147211

  8. Amino Acid Sequences Mediating Vascular Cell Adhesion Molecule 1 Binding to Integrin Alpha 4: Homologous DSP Sequence Found for JC Polyoma VP1 Coat Protein.

    PubMed

    Meyer, Michael Andrew

    2013-01-01

    The JC polyoma viral coat protein VP1 was analyzed for amino acid sequences homologies to the IDSP sequence which mediates binding of VLA-4 (integrin alpha 4) to vascular cell adhesion molecule 1. Although the full sequence was not found, a DSP sequence was located near the critical arginine residue linked to infectivity of the virus and binding to sialic acid containing molecules such as integrins (3). For the JC polyoma virus, a DSP sequence was found at residues 70, 71 and 72 with homology also noted for the mouse polyoma virus and SV40 virus. Three dimensional modeling of the VP1 molecule suggests that the DSP loop has an accessible site for interaction from the external side of the assembled viral capsid pentamer.

  9. Amino acid sequence similarity between rabies virus glycoprotein and snake venom curaremimetic neurotoxins.

    PubMed

    Lentz, T L; Wilson, P T; Hawrot, E; Speicher, D W

    1984-11-16

    Evidence was presented earlier that a host-cell receptor for the highly neurotropic rabies virus might be the acetylcholine receptor. The amino acid sequence of the glycoprotein of rabies virus was compared by computer analysis with that of snake venom curaremimetic neurotoxins, potent ligands of the acetylcholine receptor. A statistically significant sequence relation was found between a segment of the rabies glycoprotein and the entire sequence of long neurotoxins. The greatest identity occurs with residues considered most important in neurotoxicity, including those interacting with the acetylcholine binding site of the acetylcholine receptor. Because of the similarity between the glycoprotein and the receptor-binding region of the neurotoxins, this region of the viral glycoprotein may function as a recognition site for the acetylcholine receptor. Direct binding of the rabies virus glycoprotein to the acetylcholine receptor could contribute to the neurotropism of this virus.

  10. Partial amino acid sequence of human pancreatic stone protein, a novel pancreatic secretory protein.

    PubMed Central

    Montalto, G; Bonicel, J; Multigner, L; Rovery, M; Sarles, H; De Caro, A

    1986-01-01

    Pancreatic stone protein (PSP) is the major organic component of human pancreatic stones. With the use of monoclonal antibody immunoadsorbents, five immunoreactive forms (PSP-S) with close Mr values (14,000-19,000) were isolated from normal pancreatic juice. By CM-Trisacryl M chromatography the lowest-Mr form (PSP-S1) was separated from the others and some of its molecular characteristics were investigated. The Mr of the PSP-S1 polypeptide chain calculated from the amino acid composition was about 16,100. The N-terminal sequences (40 residues) of PSP and PSP-S1 are identical, which suggests that the peptide backbone is the same for both of these polypeptides. The PSP-S1 sequence was determined up to residue 65 and was found to be different from all other known protein sequences. Images Fig. 1. PMID:3541906

  11. Characterization of the microbial acid mine drainage microbial community using culturing and direct sequencing techniques.

    PubMed

    Auld, Ryan R; Myre, Maxine; Mykytczuk, Nadia C S; Leduc, Leo G; Merritt, Thomas J S

    2013-05-01

    We characterized the bacterial community from an AMD tailings pond using both classical culturing and modern direct sequencing techniques and compared the two methods. Acid mine drainage (AMD) is produced by the environmental and microbial oxidation of minerals dissolved from mining waste. Surprisingly, we know little about the microbial communities associated with AMD, despite the fundamental ecological roles of these organisms and large-scale economic impact of these waste sites. AMD microbial communities have classically been characterized by laboratory culturing-based techniques and more recently by direct sequencing of marker gene sequences, primarily the 16S rRNA gene. In our comparison of the techniques, we find that their results are complementary, overall indicating very similar community structure with similar dominant species, but with each method identifying some species that were missed by the other. We were able to culture the majority of species that our direct sequencing results indicated were present, primarily species within the Acidithiobacillus and Acidiphilium genera, although estimates of relative species abundance were only obtained from direct sequencing. Interestingly, our culture-based methods recovered four species that had been overlooked from our sequencing results because of the rarity of the marker gene sequences, likely members of the rare biosphere. Further, direct sequencing indicated that a single genus, completely missed in our culture-based study, Legionella, was a dominant member of the microbial community. Our results suggest that while either method does a reasonable job of identifying the dominant members of the AMD microbial community, together the methods combine to give a more complete picture of the true diversity of this environment.

  12. [MOLECULAR EVOLUTION OF ION CHANNELS: AMINO ACID SEQUENCES AND 3D STRUCTURES].

    PubMed

    Korkosh, V S; Zhorov, B S; Tikhonov, D B

    2016-01-01

    An integral part of modern evolutionary biology is comparative analysis of structure and function of macromolecules such as proteins. The first and critical step to understand evolution of homologous proteins is their amino acid sequence alignment. However, standard algorithms fop not provide unambiguous sequence alignments for proteins of poor homology. More reliable results can be obtained by comparing experimental 3D structures obtained at atomic resolution, for instance, with the aid of X-ray structural analysis. If such structures are lacking, homology modeling is used, which may take into account indirect experimental data on functional roles of individual amino-acid residues. An important problem is that the sequence alignment, which reflects genetic modifications, does not necessarily correspond to the functional homology. The latter depends on three-dimensional structures which are critical for natural selection. Since alignment techniques relying only on the analysis of primary structures carry no information on the functional properties of proteins, including 3D structures into consideration is very important. Here we consider several examples involving ion channels and demonstrate that alignment of their three-dimensional structures can significantly improve sequence alignments obtained by traditional methods.

  13. The amino acid sequence of the aspartate aminotransferase from baker's yeast (Saccharomyces cerevisiae).

    PubMed Central

    Cronin, V B; Maras, B; Barra, D; Doonan, S

    1991-01-01

    1. The single (cytosolic) aspartate aminotransferase was purified in high yield from baker's yeast (Saccharomyces cerevisiae). 2. Amino-acid-sequence analysis was carried out by digestion of the protein with trypsin and with CNBr; some of the peptides produced were further subdigested with Staphylococcus aureus V8 proteinase or with pepsin. Peptides were sequenced by the dansyl-Edman method and/or by automated gas-phase methods. The amino acid sequence obtained was complete except for a probable gap of two residues as indicated by comparison with the structures of counterpart proteins in other species. 3. The N-terminus of the enzyme is blocked. Fast-atom-bombardment m.s. was used to identify the blocking group as an acetyl one. 4. Alignment of the sequence of the enzyme with those of vertebrate cytosolic and mitochondrial aspartate aminotransferases and with the enzyme from Escherichia coli showed that about 25% of residues are conserved between these distantly related forms. 5. Experimental details and confirmatory data for the results presented here are given in a Supplementary Publication (SUP 50164, 25 pages) that has been deposited at the British Library Document Supply Centre, Boston Spa. Wetherby, West Yorkshire LS23 7 BQ, U.K., from whom copies can be obtained on the terms indicated in Biochem. J. (1991) 273, 5. PMID:1859361

  14. Analysis of amino acid sequence variations and immunoglobulin E-binding epitopes of German cockroach tropomyosin.

    PubMed

    Jeong, Kyoung Yong; Lee, Jongweon; Lee, In-Yong; Ree, Han-Il; Hong, Chein-Soo; Yong, Tai-Soon

    2004-09-01

    The allergenicities of tropomyosins from different organisms have been reported to vary. The cDNA encoding German cockroach tropomyosin (Bla g 7) was isolated, expressed, and characterized previously. In the present study, the amino acid sequence variations in German cockroach tropomyosin were analyzed in order to investigate its influence on allergenicity. We also undertook the identification of immunodominant peptides containing immunoglobulin E (IgE) epitopes which may facilitate the development of diagnostic and immunotherapeutic strategies based on the recombinant proteins. Two-dimensional gel electrophoresis and immunoblot analysis with mouse anti-recombinant German cockroach tropomyosin serum was performed to investigate the isoforms at the protein level. Reverse transcriptase PCR (RT-PCR) was applied to examine the sequence diversity. Eleven different variants of the deduced amino acid sequences were identified by RT-PCR. German cockroach tropomyosin has only minor sequence variations that did not seem to affect its allergenicity significantly. These results support the molecular basis underlying the cross-reactivities of arthropod tropomyosins. Recombinant fragments were also generated by PCR, and IgE-binding epitopes were assessed by enzyme-linked immunosorbent assay. Sera from seven patients revealed heterogeneous IgE-binding responses. This study demonstrates multiple IgE-binding epitope regions in a single molecule, suggesting that full-length tropomyosin should be used for the development of diagnostic and therapeutic reagents.

  15. A microcalorimetric sensor for food and cosmetic analyses: l-Malic acid determination.

    PubMed

    Antonelli, Marta Letizia; Spadaro, Claudio; Tornelli, Rosalia Fortunata

    2008-02-15

    Enzymatic microcalorimetry has been successfully employed in the reliable determination of the l-malic acid concentration in some foods and cosmetic products. The l-malic acid concentration during the wine-making process is particularly useful in order to control the progress of the malo-lactic fermentation. Total acidity, taste and flavour characteristics of wine depend on the l-malic acid quantity still present. To point out the analytical methodology the dehydration process of l-malic acid, in the presence of Fumarase enzyme, has been used. The new method has been compared with a common spectrophotometric one. By the proposed calorimetric method the l-malic acid concentration in different types of food (white and red wines, fruits and soft beverages) has been determined. In some cosmetic products too the l-malic acid was quantified. The method outlined resulted simple, direct and reliable (good accuracy and precision), in particular it does not require any pre-treatment or clean up of the samples, save the dilution in buffer.

  16. Phospholipid Analyses for Microbial Community Composition in Alpine Acid Rock Drainage

    NASA Astrophysics Data System (ADS)

    Webster, C. E.; Tapp, J. B.; Pfiffner, S. M.

    2008-12-01

    This project is examining factors of non-anthropogenic acid rock drainage that influence microbial community composition in the Peekaboo Gulch drainage basin (Sawatch Range, Colorado). At this site, natural acid rock drainage outflows from acidic springs (pH=2.6) on Red Mountain. The acid drainage converges with South Fork Lake Creek (pH ~ 7.0, prior to convergence) two miles down gradient. Sediment samples were collected across confluences with gradient of pH, temperature, conductivity and metal concentration. In-situ parameter measurements ranged from 2.3 to 7.9 of pH, 3.8 to 16.6 degree Celsius for temperature, and 34.9 to 1820 for conductivity. Biomass as measured by phospholipids ranged from 280 to 95,900 pmol/g sediment. The only relationship between the in situ parameters and the phospholipid profiles is a weak positive correlation between pH and branched monounsaturated fatty acid methyl esters in that at a pH greater than 5.0 these fatty acid methyl esters were detected. The phospholipid profiles were diverse across the samples. These profiles changed with respect to the spatial relationship within the drainage pattern. The highest alpine samples contained greater relative abundances of monounsaturated fatty acid methyl esters compared to the lower alpine samples. Microbial community profiles shifted at each confluence depending on water source chemistry. Continuing research is needed to determine other biogeochemical factors that may influence these community shifts.

  17. Comparative genomic sequence and expression analyses of Medicago truncatula and alfalfa subspecies falcata COLD-ACCLIMATION-SPECIFIC genes.

    PubMed

    Pennycooke, Joyce C; Cheng, Hongmei; Stockinger, Eric J

    2008-03-01

    In Arabidopsis (Arabidopsis thaliana) the low-temperature induction of genes encoding the C-REPEAT BINDING FACTOR (CBF) transcriptional activators is a key step in cold acclimation. CBFs in turn activate a battery of downstream genes known as the CBF regulon, which collectively act to increase tolerance to low temperatures. Fundamental questions are: What determines the size and scope of the CBF regulon, and is this is a major determinant of the low-temperature tolerance capacity of individual plant species? Here we have begun to address these questions through comparative analyses of Medicago truncatula and Medicago sativa subsp. falcata. M. truncatula survived to -4 degrees C but did not cold acclimate, whereas Medicago falcata cold acclimated and survived -14 degrees C. Both species possessed low-temperature-induced CBFs but differed in the expression of the COLD-ACCLIMATION-SPECIFIC (CAS) genes, which are candidate CBF targets. M. falcata CAS30 was robustly cold-responsive whereas the MtCAS31 homolog was not. M. falcata also possessed additional CAS30 homologs in comparison to the single CAS31 gene in M. truncatula. MfCAS30 possessed multiple pairs of closely spaced C-REPEAT/DEHYDRATION RESPONSIVE ELEMENT (CRT/DRE) motifs, the cognate CBF binding site in its upstream region whereas MtCAS31 lacked one CRT/DRE partner of the two proximal partner pairs. CAS genes also shared a promoter structure comprising modules proximal and distal to the coding sequence. CAS15, highly cold-responsive in both species, harbored numerous CRT/DRE motifs, but only in the distal module. However, fusion of the MtCAS15 promoter, including the distal module, to a reporter gene did not result in low-temperature responsiveness in stably transformed Arabidopsis. In contrast, both MtCAS31 and MfCAS30 promoter fusions were low-temperature responsive, although the MfCAS31 fusion was less robust than the MfCAS30 fusion. From these studies we conclude that CAS genes harbor CRT/DRE motifs, their

  18. A case study on discovery of novel Citrus leprosis virus cytoplasmic type 2 utilizing small RNA libraries by next generation sequencing and bioinformatic analyses

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The identification of novel plant viruses is a tricky matter. Most plant virus diagnostics are based on immunological or nucleic acid based assays, where prior characterization of the virus (either antibodies or genetic sequence) is required for reagent production. There are no universal nucleic a...

  19. Complete amino acid sequence of a histidine-rich proteolytic fragment of human ceruloplasmin.

    PubMed

    Kingston, I B; Kingston, B L; Putnam, F W

    1979-04-01

    The complete amino acid sequence has been determined for a fragment of human ceruloplasmin [ferroxidase; iron(II):oxygen oxidoreductase, EC 1.16.3.1]. The fragment (designated Cp F5) contains 159 amino acid residues and has a molecular weight of 18,650; it lacks carbohydrate, is rich in histidine, and contains one free cysteine that may be part of a copper-binding site. This fragment is present in most commercial preparations of ceruloplasmin, probably owing to proteolytic degradation, but can also be obtained by limited cleavage of single-chain ceruloplasmin with plasmin. Cp F5 probably is an intact domain attached to the COOH-terminal end of single-chain ceruloplasmin via a labile interdomain peptide bond. A model of the secondary structure predicted by empirical methods suggests that almost one-third of the amino acid residues are distributed in alpha helices, about a third in beta-sheet structure, and the remainder in beta turns and unidentified structures. Computer analysis of the amino acid sequence has not demonstrated a statistically significant relationship between this ceruloplasmin fragment and any other protein, but there is some evidence for an internal duplication.

  20. Beyond H&E: integration of nucleic acid-based analyses into diagnostic pathology.

    PubMed

    Maes, R K; Langohr, I M; Wise, A G; Smedley, R C; Thaiwong, T; Kiupel, M

    2014-01-01

    Veterinary pathology of infectious, particularly viral, and neoplastic diseases has advanced significantly with the advent of newer molecular methodologies that can detect nucleic acid of infectious agents within microscopic lesions, differentiate neoplastic from nonneoplastic cells, or determine the suitability of a targeted therapy by detecting specific mutations in certain cancers. Polymerase chain reaction-based amplification of DNA or RNA and in situ hybridization are currently the most commonly used methods for nucleic acid detection. In contrast, the main methodology used for protein detection within microscopic lesions is immunohistochemistry. Other methods that allow for analysis of nucleic acids within a particular cell type or individual cells, such as laser capture microdissection, are also available in some laboratories. This review gives an overview of the factors that influence the accurate analysis of nucleic acids in formalin-fixed tissues, as well as of different approaches to detect such targets.

  1. THERMAL AND SPECTROSCOPIC ANALYSES OF CAUSTIC LIDE SOLVENT EXTRACTION SOLVENT CONTACTED WITH 16 MOLAR AND 8 MOLAR NITRIC ACID

    SciTech Connect

    Fondeur, F; David Hobbs, D; Samuel Fink, S

    2007-07-12

    Thermal and spectroscopic analyses were performed on multiple layers formed from contacting Caustic Side Solvent Extraction (CSSX) solvent with 1 M or 3 M nitric acid. A slow chemical reaction occurs (i.e., over several weeks) between the solvent and 1 M or 3 M nitric acid as evidenced by color changes and the detection of nitro groups in the infrared spectrum of the aged samples. Thermal analysis revealed that decomposition of the resulting mixture does not meet the definition of explosive or deflagrating material.

  2. THERMAL AND SPECTROSCOPIC ANALYSES OF CAUSTIC SIDE SOLVENT EXTRACTION SOLVENT CONTACTED WITH 1 MOLARAND 3 MOLAR NITRIC ACID

    SciTech Connect

    Fondeur, F; David Hobbs, D; Samuel Fink, S

    2007-07-23

    Thermal and spectroscopic analyses were performed on multiple layers formed from contacting Caustic Side Solvent Extraction (CSSX) solvent with 1 M or 3 M nitric acid. A slow chemical reaction occurs (i.e., over several weeks) between the solvent and 1 M or 3 M nitric acid as evidenced by color changes and the detection of nitro groups in the infrared spectrum of the aged samples. Thermal analysis revealed that decomposition of the resulting mixture does not meet the definition of explosive or deflagrating material.

  3. Processing and amino acid sequence analysis of the mouse mammary tumor virus env gene product.

    PubMed Central

    Arthur, L O; Copeland, T D; Oroszlan, S; Schochetman, G

    1982-01-01

    The envelope proteins of mouse mammary tumor virus (MMTV) are synthesized from a subgenomic 24S mRNA as a 75,000-dalton glycosylated precursor polyprotein which is eventually processed to the mature glycoproteins gp52 and gp36. In vivo synthesis of this env precursor in the presence of the core glycosylation inhibitor tunicamycin yielded a precursor of approximately 61,000 daltons (P61env). However, a 67,000-dalton protein (P67env) was obtained from cell-free translation with the MMTV 24S mRNA as the template. To determine whether the portion of the protein cleaved from P67env to give P61env was removed from the NH2-terminal end of P67env and as such would represent a leader sequence, the NH2-terminal amino acid sequence of the terminal peptide gp52 was determined. Glutamic acid, and not methionine, was found to be the amino-terminal residue of gp52, indicating that the cleaved portion was derived from the NH2-terminal end of P67env. The NH2-terminal amino acid sequences of gp52's from endogenous and exogenous C3H MMTVs were determined though 46 residues and found to be identical. However, amino acid composition and type-specific gp52 radioimmunoassays from MMTVs grown in heterologous cells indicated primary structure differences between gp52's of the two viruses. The nucleic acid sequence of cloned MMTV DNA fragments (J. Majors and H. E. Varmus, personal communication) in conjunction with the NH2-terminal sequence of gp52 allowed localization of the env gene in the MMTV genome. Nucleotides coding for the NH2 terminus of gp52 begin approximately 0.8 kilobase to the 3' side of the single EcoRI cleavage site. Localization of the env gene at that point agrees with the proposed gene order -gag-pol-env- and also allows sufficient coding potential for the glycoprotein precursor without extending into the long terminal repeat. Images PMID:6281457

  4. Complete Genome Sequence of a thermotolerant sporogenic lactic acid bacterium, Bacillus coagulans strain 36D1

    PubMed Central

    Rhee, Mun Su; Moritz, Brélan E.; Xie, Gary; Glavina del Rio, T.; Dalin, E.; Tice, H.; Bruce, D.; Goodwin, L.; Chertkov, O.; Brettin, T.; Han, C.; Detter, C.; Pitluck, S.; Land, Miriam L.; Patel, Milind; Ou, Mark; Harbrucker, Roberta; Ingram, Lonnie O.; Shanmugam, K. T.

    2011-01-01

    Bacillus coagulans is a ubiquitous soil bacterium that grows at 50-55 °C and pH 5.0 and ferments various sugars that constitute plant biomass to L (+)-lactic acid. The ability of this sporogenic lactic acid bacterium to grow at 50-55 °C and pH 5.0 makes this organism an attractive microbial biocatalyst for production of optically pure lactic acid at industrial scale not only from glucose derived from cellulose but also from xylose, a major constituent of hemicellulose. This bacterium is also considered as a potential probiotic. Complete genome sequence of a representative strain, B. coagulans strain 36D1, is presented and discussed. PMID:22675583

  5. BeadCons: detection of nucleic acid sequences by flow cytometry.

    PubMed

    Horejsh, Douglas; Martini, Federico; Capobianchi, Maria Rosaria

    2005-11-01

    Molecular beacons are single-stranded nucleic acid structures with a terminal fluorophore and a distal, terminal quencher. These molecules are typically used in real-time PCR assays, but have also been conjugated with solid matrices. This unit describes protocols related to molecular beacon-conjugated beads (BeadCons), whose specific hybridization with complementary target sequences can be resolved by cytometry. Assay sensitivity is achieved through the concentration of fluorescence signal on discrete particles. By using molecular beacons with different fluorophores and microspheres of different sizes, it is possible to construct a fluid array system with each bead corresponding to a specific target nucleic acid. Methods are presented for the design, construction, and use of BeadCons for the specific, multiplexed detection of unlabeled nucleic acids in solution. The use of bead-based detection methods will likely lead to the design of new multiplex molecular diagnostic tools.

  6. Measuring nanometer distances in nucleic acids using a sequence-independent nitroxide probe

    PubMed Central

    Qin, Peter Z; Haworth, Ian S; Cai, Qi; Kusnetzow, Ana K; Grant, Gian Paola G; Price, Eric A; Sowa, Glenna Z; Popova, Anna; Herreros, Bruno; He, Honghang

    2008-01-01

    This protocol describes the procedures for measuring nanometer distances in nucleic acids using a nitroxide probe that can be attached to any nucleotide within a given sequence. Two nitroxides are attached to phosphorothioates that are chemically substituted at specific sites of DNA or RNA. Inter-nitroxide distances are measured using a four-pulse double electron–electron resonance technique, and the measured distances are correlated to the parent structures using a Web-accessible computer program. Four to five days are needed for sample labeling, purification and distance measurement. The procedures described herein provide a method for probing global structures and studying conformational changes of nucleic acids and protein/nucleic acid complexes. PMID:17947978

  7. Complete Genome Sequence of a thermotolerant sporogenic lactic acid bacterium, Bacillus coagulans strain 36D1.

    PubMed

    Rhee, Mun Su; Moritz, Brélan E; Xie, Gary; Glavina Del Rio, T; Dalin, E; Tice, H; Bruce, D; Goodwin, L; Chertkov, O; Brettin, T; Han, C; Detter, C; Pitluck, S; Land, Miriam L; Patel, Milind; Ou, Mark; Harbrucker, Roberta; Ingram, Lonnie O; Shanmugam, K T

    2011-12-31

    Bacillus coagulans is a ubiquitous soil bacterium that grows at 50-55 °C and pH 5.0 and ferments various sugars that constitute plant biomass to L (+)-lactic acid. The ability of this sporogenic lactic acid bacterium to grow at 50-55 °C and pH 5.0 makes this organism an attractive microbial biocatalyst for production of optically pure lactic acid at industrial scale not only from glucose derived from cellulose but also from xylose, a major constituent of hemicellulose. This bacterium is also considered as a potential probiotic. Complete genome sequence of a representative strain, B. coagulans strain 36D1, is presented and discussed.

  8. The amino acid sequence of Lady Amherst's pheasant (Chrysolophus amherstiae) and golden pheasant (Chrysolophus pictus) egg-white lysozymes.

    PubMed

    Araki, T; Kuramoto, M; Torikata, T

    1990-09-01

    The amino acids of Lady Amherst's pheasant and golden pheasant egg-white lysozymes have been sequenced. The carboxymethylated lysozymes were digested with trypsin followed by sequencing of the tryptic peptides. Lady Amherst's pheasant lysozyme proved to consist of 129 amino acid residues, and a relative molecular mass of 14,423 Da was calculated. This lysozyme had 6 amino acids substitutions when compared with hen egg-white lysozyme: Phe3 to Tyr, His15 to Leu, Gln41 to His, Asn77 to His, Gln 121 to Asn, and a newly found substitution of Ile124 to Thr. The amino acid sequence of golden pheasant lysozyme was identical to that of Lady Amherst's phesant lysozyme. The phylogenetic tree constructured by the comparison of amino acid sequences of phasianoid birds lysozymes revealed a minimum genetic distance between these pheasants and the turkey-peafowl group.

  9. Arrays of complementary oligonucleotides for analysing the hybridisation behaviour of nucleic acids.

    PubMed Central

    Southern, E M; Case-Green, S C; Elder, J K; Johnson, M; Mir, K U; Wang, L; Williams, J C

    1994-01-01

    Arrays of oligonucleotides corresponding to a full set of complements of a known sequence can be made in a single series of base couplings in which each base in the complement is added in turn. Coupling is carried out on the surface of a solid support such as a glass plate, using a device which applies reagents in a defined area. The device is displaced by a fixed movement after each coupling reaction so that consecutive couplings overlap only a portion of previous ones. The shape and size of the device and the amount by which it is displaced at each step determines the length of the oligonucleotides. Certain shapes create arrays of oligonucleotides from mononucleotides up to a given length in a single series of couplings. The array is used in a hybridisation reaction to a labelled target sequence, and shows the hybridisation behaviour of every oligonucleotide in the target sequence with its complement in the array. Applications include sequence comparison to test for mutation, analysis of secondary structure, and optimisation of PCR primer and antisense oligonucleotide design. Images PMID:7514785

  10. A 25-Amino Acid Sequence of the Arabidopsis TGD2 Protein Is Sufficient for Specific Binding of Phosphatidic Acid*

    PubMed Central

    Lu, Binbin; Benning, Christoph

    2009-01-01

    Genetic analysis suggests that the TGD2 protein of Arabidopsis is required for the biosynthesis of endoplasmic reticulum derived thylakoid lipids. TGD2 is proposed to be the substrate-binding protein of a presumed lipid transporter consisting of the TGD1 (permease) and TGD3 (ATPase) proteins. The TGD1, -2, and -3 proteins are localized in the inner chloroplast envelope membrane. TGD2 appears to be anchored with an N-terminal membrane-spanning domain into the inner envelope membrane, whereas the C-terminal domain faces the intermembrane space. It was previously shown that the C-terminal domain of TGD2 binds phosphatidic acid (PtdOH). To investigate the PtdOH binding site of TGD2 in detail, the C-terminal domain of the TGD2 sequence lacking the transit peptide and transmembrane sequences was fused to the C terminus of the Discosoma sp. red fluorescent protein (DR). This greatly improved the solubility of the resulting DR-TGD2C fusion protein following production in Escherichia coli. The DR-TGD2C protein bound PtdOH with high specificity, as demonstrated by membrane lipid-protein overlay and liposome association assays. Internal deletion and truncation mutagenesis identified a previously undescribed minimal 25-amino acid fragment in the C-terminal domain of TGD2 that is sufficient for PtdOH binding. Binding characteristics of this 25-mer were distinctly different from those of TGD2C, suggesting that additional sequences of TGD2 providing the proper context for this 25-mer are needed for wild type-like PtdOH binding. PMID:19416982

  11. Spectroscopic analyses and studies on respective interaction of cyanuric acid and uric acid with bovine serum albumin and melamine

    NASA Astrophysics Data System (ADS)

    Chen, Dandan; Wu, Qiong; Wang, Jun; Wang, Qi; Qiao, Heng

    2015-01-01

    In this work, the fluorescence quenching was used to study the interaction of cyanuric acid (CYA) and uric acid (UA) with bovine serum albumin (BSA) at two different temperatures (283 K and 310 K). The bimolecular quenching constant (Kq), apparent quenching constant (Ksv), effective binding constant (KA) and corresponding dissociation constant (KD), binding site number (n) and binding distance (r) were calculated by adopting Stern-Volmer, Lineweaver-Burk, Double logarithm and overlap integral equations. The results show that CYA and UA are both able to obviously bind to BSA, but the binding strength order is BSA + CYA < BSA + UA. And then, the interactions of CYA and UA with melamine (MEL) under the same conditions were also studied by using similar methods. The results indicates that both CYA and UA can bind together closely with melamine (MEL). It is wished that these research results would facilitate the understanding the formation of kidney stones and gout in the body after ingesting excess MEL.

  12. Nucleotide sequence of the luxC gene encoding fatty acid reductase of the lux operon from Photobacterium leiognathi.

    PubMed

    Lin, J W; Chao, Y F; Weng, S F

    1993-02-26

    The nucleotide sequence of the luxC gene (EMBL Accession No. 65156) encoding fatty acid reductase (FAR) of the lux operon from Photobacterium leiognathi PL741 was determined and the encoded amino acid sequence deduced. The fatty acid reductase is a component of the fatty acid reductase complex. The complex is responsible for converting fatty acid to aldehyde which serves as the substrate in the luciferase-catalyzed bioluminescent reaction. The protein comprises 478 amino acid residues and has a calculated M(r) of 53,858. Alignment and comparison of the fatty acid reductase of P. leiognathi with that of Vibrio harveyi B392 and Vibrio fischeri ATCC 7744 shows that there is 70% and 59% amino acid residues identity, respectively.

  13. Genome Sequencing of Sulfolobus sp. A20 from Costa Rica and Comparative Analyses of the Putative Pathways of Carbon, Nitrogen, and Sulfur Metabolism in Various Sulfolobus Strains

    PubMed Central

    Dai, Xin; Wang, Haina; Zhang, Zhenfeng; Li, Kuan; Zhang, Xiaoling; Mora-López, Marielos; Jiang, Chengying; Liu, Chang; Wang, Li; Zhu, Yaxin; Hernández-Ascencio, Walter; Dong, Zhiyang; Huang, Li

    2016-01-01

    The genome of Sulfolobus sp. A20 isolated from a hot spring in Costa Rica was sequenced. This circular genome of the strain is 2,688,317 bp in size and 34.8% in G+C content, and contains 2591 open reading frames (ORFs). Strain A20 shares ~95.6% identity at the 16S rRNA gene sequence level and <30% DNA-DNA hybridization (DDH) values with the most closely related known Sulfolobus species (i.e., Sulfolobus islandicus and Sulfolobus solfataricus), suggesting that it represents a novel Sulfolobus species. Comparison of the genome of strain A20 with those of the type strains of S. solfataricus, Sulfolobus acidocaldarius, S. islandicus, and Sulfolobus tokodaii, which were isolated from geographically separated areas, identified 1801 genes conserved among all Sulfolobus species analyzed (core genes). Comparative genome analyses show that central carbon metabolism in Sulfolobus is highly conserved, and enzymes involved in the Entner-Doudoroff pathway, the tricarboxylic acid cycle and the CO2 fixation pathways are predominantly encoded by the core genes. All Sulfolobus species encode genes required for the conversion of ammonium into glutamate/glutamine. Some Sulfolobus strains have gained the ability to utilize additional nitrogen source such as nitrate (i.e., S. islandicus strain REY15A, LAL14/1, M14.25, and M16.27) or urea (i.e., S. islandicus HEV10/4, S. tokodaii strain7, and S. metallicus DSM 6482). The strategies for sulfur metabolism are most diverse and least understood. S. tokodaii encodes sulfur oxygenase/reductase (SOR), whereas both S. islandicus and S. solfataricus contain genes for sulfur reductase (SRE). However, neither SOR nor SRE genes exist in the genome of strain A20, raising the possibility that an unknown pathway for the utilization of elemental sulfur may be present in the strain. The ability of Sulfolobus to utilize nitrate or sulfur is encoded by a gene cluster flanked by IS elements or their remnants. These clusters appear to have become fixed at a

  14. Genome Sequencing of Sulfolobus sp. A20 from Costa Rica and Comparative Analyses of the Putative Pathways of Carbon, Nitrogen, and Sulfur Metabolism in Various Sulfolobus Strains.

    PubMed

    Dai, Xin; Wang, Haina; Zhang, Zhenfeng; Li, Kuan; Zhang, Xiaoling; Mora-López, Marielos; Jiang, Chengying; Liu, Chang; Wang, Li; Zhu, Yaxin; Hernández-Ascencio, Walter; Dong, Zhiyang; Huang, Li

    2016-01-01

    The genome of Sulfolobus sp. A20 isolated from a hot spring in Costa Rica was sequenced. This circular genome of the strain is 2,688,317 bp in size and 34.8% in G+C content, and contains 2591 open reading frames (ORFs). Strain A20 shares ~95.6% identity at the 16S rRNA gene sequence level and <30% DNA-DNA hybridization (DDH) values with the most closely related known Sulfolobus species (i.e., Sulfolobus islandicus and Sulfolobus solfataricus), suggesting that it represents a novel Sulfolobus species. Comparison of the genome of strain A20 with those of the type strains of S. solfataricus, Sulfolobus acidocaldarius, S. islandicus, and Sulfolobus tokodaii, which were isolated from geographically separated areas, identified 1801 genes conserved among all Sulfolobus species analyzed (core genes). Comparative genome analyses show that central carbon metabolism in Sulfolobus is highly conserved, and enzymes involved in the Entner-Doudoroff pathway, the tricarboxylic acid cycle and the CO2 fixation pathways are predominantly encoded by the core genes. All Sulfolobus species encode genes required for the conversion of ammonium into glutamate/glutamine. Some Sulfolobus strains have gained the ability to utilize additional nitrogen source such as nitrate (i.e., S. islandicus strain REY15A, LAL14/1, M14.25, and M16.27) or urea (i.e., S. islandicus HEV10/4, S. tokodaii strain7, and S. metallicus DSM 6482). The strategies for sulfur metabolism are most diverse and least understood. S. tokodaii encodes sulfur oxygenase/reductase (SOR), whereas both S. islandicus and S. solfataricus contain genes for sulfur reductase (SRE). However, neither SOR nor SRE genes exist in the genome of strain A20, raising the possibility that an unknown pathway for the utilization of elemental sulfur may be present in the strain. The ability of Sulfolobus to utilize nitrate or sulfur is encoded by a gene cluster flanked by IS elements or their remnants. These clusters appear to have become fixed at a

  15. Spent lead-acid battery recycling in China - A review and sustainable analyses on mass flow of lead.

    PubMed

    Sun, Zhi; Cao, Hongbin; Zhang, Xihua; Lin, Xiao; Zheng, Wenwen; Cao, Guoqing; Sun, Yong; Zhang, Yi

    2017-03-15

    Lead is classified to be one of the top heavy metal pollutants in China. The corresponding environmental issues especially during the management of spent lead-acid battery have already caused significant public awareness and concern. This research gives a brief overview on the recycling situation based on an investigation of the lead industry in China and also the development of technologies for spent lead-acid batteries. The main principles and research focuses of different technologies including pyrometallurgy, hydrometallurgy and greener technologies are summarized and compared. Subsequently, the circulability of lead based on the entire life cycle analyses of lead-acid battery is calculated. By considering different recycling schemes, the recycling situation of spent lead-acid battery in China can be understood semi-quantitatively. According to this research, 30% of the primary lead production can be shut down that the lead production can still ensure consecutive life cycle operation of lead-acid battery, if proper management of the spent lead-acid battery is implemented according to current lead industry situation in China. This research provides a methodology on the view of lead circulability in the whole life cycle of a specific product and is aiming to contribute more quantitative guidelines for efficient organization of lead industry in China.

  16. Nucleotide sequence of the Klebsiella pneumoniae nifD gene and predicted amino acid sequence of the alpha-subunit of nitrogenase MoFe protein.

    PubMed Central

    Ioannidis, I; Buck, M

    1987-01-01

    The nucleotide sequence of the Klebsiella pneumoniae nifD gene is presented and together with the accompanying paper [Holland, Zilberstein, Zamir & Sussman (1987) Biochem. J. 247, 277-285] completes the sequence of the nifHDK genes encoding the nitrogenase polypeptides. The K. pneumoniae nifD gene encodes the 483-amino acid-residue nitrogenase alpha-subunit polypeptide of Mr 54156. The alpha-subunit has five strongly conserved cysteine residues at positions 63, 89, 155, 184 and 275, some occurring in a region showing both primary sequence and potential structural homology to the K. pneumoniae nitrogenase beta-subunit. A comparison with six other alpha-subunit amino acid sequences has been made, which indicates a number of potentially important domains within alpha-subunits. PMID:3322262

  17. Complete amino acid sequence of the A chain of human complement-classical-pathway enzyme C1r.

    PubMed Central

    Arlaud, G J; Willis, A C; Gagnon, J

    1987-01-01

    The amino acid sequence of human C1r A chain was determined, from sequence analysis performed on fragments obtained from C1r autolytic cleavage, cleavage of methionyl bonds, tryptic cleavages at arginine and lysine residues, and cleavages by staphylococcal proteinase. The polypeptide chain has an N-terminal serine residue and contains 446 amino acid residues (Mr 51,200). The sequence data allow chemical characterization of fragments alpha (positions 1-211), beta (positions 212-279) and gamma (positions 280-446) yielded from C1r autolytic cleavage, and identification of the two major cleavage sites generating these fragments. Position 150 of C1r A chain is occupied by a modified amino acid residue that, upon acid hydrolysis, yields erythro-beta-hydroxyaspartic acid, and that is located in a sequence homologous to the beta-hydroxyaspartic acid-containing regions of Factor IX, Factor X, protein C and protein Z. Sequence comparison reveals internal homology between two segments (positions 10-78 and 186-257). Two carbohydrate moieties are attached to the polypeptide chain, both via asparagine residues at positions 108 and 204. Combined with the previously determined sequence of C1r B chain [Arlaud & Gagnon (1983) Biochemistry 22, 1758-1764], these data give the complete sequence of human C1r. PMID:3036070

  18. Community Genomic and Proteomic Analyses of Chemoautotrophic Iron-Oxidizing "Leptospirillum rubarum" (Group II) and "Leptospirillum ferrodiazotrophum" (Group III) Bacteria in Acid Mine Drainage Biofilms

    SciTech Connect

    Goltsman, Daniela; Denef, Vincent; Singer, Steven; Verberkmoes, Nathan C; Lefsrud, Mark G; Mueller, Ryan; Dick, Gregory J.; Sun, Christine; Wheeler, Korin; Zelma, Adam; Baker, Brett J.; Hauser, Loren John; Land, Miriam L; Shah, Manesh B; Thelen, Michael P.; Hettich, Robert {Bob} L; Banfield, Jillian F.

    2009-01-01

    We analyzed near-complete population (composite) genomic sequences for coexisting acidophilic iron-oxidizing Leptospirillum group II and III bacteria (phylum Nitrospirae) and an extrachromosomal plasmid from a Richmond Mine, Iron Mountain, CA, acid mine drainage biofilm. Community proteomic analysis of the genomically characterized sample and two other biofilms identified 64.6% and 44.9% of the predicted proteins of Leptospirillum groups II and III, respectively, and 20% of the predicted plasmid proteins. The bacteria share 92% 16S rRNA gene sequence identity and >60% of their genes, including integrated plasmid-like regions. The extrachromosomal plasmid carries conjugation genes with detectable sequence similarity to genes in the integrated conjugative plasmid, but only those on the extrachromosomal element were identified by proteomics. Both bacterial groups have genes for community-essential functions, including carbon fixation and biosynthesis of vitamins, fatty acids, and biopolymers (including cellulose); proteomic analyses reveal these activities. Both Leptospirillum types have multiple pathways for osmotic protection. Although both are motile, signal transduction and methyl-accepting chemotaxis proteins are more abundant in Leptospirillum group III, consistent with its distribution in gradients within biofilms. Interestingly, Leptospirillum group II uses a methyl-dependent and Leptospirillum group III a methyl-independent response pathway. Although only Leptospirillum group III can fix nitrogen, these proteins were not identified by proteomics. The abundances of core proteins are similar in all communities, but the abundance levels of unique and shared proteins of unknown function vary. Some proteins unique to one organism were highly expressed and may be key to the functional and ecological differentiation of Leptospirillum groups II and III.

  19. Sequencing and bioinformatics-based analyses of the microRNA transcriptome in hepatitis B-related hepatocellular carcinoma.

    PubMed

    Mizuguchi, Yoshiaki; Mishima, Takuya; Yokomuro, Shigeki; Arima, Yasuo; Kawahigashi, Yutaka; Shigehara, Kengo; Kanda, Tomohiro; Yoshida, Hiroshi; Uchida, Eiji; Tajiri, Takashi; Takizawa, Toshihiro

    2011-01-25

    MicroRNAs (miRNAs) participate in crucial biological processes, and it is now evident that miRNA alterations are involved in the progression of human cancers. Recent studies on miRNA profiling performed with cloning suggest that sequencing is useful for the detection of novel miRNAs, modifications, and precise compositions and that miRNA expression levels calculated by clone count are reproducible. Here we focus on sequencing of miRNA to obtain a comprehensive profile and characterization of these transcriptomes as they relate to human liver. Sequencing using 454 sequencing and conventional cloning from 22 pair of HCC and adjacent normal liver (ANL) and 3 HCC cell lines identified reliable reads of more than 314000 miRNAs from HCC and more than 268000 from ANL for registered human miRNAs. Computational bioinformatics identified 7 novel miRNAs with high conservation, 15 novel opposite miRNAs, and 3 novel antisense miRNAs. Moreover sequencing can detect miRNA modifications including adenosine-to-inosine editing in miR-376 families. Expression profiling using clone count analysis was used to identify miRNAs that are expressed aberrantly in liver cancer including miR-122, miR-21, and miR-34a. Furthermore, sequencing-based miRNA clustering, but not individual miRNA, detects high risk patients who have high potentials for early tumor recurrence after liver surgery (P = 0.006), and which is the only significant variable among pathological and clinical and variables (P = 0,022). We believe that the combination of sequencing and bioinformatics will accelerate the discovery of novel miRNAs and biomarkers involved in human liver cancer.

  20. Sequencing and Bioinformatics-Based Analyses of the microRNA Transcriptome in Hepatitis B–Related Hepatocellular Carcinoma

    PubMed Central

    Mizuguchi, Yoshiaki; Mishima, Takuya; Yokomuro, Shigeki; Arima, Yasuo; Kawahigashi, Yutaka; Shigehara, Kengo; Kanda, Tomohiro; Yoshida, Hiroshi; Uchida, Eiji; Tajiri, Takashi; Takizawa, Toshihiro

    2011-01-01

    MicroRNAs (miRNAs) participate in crucial biological processes, and it is now evident that miRNA alterations are involved in the progression of human cancers. Recent studies on miRNA profiling performed with cloning suggest that sequencing is useful for the detection of novel miRNAs, modifications, and precise compositions and that miRNA expression levels calculated by clone count are reproducible. Here we focus on sequencing of miRNA to obtain a comprehensive profile and characterization of these transcriptomes as they relate to human liver. Sequencing using 454 sequencing and conventional cloning from 22 pair of HCC and adjacent normal liver (ANL) and 3 HCC cell lines identified reliable reads of more than 314000 miRNAs from HCC and more than 268000 from ANL for registered human miRNAs. Computational bioinformatics identified 7 novel miRNAs with high conservation, 15 novel opposite miRNAs, and 3 novel antisense miRNAs. Moreover sequencing can detect miRNA modifications including adenosine-to-inosine editing in miR-376 families. Expression profiling using clone count analysis was used to identify miRNAs that are expressed aberrantly in liver cancer including miR-122, miR-21, and miR-34a. Furthermore, sequencing-based miRNA clustering, but not individual miRNA, detects high risk patients who have high potentials for early tumor recurrence after liver surgery (P = 0.006), and which is the only significant variable among pathological and clinical and variables (P = 0,022). We believe that the combination of sequencing and bioinformatics will accelerate the discovery of novel miRNAs and biomarkers involved in human liver cancer. PMID:21283620

  1. Complete chloroplast genome sequences of Hordeum vulgare, Sorghum bicolor and Agrostis stolonifera, and comparative analyses with other grass genomes.

    PubMed

    Saski, Christopher; Lee, Seung-Bum; Fjellheim, Siri; Guda, Chittibabu; Jansen, Robert K; Luo, Hong; Tomkins, Jeffrey; Rognli, Odd Arne; Daniell, Henry; Clarke, Jihong Liu

    2007-08-01

    Comparisons of complete chloroplast genome sequences of Hordeum vulgare, Sorghum bicolor and Agrostis stolonifera to six published grass chloroplast genomes reveal that gene content and order are similar but two microstructural changes have occurred. First, the expansion of the IR at the SSC/IRa boundary that duplicates a portion of the 5' end of ndhH is restricted to the three genera of the subfamily Pooideae (Agrostis, Hordeum and Triticum). Second, a 6 bp deletion in ndhK is shared by Agrostis, Hordeum, Oryza and Triticum, and this event supports the sister relationship between the subfamilies Erhartoideae and Pooideae. Repeat analysis identified 19-37 direct and inverted repeats 30 bp or longer with a sequence identity of at least 90%. Seventeen of the 26 shared repeats are found in all the grass chloroplast genomes examined and are located in the same genes or intergenic spacer (IGS) regions. Examination of simple sequence repeats (SSRs) identified 16-21 potential polymorphic SSRs. Five IGS regions have 100% sequence identity among Zea mays, Saccharum officinarum and Sorghum bicolor, whereas no spacer regions were identical among Oryza sativa, Triticum aestivum, H. vulgare and A. stolonifera despite their close phylogenetic relationship. Alignment of EST sequences and DNA coding sequences identified six C-U conversions in both Sorghum bicolor and H. vulgare but only one in A. stolonifera. Phylogenetic trees based on DNA sequences of 61 protein-coding genes of 38 taxa using both maximum parsimony and likelihood methods provide moderate support for a sister relationship between the subfamilies Erhartoideae and Pooideae.

  2. Functional analyses of carnivorous plant-specific amino acid residues in S-like ribonucleases.

    PubMed

    Arai, Naoki; Nishimura, Emi; Kikuchi, Yo; Ohyama, Takashi

    2015-09-11

    Unlike plants with no carnivory, carnivorous plants seem to use S-like ribonucleases (RNases) as an enzyme for carnivory. Carnivorous plant-specific conserved amino acid residues are present at four positions around the conserved active site (CAS). The roles of these conserved amino acid residues in the enzymatic function were explored in the current study by preparing five recombinant variants of DA-I, the S-like RNase of Drosera adelae. The kcat and kcat/Km values of the enzymes revealed that among the four variants with a single mutation, the serine to glycine mutation at position 111 most negatively influenced the enzymatic activity. The change in the bulkiness of the amino acid residue side-chain seemed to be the major cause of the above effect. Modeling of the three dimensional (3D) structures strongly suggested that the S to G mutation at 111 greatly altered the overall enzyme conformation. The conserved four amino acid residues are likely to function in keeping the two histidine residues, which are essential for the cleavage of RNA strands, and the CAS in the most functional enzymatic conformation.

  3. Quantitative analyses of tartaric acid based on terahertz time domain spectroscopy

    NASA Astrophysics Data System (ADS)

    Cao, Binghua; Fan, Mengbao

    2010-10-01

    Terahertz wave is the electromagnetic spectrum situated between microwave and infrared wave. Quantitative analysis based on terahertz spectroscopy is very important for the application of terahertz techniques. But how to realize it is still under study. L-tartaric acid is widely used as acidulant in beverage, and other food, such as soft drinks, wine, candy, bread and some colloidal sweetmeats. In this paper, terahertz time-domain spectroscopy is applied to quantify the tartaric acid. Two methods are employed to process the terahertz spectra of different samples with different content of tartaric acid. The first one is linear regression combining correlation analysis. The second is partial least square (PLS), in which the absorption spectra in the 0.8-1.4THz region are used to quantify the tartaric acid. To compare the performance of these two principles, the relative error of the two methods is analyzed. For this experiment, the first method does better than the second one. But the first method is suitable for the quantitative analysis of materials which has obvious terahertz absorption peaks, while for material which has no obvious terahertz absorption peaks, the second one is more appropriate.

  4. Spectroscopic analyses of the noncovalent self-assembly of cyanines upon various nucleic acid scaffolds.

    PubMed

    Achyuthan, Komandoor E; McClain, Jaime L; Zhou, Zhijun; Whitten, David G; Branch, Darren W

    2009-04-01

    We utilized self-assembly of cyanine chromophores to study the conformational changes in various types of nucleic acid scaffolds: single and double stranded DNA, linear or circular DNA and RNA. We identified a chromophore that became highly fluorescent after aggregating upon nucleic acids. Fluorescence from the aggregate was instantaneous after self-assembly. Temporal emission profiles displayed a biphasic trend demonstrating kinetic dependence for assembly and disassembly. Absorption spectra of the aggregate showed a red-shifted "shoulder" peak indicative of J-aggregate. Fluorescence from J-aggregates was also red-shifted. We utilized cyanine self-assembly to quantize various nucleic acids. The limits of detection and quantization for psiX174 DNA were 3 and 9 fmol, respectively. We similarly determined the sensitivity for various nucleic acids and established the optimum conditions for self-assembly. Collectively, the effects of methanol, salt, and full width at half maximum for cyanine fluorescence on DNA or carboxymethylamylose scaffolds, all suggested noncovalent, electrostatic, and hydrophobic forces were involved in supramolecular self-assembly. Our results facilitate a better understanding of supramolecular self-assembly.

  5. Nucleotide sequences of the Pseudomonas savastanoi indoleacetic acid genes show homology with Agrobacterium tumefaciens T-DNA

    PubMed Central

    Yamada, Tetsuji; Palm, Curtis J.; Brooks, Bob; Kosuge, Tsune

    1985-01-01

    We report the nucleotide sequences of iaaM and iaaH, the genetic determinants for, respectively, tryptophan 2-monooxygenase and indoleacetamide hydrolase, the enzymes that catalyze the conversion of L-tryptophan to indoleacetic acid in the tumor-forming bacterium Pseudomonas syringae pv. savastanoi. The sequence analysis indicates that the iaaM locus contains an open reading frame encoding 557 amino acids that would comprise a protein with a molecular weight of 61,783; the iaaH locus contains an open reading frame of 455 amino acids that would comprise a protein with a molecular weight of 48,515. Significant amino acid sequence homology was found between the predicted sequence of the tryptophan monooxygenase of P. savastanoi and the deduced product of the T-DNA tms-1 gene of the octopine-type plasmid pTiA6NC from Agrobacterium tumefaciens. Strong homology was found in the 25 amino acid sequence in the putative FAD-binding region of tryptophan monooxygenase. Homology was also found in the amino acid sequences representing the central regions of the putative products of iaaH and tms-2 T-DNA. The results suggest a strong similarity in the pathways for indoleacetic acid synthesis encoded by genes in P. savastanoi and in A. tumefaciens T-DNA. Images PMID:16593610

  6. Prediction of flexible/rigid regions from protein sequences using k-spaced amino acid pairs

    PubMed Central

    Chen, Ke; Kurgan, Lukasz A; Ruan, Jishou

    2007-01-01

    Background Traditionally, it is believed that the native structure of a protein corresponds to a global minimum of its free energy. However, with the growing number of known tertiary (3D) protein structures, researchers have discovered that some proteins can alter their structures in response to a change in their surroundings or with the help of other proteins or ligands. Such structural shifts play a crucial role with respect to the protein function. To this end, we propose a machine learning method for the prediction of the flexible/rigid regions of proteins (referred to as FlexRP); the method is based on a novel sequence representation and feature selection. Knowledge of the flexible/rigid regions may provide insights into the protein folding process and the 3D structure prediction. Results The flexible/rigid regions were defined based on a dataset, which includes protein sequences that have multiple experimental structures, and which was previously used to study the structural conservation of proteins. Sequences drawn from this dataset were represented based on feature sets that were proposed in prior research, such as PSI-BLAST profiles, composition vector and binary sequence encoding, and a newly proposed representation based on frequencies of k-spaced amino acid pairs. These representations were processed by feature selection to reduce the dimensionality. Several machine learning methods for the prediction of flexible/rigid regions and two recently proposed methods for the prediction of conformational changes and unstructured regions were compared with the proposed method. The FlexRP method, which applies Logistic Regression and collocation-based representation with 95 features, obtained 79.5% accuracy. The two runner-up methods, which apply the same sequence representation and Support Vector Machines (SVM) and Naïve Bayes classifiers, obtained 79.2% and 78.4% accuracy, respectively. The remaining considered methods are characterized by accuracies below 70

  7. Nucleic and amino acid sequences relating to a novel transketolase, and methods for the expression thereof

    DOEpatents

    Croteau, Rodney Bruce; Wildung, Mark Raymond; Lange, Bernd Markus; McCaskill, David G.

    2001-01-01

    cDNAs encoding 1-deoxyxylulose-5-phosphate synthase from peppermint (Mentha piperita) have been isolated and sequenced, and the corresponding amino acid sequences have been determined. Accordingly, isolated DNA sequences (SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7) are provided which code for the expression of 1-deoxyxylulose-5-phosphate synthase from plants. In another aspect the present invention provides for isolated, recombinant DXPS proteins, such as the proteins having the sequences set forth in SEQ ID NO:4, SEQ ID NO:6 and SEQ ID NO:8. In other aspects, replicable recombinant cloning vehicles are provided which code for plant 1-deoxyxylulose-5-phosphate synthases, or for a base sequence sufficiently complementary to at least a portion of 1-deoxyxylulose-5-phosphate synthase DNA or RNA to enable hybridization therewith. In yet other aspects, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding a plant 1-deoxyxylulose-5-phosphate synthase. Thus, systems and methods are provided for the recombinant expression of the aforementioned recombinant 1-deoxyxylulose-5-phosphate synthase that may be used to facilitate its production, isolation and purification in significant amounts. Recombinant 1-deoxyxylulose-5-phosphate synthase may be used to obtain expression or enhanced expression of 1-deoxyxylulose-5-phosphate synthase in plants in order to enhance the production of 1-deoxyxylulose-5-phosphate, or its derivatives such as isopentenyl diphosphate (BP), or may be otherwise employed for the regulation or expression of 1-deoxyxylulose-5-phosphate synthase, or the production of its products.

  8. Gene sequence and predicted amino acid sequence of the motA protein, a membrane-associated protein required for flagellar rotation in Escherichia coli.

    PubMed Central

    Dean, G E; Macnab, R M; Stader, J; Matsumura, P; Burks, C

    1984-01-01

    The motA and motB gene products of Escherichia coli are integral membrane proteins necessary for flagellar rotation. We determined the DNA sequence of the region containing the motA gene and its promoter. Within this sequence, there is an open reading frame of 885 nucleotides, which with high probability (98% confidence level) meets criteria for a coding sequence. The 295-residue amino acid translation product had a molecular weight of 31,974, in good agreement with the value determined experimentally by gel electrophoresis. The amino acid sequence, which was quite hydrophobic, was subjected to a theoretical analysis designed to predict membrane-spanning alpha-helical segments of integral membrane proteins; four such hydrophobic helices were predicted by this treatment. Additional amphipathic helices may also be present. A remarkable feature of the sequence is the existence of two segments of high uncompensated charge density, one positive and the other negative. Possible organization of the protein in the membrane is discussed. Asymmetry in the amino acid composition of translated DNA sequences was used to distinguish between two possible initiation codons. The use of this method as a criterion for authentication of coding regions is described briefly in an Appendix. PMID:6090403

  9. A Bacterial Analysis Platform: An Integrated System for Analysing Bacterial Whole Genome Sequencing Data for Clinical Diagnostics and Surveillance

    PubMed Central

    Ahrenfeldt, Johanne; Cisneros, Jose Luis Bellod; Jurtz, Vanessa; Larsen, Mette Voldby; Hasman, Henrik; Aarestrup, Frank Møller; Lund, Ole

    2016-01-01

    Recent advances in whole genome sequencing have made the technology available for routine use in microbiological laboratories. However, a major obstacle for using this technology is the availability of simple and automatic bioinformatics tools. Based on previously published and already available web-based tools we developed a single pipeline for batch uploading of whole genome sequencing data from multiple bacterial isolates. The pipeline will automatically identify the bacterial species and, if applicable, assemble the genome, identify the multilocus sequence type, plasmids, virulence genes and antimicrobial resistance genes. A short printable report for each sample will be provided and an Excel spreadsheet containing all the metadata and a summary of the results for all submitted samples can be downloaded. The pipeline was benchmarked using datasets previously used to test the individual services. The reported results enable a rapid overview of the major results, and comparing that to the previously found results showed that the platform is reliable and able to correctly predict the species and find most of the expected genes automatically. In conclusion, a combined bioinformatics platform was developed and made publicly available, providing easy-to-use automated analysis of bacterial whole genome sequencing data. The platform may be of immediate relevance as a guide for investigators using whole genome sequencing for clinical diagnostics and surveillance. The platform is freely available at: https://cge.cbs.dtu.dk/services/CGEpipeline-1.1 and it is the intention that it will continue to be expanded with new features as these become available. PMID:27327771

  10. Genome Sequence Analysis of the Naphthenic Acid Degrading and Metal Resistant Bacterium Cupriavidus gilardii CR3

    PubMed Central

    Xiao, Jingfa; Hao, Lirui; Crowley, David E.; Zhang, Zhewen; Yu, Jun; Huang, Ning; Huo, Mingxin; Wu, Jiayan

    2015-01-01

    Cupriavidus sp. are generally heavy metal tolerant bacteria with the ability to degrade a variety of aromatic hydrocarbon compounds, although the degradation pathways and substrate versatilities remain largely unknown. Here we studied the bacterium Cupriavidus gilardii strain CR3, which was isolated from a natural asphalt deposit, and which was shown to utilize naphthenic acids as a sole carbon source. Genome sequencing of C. gilardii CR3 was carried out to elucidate possible mechanisms for the naphthenic acid biodegradation. The genome of C. gilardii CR3 was composed of two circular chromosomes chr1 and chr2 of respectively 3,539,530 bp and 2,039,213 bp in size. The genome for strain CR3 encoded 4,502 putative protein-coding genes, 59 tRNA genes, and many other non-coding genes. Many genes were associated with xenobiotic biodegradation and metal resistance functions. Pathway prediction for degradation of cyclohexanecarboxylic acid, a representative naphthenic acid, suggested that naphthenic acid undergoes initial ring-cleavage, after which the ring fission products can be degraded via several plausible degradation pathways including a mechanism similar to that used for fatty acid oxidation. The final metabolic products of these pathways are unstable or volatile compounds that were not toxic to CR3. Strain CR3 was also shown to have tolerance to at least 10 heavy metals, which was mainly achieved by self-detoxification through ion efflux, metal-complexation and metal-reduction, and a powerful DNA self-repair mechanism. Our genomic analysis suggests that CR3 is well adapted to survive the harsh environment in natural asphalts containing naphthenic acids and high concentrations of heavy metals. PMID:26301592

  11. Genome Sequence Analysis of the Naphthenic Acid Degrading and Metal Resistant Bacterium Cupriavidus gilardii CR3.

    PubMed

    Wang, Xiaoyu; Chen, Meili; Xiao, Jingfa; Hao, Lirui; Crowley, David E; Zhang, Zhewen; Yu, Jun; Huang, Ning; Huo, Mingxin; Wu, Jiayan

    2015-01-01

    Cupriavidus sp. are generally heavy metal tolerant bacteria with the ability to degrade a variety of aromatic hydrocarbon compounds, although the degradation pathways and substrate versatilities remain largely unknown. Here we studied the bacterium Cupriavidus gilardii strain CR3, which was isolated from a natural asphalt deposit, and which was shown to utilize naphthenic acids as a sole carbon source. Genome sequencing of C. gilardii CR3 was carried out to elucidate possible mechanisms for the naphthenic acid biodegradation. The genome of C. gilardii CR3 was composed of two circular chromosomes chr1 and chr2 of respectively 3,539,530 bp and 2,039,213 bp in size. The genome for strain CR3 encoded 4,502 putative protein-coding genes, 59 tRNA genes, and many other non-coding genes. Many genes were associated with xenobiotic biodegradation and metal resistance functions. Pathway prediction for degradation of cyclohexanecarboxylic acid, a representative naphthenic acid, suggested that naphthenic acid undergoes initial ring-cleavage, after which the ring fission products can be degraded via several plausible degradation pathways including a mechanism similar to that used for fatty acid oxidation. The final metabolic products of these pathways are unstable or volatile compounds that were not toxic to CR3. Strain CR3 was also shown to have tolerance to at least 10 heavy metals, which was mainly achieved by self-detoxification through ion efflux, metal-complexation and metal-reduction, and a powerful DNA self-repair mechanism. Our genomic analysis suggests that CR3 is well adapted to survive the harsh environment in natural asphalts containing naphthenic acids and high concentrations of heavy metals.

  12. Repeat sequence chromosome specific nucleic acid probes and methods of preparing and using

    DOEpatents

    Weier, Heinz-Ulrich G.; Gray, Joe W.

    1995-01-01

    A primer directed DNA amplification method to isolate efficiently chromosome-specific repeated DNA wherein degenerate oligonucleotide primers are used is disclosed. The probes produced are a heterogeneous mixture that can be used with blocking DNA as a chromosome-specific staining reagent, and/or the elements of the mixture can be screened for high specificity, size and/or high degree of repetition among other parameters. The degenerate primers are sets of primers that vary in sequence but are substantially complementary to highly repeated nucleic acid sequences, preferably clustered within the template DNA, for example, pericentromeric alpha satellite repeat sequences. The template DNA is preferably chromosome-specific. Exemplary primers ard probes are disclosed. The probes of this invention can be used to determine the number of chromosomes of a specific type in metaphase spreads, in germ line and/or somatic cell interphase nuclei, micronuclei and/or in tissue sections. Also provided is a method to select arbitrarily repeat sequence probes that can be screened for chromosome-specificity.

  13. Unconventional amino acid sequence of the sun anemone (Stoichactis helianthus) polypeptide neurotoxin

    SciTech Connect

    Kem, W.; Dunn, B.; Parten, B.; Pennington, M.; Price, D.

    1986-05-01

    A 5000 dalton polypeptide neurotoxin (Sh-NI) purified by G50 Sephadex, P-cellulose, and SP-Sephadex chromatography was homogeneous by isoelectric focusing. Sh-NI was highly toxic to crayfish (LD/sub 50/ 0.6 ..mu..g/kg) but without effect upon mice at 15,000 ..mu..g/kg (i.p. injection). The reduced, /sup 3/H-carboxymethylated toxin and its fragments were subjected to automatic Edman degradation and the resulting PTH-amino acids were identified by HPLC, back hydrolysis, and scintillation counting. Peptides resulting from proteolytic (clostripain, staphylococcal protease) and chemical (tryptophan) cleavage were sequenced. The sequence is: AACKCDDEGPDIRTAPLTGTVDLGSCNAGWEKCASYYTIIADCCRKKK. This sequence differs considerably from the homologous Anemonia and Anthopleura toxins; many of the identical residues (6 half-cystines, G9, P10, R13, G19, G29, W30) are probably critical for folding rather than receptor recognition. However, the Sh-NI sequence closely resembles Radioanthus macrodactylus neurotoxin III and r. paumotensis II. The authors propose that Sh-NI and related Radioanthus toxins act upon a different site on the sodium channel.

  14. Repeat sequence chromosome specific nucleic acid probes and methods of preparing and using

    DOEpatents

    Weier, H.U.G.; Gray, J.W.

    1995-06-27

    A primer directed DNA amplification method to isolate efficiently chromosome-specific repeated DNA wherein degenerate oligonucleotide primers are used is disclosed. The probes produced are a heterogeneous mixture that can be used with blocking DNA as a chromosome-specific staining reagent, and/or the elements of the mixture can be screened for high specificity, size and/or high degree of repetition among other parameters. The degenerate primers are sets of primers that vary in sequence but are substantially complementary to highly repeated nucleic acid sequences, preferably clustered within the template DNA, for example, pericentromeric alpha satellite repeat sequences. The template DNA is preferably chromosome-specific. Exemplary primers and probes are disclosed. The probes of this invention can be used to determine the number of chromosomes of a specific type in metaphase spreads, in germ line and/or somatic cell interphase nuclei, micronuclei and/or in tissue sections. Also provided is a method to select arbitrarily repeat sequence probes that can be screened for chromosome-specificity. 18 figs.

  15. Sequence-defined bioactive macrocycles via an acid-catalysed cascade reaction

    NASA Astrophysics Data System (ADS)

    Porel, Mintu; Thornlow, Dana N.; Phan, Ngoc N.; Alabi, Christopher A.

    2016-06-01

    Synthetic macrocycles derived from sequence-defined oligomers are a unique structural class whose ring size, sequence and structure can be tuned via precise organization of the primary sequence. Similar to peptides and other peptidomimetics, these well-defined synthetic macromolecules become pharmacologically relevant when bioactive side chains are incorporated into their primary sequence. In this article, we report the synthesis of oligothioetheramide (oligoTEA) macrocycles via a one-pot acid-catalysed cascade reaction. The versatility of the cyclization chemistry and modularity of the assembly process was demonstrated via the synthesis of >20 diverse oligoTEA macrocycles. Structural characterization via NMR spectroscopy revealed the presence of conformational isomers, which enabled the determination of local chain dynamics within the macromolecular structure. Finally, we demonstrate the biological activity of oligoTEA macrocycles designed to mimic facially amphiphilic antimicrobial peptides. The preliminary results indicate that macrocyclic oligoTEAs with just two-to-three cationic charge centres can elicit potent antibacterial activity against Gram-positive and Gram-negative bacteria.

  16. Spectroscopic analyses and studies on respective interaction of cyanuric acid and uric acid with bovine serum albumin and melamine.

    PubMed

    Chen, Dandan; Wu, Qiong; Wang, Jun; Wang, Qi; Qiao, Heng

    2015-01-25

    In this work, the fluorescence quenching was used to study the interaction of cyanuric acid (CYA) and uric acid (UA) with bovine serum albumin (BSA) at two different temperatures (283 K and 310 K). The bimolecular quenching constant (Kq), apparent quenching constant (Ksv), effective binding constant (KA) and corresponding dissociation constant (KD), binding site number (n) and binding distance (r) were calculated by adopting Stern-Volmer, Lineweaver-Burk, Double logarithm and overlap integral equations. The results show that CYA and UA are both able to obviously bind to BSA, but the binding strength order is BSA+CYA

  17. Complete amino acid sequence of ananain and a comparison with stem bromelain and other plant cysteine proteases.

    PubMed Central

    Lee, K L; Albee, K L; Bernasconi, R J; Edmunds, T

    1997-01-01

    The amino acid sequences of ananain (EC3.4.22.31) and stem bromelain (3.4.22.32), two cysteine proteases from pineapple stem, are similar yet ananain and stem bromelain possess distinct specificities towards synthetic peptide substrates and different reactivities towards the cysteine protease inhibitors E-64 and chicken egg white cystatin. We present here the complete amino acid sequence of ananain and compare it with the reported sequences of pineapple stem bromelain, papain and chymopapain from papaya and actinidin from kiwifruit. Ananain is comprised of 216 residues with a theoretical mass of 23464 Da. This primary structure includes a sequence insert between residues 170 and 174 not present in stem bromelain or papain and a hydrophobic series of amino acids adjacent to His-157. It is possible that these sequence differences contribute to the different substrate and inhibitor specificities exhibited by ananain and stem bromelain. PMID:9355753

  18. Microbial community dynamics in bioaugmented sequencing batch reactors for bromoamine acid removal.

    PubMed

    Qu, Yuanyuan; Zhou, Jiti; Wang, Jing; Fu, Xiang; Xing, Linlin

    2005-05-01

    Sphingomonas xenophaga QYY with the ability to degrade bromoamine acid (BAA) was previously isolated from sludge samples. The enhancement of BAA removal by strain QYY in sequencing batch reactors (SBRs) was investigated in this study. The results showed that augmented SBRs exhibited stronger abilities to degrade BAA than the non-augmented control one. In order to estimate the relationship between community dynamics and function of augmented SBRs, a combined method based on fingerprints (ribosomal intergenic spacer analysis, RISA) and 16S rRNA gene sequencing was used. The results indicated that the microbial community dynamics were substantially changed, and the introduced strain QYY was persistent in the augmented systems. This study suggests that it is feasible and potentially useful to enhance BAA removal using BAA-degrading bacteria, such as S. xenophaga QYY.

  19. [Measurement of the amino acid sequence for the fusion protein FP3 with LC-MS/MS].

    PubMed

    Li, Xiang; Gao, Xiang-Dong; Tao, Lei; Pei, De-Ning; Guo, Ying; Rao, Chun-Ming; Wang, Jun-Zhi

    2012-02-01

    The amino acid sequence of the fusion protein FP3 was measured by two types of LC-MS/MS and its primary structure was confirmed. After reduction and alkylation, the protein was digested with trypsin and glycosyl groups in glycopeptide were removed by PNGase F. The mixed peptides were separated by LC, then Q-TOF and Ion trap tandem mass spectrometry were used to measure b, y fragment ions of each peptide to analyze the amino acid sequence of fusion protein FP3. Seventy-six percent of full amino acid sequence of the fusion protein FP3 was measured by LC-ESI-Q-TOF with the remaining 24% completed by LC-ESI-Trap. As LC-MS and tandem mass spectrometry are rapid, sensitive, accurate to measure the protein amino acid sequence, they are important approach to structure analysis and identification of recombinant protein.

  20. NullSeq: A Tool for Generating Random Coding Sequences with Desired Amino Acid and GC Contents

    PubMed Central

    Liu, Sophia S.; Hockenberry, Adam J.; Lancichinetti, Andrea; Jewett, Michael C.

    2016-01-01

    The existence of over- and under-represented sequence motifs in genomes provides evidence of selective evolutionary pressures on biological mechanisms such as transcription, translation, ligand-substrate binding, and host immunity. In order to accurately identify motifs and other genome-scale patterns of interest, it is essential to be able to generate accurate null models that are appropriate for the sequences under study. While many tools have been developed to create random nucleotide sequences, protein coding sequences are subject to a unique set of constraints that complicates the process of generating appropriate null models. There are currently no tools available that allow users to create random coding sequences with specified amino acid composition and GC content for the purpose of hypothesis testing. Using the principle of maximum entropy, we developed a method that generates unbiased random sequences with pre-specified amino acid and GC content, which we have developed into a python package. Our method is the simplest way to obtain maximally unbiased random sequences that are subject to GC usage and primary amino acid sequence constraints. Furthermore, this approach can easily be expanded to create unbiased random sequences that incorporate more complicated constraints such as individual nucleotide usage or even di-nucleotide frequencies. The ability to generate correctly specified null models will allow researchers to accurately identify sequence motifs which will lead to a better understanding of biological processes as well as more effective engineering of biological systems. PMID:27835644

  1. Isolation, amino acid sequence and biological characterization of an "aspartic-49" phospholipase A₂ from Bothrops (Rhinocerophis) ammodytoides venom.

    PubMed

    Clement, Herlinda; Costa de Oliveira, Vanessa; Zamudio, Fernando Z; Lago, Néstor R; Valdez-Cruz, Norma A; Bérnard Valle, Melisa; Hajos, Silvia E; Alagón, Alejandro; Possani, Lourival D; de Roodt, Adolfo R

    2012-12-01

    A phospholipase enzyme was separated by chromatography from the venom of the snake Bothrops (Rhinocerophis) ammodytoides and characterized. The experimentally determined molecular weight was 13,853.65 Da, and the full primary structure was determined by Edman degradation and mass spectrometry analysis. The enzyme contains 122 amino acids residues closely stabilized by 7 disulfide bridges with an isoelectric point of 6.13. Sequence comparison with other known secretory PLA2 shows that the enzyme isolated belongs to the group II, presenting an aspartic acid residue at position 48 (numbered by convention as Asp49) of the active site, and accordingly displaying enzymatic activity. The enzyme corresponds to 3% of the total mass of the venom. The enzyme is mildly toxic to mice. The intravenous LD₅₀ of this phospholipase in CD-1 mice was around 6 μg/g of mouse body weight (more exactly 117 μg/mouse of 20 g) and the minimal mortal dose (MMD) was estimated to be close to 10 μg/g. In contrast, the LD₅₀ of the venom was circa 2 μg/g mouse body weight. Toxicological analyses of the purified enzyme were performed in vitro and in vivo using experimental animals (mice and rats). The enzyme at high doses caused pulmonary congestion, intraperitoneal bleeding, inhibition of clot retraction and muscle tissue alterations with increasing of creatine kinase levels.

  2. Morphological tranformation of calcite crystal growth by prismatic "acidic" polypeptide sequences.

    SciTech Connect

    Kim, I; Giocondi, J L; Orme, C A; Collino, J; Evans, J S

    2007-02-13

    Many of the interesting mechanical and materials properties of the mollusk shell are thought to stem from the prismatic calcite crystal assemblies within this composite structure. It is now evident that proteins play a major role in the formation of these assemblies. Recently, a superfamily of 7 conserved prismatic layer-specific mollusk shell proteins, Asprich, were sequenced, and the 42 AA C-terminal sequence region of this protein superfamily was found to introduce surface voids or porosities on calcite crystals in vitro. Using AFM imaging techniques, we further investigate the effect that this 42 AA domain (Fragment-2) and its constituent subdomains, DEAD-17 and Acidic-2, have on the morphology and growth kinetics of calcite dislocation hillocks. We find that Fragment-2 adsorbs on terrace surfaces and pins acute steps, accelerates then decelerates the growth of obtuse steps, forms clusters and voids on terrace surfaces, and transforms calcite hillock morphology from a rhombohedral form to a rounded one. These results mirror yet are distinct from some of the earlier findings obtained for nacreous polypeptides. The subdomains Acidic-2 and DEAD-17 were found to accelerate then decelerate obtuse steps and induce oval rather than rounded hillock morphologies. Unlike DEAD-17, Acidic-2 does form clusters on terrace surfaces and exhibits stronger obtuse velocity inhibition effects than either DEAD-17 or Fragment-2. Interestingly, a 1:1 mixture of both subdomains induces an irregular polygonal morphology to hillocks, and exhibits the highest degree of acute step pinning and obtuse step velocity inhibition. This suggests that there is some interplay between subdomains within an intra (Fragment-2) or intermolecular (1:1 mixture) context, and sequence interplay phenomena may be employed by biomineralization proteins to exert net effects on crystal growth and morphology.

  3. Targeted sequencing for high-resolution evolutionary analyses following genome duplication in salmonid fish: Proof of concept for key components of the insulin-like growth factor axis.

    PubMed

    Lappin, Fiona M; Shaw, Rebecca L; Macqueen, Daniel J

    2016-12-01

    High-throughput sequencing has revolutionised comparative and evolutionary genome biology. It has now become relatively commonplace to generate multiple genomes and/or transcriptomes to characterize the evolution of large taxonomic groups of interest. Nevertheless, such efforts may be unsuited to some research questions or remain beyond the scope of some research groups. Here we show that targeted high-throughput sequencing offers a viable alternative to study genome evolution across a vertebrate family of great scientific interest. Specifically, we exploited sequence capture and Illumina sequencing to characterize the evolution of key components from the insulin-like growth (IGF) signalling axis of salmonid fish at unprecedented phylogenetic resolution. The IGF axis represents a central governor of vertebrate growth and its core components were expanded by whole genome duplication in the salmonid ancestor ~95Ma. Using RNA baits synthesised to genes encoding the complete family of IGF binding proteins (IGFBP) and an IGF hormone (IGF2), we captured, sequenced and assembled orthologous and paralogous exons from species representing all ten salmonid genera. This approach generated 299 novel sequences, most as complete or near-complete protein-coding sequences. Phylogenetic analyses confirmed congruent evolutionary histories for all nineteen recognized salmonid IGFBP family members and identified novel salmonid-specific IGF2 paralogues. Moreover, we reconstructed the evolution of duplicated IGF axis paralogues across a replete salmonid phylogeny, revealing complex historic selection regimes - both ancestral to salmonids and lineage-restricted - that frequently involved asymmetric paralogue divergence under positive and/or relaxed purifying selection. Our findings add to an emerging literature highlighting diverse applications for targeted sequencing in comparative-evolutionary genomics. We also set out a viable approach to obtain large sets of nuclear genes for any

  4. Supply Chain Modeling for Fluorspar and Hydrofluoric Acid and Implications for Further Analyses

    DTIC Science & Technology

    2015-04-01

    for other materials in the FY 2015 NDS Requirements Report, potential market responses to the fluorspar and HF shortfalls have been evaluated to...Information Services Limited, Fluorspar: Global Industry Markets and Outlook, 11th ed (London: Roskill, 2013), 19, 143. 7 Thomason, Analyses for the 2015...Industry Markets and Outlook, 142–144. 9 Department of Defense, Strategic and Critical Materials 2013 Report on Stockpile Requirements (Washington, DC

  5. Lignin, cutin, amino acid and carbohydrate analyses of marine particulate organic matter

    NASA Astrophysics Data System (ADS)

    Hedges, John I.

    Our group at the University of Washington has specifically designed methods for the analysis of lignin compounds [Hedges and Ertel, 1982], cutin acids [Goñi and Hedges, 1990a], amino acids [Cowie and Hedges, in press, 1991a, b] and various carbohydrates, including aldoses [Cowie and Hedges, 1984], cyclitols [Hedges and Weliky, 1989] and uronic acids [Walters and Hedges, 1988; Bergamaschi and Hedges, in preparation], in particulate samples from aquatic environments. All of these procedures are derivatives of previous methods that we have adapted for application to complex natural mixtures and tested on a variety of sample types, such as plankton, woods, soils and sediments, for precision, accuracy and yield efficiencies. All the methods are written up in detail and only will be summarized in the following sections. The remaining discussion, covering the various compound types in the order given above, will focus on unpublished procedural developments for each technique, special problems that are unique to each method and related tricks of the trade. Lignin analysis will be treated in most detail because it is the method with which we have had the longest and most detailed experience.

  6. Sequence selective recognition of double-stranded RNA using triple helix-forming peptide nucleic acids.

    PubMed

    Zengeya, Thomas; Gupta, Pankaj; Rozners, Eriks

    2014-01-01

    Noncoding RNAs are attractive targets for molecular recognition because of the central role they play in gene expression. Since most noncoding RNAs are in a double-helical conformation, recognition of such structures is a formidable problem. Herein, we describe a method for sequence-selective recognition of biologically relevant double-helical RNA (illustrated on ribosomal A-site RNA) using peptide nucleic acids (PNA) that form a triple helix in the major grove of RNA under physiologically relevant conditions. Protocols for PNA preparation and binding studies using isothermal titration calorimetry are described in detail.

  7. Fast computational methods for predicting protein structure from primary amino acid sequence

    DOEpatents

    Agarwal, Pratul Kumar

    2011-07-19

    The present invention provides a method utilizing primary amino acid sequence of a protein, energy minimization, molecular dynamics and protein vibrational modes to predict three-dimensional structure of a protein. The present invention also determines possible intermediates in the protein folding pathway. The present invention has important applications to the design of novel drugs as well as protein engineering. The present invention predicts the three-dimensional structure of a protein independent of size of the protein, overcoming a significant limitation in the prior art.

  8. Fluorescence energy transfer as a probe for nucleic acid structures and sequences.

    PubMed Central

    Mergny, J L; Boutorine, A S; Garestier, T; Belloc, F; Rougée, M; Bulychev, N V; Koshkin, A A; Bourson, J; Lebedev, A V; Valeur, B

    1994-01-01

    The primary or secondary structure of single-stranded nucleic acids has been investigated with fluorescent oligonucleotides, i.e., oligonucleotides covalently linked to a fluorescent dye. Five different chromophores were used: 2-methoxy-6-chloro-9-amino-acridine, coumarin 500, fluorescein, rhodamine and ethidium. The chemical synthesis of derivatized oligonucleotides is described. Hybridization of two fluorescent oligonucleotides to adjacent nucleic acid sequences led to fluorescence excitation energy transfer between the donor and the acceptor dyes. This phenomenon was used to probe primary and secondary structures of DNA fragments and the orientation of oligodeoxynucleotides synthesized with the alpha-anomers of nucleoside units. Fluorescence energy transfer can be used to reveal the formation of hairpin structures and the translocation of genes between two chromosomes. PMID:8152922

  9. Amino acid sequence of two neurotoxins from the venom of the Egyptian black snake (Walterinnesia aegyptia).

    PubMed

    Samejima, Y; Aoki-Tomomatsu, Y; Yanagisawa, M; Mebs, D

    1997-02-01

    The venom of the Egyptian black snake Walterinnesia aegyptia contains at least three toxins, which act postsynaptically to block the neuromuscular transmission of isolated rat phrenic nerve-diaphragm and chicken biventer cervicis muscle. The complete amino acid sequence of the two toxins, W-III and W-IV, consisting of 62 amino acid residues, was elucidated by Edman degradation of fragments obtained after Staphylococcus aureus protease and prolylpeptidase digestion. Although the toxins exhibit close structural homology to other short-chain postsynaptic neurotoxins from Elapidae venoms, toxin IV is unique by having a free SH-group (cysteine) at position 16. In position 35 of W-III, which is located at the tip of the central loop, threonine is replaced by lysine, which may alter the interaction of the toxin with the acetylcholine receptor, since the toxin is seven times less lethal than toxin W-IV.

  10. Metabolic sequences of anaerobic fermentation on glucose-based feeding substrates based on correlation analyses of microbial and metabolite profiling.

    PubMed

    Date, Yasuhiro; Iikura, Tomohiro; Yamazawa, Akira; Moriya, Shigeharu; Kikuchi, Jun

    2012-12-07

    Degradation processes in various biomasses are managed by complex metabolic dynamics created by diverse and extensive interactions and competition in microbial communities and their environments. It is important to develop visualization methods to provide a bird's-eye view when characterizing the entire sequential metabolic process in an environmental ecosystem. Here, we describe an approach for the visualization of the metabolic sequences in anaerobic fermentation ecosystems, characterizing the entire metabolic dynamics using a combination of microbial community profiles and metabolic profiles. By evaluating their time-dependent variation, we found that microbial community profiles and metabolite production processes were characteristically affected by the feeding of different glucose-based substrates (glucose, starch, cellulose), although the compositions of the major microbial community and the metabolites detected were likely to be similar in all experiments. This combinatorial approach to variation in microbial communities and metabolic profiles was used successfully to visualize metabolic sequences in anaerobic fermentation ecosystems, in addition to mining candidate microbiota for cellulose degradation. Thus, this approach provides a powerful tool for visualizing and evaluating metabolic sequences within the biomass degradation process in an environmental ecosystem. This is the first report to visualize the entire metabolic dynamic in an anaerobic fermentation ecosystem as metabolic sequences.

  11. Using Synthetic Nanopores for Single-Molecule Analyses: Detecting SNPs, Trapping DNA Molecules, and the Prospects for Sequencing DNA

    ERIC Educational Resources Information Center

    Dimitrov, Valentin V.

    2009-01-01

    This work focuses on studying properties of DNA molecules and DNA-protein interactions using synthetic nanopores, and it examines the prospects of sequencing DNA using synthetic nanopores. We have developed a method for discriminating between alleles that uses a synthetic nanopore to measure the binding of a restriction enzyme to DNA. There exists…

  12. [Sequence and Structural Analyses of the Complete Genome of Bovine Papillomavirus 2 Genotype Aks-01 Strain from Skin Samples of Cows in Southern Xinjiang, China].

    PubMed

    Zhang, Wanqi; Hu, Jianjun; Yan, Shilei; Huang, Yaojie; Xu, Jianping; Huang, Zhongwu; Zheng, Maoliang; Meng, Ziyan; Li, Yuanyuan; Wang, Na; Wang, Qingqing

    2015-07-01

    To study the complete genomic sequence, genomic characteristics, and genetic variation of the bovine papillomavirus 2 genotype (BPV-2) Aks-01 strain at the molecular level, genotyping of this strain from the skin samples of cows in southern Xinjiang (China) was first detected by the polymerase chain reaction with FAP59/FAP64 primers. Based on the complete genome of the BPV-2 reference strain, specific primers and sequencing primers were designed, and the complete genome of the Aks-01 strain amplified and sequenced. Sequence analyses showed that genotyping of the Aks-01 strain belonged to BPV-2. The Aks-01 strain had the structural characteristics of BPV-2. The 7944-bp full-length genomic sequence of the Aks-01 strain was compiled using DNAStar™. The sequence of the Aks-01 strain had 98% similarity to the reference strain from GenBank. The Aks-01 strain was most closely related to BPV-1 and BPV-13. BPV-2, BPV-1 and BPV-13 were grouped within the genus Deltapapillomavirus. The Aks-01 strain is the first BPV-2 strain reported in southern Xinjiang.

  13. Complete genome sequence of Lactococcus lactis IO-1, a lactic acid bacterium that utilizes xylose and produces high levels of L-lactic acid.

    PubMed

    Kato, Hiroaki; Shiwa, Yuh; Oshima, Kenshiro; Machii, Miki; Araya-Kojima, Tomoko; Zendo, Takeshi; Shimizu-Kadota, Mariko; Hattori, Masahira; Sonomoto, Kenji; Yoshikawa, Hirofumi

    2012-04-01

    We report the complete genome sequence of Lactococcus lactis IO-1 (= JCM7638). It is a nondairy lactic acid bacterium, produces nisin Z, ferments xylose, and produces predominantly L-lactic acid at high xylose concentrations. From ortholog analysis with other five L. lactis strains, IO-1 was identified as L. lactis subsp. lactis.

  14. Complete genome sequence of Bacillus amyloliquefaciens LL3, which exhibits glutamic acid-independent production of poly-γ-glutamic acid.

    PubMed

    Geng, Weitao; Cao, Mingfeng; Song, Cunjiang; Xie, Hui; Liu, Li; Yang, Chao; Feng, Jun; Zhang, Wei; Jin, Yinghong; Du, Yang; Wang, Shufang

    2011-07-01

    Bacillus amyloliquefaciens is one of most prevalent Gram-positive aerobic spore-forming bacteria with the ability to synthesize polysaccharides and polypeptides. Here, we report the complete genome sequence of B. amyloliquefaciens LL3, which was isolated from fermented food and presents the glutamic acid-independent production of poly-γ-glutamic acid.

  15. Formation Sequences of Iron Minerals in the Acidic Alteration Products and Variation of Hydrothermal Fluid Conditions

    NASA Astrophysics Data System (ADS)

    Isobe, H.; Yoshizawa, M.

    2008-12-01

    Iron minerals have important role in environmental issues not only on the Earth but also other terrestrial planets. Iron mineral species related to alteration products of primary minerals with surface or subsurface fluids are characterized by temperature, acidity and redox conditions of the fluids. We can see various iron- bearing alteration products in alteration products around fumaroles in geothermal/volcanic areas. In this study, zonal structures of iron minerals in alteration products of the geothermal area are observed to elucidate temporal and spatial variation of hydrothermal fluids. Alteration of the pyroxene-amphibole andesite of Garan-dake volcano, Oita, Japan occurs by the acidic hydrothermal fluid to form cristobalite leaching out elements other than Si. Hand specimens with unaltered or weakly altered core and cristobalite crust show various sequences of layers. XRD analysis revealed that the alteration degree is represented by abundance of cristobalite. Intermediately altered layers are characterized by occurrence including alunite, pyrite, kaolinite, goethite and hematite. A specimen with reddish brown core surrounded by cristobalite-rich white crust has brown colored layers at the boundary of core and the crust. Reddish core is characterized by occurrence of crystalline hematite by XRD. Another hand specimen has light gray core, which represents reduced conditions, and white cristobalite crust with light brown and reddish brown layers of ferric iron minerals between the core and the crust. On the other hand, hornblende crystals, typical ferrous iron-bearing mineral of the host rock, are well preserved in some samples with strongly decolorized cristobalite-rich groundmass. Hydrothermal alteration experiments of iron-rich basaltic material shows iron mineral species depend on acidity and temperature of the fluid. Oxidation states of the iron-bearing mineral species are strongly influenced by the acidity and redox conditions. Variations of alteration

  16. Design, synthesis, and characterization of a protein sequencing reagent yielding amino acid derivatives with enhanced detectability by mass spectrometry.

    PubMed Central

    Aebersold, R.; Bures, E. J.; Namchuk, M.; Goghari, M. H.; Shushan, B.; Covey, T. C.

    1992-01-01

    We report the design, chemical synthesis, and structural and functional characterization of a novel reagent for protein sequence analysis by the Edman degradation, yielding amino acid derivatives rapidly detectable at high sensitivity by ion-evaporation mass spectrometry. We demonstrate that the reagent 3-[4'(ethylene-N,N,N-trimethylamino)phenyl]-2-isothiocyanate is chemically stable and shows coupling and cyclization/cleavage yields comparable to phenylisothiocyanate, the standard reagent in chemical sequence analysis, under conditions typically encountered in manual or automated sequence analysis. Amino acid derivatives generated with this reagent were detectable by ion-evaporation mass spectrometry at the subfemtomole sensitivity level at a pace of one sample per minute. Furthermore, derivatives were identified by their mass, thus permitting the rapid and highly sensitive determination of the molecular nature of modified amino acids. Derivatives of amino acids with acidic, basic, polar, or hydrophobic side chains were reproducibly detectable at comparable sensitivities. The polar nature of the reagent required covalent immobilization of polypeptides prior to automated sequence analysis. This reagent, used in automated sequence analysis, has the potential for overcoming the limitations in sensitivity, speed, and the ability to characterize modified amino acid residues inherent in the chemical sequencing methods that are currently used. PMID:1304351

  17. Complete Genome Sequence of Enterobacter cloacae UW5, a Rhizobacterium Capable of High Levels of Indole-3-Acetic Acid Production.

    PubMed

    Coulson, Thomas J D; Patten, Cheryl L

    2015-08-06

    We report the complete genome sequence of Enterobacter cloacae UW5, an indole-3-acetic acid-producing rhizobacterium originally isolated from the rhizosphere of grass. The 4.9-Mbp genome has a G+C content of 54% and contains 4,496 protein-coding sequences.

  18. Complete Genome Sequence of Enterobacter cloacae UW5, a Rhizobacterium Capable of High Levels of Indole-3-Acetic Acid Production

    PubMed Central

    Coulson, Thomas J. D.

    2015-01-01

    We report the complete genome sequence of Enterobacter cloacae UW5, an indole-3-acetic acid-producing rhizobacterium originally isolated from the rhizosphere of grass. The 4.9-Mbp genome has a G+C content of 54% and contains 4,496 protein-coding sequences. PMID:26251488

  19. Genome Sequence of the Lactic Acid Bacterium Lactococcus lactis subsp. lactis TOMSC161, Isolated from a Nonscalded Curd Pressed Cheese

    PubMed Central

    Velly, H.; Abraham, A.-L.; Loux, V.; Delacroix-Buchet, A.; Fonseca, F.; Bouix, M.

    2014-01-01

    Lactococcus lactis is a lactic acid bacterium used in the production of many fermented foods, such as dairy products. Here, we report the genome sequence of L. lactis subsp. lactis TOMSC161, isolated from nonscalded curd pressed cheese. This genome sequence provides information in relation to dairy environment adaptation. PMID:25377704

  20. Deoxyribonucleic acid sequence of araBAD promoter mutants of Escherichia coli.

    PubMed

    Horwitz, A H; Morandi, C; Wilcox, G

    1980-05-01

    The controlling site region for the araBAD operon is defined, in part, by two classes of cis-acting constitutive mutations. The aralc mutations allow low-level constitutive expression of ara-BAD in the absence of the positive regulatory protein coded for by the araC gene, whereas the araXc mutations allow expression of araBAD in the absence of the cyclic adenosine monophosphate receptor protein. Six independently isolated aralc mutations and three independently isolated araXc mutations were cloned onto the plasmid pBR322 using in vitro recombinant deoxyribonucleic acid techniques and in vivo recombination between plasmid and chromosomal deoxyribonucleic acid. The location of these mutations was determined by deoxyribonucleic acid sequence analysis. All of the aralc mutations occurred at position -35 within the araBAD promoter (+1 = messenger ribonucleic acid start for araBAD) and resulted from an AT leads to GC transition. All of the araXc mutations occurred at position -10 within the araBAD promoter and resulted from a GC leads to AT transition. Models are presented to explain the mode of action of the aralc and araXc mutations.

  1. The quest for the best: The impact of different EPI sequences on the sensitivity of random effect fMRI group analyses

    PubMed Central

    Kirilina, Evgeniya; Lutti, Antoine; Poser, Benedikt A.; Blankenburg, Felix; Weiskopf, Nikolaus

    2016-01-01

    We compared the sensitivity of standard single-shot 2D echo planar imaging (EPI) to three advanced EPI sequences, i.e., 2D multi-echo EPI, 3D high resolution EPI and 3D dual-echo fast EPI in fixed effect and random effects group level fMRI analyses at 3 T. The study focused on how well the variance reduction in fixed effect analyses achieved by advanced EPI sequences translates into increased sensitivity in the random effects group level analysis. The sensitivity was estimated in a functional MRI experiment of an emotional learning and a reward based learning tasks in a group of 24 volunteers. Each experiment was acquired with the four different sequences. The task-related response amplitude, contrast level and respective t-value were proxies for the functional sensitivity across the brain. All three advanced EPI methods increased the sensitivity in the fixed effects analyses, but standard single-shot 2D EPI provided a comparable performance in random effects group analysis when whole brain coverage and moderate resolution are required. In this experiment inter-subject variability determined the sensitivity of the random effects analysis for most brain regions, making the impact of EPI pulse sequence improvements less relevant or even negligible for random effects analyses. An exception concerns the optimization of EPI reducing susceptibility-related signal loss that translates into an enhanced sensitivity e.g. in the orbitofrontal cortex for multi-echo EPI. Thus, future optimization strategies may best aim at reducing inter-subject variability for higher sensitivity in standard fMRI group studies at moderate spatial resolution. PMID:26515905

  2. Ribosomal PCR and DNA sequencing for detection and identification of bacteria: experience from 6 years of routine analyses of patient samples.

    PubMed

    Jensen, Kristine Helander; Dargis, Rimtas; Christensen, Jens Jørgen; Kemp, Michael

    2014-03-01

    The use of broad range PCR and DNA sequencing of bacterial 16S ribosomal RNA genes for routine diagnostics of bacterial infections was evaluated. Here, the results from more than 2600 analyses during a 6-year period (2003-2009) are presented. Almost half of the samples were from joints and bones, and the second most frequent origin of samples was from the central nervous system. Overall, 26% of all samples were positive for bacterial DNA and bacterial identification was obtained in 80% of the PCR-positive samples by subsequent DNA sequencing. Ambiguous species identification was noticed among non-haemolytic streptococci, especially within the mitis group. The data show that ribosomal PCR with subsequent DNA sequencing of the PCR product is a most valuable supplement to culture for identifying bacterial agents of both acute and prolonged infections. However, some bacteria, including non-haemolytic streptococci, may not be precisely identified.

  3. In the TTF-1 homeodomain the contribution of several amino acids to DNA recognition depends on the bound sequence.

    PubMed Central

    Fabbro, D; Tell, G; Leonardi, A; Pellizzari, L; Pucillo, C; Lonigro, R; Formisano, S; Damante, G

    1996-01-01

    The thyroid transcription factor-1 homeodomain (TTF-1HD) shows a peculiar DNA binding specificity, preferentially recognizing sequences containing the 5'-CAAG-3' core motif. Most other homeodomains instead recognize sites containing the 5'-TAAT-3' core motif. Here, we show that TTF-1HD efficiently recognizes another sequence, called D1, devoid of the 5'-CAAG-3' core motif. Different experimental approaches indicate that TTF-1HD contacts the D1 sequence in a manner which is different to that used to interact with sequences containing the 5'-CAAG-3' core motif. The binding activities that mutants of TTF-1HD display with the D1 sequence or with the sequence containing the 5'-CAAG-3' core motif indicate that the role of several DNA-contacting amino acids is different. In particular, during recognition of the D1 sequence, backbone-interacting amino acids not relevant in binding to sequences containing the 5'-CAAG-3' core motif play an important role. In the TTF-1HD, therefore, the contribution of several amino acids to DNA recognition depends on the bound sequence. These data indicate that although a common bonding network exists in all of the HD/DNA complexes, peculiarities important for DNA recognition may occur in single cases. PMID:8811078

  4. Enhanced Sampling and Overfitting Analyses in Structural Refinement of Nucleic Acids into Electron Microscopy Maps

    PubMed Central

    Vashisth, Harish; Skiniotis, Georgios; Brooks, Charles L.

    2013-01-01

    Flexible-fitting computational algorithms are often useful to interpret low resolution maps of many macromolecular complexes generated by electron microscopy (EM) imaging. One such atomistic simulation technique is molecular dynamics flexible fitting (MDFF), which has been widely applied to generate structural models of large ribonucleoprotein assemblies such as the ribosome. We have previously shown that MDFF simulations of globular proteins are sensitive to the resolution of the target EM map, and the strength of restraints used to preserve the secondary structure elements during fitting (Vashisth et al. Structure 2012, 20, 1453–1462). In this work, we aim to systematically examine the quality of structural models of various nucleic acids obtained via MDFF by varying the map resolution and the strength of structural restraints. We also demonstrate how an enhanced conformational sampling technique for proteins, temperatureaccelerated molecular dynamics (TAMD), can be combined with MDFF for the structural refinement of nucleic acids in EMmaps. Finally, we also demonstrate application of TAMD-assisted MDFF (TAMDFF) on a RNA/protein complex and suggest that TAMDFF is a viable strategy for enhanced conformational fitting in target maps of ribonucleoprotein complexes. PMID:23506287

  5. Multicenter quality assessment of 16S ribosomal DNA-sequencing for microbiome analyses reveals high inter-center variability.

    PubMed

    Hiergeist, Andreas; Reischl, Udo; Gessner, Andrè

    2016-08-01

    The composition of human as well as animal microbiota has increasingly gained in interest since metabolites and structural components of endogenous microorganisms fundamentally influence all aspects of host physiology. Since many of the bacteria are still unculturable, molecular techniques such as high-throughput sequencing have dramatically increased our knowledge of microbial communities. The majority of microbiome studies published thus far are based on bacterial 16S ribosomal RNA (rRNA) gene sequencing, so that they can, at least in principle, be compared to determine the role of the microbiome composition for host metabolism and physiology, developmental processes, as well as different diseases. However, differences in DNA preparation and purification, 16S rDNA PCR amplification, sequencing procedures and platforms, as well as bioinformatic analysis and quality control measures may strongly affect the microbiome composition results obtained in different laboratories. To systematically evaluate the comparability of results and identify the most influential methodological factors affecting these differences, identical human stool sample replicates spiked with quantified marker bacteria, and their subsequent DNA sequences were analyzed by nine different centers in an external quality assessment (EQA). While high intra-center reproducibility was observed in repetitive tests, significant inter-center differences of reported microbiota composition were obtained. All steps of the complex analysis workflow significantly influenced microbiome profiles, but the magnitude of variation caused by PCR primers for 16S rDNA amplification was clearly the largest. In order to advance microbiome research to a more standardized and routine medical diagnostic procedure, it is essential to establish uniform standard operating procedures throughout laboratories and to initiate regular proficiency testing.

  6. Molecular cloning, encoding sequence, and expression of vaccinia virus nucleic acid-dependent nucleoside triphosphatase gene.

    PubMed Central

    Rodriguez, J F; Kahn, J S; Esteban, M

    1986-01-01

    A rabbit poxvirus genomic library contained within the expression vector lambda gt11 was screened with polyclonal antiserum prepared against vaccinia virus nucleic acid-dependent nucleoside triphosphatase (NTPase)-I enzyme. Five positive phage clones containing from 0.72- to 2.5-kilobase-pair (kbp) inserts expressed a beta-galactosidase fusion protein that was reactive by immunoblotting with the NTPase-I antibody. Hybridization analysis allowed the location of this gene within the vaccinia HindIIID restriction fragment. From the known nucleotide sequence of the 16-kbp vaccinia HindIIID fragment, we identified a region that contains a 1896-base open reading frame coding for a 631-amino acid protein. Analysis of the complete sequence revealed a highly basic protein, with hydrophilic COOH and NH2 termini, various hydrophobic domains, and no significant homology to other known proteins. Translational studies demonstrate that NTPase-I belongs to a late class of viral genes. This protein is highly conserved among Orthopoxviruses. Images PMID:3025846

  7. The amino acid sequences and activities of synergistic hemolysins from Staphylococcus cohnii.

    PubMed

    Mak, Pawel; Maszewska, Agnieszka; Rozalska, Malgorzata

    2008-10-01

    Staphylococcus cohnii ssp. cohnii and S. cohnii ssp. urealyticus are a coagulase-negative staphylococci considered for a long time as unable to cause infections. This situation changed recently and pathogenic strains of these bacteria were isolated from hospital environments, patients and medical staff. Most of the isolated strains were resistant to many antibiotics. The present work describes isolation and characterization of several synergistic peptide hemolysins produced by these bacteria and acting as virulence factors responsible for hemolytic and cytotoxic activities. Amino acid sequences of respective hemolysins from S. cohnii ssp. cohnii (named as H1C, H2C and H3C) and S. cohnii ssp. urealyticus (H1U, H2U and H3U) were identical. Peptides H1 and H3 possessed significant amino acid homology to three synergistic hemolysins secreted by Staphylococcus lugdunensis and to putative antibacterial peptide produced by Staphylococcus saprophyticus ssp. saprophyticus. On the other hand, hemolysin H2 had a unique sequence. All isolated peptides lysed red cells from different mammalian species and exerted a cytotoxic effect on human fibroblasts.

  8. Complete amino acid sequence of a Lolium perenne (perennial rye grass) pollen allergen, Lol p II.

    PubMed

    Ansari, A A; Shenbagamurthi, P; Marsh, D G

    1989-07-05

    The complete amino acid sequence of a Lolium perenne (rye grass) pollen allergen, Lol p II was determined by automated Edman degradation of the protein and selected fragments. Cleavage of the protein by enzymatic and chemical techniques established an unambiguous sequence for the protein. Lol p II contains 97 amino acid residues, with a calculated molecular weight of 10,882. The protein lacks cysteine and glutamine and shows no evidence of glycosylation. Theoretical predictions by Fraga's (Fraga, S. (1982) Can. J. Chem. 60, 2606-2610) and Hopp and Woods' (Hopp, T. P., and Woods, K. R. (1981) Proc. Natl. Acad. Sci. U.S.A. 78, 3824-3828) methods indicate the presence of four hydrophilic regions, which may contribute to sequential or parts of conformational B-cell epitopes. Analysis of amphipathic regions by Berzofsky's method indicates the presence of a highly amphipathic region, which may contain, or contribute to, an Ia/T-cell epitope. This latter segment of Lol p II was found to be highly homologous with an antibody-binding segment of the major rye allergen Lol p I and may explain why immune responsiveness to both the allergens is associated with HLA-DR3.

  9. The Sequence-Specific Cellular Uptake of Spherical Nucleic Acid Nanoparticle Conjugates

    PubMed Central

    Narayan, Suguna P.; Choi, Chung Hang J.; Hao, Liangliang; Calabrese, Colin M.; Auyeung, Evelyn; Zhang, Chuan; Goor, Olga J.G.M.

    2015-01-01

    We investigated the sequence-dependent cellular uptake of spherical nucleic acid nanoparticle conjugates (SNAs). This process occurs by interaction with class A scavenger receptors (SR-A) and caveolae-mediated endocytosis. It is known that linear poly(guanine) (poly G) is a natural ligand for SR-A, and it has been proposed that interaction of poly G with SR-A is dependent on the formation of G-quadruplexes. Since G-rich oligonucleotides are known to interact strongly with SR-A, we hypothesized that SNAs with higher G contents would be able to enter cells in larger amounts than SNAs composed of other nucleotides, and as such we measured cellular internalization of SNAs as a function of constituent oligonucleotide sequence. Indeed, SNAs with enriched G content show the highest cellular uptake. Using this hypothesis, we chemically conjugated a small molecule (camptothecin) with SNAs to create drug-SNA conjugates and observed that poly G SNAs deliver the most camptothecin to cells and have the highest cytotoxicity in cancer cells. Our data elucidate important design considerations for enhancing the intracellular delivery of spherical nucleic acids. PMID:26097111

  10. Partial amino acid sequences around sulfhydryl groups of soybean beta-amylase.

    PubMed

    Nomura, K; Mikami, B; Morita, Y

    1987-08-01

    Sulfhydryl (SH) groups of soybean beta-amylase were modified with 5-(iodoaceto-amidoethyl)aminonaphthalene-1-sulfonate (IAEDANS) and the SH-containing peptides exhibiting fluorescence were purified after chymotryptic digestion of the modified enzyme. The sequence analysis of the peptides derived from the modification of all SH groups in the denatured enzyme revealed the existence of six SH groups, in contrast to five reported previously. One of them was found to have extremely low reactivity toward SH-reagents without reduction. In the native state, IAEDANS reacted with 2 mol of SH groups per mol of the enzyme (SH1 and SH2) accompanied with inactivation of the enzyme owing to the modification of SH2 located near the active site of this enzyme. The selective modification of SH2 with IAEDANS was attained after the blocking of SH1 with 5,5'-dithiobis-(2-nitrobenzoic acid). The amino acid sequences of the peptides containing SH1 and SH2 were determined to be Cys-Ala-Asn-Pro-Gln and His-Gln-Cys-Gly-Gly-Asn-Val-Gly-Asp-Ile-Val-Asn-Ile-Pro-Ile-Pro-Gln-Trp, respectively.

  11. Mitochondrial DNA sequence analyses and phylogenetic relationships among two Nigerian goat breeds and the South African Kalahari Red.

    PubMed

    Awotunde, Esther O; Bemji, Martha N; Olowofeso, Olajide; James, Ikechukwu J; Ajayi, O O; Adebambo, Ayotunde O

    2015-01-01

    The first hypervariable (HV1) region of mitochondrial DNA (mtDNA) of two popular Nigerian goat breeds: West African Dwarf (WAD) (n=35) and Red Sokoto (RS) (n=37) and one exotic breed: Kalahari Red (KR) (n=38) imported from South Africa were sequenced to investigate sequence diversity, genetic structure, origin, and demographic history of the populations. A total of 68 polymorphic sites were found in 110 sequences that grouped into 68 haplotypes. Average haplotype and nucleotide diversities for all breeds were 0.982±0.005 and 0.02350±0.00213, respectively. Phylogenetic analysis revealed two mtDNA lineages (A and B). Lineage A was predominant and included all haplotypes from WAD and RS and 5 out of 11 haplotypes of KR goats. The remaining haplotypes (6) of KR belong to lineage B. The analysis of molecular variance revealed a high-within breed genetic variance of 82.4% and a low-between breed genetic variance of 17.6%. The three breeds clustered with Capra aegagrus as their wild ancestor. Mismatch distribution analysis showed that WAD, RS and haplogroup A have experienced population expansion events. The study has revealed very high diversity within the three breeds which are not strongly separated from each other based on mtDNA analysis. The information obtained on the genetic structure of the breeds will be useful in planning improvement and conservation programs for the local populations.

  12. De novo assembly and next-generation sequencing to analyse full-length gene variants from codon-barcoded libraries

    PubMed Central

    Cho, Namjin; Hwang, Byungjin; Yoon, Jung-ki; Park, Sangun; Lee, Joongoo; Seo, Han Na; Lee, Jeewon; Huh, Sunghoon; Chung, Jinsoo; Bang, Duhee

    2015-01-01

    Interpreting epistatic interactions is crucial for understanding evolutionary dynamics of complex genetic systems and unveiling structure and function of genetic pathways. Although high resolution mapping of en masse variant libraries renders molecular biologists to address genotype-phenotype relationships, long-read sequencing technology remains indispensable to assess functional relationship between mutations that lie far apart. Here, we introduce JigsawSeq for multiplexed sequence identification of pooled gene variant libraries by combining a codon-based molecular barcoding strategy and de novo assembly of short-read data. We first validate JigsawSeq on small sub-pools and observed high precision and recall at various experimental settings. With extensive simulations, we then apply JigsawSeq to large-scale gene variant libraries to show that our method can be reliably scaled using next-generation sequencing. JigsawSeq may serve as a rapid screening tool for functional genomics and offer the opportunity to explore evolutionary trajectories of protein variants. PMID:26387459

  13. Authentication of Cordyceps sinensis by DNA Analyses: Comparison of ITS Sequence Analysis and RAPD-Derived Molecular Markers.

    PubMed

    Lam, Kelly Y C; Chan, Gallant K L; Xin, Gui-Zhong; Xu, Hong; Ku, Chuen-Fai; Chen, Jian-Ping; Yao, Ping; Lin, Huang-Quan; Dong, Tina T X; Tsim, Karl W K

    2015-12-15

    Cordyceps sinensis is an endoparasitic fungus widely used as a tonic and medicinal food in the practice of traditional Chinese medicine (TCM). In historical usage, Cordyceps specifically is referring to the species of C. sinensis. However, a number of closely related species are named themselves as Cordyceps, and they are sold commonly as C. sinensis. The substitutes and adulterants of C. sinensis are often introduced either intentionally or accidentally in the herbal market, which seriously affects the therapeutic effects or even leads to life-threatening poisoning. Here, we aim to identify Cordyceps by DNA sequencing technology. Two different DNA-based approaches were compared. The internal transcribed spacer (ITS) sequences and the random amplified polymorphic DNA (RAPD)-sequence characterized amplified region (SCAR) were developed here to authenticate different species of Cordyceps. Both approaches generally enabled discrimination of C. sinensis from others. The application of the two methods, supporting each other, increases the security of identification. For better reproducibility and faster analysis, the SCAR markers derived from the RAPD results provide a new method for quick authentication of Cordyceps.

  14. Genome Sequence of Lactobacillus rhamnosus Strain CASL, an Efficient l-Lactic Acid Producer from Cheap Substrate Cassava

    PubMed Central

    Yu, Bo; Su, Fei; Wang, Limin; Zhao, Bo; Qin, Jiayang; Ma, Cuiqing; Xu, Ping; Ma, Yanhe

    2011-01-01

    Lactobacillus rhamnosus is a type of probiotic bacteria with industrial potential for l-lactic acid production. We announce the draft genome sequence of L. rhamnosus CASL (2,855,156 bp with a G+C content of 46.6%), which is an efficient producer of l-lactic acid from cheap, nonfood substrate cassava with a high production titer. PMID:22123765

  15. Amino acid sequence of versutoxin, a lethal neurotoxin from the venom of the funnel-web spider Atrax versutus.

    PubMed

    Brown, M R; Sheumack, D D; Tyler, M I; Howden, M E

    1988-03-01

    The complete amino acid sequence of versutoxin, a lethal neurotoxic polypeptide isolated from the venom of male and female funnel-web spiders of the species Atrax versutus, was determined. Sequencing was performed in a gas-phase protein sequencer by automated Edman degradation of the S-carboxymethylated toxin and fragments of it produced by reaction with CNBr. Versutoxin consisted of a single chain of 42 amino acid residues. It was found to have a high proportion of basic residues and of cystine. The primary structure showed marked homology with that of robustoxin, a novel neurotoxin recently isolated from the venom of another funnel-web-spider species, Atrax robustus.

  16. Amino acid sequence of versutoxin, a lethal neurotoxin from the venom of the funnel-web spider Atrax versutus.

    PubMed Central

    Brown, M R; Sheumack, D D; Tyler, M I; Howden, M E

    1988-01-01

    The complete amino acid sequence of versutoxin, a lethal neurotoxic polypeptide isolated from the venom of male and female funnel-web spiders of the species Atrax versutus, was determined. Sequencing was performed in a gas-phase protein sequencer by automated Edman degradation of the S-carboxymethylated toxin and fragments of it produced by reaction with CNBr. Versutoxin consisted of a single chain of 42 amino acid residues. It was found to have a high proportion of basic residues and of cystine. The primary structure showed marked homology with that of robustoxin, a novel neurotoxin recently isolated from the venom of another funnel-web-spider species, Atrax robustus. PMID:3355530

  17. Determination of the complete amino acid sequence for the coat protein of brome mosaic virus by time-of-flight mass spectrometry. Evidence for mutations associated with change of propagation host.

    PubMed

    She, Y M; Haber, S; Seifers, D L; Loboda, A; Chernushevich, I; Perreault, H; Ens, W; Standing, K G

    2001-06-08

    Time-of-flight mass spectrometry (TOFMS) has been applied to determine the complete coat protein amino acid sequences of a number of distinct brome mosaic virus (BMV) isolates. Ionization was carried out by both electrospray ionization and matrix-assisted laser desorption/ionization (MALDI). After determining overall coat protein masses, the proteins were digested with trypsin or Lys-C proteinases, and the digestion products were analyzed in a MALDI QqTOF mass spectrometer. The N terminus of the coat protein was found to be acetylated in each BMV isolate analyzed. In one isolate (BMV-Valverde), the amino acid sequence was identical to that predicted from the cDNA sequence of the "type" isolate, but deviations from the predicted amino acid sequence were observed for all the other isolates analyzed. When isolates were propagated in different host taxa, modified coat protein sequences were observed in some cases, along with the original sequence. Sequencing by TOFMS may therefore provide a basis for monitoring the effects of host passaging on a virus at the molecular level. Such TOFMS-based analyses assess the complete profiles of coat protein sequences actually present in infected tissues. They are therefore not subject to the selection biases inherent in deducing such sequences from reverse-transcribed viral RNA and cloning the resulting cDNA.

  18. Nucleic acid hybridization analyses confirm the presence of a hitherto unknown morbillivirus in Mediterranean dolphins.

    PubMed

    Bolt, G; Blixenkrone-Møller, M

    1994-08-15

    In 1990 an epidemic caused by a morbillivirus was noticed among Mediterranean dolphins. RNA was extracted from the tissues of dolphins and from cell cultures infected with a corresponding dolphin morbillivirus isolate. By nucleic acid hybridization this RNA was compared to RNA extracted from animal tissue or cell cultures infected with canine distemper virus (CDV), phocine distemper virus (PDV) or measles virus (MV). The presence of morbillivirus RNA in the dolphin tissue was demonstrated. Morbillivirus N, P, M and F gene mRNAs were detected in the RNA from dolphin morbillivirus infected cells. These mRNA species seemed to be of approximately the same size as the corresponding mRNA species of CDV, PDV and MV. The results of the comparison demonstrated that the dolphin morbillivirus is genetically different from CDV, PDV and MV. No indication of a close relationship between the dolphin isolate and either CDV, PDV or MV was found.

  19. Clostridium sticklandii, a specialist in amino acid degradation:revisiting its metabolism through its genome sequence

    PubMed Central

    2010-01-01

    Background Clostridium sticklandii belongs to a cluster of non-pathogenic proteolytic clostridia which utilize amino acids as carbon and energy sources. Isolated by T.C. Stadtman in 1954, it has been generally regarded as a "gold mine" for novel biochemical reactions and is used as a model organism for studying metabolic aspects such as the Stickland reaction, coenzyme-B12- and selenium-dependent reactions of amino acids. With the goal of revisiting its carbon, nitrogen, and energy metabolism, and comparing studies with other clostridia, its genome has been sequenced and analyzed. Results C. sticklandii is one of the best biochemically studied proteolytic clostridial species. Useful additional information has been obtained from the sequencing and annotation of its genome, which is presented in this paper. Besides, experimental procedures reveal that C. sticklandii degrades amino acids in a preferential and sequential way. The organism prefers threonine, arginine, serine, cysteine, proline, and glycine, whereas glutamate, aspartate and alanine are excreted. Energy conservation is primarily obtained by substrate-level phosphorylation in fermentative pathways. The reactions catalyzed by different ferredoxin oxidoreductases and the exergonic NADH-dependent reduction of crotonyl-CoA point to a possible chemiosmotic energy conservation via the Rnf complex. C. sticklandii possesses both the F-type and V-type ATPases. The discovery of an as yet unrecognized selenoprotein in the D-proline reductase operon suggests a more detailed mechanism for NADH-dependent D-proline reduction. A rather unusual metabolic feature is the presence of genes for all the enzymes involved in two different CO2-fixation pathways: C. sticklandii harbours both the glycine synthase/glycine reductase and the Wood-Ljungdahl pathways. This unusual pathway combination has retrospectively been observed in only four other sequenced microorganisms. Conclusions Analysis of the C. sticklandii genome and

  20. Amino acid sequence of neurotoxin III of the scorpion Androctonus austrialis Hector.

    PubMed

    Kopeyan, C; Martinez, G; Rochat, H

    1979-03-01

    The amino acid sequence of neurotoxin III, purified from the venom of the North African scorpion Androctonus australis Hector, has been determined by Edman degradation using a liquid-phase sequencer. Carboxypeptidase A hydrolyses confirmed not only the sequence of the five last residues but also the presence of a free alpha-carboxylic group at the C-terminus. Edman degradation was conducted on one hand with the Quadrol [N,N,N',N'-tetrakis(2-hydroxypropyl)ethylene diamine] program and S-alkylated protein before or after coupling with sulfophenylisothiocynate (the first 34 residues were thus identified), on the other hand on tryptic and chymotryptic peptides with a dimethylbenzylamine program (residues 1--23 and 31--34 were confirmed, the positions of residues 35-64 were established). Neurotoxin III was found to belong to the same group of scorpion toxins active on mammals as neurotoxin I purified from the same venom (50 homologous positions exist in the two proteins).

  1. Isolation and amino acid sequences of squirrel monkey (Saimiri sciurea) insulin and glucagon

    SciTech Connect

    Yu, Jinghua ); Eng, J.; Yalow, R.S. City Univ. of New York, NY )

    1990-12-01

    It was reported two decades ago that insulin was not detectable in the glucose-stimulated state in Saimiri sciurea, the New World squirrel monkey, by a radioimmunoassay system developed with guinea pig anti-pork insulin antibody and labeled park insulin. With the same system, reasonable levels were observed in rhesus monkeys and chimpanzees. This suggested that New World monkeys, like the New World hystricomorph rodents such as the guinea pig and the coypu, might have insulins whose sequences differ markedly from those of Old World mammals. In this report the authors describe the purification and amino acid sequences of squirrel monkey insulin and glucagon. They demonstrate that the substitutions at B29, B27, A2, A4, and A17 of squirrel monkey insulin are identical with those previously found in another New World primate, the owl monkey (Aotus trivirgatus). The immunologic cross-reactivity of this insulin in their immunoassay system is only a few percent of that of human insulin. It appears that the peptides of the New World monkeys have diverged less from those of the Old World mammals than have those of the New World hystricomorph rodents. The striking improvements in peptide purification and sequencing have the potential for adding new information concerning the evolutionary divergence of species.

  2. Purification, amino acid sequence and characterisation of kangaroo IGF-I.

    PubMed

    Yandell, C A; Francis, G L; Wheldrake, J F; Upton, Z

    1998-01-01

    Insulin-like growth factor-I (IGF-I) and IGF-II have been purified to homogeneity from kangaroo (Macropus fuliginosus) serum, thus this represents the first report of the purification, sequencing and characterisation of marsupial IGFs. N-Terminal protein sequencing reveals that there are six amino acid differences between kangaroo and human IGF-I. Kangaroo IGF-II has been partially sequenced and no differences were found between human and kangaroo IGF-II in the 53 residues identified. Thus the IGFs appear to be remarkably structurally conserved during mammalian radiation. In addition, in vitro characterisation of kangaroo IGF-I demonstrated that the functional properties of human, kangaroo and chicken IGF-I are very similar. In an assay measuring the ability of the proteins to stimulate protein synthesis in rat L6 myoblasts, all IGF-I proteins were found to be equally potent. The ability of all three proteins to compete for binding with radiolabelled human IGF-I to type-1 IGF receptors in L6 myoblasts and in Sminthopsis crassicaudata transformed lung fibroblasts, a marsupial cell line, was comparable. Furthermore, kangaroo and human IGF-I react equally in a human IGF-I RIA using a human reference standard, radiolabelled human IGF-I and a polyclonal antibody raised against recombinant human IGF-I. This study indicates that not only is the primary structure of eutherian and metatherian IGF-I conserved, but also the proteins appear to be functionally similar.

  3. Complete Genome Sequence of the Prototype Lactic Acid Bacterium Lactococcus lactis subsp. cremoris MG1363▿

    PubMed Central

    Wegmann, Udo; O'Connell-Motherway, Mary; Zomer, Aldert; Buist, Girbe; Shearman, Claire; Canchaya, Carlos; Ventura, Marco; Goesmann, Alexander; Gasson, Michael J.; Kuipers, Oscar P.; van Sinderen, Douwe; Kok, Jan

    2007-01-01

    Lactococcus lactis is of great importance for the nutrition of hundreds of millions of people worldwide. This paper describes the genome sequence of Lactococcus lactis subsp. cremoris MG1363, the lactococcal strain most intensively studied throughout the world. The 2,529,478-bp genome contains 81 pseudogenes and encodes 2,436 proteins. Of the 530 unique proteins, 47 belong to the COG (clusters of orthologous groups) functional category “carbohydrate metabolism and transport,” by far the largest category of novel proteins in comparison with L. lactis subsp. lactis IL1403. Nearly one-fifth of the 71 insertion elements are concentrated in a specific 56-kb region. This integration hot-spot region carries genes that are typically associated with lactococcal plasmids and a repeat sequence specifically found on plasmids and in the “lateral gene transfer hot spot” in the genome of Streptococcus thermophilus. Although the parent of L. lactis MG1363 was used to demonstrate lysogeny in Lactococcus, L. lactis MG1363 carries four remnant/satellite phages and two apparently complete prophages. The availability of the L. lactis MG1363 genome sequence will reinforce its status as the prototype among lactic acid bacteria through facilitation of further applied and fundamental research. PMID:17307855

  4. Global trophic position comparison of two dominant mesopelagic fish families (Myctophidae, Stomiidae) using amino acid nitrogen isotopic analyses.

    PubMed

    Choy, C Anela; Davison, Peter C; Drazen, Jeffrey C; Flynn, Adrian; Gier, Elizabeth J; Hoffman, Joel C; McClain-Counts, Jennifer P; Miller, Todd W; Popp, Brian N; Ross, Steve W; Sutton, Tracey T

    2012-01-01

    The δ(15)N values of organisms are commonly used across diverse ecosystems to estimate trophic position and infer trophic connectivity. We undertook a novel cross-basin comparison of trophic position in two ecologically well-characterized and different groups of dominant mid-water fish consumers using amino acid nitrogen isotope compositions. We found that trophic positions estimated from the δ(15)N values of individual amino acids are nearly uniform within both families of these fishes across five global regions despite great variability in bulk tissue δ(15)N values. Regional differences in the δ(15)N values of phenylalanine confirmed that bulk tissue δ(15)N values reflect region-specific water mass biogeochemistry controlling δ(15)N values at the base of the food web. Trophic positions calculated from amino acid isotopic analyses (AA-TP) for lanternfishes (family Myctophidae) (AA-TP ∼2.9) largely align with expectations from stomach content studies (TP ∼3.2), while AA-TPs for dragonfishes (family Stomiidae) (AA-TP ∼3.2) were lower than TPs derived from stomach content studies (TP∼4.1). We demonstrate that amino acid nitrogen isotope analysis can overcome shortcomings of bulk tissue isotope analysis across biogeochemically distinct systems to provide globally comparative information regarding marine food web structure.

  5. Molar ratio iron: zinc and folic acid in Brazilian biscuits and snacks and test for classification using principal component analyses.

    PubMed

    Godoy, Adriana Teixeira; Rebelatto, Ana Paula; Borin-Nogueira, Alessandra; Lima-Pallone, Juliana Azevedo

    2014-06-01

    The aim of the present work was to evaluate molar ratio iron: zinc and the levels of folic acid in biscuit and snacks commercialized in Brazil, prepared with folic acid and iron fortified flours. These nutrients are important for human nutrition; however, iron can have a negative effect on zinc absorption. Molar ratio iron:zinc can indicate if there will be any problems for absorption of these nutrients. The folic acid content varied from 58 to 433 μg/100 g and iron and zinc levels varied from 2.9 to 9.4 mg/100 g and from 0.2 to 1.3 mg/100 g, respectively, for 75 analyzed samples. The average iron contents observed in the products and molar ratio iron:zinc (in average 8:1 for biscuits and 12.8:1 for snacks) could result in problems with the zinc absorption. Moreover, principal compo- nent analyses (PCA) indicated low uniformity in the distribution of minerals and vitamin in the majority of the samples, mainly among brands. The results indicated that for the majority of the samples tested folic acid and iron content was higher than expected for flours and could be useful to governmental authorities in their evaluation program of flour fortification.

  6. Global trophic position comparison of two dominant mesopelagic fish families (Myctophidae, Stomiidae) using amino acid nitrogen isotopic analyses

    USGS Publications Warehouse

    Choy, C. Anela; Davison, Peter C.; Drazen, Jeffrey C.; Flynn, Adrian; Gier, Elizabeth J.; Hoffman, Joel C.; McClain-Counts, Jennifer P.; Miller, Todd W.; Popp, Brian N.; Ross, Steve W.; Sutton, Tracey T.

    2012-01-01

    The δ15N values of organisms are commonly used across diverse ecosystems to estimate trophic position and infer trophic connectivity. We undertook a novel cross-basin comparison of trophic position in two ecologically well-characterized and different groups of dominant mid-water fish consumers using amino acid nitrogen isotope compositions. We found that trophic positions estimated from the δ15N values of individual amino acids are nearly uniform within both families of these fishes across five global regions despite great variability in bulk tissue δ15N values. Regional differences in the δ15N values of phenylalanine confirmed that bulk tissue δ15N values reflect region-specific water mass biogeochemistry controlling δ15N values at the base of the food web. Trophic positions calculated from amino acid isotopic analyses (AA-TP) for lanternfishes (family Myctophidae) (AA-TP ~2.9) largely align with expectations from stomach content studies (TP ~3.2), while AA-TPs for dragonfishes (family Stomiidae) (AA-TP ~3.2) were lower than TPs derived from stomach content studies (TP~4.1). We demonstrate that amino acid nitrogen isotope analysis can overcome shortcomings of bulk tissue isotope analysis across biogeochemically distinct systems to provide globally comparative information regarding marine food web structure.

  7. The ABRF Edman Sequencing Research Group 2008 Study: Investigation into Homopolymeric Amino Acid N-Terminal Sequence Tags and Their Effects on Automated Edman Degradation

    PubMed Central

    Thoma, R. S.; Smith, J. S.; Sandoval, W.; Leone, J. W.; Hunziker, P.; Hampton, B.; Linse, K. D.; Denslow, N. D.

    2009-01-01

    The Edman Sequence Research Group (ESRG) of the Association of Biomolecular Resource designs and executes interlaboratory studies investigating the use of automated Edman degradation for protein and peptide analysis. In 2008, the ESRG enlisted the help of core sequencing facilities to investigate the effects of a repeating amino acid tag at the N-terminus of a protein. Commonly, to facilitate protein purification, an affinity tag containing a polyhistidine sequence is conjugated to the N-terminus of the protein. After expression, polyhistidine-tagged protein is readily purified via chelation with an immobilized metal affinity resin. The addition of the polyhistidine tag presents unique challenges for the determination of protein identity using Edman degradation chemistry. Participating laboratories were asked to sequence one protein engineered in three configurations: with an N-terminal polyhistidine tag; with an N-terminal polyalanine tag; or with no tag. Study participants were asked to return a data file containing the uncorrected amino acid picomole yields for the first 17 cycles. Initial and repetitive yield (R.Y.) information and the amount of lag were evaluated. Information about instrumentation and sample treatment was also collected as part of the study. For this study, the majority of participating laboratories successfully called the amino acid sequence for 17 cycles for all three test proteins. In general, laboratories found it more difficult to call the sequence containing the polyhistidine tag. Lag was observed earlier and more consistently with the polyhistidine-tagged protein than the polyalanine-tagged protein. Histidine yields were significantly less than the alanine yields in the tag portion of each analysis. The polyhistidine and polyalanine protein-R.Y. calculations were found to be equivalent. These calculations showed that the nontagged portion from each protein was equivalent. The terminal histidines from the tagged portion of the protein

  8. The amino acid sequence around the active-site cysteine and histidine residues, and the buried cysteine residue in ficin.

    PubMed

    Husain, S S; Lowe, G

    1970-04-01

    Ficin that had been prepared from the latex of Ficus glabrata by salt fractionation and chromatography on carboxymethylcellulose was completely and irreversibly inhibited with 1,3-dibromo[2-(14)C]acetone and then treated with N-(4-dimethylamino-3,5-dinitrophenyl)maleimide in 6m-guanidinium chloride. After reduction and carboxymethylation of the labelled protein, it was digested with trypsin and alpha-chymotrypsin. Two radioactive peptides and two coloured peptides were isolated chromatographically and their sequences determined. The radioactive peptides revealed the amino acid sequences around the active-site cysteine and histidine residues and showed a high degree of homology with the omino acid sequence around the active-site cysteine and histidine residues in papain. The coloured peptides allowed the amino acid sequence around the buried cysteine residue in ficin to be determined.

  9. The `heavy' subunit of the photosynthetic reaction centre from Rhodopseudomonas viridis: isolation of the gene, nucleotide and amino acid sequence

    PubMed Central

    Michel, H.; Weyer, K. A.; Gruenberg, H.; Lottspeich, F.

    1985-01-01

    The gene coding for the `heavy' subunit of the photosynthetic reaction centre from Rhodopseudomonas viridis was isolated in an expression vector. Expression of the heavy subunit in Escherichia coli was detected with antibodies raised against crystalline reaction centres. The entire subunit, and not a fusion protein, was expressed in E. coli. The protein coding region of the gene was sequenced and the amino acid sequence derived. Part of the amino acid sequence was confirmed by chemical sequence analysis of the protein. The heavy subunit consists of 258 amino acids and its mol. wt. is 28 345. It possesses one membrane-spanning α-helical segment, as was revealed by the concomitant X-ray structure analysis. ImagesFig. 1.Fig. 2. PMID:16453623

  10. Purification, amino acid sequence and immunological characterization of Ole e 6, a cysteine-enriched allergen from olive tree pollen.

    PubMed

    Batanero, E; Ledesma, A; Villalba, M; Rodríguez, R

    1997-06-30

    The Ole e 6 allergen from olive tree pollen has been isolated by combining gel permeation and reverse-phase chromatographies. It is a single and highly acidic (pI 4.2) polypeptide chain protein. Its NH2-terminal amino acid sequence has been determined by Edman degradation. Total RNA from the olive tree pollen was isolated, and a specific cDNA was amplified by the polymerase chain reaction using a degenerate oligonucleotide primer designed according to the NH2-terminal sequence of the protein. The nucleotide sequencing of the cDNA rendered an open reading frame encoding a 50 amino acid polypeptide chain, in which two sets of the sequential motif Cys-X3-Cys-X3-Cys are present. No sequence similarity has been found between this protein and other previously described polypeptides.

  11. Nucleotide and derived amino acid sequences of the major porin of Comamonas acidovorans and comparison of porin primary structures.

    PubMed Central

    Gerbl-Rieger, S; Peters, J; Kellermann, J; Lottspeich, F; Baumeister, W

    1991-01-01

    The DNA sequence of the gene which codes for the major outer membrane porin (Omp32) of Comamonas acidovorans has been determined. The structural gene encodes a precursor consisting of 351 amino acid residues with a signal peptide of 19 amino acid residues. Comparisons with amino acid sequences of outer membrane proteins and porins from several other members of the class Proteobacteria and of the Chlamydia trachomatis porin and the Neurospora crassa mitochondrial porin revealed a motif of eight regions of local homology. The results of this analysis are discussed with regard to common structural features of porins. PMID:1848840

  12. The evolution of proteins from random amino acid sequences: II. Evidence from the statistical distributions of the lengths of modern protein sequences.

    PubMed

    White, S H

    1994-04-01

    This paper continues an examination of the hypothesis that modern proteins evolved from random heteropeptide sequences. In support of the hypothesis, White and Jacobs (1993, J Mol Evol 36:79-95) have shown that any sequence chosen randomly from a large collection of nonhomologous proteins has a 90% or better chance of having a lengthwise distribution of amino acids that is indistinguishable from the random expectation regardless of amino acid type. The goal of the present study was to investigate the possibility that the random-origin hypothesis could explain the lengths of modern protein sequences without invoking specific mechanisms such as gene duplication or exon splicing. The sets of sequences examined were taken from the 1989 PIR database and consisted of 1,792 "super-family" proteins selected to have little sequence identity, 623 E. coli sequences, and 398 human sequences. The length distributions of the proteins could be described with high significance by either of two closely related probability density functions: The gamma distribution with parameter 2 or the distribution for the sum of two exponential random independent variables. A simple theory for the distributions was developed which assumes that (1) protoprotein sequences had exponentially distributed random independent lengths, (2) the length dependence of protein stability determined which of these protoproteins could fold into compact primitive proteins and thereby attain the potential for biochemical activity, (3) the useful protein sequences were preserved by the primitive genome, and (4) the resulting distribution of sequence lengths is reflected by modern proteins. The theory successfully predicts the two observed distributions which can be distinguished by the functional form of the dependence of protein stability on length. The theory leads to three interesting conclusions. First, it predicts that a tetra-nucleotide was the signal for primitive translation termination. This prediction is

  13. Relationships in the Caryophyllales as suggested by phylogenetic analyses of partial chloroplast DNA ORF2280 homolog sequences.

    PubMed

    Downie, S; Katz-Downie, D; Cho, K

    1997-02-01

    Phylogenetic relationships within the angiosperm order Caryophyllales were investigated by comparative sequencing of two portions of the highly conserved inverted repeat (totaling some 1100 base pairs) coinciding with the region occupied by ORF2280 in Nicotiana, the largest gene in the plastid genomes of most land plants. Data were obtained for 33 species in 11 families within the order and for one species each of Plumbaginaceae, Polygonaceae, and Nepenthaceae. These data, when analyzed along with previously published ORF (open reading frame) sequences from Nicotiana. Spinacia. Epifagus, and Pelargonium using parsimony, neighbor-joining, and maximum likelihood methods, reveal that: (1) Amaranthus, Celosia, and Froelichia (all Amaranthaceae) do not comprise a monophyletic group; (2) Amaranthus may be nested within a paraphyletic Chenopodiaceae; (3) Sarcobatus (Chenopodiaceae) is allied with Nyctaginaceae + Phytolaccaceae (the latter family excluding Stegnosperma but including Petiveria); and (4) Caryophyllaceae (with Corrigiola basal within the clade) are sister group to Chenopodiaceae + Amaranthaceae. Basal relations within the order remain obscure. Sequence divergence values in pairwise comparisons across all Caryophyllales taxa ranged from 0.1 to 5% of nucleotides. However, despite these low values, 23 insertion and deletion events were apparent, of which five were informative phylogenetically and bolstered several of the relationships listed above. A polymerase chain reaction (PCR) survey for ORF homolog length variants in representatives from 70 additional angiosperm families revealed major deletions, of 100 to 1400 base pairs, in 19 of these families. Although the ORF is located within the mutationally retarded inverted repeat region of most angiosperm chloroplast DNAs, this gene appears particularly prone to length mutation.

  14. The genetic diversity of genus Bacillus and the related genera revealed by 16s rRNA gene sequences and ardra analyses isolated from geothermal regions of turkey

    PubMed Central

    Cihan, Arzu Coleri; Tekin, Nilgun; Ozcan, Birgul; Cokmus, Cumhur

    2012-01-01

    Previously isolated 115 endospore-forming bacilli were basically grouped according to their temperature requirements for growth: the thermophiles (74%), the facultative thermophiles (14%) and the mesophiles (12%). These isolates were taken into 16S rRNA gene sequence analyses, and they were clustered among the 7 genera: Anoxybacillus, Aeribacillus, Bacillus, Brevibacillus, Geobacillus, Paenibacillus, and Thermoactinomycetes. Of these bacilli, only the thirty two isolates belonging to genera Bacillus (16), Brevibacillus (13), Paenibacillus (1) and Thermoactinomycetes (2) were selected and presented in this paper. The comparative sequence analyses revealed that the similarity values were ranged as 91.4–100 %, 91.8- 99.2 %, 92.6- 99.8 % and 90.7 - 99.8 % between the isolates and the related type strains from these four genera, respectively. Twenty nine of them were found to be related with the validly published type strains. The most abundant species was B. thermoruber with 9 isolates followed by B. pumilus (6), B. lichenformis (3), B. subtilis (3), B. agri (3), B. smithii (2), T. vulgaris (2) and finally P. barengoltzii (1). In addition, isolates of A391a, B51a and D295 were proposed as novel species as their 16S rRNA gene sequences displayed similarities ≤ 97% to their closely related type strains. The AluI-, HaeIII- and TaqI-ARDRA results were in congruence with the 16S rRNA gene sequence analyses. The ARDRA results allowed us to differentiate these isolates, and their discriminative restriction fragments were able to be determined. Some of their phenotypic characters and their amylase, chitinase and protease production were also studied and biotechnologically valuable enzyme producing isolates were introduced in order to use in further studies. PMID:24031834

  15. Fragmentation Characteristics of Deprotonated N-linked Glycopeptides: Influences of Amino Acid Composition and Sequence

    NASA Astrophysics Data System (ADS)

    Nishikaze, Takashi; Kawabata, Shin-ichirou; Tanaka, Koichi

    2014-06-01

    Glycopeptide structural analysis using tandem mass spectrometry is becoming a common approach for elucidating site-specific N-glycosylation. The analysis is generally performed in positive-ion mode. Therefore, fragmentation of protonated glycopeptides has been extensively investigated; however, few studies are available on deprotonated glycopeptides, despite the usefulness of negative-ion mode analysis in detecting glycopeptide signals. Here, large sets of glycopeptides derived from well-characterized glycoproteins were investigated to understand the fragmentation behavior of deprotonated N-linked glycopeptides under low-energy collision-induced dissociation (CID) conditions. The fragment ion species were found to be significantly variable depending on their amino acid sequence and could be classified into three types: (i) glycan fragment ions, (ii) glycan-lost fragment ions and their secondary cleavage products, and (iii) fragment ions with intact glycan moiety. The CID spectra of glycopeptides having a short peptide sequence were dominated by type (i) glycan fragments (e.g., 2,4AR, 2,4AR-1, D, and E ions). These fragments define detailed structural features of the glycan moiety such as branching. For glycopeptides with medium or long peptide sequences, the major fragments were type (ii) ions (e.g., [peptide + 0,2X0-H]- and [peptide-NH3-H]-). The appearance of type (iii) ions strongly depended on the peptide sequence, and especially on the presence of Asp, Asn, and Glu. When a glycosylated Asn is located on the C-terminus, an interesting fragment having an Asn residue with intact glycan moiety, [glycan + Asn-36]-, was abundantly formed. Observed fragments are reasonably explained by a combination of existing fragmentation rules suggested for N-glycans and peptides.

  16. An amino acid sequence motif sufficient for subnuclear localization of an arginine/serine-rich splicing factor.

    PubMed

    Hedley, M L; Amrein, H; Maniatis, T

    1995-12-05

    We have identified an amino acid sequence in the Drosophila Transformer (Tra) protein that is capable of directing a heterologous protein to nuclear speckles, regions of the nucleus previously shown to contain high concentrations of spliceosomal small nuclear RNAs and splicing factors. This sequence contains a nucleoplasmin-like bipartite nuclear localization signal (NLS) and a repeating arginine/serine (RS) dipeptide sequence adjacent to a short stretch of basic amino acids. Sequence comparisons from a number of other splicing factors that colocalize to nuclear speckles reveal the presence of one or more copies of this motif. We propose a two-step subnuclear localization mechanism for splicing factors. The first step is transport across the nuclear envelope via the nucleoplasmin-like NLS, while the second step is association with components in the speckled domain via the RS dipeptide sequence.

  17. Purification and partial amino acid sequence of the chloroplast cytochrome b-559.

    PubMed

    Widger, W R; Cramer, W A; Hermodson, M; Meyer, D; Gullifor, M

    1984-03-25

    The hydrophobic cytochrome b-559, purified from unstacked, ethanol-washed spinach thylakoid membranes, using extraction with 2% Triton X-100 in 4 M urea and three chromatographic steps in the presence of protease inhibitors, has a dominant band on sodium dodecyl sulfate-urea gels corresponding to Mr = 10,000. The yield of this preparation is 30-50% (5-10 mg) starting with 600 mg of chlorophyll. The heme content yields a calculated molecular weight of no more than 17,500/heme, and perhaps somewhat smaller after correction for impurities. The Mr = 10,000 band is stained by the tetramethylbenzidine-H2O2 heme reagent on lithium dodecyl sulfate gels run at 0 degrees C. The Mr = 10,000 protein, further separated by high performance liquid chromatography, contains a unique NH2 terminus that is not blocked, and the amino acid sequence for the first 27 residues is NH2-Ser-Gly-Ser-Thr-Gly-Glu-Arg-Ser-Phe-Ala-Asp-Ile-Ile-Thr-Ser-Ile-Arg-Tyr-Trp -Val-Ile-X-Ser-Ile-Thr-Ile-Pro. . . COOH. Approximately 55% of the amino acids are hydrophobic, based on amino acid analysis of the Mr = 10,000 peptide, which also indicated the presence of at least one histidine. Only one cytochrome b-559 component could be identified, whose yield indicated that it arises from a single b-559 protein in chloroplasts corresponding to the in situ high potential cytochrome of the chloroplast photosystem II.

  18. Molecular phylogeography of the brown bear (Ursus arctos) in Northeastern Asia based on analyses of complete mitochondrial DNA sequences.

    PubMed

    Hirata, Daisuke; Mano, Tsutomu; Abramov, Alexei V; Baryshnikov, Gennady F; Kosintsev, Pavel A; Vorobiev, Alexandr A; Raichev, Evgeny G; Tsunoda, Hiroshi; Kaneko, Yayoi; Murata, Koichi; Fukui, Daisuke; Masuda, Ryuichi

    2013-07-01

    To further elucidate the migration history of the brown bears (Ursus arctos) on Hokkaido Island, Japan, we analyzed the complete mitochondrial DNA (mtDNA) sequences of 35 brown bears from Hokkaido, the southern Kuril Islands (Etorofu and Kunashiri), Sakhalin Island, and the Eurasian Continent (continental Russia, Bulgaria, and Tibet), and those of four polar bears. Based on these sequences, we reconstructed the maternal phylogeny of the brown bear and estimated divergence times to investigate the timing of brown bear migrations, especially in northeastern Eurasia. Our gene tree showed the mtDNA haplotypes of all 73 brown and polar bears to be divided into eight divergent lineages. The brown bear on Hokkaido was divided into three lineages (central, eastern, and southern). The Sakhalin brown bear grouped with eastern European and western Alaskan brown bears. Etorofu and Kunashiri brown bears were closely related to eastern Hokkaido brown bears and could have diverged from the eastern Hokkaido lineage after formation of the channel between Hokkaido and the southern Kuril Islands. Tibetan brown bears diverged early in the eastern lineage. Southern Hokkaido brown bears were closely related to North American brown bears.

  19. Cultural studies coupled with DNA based sequence analyses and its implication on pigmentation as a phylogenetic marker in Pestalotiopsis taxonomy.

    PubMed

    Liu, Ai-Rong; Chen, Shuang-Chen; Wu, Shang-Ying; Xu, Tong; Guo, Liang-Dong; Jeewon, Rajesh; Wei, Ji-Guang

    2010-11-01

    Previous phylogenetic studies based on DNA sequence data have partially resolved taxonomic relationships among Pestalotiopsis species. There are still some morphological characters whose phylogenetic significance have not been assessed properly due to limited taxon sampling, in particular the degree of pigmentation of median cells. In this study, the stability of pigmentation of median cells of conidia in Pestalotiopsis species was evaluated in subculture, and a molecular phylogenetic analysis was conducted on 45 strains belonging to 26 species in order to reappraise the pigmentation of median cells for its significance in the taxonomy of Pestalotiopsis. Phylogenetic relationships were inferred from nucleotide sequences in ITS regions (ITS1, 5.8S and ITS2) and β-tubulin 2 gene (tub2). The results showed that pigmentation of median cells was stable and it could be a key character in the taxonomy of Pestalotiopsis species. Instead of "concolorous" and "versicolor" proposed by Steyeart (1949), "brown to olivaceous" and "umber to fuliginous" are described and proposed in this paper.

  20. Sequence and Copy Number Analyses of HEXB Gene in Patients Affected by Sandhoff Disease: Functional Characterization of 9 Novel Sequence Variants

    PubMed Central

    Zampieri, Stefania; Cattarossi, Silvia; Oller Ramirez, Ana Maria; Rosano, Camillo; Lourenco, Charles Marques; Passon, Nadia; Moroni, Isabella; Uziel, Graziella; Pettinari, Antonella; Stanzial, Franco; de Kremer, Raquel Dodelson; Azar, Nydia Beatriz; Hazan, Filiz; Filocamo, Mirella; Bembi, Bruno; Dardis, Andrea

    2012-01-01

    Sandhoff disease (SD) is a lysosomal disorder caused by mutations in the HEXB gene. To date, 43 mutations of HEXB have been described, including 3 large deletions. Here, we have characterized 14 unrelated SD patients and developed a Multiplex Ligation-dependent Probe Amplification (MLPA) assay to investigate the presence of large HEXB deletions. Overall, we identified 16 alleles, 9 of which were novel, including 4 sequence variation leading to aminoacid changes [c.626C>T (p.T209I), c.634C>A (p.H212N), c.926G>T (p.C309F), c.1451G>A (p.G484E)] 3 intronic mutations (c.1082+5G>A, c.1242+1G>A, c.1169+5G>A), 1 nonsense mutation c.146C>A (p.S49X) and 1 small in-frame deletion c.1260_1265delAGTTGA (p.V421_E422del). Using the new MLPA assay, 2 previously described deletions were identified. In vitro expression studies showed that proteins bearing aminoacid changes p.T209I and p.G484E presented a very low or absent activity, while proteins bearing the p.H212N and p.C309F changes retained a significant residual activity. The detrimental effect of the 3 novel intronic mutations on the HEXB mRNA processing was demonstrated using a minigene assay. Unprecedentedly, minigene studies revealed the presence of a novel alternative spliced HEXB mRNA variant also present in normal cells. In conclusion, we provided new insights into the molecular basis of SD and validated an MLPA assay for detecting large HEXB deletions. PMID:22848519

  1. Microbial Response to Soil Liming of Damaged Ecosystems Revealed by Pyrosequencing and Phospholipid Fatty Acid Analyses

    PubMed Central

    Narendrula-Kotha, Ramya; Nkongolo, Kabwe K.

    2017-01-01

    Aims To assess the effects of dolomitic limestone applications on soil microbial communities’ dynamics and bacterial and fungal biomass, relative abundance, and diversity in metal reclaimed regions. Methods and Results The study was conducted in reclaimed mining sites and metal uncontaminated areas. The limestone applications were performed over 35 years ago. Total microbial biomass was determined by Phospholipid fatty acids. Bacterial and fungal relative abundance and diversity were assessed using 454 pyrosequencing. There was a significant increase of total microbial biomass in limed sites (342 ng/g) compared to unlimed areas (149 ng/g). Chao1 estimates followed the same trend. But the total number of OTUs (Operational Taxonomic Units) in limed (463 OTUs) and unlimed (473 OTUs) soil samples for bacteria were similar. For fungi, OTUs were 96 and 81 for limed and unlimed soil samples, respectively. Likewise, Simpson and Shannon diversity indices revealed no significant differences between limed and unlimed sites. Bacterial and fungal groups specific to either limed or unlimed sites were identified. Five major bacterial phyla including Actinobacteria, Acidobacteria, Chloroflexi, Firmicutes, and Proteobacteria were found. The latter was the most prevalent phylum in all the samples with a relative abundance of 50%. Bradyrhizobiaceae family with 12 genera including the nitrogen fixing Bradirhizobium genus was more abundant in limed sites compared to unlimed areas. For fungi, Ascomycota was the most predominant phylum in unlimed soils (46%) while Basidiomycota phylum represented 86% of all fungi in the limed areas. Conclusion Detailed analysis of the data revealed that although soil liming increases significantly the amount of microbial biomass, the level of species diversity remain statistically unchanged even though the microbial compositions of the damaged and restored sites are different. Significance and Impact of the study Soil liming still have a significant

  2. Promoter analyses and transcriptional profiling of eggplant polyphenol oxidase 1 gene (SmePPO1) reveal differential response to exogenous methyl jasmonate and salicylic acid.

    PubMed

    Shetty, Santoshkumar M; Chandrashekar, Arun; Venkatesh, Yeldur P

    2012-05-01

    The transcriptional regulation of multigenic eggplant (Solanum melongena) polyphenol oxidase genes (SmePPO) is orchestrated by their corresponding promoters which mediate developmentally regulated expression in response to myriad biotic and abiotic factors. However, information on structural features of SmePPO promoters and modulation of their expression by plant defense signals are lacking. In the present study, SmePPOPROMOTERs were cloned by genome walking, and their transcription start sites (TSS) were determined by RLM-RACE. Extensive sequence analyses revealed the presence of evolutionarily conserved and over-represented putative cis-acting elements involved in light-regulated transcription, biosynthetic pathways (phenylpropanoid/flavonoid), hormone signaling (abscisic acid, gibberellic acid, jasmonate and salicylate), elicitor and stress responses (cold/dehydration responses), sugar metabolism and plant defense signaling (W-BOX/WRKY) that are common to SmePPOPROMOTER1 and 2. The TSS for SmePPO genes are located 9-15bp upstream of ATG with variable lengths of 5' untranslated regions. Transcriptional profiling of SmePPOs in eggplant seedlings has indicated differential response to methyl jasmonate (MeJA) or salicylic acid (SA) treatment. In planta, while MeJA elicited expression of all the six SmePPOs, SA was only able to induce the expression of SmePPO4-6. Interestingly, in dual treatment, SA considerably repressed the MeJA-induced expression of SmePPOs. Functional dissection of SmePPOPROMOTER1 by deletion analyses using Agrobacterium-mediated transient expression in tobacco leaves has shown that MeJA enhances the SmePPOPROMOTER1-β-glucuronidase (GUS) expression in vivo, while SA does not. Histochemical and quantitative GUS assays have also indicated the negative effect of SA on MeJA-induced expression of SmePPOPROMOTER1. By combining in silico analyses, transcriptional profiling and expression of SmePPOPROMOTER1-GUS fusions, the role of SA on the modulation

  3. Complete mitochondrial genome sequences of three bats species and whole genome mitochondrial analyses reveal patterns of codon bias and lend support to a basal split in Chiroptera.

    PubMed

    Meganathan, P R; Pagan, Heidi J T; McCulloch, Eve S; Stevens, Richard D; Ray, David A

    2012-01-15

    Order Chiroptera is a unique group of mammals whose members have attained self-powered flight as their main mode of locomotion. Much speculation persists regarding bat evolution; however, lack of sufficient molecular data hampers evolutionary and conservation studies. Of ~1200 species, complete mitochondrial genome sequences are available for only eleven. Additional sequences should be generated if we are to resolve many questions concerning these fascinating mammals. Herein, we describe the complete mitochondrial genomes of three bats: Corynorhinus rafinesquii, Lasiurus borealis and Artibeus lituratus. We also compare the currently available mitochondrial genomes and analyze codon usage in Chiroptera. C. rafinesquii, L. borealis and A. lituratus mitochondrial genomes are 16438 bp, 17048 bp and 16709 bp, respectively. Genome organization and gene arrangements are similar to other bats. Phylogenetic analyses using complete mitochondrial genome sequences support previously established phylogenetic relationships and suggest utility in future studies focusing on the evolutionary aspects of these species. Comprehensive analyses of available bat mitochondrial genomes reveal distinct nucleotide patterns and synonymous codon preferences corresponding to different chiropteran families. These patterns suggest that mutational and selection forces are acting to different extents within Chiroptera and shape their mitochondrial genomes.

  4. Taxonomic relationships among Turkish water frogs as revealed by phylogenetic analyses using mtDNA gene sequences.

    PubMed

    Bülbül, Ufuk; Matsui, Masafumi; Kutrup, Bilal; Eto, Koshiro

    2011-12-01

    We assessed taxonomic relationships among Turkish water frogs through estimation of phylogenetic relationships among 62 adult specimens from 44 distinct populations inhabiting seven main geographical regions of Turkey using 2897 bp sequences of the mitochondrial Cytb, 12S rRNA and 16S rRNA genes with equally-weighted parsimony, likelihood, and Bayesian methods of inference. Monophyletic clade (Clade A) of the northwesternmost (Thrace) samples is identified as Pelophylax ridibundus. The other clade (Clade B) consisted of two monophyletic subclades. One of these contains specimens from southernmost populations that are regarded as an unnamed species. The other subclade consists of two lineages, of which one corresponds to P. caralitanus and another to P. bedriagae. Taxonomic relationships of these two species are discussed and recognition of P. caralitanus as a subspecies of P. bedriagae is proposed.

  5. Water stress-responsive genes in loblolly pine (Pinus taeda) roots identified by analyses of expressed sequence tag libraries.

    PubMed

    Lorenz, W Walter; Sun, Feng; Liang, Chun; Kolychev, Dmitri; Wang, Haiming; Zhao, Xin; Cordonnier-Pratt, Marie-Michele; Pratt, Lee H; Dean, Jeffrey F D

    2006-01-01

    Drought stress is the principal cause of seedling mortality in pine forests of the southeastern United States and in many other forested regions around the globe. As part of a larger effort to discover loblolly pine genes, this study subjected rooted cuttings of three unrelated pine genotypes to three watering regimens. Expressed sequence tags (ESTs) were obtained from both the 3' and 5' ends of 12,918 randomly selected cDNAs generated from root tissues. These ESTs were clustered to identify 6,765 unique transcripts (UniScripts) derived from 6,202 putative unique genes (UniGenes-S). Tentative annotations were assigned on the basis of BLASTX comparisons to the Protein Information Resource Nonredundant Reference (PIR-NREF) database. Expression levels of 42 UniScripts varied with high statistical significance with respect to treatment. Many of them resembled gene products shown to be important for drought tolerance in other species, including dehydrins, endochitinases, cytochrome P450 enzymes, pathogenesis-related proteins and various late-embryogenesis abundant (LEA) gene products. Similarly, expression levels of 110 UniScripts varied with high statistical significance among genotypes, indicating that gene expression patterns in this species are much more dependent on genotype than on treatment. Most of the water stress-induced pine UniScripts that appeared to encode products resembling drought tolerance factors in other species were most highly induced in a single genotype, suggesting that particularly useful adaptive alleles for drought tolerance might exist within the collection of cDNAs characterized from this genotype. Mining and visualizing the complete data set, as well as downloading of both EST and UniScript contig sequences, are possible using MAGIC Gene Discovery at http://fungen.org/genediscovery/.

  6. Identification of medicinal Dendrobium species by phylogenetic analyses using matK and rbcL sequences.

    PubMed

    Asahina, Haruka; Shinozaki, Junichi; Masuda, Kazuo; Morimitsu, Yasujiro; Satake, Motoyoshi

    2010-04-01

    Species identification of five Dendrobium plants was conducted using phylogenetic analysis and the validity of the method was verified. Some Dendrobium plants (Orchidaceae) have been used as herbal medicines but the difficulty in identifying their botanical origin by traditional methods prevented their full modern utilization. Based on the emerging field of molecular systematics as a powerful classification tool, a phylogenetic analysis was conducted using sequences of two plastid genes, the maturase-coding gene (matK) and the large subunit of ribulose 1,5-bisphosphate carboxylase-coding gene (rbcL), as DNA barcodes for species identification of Dendrobium plants. We investigated five medicinal Dendrobium species, Dendrobium fimbriatum, D. moniliforme, D. nobile, D. pulchellum, and D. tosaense. The phylogenetic trees constructed from matK data successfully distinguished each species from each other. On the other hand, rbcL, as a single-locus barcode, offered less species discriminating power than matK, possibly due to its being present with little variation. When results using matK sequences of D. officinale that was deposited in the DNA database were combined, D. officinale and D. tosaense showed a close genetic relationship, which brought us closer to resolving the question of their taxonomic identity. Identification of the plant source as well as the uniformity of the chemical components is critical for the quality control of herbal medicines and it is important that the processed materials be validated. The methods presented here could be applied to the analysis of processed Dendrobium plants and be a promising tool for the identification of botanical origins of crude drugs.

  7. Sequence and expression analyses of Cytophaga-like hydrolases in a Western arctic metagenomic library and the Sargasso Sea.

    PubMed

    Cottrell, Matthew T; Yu, Liying; Kirchman, David L

    2005-12-01

    Sequence analysis of environmental DNA promises to provide new insights into the ecology and biogeochemistry of uncultured marine microbes. In this study we used the Sargasso Sea Whole Genome Sequence (WGS) data set to search for hydrolases used by Cytophaga-like bacteria to degrade biopolymers such as polysaccharides and proteins. Analysis of the Sargasso WGS data for contigs bearing both the 16S rRNA genes of Cytophaga-like bacteria and hydrolase genes revealed a cellulase gene (celM) most similar to the gene found in Cytophaga hutchinsonii. A BLAST search of the entire Sargasso Sea WGS data set indicated that celM was the most abundant cellulase-like gene in the Sargasso Sea. However, the similarity between CelM-like cellulases and peptidases belonging to metalloprotease family M42 led us to question whether CelM is involved in the degradation of polysaccharides or proteins. PCR primers were designed for the celM genes in the Sargasso Sea WGS data set and used to identify celM in a fosmid library constructed with prokaryotic DNA from the western Arctic Ocean. Expression analysis of the Cytophaga-like Arctic CelM, which is 63% identical and 77% similar to CelM in C. hutchinsonii, indicated that there was peptidase activity, whereas cellulase activity was not detected. Our analysis suggests that the celM gene plays a role in the degradation of protein by Cytophaga-like bacteria. The abundance of peptidase genes in the Cytophaga-like fosmid clone provides further evidence for the importance of Cytophaga-like bacteria in the degradation of protein in high-molecular-weight dissolved organic matter.

  8. Alignment of 700 globin sequences: extent of amino acid substitution and its correlation with variation in volume.

    PubMed Central

    Kapp, O. H.; Moens, L.; Vanfleteren, J.; Trotman, C. N.; Suzuki, T.; Vinogradov, S. N.

    1995-01-01

    Seven-hundred globin sequences, including 146 nonvertebrate sequences, were aligned on the basis of conservation of secondary structure and the avoidance of gap penalties. Of the 182 positions needed to accommodate all the globin sequences, only 84 are common to all, including the absolutely conserved PheCD1 and HisF8. The mean number of amino acid substitutions per position ranges from 8 to 13 for all globins and 5 to 9 for internal positions. Although the total sequence volumes have a variation approximately 2-3%, the variation in volume per position ranges from approximately 13% for the internal to approximately 21% for the surface positions. Plausible correlations exist between amino acid substitution and the variation in volume per position for the 84 common and the internal but not the surface positions. The amino acid substitution matrix derived from the 84 common positions was used to evaluate sequence similarity within the globins and between the globins and phycocyanins C and colicins A, via calculation of pairwise similarity scores. The scores for globin-globin comparisons over the 84 common positions overlap the globin-phycocyanin and globin-colicin scores, with the former being intermediate. For the subset of internal positions, overlap is minimal between the three groups of scores. These results imply a continuum of amino acid sequences able to assume the common three-on-three alpha-helical structure and suggest that the determinants of the latter include sites other than those inaccessible to solvent. PMID:8535255

  9. Amino acid substitutions in genetic variants of human serum albumin and in sequences inferred from molecular cloning

    SciTech Connect

    Takahashi, N.; Takahashi, Y.; Blumberg, B.S.; Putnam, F.W.

    1987-07-01

    The structural changes in four genetic variants of human serum albumin were analyzed by tandem high-pressure liquid chromatography (HPLC) of the tryptic peptides, HPLC mapping and isoelectric focusing of the CNBr fragments, and amino acid sequence analysis of the purified peptides. Lysine-372 of normal (common) albumin A was changed to glutamic acid both in albumin Naskapi, a widespread polymorphic variant of North American Indians, and in albumin Mersin found in Eti Turks. The two variants also exhibited anomalous migration in NaDodSO/sub 4//PAGE, which is attributed to a conformational change. The identity of albumins Naskapi and Mersin may have originated through descent from a common mid-Asiatic founder of the two migrating ethnic groups, or it may represent identical but independent mutations of the albumin gene. In albumin Adana, from Eti Turks, the substitution site was not identified but was localized to the region from positions 447 through 548. The substitution of aspartic acid-550 by glycine was found in albumin Mexico-2 from four individuals of the Pima tribe. Although only single-point substitutions have been found in these and in certain other genetic variants of human albumin, five differences exist in the amino acid sequences inferred from cDNA sequences by workers in three other laboratories. However, our results on albumin A and on 14 different genetic variants accord with the amino acid sequence of albumin deduced from the genomic sequence. The apparent amino acid substitutions inferred from comparison of individual cDNA sequences probably reflect artifacts in cloning or in cDNA sequence analysis rather than polymorphism of the coding sections of the albumin gene.

  10. Identification of grass-associated and toluene-degrading diazotrophs, Axoarcus spp., by analyses of partial 16S ribosomal DNA sequences

    SciTech Connect

    Hurek, T.; Reinhold-Hurek, B.

    1995-06-01

    The genus Azoarcus includes nitrogen-fixing, grass-associated strains as well as denitrifying toluene degraders. In order to identify and group members of the genus Azoarcus, phylogenetic analysis based on partial sequences of 16S rRNA genes (16S rDNAs) is proposed. 16S rRNA-targeted PCR using specific primers to exclude amplification in the majority of other members of the beta subclass of the class Proteobacteria was combined with direct sequencing of the PCR products. Tree inference from comparisons of 446-bp rDNA fragments yielded similar results for the three known Azoarcus spp. sequences and for analysis of the complete 16S rDNA sequence. These three species formed a phylogenetically coherent group with representatives of two other Azoarcus species which were subjected to 16S rRNA sequencing in this study. This group was related to Rhodocyclus purpureus and Thaurea selenatis. New isolates and also sequences of so far uncultured bacteria from roots of Kallar grass were assigned to the genus Azoarcus as well. Also, strains degrading monoaromatic hydrocarbons anaerobically in the presence of nitrate clustered within this genus, albeit not with grass-associated isolates. All representative members of the five species harboring rhizospheric bacteria were able to form N{sub 2}O from nitrate and showed anaerobic growth on malic acid with nitrate but not on toluene. In order to visualize different Azoarcus spp. by whole-cell in situ hybridizations, we generated 16S rRNA-targeted, fluorescent probes by in vitro transcription directly from PCR products which spanned the variable region V2. Hybridization was species specific for Azoarcus communis and Azoarcus indigens. The proposed scheme of phylogenetic analysis of PCR-generated 16S rDNA segements will facilitate studies on ecological distribution, host range, and diversity of Azoarcus spp. and may even allow rapid identification of unc ultured strains from environmental DNAs. 30 refs., 3 figs.

  11. Real-Time Nucleic Acid Sequence-Based Amplification Assay for Detection of Hepatitis A Virus

    PubMed Central

    Abd El Galil, Khaled H.; El Sokkary, M. A.; Kheira, S. M.; Salazar, Andre M.; Yates, Marylynn V.; Chen, Wilfred; Mulchandani, Ashok

    2005-01-01

    A nucleic acid sequence-based amplification (NASBA) assay in combination with a molecular beacon was developed for the real-time detection and quantification of hepatitis A virus (HAV). A 202-bp, highly conserved 5′ noncoding region of HAV was targeted. The sensitivity of the real-time NASBA assay was tested with 10-fold dilutions of viral RNA, and a detection limit of 1 PFU was obtained. The specificity of the assay was demonstrated by testing with other environmental pathogens and indicator microorganisms, with only HAV positively identified. When combined with immunomagnetic separation, the NASBA assay successfully detected as few as 10 PFU from seeded lake water samples. Due to its isothermal nature, its speed, and its similar sensitivity compared to the real-time RT-PCR assay, this newly reported real-time NASBA method will have broad applications for the rapid detection of HAV in contaminated food or water. PMID:16269748

  12. Evolutionary connections of biological kingdoms based on protein and nucleic acid sequence evidence

    NASA Technical Reports Server (NTRS)

    Dayhoff, M. O.

    1983-01-01

    Prokaryotic and eukaryotic evolutionary trees are developed from protein and nucleic-acid sequences by the methods of numerical taxonomy. Trees are presented for bacterial ferredoxins, 5S ribosomal RNA, c-type cytochromes , cytochromes c2 and c', and 5.8S ribosomal RNA; the implications for early evolution are discussed; and a composite tree showing the branching of the anaerobes, aerobes, archaebacteria, and eukaryotes is shown. Single lines are found for all oxygen-evolving photosynthetic forms and for the salt-loving and high-temperature forms of archaebacteria. It is argued that the eukaryote mitochondria, chloroplasts, and cytoplasmic host material are descended from free-living prokaryotes that formed symbiotic associations, with more than one symbiotic event involved in the evolution of each organelle.

  13. Sequence-defined shuttles for targeted nucleic acid and protein delivery.

    PubMed

    Röder, Ruth; Wagner, Ernst

    2014-01-01

    Molecular medicine opens into a space of novel specific therapeutic agents: intracellularly active drugs such as peptides, proteins or nucleic acids, which are not able to cross cell membranes and enter the intracellular space on their own. Through the development of cell-targeted shuttles for specific delivery, this restriction in delivery has the potential to be converted into an advantage. On the one hand, due to the multiple extra- and intracellular barriers, such carrier systems need to be multifunctional. On the other hand, they must be precise and reproducibly manufactured due to pharmaceutical reasons. Here we review the design of precise sequence-defined delivery carriers, including solid-phase synthesized peptides and nonpeptidic oligomers, or nucleotide-based carriers such as aptamers and origami nanoboxes.

  14. Identification of amino acid sequences in the polyomavirus capsid proteins that serve as nuclear localization signals

    NASA Technical Reports Server (NTRS)

    Chang, D.; Haynes, J. I. Jr; Brady, J. N.; Consigli, R. A.; Spooner, B. S. (Principal Investigator)

    1993-01-01

    The molecular mechanism participating in the transport of newly synthesized proteins from the cytoplasm to the nucleus in mammalian cells is poorly understood. Recently, the nuclear localization signal sequences (NLS) of many nuclear proteins have been identified, and most have been found to be composed of a highly basic amino acid stretch. A genetic "subtractive" and a biochemical "additive" approach were used in our studies to identify the NLS's of the polyomavirus structural capsid proteins. An NLS was identified at the N-terminus (Ala1-Pro-Lys-Arg-Lys-Ser-Gly-Val-Ser-Lys-Cys11) of the major capsid protein VP1 and at the C-terminus (Glu307 -Glu-Asp-Gly-Pro-Glu-Lys-Lys-Lys-Arg-Arg-Leu318) of the VP2/VP3 minor capsid proteins.

  15. The amino acid sequence of a carbohydrate-containing fragment of hen ovotransferrin.

    PubMed Central

    Kingston, I B; Williams, J

    1975-01-01

    1. Hen ovotransferrin was treated with CNBr and fractionated by gel filtration. 2. After further treatment by reduction and carboxymethylation a carbohydrate-containing fragment of molecular weight 11990 was obtained (fragment BCd). 3. The amino acid sequence of this fragment was determined. It consists of a single chain of 94 residues. 4. The structure of a tryptic glycopeptide derived from whole ovotransferrin permitted a further eight residues to be assigned at the N-terminus of fragment BCd. 5. Heterogeneity was found at two positions. 6. Further evidence has been deposited as Supplementary Publication SUP 50045 (19 pages) at the British Library (Lending Division), Boston Spa, Wetherby, W. Yorkshire LS23 7BQ, U.K., from whom copies may be obtained on the terms indicated in Biochem. J. (1975), 145, 5. PMID:1172663

  16. Comparative sequence analyses on the 16S rRNA (rDNA) of Bacillus acidocaldarius, Bacillus acidoterrestris, and Bacillus cycloheptanicus and proposal for creation of a new genus, Alicyclobacillus gen. nov

    NASA Technical Reports Server (NTRS)

    Wisotzkey, J. D.; Jurtshuk, P. Jr; Fox, G. E.; Deinhard, G.; Poralla, K.

    1992-01-01

    Comparative 16S rRNA (rDNA) sequence analyses performed on the thermophilic Bacillus species Bacillus acidocaldarius, Bacillus acidoterrestris, and Bacillus cycloheptanicus revealed that these organisms are sufficiently different from the traditional Bacillus species to warrant reclassification in a new genus, Alicyclobacillus gen. nov. An analysis of 16S rRNA sequences established that these three thermoacidophiles cluster in a group that differs markedly from both the obligately thermophilic organisms Bacillus stearothermophilus and the facultatively thermophilic organism Bacillus coagulans, as well as many other common mesophilic and thermophilic Bacillus species. The thermoacidophilic Bacillus species B. acidocaldarius, B. acidoterrestris, and B. cycloheptanicus also are unique in that they possess omega-alicylic fatty acid as the major natural membranous lipid component, which is a rare phenotype that has not been found in any other Bacillus species characterized to date. This phenotype, along with the 16S rRNA sequence data, suggests that these thermoacidophiles are biochemically and genetically unique and supports the proposal that they should be reclassified in the new genus Alicyclobacillus.

  17. The amino acid sequences of two alpha chains of hemoglobins from Komodo dragon Varanus komodoensis and phylogenetic relationships of amniotes.

    PubMed

    Fushitani, K; Higashiyama, K; Moriyama, E N; Imai, K; Hosokawa, K

    1996-09-01

    To elucidate phylogenetic relationships among amniotes and the evolution of alpha globins, hemoglobins were analyzed from the Komodo dragon (Komodo monitor lizard) Varanus komodoensis, the world's largest extant lizard, inhabiting Komodo Islands, Indonesia. Four unique globin chains (alpha A, alpha D, beta B, and beta C) were isolated in an equal molar ratio by high performance liquid chromatography from the hemolysate. The amino acid sequences of two alpha chains were determined. The alpha D chain has a glutamine at E7 as does an alpha chain of a snake, Liophis miliaris, but the alpha A chain has a histidine at E7 like the majority of hemoglobins. Phylogenetic analyses of 19 globins including two alpha chains of Komodo dragon and ones from representative amniotes showed the following results: (1) The a chains of squamates (snakes and lizards), which have a glutamine at E7, are clustered with the embryonic alpha globin family, which typically includes the alpha D chain from birds; (2) birds form a sister group with other reptiles but not with mammals; (3) the genes for embryonic and adult types of alpha globins were possibly produced by duplication of the ancestral alpha gene before ancestral amniotes diverged, indicating that each of the present amniotes might carry descendants of the two types of alpha globin genes; (4) squamates first split off from the ancestor of other reptiles and birds.

  18. Amino acid sequence homology between rat and human C-reactive protein.

    PubMed Central

    Taylor, J A; Bruton, C J; Anderson, J K; Mole, J E; De Beer, F C; Baltz, M L; Pepys, M B

    1984-01-01

    The rat serum protein that undergoes Ca2+-dependent binding to pneumococcal C-polysaccharide and to phosphocholine residues, and that is evidently a member of the pentraxin family of proteins by virtue of its appearance under the electron microscope, has been variously designated as rat C-reactive protein (CRP) [de Beer, Baltz, Munn, Feinstein, Taylor, Bruton, Clamp & Pepys (1982) Immunology 45, 55-70], 'phosphoryl choline-binding protein' [Nagpurkar & Mookerjea (1981) J. Biol. Chem. 256, 7440-7448] and rat serum amyloid P component (SAP) [Pontet, D'Asnieres, Gache, Escaig & Engler (1981) Biochim. Biophys. Acta 671, 202-210]. The partial amino acid sequence (45 residues) towards the C-terminus of this protein was determined, and it showed 71.7% identity with the known sequence of human CRP but only 54.3% identity with human SAP. Since human CRP and SAP are themselves approximately 50% homologous, the level of identity between the rat protein and human SAP is evidence only of membership of the pentraxin family. In contrast, the much greater resemblance to human CRP confirms that the rat C-polysaccharide-binding/phosphocholine-binding protein is in fact rat CRP. PMID:6477504

  19. Integrative analyses of RNA editing, alternative splicing, and expression of young genes in human brain transcriptome by deep RNA sequencing.

    PubMed

    Wu, Dong-Dong; Ye, Ling-Qun; Li, Yan; Sun, Yan-Bo; Shao, Yi; Chen, Chunyan; Zhu, Zhu; Zhong, Li; Wang, Lu; Irwin, David M; Zhang, Yong E; Zhang, Ya-Ping

    2015-08-01

    Next-generation RNA sequencing has been successfully used for identification of transcript assembly, evaluation of gene expression levels, and detection of post-transcriptional modifications. Despite these large-scale studies, additional comprehensive RNA-seq data from different subregions of the human brain are required to fully evaluate the evolutionary patterns experienced by the human brain transcriptome. Here, we provide a total of 6.5 billion RNA-seq reads from different subregions of the human brain. A significant correlation was observed between the levels of alternative splicing and RNA editing, which might be explained by a competition between the molecular machineries responsible for the splicing and editing of RNA. Young human protein-coding genes demonstrate biased expression to the neocortical and non-neocortical regions during evolution on the lineage leading to humans. We also found that a significantly greater number of young human protein-coding genes are expressed in the putamen, a tissue that was also observed to have the highest level of RNA-editing activity. The putamen, which previously received little attention, plays an important role in cognitive ability, and our data suggest a potential contribution of the putamen to human evolution.

  20. Phylogenetic Analysis of Bolivian Bat Trypanosomes of the Subgenus Schizotrypanum Based on Cytochrome b Sequence and Minicircle Analyses

    PubMed Central

    García, Lineth; Ortiz, Sylvia; Osorio, Gonzalo; Torrico, Mary Cruz; Torrico, Faustino; Solari, Aldo

    2012-01-01

    The aim of this study was to establish the phylogenetic relationships of trypanosomes present in blood samples of Bolivian Carollia bats. Eighteen cloned stocks were isolated from 115 bats belonging to Carollia perspicillata (Phyllostomidae) from three Amazonian areas of the Chapare Province of Bolivia and studied by xenodiagnosis using the vectors Rhodnius robustus and Triatoma infestans (Trypanosoma cruzi marenkellei) or haemoculture (Trypanosoma dionisii). The PCR DNA amplified was analyzed by nucleotide sequences of maxicircles encoding cytochrome b and by means of the molecular size of hyper variable regions of minicircles. Ten samples were classified as Trypanosoma cruzi marinkellei and 8 samples as Trypanosoma dionisii. The two species have a different molecular size profile with respect to the amplified regions of minicircles and also with respect to Trypanosoma cruzi and Trypanosoma rangeli used for comparative purpose. We conclude the presence of two species of bat trypanosomes in these samples, which can clearly be identified by the methods used in this study. The presence of these trypanosomes in Amazonian bats is discussed. PMID:22590570

  1. Compact variant-rich customized sequence database and a fast and sensitive database search for efficient proteogenomic analyses.

    PubMed

    Park, Heejin; Bae, Junwoo; Kim, Hyunwoo; Kim, Sangok; Kim, Hokeun; Mun, Dong-Gi; Joh, Yoonsung; Lee, Wonyeop; Chae, Sehyun; Lee, Sanghyuk; Kim, Hark Kyun; Hwang, Daehee; Lee, Sang-Won; Paek, Eunok

    2014-12-01

    In proteogenomic analysis, construction of a compact, customized database from mRNA-seq data and a sensitive search of both reference and customized databases are essential to accurately determine protein abundances and structural variations at the protein level. However, these tasks have not been systematically explored, but rather performed in an ad-hoc fashion. Here, we present an effective method for constructing a compact database containing comprehensive sequences of sample-specific variants--single nucleotide variants, insertions/deletions, and stop-codon mutations derived from Exome-seq and RNA-seq data. It, however, occupies less space by storing variant peptides, not variant proteins. We also present an efficient search method for both customized and reference databases. The separate searches of the two databases increase the search time, and a unified search is less sensitive to identify variant peptides due to the smaller size of the customized database, compared to the reference database, in the target-decoy setting. Our method searches the unified database once, but performs target-decoy validations separately. Experimental results show that our approach is as fast as the unified search and as sensitive as the separate searches. Our customized database includes mutation information in the headers of variant peptides, thereby facilitating the inspection of peptide-spectrum matches.

  2. Non-LTE Abundance Analyses of Nitrogen and Sulfur in Chemically Peculiar Stars of the Upper Main Sequence

    NASA Astrophysics Data System (ADS)

    Takada-Hidai, Masahide; Takeda, Yoichi

    1996-10-01

    The LTE and non-LTE abundances of nitrogen (N) and sulfur (S) in chemically peculiar stars of the upper main sequence were derived from the NI and SI lines observed in a near-infrared spectral region. The sample consisted of 11 stars: three HgMn stars, two Am stars, three magnetic Ap (SrCrEu) stars, two weak-lined stars, and one normal star. The following results were obtained: (1) the LTE abundances of N suffer a large non-LTE effect with correction factors of up to -0.6 dex, while those of S suffer a minor non-LTE effect with correction factors of up to -0.2 dex; (2) the non-LTE abundances of N are systematically below solar value among the sample stars. Although the deficiencies of N are mild in the normal and weak-lined stars, they are enhanced by a factor of up to 2 dex in HgMn stars. A star-to-star variation with a range of 1 dex or more in the N deficiency is shown in Am and SrCrEu stars; (3) the non-LTE abundances of S are solar or slightly overabundant among the sample stars, except for SrCrEu stars. S is systematically deficient relative to the Sun by a factor of >~ 0.7 dex in SrCrEu stars.

  3. Transcriptome Sequencing Analyses between the Cytoplasmic Male Sterile Line and Its Maintainer Line in Welsh Onion (Allium fistulosum L.)

    PubMed Central

    Liu, Qianchun; Lan, Yanping; Wen, Changlong; Zhao, Hong; Wang, Jian; Wang, Yongqin

    2016-01-01

    Cytoplasmic male sterility (CMS) is important for exploiting heterosis in crop plants and also serves as a model for investigating nuclear–cytoplasmic interaction. The molecular mechanism of cytoplasmic male sterility and fertility restoration was investigated in several important economic crops but remains poorly understood in the Welsh onion. Therefore, we compared the differences between the CMS line 64-2 and its maintainer line 64-1 using transcriptome sequencing with the aim of determining critical genes and pathways associated with male sterility. This study combined two years of RNA-seq data; there were 1504 unigenes (in May 2013) and 2928 unigenes (in May 2014) that were differentially expressed between the CMS and cytoplasmic male maintainer Welsh onion varieties. Known CMS-related genes were found in the set of differentially expressed genes and checked by qPCR. These genes included F-type ATPase, NADH dehydrogenase, cytochrome c oxidase, etc. Overall, this study demonstrated that the CMS regulatory genes and pathways may be associated with the mitochondria and nucleus in the Welsh onion. We believe that this transcriptome dataset will accelerate the research on CMS gene clones and other functional genomics research on A. fistulosum L. PMID:27376286

  4. Amino acid sequences of alpha-helical segments from S-carbosymethylkerateine-A. Complete sequence of a type-I segment.

    PubMed Central

    Gough, K H; Inglis, A S; Crewther, W G

    1978-01-01

    The amino acid sequence of a type-I helical segment from the low-sulphur protein (S-carboxymethylkerateine-A) of wool was determined by combining automatic and manual-sequencing data. Whereas in the type-II helical segment most of the cationic groups occur in pairs, 11 of the 22 anionic residues in the sequence of the type-I segment were situated next to a second anionic residue. This suggests possible interactions between type-I and type-II helical segments in alpha-keratin. As observed with the sequence of a type-II helical segment a model constructed on 3.6 residues per turn of helix shows a line of hydrophobic residues along the helix, thereby supporting the physicochemical evidence that the molecule is predominantly helical and forms part of a coiled-coil structure. Examination of the sequence data by predictive methods indicates the possibilty of extensive sections of alpha-helix interspersed with discontinuities. The molecule contains a number of regions with peptide sequences identical with those found by other workers after enzymic digestion of fractions from oxidized wool. Images Fig. 1. PMID:697725

  5. Spermatogenesis of the lizard Lacerta vivipara: histological studies and amino acid sequence of a protamine lacertine 1.

    PubMed

    Martinage, A; Depeiges, A; Wouters, D; Morel, L; Sautière, P

    1996-06-01

    The lizard Lacerta vivipara is a seasonal breeder with a well characterized reproductive cycle. An histological study of the lizard testis has been performed at different stages of spermatogenesis and the nuclear basic proteins content was assessed by electrophoretical analysis. Two protamines, lacertines 1 and 2, are present in spermatozoa in April and May. We have isolated lacertine1 and characterized a protamine with a mass of 4,963.7 Da. Amino acid sequence of this protamine (41 residues) was established from data provided by automated Edman degradation. It is characterized by a basic amino acid stretch in the N- and C-terminal regions and by a central part which only consists of 3 different intermingled amino acids. This protamine presents 62% homology with scylliorhinine Z3 from dog-fish Scylliorhinus caniculus and 58% homology with quail protamine. The reported lizard protamine sequence is the first reptilian protamine sequence available so far.

  6. The amino acid sequence of the cytochrome c-554(547) from the chemolithotrophic bacterium Thiobacillus neapolitanus.

    PubMed Central

    Ambler, R P; Meyer, T E; Trudinger, P A; Kamen, M D

    1985-01-01

    An amino acid sequence is proposed for the cytochrome c-554(547) from the bacterium Thiobacillus neapolitanus N.C.I.B. 8539). It consists of a polypeptide chain of 91 residues, with a pair of haem-attachment cysteine residues at positions 15 and 18. There is similarity in sequence with each of the halves of the sequence of the dihaem cytochromes c4 and with a cytochrome c-554(548) from a halophilic strain of Paracoccus. Detailed evidence for the amino acid sequence of the protein has been deposited as Supplementary Publication SUP 50127 (11 pages) at the British Library (Lending Division), Boston Spa, Wetherby, West Yorkshire LS23 7BQ, U.K., from whom copies can be obtained on the terms indicated in Biochem. J. (1985) 225, 5. PMID:2988504

  7. Statistical analyses of soil properties on a quaternary terrace sequence in the upper sava river valley, Slovenia, Yugoslavia

    USGS Publications Warehouse

    Vidic, N.; Pavich, M.; Lobnik, F.

    1991-01-01

    Alpine glaciations, climatic changes and tectonic movements have created a Quaternary sequence of gravely carbonate sediments in the upper Sava River Valley, Slovenia, Yugoslavia. The names for terraces, assigned in this model, Gu??nz, Mindel, Riss and Wu??rm in order of decreasing age, are used as morphostratigraphic terms. Soil chronosequence on the terraces was examined to evaluate which soil properties are time dependent and can be used to help constrain the ages of glaciofluvial sedimentation. Soil thickness, thickness of Bt horizons, amount and continuity of clay coatings and amount of Fe and Me concretions increase with soil age. The main source of variability consists of solutions of carbonate, leaching of basic cations and acidification of soils, which are time dependent and increase with the age of soils. The second source of variability is the content of organic matter, which is less time dependent, but varies more within soil profiles. Textural changes are significant, presented by solution of carbonate pebbles and sand, and formation is silt loam matrix, which with age becomes finer, with clay loam or clayey texture. The oldest, Gu??nz, terrace shows slight deviation from general progressive trends of changes of soil properties with time. The hypothesis of single versus multiple depositional periods of deposition was tested with one-way analysis of variance (ANOVA) on a staggered, nested hierarchical sampling design on a terrace of largest extent and greatest gravel volume, the Wu??rm terrace. The variability of soil properties is generally higher within subareas than between areas of the terrace, except for the soil thickness. Observed differences in soil thickness between the areas of the terrace could be due to multiple periods of gravel deposition, or to the initial differences of texture of the deposits. ?? 1991.

  8. Nucleic acid sequence of an internal image-bearing monoclonal anti-idiotype and its comparison to the sequence of the external antigen.

    PubMed Central

    Bruck, C; Co, M S; Slaoui, M; Gaulton, G N; Smith, T; Fields, B N; Mullins, J I; Greene, M I

    1986-01-01

    The monoclonal anti-idiotypic antibody (mAb2) 87.92.6 directed against the 9B.G5 antibody specific for the virus neutralizing epitope on the mammalian reovirus type 3 hemagglutinin was previously demonstrated to express an internal image of the receptor binding epitope of the reovirus type 3. Furthermore, this mAb2 has autoimmune reactivity to the cell surface receptor of the reovirus. The nucleotide and deduced amino acid sequences of the 87.92.6 mAb2 heavy and light chains are described in this report. The sequence analysis reveals that the same heavy chain variable and joining (VH and JH) gene segments are used by the 87.92.6 anti-idiotypic mAb2 and by the dominant idiotypes of the BALB/c anti-GAT (cGAT) and anti-NP (NPa) responses. [GAT; random polymer that is 60% glutamic acid, 30% alanine, and 10% tyrosine. NP; (4-hydroxy-3-nitrophenyl)-acetyl.] Despite extensive homology at the level of the heavy chain variable regions, the NPa positive BALB/c anti-NP monoclonal antibody 17.2.25 binds neither 9B.G5 nor the cellular receptor for the hemagglutinin. Amino acid sequence comparison between the viral hemagglutinin and the 87.92.6 mAb2 light chain "internal image," reveals an area of significant homology indicating that antigen mimicry by antibodies may be achieved by sharing primary structure. PMID:2428036

  9. Draft Genome Sequence of Escherichia coli O157:H7 ATCC 35150 and a Nalidixic Acid-Resistant Mutant Derivative

    PubMed Central

    Markell, James A.; Koziol, Adam G.

    2015-01-01

    Shiga toxin-producing Escherichia coli strains, occasionally isolated from food, are of public health importance. Here, we report on the 5.30-Mbp draft genome sequence of E. coli O157:H7 EDL931 (strain ATCC 35150) and the 5.32-Mbp draft genome sequence of a nalidixic acid-resistant mutant derivative used as a distinguishable control strain in food-testing laboratories. PMID:26205873

  10. Microwave-assisted acid and base hydrolysis of intact proteins containing disulfide bonds for protein sequence analysis by mass spectrometry.

    PubMed

    Reiz, Bela; Li, Liang

    2010-09-01

    Controlled hydrolysis of proteins to generate peptide ladders combined with mass spectrometric analysis of the resultant peptides can be used for protein sequencing. In this paper, two methods of improving the microwave-assisted protein hydrolysis process are described to enable rapid sequencing of proteins containing disulfide bonds and increase sequence coverage, respectively. It was demonstrated that proteins containing disulfide bonds could be sequenced by MS analysis by first performing hydrolysis for less than 2 min, followed by 1 h of reduction to release the peptides originally linked by disulfide bonds. It was shown that a strong base could be used as a catalyst for microwave-assisted protein hydrolysis, producing complementary sequence information to that generated by microwave-assisted acid hydrolysis. However, using either acid or base hydrolysis, amide bond breakages in small regions of the polypeptide chains of the model proteins (e.g., cytochrome c and lysozyme) were not detected. Dynamic light scattering measurement of the proteins solubilized in an acid or base indicated that protein-protein interaction or aggregation was not the cause of the failure to hydrolyze certain amide bonds. It was speculated that there were some unknown local structures that might play a role in preventing an acid or base from reacting with the peptide bonds therein.

  11. Negative Ion In-Source Decay Matrix-Assisted Laser Desorption/Ionization Mass Spectrometry for Sequencing Acidic Peptides

    NASA Astrophysics Data System (ADS)

    McMillen, Chelsea L.; Wright, Patience M.; Cassady, Carolyn J.

    2016-05-01

    Matrix-assisted laser desorption/ionization (MALDI) in-source decay was studied in the negative ion mode on deprotonated peptides to determine its usefulness for obtaining extensive sequence information for acidic peptides. Eight biological acidic peptides, ranging in size from 11 to 33 residues, were studied by negative ion mode ISD (nISD). The matrices 2,5-dihydroxybenzoic acid, 2-aminobenzoic acid, 2-aminobenzamide, 1,5-diaminonaphthalene, 5-amino-1-naphthol, 3-aminoquinoline, and 9-aminoacridine were used with each peptide. Optimal fragmentation was produced with 1,5-diaminonphthalene (DAN), and extensive sequence informative fragmentation was observed for every peptide except hirudin(54-65). Cleavage at the N-Cα bond of the peptide backbone, producing c' and z' ions, was dominant for all peptides. Cleavage of the N-Cα bond N-terminal to proline residues was not observed. The formation of c and z ions is also found in electron transfer dissociation (ETD), electron capture dissociation (ECD), and positive ion mode ISD, which are considered to be radical-driven techniques. Oxidized insulin chain A, which has four highly acidic oxidized cysteine residues, had less extensive fragmentation. This peptide also exhibited the only charged localized fragmentation, with more pronounced product ion formation adjacent to the highly acidic residues. In addition, spectra were obtained by positive ion mode ISD for each protonated peptide; more sequence informative fragmentation was observed via nISD for all peptides. Three of the peptides studied had no product ion formation in ISD, but extensive sequence informative fragmentation was found in their nISD spectra. The results of this study indicate that nISD can be used to readily obtain sequence information for acidic peptides.

  12. Purification, characterization, gene cloning and nucleotide sequencing of D: -stereospecific amino acid amidase from soil bacterium: Delftia acidovorans.

    PubMed

    Hongpattarakere, Tipparat; Komeda, Hidenobu; Asano, Yasuhisa

    2005-12-01

    The D-amino acid amidase-producing bacterium was isolated from soil samples using an enrichment culture technique in medium broth containing D-phenylalanine amide as a sole source of nitrogen. The strain exhibiting the strongest activity was identified as Delftia acidovorans strain 16. This strain produced intracellular D-amino acid amidase constitutively. The enzyme was purified about 380-fold to homogeneity and its molecular mass was estimated to be about 50 kDa, on sodium dodecyl sulfate polyacrylamide gel electrophoresis. The enzyme was active preferentially toward D-amino acid amides rather than their L-counterparts. It exhibited strong amino acid amidase activity toward aromatic amino acid amides including D-phenylalanine amide, D-tryptophan amide and D-tyrosine amide, yet it was not specifically active toward low-molecular-weight D-amino acid amides such as D-alanine amide, L-alanine amide and L-serine amide. Moreover, it was not specifically active toward oligopeptides. The enzyme showed maximum activity at 40 degrees C and pH 8.5 and appeared to be very stable, with 92.5% remaining activity after the reaction was performed at 45 degrees C for 30 min. However, it was mostly inactivated in the presence of phenylmethanesulfonyl fluoride or Cd2+, Ag+, Zn2+, Hg2+ and As3+ . The NH2 terminal and internal amino acid sequences of the enzyme were determined; and the gene was cloned and sequenced. The enzyme gene damA encodes a 466-amino-acid protein (molecular mass 49,860.46 Da); and the deduced amino acid sequence exhibits homology to the D-amino acid amidase from Variovorax paradoxus (67.9% identity), the amidotransferase A subunit from Burkholderia fungorum (50% identity) and other enantioselective amidases.

  13. The phylogenetic position of the Loimoidae Price, 1936 (Monogenoidea: Monocotylidea) based on analyses of partial rDNA sequences and morphological data.

    PubMed

    Boeger, W A; Kritsky, D C; Domingues, M V; Bueno-Silva, M

    2014-06-01

    Phylogenetic analyses of partial sequences of 18S and 28S rDNA of some monogenoids, including monocotylids and a specimen of Loimosina sp. collected from a hammerhead shark off Brazil, indicated that the Loimoidae (as represented by the specimen of Loimosina sp.) represents an in-group taxon of the Monocotylidae. In all analyses, the Loimoidae fell within a major monocotylid clade including species of the Heterocotylinae, Decacotylinae, and Monocotylinae. The Loimoidae formed a terminal clade with two heterocotyline species, Troglocephalus rhinobatidis and Neoheterocotyle rhinobatis, for which it represented the sister taxon. The following morphological characters supported the clade comprising the Loimoidae, Heterocotylinae, Decacotylinae and Monocotylinae: single vagina present, presence of a narrow deep anchor root, and presence of a marginal haptoral membrane. The presence of cephalic pits was identified as a putative synapomorphy for the clade (Loimoidae (T. rhinobatidis, N. rhinobatis)). Although rDNA sequence data support the rejection of the Loimoidae and incorporating its species into the Monocotylidae, this action was not recommended pending a full phylogenetic analysis of morphological data.

  14. Effects of Acidic Peptide Size and Sequence on Trivalent Praseodymium Adduction and Electron Transfer Dissociation Mass Spectrometry.

    PubMed

    Commodore, Juliette J; Cassady, Carolyn J

    2017-02-07

    Using the lanthanide ion praseodymium, Pr(III), metallated ion formation and electron transfer dissociation (ETD) were studied for 25 biological and model acidic peptides. For chain lengths of seven or more residues, even highly acidic peptides that can be difficult to protonate by electrospray ionization will metallate and undergo abundant ETD fragmentation. Peptides composed of predominantly acidic residues form only the deprotonated ion, [M + Pr - H](2+) ; this ion yields near complete ETD sequence coverage for larger peptides. Peptides with a mixture of acidic and neutral residues, generate [M + Pr](3+) , which cleaves between every residue for many peptides. Acidic peptides that contain at least one residue with a basic side chain also produce the protonated ion, [M + Pr + H](4+) ; this ion undergoes the most extensive sequence coverage by ETD. Primarily metallated and non-metallated c- and z-ions form for all peptides investigated. Metal adducted product ions are only present when at least half of the peptide sequence can be incorporated into the ion; this suggests that the metal ion simultaneously attaches to more than one acidic site. The only site consistently lacking dissociation is at the N-terminal side of a proline residue. Increasing peptide chain length generates more backbone cleavage for metal-peptide complexes with the same charge state. For acidic peptides with the same length, increasing the precursor ion charge state from 2+ to 3+ also leads to more cleavage. The results of this study indicate that highly acidic peptides can be sequenced by ETD of complexes formed with Pr(III).

  15. Effects of simple acid leaching of crushed and powdered geological materials on high-precision Pb isotope analyses

    NASA Astrophysics Data System (ADS)

    Todd, Erin; Stracke, Andreas; Scherer, Erik E.

    2015-07-01

    We present new results of simple acid leaching experiments on the Pb isotope composition of USGS standard reference material powders and on ocean island basalt whole rock splits and powders. Rock samples were leached with cold 6 N HCl in an ultrasonic bath, then on a hot plate, and washed with ultrapure H2O before sample digestion in HF-HNO3 and chromatographic purification of Pb. Lead isotope analyses were measured by Tl-doped MC-ICPMS. Intrasession and intersession analytical reproducibilities of repeated analyses of both synthetic Pb solutions and Pb from single digests of chemically processed natural samples were generally better than 100 ppm (2 SD). The comparison of leached and unleached samples shows that leaching consistently removes variable amounts of contaminants that differ in Pb isotopic composition for different starting materials. For repeated digests of a single sample, analyses of leached samples reproduce better than those of unleached ones, confirming that leaching effectively removes most of the heterogeneously distributed extraneous Pb. Nevertheless, the external reproducibility of leached samples is still up to an order of magnitude worse than that of Pb solution standards (˜100 ppm). More complex leaching methods employed by earlier studies yield Pb isotope ratios within error of those produced by our method and at similar levels of reproducibility, demonstrating that our simple leaching method is as effective as more complex leaching techniques. Therefore, any Pb isotope heterogeneity among multiple leached digests of samples in excess of the external reproducibility is attributed to inherent isotopic heterogeneity of the sample. The external precision of ˜100 ppm (2 SD) achieved for Pb isotope ratio determination by Tl-doped MC-ICPMS is thus sufficient for most rocks. The full advantage of the most precise Pb isotope analytical methods is only realized in cases where the natural isotopic heterogeneity among samples in a studied suite is

  16. Unifying bacteria from decaying wood with various ubiquitous Gibbsiella species as G. acetica sp. nov. based on nucleotide sequence similarities and their acetic acid secretion.

    PubMed

    Geider, Klaus; Gernold, Marina; Jock, Susanne; Wensing, Annette; Völksch, Beate; Gross, Jürgen; Spiteller, Dieter

    2015-12-01

    Bacteria were isolated from necrotic apple and pear tree tissue and from dead wood in Germany and Austria as well as from pear tree exudate in China. They were selected for growth at 37 °C, screened for levan production and then characterized as Gram-negative, facultatively anaerobic rods. Nucleotide sequences from 16S rRNA genes, the housekeeping genes dnaJ, gyrB, recA and rpoB alignments, BLAST searches and phenotypic data confirmed by MALDI-TOF analysis showed that these bacteria belong to the genus Gibbsiella and resembled strains isolated from diseased oaks in Britain and Spain. Gibbsiella-specific PCR primers were designed from the proline isomerase and the levansucrase genes. Acid secretion was investigated by screening for halo formation on calcium carbonate agar and the compound identified by NMR as acetic acid. Its production by Gibbsiella spp. strains was also determined in culture supernatants by GC/MS analysis after derivatization with pentafluorobenzyl bromide. Some strains were differentiated by the PFGE patterns of SpeI digests and by sequence analyses of the lsc and the ppiD genes, and the Chinese Gibbsiella strain was most divergent. The newly investigated bacteria as well as Gibbsiella querinecans, Gibbsiella dentisursi and Gibbsiella papilionis, isolated in Britain, Spain, Korea and Japan, are taxonomically related Enterobacteriaceae, tolerate and secrete acetic acid. We therefore propose to unify them in the species Gibbsiella acetica sp. nov.

  17. Analyses of Methylomes Derived from Meso-American Common Bean (Phaseolus vulgaris L.) Using MeDIP-Seq and Whole Genome Sodium Bisulfite-Sequencing

    PubMed Central

    Crampton, Mollee; Sripathi, Venkateswara R.; Hossain, Khwaja; Kalavacharla, Venu

    2016-01-01

    Common bean (Phaseolus vulgaris L.) is economically important for its high protein, fiber, and micronutrient contents, with a relatively small genome size of ∼587 Mb. Common bean is genetically diverse with two major gene pools, Meso-American and Andean. The phenotypic variability within common bean is partly attributed to the genetic diversity and epigenetic changes that are largely influenced by environmental factors. It is well established that an important epigenetic regulator of gene expression is DNA methylation. Here, we present results generated from two high-throughput sequencing technologies, methylated DNA immunoprecipitation-sequencing (MeDIP-seq) and whole genome bisulfite-sequencing (BS-Seq). Our analyses revealed that this Meso-American common bean displays similar methylation patterns as other previously published plant methylomes, with CG ∼50%, CHG ∼30%, and CHH ∼2.7% methylation, however, these differ from the common bean reference methylome of Andean origin. We identified higher CG methylation levels in both promoter and genic regions than CHG and CHH contexts. Moreover, we found relatively higher CG methylation levels in genes than in promoters. Conversely, the CHG and CHH methylation levels were highest in promoters than in genes. This is the first genome-wide DNA methylation profiling study in a Meso-American common bean cultivar (“Sierra”) using NGS approaches. Our long-term goal is to generate genome-wide epigenomic maps in common bean focusing on chromatin accessibility, histone modifications, and DNA methylation. PMID:27199997

  18. Analyses of Methylomes Derived from Meso-American Common Bean (Phaseolus vulgaris L.) Using MeDIP-Seq and Whole Genome Sodium Bisulfite-Sequencing.

    PubMed

    Crampton, Mollee; Sripathi, Venkateswara R; Hossain, Khwaja; Kalavacharla, Venu

    2016-01-01

    Common bean (Phaseolus vulgaris L.) is economically important for its high protein, fiber, and micronutrient contents, with a relatively small genome size of ∼587 Mb. Common bean is genetically diverse with two major gene pools, Meso-American and Andean. The phenotypic variability within common bean is partly attributed to the genetic diversity and epigenetic changes that are largely influenced by environmental factors. It is well established that an important epigenetic regulator of gene expression is DNA methylation. Here, we present results generated from two high-throughput sequencing technologies, methylated DNA immunoprecipitation-sequencing (MeDIP-seq) and whole genome bisulfite-sequencing (BS-Seq). Our analyses revealed that this Meso-American common bean displays similar methylation patterns as other previously published plant methylomes, with CG ∼50%, CHG ∼30%, and CHH ∼2.7% methylation, however, these differ from the common bean reference methylome of Andean origin. We identified higher CG methylation levels in both promoter and genic regions than CHG and CHH contexts. Moreover, we found relatively higher CG methylation levels in genes than in promoters. Conversely, the CHG and CHH methylation levels were highest in promoters than in genes. This is the first genome-wide DNA methylation profiling study in a Meso-American common bean cultivar ("Sierra") using NGS approaches. Our long-term goal is to generate genome-wide epigenomic maps in common bean focusing on chromatin accessibility, histone modifications, and DNA methylation.

  19. Method for the detection of specific nucleic acid sequences by polymerase nucleotide incorporation

    DOEpatents

    Castro, Alonso

    2004-06-01

    A method for rapid and efficient detection of a target DNA or RNA sequence is provided. A primer having a 3'-hydroxyl group at one end and having a sequence of nucleotides sufficiently homologous with an identifying sequence of nucleotides in the target DNA is selected. The primer is hybridized to the identifying sequence of nucleotides on the DNA or RNA sequence and a reporter molecule is synthesized on the target sequence by progressively binding complementary nucleotides to the primer, where the complementary nucleotides include nucleotides labeled with a fluorophore. Fluorescence emitted by fluorophores on single reporter molecules is detected to identify the target DNA or RNA sequence.

  20. Transcriptome de novo assembly from next-generation sequencing and comparative analyses in the hexaploid salt marsh species Spartina maritima and Spartina alterniflora (Poaceae)

    PubMed Central

    Ferreira de Carvalho, J; Poulain, J; Da Silva, C; Wincker, P; Michon-Coudouel, S; Dheilly, A; Naquin, D; Boutte, J; Salmon, A; Ainouche, M

    2013-01-01

    Spartina species have a critical ecological role in salt marshes and represent an excellent system to investigate recurrent polyploid speciation. Using the 454 GS-FLX pyrosequencer, we assembled and annotated the first reference transcriptome (from roots and leaves) for two related hexaploid Spartina species that hybridize in Western Europe, the East American invasive Spartina alterniflora and the Euro-African S. maritima. The de novo read assembly generated 38 478 consensus sequences and 99% found an annotation using Poaceae databases, representing a total of 16 753 non-redundant genes. Spartina expressed sequence tags were mapped onto the Sorghum bicolor genome, where they were distributed among the subtelomeric arms of the 10 S. bicolor chromosomes, with high gene density correlation. Normalization of the complementary DNA library improved the number of annotated genes. Ecologically relevant genes were identified among GO biological function categories in salt and heavy metal stress response, C4 photosynthesis and in lignin and cellulose metabolism. Expression of some of these genes had been found to be altered by hybridization and genome duplication in a previous microarray-based study in Spartina. As these species are hexaploid, up to three duplicated homoeologs may be expected per locus. When analyzing sequence polymorphism at four different loci in S. maritima and S. alterniflora, we found up to four haplotypes per locus, suggesting the presence of two expressed homoeologous sequences with one or two allelic variants each. This reference transcriptome will allow analysis of specific Spartina genes of ecological or evolutionary interest, estimation of homoeologous gene expression variation using RNA-seq and further gene expression evolution analyses in natural populations. PMID:23149455

  1. Mitochondrial DNA and retroviral RNA analyses of archival oral polio vaccine (OPV CHAT) materials: evidence of macaque nuclear sequences confirms substrate identity.

    PubMed

    Berry, Neil; Jenkins, Adrian; Martin, Javier; Davis, Clare; Wood, David; Schild, Geoffrey; Bottiger, Margareta; Holmes, Harvey; Minor, Philip; Almond, Neil

    2005-02-25

    Inoculation of live experimental oral poliovirus vaccines (OPV CHAT) during the 1950s in central Africa has been proposed to account for the introduction of HIV into human populations. For this to have occurred, it would have been necessary for chimpanzee rather than macaque kidney epithelial cells to have been included in the preparation of early OPV materials. Theoretically, this could have led to contamination with a progenitor of HIV-1 derived from a related simian immunodeficiency virus of chimpanzees (SIVCPZ). In this article we present further detailed analyses of two samples of OPV, CHAT 10A-11 and CHAT 6039/Yugo, which were used in early human trials of poliovirus vaccination. Recovery of poliovirus by culture techniques confirmed the biological viability of the vaccines and sequence analysis of poliovirus RNA specifically identified the presence of the CHAT strain. Independent nested sets of oligonucleotide primers specific for HIV-1/SIVCPZ and HIV-2/SIVMAC/SIVSM phylogenetic lineages, respectively, indicated no evidence of HIV/SIV RNA in either vaccine preparation, at a sensitivity of 100 RNA equivalents/ml. Analysis of cellular substrate by the amplification of two distinct regions of mitochondrial DNA (D-loop control region and 12S ribosomal sequences) revealed no evidence of chimpanzee cellular sequences. However, this approach positively identified rhesus and cynomolgus macaque DNA for the CHAT 10A-11 and CHAT 6039/Yugo vaccine preparations, respectively. Analysis of multiple clones of mtDNA 12S rDNA indicated a relatively high number of nuclear mitochondrial DNA sequences (numts) in the CHAT 10A-11 material, but confirmed the macaque origin of cellular substrate used in vaccine preparation. These data reinforce earlier findings on this topic providing no evidence to support the contention that poliovirus vaccination was responsible for the introduction of HIV into humans and sparking the AIDS pandemic.

  2. Identification of tropomyosins as major allergens in antarctic krill and mantis shrimp and their amino acid sequence characteristics.

    PubMed

    Motoyama, Kanna; Suma, Yota; Ishizaki, Shoichiro; Nagashima, Yuji; Lu, Ying; Ushio, Hideki; Shiomi, Kazuo

    2008-01-01

    Tropomyosin represents a major allergen of decapod crustaceans such as shrimps and crabs, and its highly conserved amino acid sequence (>90% identity) is a molecular basis of the immunoglobulin E (IgE) cross-reactivity among decapods. At present, however, little information is available about allergens in edible crustaceans other than decapods. In this study, the major allergen in two species of edible crustaceans, Antarctic krill Euphausia superba and mantis shrimp Oratosquilla oratoria that are taxonomically distinct from decapods, was demonstrated to be tropomyosin by IgE-immunoblotting using patient sera. The cross-reactivity of the tropomyosins from both species with decapod tropomyosins was also confirmed by inhibition IgE immunoblotting. Sequences of the tropomyosins from both species were determined by complementary deoxyribonucleic acid cloning. The mantis shrimp tropomyosin has high sequence identity (>90% identity) with decapod tropomyosins, especially with fast-type tropomyosins. On the other hand, the Antarctic krill tropomyosin is characterized by diverse alterations in region 13-42, the amino acid sequence of which is highly conserved for decapod tropomyosins, and hence, it shares somewhat lower sequence identity (82.4-89.8% identity) with decapod tropomyosins than the mantis shrimp tropomyosin. Quantification by enzyme-linked immunosorbent assay revealed that Antarctic krill contains tropomyosin at almost the same level as decapods, suggesting that its allergenicity is equivalent to decapods. However, mantis shrimp was assumed to be substantially not allergenic because of the extremely low content of tropomyosin.

  3. Molecular cloning and sequencing of a cDNA encoding the thioesterase domain of the rat fatty acid synthetase.

    PubMed

    Naggert, J; Witkowski, A; Mikkelsen, J; Smith, S

    1988-01-25

    A cloned cDNA containing the entire coding sequence for the long-chain S-acyl fatty acid synthetase thioester hydrolase (thioesterase I) component as well as the 3'-noncoding region of the fatty acid synthetase has been isolated using an expression vector and domain-specific antibodies. The coding region was assigned to the thioesterase I domain by identification of sequences coding for characterized peptide fragments, amino-terminal analysis of the isolated thioesterase I domain and the presence of the serine esterase active-site sequence motif. The thioesterase I domain is 306 amino acids long with a calculated molecular mass of 33,476 daltons; its DNA is flanked at the 5'-end by a region coding for the acyl carrier protein domain and at the 3'-end by a 1,537-base pairs-long noncoding sequence with a poly(A) tail. The thioesterase I domain exhibits a low, albeit discernible, homology with the discrete medium-chain S-acyl fatty acid synthetase thioester hydrolases (thioesterase II) from rat mammary gland and duck uropygial gland, suggesting a distant but common evolutionary ancestry for these proteins.

  4. Human parainfluenza type 3 virus hemagglutinin-neuraminidase glycoprotein: nucleotide sequence of mRNA and limited amino acid sequence of the purified protein.

    PubMed Central

    Elango, N; Coligan, J E; Jambou, R C; Venkatesan, S

    1986-01-01

    The nucleotide sequence of mRNA for the hemagglutinin-neuraminidase (HN) protein of human parainfluenza type 3 virus obtained from the corresponding cDNA clone had a single long open reading frame encoding a putative protein of 64,254 daltons consisting of 572 amino acids. The deduced protein sequence was confirmed by limited N-terminal amino acid microsequencing of CNBr cleavage fragments of native HN that was purified by immunoprecipitation. The HN protein is moderately hydrophobic and has four potential sites (Asn-X-Ser/Thr) of N-glycosylation in the C-terminal half of the molecule. It is devoid of both the N-terminal signal sequence and the C-terminal membrane anchorage domain characteristic of the hemagglutinin of influenza virus and the fusion (F0) protein of the paramyxoviruses. Instead, it has a single prominent hydrophobic region capable of membrane insertion beginning at 32 residues from the N terminus. This N-terminal membrane insertion is similar to that of influenza virus neuraminidase and the recently reported structures of HN proteins of Sendai virus and simian virus 5. Images PMID:3003381

  5. Sequence dependent N-terminal rearrangement and degradation of peptide nucleic acid (PNA) in aqueous solution

    NASA Technical Reports Server (NTRS)

    Eriksson, M.; Christensen, L.; Schmidt, J.; Haaima, G.; Orgel, L.; Nielsen, P. E.

    1998-01-01

    The stability of the PNA (peptide nucleic acid) thymine monomer inverted question markN-[2-(thymin-1-ylacetyl)]-N-(2-aminoaminoethyl)glycine inverted question mark and those of various PNA oligomers (5-8-mers) have been measured at room temperature (20 degrees C) as a function of pH. The thymine monomer undergoes N-acyl transfer rearrangement with a half-life of 34 days at pH 11 as analyzed by 1H NMR; and two reactions, the N-acyl transfer and a sequential degradation, are found by HPLC analysis to occur at measurable rates for the oligomers at pH 9 or above. Dependent on the amino-terminal sequence, half-lives of 350 h to 163 days were found at pH 9. At pH 12 the half-lives ranged from 1.5 h to 21 days. The results are discussed in terms of PNA as a gene therapeutic drug as well as a possible prebiotic genetic material.

  6. Solubility Challenges in High Concentration Monoclonal Antibody Formulations: Relationship with Amino Acid Sequence and Intermolecular Interactions.

    PubMed

    Pindrus, Mariya; Shire, Steven J; Kelley, Robert F; Demeule, Barthélemy; Wong, Rita; Xu, Yiren; Yadav, Sandeep

    2015-11-02

    The purpose of this work was to elucidate the molecular interactions leading to monoclonal antibody self-association and precipitation and utilize biophysical measurements to predict solubility behavior at high protein concentration. Two monoclonal antibodies (mAb-G and mAb-R) binding to overlapping epitopes were investigated. Precipitation of mAb-G solutions was most prominent at high ionic strength conditions and demonstrated strong dependence on ionic strength, as well as slight dependence on solution pH. At similar conditions no precipitation was observed for mAb-R solutions. Intermolecular interactions (interaction parameter, kD) related well with high concentration solubility behavior of both antibodies. Upon increasing buffer ionic strength, interactions of mAb-R tended to weaken, while those of mAb-G became more attractive. To investigate the role of amino acid sequence on precipitation behavior, mutants were designed by substituting the CDR of mAb-R into the mAb-G framework (GM-1) or deleting two hydrophobic residues in the CDR of mAb-G (GM-2). No precipitation was observed at high ionic strength for either mutant. The molecular interactions of mutants were similar in magnitude to those of mAb-R. The results suggest that presence of hydrophobic groups in the CDR of mAb-G may be responsible for compromising its solubility at high ionic strength conditions since deleting these residues mitigated the solubility issue.

  7. Frequencies of amino acid strings in globular protein sequences indicate suppression of blocks of consecutive hydrophobic residues

    PubMed Central

    Schwartz, Russell; Istrail, Sorin; King, Jonathan

    2001-01-01

    Patterns of hydrophobic and hydrophilic residues play a major role in protein folding and function. Long, predominantly hydrophobic strings of 20–22 amino acids each are associated with transmembrane helices and have been used to identify such sequences. Much less attention has been paid to hydrophobic sequences within globular proteins. In prior work on computer simulations of the competition between on-pathway folding and off-pathway aggregate formation, we found that long sequences of consecutive hydrophobic residues promoted aggregation within the model, even controlling for overall hydrophobic content. We report here on an analysis of the frequencies of different lengths of contiguous blocks of hydrophobic residues in a database of amino acid sequences of proteins of known structure. Sequences of three or more consecutive hydrophobic residues are found to be significantly less common in actual globular proteins than would be predicted if residues were selected independently. The result may reflect selection against long blocks of hydrophobic residues within globular proteins relative to what would be expected if residue hydrophobicities were independent of those of nearby residues in the sequence. PMID:11316883

  8. Amino acid sequence of rabbit kidney neutral endopeptidase 24.11 (enkephalinase) deduced from a complementary DNA.

    PubMed Central

    Devault, A; Lazure, C; Nault, C; Le Moual, H; Seidah, N G; Chrétien, M; Kahn, P; Powell, J; Mallet, J; Beaumont, A

    1987-01-01

    Neutral endopeptidase (EC 3.4.24.11) is a major constituent of kidney brush border membranes. It is also present in the brain where it has been shown to be involved in the inactivation of opioid peptides, methionine- and leucine-enkephalins. For this reason this enzyme is often called 'enkephalinase'. In order to characterize the primary structure of the enzyme, oligonucleotide probes were designed from partial amino acid sequences and used to isolate clones from kidney cDNA libraries. Sequencing of the cDNA inserts revealed the complete primary structure of the enzyme. Neutral endopeptidase consists of 750 amino acids. It contains a short N-terminal cytoplasmic domain (27 amino acids), a single membrane-spanning segment (23 amino acids) and an extracellul