Science.gov

Sample records for acid consensus sequence

  1. A Possible Mechanism of Zika Virus Associated Microcephaly: Imperative Role of Retinoic Acid Response Element (RARE) Consensus Sequence Repeats in the Viral Genome.

    PubMed

    Kumar, Ashutosh; Singh, Himanshu N; Pareek, Vikas; Raza, Khursheed; Dantham, Subrahamanyam; Kumar, Pavan; Mochan, Sankat; Faiq, Muneeb A

    2016-01-01

    Owing to the reports of microcephaly as a consistent outcome in the fetuses of pregnant women infected with ZIKV in Brazil, Zika virus (ZIKV)-microcephaly etiomechanistic relationship has recently been implicated. Researchers, however, are still struggling to establish an embryological basis for this interesting causal handcuff. The present study reveals robust evidence in favor of a plausible ZIKV-microcephaly cause-effect liaison. The rationale is based on: (1) sequence homology between ZIKV genome and the response element of an early neural tube developmental marker "retinoic acid" in human DNA and (2) comprehensive similarities between the details of brain defects in ZIKV-microcephaly and retinoic acid embryopathy. Retinoic acid is considered as the earliest factor for regulating anteroposterior axis of neural tube and positioning of structures in developing brain through retinoic acid response elements (RARE) consensus sequence (5'-AGGTCA-3') in promoter regions of retinoic acid-dependent genes. We screened genomic sequences of already reported virulent ZIKV strains (including those linked to microcephaly) and other viruses available in National Institute of Health genetic sequence database (GenBank) for the RARE consensus repeats and obtained results strongly bolstering our hypothesis that ZIKV strains associated with microcephaly may act through precipitation of dysregulation in retinoic acid-dependent genes by introducing extra stretches of RARE consensus sequence repeats in the genome of developing brain cells. Additional support to our hypothesis comes from our findings that screening of other viruses for RARE consensus sequence repeats is positive only for those known to display neurotropism and cause fetal brain defects (for which maternal-fetal transmission during developing stage may be required). The numbers of RARE sequence repeats appeared to match with the virulence of screened positive viruses. Although, bioinformatic evidence and embryological

  2. A Possible Mechanism of Zika Virus Associated Microcephaly: Imperative Role of Retinoic Acid Response Element (RARE) Consensus Sequence Repeats in the Viral Genome

    PubMed Central

    Kumar, Ashutosh; Singh, Himanshu N.; Pareek, Vikas; Raza, Khursheed; Dantham, Subrahamanyam; Kumar, Pavan; Mochan, Sankat; Faiq, Muneeb A.

    2016-01-01

    Owing to the reports of microcephaly as a consistent outcome in the fetuses of pregnant women infected with ZIKV in Brazil, Zika virus (ZIKV)—microcephaly etiomechanistic relationship has recently been implicated. Researchers, however, are still struggling to establish an embryological basis for this interesting causal handcuff. The present study reveals robust evidence in favor of a plausible ZIKV-microcephaly cause-effect liaison. The rationale is based on: (1) sequence homology between ZIKV genome and the response element of an early neural tube developmental marker “retinoic acid” in human DNA and (2) comprehensive similarities between the details of brain defects in ZIKV-microcephaly and retinoic acid embryopathy. Retinoic acid is considered as the earliest factor for regulating anteroposterior axis of neural tube and positioning of structures in developing brain through retinoic acid response elements (RARE) consensus sequence (5′–AGGTCA–3′) in promoter regions of retinoic acid-dependent genes. We screened genomic sequences of already reported virulent ZIKV strains (including those linked to microcephaly) and other viruses available in National Institute of Health genetic sequence database (GenBank) for the RARE consensus repeats and obtained results strongly bolstering our hypothesis that ZIKV strains associated with microcephaly may act through precipitation of dysregulation in retinoic acid-dependent genes by introducing extra stretches of RARE consensus sequence repeats in the genome of developing brain cells. Additional support to our hypothesis comes from our findings that screening of other viruses for RARE consensus sequence repeats is positive only for those known to display neurotropism and cause fetal brain defects (for which maternal-fetal transmission during developing stage may be required). The numbers of RARE sequence repeats appeared to match with the virulence of screened positive viruses. Although, bioinformatic evidence and

  3. [Creation of DNA vaccine vector based on codon-optimized gene of rabies virus glycoprotein (G protein) with consensus amino acid sequence].

    PubMed

    Starodubova, E S; Kuzmenko, Y V; Latanova, A A; Preobrazhenskaya, O V; Karpov, V L

    2016-01-01

    An optimized design of the rabies virus glycoprotein (G protein) for use within DNA vaccines has been suggested. The design represents a territorially adapted antigen constructed taking into account glycoprotein amino acid sequences of the rabies viruses registered in the Russian Federation and the vaccine Vnukovo-32 strain. Based on the created consensus amino acid sequence, the nucleotide codon-optimized sequence of this modified glycoprotein was obtained and cloned into the pVAX1 plasmid (a vector of the last generation used in the creation of DNA vaccines). A twofold increase in this gene expression compared to the expression of the Vnukovo-32 strain viral glycoprotein gene in a similar vector was registered in the transfected cell culture. It has been demonstrated that the accumulation of modified G protein exceeds the number of the control protein synthesized using the plasmid with the Vnukovo-32 strain viral glycoprotein gene by 20 times. Thus, the obtained modified rabies virus glycoprotein can be considered to be a promising DNA vaccine antigen. PMID:27239860

  4. Sequence logos: a new way to display consensus sequences.

    PubMed Central

    Schneider, T D; Stephens, R M

    1990-01-01

    A graphical method is presented for displaying the patterns in a set of aligned sequences. The characters representing the sequence are stacked on top of each other for each position in the aligned sequences. The height of each letter is made proportional to its frequency, and the letters are sorted so the most common one is on top. The height of the entire stack is then adjusted to signify the information content of the sequences at that position. From these 'sequence logos', one can determine not only the consensus sequence but also the relative frequency of bases and the information content (measured in bits) at every position in a site or sequence. The logo displays both significant residues and subtle sequence patterns. PMID:2172928

  5. Definition of the bacterial N-glycosylation site consensus sequence.

    PubMed

    Kowarik, Michael; Young, N Martin; Numao, Shin; Schulz, Benjamin L; Hug, Isabelle; Callewaert, Nico; Mills, Dominic C; Watson, David C; Hernandez, Marcela; Kelly, John F; Wacker, Michael; Aebi, Markus

    2006-05-01

    The Campylobacter jejuni pgl locus encodes an N-linked protein glycosylation machinery that can be functionally transferred into Escherichia coli. In this system, we analyzed the elements in the C. jejuni N-glycoprotein AcrA required for accepting an N-glycan. We found that the eukaryotic primary consensus sequence for N-glycosylation is N terminally extended to D/E-Y-N-X-S/T (Y, X not equalP) for recognition by the bacterial oligosaccharyltransferase (OST) PglB. However, not all consensus sequences were N-glycosylated when they were either artificially introduced or when they were present in non-C. jejuni proteins. We were able to produce recombinant glycoproteins with engineered N-glycosylation sites and confirmed the requirement for a negatively charged side chain at position -2 in C. jejuni N-glycoproteins. N-glycosylation of AcrA by the eukaryotic OST in Saccharomyces cerevisiae occurred independent of the acidic residue at the -2 position. Thus, bacterial N-glycosylation site selection is more specific than the eukaryotic equivalent with respect to the polypeptide acceptor sequence. PMID:16619027

  6. Newly Exerted T Cell Pressures on Mutated Epitopes following Transmission Help Maintain Consensus HIV-1 Sequences

    PubMed Central

    Eriksson, Emily M.; Liegler, Teri; Keh, Chris E.; Karlsson, Annika C.; Holditch, Sara J.; Pilcher, Christopher D.; Loeb, Lisa; Nixon, Douglas F.; Hecht, Frederick M.

    2015-01-01

    CD8+ T cells are important for HIV-1 virus control, but are also a major contributing factor that drives HIV-1 virus sequence evolution. Although HIV-1 cytotoxic T cell (CTL) escape mutations are a common aspect during HIV-1 infection, less is known about the importance of T cell pressure in reversing HIV-1 virus back to a consensus sequences. In this study we aimed to assess the frequency with which reversion of transmitted mutations in T cell epitopes were associated with T cell responses to the mutation. This study included 14 HIV-1 transmission pairs consisting of a ‘source’ (virus-donor) and a ‘recipient’ (newly infected individual). Non-consensus B sequence amino acids (mutations) in T cell epitopes in HIV-1 gag regions p17, p24, p2 and p7 were identified in each pair and transmission of mutations to the recipient was verified with population viral sequencing. Longitudinal analyses of the recipient’s viral sequence were used to identify whether reversion of mutations back to the consensus B sequence occurred. Autologous 12-mer peptides overlapping by 11 were synthesized, representing the sequence region surrounding each reversion and longitudinal analysis of T cell responses to source-derived mutated and reverted epitopes were assessed. We demonstrated that mutations in the source were frequently transmitted to the new host and on an average 17 percent of mutated epitopes reverted to consensus sequence in the recipient. T cell responses to these mutated epitopes were detected in 7 of the 14 recipients in whom reversion occurred. Overall, these findings indicate that transmitted non-consensus B epitopes are frequently immunogenic in HLA-mismatched recipients and new T cell pressures to T cell escape mutations following transmission play a significant role in maintaining consensus HIV-1 sequences. PMID:25919393

  7. Reconstruction and applications of consensus yeast metabolic network based on RNA sequencing.

    PubMed

    Zhao, Yuqi; Wang, Yanjie; Zou, Lei; Huang, Jingfei

    2016-04-01

    One practical application of genome-scale metabolic reconstructions is to interrogate multispecies relationships. Here, we report a consensus metabolic model in four yeast species (Saccharomyces cerevisiae, S. paradoxus, S. mikatae, and S. bayanus) by integrating metabolic network simulations with RNA sequencing (RNA-seq) datasets. We generated high-resolution transcriptome maps of four yeast species through de novo assembly and genome-guided approaches. The transcriptomes were annotated and applied to build the consensus metabolic network, which was verified using independent RNA-seq experiments. The expression profiles reveal that the genes involved in amino acid and lipid metabolism are highly coexpressed. The diverse phenotypic characteristics, such as cellular growth and gene deletions, can be simulated using the metabolic model. We also explored the applications of the consensus model in metabolic engineering using yeast-specific reactions and biofuel production as examples. Similar strategies will benefit communities studying genome-scale metabolic networks of other organisms. PMID:27239440

  8. Mutational analysis of the consensus sequence of a replication origin from yeast chromosome III.

    PubMed Central

    Van Houten, J V; Newlon, C S

    1990-01-01

    Yeast autonomously replicating sequence (ARS) elements contain an 11-base-pair core consensus sequence (5'-[A/T]TTTAT[A/G]TTT[A/T]-3') that is required for function. The contribution of each position within this sequence to ARS activity was tested by creating all possible single-base mutations within the core consensus sequence of ARS307 (formerly called the C2G1 ARS) and testing their effects on high-frequency transformation and on plasmid stability. Of the 33 mutations, 22 abolished ARS function as measured by high-frequency transformation, 7 caused more than twofold reductions in plasmid stability, and 4 had no effect on plasmid stability. Mutations that reduced or abolished ARS activity occurred at each position in the consensus sequence, demonstrating that each position of this sequence contributes to ARS function. Of the four mutations that had no effect on ARS activity, three created alternative perfect matches to the core consensus sequence, demonstrating that the alternate bases allowed by the consensus sequence are, indeed, interchangeable. In addition, a change from T to C at position 6 did not perturb wild-type efficiency. To test whether the essential region extends beyond the 11-base-pair consensus sequence, the effects on plasmid stability of point mutations one base 3' to the T-rich strand of the core consensus sequence (position 12) and deletion mutations that altered bases 5' to the T-rich strand of the core consensus sequence were examined. An A at position 12 or the removal of three T residues 5' to the core consensus sequence severely diminished ARS efficiency, showing that the region required for full ARS efficiency extends beyond the core consensus sequence in both directions. PMID:2196439

  9. Application of circular consensus sequencing and network analysis to characterize the bovine IgG repertoire

    PubMed Central

    2012-01-01

    Background Vertebrate immune systems generate diverse repertoires of antibodies capable of mediating response to a variety of antigens. Next generation sequencing methods provide unique approaches to a number of immuno-based research areas including antibody discovery and engineering, disease surveillance, and host immune response to vaccines. In particular, single-molecule circular consensus sequencing permits the sequencing of antibody repertoires at previously unattainable depths of coverage and accuracy. We approached the bovine immunoglobulin G (IgG) repertoire with the objective of characterizing diversity of expressed IgG transcripts. Here we present single-molecule real-time sequencing data of expressed IgG heavy-chain repertoires of four individual cattle. We describe the diversity observed within antigen binding regions and visualize this diversity using a network-based approach. Results We generated 49,945 high quality cDNA sequences, each spanning the entire IgG variable region from four Bos taurus calves. From these sequences we identified 49,521 antigen binding regions using the automated Paratome web server. Approximately 9% of all unique complementarity determining 2 (CDR2) sequences were of variable lengths. A bimodal distribution of unique CDR3 sequence lengths was observed, with common lengths of 5–6 and 21–25 amino acids. The average number of cysteine residues in CDR3s increased with CDR3 length and we observed that cysteine residues were centrally located in CDR3s. We identified 19 extremely long CDR3 sequences (up to 62 amino acids in length) within IgG transcripts. Network analyses revealed distinct patterns among the expressed IgG antigen binding repertoires of the examined individuals. Conclusions We utilized circular consensus sequencing technology to provide baseline data of the expressed bovine IgG repertoire that can be used for future studies important to livestock research. Somatic mutation resulting in base insertions and

  10. Interferon Consensus Sequence Binding Protein Confers Resistance against Yersinia enterocolitica

    PubMed Central

    Hein, Joachim; Kempf, Volkhard A. J.; Diebold, Joachim; Bücheler, Nicole; Preger, Sonja; Horak, Ivan; Sing, Andreas; Kramer, Uwe; Autenrieth, Ingo B.

    2000-01-01

    Interferon consensus sequence binding protein (ICSBP)-deficient mice display enhanced susceptibility to intracellular pathogens. At least two distinct immunoregulatory defects are responsible for this phenotype. First, diminished production of reactive oxygen intermediates in macrophages results in impaired intracellular killing of microorganisms. Second, defective early interleukin-12 (IL-12) production upon microbial challenge leads to a failure in gamma interferon (IFN-γ) induction and subsequently in T helper 1 immune responses. Here, we investigated the role of ICSBP in resistance against the extracellular bacterium Yersinia enterocolitica. ICSBP−/− mice failed to produce IL-12 and IFN-γ, but also IL-4, after Yersinia challenge. In addition, granuloma formation was highly disturbed in infected ICSBP−/− mice, leading to multiple necrotic abscesses in affected organs. Consequently, ICSBP−/− mice rapidly succumbed to acute Yersinia infection. In vitro treatment of spleen cells from ICSBP−/− mice with recombinant IL-12 (rIL-12) or rIL-18 in combination with a second stimulus resulted in IFN-γ induction. In experimental therapy of infected ICSBP−/− mice, we observed that administration of rIL-12 induced IFN-γ production which was associated with improved resistance to Yersinia. In contrast, treatment with rIL-18 failed to enhance endogenous IFN-γ production but nevertheless reduced bacterial burden in ICSBP−/− mice. Although cytokine therapy with rIL-12 or rIL-18 ameliorated the course of Yersinia infection in ICSBP−/− mice, both cytokines failed to completely restore impaired immunity. Taken together, the results indicate that the transcription factor ICSBP is essential for efficient host immune defense against Yersinia. These results are important for understanding the complex host immune responses in bacterial infections. PMID:10678954

  11. Sparc: a sparsity-based consensus algorithm for long erroneous sequencing reads.

    PubMed

    Ye, Chengxi; Ma, Zhanshan Sam

    2016-01-01

    Motivation. The third generation sequencing (3GS) technology generates long sequences of thousands of bases. However, its current error rates are estimated in the range of 15-40%, significantly higher than those of the prevalent next generation sequencing (NGS) technologies (less than 1%). Fundamental bioinformatics tasks such as de novo genome assembly and variant calling require high-quality sequences that need to be extracted from these long but erroneous 3GS sequences. Results. We describe a versatile and efficient linear complexity consensus algorithm Sparc to facilitate de novo genome assembly. Sparc builds a sparse k-mer graph using a collection of sequences from a targeted genomic region. The heaviest path which approximates the most likely genome sequence is searched through a sparsity-induced reweighted graph as the consensus sequence. Sparc supports using NGS and 3GS data together, which leads to significant improvements in both cost efficiency and computational efficiency. Experiments with Sparc show that our algorithm can efficiently provide high-quality consensus sequences using both PacBio and Oxford Nanopore sequencing technologies. With only 30× PacBio data, Sparc can reach a consensus with error rate <0.5%. With the more challenging Oxford Nanopore data, Sparc can also achieve similar error rate when combined with NGS data. Compared with the existing approaches, Sparc calculates the consensus with higher accuracy, and uses approximately 80% less memory and time. Availability. The source code is available for download at https://github.com/yechengxi/Sparc. PMID:27330851

  12. Sparc: a sparsity-based consensus algorithm for long erroneous sequencing reads

    PubMed Central

    2016-01-01

    Motivation. The third generation sequencing (3GS) technology generates long sequences of thousands of bases. However, its current error rates are estimated in the range of 15–40%, significantly higher than those of the prevalent next generation sequencing (NGS) technologies (less than 1%). Fundamental bioinformatics tasks such as de novo genome assembly and variant calling require high-quality sequences that need to be extracted from these long but erroneous 3GS sequences. Results. We describe a versatile and efficient linear complexity consensus algorithm Sparc to facilitate de novo genome assembly. Sparc builds a sparse k-mer graph using a collection of sequences from a targeted genomic region. The heaviest path which approximates the most likely genome sequence is searched through a sparsity-induced reweighted graph as the consensus sequence. Sparc supports using NGS and 3GS data together, which leads to significant improvements in both cost efficiency and computational efficiency. Experiments with Sparc show that our algorithm can efficiently provide high-quality consensus sequences using both PacBio and Oxford Nanopore sequencing technologies. With only 30× PacBio data, Sparc can reach a consensus with error rate <0.5%. With the more challenging Oxford Nanopore data, Sparc can also achieve similar error rate when combined with NGS data. Compared with the existing approaches, Sparc calculates the consensus with higher accuracy, and uses approximately 80% less memory and time. Availability. The source code is available for download at https://github.com/yechengxi/Sparc. PMID:27330851

  13. Composition for nucleic acid sequencing

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2008-08-26

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  14. Haplotype and minimum-chimerism consensus determination using short sequence data

    PubMed Central

    2012-01-01

    Background Assembling haplotypes given sequence data derived from a single individual is a well studied problem, but only recently has haplotype assembly been considered for population-sampled data. We discuss a software tool called Hapler, which is designed specifically for low-diversity, low-coverage data such as ecological samples derived from natural populations. Because such data may contain error as well as ambiguous haplotype information, we developed methods that increase confidence in these assemblies. Hapler also reconstructs full consensus sequences while minimizing and identifying possible chimeric points. Results Experiments on simulated data indicate that Hapler is effective at assembling haplotypes from gene-sized alignments of short reads. Further, in our tests Hapler-generated consensus sequences are less chimeric than the alternative consensus approaches of majority vote and viral quasispecies estimation regardless of error rate, read length, or population haplotype bias. Conclusions The analysis of genetically diverse sequence data is increasingly common, particularly in the field of ecoinformatics where transcriptome sequencing of natural populations is a cost effective alternative to genome sequencing. For such studies, it is important to consider and identify haplotype diversity. Hapler provides robust haplotype information and identifies possible phasing errors in consensus sequences, providing valuable information for population studies and downstream usage of resulting assemblies. PMID:22537299

  15. Scoring consensus of multiple ECG annotators by optimal sequence alignment.

    PubMed

    Haghpanahi, Masoumeh; Sameni, Reza; Borkholder, David A

    2014-01-01

    Development of ECG delineation algorithms has been an area of intense research in the field of computational cardiology for the past few decades. However, devising evaluation techniques for scoring and/or merging the results of such algorithms, both in the presence or absence of gold standards, still remains as a challenge. This is mainly due to existence of missed or erroneous determination of fiducial points in the results of different annotation algorithms. The discrepancy between different annotators increases when the reference signal includes arrhythmias or significant noise and its morphology deviates from a clean ECG signal. In this work, we propose a new approach to evaluate and compare the results of different annotators under such conditions. Specifically, we use sequence alignment techniques similar to those used in bioinformatics for the alignment of gene sequences. Our approach is based on dynamic programming where adequate mismatch penalties, depending on the type of the fiducial point and the underlying signal, are defined to optimally align the annotation sequences. We also discuss how to extend the algorithm for more than two sequences by using suitable data structures to align multiple annotation sequences with each other. Once the sequences are aligned, different heuristics are devised to evaluate the performance against a gold standard annotation, or to merge the results of multiple annotations when no gold standard exists. PMID:25570339

  16. Splice site consensus sequences are preferentially accessible to nucleases in isolated adenovirus RNA.

    PubMed Central

    Munroe, S H; Duthie, R S

    1986-01-01

    The conformation of RNA sequences spanning five 3' splice sites and two 5' splice sites in adenovirus mRNA was probed by partial digestion with single-strand specific nucleases. Although cleavage of nucleotides near both 3' and 5' splice sites was observed, most striking was the preferential digestion of sequences near the 3' splice site. At each 3' splice site a region of very strong cleavage is observed at low concentrations of enzyme near the splice site consensus sequence or the upstream branch point consensus sequence. Additional sites of moderately strong cutting near the branch point consensus sequence were observed in those sequences where the splice site was the preferred target. Since recognition of the 3' splice site and branch site appear to be early events in mRNA splicing these observations may indicate that the local conformation of the splice site sequences may play a direct or indirect role in enhancing the accessibility of sequences important for splicing. Images PMID:3024107

  17. Flagellar transcriptional activators FlbB and FlaI: gene sequences and 5' consensus sequences of operons under FlbB and FlaI control.

    PubMed Central

    Bartlett, D H; Frantz, B B; Matsumura, P

    1988-01-01

    The regulation of the expression of the operons in the flagella-chemotaxis regulon in Escherichia coli has been shown to be a highly ordered cascade which closely parallels the assembly of the flagellar structure and the chemotaxis machinery (T. Iino, Annu. Rev. Genet. 11:161-182, 1977; Y. Komeda, J. Bacteriol. 168: 1315-1318). The master operon, flbB, has been sequenced, and one of its gene products (FlaI) has been identified. On the basis of the deduced amino acid sequence, the FlbB protein has similarity to an alternate sigma factor which is responsible for expression of flagella in Bacillus subtilis. In addition, we have sequenced the 5' regions of a number of flagellar operons and compared these sequences with the 5' region of flagellar operons directly and indirectly under FlbB and FlaI control. We found both a consensus sequence which has been shown to be in all other flagellar operons (J. D. Helmann and M. J. Chamberlin, Proc. Natl. Acad. Sci. USA 84:6422-6424) and a derivative consensus sequence, which is found only in the 5' region of operons directly under FlbB and FlaI control. Images PMID:2832369

  18. Defining a Conformational Consensus Motif in Cotransin-Sensitive Signal Sequences: A Proteomic and Site-Directed Mutagenesis Study

    PubMed Central

    Klein, Wolfgang; Westendorf, Carolin; Schmidt, Antje; Conill-Cortés, Mercè; Rutz, Claudia; Blohs, Marcus; Beyermann, Michael; Protze, Jonas; Krause, Gerd; Krause, Eberhard; Schülein, Ralf

    2015-01-01

    The cyclodepsipeptide cotransin was described to inhibit the biosynthesis of a small subset of proteins by a signal sequence-discriminatory mechanism at the Sec61 protein-conducting channel. However, it was not clear how selective cotransin is, i.e. how many proteins are sensitive. Moreover, a consensus motif in signal sequences mediating cotransin sensitivity has yet not been described. To address these questions, we performed a proteomic study using cotransin-treated human hepatocellular carcinoma cells and the stable isotope labelling by amino acids in cell culture technique in combination with quantitative mass spectrometry. We used a saturating concentration of cotransin (30 micromolar) to identify also less-sensitive proteins and to discriminate the latter from completely resistant proteins. We found that the biosynthesis of almost all secreted proteins was cotransin-sensitive under these conditions. In contrast, biosynthesis of the majority of the integral membrane proteins was cotransin-resistant. Cotransin sensitivity of signal sequences was neither related to their length nor to their hydrophobicity. Instead, in the case of signal anchor sequences, we identified for the first time a conformational consensus motif mediating cotransin sensitivity. PMID:25806945

  19. High speed nucleic acid sequencing

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2011-05-17

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid. Each type of labeled nucleotide comprises an acceptor fluorophore attached to a phosphate portion of the nucleotide such that the fluorophore is removed upon incorporation into a growing strand. Fluorescent signal is emitted via fluorescent resonance energy transfer between the donor fluorophore and the acceptor fluorophore as each nucleotide is incorporated into the growing strand. The sequence is deduced by identifying which base is being incorporated into the growing strand.

  20. The iceLogo web server and SOAP service for determining protein consensus sequences.

    PubMed

    Maddelein, Davy; Colaert, Niklaas; Buchanan, Iain; Hulstaert, Niels; Gevaert, Kris; Martens, Lennart

    2015-07-01

    The iceLogo web server and SOAP service implement the previously published iceLogo algorithm. iceLogo builds on probability theory to visualize protein consensus sequences in a format resembling sequence logos. Peptide sequences are compared against a reference sequence set that can be tailored to the studied system and the used protocol. As such, not only over- but also underrepresented residues can be visualized in a statistically sound manner, which further allows the user to easily analyse and interpret conserved sequence patterns in proteins. The web application and SOAP service can be found free and open to all users without the need for a login on http://iomics.ugent.be/icelogoserver/main.html. PMID:25897125

  1. Consensus protein design.

    PubMed

    Porebski, Benjamin T; Buckle, Ashley M

    2016-07-01

    A popular and successful strategy in semi-rational design of protein stability is the use of evolutionary information encapsulated in homologous protein sequences. Consensus design is based on the hypothesis that at a given position, the respective consensus amino acid contributes more than average to the stability of the protein than non-conserved amino acids. Here, we review the consensus design approach, its theoretical underpinnings, successes, limitations and challenges, as well as providing a detailed guide to its application in protein engineering. PMID:27274091

  2. Consensus protein design

    PubMed Central

    Porebski, Benjamin T.; Buckle, Ashley M.

    2016-01-01

    A popular and successful strategy in semi-rational design of protein stability is the use of evolutionary information encapsulated in homologous protein sequences. Consensus design is based on the hypothesis that at a given position, the respective consensus amino acid contributes more than average to the stability of the protein than non-conserved amino acids. Here, we review the consensus design approach, its theoretical underpinnings, successes, limitations and challenges, as well as providing a detailed guide to its application in protein engineering. PMID:27274091

  3. Origin of the human L1 elements: proposed progenitor genes deduced from a consensus DNA sequence.

    PubMed

    Scott, A F; Schmeckpeper, B J; Abdelrazik, M; Comey, C T; O'Hara, B; Rossiter, J P; Cooley, T; Heath, P; Smith, K D; Margolet, L

    1987-10-01

    A consensus sequence for the human long interspersed repeated DNA element, L1Hs (LINE or KpnI sequence), is presented. The sequence contains two open reading frames (ORFs) which are homologous to ORFs in corresponding regions of L1 elements in other species. The L1Hs ORFs are separated by a small evolutionarily nonconserved region. The 5' end of the consensus contains frequent terminators in all three reading frames and has a relatively high GC content with numerous stretches of weak homology with AluI repeats. The 5' ORF extends for a minimum of 723 bp (241 codons). The 3' ORF is 3843 bp (1281 codons) and predicts a protein of 149 kD which has regions of weak homology to the polymerase domain of various reverse transcriptases. The 3' end of the consensus has a 208-bp nonconserved region followed by an adenine-rich end. The organization of the L1Hs consensus sequence resembles the structure of eukaryotic mRNAs except for the noncoding region between ORFs. However, due to base substitutions or truncation most elements appear incapable of producing mRNA that can be translated. Our observation that individual elements cluster into subfamilies on the basis of the presence or absence of blocks of sequence, or by the linkage of alternative bases at multiple positions, suggests that most L1 sequences were derived from a small number of structural genes. An estimate of the mammalian L1 substitution rate was derived and used to predict the age of individual human elements. From this it follows that the majority of human L1 sequences have been generated within the last 30 million years. The human elements studied here differ from each other, yet overall the L1Hs sequences demonstrate a pattern of species-specificity when compared to the L1 families of other mammals. Possible mechanisms that may account for the origin and evolution of the L1 family are discussed. These include pseudogene formation (retroposition), transposition, gene conversion, and RNA recombination. PMID

  4. Is There Scientific Consensus on Acid Rain? -- Excerpts from Six Governmental Reports.

    ERIC Educational Resources Information Center

    Environmental Education Report and Newsletter, 1986

    1986-01-01

    Compiles a series of direct quotations from six governmental reports that reflect a scientific consensus on major aspects of acid deposition. Presents the statements in a question and answer format. Also reviews the sources, extent, and effects of acid rain. (ML)

  5. Stimulatory and inhibitory protein kinase C consensus sequences regulate the cystic fibrosis transmembrane conductance regulator.

    PubMed

    Chappe, Valerie; Hinkson, Deborah A; Howell, L Daniel; Evagelidis, Alexandra; Liao, Jie; Chang, Xiu-Bao; Riordan, John R; Hanrahan, John W

    2004-01-01

    Protein kinase C (PKC) phosphorylation stimulates the cystic fibrosis transmembrane conductance regulator (CFTR) channel and enhances its activation by protein kinase A (PKA) through mechanisms that remain poorly understood. We have examined the effects of mutating consensus sequences for PKC phosphorylation and report here evidence for both stimulatory and inhibitory sites. Sequences were mutated in subsets and the mutants characterized by patch clamping. Activation of a 4CA mutant (S707A/S790A/T791A/S809A) by PKA was similar to that of wild-type CFTR and was enhanced by PKC, whereas responses of 3CA (T582A/T604A/S641A) and 2CA (T682A/S686A) channels to PKA were both drastically reduced (>90%). When each mutation in the 3CA and 2CA constructs was studied individually in a wild-type background, T582, T604, and S686 were found to be essential for PKA activation. Responses were restored when these three residues were reintroduced simultaneously into a 9CA mutant lacking all nine PKC consensus sequences (R6CA revertant); however, PKC phosphorylation was not required for this rescue. Nevertheless, two of the sites (T604 and S686) were phosphorylated in vitro, and PKC alone partially activated wild-type CFTR, the 4CA mutant, and the point mutants T582A and T604A, but not S686A channels, indicating that PKC does act at S686. The region encompassing S641 and T682 is inhibitory, because S641A enhanced activation by PKA, and T682A channels had 4-fold larger responses to PKC compared to wild-type channels. These results identify functionally important PKC consensus sequences on CFTR and will facilitate studies of its convergent regulation by PKC and PKA. PMID:14695900

  6. Stimulatory and inhibitory protein kinase C consensus sequences regulate the cystic fibrosis transmembrane conductance regulator

    PubMed Central

    Chappe, Valerie; Hinkson, Deborah A.; Howell, L. Daniel; Evagelidis, Alexandra; Liao, Jie; Chang, Xiu-Bao; Riordan, John R.; Hanrahan, John W.

    2004-01-01

    Protein kinase C (PKC) phosphorylation stimulates the cystic fibrosis transmembrane conductance regulator (CFTR) channel and enhances its activation by protein kinase A (PKA) through mechanisms that remain poorly understood. We have examined the effects of mutating consensus sequences for PKC phosphorylation and report here evidence for both stimulatory and inhibitory sites. Sequences were mutated in subsets and the mutants characterized by patch clamping. Activation of a 4CA mutant (S707A/S790A/T791A/S809A) by PKA was similar to that of wild-type CFTR and was enhanced by PKC, whereas responses of 3CA (T582A/T604A/S641A) and 2CA (T682A/S686A) channels to PKA were both drastically reduced (>90%). When each mutation in the 3CA and 2CA constructs was studied individually in a wild-type background, T582, T604, and S686 were found to be essential for PKA activation. Responses were restored when these three residues were reintroduced simultaneously into a 9CA mutant lacking all nine PKC consensus sequences (R6CA revertant); however, PKC phosphorylation was not required for this rescue. Nevertheless, two of the sites (T604 and S686) were phosphorylated in vitro, and PKC alone partially activated wild-type CFTR, the 4CA mutant, and the point mutants T582A and T604A, but not S686A channels, indicating that PKC does act at S686. The region encompassing S641 and T682 is inhibitory, because S641A enhanced activation by PKA, and T682A channels had 4-fold larger responses to PKC compared to wild-type channels. These results identify functionally important PKC consensus sequences on CFTR and will facilitate studies of its convergent regulation by PKC and PKA. PMID:14695900

  7. Structure elucidation of the Pribnow box consensus promoter sequence by racemic DNA crystallography.

    PubMed

    Mandal, Pradeep K; Collie, Gavin W; Srivastava, Suresh C; Kauffmann, Brice; Huc, Ivan

    2016-07-01

    It has previously been shown that the use of racemic mixtures of naturally chiral macromolecules such as protein and DNA can significantly aid the crystallogenesis process, thereby addressing one of the major bottlenecks to structure determination by X-ray crystallographic methods-that of crystal growth. Although previous studies have provided convincing evidence of the applicability of the racemic crystallization technique to DNA through the study of well-characterized DNA structures, we sought to apply this method to a historically challenging DNA sequence. For this purpose we chose a non-self-complementary DNA duplex containing the biologically-relevant Pribnow box consensus sequence 'TATAAT'. Four racemic crystal structures of this previously un-crystallizable DNA target are reported (with resolutions in the range of 1.65-2.3 Å), with further crystallographic studies and structural analysis providing insight into the racemic crystallization process as well as structural details of this highly pertinent DNA sequence. PMID:27137886

  8. PDP-CON: prediction of domain/linker residues in protein sequences using a consensus approach.

    PubMed

    Chatterjee, Piyali; Basu, Subhadip; Zubek, Julian; Kundu, Mahantapas; Nasipuri, Mita; Plewczynski, Dariusz

    2016-04-01

    The prediction of domain/linker residues in protein sequences is a crucial task in the functional classification of proteins, homology-based protein structure prediction, and high-throughput structural genomics. In this work, a novel consensus-based machine-learning technique was applied for residue-level prediction of the domain/linker annotations in protein sequences using ordered/disordered regions along protein chains and a set of physicochemical properties. Six different classifiers-decision tree, Gaussian naïve Bayes, linear discriminant analysis, support vector machine, random forest, and multilayer perceptron-were exhaustively explored for the residue-level prediction of domain/linker regions. The protein sequences from the curated CATH database were used for training and cross-validation experiments. Test results obtained by applying the developed PDP-CON tool to the mutually exclusive, independent proteins of the CASP-8, CASP-9, and CASP-10 databases are reported. An n-star quality consensus approach was used to combine the results yielded by different classifiers. The average PDP-CON accuracy and F-measure values for the CASP targets were found to be 0.86 and 0.91, respectively. The dataset, source code, and all supplementary materials for this work are available at https://cmaterju.org/cmaterbioinfo/ for noncommercial use. PMID:26969678

  9. Identification of Phosphorylation Consensus Sequences and Endogenous Neuronal Substrates of the Psychiatric Risk Kinase TNIK.

    PubMed

    Wang, Qi; Amato, Stephen P; Rubitski, David M; Hayward, Matthew M; Kormos, Bethany L; Verhoest, Patrick R; Xu, Lan; Brandon, Nicholas J; Ehlers, Michael D

    2016-02-01

    Traf2- and Nck-interacting kinase (TNIK) is a serine/threonine kinase highly expressed in the brain and enriched in the postsynaptic density of glutamatergic synapses in the mammalian brain. Accumulating genetic evidence and functional data have implicated TNIK as a risk factor for psychiatric disorders. However, the endogenous substrates of TNIK in neurons are unknown. Here, we describe a novel selective small molecule inhibitor of the TNIK kinase family. Using this inhibitor, we report the identification of endogenous neuronal TNIK substrates by immunoprecipitation with a phosphomotif antibody followed by mass spectrometry. Phosphorylation consensus sequences were defined by phosphopeptide sequence analysis. Among the identified substrates were members of the delta-catenin family including p120-catenin, δ-catenin, and armadillo repeat gene deleted in velo-cardio-facial syndrome (ARVCF), each of which is linked to psychiatric or neurologic disorders. Using p120-catenin as a representative substrate, we show TNIK-induced p120-catenin phosphorylation in cells requires intact kinase activity and phosphorylation of TNIK at T181 and T187 in the activation loop. Addition of the small molecule TNIK inhibitor or knocking down TNIK by two shRNAs reduced endogenous p120-catenin phosphorylation in cells. Together, using a TNIK inhibitor and phosphomotif antibody, we identify endogenous substrates of TNIK in neurons, define consensus sequences for TNIK, and suggest signaling pathways by which TNIK influences synaptic development and function linked to psychiatric and neurologic disorders. PMID:26645429

  10. Deduced consensus sequence of Sindbis virus strain AR339: mutations contained in laboratory strains which affect cell culture and in vivo phenotypes.

    PubMed Central

    McKnight, K L; Simpson, D A; Lin, S C; Knott, T A; Polo, J M; Pence, D F; Johannsen, D B; Heidner, H W; Davis, N L; Johnston, R E

    1996-01-01

    The consensus sequence of the Sindbis virus AR339 isolate, the prototype alphavirus, has been deduced. THe results presented here suggest (i) that a substantial proportion of the sequence divergence evident between the consensus sequence and sequences of laboratory strains of AR339 has resulted from selection for efficient growth in cell culture, (ii) that many of these changes affect the virulence of the virus in animal models, and (iii) that such modified genetic backgrounds present in laboratory strains can exert a significant influence on genetic studies of virus pathogenesis and host range. A laboratory strain of Sindbis virus AR339 was sequenced and cloned as a cDNA (pTRSB) from which infectious virus (TRSB) could be derived. The consensus sequence was deduced from the complete sequences of pTRSB and HRsp (E. G. Strauss, C. M. Rice, and J. H. Strauss, Virology 133:92-110, 1984), from partial sequences of the glycoprotein genes of three other AR339 laboratory strains, and by comparison with the sequences of the glycoprotein genes of three other AR339 sequence. HRsp differed form the consensus sequence by eight coding changes, and TRSB differed by three coding changes. In the 5' untranslated region, HRsp differed from the consensus sequence at nucleotide (nt) 5. These differences were likely the result of cell culture passage of the original AR339 isolate. At three of the difference loci (one in TRSB and two in HRsp), selection of cell-culture-adaptive mutations was documented with Sindbis virus or other alphaviruses. Selection in cell culture often results in attenuation of virulence in animals. Considering the TRSB and HRsp sequences together, one noncoding difference from the consensus (an A-for-G substitution in the 5' untranslated region at nt 5) and six coding differences in the glycoprotein genes (at E2 amino acids 1, 3, 70, and 172 and at E1 amino acids 72 and 237) were at loci which, either individually or in combination, significantly affected

  11. Prediction of Protein Structural Classes for Low-Similarity Sequences Based on Consensus Sequence and Segmented PSSM

    PubMed Central

    Liang, Yunyun; Liu, Sanyang; Zhang, Shengli

    2015-01-01

    Prediction of protein structural classes for low-similarity sequences is useful for understanding fold patterns, regulation, functions, and interactions of proteins. It is well known that feature extraction is significant to prediction of protein structural class and it mainly uses protein primary sequence, predicted secondary structure sequence, and position-specific scoring matrix (PSSM). Currently, prediction solely based on the PSSM has played a key role in improving the prediction accuracy. In this paper, we propose a novel method called CSP-SegPseP-SegACP by fusing consensus sequence (CS), segmented PsePSSM, and segmented autocovariance transformation (ACT) based on PSSM. Three widely used low-similarity datasets (1189, 25PDB, and 640) are adopted in this paper. Then a 700-dimensional (700D) feature vector is constructed and the dimension is decreased to 224D by using principal component analysis (PCA). To verify the performance of our method, rigorous jackknife cross-validation tests are performed on 1189, 25PDB, and 640 datasets. Comparison of our results with the existing PSSM-based methods demonstrates that our method achieves the favorable and competitive performance. This will offer an important complementary to other PSSM-based methods for prediction of protein structural classes for low-similarity sequences. PMID:26788119

  12. Prediction of Protein Structural Classes for Low-Similarity Sequences Based on Consensus Sequence and Segmented PSSM.

    PubMed

    Liang, Yunyun; Liu, Sanyang; Zhang, Shengli

    2015-01-01

    Prediction of protein structural classes for low-similarity sequences is useful for understanding fold patterns, regulation, functions, and interactions of proteins. It is well known that feature extraction is significant to prediction of protein structural class and it mainly uses protein primary sequence, predicted secondary structure sequence, and position-specific scoring matrix (PSSM). Currently, prediction solely based on the PSSM has played a key role in improving the prediction accuracy. In this paper, we propose a novel method called CSP-SegPseP-SegACP by fusing consensus sequence (CS), segmented PsePSSM, and segmented autocovariance transformation (ACT) based on PSSM. Three widely used low-similarity datasets (1189, 25PDB, and 640) are adopted in this paper. Then a 700-dimensional (700D) feature vector is constructed and the dimension is decreased to 224D by using principal component analysis (PCA). To verify the performance of our method, rigorous jackknife cross-validation tests are performed on 1189, 25PDB, and 640 datasets. Comparison of our results with the existing PSSM-based methods demonstrates that our method achieves the favorable and competitive performance. This will offer an important complementary to other PSSM-based methods for prediction of protein structural classes for low-similarity sequences. PMID:26788119

  13. Consensus Sequence of 27 African Horse Sickness Virus Genomes from Viruses Collected over a 76-Year Period (1933 to 2009)

    PubMed Central

    Wright, Isabella M.; van Dijk, Alberdina A.

    2015-01-01

    We announce the complete consensus genome sequence of 27 African horse sickness viruses, representing all nine African horse sickness virus (AHSV) serotypes from historical and recent isolates collected over a 76-year period (1933 to 2009). The data set includes the sequence of the virulent Office International des Epizooties AHSV reference strains which are not adapted to cell culture. PMID:26358586

  14. A highly conserved G-rich consensus sequence in hepatitis C virus core gene represents a new anti–hepatitis C target

    PubMed Central

    Wang, Shao-Ru; Min, Yuan-Qin; Wang, Jia-Qi; Liu, Chao-Xing; Fu, Bo-Shi; Wu, Fan; Wu, Ling-Yu; Qiao, Zhi-Xian; Song, Yan-Yan; Xu, Guo-Hua; Wu, Zhi-Guo; Huang, Gai; Peng, Nan-Fang; Huang, Rong; Mao, Wu-Xiang; Peng, Shuang; Chen, Yu-Qi; Zhu, Ying; Tian, Tian; Zhang, Xiao-Lian; Zhou, Xiang

    2016-01-01

    G-quadruplex (G4) is one of the most important secondary structures in nucleic acids. Until recently, G4 RNAs have not been reported in any ribovirus, such as the hepatitis C virus. Our bioinformatics analysis reveals highly conserved guanine-rich consensus sequences within the core gene of hepatitis C despite the high genetic variability of this ribovirus; we further show using various methods that such consensus sequences can fold into unimolecular G4 RNA structures, both in vitro and under physiological conditions. Furthermore, we provide direct evidences that small molecules specifically targeting G4 can stabilize this structure to reduce RNA replication and inhibit protein translation of intracellular hepatitis C. Ultimately, the stabilization of G4 RNA in the genome of hepatitis C represents a promising new strategy for anti–hepatitis C drug development. PMID:27051880

  15. A highly conserved G-rich consensus sequence in hepatitis C virus core gene represents a new anti-hepatitis C target.

    PubMed

    Wang, Shao-Ru; Min, Yuan-Qin; Wang, Jia-Qi; Liu, Chao-Xing; Fu, Bo-Shi; Wu, Fan; Wu, Ling-Yu; Qiao, Zhi-Xian; Song, Yan-Yan; Xu, Guo-Hua; Wu, Zhi-Guo; Huang, Gai; Peng, Nan-Fang; Huang, Rong; Mao, Wu-Xiang; Peng, Shuang; Chen, Yu-Qi; Zhu, Ying; Tian, Tian; Zhang, Xiao-Lian; Zhou, Xiang

    2016-04-01

    G-quadruplex (G4) is one of the most important secondary structures in nucleic acids. Until recently, G4 RNAs have not been reported in any ribovirus, such as the hepatitis C virus. Our bioinformatics analysis reveals highly conserved guanine-rich consensus sequences within the core gene of hepatitis C despite the high genetic variability of this ribovirus; we further show using various methods that such consensus sequences can fold into unimolecular G4 RNA structures, both in vitro and under physiological conditions. Furthermore, we provide direct evidences that small molecules specifically targeting G4 can stabilize this structure to reduce RNA replication and inhibit protein translation of intracellular hepatitis C. Ultimately, the stabilization of G4 RNA in the genome of hepatitis C represents a promising new strategy for anti-hepatitis C drug development. PMID:27051880

  16. A Comprehensive Approach to Clustering of Expressed Human Gene Sequence: The Sequence Tag Alignment and Consensus Knowledge Base

    PubMed Central

    Miller, Robert T.; Christoffels, Alan G.; Gopalakrishnan, Chella; Burke, John; Ptitsyn, Andrey A.; Broveak, Tania R.; Hide, Winston A.

    1999-01-01

    The expressed human genome is being sequenced and analyzed by disparate groups producing disparate data. The majority of the identified coding portion is in the form of expressed sequence tags (ESTs). The need to discover exonic representation and expression forms of full-length cDNAs for each human gene is frustrated by the partial and variable quality nature of this data delivery. A highly redundant human EST data set has been processed into integrated and unified expressed transcript indices that consist of hierarchically organized human transcript consensi reflecting gene expression forms and genetic polymorphism within an index class. The expression index and its intermediate outputs include cleaned transcript sequence, expression, and alignment information and a higher fidelity subset, SANIGENE. The STACK_PACK clustering system has been applied to dbEST release 121598 (GenBank version 110). Sixty-four percent of 1,313,103 Homo sapiens ESTs are condensed into 143,885 tissue level multiple sequence clusters; linking through clone-ID annotations produces 68,701 total assemblies, such that 81% of the original input set is captured in a STACK multiple sequence or linked cluster. Indexing of alignments by substituent EST accession allows browsing of the data structure and its cross-links to UniGene. STACK metaclusters consolidate a greater number of ESTs by a factor of 1.86 with respect to the corresponding UniGene build. Fidelity comparison with genome reference sequence AC004106 demonstrates consensus expression clusters that reflect significantly lower spurious repeat sequence content and capture alternate splicing within a whole body index cluster and three STACK v.2.3 tissue-level clusters. Statistics of a staggered release whole body index build of STACK v.2.0 are presented. PMID:10568754

  17. Context based computational analysis and characterization of ARS consensus sequences (ACS) of Saccharomyces cerevisiae genome.

    PubMed

    Singh, Vinod Kumar; Krishnamachari, Annangarachari

    2016-09-01

    Genome-wide experimental studies in Saccharomyces cerevisiae reveal that autonomous replicating sequence (ARS) requires an essential consensus sequence (ACS) for replication activity. Computational studies identified thousands of ACS like patterns in the genome. However, only a few hundreds of these sites act as replicating sites and the rest are considered as dormant or evolving sites. In a bid to understand the sequence makeup of replication sites, a content and context-based analysis was performed on a set of replicating ACS sequences that binds to origin-recognition complex (ORC) denoted as ORC-ACS and non-replicating ACS sequences (nrACS), that are not bound by ORC. In this study, DNA properties such as base composition, correlation, sequence dependent thermodynamic and DNA structural profiles, and their positions have been considered for characterizing ORC-ACS and nrACS. Analysis reveals that ORC-ACS depict marked differences in nucleotide composition and context features in its vicinity compared to nrACS. Interestingly, an A-rich motif was also discovered in ORC-ACS sequences within its nucleosome-free region. Profound changes in the conformational features, such as DNA helical twist, inclination angle and stacking energy between ORC-ACS and nrACS were observed. Distribution of ACS motifs in the non-coding segments points to the locations of ORC-ACS which are found far away from the adjacent gene start position compared to nrACS thereby enabling an accessible environment for ORC-proteins. Our attempt is novel in considering the contextual view of ACS and its flanking region along with nucleosome positioning in the S. cerevisiae genome and may be useful for any computational prediction scheme. PMID:27508123

  18. The effect of two closely inserted transcription consensus sequences on coronavirus transcription.

    PubMed Central

    Joo, M; Makino, S

    1995-01-01

    Insertion of an intergenic region from the murine coronavirus mouse hepatitis virus into a mouse hepatitis virus defective interfering (DI) RNA led to transcription of subgenomic DI RNA in helper virus-infected cells. Using this system, we studied how two intergenic regions in close proximity affected subgenomic RNA synthesis. When two intergenic regions were separated by more than 100 nucleotides, slightly less of the larger subgenomic DI RNA (synthesized from the upstream intergenic region) was made; this difference was significant when the intergenic region separation was less than about 35 nucleotides. Deletion of sequences flanking the two intergenic regions inserted in close proximity did not affect transcription. No significant change in the ratio of the two subgenomic DI RNAs was observed when the sequence between the two intergenic regions was altered. Removal of the downstream intergenic region restored transcription of the larger subgenomic DI RNA. The UCUAAAC consensus sequence was needed for efficient suppression of the larger subgenomic DI RNA synthesis. These results demonstrated that the downstream intergenic sequence was suppressing subgenomic DI RNA synthesis from the upstream intergenic region. We discuss possible mechanisms to account for the regulation of this suppression of subgenomic DI RNA synthesis and the ways in which they relate to the general regulation of coronavirus transcription. PMID:7983719

  19. A High Quality Draft Consensus Sequence of the Genome of a Heterozygous Grapevine Variety

    PubMed Central

    Cartwright, Dustin A.; Cestaro, Alessandro; Pruss, Dmitry; Pindo, Massimo; FitzGerald, Lisa M.; Vezzulli, Silvia; Reid, Julia; Malacarne, Giulia; Iliev, Diana; Coppola, Giuseppina; Wardell, Bryan; Micheletti, Diego; Macalma, Teresita; Facci, Marco; Mitchell, Jeff T.; Perazzolli, Michele; Eldredge, Glenn; Gatto, Pamela; Oyzerski, Rozan; Moretto, Marco; Gutin, Natalia; Stefanini, Marco; Chen, Yang; Segala, Cinzia; Davenport, Christine; Demattè, Lorenzo; Mraz, Amy; Battilana, Juri; Stormo, Keith; Costa, Fabrizio; Tao, Quanzhou; Si-Ammour, Azeddine; Harkins, Tim; Lackey, Angie; Perbost, Clotilde; Taillon, Bruce; Stella, Alessandra; Solovyev, Victor; Fawcett, Jeffrey A.; Sterck, Lieven; Vandepoele, Klaas; Grando, Stella M.; Toppo, Stefano; Moser, Claudio; Lanchbury, Jerry; Bogden, Robert; Skolnick, Mark; Sgaramella, Vittorio; Bhatnagar, Satish K.; Fontana, Paolo; Gutin, Alexander; Van de Peer, Yves; Salamini, Francesco; Viola, Roberto

    2007-01-01

    Background Worldwide, grapes and their derived products have a large market. The cultivated grape species Vitis vinifera has potential to become a model for fruit trees genetics. Like many plant species, it is highly heterozygous, which is an additional challenge to modern whole genome shotgun sequencing. In this paper a high quality draft genome sequence of a cultivated clone of V. vinifera Pinot Noir is presented. Principal Findings We estimate the genome size of V. vinifera to be 504.6 Mb. Genomic sequences corresponding to 477.1 Mb were assembled in 2,093 metacontigs and 435.1 Mb were anchored to the 19 linkage groups (LGs). The number of predicted genes is 29,585, of which 96.1% were assigned to LGs. This assembly of the grape genome provides candidate genes implicated in traits relevant to grapevine cultivation, such as those influencing wine quality, via secondary metabolites, and those connected with the extreme susceptibility of grape to pathogens. Single nucleotide polymorphism (SNP) distribution was consistent with a diffuse haplotype structure across the genome. Of around 2,000,000 SNPs, 1,751,176 were mapped to chromosomes and one or more of them were identified in 86.7% of anchored genes. The relative age of grape duplicated genes was estimated and this made possible to reveal a relatively recent Vitis-specific large scale duplication event concerning at least 10 chromosomes (duplication not reported before). Conclusions Sanger shotgun sequencing and highly efficient sequencing by synthesis (SBS), together with dedicated assembly programs, resolved a complex heterozygous genome. A consensus sequence of the genome and a set of mapped marker loci were generated. Homologous chromosomes of Pinot Noir differ by 11.2% of their DNA (hemizygous DNA plus chromosomal gaps). SNP markers are offered as a tool with the potential of introducing a new era in the molecular breeding of grape. PMID:18094749

  20. Chip-based sequencing nucleic acids

    DOEpatents

    Beer, Neil Reginald

    2014-08-26

    A system for fast DNA sequencing by amplification of genetic material within microreactors, denaturing, demulsifying, and then sequencing the material, while retaining it in a PCR/sequencing zone by a magnetic field. One embodiment includes sequencing nucleic acids on a microchip that includes a microchannel flow channel in the microchip. The nucleic acids are isolated and hybridized to magnetic nanoparticles or to magnetic polystyrene-coated beads. Microreactor droplets are formed in the microchannel flow channel. The microreactor droplets containing the nucleic acids and the magnetic nanoparticles are retained in a magnetic trap in the microchannel flow channel and sequenced.

  1. Structure elucidation of the Pribnow box consensus promoter sequence by racemic DNA crystallography

    PubMed Central

    Mandal, Pradeep K.; Collie, Gavin W.; Srivastava, Suresh C.; Kauffmann, Brice; Huc, Ivan

    2016-01-01

    It has previously been shown that the use of racemic mixtures of naturally chiral macromolecules such as protein and DNA can significantly aid the crystallogenesis process, thereby addressing one of the major bottlenecks to structure determination by X-ray crystallographic methods—that of crystal growth. Although previous studies have provided convincing evidence of the applicability of the racemic crystallization technique to DNA through the study of well-characterized DNA structures, we sought to apply this method to a historically challenging DNA sequence. For this purpose we chose a non-self-complementary DNA duplex containing the biologically-relevant Pribnow box consensus sequence ‘TATAAT’. Four racemic crystal structures of this previously un-crystallizable DNA target are reported (with resolutions in the range of 1.65–2.3 Å), with further crystallographic studies and structural analysis providing insight into the racemic crystallization process as well as structural details of this highly pertinent DNA sequence. PMID:27137886

  2. Distinguishing Proteins From Arbitrary Amino Acid Sequences

    PubMed Central

    Yau, Stephen S.-T.; Mao, Wei-Guang; Benson, Max; He, Rong Lucy

    2015-01-01

    What kinds of amino acid sequences could possibly be protein sequences? From all existing databases that we can find, known proteins are only a small fraction of all possible combinations of amino acids. Beginning with Sanger's first detailed determination of a protein sequence in 1952, previous studies have focused on describing the structure of existing protein sequences in order to construct the protein universe. No one, however, has developed a criteria for determining whether an arbitrary amino acid sequence can be a protein. Here we show that when the collection of arbitrary amino acid sequences is viewed in an appropriate geometric context, the protein sequences cluster together. This leads to a new computational test, described here, that has proved to be remarkably accurate at determining whether an arbitrary amino acid sequence can be a protein. Even more, if the results of this test indicate that the sequence can be a protein, and it is indeed a protein sequence, then its identity as a protein sequence is uniquely defined. We anticipate our computational test will be useful for those who are attempting to complete the job of discovering all proteins, or constructing the protein universe. PMID:25609314

  3. Surveying determinants of protein structure designability across different energy models and amino-acid alphabets: A consensus

    NASA Astrophysics Data System (ADS)

    Buchler, Nicolas E. G.; Goldstein, Richard A.

    2000-02-01

    A variety of analytical and computational models have been proposed to answer the question of why some protein structures are more "designable" (i.e., have more sequences folding into them) than others. One class of analytical and statistical-mechanical models has approached the designability problem from a thermodynamic viewpoint. These models highlighted specific structural features important for increased designability. Furthermore, designability was shown to be inherently related to thermodynamically relevant energetic measures of protein folding, such as the foldability F and energy gap Δ10. However, many of these models have been done within a very narrow focus: Namely, pair-contact interactions and two-letter amino-acid alphabets. Recently, two-letter amino-acid alphabets for pair-contact models have been shown to contain designability artifacts which disappear for larger-letter amino-acid alphabets. In addition, a solvation model was demonstrated to give identical designability results to previous two-letter amino-acid alphabet pair-contact models. In light of these discordant results, this report synthesizes a broad consensus regarding the relationship between specific structural features, foldability F, energy gap Δ10, and structure designability for different energy models (pair-contact vs solvation) across a wide range of amino-acid alphabets. We also propose a novel measure Zdk which is shown to be well correlated to designability. Finally, we conclusively demonstrate that two-letter amino-acid alphabets for pair-contact models appear to be solvation models in disguise.

  4. The Chinese hamster Alu-equivalent sequence: a conserved highly repetitious, interspersed deoxyribonucleic acid sequence in mammals has a structure suggestive of a transposable element.

    PubMed Central

    Haynes, S R; Toomey, T P; Leinwand, L; Jelinek, W R

    1981-01-01

    A consensus sequence has been determined for a major interspersed deoxyribonucleic acid repeat in the genome of Chinese hamster ovary cells (CHO cells). This sequence is extensively homologous to (i) the human Alu sequence (P. L. Deininger et al., J. Mol. Biol., in press), (ii) the mouse B1 interspersed repetitious sequence (Krayev et al., Nucleic Acids Res. 8:1201-1215, 1980) (iii) an interspersed repetitious sequence from African green monkey deoxyribonucleic acid (Dhruva et al., Proc. Natl. Acad. Sci. U.S.A. 77:4514-4518, 1980) and (iv) the CHO and mouse 4.5S ribonucleic acid (this report; F. Harada and N. Kato, Nucleic Acids Res. 8:1273-1285, 1980). Because the CHO consensus sequence shows significant homology to the human Alu sequence it is termed the CHO Alu-equivalent sequence. A conserved structure surrounding CHO Alu-equivalent family members can be recognized. It is similar to that surrounding the human Alu and the mouse B1 sequences, and is represented as follows: direct repeat-CHO-Alu-A-rich sequence-direct repeat. A composite interspersed repetitious sequence has been identified. Its structure is represented as follows: direct repeat-residue 47 to 107 of CHO-Alu-non-Alu repetitious sequence-A-rich sequence-direct repeat. Because the Alu flanking sequences resemble those that flank known transposable elements, we think it likely that the Alu sequence dispersed throughout the mammalian genome by transposition. Images PMID:9279371

  5. Method for sequencing nucleic acid molecules

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2006-05-30

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  6. Method for sequencing nucleic acid molecules

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2006-06-06

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  7. Improved metagenome assemblies and taxonomic binning using long-read circular consensus sequence data

    PubMed Central

    Frank, J. A.; Pan, Y.; Tooming-Klunderud, A.; Eijsink, V. G. H.; McHardy, A. C.; Nederbragt, A. J.; Pope, P. B.

    2016-01-01

    DNA assembly is a core methodological step in metagenomic pipelines used to study the structure and function within microbial communities. Here we investigate the utility of Pacific Biosciences long and high accuracy circular consensus sequencing (CCS) reads for metagenomic projects. We compared the application and performance of both PacBio CCS and Illumina HiSeq data with assembly and taxonomic binning algorithms using metagenomic samples representing a complex microbial community. Eight SMRT cells produced approximately 94 Mb of CCS reads from a biogas reactor microbiome sample that averaged 1319 nt in length and 99.7% accuracy. CCS data assembly generated a comparative number of large contigs greater than 1 kb, to those assembled from a ~190x larger HiSeq dataset (~18 Gb) produced from the same sample (i.e approximately 62% of total contigs). Hybrid assemblies using PacBio CCS and HiSeq contigs produced improvements in assembly statistics, including an increase in the average contig length and number of large contigs. The incorporation of CCS data produced significant enhancements in taxonomic binning and genome reconstruction of two dominant phylotypes, which assembled and binned poorly using HiSeq data alone. Collectively these results illustrate the value of PacBio CCS reads in certain metagenomics applications. PMID:27156482

  8. Eliciting neutralizing antibodies with gp120 outer domain constructs based on M-group consensus sequence.

    PubMed

    Qin, Yali; Banasik, Marisa; Kim, SoonJeung; Penn-Nicholson, Adam; Habte, Habtom H; LaBranche, Celia; Montefiori, David C; Wang, Chong; Cho, Michael W

    2014-08-01

    One strategy being evaluated for HIV-1 vaccine development is focusing immune responses towards neutralizing epitopes on the gp120 outer domain (OD) by removing the immunodominant, but non-neutralizing, inner domain. Previous OD constructs have not elicited strong neutralizing antibodies (nAbs). We constructed two immunogens, a monomeric gp120-OD and a trimeric gp120-OD×3, based on an M group consensus sequence (MCON6). Their biochemical and immunological properties were compared with intact gp120. Results indicated better preservation of critical neutralizing epitopes on gp120-OD×3. In contrast to previous studies, our immunogens induced potent, cross-reactive nAbs in rabbits. Although nAbs primarily targeted Tier 1 viruses, they exhibited significant breadth. Epitope mapping analyses indicated that nAbs primarily targeted conserved V3 loop elements. Although the potency and breadth of nAbs were similar for all three immunogens, nAb induction kinetics indicated that gp120-OD×3 was superior to gp120-OD, suggesting that gp120-OD×3 is a promising prototype for further gp120 OD-based immunogen development. PMID:25046154

  9. Eliciting Neutralizing Antibodies with gp120 Outer Domain Constructs Based on M-Group Consensus Sequence

    PubMed Central

    Qin, Yali; Banasik, Marisa; Kim, SoonJeung; Penn-Nicholson, Adam; Habte, Habtom H; Labranche, Celia; Montefiori, David C; Wang, Chong; Cho, Michael W

    2014-01-01

    One strategy being evaluated for HIV-1 vaccine development is focusing immune responses towards neutralizing epitopes on the gp120 outer domain (OD) by removing the immunodominant, but non-neutralizing, inner domain. Previous OD constructs have not elicited strong neutralizing antibodies (nAbs). We constructed two immunogens, a monomeric gp120-OD and a trimeric gp120-OD×3, based on an M group consensus sequence (MCON6). Their biochemical and immunological properties were compared with intact gp120. Results indicated better preservation of critical neutralizing epitopes on gp120-OD×3. In contrast to previous studies, our immunogens induced potent, cross-reactive nAbs in rabbits. Although nAbs primarily targeted Tier 1 viruses, they exhibited significant breadth. Epitope mapping analyses indicated that nAbs primarily targeted conserved V3 loop elements. Although the potency and breadth of nAbs were similar for all three immunogens, nAb induction kinetics indicated that gp120-OD×3 was superior to gp120-OD, suggesting that gp120-OD×3 is a promising prototype for further gp120 OD-based immunogen development. PMID:25046154

  10. Improved metagenome assemblies and taxonomic binning using long-read circular consensus sequence data.

    PubMed

    Frank, J A; Pan, Y; Tooming-Klunderud, A; Eijsink, V G H; McHardy, A C; Nederbragt, A J; Pope, P B

    2016-01-01

    DNA assembly is a core methodological step in metagenomic pipelines used to study the structure and function within microbial communities. Here we investigate the utility of Pacific Biosciences long and high accuracy circular consensus sequencing (CCS) reads for metagenomic projects. We compared the application and performance of both PacBio CCS and Illumina HiSeq data with assembly and taxonomic binning algorithms using metagenomic samples representing a complex microbial community. Eight SMRT cells produced approximately 94 Mb of CCS reads from a biogas reactor microbiome sample that averaged 1319 nt in length and 99.7% accuracy. CCS data assembly generated a comparative number of large contigs greater than 1 kb, to those assembled from a ~190x larger HiSeq dataset (~18 Gb) produced from the same sample (i.e approximately 62% of total contigs). Hybrid assemblies using PacBio CCS and HiSeq contigs produced improvements in assembly statistics, including an increase in the average contig length and number of large contigs. The incorporation of CCS data produced significant enhancements in taxonomic binning and genome reconstruction of two dominant phylotypes, which assembled and binned poorly using HiSeq data alone. Collectively these results illustrate the value of PacBio CCS reads in certain metagenomics applications. PMID:27156482

  11. Energy-based RNA consensus secondary structure prediction in multiple sequence alignments.

    PubMed

    Washietl, Stefan; Bernhart, Stephan H; Kellis, Manolis

    2014-01-01

    Many biologically important RNA structures are conserved in evolution leading to characteristic mutational patterns. RNAalifold is a widely used program to predict consensus secondary structures in multiple alignments by combining evolutionary information with traditional energy-based RNA folding algorithms. Here we describe the theory and applications of the RNAalifold algorithm. Consensus secondary structure prediction not only leads to significantly more accurate structure models, but it also allows to study structural conservation of functional RNAs. PMID:24639158

  12. A consensus linkage map for sugi (Cryptomeria japonica) from two pedigrees, based on microsatellites and expressed sequence tags.

    PubMed Central

    Tani, Naoki; Takahashi, Tomokazu; Iwata, Hiroyoshi; Mukai, Yuzuru; Ujino-Ihara, Tokuko; Matsumoto, Asako; Yoshimura, Kensuke; Yoshimaru, Hiroshi; Murai, Masafumi; Nagasaka, Kazutoshi; Tsumura, Yoshihiko

    2003-01-01

    A consensus map for sugi (Cryptomeria japonica) was constructed by integrating linkage data from two unrelated third-generation pedigrees, one derived from a full-sib cross and the other by self-pollination of F1 individuals. The progeny segregation data of the first pedigree were derived from cleaved amplified polymorphic sequences, microsatellites, restriction fragment length polymorphisms, and single nucleotide polymorphisms. The data of the second pedigree were derived from cleaved amplified polymorphic sequences, isozyme markers, morphological traits, random amplified polymorphic DNA markers, and restriction fragment length polymorphisms. Linkage analyses were done for the first pedigree with JoinMap 3.0, using its parameter set for progeny derived by cross-pollination, and for the second pedigree with the parameter set for progeny derived from selfing of F1 individuals. The 11 chromosomes of C. japonica are represented in the consensus map. A total of 438 markers were assigned to 11 large linkage groups, 1 small linkage group, and 1 nonintegrated linkage group from the second pedigree; their total length was 1372.2 cM. On average, the consensus map showed 1 marker every 3.0 cM. PCR-based codominant DNA markers such as cleaved amplified polymorphic sequences and microsatellite markers were distributed in all linkage groups and occupied about half of mapped loci. These markers are very useful for integration of different linkage maps, QTL mapping, and comparative mapping for evolutional study, especially for species with a large genome size such as conifers. PMID:14668402

  13. Phenolic acid esterases, coding sequences and methods

    DOEpatents

    Blum, David L.; Kataeva, Irina; Li, Xin-Liang; Ljungdahl, Lars G.

    2002-01-01

    Described herein are four phenolic acid esterases, three of which correspond to domains of previously unknown function within bacterial xylanases, from XynY and XynZ of Clostridium thermocellum and from a xylanase of Ruminococcus. The fourth specifically exemplified xylanase is a protein encoded within the genome of Orpinomyces PC-2. The amino acids of these polypeptides and nucleotide sequences encoding them are provided. Recombinant host cells, expression vectors and methods for the recombinant production of phenolic acid esterases are also provided.

  14. Amino-Acid Sequence of Porcine Pepsin

    PubMed Central

    Tang, J.; Sepulveda, P.; Marciniszyn, J.; Chen, K. C. S.; Huang, W-Y.; Tao, N.; Liu, D.; Lanier, J. P.

    1973-01-01

    As the culmination of several years of experiments, we propose a complete amino-acid sequence for porcine pepsin, an enzyme containing 327 amino-acid residues in a single polypeptide chain. In the sequence determination, the enzyme was treated with cyanogen bromide. Five resulting fragments were purified. The amino-acid sequence of four of the fragments accounted for 290 residues. Because the structure of a 37-residue carboxyl-terminal fragment was already known, it was not studied. The alignment of these fragments was determined from the sequence of methionyl-peptides we had previously reported. We also discovered the locations of activesite aspartyl residues, as well as the pairing of the three disulfide bridges. A minor component of commercial crystalline pepsin was found to contain two extra amino-acid residues, Ala-Leu-, at the amino-terminus of the molecule. This minor component was apparently derived from a different site of cleavage during the activation of porcine pepsinogen. PMID:4587252

  15. Method for identifying and quantifying nucleic acid sequence aberrations

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    1998-01-01

    A method for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe.

  16. Method for identifying and quantifying nucleic acid sequence aberrations

    DOEpatents

    Lucas, J.N.; Straume, T.; Bogen, K.T.

    1998-07-21

    A method is disclosed for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe. 11 figs.

  17. Epistasis effects of multiple ancestral-consensus amino acid substitutions on the thermal stability of glycerol kinase from Cellulomonas sp. NT3060.

    PubMed

    Fukuda, Yasuhisa; Abe, Asuka; Tamura, Takashi; Kishimoto, Takahide; Sogabe, Atsushi; Akanuma, Satoshi; Yokobori, Shin-Ichi; Yamagishi, Akihiko; Imada, Katsumi; Inagaki, Kenji

    2016-05-01

    Thermostable variants of the Cellulomonas sp. NT3060 glycerol kinase have been constructed by through the introduction of ancestral-consensus mutations. We produced seven mutants, each having an ancestral-consensus amino acid residue that might be present in the common ancestors of both bacteria and of archaea, and that appeared most frequently at the position of 17 glycerol kinase sequences in the multiple sequence alignment. The thermal stabilities of the resulting mutants were assessed by determining their melting temperatures (Tm), which was defined as the temperature at which 50% of the initial catalytic activity is lost after 15 min of incubation, as well as when the half-life of the catalytic activity occurs at a temperature of 60°C (t1/2). Three mutants showed increased stabilities compared to the wild-type protein. We then produced five more mutants with multiple amino acid substitutions. Some of the resulting mutants showed thermal stabilities much greater than those expected given the stabilities of the respective mutants with single mutations. Therefore, the effects of mutations are not always simply additive and some amino acid substitutions, which do not affect or only slightly improve stability when individually introduced into the protein, show substantial stabilizing effects in combination with other mutations. PMID:26493633

  18. Methods for analyzing nucleic acid sequences

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2011-05-17

    The present invention is directed to a method of sequencing a target nucleic acid. The method provides a complex comprising a polymerase enzyme, a target nucleic acid molecule, and a primer, wherein the complex is immobilized on a support Fluorescent label is attached to a terminal phosphate group of the nucleotide or nucleotide analog. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The time duration of the signal from labeled nucleotides or nucleotide analogs that become incorporated is distinguished from freely diffusing labels by a longer retention in the observation volume for the nucleotides or nucleotide analogs that become incorporated than for the freely diffusing labels.

  19. Plasmodium falciparum Variability and Immune Evasion Proceed from Antigenicity of Consensus Sequences from DBL6ε; Generalization to All DBL from VAR2CSA

    PubMed Central

    Deloron, Philippe; Milet, Jacqueline; Badaut, Cyril

    2013-01-01

    We studied all consensus sequences within the four least ‘variable blocks’ (VB) present in the DBL6ε domain of VAR2CSA, the protein involved in the adhesion of infected red blood cells by Plasmodium falciparum that causes the Pregnancy-Associated Malaria (PAM). Characterising consensus sequences with respect to recognition of antibodies and percentage of responders among pregnant women living in areas where P. falciparum is endemic allows the identification of the most antigenic sequences within each VB. When combining these consensus sequences among four serotypes from VB1 or VB5, the most often recognized ones are expected to induce pan-reactive antibodies recognizing VAR2CSA from all plasmodial strains. These sequences are of main interest in the design of an immunogenic molecule. Using a similar approach than for DBL6ε, we studied the five other DBL and the CIDRpam from VAR2CSA, and again identified VB segments with highly conserved consensus sequences. In addition, we identified consensus sequences in other var genes expressed by non-PAM parasites. This finding paves the way for vaccine design against other pathologies caused by P. falciparum. PMID:23372786

  20. International interlaboratory study comparing single organism 16S rRNA gene sequencing data: Beyond consensus sequence comparisons.

    PubMed

    Olson, Nathan D; Lund, Steven P; Zook, Justin M; Rojas-Cornejo, Fabiola; Beck, Brian; Foy, Carole; Huggett, Jim; Whale, Alexandra S; Sui, Zhiwei; Baoutina, Anna; Dobeson, Michael; Partis, Lina; Morrow, Jayne B

    2015-03-01

    This study presents the results from an interlaboratory sequencing study for which we developed a novel high-resolution method for comparing data from different sequencing platforms for a multi-copy, paralogous gene. The combination of PCR amplification and 16S ribosomal RNA gene (16S rRNA) sequencing has revolutionized bacteriology by enabling rapid identification, frequently without the need for culture. To assess variability between laboratories in sequencing 16S rRNA, six laboratories sequenced the gene encoding the 16S rRNA from Escherichia coli O157:H7 strain EDL933 and Listeria monocytogenes serovar 4b strain NCTC11994. Participants performed sequencing methods and protocols available in their laboratories: Sanger sequencing, Roche 454 pyrosequencing(®), or Ion Torrent PGM(®). The sequencing data were evaluated on three levels: (1) identity of biologically conserved position, (2) ratio of 16S rRNA gene copies featuring identified variants, and (3) the collection of variant combinations in a set of 16S rRNA gene copies. The same set of biologically conserved positions was identified for each sequencing method. Analytical methods using Bayesian and maximum likelihood statistics were developed to estimate variant copy ratios, which describe the ratio of nucleotides at each identified biologically variable position, as well as the likely set of variant combinations present in 16S rRNA gene copies. Our results indicate that estimated variant copy ratios at biologically variable positions were only reproducible for high throughput sequencing methods. Furthermore, the likely variant combination set was only reproducible with increased sequencing depth and longer read lengths. We also demonstrate novel methods for evaluating variable positions when comparing multi-copy gene sequence data from multiple laboratories generated using multiple sequencing technologies. PMID:27077030

  1. Nucleic acids encoding modified human immunodeficiency virus type 1 (HIV-1) group M consensus envelope glycoproteins

    DOEpatents

    Haynes, Barton F.; Gao, Feng; Korber, Bette T.; Hahn, Beatrice H.; Shaw, George M.; Kothe, Denise; Li, Ying Ying; Decker, Julie; Liao, Hua-Xin

    2011-12-06

    The present invention relates, in general, to an immunogen and, in particular, to an immunogen for inducing antibodies that neutralizes a wide spectrum of HIV primary isolates and/or to an immunogen that induces a T cell immune response. The invention also relates to a method of inducing anti-HIV antibodies, and/or to a method of inducing a T cell immune response, using such an immunogen. The invention further relates to nucleic acid sequences encoding the present immunogens.

  2. Accuracy of sequence alignment and fold assessment using reduced amino acid alphabets.

    PubMed

    Melo, Francisco; Marti-Renom, Marc A

    2006-06-01

    Reduced or simplified amino acid alphabets group the 20 naturally occurring amino acids into a smaller number of representative protein residues. To date, several reduced amino acid alphabets have been proposed, which have been derived and optimized by a variety of methods. The resulting reduced amino acid alphabets have been applied to pattern recognition, generation of consensus sequences from multiple alignments, protein folding, and protein structure prediction. In this work, amino acid substitution matrices and statistical potentials were derived based on several reduced amino acid alphabets and their performance assessed in a large benchmark for the tasks of sequence alignment and fold assessment of protein structure models, using as a reference frame the standard alphabet of 20 amino acids. The results showed that a large reduction in the total number of residue types does not necessarily translate into a significant loss of discriminative power for sequence alignment and fold assessment. Therefore, some definitions of a few residue types are able to encode most of the relevant sequence/structure information that is present in the 20 standard amino acids. Based on these results, we suggest that the use of reduced amino acid alphabets may allow to increasing the accuracy of current substitution matrices and statistical potentials for the prediction of protein structure of remote homologs. PMID:16506243

  3. Consensus Rules in Variant Detection from Next-Generation Sequencing Data

    PubMed Central

    Jia, Peilin; Li, Fei; Xia, Jufeng; Chen, Haiquan; Ji, Hongbin; Pao, William; Zhao, Zhongming

    2012-01-01

    A critical step in detecting variants from next-generation sequencing data is post hoc filtering of putative variants called or predicted by computational tools. Here, we highlight four critical parameters that could enhance the accuracy of called single nucleotide variants and insertions/deletions: quality and deepness, refinement and improvement of initial mapping, allele/strand balance, and examination of spurious genes. Use of these sequence features appropriately in variant filtering could greatly improve validation rates, thereby saving time and costs in next-generation sequencing projects. PMID:22715385

  4. Detection of nucleic acid sequences by invader-directed cleavage

    DOEpatents

    Brow, Mary Ann D.; Hall, Jeff Steven Grotelueschen; Lyamichev, Victor; Olive, David Michael; Prudent, James Robert

    1999-01-01

    The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The 5' nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based by charge.

  5. Application of circular consensus sequencing and network analysis to characterize the bovine IgG repertoire

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Background: Vertebrate immune systems generate diverse repertoires of antibodies capable of mediating response to a variety of antigens. Next generation sequencing methods provide unique approaches to a number of immuno-based research areas including antibody discovery and engineering, disease surve...

  6. Chimaeric virus-like particles derived from consensus genome sequences of human rotavirus strains co-circulating in Africa.

    PubMed

    Jere, Khuzwayo C; O'Neill, Hester G; Potgieter, A Christiaan; van Dijk, Alberdina A

    2014-01-01

    Rotavirus virus-like particles (RV-VLPs) are potential alternative non-live vaccine candidates due to their high immunogenicity. They mimic the natural conformation of native viral proteins but cannot replicate because they do not contain genomic material which makes them safe. To date, most RV-VLPs have been derived from cell culture adapted strains or common G1 and G3 rotaviruses that have been circulating in communities for some time. In this study, chimaeric RV-VLPs were generated from the consensus sequences of African rotaviruses (G2, G8, G9 or G12 strains associated with either P[4], P[6] or P[8] genotypes) characterised directly from human stool samples without prior adaptation of the wild type strains to cell culture. Codon-optimised sequences for insect cell expression of genome segments 2 (VP2), 4 (VP4), 6 (VP6) and 9 (VP7) were cloned into a modified pFASTBAC vector, which allowed simultaneous expression of up to four genes using the Bac-to-Bac Baculovirus Expression System (BEVS; Invitrogen). Several combinations of the genome segments originating from different field strains were cloned to produce double-layered RV-VLPs (dRV-VLP; VP2/6), triple-layered RV-VLPs (tRV-VLP; VP2/6/7 or VP2/6/7/4) and chimaeric tRV-VLPs. The RV-VLPs were produced by infecting Spodoptera frugiperda 9 and Trichoplusia ni cells with recombinant baculoviruses using multi-cistronic, dual co-infection and stepwise-infection expression strategies. The size and morphology of the RV-VLPs, as determined by transmission electron microscopy, revealed successful production of RV-VLPs. The novel approach of producing tRV-VLPs, by using the consensus insect cell codon-optimised nucleotide sequence derived from dsRNA extracted directly from clinical specimens, should speed-up vaccine research and development by by-passing the need to adapt rotaviruses to cell culture. Other problems associated with cell culture adaptation, such as possible changes in epitopes, can also be circumvented

  7. Hybridization and sequencing of nucleic acids using base pair mismatches

    DOEpatents

    Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua

    2001-01-01

    Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.

  8. Enterocytozoon bieneusi genotype nomenclature based on the internal transcribed spacer sequence: a consensus.

    PubMed

    Santín, Mónica; Fayer, Ronald

    2009-01-01

    The standard method for determining the genotypes of Enterocytozoon bieneusi is based on the DNA sequence of the internal transcribed spacer (ITS) region of the rRNA gene. There are 81 genotypes with 111 genotype names: 26 genotypes have been identified exclusively in humans, eight have been identified in humans and in other hosts, 27 have been identified exclusively in cattle and pigs, six have been identified exclusively in cats and dogs, and 14 have been identified in miscellaneous hosts. Because none of these genotypes has taxonomic status and therefore do not adhere to the International Code of Zoological Nomenclature regarding naming, some genotypes have received multiple names, each different and in separate publications by different authors. Because of the proliferation of genotypes with overlapping names and multiple hosts the scientific literature has become confusing and difficult to efficiently utilize. To reduce confusion and provide guidance for future publications we tabulated all names, GenBank accession numbers, and author citations and propose that the first published name has precedence and should become the primary name used in all subsequent publications in which genotyping is based on ITS sequencing. In those publications the names and GenBank numbers that were submitted at later dates should also be provided by the authors as synonyms to aid readers and reviewers. PMID:19335772

  9. Genomic mapping of phosphorothioates reveals partial modification of short consensus sequences

    PubMed Central

    Cao, Bo; Chen, Chao; DeMott, Michael S.; Cheng, Qiuxiang; Clark, Tyson A.; Xiong, Xiaolin; Zheng, Xiaoqing; Butty, Vincent; Levine, Stuart S.; Yuan, George; Boitano, Matthew; Luong, Khai; Song, Yi; Zhou, Xiufen; Deng, Zixin; Turner, Stephen W.; Korlach, Jonas; You, Delin; Wang, Lianrong; Chen, Shi; Dedon, Peter C.

    2015-01-01

    Bacterial phosphorothioate (PT) DNA modifications are incorporated by Dnd proteins A-E and often function with DndF-H as a restriction-modification (R-M) system, as in Escherichia coli B7A. However, bacteria such as Vibrio cyclitrophicus FF75 lack dndF-H, which points to other PT functions. To better understand PT biology, we report two novel, orthogonal technologies to map PTs across the genomes of B7A and FF75 with >90% agreement: real-time (SMRT) sequencing and deep sequencing of iodine-induced cleavage at PT (ICDS). In B7A, we detect PT on both strands of GpsAAC/GpsTTC motifs, but with only 18% of 40,701 possible sites modified. In contrast, PT in FF75 occurs as a single-strand modification at CpsCA, again with only 14% of 160,541 sites modified. Single-molecule analysis indicates that modification could be partial at any particular genomic site even with active restriction by DndF-H, with direct interaction of modification proteins with GAAC/GTTC sites demonstrated with oligonucleotides. These results point to highly unusual target selection by PT modification proteins and rule out known R-M mechanisms. PMID:24899568

  10. A multiwavelength consensus on the main sequence of star-forming galaxies at z ˜ 2

    NASA Astrophysics Data System (ADS)

    Rodighiero, G.; Renzini, A.; Daddi, E.; Baronchelli, I.; Berta, S.; Cresci, G.; Franceschini, A.; Gruppioni, C.; Lutz, D.; Mancini, C.; Santini, P.; Zamorani, G.; Silverman, J.; Kashino, D.; Andreani, P.; Cimatti, A.; Sánchez, H. Domínguez; Le Floch, E.; Magnelli, B.; Popesso, P.; Pozzi, F.

    2014-09-01

    We compare various star formation rate (SFR) indicators for star-forming galaxies at 1.4 < z < 2.5 in the COSMOS field. The main focus is on the SFRs from the far-IR (PACS-Herschel data) with those from the ultraviolet, for galaxies selected according to the BzK criterion. FIR-selected samples lead to a vastly different slope of the SFR-stellar mass (M*) relation, compared to that of the dominant main-sequence population as measured from the UV, since the FIR selection picks predominantly only a minority of outliers. However, there is overall agreement between the main sequences derived with the two SFR indicators, when stacking on the PACS maps the BzK-selected galaxies. The resulting logarithmic slope of the SFR-M* relation is ˜0.8-0.9, in agreement with that derived from the dust-corrected UV luminosity. Exploiting deeper 24 μm Spitzer data, we have characterized a subsample of galaxies with reddening and SFRs poorly constrained, as they are very faint in the B band. The combination of Herschel with Spitzer data has allowed us to largely break the age/reddening degeneracy for these intriguing sources, by distinguishing whether a galaxy is very red in B-z because of being heavily dust reddened, or whether because star formation has been (or is being) quenched. Finally, we have compared our SFR(UV) to the SFRs derived by stacking the radio data and to those derived from the Hα luminosity of a sample of star-forming galaxies at 1.4 < z < 1.7. The two sets of SFRs are broadly consistent as they are with the SFRs derived from the UV and by stacking the corresponding PACS data in various mass bins.

  11. 77 FR 65537 - Requirements for Patent Applications Containing Nucleotide Sequence and/or Amino Acid Sequence...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2012-10-29

    ... Amino Acid Sequence Disclosures ACTION: Proposed collection; comment request. SUMMARY: The United States....'' SUPPLEMENTARY INFORMATION: I. Abstract Patent applications that contain nucleotide and/or amino acid sequence disclosures must include a copy of the sequence listing in accordance with the requirements in 37 CFR...

  12. Characterization of Synthetic Chikungunya Viruses Based on the Consensus Sequence of Recent E1-226V Isolates

    PubMed Central

    Scholte, Florine E. M.; Tas, Ali; Martina, Byron E. E.; Cordioli, Paolo; Narayanan, Krishna; Makino, Shinji; Snijder, Eric J.; van Hemert, Martijn J.

    2013-01-01

    Chikungunya virus (CHIKV) is a mosquito-borne alphavirus that re-emerged in 2004 and has caused massive outbreaks in recent years. The lack of a licensed vaccine or treatment options emphasize the need to obtain more insight into the viral life cycle and CHIKV-host interactions. Infectious cDNA clones are important tools for such studies, and for mechanism of action studies on antiviral compounds. Existing CHIKV cDNA clones are based on a single genome from an individual clinical isolate, which is expected to have evolved specific characteristics in response to the host environment, and possibly also during subsequent cell culture passaging. To obtain a virus expected to have the general characteristics of the recent E1-226V CHIKV isolates, we have constructed a new CHIKV full-length cDNA clone, CHIKV LS3, based on the consensus sequence of their aligned genomes. Here we report the characterization of this synthetic virus and a green fluorescent protein-expressing variant (CHIKV LS3-GFP). Their characteristics were compared to those of natural strain ITA07-RA1, which was isolated during the 2007 outbreak in Italy. In cell culture the synthetic viruses displayed phenotypes comparable to the natural isolate, and in a mouse model they caused lethal infections that were indistinguishable from infections with a natural strain. Compared to ITA07-RA1 and clinical isolate NL10/152, the synthetic viruses displayed similar sensitivities to several antiviral compounds. 3-deaza-adenosine was identified as a new inhibitor of CHIKV replication. Cyclosporin A had no effect on CHIKV replication, suggesting that cyclophilins -opposite to what was found for other +RNA viruses- do not play an essential role in CHIKV replication. The characterization of the consensus sequence-based synthetic viruses and their comparison to natural isolates demonstrated that CHIKV LS3 and LS3-GFP are suitable and representative tools to study CHIKV-host interactions, screen for antiviral compounds and

  13. Predicting intrinsic disorder from amino acid sequence.

    PubMed

    Obradovic, Zoran; Peng, Kang; Vucetic, Slobodan; Radivojac, Predrag; Brown, Celeste J; Dunker, A Keith

    2003-01-01

    Blind predictions of intrinsic order and disorder were made on 42 proteins subsequently revealed to contain 9,044 ordered residues, 284 disordered residues in 26 segments of length 30 residues or less, and 281 disordered residues in 2 disordered segments of length greater than 30 residues. The accuracies of the six predictors used in this experiment ranged from 77% to 91% for the ordered regions and from 56% to 78% for the disordered segments. The average of the order and disorder predictions ranged from 73% to 77%. The prediction of disorder in the shorter segments was poor, from 25% to 66% correct, while the prediction of disorder in the longer segments was better, from 75% to 95% correct. Four of the predictors were composed of ensembles of neural networks. This enabled them to deal more efficiently with the large asymmetry in the training data through diversified sampling from the significantly larger ordered set and achieve better accuracy on ordered and long disordered regions. The exclusive use of long disordered regions for predictor training likely contributed to the disparity of the predictions on long versus short disordered regions, while averaging the output values over 61-residue windows to eliminate short predictions of order or disorder probably contributed to the even greater disparity for three of the predictors. This experiment supports the predictability of intrinsic disorder from amino acid sequence. PMID:14579347

  14. Methods and compositions for efficient nucleic acid sequencing

    DOEpatents

    Drmanac, Radoje

    2002-01-01

    Disclosed are novel methods and compositions for rapid and highly efficient nucleic acid sequencing based upon hybridization with two sets of small oligonucleotide probes of known sequences. Extremely large nucleic acid molecules, including chromosomes and non-amplified RNA, may be sequenced without prior cloning or subcloning steps. The methods of the invention also solve various current problems associated with sequencing technology such as, for example, high noise to signal ratios and difficult discrimination, attaching many nucleic acid fragments to a surface, preparing many, longer or more complex probes and labelling more species.

  15. Methods and compositions for efficient nucleic acid sequencing

    DOEpatents

    Drmanac, Radoje

    2006-07-04

    Disclosed are novel methods and compositions for rapid and highly efficient nucleic acid sequencing based upon hybridization with two sets of small oligonucleotide probes of known sequences. Extremely large nucleic acid molecules, including chromosomes and non-amplified RNA, may be sequenced without prior cloning or subcloning steps. The methods of the invention also solve various current problems associated with sequencing technology such as, for example, high noise to signal ratios and difficult discrimination, attaching many nucleic acid fragments to a surface, preparing many, longer or more complex probes and labelling more species.

  16. Kit for detecting nucleic acid sequences using competitive hybridization probes

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    2001-01-01

    A kit is provided for detecting a target nucleic acid sequence in a sample, the kit comprising: a first hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the first hybridization probe including a first complexing agent for forming a binding pair with a second complexing agent; and a second hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the first hybridization probe does not selectively hybridize, the second hybridization probe including a detectable marker; a third hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the third hybridization probe including the same detectable marker as the second hybridization probe; and a fourth hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the third hybridization probe does not selectively hybridize, the fourth hybridization probe including the first complexing agent for forming a binding pair with the second complexing agent; wherein the first and second hybridization probes are capable of simultaneously hybridizing to the target sequence and the third and fourth hybridization probes are capable of simultaneously hybridizing to the target sequence, the detectable marker is not present on the first or fourth hybridization probes and the first, second, third, and fourth hybridization probes each include a competitive nucleic acid sequence which is sufficiently complementary to a third portion of the target sequence that the competitive sequences of the first, second, third, and fourth hybridization probes compete with each other to hybridize to the third portion of the

  17. Rice MEL2, the RNA recognition motif (RRM) protein, binds in vitro to meiosis-expressed genes containing U-rich RNA consensus sequences in the 3'-UTR.

    PubMed

    Miyazaki, Saori; Sato, Yutaka; Asano, Tomoya; Nagamura, Yoshiaki; Nonomura, Ken-Ichi

    2015-10-01

    Post-transcriptional gene regulation by RNA recognition motif (RRM) proteins through binding to cis-elements in the 3'-untranslated region (3'-UTR) is widely used in eukaryotes to complete various biological processes. Rice MEIOSIS ARRESTED AT LEPTOTENE2 (MEL2) is the RRM protein that functions in the transition to meiosis in proper timing. The MEL2 RRM preferentially associated with the U-rich RNA consensus, UUAGUU[U/A][U/G][A/U/G]U, dependently on sequences and proportionally to MEL2 protein amounts in vitro. The consensus sequences were located in the putative looped structures of the RNA ligand. A genome-wide survey revealed a tendency of MEL2-binding consensus appearing in 3'-UTR of rice genes. Of 249 genes that conserved the consensus in their 3'-UTR, 13 genes spatiotemporally co-expressed with MEL2 in meiotic flowers, and included several genes whose function was supposed in meiosis; such as Replication protein A and OsMADS3. The proteome analysis revealed that the amounts of small ubiquitin-related modifier-like protein and eukaryotic translation initiation factor3-like protein were dramatically altered in mel2 mutant anthers. Taken together with transcriptome and gene ontology results, we propose that the rice MEL2 is involved in the translational regulation of key meiotic genes on 3'-UTRs to achieve the faithful transition of germ cells to meiosis. PMID:26319516

  18. Solid phase sequencing of double-stranded nucleic acids

    DOEpatents

    Fu, Dong-Jing; Cantor, Charles R.; Koster, Hubert; Smith, Cassandra L.

    2002-01-01

    This invention relates to methods for detecting and sequencing of target double-stranded nucleic acid sequences, to nucleic acid probes and arrays of probes useful in these methods, and to kits and systems which contain these probes. Useful methods involve hybridizing the nucleic acids or nucleic acids which represent complementary or homologous sequences of the target to an array of nucleic acid probes. These probe comprise a single-stranded portion, an optional double-stranded portion and a variable sequence within the single-stranded portion. The molecular weights of the hybridized nucleic acids of the set can be determined by mass spectroscopy, and the sequence of the target determined from the molecular weights of the fragments. Nucleic acids whose sequences can be determined include nucleic acids in biological samples such as patient biopsies and environmental samples. Probes may be fixed to a solid support such as a hybridization chip to facilitate automated determination of molecular weights and identification of the target sequence.

  19. Analysis and Annotation of Nucleic Acid Sequence

    SciTech Connect

    States, David J.

    2004-07-28

    The aims of this project were to develop improved methods for computational genome annotation and to apply these methods to improve the annotation of genomic sequence data with a specific focus on human genome sequencing. The project resulted in a substantial body of published work. Notable contributions of this project were the identification of basecalling and lane tracking as error processes in genome sequencing and contributions to improved methods for these steps in genome sequencing. This technology improved the accuracy and throughput of genome sequence analysis. Probabilistic methods for physical map construction were developed. Improved methods for sequence alignment, alternative splicing analysis, promoter identification and NF kappa B response gene prediction were also developed.

  20. Analysis and Annotation of Nucleic Acid Sequence

    SciTech Connect

    David J. States

    1998-08-01

    The aims of this project were to develop improved methods for computational genome annotation and to apply these methods to improve the annotation of genomic sequence data with a specific focus on human genome sequencing. The project resulted in a substantial body of published work. Notable contributions of this project were the identification of basecalling and lane tracking as error processes in genome sequencing and contributions to improved methods for these steps in genome sequencing. This technology improved the accuracy and throughput of genome sequence analysis. Probabilistic methods for physical map construction were developed. Improved methods for sequence alignment, alternative splicing analysis, promoter identification and NF kappa B response gene prediction were also developed.

  1. The Interferon Consensus Sequence Binding Protein (Icsbp/Irf8) Is Required for Termination of Emergency Granulopoiesis.

    PubMed

    Hu, Liping; Huang, Weiqi; Hjort, Elizabeth E; Bei, Ling; Platanias, Leonidas C; Eklund, Elizabeth A

    2016-02-19

    Emergency granulopoiesis occurs in response to infectious or inflammatory challenge and is a component of the innate immune response. Some molecular events involved in initiating emergency granulopoiesis are known, but termination of this process is less well defined. In this study, we found that the interferon consensus sequence binding protein (Icsbp/Irf8) was required to terminate emergency granulopoiesis. Icsbp is an interferon regulatory transcription factor with leukemia suppressor activity. Expression of Icsbp is decreased in chronic myeloid leukemia, and Icsbp(-/-) mice exhibit progressive granulocytosis with evolution to blast crisis, similar to the course of human chronic myeloid leukemia. In this study, we found aberrantly sustained granulocyte production in Icsbp(-/-) mice after stimulation of an emergency granulopoiesis response. Icsbp represses transcription of the genes encoding Fas-associated phosphatase 1 (Fap1) and growth arrest-specific 2 (Gas2) and activates genes encoding Fanconi C and F. After stimulation of emergency granulopoiesis, we found increased and sustained expression of Fap1 and Gas2 in bone marrow myeloid progenitor cells from Icsbp(-/-) mice in comparison with the wild type. This was associated with resistance to Fas-induced apoptosis and increased β-catenin activity in these cells. We also found that repeated episodes of emergency granulopoiesis accelerated progression to acute myeloid leukemia in Icsbp(-/-) mice. This was associated with impaired Fanconi C and F expression and increased sensitivity to DNA damage in bone marrow myeloid progenitors. Our results suggest that impaired Icsbp expression enhances leukemogenesis by deregulating processes that normally limit granulocyte expansion during the innate immune response. PMID:26683374

  2. First full-length genome sequence of the polerovirus luffa aphid-borne yellows virus (LABYV) reveals the presence of at least two consensus sequences in an isolate from Thailand.

    PubMed

    Knierim, Dennis; Maiss, Edgar; Kenyon, Lawrence; Winter, Stephan; Menzel, Wulf

    2015-10-01

    Luffa aphid-borne yellows virus (LABYV) was proposed as the name for a previously undescribed polerovirus based on partial genome sequences obtained from samples of cucurbit plants collected in Thailand between 2008 and 2013. In this study, we determined the first full-length genome sequence of LABYV. Based on phylogenetic analysis and genome properties, it is clear that this virus represents a distinct species in the genus Polerovirus. Analysis of sequences from sample TH24, which was collected in 2010 from a luffa plant in Thailand, reveals the presence of two different full-length genome consensus sequences. PMID:26195192

  3. From Artificial Amino Acids to Sequence-Defined Targeted Oligoaminoamides.

    PubMed

    Morys, Stephan; Wagner, Ernst; Lächelt, Ulrich

    2016-01-01

    Artificial oligoamino acids with appropriate protecting groups can be used for the sequential assembly of oligoaminoamides on solid-phase. With the help of these oligoamino acids multifunctional nucleic acid (NA) carriers can be designed and produced in highly defined topologies. Here we describe the synthesis of the artificial oligoamino acid Fmoc-Stp(Boc3)-OH, the subsequent assembly into sequence-defined oligomers and the formulation of tumor-targeted plasmid DNA (pDNA) polyplexes. PMID:27436323

  4. Detecting frame shifts by amino acid sequence comparison.

    PubMed

    Claverie, J M

    1993-12-20

    Various amino acid substitution scoring matrices are used in conjunction with local alignments programs to detect regions of similarity and infer potential common ancestry between proteins. The usual scoring schemes derive from the implicit hypothesis that related proteins evolve from a common ancestor by the accumulation of point mutations and that amino acids tend to be progressively substituted by others with similar properties. However, other frequent single mutation events, like nucleotide insertion or deletion and gene inversion, change the translation reading frame and cause previously encoded amino acid sequences to become unrecognizable at once. Here, I derive five new types of scoring matrix, each capable of detecting a specific frame shift (deletion, insertion and inversion in 3 frames) and use them with a regular local alignments program to detect amino acid sequences that may have derived from alternative reading frames of the same nucleotide sequence. Frame shifts are inferred from the sole comparison of the protein sequences. The five scoring matrices were used with the BLASTP program to compare all the protein sequences in the Swissprot database. Surprisingly, the searches revealed hundreds of highly significant frame shift matches, of which many are likely to represent sequencing errors. Others provide some evidence that frame shift mutations might be used in protein evolution as a way to create new amino acid sequences from pre-existing coding regions. PMID:7903399

  5. Consensus recommendations on the use of injectable poly-L-lactic acid for facial and nonfacial volumization.

    PubMed

    Vleggaar, Danny; Fitzgerald, Rebecca; Lorenc, Z Paul; Andrews, J Todd; Butterwick, Kimberly; Comstock, Jody; Hanke, C William; O'Daniel, T Gerald; Palm, Melanie D; Roberts, Wendy E; Sadick, Neil; Teller, Craig F

    2014-04-01

    Poly-L-lactic acid (PLLA) was approved for use in Europe in 1999. In the United States, it was approved by the Food and Drug Administration in 2004 for the treatment of facial lipoatrophy associated with human immunodeficiency virus, and in 2009 for cosmetic indications in immune-competent patients. The need for consistent, effective PLLA usage recommendations is heightened by an increased consumer demand for soft tissue augmentation and a shift toward a younger demographic. Over the past 14 years, considerable experience has been gained with this agent, and we have come to better understand the clinical, technical, and mechanistic aspects of PLLA use that need to be considered to optimize patient outcomes. These consensus recommendations regarding patient selection, proper preparation and storage, optimal injection techniques, and other practical considerations reflect the body of evidence in the medical literature, as well as the collective experience of this author group. PMID:24719078

  6. Consensus Report of the 4th International Forum for Gadolinium-Ethoxybenzyl-Diethylenetriamine Pentaacetic Acid Magnetic Resonance Imaging

    PubMed Central

    Zech, Christoph J; Bolondi, Luigi; Jonas, Eduard; Kim, Myeong-Jin; Matsui, Osamu; Merkle, Elmar M.; Sakamoto, Michiie; Choi, Byung Ihn

    2011-01-01

    This paper reports on issues relating to the optimal use of gadolinium-ethoxybenzyl-diethylenetriamine pentaacetic acid magnetic resonance imaging (Gd-EOB-DTPA MR imaging) together with the generation of consensus statements from a working group meeting, which was held in Seoul, Korea (2010). Gd-EOB-DTPA has been shown to improve the detection and characterization of liver lesions, and the information provided by the hepatobiliary phase is proving particularly useful in differential diagnoses and in the characterization of small lesions (around 1-1.5 cm). Discussion also focused on advances in the role of organic anion-transporting polypeptide 8 (OATP8) transporters. Gd-EOB-DTPA is also emerging as a promising tool for functional analysis, enabling the calculation of post-surgical liver function in the remaining segments. Updates to current algorithms were also discussed. PMID:21852900

  7. Segments of amino acid sequence similarity in beta-amylases.

    PubMed

    Friedberg, F; Rhodes, C

    1988-01-01

    In alpha-amylases from animals, plants and bacteria and in beta-amylases from plants and bacteria a number of segments exhibit amino acid sequence similarity specific to the alpha or to the beta type, respectively. In the case of the beta-amylases the similar sequence regions are extensive and they are disrupted only by short interspersed dissimilar regions. Close to the C terminus, however, no such sequence similarity exist. PMID:2464171

  8. Measuring consensus

    SciTech Connect

    Kurstedt, H.A. Jr.; Brubaker, D.M.; Doss, A.R.; Koelling, C.P.

    1989-10-01

    For this paper, I wanted to compare mathematical techniques against group interaction in generating consensus for a ranking decision. I convened a group to come to consensus on ranking items needed for survival on the moon. I chose this problem because NASA has an approved solution. I solicited the group's individual rankings before and after discussion. I used Kendall's coefficient of concordance to measure the level of consensus before and after discussion and compared the results against individual qualitative responses to a questionnaire designed to also measure consensus. The approved solution allowed me to see if group felt more or less in agreement as they moved closer or farther from the approved solution. As background for this experiment, I researched the existing knowledge on measuring consensus. I make a distinction between consensus and successful consensus, define them, and operationalize them for the purposes of this study. I define different levels of consensus which can be reached regardless of the success of the consensus. In this experiment, I determined the interactive discussion produced consensus, but not successful consensus. The mathematical technique produced a ranking closer to the accepted answer than the group discussion did. 15 refs., 1 tab.

  9. Dispelling the North American acid rain clouds: Developing a framework for political consensus through the identification of elite viewpoints

    SciTech Connect

    Bhatti, N.

    1988-01-01

    Acidic deposition has simultaneously been referred to as an environmental curiosity and as an ecological holocaust. This polarization of opinion on this pollutant has resulted in the policy stalemate in Congress over this issue and is responsible for the major part of the friction which currently besets Canada-United States relations. This study identified the distinctive viewpoints which characterize opposing attitudes. In addition, the specific areas of consensus and disagreement among these elite groups were determined. All of these objectives were carried out using the results of the Q-sort technique and interviews with members of the acid rain elite in both Canada and the United States (i.e. politicians, scientists, regulators, environmental/advocacy groups, and industry/utility personnel). Furthermore, a comprehensive, in-depth review of the scientific, legal, economic, social and political aspects of this tissue was conducted. Results show that implementation of the Acid Rain Experimental Control Program (ARECP) and the Clean Coal Technology project has the potential to break the existing stalemates over this issue and, at the same, could avert damage to many ecosystems, man-made structures and human health.

  10. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... acids are not intended to be embraced by this definition. Any amino acid sequence that contains post-translationally modified amino acids may be described as the amino acid sequence that is initially translated... sequence of four or more amino acids or an unbranched sequence of ten or more nucleotides....

  11. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... acids are not intended to be embraced by this definition. Any amino acid sequence that contains post-translationally modified amino acids may be described as the amino acid sequence that is initially translated... sequence of four or more amino acids or an unbranched sequence of ten or more nucleotides....

  12. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... acids are not intended to be embraced by this definition. Any amino acid sequence that contains post-translationally modified amino acids may be described as the amino acid sequence that is initially translated... sequence of four or more amino acids or an unbranched sequence of ten or more nucleotides....

  13. Treatment recommendations in long-chain fatty acid oxidation defects: consensus from a workshop.

    PubMed

    Spiekerkoetter, U; Lindner, M; Santer, R; Grotzke, M; Baumgartner, M R; Boehles, H; Das, A; Haase, C; Hennermann, J B; Karall, D; de Klerk, H; Knerr, I; Koch, H G; Plecko, B; Röschinger, W; Schwab, K O; Scheible, D; Wijburg, F A; Zschocke, J; Mayatepek, E; Wendel, U

    2009-08-01

    Published data on treatment of fatty acid oxidation defects are scarce. Treatment recommendations have been developed on the basis of observations in 75 patients with long-chain fatty acid oxidation defects from 18 metabolic centres in Central Europe. Recommendations are based on expert practice and are suggested to be the basis for further multicentre prospective studies and the development of approved treatment guidelines. Considering that disease complications and prognosis differ between different disorders of long-chain fatty acid oxidation and also depend on the severity of the underlying enzyme deficiency, treatment recommendations have to be disease-specific and depend on individual disease severity. Disorders of the mitochondrial trifunctional protein are associated with the most severe clinical picture and require a strict fat-reduced and fat-modified (medium-chain triglyceride-supplemented) diet. Many patients still suffer acute life-threatening events or long-term neuropathic symptoms despite adequate treatment, and newborn screening has not significantly changed the prognosis for these severe phenotypes. Very long-chain acyl-CoA dehydrogenase deficiency recognized in neonatal screening, in contrast, frequently has a less severe disease course and dietary restrictions in many patients may be loosened. On the basis of the collected data, recommendations are given with regard to the fat and carbohydrate content of the diet, the maximal length of fasting periods and the use of l-carnitine in long-chain fatty acid oxidation defects. PMID:19452263

  14. Predicting the functional consequences of non-synonymous DNA sequence variants--evaluation of bioinformatics tools and development of a consensus strategy.

    PubMed

    Frousios, Kimon; Iliopoulos, Costas S; Schlitt, Thomas; Simpson, Michael A

    2013-10-01

    The study of DNA sequence variation has been transformed by recent advances in DNA sequencing technologies. Determination of the functional consequences of sequence variant alleles offers potential insight as to how genotype may influence phenotype. Even within protein coding regions of the genome, establishing the consequences of variation on gene and protein function is challenging and requires substantial laboratory investigation. However, a series of bioinformatics tools have been developed to predict whether non-synonymous variants are neutral or disease-causing. In this study we evaluate the performance of nine such methods (SIFT, PolyPhen2, SNPs&GO, PhD-SNP, PANTHER, Mutation Assessor, MutPred, Condel and CAROL) and developed CoVEC (Consensus Variant Effect Classification), a tool that integrates the prediction results from four of these methods. We demonstrate that the CoVEC approach outperforms most individual methods and highlights the benefit of combining results from multiple tools. PMID:23831115

  15. A method to find palindromes in nucleic acid sequences.

    PubMed

    Anjana, Ramnath; Shankar, Mani; Vaishnavi, Marthandan Kirti; Sekar, Kanagaraj

    2013-01-01

    Various types of sequences in the human genome are known to play important roles in different aspects of genomic functioning. Among these sequences, palindromic nucleic acid sequences are one such type that have been studied in detail and found to influence a wide variety of genomic characteristics. For a nucleotide sequence to be considered as a palindrome, its complementary strand must read the same in the opposite direction. For example, both the strands i.e the strand going from 5' to 3' and its complementary strand from 3' to 5' must be complementary. A typical nucleotide palindromic sequence would be TATA (5' to 3') and its complimentary sequence from 3' to 5' would be ATAT. Thus, a new method has been developed using dynamic programming to fetch the palindromic nucleic acid sequences. The new method uses less memory and thereby it increases the overall speed and efficiency. The proposed method has been tested using the bacterial (3891 KB bases) and human chromosomal sequences (Chr-18: 74366 kb and Chr-Y: 25554 kb) and the computation time for finding the palindromic sequences is in milli seconds. PMID:23515654

  16. Animal Protection and Structural Studies of a Consensus Sequence VaccineTargeting the Receptor Binding Domain of the Type IV Pilus of Pseudomonas aeruginosa

    PubMed Central

    Kao, Daniel J.; Churchill, Mair E. A.; Irvin, Randall T.; Hodges, Robert S.

    2010-01-01

    One of the main obstacles in the development of a vaccine against Pseudomonas aeruginosa is the requirement that it is protective against a wide range of virulent strains. We have developed a synthetic-peptide consensus-sequence vaccine (Cs1) that targets the host receptor-binding domain (RBD) of the type IV pilus of P. aeruginosa. Here, we show that this vaccine provides increased protection against challenge by the four piliated strains that we have examined (PAK, PAO, KB7 and P1) in the A.BY/SnJ mouse model of acute P. aeruginosa infection. To further characterize the consensus sequence, we engineered Cs1 into the PAK monomeric pilin protein and determined the crystal structure of the chimeric Cs1 pilin to 1.35 Å resolution. The substitutions (T130K and E135P) used to create Cs1 do not disrupt the conserved backbone conformation of the pilin RBD. In fact, based on the Cs1 pilin structure, we hypothesize that the E135P substitution bolsters the conserved backbone conformation and may partially explain the immunological activity of Cs1. Structural analysis of Cs1, PAK and K122-4 pilins reveal substitutions of non-conserved residues in the RBD are compensated for by complementary changes in the rest of the pilin monomer. Thus, the interactions between the RBD and the rest of the pilin can either be mediated by polar interactions of a hydrogen bond network in some strains or by hydrophobic interactions in others. Both configurations maintain a conserved backbone conformation of the RBD. Thus, the backbone conformation is critical in our consensus-sequence vaccine design and that cross-reactivity of the antibody response may be modulated by the composition of exposed side-chains on the surface of the RBD. This structure will guide our future vaccine design by focusing our investigation on the four variable residue positions that are exposed on the RBD surface. PMID:17936788

  17. Animal Protection and Structural Studies of a Consensus Sequence Vaccine Targeting the Receptor Binding Domain of the Type IV Pilus of Pseudomonas aeruginosa

    SciTech Connect

    Kao, Daniel J.; Churchill, Mair E.A.; Irvin, Randall T.; Hodges, Robert S.

    2008-09-23

    One of the main obstacles in the development of a vaccine against Pseudomonas aeruginosa is the requirement that it is protective against a wide range of virulent strains. We have developed a synthetic-peptide consensus-sequence vaccine (Cs1) that targets the host receptor-binding domain (RBD) of the type IV pilus of P. aeruginosa. Here, we show that this vaccine provides increased protection against challenge by the four piliated strains that we have examined (PAK, PAO, KB7 and P1) in the A.BY/SnJ mouse model of acute P. aeruginosa infection. To further characterize the consensus sequence, we engineered Cs1 into the PAK monomeric pilin protein and determined the crystal structure of the chimeric Cs1 pilin to 1.35 {angstrom} resolution. The substitutions (T130K and E135P) used to create Cs1 do not disrupt the conserved backbone conformation of the pilin RBD. In fact, based on the Cs1 pilin structure, we hypothesize that the E135P substitution bolsters the conserved backbone conformation and may partially explain the immunological activity of Cs1. Structural analysis of Cs1, PAK and K122-4 pilins reveal substitutions of non-conserved residues in the RBD are compensated for by complementary changes in the rest of the pilin monomer. Thus, the interactions between the RBD and the rest of the pilin can either be mediated by polar interactions of a hydrogen bond network in some strains or by hydrophobic interactions in others. Both configurations maintain a conserved backbone conformation of the RBD. Thus, the backbone conformation is critical in our consensus-sequence vaccine design and that cross-reactivity of the antibody response may be modulated by the composition of exposed side-chains on the surface of the RBD. This structure will guide our future vaccine design by focusing our investigation on the four variable residue positions that are exposed on the RBD surface.

  18. Amino acid sequence repertoire of the bacterial proteome and the occurrence of untranslatable sequences.

    PubMed

    Navon, Sharon Penias; Kornberg, Guy; Chen, Jin; Schwartzman, Tali; Tsai, Albert; Puglisi, Elisabetta Viani; Puglisi, Joseph D; Adir, Noam

    2016-06-28

    Bioinformatic analysis of Escherichia coli proteomes revealed that all possible amino acid triplet sequences occur at their expected frequencies, with four exceptions. Two of the four underrepresented sequences (URSs) were shown to interfere with translation in vivo and in vitro. Enlarging the URS by a single amino acid resulted in increased translational inhibition. Single-molecule methods revealed stalling of translation at the entrance of the peptide exit tunnel of the ribosome, adjacent to ribosomal nucleotides A2062 and U2585. Interaction with these same ribosomal residues is involved in regulation of translation by longer, naturally occurring protein sequences. The E. coli exit tunnel has evidently evolved to minimize interaction with the exit tunnel and maximize the sequence diversity of the proteome, although allowing some interactions for regulatory purposes. Bioinformatic analysis of the human proteome revealed no underrepresented triplet sequences, possibly reflecting an absence of regulation by interaction with the exit tunnel. PMID:27307442

  19. easyPAC: A Tool for Fast Prediction, Testing and Reference Mapping of Degenerate PCR Primers from Alignments or Consensus Sequences

    PubMed Central

    Rosenkranz, David

    2012-01-01

    The PCR-amplification of unknown homologous or paralogous genes generally relies on PCR primers predicted from multi sequence alignments. But increasing sequence divergence can induce the need to use degenerate primers which entails the problem of testing the characteristics, unwanted interactions and potential mispriming of degenerate primers. Here I introduce easyPAC, a new software for the prediction of degenerate primers from multi sequence alignments or single consensus sequences. As a major innovation, easyPAC allows to apply all customary primer test procedures to degenerate primer sequences including fast mapping to reference files. Thus, easyPAC simplifies and expedites the designing of specific degenerate primers enormously. Degenerate primers suggested by easyPAC were used in PCR amplification with subsequent de novo sequencing of TDRD1 exon 11 homologs from several representatives of the haplorrhine primate phylogeny. The results demonstrate the efficient performance of the suggested primers and therefore show that easyPAC can advance upcoming comparative genetic studies.

  20. On Quantum Algorithm for Multiple Alignment of Amino Acid Sequences

    NASA Astrophysics Data System (ADS)

    Iriyama, Satoshi; Ohya, Masanori

    2009-02-01

    The alignment of genome sequences or amino acid sequences is one of fundamental operations for the study of life. Usual computational complexity for the multiple alignment of N sequences with common length L by dynamic programming is O(LN). This alignment is considered as one of the NP problems, so that it is desirable to find a nice algorithm of the multiple alignment. Thus in this paper we propose the quantum algorithm for the multiple alignment based on the works12,1,2 in which the NP complete problem was shown to be the P problem by means of quantum algorithm and chaos information dynamics.

  1. Prebiotically plausible mechanisms increase compositional diversity of nucleic acid sequences

    PubMed Central

    Derr, Julien; Manapat, Michael L.; Rajamani, Sudha; Leu, Kevin; Xulvi-Brunet, Ramon; Joseph, Isaac; Nowak, Martin A.; Chen, Irene A.

    2012-01-01

    During the origin of life, the biological information of nucleic acid polymers must have increased to encode functional molecules (the RNA world). Ribozymes tend to be compositionally unbiased, as is the vast majority of possible sequence space. However, ribonucleotides vary greatly in synthetic yield, reactivity and degradation rate, and their non-enzymatic polymerization results in compositionally biased sequences. While natural selection could lead to complex sequences, molecules with some activity are required to begin this process. Was the emergence of compositionally diverse sequences a matter of chance, or could prebiotically plausible reactions counter chemical biases to increase the probability of finding a ribozyme? Our in silico simulations using a two-letter alphabet show that template-directed ligation and high concatenation rates counter compositional bias and shift the pool toward longer sequences, permitting greater exploration of sequence space and stable folding. We verified experimentally that unbiased DNA sequences are more efficient templates for ligation, thus increasing the compositional diversity of the pool. Our work suggests that prebiotically plausible chemical mechanisms of nucleic acid polymerization and ligation could predispose toward a diverse pool of longer, potentially structured molecules. Such mechanisms could have set the stage for the appearance of functional activity very early in the emergence of life. PMID:22319215

  2. The amino-acid sequence of kangaroo pancreatic ribonuclease.

    PubMed

    Gaastra, W; Welling, G W; Beintema, J J

    1978-05-01

    Red kangaroo (Macropus rufus) ribonuclease was isolated from pancreatic tissue by affinity chromatography. The amino acid sequence was determined by automatic sequencing of overlapping large fragments and by analysis of shorter peptides obtained by digestion with a number of proteolytic enzymes. The polypeptide chain consists of 122 amino acid residues. Compared to other ribonucleases, the N-terminal residue and residue 114 are deleted. In other pancreatic ribonucleases position 114 is occupied by a cis proline residue in an external loop at the surface of the molecule. Other remarkable substitutions are the presence of a tyrosine residue at position 123 instead of a serine which forms a hydrogen bond with the pyrimidine ring of a nucleotide substrate, and a number of hydrophobichydrophilic interchanges in the sequence 51-55, which forms part of an alpha-helix in bovine ribonuclease and exhibits few substitutions in the placental mammals. Kangaroo ribonuclease contains no carbohydrate, although the enzyme possesses a recognition site for carbohydrate attachment in the sequence Asn-Val-Thr (62-64). The enzyme differs at about 35-40% of the positions from all other mammalian pancreatic ribonucleases sequenced to date, which is in agreement with the early divergence between the marsupials and the placental mammals. From fragmentary data a tentative sequence of red-necked wallaby (Macropus rufogriseus) pancreatic ribonuclease has been derived. Eight differences with the kangaroo sequence were found. PMID:658039

  3. beta-Cyclodextrin derivatives as carriers to enhance the antiviral activity of an antisense oligonucleotide directed toward a coronavirus intergenic consensus sequence.

    PubMed

    Abdou, S; Collomb, J; Sallas, F; Marsura, A; Finance, C

    1997-01-01

    The ability of cyclodextrins to enhance the antiviral activity of a phosphodiester oligodeoxynucleotide has been investigated. A 18-mer oligodeoxynucleotide complementary to the initiation region of the mRNA coding for the spike protein and containing the intergenic consensus sequence of an enteric coronavirus has been tested for antiviral action against virus growth in human adenocarcinoma cells. The phosphodiester oligodeoxynucleotide only showed a limited effect on virus growth rate (from 12 to 34% viral inhibition in cells treated with 7.5 to 25 microM oligodeoxynucleotide, respectively, at a multiplicity of infection of 0.1 infectious particle per cell). In the same conditions, the phosphorothioate analogue exhibited stronger antiviral activity, the inhibition increased from 56 to 90%. The inhibitory effect of this analogue was antisense and sequence-specific. Northern blot analysis showed that the sequence-dependent mechanism of action appears to be the inhibition of mRNA transcription. We conclude that the coronavirus intergenic consensus sequence is a good target for an antisense oligonucleotide antiviral action. The properties of the phosphodiester oligonucleotide was improved after its complexation with cyclodextrins. The most important increase of the antiviral activity (90% inhibition) was obtained with only 7.5 microM oligonucleotide complexed to a cyclodextrin derivative, 6-deoxy-6-S-beta-D-galactopyranosyl-6-thio-cyclomalto-heptaose+ ++ in a molar ratio of 1:100. These studies suggest that the use of cyclodextrin derivatives as carrier for phosphodiester oligonucleotides delivery may be an effective method for increasing the therapeutic potential of these compounds in viral infections. PMID:9672621

  4. Homology analyses of the protein sequences of fatty acid synthases from chicken liver, rat mammary gland, and yeast

    SciTech Connect

    Chang, Soo-Ik ); Hammes, G.G. )

    1989-11-01

    Homology analyses of the protein sequences of chicken liver and rat mammary gland fatty acid synthases were carried out. The amino acid sequences of the chicken and rat enzymes are 67% identical. If conservative substitutions are allowed, 78% of the amino acids are matched. A region of low homologies exists between the functional domains, in particular around amino acid residues 1059-1264 of the chicken enzyme. Homologies between the active sites of chicken and rat and of chicken and yeast enzymes have been analyzed by an alignment method. A high degree of homology exists between the active sites of the chicken and rat enzymes. However, the chicken and yeast enzymes show a lower degree of homology. The DADPH-binding dinucleotide folds of the {beta}-ketoacyl reductase and the enoyl reductase sites were identified by comparison with a known consensus sequence for the DADP- and FAD-binding dinucleotide folds. The active sites of all of the enzymes are primarily in hydrophobic regions of the protein. This study suggests that the genes for the functional domains of fatty acid synthase were originally separated, and these genes were connected to each other by using different connecting nucleotide sequences in different species. An alternative explanation for the differences in rat and chicken is a common ancestry and mutations in the joining regions during evolution.

  5. Chicken interferon consensus sequence-binding protein (ICSBP) and interferon regulatory factor (IRF) 1 genes reveal evolutionary conservation in the IRF gene family.

    PubMed Central

    Jungwirth, C; Rebbert, M; Ozato, K; Degen, H J; Schultz, U; Dawid, I B

    1995-01-01

    Members of the IRF family mediate transcriptional responses to interferons (IFNs) and to virus infection. So far, proteins of this family have been studied only among mammalian species. Here we report the isolation of cDNA clones encoding two members of this family from chicken, interferon consensus sequence-binding protein (ICSBP) and IRF-1. The predicted chicken ICSBP and IRF-1 proteins show high levels of sequence similarity to their corresponding human and mouse counterparts. Sequence identities in the putative DNA-binding domains of chicken and human ICSBP and IRF-1 were 97% and 89%, respectively, whereas the C-terminal regions showed identities of 64% and 51%; sequence relationships with mouse ICSBP and IRF-1 are very similar. Chicken ICSBP was found to be expressed in several embryonic tissues, and both chicken IRF-1 and ICSBP were strongly induced in chicken fibroblasts by IFN treatment, supporting the involvement of these factors in IFN-regulated gene expression. The presence of proteins homologous to mammalian IRF family members, together with earlier observations on the occurrence of functionally homologous IFN-responsive elements in chicken and mammalian genes, highlights the conservation of transcriptional mechanisms in the IFN system, a finding that contrasts with the extensive sequence and functional divergence of the IFNs. Images Fig. 3 Fig. 4 Fig. 5 PMID:7536924

  6. Amino acid sequence of Salmonella typhimurium branched-chain amino acid aminotransferase.

    PubMed

    Feild, M J; Nguyen, D C; Armstrong, F B

    1989-06-13

    The complete amino acid sequence of the subunit of branched-chain amino acid aminotransferase (transaminase B, EC 2.6.1.42) of Salmonella typhimurium was determined. An Escherichia coli recombinant containing the ilvGEDAY gene cluster of Salmonella was used as the source of the hexameric enzyme. The peptide fragments used for sequencing were generated by treatment with trypsin, Staphylococcus aureus V8 protease, endoproteinase Lys-C, and cyanogen bromide. The enzyme subunit contains 308 residues and has a molecular weight of 33,920. To determine the coenzyme-binding site, the pyridoxal 5-phosphate containing enzyme was treated with tritiated sodium borohydride prior to trypsin digestion. Peptide map comparisons with an apoenzyme tryptic digest and monitoring radioactivity incorporation allowed identification of the pyridoxylated peptide, which was then isolated and sequenced. The coenzyme-binding site is the lysyl residue at position 159. The amino acid sequence of Salmonella transaminase B is 97.4% identical with that of Escherichia coli, differing in only eight amino acid positions. Sequence comparisons of transaminase B to other known aminotransferase sequences revealed limited sequence similarity (24-33%) when conserved amino acid substitutions are allowed and alignments were forced to occur on the coenzyme-binding site. PMID:2669973

  7. Amino acid sequence of bovine heart coupling factor 6.

    PubMed Central

    Fang, J K; Jacobs, J W; Kanner, B I; Racker, E; Bradshaw, R A

    1984-01-01

    The amino acid sequence of bovine heart mitochondrial coupling factor 6 (F6) has been determined by automated Edman degradation of the whole protein and derived peptides. Preparations based on heat precipitation and ethanol extraction showed allotypic variation at three positions while material further purified by HPLC yielded only one sequence that also differed by a Phe-Thr replacement at residue 62. The mature protein contains 76 amino acids with a calculated molecular weight of 9006 and a pI of approximately equal to 5, in good agreement with experimentally measured values. The charged amino acids are mainly clustered at the termini and in one section in the middle; these three polar segments are separated by two segments relatively rich in nonpolar residues. Chou-Fasman analysis suggests three stretches of alpha-helix coinciding (or within) the high-charge-density sequences with a single beta-turn at the first polar-nonpolar junction. Comparison of the F6 sequence with those of other proteins did not reveal any homologous structures. PMID:6149548

  8. Towards a consensus Y-chromosomal phylogeny and Y-SNP set in forensics in the next-generation sequencing era.

    PubMed

    Larmuseau, Maarten H D; Van Geystelen, Anneleen; Kayser, Manfred; van Oven, Mannis; Decorte, Ronny

    2015-03-01

    Currently, several different Y-chromosomal phylogenies and haplogroup nomenclatures are presented in scientific literature and at conferences demonstrating the present diversity in Y-chromosomal phylogenetic trees and Y-SNP sets used within forensic and anthropological research. This situation can be ascribed to the exponential growth of the number of Y-SNPs discovered due to mostly next-generation sequencing (NGS) studies. As Y-SNPs and their respective phylogenetic positions are important in forensics, such as for male lineage characterization and paternal bio-geographic ancestry inference, there is a need for forensic geneticists to know how to deal with these newly identified Y-SNPs and phylogenies, especially since these phylogenies are often created with other aims than to carry out forensic genetic research. Therefore, we give here an overview of four categories of currently used Y-chromosomal phylogenies and the associated Y-SNP sets in scientific research in the current NGS era. We compare these categories based on the construction method, their advantages and disadvantages, the disciplines wherein the phylogenetic tree can be used, and their specific relevance for forensic geneticists. Based on this overview, it is clear that an up-to-date reduced tree with a consensus Y-SNP set and a stable nomenclature will be the most appropriate reference resource for forensic research. Initiatives to reach such an international consensus are therefore highly recommended. PMID:25488610

  9. Sequences Of Amino Acids For Human Serum Albumin

    NASA Technical Reports Server (NTRS)

    Carter, Daniel C.

    1992-01-01

    Sequences of amino acids defined for use in making polypeptides one-third to one-sixth as large as parent human serum albumin molecule. Smaller, chemically stable peptides have diverse applications including service as artificial human serum and as active components of biosensors and chromatographic matrices. In applications involving production of artificial sera from new sequences, little or no concern about viral contaminants. Smaller genetically engineered polypeptides more easily expressed and produced in large quantities, making commercial isolation and production more feasible and profitable.

  10. Nanopores and nucleic acids: prospects for ultrarapid sequencing

    NASA Technical Reports Server (NTRS)

    Deamer, D. W.; Akeson, M.

    2000-01-01

    DNA and RNA molecules can be detected as they are driven through a nanopore by an applied electric field at rates ranging from several hundred microseconds to a few milliseconds per molecule. The nanopore can rapidly discriminate between pyrimidine and purine segments along a single-stranded nucleic acid molecule. Nanopore detection and characterization of single molecules represents a new method for directly reading information encoded in linear polymers. If single-nucleotide resolution can be achieved, it is possible that nucleic acid sequences can be determined at rates exceeding a thousand bases per second.

  11. A Screen for Dominant Negative Mutants of SEC18 Reveals a Role for the AAA Protein Consensus Sequence in ATP Hydrolysis

    PubMed Central

    Steel, Gregor J.; Harley, Carol; Boyd, Alan; Morgan, Alan

    2000-01-01

    An evolutionarily ancient mechanism is used for intracellular membrane fusion events ranging from endoplasmic reticulum–Golgi traffic in yeast to synaptic vesicle exocytosis in the human brain. At the heart of this mechanism is the core complex of N-ethylmaleimide-sensitive fusion protein (NSF), soluble NSF attachment proteins (SNAPs), and SNAP receptors (SNAREs). Although these proteins are accepted as key players in vesicular traffic, their molecular mechanisms of action remain unclear. To illuminate important structure–function relationships in NSF, a screen for dominant negative mutants of yeast NSF (Sec18p) was undertaken. This involved random mutagenesis of a GAL1-regulated SEC18 yeast expression plasmid. Several dominant negative alleles were identified on the basis of galactose-inducible growth arrest, of which one, sec18-109, was characterized in detail. The sec18-109 phenotype (abnormal membrane trafficking through the biosynthetic pathway, accumulation of a membranous tubular network, growth suppression, increased cell density) is due to a single A-G substitution in SEC18 resulting in a missense mutation in Sec18p (Thr394→Pro). Thr394 is conserved in most AAA proteins and indeed forms part of the minimal AAA consensus sequence that serves as a signature of this large protein family. Analysis of recombinant Sec18-109p indicates that the mutation does not prevent hexamerization or interaction with yeast α-SNAP (Sec17p), but instead results in undetectable ATPase activity that cannot be stimulated by Sec17p. This suggests a role for the AAA protein consensus sequence in regulating ATP hydrolysis. Furthermore, this approach of screening for dominant negative mutants in yeast can be applied to other conserved proteins so as to highlight important functional domains in their mammalian counterparts. PMID:10749934

  12. Amino acid sequence of the Amur tiger prion protein.

    PubMed

    Wu, Changde; Pang, Wanyong; Zhao, Deming

    2006-10-01

    Prion diseases are fatal neurodegenerative disorders in human and animal associated with conformational conversion of a cellular prion protein (PrP(C)) into the pathologic isoform (PrP(Sc)). Various data indicate that the polymorphisms within the open reading frame (ORF) of PrP are associated with the susceptibility and control the species barrier in prion diseases. In the present study, partial Prnp from 25 Amur tigers (tPrnp) were cloned and screened for polymorphisms. Four single nucleotide polymorphisms (T423C, A501G, C511A, A610G) were found; the C511A and A610G nucleotide substitutions resulted in the amino acid changes Lysine171Glutamine and Alanine204Threoine, respectively. The tPrnp amino acid sequence is similar to house cat (Felis catus ) and sheep, but differs significantly from other two cat Prnp sequences that were previously deposited in GenBank. PMID:16780982

  13. Quantum-Sequencing: Biophysics of quantum tunneling through nucleic acids

    NASA Astrophysics Data System (ADS)

    Casamada Ribot, Josep; Chatterjee, Anushree; Nagpal, Prashant

    2014-03-01

    Tunneling microscopy and spectroscopy has extensively been used in physical surface sciences to study quantum tunneling to measure electronic local density of states of nanomaterials and to characterize adsorbed species. Quantum-Sequencing (Q-Seq) is a new method based on tunneling microscopy for electronic sequencing of single molecule of nucleic acids. A major goal of third-generation sequencing technologies is to develop a fast, reliable, enzyme-free single-molecule sequencing method. Here, we present the unique ``electronic fingerprints'' for all nucleotides on DNA and RNA using Q-Seq along their intrinsic biophysical parameters. We have analyzed tunneling spectra for the nucleotides at different pH conditions and analyzed the HOMO, LUMO and energy gap for all of them. In addition we show a number of biophysical parameters to further characterize all nucleobases (electron and hole transition voltage and energy barriers). These results highlight the robustness of Q-Seq as a technique for next-generation sequencing.

  14. Amino acid sequence of mouse nidogen, a multidomain basement membrane protein with binding activity for laminin, collagen IV and cells.

    PubMed Central

    Mann, K; Deutzmann, R; Aumailley, M; Timpl, R; Raimondi, L; Yamada, Y; Pan, T C; Conway, D; Chu, M L

    1989-01-01

    The whole amino acid sequence of nidogen was deduced from cDNA clones isolated from expression libraries and confirmed to approximately 50% by Edman degradation of peptides. The protein consists of some 1217 amino acid residues and a 28-residue signal peptide. The data support a previously proposed dumb-bell model of nidogen by demonstrating a large N-terminal globular domain (641 residues), five EGF-like repeats constituting the rod-like domain (248 residues) and a smaller C-terminal globule (328 residues). Two more EGF-like repeats interrupt the N-terminal and terminate the C-terminal sequences. Weak sequence homologies (25%) were detected between some regions of nidogen, the LDL receptor, thyroglobulin and the EGF precursor. Nidogen contains two consensus sequences for tyrosine sulfation and for asparagine beta-hydroxylation, two N-linked carbohydrate acceptor sites and, within one of the EGF-like repeats an Arg-Gly-Asp sequence. The latter was shown to be functional in cell attachment to nidogen. Binding sites for laminin and collagen IV are present on the C-terminal globule but not yet precisely localized. Images PMID:2496973

  15. Correlation between fibroin amino acid sequence and physical silk properties.

    PubMed

    Fedic, Robert; Zurovec, Michal; Sehnal, Frantisek

    2003-09-12

    The fiber properties of lepidopteran silk depend on the amino acid repeats that interact during H-fibroin polymerization. The aim of our research was to relate repeat composition to insect biology and fiber strength. Representative regions of the H-fibroin genes were sequenced and analyzed in three pyralid species: wax moth (Galleria mellonella), European flour moth (Ephestia kuehniella), and Indian meal moth (Plodia interpunctella). The amino acid repeats are species-specific, evidently a diversification of an ancestral region of 43 residues, and include three types of regularly dispersed motifs: modifications of GSSAASAA sequence, stretches of tripeptides GXZ where X and Z represent bulky residues, and sequences similar to PVIVIEE. No concatenations of GX dipeptide or alanine, which are typical for Bombyx silkworms and Antheraea silk moths, respectively, were found. Despite different repeat structure, the silks of G. mellonella and E. kuehniella exhibit similar tensile strength as the Bombyx and Antheraea silks. We suggest that in these latter two species, variations in the repeat length obstruct repeat alignment, but sufficiently long stretches of iterated residues get superposed to interact. In the pyralid H-fibroins, interactions of the widely separated and diverse motifs depend on the precision of repeat matching; silk is strong in G. mellonella and E. kuehniella, with 2-3 types of long homogeneous repeats, and nearly 10 times weaker in P. interpunctella, with seven types of shorter erratic repeats. The high proportion of large amino acids in the H-fibroin of pyralids has probably evolved in connection with the spinning habit of caterpillars that live in protective silk tubes and spin continuously, enlarging the tubes on one end and partly devouring the other one. The silk serves as a depot of energetically rich and essential amino acids that may be scarce in the diet. PMID:12816957

  16. Amino acid sequence of the nonsecretory ribonuclease of human urine.

    PubMed

    Beintema, J J; Hofsteenge, J; Iwama, M; Morita, T; Ohgi, K; Irie, M; Sugiyama, R H; Schieven, G L; Dekker, C A; Glitz, D G

    1988-06-14

    The amino acid sequence of a nonsecretory ribonuclease isolated from human urine was determined except for the identity of the residue at position 7. Sequence information indicates that the ribonucleases of human liver and spleen and an eosinophil-derived neurotoxin are identical or very closely related gene products. The sequence is identical at about 30% of the amino acid positions with those of all of the secreted mammalian ribonucleases for which information is available. Identical residues include active-site residues histidine-12, histidine-119, and lysine-41, other residues known to be important for substrate binding and catalytic activity, and all eight half-cystine residues common to these enzymes. Major differences include a deletion of six residues in the (so-called) S-peptide loop, insertions of two, and nine residues, respectively, in three other external loops of the molecule, and an addition of three residues at the amino terminus. The sequence shows the human nonsecretory ribonuclease to belong to the same ribonuclease superfamily as the mammalian secretory ribonucleases, turtle pancreatic ribonuclease, and human angiogenin. Sequence data suggest that a gene duplication occurred in an ancient vertebrate ancestor; one branch led to the nonsecretory ribonuclease, while the other branch led to a second duplication, with one line leading to the secretory ribonucleases (in mammals) and the second line leading to pancreatic ribonuclease in turtle and an angiogenic factor in mammals (human angiogenin). The nonsecretory ribonuclease has five short carbohydrate chains attached via asparagine residues at the surface of the molecule; these chains may have been shortened by exoglycosidase action.(ABSTRACT TRUNCATED AT 250 WORDS) PMID:3166997

  17. Characterization and amino acid sequence of a fatty acid-binding protein from human heart.

    PubMed

    Offner, G D; Brecher, P; Sawlivich, W B; Costello, C E; Troxler, R F

    1988-05-15

    The complete amino acid sequence of a fatty acid-binding protein from human heart was determined by automated Edman degradation of CNBr, BNPS-skatole [3'-bromo-3-methyl-2-(2-nitrobenzenesulphenyl)indolenine], hydroxylamine, Staphylococcus aureus V8 proteinase, tryptic and chymotryptic peptides, and by digestion of the protein with carboxypeptidase A. The sequence of the blocked N-terminal tryptic peptide from citraconylated protein was determined by collisionally induced decomposition mass spectrometry. The protein contains 132 amino acid residues, is enriched with respect to threonine and lysine, lacks cysteine, has an acetylated valine residue at the N-terminus, and has an Mr of 14768 and an isoelectric point of 5.25. This protein contains two short internal repeated sequences from residues 48-54 and from residues 114-119 located within regions of predicted beta-structure and decreasing hydrophobicity. These short repeats are contained within two longer repeated regions from residues 48-60 and residues 114-125, which display 62% sequence similarity. These regions could accommodate the charged and uncharged moieties of long-chain fatty acids and may represent fatty acid-binding domains consistent with the finding that human heart fatty acid-binding protein binds 2 mol of oleate or palmitate/mol of protein. Detailed evidence for the amino acid sequences of the peptides has been deposited as Supplementary Publication SUP 50143 (23 pages) at the British Library Lending Division, Boston Spa, Yorkshire LS23 7BQ, U.K., from whom copies may be obtained as indicated in Biochem. J. (1988) 249, 5. PMID:3421901

  18. Accurate prediction of protein secondary structure and solvent accessibility by consensus combiners of sequence and structure information

    PubMed Central

    Pollastri, Gianluca; Martin, Alberto JM; Mooney, Catherine; Vullo, Alessandro

    2007-01-01

    Background Structural properties of proteins such as secondary structure and solvent accessibility contribute to three-dimensional structure prediction, not only in the ab initio case but also when homology information to known structures is available. Structural properties are also routinely used in protein analysis even when homology is available, largely because homology modelling is lower throughput than, say, secondary structure prediction. Nonetheless, predictors of secondary structure and solvent accessibility are virtually always ab initio. Results Here we develop high-throughput machine learning systems for the prediction of protein secondary structure and solvent accessibility that exploit homology to proteins of known structure, where available, in the form of simple structural frequency profiles extracted from sets of PDB templates. We compare these systems to their state-of-the-art ab initio counterparts, and with a number of baselines in which secondary structures and solvent accessibilities are extracted directly from the templates. We show that structural information from templates greatly improves secondary structure and solvent accessibility prediction quality, and that, on average, the systems significantly enrich the information contained in the templates. For sequence similarity exceeding 30%, secondary structure prediction quality is approximately 90%, close to its theoretical maximum, and 2-class solvent accessibility roughly 85%. Gains are robust with respect to template selection noise, and significant for marginal sequence similarity and for short alignments, supporting the claim that these improved predictions may prove beneficial beyond the case in which clear homology is available. Conclusion The predictive system are publicly available at the address . PMID:17570843

  19. Molecular cloning and amino acid sequence of human 5-lipoxygenase

    SciTech Connect

    Matsumoto, T.; Funk, C.D.; Radmark, O.; Hoeoeg, J.O.; Joernvall, H.; Samuelsson, B.

    1988-01-01

    5-Lipoxygenase (EC 1.13.11.34), a Ca/sup 2 +/- and ATP-requiring enzyme, catalyzes the first two steps in the biosynthesis of the peptidoleukotrienes and the chemotactic factor leukotriene B/sub 4/. A cDNA clone corresponding to 5-lipoxygenase was isolated from a human lung lambda gt11 expression library by immunoscreening with a polyclonal antibody. Additional clones from a human placenta lambda gt11 cDNA library were obtained by plaque hybridization with the /sup 32/P-labeled lung cDNA clone. Sequence data obtained from several overlapping clones indicate that the composite DNAs contain the complete coding region for the enzyme. From the deduced primary structure, 5-lipoxygenase encodes a 673 amino acid protein with a calculated molecular weight of 77,839. Direct analysis of the native protein and its proteolytic fragments confirmed the deduced composition, the amino-terminal amino acid sequence, and the structure of many internal segments. 5-Lipoxygenase has no apparent sequence homology with leukotriene A/sub 4/ hydrolase or Ca/sup 2 +/-binding proteins. RNA blot analysis indicated substantial amounts of an mRNA species of approx. = 2700 nucleotides in leukocytes, lung, and placenta.

  20. Nucleic acid sequence detection using multiplexed oligonucleotide PCR

    DOEpatents

    Nolan, John P.; White, P. Scott

    2006-12-26

    Methods for rapidly detecting single or multiple sequence alleles in a sample nucleic acid are described. Provided are all of the oligonucleotide pairs capable of annealing specifically to a target allele and discriminating among possible sequences thereof, and ligating to each other to form an oligonucleotide complex when a particular sequence feature is present (or, alternatively, absent) in the sample nucleic acid. The design of each oligonucleotide pair permits the subsequent high-level PCR amplification of a specific amplicon when the oligonucleotide complex is formed, but not when the oligonucleotide complex is not formed. The presence or absence of the specific amplicon is used to detect the allele. Detection of the specific amplicon may be achieved using a variety of methods well known in the art, including without limitation, oligonucleotide capture onto DNA chips or microarrays, oligonucleotide capture onto beads or microspheres, electrophoresis, and mass spectrometry. Various labels and address-capture tags may be employed in the amplicon detection step of multiplexed assays, as further described herein.

  1. The amino acid sequence of rabbit muscle triose phosphate isomerase.

    PubMed Central

    Corran, P H; Waley, S G

    1975-01-01

    The amino acid sequence of rabbit muscle triose phosphate isomerase was deduced by characterizing peptides that overlap the tryptic peptides. Thiol groups were modified by oxidation, carboxymethylation or aminoen. About 50 peptides that provided information about overlaps were isolated; the peptides were mostly characterized by their compositions and N-terminal residues. The peptide chains contain 248 amino acid residues, and no evidence for dissimilarity of the two subunits that comprise the native enzyme was found. The sequence of the rabbit muscle enzyme may be compared with that of the coelacanth enzyme (Kolb et al., 1974): 84% of the residues are in identical positions. Similarly, comparison of the sequence with that inferred for the chicken enzyme (Furth et al., 1974) shows that 87% of the residues are in identical positions. Limited though these comparisons are, they suggest that triose phosphate isomerase has one of the lowest rates of evolutionary change. An extended version of the present paper has been deposited as Supplementary Publication SUP 50040 (42 pages) at the British Library (Lending Division) (formerly the National Lending Library for Science and Technology), Boston Spa, Yorks. LS23 7BQ, U.K., from whom copies can be obtained on the terms given in Biochem. J. (1975) 145, 5. PMID:1171682

  2. The amino acid sequence of chymopapain from Carica papaya.

    PubMed Central

    Watson, D C; Yaguchi, M; Lynn, K R

    1990-01-01

    Chymopapain is a polypeptide of 218 amino acid residues. It has considerable structural similarity with papain and papaya proteinase omega, including conservation of the catalytic site and of the disulphide bonding. Chymopapain is like papaya proteinase omega in carrying four extra residues between papain positions 168 and 169, but differs from both papaya proteinases in the composition of its S2 subsite, as well as in having a second thiol group, Cys-117. Some evidence for the amino acid sequence of chymopapain has been deposited as Supplementary Publication SUP 50153 (12 pages) at the British Library Document Supply Centre, Boston Spa., Wetherby, West Yorkshire LS23 7BQ, U.K., from whom copies may be obtained on the terms indicated in Biochem. J. (1990) 265, 5. The information comprises Supplement Tables 1-4, which contain, in order, amino acid compositions of peptides from tryptic, peptic, CNBr and mild acid cleavages, Supplement Fig. 1, showing re-fractionation of selected peaks from Fig. 2 of the main paper. Supplement Fig. 2, showing cation-exchange chromatography of the earliest-eluted peak of Fig. 3 of the main paper, Supplement Fig. 3, showing reverse-phase h.p.l.c. of the later-eluted peak from Fig. 3 of the main paper, and Supplement Fig. 4, showing the separation of peptides after mild acid hydrolysis of CNBr-cleavage fragment CB3. PMID:2106878

  3. The amino acid sequence of chymopapain from Carica papaya.

    PubMed

    Watson, D C; Yaguchi, M; Lynn, K R

    1990-02-15

    Chymopapain is a polypeptide of 218 amino acid residues. It has considerable structural similarity with papain and papaya proteinase omega, including conservation of the catalytic site and of the disulphide bonding. Chymopapain is like papaya proteinase omega in carrying four extra residues between papain positions 168 and 169, but differs from both papaya proteinases in the composition of its S2 subsite, as well as in having a second thiol group, Cys-117. Some evidence for the amino acid sequence of chymopapain has been deposited as Supplementary Publication SUP 50153 (12 pages) at the British Library Document Supply Centre, Boston Spa., Wetherby, West Yorkshire LS23 7BQ, U.K., from whom copies may be obtained on the terms indicated in Biochem. J. (1990) 265, 5. The information comprises Supplement Tables 1-4, which contain, in order, amino acid compositions of peptides from tryptic, peptic, CNBr and mild acid cleavages, Supplement Fig. 1, showing re-fractionation of selected peaks from Fig. 2 of the main paper. Supplement Fig. 2, showing cation-exchange chromatography of the earliest-eluted peak of Fig. 3 of the main paper, Supplement Fig. 3, showing reverse-phase h.p.l.c. of the later-eluted peak from Fig. 3 of the main paper, and Supplement Fig. 4, showing the separation of peptides after mild acid hydrolysis of CNBr-cleavage fragment CB3. PMID:2106878

  4. The N-X-S/T consensus sequence is required but not sufficient for bacterial N-linked protein glycosylation.

    PubMed

    Nita-Lazar, Mihai; Wacker, Michael; Schegg, Belinda; Amber, Saba; Aebi, Markus

    2005-04-01

    In the Gram-negative bacterium Campylobacter jejuni there is a pgl (protein glycosylation) locus-dependent general N-glycosylation system of proteins. One of the proteins encoded by pgl locus, PglB, a homolog of the eukaryotic oligosaccharyltransferase component Stt3p, is proposed to function as an oligosaccharyltransferase in this prokaryotic system. The sequence requirements of the acceptor polypeptide for N-glycosylation were analyzed by reverse genetics using the reconstituted glycosylation of the model protein AcrA in Escherichia coli. As in eukaryotes, the N-X-S/T sequon is an essential but not a sufficient determinant for N-linked protein glycosylation. This conclusion was supported by the analysis of a novel C. jejuni glycoprotein, HisJ. Export of the polypeptide to the periplasm was required for glycosylation. Our data support the hypothesis that eukaryotic and bacterial N-linked protein glycosylation are homologous processes. PMID:15574802

  5. Amino acid sequence prerequisites for the formation of cn ions.

    PubMed

    Downard, K M; Biemann, K

    1993-11-01

    Ammo acid sequence prerequisites are described for the formation of c, ions observed in high-energy collision-induced decomposition spectra of peptides. It is shown that the formation of cn ions is promoted by the nature of the amino acid C-terminal to the cleavage site. A propensity for cn cleavage preceding threonine, and to a lesser extent tryptophan, lysine, and serine, is demonstrated where fragmentation is directed N-terminally at these residues. In addition, the nature of the residue N-terminal to the cleavage site is shown to have little effect on cn ion formation. A mechanism for cn ion formation is proposed and its applicability to the results observed is discussed. PMID:24227531

  6. Ultrasensitive nucleic acid sequence detection by single-molecule electrophoresis

    SciTech Connect

    Castro, A; Shera, E.B.

    1996-09-01

    This is the final report of a one-year laboratory-directed research and development project at Los Alamos National Laboratory. There has been considerable interest in the development of very sensitive clinical diagnostic techniques over the last few years. Many pathogenic agents are often present in extremely small concentrations in clinical samples, especially at the initial stages of infection, making their detection very difficult. This project sought to develop a new technique for the detection and accurate quantification of specific bacterial and viral nucleic acid sequences in clinical samples. The scheme involved the use of novel hybridization probes for the detection of nucleic acids combined with our recently developed technique of single-molecule electrophoresis. This project is directly relevant to the DOE`s Defense Programs strategic directions in the area of biological warfare counter-proliferation.

  7. A consensus procedure for predicting the location of alpha-helical transmembrane segments in proteins.

    PubMed

    Parodi, L A; Granatir, C A; Maggiora, G M

    1994-09-01

    To aid in the development of three-dimensional models of membrane-bound proteins, a consensus procedure for predicting alpha-helical transmembrane segments from amino acid sequence is presented. The algorithm combines the results of six individual prediction methods and some basic properties of membrane-spanning helices to obtain a final consensus prediction. Comparison with experiment and several other recently developed methods shows that the consensus procedure performs quite well in comparison to other recent methods. A FORTRAN program has been developed which takes an input file containing an amino acid sequence in one-letter code and outputs a list of the alpha-helical transmembrane segments predicted by the consensus algorithm. PMID:7828069

  8. Interaction of the Heparin-Binding Consensus Sequence of β-Amyloid Peptides with Heparin and Heparin-Derived Oligosaccharides.

    PubMed

    Nguyen, Khanh; Rabenstein, Dallas L

    2016-03-10

    Alzheimer's disease (AD) is characterized by the presence of amyloid plaques in the AD brain. Comprised primarily of the 40- and 42-residue β-amyloid (Aβ) peptides, there is evidence that the heparan sulfate (HS) of heparan sulfate proteoglycans (HSPGs) plays a role in amyloid plaque formation and stability; however, details of the interaction of Aβ peptides with HS are not known. We have characterized the interaction of heparin and heparin-derived oligosaccharides with a model peptide for the heparin- and HS-binding domain of Aβ peptides (Ac-VHHQKLV-NH2; Aβ(12-18)), with mutants of Aβ(12-18), and with additional histidine-containing peptides. The nature of the binding interaction was characterized by NMR, binding constants and other thermodynamic parameters were determined by isothermal titration calorimetry (ITC), and relative binding affinities were determined by heparin affinity chromatography. The binding of Aβ(12-18) by heparin and heparin-derived oligosaccharides is pH-dependent, with the imidazolium groups of the histidine side chains interacting site-specifically within a cleft created by a trisaccharide sequence of heparin, the binding is mediated by electrostatic interactions, and there is a significant entropic contribution to the binding free energy as a result of displacement of Na(+) ions from heparin upon binding of cationic Aβ(12-18). The binding constant decreases as the size of the heparin-derived oligosaccharide decreases and as the concentration of Na(+) ion in the bulk solution increases. Structure-binding relationships characterized in this study are analyzed and discussed in terms of the counterion condensation theory of the binding of cationic peptides by anionic polyelectrolytes. PMID:26872053

  9. Minimal residual disease quantification using consensus primers and high-throughput IGH sequencing predicts post-transplant relapse in chronic lymphocytic leukemia

    PubMed Central

    Logan, A C; Zhang, B; Narasimhan, B; Carlton, V; Zheng, J; Moorhead, M; Krampf, M R; Jones, C D; Waqar, A N; Faham, M; Zehnder, J L; Miklos, D B

    2013-01-01

    Quantification of minimal residual disease (MRD) following allogeneic hematopoietic cell transplantation (allo-HCT) predicts post-transplant relapse in patients with chronic lymphocytic leukemia (CLL). We utilized an MRD-quantification method that amplifies immunoglobulin heavy chain (IGH) loci using consensus V and J segment primers followed by high-throughput sequencing (HTS), enabling quantification with a detection limit of one CLL cell per million mononuclear cells. Using this IGH–HTS approach, we analyzed MRD patterns in over 400 samples from 40 CLL patients who underwent reduced-intensity allo-HCT. Nine patients relapsed within 12 months post-HCT. Of the 31 patients in remission at 12 months post-HCT, disease-free survival was 86% in patients with MRD <10−4 and 20% in those with MRD ⩾10−4 (relapse hazard ratio (HR) 9.0; 95% confidence interval (CI) 2.5–32; P<0.0001), with median follow-up of 36 months. Additionally, MRD predicted relapse at other time points, including 9, 18 and 24 months post-HCT. MRD doubling time <12 months with disease burden ⩾10−5 was associated with relapse within 12 months of MRD assessment in 50% of patients, and within 24 months in 90% of patients. This IGH–HTS method may facilitate routine MRD quantification in clinical trials. PMID:23419792

  10. Phenotypic comparisons of consensus variants versus laboratory resurrections of Precambrian proteins.

    PubMed

    Risso, Valeria A; Gavira, Jose A; Gaucher, Eric A; Sanchez-Ruiz, Jose M

    2014-06-01

    Consensus-sequence engineering has generated protein variants with enhanced stability, and sometimes, with modulated biological function. Consensus mutations are often interpreted as the introduction of ancestral amino acid residues. However, the precise relationship between consensus engineering and ancestral protein resurrection is not fully understood. Here, we report the properties of proteins encoded by consensus sequences derived from a multiple sequence alignment of extant, class A β-lactamases, as compared with the properties of ancient Precambrian β-lactamases resurrected in the laboratory. These comparisons considered primary sequence, secondary, and tertiary structure, as well as stability and catalysis against different antibiotics. Out of the three consensus variants generated, one could not be expressed and purified (likely due to misfolding and/or low stability) and only one displayed substantial stability having substrate promiscuity, although to a lower extent than ancient β-lactamases. These results: (i) highlight the phenotypic differences between consensus variants and laboratory resurrections of ancestral proteins; (ii) question interpretations of consensus proteins as phenotypic proxies of ancestral proteins; and (iii) support the notion that ancient proteins provide a robust approach toward the preparation of protein variants having large numbers of mutational changes while possessing unique biomolecular properties. PMID:24710963

  11. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... in the sequence. (4) The enumeration of amino acids may start at the first amino acid of the first..., counting backwards starting with the amino acid next to number 1. Otherwise, the enumeration of amino acids... sequence every 5 amino acids. The enumeration method for amino acid sequences that is set forth......

  12. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... in the sequence. (4) The enumeration of amino acids may start at the first amino acid of the first..., counting backwards starting with the amino acid next to number 1. Otherwise, the enumeration of amino acids... sequence every 5 amino acids. The enumeration method for amino acid sequences that is set forth......

  13. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... in the sequence. (4) The enumeration of amino acids may start at the first amino acid of the first..., counting backwards starting with the amino acid next to number 1. Otherwise, the enumeration of amino acids... sequence every 5 amino acids. The enumeration method for amino acid sequences that is set forth......

  14. Predicting protein disorder by analyzing amino acid sequence

    PubMed Central

    Yang, Jack Y; Yang, Mary Qu

    2008-01-01

    Background Many protein regions and some entire proteins have no definite tertiary structure, presenting instead as dynamic, disorder ensembles under different physiochemical circumstances. These proteins and regions are known as Intrinsically Unstructured Proteins (IUP). IUP have been associated with a wide range of protein functions, along with roles in diseases characterized by protein misfolding and aggregation. Results Identifying IUP is important task in structural and functional genomics. We exact useful features from sequences and develop machine learning algorithms for the above task. We compare our IUP predictor with PONDRs (mainly neural-network-based predictors), disEMBL (also based on neural networks) and Globplot (based on disorder propensity). Conclusion We find that augmenting features derived from physiochemical properties of amino acids (such as hydrophobicity, complexity etc.) and using ensemble method proved beneficial. The IUP predictor is a viable alternative software tool for identifying IUP protein regions and proteins. PMID:18831799

  15. A Research and Discussion Note: The Macrostructure of Consensus Statements

    ERIC Educational Resources Information Center

    Mungra, Philippa

    2007-01-01

    This research note presents a preliminary study of the structure of consensus statements (CSs). The consensus statement is released by a medical association after calling a consensus development conference on a pertinent medical issue. Using a very small corpus, this note attempts to characterize consensus statements by identifying the sequence of…

  16. Interferon Consensus Sequence Binding Protein–deficient Mice Display Impaired Resistance to Intracellular Infection Due to a Primary Defect in Interleukin 12 p40 Induction

    PubMed Central

    Scharton-Kersten, Tanya; Contursi, Cristina; Masumi, Atsuko; Sher, Alan; Ozato, Keiko

    1997-01-01

    Mice lacking the transcription factor interferon consensus sequence binding protein (ICSBP), a member of the interferon regulatory factor family of transcription proteins, were infected with the intracellular protozoan, Toxoplasma gondii. ICSBP-deficient mice exhibited unchecked parasite replication in vivo and rapidly succumbed within 14 d after inoculation with an avirulent Toxoplasma strain. In contrast, few intracellular parasites were observed in wild-type littermates and these animals survived for at least 60 d after infection. Analysis of cytokine synthesis in vitro and in vivo revealed a major deficiency in the expression of both interferon (IFN)-γ and interleukin (IL)-12 p40 in the T. gondii exposed ICSBP−/− animals. In related experiments, macrophages from uninfected ICSBP−/− mice were shown to display a selective impairment in the mRNA expression of IL-12 p40 but not IL-1α, IL-1β, IL-1Ra, IL-6, IL-10, or TNF-α in response to live parasites, parasite antigen, lipopolysaccharide, or Staphylococcus aureus. This selective defect in IL-12 p40 production was observed regardless of whether the macrophages had been primed with IFN-γ. We hypothesize that the impaired synthesis of IL-12 p40 in ICSBP−/− animals is the primary lesion responsible for the loss in resistance to T. gondii because IFN-γ–induced parasite killing was unimpaired in vitro and, more importantly, administration of exogenous IL-12 in vivo significantly prolonged survival of the infected mice. Together these findings implicate ICSBP as a major transcription factor which directly or indirectly regulates IL-12 p40 gene activation and, as a consequence, IFN-γ–dependent host resistance. PMID:9348310

  17. Interferon (IFN) Consensus Sequence-binding Protein, a Transcription Factor of the IFN Regulatory Factor Family, Regulates Immune Responses In Vivo through Control of Interleukin 12 Expression

    PubMed Central

    Giese, Nathalia A.; Gabriele, Lucia; Doherty, T. Mark; Klinman, Dennis M.; Tadesse-Heath, Lekidelu; Contursi, Christina; Epstein, Suzanne L.; Morse, Herbert C.

    1997-01-01

    Mice with a null mutation of the gene encoding interferon consensus sequence-binding protein (ICSBP) develop a chronic myelogenous leukemia-like syndrome and mount impaired responses to certain viral and bacterial infections. To gain a mechanistic understanding of the contributions of ICSBP to humoral and cellular immunity, we characterized the responses of control and ICSBP−/− mice to infection with influenza A (flu) and Leishmania major (L. major). Mice of both genotypes survived infections with flu, but differed markedly in the isotype distribution of antiflu antibodies. In sera of normal mice, immunoglobulin (Ig)G2a antibodies were dominant over IgG1 antibodies, a pattern indicative of a T helper cell type 1 (Th1)-driven response. In sera of ICSBP−/− mice, however, IgG1 antibodies dominated over IgG2a antibodies, a pattern indicative of a Th2-driven response. The dominance of IgG1 and IgE over IgG2a was detected in the sera of uninfected mice as well. A seeming Th2 bias of ICSBP-deficient mice was also uncovered in their inability to control infection with L. major, where resistance is known to be dependent on IL-12 and IFN-γ as components of a Th1 response. Infected ICSBP-deficient mice developed fulminant, disseminated leishmaniasis as a result of failure to mount a Th1-mediated curative response, although T cells remained capable of secreting IFN-γ and macrophages of producing nitric oxide. Compromised Th1 differentiation in ICSBP−/− mice could not be attributed to hyporesponsiveness of CD4+ T cells to interleukin (IL)-12; however, the ability of uninfected and infected ICSBP-deficient mice to produce IL-12 was markedly impaired. This indicates that ICSBP is a deciding factor in Th responses governing humoral and cellular immunity through its role in regulating IL-12 expression. PMID:9348311

  18. Structural gene and complete amino acid sequence of Pseudomonas aeruginosa IFO 3455 elastase.

    PubMed Central

    Fukushima, J; Yamamoto, S; Morihara, K; Atsumi, Y; Takeuchi, H; Kawamoto, S; Okuda, K

    1989-01-01

    The DNA encoding the elastase of Pseudomonas aeruginosa IFO 3455 was cloned, and its complete nucleotide sequence was determined. When the cloned gene was ligated to pUC18, the Escherichia coli expression vector, bacteria carrying the gene exhibited high levels of both elastase activity and elastase antigens. The amino acid sequence, deduced from the nucleotide sequence, revealed that the mature elastase consisted of 301 amino acids with a relative molecular mass of 32,926 daltons. The amino acid composition predicted from the DNA sequence was quite similar to the chemically determined composition of purified elastase reported previously. We also observed nucleotide sequence encoding a signal peptide and "pro" sequence consisting of 197 amino acids upstream from the mature elastase protein gene. The amino acid sequence analysis revealed that both the N-terminal sequence of the purified elastase and the N-terminal side sequences of the C-terminal tryptic peptide as well as the internal lysyl peptide fragment were completely identical to the deduced amino acid sequences. The pattern of identity of amino acid sequences was quite evident in the regions that include structurally and functionally important residues of Bacillus subtilis thermolysin. PMID:2493453

  19. Human retroviruses and AIDS 1996. A compilation and analysis of nucleic acid and amino acid sequences

    SciTech Connect

    Myers, G.; Foley, B.; Korber, B.; Mellors, J.W.; Jeang, K.T.; Wain-Hobson, S.

    1997-04-01

    This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (1) Nuclear Acid Alignments and Sequences; (2) Amino Acid Alignments; (3) Analysis; (4) Related Sequences; and (5) Database Communications. Information within all the parts is updated throughout the year on the Web site, http://hiv-web.lanl.gov. While this publication could take the form of a review or sequence monograph, it is not so conceived. Instead, the literature from which the database is derived has simply been summarized and some elementary computational analyses have been performed upon the data. Interpretation and commentary have been avoided insofar as possible so that the reader can form his or her own judgments concerning the complex information. In addition to the general descriptions of the parts of the compendium, the user should read the individual introductions for each part.

  20. Evolution of a "conserved" amino acid sequence: a model study of an in silico investigation of the phylogenesis of some immune receptors.

    PubMed

    Panaro, M A; Acquafredda, A; Sisto, M; Lisi, S; Saccia, M; Mitolo, V

    2006-01-01

    In this paper we analyze a 55-amino acid (aa) sequence which is relatively well conserved in several seven-transmembrane receptor families (from Insects to Mammals) and in some Viruses. This sequence, which covers the second transmembrane domain, the first extracellular loop and the third transmembrane domain, appears in its complete configuration in most of the seven-transmembrane receptor families, as well as in the protein products of some viruses. Other seven-transmembrane receptors and viruses exhibit reduced configurations of the conserved sequence, lacking either aa 31 or aa 30-31. 53-aa configurations are typically found in most chemokine receptor (CKR) subfamilies, as well as in some viral protein products. However, the CCR1, CCR3, and CCR6 subfamilies comprise a 54-aa configuration and the CKR-related protein products, ChemR23 and RDC1, include the complete 55-aa sequence. For each CKR subfamily the "modal sequence" of the conserved segment was constructed by selecting the most frequently occurring aa at each position. Then, pairwise alignments were made between: (i) the modal CKR sequences, and (ii) the sequence (53-aa) of the Yaba-like disease virus - 7L protein. From the alignments two consensus matrices were derived: (i) the consensus 1 matrix with reference to the whole conserved segment, and (ii) the consensus 2 matrix with reference to aa 22-29, which appear to be the most variable segment of the sequence. Based on the obtained consensus values and with reference to this specific conserved segment, the following conclusions are proposed: (1) ChemR23 and RDC1 are probably the more primitive CKR forms; (2) CCR1 and CCR3 may be grouped in a single cluster; (3) CCRs 2, 4, and 5 are closely related to each other and may be grouped in a cluster; CCR7 is likely to be evolutionarily related to this cluster; (4) CXCRs 2, 3, and 4 and CCX CKR appear to be evolutionarily related to each other and very likely derived from an CCR6-like gene; (5) CCR2/4/5 and

  1. Natural vs. random protein sequences: Discovering combinatorics properties on amino acid words.

    PubMed

    Santoni, Daniele; Felici, Giovanni; Vergni, Davide

    2016-02-21

    Casual mutations and natural selection have driven the evolution of protein amino acid sequences that we observe at present in nature. The question about which is the dominant force of proteins evolution is still lacking of an unambiguous answer. Casual mutations tend to randomize protein sequences while, in order to have the correct functionality, one expects that selection mechanisms impose rigid constraints on amino acid sequences. Moreover, one also has to consider that the space of all possible amino acid sequences is so astonishingly large that it could be reasonable to have a well tuned amino acid sequence indistinguishable from a random one. In order to study the possibility to discriminate between random and natural amino acid sequences, we introduce different measures of association between pairs of amino acids in a sequence, and apply them to a dataset of 1047 natural protein sequences and 10,470 random sequences, carefully generated in order to preserve the relative length and amino acid distribution of the natural proteins. We analyze the multidimensional measures with machine learning techniques and show that, to a reasonable extent, natural protein sequences can be differentiated from random ones. PMID:26656109

  2. Transcriptome Sequencing in Response to Salicylic Acid in Salvia miltiorrhiza

    PubMed Central

    Zhang, Xiaoru; Dong, Juane; Liu, Hailong; Wang, Jiao; Qi, Yuexin; Liang, Zongsuo

    2016-01-01

    Salvia miltiorrhiza is a traditional Chinese herbal medicine, whose quality and yield are often affected by diseases and environmental stresses during its growing season. Salicylic acid (SA) plays a significant role in plants responding to biotic and abiotic stresses, but the involved regulatory factors and their signaling mechanisms are largely unknown. In order to identify the genes involved in SA signaling, the RNA sequencing (RNA-seq) strategy was employed to evaluate the transcriptional profiles in S. miltiorrhiza cell cultures. A total of 50,778 unigenes were assembled, in which 5,316 unigenes were differentially expressed among 0-, 2-, and 8-h SA induction. The up-regulated genes were mainly involved in stimulus response and multi-organism process. A core set of candidate novel genes coding SA signaling component proteins was identified. Many transcription factors (e.g., WRKY, bHLH and GRAS) and genes involved in hormone signal transduction were differentially expressed in response to SA induction. Detailed analysis revealed that genes associated with defense signaling, such as antioxidant system genes, cytochrome P450s and ATP-binding cassette transporters, were significantly overexpressed, which can be used as genetic tools to investigate disease resistance. Our transcriptome analysis will help understand SA signaling and its mechanism of defense systems in S. miltiorrhiza. PMID:26808150

  3. Transcriptome Sequencing in Response to Salicylic Acid in Salvia miltiorrhiza.

    PubMed

    Zhang, Xiaoru; Dong, Juane; Liu, Hailong; Wang, Jiao; Qi, Yuexin; Liang, Zongsuo

    2016-01-01

    Salvia miltiorrhiza is a traditional Chinese herbal medicine, whose quality and yield are often affected by diseases and environmental stresses during its growing season. Salicylic acid (SA) plays a significant role in plants responding to biotic and abiotic stresses, but the involved regulatory factors and their signaling mechanisms are largely unknown. In order to identify the genes involved in SA signaling, the RNA sequencing (RNA-seq) strategy was employed to evaluate the transcriptional profiles in S. miltiorrhiza cell cultures. A total of 50,778 unigenes were assembled, in which 5,316 unigenes were differentially expressed among 0-, 2-, and 8-h SA induction. The up-regulated genes were mainly involved in stimulus response and multi-organism process. A core set of candidate novel genes coding SA signaling component proteins was identified. Many transcription factors (e.g., WRKY, bHLH and GRAS) and genes involved in hormone signal transduction were differentially expressed in response to SA induction. Detailed analysis revealed that genes associated with defense signaling, such as antioxidant system genes, cytochrome P450s and ATP-binding cassette transporters, were significantly overexpressed, which can be used as genetic tools to investigate disease resistance. Our transcriptome analysis will help understand SA signaling and its mechanism of defense systems in S. miltiorrhiza. PMID:26808150

  4. Matrix genes of measles virus and canine distemper virus: cloning, nucleotide sequences, and deduced amino acid sequences.

    PubMed Central

    Bellini, W J; Englund, G; Richardson, C D; Rozenblatt, S; Lazzarini, R A

    1986-01-01

    The nucleotide sequences encoding the matrix (M) proteins of measles virus (MV) and canine distemper virus (CDV) were determined from cDNA clones containing these genes in their entirety. In both cases, single open reading frames specifying basic proteins of 335 amino acid residues were predicted from the nucleotide sequences. Both viral messages were composed of approximately 1,450 nucleotides and contained 400 nucleotides of presumptive noncoding sequences at their respective 3' ends. MV and CDV M-protein-coding regions were 67% homologous at the nucleotide level and 76% homologous at the amino acid level. Only chance homology was observed in the 400-nucleotide trailer sequences. Comparisons of the M protein sequences of MV and CDV with the sequence reported for Sendai virus (B. M. Blumberg, K. Rose, M. G. Simona, L. Roux, C. Giorgi, and D. Kolakofsky, J. Virol. 52:656-663; Y. Hidaka, T. Kanda, K. Iwasaki, A. Nomoto, T. Shioda, and H. Shibuta, Nucleic Acids Res. 12:7965-7973) indicated the greatest homology among these M proteins in the carboxyterminal third of the molecule. Secondary-structure analyses of this shared region indicated a structurally conserved, hydrophobic sequence which possibly interacted with the lipid bilayer. Images PMID:3754588

  5. Detection and isolation of nucleic acid sequences using a bifunctional hybridization probe

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    2000-01-01

    A method for detecting and isolating a target sequence in a sample of nucleic acids is provided using a bifunctional hybridization probe capable of hybridizing to the target sequence that includes a detectable marker and a first complexing agent capable of forming a binding pair with a second complexing agent. A kit is also provided for detecting a target sequence in a sample of nucleic acids using a bifunctional hybridization probe according to this method.

  6. The highly conserved amino acid sequence motif Tyr-Gly-Asp-Thr-Asp-Ser in alpha-like DNA polymerases is required by phage phi 29 DNA polymerase for protein-primed initiation and polymerization.

    PubMed Central

    Bernad, A; Lázaro, J M; Salas, M; Blanco, L

    1990-01-01

    The alpha-like DNA polymerases from bacteriophage phi 29 and other viruses, prokaryotes and eukaryotes contain an amino acid consensus sequence that has been proposed to form part of the dNTP binding site. We have used site-directed mutants to study five of the six highly conserved consecutive amino acids corresponding to the most conserved C-terminal segment (Tyr-Gly-Asp-Thr-Asp-Ser). Our results indicate that in phi 29 DNA polymerase this consensus sequence, although irrelevant for the 3'----5' exonuclease activity, is essential for initiation and elongation. Based on these results and on its homology with known or putative metal-binding amino acid sequences, we propose that in phi 29 DNA polymerase the Tyr-Gly-Asp-Thr-Asp-Ser consensus motif is part of the dNTP binding site, involved in the synthetic activities of the polymerase (i.e., initiation and polymerization), and that it is involved particularly in the metal binding associated with the dNTP site. Images PMID:2191296

  7. Partial amino acid sequence of human factor D:homology with serine proteases.

    PubMed Central

    Volanakis, J E; Bhown, A; Bennett, J C; Mole, J E

    1980-01-01

    Human factor D purified to homogeneity by a modified procedure was subjected to NH2-terminal amino acid sequence analysis by using a modified automated Beckman sequencer. We identified 48 of the first 57 NH2-terminal amino acids in a single sequencer run, using microgram quantities of factor D. The deduced amino acid sequence represents approximately 25% of the primary structure of factor D. This extended NH2-terminal amino acid sequence of factor D was compared to that of other trypsin-related serine proteases. By visual inspection, strong homologies (33--50% identity) were observed with all the serine proteases included in the comparison. Interestingly, factor D showed a higher degree of homology to serine proteases of pancreatic origin than to those of serum origin. Images PMID:6987665

  8. Amino acid sequence of Japanese quail (Coturnix japonica) and northern bobwhite (Colinus virginianus) myoglobin.

    PubMed

    Goodson, John; Beckstead, Robert B; Payne, Jason; Singh, Rakesh K; Mohan, Anand

    2015-08-15

    Myoglobin has an important physiological role in vertebrates, and as the primary sarcoplasmic pigment in meat, influences quality perception and consumer acceptability. In this study, the amino acid sequences of Japanese quail and northern bobwhite myoglobin were deduced by cDNA cloning of the coding sequence from mRNA. Japanese quail myoglobin was isolated from quail cardiac muscles, purified using ammonium sulphate precipitation and gel-filtration, and subjected to multiple enzymatic digestions. Mass spectrometry corroborated the deduced protein amino acid sequence at the protein level. Sequence analysis revealed both species' myoglobin structures consist of 153 amino acids, differing at only three positions. When compared with chicken myoglobin, Japanese quail showed 98% sequence identity, and northern bobwhite 97% sequence identity. The myoglobin in both quail species contained eight histidine residues instead of the nine present in chicken and turkey. PMID:25794748

  9. Identification of random nucleic acid sequence aberrations using dual capture probes which hybridize to different chromosome regions

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    1998-01-01

    A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration.

  10. Identification of random nucleic acid sequence aberrations using dual capture probes which hybridize to different chromosome regions

    DOEpatents

    Lucas, J.N.; Straume, T.; Bogen, K.T.

    1998-03-24

    A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration. 14 figs.

  11. tax and rex Sequences of bovine leukaemia virus from globally diverse isolates: rex amino acid sequence more variable than tax.

    PubMed

    McGirr, K M; Buehring, G C

    2005-02-01

    Bovine leukaemia virus (BLV) is an important agricultural problem with high costs to the dairy industry. Here, we examine the variation of the tax and rex genes of BLV. The tax and rex genes share 420 bases and have overlapping reading frames. The tax gene encodes a protein that functions as a transactivator of the BLV promoter, is required for viral replication, acts on cellular promoters, and is responsible for oncogenesis. The rex facilitates the export of viral mRNAs from the nucleus and regulates transcription. We have sequenced five new isolates of the tax/rex gene. We examined the five new and three previously published tax/rex DNA and predicted amino acid sequences of BLV isolates from cattle in representative regions worldwide. The highest variation among nucleic acid sequences for tax and rex was 7% and 5%, respectively; among predicted amino acid sequences for Tax and Rex, 9% and 11%, respectively. Significantly more nucleotide changes resulted in predicted amino acid changes in the rex gene than in the tax gene (P < or = 0.0006). This variability is higher than previously reported for any region of the viral genome. This research may also have implications for the development of Tax-based vaccines. PMID:15702995

  12. The amino acid sequence of protein CM-3 from Dendroaspis polylepis polylepis (black mamba) venom.

    PubMed

    Joubert, F J

    1985-01-01

    Protein CM-3 from Dendroaspis polylepis polylepis venom was purified by gel filtration and ion exchange chromatography. It comprises 65 amino acids including eight half-cystines. The complete amino acid sequence of protein CM-3 has been elucidated. The sequence (residues 1-50) resembles that of the N-terminal sequence of the subunits of a synergistic type protein and residues 51-65 that of the C-terminal sequence of an angusticeps type protein. Mixtures of protein CM-3 and angusticeps type proteins showed no apparent synergistic effect, in that their toxicity in combination was no greater than the sum of their individual toxicities. PMID:4029488

  13. Three-dimensional structure and mimetic-membrane association of consensus 11-amino-acid motif from soybean LEA3 protein.

    PubMed

    Xue, Rong; Liu, Yun; Zheng, Yizhi; Wu, Yijie; Li, Xiaojing; Pei, Fengkui; Ni, Jiazuan

    2012-01-01

    The occurrence of a highly conserved 11-mer repeating motif in the primary sequence is a major characteristic of group 3 late embryogenesis abundant (LEA3) proteins, which are strongly associated with abiotic stress tolerance of the plants. In this study, the three-dimensional structure, mimetic membrane association, and salt effect for consensus 11-mer motif from soybean PM2 protein (LEA3) were investigated in sodium dodecyl sulfate (SDS) micelles by NMR techniques. It was shown that the 11-mer motif was disordered in aqueous solution, but adopted an α-helix in SDS micelles. NMR diffusion measurements demonstrated that the 11-mer motif was associated with SDS micelles. Paramagnetic quenching NMR experiments further revealed the orientation of the 11-mer motif with respect to the mimetic membrane: the ordered N-terminal segment was inserted into the mimetic membrane, and the disordered C-terminal segment was exposed to water. In addition, salt addition could not change the secondary structure of the 11-mer motif, but might slightly alter the relative spatial position of some N-terminal residue atoms. These results implied that the 11-mer motif would take an important role in structural plasticity and membrane stabilization for LEA3 proteins. PMID:23325560

  14. Nucleotide sequences and characterization of liv genes encoding components of the high-affinity branched-chain amino acid transport system in Salmonella typhimurium.

    PubMed

    Matsubara, K; Ohnishi, K; Kiritani, K

    1992-07-01

    A 7.6-kb fragment of Salmonella typhimurium LT2 containing the liv gene cluster, which specifies the high-affinity branched-chain amino acid transport system (LIV-I), has been isolated. The upstream region contains the livB and livC genes encoding the leucine-isoleucine-valine-threonine and leucine-specific binding proteins, respectively. In this study, the nucleotide sequence of the 4-kb downstream segment was determined and found to contain four reading frames, designated as livA, livE, livF, and livG, that encode putative membrane-associated proteins. The livA and livE genes encode hydrophobic proteins composed of 308 and 425 amino acid residues, respectively. The livF and livG genes encode hydrophilic proteins of 255 and 237 amino acids, respectively; both the proteins contain consensus amino acid sequences found in proteins with ATP-binding sites. These four genes linked together have a potential rho-independent transcriptional terminator adjacent to the 3'-end of livG. No promoter sequence was found in the immediate upstream region of the livAEFG cluster. The livA, livE, livF, and livG gene products were identified as proteins with apparent M(r)s of 25,500, 34,500, 28,000, and 26,000, respectively, by SDS-polyacryl-amide gel electrophoresis. The deduced amino acid sequences of these four proteins showed strong homology to those of the corresponding membrane-associated proteins required for the high-affinity branched-chain amino acid transport systems from both Escherichia coli and Pseudomonas aeruginosa. PMID:1429514

  15. Computer Simulation of the Determination of Amino Acid Sequences in Polypeptides

    ERIC Educational Resources Information Center

    Daubert, Stephen D.; Sontum, Stephen F.

    1977-01-01

    Describes a computer program that generates a random string of amino acids and guides the student in determining the correct sequence of a given protein by using experimental analytic data for that protein. (MLH)

  16. Characterization of mouse cellular deoxyribonucleic acid homologous to Abelson murine leukemia virus-specific sequences.

    PubMed Central

    Dale, B; Ozanne, B

    1981-01-01

    The genome of Abelson murine leukemia virus (A-MuLV) consists of sequences derived from both BALB/c mouse deoxyribonucleic acid and the genome of Moloney murine leukemia virus. Using deoxyribonucleic acid linear intermediates as a source of retroviral deoxyribonucleic acid, we isolated a recombinant plasmid which contained 1.9 kilobases of the 3.5-kilobase mouse-derived sequences found in A-MuLV (A-MuLV-specific sequences). We used this clone, designated pSA-17, as a probe restriction enzyme and Southern blot analyses to examine the arrangement of homologous sequences in BALB/c deoxyribonucleic acid (endogenous Abelson sequences). The endogenous Abelson sequences within the mouse genome were interrupted by noncoding regions, suggesting that a rearrangement of the cell sequences was required to produce the sequence found in the virus. Endogenous Abelson sequences were arranged similarly in mice that were susceptible to A-MuLV tumors and in mice that were resistant to A-MuLV tumors. An examination of three BALB/c plasmacytomas and a BALB/c early B-cell tumor likewise revealed no alteration in the arrangement of the endogenous Abelson sequences. Homology to pSA-17 was also observed in deoxyribonucleic acids prepared from rat, hamster, chicken, and human cells. An isolate of A-MuLV which encoded a 160,000-dalton transforming protein (P160) contained 700 more base pairs of mouse sequences than the standard A-MuLV isolate, which encoded a 120,000-dalton transforming protein (P120). Images PMID:9279386

  17. The amino acid sequence of monal pheasant lysozyme and its activity.

    PubMed

    Araki, T; Matsumoto, T; Torikata, T

    1998-10-01

    The amino acid sequence of monal pheasant lysozyme and its activity were analyzed. Carboxymethylated lysozyme was digested with trypsin and the resulting peptides were sequenced. The established amino acid sequence had one amino acid substitution at position 102 (Arg to Gly) comparing with Indian peafowl lysozyme and four amino acid substitutions at positions 3 (Phe to Tyr), 15 (His to Leu), 41 (Gln to His), and 121 (Gln to His) with chicken lysozyme. Analysis of the time-courses of reaction using N-acetylglucosamine pentamer as a substrate showed a difference of binding free energy change (-0.4 kcal/mol) at subsites A between monal pheasant and Indian peafowl lysozyme. This was assumed to be caused by the amino acid substitution at subsite A with loss of a positive charge at position 102 (Arg102 to Gly). PMID:9836434

  18. Studies on monotreme proteins. VII. Amino acid sequence of myoglobin from the platypus, Ornithoryhynchus anatinus.

    PubMed

    Fisher, W K; Thompson, E O

    1976-03-01

    Myoglobin isolated from skeletal muscle of the platypus contains 153 amino acid residues. The complete amino acid sequence has been determined following cleavage with cyanogen bromide and further digestion of the four fragments with trypsin, chymotrypsin, pepsin and thermolysin. Sequences of the purified peptides were determined by the dansyl-Edman procedure. The amino acid sequence showed 25 differences from human myoglobin and 24 from kangaroo myoglobin. Amino acid sequences in myoglobins are more conserved than sequences in the alpha- and beta-globin chains, and platypus myoglobin shows a similar number of variations in sequence to kangaroo myoglobin when compared with myoglobin of other species. The date of divergence of the platypus from other mammals was estimated at 102 +/- 31 million years, based on the number of amino acid differences between species and allowing for mutations during the evolutionary period. This estimate differs widely from the estimate given by similar treatment of the alpha- and beta-chain sequences and a constant rate of mutation of globin chains is not supported. PMID:962722

  19. cDNA-derived amino acid sequences of myoglobins from nine species of whales and dolphins.

    PubMed

    Iwanami, Kentaro; Mita, Hajime; Yamamoto, Yasuhiko; Fujise, Yoshihiro; Yamada, Tadasu; Suzuki, Tomohiko

    2006-10-01

    We determined the myoglobin (Mb) cDNA sequences of nine cetaceans, of which six are the first reports of Mb sequences: sei whale (Balaenoptera borealis), Bryde's whale (Balaenoptera edeni), pygmy sperm whale (Kogia breviceps), Stejneger's beaked whale (Mesoplodon stejnegeri), Longman's beaked whale (Indopacetus pacificus), and melon-headed whale (Peponocephala electra), and three confirm the previously determined chemical amino acid sequences: sperm whale (Physeter macrocephalus), common minke whale (Balaenoptera acutorostrata) and pantropical spotted dolphin (Stenella attenuata). We found two types of Mb in the skeletal muscle of pantropical spotted dolphin: Mb I with the same amino acid sequence as that deposited in the protein database, and Mb II, which differs at two amino acid residues compared with Mb I. Using an alignment of the amino acid or cDNA sequences of cetacean Mb, we constructed a phylogenetic tree by the NJ method. Clustering of cetacean Mb amino acid and cDNA sequences essentially follows the classical taxonomy of cetaceans, suggesting that Mb sequence data is valid for classification of cetaceans at least to the family level. PMID:16962803

  20. Consensus among Economists Revisited.

    ERIC Educational Resources Information Center

    Fuller, Dan; Geide-Stevenson, Doris

    2003-01-01

    Explores consensus among economists on specific propositions on the basis of a fall 2000 survey of American Economic Association members. Finds consensus generally within the profession, although the degree of consensus varies among propositions that are international, macroeconomic, and microeconomic in nature. States the profession displays…

  1. Draft Genome Sequences of Two Novel Acidimicrobiaceae Members from an Acid Mine Drainage Biofilm Metagenome.

    PubMed

    Pinto, Ameet J; Sharp, Jonathan O; Yoder, Michael J; Almstrand, Robert

    2016-01-01

    Bacteria belonging to the family Acidimicrobiaceae are frequently encountered in heavy metal-contaminated acidic environments. However, their phylogenetic and metabolic diversity is poorly resolved. We present draft genome sequences of two novel and phylogenetically distinct Acidimicrobiaceae members assembled from an acid mine drainage biofilm metagenome. PMID:26769942

  2. Draft Genome Sequences of Two Novel Acidimicrobiaceae Members from an Acid Mine Drainage Biofilm Metagenome

    PubMed Central

    Pinto, Ameet J.; Sharp, Jonathan O.; Yoder, Michael J.

    2016-01-01

    Bacteria belonging to the family Acidimicrobiaceae are frequently encountered in heavy metal-contaminated acidic environments. However, their phylogenetic and metabolic diversity is poorly resolved. We present draft genome sequences of two novel and phylogenetically distinct Acidimicrobiaceae members assembled from an acid mine drainage biofilm metagenome. PMID:26769942

  3. Two distinct ferredoxins from Rhodobacter capsulatus: complete amino acid sequences and molecular evolution.

    PubMed

    Saeki, K; Suetsugu, Y; Yao, Y; Horio, T; Marrs, B L; Matsubara, H

    1990-09-01

    Two distinct ferredoxins were purified from Rhodobacter capsulatus SB1003. Their complete amino acid sequences were determined by a combination of protease digestion, BrCN cleavage and Edman degradation. Ferredoxins I and II were composed of 64 and 111 amino acids, respectively, with molecular weights of 6,728 and 12,549 excluding iron and sulfur atoms. Both contained two Cys clusters in their amino acid sequences. The first cluster of ferredoxin I and the second cluster of ferredoxin II had a sequence, CxxCxxCxxxCP, in common with the ferredoxins found in Clostridia. The second cluster of ferredoxin I had a sequence, CxxCxxxxxxxxCxxxCM, with extra amino acids between the second and third Cys, which has been reported for other photosynthetic bacterial ferredoxins and putative ferredoxins (nif-gene products) from nitrogen-fixing bacteria, and with a unique occurrence of Met. The first cluster of ferredoxin II had a CxxCxxxxCxxxCP sequence, with two additional amino acids between the second and third Cys, a characteristics feature of Azotobacter-[3Fe-4S] [4Fe-4S]-ferredoxin. Ferredoxin II was also similar to Azotobacter-type ferredoxins with an extended carboxyl (C-) terminal sequence compared to the common Clostridium-type. The evolutionary relationship of the two together with a putative one recently found to be encoded in nifENXQ region in this bacterium [Moreno-Vivian et al. (1989) J. Bacteriol. 171, 2591-2598] is discussed. PMID:2277040

  4. Amino Acid Sequence of Anionic Peroxidase from the Windmill Palm Tree Trachycarpus fortunei

    PubMed Central

    2015-01-01

    Palm peroxidases are extremely stable and have uncommon substrate specificity. This study was designed to fill in the knowledge gap about the structures of a peroxidase from the windmill palm tree Trachycarpus fortunei. The complete amino acid sequence and partial glycosylation were determined by MALDI-top-down sequencing of native windmill palm tree peroxidase (WPTP), MALDI-TOF/TOF MS/MS of WPTP tryptic peptides, and cDNA sequencing. The propeptide of WPTP contained N- and C-terminal signal sequences which contained 21 and 17 amino acid residues, respectively. Mature WPTP was 306 amino acids in length, and its carbohydrate content ranged from 21% to 29%. Comparison to closely related royal palm tree peroxidase revealed structural features that may explain differences in their substrate specificity. The results can be used to guide engineering of WPTP and its novel applications. PMID:25383699

  5. Protein chemotaxonomy. XIII. Amino acid sequence of ferredoxin from Panax ginseng.

    PubMed

    Mino, Yoshiki

    2006-08-01

    The complete amino acid sequence of [2Fe-2S] ferredoxin from Panax ginseng (Araliaceae) has been determined by automated Edman degradation of the entire S-carboxymethylcysteinyl protein and of the peptides obtained by enzymatic digestion. This ferredoxin has a unique amino acid sequence, which includes an insertion of Tyr at the 3rd position from the amino-terminus and a deletion of two amino acid residues at the carboxyl terminus. This ferredoxin had 18 differences in its amino acid sequence compared to that of Petroselinum sativum (Umbelliferae). In contrast, 23-33 differences were observed compared to other dicotyledonous plants. This suggests that Panax ginseng is related taxonomically to umbelliferous plants. PMID:16880642

  6. Complete amino acid sequence and structure characterization of the taste-modifying protein, miraculin.

    PubMed

    Theerasilp, S; Hitotsuya, H; Nakajo, S; Nakaya, K; Nakamura, Y; Kurihara, Y

    1989-04-25

    The taste-modifying protein, miraculin, has the unusual property of modifying sour taste into sweet taste. The complete amino acid sequence of miraculin purified from miracle fruits by a newly developed method (Theerasilp, S., and Kurihara, Y. (1988) J. Biol. Chem. 263, 11536-11539) was determined by an automatic Edman degradation method. Miraculin was a single polypeptide with 191 amino acid residues. The calculated molecular weight based on the amino acid sequence and the carbohydrate content (13.9%) was 24,600. Asn-42 and Asn-186 were linked N-glycosidically to carbohydrate chains. High homology was found between the amino acid sequences of miraculin and soybean trypsin inhibitor. PMID:2708331

  7. Complete cDNA and derived amino acid sequence of human factor V

    SciTech Connect

    Jenny, R.J.; Pittman, D.D.; Toole, J.J.; Kriz, R.W.; Aldape, R.A.; Hewick, R.M.; Kaufman, R.J.; Mann, K.G.

    1987-07-01

    cDNA clones encoding human factor V have been isolated from an oligo(dT)-primed human fetal liver cDNA library prepared with vector Charon 21A. The cDNA sequence of factor V from three overlapping clones includes a 6672-base-pair (bp) coding region, a 90-bp 5' untranslated region, and a 163-bp 3' untranslated region within which is a poly(A)tail. The deduced amino acid sequence consists of 2224 amino acids inclusive of a 28-amino acid leader peptide. Direct comparison with human factor VIII reveals considerable homology between proteins in amino acid sequence and domain structure: a triplicated A domain and duplicated C domain show approx. 40% identity with the corresponding domains in factor VIII. As in factor VIII, the A domains of factor V share approx. 40% amino acid-sequence homology with the three highly conserved domains in ceruloplasmin. The B domain of factor V contains 35 tandem and approx. 9 additional semiconserved repeats of nine amino acids of the form Asp-Leu-Ser-Gln-Thr-Thr/Asn-Leu-Ser-Pro and 2 additional semiconserved repeats of 17 amino acids. Factor V contains 37 potential N-linked glycosylation sites, 25 of which are in the B domain, and a total of 19 cysteine residues.

  8. N-terminal sequence of amino acids and some properties of an acid-stable alpha-amylase from citric acid-koji (Aspergillus usamii var.).

    PubMed

    Suganuma, T; Tahara, N; Kitahara, K; Nagahama, T; Inuzuka, K

    1996-01-01

    An acid-stable alpha-amylase (AA) was purified from an acidic extract of citric acid-koji (A. usamii var.). The N-terminal sequence of the first 20 amino acids of the enzyme was identical with that of AA from A. niger, but the two enzymes differed in molecular weight. HPLC analysis for identifying the anomers of products indicated that the AA hydrolyzed maltopentaose (G5) at the third glycoside bond predominantly, which differed from Taka-amylase A and the neutral alpha-amylase (NA) from the citric acid-koji. PMID:8824843

  9. Analysis of a nucleotide-binding site of 5-lipoxygenase by affinity labelling: binding characteristics and amino acid sequences.

    PubMed Central

    Zhang, Y Y; Hammarberg, T; Radmark, O; Samuelsson, B; Ng, C F; Funk, C D; Loscalzo, J

    2000-01-01

    5-Lipoxygenase (5LO) catalyses the first two steps in the biosynthesis of leukotrienes, which are inflammatory mediators derived from arachidonic acid. 5LO activity is stimulated by ATP; however, a consensus ATP-binding site or nucleotide-binding site has not been found in its protein sequence. In the present study, affinity and photoaffinity labelling of 5LO with 5'-p-fluorosulphonylbenzoyladenosine (FSBA) and 2-azido-ATP showed that 5LO bound to the ATP analogues quantitatively and specifically and that the incorporation of either analogue inhibited ATP stimulation of 5LO activity. The stoichiometry of the labelling was 1.4 mol of FSBA/mol of 5LO (of which ATP competed with 1 mol/mol) or 0.94 mol of 2-azido-ATP/mol of 5LO (of which ATP competed with 0.77 mol/mol). Labelling with FSBA prevented further labelling with 2-azido-ATP, indicating that the same binding site was occupied by both analogues. Other nucleotides (ADP, AMP, GTP, CTP and UTP) also competed with 2-azido-ATP labelling, suggesting that the site was a general nucleotide-binding site rather than a strict ATP-binding site. Ca(2+), which also stimulates 5LO activity, had no effect on the labelling of the nucleotide-binding site. Digestion with trypsin and peptide sequencing showed that two fragments of 5LO were labelled by 2-azido-ATP. These fragments correspond to residues 73-83 (KYWLNDDWYLK, in single-letter amino acid code) and 193-209 (FMHMFQSSWNDFADFEK) in the 5LO sequence. Trp-75 and Trp-201 in these peptides were modified by the labelling, suggesting that they were immediately adjacent to the C-2 position of the adenine ring of ATP. Given the stoichiometry of the labelling, the two peptide sequences of 5LO were probably near each other in the enzyme's tertiary structure, composing or surrounding the ATP-binding site of 5LO. PMID:11042125

  10. Detection and isolation of nucleic acid sequences using competitive hybridization probes

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    1997-01-01

    A method for detecting a target nucleic acid sequence in a sample is provided using hybridization probes which competitively hybridize to a target nucleic acid. According to the method, a target nucleic acid sequence is hybridized to first and second hybridization probes which are complementary to overlapping portions of the target nucleic acid sequence, the first hybridization probe including a first complexing agent capable of forming a binding pair with a second complexing agent and the second hybridization probe including a detectable marker. The first complexing agent attached to the first hybridization probe is contacted with a second complexing agent, the second complexing agent being attached to a solid support such that when the first and second complexing agents are attached, target nucleic acid sequences hybridized to the first hybridization probe become immobilized on to the solid support. The immobilized target nucleic acids are then separated and detected by detecting the detectable marker attached to the second hybridization probe. A kit for performing the method is also provided.

  11. Detection and isolation of nucleic acid sequences using competitive hybridization probes

    DOEpatents

    Lucas, J.N.; Straume, T.; Bogen, K.T.

    1997-04-01

    A method for detecting a target nucleic acid sequence in a sample is provided using hybridization probes which competitively hybridize to a target nucleic acid. According to the method, a target nucleic acid sequence is hybridized to first and second hybridization probes which are complementary to overlapping portions of the target nucleic acid sequence, the first hybridization probe including a first complexing agent capable of forming a binding pair with a second complexing agent and the second hybridization probe including a detectable marker. The first complexing agent attached to the first hybridization probe is contacted with a second complexing agent, the second complexing agent being attached to a solid support such that when the first and second complexing agents are attached, target nucleic acid sequences hybridized to the first hybridization probe become immobilized on to the solid support. The immobilized target nucleic acids are then separated and detected by detecting the detectable marker attached to the second hybridization probe. A kit for performing the method is also provided. 7 figs.

  12. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology.

    PubMed

    Richards, Sue; Aziz, Nazneen; Bale, Sherri; Bick, David; Das, Soma; Gastier-Foster, Julie; Grody, Wayne W; Hegde, Madhuri; Lyon, Elaine; Spector, Elaine; Voelkerding, Karl; Rehm, Heidi L

    2015-05-01

    The American College of Medical Genetics and Genomics (ACMG) previously developed guidance for the interpretation of sequence variants.(1) In the past decade, sequencing technology has evolved rapidly with the advent of high-throughput next-generation sequencing. By adopting and leveraging next-generation sequencing, clinical laboratories are now performing an ever-increasing catalogue of genetic testing spanning genotyping, single genes, gene panels, exomes, genomes, transcriptomes, and epigenetic assays for genetic disorders. By virtue of increased complexity, this shift in genetic testing has been accompanied by new challenges in sequence interpretation. In this context the ACMG convened a workgroup in 2013 comprising representatives from the ACMG, the Association for Molecular Pathology (AMP), and the College of American Pathologists to revisit and revise the standards and guidelines for the interpretation of sequence variants. The group consisted of clinical laboratory directors and clinicians. This report represents expert opinion of the workgroup with input from ACMG, AMP, and College of American Pathologists stakeholders. These recommendations primarily apply to the breadth of genetic tests used in clinical laboratories, including genotyping, single genes, panels, exomes, and genomes. This report recommends the use of specific standard terminology-"pathogenic," "likely pathogenic," "uncertain significance," "likely benign," and "benign"-to describe variants identified in genes that cause Mendelian disorders. Moreover, this recommendation describes a process for classifying variants into these five categories based on criteria using typical types of variant evidence (e.g., population data, computational data, functional data, segregation data). Because of the increased complexity of analysis and interpretation of clinical genetic testing described in this report, the ACMG strongly recommends that clinical molecular genetic testing should be performed in a

  13. Conservation of Shannon's redundancy for proteins. [information theory applied to amino acid sequences

    NASA Technical Reports Server (NTRS)

    Gatlin, L. L.

    1974-01-01

    Concepts of information theory are applied to examine various proteins in terms of their redundancy in natural originators such as animals and plants. The Monte Carlo method is used to derive information parameters for random protein sequences. Real protein sequence parameters are compared with the standard parameters of protein sequences having a specific length. The tendency of a chain to contain some amino acids more frequently than others and the tendency of a chain to contain certain amino acid pairs more frequently than other pairs are used as randomness measures of individual protein sequences. Non-periodic proteins are generally found to have random Shannon redundancies except in cases of constraints due to short chain length and genetic codes. Redundant characteristics of highly periodic proteins are discussed. A degree of periodicity parameter is derived.

  14. Conversion of amino-acid sequence in proteins to classical music: search for auditory patterns

    PubMed Central

    2007-01-01

    We have converted genome-encoded protein sequences into musical notes to reveal auditory patterns without compromising musicality. We derived a reduced range of 13 base notes by pairing similar amino acids and distinguishing them using variations of three-note chords and codon distribution to dictate rhythm. The conversion will help make genomic coding sequences more approachable for the general public, young children, and vision-impaired scientists. PMID:17477882

  15. Vaccination of Cattle Persistently Infected with BVDV Does Not Cause a Change in the Consensus Sequence of the Structural Proteins of the Quasispecies

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Bovine viral diarrhea virus (BVDV) is a ubiquitous viral pathogen of cattle worldwide. An interesting aspect of these viruses is the great amount of sequence diversity that exists amongst strains in circulation in livestock herds that can impact diagnostic testing. The driving force behind change in...

  16. The Lactobacillus acidophilus S-layer protein gene expression site comprises two consensus promoter sequences, one of which directs transcription of stable mRNA.

    PubMed Central

    Boot, H J; Kolen, C P; Andreadaki, F J; Leer, R J; Pouwels, P H

    1996-01-01

    S-proteins are proteins which form a regular structure (S-layer) on the outside of the cell walls of many bacteria. Two S-protein-encoding genes are located in opposite directions on a 6.0-kb segment of the chromosome of Lactobacillus acidophilus ATCC 4356 bacteria. Inversion of this chromosomal segment occurs through recombination between two regions with identical sequences, thereby interchanging the expressed and the silent genes. In this study, we show that the region involved in recombination also has a function in efficient S-protein production. Two promoter sequences are present in the S-protein gene expression site, although only the most downstream promoter (P-1) is used to direct mRNA synthesis. S-protein mRNA directed by this promoter has a half-life of 15 min. Its untranslated leader can form a stable secondary structure in which the 5' end is base paired, whereas the ribosome-binding site is exposed. Truncation of this leader sequence results in a reduction in protein production, as shown by reporter gene analysis of Lactobacillus casei. The results obtained indicate that the untranslated leader sequence of S-protein mRNA is involved in efficient S-protein production. PMID:8808926

  17. Protein location prediction using atomic composition and global features of the amino acid sequence

    SciTech Connect

    Cherian, Betsy Sheena; Nair, Achuthsankar S.

    2010-01-22

    Subcellular location of protein is constructive information in determining its function, screening for drug candidates, vaccine design, annotation of gene products and in selecting relevant proteins for further studies. Computational prediction of subcellular localization deals with predicting the location of a protein from its amino acid sequence. For a computational localization prediction method to be more accurate, it should exploit all possible relevant biological features that contribute to the subcellular localization. In this work, we extracted the biological features from the full length protein sequence to incorporate more biological information. A new biological feature, distribution of atomic composition is effectively used with, multiple physiochemical properties, amino acid composition, three part amino acid composition, and sequence similarity for predicting the subcellular location of the protein. Support Vector Machines are designed for four modules and prediction is made by a weighted voting system. Our system makes prediction with an accuracy of 100, 82.47, 88.81 for self-consistency test, jackknife test and independent data test respectively. Our results provide evidence that the prediction based on the biological features derived from the full length amino acid sequence gives better accuracy than those derived from N-terminal alone. Considering the features as a distribution within the entire sequence will bring out underlying property distribution to a greater detail to enhance the prediction accuracy.

  18. Defining the Plasticity of Transcription Factor Binding Sites by Deconstructing DNA Consensus Sequences: The PhoP-Binding Sites among Gamma/Enterobacteria

    PubMed Central

    Harari, Oscar; Park, Sun-Yang; Huang, Henry; Groisman, Eduardo A.; Zwir, Igor

    2010-01-01

    Transcriptional regulators recognize specific DNA sequences. Because these sequences are embedded in the background of genomic DNA, it is hard to identify the key cis-regulatory elements that determine disparate patterns of gene expression. The detection of the intra- and inter-species differences among these sequences is crucial for understanding the molecular basis of both differential gene expression and evolution. Here, we address this problem by investigating the target promoters controlled by the DNA-binding PhoP protein, which governs virulence and Mg2+ homeostasis in several bacterial species. PhoP is particularly interesting; it is highly conserved in different gamma/enterobacteria, regulating not only ancestral genes but also governing the expression of dozens of horizontally acquired genes that differ from species to species. Our approach consists of decomposing the DNA binding site sequences for a given regulator into families of motifs (i.e., termed submotifs) using a machine learning method inspired by the “Divide & Conquer” strategy. By partitioning a motif into sub-patterns, computational advantages for classification were produced, resulting in the discovery of new members of a regulon, and alleviating the problem of distinguishing functional sites in chromatin immunoprecipitation and DNA microarray genome-wide analysis. Moreover, we found that certain partitions were useful in revealing biological properties of binding site sequences, including modular gains and losses of PhoP binding sites through evolutionary turnover events, as well as conservation in distant species. The high conservation of PhoP submotifs within gamma/enterobacteria, as well as the regulatory protein that recognizes them, suggests that the major cause of divergence between related species is not due to the binding sites, as was previously suggested for other regulators. Instead, the divergence may be attributed to the fast evolution of orthologous target genes and/or the

  19. Ab initio detection of fuzzy amino acid tandem repeats in protein sequences

    PubMed Central

    2012-01-01

    Background Tandem repetitions within protein amino acid sequences often correspond to regular secondary structures and form multi-repeat 3D assemblies of varied size and function. Developing internal repetitions is one of the evolutionary mechanisms that proteins employ to adapt their structure and function under evolutionary pressure. While there is keen interest in understanding such phenomena, detection of repeating structures based only on sequence analysis is considered an arduous task, since structure and function is often preserved even under considerable sequence divergence (fuzzy tandem repeats). Results In this paper we present PTRStalker, a new algorithm for ab-initio detection of fuzzy tandem repeats in protein amino acid sequences. In the reported results we show that by feeding PTRStalker with amino acid sequences from the UniProtKB/Swiss-Prot database we detect novel tandemly repeated structures not captured by other state-of-the-art tools. Experiments with membrane proteins indicate that PTRStalker can detect global symmetries in the primary structure which are then reflected in the tertiary structure. Conclusions PTRStalker is able to detect fuzzy tandem repeating structures in protein sequences, with performance beyond the current state-of-the art. Such a tool may be a valuable support to investigating protein structural properties when tertiary X-ray data is not available. PMID:22536906

  20. Multimodal phylogeny for taxonomy: integrating information from nucleotide and amino acid sequences.

    PubMed

    Bicego, Manuele; Dellaglio, Franco; Felis, Giovanna E

    2007-10-01

    The crucial role played by the analysis of microbial diversity in biotechnology-based innovations has increased the interest in the microbial taxonomy research area. Phylogenetic sequence analyses have contributed significantly to the advances in this field, also in the view of the large amount of sequence data collected in recent years. Phylogenetic analyses could be realized on the basis of protein-encoding nucleotide sequences or encoded amino acid molecules: these two mechanisms present different peculiarities, still starting from two alternative representations of the same information. This complementarity could be exploited to achieve a multimodal phylogenetic scheme that is able to integrate gene and protein information in order to realize a single final tree. This aspect has been poorly addressed in the literature. In this paper, we propose to integrate the two phylogenetic analyses using basic schemes derived from the multimodality fusion theory (or multiclassifier systems theory), a well-founded and rigorous branch for which its powerfulness has already been demonstrated in other pattern recognition contexts. The proposed approach could be applied to distance matrix-based phylogenetic techniques (like neighbor joining), resulting in a smart and fast method. The proposed methodology has been tested in a real case involving sequences of some species of lactic acid bacteria. With this dataset, both nucleotide sequence- and amino acid sequence-based phylogenetic analyses present some drawbacks, which are overcome with the multimodal analysis. PMID:17933011

  1. The amino-acid sequence of leghemoglobin component a from Phaseolus vulgaris (kidney bean).

    PubMed

    Lehtovaara, P; Ellfolk, N

    1975-06-01

    1. Leghemoglobin component a from Phaseolus vulgaris (kidney bean) was digested with trypsin; 15 tryptic peptides and free lysine were purified and the amino acid sequences of the peptides determined. 2. The internal order of the tryptic peptides was determined by the bridge peptides obtained from the thermolytic digest and the dilute acid hydrolyzate of kidney bean leghemoglobin a; 12 thermolytic peptides and two acid hydrolysis peptides were purified and the sequences were partially or completely determined. 3. The complete amino acid sequence of kidney bean leghemoglobin a is compared to that of leghemoglobin a from soybean (Glycine max) and to some animal globins. As regards sequence, the kidney bean globin has 79% identity with the soybean globin and 21% identity with human hemoglobin gamma-chain. Seven of the 14 amino acid residues common to most globins are found in the kidney bean globin. Trp-15 and Tyr-145 are evolutionarily conserved in this globin, which confirms the concept of a common origin of animal and plant globins. PMID:809270

  2. Draft genome sequence of the docosahexaenoic acid producing thraustochytrid Aurantiochytrium sp. T66.

    PubMed

    Liu, Bin; Ertesvåg, Helga; Aasen, Inga Marie; Vadstein, Olav; Brautaset, Trygve; Heggeset, Tonje Marita Bjerkan

    2016-06-01

    Thraustochytrids are unicellular, marine protists, and there is a growing industrial interest in these organisms, particularly because some species, including strains belonging to the genus Aurantiochytrium, accumulate high levels of docosahexaenoic acid (DHA). Here, we report the draft genome sequence of Aurantiochytrium sp. T66 (ATCC PRA-276), with a size of 43 Mbp, and 11,683 predicted protein-coding sequences. The data has been deposited at DDBJ/EMBL/Genbank under the accession LNGJ00000000. The genome sequence will contribute new insight into DHA biosynthesis and regulation, providing a basis for metabolic engineering of thraustochytrids. PMID:27222814

  3. A classification of glycosyl hydrolases based on amino acid sequence similarities.

    PubMed Central

    Henrissat, B

    1991-01-01

    The amino acid sequences of 301 glycosyl hydrolases and related enzymes have been compared. A total of 291 sequences corresponding to 39 EC entries could be classified into 35 families. Only ten sequences (less than 5% of the sample) could not be assigned to any family. With the sequences available for this analysis, 18 families were found to be monospecific (containing only one EC number) and 17 were found to be polyspecific (containing at least two EC numbers). Implications on the folding characteristics and mechanism of action of these enzymes and on the evolution of carbohydrate metabolism are discussed. With the steady increase in sequence and structural data, it is suggested that the enzyme classification system should perhaps be revised. PMID:1747104

  4. New families in the classification of glycosyl hydrolases based on amino acid sequence similarities.

    PubMed Central

    Henrissat, B; Bairoch, A

    1993-01-01

    301 glycosyl hydrolases and related enzymes corresponding to 39 EC entries of the I.U.B. classification system have been classified into 35 families on the basis of amino-acid-sequence similarities [Henrissat (1991) Biochem. J. 280, 309-316]. Approximately half of the families were found to be monospecific (containing only one EC number), whereas the other half were found to be polyspecific (containing at least two EC numbers). A > 60% increase in sequence data for glycosyl hydrolases (181 additional enzymes or enzyme domains sequences have since become available) allowed us to update the classification not only by the addition of more members to already identified families, but also by the finding of ten new families. On the basis of a comparison of 482 sequences corresponding to 52 EC entries, 45 families, out of which 22 are polyspecific, can now be defined. This classification has been implemented in the SWISS-PROT protein sequence data bank. PMID:8352747

  5. Sequence-specific purification of nucleic acids by PNA-controlled hybrid selection.

    PubMed

    Orum, H; Nielsen, P E; Jørgensen, M; Larsson, C; Stanley, C; Koch, T

    1995-09-01

    Using an oligohistidine peptide nucleic acids (oligohistidine-PNA) chimera, we have developed a rapid hybrid selection method that allows efficient, sequence-specific purification of a target nucleic acid. The method exploits two fundamental features of PNA. First, that PNA binds with high affinity and specificity to its complementary nucleic acid. Second, that amino acids are easily attached to the PNA oligomer during synthesis. We show that a (His)6-PNA chimera exhibits strong binding to chelated Ni2+ ions without compromising its native PNA hybridization properties. We further show that these characteristics allow the (His)6-PNA/DNA complex to be purified by the well-established method of metal ion affinity chromatography using a Ni(2+)-NTA (nitrilotriactic acid) resin. Specificity and efficiency are the touchstones of any nucleic acid purification scheme. We show that the specificity of the (His)6-PNA selection approach is such that oligonucleotides differing by only a single nucleotide can be selectively purified. We also show that large RNAs (2224 nucleotides) can be captured with high efficiency by using multiple (His)6-PNA probes. PNA can hybridize to nucleic acids in low-salt concentrations that destabilize native nucleic acid structures. We demonstrate that this property of PNA can be utilized to purify an oligonucleotide in which the target sequence forms part of an intramolecular stem/loop structure. PMID:7495562

  6. In silico comparative analysis of DNA and amino acid sequences for prion protein gene.

    PubMed

    Kim, Y; Lee, J; Lee, C

    2008-01-01

    Genetic variability might contribute to species specificity of prion diseases in various organisms. In this study, structures of the prion protein gene (PRNP) and its amino acids were compared among species of which sequence data were available. Comparisons of PRNP DNA sequences among 12 species including human, chimpanzee, monkey, bovine, ovine, dog, mouse, rat, wallaby, opossum, chicken and zebrafish allowed us to identify candidate regulatory regions in intron 1 and 3'-untranslated region (UTR) in addition to the coding region. Highly conserved putative binding sites for transcription factors, such as heat shock factor 2 (HSF2) and myocite enhancer factor 2 (MEF2), were discovered in the intron 1. In 3'-UTR, the functional sequence (ATTAAA) for nucleus-specific polyadenylation was found in all the analysed species. The functional sequence (TTTTTAT) for maturation-specific polyadenylation was identically observed only in ovine, and one or two nucleotide mismatches in the other species. A comparison of the amino acid sequences in 53 species revealed a large sequence identity. Especially the octapeptide repeat region was observed in all the species but frog and zebrafish. Functional changes and susceptibility to prion diseases with various isoforms of prion protein could be caused by numeric variability and conformational changes discovered in the repeat sequences. PMID:18397498

  7. Antibody-specific model of amino acid substitution for immunological inferences from alignments of antibody sequences.

    PubMed

    Mirsky, Alexander; Kazandjian, Linda; Anisimova, Maria

    2015-03-01

    Antibodies are glycoproteins produced by the immune system as a dynamically adaptive line of defense against invading pathogens. Very elegant and specific mutational mechanisms allow B lymphocytes to produce a large and diversified repertoire of antibodies, which is modified and enhanced throughout all adulthood. One of these mechanisms is somatic hypermutation, which stochastically mutates nucleotides in the antibody genes, forming new sequences with different properties and, eventually, higher affinity and selectivity to the pathogenic target. As somatic hypermutation involves fast mutation of antibody sequences, this process can be described using a Markov substitution model of molecular evolution. Here, using large sets of antibody sequences from mice and humans, we infer an empirical amino acid substitution model AB, which is specific to antibody sequences. Compared with existing general amino acid models, we show that the AB model provides significantly better description for the somatic evolution of mice and human antibody sequences, as demonstrated on large next generation sequencing (NGS) antibody data. General amino acid models are reflective of conservation at the protein level due to functional constraints, with most frequent amino acids exchanges taking place between residues with the same or similar physicochemical properties. In contrast, within the variable part of antibody sequences we observed an elevated frequency of exchanges between amino acids with distinct physicochemical properties. This is indicative of a sui generis mutational mechanism, specific to antibody somatic hypermutation. We illustrate this property of antibody sequences by a comparative analysis of the network modularity implied by the AB model and general amino acid substitution models. We recommend using the new model for computational studies of antibody sequence maturation, including inference of alignments and phylogenetic trees describing antibody somatic hypermutation in

  8. Working toward Consensus.

    ERIC Educational Resources Information Center

    Brown, Harold

    1998-01-01

    A California high school English teacher uses, with students, a culturally sensitive process of facilitating classroom decision making through consensus. He correlates communication and language skills with consensus building, the facilitation of which is a slow process implemented in small portions over the school year. Sidebar provides a…

  9. Amino acid sequence of a vitamin K-dependent Ca2+-binding peptide from bovine prothrombin.

    PubMed

    Howard, J B; Fausch, M D

    1975-08-10

    The amino acid sequence of a 31-residue peptide from bovine prothrombin has been determined. This peptide has been shown to contain the vitamin K-dependent modification required for Ca2+ binding (Nelsestuen, G. L., and Suttie, J. W. (1973) Proc. Natl. Acad. Sci. U. S. A. 70, 3366-3370) and the modified amino acid, gamma-carboxyglutamic acid (Nelsestuen, G. L., Zytkovicz, T., and Howard, J. B. (1974) J. Biol. Chem. 249, 6347-6350). The peptide was shown to correspond to residues 12 to 42 of prothrombin. PMID:807581

  10. Amino acid sequences around the cysteine residues of rabbit muscle triose phosphate isomerase

    PubMed Central

    Miller, Janet C.; Waley, S. G.

    1971-01-01

    1. The nature of the subunits in rabbit muscle triose phosphate isomerase has been investigated. 2. Amino acid analyses show that there are five cysteine residues and two methionine residues/subunit. 3. The amino acid sequences around the cysteine residues have been determined; these account for about 75 residues. 4. Cleavage at the methionine residues with cyanogen bromide gave three fragments. 5. These results show that the subunits correspond to polypeptide chains, containing about 230 amino acid residues. The chains in triose phosphate isomerase seem to be shorter than those of other glycolytic enzymes. PMID:5165707

  11. Complete amino acid sequence of the Mu heavy chain of a human IgM immunoglobulin.

    PubMed

    Putnam, F W; Florent, G; Paul, C; Shinoda, T; Shimizu, A

    1973-10-19

    The amino acid sequence of the micro, chain of a human IgM immunoglobulin, including the location of all disulfide bridges and oligosaccharides, has been determined. The homology of the constant regions of immunoglobulin micro, gamma, alpha, and epsilon heavy chains reveals evolutionary relationships and suggests that two genes code for each heavy chain. PMID:4742735

  12. Draft Genome Sequence of the Butyric Acid Producer Clostridium tyrobutyricum Strain CIP I-776 (IFP923)

    PubMed Central

    Clément, Benjamin; Lopes Ferreira, Nicolas

    2016-01-01

    Here, we report the draft genome sequence of Clostridium tyrobutyricum CIP I-776 (IFP923), an efficient producer of butyric acid. The genome consists of a single chromosome of 3.19 Mb and provides useful data concerning the metabolic capacities of the strain. PMID:26941139

  13. Draft Genome Sequence of Perfluorooctane Acid-Degrading Bacterium Pseudomonas parafulva YAB-1

    PubMed Central

    Tang, Chongjian; Peng, Qingjing; Peng, Qingzhong

    2015-01-01

    Pseudomonas parafulva YAB-1, isolated from perfluorinated compound-contaminated soil, has the ability to degrade perfluorooctane acid (PFOA) compound. Here, we report the draft genome sequence and annotation of the PFOA-degrading bacterium P. parafulva YAB-1. The data provide the basis to investigate the molecular mechanism of PFOA metabolism. PMID:26337877

  14. The amino acid sequence of cytochrome c-555 from the methane-oxidizing bacterium Methylococcus capsulatus.

    PubMed Central

    Ambler, R P; Dalton, H; Meyer, T E; Bartsch, R G; Kamen, M D

    1986-01-01

    The amino acid sequence of the cytochrome c-555 from the obligate methanotroph Methylococcus capsulatus strain Bath (N.C.I.B. 11132) was determined. It is a single polypeptide chain of 96 residues, binding a haem group through the cysteine residues at positions 19 and 22, and the only methionine residue is a position 59. The sequence does not closely resemble that of any other cytochrome c that has yet been characterized. Detailed evidence for the amino acid sequence of the protein has been deposited as Supplementary Publication SUP 50131 (12 pages) at the British Library Lending Division, Boston Spa, West Yorkshire LS23 7BQ, U.K., from whom copies are available on prepayment. PMID:3006666

  15. Allelic polymorphism in arabian camel ribonuclease and the amino acid sequence of bactrian camel ribonuclease.

    PubMed

    Welling, G W; Mulder, H; Beintema, J J

    1976-04-01

    Pancreatic ribonucleases from several species (whitetail deer, roe deer, guinea pig, and arabian camel) exhibit more than one amino acid at particular positions in their amino acid sequences. Since these enzymes were isolated from pooled pancreas, the origin of this heterogeneity is not clear. The pancreatic ribonucleases from 11 individual arabian camels (Camelus dromedarius) have been investigated with respect to the lysine-glutamine heterogeneity at position 103 (Welling et al., 1975). Six ribonucleases showed only one basic band and five showed two bands after polyacrylamide gel electrophoresis, suggesting a gene frequency of about 0.75 for the Lys gene and about 0.25 for the Gln gene. The amino acid sequence of bactrian camel (Camelus bactrianus) ribonuclease isolated from individual pancreatic tissue was determined and compared with that of arabian camel ribonuclease. The only difference was observed at position 103. In the ribonucleases from two unrelated bactrian camels, only glutamine was observed at that position. PMID:962846

  16. A single amino acid change, Q114R, in the cleavage-site sequence of Newcastle disease virus fusion protein attenuates viral replication and pathogenicity.

    PubMed

    Samal, Sweety; Kumar, Sachin; Khattar, Sunil K; Samal, Siba K

    2011-10-01

    A key determinant of Newcastle disease virus (NDV) virulence is the amino acid sequence at the fusion (F) protein cleavage site. The NDV F protein is synthesized as an inactive precursor, F(0), and is activated by proteolytic cleavage between amino acids 116 and 117 to produce two disulfide-linked subunits, F(1) and F(2). The consensus sequence of the F protein cleavage site of virulent [(112)(R/K)-R-Q-(R/K)-R↓F-I(118)] and avirulent [(112)(G/E)-(K/R)-Q-(G/E)-R↓L-I(118)] strains contains a conserved glutamine residue at position 114. Recently, some NDV strains from Africa and Madagascar were isolated from healthy birds and have been reported to contain five basic residues (R-R-R-K-R↓F-I/V or R-R-R-R-R↓F-I/V) at the F protein cleavage site. In this study, we have evaluated the role of this conserved glutamine residue in the replication and pathogenicity of NDV by using the moderately pathogenic Beaudette C strain and by making Q114R, K115R and I118V mutants of the F protein in this strain. Our results showed that changing the glutamine to a basic arginine residue reduced viral replication and attenuated the pathogenicity of the virus in chickens. The pathogenicity was further reduced when the isoleucine at position 118 was substituted for valine. PMID:21677091

  17. Use of a structural alphabet to find compatible folds for amino acid sequences

    PubMed Central

    Mahajan, Swapnil; de Brevern, Alexandre G; Sanejouand, Yves-Henri; Srinivasan, Narayanaswamy; Offmann, Bernard

    2015-01-01

    The structural annotation of proteins with no detectable homologs of known 3D structure identified using sequence-search methods is a major challenge today. We propose an original method that computes the conditional probabilities for the amino-acid sequence of a protein to fit to known protein 3D structures using a structural alphabet, known as “Protein Blocks” (PBs). PBs constitute a library of 16 local structural prototypes that approximate every part of protein backbone structures. It is used to encode 3D protein structures into 1D PB sequences and to capture sequence to structure relationships. Our method relies on amino acid occurrence matrices, one for each PB, to score global and local threading of query amino acid sequences to protein folds encoded into PB sequences. It does not use any information from residue contacts or sequence-search methods or explicit incorporation of hydrophobic effect. The performance of the method was assessed with independent test datasets derived from SCOP 1.75A. With a Z-score cutoff that achieved 95% specificity (i.e., less than 5% false positives), global and local threading showed sensitivity of 64.1% and 34.2%, respectively. We further tested its performance on 57 difficult CASP10 targets that had no known homologs in PDB: 38 compatible templates were identified by our approach and 66% of these hits yielded correctly predicted structures. This method scales-up well and offers promising perspectives for structural annotations at genomic level. It has been implemented in the form of a web-server that is freely available at http://www.bo-protscience.fr/forsa. PMID:25297700

  18. Use of a structural alphabet to find compatible folds for amino acid sequences.

    PubMed

    Mahajan, Swapnil; de Brevern, Alexandre G; Sanejouand, Yves-Henri; Srinivasan, Narayanaswamy; Offmann, Bernard

    2015-01-01

    The structural annotation of proteins with no detectable homologs of known 3D structure identified using sequence-search methods is a major challenge today. We propose an original method that computes the conditional probabilities for the amino-acid sequence of a protein to fit to known protein 3D structures using a structural alphabet, known as "Protein Blocks" (PBs). PBs constitute a library of 16 local structural prototypes that approximate every part of protein backbone structures. It is used to encode 3D protein structures into 1D PB sequences and to capture sequence to structure relationships. Our method relies on amino acid occurrence matrices, one for each PB, to score global and local threading of query amino acid sequences to protein folds encoded into PB sequences. It does not use any information from residue contacts or sequence-search methods or explicit incorporation of hydrophobic effect. The performance of the method was assessed with independent test datasets derived from SCOP 1.75A. With a Z-score cutoff that achieved 95% specificity (i.e., less than 5% false positives), global and local threading showed sensitivity of 64.1% and 34.2%, respectively. We further tested its performance on 57 difficult CASP10 targets that had no known homologs in PDB: 38 compatible templates were identified by our approach and 66% of these hits yielded correctly predicted structures. This method scales-up well and offers promising perspectives for structural annotations at genomic level. It has been implemented in the form of a web-server that is freely available at http://www.bo-protscience.fr/forsa. PMID:25297700

  19. Pan-Serotype Diagnostic for Foot-and-Mouth Disease Using the Consensus Antigen of Nonstructural Protein 3B

    PubMed Central

    Van Dreumel, Alyssa K.; Michalski, Wojtek P.; McNabb, Leanne M.; Shiell, Brian J.; Singanallur, Nagendrakumar B.

    2015-01-01

    An amino acid consensus sequence for the seven serotypes of foot-and-mouth disease virus (FMDV) nonstructural protein 3B, including all three contiguous repeats, and its use in the development of a pan-serotype diagnostic test for all seven FMDV serotypes are described. The amino acid consensus sequence of the 3B protein was determined from a multiple-sequence alignment of 125 sequences of 3B. The consensus 3B (c3B) protein was expressed as a soluble recombinant fusion protein with maltose-binding protein (MBP) using a bacterial expression system and was affinity purified using amylose resin. The MBP-c3B protein was used as the antigen in the development of a competition enzyme-linked immunosorbent assay (cELISA) for detection of anti-3B antibodies in bovine sera. The comparative diagnostic sensitivity and specificity at 47% inhibition were estimated to be 87.22% and 93.15%, respectively. Reactivity of c3B with bovine sera representing the seven FMDV serotypes demonstrated the pan-serotype diagnostic capability of this bioreagent. The consensus antigen and competition ELISA are described here as candidates for a pan-serotype diagnostic test for FMDV infection. PMID:25788546

  20. Software scripts for quality checking of high-throughput nucleic acid sequencers.

    PubMed

    Lazo, G R; Tong, J; Miller, R; Hsia, C; Rausch, C; Kang, Y; Anderson, O D

    2001-06-01

    We have developed a graphical interface to allow the researcher to view and assess the quality of sequencing results using a series of program scripts developed to process data generated by automated sequencers. The scripts are written in Perl programming language and are executable under the cgibin directory of a Web server environment. The scripts direct nucleic acid sequencing trace file data output from automated sequencers to be analyzed by the phred molecular biology program and are displayed as graphical hypertext mark-up language (HTML) pages. The scripts are mainly designed to handle 96-well microtiter dish samples, but the scripts are also able to read data from 384-well microtiter dishes 96 samples at a time. The scripts may be customized for different laboratory environments and computer configurations. Web links to the sources and discussion page are provided. PMID:11414222

  1. Nucleotide and predicted amino acid sequences of cloned human and mouse preprocathepsin B cDNAs.

    PubMed Central

    Chan, S J; San Segundo, B; McCormick, M B; Steiner, D F

    1986-01-01

    Cathepsin B is a lysosomal thiol proteinase that may have additional extralysosomal functions. To further our investigations on the structure, mode of biosynthesis, and intracellular sorting of this enzyme, we have determined the complete coding sequences for human and mouse preprocathepsin B by using cDNA clones isolated from human hepatoma and kidney phage libraries. The nucleotide sequences predict that the primary structure of preprocathepsin B contains 339 amino acids organized as follows: a 17-residue NH2-terminal prepeptide sequence followed by a 62-residue propeptide region, 254 residues in mature (single chain) cathepsin B, and a 6-residue extension at the COOH terminus. A comparison of procathepsin B sequences from three species (human, mouse, and rat) reveals that the homology between the propeptides is relatively conserved with a minimum of 68% sequence identity. In particular, two conserved sequences in the propeptide that may be functionally significant include a potential glycosylation site and the presence of a single cysteine at position 59. Comparative analysis of the three sequences also suggests that processing of procathepsin B is a multistep process, during which enzymatically active intermediate forms may be generated. The availability of the cDNA clones will facilitate the identification of possible active or inactive intermediate processive forms as well as studies on the transcriptional regulation of the cathepsin B gene. PMID:3463996

  2. Efficient Nucleic Acid Extraction and 16S rRNA Gene Sequencing for Bacterial Community Characterization.

    PubMed

    Anahtar, Melis N; Bowman, Brittany A; Kwon, Douglas S

    2016-01-01

    There is a growing appreciation for the role of microbial communities as critical modulators of human health and disease. High throughput sequencing technologies have allowed for the rapid and efficient characterization of bacterial communities using 16S rRNA gene sequencing from a variety of sources. Although readily available tools for 16S rRNA sequence analysis have standardized computational workflows, sample processing for DNA extraction remains a continued source of variability across studies. Here we describe an efficient, robust, and cost effective method for extracting nucleic acid from swabs. We also delineate downstream methods for 16S rRNA gene sequencing, including generation of sequencing libraries, data quality control, and sequence analysis. The workflow can accommodate multiple samples types, including stool and swabs collected from a variety of anatomical locations and host species. Additionally, recovered DNA and RNA can be separated and used for other applications, including whole genome sequencing or RNA-seq. The method described allows for a common processing approach for multiple sample types and accommodates downstream analysis of genomic, metagenomic and transcriptional information. PMID:27168460

  3. Efficient Nucleic Acid Extraction and 16S rRNA Gene Sequencing for Bacterial Community Characterization

    PubMed Central

    Anahtar, Melis N.; Bowman, Brittany A.; Kwon, Douglas S.

    2016-01-01

    There is a growing appreciation for the role of microbial communities as critical modulators of human health and disease. High throughput sequencing technologies have allowed for the rapid and efficient characterization of bacterial communities using 16S rRNA gene sequencing from a variety of sources. Although readily available tools for 16S rRNA sequence analysis have standardized computational workflows, sample processing for DNA extraction remains a continued source of variability across studies. Here we describe an efficient, robust, and cost effective method for extracting nucleic acid from swabs. We also delineate downstream methods for 16S rRNA gene sequencing, including generation of sequencing libraries, data quality control, and sequence analysis. The workflow can accommodate multiple samples types, including stool and swabs collected from a variety of anatomical locations and host species. Additionally, recovered DNA and RNA can be separated and used for other applications, including whole genome sequencing or RNA-seq. The method described allows for a common processing approach for multiple sample types and accommodates downstream analysis of genomic, metagenomic and transcriptional information. PMID:27168460

  4. Preparation of Nucleic Acid Libraries for Personalized Sequencing Systems Using an Integrated Microfluidic Hub Technology (Seventh Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting 2012)

    ScienceCinema

    Patel, Kamlesh D [Ken]; SNL,

    2013-01-25

    Kamlesh (Ken) Patel from Sandia National Laboratories (Livermore, California) presents "Preparation of Nucleic Acid Libraries for Personalized Sequencing Systems Using an Integrated Microfluidic Hub Technology " at the 7th Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting held in June, 2012 in Santa Fe, NM.

  5. The amino acid sequence of ribonuclease U2 from Ustilago sphaerogena.

    PubMed Central

    Sato, S; Uchida, T

    1975-01-01

    1. RNAase (ribonuclease) U2, a purine-specific RNAase, was reduced, aminoethylated and hydrolysed with trypsin, chymotrypsin and thermolysin. On the basis of the analyses of the resulting peptides, the complete amino acid sequence of RNAase U2 was determined, 2. When the sequence was compared with the amino acid sequence of RNAase T1 (EC 3.1.4.8), the following regions were found to be similar in the two enzymes; Tyr-Pro-His-Gln-Tyr (38-42) in RNAase U2 and Tyr-Pro-His-Lys-Tyr (38-42) in RNAase T1, Glu-Phe-Pro-Leu-Val (61-65) in RNAase U2 and Glu-Trp-Pro-Ile-Leu (58-62) in RNAase T1, Asp-Arg-Val-Ile-Tyr-Gln (83-88) in RNAase U2 and Asp-Arg-Val-Phe-Asn (76-81) in RNAase T1 and Val-Thr-His-Thr-Gly-Ala (98-103) in RNAase U2 and Ile-Thr-His-Thr-Gly-Ala (90-95) in RNAase T1. All of the amino acid residues, histidine-40, glutamate-58, arginine-77 and histidine-92, which were found to play a crucial role in the biological activity of RNAase T1, were included in the regions cited here. 3. Detailed evidence for the amino acid sequence of the sequence of the proteins has been deposited as Supplementary Publication SUP 50041 (33 PAGES) AT THE British Library (Lending Division)(formerly the National Lending Library for Science and Technology), Boston Spa, Yorks. LS23 7BQ, U.K., from whom copies can be obtained on the terms indicated in Biochem. J. (1975), 145, 5. PMID:1156364

  6. Deduced amino acid sequence of human pulmonary surfactant proteolipid: SPL(pVal)

    SciTech Connect

    Whitsett, J.A.; Glasser, S.W.; Korfhagen, T.R.; Weaver, T.E.; Clark, J.; Pilot-Matias, T.; Meuth, J.; Fox, J.L.

    1987-05-01

    Hydrophobic, proteolipid-like protein of Mr 6500 was isolated from ether/ethanol extracts of human, canine and bovine pulmonary surfactant. Amino acid composition of the protein demonstrated a remarkable abundance of hydrophobic residues, particularly valine and leucine. The N-terminal amino acid sequence of the human protein was determined: N-Leu-Ile-Pro-Cys-Cys-Pro-Val-Asn-Leu-Lys-Arg-Leu-Leu-Ile-Val4... An oligonucleotide probe was used to screen an adult human lung cDNA library and resulted in detection of cDNA clones with predicted amino acid sequence with close identity to the N-terminal amino acid sequence of the human peptide. SPL(pVal) was found within the reading frame of a larger peptide. SPL(pVal) results from proteolytic processing of a larger preprotein. Northern blot analysis detected in a single 1.0 kilobase SPL(pVal) RNA which was less abundant in fetal than in adult lung. Mixtures of purified canine and bovine SPL(pVal) and synthetic phospholipids display properties of rapid adsorption and surface tension lowering activity characteristic of surfactant. Human SPL(pVal) is a pulmonary surfactant proteolipid which may therefore be useful in combination with phospholipids and/or other surfactant proteins for the treatment of surfactant deficiency such as hyaline membrane disease in newborn infants.

  7. Complete nucleic acid sequence of Penaeus stylirostris densovirus (PstDNV) from India.

    PubMed

    Rai, Praveen; Safeena, Muhammed P; Karunasagar, Iddya; Karunasagar, Indrani

    2011-06-01

    Infectious hypodermal and hematopoietic necrosis virus (IHHNV) of shrimp, recently been classified as Penaeus stylirostris densovirus (PstDNV). The complete nucleic acid sequence of PstDNV from India was obtained by cloning and sequencing of different DNA fragment of the virus. The genome organisation of PstDNV revealed that there were three major coding domains: a left ORF (NS1) of 2001 bp, a mid ORF (NS2) of 1092 bp and a right ORF (VP) of 990 bp. The complete genome and amino acid sequences of three proteins viz., NS1, NS2 and VP were compared with the genomes of the virus reported from Hawaii, China and Mexico and with partial sequence available from isolates from different regions. The phylogenetic analysis of shrimp, insect and vertebrate parvovirus sequences showed that the Indian PstDNV isolate is phylogenetically more closely related to one of the three isolates from Taiwan (AY355307), and two isolates (AY362547 and AY102034) from Thailand. PMID:21402111

  8. Human liver type pyruvate kinase: complete amino acid sequence and the expression in mammalian cells.

    PubMed Central

    Tani, K; Fujii, H; Nagata, S; Miwa, S

    1988-01-01

    Pyruvate kinase (PK) has four isozymes (L, R, M1, M2) that are encoded by two different genes. Among these isozymes, abnormalities of liver (L)-type PK is considered to be associated with hereditary nonspherocytic hemolytic anemia in humans. We isolated and determined the full-length sequence of human L-type PK cDNA. The cDNA contains 1629 base pairs encoding 543 amino acids, 68 base pairs of 5'-noncoding sequence, and 734 base pairs of 3'-noncoding sequence. The similarity between human and rat L-type PK was 86.9% at the nucleotide sequence level and 92.4% at the amino acid sequence level. The full-length L-type PK cDNA was placed under the promoter of simian virus 40 and introduced into monkey COS cells. Human L-type PK activity was detected in the extract of COS cells by the classical PK electrophoresis method. Images PMID:3126495

  9. Human liver type pyruvate kinase: Complete amino acid sequence and the expression in mammalian cells

    SciTech Connect

    Tani, Kenzaburo; Nagata, Shigekazu ); Fujii, Hisaichi ); Miwa, Shiro )

    1988-03-01

    Pyruvate kinase (PK) has four isozymes (L, R, M{sub 1}, M{sub 2}) that are encoded by two different genes. Among these isozymes, abnormalities of liver (L)-type PK is considered to be associated with hereditary nonspherocytic hemolytic anemia in humans. The authors isolated and determined the full-length sequence of human L-type PK cDNA. The cDNA contains 1,629 base pairs encoding 543 amino acids, 68 base pairs of 5{prime}-noncoding sequence, and 734 base pairs of 3{prime}-noncoding sequence. The similarity between human and rat L-type PK was 86.9% at the nucleotide sequence level and 92.4% at the amino acid sequence level. The full-length L-type PK cDNA was placed under the promoter of simian virus 40 and introduced into monkey COS cells. Human L-type PK activity was detected in the extract of COS cells by the classical PK electrophoresis method.

  10. Molecular cytogenetics by polymerase catalyzed amplification or in situ labelling of specific nucleic acid sequences

    SciTech Connect

    Bolund, L.; Brandt, C.; Hindkjaer, J.; Koch, J.; Koelvraa, S.; Pedersen, S. )

    1993-01-01

    The Polymerase Chain Reaction (PCR) can be performed on isolated cells or chromosomes and the product can be analyzed by DNA technology or by FISH to test metaphases. The authors have good experiences analyzing aberrant chromosomes by FACS sorting, PCR with degenerated primers and painting of test metaphases with the PCR product. They also utilize polymerases for PRimed IN Situ labelling (PRINS) of specific nucleic acid sequences. In PRINS oligonucleotides are hybridized to their target sequences and labeled nucleotides are incorporated at the site of hybridization with the oligonucleotide as primer. PRINS may eventually allow the study of individual genes, gene expression and even somatic mutations (in mRNA) in single cells.

  11. DNA Cloning of Plasmodium falciparum Circumsporozoite Gene: Amino Acid Sequence of Repetitive Epitope

    NASA Astrophysics Data System (ADS)

    Enea, Vincenzo; Ellis, Joan; Zavala, Fidel; Arnot, David E.; Asavanich, Achara; Masuda, Aoi; Quakyi, Isabella; Nussenzweig, Ruth S.

    1984-08-01

    A clone of complementary DNA encoding the circumsporozoite (CS) protein of the human malaria parasite Plasmodium falciparum has been isolated by screening an Escherichia coli complementary DNA library with a monoclonal antibody to the CS protein. The DNA sequence of the complementary DNA insert encodes a four-amino acid sequence: proline-asparagine-alanine-asparagine, tandemly repeated 23 times. The CS β -lactamase fusion protein specifically binds monoclonal antibodies to the CS protein and inhibits the binding of these antibodies to native Plasmodium falciparum CS protein. These findings provide a basis for the development of a vaccine against Plasmodium falciparum malaria.

  12. Method for high-volume sequencing of nucleic acids: random and directed priming with libraries of oligonucleotides

    DOEpatents

    Studier, F.W.

    1995-04-18

    Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient. 2 figs.

  13. Method for high-volume sequencing of nucleic acids: random and directed priming with libraries of oligonucleotides

    DOEpatents

    Studier, F. William

    1995-04-18

    Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient.

  14. Partial amino acid sequence of apolipoprotein(a) shows that it is homologous to plasminogen

    SciTech Connect

    Eaton, D.L.; Fless, G.M.; Kohr, W.J.; McLean, J.W.; Xu, Q.T.; Miller, C.G.; Lawn, R.M.; Scanu, A.M.

    1987-05-01

    Apolipoprotein(a) (apo(a)) is a glycoprotein with M/sub r/ approx. 280,000 that is disulfide linked to apolipoprotein B in lipoprotein(a) particles. Elevated plasma levels of lipoprotein(a) are correlated with atherosclerosis. Partial amino acid sequence of apo(a) shows that it has striking homology to plasminogen. Plasminogen is a plasma serine protease zymogen that consists of five homologous and tandemly repeated domains called kringles and a trypsin-like protease domain. The amino-terminal sequence obtained for apo(a) is homologous to the beginning of kringle 4 but not the amino terminus of plasminogen. Apo(a) was subjected to limited proteolysis by trypsin or V8 protease, and fragments generated were isolated and sequenced. Sequences obtained from several of these fragments are highly (77-100%) homologous to plasminogen residues 391-421, which reside within kringle 4. Analysis of these internal apo(a) sequences revealed that apo(a) may contain at least two kringle 4-like domains. A sequence obtained from another tryptic fragment also shows homology to the end of kringle 4 and the beginning of kringle 5. Sequence data obtained from the two tryptic fragments shows homology with the protease domain of plasminogen. One of these sequences is homologous to the sequences surrounding the activation site of plasminogen. Plasminogen is activated by the cleavage of a specific arginine residue by urokinase and tissue plasminogen activator; however, the corresponding site in apo(a) is a serine that would not be cleaved by tissue plasminogen activator or urokinase. Using a plasmin-specific assay, no proteolytic activity could be demonstrated for lipoprotein(a) particles. These results suggest that apo(a) contains kringle-like domains and an inactive protease domain.

  15. The Complete Genome Sequence of the Lactic Acid Bacterium Lactococcus lactis ssp. lactis IL1403

    PubMed Central

    Bolotin, Alexander; Wincker, Patrick; Mauger, Stéphane; Jaillon, Olivier; Malarme, Karine; Weissenbach, Jean; Ehrlich, S. Dusko; Sorokin, Alexei

    2001-01-01

    Lactococcus lactis is a nonpathogenic AT-rich gram-positive bacterium closely related to the genus Streptococcus and is the most commonly used cheese starter. It is also the best-characterized lactic acid bacterium. We sequenced the genome of the laboratory strain IL1403, using a novel two-step strategy that comprises diagnostic sequencing of the entire genome and a shotgun polishing step. The genome contains 2,365,589 base pairs and encodes 2310 proteins, including 293 protein-coding genes belonging to six prophages and 43 insertion sequence (IS) elements. Nonrandom distribution of IS elements indicates that the chromosome of the sequenced strain may be a product of recent recombination between two closely related genomes. A complete set of late competence genes is present, indicating the ability of L. lactis to undergo DNA transformation. Genomic sequence revealed new possibilities for fermentation pathways and for aerobic respiration. It also indicated a horizontal transfer of genetic information from Lactococcus to gram-negative enteric bacteria of Salmonella-Escherichia group. [The sequence data described in this paper has been submitted to the GenBank data library under accession no. AE005176.] PMID:11337471

  16. On human disease-causing amino acid variants: statistical study of sequence and structural patterns

    PubMed Central

    Alexov, Emil

    2015-01-01

    Statistical analysis was carried out on large set of naturally occurring human amino acid variations and it was demonstrated that there is a preference for some amino acid substitutions to be associated with diseases. At an amino acid sequence level, it was shown that the disease-causing variants frequently involve drastic changes of amino acid physico-chemical properties of proteins such as charge, hydrophobicity and geometry. Structural analysis of variants involved in diseases and being frequently observed in human population showed similar trends: disease-causing variants tend to cause more changes of hydrogen bond network and salt bridges as compared with harmless amino acid mutations. Analysis of thermodynamics data reported in literature, both experimental and computational, indicated that disease-causing variants tend to destabilize proteins and their interactions, which prompted us to investigate the effects of amino acid mutations on large databases of experimentally measured energy changes in unrelated proteins. Although the experimental datasets were linked neither to diseases nor exclusory to human proteins, the observed trends were the same: amino acid mutations tend to destabilize proteins and their interactions. Having in mind that structural and thermodynamics properties are interrelated, it is pointed out that any large change of any of them is anticipated to cause a disease. PMID:25689729

  17. Self-sequencing of amino acids and origins of polyfunctional protocells.

    PubMed

    Fox, S W

    1984-01-01

    The primal role of the origins of proteins in molecular evolution is discussed. On the basis of this premise, the significance of the experimentally established self-sequencing of amino acids under simulated geological conditions is explained as due to the fact that the products are highly nonrandom and accordingly contain many kinds of information. When such thermal proteins are aggregated into laboratory protocells, an action that occurs readily, the resultant protocells also contain many kinds of information. Residue-by-residue order, enzymic activities, and lipid quality accordingly occur within each preparation of proteinoid (thermal protein). In this paper are reviewed briefly the phenomenon of self-sequencing of amino acids, its relationship to evolutionary processes, other significance of such self-ordering, and the experimental evidence for original polyfunctional protocells. PMID:6462684

  18. Self-Sequencing of Amino Acids and Origins of Polyfunctional Protocells

    NASA Astrophysics Data System (ADS)

    Fox, Sidney W.

    1984-12-01

    The primal role of the origins of proteins in molecular evolution is discussed. On the basis of this premise, the significance of the experimentally established self-sequencing of amino acids under simulated geological conditions is explained as due to the fact that the products are highly nonrandom and accordingly contain many kinds of information. When such thermal proteins are aggregated into laboratory protocells, an action that occurs readily, the resultant protocells also contain many kinds of information. Residue-by-residue order, enzymic activities, and lipid quality accordingly occur within each preparation of proteinoid (thermal protein). In this paper are reviewed briefly the phenomenon of self-sequencing of amino acids, its relationship to evolutionary processes, other significance of such self-ordering, and the experimental evidence for original polyfunctional protocells.

  19. 7-Benzylamino-6-chloro-2-piperazino-4-pyrrolidino-pteridine, a potent inhibitor of cAMP-specific phosphodiesterase, enhancing nuclear protein binding to the CRE consensus sequence in human tumour cells.

    PubMed

    Wagner, Barbara; Jakobs, Sandra; Habermeyer, Michael; Hippe, Frankie; Cho-Chung, Yoon Sang; Eisenbrand, Gerhard; Marko, Doris

    2002-02-15

    The cAMP-specific phosphodiesterase isoenzyme family PDE4 represents the highest cAMP-hydrolysing activity in many human cancer cell lines including the human large cell lung carcinoma cell line LXFL529L. Treatment of LXFL529L cells with the potent PDE4 inhibitor 7-benzylamino-6-chloro-2-piperazino-4-pyrrolidino-pteridine (DC-TA-46) induces dose-dependent growth inhibition. Cells are arrested in the G(1)-phase of the cell cycle and the induction of apoptosis is observed. In this study, we investigated the effect of DC-TA-46 on downstream elements of the cAMP-pathway. DC-TA-46 mediated inhibition of PDE4 activity in LXFL529L cells resulted in an increase of the intracellular cAMP level and significant induction of the activity of protein kinase A (PKA). The regulatory PKA subunit RIalpha was predominantly expressed in LXFL529L cells. In contrast to effects induced by cAMP analogues like 8-Cl-cAMP, the expression of the regulatory subunits of PKA remained unaffected by DC-TA-46. Treatment of LXFL529L cells with DC-TA-46 enhanced the binding of nuclear proteins to the cAMP-responsive element (CRE) consensus sequence TGACGTCA in a time- and dose-dependent manner, indicating the activation of transcription factors by PKA phosphorylation. PMID:11992633

  20. Sequence of morphological transitions in two-dimensional pattern growth from aqueous ascorbic Acid solutions.

    PubMed

    Paranjpe, A S

    2002-08-12

    A sequence of morphological transitions in two-dimensional dehydration patterns of aqueous solutions of ascorbic acid is observed with humidity as a control parameter. Change in morphology occurs due to humidity induced variation in the concentration of the metastable supersaturated solution phase formed after initial solvent evaporation. As percent humidity is varied from 40 to 80, patterns change from compact circular --> radial --> density modulated radial (a new morphology) --> density modulated circular --> density modulated dendritic (a new morphology) --> dense branching. PMID:12190528

  1. Self-sequencing of amino acids and origins of polyfunctional protocells

    NASA Technical Reports Server (NTRS)

    Fox, S. W.

    1984-01-01

    The role of proteins in the origin of living things is discussed. It has been experimentally established that amino acids can sequence themselves under simulated geological conditions with highly nonrandom products which accordingly contain diverse information. Multiple copies of each type of macromolecule are formed, resulting in greater power for any protoenzymic molecule than would accrue from a single copy of each type. Thermal proteins are readily incorporated into laboratory protocells. The experimental evidence for original polyfunctional protocells is discussed.

  2. Snake venom. The amino acid sequence of protein A from Dendroaspis polylepis polylepis (black mamba) venom.

    PubMed

    Joubert, F J; Strydom, D J

    1980-12-01

    Protein A from Dendroaspis polylepis polylepis venom comprises 81 amino acids, including ten half-cystine residues. The complete primary structures of protein A and its variant A' were elucidated. The sequences of proteins A and A', which differ in a single position, show no homology with various neurotoxins and non-neurotoxic proteins and represent a new type of elapid venom protein. PMID:7461607

  3. DNA-binding and transactivation properties of Pax-6: three amino acids in the paired domain are responsible for the different sequence recognition of Pax-6 and BSAP (Pax-5).

    PubMed Central

    Czerny, T; Busslinger, M

    1995-01-01

    Pax-6 is known to be a key regulator of vertebrate eye development. We have now isolated cDNA for an invertebrate Pax-6 protein from sea urchin embryos. Transcripts of this gene first appear during development at the gastrula stage and are later expressed at high levels in the tube foot of the adult sea urchin. The sea urchin Pax-6 protein is highly homologous throughout the whole protein to its vertebrate counterpart with the paired domain and homeodomain being virtually identical. Consequently, we found that the DNA-binding and transactivation properties of the sea urchin and mouse Pax-6 proteins are very similar, if not identical. A potent activation domain capable of stimulating transcription from proximal promoter and distal enhancer positions was localized within the C-terminal sequences of both the sea urchin and mouse Pax-6 proteins. The homeodomain of Pax-6 was shown to cooperatively dimerize on DNA sequences consisting of an inverted repeat of the TAAT motif with a preferred spacing of 3 nucleotides. The consensus recognition sequence of the Pax-6 paired domain deviates primarily only at one position from that of BSAP (Pax-5), and yet the two proteins exhibit largely different binding specificities for individual, naturally occurring sites. By creating Pax-6-BSAP fusion proteins, we were able to identify a short amino acid stretch in the N-terminal part of the paired domain which is responsible for these differences in DNA-binding specificity. Mutation of three Pax-6-specific residues in this region (at positions 42, 44, and 47 of the paired domain) to the corresponding amino acids of BSAP resulted in a complete switch of the DNA-binding specificity from Pax-6 to BSAP. These three amino acids were furthermore shown to discriminate between the Pax-6- and BSAP-specific nucleotide at the divergent position of the two consensus recognition sequences. PMID:7739566

  4. Polyphasic taxonomy, a consensus approach to bacterial systematics.

    PubMed Central

    Vandamme, P; Pot, B; Gillis, M; de Vos, P; Kersters, K; Swings, J

    1996-01-01

    Over the last 25 years, a much broader range of taxonomic studies of bacteria has gradually replaced the former reliance upon morphological, physiological, and biochemical characterization. This polyphasic taxonomy takes into account all available phenotypic and genotypic data and integrates them in a consensus type of classification, framed in a general phylogeny derived from 16S rRNA sequence analysis. In some cases, the consensus classification is a compromise containing a minimum of contradictions. It is thought that the more parameters that will become available in the future, the more polyphasic classification will gain stability. In this review, the practice of polyphasic taxonomy is discussed for four groups of bacteria chosen for their relevance, complexity, or both: the genera Xanthomonas and Campylobacter, the lactic acid bacteria, and the family Comamonadaceae. An evaluation of our present insights, the conclusions derived from it, and the perspectives of polyphasic taxonomy are discussed, emphasizing the keystone role of the species. Taxonomists did not succeed in standardizing species delimitation by using percent DNA hybridization values. Together with the absence of another "gold standard" for species definition, this has an enormous repercussion on bacterial taxonomy. This problem is faced in polyphasic taxonomy, which does not depend on a theory, a hypothesis, or a set of rules, presenting a pragmatic approach to a consensus type of taxonomy, integrating all available data maximally. In the future, polyphasic taxonomy will have to cope with (i) enormous amounts of data, (ii) large numbers of strains, and (iii) data fusion (data aggregation), which will demand efficient and centralized data storage. In the future, taxonomic studies will require collaborative efforts by specialized laboratories even more than now is the case. Whether these future developments will guarantee a more stable consensus classification remains an open question. PMID

  5. Characterization of the microbial acid mine drainage microbial community using culturing and direct sequencing techniques.

    PubMed

    Auld, Ryan R; Myre, Maxine; Mykytczuk, Nadia C S; Leduc, Leo G; Merritt, Thomas J S

    2013-05-01

    We characterized the bacterial community from an AMD tailings pond using both classical culturing and modern direct sequencing techniques and compared the two methods. Acid mine drainage (AMD) is produced by the environmental and microbial oxidation of minerals dissolved from mining waste. Surprisingly, we know little about the microbial communities associated with AMD, despite the fundamental ecological roles of these organisms and large-scale economic impact of these waste sites. AMD microbial communities have classically been characterized by laboratory culturing-based techniques and more recently by direct sequencing of marker gene sequences, primarily the 16S rRNA gene. In our comparison of the techniques, we find that their results are complementary, overall indicating very similar community structure with similar dominant species, but with each method identifying some species that were missed by the other. We were able to culture the majority of species that our direct sequencing results indicated were present, primarily species within the Acidithiobacillus and Acidiphilium genera, although estimates of relative species abundance were only obtained from direct sequencing. Interestingly, our culture-based methods recovered four species that had been overlooked from our sequencing results because of the rarity of the marker gene sequences, likely members of the rare biosphere. Further, direct sequencing indicated that a single genus, completely missed in our culture-based study, Legionella, was a dominant member of the microbial community. Our results suggest that while either method does a reasonable job of identifying the dominant members of the AMD microbial community, together the methods combine to give a more complete picture of the true diversity of this environment. PMID:23485423

  6. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... approved by the Director of the Federal Register in accordance with 5 U.S.C. 552(a) and 1 CFR part 51... base or modified or unusual amino acid may be presented in a given sequence as the corresponding unmodified base or amino acid if the modified base or modified or unusual amino acid is one of those...

  7. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... approved by the Director of the Federal Register in accordance with 5 U.S.C. 552(a) and 1 CFR part 51... base or modified or unusual amino acid may be presented in a given sequence as the corresponding unmodified base or amino acid if the modified base or modified or unusual amino acid is one of those...

  8. Nanopore Analysis of Nucleic Acids: Single-Molecule Studies of Molecular Dynamics, Structure, and Base Sequence

    NASA Astrophysics Data System (ADS)

    Olasagasti, Felix; Deamer, David W.

    Nucleic acids are linear polynucleotides in which each base is covalently linked to a pentose sugar and a phosphate group carrying a negative charge. If a pore having roughly the crosssectional diameter of a single-stranded nucleic acid is embedded in a thin membrane and a voltage of 100 mV or more is applied, individual nucleic acids in solution can be captured by the electrical field in the pore and translocated through by single-molecule electrophoresis. The dimensions of the pore cannot accommodate anything larger than a single strand, so each base in the molecule passes through the pore in strict linear sequence. The nucleic acid strand occupies a large fraction of the pore's volume during translocation and therefore produces a transient blockade of the ionic current created by the applied voltage. If it could be demonstrated that each nucleotide in the polymer produced a characteristic modulation of the ionic current during its passage through the nanopore, the sequence of current modulations would reflect the sequence of bases in the polymer. According to this basic concept, nanopores are analogous to a Coulter counter that detects nanoscopic molecules rather than microscopic [1,2]. However, the advantage of nanopores is that individual macromolecules can be characterized because different chemical and physical properties affect their passage through the pore. Because macromolecules can be captured in the pore as well as translocated, the nanopore can be used to detect individual functional complexes that form between a nucleic acid and an enzyme. No other technique has this capability.

  9. Complete amino acid sequence of a histidine-rich proteolytic fragment of human ceruloplasmin.

    PubMed

    Kingston, I B; Kingston, B L; Putnam, F W

    1979-04-01

    The complete amino acid sequence has been determined for a fragment of human ceruloplasmin [ferroxidase; iron(II):oxygen oxidoreductase, EC 1.16.3.1]. The fragment (designated Cp F5) contains 159 amino acid residues and has a molecular weight of 18,650; it lacks carbohydrate, is rich in histidine, and contains one free cysteine that may be part of a copper-binding site. This fragment is present in most commercial preparations of ceruloplasmin, probably owing to proteolytic degradation, but can also be obtained by limited cleavage of single-chain ceruloplasmin with plasmin. Cp F5 probably is an intact domain attached to the COOH-terminal end of single-chain ceruloplasmin via a labile interdomain peptide bond. A model of the secondary structure predicted by empirical methods suggests that almost one-third of the amino acid residues are distributed in alpha helices, about a third in beta-sheet structure, and the remainder in beta turns and unidentified structures. Computer analysis of the amino acid sequence has not demonstrated a statistically significant relationship between this ceruloplasmin fragment and any other protein, but there is some evidence for an internal duplication. PMID:287005

  10. The amino acid sequence of Lady Amherst's pheasant (Chrysolophus amherstiae) and golden pheasant (Chrysolophus pictus) egg-white lysozymes.

    PubMed

    Araki, T; Kuramoto, M; Torikata, T

    1990-09-01

    The amino acids of Lady Amherst's pheasant and golden pheasant egg-white lysozymes have been sequenced. The carboxymethylated lysozymes were digested with trypsin followed by sequencing of the tryptic peptides. Lady Amherst's pheasant lysozyme proved to consist of 129 amino acid residues, and a relative molecular mass of 14,423 Da was calculated. This lysozyme had 6 amino acids substitutions when compared with hen egg-white lysozyme: Phe3 to Tyr, His15 to Leu, Gln41 to His, Asn77 to His, Gln 121 to Asn, and a newly found substitution of Ile124 to Thr. The amino acid sequence of golden pheasant lysozyme was identical to that of Lady Amherst's phesant lysozyme. The phylogenetic tree constructured by the comparison of amino acid sequences of phasianoid birds lysozymes revealed a minimum genetic distance between these pheasants and the turkey-peafowl group. PMID:1368578

  11. Complete Genome Sequence of a thermotolerant sporogenic lactic acid bacterium, Bacillus coagulans strain 36D1

    PubMed Central

    Rhee, Mun Su; Moritz, Brélan E.; Xie, Gary; Glavina del Rio, T.; Dalin, E.; Tice, H.; Bruce, D.; Goodwin, L.; Chertkov, O.; Brettin, T.; Han, C.; Detter, C.; Pitluck, S.; Land, Miriam L.; Patel, Milind; Ou, Mark; Harbrucker, Roberta; Ingram, Lonnie O.; Shanmugam, K. T.

    2011-01-01

    Bacillus coagulans is a ubiquitous soil bacterium that grows at 50-55 °C and pH 5.0 and ferments various sugars that constitute plant biomass to L (+)-lactic acid. The ability of this sporogenic lactic acid bacterium to grow at 50-55 °C and pH 5.0 makes this organism an attractive microbial biocatalyst for production of optically pure lactic acid at industrial scale not only from glucose derived from cellulose but also from xylose, a major constituent of hemicellulose. This bacterium is also considered as a potential probiotic. Complete genome sequence of a representative strain, B. coagulans strain 36D1, is presented and discussed. PMID:22675583

  12. Complete amino acid sequence of globin chains and biological activity of fragmented crocodile hemoglobin (Crocodylus siamensis).

    PubMed

    Srihongthong, Saowaluck; Pakdeesuwan, Anawat; Daduang, Sakda; Araki, Tomohiro; Dhiravisit, Apisak; Thammasirirak, Sompong

    2012-08-01

    Hemoglobin, α-chain, β-chain and fragmented hemoglobin of Crocodylus siamensis demonstrated both antibacterial and antioxidant activities. Antibacterial and antioxidant properties of the hemoglobin did not depend on the heme structure but could result from the compositions of amino acid residues and structures present in their primary structure. Furthermore, thirteen purified active peptides were obtained by RP-HPLC analyses, corresponding to fragments in the α-globin chain and the β-globin chain which are mostly located at the N-terminal and C-terminal parts. These active peptides operate on the bacterial cell membrane. The globin chains of Crocodylus siamensis showed similar amino acids to the sequences of Crocodylus niloticus. The novel amino acid substitutions of α-chain and β-chain are not associated with the heme binding site or the bicarbonate ion binding site, but could be important through their interactions with membranes of bacteria. PMID:22648692

  13. [Partial sequence homology of FtsZ in phylogenetics analysis of lactic acid bacteria].

    PubMed

    Zhang, Bin; Dong, Xiu-zhu

    2005-10-01

    FtsZ is a structurally conserved protein, which is universal among the prokaryotes. It plays a key role in prokaryote cell division. A partial fragment of the ftsZ gene about 800bp in length was amplified and sequenced and a partial FtsZ protein phylogenetic tree for the lactic acid bacteria was constructed. By comparing the FtsZ phylogenetic tree with the 16S rDNA tree, it was shown that the two trees were similar in topology. Both trees revealed that Pediococcus spp. were closely related with L. casei group of Lactobacillus spp. , but less related with other lactic acid cocci such as Enterococcus and Streptococcus. The results also showed that the discriminative power of FtsZ was higher than that of 16S rDNA for either inter-species or inter-genus and could be a very useful tool in species identification of lactic acid bacteria. PMID:16342751

  14. Comparative characterization of random-sequence proteins consisting of 5, 12, and 20 kinds of amino acids.

    PubMed

    Tanaka, Junko; Doi, Nobuhide; Takashima, Hideaki; Yanagawa, Hiroshi

    2010-04-01

    Screening of functional proteins from a random-sequence library has been used to evolve novel proteins in the field of evolutionary protein engineering. However, random-sequence proteins consisting of the 20 natural amino acids tend to aggregate, and the occurrence rate of functional proteins in a random-sequence library is low. From the viewpoint of the origin of life, it has been proposed that primordial proteins consisted of a limited set of amino acids that could have been abundantly formed early during chemical evolution. We have previously found that members of a random-sequence protein library constructed with five primitive amino acids show high solubility (Doi et al., Protein Eng Des Sel 2005;18:279-284). Although such a library is expected to be appropriate for finding functional proteins, the functionality may be limited, because they have no positively charged amino acid. Here, we constructed three libraries of 120-amino acid, random-sequence proteins using alphabets of 5, 12, and 20 amino acids by preselection using mRNA display (to eliminate sequences containing stop codons and frameshifts) and characterized and compared the structural properties of random-sequence proteins arbitrarily chosen from these libraries. We found that random-sequence proteins constructed with the 12-member alphabet (including five primitive amino acids and positively charged amino acids) have higher solubility than those constructed with the 20-member alphabet, though other biophysical properties are very similar in the two libraries. Thus, a library of moderate complexity constructed from 12 amino acids may be a more appropriate resource for functional screening than one constructed from 20 amino acids. PMID:20162614

  15. Detection and analysis of diverse herpesviral species by consensus primer PCR.

    PubMed Central

    VanDevanter, D R; Warrener, P; Bennett, L; Schultz, E R; Coulter, S; Garber, R L; Rose, T M

    1996-01-01

    A consensus primer PCR method which amplifies a region of herpesviral DNA-directed DNA polymerase (EC 2.7.7.7) and which uses degenerate primers in a nested format was developed. Primers were designed to target sequences coding for highly conserved amino acid motifs covering a region of approximately 800 bp. The assay was applied to 22 species of herpesviruses (8 human and 14 animal viruses), with PCR products obtained for 21 of 22 viruses. In the process, 14 previously unreported amino acid-coding sequences from herpesviral DNA polymerases were obtained, including regions of human herpesviruses 7 and 8. The 50 to 60 amino acid-coding sequences recovered in the present study were determined to be unique to each viral species studied, with very little sequence variation between strains of a single species when studied. Template dilution studies in the presence of human carrier DNA demonstrated that six human herpesviruses (herpesviruses 1, 2, 3, 4, 5, and 6B) could be detected at levels at or below 100 genome equivalents per 100 ng of carrier DNA. These data suggest that consensus primer PCR targeted to herpesviral DNA polymerase may prove to be useful in the detection and identification of known herpesviruses in clinical samples and the initial characterization of new herpesviral genomes. PMID:8784566

  16. N-Terminal Amino Acid Sequence Determination of Proteins by N-Terminal Dimethyl Labeling: Pitfalls and Advantages When Compared with Edman Degradation Sequence Analysis.

    PubMed

    Chang, Elizabeth; Pourmal, Sergei; Zhou, Chun; Kumar, Rupesh; Teplova, Marianna; Pavletich, Nikola P; Marians, Kenneth J; Erdjument-Bromage, Hediye

    2016-07-01

    In recent history, alternative approaches to Edman sequencing have been investigated, and to this end, the Association of Biomolecular Resource Facilities (ABRF) Protein Sequencing Research Group (PSRG) initiated studies in 2014 and 2015, looking into bottom-up and top-down N-terminal (Nt) dimethyl derivatization of standard quantities of intact proteins with the aim to determine Nt sequence information. We have expanded this initiative and used low picomole amounts of myoglobin to determine the efficiency of Nt-dimethylation. Application of this approach on protein domains, generated by limited proteolysis of overexpressed proteins, confirms that it is a universal labeling technique and is very sensitive when compared with Edman sequencing. Finally, we compared Edman sequencing and Nt-dimethylation of the same polypeptide fragments; results confirm that there is agreement in the identity of the Nt amino acid sequence between these 2 methods. PMID:27006647

  17. N-Terminal Amino Acid Sequence Determination of Proteins by N-Terminal Dimethyl Labeling: Pitfalls and Advantages When Compared with Edman Degradation Sequence Analysis

    PubMed Central

    Chang, Elizabeth; Pourmal, Sergei; Zhou, Chun; Kumar, Rupesh; Teplova, Marianna; Pavletich, Nikola P.; Marians, Kenneth J.

    2016-01-01

    In recent history, alternative approaches to Edman sequencing have been investigated, and to this end, the Association of Biomolecular Resource Facilities (ABRF) Protein Sequencing Research Group (PSRG) initiated studies in 2014 and 2015, looking into bottom-up and top-down N-terminal (Nt) dimethyl derivatization of standard quantities of intact proteins with the aim to determine Nt sequence information. We have expanded this initiative and used low picomole amounts of myoglobin to determine the efficiency of Nt-dimethylation. Application of this approach on protein domains, generated by limited proteolysis of overexpressed proteins, confirms that it is a universal labeling technique and is very sensitive when compared with Edman sequencing. Finally, we compared Edman sequencing and Nt-dimethylation of the same polypeptide fragments; results confirm that there is agreement in the identity of the Nt amino acid sequence between these 2 methods. PMID:27006647

  18. Partial amino acid sequence of fructose-1,6-bisphosphatase from the blue-green algae Synechococcus leopoliensis.

    PubMed

    Marcus, F; Latshaw, S P; Steup, M; Gerbling, K P

    1989-08-01

    Purified fructose-1,6-bisphosphatase from the cyanobacterium Synechococcus leopoliensis was S-carboxymethylated and cleaved with trypsin. The resulting peptides were purified by reversed-phase high performance liquid chromatography and the amino acid sequence of six of the purified peptides was determined by gas-phase microsequencing. The results revealed sequence homology with other fructose-1,6-bisphosphatases. The obtained sequence data provides information required for the design of oligonucleotide hybridization probes to screen existing libraries of cyanobacterial DNA. The determination of the amino acid sequence of cyanobacterial proteins may yield important information with respect to the endosymbiotic theory of evolution. PMID:2550924

  19. Protein sequence analysis by incorporating modified chaos game and physicochemical properties into Chou's general pseudo amino acid composition.

    PubMed

    Xu, Chunrui; Sun, Dandan; Liu, Shenghui; Zhang, Yusen

    2016-10-01

    In this contribution we introduced a novel graphical method to compare protein sequences. By mapping a protein sequence into 3D space based on codons and physicochemical properties of 20 amino acids, we are able to get a unique P-vector from the 3D curve. This approach is consistent with wobble theory of amino acids. We compute the distance between sequences by their P-vectors to measure similarities/dissimilarities among protein sequences. Finally, we use our method to analyze four datasets and get better results compared with previous approaches. PMID:27375218

  20. Nucleotide sequence of the phosphoglycerate kinase gene from the extreme thermophile Thermus thermophilus. Comparison of the deduced amino acid sequence with that of the mesophilic yeast phosphoglycerate kinase.

    PubMed Central

    Bowen, D; Littlechild, J A; Fothergill, J E; Watson, H C; Hall, L

    1988-01-01

    Using oligonucleotide probes derived from amino acid sequencing information, the structural gene for phosphoglycerate kinase from the extreme thermophile, Thermus thermophilus, was cloned in Escherichia coli and its complete nucleotide sequence determined. The gene consists of an open reading frame corresponding to a protein of 390 amino acid residues (calculated Mr 41,791) with an extreme bias for G or C (93.1%) in the codon third base position. Comparison of the deduced amino acid sequence with that of the corresponding mesophilic yeast enzyme indicated a number of significant differences. These are discussed in terms of the unusual codon bias and their possible role in enhanced protein thermal stability. Images Fig. 1. PMID:3052437

  1. Bacteria obtained from a sequencing batch reactor that are capable of growth on dehydroabietic acid.

    PubMed Central

    Mohn, W W

    1995-01-01

    Eleven isolates capable of growth on the resin acid dehydroabietic acid (DhA) were obtained from a sequencing batch reactor designed to treat a high-strength process stream from a paper mill. The isolates belonged to two groups, represented by strains DhA-33 and DhA-35, which were characterized. In the bioreactor, bacteria like DhA-35 were more abundant than those like DhA-33. The population in the bioreactor of organisms capable of growth on DhA was estimated to be 1.1 x 10(6) propagules per ml, based on a most-probable-number determination. Analysis of small-subunit rRNA partial sequences indicated that DhA-33 was most closely related to Sphingomonas yanoikuyae (Sab = 0.875) and that DhA-35 was most closely related to Zoogloea ramigera (Sab = 0.849). Both isolates additionally grew on other abietanes, i.e., abietic and palustric acids, but not on the pimaranes, pimaric and isopimaric acids. For DhA-33 and DhA-35 with DhA as the sole organic substrate, doubling times were 2.7 and 2.2 h, respectively, and growth yields were 0.30 and 0.25 g of protein per g of DhA, respectively. Glucose as a cosubstrate stimulated growth of DhA-33 on DhA and stimulated DhA degradation by the culture. Pyruvate as a cosubstrate did not stimulate growth of DhA-35 on DhA and reduced the specific rate of DhA degradation of the culture. DhA induced DhA and abietic acid degradation activities in both strains, and these activities were heat labile. Cell suspensions of both strains consumed DhA at a rate of 6 mumol mg of protein-1 h-1.(ABSTRACT TRUNCATED AT 250 WORDS) PMID:7793937

  2. Nucleic and amino acid sequences relating to a novel transketolase, and methods for the expression thereof

    DOEpatents

    Croteau, Rodney Bruce; Wildung, Mark Raymond; Lange, Bernd Markus; McCaskill, David G.

    2001-01-01

    cDNAs encoding 1-deoxyxylulose-5-phosphate synthase from peppermint (Mentha piperita) have been isolated and sequenced, and the corresponding amino acid sequences have been determined. Accordingly, isolated DNA sequences (SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7) are provided which code for the expression of 1-deoxyxylulose-5-phosphate synthase from plants. In another aspect the present invention provides for isolated, recombinant DXPS proteins, such as the proteins having the sequences set forth in SEQ ID NO:4, SEQ ID NO:6 and SEQ ID NO:8. In other aspects, replicable recombinant cloning vehicles are provided which code for plant 1-deoxyxylulose-5-phosphate synthases, or for a base sequence sufficiently complementary to at least a portion of 1-deoxyxylulose-5-phosphate synthase DNA or RNA to enable hybridization therewith. In yet other aspects, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding a plant 1-deoxyxylulose-5-phosphate synthase. Thus, systems and methods are provided for the recombinant expression of the aforementioned recombinant 1-deoxyxylulose-5-phosphate synthase that may be used to facilitate its production, isolation and purification in significant amounts. Recombinant 1-deoxyxylulose-5-phosphate synthase may be used to obtain expression or enhanced expression of 1-deoxyxylulose-5-phosphate synthase in plants in order to enhance the production of 1-deoxyxylulose-5-phosphate, or its derivatives such as isopentenyl diphosphate (BP), or may be otherwise employed for the regulation or expression of 1-deoxyxylulose-5-phosphate synthase, or the production of its products.

  3. Novel method for PIK3CA mutation analysis: locked nucleic acid--PCR sequencing.

    PubMed

    Ang, Daphne; O'Gara, Rebecca; Schilling, Amy; Beadling, Carol; Warrick, Andrea; Troxell, Megan L; Corless, Christopher L

    2013-05-01

    Somatic mutations in PIK3CA are commonly seen in invasive breast cancer and several other carcinomas, occurring in three hotspots: codons 542 and 545 of exon 9 and in codon 1047 of exon 20. We designed a locked nucleic acid (LNA)-PCR sequencing assay to detect low levels of mutant PIK3CA DNA with attention to avoiding amplification of a pseudogene on chromosome 22 that has >95% homology to exon 9 of PIK3CA. We tested 60 FFPE breast DNA samples with known PIK3CA mutation status (48 cases had one or more PIK3CA mutations, and 12 were wild type) as identified by PCR-mass spectrometry. PIK3CA exons 9 and 20 were amplified in the presence or absence of LNA-oligonucleotides designed to bind to the wild-type sequences for codons 542, 545, and 1047, and partially suppress their amplification. LNA-PCR sequencing confirmed all 51 PIK3CA mutations; however, the mutation detection rate by standard Sanger sequencing was only 69% (35 of 51). Of the 12 PIK3CA wild-type cases, LNA-PCR sequencing detected three additional H1047R mutations in "normal" breast tissue and one E545K in usual ductal hyperplasia. Histopathological review of these three normal breast specimens showed columnar cell change in two (both with known H1047R mutations) and apocrine metaplasia in one. The novel LNA-PCR shows higher sensitivity than standard Sanger sequencing and did not amplify the known pseudogene. PMID:23541593

  4. Genome Sequence Analysis of the Naphthenic Acid Degrading and Metal Resistant Bacterium Cupriavidus gilardii CR3.

    PubMed

    Wang, Xiaoyu; Chen, Meili; Xiao, Jingfa; Hao, Lirui; Crowley, David E; Zhang, Zhewen; Yu, Jun; Huang, Ning; Huo, Mingxin; Wu, Jiayan

    2015-01-01

    Cupriavidus sp. are generally heavy metal tolerant bacteria with the ability to degrade a variety of aromatic hydrocarbon compounds, although the degradation pathways and substrate versatilities remain largely unknown. Here we studied the bacterium Cupriavidus gilardii strain CR3, which was isolated from a natural asphalt deposit, and which was shown to utilize naphthenic acids as a sole carbon source. Genome sequencing of C. gilardii CR3 was carried out to elucidate possible mechanisms for the naphthenic acid biodegradation. The genome of C. gilardii CR3 was composed of two circular chromosomes chr1 and chr2 of respectively 3,539,530 bp and 2,039,213 bp in size. The genome for strain CR3 encoded 4,502 putative protein-coding genes, 59 tRNA genes, and many other non-coding genes. Many genes were associated with xenobiotic biodegradation and metal resistance functions. Pathway prediction for degradation of cyclohexanecarboxylic acid, a representative naphthenic acid, suggested that naphthenic acid undergoes initial ring-cleavage, after which the ring fission products can be degraded via several plausible degradation pathways including a mechanism similar to that used for fatty acid oxidation. The final metabolic products of these pathways are unstable or volatile compounds that were not toxic to CR3. Strain CR3 was also shown to have tolerance to at least 10 heavy metals, which was mainly achieved by self-detoxification through ion efflux, metal-complexation and metal-reduction, and a powerful DNA self-repair mechanism. Our genomic analysis suggests that CR3 is well adapted to survive the harsh environment in natural asphalts containing naphthenic acids and high concentrations of heavy metals. PMID:26301592

  5. Genome Sequence Analysis of the Naphthenic Acid Degrading and Metal Resistant Bacterium Cupriavidus gilardii CR3

    PubMed Central

    Xiao, Jingfa; Hao, Lirui; Crowley, David E.; Zhang, Zhewen; Yu, Jun; Huang, Ning; Huo, Mingxin; Wu, Jiayan

    2015-01-01

    Cupriavidus sp. are generally heavy metal tolerant bacteria with the ability to degrade a variety of aromatic hydrocarbon compounds, although the degradation pathways and substrate versatilities remain largely unknown. Here we studied the bacterium Cupriavidus gilardii strain CR3, which was isolated from a natural asphalt deposit, and which was shown to utilize naphthenic acids as a sole carbon source. Genome sequencing of C. gilardii CR3 was carried out to elucidate possible mechanisms for the naphthenic acid biodegradation. The genome of C. gilardii CR3 was composed of two circular chromosomes chr1 and chr2 of respectively 3,539,530 bp and 2,039,213 bp in size. The genome for strain CR3 encoded 4,502 putative protein-coding genes, 59 tRNA genes, and many other non-coding genes. Many genes were associated with xenobiotic biodegradation and metal resistance functions. Pathway prediction for degradation of cyclohexanecarboxylic acid, a representative naphthenic acid, suggested that naphthenic acid undergoes initial ring-cleavage, after which the ring fission products can be degraded via several plausible degradation pathways including a mechanism similar to that used for fatty acid oxidation. The final metabolic products of these pathways are unstable or volatile compounds that were not toxic to CR3. Strain CR3 was also shown to have tolerance to at least 10 heavy metals, which was mainly achieved by self-detoxification through ion efflux, metal-complexation and metal-reduction, and a powerful DNA self-repair mechanism. Our genomic analysis suggests that CR3 is well adapted to survive the harsh environment in natural asphalts containing naphthenic acids and high concentrations of heavy metals. PMID:26301592

  6. Bile acid sulfotransferase I from rat liver sulfates bile acids and 3-hydroxy steroids: purification, N-terminal amino acid sequence, and kinetic properties.

    PubMed

    Barnes, S; Buchina, E S; King, R J; McBurnett, T; Taylor, K B

    1989-04-01

    A bile acid:3'phosphoadenosine-5'phosphosulfate:sulfotransferase (BAST I) from adult female rat liver cytosol has been purified 157-fold by a two-step isolation procedure. The N-terminal amino acid sequence of the 30,000 subunit has been determined for the first 35 residues. The Vmax of purified BAST I is 18.7 nmol/min per mg protein with N-(3-hydroxy-5 beta-cholanoyl)glycine (glycolithocholic acid) as substrate, comparable to that of the corresponding purified human BAST (Chen, L-J., and I. H. Segel, 1985. Arch. Biochem. Biophys. 241: 371-379). BAST I activity has a broad pH optimum from 5.5-7.5. Although maximum activity occurs with 5 mM MgCl2, Mg2+ is not essential for BAST I activity. The greatest sulfotransferase activity and the highest substrate affinity is observed with bile acids or steroids that have a steroid nucleus containing a 3 beta-hydroxy group and a 5-6 double bond or a trans A-B ring junction. These substrates have normal hyperbolic initial velocity curves with substrate inhibition occurring above 5 microM. Of the saturated 5 beta-bile acids, those with a single 3-hydroxy group are the most active. The addition of a second hydroxy group at the 6- or 7-position eliminates more than 99% of the activity. In contrast, 3 alpha,12 alpha-dihydroxy-5 beta-cholan-24-oic acid (deoxycholic acid) is an excellent substrate. The initial velocity curves for glycolithocholic and deoxycholic acid conjugates are sigmoidal rather than hyperbolic, suggestive of an allosteric effect. Maximum activity is observed at 80 microM for glycolithocholic acid. All substrates, bile acids and steroids, are inhibited by the 5 beta-bile acid, 3-keto-5 beta-cholanoic acid. The data suggest that BAST I is the same protein as hydrosteroid sulfotransferase 2 (Marcus, C. J., et al. 1980. Anal. Biochem. 107: 296-304). PMID:2754334

  7. Phosphorylation of Simian Cytomegalovirus Assembly Protein Precursor (pAPNG.5) and Proteinase Precursor (pAPNG1): Multiple Attachment Sites Identified, Including Two Adjacent Serines in a Casein Kinase II Consensus Sequence

    PubMed Central

    Plafker, Scott M.; Woods, Amina S.; Gibson, Wade

    1999-01-01

    The assembly protein precursor (pAP) of cytomegalovirus (CMV), and its homologs in other herpesviruses, functions at several key steps during the process of capsid formation. This protein, and the genetically related maturational proteinase, is distinguished from the other capsid proteins by posttranslational modifications, including phosphorylation. The objective of this study was to identify sites at which pAP is phosphorylated so that the functional significance of this modification and the enzyme(s) responsible for it can be determined. In the work reported here, we used peptide mapping, mass spectrometry, and site-directed mutagenesis to identify two sets of pAP phosphorylation sites. One is a casein kinase II (CKII) consensus sequence that contains two adjacent serines, both of which are phosphorylated. The other site(s) is in a different domain of the protein, is phosphorylated less frequently than the CKII site, does not require preceding CKII-site phosphorylation, and causes an electrophoretic mobility shift when phosphorylated. Transfection/expression assays for proteolytic activity showed no gross effect of CKII-site phosphorylation on the enzymatic activity of the proteinase or on the substrate behavior of pAP. Evidence is presented that both the CKII sites and the secondary sites are phosphorylated in virus-infected cells and plasmid-transfected cells, indicating that these modifications can be made by a cellular enzyme(s). Apparent compartmental differences in phosphorylation of the CKII-site (cytoplasmic) and secondary-site (nuclear) serines suggest the involvement of more that one enzyme in these modifications. PMID:10516011

  8. Expression of Interferon Consensus Sequence Binding Protein (ICSBP) Is Downregulated in Bcr-Abl-Induced Murine Chronic Myelogenous Leukemia-Like Disease, and Forced Coexpression of ICSBP Inhibits Bcr-Abl-Induced Myeloproliferative Disorder

    PubMed Central

    Hao, Sheryl X.; Ren, Ruibao

    2000-01-01

    Chronic myelogenous leukemia (CML) is a clonal myeloproliferative disorder resulting from the neoplastic transformation of a hematopoietic stem cell. The majority of cases of CML are associated with the (9;22) chromosome translocation that generates the bcr-abl chimeric gene. Alpha interferon (IFN-α) treatment induces hematological remission and prolongs life in 75% of CML patients in the chronic phase. It has been shown that mice deficient in interferon consensus sequence binding protein (ICSBP), a member of the interferon regulatory factor family, manifest a CML-like syndrome. We have shown that expression of Bcr-Abl in bone marrow (BM) cells from 5-fluorouracil (5-FU)-treated mice by retroviral transduction efficiently induces a myeloproliferative disease in mice resembling human CML. To directly test whether icsbp can function as a tumor suppressor gene, we examined the effect of ICSBP on Bcr-Abl-induced CML-like disease using this murine model for CML. We found that expression of the ICSBP protein was significantly decreased in Bcr-Abl-induced CML-like disease. Forced coexpression of ICSBP inhibited the Bcr-Abl-induced colony formation of BM cells from 5-FU-treated mice in vitro and Bcr-Abl-induced CML-like disease in vivo. Interestingly, coexpression of ICSBP and Bcr-Abl induced a transient B-lymphoproliferative disorder in the murine model of Bcr-Abl-induced CML-like disease. Overexpression of ICSBP consistently promotes rather than inhibits Bcr-Abl-induced B lymphoproliferation in a murine model where BM cells from non-5-FU-treated donors were used, indicating that ICSBP has a specific antitumor activity toward myeloid neoplasms. We also found that overexpression of ICSBP negatively regulated normal hematopoiesis. These data provide direct evidence that ICSBP can act as a tumor suppressor that regulates normal and neoplastic proliferation of hematopoietic cells. PMID:10648600

  9. Sequence-defined bioactive macrocycles via an acid-catalysed cascade reaction

    NASA Astrophysics Data System (ADS)

    Porel, Mintu; Thornlow, Dana N.; Phan, Ngoc N.; Alabi, Christopher A.

    2016-06-01

    Synthetic macrocycles derived from sequence-defined oligomers are a unique structural class whose ring size, sequence and structure can be tuned via precise organization of the primary sequence. Similar to peptides and other peptidomimetics, these well-defined synthetic macromolecules become pharmacologically relevant when bioactive side chains are incorporated into their primary sequence. In this article, we report the synthesis of oligothioetheramide (oligoTEA) macrocycles via a one-pot acid-catalysed cascade reaction. The versatility of the cyclization chemistry and modularity of the assembly process was demonstrated via the synthesis of >20 diverse oligoTEA macrocycles. Structural characterization via NMR spectroscopy revealed the presence of conformational isomers, which enabled the determination of local chain dynamics within the macromolecular structure. Finally, we demonstrate the biological activity of oligoTEA macrocycles designed to mimic facially amphiphilic antimicrobial peptides. The preliminary results indicate that macrocyclic oligoTEAs with just two-to-three cationic charge centres can elicit potent antibacterial activity against Gram-positive and Gram-negative bacteria.

  10. Unconventional amino acid sequence of the sun anemone (Stoichactis helianthus) polypeptide neurotoxin

    SciTech Connect

    Kem, W.; Dunn, B.; Parten, B.; Pennington, M.; Price, D.

    1986-05-01

    A 5000 dalton polypeptide neurotoxin (Sh-NI) purified by G50 Sephadex, P-cellulose, and SP-Sephadex chromatography was homogeneous by isoelectric focusing. Sh-NI was highly toxic to crayfish (LD/sub 50/ 0.6 ..mu..g/kg) but without effect upon mice at 15,000 ..mu..g/kg (i.p. injection). The reduced, /sup 3/H-carboxymethylated toxin and its fragments were subjected to automatic Edman degradation and the resulting PTH-amino acids were identified by HPLC, back hydrolysis, and scintillation counting. Peptides resulting from proteolytic (clostripain, staphylococcal protease) and chemical (tryptophan) cleavage were sequenced. The sequence is: AACKCDDEGPDIRTAPLTGTVDLGSCNAGWEKCASYYTIIADCCRKKK. This sequence differs considerably from the homologous Anemonia and Anthopleura toxins; many of the identical residues (6 half-cystines, G9, P10, R13, G19, G29, W30) are probably critical for folding rather than receptor recognition. However, the Sh-NI sequence closely resembles Radioanthus macrodactylus neurotoxin III and r. paumotensis II. The authors propose that Sh-NI and related Radioanthus toxins act upon a different site on the sodium channel.

  11. Repeat sequence chromosome specific nucleic acid probes and methods of preparing and using

    DOEpatents

    Weier, H.U.G.; Gray, J.W.

    1995-06-27

    A primer directed DNA amplification method to isolate efficiently chromosome-specific repeated DNA wherein degenerate oligonucleotide primers are used is disclosed. The probes produced are a heterogeneous mixture that can be used with blocking DNA as a chromosome-specific staining reagent, and/or the elements of the mixture can be screened for high specificity, size and/or high degree of repetition among other parameters. The degenerate primers are sets of primers that vary in sequence but are substantially complementary to highly repeated nucleic acid sequences, preferably clustered within the template DNA, for example, pericentromeric alpha satellite repeat sequences. The template DNA is preferably chromosome-specific. Exemplary primers and probes are disclosed. The probes of this invention can be used to determine the number of chromosomes of a specific type in metaphase spreads, in germ line and/or somatic cell interphase nuclei, micronuclei and/or in tissue sections. Also provided is a method to select arbitrarily repeat sequence probes that can be screened for chromosome-specificity. 18 figs.

  12. Repeat sequence chromosome specific nucleic acid probes and methods of preparing and using

    DOEpatents

    Weier, Heinz-Ulrich G.; Gray, Joe W.

    1995-01-01

    A primer directed DNA amplification method to isolate efficiently chromosome-specific repeated DNA wherein degenerate oligonucleotide primers are used is disclosed. The probes produced are a heterogeneous mixture that can be used with blocking DNA as a chromosome-specific staining reagent, and/or the elements of the mixture can be screened for high specificity, size and/or high degree of repetition among other parameters. The degenerate primers are sets of primers that vary in sequence but are substantially complementary to highly repeated nucleic acid sequences, preferably clustered within the template DNA, for example, pericentromeric alpha satellite repeat sequences. The template DNA is preferably chromosome-specific. Exemplary primers ard probes are disclosed. The probes of this invention can be used to determine the number of chromosomes of a specific type in metaphase spreads, in germ line and/or somatic cell interphase nuclei, micronuclei and/or in tissue sections. Also provided is a method to select arbitrarily repeat sequence probes that can be screened for chromosome-specificity.

  13. Detection of Nucleic Acids with Graphene Nanopores: Ab Initio Characterization of a Novel Sequencing Device

    NASA Astrophysics Data System (ADS)

    Nelson, Tammie; Zhang, Bo; Prezhdo, Oleg

    2010-03-01

    We report an ab initio study of the interaction of two nucleobases, cytosine and adenine, with a novel graphene nanopore device for detecting the base sequence of a single-stranded nucleic acid (ssDNA or RNA). The nucleobases were inserted into a pore in a graphene nanoribbon, and the electrical current and conductance spectra were calculated as functions of voltage applied across the nanoribbon. The conductance spectra and charge densities were analyzed in the presence of each nucleobase in the graphene nanopore. The results indicate that, due to significant differences in the conductance spectra, the proposed device has adequate sensitivity to discriminate between different nucleotides. Moreover, we show that the nucleotide conductance spectra is not affected by its orientation inside the graphene nanopore. The proposed technique may be extremely useful for real applications in developing ultrafast, low cost DNA sequencing methods.

  14. Morphological tranformation of calcite crystal growth by prismatic "acidic" polypeptide sequences.

    SciTech Connect

    Kim, I; Giocondi, J L; Orme, C A; Collino, J; Evans, J S

    2007-02-13

    Many of the interesting mechanical and materials properties of the mollusk shell are thought to stem from the prismatic calcite crystal assemblies within this composite structure. It is now evident that proteins play a major role in the formation of these assemblies. Recently, a superfamily of 7 conserved prismatic layer-specific mollusk shell proteins, Asprich, were sequenced, and the 42 AA C-terminal sequence region of this protein superfamily was found to introduce surface voids or porosities on calcite crystals in vitro. Using AFM imaging techniques, we further investigate the effect that this 42 AA domain (Fragment-2) and its constituent subdomains, DEAD-17 and Acidic-2, have on the morphology and growth kinetics of calcite dislocation hillocks. We find that Fragment-2 adsorbs on terrace surfaces and pins acute steps, accelerates then decelerates the growth of obtuse steps, forms clusters and voids on terrace surfaces, and transforms calcite hillock morphology from a rhombohedral form to a rounded one. These results mirror yet are distinct from some of the earlier findings obtained for nacreous polypeptides. The subdomains Acidic-2 and DEAD-17 were found to accelerate then decelerate obtuse steps and induce oval rather than rounded hillock morphologies. Unlike DEAD-17, Acidic-2 does form clusters on terrace surfaces and exhibits stronger obtuse velocity inhibition effects than either DEAD-17 or Fragment-2. Interestingly, a 1:1 mixture of both subdomains induces an irregular polygonal morphology to hillocks, and exhibits the highest degree of acute step pinning and obtuse step velocity inhibition. This suggests that there is some interplay between subdomains within an intra (Fragment-2) or intermolecular (1:1 mixture) context, and sequence interplay phenomena may be employed by biomineralization proteins to exert net effects on crystal growth and morphology.

  15. Fast computational methods for predicting protein structure from primary amino acid sequence

    DOEpatents

    Agarwal, Pratul Kumar

    2011-07-19

    The present invention provides a method utilizing primary amino acid sequence of a protein, energy minimization, molecular dynamics and protein vibrational modes to predict three-dimensional structure of a protein. The present invention also determines possible intermediates in the protein folding pathway. The present invention has important applications to the design of novel drugs as well as protein engineering. The present invention predicts the three-dimensional structure of a protein independent of size of the protein, overcoming a significant limitation in the prior art.

  16. Amino-terminal amino acid sequence of the major structural polypeptides of avian retroviruses: sequence homology between reticuloendotheliosis virus p30 and p30s of mammalian retroviruses.

    PubMed Central

    Hunter, E; Bhown, A S; Bennett, J C

    1978-01-01

    The major structural polypeptides, p30 of reticuloendotheliosis virus (REV) (strain T) and p27 of avian sarcoma virus B77, have been compared with regard to amino acid composition. NH2-terminal amino acid sequence, and immunological crossreactions. The amino acid composition of the two polypeptides is distinct, and a comparison of the first 30 NH2-terminal amino acids of REV p30 with that for the first 25 of B77 p27 yields only three homologous residues. In competition radioimmunoassays the polypeptides show no crossreactivity. A comparison of the amino acid composition and NH2-terminal amino acid sequence of REV p30 with those reported for several mammalian retrovirus p30s shows remarkable similarities. Both REV and mammalian p30s contain a large number of polar residues in their amino acid composition and show approximately 40% homology in the first 30 NH2-terminal amino acids. No crossreactivity could be observed, however, in competition radioimmunoassays between Rauscher murine leukemia virus p30 and that of REV. The observations reported here suggest a close evolutionary relationship between REV and the mammalian retroviruses. Images PMID:208072

  17. Purification and amino acid sequence of aminopeptidase P from pig kidney.

    PubMed

    Vergas Romero, C; Neudorfer, I; Mann, K; Schäfer, W

    1995-04-01

    Aminopeptidase P from kidney cortex was purified in high yield (recovery greater than or equal to 20%) by a series of column chromatographic steps after solubilization of the membrane-bound glycoprotein with n-butanol. A coupled enzymic assay, using Gly-Pro-Pro-NH-Nap as substrate and dipeptidyl-peptidase IV as auxilliary enzyme, was used to monitor the purification. The purification procedure yielded two forms of aminopeptidase P differing in their carbohydrate composition (glycoforms). Both enzyme preparations were homogeneous as assessed by SDS/PAGE silver staining, and isoelectric focusing. Both forms possessed the same substrate specificity, catalysed the same reaction, and consisted of identical protein chains. The amino acid sequence determined by Edman degradation and mass spectrometry consisted of 623 amino acids. Six N-glycosylation sites, all contained in the N-terminal half of the protein, were characterized. PMID:7744038

  18. Draft Genome Sequence of Cupriavidus sp. Strain SK-3, a 4-Chlorobiphenyl- and 4-Clorobenzoic Acid-Degrading Bacterium

    PubMed Central

    Vilo, Claudia; Benedik, Michael J.; Ilori, Matthew

    2014-01-01

    We report the draft genome sequence of Cupriavidus sp. strain SK-3, which can use 4-chlorobiphenyl and 4-clorobenzoic acid as the sole carbon source for growth. The draft genome sequence allowed the study of the polychlorinated biphenyl degradation mechanism and the recharacterization of the strain SK-3 as a Cupriavidus species. PMID:24994805

  19. Draft Genome Sequence of Bacillus subtilis subsp. natto Strain CGMCC 2108, a High Producer of Poly-γ-Glutamic Acid

    PubMed Central

    Tan, Siyuan; Su, Anping; Zhang, Chen; Ren, Yuanyuan

    2016-01-01

    Here, we report the 4.1-Mb draft genome sequence of Bacillus subtilis subsp. natto strain CGMCC 2108, a high producer of poly-γ-glutamic acid (γ-PGA). This sequence will provide further help for the biosynthesis of γ-PGA and will greatly facilitate research efforts in metabolic engineering of B. subtilis subsp. natto strain CGMCC 2108. PMID:27231363

  20. New monoclonal antibodies to the Ebola virus glycoprotein: Identification and analysis of the amino acid sequence of the variable domains.

    PubMed

    Panina, A A; Aliev, T K; Shemchukova, O B; Dement'yeva, I G; Varlamov, N E; Pozdnyakova, L P; Bokov, M N; Dolgikh, D A; Sveshnikov, P G; Kirpichnikov, M P

    2016-03-01

    We determined the nucleotide and amino acid sequences of variable domains of three new monoclonal antibodies to the glycoprotein of Ebola virus capsid. The framework and hypervariable regions of immunoglobulin heavy and light chains were identified. The primary structures were confirmed using massspectrometry analysis. Immunoglobulin database search showed the uniqueness of the sequences obtained. PMID:27193713

  1. Genome Sequence of the Lactic Acid Bacterium Lactococcus lactis subsp. lactis TOMSC161, Isolated from a Nonscalded Curd Pressed Cheese

    PubMed Central

    Velly, H.; Abraham, A.-L.; Loux, V.; Delacroix-Buchet, A.; Fonseca, F.; Bouix, M.

    2014-01-01

    Lactococcus lactis is a lactic acid bacterium used in the production of many fermented foods, such as dairy products. Here, we report the genome sequence of L. lactis subsp. lactis TOMSC161, isolated from nonscalded curd pressed cheese. This genome sequence provides information in relation to dairy environment adaptation. PMID:25377704

  2. Draft Genome Sequence of Bacillus subtilis subsp. natto Strain CGMCC 2108, a High Producer of Poly-γ-Glutamic Acid.

    PubMed

    Tan, Siyuan; Meng, Yonghong; Su, Anping; Zhang, Chen; Ren, Yuanyuan

    2016-01-01

    Here, we report the 4.1-Mb draft genome sequence of Bacillus subtilis subsp. natto strain CGMCC 2108, a high producer of poly-γ-glutamic acid (γ-PGA). This sequence will provide further help for the biosynthesis of γ-PGA and will greatly facilitate research efforts in metabolic engineering of B. subtilis subsp. natto strain CGMCC 2108. PMID:27231363

  3. ANTICALIgN: visualizing, editing and analyzing combined nucleotide and amino acid sequence alignments for combinatorial protein engineering.

    PubMed

    Jarasch, Alexander; Kopp, Melanie; Eggenstein, Evelyn; Richter, Antonia; Gebauer, Michaela; Skerra, Arne

    2016-07-01

    ANTIC ALIGN: is an interactive software developed to simultaneously visualize, analyze and modify alignments of DNA and/or protein sequences that arise during combinatorial protein engineering, design and selection. ANTIC ALIGN: combines powerful functions known from currently available sequence analysis tools with unique features for protein engineering, in particular the possibility to display and manipulate nucleotide sequences and their translated amino acid sequences at the same time. ANTIC ALIGN: offers both template-based multiple sequence alignment (MSA), using the unmutated protein as reference, and conventional global alignment, to compare sequences that share an evolutionary relationship. The application of similarity-based clustering algorithms facilitates the identification of duplicates or of conserved sequence features among a set of selected clones. Imported nucleotide sequences from DNA sequence analysis are automatically translated into the corresponding amino acid sequences and displayed, offering numerous options for selecting reading frames, highlighting of sequence features and graphical layout of the MSA. The MSA complexity can be reduced by hiding the conserved nucleotide and/or amino acid residues, thus putting emphasis on the relevant mutated positions. ANTIC ALIGN: is also able to handle suppressed stop codons or even to incorporate non-natural amino acids into a coding sequence. We demonstrate crucial functions of ANTIC ALIGN: in an example of Anticalins selected from a lipocalin random library against the fibronectin extradomain B (ED-B), an established marker of tumor vasculature. Apart from engineered protein scaffolds, ANTIC ALIGN: provides a powerful tool in the area of antibody engineering and for directed enzyme evolution. PMID:27261456

  4. Formation Sequences of Iron Minerals in the Acidic Alteration Products and Variation of Hydrothermal Fluid Conditions

    NASA Astrophysics Data System (ADS)

    Isobe, H.; Yoshizawa, M.

    2008-12-01

    Iron minerals have important role in environmental issues not only on the Earth but also other terrestrial planets. Iron mineral species related to alteration products of primary minerals with surface or subsurface fluids are characterized by temperature, acidity and redox conditions of the fluids. We can see various iron- bearing alteration products in alteration products around fumaroles in geothermal/volcanic areas. In this study, zonal structures of iron minerals in alteration products of the geothermal area are observed to elucidate temporal and spatial variation of hydrothermal fluids. Alteration of the pyroxene-amphibole andesite of Garan-dake volcano, Oita, Japan occurs by the acidic hydrothermal fluid to form cristobalite leaching out elements other than Si. Hand specimens with unaltered or weakly altered core and cristobalite crust show various sequences of layers. XRD analysis revealed that the alteration degree is represented by abundance of cristobalite. Intermediately altered layers are characterized by occurrence including alunite, pyrite, kaolinite, goethite and hematite. A specimen with reddish brown core surrounded by cristobalite-rich white crust has brown colored layers at the boundary of core and the crust. Reddish core is characterized by occurrence of crystalline hematite by XRD. Another hand specimen has light gray core, which represents reduced conditions, and white cristobalite crust with light brown and reddish brown layers of ferric iron minerals between the core and the crust. On the other hand, hornblende crystals, typical ferrous iron-bearing mineral of the host rock, are well preserved in some samples with strongly decolorized cristobalite-rich groundmass. Hydrothermal alteration experiments of iron-rich basaltic material shows iron mineral species depend on acidity and temperature of the fluid. Oxidation states of the iron-bearing mineral species are strongly influenced by the acidity and redox conditions. Variations of alteration

  5. Sequence-specific interaction between HIV-1 matrix protein and viral genomic RNA revealed by in vitro genetic selection.

    PubMed Central

    Purohit, P; Dupont, S; Stevenson, M; Green, M R

    2001-01-01

    The human immunodeficiency virus type-1 matrix protein (HIV-1 MA) is a multifunctional structural protein synthesized as part of the Pr55 gag polyprotein. We have used in vitro genetic selection to identify an RNA consensus sequence that specifically interacts with MA (Kd = 5 x 10(-7) M). This 13-nt MA binding consensus sequence bears a high degree of homology (77%) to a region (nt 1433-1446) within the POL open reading frame of the HIV-1 genome (consensus sequence from 38 HIV-1 strains). Chemical interference experiments identified the nucleotides within the MA binding consensus sequence involved in direct contact with MA. We further demonstrate that this RNA-protein interaction is mediated through a stretch of basic amino acids within MA. Mutations that disrupt the interaction between MA and its RNA binding site within the HIV-1 genome resulted in a measurable decrease in viral replication. PMID:11345436

  6. Multiple Amino Acid Sequence Alignment Nitrogenase Component 1: Insights into Phylogenetics and Structure-Function Relationships

    PubMed Central

    Howard, James B.; Kechris, Katerina J.; Rees, Douglas C.; Glazer, Alexander N.

    2013-01-01

    Amino acid residues critical for a protein's structure-function are retained by natural selection and these residues are identified by the level of variance in co-aligned homologous protein sequences. The relevant residues in the nitrogen fixation Component 1 α- and β-subunits were identified by the alignment of 95 protein sequences. Proteins were included from species encompassing multiple microbial phyla and diverse ecological niches as well as the nitrogen fixation genotypes, anf, nif, and vnf, which encode proteins associated with cofactors differing at one metal site. After adjusting for differences in sequence length, insertions, and deletions, the remaining >85% of the sequence co-aligned the subunits from the three genotypes. Six Groups, designated Anf, Vnf , and Nif I-IV, were assigned based upon genetic origin, sequence adjustments, and conserved residues. Both subunits subdivided into the same groups. Invariant and single variant residues were identified and were defined as “core” for nitrogenase function. Three species in Group Nif-III, Candidatus Desulforudis audaxviator, Desulfotomaculum kuznetsovii, and Thermodesulfatator indicus, were found to have a seleno-cysteine that replaces one cysteinyl ligand of the 8Fe:7S, P-cluster. Subsets of invariant residues, limited to individual groups, were identified; these unique residues help identify the gene of origin (anf, nif, or vnf) yet should not be considered diagnostic of the metal content of associated cofactors. Fourteen of the 19 residues that compose the cofactor pocket are invariant or single variant; the other five residues are highly variable but do not correlate with the putative metal content of the cofactor. The variable residues are clustered on one side of the cofactor, away from other functional centers in the three dimensional structure. Many of the invariant and single variant residues were not previously recognized as potentially critical and their identification provides the bases

  7. Draft Genome Sequences of Gluconobacter cerinus CECT 9110 and Gluconobacter japonicus CECT 8443, Acetic Acid Bacteria Isolated from Grape Must

    PubMed Central

    Sainz, Florencia

    2016-01-01

    We report here the draft genome sequences of Gluconobacter cerinus strain CECT9110 and Gluconobacter japonicus CECT8443, acetic acid bacteria isolated from grape must. Gluconobacter species are well known for their ability to oxidize sugar alcohols into the corresponding acids. Our objective was to select strains to oxidize effectively d-glucose. PMID:27365351

  8. Swfoldrate: predicting protein folding rates from amino acid sequence with sliding window method.

    PubMed

    Cheng, Xiang; Xiao, Xuan; Wu, Zhi-cheng; Wang, Pu; Lin, Wei-zhong

    2013-01-01

    Protein folding is the process by which a protein processes from its denatured state to its specific biologically active conformation. Understanding the relationship between sequences and the folding rates of proteins remains an important challenge. Most previous methods of predicting protein folding rate require the tertiary structure of a protein as an input. In this study, the long-range and short-range contact in protein were used to derive extended version of the pseudo amino acid composition based on sliding window method. This method is capable of predicting the protein folding rates just from the amino acid sequence without the aid of any structural class information. We systematically studied the contributions of individual features to folding rate prediction. The optimal feature selection procedures are adopted by means of combining the forward feature selection and sequential backward selection method. Using the jackknife cross validation test, the method was demonstrated on the large dataset. The predictor was achieved on the basis of multitudinous physicochemical features and statistical features from protein using nonlinear support vector machine (SVM) regression model, the method obtained an excellent agreement between predicted and experimentally observed folding rates of proteins. The correlation coefficient is 0.9313 and the standard error is 2.2692. The prediction server is freely available at http://www.jci-bioinfo.cn/swfrate/input.jsp. PMID:22933332

  9. From amino acid sequence to bioactivity: The biomedical potential of antitumor peptides.

    PubMed

    Blanco-Míguez, Aitor; Gutiérrez-Jácome, Alberto; Pérez-Pérez, Martín; Pérez-Rodríguez, Gael; Catalán-García, Sandra; Fdez-Riverola, Florentino; Lourenço, Anália; Sánchez, Borja

    2016-06-01

    Chemoprevention is the use of natural and/or synthetic substances to block, reverse, or retard the process of carcinogenesis. In this field, the use of antitumor peptides is of interest as, (i) these molecules are small in size, (ii) they show good cell diffusion and permeability, (iii) they affect one or more specific molecular pathways involved in carcinogenesis, and (iv) they are not usually genotoxic. We have checked the Web of Science Database (23/11/2015) in order to collect papers reporting on bioactive peptide (1691 registers), which was further filtered searching terms such as "antiproliferative," "antitumoral," or "apoptosis" among others. Works reporting the amino acid sequence of an antiproliferative peptide were kept (60 registers), and this was complemented with the peptides included in CancerPPD, an extensive resource for antiproliferative peptides and proteins. Peptides were grouped according to one of the following mechanism of action: inhibition of cell migration, inhibition of tumor angiogenesis, antioxidative mechanisms, inhibition of gene transcription/cell proliferation, induction of apoptosis, disorganization of tubulin structure, cytotoxicity, or unknown mechanisms. The main mechanisms of action of those antiproliferative peptides with known amino acid sequences are presented and finally, their potential clinical usefulness and future challenges on their application is discussed. PMID:27010507

  10. The amino acid sequences and activities of synergistic hemolysins from Staphylococcus cohnii.

    PubMed

    Mak, Pawel; Maszewska, Agnieszka; Rozalska, Malgorzata

    2008-10-01

    Staphylococcus cohnii ssp. cohnii and S. cohnii ssp. urealyticus are a coagulase-negative staphylococci considered for a long time as unable to cause infections. This situation changed recently and pathogenic strains of these bacteria were isolated from hospital environments, patients and medical staff. Most of the isolated strains were resistant to many antibiotics. The present work describes isolation and characterization of several synergistic peptide hemolysins produced by these bacteria and acting as virulence factors responsible for hemolytic and cytotoxic activities. Amino acid sequences of respective hemolysins from S. cohnii ssp. cohnii (named as H1C, H2C and H3C) and S. cohnii ssp. urealyticus (H1U, H2U and H3U) were identical. Peptides H1 and H3 possessed significant amino acid homology to three synergistic hemolysins secreted by Staphylococcus lugdunensis and to putative antibacterial peptide produced by Staphylococcus saprophyticus ssp. saprophyticus. On the other hand, hemolysin H2 had a unique sequence. All isolated peptides lysed red cells from different mammalian species and exerted a cytotoxic effect on human fibroblasts. PMID:18752624

  11. Clostridium sticklandii, a specialist in amino acid degradation:revisiting its metabolism through its genome sequence

    PubMed Central

    2010-01-01

    Background Clostridium sticklandii belongs to a cluster of non-pathogenic proteolytic clostridia which utilize amino acids as carbon and energy sources. Isolated by T.C. Stadtman in 1954, it has been generally regarded as a "gold mine" for novel biochemical reactions and is used as a model organism for studying metabolic aspects such as the Stickland reaction, coenzyme-B12- and selenium-dependent reactions of amino acids. With the goal of revisiting its carbon, nitrogen, and energy metabolism, and comparing studies with other clostridia, its genome has been sequenced and analyzed. Results C. sticklandii is one of the best biochemically studied proteolytic clostridial species. Useful additional information has been obtained from the sequencing and annotation of its genome, which is presented in this paper. Besides, experimental procedures reveal that C. sticklandii degrades amino acids in a preferential and sequential way. The organism prefers threonine, arginine, serine, cysteine, proline, and glycine, whereas glutamate, aspartate and alanine are excreted. Energy conservation is primarily obtained by substrate-level phosphorylation in fermentative pathways. The reactions catalyzed by different ferredoxin oxidoreductases and the exergonic NADH-dependent reduction of crotonyl-CoA point to a possible chemiosmotic energy conservation via the Rnf complex. C. sticklandii possesses both the F-type and V-type ATPases. The discovery of an as yet unrecognized selenoprotein in the D-proline reductase operon suggests a more detailed mechanism for NADH-dependent D-proline reduction. A rather unusual metabolic feature is the presence of genes for all the enzymes involved in two different CO2-fixation pathways: C. sticklandii harbours both the glycine synthase/glycine reductase and the Wood-Ljungdahl pathways. This unusual pathway combination has retrospectively been observed in only four other sequenced microorganisms. Conclusions Analysis of the C. sticklandii genome and

  12. Towards Consensus Gene Ages

    PubMed Central

    Liebeskind, Benjamin J.; McWhite, Claire D.; Marcotte, Edward M.

    2016-01-01

    Correctly estimating the age of a gene or gene family is important for a variety of fields, including molecular evolution, comparative genomics, and phylogenetics, and increasingly for systems biology and disease genetics. However, most studies use only a point estimate of a gene’s age, neglecting the substantial uncertainty involved in this estimation. Here, we characterize this uncertainty by investigating the effect of algorithm choice on gene-age inference and calculate consensus gene ages with attendant error distributions for a variety of model eukaryotes. We use 13 orthology inference algorithms to create gene-age datasets and then characterize the error around each age-call on a per-gene and per-algorithm basis. Systematic error was found to be a large factor in estimating gene age, suggesting that simple consensus algorithms are not enough to give a reliable point estimate. We also found that different sources of error can affect downstream analyses, such as gene ontology enrichment. Our consensus gene-age datasets, with associated error terms, are made fully available at so that researchers can propagate this uncertainty through their analyses (geneages.org). PMID:27259914

  13. Spanish Consensus Statement

    PubMed Central

    Rey, Guillermo Álvarez; Cuesta, Jordi Ardevol; Loureda, Rafael Arriaza; España, Fernando Ávila; Matas, Ramón Balius; Pazos, Fernando Baró; de Dios Beas Jiménez, Juan; Rosell, Jorge Candel; Fernandez, César Cobián; Ros, Francisco Esparza; Colmenero, Josefina Espejo; de Prado, Jorge Fernández; Cota, Juan José García; González, Jose Ignacio Garrido; Santander, Manuela González; Munilla, Miguel Ángel Herrador; Ruiz, Francisco Ivorra; Díaz, Fernando Jiménez; Marqueta, Pedro Manonelles; Fernandez, Antonio Maestro; Benito, Juan José Muñoz; Vilás, Ramón Olivé; Teres, Xavier Peirau; Amaro, José Peña; Roque, Juan Pérez San; Parenteu, Christophe Ramírez; Serna, Juan Ribas; Álvarez, Mikel Sánchez; Marchori, Carlos Sanchez; Soto, Miguel del Valle; Alonso, José María Villalón; García, Pedro Guillen; de la Iglesia, Nicolas Hugo; Alcorocho, Juan Manuel Lopez

    2016-01-01

    On the 21st of March, 2015, experts met at Clínica CEMTRO in Madrid, Spain, under the patronage of The Spanish Society for Sports Traumatology (SETRADE), The Spanish Federation of Sports Medicine (FEMEDE), The Spanish Association of Medical Services for Football Clubs (AEMEF), and The Spanish Association of Medical Services for Basketball Clubs (AEMB) with the aim of establishing a round table that would allow specialists to consider the most appropriate current general actions to be taken when treating muscle tears in sport, based on proven scientific data described in the medical literature. Each expert received a questionnaire prior to the aforementioned meeting comprising a set of questions concerning therapeutic indications generally applied in the different stages present during muscle repair. The present Consensus Document is the result of the answers to the questionnaire and resulting discussion and consensus over which are the best current indications in the treatment of muscle tears in sport. Avoiding immobilization, not taking nonsteroidal anti-inflammatory drugs (NSAIDs) randomly, fostering early mobilization, increasing vascularization of injured, site and regulating inflammatory mechanisms—without inhibiting these from the early stages of the recovery period—all stood out as main points of the Consensus Document. Additionally, there is controversy concerning cell stimulation techniques and the use of growth factors or cell inhibitors. The decision concerning discharge was unanimous, as was the criteria considered when it came to performing sport techniques without pain. PMID:27213161

  14. Towards Consensus Gene Ages.

    PubMed

    Liebeskind, Benjamin J; McWhite, Claire D; Marcotte, Edward M

    2016-01-01

    Correctly estimating the age of a gene or gene family is important for a variety of fields, including molecular evolution, comparative genomics, and phylogenetics, and increasingly for systems biology and disease genetics. However, most studies use only a point estimate of a gene's age, neglecting the substantial uncertainty involved in this estimation. Here, we characterize this uncertainty by investigating the effect of algorithm choice on gene-age inference and calculate consensus gene ages with attendant error distributions for a variety of model eukaryotes. We use 13 orthology inference algorithms to create gene-age datasets and then characterize the error around each age-call on a per-gene and per-algorithm basis. Systematic error was found to be a large factor in estimating gene age, suggesting that simple consensus algorithms are not enough to give a reliable point estimate. We also found that different sources of error can affect downstream analyses, such as gene ontology enrichment. Our consensus gene-age datasets, with associated error terms, are made fully available at so that researchers can propagate this uncertainty through their analyses (geneages.org). PMID:27259914

  15. Complete amino acid sequence of the myoglobin from the Pacific spotted dolphin, Stenella attenuata graffmani.

    PubMed

    Jones, B N; Wang, C C; Dwulet, F E; Lehman, L D; Meuth, J L; Bogardt, R A; Gurd, F R

    1979-04-25

    The complete amino acid sequence of the major component myoglobin from the Pacific spotted dolphin, Stenella attenuata graffmani, was determined by the automated Edman degradation of several large peptides obtained by specific cleavage of the protein. The acetimidated apomyoglobin was selectively cleaved at its two methionyl residues with cyanogen bromide and at its three arginyl residues by trypsin. By subjecting four of these peptides and the apomyoglobin to automated Edman degradation, over 80% of the primary structure of the protein was obtained. The remainder of the covalent structure was determined by the sequence analysis of peptides that resulted from further digestion of the central cyanogen bromide fragment. This fragment was cleaved at its glutamyl residues with staphylococcal protease and its lysyl residues with trypsin. The action of trypsin was restricted to the lysyl residues by chemical modification of the single arginyl residue of the fragment with 1,2-cyclohexanedione. The primary structure of this myoglobin proved to be identical with that from the Atlantic bottlenosed dolphin and Pacific common dolphin but differs from the myoglobins of the killer whale and pilot whale at two positions. The above sequence identities and differences reflect the close taxonomic relationship of these five species of Cetacea. PMID:454657

  16. Isolation and amino acid sequences of squirrel monkey (Saimiri sciurea) insulin and glucagon.

    PubMed Central

    Yu, J H; Eng, J; Yalow, R S

    1990-01-01

    It was reported two decades ago that insulin was not detectable in the glucose-stimulated state in Saimiri sciurea, the New World squirrel monkey, by a radioimmunoassay system developed with guinea pig anti-pork insulin antibody and labeled pork insulin. With the same system, reasonable levels were observed in rhesus monkeys and chimpanzees. This suggested that New World monkeys, like the New World hystricomorph rodents such as the guinea pig and the coypu, might have insulins whose sequences differ markedly from those of Old World mammals. In this report we describe the purification and amino acid sequences of squirrel monkey insulin and glucagon. We demonstrate that the substitutions at B29, B27, A2, A4, and A17 of squirrel monkey insulin are identical with those previously found in another New World primate, the owl monkey (Aotus trivirgatus). The immunologic cross-reactivity of this insulin in our immunoassay system is only a few percent of that of human insulin. Squirrel monkey glucagon is identical with the usual glucagon found in Old World mammals, which predicts that the glucagons of other New World monkeys would not differ from the usual Old World mammalian glucagon. It appears that the peptides of the New World monkeys have diverged less from those of the Old World mammals than have those of the New World hystricomorph rodents. The striking improvements in peptide purification and sequencing have the potential for adding new information concerning the evolutionary divergence of species. PMID:2263627

  17. Isolation and amino acid sequences of squirrel monkey (Saimiri sciurea) insulin and glucagon

    SciTech Connect

    Yu, Jinghua ); Eng, J.; Yalow, R.S. City Univ. of New York, NY )

    1990-12-01

    It was reported two decades ago that insulin was not detectable in the glucose-stimulated state in Saimiri sciurea, the New World squirrel monkey, by a radioimmunoassay system developed with guinea pig anti-pork insulin antibody and labeled park insulin. With the same system, reasonable levels were observed in rhesus monkeys and chimpanzees. This suggested that New World monkeys, like the New World hystricomorph rodents such as the guinea pig and the coypu, might have insulins whose sequences differ markedly from those of Old World mammals. In this report the authors describe the purification and amino acid sequences of squirrel monkey insulin and glucagon. They demonstrate that the substitutions at B29, B27, A2, A4, and A17 of squirrel monkey insulin are identical with those previously found in another New World primate, the owl monkey (Aotus trivirgatus). The immunologic cross-reactivity of this insulin in their immunoassay system is only a few percent of that of human insulin. It appears that the peptides of the New World monkeys have diverged less from those of the Old World mammals than have those of the New World hystricomorph rodents. The striking improvements in peptide purification and sequencing have the potential for adding new information concerning the evolutionary divergence of species.

  18. Binding site discovery from nucleic acid sequences by discriminative learning of hidden Markov models

    PubMed Central

    Maaskola, Jonas; Rajewsky, Nikolaus

    2014-01-01

    We present a discriminative learning method for pattern discovery of binding sites in nucleic acid sequences based on hidden Markov models. Sets of positive and negative example sequences are mined for sequence motifs whose occurrence frequency varies between the sets. The method offers several objective functions, but we concentrate on mutual information of condition and motif occurrence. We perform a systematic comparison of our method and numerous published motif-finding tools. Our method achieves the highest motif discovery performance, while being faster than most published methods. We present case studies of data from various technologies, including ChIP-Seq, RIP-Chip and PAR-CLIP, of embryonic stem cell transcription factors and of RNA-binding proteins, demonstrating practicality and utility of the method. For the alternative splicing factor RBM10, our analysis finds motifs known to be splicing-relevant. The motif discovery method is implemented in the free software package Discrover. It is applicable to genome- and transcriptome-scale data, makes use of available repeat experiments and aside from binary contrasts also more complex data configurations can be utilized. PMID:25389269

  19. Nucleotide and derived amino acid sequences of the major porin of Comamonas acidovorans and comparison of porin primary structures.

    PubMed Central

    Gerbl-Rieger, S; Peters, J; Kellermann, J; Lottspeich, F; Baumeister, W

    1991-01-01

    The DNA sequence of the gene which codes for the major outer membrane porin (Omp32) of Comamonas acidovorans has been determined. The structural gene encodes a precursor consisting of 351 amino acid residues with a signal peptide of 19 amino acid residues. Comparisons with amino acid sequences of outer membrane proteins and porins from several other members of the class Proteobacteria and of the Chlamydia trachomatis porin and the Neurospora crassa mitochondrial porin revealed a motif of eight regions of local homology. The results of this analysis are discussed with regard to common structural features of porins. PMID:1848840

  20. Amino acid sequence analysis and characterization of a ribonuclease from starfish Asterias amurensis.

    PubMed

    Motoyoshi, Naomi; Kobayashi, Hiroko; Itagaki, Tadashi; Inokuchi, Norio

    2016-09-01

    The aim of this study was to phylogenetically characterize the location of the RNase T2 enzyme in the starfish (Asterias amurensis). We isolated an RNase T2 ribonuclease (RNase Aa) from the ovaries of starfish and determined its amino acid sequence by protein chemistry and cloning cDNA encoding RNase Aa. The isolated protein had 231 amino acid residues, a predicted molecular mass of 25,906 Da, and an optimal pH of 5.0. RNase Aa preferentially released guanylic acid from the RNA. The catalytic sites of the RNase T2 family are conserved in RNase Aa; furthermore, the distribution of the cysteine residues in RNase Aa is similar to that in other animal and plant T2 RNases. RNase Aa is cleaved at two points: 21 residues from the N-terminus and 29 residues from the C-terminus; however, both fragments may remain attached to the protein via disulfide bridges, leading to the maintenance of its conformation, as suggested by circular dichroism spectrum analysis. The phylogenetic analysis revealed that starfish RNase Aa is evolutionarily an intermediate between protozoan and oyster RNases. PMID:26920046

  1. Achieving consensus in environmental programs

    SciTech Connect

    Kurstedt, Jr., H. A.; Jones, R. M.; Walker, J. A.; Middleman, L. I.

    1989-01-01

    In this paper, we describe a new research effort on consensus tied to the Environmental Restoration Program (ERP) within the US Department of Energy's Office of Defense Waste and Transportation Management (DWTM). We define consensus and explain why consensus decisions are not merely desirable but necessary in furthering ERP activities. As examples of our planned applied research, we first discuss Nominal Group Technique as a representative consensus-generating tool, and we conclude by describing the consensus-related mission of the Waste Management Review Group, established at Virginia Tech to conduct independent, third-party review of DWTM/ERP plans and activities. 10 refs.

  2. Full Genome Virus Detection in Fecal Samples Using Sensitive Nucleic Acid Preparation, Deep Sequencing, and a Novel Iterative Sequence Classification Algorithm

    PubMed Central

    Cotten, Matthew; Oude Munnink, Bas; Canuti, Marta; Deijs, Martin; Watson, Simon J.; Kellam, Paul; van der Hoek, Lia

    2014-01-01

    We have developed a full genome virus detection process that combines sensitive nucleic acid preparation optimised for virus identification in fecal material with Illumina MiSeq sequencing and a novel post-sequencing virus identification algorithm. Enriched viral nucleic acid was converted to double-stranded DNA and subjected to Illumina MiSeq sequencing. The resulting short reads were processed with a novel iterative Python algorithm SLIM for the identification of sequences with homology to known viruses. De novo assembly was then used to generate full viral genomes. The sensitivity of this process was demonstrated with a set of fecal samples from HIV-1 infected patients. A quantitative assessment of the mammalian, plant, and bacterial virus content of this compartment was generated and the deep sequencing data were sufficient to assembly 12 complete viral genomes from 6 virus families. The method detected high levels of enteropathic viruses that are normally controlled in healthy adults, but may be involved in the pathogenesis of HIV-1 infection and will provide a powerful tool for virus detection and for analyzing changes in the fecal virome associated with HIV-1 progression and pathogenesis. PMID:24695106

  3. Microscopic enteritis: Bucharest consensus.

    PubMed

    Rostami, Kamran; Aldulaimi, David; Holmes, Geoffrey; Johnson, Matt W; Robert, Marie; Srivastava, Amitabh; Fléjou, Jean-François; Sanders, David S; Volta, Umberto; Derakhshan, Mohammad H; Going, James J; Becheanu, Gabriel; Catassi, Carlo; Danciu, Mihai; Materacki, Luke; Ghafarzadegan, Kamran; Ishaq, Sauid; Rostami-Nejad, Mohammad; Peña, A Salvador; Bassotti, Gabrio; Marsh, Michael N; Villanacci, Vincenzo

    2015-03-01

    Microscopic enteritis (ME) is an inflammatory condition of the small bowel that leads to gastrointestinal symptoms, nutrient and micronutrient deficiency. It is characterised by microscopic or sub-microscopic abnormalities such as microvillus changes and enterocytic alterations in the absence of definite macroscopic changes using standard modern endoscopy. This work recognises a need to characterize disorders with microscopic and submicroscopic features, currently regarded as functional or non-specific entities, to obtain further understanding of their clinical relevance. The consensus working party reviewed statements about the aetiology, diagnosis and symptoms associated with ME and proposes an algorithm for its investigation and treatment. Following the 5(th) International Course in Digestive Pathology in Bucharest in November 2012, an international group of 21 interested pathologists and gastroenterologists formed a working party with a view to formulating a consensus statement on ME. A five-step agreement scale (from strong agreement to strong disagreement) was used to score 21 statements, independently. There was strong agreement on all statements about ME histology (95%-100%). Statements concerning diagnosis achieved 85% to 100% agreement. A statement on the management of ME elicited agreement from the lowest rate (60%) up to 100%. The remaining two categories showed general agreement between experts on clinical presentation (75%-95%) and pathogenesis (80%-90%) of ME. There was strong agreement on the histological definition of ME. Weaker agreement on management indicates a need for further investigations, better definitions and clinical trials to produce quality guidelines for management. This ME consensus is a step toward greater recognition of a significant entity affecting symptomatic patients previously labelled as non-specific or functional enteropathy. PMID:25759526

  4. Microscopic enteritis: Bucharest consensus

    PubMed Central

    Rostami, Kamran; Aldulaimi, David; Holmes, Geoffrey; Johnson, Matt W; Robert, Marie; Srivastava, Amitabh; Fléjou, Jean-François; Sanders, David S; Volta, Umberto; Derakhshan, Mohammad H; Going, James J; Becheanu, Gabriel; Catassi, Carlo; Danciu, Mihai; Materacki, Luke; Ghafarzadegan, Kamran; Ishaq, Sauid; Rostami-Nejad, Mohammad; Peña, A Salvador; Bassotti, Gabrio; Marsh, Michael N; Villanacci, Vincenzo

    2015-01-01

    Microscopic enteritis (ME) is an inflammatory condition of the small bowel that leads to gastrointestinal symptoms, nutrient and micronutrient deficiency. It is characterised by microscopic or sub-microscopic abnormalities such as microvillus changes and enterocytic alterations in the absence of definite macroscopic changes using standard modern endoscopy. This work recognises a need to characterize disorders with microscopic and submicroscopic features, currently regarded as functional or non-specific entities, to obtain further understanding of their clinical relevance. The consensus working party reviewed statements about the aetiology, diagnosis and symptoms associated with ME and proposes an algorithm for its investigation and treatment. Following the 5th International Course in Digestive Pathology in Bucharest in November 2012, an international group of 21 interested pathologists and gastroenterologists formed a working party with a view to formulating a consensus statement on ME. A five-step agreement scale (from strong agreement to strong disagreement) was used to score 21 statements, independently. There was strong agreement on all statements about ME histology (95%-100%). Statements concerning diagnosis achieved 85% to 100% agreement. A statement on the management of ME elicited agreement from the lowest rate (60%) up to 100%. The remaining two categories showed general agreement between experts on clinical presentation (75%-95%) and pathogenesis (80%-90%) of ME. There was strong agreement on the histological definition of ME. Weaker agreement on management indicates a need for further investigations, better definitions and clinical trials to produce quality guidelines for management. This ME consensus is a step toward greater recognition of a significant entity affecting symptomatic patients previously labelled as non-specific or functional enteropathy. PMID:25759526

  5. Aspartyl-tRNA synthetase from Escherichia coli: cloning and characterisation of the gene, homologies of its translated amino acid sequence with asparaginyl- and lysyl-tRNA synthetases.

    PubMed Central

    Eriani, G; Dirheimer, G; Gangloff, J

    1990-01-01

    By screening of an Escherichia coli plasmidic library using antibodies against aspartyl-tRNA synthetase (AspRS) several clones were obtained containing aspS, the gene coding for AspRS. We report here the nucleotide sequence of aspS and the corresponding primary structure of the aspartyl-tRNA synthetase, a protein of 590 amino acid residues with a Mr 65,913, a value in close agreement with that observed for the purified protein. Primer extension analysis of the aspS mRNA using reverse transcriptase located its 5'-end at 94 nucleotides upstream of the translation initiation AUG; nuclease S1 analysis located the 3'-end at 126 nucleotides downstream of the stop codon UGA. Comparison of the DNA-derived protein sequence with known aminoacyl-tRNA sequences revealed important homologies with asparaginyl- and lysyl-tRNA synthetases from E.coli; more than 25% of their amino acid residues are identical, the homologies being distributed preferencially in the first part and the carboxy-terminal end of the molecule. Mutagenesis directed towards a consensus tetrapeptide (Gly-Leu-Asp-Arg) and the carboxy-terminal end showed that both domains could be implicated in catalysis as well as in ATP binding. Images PMID:2129559

  6. Evolutionary connections of biological kingdoms based on protein and nucleic acid sequence evidence

    NASA Technical Reports Server (NTRS)

    Dayhoff, M. O.

    1983-01-01

    Prokaryotic and eukaryotic evolutionary trees are developed from protein and nucleic-acid sequences by the methods of numerical taxonomy. Trees are presented for bacterial ferredoxins, 5S ribosomal RNA, c-type cytochromes , cytochromes c2 and c', and 5.8S ribosomal RNA; the implications for early evolution are discussed; and a composite tree showing the branching of the anaerobes, aerobes, archaebacteria, and eukaryotes is shown. Single lines are found for all oxygen-evolving photosynthetic forms and for the salt-loving and high-temperature forms of archaebacteria. It is argued that the eukaryote mitochondria, chloroplasts, and cytoplasmic host material are descended from free-living prokaryotes that formed symbiotic associations, with more than one symbiotic event involved in the evolution of each organelle.

  7. The amino acid alphabet and the architecture of the protein sequence-structure map. I. Binary alphabets.

    PubMed

    Ferrada, Evandro

    2014-12-01

    The correspondence between protein sequences and structures, or sequence-structure map, relates to fundamental aspects of structural, evolutionary and synthetic biology. The specifics of the mapping, such as the fraction of accessible sequences and structures, or the sequences' ability to fold fast, are dictated by the type of interactions between the monomers that compose the sequences. The set of possible interactions between monomers is encapsulated by the potential energy function. In this study, I explore the impact of the relative forces of the potential on the architecture of the sequence-structure map. My observations rely on simple exact models of proteins and random samples of the space of potential energy functions of binary alphabets. I adopt a graph perspective and study the distribution of viable sequences and the structures they produce, as networks of sequences connected by point mutations. I observe that the relative proportion of attractive, neutral and repulsive forces defines types of potentials, that induce sequence-structure maps of vastly different architectures. I characterize the properties underlying these differences and relate them to the structure of the potential. Among these properties are the expected number and relative distribution of sequences associated to specific structures and the diversity of structures as a function of sequence divergence. I study the types of binary potentials observed in natural amino acids and show that there is a strong bias towards only some types of potentials, a bias that seems to characterize the folding code of natural proteins. I discuss implications of these observations for the architecture of the sequence-structure map of natural proteins, the construction of random libraries of peptides, and the early evolution of the natural amino acid alphabet. PMID:25473967

  8. The Amino Acid Alphabet and the Architecture of the Protein Sequence-Structure Map. I. Binary Alphabets

    PubMed Central

    Ferrada, Evandro

    2014-01-01

    The correspondence between protein sequences and structures, or sequence-structure map, relates to fundamental aspects of structural, evolutionary and synthetic biology. The specifics of the mapping, such as the fraction of accessible sequences and structures, or the sequences' ability to fold fast, are dictated by the type of interactions between the monomers that compose the sequences. The set of possible interactions between monomers is encapsulated by the potential energy function. In this study, I explore the impact of the relative forces of the potential on the architecture of the sequence-structure map. My observations rely on simple exact models of proteins and random samples of the space of potential energy functions of binary alphabets. I adopt a graph perspective and study the distribution of viable sequences and the structures they produce, as networks of sequences connected by point mutations. I observe that the relative proportion of attractive, neutral and repulsive forces defines types of potentials, that induce sequence-structure maps of vastly different architectures. I characterize the properties underlying these differences and relate them to the structure of the potential. Among these properties are the expected number and relative distribution of sequences associated to specific structures and the diversity of structures as a function of sequence divergence. I study the types of binary potentials observed in natural amino acids and show that there is a strong bias towards only some types of potentials, a bias that seems to characterize the folding code of natural proteins. I discuss implications of these observations for the architecture of the sequence-structure map of natural proteins, the construction of random libraries of peptides, and the early evolution of the natural amino acid alphabet. PMID:25473967

  9. Trypsin inhibitors from ridged gourd (Luffa acutangula Linn.) seeds: purification, properties, and amino acid sequences.

    PubMed

    Haldar, U C; Saha, S K; Beavis, R C; Sinha, N K

    1996-02-01

    Two trypsin inhibitors, LA-1 and LA-2, have been isolated from ridged gourd (Luffa acutangula Linn.) seeds and purified to homogeneity by gel filtration followed by ion-exchange chromatography. The isoelectric point is at pH 4.55 for LA-1 and at pH 5.85 for LA-2. The Stokes radius of each inhibitor is 11.4 A. The fluorescence emission spectrum of each inhibitor is similar to that of the free tyrosine. The biomolecular rate constant of acrylamide quenching is 1.0 x 10(9) M-1 sec-1 for LA-1 and 0.8 x 10(9) M-1 sec-1 for LA-2 and that of K2HPO4 quenching is 1.6 x 10(11) M-1 sec-1 for LA-1 and 1.2 x 10(11) M-1 sec-1 for LA-2. Analysis of the circular dichroic spectra yields 40% alpha-helix and 60% beta-turn for La-1 and 45% alpha-helix and 55% beta-turn for LA-2. Inhibitors LA-1 and LA-2 consist of 28 and 29 amino acid residues, respectively. They lack threonine, alanine, valine, and tryptophan. Both inhibitors strongly inhibit trypsin by forming enzyme-inhibitor complexes at a molar ratio of unity. A chemical modification study suggests the involvement of arginine of LA-1 and lysine of LA-2 in their reactive sites. The inhibitors are very similar in their amino acid sequences, and show sequence homology with other squash family inhibitors. PMID:8924202

  10. Microfluidic platform for isolating nucleic acid targets using sequence specific hybridization

    PubMed Central

    Wang, Jingjing; Morabito, Kenneth; Tang, Jay X.; Tripathi, Anubhav

    2013-01-01

    The separation of target nucleic acid sequences from biological samples has emerged as a significant process in today's diagnostics and detection strategies. In addition to the possible clinical applications, the fundamental understanding of target and sequence specific hybridization on surface modified magnetic beads is of high value. In this paper, we describe a novel microfluidic platform that utilizes a mobile magnetic field in static microfluidic channels, where single stranded DNA (ssDNA) molecules are isolated via nucleic acid hybridization. We first established efficient isolation of biotinylated capture probe (BP) using streptavidin-coated magnetic beads. Subsequently, we investigated the hybridization of target ssDNA with BP bound to beads and explained these hybridization kinetics using a dual-species kinetic model. The number of hybridized target ssDNA molecules was determined to be about 6.5 times less than that of BP on the bead surface, due to steric hindrance effects. The hybridization of target ssDNA with non-complementary BP bound to bead was also examined, and non-specific hybridization was found to be insignificant. Finally, we demonstrated highly efficient capture and isolation of target ssDNA in the presence of non-target ssDNA, where as low as 1% target ssDNA can be detected from mixture. The microfluidic method described in this paper is significantly relevant and is broadly applicable, especially towards point-of-care biological diagnostic platforms that require binding and separation of known target biomolecules, such as RNA, ssDNA, or protein. PMID:24404041

  11. Characterization of N-glycosylation and amino acid sequence features of immunoglobulins from swine.

    PubMed

    Lopez, Paul G; Girard, Lauren; Buist, Marjorie; de Oliveira, Andrey Giovanni Gomes; Bodnar, Edward; Salama, Apolline; Soulillou, Jean-Paul; Perreault, Hélène

    2016-02-01

    The primary goal of this study was to develop a method to study the N-glycosylation of IgG from swine in order to detect epitopes containing N-glycolylneuraminic acid (Neu5Gc) and/or terminal galactose residues linked in α1-3 susceptible to cause xenograft-related problems. Samples of immunoglobulin were isolated from porcine serum using protein-A affinity chromatography. The eluate was then separated on electrophoretic gel, and bands corresponding to the N-glycosylated heavy chains were cut off the gel and subjected to tryptic digestion. Peptides and glycopeptides were separated by reversed phase liquid chromatography and fractions were collected for matrix-assisted laser desorption/ionization time-of-flight mass spectrometric (MALDI-TOF-MS) analysis. Overall no α1-3 galactose was detected, as demonstrated by complete susceptibility of terminal galactose residues to β-galactosidase digestion. Neu5Gc was detected on singly sialylated structures. Two major N-glycopeptides were found, EEQFNSTYR and EAQFNSTYR as determined by tandem MS (MS/MS), as previously reported by Butler et al. (Immunogenetics, 61, 2009, 209-230), who found 11 subclasses for porcine IgG. Out of the 11, ten include the sequence corresponding to EEQFNSTYR, and only one codes for EAQFNSTYR. In this study, glycosylation patterns associated with both chains were slightly different, in that EEQFNSTYR had a higher content of galactose. The last step of this study consisted of peptide-mapping the 11 reported porcine IgG sequences. Although there was considerable overlap, at least one unique tryptic peptide was found per IgG sequence. The workflow presented in this manuscript constitutes the first study to use MALDI-TOF-MS in the investigation of porcine IgG structural features. PMID:26586247

  12. Human Retroviruses and AIDS. A compilation and analysis of nucleic acid and amino acid sequences: I--II; III--V

    SciTech Connect

    Myers, G.; Korber, B.; Wain-Hobson, S.; Smith, R.F.; Pavlakis, G.N.

    1993-12-31

    This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (I) HIV and SIV Nucleotide Sequences; (II) Amino Acid Sequences; (III) Analyses; (IV) Related Sequences; and (V) Database Communications. Information within all the parts is updated at least twice in each year, which accounts for the modes of binding and pagination in the compendium.

  13. Lactic acid production from potato peel waste by anaerobic sequencing batch fermentation using undefined mixed culture.

    PubMed

    Liang, Shaobo; McDonald, Armando G; Coats, Erik R

    2015-11-01

    Lactic acid (LA) is a necessary industrial feedstock for producing the bioplastic, polylactic acid (PLA), which is currently produced by pure culture fermentation of food carbohydrates. This work presents an alternative to produce LA from potato peel waste (PPW) by anaerobic fermentation in a sequencing batch reactor (SBR) inoculated with undefined mixed culture from a municipal wastewater treatment plant. A statistical design of experiments approach was employed using set of 0.8L SBRs using gelatinized PPW at a solids content range from 30 to 50 g L(-1), solids retention time of 2-4 days for yield and productivity optimization. The maximum LA production yield of 0.25 g g(-1) PPW and highest productivity of 125 mg g(-1) d(-1) were achieved. A scale-up SBR trial using neat gelatinized PPW (at 80 g L(-1) solids content) at the 3 L scale was employed and the highest LA yield of 0.14 g g(-1) PPW and a productivity of 138 mg g(-1) d(-1) were achieved with a 1 d SRT. PMID:25708409

  14. Bacterial community compositions in sediment polluted by perfluoroalkyl acids (PFAAs) using Illumina high-throughput sequencing.

    PubMed

    Sun, Yajun; Wang, Tieyu; Peng, Xiawei; Wang, Pei; Lu, Yonglong

    2016-06-01

    The characterization of bacterial community compositions and the change in perfluoroalkyl acids (PFAAs) along a natural river distribution system were explored in the present study. Illumina high-throughput sequencing was used to explore bacterial community diversity and structure in sediment polluted by PFAAs from the Xiaoqing River, the area with concentrated fluorochemical facilities in China. The concentration of PFAAs was in the range of 8.44-465.60 ng/g dry weight (dw) in sediment. Perfluorooctanoic acid (PFOA) was the dominant PFAA in all samples, which accounted for 94.2 % of total PFAAs. High-level PFOA could lead to an obvious increase in relative abundance of Proteobacteria, ε-Proteobacteria, Thiobacillus, and Sulfurimonas and the decrease in relative abundance of other bacteria. Redundancy analysis revealed that PFOA played an important role in the formation of bacterial community, and PFOA at higher concentration could reduce the diversity of bacterial community. When the concentration of PFOA was below 100 ng/g dw in sediment, no significant effect on microbial community structure was observed. Thiobacillus and Sulfurimonas were positively correlated with the concentration of PFOA, suggesting that both genera were resistant to PFOA contamination. PMID:26780047

  15. Mass spectrometric detection of the amino acid sequence polymorphism of the hepatitis C virus antigen.

    PubMed

    Kaysheva, A L; Ivanov, Yu D; Frantsuzov, P A; Krohin, N V; Pavlova, T I; Uchaikin, V F; Konev, V А; Kovalev, O B; Ziborov, V S; Archakov, A I

    2016-03-01

    A method for detection and identification of the hepatitis C virus antigen (HCVcoreAg) in human serum with consideration for possible amino acid substitutions is proposed. The method is based on a combination of biospecific capturing and concentrating of the target protein on the surface of the chip for atomic force microscope (AFM chip) with subsequent protein identification by tandem mass spectrometric (MS/MS) analysis. Biospecific AFM-capturing of viral particles containing HCVcoreAg from serum samples was performed by use of AFM chips with monoclonal antibodies (anti-HCVcore) covalently immobilized on the surface. Biospecific complexes were registered and counted by AFM. Further MS/MS analysis allowed to reliably identify the HCVcoreAg in the complexes formed on the AFM chip surface. Analysis of MS/MS spectra, with the account taken of the possible polymorphisms in the amino acid sequence of the HCVcoreAg, enabled us to increase the number of identified peptides. PMID:26773170

  16. Peptide sequencing by using a combination of partial acid hydrolysis and fast-atom-bombardment mass spectrometry.

    PubMed Central

    De Angelis, F; Botta, M; Ceccarelli, S; Nicoletti, R

    1986-01-01

    To overcome the limit of the intensity of ions carrying sequence information in structural determinations of peptides by fast-atom-bombardment m.s., we have developed a method that consists in taking spectra of the peptide acid hydrolysates at different hydrolysis times. Peaks correspond to the oligomers arising from the peptide partial hydrolysis. The sequence can then be identified from the structurally overlapping fragments. PMID:2428356

  17. Canine preprorelaxin: nucleic acid sequence and localization within the canine placenta.

    PubMed

    Klonisch, T; Hombach-Klonisch, S; Froehlich, C; Kauffold, J; Steger, K; Steinetz, B G; Fischer, B

    1999-03-01

    Employing uteroplacental tissue at Day 35 of gestation, we determined the nucleic acid sequence of canine preprorelaxin using reverse transcription- and rapid amplification of cDNA ends-polymerase chain reaction. Canine preprorelaxin cDNA consisted of 534 base pairs encoding a protein of 177 amino acids with a signal peptide of 25 amino acids (aa), a B domain of 35 aa, a C domain of 93 aa, and an A domain of 24 aa. The putative receptor binding region in the N'-terminal part of the canine relaxin B domain GRDYVR contained two substitutions from the classical motif (E-->D and L-->Y). Canine preprorelaxin shared highest homology with porcine and equine preprorelaxin. Northern analysis revealed a 1-kilobase transcript present in total RNA of canine uteroplacental tissue but not of kidney tissue. Uteroplacental tissue from two bitches each at Days 30 and 35 of gestation were studied by in situ hybridization to localize relaxin mRNA. Immunohistochemistry for relaxin, cytokeratin, vimentin, and von Willebrand factor was performed on uteroplacental tissue at Day 30 of gestation. The basal cell layer at the core of the chorionic villi was devoid of relaxin mRNA and immunoreactive relaxin or vimentin but was immunopositive for cytokeratin and identified as cytotrophoblast cells. The cell layer surrounding the chorionic villi displayed specific hybridization signals for relaxin mRNA and immunoreactivity for relaxin and cytokeratin but not for vimentin, and was identified as syncytiotrophoblast. Those areas of the chorioallantoic tissue with most intense relaxin immunoreactivity were highly vascularized as demonstrated by immunoreactive von Willebrand factor expressed on vascular endothelium. The uterine glands and nonplacental uterine areas of the canine zonary girdle placenta were devoid of relaxin mRNA and relaxin. We conclude that the syncytiotrophoblast is the source of relaxin in the canine placenta. PMID:10026098

  18. Purification and partial amino acid sequence of the chloroplast cytochrome b-559.

    PubMed

    Widger, W R; Cramer, W A; Hermodson, M; Meyer, D; Gullifor, M

    1984-03-25

    The hydrophobic cytochrome b-559, purified from unstacked, ethanol-washed spinach thylakoid membranes, using extraction with 2% Triton X-100 in 4 M urea and three chromatographic steps in the presence of protease inhibitors, has a dominant band on sodium dodecyl sulfate-urea gels corresponding to Mr = 10,000. The yield of this preparation is 30-50% (5-10 mg) starting with 600 mg of chlorophyll. The heme content yields a calculated molecular weight of no more than 17,500/heme, and perhaps somewhat smaller after correction for impurities. The Mr = 10,000 band is stained by the tetramethylbenzidine-H2O2 heme reagent on lithium dodecyl sulfate gels run at 0 degrees C. The Mr = 10,000 protein, further separated by high performance liquid chromatography, contains a unique NH2 terminus that is not blocked, and the amino acid sequence for the first 27 residues is NH2-Ser-Gly-Ser-Thr-Gly-Glu-Arg-Ser-Phe-Ala-Asp-Ile-Ile-Thr-Ser-Ile-Arg-Tyr-Trp -Val-Ile-X-Ser-Ile-Thr-Ile-Pro. . . COOH. Approximately 55% of the amino acids are hydrophobic, based on amino acid analysis of the Mr = 10,000 peptide, which also indicated the presence of at least one histidine. Only one cytochrome b-559 component could be identified, whose yield indicated that it arises from a single b-559 protein in chloroplasts corresponding to the in situ high potential cytochrome of the chloroplast photosystem II. PMID:6706983

  19. Sequence-Specific Electrical Purification of Nucleic Acids with Nanoporous Gold Electrodes.

    PubMed

    Daggumati, Pallavi; Appelt, Sandra; Matharu, Zimple; Marco, Maria L; Seker, Erkin

    2016-06-22

    Nucleic-acid-based biosensors have enabled rapid and sensitive detection of pathogenic targets; however, these devices often require purified nucleic acids for analysis since the constituents of complex biological fluids adversely affect sensor performance. This purification step is typically performed outside the device, thereby increasing sample-to-answer time and introducing contaminants. We report a novel approach using a multifunctional matrix, nanoporous gold (np-Au), which enables both detection of specific target sequences in a complex biological sample and their subsequent purification. The np-Au electrodes modified with 26-mer DNA probes (via thiol-gold chemistry) enabled sensitive detection and capture of complementary DNA targets in the presence of complex media (fetal bovine serum) and other interfering DNA fragments in the range of 50-1500 base pairs. Upon capture, the noncomplementary DNA fragments and serum constituents of varying sizes were washed away. Finally, the surface-bound DNA-DNA hybrids were released by electrochemically cleaving the thiol-gold linkage, and the hybrids were iontophoretically eluted from the nanoporous matrix. The optical and electrophoretic characterization of the analytes before and after the detection-purification process revealed that low target DNA concentrations (80 pg/μL) can be successfully detected in complex biological fluids and subsequently released to yield pure hybrids free of polydisperse digested DNA fragments and serum biomolecules. Taken together, this multifunctional platform is expected to enable seamless integration of detection and purification of nucleic acid biomarkers of pathogens and diseases in miniaturized diagnostic devices. PMID:27244455

  20. Negative Ion In-Source Decay Matrix-Assisted Laser Desorption/Ionization Mass Spectrometry for Sequencing Acidic Peptides

    NASA Astrophysics Data System (ADS)

    McMillen, Chelsea L.; Wright, Patience M.; Cassady, Carolyn J.

    2016-05-01

    Matrix-assisted laser desorption/ionization (MALDI) in-source decay was studied in the negative ion mode on deprotonated peptides to determine its usefulness for obtaining extensive sequence information for acidic peptides. Eight biological acidic peptides, ranging in size from 11 to 33 residues, were studied by negative ion mode ISD (nISD). The matrices 2,5-dihydroxybenzoic acid, 2-aminobenzoic acid, 2-aminobenzamide, 1,5-diaminonaphthalene, 5-amino-1-naphthol, 3-aminoquinoline, and 9-aminoacridine were used with each peptide. Optimal fragmentation was produced with 1,5-diaminonphthalene (DAN), and extensive sequence informative fragmentation was observed for every peptide except hirudin(54-65). Cleavage at the N-Cα bond of the peptide backbone, producing c' and z' ions, was dominant for all peptides. Cleavage of the N-Cα bond N-terminal to proline residues was not observed. The formation of c and z ions is also found in electron transfer dissociation (ETD), electron capture dissociation (ECD), and positive ion mode ISD, which are considered to be radical-driven techniques. Oxidized insulin chain A, which has four highly acidic oxidized cysteine residues, had less extensive fragmentation. This peptide also exhibited the only charged localized fragmentation, with more pronounced product ion formation adjacent to the highly acidic residues. In addition, spectra were obtained by positive ion mode ISD for each protonated peptide; more sequence informative fragmentation was observed via nISD for all peptides. Three of the peptides studied had no product ion formation in ISD, but extensive sequence informative fragmentation was found in their nISD spectra. The results of this study indicate that nISD can be used to readily obtain sequence information for acidic peptides.

  1. Surface Hopping by Consensus.

    PubMed

    Martens, Craig C

    2016-07-01

    We present a new stochastic surface hopping method for modeling molecular dynamics with electronic transitions. The approach, consensus surface hopping (CSH), is a numerical framework for solving the semiclassical limit Liouville equation describing nuclear dynamics on coupled electronic surfaces using ensembles of trajectories. In contrast to existing techniques based on propagating independent classical trajectories that undergo stochastic hops between the electronic states, the present method determines the probabilities of transition of each trajectory collectively with input from the entire ensemble. The full coherent dynamics of the coupled system arise naturally at the ensemble level and ad hoc corrections, such as momentum rescaling to impose strict trajectory energy conservation and artificial decoherence to avoid the overcoherence of the quantum states associated with independent trajectories, are avoided. PMID:27345103

  2. Complete amino acid sequence of the medium-chain S-acyl fatty acid synthetase thio ester hydrolase from rat mammary gland

    SciTech Connect

    Randhawa, Z.I.; Smith, S.

    1987-03-10

    The complete amino acid sequence of the medium-chain S-acyl fatty acid synthetase thio ester hydrolase (thioesterase II) from rat mammary gland is presented. Most of the sequence was derived by analysis of (/sup 14/C)-labelled peptide fragments produced by cleavage at methionyl, glutamyl, lysyl, arginyl, and tryptophanyl residues. A small section of the sequence was deduced from a previously analyzed cDNA clone. The protein consists of 260 residues and has a blocked amino-terminal methionine and calculated M/sub r/ of 29,212. The carboxy-terminal sequence, verified by Edman degradation of the carboxy-terminal cyanogen bromide fragment and carboxypeptidase Y digestion of the intact thioesterase II, terminates with a serine residue and lacks three additional residues predicted by the cDNA sequence. The native enzyme contains three cysteine residues but no disulfide bridges. The active site serine residue is located at position 101. The rat mammary gland thioesterase II exhibits approximately 40% homology with a thioesterase from mallard uropygial gland, the sequence of which was recently determined by cDNA analysis. Thus the two enzymes may share similar structural features and a common evolutionary origin. The location of the active site in these thioesterases differs from that of other serine active site esterases; indeed, the enzymes do not exhibit any significant homology with other serine esterases, suggesting that they may constitute a separate new family of serine active site enzymes.

  3. The complete amino acid sequence of the A-chain of human plasma alpha 2HS-glycoprotein.

    PubMed

    Yoshioka, Y; Gejyo, F; Marti, T; Rickli, E E; Bürgi, W; Offner, G D; Troxler, R F; Schmid, K

    1986-02-01

    Normal human plasma alpha 2HS-glycoprotein has earlier been shown to be comprised of two polypeptide chains. Recently, the amino acid and carbohydrate sequences of the short chain were elucidated (Gejyo, F., Chang, J.-L., Bürgi, W., Schmid, K., Offner, G. D., Troxler, R.F., van Halbeck, H., Dorland, L., Gerwig, G. J., and Vliegenthart, J.F.G. (1983) J. Biol. Chem. 258, 4966-4971). In the present study, the amino acid sequence of the long chain of this protein, designated A-chain, was determined and found to consist of 282 amino acid residues. Twenty-four amino acid doublets were found; the most abundant of these are Pro-Pro and Ala-Ala which each occur five times. Of particular interest is the presence of three Gly-X-Pro and one Gly-Pro-X sequences that are characteristic of the repeating sequences of collagens. Chou-Fasman evaluation of the secondary structure suggested that the A-chain contains 29% alpha-helix, 24% beta-pleated sheet, and 26% reverse turns and, thus, approximately 80% of the polypeptide chain may display ordered structure. Four glycosylation sites were identified. The two N-glycosidic oligosaccharides were found in the center region (residues 138 and 158), whereas the two O-glycosidic heterosaccharides, both linked to threonine (residues 238 and 252), occur within the carboxyl-terminal region. The N-glycans are linked to Asn residues in beta-turns, while the O-glycans are located in short random segments. Comparison of the sequence of the amino- and carboxyl-terminal 30 residues with protein sequences in a data bank demonstrated that the A-chain is not significantly related to any known proteins. However, the proline-rich carboxyl-terminal region of the A-chain displays some sequence similarity to collagens and the collagen-like domains of complement subcomponent C1q. PMID:3944104

  4. Analysis of the functional domains of biosynthetic threonine deaminase by comparison of the amino acid sequences of three wild-type alleles to the amino acid sequence of biodegradative threonine deaminase.

    PubMed

    Taillon, B E; Little, R; Lawther, R P

    1988-03-31

    The nucleotide sequence of the gene, ilvA, for biosynthetic threonine deaminase (Tda) from Salmonella typhimurium was determined. The deduced amino acid sequence was compared with the deduced amino acid sequences of the biosynthetic Tda from Escherichia coli K-12 (ilvA) and Saccharomyces cerevisiae (ILV1) and the biodegradative Tda from E. coli K-12 (tdc). The comparison indicated the presence of two types of blocks of homologous amino acids. The first type of homology is in the N-terminal portion of all four isozymes of Tda and probably indicates amino acids involved in catalysis. The second type of homology is found in the C-terminal portion of the three biosynthetic isozymes and presumably is involved in either (i) the binding or interaction of the allosteric effector isoleucine with the enzyme, or (ii) subunit interactions. The sites of amino acid changes of two E. coli K-12 ilvA alleles with altered response to isoleucine are consistent with the conclusion that the C-terminal portion of biosynthetic Tda is involved in allosteric regulation. PMID:3290055

  5. The developmental transcriptome landscape of bovine skeletal muscle defined by Ribo-Zero ribonucleic acid sequencing.

    PubMed

    Sun, X; Li, M; Sun, Y; Cai, H; Li, R; Wei, X; Lan, X; Huang, Y; Lei, C; Chen, H

    2015-12-01

    Ribonucleic acid sequencing (RNA-Seq) libraries are normally prepared with oligo(dT) selection of poly(A)+ mRNA, but it depends on intact total RNA samples. Recent studies have described Ribo-Zero technology, a novel method that can capture both poly(A)+ and poly(A)- transcripts from intact or fragmented RNA samples. We report here the first application of Ribo-Zero RNA-Seq for the analysis of the bovine embryonic, neonatal, and adult skeletal muscle whole transcriptome at an unprecedented depth. Overall, 19,893 genes were found to be expressed, with a high correlation of expression levels between the calf and the adult. Hundreds of genes were found to be highly expressed in the embryo and decreased at least 10-fold after birth, indicating their potential roles in embryonic muscle development. In addition, we present for the first time the analysis of global transcript isoform discovery in bovine skeletal muscle and identified 36,694 transcript isoforms. Transcriptomic data were also analyzed to unravel sequence variations; 185,036 putative SNP and 12,428 putative short insertions-deletions (InDel) were detected. Specifically, many stop-gain, stop-loss, and frameshift mutations were identified that probably change the relative protein production and sequentially affect the gene function. Notably, the numbers of stage-specific transcripts, alternative splicing events, SNP, and InDel were greater in the embryo than in the calf and the adult, suggesting that gene expression is most active in the embryo. The resulting view of the transcriptome at a single-base resolution greatly enhances the comprehensive transcript catalog and uncovers the global trends in gene expression during bovine skeletal muscle development. PMID:26641174

  6. Method for the detection of specific nucleic acid sequences by polymerase nucleotide incorporation

    DOEpatents

    Castro, Alonso

    2004-06-01

    A method for rapid and efficient detection of a target DNA or RNA sequence is provided. A primer having a 3'-hydroxyl group at one end and having a sequence of nucleotides sufficiently homologous with an identifying sequence of nucleotides in the target DNA is selected. The primer is hybridized to the identifying sequence of nucleotides on the DNA or RNA sequence and a reporter molecule is synthesized on the target sequence by progressively binding complementary nucleotides to the primer, where the complementary nucleotides include nucleotides labeled with a fluorophore. Fluorescence emitted by fluorophores on single reporter molecules is detected to identify the target DNA or RNA sequence.

  7. Characterization and cDNA sequence of Bothriechis schlegeliil-amino acid oxidase with antibacterial activity.

    PubMed

    Vargas Muñoz, Leidy Johana; Estrada-Gomez, Sebastian; Núñez, Vitelbina; Sanz, Libia; Calvete, Juan J

    2014-08-01

    Snake venoms are complex mixtures of proteins including l-amino acid oxidase (lAAO). A lAAO (named BslAAO) with a mass of 56kDa and a theoretical Ip of 5.79, was purified from Bothriechis schlegelii venom through size-exclusion, ion exchange and affinity chromatography. The entire protein sequence of 498 amino acids, was determined from cDNA using reverse-transcribed mRNA isolated from venom gland. The enzyme showed dose-dependent inhibition of bacterial growth. BslAAO showed inhibitory effect against S. aureus with a MIC of 4μg/mL and a MBC of 8μg/mL. Against Acinetobacter baumannii, showed a MIC of 2μg/mL and MBC of 4μg/mL, No effect was observed in Escherichia coli. This antibacterial activity was inhibited by catalase, indicating that antimicrobial activity was due to H2O2 production. BslAAO did not show any cytotoxic activity toward mouse myoblast cell line C2C12 or peripheral blood mononuclear cells. The enzyme oxidated l-Leu, with a Km of 16.37μM and a Vmax of 0.39μM/min. Snake venoms lAAOs, are potential frames of different therapeutics molecules since these enzymes exhibit low MICs and MBCs and show to be harmless to human cells due to microorganisms being generally several fold more sensitive to reactive oxygen species than human tissues. PMID:24875315

  8. Genome Sequence of a Candidate World Health Organization Reference Strain of Zika Virus for Nucleic Acid Testing

    PubMed Central

    Trösemeier, Jan-Hendrik; Musso, Didier; Blümel, Johannes; Thézé, Julien; Pybus, Oliver G.

    2016-01-01

    We report here the sequence of a candidate reference strain of Zika virus (ZIKV) developed on behalf of the World Health Organization (WHO). The ZIKV reference strain is intended for use in nucleic acid amplification (NAT)-based assays for the detection and quantification of ZIKV RNA. PMID:27587826

  9. Genome Sequence of Schizochytrium sp. CCTCC M209059, an Effective Producer of Docosahexaenoic Acid-Rich Lipids

    PubMed Central

    Ji, Xiao-Jun; Mo, Kai-Qiang; Ren, Lu-Jing; Li, Gan-Lu; Huang, Jian-Zhong

    2015-01-01

    Schizochytrium is an effective species for producing omega-3 docosahexaenoic acid (DHA). Here, we report a genome sequence of Schizochytrium sp. CCTCC M209059, which has a genome size of 39.09 Mb. It will provide the genomic basis for further insights into the metabolic and regulatory mechanisms underlying the DHA formation. PMID:26251485

  10. Evolutionary Distance of Amino Acid Sequence Orthologs across Macaque Subspecies: Identifying Candidate Genes for SIV Resistance in Chinese Rhesus Macaques

    PubMed Central

    Ross, Cody T.; Roodgar, Morteza; Smith, David Glenn

    2015-01-01

    We use the Reciprocal Smallest Distance (RSD) algorithm to identify amino acid sequence orthologs in the Chinese and Indian rhesus macaque draft sequences and estimate the evolutionary distance between such orthologs. We then use GOanna to map gene function annotations and human gene identifiers to the rhesus macaque amino acid sequences. We conclude methodologically by cross-tabulating a list of amino acid orthologs with large divergence scores with a list of genes known to be involved in SIV or HIV pathogenesis. We find that many of the amino acid sequences with large evolutionary divergence scores, as calculated by the RSD algorithm, have been shown to be related to HIV pathogenesis in previous laboratory studies. Four of the strongest candidate genes for SIVmac resistance in Chinese rhesus macaques identified in this study are CDK9, CXCL12, TRIM21, and TRIM32. Additionally, ANKRD30A, CTSZ, GORASP2, GTF2H1, IL13RA1, MUC16, NMDAR1, Notch1, NT5M, PDCD5, RAD50, and TM9SF2 were identified as possible candidates, among others. We failed to find many laboratory experiments contrasting the effects of Indian and Chinese orthologs at these sites on SIVmac pathogenesis, but future comparative studies might hold fertile ground for research into the biological mechanisms underlying innate resistance to SIVmac in Chinese rhesus macaques. PMID:25884674

  11. Evolutionary distance of amino acid sequence orthologs across macaque subspecies: identifying candidate genes for SIV resistance in Chinese rhesus macaques.

    PubMed

    Ross, Cody T; Roodgar, Morteza; Smith, David Glenn

    2015-01-01

    We use the Reciprocal Smallest Distance (RSD) algorithm to identify amino acid sequence orthologs in the Chinese and Indian rhesus macaque draft sequences and estimate the evolutionary distance between such orthologs. We then use GOanna to map gene function annotations and human gene identifiers to the rhesus macaque amino acid sequences. We conclude methodologically by cross-tabulating a list of amino acid orthologs with large divergence scores with a list of genes known to be involved in SIV or HIV pathogenesis. We find that many of the amino acid sequences with large evolutionary divergence scores, as calculated by the RSD algorithm, have been shown to be related to HIV pathogenesis in previous laboratory studies. Four of the strongest candidate genes for SIVmac resistance in Chinese rhesus macaques identified in this study are CDK9, CXCL12, TRIM21, and TRIM32. Additionally, ANKRD30A, CTSZ, GORASP2, GTF2H1, IL13RA1, MUC16, NMDAR1, Notch1, NT5M, PDCD5, RAD50, and TM9SF2 were identified as possible candidates, among others. We failed to find many laboratory experiments contrasting the effects of Indian and Chinese orthologs at these sites on SIVmac pathogenesis, but future comparative studies might hold fertile ground for research into the biological mechanisms underlying innate resistance to SIVmac in Chinese rhesus macaques. PMID:25884674

  12. Draft Genome Sequence of Lactobacillus delbrueckii subsp. bulgaricus CFL1, a Lactic Acid Bacterium Isolated from French Handcrafted Fermented Milk.

    PubMed

    Meneghel, Julie; Dugat-Bony, Eric; Irlinger, Françoise; Loux, Valentin; Vidal, Marie; Passot, Stéphanie; Béal, Catherine; Layec, Séverine; Fonseca, Fernanda

    2016-01-01

    Lactobacillus delbrueckii subsp. bulgaricus (L. bulgaricus) is a lactic acid bacterium widely used for the production of yogurt and cheeses. Here, we report the genome sequence of L. bulgaricus CFL1 to improve our knowledge on its stress-induced damages following production and end-use processes. PMID:26941141

  13. Draft Genome Sequence of Cutaneotrichosporon curvatus DSM 101032 (Formerly Cryptococcus curvatus), an Oleaginous Yeast Producing Polyunsaturated Fatty Acids.

    PubMed

    Hofmeyer, Thomas; Hackenschmidt, Silke; Nadler, Florian; Thürmer, Andrea; Daniel, Rolf; Kabisch, Johannes

    2016-01-01

    Cutaneotrichosporon curvatus DSM 101032 is an oleaginous yeast that can be isolated from various habitats and is capable of producing substantial amounts of polyunsaturated fatty acids. Here, we present the first draft genome sequence of any C. curvatus species. PMID:27174275

  14. Complete genome sequence of Lactobacillus plantarum ZS2058, a probiotic strain with high conjugated linoleic acid production ability.

    PubMed

    Yang, Bo; Chen, Haiqin; Tian, Fengwei; Zhao, Jianxin; Gu, Zhennan; Zhang, Hao; Chen, Yong Q; Chen, Wei

    2015-11-20

    Lactobacillus plantarum ZS2058 was isolated from sauerkraut and identified to synthesize the beneficial metabolite conjugated linoleic acid. The genome contains a 319,7363-bp chromosome and three plasmids. The sequence will facilitate identification and characterization of the genetic determinants for its putative biological benefits. PMID:26439428

  15. Draft Genome Sequence of Burkholderia stabilis LA20W, a Trehalose Producer That Uses Levulinic Acid as a Substrate

    PubMed Central

    Sato, Yuya; Koike, Hideaki; Kondo, Susumu; Hori, Tomoyuki; Kanno, Manabu; Kimura, Nobutada; Morita, Tomotake; Kirimura, Kohtaro

    2016-01-01

    Burkholderia stabilis LA20W produces trehalose using levulinic acid (LA) as a substrate. Here, we report the 7.97-Mb draft genome sequence of B. stabilis LA20W, which will be useful in investigations of the enzymes involved in LA metabolism and the mechanism of LA-induced trehalose production. PMID:27491978

  16. Draft Genome Sequence of Acetobacter tropicalis Type Strain NBRC16470, a Producer of Optically Pure d-Glyceric Acid.

    PubMed

    Koike, Hideaki; Sato, Shun; Morita, Tomotake; Fukuoka, Tokuma; Habe, Hiroshi

    2014-01-01

    Here we report the 3.7-Mb draft genome sequence of Acetobacter tropicalis NBRC16470(T), which can produce optically pure d-glyceric acid (d-GA; 99% enantiomeric excess) from raw glycerol feedstock derived from biodiesel fuel production processes. PMID:25523780

  17. Genome Sequence of a Candidate World Health Organization Reference Strain of Zika Virus for Nucleic Acid Testing.

    PubMed

    Trösemeier, Jan-Hendrik; Musso, Didier; Blümel, Johannes; Thézé, Julien; Pybus, Oliver G; Baylis, Sally A

    2016-01-01

    We report here the sequence of a candidate reference strain of Zika virus (ZIKV) developed on behalf of the World Health Organization (WHO). The ZIKV reference strain is intended for use in nucleic acid amplification (NAT)-based assays for the detection and quantification of ZIKV RNA. PMID:27587826

  18. Draft Genome Sequence of Burkholderia stabilis LA20W, a Trehalose Producer That Uses Levulinic Acid as a Substrate.

    PubMed

    Sato, Yuya; Koike, Hideaki; Kondo, Susumu; Hori, Tomoyuki; Kanno, Manabu; Kimura, Nobutada; Morita, Tomotake; Kirimura, Kohtaro; Habe, Hiroshi

    2016-01-01

    Burkholderia stabilis LA20W produces trehalose using levulinic acid (LA) as a substrate. Here, we report the 7.97-Mb draft genome sequence of B. stabilis LA20W, which will be useful in investigations of the enzymes involved in LA metabolism and the mechanism of LA-induced trehalose production. PMID:27491978

  19. Draft Genome Sequence of Cutaneotrichosporon curvatus DSM 101032 (Formerly Cryptococcus curvatus), an Oleaginous Yeast Producing Polyunsaturated Fatty Acids

    PubMed Central

    Hofmeyer, Thomas; Hackenschmidt, Silke; Nadler, Florian; Thürmer, Andrea; Daniel, Rolf

    2016-01-01

    Cutaneotrichosporon curvatus DSM 101032 is an oleaginous yeast that can be isolated from various habitats and is capable of producing substantial amounts of polyunsaturated fatty acids. Here, we present the first draft genome sequence of any C. curvatus species. PMID:27174275

  20. Ultra high-throughput nucleic acid sequencing as a tool for virus discovery in the turkey gut.

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Recently, the use of the next generation of nucleic acid sequencing technology (i.e., 454 pyrosequencing, as developed by Roche/454 Life Sciences) has allowed an in-depth look at the uncultivated microorganisms present in complex environmental samples, including samples with agricultural importance....

  1. Draft Genome Sequence of Lactobacillus delbrueckii subsp. bulgaricus CFL1, a Lactic Acid Bacterium Isolated from French Handcrafted Fermented Milk

    PubMed Central

    Meneghel, Julie; Irlinger, Françoise; Loux, Valentin; Vidal, Marie; Passot, Stéphanie; Béal, Catherine; Layec, Séverine

    2016-01-01

    Lactobacillus delbrueckii subsp. bulgaricus (L. bulgaricus) is a lactic acid bacterium widely used for the production of yogurt and cheeses. Here, we report the genome sequence of L. bulgaricus CFL1 to improve our knowledge on its stress-induced damages following production and end-use processes. PMID:26941141

  2. Multipolar consensus for phylogenetic trees.

    PubMed

    Bonnard, Cécile; Berry, Vincent; Lartillot, Nicolas

    2006-10-01

    Collections of phylogenetic trees are usually summarized using consensus methods. These methods build a single tree, supposed to be representative of the collection. However, in the case of heterogeneous collections of trees, the resulting consensus may be poorly resolved (strict consensus, majority-rule consensus, ...), or may perform arbitrary choices among mutually incompatible clades, or splits (greedy consensus). Here, we propose an alternative method, which we call the multipolar consensus (MPC). Its aim is to display all the splits having a support above a predefined threshold, in a minimum number of consensus trees, or poles. We show that the problem is equivalent to a graph-coloring problem, and propose an implementation of the method. Finally, we apply the MPC to real data sets. Our results indicate that, typically, all the splits down to a weight of 10% can be displayed in no more than 4 trees. In addition, in some cases, biologically relevant secondary signals, which would not have been present in any of the classical consensus trees, are indeed captured by our method, indicating that the MPC provides a convenient exploratory method for phylogenetic analysis. The method was implemented in a package freely available at http://www.lirmm.fr/~cbonnard/MPC.html PMID:17060203

  3. Sequence-Specific Recognition of MicroRNAs and Other Short Nucleic Acids with Solid-State Nanopores.

    PubMed

    Zahid, Osama K; Wang, Fanny; Ruzicka, Jan A; Taylor, Ethan W; Hall, Adam R

    2016-03-01

    The detection and quantification of short nucleic acid sequences has many potential applications in studying biological processes, monitoring disease initiation and progression, and evaluating environmental systems, but is challenging by nature. We present here an assay based on the solid-state nanopore platform for the identification of specific sequences in solution. We demonstrate that hybridization of a target nucleic acid with a synthetic probe molecule enables discrimination between duplex and single-stranded molecules with high efficacy. Our approach requires limited preparation of samples and yields an unambiguous translocation event rate enhancement that can be used to determine the presence and abundance of a single sequence within a background of nontarget oligonucleotides. PMID:26824296

  4. Amino acid sequence of rabbit kidney neutral endopeptidase 24.11 (enkephalinase) deduced from a complementary DNA.

    PubMed Central

    Devault, A; Lazure, C; Nault, C; Le Moual, H; Seidah, N G; Chrétien, M; Kahn, P; Powell, J; Mallet, J; Beaumont, A

    1987-01-01

    Neutral endopeptidase (EC 3.4.24.11) is a major constituent of kidney brush border membranes. It is also present in the brain where it has been shown to be involved in the inactivation of opioid peptides, methionine- and leucine-enkephalins. For this reason this enzyme is often called 'enkephalinase'. In order to characterize the primary structure of the enzyme, oligonucleotide probes were designed from partial amino acid sequences and used to isolate clones from kidney cDNA libraries. Sequencing of the cDNA inserts revealed the complete primary structure of the enzyme. Neutral endopeptidase consists of 750 amino acids. It contains a short N-terminal cytoplasmic domain (27 amino acids), a single membrane-spanning segment (23 amino acids) and an extracellular domain that comprises most of the protein mass. The comparison of the primary structure of neutral endopeptidase with that of thermolysin, a bacterial Zn-metallopeptidase, indicates that most of the amino acid residues involved in Zn coordination and catalytic activity in thermolysin are found within highly honmologous sequences in neutral endopeptidase. Images Fig. 1. Fig. 3. PMID:2440677

  5. Human parainfluenza type 3 virus hemagglutinin-neuraminidase glycoprotein: nucleotide sequence of mRNA and limited amino acid sequence of the purified protein.

    PubMed Central

    Elango, N; Coligan, J E; Jambou, R C; Venkatesan, S

    1986-01-01

    The nucleotide sequence of mRNA for the hemagglutinin-neuraminidase (HN) protein of human parainfluenza type 3 virus obtained from the corresponding cDNA clone had a single long open reading frame encoding a putative protein of 64,254 daltons consisting of 572 amino acids. The deduced protein sequence was confirmed by limited N-terminal amino acid microsequencing of CNBr cleavage fragments of native HN that was purified by immunoprecipitation. The HN protein is moderately hydrophobic and has four potential sites (Asn-X-Ser/Thr) of N-glycosylation in the C-terminal half of the molecule. It is devoid of both the N-terminal signal sequence and the C-terminal membrane anchorage domain characteristic of the hemagglutinin of influenza virus and the fusion (F0) protein of the paramyxoviruses. Instead, it has a single prominent hydrophobic region capable of membrane insertion beginning at 32 residues from the N terminus. This N-terminal membrane insertion is similar to that of influenza virus neuraminidase and the recently reported structures of HN proteins of Sendai virus and simian virus 5. Images PMID:3003381

  6. Sequence of cDNA for rat cystathionine gamma-lyase and comparison of deduced amino acid sequence with related Escherichia coli enzymes.

    PubMed Central

    Erickson, P F; Maxwell, I H; Su, L J; Baumann, M; Glode, L M

    1990-01-01

    A cDNA clone for cystathionine gamma-lyase was isolated from a rat cDNA library in lambda gt11 by screening with a monospecific antiserum. The identity of this clone, containing 600 bp proximal to the 3'-end of the gene, was confirmed by positive hybridization selection. Northern-blot hybridization showed the expected higher abundance of the corresponding mRNA in liver than in brain. Two further cDNA clones from a plasmid pcD library were isolated by colony hybridization with the first clone and were found to contain inserts of 1600 and 1850 bp. One of these was confirmed as encoding cystathionine gamma-lyase by hybridization with two independent pools of oligodeoxynucleotides corresponding to partial amino acid sequence information for cystathionine gamma-lyase. The other clone (estimated to represent all but 8% of the 5'-end of the mRNA) was sequenced and its deduced amino acid sequence showed similarity to those of the Escherichia coli enzymes cystathionine beta-lyase and cystathionine gamma-synthase throughout its length, especially to that of the latter. Images Fig. 1. Fig. 2. Fig. 3. Fig. 5. PMID:2201285

  7. Sequence dependent N-terminal rearrangement and degradation of peptide nucleic acid (PNA) in aqueous solution

    NASA Technical Reports Server (NTRS)

    Eriksson, M.; Christensen, L.; Schmidt, J.; Haaima, G.; Orgel, L.; Nielsen, P. E.

    1998-01-01

    The stability of the PNA (peptide nucleic acid) thymine monomer inverted question markN-[2-(thymin-1-ylacetyl)]-N-(2-aminoaminoethyl)glycine inverted question mark and those of various PNA oligomers (5-8-mers) have been measured at room temperature (20 degrees C) as a function of pH. The thymine monomer undergoes N-acyl transfer rearrangement with a half-life of 34 days at pH 11 as analyzed by 1H NMR; and two reactions, the N-acyl transfer and a sequential degradation, are found by HPLC analysis to occur at measurable rates for the oligomers at pH 9 or above. Dependent on the amino-terminal sequence, half-lives of 350 h to 163 days were found at pH 9. At pH 12 the half-lives ranged from 1.5 h to 21 days. The results are discussed in terms of PNA as a gene therapeutic drug as well as a possible prebiotic genetic material.

  8. Structural analysis of complementary DNA and amino acid sequences of human and rat androgen receptors

    SciTech Connect

    Chang, C.; Kokontis, J.; Liao, S. )

    1988-10-01

    Structural analysis of cDNAs for human and rat androgen receptors (ARs) indicates that the amino-terminal regions of ARs are rich in oligo- and poly(amino acid) motifs as in some homeotic genes. The human AR has a long stretch of repeated glycines, whereas rat AR has a long stretch of glutamines. There is a considerable sequence similarity among ARs and the receptors for glucocorticoids, progestins, and mineralocorticoids within the steroid-binding domains. The cysteine-rich DNA-binding domains are well conserved. Translation of mRNA transcribed from AR cDNAs yielded 94- and 76-kDa proteins and smaller forms that bind to DNA and have high affinity toward androgens. These rat or human ARs were recognized by human autoantibodies to natural Ars. Molecular hybridization studies, using AR cDNAs as probes, indicated that the ventral prostate and other male accessory organs are rich in AR mRNA and that the production of AR mRNA in the target organs may be autoregulated by androgens.

  9. Rapid and Sensitive Isothermal Detection of Nucleic-acid Sequence by Multiple Cross Displacement Amplification

    PubMed Central

    Wang, Yi; Wang, Yan; Ma, Ai-Jing; Li, Dong-Xun; Luo, Li-Juan; Liu, Dong-Xin; Jin, Dong; Liu, Kai; Ye, Chang-Yun

    2015-01-01

    We have devised a novel amplification strategy based on isothermal strand-displacement polymerization reaction, which was termed multiple cross displacement amplification (MCDA). The approach employed a set of ten specially designed primers spanning ten distinct regions of target sequence and was preceded at a constant temperature (61–65 °C). At the assay temperature, the double-stranded DNAs were at dynamic reaction environment of primer-template hybrid, thus the high concentration of primers annealed to the template strands without a denaturing step to initiate the synthesis. For the subsequent isothermal amplification step, a series of primer binding and extension events yielded several single-stranded DNAs and single-stranded single stem-loop DNA structures. Then, these DNA products enabled the strand-displacement reaction to enter into the exponential amplification. Three mainstream methods, including colorimetric indicators, agarose gel electrophoresis and real-time turbidity, were selected for monitoring the MCDA reaction. Moreover, the practical application of the MCDA assay was successfully evaluated by detecting the target pathogen nucleic acid in pork samples, which offered advantages on quick results, modest equipment requirements, easiness in operation, and high specificity and sensitivity. Here we expounded the basic MCDA mechanism and also provided details on an alternative (Single-MCDA assay, S-MCDA) to MCDA technique. PMID:26154567

  10. Snake venoms. The amino acid sequences of two proteinase inhibitor homologues from Dendroaspis angusticeps venom.

    PubMed

    Joubert, F J; Taljaard, N

    1980-05-01

    Toxins C13S1C3 and C13S2C3 from D. angusticeps venom were purified by gel filtration and ion exchange chromatography. Whereas C13S1C3 contains 57 amino acids, C13S2C3 contains 59 but each include six half-cystine residues. The complete primary structure of the low toxicity proteins have been elucidated. The sequences and the invariant residues of toxins C13S1C3 and C13S2C3 from D. angusticeps venom resemble, respectively, those of the proteinase inhibitor homologues K and I from D. polylepis polylepis venom and they are also homologous to the active proteinase inhibitors from various sources. In C13S1C3 and K the active site lysyl residue of active bovine pancreatic proteinase inhibitor is conserved but the site residue alanine, is replaced by lysine. In C13S2C3 and I the active site residue is replaced by tyrosine. PMID:7429422

  11. Cloning, sequencing and characterization of lipase from a polyhydroxyalkanoate- (PHA-) synthesizing Pseudomonas resinovorans

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Lipase gene (lip) of a biodegradable polyhydroxyalkanoate- (PHA-) synthesizing bacterium P. resinovorans NRRL B-2649 was cloned, sequenced and characterized by using consensus primers and PCR-based genome walking method. The ORF of the putative Lip (314 amino acids) and its active site (Ser111, Asp...

  12. Brazilian Consensus on Photoprotection

    PubMed Central

    Schalka, Sérgio; Steiner, Denise; Ravelli, Flávia Naranjo; Steiner, Tatiana; Terena, Aripuanã Cobério; Marçon, Carolina Reato; Ayres, Eloisa Leis; Addor, Flávia Alvim Sant'anna; Miot, Helio Amante; Ponzio, Humberto; Duarte, Ida; Neffá, Jane; da Cunha, José Antônio Jabur; Boza, Juliana Catucci; Samorano, Luciana de Paula; Corrêa, Marcelo de Paula; Maia, Marcus; Nasser, Nilton; Leite, Olga Maria Rodrigues Ribeiro; Lopes, Otávio Sergio; Oliveira, Pedro Dantas; Meyer, Renata Leal Bregunci; Cestari, Tânia; dos Reis, Vitor Manoel Silva; Rego, Vitória Regina Pedreira de Almeida

    2014-01-01

    Brazil is a country of continental dimensions with a large heterogeneity of climates and massive mixing of the population. Almost the entire national territory is located between the Equator and the Tropic of Capricorn, and the Earth axial tilt to the south certainly makes Brazil one of the countries of the world with greater extent of land in proximity to the sun. The Brazilian coastline, where most of its population lives, is more than 8,500 km long. Due to geographic characteristics and cultural trends, Brazilians are among the peoples with the highest annual exposure to the sun. Epidemiological data show a continuing increase in the incidence of non-melanoma and melanoma skin cancers. Photoprotection can be understood as a set of measures aimed at reducing sun exposure and at preventing the development of acute and chronic actinic damage. Due to the peculiarities of Brazilian territory and culture, it would not be advisable to replicate the concepts of photoprotection from other developed countries, places with completely different climates and populations. Thus the Brazilian Society of Dermatology has developed the Brazilian Consensus on Photoprotection, the first official document on photoprotection developed in Brazil for Brazilians, with recommendations on matters involving photoprotection. PMID:25761256

  13. Between consensus and contestation.

    PubMed

    Weale, Albert

    2016-08-15

    Purpose - Noting that discussions of public participation and priority setting typically presuppose certain political theories of democracy, the purpose of this paper is to discuss two theories: the consensual and the agonistic. The distinction is illuminating when considering the difference between institutionalized public participation and contestatory participation. Design/methodology/approach - The approach is a theoretical reconstruction of two ways of thinking about public participation in relation to priority setting in health care, drawing on the work of Habermas, a deliberative theorist, and Mouffe, a theorist of agonism. Findings - The different theoretical approaches can be associated with different ways of understanding priority setting. In particular, agonistic democratic theory would understand priority setting as system of inclusions and exclusions rather than the determination of a consensus of social values, which is the typical deliberative way of thinking about the issues. Originality/value - The paper shows the value of drawing out explicitly the tacit assumptions of practices of political participation in order to reveal their scope and limitations. It suggests that making such theoretical presuppositions explicit has value for health services management in recognizing these implicit choices. PMID:27468774

  14. Hohenheim consensus workshop: copper.

    PubMed

    Schümann, K; Classen, H G; Dieter, H H; König, J; Multhaup, G; Rükgauer, M; Summer, K H; Bernhardt, J; Biesalski, H K

    2002-06-01

    Copper (Cu) is an essential trace element with many physiological functions. Homeostatic mechanisms exist to allow Cu to act as a cofactor in enzymatic processes and to prevent accumulation of Cu to toxic levels. The aim of this commentary is to better understand the role of dietary Cu supply in deficiency and under physiological and pathological conditions. The essentiality of Cu can be attributed to its role as a cofactor in a number of enzymes that are involved in the defence against oxidative stress. Cu, however, has a second face, that of a toxic compound as it is observed with accumulating evidence in hepatic, neurodegenerative and cardiovascular diseases. The destructive potential of Cu can be attributed to inherent physico-chemical properties. The main property is its ability to take part in Fenton-like reactions in which the highly reactive and extremely deleterious hydroxyl radical is formed. Diseases caused by dietary Cu overload could be based on a genetic predisposition. Thus, an assessment of risk-groups, such as infants with impaired mechanisms of Cu homeostasis regarding detoxification, is of special interest, as their Cu intake with resuspended formula milk may be very high. This implies the need for reliable diagnostic markers to determine the Cu status. These topics were introduced at the workshop by the participants followed by extensive group discussion. The consensus statements were agreed on by all members. One of the conclusions is that a re-assessment of published data is necessary and future research is required. PMID:12032645

  15. Acinetobacter cyclohexanone monooxygenase: gene cloning and sequence determination.

    PubMed Central

    Chen, Y C; Peoples, O P; Walsh, C T

    1988-01-01

    The gene coding for cyclohexanone monooxygenase from Acinetobacter sp. strain NCIB 9871 was isolated by immunological screening methods. We located and determined the nucleotide sequence of the gene. The structural gene is 1,626 nucleotides long and codes for a polypeptide of 542 amino acids; 389 nucleotides 5' and 108 nucleotides 3' of the coding region are also reported. The complete amino acid sequence of the enzyme was derived by translation of the nucleotide sequence. From a comparison of the amino acid sequence with consensus sequences of nucleotide-binding folds, we identified a potential flavin-binding site at the NH2 terminus of the enzyme (residues 6 to 18) and a potential nicotinamide-binding site extending from residue 176 to residue 208 of the protein. An overproduction system for the gene to facilitate genetic manipulations was also constructed by using the tac promoter vector pKK223-3 in Escherichia coli. Images PMID:3338974

  16. Nucleotide and predicted amino acid sequence of a cDNA clone encoding part of human transketolase.

    PubMed

    Abedinia, M; Layfield, R; Jones, S M; Nixon, P F; Mattick, J S

    1992-03-31

    Transketolase is a key enzyme in the pentose-phosphate pathway which has been implicated in the latent human genetic disease, Wernicke-Korsakoff syndrome. Here we report the cloning and partial characterisation of the coding sequences encoding human transketolase from a human brain cDNA library. The library was screened with oligonucleotide probes based on the amino acid sequence of proteolytic fragments of the purified protein. Northern blots showed that the transketolase mRNA is approximately 2.2 kb, close to the minimum expected, of which approximately 60% was represented in the largest cDNA clone. Sequence analysis of the transketolase coding sequences reveals a number of homologies with related enzymes from other species. PMID:1567394

  17. 5S ribosomal ribonucleic acid sequences in Bacteroides and Fusobacterium: evolutionary relationships within these genera and among eubacteria in general

    NASA Technical Reports Server (NTRS)

    Van den Eynde, H.; De Baere, R.; Shah, H. N.; Gharbia, S. E.; Fox, G. E.; Michalik, J.; Van de Peer, Y.; De Wachter, R.

    1989-01-01

    The 5S ribosomal ribonucleic acid (rRNA) sequences were determined for Bacteroides fragilis, Bacteroides thetaiotaomicron, Bacteroides capillosus, Bacteroides veroralis, Porphyromonas gingivalis, Anaerorhabdus furcosus, Fusobacterium nucleatum, Fusobacterium mortiferum, and Fusobacterium varium. A dendrogram constructed by a clustering algorithm from these sequences, which were aligned with all other hitherto known eubacterial 5S rRNA sequences, showed differences as well as similarities with respect to results derived from 16S rRNA analyses. In the 5S rRNA dendrogram, Bacteroides clustered together with Cytophaga and Fusobacterium, as in 16S rRNA analyses. Intraphylum relationships deduced from 5S rRNAs suggested that Bacteroides is specifically related to Cytophaga rather than to Fusobacterium, as was suggested by 16S rRNA analyses. Previous taxonomic considerations concerning the genus Bacteroides, based on biochemical and physiological data, were confirmed by the 5S rRNA sequence analysis.

  18. Consensus on consensus: a synthesis of consensus estimates on human-caused global warming

    NASA Astrophysics Data System (ADS)

    Cook, John; Oreskes, Naomi; Doran, Peter T.; Anderegg, William R. L.; Verheggen, Bart; Maibach, Ed W.; Carlton, J. Stuart; Lewandowsky, Stephan; Skuce, Andrew G.; Green, Sarah A.; Nuccitelli, Dana; Jacobs, Peter; Richardson, Mark; Winkler, Bärbel; Painting, Rob; Rice, Ken

    2016-04-01

    The consensus that humans are causing recent global warming is shared by 90%–100% of publishing climate scientists according to six independent studies by co-authors of this paper. Those results are consistent with the 97% consensus reported by Cook et al (Environ. Res. Lett. 8 024024) based on 11 944 abstracts of research papers, of which 4014 took a position on the cause of recent global warming. A survey of authors of those papers (N = 2412 papers) also supported a 97% consensus. Tol (2016 Environ. Res. Lett. 11 048001) comes to a different conclusion using results from surveys of non-experts such as economic geologists and a self-selected group of those who reject the consensus. We demonstrate that this outcome is not unexpected because the level of consensus correlates with expertise in climate science. At one point, Tol also reduces the apparent consensus by assuming that abstracts that do not explicitly state the cause of global warming (‘no position’) represent non-endorsement, an approach that if applied elsewhere would reject consensus on well-established theories such as plate tectonics. We examine the available studies and conclude that the finding of 97% consensus in published climate research is robust and consistent with other surveys of climate scientists and peer-reviewed studies.

  19. Definition of a consensus transportin-specific nucleocytoplasmic transport signal.

    PubMed

    Bogerd, H P; Benson, R E; Truant, R; Herold, A; Phingbodhipakkiya, M; Cullen, B R

    1999-04-01

    The low cytoplasmic and high nuclear concentration of the GTP-bound form of Ran provides directionality for both nuclear protein import and export. Both import and export factors bind RanGTP directly, yet this interaction produces opposite effects; in the former case, RanGTP binding induces nuclear cargo release, whereas in the latter, RanGTP binding induces nuclear cargo assembly. Therefore, nuclear import and export receptors and their protein recognition sites are predicted to be distinct. Nevertheless, the approximately 38-amino acid M9 sequence present in heterogeneous nuclear ribonucleoprotein A1 has been reported to serve as both a nuclear localization signal and a nuclear export signal, even though only one protein, the nuclear import factor transportin, has been shown to bind M9 directly. We have used a combination of mutational randomization followed by selection for transportin binding to exhaustively define amino acids in M9 that are critical for transportin binding in vivo. As expected, the resultant approximately 12-amino acid transportin-binding consensus sequence is also predictive of nuclear localization signal activity. Surprisingly, however, this extensive mutational analysis failed to dissect M9 nuclear localization signal and nuclear export signal function. Nevertheless, transportin appears unlikely to be the M9 export receptor, as RanGTP can be shown to block M9 binding by transportin not only in vitro, but also in the nucleus in vivo. This analysis therefore predicts the existence of a nuclear export receptor distinct from transportin that nevertheless shares a common protein-binding site on heterogeneous nuclear ribonucleoprotein A1. PMID:10092666

  20. Sample Prep, Workflow Automation and Nucleic Acid Fractionation for Next Generation Sequencing

    SciTech Connect

    Roskey, Mark

    2010-06-03

    Mark Roskey of Caliper LifeSciences discusses how the company's technologies fit into the next generation sequencing workflow on June 3, 2010 at the "Sequencing, Finishing, Analysis in the Future" meeting in Santa Fe, NM

  1. Evolution of vertebrate IgM: complete amino acid sequence of the constant region of Ambystoma mexicanum mu chain deduced from cDNA sequence.

    PubMed

    Fellah, J S; Wiles, M V; Charlemagne, J; Schwager, J

    1992-10-01

    cDNA clones coding for the constant region of the Mexican axolotl (Ambystoma mexicanum) mu heavy immunoglobulin chain were selected from total spleen RNA, using a cDNA polymerase chain reaction technique. The specific 5'-end primer was an oligonucleotide homologous to the JH segment of Xenopus laevis mu chain. One of the clones, JHA/3, corresponded to the complete constant region of the axolotl mu chain, consisting of a 1362-nucleotide sequence coding for a polypeptide of 454 amino acids followed in 3' direction by a 179-nucleotide untranslated region and a polyA+ tail. The axolotl C mu is divided into four typical domains (C mu 1-C mu 4) and can be aligned with the Xenopus C mu with an overall identity of 56% at the nucleotide level. Percent identities were particularly high between C mu 1 (59%) and C mu 4 (71%). The C-terminal 20-amino acid segment which constitutes the secretory part of the mu chain is strongly homologous to the equivalent sequences of chondrichthyans and of other tetrapods, including a conserved N-linked oligosaccharide, the penultimate cysteine and the C-terminal lysine. The four C mu domains of 13 vertebrate species ranging from chondrichthyans to mammals were aligned and compared at the amino acid level. The significant number of mu-specific residues which are conserved into each of the four C mu domains argues for a continuous line of evolution of the vertebrate mu chain. This notion was confirmed by the ability to reconstitute a consistent vertebrate evolution tree based on the phylogenic parsimony analysis of the C mu 4 sequences. PMID:1382992

  2. Mosaic protein and nucleic acid vaccines against hepatitis C virus

    DOEpatents

    Yusim, Karina; Korber, Bette T. M.; Kuiken, Carla L.; Fischer, William M.

    2013-06-11

    The invention relates to immunogenic compositions useful as HCV vaccines. Provided are HCV mosaic polypeptide and nucleic acid compositions which provide higher levels of T-cell epitope coverage while minimizing the occurrence of unnatural and rare epitopes compared to natural HCV polypeptides and consensus HCV sequences.

  3. Low levels of haptoglobin and putative amino acid sequence in Taiwanese Lanyu miniature pigs.

    PubMed

    Yueh, Sunny C H; Wang, Yao Horng; Lin, Kuan Yu; Tseng, Chi Feng; Chu, Hsien Pin; Chen, Kuen Jaw; Wang, Shih Sheng; Lai, I Hsiang; Mao, Simon J T

    2008-04-01

    Porcine haptoglobin (Hp) is an acute phase protein. Its plasma level increases significantly during inflammation and infection. One of the main functions of Hp is to bind free hemoglobin (Hb) and inhibit its oxidative activity. In the present report, we studied the Hp phenotype of Taiwanese Lanyu miniature pigs (TLY minipigs; n=43) and found their Hp structure to be a homodimer (beta-alpha-alpha-beta) similar to human Hp 1-1. Interestingly, Western blot and high performance liquid chromatographic (HPLC) analysis showed that 25% of the TLY minipigs possessed low or no plasma Hp level (<0.05 mg/ml). The Hp cDNA of these TLY minipigs was then cloned, and the translated amino acid sequence was analyzed. No sequences were found to be deficient; they showed a 99.7% identity with domestic pigs (NP_999165). The mean overall Hp level of the TLY minipigs (0.21 +/- 0.25 mg/ml; n=43) determined by enzyme-linked immunosorbent assay (ELISA) was markedly lower than that of domestic pigs (0.78 +/- 0.45 mg/ml; p<0.001), while 25% of the TLY minipigs had an Hp level that was extremely low (<0.05 mg/ml). In addition, the initial recovery rate (first 40 min) in the circulation of infused fluorescein isothiocyanate (FITC)-Hb was significantly higher in the TLY minipigs with extremely low Hp levels than those with high levels. This data suggests that the low concentration of Hp-Hb complex is responsible for the higher recovery rate of Hb in the circulation. TLY minipigs have been used as an experimental model for cardiovascular diseases; whether they can be used as a model for inflammatory diseases, with Hp as a marker, remains a topic of interest. However, since the Hp level varies significantly among individual TLY minipigs, it is necessary to prescreen the Hp levels of the animals to minimize variation in the experimental baseline. The present study may provide a reference value for future use of the TLY minipig as an animal model for inflammation-associated diseases. PMID:18460833

  4. Sequence Comparison and Phylogeny of Nucleotide Sequence of Coat Protein and Nucleic Acid Binding Protein of a Distinct Isolate of Shallot virus X from India.

    PubMed

    Majumder, S; Baranwal, V K

    2011-06-01

    Shallot virus X (ShVX), a type species in the genus Allexivirus of the family Alfaflexiviridae has been associated with shallot plants in India and other shallot growing countries like Russia, Germany, Netherland, and New Zealand. Coat protein (CP) and nucleic acid binding protein (NB) region of the virus was obtained by reverse transcriptase polymerase chain reaction from scales leaves of shallot bulbs. The partial cDNA contained two open reading frames encoding proteins of molecular weights of 28.66 and 14.18 kDa belonging to Flexi_CP super-family and viral NB super-family, respectively. The percent identity and phylogenetic analysis of amino acid sequences of CP and NB region of the virus associated with shallot indicated that it was a distinct isolate of ShVX. PMID:23637504

  5. Jack bean α-mannosidase: amino acid sequencing and N-glycosylation analysis of a valuable glycomics tool.

    PubMed

    Gnanesh Kumar, B S; Pohlentz, Gottfried; Schulte, Mona; Mormann, Michael; Siva Kumar, Nadimpalli

    2014-03-01

    Jack bean (Canavalia ensiformis) seeds contain several biologically important proteins among which α-mannosidase (EC 3.2.1.24) has been purified, its biochemical properties studied and widely used in glycan analysis. In the present study, we have used the purified enzyme and derived its amino acid sequence covering both the known subunits (molecular mass of ∼66,000 and ∼44,000 Da) hitherto not known in its entirety. Peptide de novo sequencing and structural elucidation of N-glycopeptides obtained either directly from proteolytic digestion or after zwitterionic hydrophilic interaction liquid chromatography solid phase extraction-based separation were performed by use of nanoelectrospray ionization quadrupole time-of-flight mass spectrometry and low-energy collision-induced dissociation experiments. De novo sequencing provided new insights into the disulfide linkage organization, intersection of subunits and complete N-glycan structures along with site specificities. The primary sequence suggests that the enzyme belongs to glycosyl hydrolase family 38 and the N-glycan sequence analysis revealed high-mannose oligosaccharides, which were found to be heterogeneous with varying number of hexoses viz, Man8-9GlcNAc2 and Glc1Man9GlcNAc2 in an evolutionarily conserved N-glycosylation site. This site with two proximal cysteines is present in all the acidic α-mannosidases reported so far in eukaryotes. Further, a truncated paucimannose type was identified to be lacking terminal two mannose, Man1(Xyl)GlcNAc2 (Fuc). PMID:24295789

  6. Complete Genome Sequence of Enterococcus mundtii QU 25, an Efficient l-(+)-Lactic Acid-Producing Bacterium

    PubMed Central

    Shiwa, Yuh; Yanase, Hiroaki; Hirose, Yuu; Satomi, Shohei; Araya-Kojima, Tomoko; Watanabe, Satoru; Zendo, Takeshi; Chibazakura, Taku; Shimizu-Kadota, Mariko; Yoshikawa, Hirofumi; Sonomoto, Kenji

    2014-01-01

    Enterococcus mundtii QU 25, a non-dairy bacterial strain of ovine faecal origin, can ferment both cellobiose and xylose to produce l-lactic acid. The use of this strain is highly desirable for economical l-lactate production from renewable biomass substrates. Genome sequence determination is necessary for the genetic improvement of this strain. We report the complete genome sequence of strain QU 25, primarily determined using Pacific Biosciences sequencing technology. The E. mundtii QU 25 genome comprises a 3 022 186-bp single circular chromosome (GC content, 38.6%) and five circular plasmids: pQY182, pQY082, pQY039, pQY024, and pQY003. In all, 2900 protein-coding sequences, 63 tRNA genes, and 6 rRNA operons were predicted in the QU 25 chromosome. Plasmid pQY024 harbours genes for mundticin production. We found that strain QU 25 produces a bacteriocin, suggesting that mundticin-encoded genes on plasmid pQY024 were functional. For lactic acid fermentation, two gene clusters were identified—one involved in the initial metabolism of xylose and uptake of pentose and the second containing genes for the pentose phosphate pathway and uptake of related sugars. This is the first complete genome sequence of an E. mundtii strain. The data provide insights into lactate production in this bacterium and its evolution among enterococci. PMID:24568933

  7. Gastropod arginine kinases from Cellana grata and Aplysia kurodai. Isolation and cDNA-derived amino acid sequences.

    PubMed

    Suzuki, T; Inoue, N; Higashi, T; Mizobuchi, R; Sugimura, N; Yokouchi, K; Furukohri, T

    2000-12-01

    Arginine kinase (AK) was isolated from the radular muscle of the gastropod molluscs Cellana grata (subclass Prosobranchia) and Aplysia kurodai (subclass Opisthobranchia), respectively, by ammonium sulfate fractionation, Sephadex G-75 gel filtration and DEAE-ion exchange chromatography. The denatured relative molecular mass values were estimated to be 40 kDa by sodium dodecyl sulfate-polyacrylamide gel electrophoresis. The isolated enzyme from Aplysia gave a Km value of 0.6 mM for arginine and a Vmax value of 13 micromole Pi min(-1) mg protein(-1) for the forward reaction. These values are comparable to other molluscan AKs. The cDNAs encoding Cellana and Aplysia AKs were amplified by polymerase chain reaction, and the nucleotide sequences of 1,608 and 1,239 bp, respectively, were determined. The open reading frame for Cellana AK is 1044 nucleotides in length and encodes a protein with 347 amino acid residues, and that for A. kurodai is 1077 nucleotides and 354 residues. The cDNA-derived amino acid sequences were validated by chemical sequencing of internal lysyl endopeptidase peptides. The amino acid sequences of Cellana and Aplysia AKs showed the highest percent identity (66-73%) with those of the abalone Nordotis and turbanshell Battilus belonging to the same class Gastropoda. These AK sequences still have a strong homology (63-71%) with that of the chiton Liolophura (class Polyplacophora), which is believed to be one of the most primitive molluscs. On the other hand, these AK sequences are less homologous (55-57%) with that of the clam Pseudocardium (class Bivalvia), suggesting that the biological position of the class Polyplacophora should be reconsidered. PMID:11281267

  8. Studies on the high-sulphur proteins of reduced Merino wool. Amino acid sequence of protein SCMKB-IIIB4

    PubMed Central

    Swart, L. S.; Haylett, T.

    1971-01-01

    The complete amino acid sequence of protein SCMKB-IIIB4 is presented. It is closely related to the sequence of protein SCMKB-IIIB3 (Haylett, Swart & Parris, 1971) differing in only four positions. The peptic and thermolysin peptides of protein SCMKB-IIIB4 were analysed by the dansyl–Edman method (Gray, 1967) and by tritium-labelling of C-terminal residues (Matsuo, Fujimoto & Tatsuno, 1966). This protein is the third member of a group of high-sulphur wool proteins with molecular weight of about 11400. It consists of 98 residues and has acetylalanine and carboxymethylcysteine as N- and C-terminal residues respectively. PMID:4942536

  9. A computer program for the estimation of protein and nucleic acid sequence diversity in random point mutagenesis libraries

    PubMed Central

    Volles, Michael J.; Lansbury, Peter T.

    2005-01-01

    A computer program for the generation and analysis of in silico random point mutagenesis libraries is described. The program operates by mutagenizing an input nucleic acid sequence according to mutation parameters specified by the user for each sequence position and type of point mutation. The program can mimic almost any type of random mutagenesis library, including those produced via error-prone PCR (ep-PCR), mutator Escherichia coli strains, chemical mutagenesis, and doped or random oligonucleotide synthesis. The program analyzes the generated nucleic acid sequences and/or the associated protein library to produce several estimates of library diversity (number of unique sequences, point mutations, and single point mutants) and the rate of saturation of these diversities during experimental screening or selection of clones. This information allows one to select the optimal screen size for a given mutagenesis library, necessary to efficiently obtain a certain coverage of the sequence-space. The program also reports the abundance of each specific protein mutation at each sequence position, which is useful as a measure of the level and type of mutation bias in the library. Alternatively, one can use the program to evaluate the relative merits of preexisting libraries, or to examine various hypothetical mutation schemes to determine the optimal method for creating a library that serves the screen/selection of interest. Simulated libraries of at least 109 sequences are accessible by the numerical algorithm with currently available personal computers; an analytical algorithm is also available which can rapidly calculate a subset of the numerical statistics in libraries of arbitrarily large size. A multi-type double-strand stochastic model of ep-PCR is developed in an appendix to demonstrate the applicability of the algorithm to amplifying mutagenesis procedures. Estimators of DNA polymerase mutation-type-specific error rates are derived using the model. Analyses of an

  10. DNA Sequence and Expression Variation of Hop (Humulus lupulus) Valerophenone Synthase (VPS), a Key Gene in Bitter Acid Biosynthesis

    PubMed Central

    Castro, Consuelo B.; Whittock, Lucy D.; Whittock, Simon P.; Leggett, Grey; Koutoulis, Anthony

    2008-01-01

    Background The hop plant (Humulus lupulus) is a source of many secondary metabolites, with bitter acids essential in the beer brewing industry and others having potential applications for human health. This study investigated variation in DNA sequence and gene expression of valerophenone synthase (VPS), a key gene in the bitter acid biosynthesis pathway of hop. Methods Sequence variation was studied in 12 varieties, and expression was analysed in four of the 12 varieties in a series across the development of the hop cone. Results Nine single nucleotide polymorphisms (SNPs) were detected in VPS, seven of which were synonymous. The two non-synonymous polymorphisms did not appear to be related to typical bitter acid profiles of the varieties studied. However, real-time quantitative reverse-transcription polymerase chain reaction (qRT-PCR) analysis of VPS expression during hop cone development showed a clear link with the bitter acid content. The highest levels of VPS expression were observed in two triploid varieties, ‘Symphony’ and ‘Ember’, which typically have high bitter acid levels. Conclusions In all hop varieties studied, VPS expression was lowest in the leaves and an increase in expression was consistently observed during the early stages of cone development. PMID:18519445

  11. The amino acid sequence of protein SCMK-B2A from the high-sulphur fraction of wool keratin

    PubMed Central

    Elleman, T. C.

    1972-01-01

    1. The amino acid sequence of protein SCMK-B2A, a reduced and S-carboxymethylated protein from the high-sulphur fraction of wool, has been determined. 2. This protein of 171 amino acid residues displays both a high degree of internal homology and extensive external homology with other members of the SCMK-B2 group of proteins. 3. Evidence is presented which suggests that the SCMK-B2 group of proteins are produced by separate non-allelic genes. ImagesPLATE 1 PMID:4679226

  12. Consensus development for healthcare professionals

    PubMed Central

    Kea, Bory; Sun, Benjamin C.

    2015-01-01

    Consensus development sprang from a desire to synthesize clinician and expert opinions on clinical practice and research agendas in the 1950s. And since the American Institute of Medicine formally defined “guidelines” in 1990, there has been a proliferation of clinical practice guidelines (CPG) both formally and informally. This modern decision making tool used by both physicians and patients, requires extensive planning to meet the challenges of consensus development while reaping its rewards. Consensus allows for a group approach with multiple experts sharing ideas to form consensus on topics ranging from appropriateness of procedures to research agenda development. Disagreements can shed light on areas of controversy and launch further discussions. It has five main components: three inputs (defining the task, participant identification and recruitment, and information synthesis), the approach (consensus development by explicit or implicit means), and the output (dissemination of results). Each aspect requires extensive planning a priori as they influence the entire process, from how information will be interpreted, the interaction of participants, the resulting judgment, to whether there will be uptake of results. Implicit approaches utilize qualitative methods and/or a simple voting structure of majority wins, and are used in informal consensus development methods and consensus development conferences. Explicit approaches aggregate results or judgments using explicit rules set a priori with definitions of “agreement” or consensus. Because the implicit process can be more opaque, unforeseen challenges can emerge such as the undue influence of a minority. And yet, the logistics of explicit approaches may be more time consuming and not appropriate when speed is a priority. In determining which method to use, it is important to understand the pros and cons of the different approaches and how it will affect the overall input, approach, and outcome. PMID

  13. High-affinity homologous peptide nucleic acid probes for targeting a quadruplex-forming sequence from a MYC promoter element.

    PubMed

    Roy, Subhadeep; Tanious, Farial A; Wilson, W David; Ly, Danith H; Armitage, Bruce A

    2007-09-18

    Guanine-rich DNA and RNA sequences are known to fold into secondary structures known as G-quadruplexes. Recent biochemical evidence along with the discovery of an increasing number of sequences in functionally important regions of the genome capable of forming G-quadruplexes strongly indicates important biological roles for these structures. Thus, molecular probes that can selectively target quadruplex-forming sequences (QFSs) are envisioned as tools to delineate biological functions of quadruplexes as well as potential therapeutic agents. Guanine-rich peptide nucleic acids have been previously shown to hybridize to homologous DNA or RNA sequences forming PNA-DNA (or RNA) quadruplexes. For this paper we studied the hybridization of an eight-mer G-rich PNA to a quadruplex-forming sequence derived from the promoter region of the MYC proto-oncogene. UV melting analysis, fluorescence assays, and surface plasmon resonance experiments reveal that this PNA binds to the MYC QFS in a 2:1 stoichiometry and with an average binding constant Ka = (2.0 +/- 0.2) x 10(8) M(-1) or Kd = 5.0 nM. In addition, experiments carried out with short DNA targets revealed a dependence of the affinity on the sequence of bases in the loop region of the DNA. A structural model for the hybrid quadruplex is proposed, and implications for gene targeting by G-rich PNAs are discussed. PMID:17718513

  14. A knowledge engineering approach to recognizing and extracting sequences of nucleic acids from scientific literature.

    PubMed

    García-Remesal, Miguel; Maojo, Victor; Crespo, José

    2010-01-01

    In this paper we present a knowledge engineering approach to automatically recognize and extract genetic sequences from scientific articles. To carry out this task, we use a preliminary recognizer based on a finite state machine to extract all candidate DNA/RNA sequences. The latter are then fed into a knowledge-based system that automatically discards false positives and refines noisy and incorrectly merged sequences. We created the knowledge base by manually analyzing different manuscripts containing genetic sequences. Our approach was evaluated using a test set of 211 full-text articles in PDF format containing 3134 genetic sequences. For such set, we achieved 87.76% precision and 97.70% recall respectively. This method can facilitate different research tasks. These include text mining, information extraction, and information retrieval research dealing with large collections of documents containing genetic sequences. PMID:21096556

  15. Ferredoxin:NADP oxidoreductase of Cyanophora paradoxa: purification, partial characterization, and N-terminal amino acid sequence.

    PubMed

    Gebhart, U B; Maier, T L; Stevanović, S; Bayer, M G; Schenk, H E

    1992-06-01

    The ferredoxin:NADP+ oxidoreductase of the protist Cyanophora paradoxa, as a descendant of a former symbiotic consortium, an important model organism in view of the Endosymbiosis Theory, is the first enzyme purified from a formerly original endocytobiont (cyanelle) that is found to be encoded in the nucleus of the host. This cyanoplast enzyme was isolated by FPLC (19% yield) and characterized with respect to the uv-vis spectrum, pH optimum (pH 9), molecular mass of 34 kDa, and an N-terminal amino acid sequence (24 residues). The enzyme shows, as known from other organisms, molecular heterogeneity. The N-terminus of a further ferredoxin:NADP+ oxidoreductase polypeptide represents a shorter sequence missing the first four amino acids of the mature enzyme. PMID:1392619

  16. Purification, characterization, and amino acid sequencing of a. delta. /sup 5/-3-oxosteroid isomerase from Pseudomonas putida biotype B

    SciTech Connect

    Linden, K.G.

    1986-01-01

    Studies were performed on the ..delta../sup 5/-3-oxosteroid isomerase from Pseudomonas putida biotype B. The studies have involved three broad areas: improvement in the purification of the enzyme, further characterization of the purified enzyme, and completion of the amino acid sequence of the enzyme. For the purification of the enzyme, techniques for removing the isomerase from whole cells were studied, the effects of ionic strength on the binding of the isomerase to steroidal affinity resins was explored, and a new affinity resin was developed. Absorption spectra and the proton NMR spectra of the isomerase were obtained. Amino acid sequencing of the oxosteroid isomerase indicates that the enzyme is a dimeric protein consisting of two identical subunits each consisting of a polypeptide chain of 131 residues and a M/sub r/ = 14,536.

  17. Identification of novel rice low phytic acid mutations via TILLING by sequencing

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Phytic acid (myo-inositol-1,2,3,4,5,6-hexakisphosphate or InsP6) accounts for 75-85% of the total phosphorus in seeds. Low phytic acid (lpa) mutants exhibit decreases in seed InsP6 with corresponding increases in inorganic P which, unlike phytic acid P, is readily utilized by humans and monogastric ...

  18. Snake venoms. The amino-acid sequence of trypsin inhibitor E of Dendroaspis polylepis polylepis (Black Mamba) venom.

    PubMed

    Joubert, F J; Strydom, D J

    1978-06-01

    Trypsin inhibitor E from black mamba venom comprises 59 amino acid residues in a single polypeptide chain, cross-linked by three intrachain disulphide bridges. The complete primary structure of inhibitor E was elucidated. The sequence is homologous with trypsin inhibitors from different sources. Unique among this homologous series of proteinase inhibitors, inhibitor E has an affinity for transition metal ions, exemplified here by Cu2 and Co2+. PMID:668688

  19. Draft Genome Sequence of Escherichia coli Strain VKPM B-10182, Producing the Enzyme for Synthesis of Cephalosporin Acids

    PubMed Central

    Mardanov, Andrey V.; Eldarov, Mikhail A.; Sklyarenko, Anna V.; Dumina, Maria V.; Beletsky, Alexey V.; Yarotsky, Sergey V.

    2014-01-01

    Escherichia coli strain VKPM B-10182, obtained by chemical mutagenesis from E. coli strain ATCC 9637, produces cephalosporin acid synthetase employed in the synthesis of β-lactam antibiotics, such as cefazolin. The draft genome sequence of strain VKPM B-10182 revealed 32 indels and 1,780 point mutations that might account for the improvement in antibiotic synthesis that we observed. PMID:25414512

  20. First draft genome sequencing of indole acetic acid producing and plant growth promoting fungus Preussia sp. BSL10.

    PubMed

    Khan, Abdul Latif; Asaf, Sajjad; Khan, Abdur Rahim; Al-Harrasi, Ahmed; Al-Rawahi, Ahmed; Lee, In-Jung

    2016-05-10

    Preussia sp. BSL10, family Sporormiaceae, was actively producing phytohormone (indole-3-acetic acid) and extra-cellular enzymes (phosphatases and glucosidases). The fungus was also promoting the growth of arid-land tree-Boswellia sacra. Looking at such prospects of this fungus, we sequenced its draft genome for the first time. The Illumina based sequence analysis reveals an approximate genome size of 31.4Mbp for Preussia sp. BSL10. Based on ab initio gene prediction, total 32,312 coding sequences were annotated consisting of 11,967 coding genes, pseudogenes, and 221 tRNA genes. Furthermore, 321 carbohydrate-active enzymes were predicted and classified into many functional families. PMID:26995610

  1. A simple ligation-based method to increase the information density in sequencing reactions used to deconvolute nucleic acid selections

    PubMed Central

    Childs-Disney, Jessica L.; Disney, Matthew D.

    2008-01-01

    Herein, a method is described to increase the information density of sequencing experiments used to deconvolute nucleic acid selections. The method is facile and should be applicable to any selection experiment. A critical feature of this method is the use of biotinylated primers to amplify and encode a BamHI restriction site on both ends of a PCR product. After amplification, the PCR reaction is captured onto streptavidin resin, washed, and digested directly on the resin. Resin-based digestion affords clean product that is devoid of partially digested products and unincorporated PCR primers. The product's complementary ends are annealed and ligated together with T4 DNA ligase. Analysis of ligation products shows formation of concatemers of different length and little detectable monomer. Sequencing results produced data that routinely contained three to four copies of the library. This method allows for more efficient formulation of structure-activity relationships since multiple active sequences are identified from a single clone. PMID:18065718

  2. A novel T-cell-defined HLA-DR polymorphism not predicted from the linear amino acid sequence.

    PubMed

    Termijtelen, A; van den Elsen, P; Koning, F; de Koster, S; Schroeijers, W; Vanderkerckhove, B

    1989-09-01

    Recent investigations have shown that alloreactive T cells are capable of responding to structures defined by specific linear amino acid sequences on class II molecules. In the present study we show that also a polymorphism can be recognized that is not defined by such linear amino acid sequences. Two human T-cell clones, sensitized to DRw13 haplotypes, are described. The description of clone c50 serves to exemplify the first model. This DRB1-specific clone responds to stimulator cells that carry DR molecules, different in their DRB1 first and second hypervariable regions (HV1 and HV2) but identical in their HV3 regions (i.e., DRw13,Dw18; DRw13,Dw19; DR4,Dw10; and DRw11,LDVII). The second clone, c1443, behaves nonconventionally. It responds to DRw13,Dw18; DRw13,Dw19; and DR4,Dw4 stimulator cells, although no specific amino acid sequence is shared between these specificities. The latter pattern of reactivity suggests the existence of a novel polymorphism recognized by alloreactive T cells. This particular polymorphism may also be biologically significant. PMID:2476425

  3. cDNA-derived amino-acid sequence of a land turtle (Geochelone carbonaria) beta-chain hemoglobin.

    PubMed

    Bordin, S; Meza, A N; Saad, S T; Ogo, S H; Costa, F F

    1997-06-01

    The cDNA sequence encoding the turtle Geochelone carbonaria beta-chain was determinated. The isolation of hemoglobin mRNA was based on degenerate primers' PCR in combination with 5'- and 3'-RACE protocol. The full length cDNA is 615 bp with the ATG start codon at position 53 and TGA stop codon at position 495; The AATAAA polyadenylation signal is found at position 599. The deduced polypeptyde contains 146 amino-acid residues. The predicted amino acid sequence shares 83% identity with the beta-globin of a related specie, the aquatic turtle C. p. belli. Otherwise, identity is higher when compared with chicken beta-Hb (80%) than with other reptilian orders (Squamata, 69%, and Crocodilia, 61%). Compared with human HbA, there is 67% identity, and at least three amino acid substitutions could be of some functional significance (Glu43 beta-->Ser, His116 beta-->Thr and His143 beta-->Leu). To our knowledge this represents the first cDNA sequence of a reptile globin gene described. PMID:9238523

  4. Amino acid sequence of the serine-repeat antigen (SERA) of Plasmodium falciparum determined from cloned cDNA.

    PubMed

    Bzik, D J; Li, W B; Horii, T; Inselburg, J

    1988-09-01

    We report the isolation of cDNA clones for a Plasmodium falciparum gene that encodes the complete amino acid sequence of a previously identified exported blood stage antigen. The Mr of this antigen protein had been determined by sodium dodecylsulphate-polyacrylamide gel electrophoresis analysis, by different workers, to be 113,000, 126,000, and 140,000. We show, by cDNA nucleotide sequence analysis, that this antigen gene encodes a 989 amino acid protein (111 kDa) that contains a potential signal peptide, but not a membrane anchor domain. In the FCR3 strain the serine content of the protein was 11%, of which 57% of the serine residues were localized within a 201 amino acid sequence that included 35 consecutive serine residues. The protein also contained three possible N-linked glycosylation sites and numerous possible O-linked glycosylation sites. The mRNA was abundant during late trophozoite-schizont parasite stages. We propose to identity this antigen, which had been called p126, by the acronym SERA, serine-repeat antigen, based on its complete structure. The usefulness of the cloned cDNA as a source of a possible malaria vaccine is considered in view of the previously demonstrated ability of the antigen to induce parasite-inhibitory antibodies and a protective immune response in Saimiri monkeys. PMID:2847041

  5. Amino acid sequences of lysozymes newly purified from invertebrates imply wide distribution of a novel class in the lysozyme family.

    PubMed

    Ito, Y; Yoshikawa, A; Hotani, T; Fukuda, S; Sugimura, K; Imoto, T

    1999-01-01

    Lysozymes were purified from three invertebrates: a marine bivalve, a marine conch, and an earthworm. The purified lysozymes all showed a similar molecular weight of 13 kDa on SDS/PAGE. Their N-terminal sequences up to the 33rd residue determined here were apparently homologous among them; in addition, they had a homology with a partial sequence of a starfish lysozyme which had been reported before. The complete sequence of the bivalve lysozyme was determined by peptide mapping and subsequent sequence analysis. This was composed of 123 amino acids including as many as 14 cysteine residues and did not show a clear homology with the known types of lysozymes. However, the homology search of this protein on the protein or nucleic acid database revealed two homologous proteins. One of them was a gene product, CELF22 A3.6 of C. elegans, which was a functionally unknown protein. The other was an isopeptidase of a medicinal leech, named destabilase. Thus, a new type of lysozyme found in at least four species across the three classes of the invertebrates demonstrates a novel class of protein/lysozyme family in invertebrates. The bivalve lysozyme, first characterized here, showed extremely high protein stability and hen lysozyme-like enzymatic features. PMID:9914527

  6. Complete Genome Sequences of Escherichia coli O157:H7 Strains SRCC 1675 and 28RC, Which Vary in Acid Resistance

    PubMed Central

    Baranzoni, Gian Marco; Reichenberger, Erin R.; Kim, Gwang-Hee; Breidt, Frederick; Kay, Kathryn; Oh, Deog-Hwan

    2016-01-01

    The level of acid resistance among Escherichia coli O157:H7 strains varies, and strains with higher resistance to acid may have a lower infectious dose. The complete genome sequences belonging to two strains of Escherichia coli O157:H7 with different levels of acid resistance are presented here. PMID:27469964

  7. Complete Genome Sequences of Escherichia coli O157:H7 Strains SRCC 1675 and 28RC, Which Vary in Acid Resistance.

    PubMed

    Baranzoni, Gian Marco; Fratamico, Pina M; Reichenberger, Erin R; Kim, Gwang-Hee; Breidt, Frederick; Kay, Kathryn; Oh, Deog-Hwan

    2016-01-01

    The level of acid resistance among Escherichia coli O157:H7 strains varies, and strains with higher resistance to acid may have a lower infectious dose. The complete genome sequences belonging to two strains of Escherichia coli O157:H7 with different levels of acid resistance are presented here. PMID:27469964

  8. Complete genome sequences of Escherichia coli O157:H7 strains SRCC 1675 and 28RC that vary in acid resistance

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The level of acid resistance among Escherichia coli O157:H7 strains varies, and strains with higher resistance to acid may have a lower infectious dose. The complete genome sequences belonging to two strains of Escherichia coli O157:H7 with different levels of acid resistance are presented....

  9. Fad7 gene identification and fatty acids phenotypic variation in an olive collection by EcoTILLING and sequencing approaches.

    PubMed

    Sabetta, Wilma; Blanco, Antonio; Zelasco, Samanta; Lombardo, Luca; Perri, Enzo; Mangini, Giacomo; Montemurro, Cinzia

    2013-08-01

    The ω-3 fatty acid desaturases (FADs) are enzymes responsible for catalyzing the conversion of linoleic acid to α-linolenic acid localized in the plastid or in the endoplasmic reticulum. In this research we report the genotypic and phenotypic variation of Italian Olea europaea L. germoplasm for the fatty acid composition. The phenotypic oil characterization was followed by the molecular analysis of the plastidial-type ω-3 FAD gene (fad7) (EC 1.14.19), whose full-length sequence has been here identified in cultivar Leccino. The gene consisted of 2635 bp with 8 exons and 5'- and 3'-UTRs of 336 and 282 bp respectively, and showed a high level of heterozygousity (1/110 bp). The natural allelic variation was investigated both by a LiCOR EcoTILLING assay and the PCR product direct sequencing. Only three haplotypes were identified among the 96 analysed cultivars, highlighting the strong degree of conservation of this gene. PMID:23685785

  10. Sequence-independent and reversible photocontrol of transcription/expression systems using a photosensitive nucleic acid binder

    PubMed Central

    Estévez-Torres, André; Crozatier, Cécile; Diguet, Antoine; Hara, Tomoaki; Saito, Hirohide; Yoshikawa, Kenichi; Baigl, Damien

    2009-01-01

    To understand non-trivial biological functions, it is crucial to develop minimal synthetic models that capture their basic features. Here, we demonstrate a sequence-independent, reversible control of transcription and gene expression using a photosensitive nucleic acid binder (pNAB). By introducing a pNAB whose affinity for nucleic acids is tuned by light, in vitro RNA production, EGFP translation, and GFP expression (a set of reactions including both transcription and translation) were successfully inhibited in the dark and recovered after a short illumination at 365 nm. Our results indicate that the accessibility of the protein machinery to one or several nucleic acid binding sites can be efficiently regulated by changing the conformational/condensation state of the nucleic acid (DNA conformation or mRNA aggregation), thus regulating gene activity in an efficient, reversible, and sequence-independent manner. The possibility offered by our approach to use light to trigger various gene expression systems in a system-independent way opens interesting perspectives to study gene expression dynamics as well as to develop photocontrolled biotechnological procedures. PMID:19617550

  11. Enzymatic generation of peptides flanked by basic amino acids to obtain MS/MS spectra with 2× sequence coverage

    PubMed Central

    Ebhardt, H Alexander; Nan, Jie; Chaulk, Steven G; Fahlman, Richard P; Aebersold, Ruedi

    2014-01-01

    RATIONALE Tandem mass (MS/MS) spectra generated by collision-induced dissociation (CID) typically lack redundant peptide sequence information in the form of e.g. b- and y-ion series due to frequent use of sequence-specific endopeptidases cleaving C- or N-terminal to Arg or Lys residues. METHODS Here we introduce arginyl-tRNA protein transferase (ATE, EC 2.3.2.8) for proteomics. ATE recognizes acidic amino acids or oxidized Cys at the N-terminus of a substrate peptide and conjugates an arginine from an aminoacylated tRNAArg onto the N-terminus of the substrate peptide. This enzymatic reaction is carried out under physiological conditions and, in combination with Lys-C/Asp-N double digest, results in arginylated peptides with basic amino acids on both termini. RESULTS We demonstrate that in vitro arginylation of peptides using yeast arginyl tRNA protein transferase 1 (yATE1) is a robust enzymatic reaction, specific to only modifying N-terminal acidic amino acids. Precursors originating from arginylated peptides generally have an increased protonation state compared with their non-arginylated forms. Furthermore, the product ion spectra of arginylated peptides show near complete 2× fragment ladders within the same MS/MS spectrum using commonly available electrospray ionization peptide fragmentation modes. Unexpectedly, arginylated peptides generate complete y- and c-ion series using electron transfer dissociation (ETD) despite having an internal proline residue. CONCLUSIONS We introduce a rapid enzymatic method to generate peptides flanked on either terminus by basic amino acids, resulting in a rich, redundant MS/MS fragment pattern. © 2014 The Authors. Rapid Communications in Mass Spectrometry published by John Wiley & Sons Ltd. PMID:25380496

  12. Site-directed gene mutation at mixed sequence targets by psoralen-conjugated pseudo-complementary peptide nucleic acids.

    PubMed

    Kim, Ki-Hyun; Nielsen, Peter E; Glazer, Peter M

    2007-01-01

    Sequence-specific DNA-binding molecules such as triple helix-forming oligonucleotides (TFOs) provide a means for inducing site-specific mutagenesis and recombination at chromosomal sites in mammalian cells. However, the utility of TFOs is limited by the requirement for homopurine stretches in the target duplex DNA. Here, we report the use of pseudo-complementary peptide nucleic acids (pcPNAs) for intracellular gene targeting at mixed sequence sites. Due to steric hindrance, pcPNAs are unable to form pcPNA-pcPNA duplexes but can bind to complementary DNA sequences by Watson-Crick pairing via double duplex-invasion complex formation. We show that psoralen-conjugated pcPNAs can deliver site-specific photoadducts and mediate targeted gene modification within both episomal and chromosomal DNA in mammalian cells without detectable off-target effects. Most of the induced psoralen-pcPNA mutations were single-base substitutions and deletions at the predicted pcPNA-binding sites. The pcPNA-directed mutagenesis was found to be dependent on PNA concentration and UVA dose and required matched pairs of pcPNAs. Neither of the individual pcPNAs alone had any effect nor did complementary PNA pairs of the same sequence. These results identify pcPNAs as new tools for site-specific gene modification in mammalian cells without purine sequence restriction, thereby providing a general strategy for designing gene targeting molecules. PMID:17977869

  13. Comparison of the nucleotide and amino acid sequences of the RsrI and EcoRI restriction endonucleases.

    PubMed

    Stephenson, F H; Ballard, B T; Boyer, H W; Rosenberg, J M; Greene, P J

    1989-12-21

    The RsrI endonuclease, a type-II restriction endonuclease (ENase) found in Rhodobacter sphaeroides, is an isoschizomer of the EcoRI ENase. A clone containing an 11-kb BamHI fragment was isolated from an R. sphaeroides genomic DNA library by hybridization with synthetic oligodeoxyribonucleotide probes based on the N-terminal amino acid (aa) sequence of RsrI. Extracts of E. coli containing a subclone of the 11-kb fragment display RsrI activity. Nucleotide sequence analysis reveals an 831-bp open reading frame encoding a polypeptide of 277 aa. A 50% identity exists within a 266-aa overlap between the deduced aa sequences of RsrI and EcoRI. Regions of 75-100% aa sequence identity correspond to key structural and functional regions of EcoRI. The type-II ENases have many common properties, and a common origin might have been expected. Nevertheless, this is the first demonstration of aa sequence similarity between ENases produced by different organisms. PMID:2695392

  14. Complete amino acid sequence of human plasma Zn-. cap alpha. /sub 2/-glycoprotein and its homology to histocompatibility antigens

    SciTech Connect

    Araki, T.; Gejyo, F.; Takagaki, K.; Haupt, H.; Schwick, H.G.; Buergi, W.; Marti, T.; Schaller, J.; Rickli, E.; Brossmer, R.

    1988-02-01

    In the present study the complete amino acid sequence of human plasma Zn-..cap alpha../sub 2/-glycoprotein was determined. This protein whose biological function is unknown consists of a single polypeptide chain of 276 amino acid residues including 8 tryptophan residues and has a pyroglutamyl residue at the amino terminus. The location of the two disulfide bonds in the polypeptide chain was also established. The three glycans, whose structure was elucidated with the aid of 500 MHz /sup 1/H NMR spectroscopy, were sialylated N-biantennas. The molecular weight calculated from the polypeptide and carbohydrate structure is 38,478, which is close to the reported value of approx. = 41,000 based on physicochemical measurements. The predicted secondary structure appeared to comprised of 23% ..cap alpha..-helix, 27% ..beta..-sheet, and 22% ..beta..-turns. The three N-glycans were found to be located in ..beta..-turn regions. An unexpected finding was made by computer analysis of the sequence data; this revealed that Zn-..cap alpha../sub 2/-glycoprotein is closely related to antigens of the major histocompatibility complex in amino acid sequence and in domain structure. There was an unusually high degree of sequence homology with the ..cap alpha.. chains of class I histocompatibility antigens. Moreover, this plasma protein was shown to be a member of the immunoglobulin gene superfamily. Zn-..cap alpha../sub 2/-glycoprotein appears to be truncated secretory major histocompatibility complex-related molecule, and it may have a role in the expression of the immune response.

  15. Distributed consensus on camera pose.

    PubMed

    Jorstad, Anne; DeMenthon, Daniel; Wang, I-Jeng; Burlina, Philippe

    2010-09-01

    Our work addresses pose estimation in a distributed camera framework. We examine how processing cameras can best reach a consensus about the pose of an object when they are each given a model of the object, defined by a set of point coordinates in the object frame of reference. The cameras can only see a subset of the object feature points in the midst of background clutter points, not knowing which image points match with which object points, nor which points are object points or background points. The cameras individually recover a prediction of the object's pose using their knowledge of the model, and then exchange information with their neighbors, performing consensus updates locally to obtain a single estimate consistent across all cameras, without requiring a common centralized processor. Our main contributions are: 1) we present a novel algorithm performing consensus updates in 3-D world coordinates penalized by a 3-D model, and 2) we perform a thorough comparison of our method with other current consensus methods. Our method is consistently the most accurate, and we confirm that the existing consensus method based upon calculating the Karcher mean of rotations is also reliable and fast. Experiments on simulated and real imagery are reported. PMID:20363678

  16. Primordia Vita. Deconvolution from Modern Sequences.

    NASA Astrophysics Data System (ADS)

    Trifonov, Edward N.; Gabdank, Idan; Barash, Danny; Sobolevsky, Yehoshua

    2006-12-01

    Evolution of the triplet code is reconstructed on the basis of consensus temporal order of appearance of amino acids. Several important predictions are confirmed by computational sequence analyses. The earliest amino acids, alanine and glycine, have been encoded by GCC and GGC codons, as today. They were succeeded, respectively, by A- and G-series of amino acids, encoded by pyrimidine-central and purine-central codons. The length of the earliest proteins is estimated to be 6 7 residues. The earliest mRNAs were short G+C-rich molecules. These short sequences could have formed hairpins. This is confirmed by analysis of modern prokaryotic mRNA sequences. Predominant size of detected ancient hairpins also corresponds to 6 7 amino acids, as above. Vestiges of last common ancestor can be found in extant proteins in form of entirely conserved short sequences of size six to nine residues present in all or almost all sequenced prokaryotic proteomes (omnipresent motifs). The functions of the topmost conserved octamers are not involved in the basic elementary syntheses. This suggests an initial abiotic supply of amino acids, bases and sugars.

  17. ENTPRISE: An Algorithm for Predicting Human Disease-Associated Amino Acid Substitutions from Sequence Entropy and Predicted Protein Structures

    PubMed Central

    Zhou, Hongyi; Gao, Mu; Skolnick, Jeffrey

    2016-01-01

    The advance of next-generation sequencing technologies has made exome sequencing rapid and relatively inexpensive. A major application of exome sequencing is the identification of genetic variations likely to cause Mendelian diseases. This requires processing large amounts of sequence information and therefore computational approaches that can accurately and efficiently identify the subset of disease-associated variations are needed. The accuracy and high false positive rates of existing computational tools leave much room for improvement. Here, we develop a boosted tree regression machine-learning approach to predict human disease-associated amino acid variations by utilizing a comprehensive combination of protein sequence and structure features. On comparing our method, ENTPRISE, to the state-of-the-art methods SIFT, PolyPhen-2, MUTATIONASSESSOR, MUTATIONTASTER, FATHMM, ENTPRISE exhibits significant improvement. In particular, on a testing dataset consisting of only proteins with balanced disease-associated and neutral variations defined as having the ratio of neutral/disease-associated variations between 0.3 and 3, the Mathews Correlation Coefficient by ENTPRISE is 0.493 as compared to 0.432 by PPH2-HumVar, 0.406 by SIFT, 0.403 by MUTATIONASSESSOR, 0.402 by PPH2-HumDiv, 0.305 by MUTATIONTASTER, and 0.181 by FATHMM. ENTPRISE is then applied to nucleic acid binding proteins in the human proteome. Disease-associated predictions are shown to be highly correlated with the number of protein-protein interactions. Both these predictions and the ENTPRISE server are freely available for academic users as a web service at http://cssb.biology.gatech.edu/entprise/. PMID:26982818

  18. The sequence diversity and expression among genes of the folic acid biosynthesis pathway in industrial Saccharomyces strains.

    PubMed

    Goncerzewicz, Anna; Misiewicz, Anna

    2015-01-01

    Folic acid is an important vitamin in human nutrition and its deficiency in pregnant women's diets results in neural tube defects and other neurological damage to the fetus. Additionally, DNA synthesis, cell division and intestinal absorption are inhibited in case of adults. Since this discovery, governments and health organizations worldwide have made recommendations concerning folic acid supplementation of food for women planning to become pregnant. In many countries this has led to the introduction of fortifications, where synthetic folic acid is added to flour. It is known that Saccharomyces strains (brewing and bakers' yeast) are one of the main producers of folic acid and they can be used as a natural source of this vitamin. Proper selection of the most efficient strains may enhance the folate content in bread, fermented vegetables, dairy products and beer by 100% and may be used in the food industry. The objective of this study was to select the optimal producing yeast strain by determining the differences in nucleotide sequences in the FOL2, FOL3 and DFR1 genes of folic acid biosynthesis pathway. The Multitemperature Single Strand Conformation Polymorphism (MSSCP) method and further nucleotide sequencing for selected strains were applied to indicate SNPs in selected gene fragments. The RT qPCR technique was also applied to examine relative expression of the FOL3 gene. Furthermore, this is the first time ever that industrial yeast strains were analysed regarding genes of the folic acid biosynthesis pathway. It was observed that a correlation exists between the folic acid amount produced by industrial yeast strains and changes in the nucleotide sequence of adequate genes. The most significant changes occur in the DFR1 gene, mostly in the first part, which causes major protein structure modifications in KKP 232, KKP 222 and KKP 277 strains. Our study shows that the large amount of SNP contributes to impairment of the selected enzymes and S. cerevisiae and S

  19. Fatty acid profile and Unigene-derived simple sequence repeat markers in tung tree (Vernicia fordii)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Tung tree (Vernicia fordii) provides the sole source of tung oil widely used in industry. Lack of fatty acid composition and molecular markers hinders biochemical, genetic and breeding research. The objectives of this study were to determine fatty acid profiles and develop unigene-derived simple se...

  20. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... Director of the Federal Register in accordance with 5 U.S.C. 552(a) and 1 CFR part 51. Copies of WIPO... 37 Patents, Trademarks, and Copyrights 1 2010-07-01 2010-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid...

  1. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... Director of the Federal Register in accordance with 5 U.S.C. 552(a) and 1 CFR part 51. Copies of WIPO... 37 Patents, Trademarks, and Copyrights 1 2011-07-01 2011-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid...

  2. Analysis of human immunodeficiency virus type 1 nef gene sequences present in vivo.

    PubMed Central

    Shugars, D C; Smith, M S; Glueck, D H; Nantermet, P V; Seillier-Moiseiwitsch, F; Swanstrom, R

    1993-01-01

    The nef genes of the human immunodeficiency viruses type 1 and 2 (HIV-1 and HIV-2) and the related simian immunodeficiency viruses (SIVs) encode a protein (Nef) whose role in virus replication and cytopathicity remains uncertain. As an attempt to elucidate the function of nef, we characterized the nucleotide and corresponding protein sequences of naturally occurring nef genes obtained from several HIV-1-infected individuals. A consensus Nef sequence was derived and used to identify several features that were highly conserved among the Nef sequences. These features included a nearly invariant myristylation signal, regions of sequence polymorphism and variable duplication, a region with an acidic charge, a (Pxx)4 repeat sequence, and a potential protein kinase C phosphorylation site. Clustering of premature stop codons at position 124 was noted in 6 of the 54 Nef sequences. Further analysis revealed four stretches of residues that were highly conserved not only among the patient-derived HIV-1 Nef sequences, but also among the Nef sequences of HIV-2 and the SIVs, suggesting that Nef proteins expressed by these retroviruses are functionally equivalent. The "Nef-defining" sequences were used to evaluate the sequence alignments of known proteins reported to share sequence similarity with Nef sequences and to conduct additional computer-based searches for similar protein sequences. A gene encoding the consensus Nef sequence was also generated. This gene encodes a full-length Nef protein that should be a valuable tool in further studies of Nef function. Images PMID:8043040

  3. "De-novo" amino acid sequence elucidation of protein G'e by combined "Top-Down" and "Bottom-Up" mass spectrometry

    NASA Astrophysics Data System (ADS)

    Yefremova, Yelena; Al-Majdoub, Mahmoud; Opuni, Kwabena F. M.; Koy, Cornelia; Cui, Weidong; Yan, Yuetian; Gross, Michael L.; Glocker, Michael O.

    2015-03-01

    Mass spectrometric de-novo sequencing was applied to review the amino acid sequence of a commercially available recombinant protein Ǵ with great scientific and economic importance. Substantial deviations to the published amino acid sequence (Uniprot Q54181) were found by the presence of 46 additional amino acids at the N-terminus, including a so-called "His-tag" as well as an N-terminal partial α- N-gluconoylation and α- N-phosphogluconoylation, respectively. The unexpected amino acid sequence of the commercial protein G' comprised 241 amino acids and resulted in a molecular mass of 25,998.9 ± 0.2 Da for the unmodified protein. Due to the higher mass that is caused by its extended amino acid sequence compared with the original protein G' (185 amino acids), we named this protein "protein G'e." By means of mass spectrometric peptide mapping, the suggested amino acid sequence, as well as the N-terminal partial α- N-gluconoylations, was confirmed with 100% sequence coverage. After the protein G'e sequence was determined, we were able to determine the expression vector pET-28b from Novagen with the Xho I restriction enzyme cleavage site as the best option that was used for cloning and expressing the recombinant protein G'e in E. coli. A dissociation constant ( K d ) value of 9.4 nM for protein G'e was determined thermophoretically, showing that the N-terminal flanking sequence extension did not cause significant changes in the binding affinity to immunoglobulins.

  4. "De-novo" amino acid sequence elucidation of protein G'e by combined "top-down" and "bottom-up" mass spectrometry.

    PubMed

    Yefremova, Yelena; Al-Majdoub, Mahmoud; Opuni, Kwabena F M; Koy, Cornelia; Cui, Weidong; Yan, Yuetian; Gross, Michael L; Glocker, Michael O

    2015-03-01

    Mass spectrometric de-novo sequencing was applied to review the amino acid sequence of a commercially available recombinant protein G´ with great scientific and economic importance. Substantial deviations to the published amino acid sequence (Uniprot Q54181) were found by the presence of 46 additional amino acids at the N-terminus, including a so-called "His-tag" as well as an N-terminal partial α-N-gluconoylation and α-N-phosphogluconoylation, respectively. The unexpected amino acid sequence of the commercial protein G' comprised 241 amino acids and resulted in a molecular mass of 25,998.9 ± 0.2 Da for the unmodified protein. Due to the higher mass that is caused by its extended amino acid sequence compared with the original protein G' (185 amino acids), we named this protein "protein G'e." By means of mass spectrometric peptide mapping, the suggested amino acid sequence, as well as the N-terminal partial α-N-gluconoylations, was confirmed with 100% sequence coverage. After the protein G'e sequence was determined, we were able to determine the expression vector pET-28b from Novagen with the Xho I restriction enzyme cleavage site as the best option that was used for cloning and expressing the recombinant protein G'e in E. coli. A dissociation constant (K(d)) value of 9.4 nM for protein G'e was determined thermophoretically, showing that the N-terminal flanking sequence extension did not cause significant changes in the binding affinity to immunoglobulins. PMID:25560987

  5. Dynamic behavior of an intrinsically unstructured linker domain is conserved in the face of negligible amino acid sequence conservation.

    PubMed

    Daughdrill, Gary W; Narayanaswami, Pranesh; Gilmore, Sara H; Belczyk, Agniezka; Brown, Celeste J

    2007-09-01

    Proteins or regions of proteins that do not form compact globular structures are classified as intrinsically unstructured proteins (IUPs). IUPs are common in nature and have essential molecular functions, but even a limited understanding of the evolution of their dynamic behavior is lacking. The primary objective of this work was to test the evolutionary conservation of dynamic behavior for a particular class of IUPs that form intrinsically unstructured linker domains (IULD) that tether flanking folded domains. This objective was accomplished by measuring the backbone flexibility of several IULD homologues using nuclear magnetic resonance (NMR) spectroscopy. The backbone flexibility of five IULDs, representing three kingdoms, was measured and analyzed. Two IULDs from animals, one IULD from fungi, and two IULDs from plants showed similar levels of backbone flexibility that were consistent with the absence of a compact globular structure. In contrast, the amino acid sequences of the IULDs from these three taxa showed no significant similarity. To investigate how the dynamic behavior of the IULDs could be conserved in the absence of detectable sequence conservation, evolutionary rate studies were performed on a set of nine mammalian IULDs. The results of this analysis showed that many sites in the IULD are evolving neutrally, suggesting that dynamic behavior can be maintained in the absence of natural selection. This work represents the first experimental test of the evolutionary conservation of dynamic behavior and demonstrates that amino acid sequence conservation is not required for the conservation of dynamic behavior and presumably molecular function. PMID:17721672

  6. Cloning and nucleotide sequencing of genes for three small, acid-soluble proteins from Bacillus subtilis spores.

    PubMed Central

    Connors, M J; Mason, J M; Setlow, P

    1986-01-01

    Three Bacillus subtilis genes (termed sspA, sspB, and sspD) which code for small, acid-soluble spore proteins (SASPs) have been cloned, and their complete nucleotide sequence has been determined. The amino acid sequences of the SASPs coded for by these genes are similar to each other and to those of the SASP-1 of B. subtilis (coded for by the sspC gene) and the SASP-A/C family of B. megaterium. The sspA and sspB genes are expressed only in sporulation, in parallel with each other and with the sspC gene. Two regions upstream of the postulated transcription start sites for the sspA and B genes have significant homology with the analogous regions of the sspC gene and the SASP-A/C gene family. Purification of two of the three major B, subtilis SASPs (alpha and beta) and determination of their amino-terminal sequences indicated that the sspA gene codes for SASP-alpha and that the sspB gene codes for SASP-beta. This was confirmed by the introduction of deletion mutations into the cloned sspA and sspB genes and transfer of these deletions into the B. subtilis chromosome with concomitant loss of the wild-type gene. Images PMID:3009398

  7. Nucleotide sequence of the fadR gene, a multifunctional regulator of fatty acid metabolism in Escherichia coli.

    PubMed Central

    DiRusso, C C

    1988-01-01

    The Escherichia coli fadR gene is a multifunctional regulator of fatty acid and acetate metabolism. In the present work the nucleotide sequence of the 1.3 kb DNA fragment which encodes FadR has been determined. The coding sequence of the fadR gene is 714 nucleotides long and is preceded by a typical E. coli ribosome binding site and is followed by a sequence predicted to be sufficient for factor-independent chain termination. Primer extension experiments demonstrated that the transcription of the fadR gene initiates with an adenine nucleotide 33 nucleotides upstream from the predicted start of translation. The derived fadR peptide has a calculated molecular weight of 26,972. This is in reasonable agreement with the apparent molecular weight of 29,000 previously estimated on the basis of maxi-cell analysis of plasmid encoded proteins. There is a segment of twenty amino acids within the predicted peptide which resembles the DNA recognition and binding site of many transcriptional regulatory proteins. Images PMID:2843809

  8. The amino acid sequence of protein SCMK-B2C from the high-sulphur fraction of wool keratin

    PubMed Central

    Elleman, T. C.

    1972-01-01

    1. The amino acid sequence of a protein from the reduced and carboxymethylated high-sulphur fraction of wool has been determined. 2. The sequence of this S-carboxymethylkerateine (SCMK-B2C) of 151 amino acid residues displays much internal homology and an unusual residue distribution. Thus a ten-residue sequence occurs four times near the N-terminus and five times near the C-terminus with few changes. These regions contain much of the molecule's half-cystine, whereas between them there is a region of 19 residues that are mainly small and devoid of cystine and proline. 3. Certain models of the wool fibre based on its mechanical and physical properties propose a matrix of small compact globular units linked together to form beaded chains. The unusual distribution of the component residues of protein SCMK-B2C suggests structures in the wool-fibre matrix compatible with certain features of the proposed models. PMID:4678578

  9. The nucleotide sequence of HLA-B{sup *}2704 reveals a new amino acid substitution in exon 4 which is also present in HLA-B{sup *}2706

    SciTech Connect

    Rudwaleit, M.; Bowness, P.; Wordsworth, P.

    1996-12-31

    The HLA-B27 subtype HLA-B{sup *}2704 is virtually absent in Caucasians but common in Orientals, where it is associated with ankylosing spondylitis. The amino acid sequence of HLA-B{sup *}2704 has been established by peptide mapping and was shown to differ by two amino acids from HLA-B{sup *}2705, HLA-B{sup *}2704 is characterized by a serine for aspartic acid substitution at position 77 and glutamic acid for valine at position 152. To date, however, no nucleotide sequence confirming these changes at the DNA level has been published. 13 refs., 2 figs.

  10. JRC GMO-Amplicons: a collection of nucleic acid sequences related to genetically modified organisms.

    PubMed

    Petrillo, Mauro; Angers-Loustau, Alexandre; Henriksson, Peter; Bonfini, Laura; Patak, Alex; Kreysa, Joachim

    2015-01-01

    The DNA target sequence is the key element in designing detection methods for genetically modified organisms (GMOs). Unfortunately this information is frequently lacking, especially for unauthorized GMOs. In addition, patent sequences are generally poorly annotated, buried in complex and extensive documentation and hard to link to the corresponding GM event. Here, we present the JRC GMO-Amplicons, a database of amplicons collected by screening public nucleotide sequence databanks by in silico determination of PCR amplification with reference methods for GMO analysis. The European Union Reference Laboratory for Genetically Modified Food and Feed (EU-RL GMFF) provides these methods in the GMOMETHODS database to support enforcement of EU legislation and GM food/feed control. The JRC GMO-Amplicons database is composed of more than 240 000 amplicons, which can be easily accessed and screened through a web interface. To our knowledge, this is the first attempt at pooling and collecting publicly available sequences related to GMOs in food and feed. The JRC GMO-Amplicons supports control laboratories in the design and assessment of GMO methods, providing inter-alia in silico prediction of primers specificity and GM targets coverage. The new tool can assist the laboratories in the analysis of complex issues, such as the detection and identification of unauthorized GMOs. Notably, the JRC GMO-Amplicons database allows the retrieval and characterization of GMO-related sequences included in patents documentation. Finally, it can help annotating poorly described GM sequences and identifying new relevant GMO-related sequences in public databases. The JRC GMO-Amplicons is freely accessible through a web-based portal that is hosted on the EU-RL GMFF website. Database URL: http://gmo-crl.jrc.ec.europa.eu/jrcgmoamplicons/. PMID:26424080

  11. JRC GMO-Amplicons: a collection of nucleic acid sequences related to genetically modified organisms

    PubMed Central

    Petrillo, Mauro; Angers-Loustau, Alexandre; Henriksson, Peter; Bonfini, Laura; Patak, Alex; Kreysa, Joachim

    2015-01-01

    The DNA target sequence is the key element in designing detection methods for genetically modified organisms (GMOs). Unfortunately this information is frequently lacking, especially for unauthorized GMOs. In addition, patent sequences are generally poorly annotated, buried in complex and extensive documentation and hard to link to the corresponding GM event. Here, we present the JRC GMO-Amplicons, a database of amplicons collected by screening public nucleotide sequence databanks by in silico determination of PCR amplification with reference methods for GMO analysis. The European Union Reference Laboratory for Genetically Modified Food and Feed (EU-RL GMFF) provides these methods in the GMOMETHODS database to support enforcement of EU legislation and GM food/feed control. The JRC GMO-Amplicons database is composed of more than 240 000 amplicons, which can be easily accessed and screened through a web interface. To our knowledge, this is the first attempt at pooling and collecting publicly available sequences related to GMOs in food and feed. The JRC GMO-Amplicons supports control laboratories in the design and assessment of GMO methods, providing inter-alia in silico prediction of primers specificity and GM targets coverage. The new tool can assist the laboratories in the analysis of complex issues, such as the detection and identification of unauthorized GMOs. Notably, the JRC GMO-Amplicons database allows the retrieval and characterization of GMO-related sequences included in patents documentation. Finally, it can help annotating poorly described GM sequences and identifying new relevant GMO-related sequences in public databases. The JRC GMO-Amplicons is freely accessible through a web-based portal that is hosted on the EU-RL GMFF website. Database URL: http://gmo-crl.jrc.ec.europa.eu/jrcgmoamplicons/ PMID:26424080

  12. C3 glomerulopathy: consensus report

    PubMed Central

    Pickering, Matthew C; D'Agati, Vivette D; Nester, Carla M; Smith, Richard J; Haas, Mark; Appel, Gerald B; Alpers, Charles E; Bajema, Ingeborg M; Bedrosian, Camille; Braun, Michael; Doyle, Mittie; Fakhouri, Fadi; Fervenza, Fernando C; Fogo, Agnes B; Frémeaux-Bacchi, Véronique; Gale, Daniel P; Goicoechea de Jorge, Elena; Griffin, Gene; Harris, Claire L; Holers, V Michael; Johnson, Sally; Lavin, Peter J; Medjeral-Thomas, Nicholas; Paul Morgan, B; Nast, Cynthia C; Noel, Laure-Hélène; Peters, D Keith; Rodríguez de Córdoba, Santiago; Servais, Aude; Sethi, Sanjeev; Song, Wen-Chao; Tamburini, Paul; Thurman, Joshua M; Zavros, Michael; Cook, H Terence

    2013-01-01

    C3 glomerulopathy is a recently introduced pathological entity whose original definition was glomerular pathology characterized by C3 accumulation with absent or scanty immunoglobulin deposition. In August 2012, an invited group of experts (comprising the authors of this document) in renal pathology, nephrology, complement biology, and complement therapeutics met to discuss C3 glomerulopathy in the first C3 Glomerulopathy Meeting. The objectives were to reach a consensus on: the definition of C3 glomerulopathy, appropriate complement investigations that should be performed in these patients, and how complement therapeutics should be explored in the condition. This meeting report represents the current consensus view of the group. PMID:24172683

  13. A molecular mechanism realizing sequence-specific recognition of nucleic acids by TDP-43

    PubMed Central

    Furukawa, Yoshiaki; Suzuki, Yoh; Fukuoka, Mami; Nagasawa, Kenichi; Nakagome, Kenta; Shimizu, Hideaki; Mukaiyama, Atsushi; Akiyama, Shuji

    2016-01-01

    TAR DNA-binding protein 43 (TDP-43) is a DNA/RNA-binding protein containing two consecutive RNA recognition motifs (RRM1 and RRM2) in tandem. Functional abnormality of TDP-43 has been proposed to cause neurodegeneration, but it remains obscure how the physiological functions of this protein are regulated. Here, we show distinct roles of RRM1 and RRM2 in the sequence-specific substrate recognition of TDP-43. RRM1 was found to bind a wide spectrum of ssDNA sequences, while no binding was observed between RRM2 and ssDNA. When two RRMs are fused in tandem as in native TDP-43, the fused construct almost exclusively binds ssDNA with a TG-repeat sequence. In contrast, such sequence-specificity was not observed in a simple mixture of RRM1 and RRM2. We thus propose that the spatial arrangement of multiple RRMs in DNA/RNA binding proteins provides steric effects on the substrate-binding site and thereby controls the specificity of its substrate nucleotide sequences. PMID:26838063

  14. Application of combined mass spectrometry and partial amino acid sequence to the identification of gel-separated proteins.

    PubMed

    Patterson, S D; Thomas, D; Bradshaw, R A

    1996-05-01

    The combined use of peptide mass information with amino acid sequence information derived by chemical sequencing or mass spectrometry (MS)-based approaches provides a powerful means of protein identification. We have used a two-part strategy to identify proteins from nerve growth factor (NGF)-stimulated rat adrenal pheochromocytoma cell line PC-12 cell lysates that associate with the adaptor protein Shc (Shc homologous and collagen protein). Initial experiments with metabolically radiolabeled cell extracts separated by sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) revealed a number of proteins that coimmunoprecipitated with anti-Shc antibody compared with control (unstimulated) cell extracts. The experiment was scaled up and cell lysate from NGF-stimulated PC-12 cells was applied to a glutathione-S-transferase (GST)-Shc affinity column, eluted, separated by SDS-PAGE and blotted to Immobilon-CD. The blotted proteins were proteolytically digested in situ, and the masses obtained from the extracted peptides were used in a peptide-mass search program in an attempt to identify the protein. Even if a strong candidate was found using this search, an additional step was performed to confirm the identification. The mixtures were fractionated by reversed-phase high-performance liquid chromatography (RP-HPLC) and subjected to chemical sequencing to obtain (partial) sequence information, or post-source decay (PSD-) matrix-assisted laser-desorption ionization (MALDI)-MS to obtain sequence-specific fragment ions. This data was used in a peptide-sequence tag search to confirm the identity of the proteins. This combined approach allowed identification of four proteins of M(r) 43,000 to 200,000. In one case the identified protein clearly did not correspond to the radiolabeled band, but to a protein contaminant from the column. The advantages and pitfalls of the approach are discussed. PMID:8783013

  15. Peptide mapping and amino acid sequencing of two catechol 1,2-dioxygenases (CD I1 and CD I2) from Acinetobacter lwoffii K24.

    PubMed

    Kim, S I; Ha, K S

    1997-10-31

    The partial amino acid sequences of two catechol 1,2-dioxygenases (CD I1 and CD I2) from Acinetobacter lwoffii K24 have been determined by analysis of peptides after cleavages with endopeptidase Lys-C, endopeptidase Glu-C, trypsin, and chemicals (cyanogen bromide and BNPS-skatole). They include 248 amino acid sequences (4 fragments) of CD I1 and 211 amino acid sequences (5 fragments) of CD I2. Two enzymes have more than 50% sequence homology with type I catechol 1,2-dioxygenases and less than 30% sequence homology with type II catechol 1,2-dioxygenases. Two enzymes have similar hydropathy profiles in the N-terminal region, suggesting that they have similar secondary structures. PMID:9387151

  16. The Role of HIV-1 gp41 Glycoprotein in Infectious Tropism Inferred from Physico-Chemical Properties of its Amino Acid Sequence

    NASA Astrophysics Data System (ADS)

    Figueroa, E.; Villarreal, C.; Huerta, L.; Cocho, G.

    2006-09-01

    We performed a statistical analysis of the amino acid sequence of the gp41 ectodomain of the Human Immunodeficiency Virus type 1. We found strong correlations between physicochemical properties of highly variable residues and the viral infectious tropism.

  17. Characterization of fatty acid-producing wastewater microbial communities using next generation sequencing technologies

    EPA Science Inventory

    While wastewater represents a viable source of bacterial biodiesel production, very little is known on the composition of these microbial communities. We studied the taxonomic diversity and succession of microbial communities in bioreactors accumulating fatty acids using 454-pyro...

  18. Complete Genome Sequence of Amino Acid-Utilizing Eubacterium acidaminophilum al-2 (DSM 3953)

    PubMed Central

    Poehlein, Anja; Andreesen, Jan R.

    2014-01-01

    Eubacterium acidaminophilum is a strictly anaerobic, Gram-positive, rod-shaped bacterium which belongs to cluster XI of the Clostridia. It ferments amino acids by a Stickland reaction. The genome harbors a chromosome (2.25 Mb) and a megaplasmid (0.8 Mb). It contains several gene clusters coding for selenocysteine-containing, glycine-derived, and amino acid-degrading reductases. PMID:24926057

  19. Using Chou's pseudo amino acid composition to predict protein quaternary structure: a sequence-segmented PseAAC approach.

    PubMed

    Zhang, Shao-Wu; Chen, Wei; Yang, Feng; Pan, Quan

    2008-10-01

    In the protein universe, many proteins are composed of two or more polypeptide chains, generally referred to as subunits, which associate through noncovalent interactions and, occasionally, disulfide bonds to form protein quaternary structures. It has long been known that the functions of proteins are closely related to their quaternary structures; some examples include enzymes, hemoglobin, DNA polymerase, and ion channels. However, it is extremely labor-expensive and even impossible to quickly determine the structures of hundreds of thousands of protein sequences solely from experiments. Since the number of protein sequences entering databanks is increasing rapidly, it is highly desirable to develop computational methods for classifying the quaternary structures of proteins from their primary sequences. Since the concept of Chou's pseudo amino acid composition (PseAAC) was introduced, a variety of approaches, such as residue conservation scores, von Neumann entropy, multiscale energy, autocorrelation function, moment descriptors, and cellular automata, have been utilized to formulate the PseAAC for predicting different attributes of proteins. Here, in a different approach, a sequence-segmented PseAAC is introduced to represent protein samples. Meanwhile, multiclass SVM classifier modules were adopted to classify protein quaternary structures. As a demonstration, the dataset constructed by Chou and Cai [(2003) Proteins 53:282-289] was adopted as a benchmark dataset. The overall jackknife success rates thus obtained were 88.2-89.1%, indicating that the new approach is quite promising for predicting protein quaternary structure. PMID:18427713

  20. Effects of the amino acid sequence on thermal conduction through β-sheet crystals of natural silk protein.

    PubMed

    Zhang, Lin; Bai, Zhitong; Ban, Heng; Liu, Ling

    2015-11-21

    Recent experiments have discovered very different thermal conductivities between the spider silk and the silkworm silk. Decoding the molecular mechanisms underpinning the distinct thermal properties may guide the rational design of synthetic silk materials and other biomaterials for multifunctionality and tunable properties. However, such an understanding is lacking, mainly due to the complex structure and phonon physics associated with the silk materials. Here, using non-equilibrium molecular dynamics, we demonstrate that the amino acid sequence plays a key role in the thermal conduction process through β-sheets, essential building blocks of natural silks and a variety of other biomaterials. Three representative β-sheet types, i.e. poly-A, poly-(GA), and poly-G, are shown to have distinct structural features and phonon dynamics leading to different thermal conductivities. A fundamental understanding of the sequence effects may stimulate the design and engineering of polymers and biopolymers for desired thermal properties. PMID:26455593

  1. Ribonuclease "XlaI," an activity from Xenopus laevis oocytes that excises intervening sequences from yeast transfer ribonucleic acid precursors.

    PubMed Central

    Otsuka, A; de Paolis, A; Tocchini-Valentini, G P

    1981-01-01

    A ribonuclease (RNase) activity, RNase "XlaI," responsible for the excision of intervening sequences from two yeast transfer ribonucleic acid (tRNA) precursors, pre-tRNA(Tyr) and pre-tRNA(3Leu), has been purified 54-fold from nuclear extracts of Xenopus laevis oocytes. The RNase preparation is essentially free of contaminating RNase. A quantitative assay for RNase XlaI was developed, and the reaction products were characterized. RNase XlaI cleavage sites in the yeast tRNA precursors were identical to those made by yeast extracts (including 3'-phosphate and 5'-hydroxyl termini). Cleavage of pre-tRNA(3Leu) by RNase XlaI and subsequent ligation of the half-tRNA molecules do not require removal of the 5' leader or 3' trailer sequences. Images PMID:6765601

  2. Purification, amino acid sequence and mode of action of bifidocin B produced by Bifidobacterium bifidum NCFB 1454.

    PubMed

    Yildirim, Z; Winters, D K; Johnson, M G

    1999-01-01

    Bifidocin B produced by Bifidobacterium bifidum NCFB 1454 was purified to homogeneity by a rapid and simple three step purification procedure which included freeze drying, Micro-Cel adsorption/desorption and cation exchange chromatography. The purification resulted in 18% recovery and an approximately 1900-fold increase in the specific activity and purity of bifidocin B. Treatment with bifidocin B caused sensitive cells to lose high amounts of intracellular K+ ions and u.v.-absorbing materials, and to become more permeable to ONPG. Bifidocin B adsorbed to the Gram-positive bacteria but not the Gram-negative bacteria tested. Its adsorption was pH-dependent but not time-dependent. For sensitive cells, the adsorption and lethal action of bifidocin B was very rapid. In 5 min, 95% of bifidocin B adsorbed onto sensitive cells. Several salts inhibited the binding of bifidocin B, which could be overcome by increasing the amount of bifidocin B added. Pre-treatment of sensitive cells and cell walls with detergents, organic solvents or enzymes did not cause a reduction in subsequent cellular binding of bifidocin B, but cell wall preparations treated with methanol:chloroform and hot 20% (w/v) TCA lost the ability to adsorb bifidocin B. Also, the addition of purified heterologous lipoteichoic acid to sensitive cells completely blocked the adsorption of bifidocin B. The amino acid sequence indicated that the bacteriocin contained 36 residues. N-terminal amino acid sequence analysis yielded a sequence of KYYGNGVTCGLHDCRVDRGKATCGIINNGGMWGDIG. Curing experiments with 20 micrograms ml-1 acriflavine yielded cell derivatives that no longer produced bifidocin B but retained immunity to bifidocin B. Production of bifidocin B, but not immunity to bifidocin B, was associated with a plasmid of about 8 kb in this strain. PMID:10030011

  3. Amino acid sequences of two novel long-chain neurotoxins from the venom of the sea snake Laticauda colubrina.

    PubMed

    Kim, H S; Tamiya, N

    1982-11-01

    From the venom of a population of the sea snake Laticauda colubrina from the Solomon Islands, a neurotoxic component, Laticauda colubrina a (toxin Lc a), was isolated in 16.6% (A280) yield. Similarly, from the venom of a population of L. colubrina from the Philippines, a neurotoxic component, Laticauda colubrina b (toxin Lc b), was obtained in 10.0% (A280) yield. The LD50 values of these toxins were 0.12 microgram/g body wt. on intramuscular injection in mice. Toxins Lc a and Lc b were each composed of molecules containing 69 amino acid residues with eight half-cystine residues. The complete amino acid sequences of these two toxins were elucidated. Toxins Lc a and Lc b are different from each other at five positions of their sequences, namely at positions 31 (Phe/Ser), 32 (Leu/Ile), 33 (Lys/Arg), 50 (Pro/Arg) and 53 (Asp/His) (residues in parentheses give the residues in toxins Lc a and Lc b respectively). Toxins Lc a and Lc b have a novel structure in that they have only four disulphide bridges, although the whole amino acid sequences are homologous to those of other known long-chain neurotoxins. It is remarkable that toxins Lc a and Lc b are not coexistent at the detection error of 6% of the other toxin. Populations of Laticauda colubrina from the Solomon Islands and from the Philippines have either toxin Lc a or toxin Lc b and not both of them. PMID:7159381

  4. Evolutionary origin of asymptotically stable consensus.

    PubMed

    Tang, Chang-Bing; Wu, Bin; Wang, Jian-Bo; Li, Xiang

    2014-01-01

    Consensus is widely observed in nature as well as in society. Up to now, many works have focused on what kind of (and how) isolated single structures lead to consensus, while the dynamics of consensus in interdependent populations remains unclear, although interactive structures are everywhere. For such consensus in interdependent populations, we refer that the fraction of population adopting a specified strategy is the same across different interactive structures. A two-strategy game as a conflict is adopted to explore how natural selection affects the consensus in such interdependent populations. It is shown that when selection is absent, all the consensus states are stable, but none are evolutionarily stable. In other words, the final consensus state can go back and forth from one to another. When selection is present, there is only a small number of stable consensus state which are evolutionarily stable. Our study highlights the importance of evolution on stabilizing consensus in interdependent populations. PMID:24699444

  5. The sequence of rat leukosialin (W3/13 antigen) reveals a molecule with O-linked glycosylation of one third of its extracellular amino acids.

    PubMed Central

    Killeen, N; Barclay, A N; Willis, A C; Williams, A F

    1987-01-01

    Leukosialin is one of the major glycoproteins of thymocytes and T lymphocytes and is notable for a very high content of O-linked carbohydrate structures. The full protein sequence for rat leukosialin as translated from cDNA clones is now reported. The molecule contains 371 amino acids with 224 residues outside the cell, one transmembrane sequence and 124 cytoplasmic residues. Data from the peptide sequence and carbohydrate composition suggest that one in three of the extracellular amino acids may be O-glycosylated with no N-linked glycosylation sites. The cDNA sequence contained a CpG rich region in the 3' coding sequence and a large 3' non-coding region which included tandem repeats of the sequence GGAT. Images Fig. 4. PMID:2965006

  6. Amorphous/nanocrystalline silicon biosensor for the specific identification of unamplified nucleic acid sequences using gold nanoparticle probes

    NASA Astrophysics Data System (ADS)

    Martins, Rodrigo; Baptista, Pedro; Raniero, Leandro; Doria, Gonçalo; Silva, Leonardo; Franco, Ricardo; Fortunato, Elvira

    2007-01-01

    Amorphous/nanocrystalline silicon pi 'ii'n devices fabricated on micromachined glass substrates are integrated with oligonucleotide-derivatized gold nanoparticles for a colorimetric detection method. The method enables the specific detection and quantification of unamplified nucleic acid sequences (DNA and RNA) without the need to functionalize the glass surface, allowing for resolution of single nucleotide differences between DNA and RNA sequences—single nucleotide polymorphism and mutation detection. The detector's substrate is glass and the sample is directly applied on the back side of the biosensor, ensuring a direct optical coupling of the assays with a concomitant maximum photon capture and the possibility to reuse the sensor.

  7. International consensus on allergy immunotherapy.

    PubMed

    Jutel, Marek; Agache, Ioana; Bonini, Sergio; Burks, A Wesley; Calderon, Moises; Canonica, Walter; Cox, Linda; Demoly, Pascal; Frew, Antony J; O'Hehir, Robin; Kleine-Tebbe, Jörg; Muraro, Antonella; Lack, Gideon; Larenas, Désirée; Levin, Michael; Nelson, Harald; Pawankar, Ruby; Pfaar, Oliver; van Ree, Ronald; Sampson, Hugh; Santos, Alexandra F; Du Toit, George; Werfel, Thomas; Gerth van Wijk, Roy; Zhang, Luo; Akdis, Cezmi A

    2015-09-01

    Allergen immunotherapy (AIT) has been used to treat allergic disease since the early 1900s. Despite numerous clinical trials and meta-analyses proving AIT efficacious, it remains underused and is estimated to be used in less than 10% of patients with allergic rhinitis or asthma worldwide. In addition, there are large differences between regions, which are not only due to socioeconomic status. There is practically no controversy about the use of AIT in the treatment of allergic rhinitis and allergic asthma, but for atopic dermatitis or food allergy, the indications for AIT are not well defined. The elaboration of a wider consensus is of utmost importance because AIT is the only treatment that can change the course of allergic disease by preventing the development of asthma and new allergen sensitizations and by inducing allergen-specific immune tolerance. Safer and more effective AIT strategies are being continuously developed both through elaboration of new allergen preparations and adjuvants and alternate routes of administration. A number of guidelines, consensus documents, or both are available on both the international and national levels. The international community of allergy specialists recognizes the need to develop a comprehensive consensus report to harmonize, disseminate, and implement the best AIT practice. Consequently, the International Collaboration in Asthma, Allergy and Immunology, formed by the European Academy of Allergy and Clinical Immunology; the American Academy of Allergy, Asthma & Immunology; the American College of Allergy, Asthma & Immunology; and the World Allergy Organization, has decided to issue an international consensus on AIT. PMID:26162571

  8. An Interpretation of the Ancestral Codon from Miller’s Amino Acids and Nucleotide Correlations in Modern Coding Sequences

    PubMed Central

    Carels, Nicolas; de Leon, Miguel Ponce

    2015-01-01

    Purine bias, which is usually referred to as an “ancestral codon”, is known to result in short-range correlations between nucleotides in coding sequences, and it is common in all species. We demonstrate that RWY is a more appropriate pattern than the classical RNY, and purine bias (Rrr) is the product of a network of nucleotide compensations induced by functional constraints on the physicochemical properties of proteins. Through deductions from universal correlation properties, we also demonstrate that amino acids from Miller’s spark discharge experiment are compatible with functional primeval proteins at the dawn of living cell radiation on earth. These amino acids match the hydropathy and secondary structures of modern proteins. PMID:25922573

  9. Rapid Nucleic Acid Sequencing Methods--Alternative Approaches to Facilitating Learning.

    ERIC Educational Resources Information Center

    Bryce, Charles F. A.

    1982-01-01

    Because advanced students had difficulty in interpreting cleavage patterns obtained by gel electrophoresis related to rapid sequencing techniques for DNA and RNA, several formats were developed to aid in understanding this topic. Formats included print, print plus scrambled print, interactive computer-based instruction, and high-resolution…

  10. Draft Genome Sequence of Ustilago trichophora RK089, a Promising Malic Acid Producer

    PubMed Central

    Zambanini, Thiemo; Buescher, Joerg M.; Meurer, Guido; Blank, Lars M.

    2016-01-01

    The basidiomycetous smut fungus Ustilago trichophora RK089 produces malate from glycerol. De novo genome sequencing revealed a 20.7-Mbp genome (301 gap-closed contigs, 246 scaffolds). A comparison to the genome of Ustilago maydis 521 revealed all essential genes for malate production from glycerol contributing to metabolic engineering for improving malate production. PMID:27469969

  11. Draft Genome Sequence of Ustilago trichophora RK089, a Promising Malic Acid Producer.

    PubMed

    Zambanini, Thiemo; Buescher, Joerg M; Meurer, Guido; Wierckx, Nick; Blank, Lars M

    2016-01-01

    The basidiomycetous smut fungus Ustilago trichophora RK089 produces malate from glycerol. De novo genome sequencing revealed a 20.7-Mbp genome (301 gap-closed contigs, 246 scaffolds). A comparison to the genome of Ustilago maydis 521 revealed all essential genes for malate production from glycerol contributing to metabolic engineering for improving malate production. PMID:27469969

  12. 37 CFR 1.824 - Form and format for nucleotide and/or amino acid sequence submissions in computer readable form.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... nucleotide and/or amino acid sequence submissions in computer readable form. 1.824 Section 1.824 Patents... submissions in computer readable form. (a) The computer readable form required by § 1.821(e) shall meet the following requirements: (1) The computer readable form shall contain a single “Sequence Listing” as either...

  13. 37 CFR 1.824 - Form and format for nucleotide and/or amino acid sequence submissions in computer readable form.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... nucleotide and/or amino acid sequence submissions in computer readable form. 1.824 Section 1.824 Patents... submissions in computer readable form. (a) The computer readable form required by § 1.821(e) shall meet the following requirements: (1) The computer readable form shall contain a single “Sequence Listing” as either...

  14. 37 CFR 1.824 - Form and format for nucleotide and/or amino acid sequence submissions in computer readable form.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... nucleotide and/or amino acid sequence submissions in computer readable form. 1.824 Section 1.824 Patents... submissions in computer readable form. (a) The computer readable form required by § 1.821(e) shall meet the following requirements: (1) The computer readable form shall contain a single “Sequence Listing” as either...

  15. 37 CFR 1.824 - Form and format for nucleotide and/or amino acid sequence submissions in computer readable form.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... nucleotide and/or amino acid sequence submissions in computer readable form. 1.824 Section 1.824 Patents... submissions in computer readable form. (a) The computer readable form required by § 1.821(e) shall meet the following requirements: (1) The computer readable form shall contain a single “Sequence Listing” as either...

  16. 37 CFR 1.824 - Form and format for nucleotide and/or amino acid sequence submissions in computer readable form.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... nucleotide and/or amino acid sequence submissions in computer readable form. 1.824 Section 1.824 Patents... submissions in computer readable form. (a) The computer readable form required by § 1.821(e) shall meet the following requirements: (1) The computer readable form shall contain a single “Sequence Listing” as either...

  17. Mutation-selection models of coding sequence evolution with site-heterogeneous amino acid fitness profiles.

    PubMed

    Rodrigue, Nicolas; Philippe, Hervé; Lartillot, Nicolas

    2010-03-01

    Modeling the interplay between mutation and selection at the molecular level is key to evolutionary studies. To this end, codon-based evolutionary models have been proposed as pertinent means of studying long-range evolutionary patterns and are widely used. However, these approaches have not yet consolidated results from amino acid level phylogenetic studies showing that selection acting on proteins displays strong site-specific effects, which translate into heterogeneous amino acid propensities across the columns of alignments; related codon-level studies have instead focused on either modeling a single selective context for all codon columns, or a separate selective context for each codon column, with the former strategy deemed too simplistic and the latter deemed overparameterized. Here, we integrate recent developments in nonparametric statistical approaches to propose a probabilistic model that accounts for the heterogeneity of amino acid fitness profiles across the coding positions of a gene. We apply the model to a dozen real protein-coding gene alignments and find it to produce biologically plausible inferences, for instance, as pertaining to site-specific amino acid constraints, as well as distributions of scaled selection coefficients. In their account of mutational features as well as the heterogeneous regimes of selection at the amino acid level, the modeling approaches studied here can form a backdrop for several extensions, accounting for other selective features, for variable population size, or for subtleties of mutational features, all with parameterizations couched within population-genetic theory. PMID:20176949

  18. Terminal sequence studies of high-molecular-weight ribonucleic acid. The 3′-termini of rabbit globin messenger ribonucleic acid

    PubMed Central

    Hunt, John A.

    1973-01-01

    Haemoglobin mRNA isolated from EDTA-treated polyribosomes has an apparent molecular weight of 120000–180000 estimated by condensation with 3H-labelled isoniazid after periodate oxidation. Analysis of the ribonuclease digests of isoniazid-labelled RNA by paper electrophoresis and column chromatography enables the amount of contaminating 18S, 7S, 5S and 4S RNA to be estimated, and a corrected molecular weight of globin mRNA as the acid is 161000 or 500 nucleotides in length. This molecule contains two groups of 3′-terminal sequences in equal yield; G-Y-A6 and G-Y-A7 in the ratio 3:2, and G-N9–16-Y-A2 and G-N9–16-Y-N3 in the ratio 3:2. The significance of these sequences is discussed in relation to the poly(A) content of globin mRNA, the specificity of the sequences, and possible function in processing and biosynthesis of mRNA. PMID:4737318

  19. Consensus algorithms in decentralized networks

    NASA Astrophysics Data System (ADS)

    Coduti, Leonardo Phillip

    We consider a decentralized network with the following goal: the state at each node of the network iteratively converges to the same value. Ensuring that this goal is achieved requires certain properties of the topology of the network and the function describing the evolution of the network. We will present these properties for deterministic systems, extending current results in the literature. As an additional contribution, we will show how the convergence results for stochastic systems are direct consequences of the corresponding deterministic systems, drastically simplifying many other current results. In general, these consensus systems can be both time invariant and time varying, and we will extend all our deterministic and stochastic results to include time varying systems as well. We will then consider a more complex consensus problem, the resource allocation problem. In this situation each node of the network has both a state and a capacity. The capacity is a monotone increasing function of the state, and the goal is for the nodes to exchange capacity in a decentralized manner in order to drive all of the states to the same value. Conditions ensuring consensus in the deterministic setting will be presented, and we will show how convergence in this system also comes from the fundamental deterministic result for consensus algorithms. The main results will again be extended to stochastic and time varying systems. The linear consensus system requires the construction of a matrix of weighting parameters with specific properties. We present an iterative algorithm for determining the weighting parameters in a decentralized fashion; the weighting parameters are specified by the nodes and each node only specifies the weighting parameters as sociated with that node. The results assume that the communication graph of the network is directed, and we consider both synchronous communication, and stochastic asynchronous networks.

  20. Identification of the amino acid sequence that targets peroxiredoxin 6 to lysosome-like structures of lung epithelial cells.

    PubMed

    Sorokina, Elena M; Feinstein, Sheldon I; Milovanova, Tatyana N; Fisher, Aron B

    2009-11-01

    Peroxiredoxin 6 (Prdx6), an enzyme with glutathione peroxidase and PLA2 (aiPLA2) activities, is highly expressed in respiratory epithelium, where it participates in phospholipid turnover and antioxidant defense. Prdx6 has been localized by immunocytochemistry and subcellular fractionation to acidic organelles (lung lamellar bodies and lysosomes) and cytosol. On the basis of their pH optima, we have postulated that protein subcellular localization determines the balance between the two activities of Prdx6. Using green fluorescent protein-labeled protein expression in alveolar epithelial cell lines, we showed Prdx6 localization to organellar structures resembling lamellar bodies in mouse lung epithelial (MLE-12) cells and lysosomes in A549 cells. Localization within lamellar bodies/lysosomes was in the luminal compartment. Targeting to lysosome-like organelles was abolished by the deletion of amino acids 31-40 from the Prdx6 NH2-terminal region; deletion of the COOH-terminal region had no effect. A green fluorescent protein-labeled peptide containing only amino acids 31-40 showed lysosomal targeting that was abolished by mutation of S32 or G34 within the peptide. Studies with mutated protein indicated that lipid binding was not necessary for Prdx6 targeting. This peptide sequence has no homology to known organellar targeting motifs. These studies indicate that the localization of Prdx6 in acidic organelles and consequent PLA2 activity depend on a novel 10-aa peptide located at positions 31-40 of the protein. PMID:19700648

  1. From Amino Acid to Glucosinolate Biosynthesis: Protein Sequence Changes in the Evolution of Methylthioalkylmalate Synthase in Arabidopsis[W][OA

    PubMed Central

    de Kraker, Jan-Willem; Gershenzon, Jonathan

    2011-01-01

    Methylthioalkylmalate synthase (MAM) catalyzes the committed step in the side chain elongation of Met, yielding important precursors for glucosinolate biosynthesis in Arabidopsis thaliana and other Brassicaceae species. MAM is believed to have evolved from isopropylmalate synthase (IPMS), an enzyme involved in Leu biosynthesis, based on phylogenetic analyses and an overlap of catalytic abilities. Here, we investigated the changes in protein structure that have occurred during the recruitment of IPMS from amino acid to glucosinolate metabolism. The major sequence difference between IPMS and MAM is the absence of 120 amino acids at the C-terminal end of MAM that constitute a regulatory domain for Leu-mediated feedback inhibition. Truncation of this domain in Arabidopsis IPMS2 results in loss of Leu feedback inhibition and quaternary structure, two features common to MAM enzymes, plus an 8.4-fold increase in the kcat/Km for a MAM substrate. Additional exchange of two amino acids in the active site resulted in a MAM-like enzyme that had little residual IPMS activity. Hence, combination of the loss of the regulatory domain and a few additional amino acid exchanges can explain the evolution of MAM from IPMS during its recruitment from primary to secondary metabolism. PMID:21205930

  2. Templated synthesis of peptide nucleic acids via sequence-selective base-filling reactions.

    PubMed

    Heemstra, Jennifer M; Liu, David R

    2009-08-19

    The templated synthesis of nucleic acids has previously been achieved through the backbone ligation of preformed nucleotide monomers or oligomers. In contrast, here we demonstrate templated nucleic acid synthesis using a base-filling approach in which individual bases are added to abasic sites of a peptide nucleic acid (PNA). Because nucleobase substrates in this approach are not self-reactive, a base-filling approach may reduce the formation of nontemplated reaction products. Using either reductive amination or amine acylation chemistries, we observed efficient and selective addition of each of the four nucleobases to an abasic site in the middle of the PNA strand. We also describe the addition of single nucleobases to the end of a PNA strand through base filling, as well as the tandem addition of two bases to the middle of the PNA strand. These findings represent an experimental foundation for nonenzymatic information transfer through base filling. PMID:19722647

  3. Templated Synthesis of Peptide Nucleic Acids via Sequence-Selective Base-Filling Reactions

    PubMed Central

    2009-01-01

    The templated synthesis of nucleic acids has previously been achieved through the backbone ligation of preformed nucleotide monomers or oligomers. In contrast, here we demonstrate templated nucleic acid synthesis using a base-filling approach in which individual bases are added to abasic sites of a peptide nucleic acid (PNA). Because nucleobase substrates in this approach are not self-reactive, a base-filling approach may reduce the formation of nontemplated reaction products. Using either reductive amination or amine acylation chemistries, we observed efficient and selective addition of each of the four nucleobases to an abasic site in the middle of the PNA strand. We also describe the addition of single nucleobases to the end of a PNA strand through base filling, as well as the tandem addition of two bases to the middle of the PNA strand. These findings represent an experimental foundation for nonenzymatic information transfer through base filling. PMID:19722647

  4. Complete Genome Sequence of a thermotolerant sporogenic lactic acid bacterium, Bacillus coagulans strain 36D1

    SciTech Connect

    Xie, Gary; Dalin, Eileen; Tice, Hope; Chertkov, Olga; Land, Miriam L

    2011-01-01

    Bacillus coagulans is a ubiquitous soil bacterium that grows at 50-55 C and pH 5.0 and fer-ments various sugars that constitute plant biomass to L (+)-lactic acid. The ability of this sporogenic lactic acid bacterium to grow at 50-55 C and pH 5.0 makes this organism an attractive microbial biocatalyst for production of optically pure lactic acid at industrial scale not only from glucose derived from cellulose but also from xylose, a major constituent of hemi-cellulose. This bacterium is also considered as a potential probiotic. Complete genome squence of a representative strain, B. coagulans strain 36D1, is presented and discussed.

  5. Complete Genome Sequence of a thermotolerant sporogenic lactic acid bacterium, Bacillus coagulans strain 36D1

    SciTech Connect

    Rhee, Mun Su; Moritz, Brelan E.; Xie, Gary; Glavina Del Rio, Tijana; Dalin, Eileen; Tice, Hope; Bruce, David; Goodwin, Lynne A.; Chertkov, Olga; Brettin, Thomas S; Han, Cliff; Detter, J. Chris; Pitluck, Sam; Land, Miriam L; Patel, Milind; Ou, Mark; Harbrucker, Roberta; Ingram, Lonnie O.; Shanmugam, Keelnathan T.

    2011-01-01

    Bacillus coagulans is a ubiquitous soil bacterium that grows at 50-55 C and pH 5.0 and fer- ments various sugars that constitute plant biomass to L (+)-lactic acid. The ability of this spo- rogenic lactic acid bacterium to grow at 50-55 C and pH 5.0 makes this organism an attrac- tive microbial biocatalyst for production of optically pure lactic acid at industrial scale not only from glucose derived from cellulose but also from xylose, a major constituent of hemi- cellulose. This bacterium is also considered as a potential probiotic. Complete genome se- quence of a representative strain, B. coagulans strain 36D1, is presented and discussed.

  6. New consensus nomenclature for mammalian keratins

    PubMed Central

    Schweizer, Jürgen; Bowden, Paul E.; Coulombe, Pierre A.; Langbein, Lutz; Lane, E. Birgitte; Magin, Thomas M.; Maltais, Lois; Omary, M. Bishr; Parry, David A.D.; Rogers, Michael A.; Wright, Mathew W.

    2006-01-01

    Keratins are intermediate filament–forming proteins that provide mechanical support and fulfill a variety of additional functions in epithelial cells. In 1982, a nomenclature was devised to name the keratin proteins that were known at that point. The systematic sequencing of the human genome in recent years uncovered the existence of several novel keratin genes and their encoded proteins. Their naming could not be adequately handled in the context of the original system. We propose a new consensus nomenclature for keratin genes and proteins that relies upon and extends the 1982 system and adheres to the guidelines issued by the Human and Mouse Genome Nomenclature Committees. This revised nomenclature accommodates functional genes and pseudogenes, and although designed specifically for the full complement of human keratins, it offers the flexibility needed to incorporate additional keratins from other mammalian species. PMID:16831889

  7. Diverse Bacterial PKS Sequences Derived From Okadaic Acid-Producing Dinoflagellates

    PubMed Central

    Perez, Roberto; Liu, Li; Lopez, Jose; An, Tianying; Rein, Kathleen S.

    2008-01-01

    Okadaic acid (OA) and the related dinophysistoxins are isolated from dinoflagellates of the genus Prorocentrum and Dinophysis. Bacteria of the Roseobacter group have been associated with okadaic acid producing dinoflagellates and have been previously implicated in OA production. Analysis of 16S rRNA libraries reveals that Roseobacter are the most abundant bacteria associated with OA producing dinoflagellates of the genus Prorocentrum and are not found in association with non-toxic dinoflagellates. While some polyketide synthase (PKS) genes form a highly supported Prorocentrum clade, most appear to be bacterial, but unrelated to Roseobacter or Alpha-Proteobacterial PKSs or those derived from other Alveolates Karenia brevis or Crytosporidium parvum. PMID:18728765

  8. Complete Genome Sequence of Moraxella osloensis Strain KMC41, a Producer of 4-Methyl-3-Hexenoic Acid, a Major Malodor Compound in Laundry.

    PubMed

    Goto, Takatsugu; Hirakawa, Hideki; Morita, Yuji; Tomida, Junko; Sato, Jun; Matsumura, Yuta; Mitani, Asako; Niwano, Yu; Takeuchi, Kohei; Kubota, Hiromi; Kawamura, Yoshiaki

    2016-01-01

    We report the complete genome sequence of Moraxella osloensis strain KMC41, isolated from laundry with malodor. The KMC41 genome comprises a 2,445,556-bp chromosome and three plasmids. A fatty acid desaturase and at least four β-oxidation-related genes putatively associated with 4-methyl-3-hexenoic acid generation were detected in the KMC41 chromosome. PMID:27445387

  9. Complete Genome Sequence of Moraxella osloensis Strain KMC41, a Producer of 4-Methyl-3-Hexenoic Acid, a Major Malodor Compound in Laundry

    PubMed Central

    Hirakawa, Hideki; Morita, Yuji; Tomida, Junko; Sato, Jun; Matsumura, Yuta; Mitani, Asako; Niwano, Yu; Takeuchi, Kohei; Kubota, Hiromi; Kawamura, Yoshiaki

    2016-01-01

    We report the complete genome sequence of Moraxella osloensis strain KMC41, isolated from laundry with malodor. The KMC41 genome comprises a 2,445,556-bp chromosome and three plasmids. A fatty acid desaturase and at least four β-oxidation-related genes putatively associated with 4-methyl-3-hexenoic acid generation were detected in the KMC41 chromosome. PMID:27445387

  10. cDNA cloning and structural characterization of a lectin from the mussel Crenomytilus grayanus with a unique amino acid sequence and antibacterial activity.

    PubMed

    Kovalchuk, Svetlana N; Chikalovets, Irina V; Chernikov, Oleg V; Molchanova, Valentina I; Li, Wei; Rasskazov, Valery A; Lukyanov, Pavel A

    2013-10-01

    An amino acid sequence of GalNAc/Gal-specific lectin from the mussel Crenomytilus grayanus (CGL) was determined by cDNA sequencing. CGL consists of 150 amino acid residues, contains three tandem repeats with high sequence similarities to each other (up to 73%) and does not belong to any known lectins family. According to circular dichroism results CGL is a β/α-protein with the predominance of β-structure. CGL was predicted to adopt a ß-trefoil fold. The lectin exhibits antibacterial activity and might be involved in the recognition and clearance of bacterial pathogens in the shellfish. PMID:23886951

  11. Snake venom toxins. The amino acid sequence of toxin Vi2, a homologue of pancreatic trypsin inhibitor, from Dendroaspis polylepis polylepis (black mamba) venom.

    PubMed

    Strydom, D J

    1977-04-25

    The amino acid sequence of venom component Vi2, a protein of low toxicity from Dendroaspis polylepis polylepis venom was determined by automatic sequence analysis in combination with sequence studies on tryptic peptides. This protein, the most retarded fraction of this venom on a cation-exchange resin, is a homologue of bovine pancreatic trypsin inhibitor consisting of a single chain of 57 amino acid residues containing six half-cystine residues. The active site lysyl residue of bovine trypsin inhibitor is conserved in Vi2 although large differences are found in the rest of the molecule. PMID:857902

  12. The complete amino acid sequence of the major Kunitz trypsin inhibitor from the seeds of Prosopsis juliflora.

    PubMed

    Negreiros, A N; Carvalho, M M; Xavier Filho, J; Blanco-Labra, A; Shewry, P R; Richardson, M

    1991-01-01

    The major inhibitor of trypsin in seeds of Prosopsis juliflora was purified by precipitation with ammonium sulphate, ion-exchange column chromatography on DEAE- and CM-Sepharose and preparative reverse phase HPLC on a Vydac C-18 column. The protein inhibited trypsin in the stoichiometric ratio of 1:1, but had only weak activity against chymotrypsin and did not inhibit human salivary or porcine pancreatic alpha-amylases. SDS-PAGE indicated that the inhibitor has a Mr of ca 20,000, and IEF-PAGE showed that the pI is 8.8. The complete amino acid sequence was determined by automatic degradation, and by DABITC/PITC microsequence analysis of peptides obtained from enzyme digestions of the reduced and S-carboxymethylated protein with trypsin, chymotrypsin, elastase, the Glu-specific protease from S. aureus and the Lys-specific protease from Lysobacter enzymogenes. The inhibitor consisted of two polypeptide chains, of 137 residues (alpha chain) and 38 residues (beta chain) linked together by a single disulphide bond. The amino acid sequence of the protein exhibited homology with a number of Kunitz proteinase inhibitors from other legume seeds, the bifunctional subtilisin/alpha-amylase inhibitors from cereals and the taste-modifying protein miraculin. PMID:1367792

  13. Isolation and complete amino acid sequence of two fibrinolytic proteinases from the toxic Saturnid caterpillar Lonomia achelous.

    PubMed

    Amarant, T; Burkhart, W; LeVine, H; Arocha-Pinango, C L; Parikh, I

    1991-08-30

    The major toxic and fibrinolytic activity of the saliva and hemolymph of the larval form of Lonomia achelous was purified to homogeneity by a combination of metal chelate and affinity chromatography. Two apparent isozymes, Achelase I (213 amino acids, pIcalc = 10.55) and Achelase II (214 amino acids, pIcalc = 8.51), were sequenced by automated Edman degradation, and their C-termini confirmed by Fourier-transform mass spectrometry. The calculated molecular weights (22,473 and 22,727) correspond well to Mr estimates of 24,000 by SDS-PAGE. No carbohydrate was detected during sequencing. The enzymes degraded all three chains of fibrin, alpha greater than beta much greater than gamma, yielding a fragmentation pattern indistinguishable from that produced by trypsin. Chromogenic peptides S-2222 (Factor Xa and trypsin), S-2251 (plasmin), S-2302 (kallikrein) and S-2444 (urokinase) were substrates while S-2288 (broad range of serine proteinases including thrombin) was not hydrolyzed. Among a range of inhibitors Hg+2, aminophenylmercuriacetate, leupeptin, antipain and E-64 but not N-ethylmaleimide or iodoacetate abolished the activity of the purified isozymes against S-2444. Phenylmethylsulfonyl fluoride, soybean trypsin inhibitor and aprotinin were less effective. The presence of the classic catalytic triad (histidine-41, aspartate-86 and serine-189) suggests that Achelases I and II may be serine proteinases, but with a potentially free cysteine-185 which could react with thiol proteinase-directed reagents. PMID:1911844

  14. Genome sequence of the acid-tolerant Burkholderia sp. strain WSM2232 from Karijini National Park, Australia

    PubMed Central

    Walker, Robert; Watkin, Elizabeth; Tian, Rui; Bräu, Lambert; O’Hara, Graham; Goodwin, Lynne; Han, James; Reddy, Tatiparthi; Huntemann, Marcel; Pati, Amrita; Woyke, Tanja; Mavromatis, Konstantinos; Markowitz, Victor; Ivanova, Natalia; Kyrpides, Nikos; Reeve, Wayne

    2013-01-01

    Burkholderia sp. strain WSM2232 is an aerobic, motile, Gram-negative, non-spore-forming acid-tolerant rod that was trapped in 2001 from acidic soil collected from Karijini National Park (Australia) using Gastrolobium capitatum as a host. WSM2232 was effective in nitrogen fixation with G. capitatum but subsequently lost symbiotic competence during long-term storage. Here we describe the features of Burkholderia sp. strain WSM2232, together with genome sequence information and its annotation. The 7,208,311 bp standard-draft genome is arranged into 72 scaffolds of 72 contigs containing 6,322 protein-coding genes and 61 RNA-only encoding genes. The loss of symbiotic capability can now be attributed to the loss of nodulation and nitrogen fixation genes from the genome. This rhizobial genome is one of 100 sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project. PMID:25197442

  15. Cloning and nucleotide sequencing of a novel 7 beta-(4-carboxybutanamido)cephalosporanic acid acylase gene of Bacillus laterosporus and its expression in Escherichia coli and Bacillus subtilis.

    PubMed

    Aramori, I; Fukagawa, M; Tsumura, M; Iwami, M; Ono, H; Kojo, H; Kohsaka, M; Ueda, Y; Imanaka, H

    1991-12-01

    A strain of Bacillus species which produced an enzyme named glutaryl 7-ACA acylase which converts 7 beta-(4-carboxybutanamido)cephalosporanic acid (glutaryl 7-ACA) to 7-amino cephalosporanic acid (7-ACA) was isolated from soil. The gene for the glutaryl 7-ACA acylase was cloned with pHSG298 in Escherichia coli JM109, and the nucleotide sequence was determined by the M13 dideoxy chain termination method. The DNA sequence revealed only one large open reading frame composed of 1,902 bp corresponding to 634 amino acid residues. The deduced amino acid sequence contained a potential signal sequence in its amino-terminal region. Expression of the gene for glutaryl 7-ACA acylase was performed in both E. coli and Bacillus subtilis. The enzyme preparations purified from either recombinant strain of E. coli or B. subtilis were shown to be identical with each other as regards the profile of sodium dodecyl sulfate-polyacrylamide gel electrophoresis and were composed of a single peptide with the molecular size of 70 kDa. Determination of the amino-terminal sequence of the two enzyme preparations revealed that both amino-terminal sequences (the first nine amino acids) were identical and completely coincided with residues 28 to 36 of the open reading frame. Extracellular excretion of the enzyme was observed in a recombinant strain of B. subtilis. PMID:1744041

  16. Consensus-Degenerate Hybrid Oligonucleotide Primers for Amplification of Priming Glycosyltransferase Genes of the Exopolysaccharide Locus in Strains of the Lactobacillus casei Group

    PubMed Central

    Provencher, Cathy; LaPointe, Gisèle; Sirois, Stéphane; Van Calsteren, Marie-Rose; Roy, Denis

    2003-01-01

    A primer design strategy named CODEHOP (consensus-degenerate hybrid oligonucleotide primer) for amplification of distantly related sequences was used to detect the priming glycosyltransferase (GT) gene in strains of the Lactobacillus casei group. Each hybrid primer consisted of a short 3′ degenerate core based on four highly conserved amino acids and a longer 5′ consensus clamp region based on six sequences of the priming GT gene products from exopolysaccharide (EPS)-producing bacteria. The hybrid primers were used to detect the priming GT gene of 44 commercial isolates and reference strains of Lactobacillus rhamnosus, L. casei, Lactobacillus zeae, and Streptococcus thermophilus. The priming GT gene was detected in the genome of both non-EPS-producing (EPS−) and EPS-producing (EPS+) strains of L. rhamnosus. The sequences of the cloned PCR products were similar to those of the priming GT gene of various gram-negative and gram-positive EPS+ bacteria. Specific primers designed from the L. rhamnosus RW-9595M GT gene were used to sequence the end of the priming GT gene in selected EPS+ strains of L. rhamnosus. Phylogenetic analysis revealed that Lactobacillus spp. form a distinctive group apart from other lactic acid bacteria for which GT genes have been characterized to date. Moreover, the sequences show a divergence existing among strains of L. rhamnosus with respect to the terminal region of the priming GT gene. Thus, the PCR approach with consensus-degenerate hybrid primers designed with CODEHOP is a practical approach for the detection of similar genes containing conserved motifs in different bacterial genomes. PMID:12788729

  17. Nucleic acid-binding molecules with high affinity and base sequence specificity: intercalating agents covalently linked to oligodeoxynucleotides.

    PubMed Central

    Asseline, U; Delarue, M; Lancelot, G; Toulmé, F; Thuong, N T; Montenay-Garestier, T; Hélène, C

    1984-01-01

    Oligodeoxyribonucleotides covalently linked to an intercalating agent via a polymethylene linker were synthesized. Oligothymidylates attached to an acridine dye (Acr) through the 3'-phosphate group [(Tp)n(CH2) mAcr ] specifically interact with the complementary sequence. The interaction is strongly stabilized by the intercalating agent. By using absorption and fluorescence spectroscopies, it is shown that complex formation between (Tp)n(CH2) mAcr and poly(rA) involves the formation of n A X T base pairs, where n is the number of thymines in the oligonucleotide. The acridine ring intercalates between A X T base pairs. Fluorescence excitation spectra reveal the existence of two environments for the acridine ring, whose relative contributions depend on the linker length (m). The binding of (Tp)4(CH2) mAcr to poly(rA) is analyzed in terms of site binding and cooperative interactions between oligonucleotides along the polynucleotide lattice. Thermodynamic parameters show that the covalent attachment of the acridine ring strongly stabilizes the binding of the oligonucleotide to its complementary sequence. The stabilization depends on the linker length; the compound with m = 5 gives a more stable complex than that with m = 3. These results open the way to the synthesis of a family of molecules exhibiting both high-affinity and high-specificity for a nucleic acid base sequence. PMID:6587350

  18. Using Consensus Groups in Online Learning

    ERIC Educational Resources Information Center

    Smith, Regina O.; Dirkx, John M.

    2007-01-01

    This chapter describes online consensus group work, a form of collaborative learning. It discusses collaborative learning, small group work, and consensus learning, with recommendations for their use in online contexts.

  19. Metallothionein cDNA, promoter, and genomic sequences of the tropical green mussel, Perna viridis.

    PubMed

    Khoo, H W; Patel, K H

    1999-09-01

    The primary structure of the cDNA and metallothionein (MT) genomic sequences of the tropical green mussel (Perna viridis) was determined. The complete cDNA sequences were obtained using degenerate primers designed from known metallothionein consensus amino acid sequences from the temperate species Mytilus edulis. The amino acid sequences of P. viridis metallothionein deduced from the coding region consisted of 72 amino acids with 21 cysteine residues and 9 Cys-X-Cys motifs corresponding to Type I MT class of other species. Two different genomic sequences coding for the same mRNA were obtained. Each putative gene contained a unique 5'UTR and two unique introns located at the same splice sites. The promoters for both genes were different in length and both contained metal responsive elements and active protein-binding sites. The structures of the genomic clones were compared with those of other species. J. Exp. Zool. 284:445-453, 1999. PMID:10451422

  20. Amino acid sequence and posttranslational modifications of human factor VII sub a from plasma and transfected baby hamster kidney cells

    SciTech Connect

    Thim, L.; Bjoern, S.; Christensen, M.; Nicolaisen, E.M.; Lund-Hansen, T.; Pedersen, A.H.; Hedner, U. )

    1988-10-04

    Blood coagulation factor VII is a vitamin K dependent glycoprotein which in its activated form, factor VII{sub a}, participates in the coagulation process by activating factor X and/or factor IX in the presence of Ca{sup 2+} and tissue factor. Three types of potential posttranslational modifications exist in the human factor VII{sub a} molecule, namely, 10 {gamma}-carboxylated, N-terminally located glutamic acid residues, 1 {beta}-hydroxylated aspartic acid residue, and 2 N-glycosylated asparagine residues. In the present study, the amino acid sequence and posttranslational modifications of recombinant factor VII{sub a} as purified from the culture medium of a transfected baby hamster kidney cell line have been compared to human plasma factor VII{sub a}. By use of HPLC, amino acid analysis, peptide mapping, and automated Edman degradation, the protein backbone of recombinant factor VII{sub a} was found to be identical with human factor VII{sub a}. Asparagine residues 145 and 322 were found to be fully N-glycosylated in human plasma factor VII{sub a}. In the recombinant factor VII{sub a}, asparagine residue 322 was fully glycosylated whereas asparagine residue 145 was only partially (approximately 66%) glycosylated. Besides minor differences in the sialic acid and fucose contents, the overall carbohydrate compositions were nearly identical in recombinant factor VII{sub a} and human plasma factor VII{sub a}. These results show that factor VII{sub a} as produced in the transfected baby hamster kidney cells is very similar to human plasma factor VII{sub a} and that this cell line thus might represent an alternative source for human factor VII{sub a}.

  1. Complete genome sequence of probiotic Bacillus coagulans HM-08: A potential lactic acid producer.

    PubMed

    Yao, Guoqiang; Gao, Pengfei; Zhang, Wenyi

    2016-06-20

    Bacillus coagulans HM-08 is a commercialized probiotic strain in China. Its genome contains a 3.62Mb circular chromosome with an average GC content of 46.3%. In silico analysis revealed the presence of one xyl operon as well as several other genes that are correlated to xylose utilization. The genetic information provided here may help to expand its future biotechnology potential in lactic acid production. PMID:27130497

  2. Complete Genome Sequence of the Amino Acid-Fermenting Clostridium propionicum X2 (DSM 1682)

    PubMed Central

    Poehlein, Anja; Schlien, Katja; Chowdhury, Nilanjan Pal; Gottschalk, Gerhard; Buckel, Wolfgang

    2016-01-01

    Clostridium propionicum is a strict anaerobic, Gram positive, rod-shaped bacterium that belongs to the clostridial cluster XIVb. The genome consists of one replicon (3.1 Mb) and harbors 2,936 predicted protein-encoding genes. The genome encodes all enzymes required for fermentation of the amino acids α-alanine, β-alanine, serine, threonine, and methionine. PMID:27081148

  3. Purification, characterization, and complete amino acid sequence of a trypsin inhibitor from amaranth (Amaranthus hypochondriacus) seeds.

    PubMed Central

    Valdes-Rodriguez, S; Segura-Nieto, M; Chagolla-Lopez, A; Verver y Vargas-Cortina, A; Martinez-Gallardo, N; Blanco-Labra, A

    1993-01-01

    A protein proteinase inhibitor was purified from a seed extract of amaranth (Amaranthus hypochondriacus) by precipitation with (NH4)2SO4, gel-filtration chromatography, ion-exchange chromatography, and reverse-phase high-performance liquid chromatography. It is a 69-amino acid protein with a high content of valine, arginine, and glutamic acid, but lacking in methionine. The inhibitor has a relative molecular weight of 7400 and an isoelectric point of 7.5. It is a serine proteinase inhibitor that recognizes chymotrypsin, trypsin, and trypsin-like proteinase activities extracted from larvae of the insect Prostephanus truncatus. This inhibitor belongs to the potato-I inhibitor family, showing the closest homology (59.5%) with the Lycopersicum peruvianum trypsin inhibitor, and (51%) with the proteinase inhibitor 5 extracted from the seeds of Cucurbita maxima. The position of the lysine-aspartic acid residues present in the active site of the amaranth inhibitor are found in almost the same relative position as in the inhibitor from C. maxima. PMID:8290633

  4. Energy strategy: Roadmap to consensus

    SciTech Connect

    Not Available

    1990-11-01

    The United States lacks a comprehensive approach to policy-making in the energy realm. Today, as in the past, individual constituency groups tend to focus on their particular aspect of the energy challenge. Many employ a ``decide-announce-defend`` approach to policy-making, setting out to secure a unilateral advantage for themselves. By so doing, they inevitably pit interest against interest. The result is a polarization of constituencies, and shortsighted policies designed to address the issue of the moment. The American Energy Assurance Council (AEAC) is a non-profit organization founded in 1987 for the sole purpose of facilitating progress toward a fair efficient wise, stable, and consensus-based national energy strategy. AEAC does not have a substantive policy agencies. Rather, we are committed to supporting a process whereby the many stakeholders and policy makers concerned with energy-related issues can come together in productive discourse, thereby overcoming ignorance of each other`s positions. The Council seeks to act as a facilitative body, providing a ``safe`` context for inventive and creative thinking. We attempt to build a store of common knowledge, and to build on that store according to mutually agreed-upon groundrules, and employing sophisticated approaches to facilitation and mediation. This report, the National Energy Consensus Experiment (NECE), was an ambitious experiment in consensus-building. We learned a great deal from it, both in terms of substance and process, and we are convinced that it holds important lessons for others who may seek to build consensus in the public policy realm.

  5. Energy strategy: Roadmap to consensus

    SciTech Connect

    Not Available

    1990-11-01

    The United States lacks a comprehensive approach to policy-making in the energy realm. Today, as in the past, individual constituency groups tend to focus on their particular aspect of the energy challenge. Many employ a decide-announce-defend'' approach to policy-making, setting out to secure a unilateral advantage for themselves. By so doing, they inevitably pit interest against interest. The result is a polarization of constituencies, and shortsighted policies designed to address the issue of the moment. The American Energy Assurance Council (AEAC) is a non-profit organization founded in 1987 for the sole purpose of facilitating progress toward a fair efficient wise, stable, and consensus-based national energy strategy. AEAC does not have a substantive policy agencies. Rather, we are committed to supporting a process whereby the many stakeholders and policy makers concerned with energy-related issues can come together in productive discourse, thereby overcoming ignorance of each other's positions. The Council seeks to act as a facilitative body, providing a safe'' context for inventive and creative thinking. We attempt to build a store of common knowledge, and to build on that store according to mutually agreed-upon groundrules, and employing sophisticated approaches to facilitation and mediation. This report, the National Energy Consensus Experiment (NECE), was an ambitious experiment in consensus-building. We learned a great deal from it, both in terms of substance and process, and we are convinced that it holds important lessons for others who may seek to build consensus in the public policy realm.

  6. Liberal Education: An Overlapping Pragmatic Consensus.

    ERIC Educational Resources Information Center

    Paris, David C.; Kimball, Bruce A.

    2000-01-01

    Suggests in Bruce Kimball's thesis that a pragmatic consensus was emerging about the understanding of liberal education offers that it might be best understood by comparing it to J. Rawl's idea of an "overlapping consensus." States that by comparing and contrasting these ideas that the emerging consensus is pragmatic in nature. (CMK)

  7. Inferences from protein and nucleic acid sequences - Early molecular evolution, divergence of kingdoms and rates of change

    NASA Technical Reports Server (NTRS)

    Dayhoff, M. O.; Barker, W. C.; Mclaughlin, P. J.

    1974-01-01

    Description of new sensitive, objective methods for establishing the probable common ancestry of very distantly related sequences and the quantitative evolutionary change which has taken place. These methods are applied to four families of proteins and nucleic acids and evolutionary trees will be derived where possible. Of the three families containing duplications of genetic material, two are nucleic acids: transfer RNA and 5S ribosomal RNA. Both of these structures are functional in the synthesis of coded proteins, and prototypes must have been present in the cell at the inception of the fundamental coding process that all living things share. There are many types of tRNA which recognize the various nucleotide triplets and the 20 amino acids. These types are thought to have arisen as a result of many gene duplications. Relationships among these types are discussed. The 5S ribosomal RNA, presently functional in both eukaryotes and prokaryotes, is very likely descended from an early form incorporating almost a complete duplication of genetic material. The amount of evolution in the various lines can again be compared. The other two families containing duplications are proteins; ferredoxin and cytochrome c.

  8. Species specific amino acid sequence-protein local structure relationships: An analysis in the light of a structural alphabet.

    PubMed

    de Brevern, Alexandre G; Joseph, Agnel Praveen

    2011-05-01

    Protein structure analysis and prediction methods are based on non-redundant data extracted from the available protein structures, regardless of the species from which the protein originates. Hence, these datasets represent the global knowledge on protein folds, which constitutes a generic distribution of amino acid sequence-protein structure (AAS-PS) relationships. In this study, we try to elucidate whether the AAS-PS relationship could possess specificities depending on the specie. For this purpose, we have chosen three different species: Saccharomyces cerevisiae, Plasmodium falciparum and Arabidopsis thaliana. We analyzed the AAS-PS behaviors of the proteins from these three species and compared it to the "expected" distribution of a classical non-redundant databank. With the classical secondary structure description, only slight differences in amino acid preferences could be observed. With a more precise description of local protein structures (Protein Blocks), significant changes could be highlighted. S. cerevisiae's AAS-PS relationship is close to the general distribution, while striking differences are observed in the case of A. thaliana. P. falciparum is the most distant one. This study presents some interesting view-points on AAS-PS relationship. Certain species exhibit unique preferences for amino acids to be associated with protein local structural elements. Thus, AAS-PS relationships are species dependent. These results can give useful insights for improving prediction methodologies which take the species specific information into account. PMID:21333657

  9. Amino acid sequence alignment of bacterial and mammalian pancreatic serine proteases based on topological equivalences.

    PubMed

    James, M N; Delbaere, L T; Brayer, G D

    1978-06-01

    The three-dimensional structures of the bacterial serine proteases SGPA, SGPB, and alpha-lytic protease have been compared with those of the pancreatic enzymes alpha-chymotrypsin and elastase. This comparison shows that approximately 60% (55-64%) of the alpha-carbon atom positions of the bacterial serine proteases are topologically equivalent to the alpha-carbon atom positions of the pancreatic enzymes. The corresponding value for a comparison of the bacterial enzymes among themselves is approximately 84%. The results of these topological comparisons have been used to deduce an experimentally sound sequence alignment for these several enzymes. This alignment shows that there is extensive tertiary structural homology among the bacteria and pancreatic enzymes without significant primary sequence identity (less than 21%). The acquisition of a zymogen function by the pancreatic enzymes is accompanied by two major changes to the bacterial enzymes' architecture: an insertion of 9 residues to increase the length of the N-terminal loop, and one of 12 residues to a loop near the activation salt bridge. In addition, in these two enzyme families, the methionine loop (residues 164-182) adopts very different comformations which are associated with their altered substrate specificities. PMID:96920

  10. DNA sequence of the control region of phage D108: the N-terminal amino acid sequences of repressor and transposase are similar both in phage D108 and in its relative, phage Mu.

    PubMed Central

    Mizuuchi, M; Weisberg, R A; Mizuuchi, K

    1986-01-01

    We have determined the DNA sequence of the control region of phage D108 up to position 1419 at the left end of the phage genome. Open reading frames for the repressor gene, ner gene, and the 5' part of the A gene (which codes for transposase) are found in the sequence. The genetic organization of this region of phage D108 is quite similar to that of phage Mu in spite of considerable divergence, both in the nucleotide sequence and in the amino acid sequences of the regulatory proteins of the two phages. The N-terminal amino acid sequences of the transposases of the two phages also share only limited homology. On the other hand, a significant amino acid sequence homology was found within each phage between the N-terminal parts of the repressor and transposase. We propose that the N-terminal domains of the repressor and transposase of each phage interact functionally in the process of making the decision between the lytic and the lysogenic mode of growth. PMID:3012481

  11. A logical sequence search for S100B target proteins.

    PubMed Central

    McClintock, K. A.; Shaw, G. S.

    2000-01-01

    The EF-hand calcium-binding protein S100B has been shown to interact in vitro in a calcium-sensitive manner with many substrates. These potential S100B target proteins have been screened for the preservation of a previously identified consensus sequence across species. The results were compared to known structural and in vitro properties of the proteins to rationalize choices for potential binding partners. Our approach uncovered four oligomeric proteins tubulin (alpha and beta), glial fibrillary acidic protein (GFAP), desmin, and vimentin that have conserved regions matching the consensus sequence. In the type III intermediate filament proteins (GFAP, vimentin, and desmin), this region corresponds to a portion of a coiled-coil (helix 2A), the structural element responsible for their assembly. In tubulin, the sequence matches correspond to regions of alpha and beta tubulin found at the alpha beta tubulin interface. In both cases, these consensus sequence matches provide a logical explanation for in vitro observations that S100B is able to inhibit oligomerization of these proteins. PMID:11106180

  12. Isolation and amino acid sequences of opossum vasoactive intestinal polypeptide and cholecystokinin octapeptide.

    PubMed Central

    Eng, J; Yu, J; Rattan, S; Yalow, R S

    1992-01-01

    Evolutionary history suggests that the marsupials entered South America from North America about 75 million years ago and subsequently dispersed into Australia before the separation between South America and Antarctica-Australia. A question of interest is whether marsupial peptides resemble the corresponding peptides of Old or New World mammals. Previous studies had shown that "little" gastrin of the North American marsupial, the opossum, is identical in length to that of the New World mammals, the guinea pig and chinchilla. In this report, we demonstrate that opossum cholecystokinin octapeptide, like that of the Australian marsupials, the Eastern quoll and the Tamar wallaby, is identical to the cholecystokinin octapeptide of Old World mammals and differs from that of the guinea pig and chinchilla. However, opossum vasoactive intestinal polypeptide differs from the usual Old World mammalian vasoactive intestinal polypeptide in five sites: [sequence; see text]. PMID:1542675

  13. Evolution of early life inferred from protein and ribonucleic acid sequences

    NASA Technical Reports Server (NTRS)

    Dayhoff, M. O.; Schwartz, R. M.

    1978-01-01

    The chemical structures of ferredoxin, 5S ribosomal RNA, and c-type cytochrome sequences have been employed to construct a phylogenetic tree which connects all major photosynthesizing organisms: the three types of bacteria, blue-green algae, and chloroplasts. Anaerobic and aerobic bacteria, eukaryotic cytoplasmic components and mitochondria are also included in the phylogenetic tree. Anaerobic nonphotosynthesizing bacteria similar to Clostridium were the earliest organisms, arising more than 3.2 billion years ago. Bacterial photosynthesis evolved nearly 3.0 billion years ago, while oxygen-evolving photosynthesis, originating in the blue-green algal line, came into being about 2.0 billion years ago. The phylogenetic tree supports the symbiotic theory of the origin of eukaryotes.

  14. Analyses of mitochondrial amino acid sequence datasets support the proposal that specimens of Hypodontus macropi from three species of macropodid hosts represent distinct species

    PubMed Central

    2013-01-01

    Background Hypodontus macropi is a common intestinal nematode of a range of kangaroos and wallabies (macropodid marsupials). Based on previous multilocus enzyme electrophoresis (MEE) and nuclear ribosomal DNA sequence data sets, H. macropi has been proposed to be complex of species. To test this proposal using independent molecular data, we sequenced the whole mitochondrial (mt) genomes of individuals of H. macropi from three different species of hosts (Macropus robustus robustus, Thylogale billardierii and Macropus [Wallabia] bicolor) as well as that of Macropicola ocydromi (a related nematode), and undertook a comparative analysis of the amino acid sequence datasets derived from these genomes. Results The mt genomes sequenced by next-generation (454) technology from H. macropi from the three host species varied from 13,634 bp to 13,699 bp in size. Pairwise comparisons of the amino acid sequences predicted from these three mt genomes revealed differences of 5.8% to 18%. Phylogenetic analysis of the amino acid sequence data sets using Bayesian Inference (BI) showed that H. macropi from the three different host species formed distinct, well-supported clades. In addition, sliding window analysis of the mt genomes defined variable regions for future population genetic studies of H. macropi in different macropodid hosts and geographical regions around Australia. Conclusions The present analyses of inferred mt protein sequence datasets clearly supported the hypothesis that H. macropi from M. robustus robustus, M. bicolor and T. billardierii represent distinct species. PMID:24261823

  15. Using Triple Helix Forming Peptide Nucleic Acids for Sequence-selective Recognition of Double-stranded RNA

    PubMed Central

    Hnedzko, Dziyana; Cheruiyot, Samwel K.; Rozners, Eriks

    2014-01-01

    Non-coding RNAs play important roles in regulation of gene expression. Specific recognition and inhibition of these biologically important RNAs that form complex double-helical structures will be highly useful for fundamental studies in biology and practical applications in medicine. This protocol describes a strategy developed in our laboratory for sequence-selective recognition of double-stranded RNA (dsRNA) using triple helix forming peptide nucleic acids (PNAs) that bind in the major grove of RNA helix. The strategy developed uses chemically modified nucleobases, such as 2-aminopyridine (M) that enables strong triple helical binding at physiologically relevant conditions, and 2-pyrimidinone (P) and 3-oxo-2,3-dihydropyridazine (E) that enable recognition of isolated pyrimidines in the purine rich strand of the RNA duplex. Detailed protocols for preparation of modified PNA monomers, solid-phase synthesis and HPLC purification of PNA oligomers, and measuring dsRNA binding affinity using isothermal titration calorimetry are included. PMID:25199637

  16. Nucleic acid sequences encoding D1 and D1/D2 domains of human coxsackievirus and adenovirus receptor (CAR)

    DOEpatents

    Freimuth, Paul I.

    2010-04-06

    The invention provides recombinant human CAR (coxsackievirus and adenovirus receptor) polypeptides which bind adenovirus. Specifically, polypeptides corresponding to adenovirus binding domain D1 and the entire extracellular domain of human CAR protein comprising D1 and D2 are provided. In another aspect, the invention provides nucleic acid sequences encoding these domains and expression vectors for producing the domains and bacterial cells containing such vectors. The invention also includes an isolated fusion protein comprised of the D1 polypeptide fused to a polypeptide which facilitates folding of D1 when expressed in bacteria. The functional D1 domain finds application in a therapeutic method for treating a patient infected with a CAR D1-binding virus, and also in a method for identifying an antiviral compound which interferes with viral attachment. The invention also provides a method for specifically targeting a cell for infection by a virus which binds to D1.

  17. Prediction of Residue Status to Be Protected or Not Protected From Hy-drogen Exchange Using Amino Acid Sequence Only.

    PubMed

    Nikita V, Dovidchenko; Oxana V, Galzitskaya

    2008-01-01

    We have outlined here some structural aspects of local flexibility. Important functional properties are related to flexible segments. We try to predict regions that have been shown to exhibit the highest probability of being folded in the equilibrium intermediate or native state and will be protected from hydrogen exchange using amino acid sequence only. Our approach FoldUnfold for the prediction of unstructured regions has been applied to seven different proteins. For 80% of the residues considered in this paper we can predict correctly their status: will they be protected or not from hydrogen exchange. An additional goal of our study is to assess whether properties inferred using the bioinformatics approach are easily applicable to predict behavior of proteins in solution. PMID:18949078

  18. Prediction of Residue Status to Be Protected or Not Protected From Hy-drogen Exchange Using Amino Acid Sequence Only

    PubMed Central

    Dovidchenko, Nikita V; Galzitskaya, Oxana V

    2008-01-01

    We have outlined here some structural aspects of local flexibility. Important functional properties are related to flexible segments. We try to predict regions that have been shown to exhibit the highest probability of being folded in the equilibrium intermediate or native state and will be protected from hydrogen exchange using amino acid sequence only. Our approach FoldUnfold for the prediction of unstructured regions has been applied to seven different proteins. For 80% of the residues considered in this paper we can predict correctly their status: will they be protected or not from hydrogen exchange. An additional goal of our study is to assess whether properties inferred using the bioinformatics approach are easily applicable to predict behavior of proteins in solution. PMID:18949078

  19. Primary structure of a histidine-rich proteolytic fragment of human ceruloplasmin. II. Amino acid sequence of the tryptic peptides.

    PubMed

    Kingston, I B; Kingston, B L; Putnam, F W

    1980-04-10

    Amino acid sequence studies of tryptic peptides isolated from a histidine-rich fragment (Cp F5) of human ceruloplasmin are described. Nineteen tryptic peptides were isolated from unmodified Cp F5 and five tryptic peptides were isolated from citraconylated Cp F5. These peptides, together with the cyanogen bromide fragments reported previously, allowed the assembly of the complete sequence of Cp F5. The fragment has 159 residues and a molecular weight of 18,650; it lacks carbohydrate, is rich in histidine, and contains 1 free cysteine that may be part of a copper-binding site. Human ceruloplasmin is a single polypeptide chain with a molecular weight of about 130,000 that is readily cleaved to large fragments by proteolytic enzymes; the relationships of Cp F5 to intact ceruloplasmin and to structural subunits earlier proposed is described. Cp F5 probably is an intact globular domain that is attached to the COOH-terminal end of ceruloplasmin by a labile interdomain peptide bond. PMID:6987230

  20. Immunoreactivity of polyclonal antibodies generated against the carboxy terminus of the predicted amino acid sequence of the Huntington disease gene

    SciTech Connect

    Alkatib, G.; Graham, R.; Pelmear-Telenius, A.

    1994-09-01

    A cDNA fragment spanning the 3{prime}-end of the Huntington disease gene (from 8052 to 9252) was cloned into a prokaryotic expression vector containing the E. Coli lac promoter and a portion of the coding sequence for {beta}-galactosidase. The truncated {beta}-galactosidase gene was cleaved with BamHl and fused in frame to the BamHl fragment of the Huntington disease gene 3{prime}-end. Expression analysis of proteins made in E. Coli revealed that 20-30% of the total cellular proteins was represented by the {beta}-galactosidase-huntingtin fusion protein. The identity of the Huntington disease protein amino acid sequences was confirmed by protein sequence analysis. Affinity chromatography was used to purify large quantities of the fusion protein from bacterial cell lysates. Affinity-purified proteins were used to immunize New Zealand white rabbits for antibody production. The generated polyclonal antibodies were used to immunoprecipitate the Huntington disease gene product expressed in a neuroblastoma cell line. In this cell line the antibodies precipitated two protein bands of apparent gel migrations of 200 and 150 kd which together, correspond to the calculated molecular weight of the Huntington disease gene product (350 kd). Immunoblotting experiments revealed the presence of a large precursor protein in the range of 350-750 kd which is in agreement with the predicted molecular weight of the protein without post-translational modifications. These results indicate that the huntingtin protein is cleaved into two subunits in this neuroblastoma cell line and implicate that cleavage of a large precursor protein may contribute to its biological activity. Experiments are ongoing to determine the precursor-product relationship and to examine the synthesis of the huntingtin protein in freshly isolated rat brains, and to determine cellular and subcellular distribution of the gene product.

  1. Ambient temperature detection of PCR amplicons with a novel sequence-specific nucleic acid lateral flow biosensor.

    PubMed

    Ang, Geik Yong; Yu, Choo Yee; Yean, Chan Yean

    2012-01-01

    In the field of diagnostics, molecular amplification targeting unique genetic signature sequences has been widely used for rapid identification of infectious agents, which significantly aids physicians in determining the choice of treatment as well as providing important epidemiological data for surveillance and disease control assessment. We report the development of a rapid nucleic acid lateral flow biosensor (NALFB) in a dry-reagent strip format for the sequence-specific detection of single-stranded polymerase chain reaction (PCR) amplicons at ambient temperature (22-25°C). The NALFB was developed in combination with a linear-after-the-exponential PCR assay and the applicability of this biosensor was demonstrated through detection of the cholera toxin gene from diarrheal-causing toxigenic Vibrio cholerae. Amplification using the advanced asymmetric PCR boosts the production of fluorescein-labeled single-stranded amplicons, allowing capture probes immobilized on the NALFB to hybridize specifically with complementary targets in situ on the strip. Subsequent visual formation of red lines is achieved through the binding of conjugated gold nanoparticles to the fluorescein label of the captured amplicons. The visual detection limit observed with synthetic target DNA was 0.3 ng and 1 pg with pure genomic DNA. Evaluation of the NALFB with 164 strains of V. cholerae and non-V. cholerae bacteria recorded 100% for both sensitivity and specificity. The whole procedure of the low-cost NALFB, which is performed at ambient temperature, eliminates the need for preheated buffers or additional equipment, greatly simplifying the protocol for sequence-specific PCR amplicon analysis. PMID:22705404

  2. Primary structure of a histidine-rich proteolytic fragment of human ceruloplasmin. I. Amino acid sequence of the cyanogen bromide peptides.

    PubMed

    Kingston, I B; Kingston, B L; Putnam, F W

    1980-04-10

    A histidine-rich fragment, Cp F5, with a molecular weight of 18,650 was isolated from human ceruloplasmin. It consists of 159 amino acids and contains a possible copper-binding site. The sequence of the first 18 NH2-terminal residues of Cp F5 was determined by automated Edman degradation. Cp F5 was cleaved by cyanogen bromide to produce nine fragments of from 2 to 63 residues. The amino acid sequence of all of the cyanogen bromide fragments was investigated using automated and manual Edman degradation, the fragments being digested with trypsin, chymotrypsin, thermolysin, staphylococcal protease, and pepsin as appropriate. The results, in conjunction with the data on the tryptic peptides reported in the accompanying paper (Kingston, I.B., Kingston, B.L., and Putnam, F.L. (1980) J. Biol. Chem. 255, 2886-2896), establish the complete amino acid sequence of Cp F5. PMID:6987229

  3. Sequence of the Proteus mirabilis urease accessory gene ureG.

    PubMed

    Sriwanthana, B; Island, M D; Mobley, H L

    1993-07-15

    We report the sequence of ureG, an accessory gene that is a part of the ure gene cluster of uropathogenic Proteus mirabilis and required for full enzymatic activity of urease. The 615-bp open reading frame predicts a M(r) 22,374 polypeptide, which contains a consensus amino acid (aa) sequence for ATP-binding. The polypeptide shares sequence homology with UreG of Escherichia coli (93% of identical aa), Klebsiella aerogenes (59%) and Helicobacter pylori (59%). PMID:8335248

  4. Protective immunogenicity of two synthetic peptides selected from the amino acid sequence of Bordetella pertussis toxin subunit S1.

    PubMed Central

    Askelöf, P; Rodmalm, K; Wrangsell, G; Larsson, U; Svenson, S B; Cowell, J L; Undén, A; Bartfai, T

    1990-01-01

    Two peptides, corresponding to amino acids 1-17 and 169-186 of the amino acid sequence of pertussis toxin (PT) subunit S1, were synthesized and coupled to the diphtheria toxin cross-reactive mutant protein CRM 197 and evaluated for immunogenicity and protective capacity against PT challenge in vivo. The peptide-CRM conjugates induced high antibody titers against native toxin in mice (BALB/c, C57/Black, and outbred NMRI) as measured by ELISA. Upon PT challenge (0.5 microgram of toxin) of the NMRI mice, the CRM conjugates of peptides 1-17 and 169-186 fully protected the mice from PT-induced leukocytosis. Immunization with the corresponding bovine serum albumin conjugates of these two peptides also fully protected mice. Rabbit antiserum to the peptide 1-17-CRM conjugate was highly efficient in inhibiting the ADP-ribosylating activity of PT but did not neutralize the clustering effect of PT on Chinese hamster ovary cells. In contrast, the rabbit antiserum raised against the peptide 169-186-CRM conjugate neutralized the clustering effect of PT on Chinese hamster ovary cells but did not inhibit the enzymatic activity of PT. Peptide 169-186-CRM conjugates mimic the immunoglobulin binding properties of PT and also cause clustering of Chinese hamster ovary cells. The CRM conjugates of these two peptides constitute a synthetic pertussis vaccine candidate with the ability to provide a chemically well-defined, safe, and efficient pertussis vaccine. Images PMID:2304902

  5. Hybridization properties of long nucleic acid probes for detection of variable target sequences, and development of a hybridization prediction algorithm

    PubMed Central

    Öhrmalm, Christina; Jobs, Magnus; Eriksson, Ronnie; Golbob, Sultan; Elfaitouri, Amal; Benachenhou, Farid; Strømme, Maria; Blomberg, Jonas

    2010-01-01

    One of the main problems in nucleic acid-based techniques for detection of infectious agents, such as influenza viruses, is that of nucleic acid sequence variation. DNA probes, 70-nt long, some including the nucleotide analog deoxyribose-Inosine (dInosine), were analyzed for hybridization tolerance to different amounts and distributions of mismatching bases, e.g. synonymous mutations, in target DNA. Microsphere-linked 70-mer probes were hybridized in 3M TMAC buffer to biotinylated single-stranded (ss) DNA for subsequent analysis in a Luminex® system. When mismatches interrupted contiguous matching stretches of 6 nt or longer, it had a strong impact on hybridization. Contiguous matching stretches are more important than the same number of matching nucleotides separated by mismatches into several regions. dInosine, but not 5-nitroindole, substitutions at mismatching positions stabilized hybridization remarkably well, comparable to N (4-fold) wobbles in the same positions. In contrast to shorter probes, 70-nt probes with judiciously placed dInosine substitutions and/or wobble positions were remarkably mismatch tolerant, with preserved specificity. An algorithm, NucZip, was constructed to model the nucleation and zipping phases of hybridization, integrating both local and distant binding contributions. It predicted hybridization more exactly than previous algorithms, and has the potential to guide the design of variation-tolerant yet specific probes. PMID:20864443

  6. Nucleic acid amplification in vitro: detection of sequences with low copy numbers and application to diagnosis of human immunodeficiency virus type 1 infection.

    PubMed Central

    Guatelli, J C; Gingeras, T R; Richman, D D

    1989-01-01

    The enzymatic amplification of specific nucleic acid sequences in vitro has revolutionized the use of nucleic acid hybridization assays for viral detection. With this method, the copy number of a pathogen-specific sequence is increased several orders of magnitude before detection is attempted. The sensitivity and specificity of detection are thus markedly improved. Mullis and Faloona devised the first method of sequence amplification in vitro, the polymerase chain reaction (K.B. Mullis and F.A. Faloona, Methods Enzymol. 155:355-350, 1987). By this method, synthetic oligonucleotide primers direct repeated, target-specific, deoxyribonucleic acid-synthetic reactions, resulting in an exponential increase in the amount of the specific target sequence. The application of sequence amplification to viral detection was initially performed with human immunodeficiency virus type 1 and human T-cell lymphoma virus type I. In principle, however, this approach can be applied to the detection of any deoxyribonucleic or ribonucleic acid virus; the only requirement is that sufficient nucleotide sequence data exist to allow the synthesis of target-specific oligonucleotide primers. The use of target amplification in vitro will permit a variety of studies of viral pathogenesis which have not been feasible because of the low copy number of the viral nucleic acids in infected material. This approach is particularly applicable to the study of human retroviral infections, which are chronic and persistent and are characterized by low titers of virus in tissues. In addition, target amplification in vitro will facilitate the development of new methods of sequence detection, which will be useful for rapid viral diagnosis in the clinical laboratory. PMID:2650862

  7. PRECISE: a Database of Predicted and Consensus Interaction Sites in Enzymes

    PubMed Central

    Sheu, Shu-Hsien; Lancia, David R.; Clodfelter, Karl H.; Landon, Melissa R.; Vajda, Sandor

    2005-01-01

    PRECISE (Predicted and Consensus Interaction Sites in Enzymes) is a database of interactions between the amino acid residues of an enzyme and its ligands (substrate and transition state analogs, cofactors, inhibitors and products). It is available online at http://precise.bu.edu/. In the current version, all information on interactions is extracted from the enzyme–ligand complexes in the Protein Data Bank (PDB) by performing the following steps: (i) clustering homologous enzyme chains such that, in each cluster, the proteins have the same EC number and all sequences are similar; (ii) selecting a representative chain for each cluster; (iii) selecting ligand types; (iv) finding non-bonded interactions and hydrogen bonds; and (v) summing the interactions for all chains within the cluster. The output of the search is the color-coded sequence of the representative. The colors indicate the total number of interactions found at each amino acid position in all chains of the cluster. Clicking on a residue displays a detailed list of interactions for that residue. Optional filters allow restricting the output to selected chains in the cluster, to non-bonded or hydrogen bonding interactions, and to selected ligand types. The binding site information is essential for understanding and altering substrate specificity and for the design of enzyme inhibitors. PMID:15608178

  8. Amino acid sequence homology between Piv, an essential protein in site-specific DNA inversion in Moraxella lacunata, and transposases of an unusual family of insertion elements.

    PubMed Central

    Lenich, A G; Glasgow, A C

    1994-01-01

    Deletion analysis of the subcloned DNA inversion region of Moraxella lacunata indicates that Piv is the only M. lacunata-encoded factor required for site-specific inversion of the tfpQ/tfpI pilin segment. The predicted amino acid sequence of Piv shows significant homology solely with the transposases/integrases of a family of insertion sequence elements, suggesting that Piv is a novel site-specific recombinase. Images PMID:8021196

  9. The outer capsid protein VP4 of equine rotavirus strain H-2 represents a unique VP4 type by amino acid sequence analysis.

    PubMed

    Hardy, M E; Gorziglia, M; Woode, G N

    1993-03-01

    The nucleotide and deduced amino acid sequence of G serotype 3 equine rotavirus strain H-2 was determined. A predicted 776-amino-acid H-2 VP4 shows less than or equal to 85.3% identity to other rotavirus VP4 types sequenced to date and thus represents a new P serotype. A PCR-generated probe derived from a cDNA clone of H-2 gene 4 hybridized to gene 4 of several tissue-culture-adapted equine rotavirus isolates, demonstrating that the gene 4 allele present in the H-2 strain is present in the equine rotavirus population. PMID:8382410

  10. Single Amino Acid Substitutions in the Chemotactic Sequence of Urokinase Receptor Modulate Cell Migration and Invasion

    PubMed Central

    Franco, Paola; Pavone, Vincenzo; Mugione, Pietro; Di Carluccio, Gioconda; Masucci, Maria Teresa; Arra, Claudio; Pirozzi, Giuseppe; Stoppelli, Maria Patrizia; Carriero, Maria Vincenza

    2012-01-01

    The receptor for urokinase-type plasminogen activator (uPAR) plays an important role in controlling cell migration. uPAR binds urokinase and vitronectin extracellular ligands, and signals in complex with transmembrane receptors such as Formyl-peptide Receptors (FPR)s and integrins. Previous work from this laboratory has shown that synthetic peptides, corresponding to the uPAR88–92 chemotactic sequence, when carrying the S90P or S90E substitutions, up- or down-regulate cell migration, respectively. To gain mechanistic insights into these opposite cell responses, the functional consequences of S90P and S90E mutations in full-length uPAR were evaluated. First, (HEK)-293 embryonic kidney cells expressing uPARS90P exhibit enhanced FPR activation, increased random and directional cell migration, long-lasting Akt phosphorylation, and increased adhesion to vitronectin, as well as uPAR/vitronectin receptor association. In contrast, the S90E substitution prevents agonist-triggered FPR activation and internalization, decreases binding and adhesion to vitronectin, and inhibits uPAR/vitronectin receptor association. Also, 293/uPARS90P cells appear quite elongated and their cytoskeleton well organized, whereas 293/uPARS90E cells assume a large flattened morphology, with random orientation of actin filaments. Interestingly, when HT1080 cells co-express wild type uPAR with uPAR S90E, the latter behaves as a dominant-negative, impairing uPAR-mediated signaling and reducing cell wound repair as well as lung metastasis in nude mice. In contrast, signaling, wound repair and in vivo lung metastasis of HT1080 cells bearing wild type uPAR are enhanced when they co-express uPARS90P. In conclusion, our findings indicate that Ser90 is a critical residue for uPAR signaling and that the S90P and S90E exert opposite effects on uPAR activities. These findings may be accommodated in a molecular model, in which uPARS90E and uPARS90P are forced into inactive and active forms, respectively

  11. Complete Genome Sequence of the d-Amino Acid Catabolism Bacterium Phaeobacter sp. Strain JL2886, Isolated from Deep Seawater of the South China Sea.

    PubMed

    Fu, Yingnan; Wang, Rui; Zhang, Zilian; Jiao, Nianzhi

    2016-01-01

    Phaeobacter sp. strain JL2886, isolated from deep seawater of the South China Sea, can catabolize d-amino acids. Here, we report the complete genome sequence of Phaeobacter sp. JL2886. It comprises ~4.06 Mbp, with a G+C content of 61.52%. A total of 3,913 protein-coding genes and 10 genes related to d-amino acid catabolism were obtained. PMID:27587825

  12. Complete Genome Sequence of the d-Amino Acid Catabolism Bacterium Phaeobacter sp. Strain JL2886, Isolated from Deep Seawater of the South China Sea

    PubMed Central

    Fu, Yingnan; Wang, Rui

    2016-01-01

    Phaeobacter sp. strain JL2886, isolated from deep seawater of the South China Sea, can catabolize d-amino acids. Here, we report the complete genome sequence of Phaeobacter sp. JL2886. It comprises ~4.06 Mbp, with a G+C content of 61.52%. A total of 3,913 protein-coding genes and 10 genes related to d-amino acid catabolism were obtained. PMID:27587825

  13. Partial molecular cloning of the JHK retrovirus using gammaretrovirus consensus PCR primers

    PubMed Central

    Halligan, Brian D; Sun, Hai-Yuan; Kushnaryov, Vladimir M; Grossberg, Sidney E

    2013-01-01

    The JHK virus (JHKV) was previously described as a type C retrovirus that has some distinctive ultrastructural features and replicates constitutively in a human B-lymphoblastoid cell line, JHK-3. In order to facilitate the cloning of sequences from JHKV, a series of partially degenerate consensus retroviral PCR primers were created by a data-driven design approach based on an alignment of 14 diverse gammaretroviral genomes. These primers were used in the PCR amplification of purified JHK virion cDNA, and ana lysis of the resulting amplified sequence indicates that the JHKV is in the murine leukemia virus (MLV) family. The JHK sequence is nearly identical to the corresponding region of the Bxv-1 endogenous mouse retrovirus (GenBank accession AC115959) and distinct from XMRV. JHKV gag-specific amplification was demonstrated with nucleic acids from uncultivated, frozen, peripheral blood mononuclear cells (PBMCs) of the index patient, but not in PBMCs from nine healthy blood donors. Unlike earlier reports, in which MLV-like sequences were identified in human source material, which may have been due to murine contamination, budding retrovirions were demonstrated repeatedly by electron microscopy in uncultivated lymphocytes of the index patient that were morphologically identical in their development to the virions in the JHK-3 cells, and immunological evidence was obtained that the index patient produced IgG antibodies that bound to the budding viral particles in patient PBMCs and in the JHK-3 cells. These data indicate that the patient had been infected by JHKV, lending significance to the demonstration of JHKV amplicons in nucleic acids of the patient’s PBMCs. In future studies, the PCR primer sets described herein may expand the detection of an amplifiable subset of viruses related to MLV. PMID:24159361

  14. Canadian asthma consensus report, 1999

    PubMed Central

    Boulet, L P; Becker, A; Bérubé, D; Beveridge, R; Ernst, P

    1999-01-01

    OBJECTIVES: To provide physicians with current guidelines for the diagnosis and optimal management of asthma in children and adults, including pregnant women and the elderly, in office, emergency department, hospital and clinic settings. OPTIONS: The consensus group considered the roles of education, avoidance of provocative environmental and other factors, diverse pharmacotherapies, delivery devices and emergency and in-hospital management of asthma. OUTCOMES: Provision of the best control of asthma by confirmation of the diagnosis using objective measures, rapid achievement and maintenance of control and regular follow-up. EVIDENCE: The key diagnostic and therapeutic recommendations are based on the 1995 Canadian guidelines and a critical review of the literature by small groups before a full meeting of the consensus group. Recommendations are graded according to 5 levels of evidence. Differences of opinion were resolved by consensus following discussion. VALUES: Respirologists, immunoallergists, pediatricians and emergency and family physicians gave prime consideration to the achievement and maintenance of optimal control of asthma through avoidance of environmental inciters, education of patients and the lowest effective regime of pharmacotherapy to reduce morbidity and mortality. BENEFITS, HARMS AND COSTS: Adherence to the guidelines should be accompanied by significant reduction in patients' symptoms, reduced morbidity and mortality, fewer emergency and hospital admissions, fewer adverse side-effects from medications, better quality of life for patients and reduced costs. RECOMMENDATIONS: Recommendations are included in each section of the report. In summary, after a diagnosis of asthma is made based on clinical evaluation, including demonstration of variable airflow obstruction, and contributing factors are identified, a treatment plan is established to obtain and maintain optimal asthma control. The main components of treatment are patient education

  15. Extremely Acidophilic Protists from Acid Mine Drainage Host Rickettsiales-Lineage Endosymbionts That Have Intervening Sequences in Their 16S rRNA Genes

    PubMed Central

    Baker, Brett J.; Hugenholtz, Philip; Dawson, Scott C.; Banfield, Jillian F.

    2003-01-01

    During a molecular phylogenetic survey of extremely acidic (pH < 1), metal-rich acid mine drainage habitats in the Richmond Mine at Iron Mountain, Calif., we detected 16S rRNA gene sequences of a novel bacterial group belonging to the order Rickettsiales in the Alphaproteobacteria. The closest known relatives of this group (92% 16S rRNA gene sequence identity) are endosymbionts of the protist Acanthamoeba. Oligonucleotide 16S rRNA probes were designed and used to observe members of this group within acidophilic protists. To improve visualization of eukaryotic populations in the acid mine drainage samples, broad-specificity probes for eukaryotes were redesigned and combined to highlight this component of the acid mine drainage community. Approximately 4% of protists in the acid mine drainage samples contained endosymbionts. Measurements of internal pH of the protists showed that their cytosol is close to neutral, indicating that the endosymbionts may be neutrophilic. The endosymbionts had a conserved 273-nucleotide intervening sequence (IVS) in variable region V1 of their 16S rRNA genes. The IVS does not match any sequence in current databases, but the predicted secondary structure forms well-defined stem loops. IVSs are uncommon in rRNA genes and appear to be confined to bacteria living in close association with eukaryotes. Based on the phylogenetic novelty of the endosymbiont sequences and initial culture-independent characterization, we propose the name “Candidatus Captivus acidiprotistae.” To our knowledge, this is the first report of an endosymbiotic relationship in an extremely acidic habitat. PMID:12957940

  16. Predicting Secretory Proteins of Malaria Parasite by Incorporating Sequence Evolution Information into Pseudo Amino Acid Composition via Grey System Model

    PubMed Central

    Lin, Wei-Zhong; Fang, Jian-An; Xiao, Xuan; Chou, Kuo-Chen

    2012-01-01

    The malaria disease has become a cause of poverty and a major hindrance to economic development. The culprit of the disease is the parasite, which secretes an array of proteins within the host erythrocyte to facilitate its own survival. Accordingly, the secretory proteins of malaria parasite have become a logical target for drug design against malaria. Unfortunately, with the increasing resistance to the drugs thus developed, the situation has become more complicated. To cope with the drug resistance problem, one strategy is to timely identify the secreted proteins by malaria parasite, which can serve as potential drug targets. However, it is both expensive and time-consuming to identify the secretory proteins of malaria parasite by experiments alone. To expedite the process for developing effective drugs against malaria, a computational predictor called “iSMP-Grey” was developed that can be used to identify the secretory proteins of malaria parasite based on the protein sequence information alone. During the prediction process a protein sample was formulated with a 60D (dimensional) feature vector formed by incorporating the sequence evolution information into the general form of PseAAC (pseudo amino acid composition) via a grey system model, which is particularly useful for solving complicated problems that are lack of sufficient information or need to process uncertain information. It was observed by the jackknife test that iSMP-Grey achieved an overall success rate of 94.8%, remarkably higher than those by the existing predictors in this area. As a user-friendly web-server, iSMP-Grey is freely accessible to the public at http://www.jci-bioinfo.cn/iSMP-Grey. Moreover, for the convenience of most experimental scientists, a step-by-step guide is provided on how to use the web-server to get the desired results without the need to follow the complicated mathematical equations involved in this paper. PMID:23189138

  17. Next-generation re-sequencing of genes involved in increased platelet reactivity in diabetic patients on acetylsalicylic acid.

    PubMed

    Postula, Marek; Janicki, Piotr K; Eyileten, Ceren; Rosiak, Marek; Kaplon-Cieslicka, Agnieszka; Sugino, Shigekazu; Wilimski, Radosław; Kosior, Dariusz A; Opolski, Grzegorz; Filipiak, Krzysztof J; Mirowska-Guzel, Dagmara

    2016-06-01

    The objective of this study was to investigate whether rare missense genetic variants in several genes related to platelet functions and acetylsalicylic acid (ASA) response are associated with the platelet reactivity in patients with diabetes type 2 (T2D) on ASA therapy. Fifty eight exons and corresponding introns of eight selected genes, including PTGS1, PTGS2, TXBAS1, PTGIS, ADRA2A, ADRA2B, TXBA2R, and P2RY1 were re-sequenced in 230 DNA samples from T2D patients by using a pooled PCR amplification and next-generation sequencing by Illumina HiSeq2000. The observed non-synonymous variants were confirmed by individual genotyping of 384 DNA samples comprising of the individuals from the original discovery pools and additional verification cohort of 154 ASA-treated T2DM patients. The association between investigated phenotypes (ASA induced changes in platelets reactivity by PFA-100, VerifyNow and serum thromboxane B2 level [sTxB2]), and accumulation of rare missense variants (genetic burden) in investigated genes was tested using statistical collapsing tests. We identified a total of 35 exonic variants, including 3 common missense variants, 15 rare missense variants, and 17 synonymous variants in 8 investigated genes. The rare missense variants exhibited statistically significant difference in the accumulation pattern between a group of patients with increased and normal platelet reactivity based on PFA-100 assay. Our study suggests that genetic burden of the rare functional variants in eight genes may contribute to differences in the platelet reactivity measured with the PFA-100 assay in the T2DM patients treated with ASA. PMID:26599574

  18. Identification of G and P genotype-specific motifs in the predicted VP7 and VP4 amino acid sequences.

    PubMed

    Ma, Yongping

    2015-12-01

    Equine rotavirus (ERV) strain L338 (G13P[18]) has a unique G and P genotype. However, the evolutionary relationship of L338 with other ERVs is still unknown. Here whole genome analysis of the L338 ERV strain was independently performed. Its genotype constellations were determined as G13-P[18]-I6-R9-C9-M6-A6-N9-T12-E14-H11, confirming previous genotype assignments. The L338 strain only shared the P[18] and I6 genotypes with other ERVs. The nucleotide sequences of the other 9 RNA segments were different from those of cogent genes of all other group A rotavirus (RVA) strains including ERVs and formed unique phylogenetic lineages. The L338 evolutionary footprints were tentatively identified in both VP7 and VP4 amino acid sequences: two regions were found in VP7 and twelve in VP4. The conserved regions shared between L338 and other group A rotavirus strains (RVAs) indicated that L338 was more closely related genomically to animal and human RVAs other than ERVs, suggesting that L338 may not be an endogenous equine RV but have emerged as an interspecies reassortant with other RVA strains. Furthermore, genotype-specific motifs of all 27 G and 37 P types were identified in regions 7-1a (aa 91-100) of VP7 and regions 8-1 (aa146-151) and 8-3 (aa113-118 and 125-135) of VP4 (VP8*). PMID:26321159

  19. Learning consensus in adversarial environments

    NASA Astrophysics Data System (ADS)

    Vamvoudakis, Kyriakos G.; García Carrillo, Luis R.; Hespanha, João. P.

    2013-05-01

    This work presents a game theory-based consensus problem for leaderless multi-agent systems in the presence of adversarial inputs that are introducing disturbance to the dynamics. Given the presence of enemy components and the possibility of malicious cyber attacks compromising the security of networked teams, a position agreement must be reached by the networked mobile team based on environmental changes. The problem is addressed under a distributed decision making framework that is robust to possible cyber attacks, which has an advantage over centralized decision making in the sense that a decision maker is not required to access information from all the other decision makers. The proposed framework derives three tuning laws for every agent; one associated with the cost, one associated with the controller, and one with the adversarial input.

  20. Subclinical hypothyroidism: Controversies to consensus

    PubMed Central

    Raza, Syed Abbas; Mahmood, Nasir

    2013-01-01

    Diagnoses of subclinicaal hypothyroidism (SCH) is biochemically made, when serum thyroid stimulating hormone (TSH) levels is elevated while free thyroid hormone levels are within normal reference range. SCH is diagnosed after excluding all other causes of elevated TSH levels. Symptoms of SCH may vary from being asymptomatic to having mild nonspecific symptoms. The risk of progression to overt hypothyroidism is related to number of factors including initial serum TSH concentration, presence of auto antibodies, family history and presence goiter. Various screening recommendations for thyroid function assessment are in practice. There are still controversies surrounding SCH and associated risk of various cardiovascular diseases (CVDs), pregnancy outcomes, neuropsychiatric issues, metabolic syndrome, and dyslipidemia. Consensus will require more large randomized clinical studies involving various age groups and medical condition, especially in developing countries. All these efforts will definitely improve our understanding of disease and ultimately patient outcomes. PMID:24910826

  1. Consensus in a Precambrian garden

    NASA Astrophysics Data System (ADS)

    Maggs, William Ward

    At the Precambrian-Cambrian boundary, the course of life on Earth underwent a dramatic change that culminated in the rise of predators and other complex animals, a group of paleontologists agreed at a conferece last week.Just prior to 590 million years ago, the ecology of life in the oceans was very simple; soft-shelled multicellular animals called Ediacara lived in apparent harmony with vast mats o f bacteria and algae that covered the seafloor, dependent on the photosynthesis or chemosynthesis of their one-celled hosts for their existence. According to the consensus reached by the scientists, this symbiotic and apparently global “Garden of Ediacara” fell early in the Cambrian Period, as the mats declined and food chains multiplied with new animals that, for the first time in Earth's history, preyed on other living things.

  2. Data publication consensus and controversies

    PubMed Central

    Kratz, John; Strasser, Carly

    2014-01-01

    The movement to bring datasets into the scholarly record as first class research products (validated, preserved, cited, and credited) has been inching forward for some time, but now the pace is quickening. As data publication venues proliferate, significant debate continues over formats, processes, and terminology. Here, we present an overview of data publication initiatives underway and the current conversation, highlighting points of consensus and issues still in contention. Data publication implementations differ in a variety of factors, including the kind of documentation, the location of the documentation relative to the data, and how the data is validated. Publishers may present data as supplemental material to a journal article, with a descriptive “data paper,” or independently. Complicating the situation, different initiatives and communities use the same terms to refer to distinct but overlapping concepts. For instance, the term published means that the data is publicly available and citable to virtually everyone, but it may or may not imply that the data has been peer-reviewed. In turn, what is meant by data peer review is far from defined; standards and processes encompass the full range employed in reviewing the literature, plus some novel variations. Basic data citation is a point of consensus, but the general agreement on the core elements of a dataset citation frays if the data is dynamic or part of a larger set. Even as data publication is being defined, some are looking past publication to other metaphors, notably “data as software,” for solutions to the more stubborn problems. PMID:25075301

  3. Hilar Cholangiocarcinoma: expert consensus statement

    PubMed Central

    Mansour, John C; Aloia, Thomas A; Crane, Christopher H; Heimbach, Julie K; Nagino, Masato; Vauthey, Jean-Nicolas

    2015-01-01

    An American Hepato-Pancreato-Biliary Association (AHPBA)-sponsored consensus meeting of expert panellists met on 15 January 2014 to review current evidence on the management of hilar cholangiocarcinoma in order to establish practice guidelines and to agree consensus statements. It was established that the treatment of patients with hilar cholangiocarcinoma requires a coordinated, multidisciplinary approach to optimize the chances for both durable survival and effective palliation. An adequate diagnostic and staging work-up includes high-quality cross-sectional imaging; however, pathologic confirmation is not required prior to resection or initiation of a liver transplant trimodal treatment protocol. The ideal treatment for suitable patients with resectable hilar malignancy is resection of the intra- and extrahepatic bile ducts, as well as resection of the involved ipsilateral liver. Preoperative biliary drainage is best achieved with percutaneous transhepatic approaches and may be indicated for patients with cholangitis, malnutrition or hepatic insufficiency. Portal vein embolization is a safe and effective strategy for increasing the future liver remnant (FLR) and is particularly useful for patients with an FLR of <30%. Selected patients with unresectable hilar cholangiocarcinoma should be evaluated for a standard trimodal protocol incorporating external beam and endoluminal radiation therapy, systemic chemotherapy and liver transplantation. Post-resection chemoradiation should be offered to patients who show high-risk features on surgical pathology. Chemoradiation is also recommended for patients with locally advanced, unresectable hilar cancers. For patients with locally recurrent or metastatic hilar cholangiocarcinoma, first-line chemotherapy with gemcitabine and cisplatin is recommended based on multiple Phase II trials and a large randomized controlled trial including a heterogeneous population of patients with biliary cancers. PMID:26172136

  4. Hilar cholangiocarcinoma: expert consensus statement.

    PubMed

    Mansour, John C; Aloia, Thomas A; Crane, Christopher H; Heimbach, Julie K; Nagino, Masato; Vauthey, Jean-Nicolas

    2015-08-01

    An American Hepato-Pancreato-Biliary Association (AHPBA)-sponsored consensus meeting of expert panellists met on 15 January 2014 to review current evidence on the management of hilar cholangiocarcinoma in order to establish practice guidelines and to agree consensus statements. It was established that the treatment of patients with hilar cholangiocarcinoma requires a coordinated, multidisciplinary approach to optimize the chances for both durable survival and effective palliation. An adequate diagnostic and staging work-up includes high-quality cross-sectional imaging; however, pathologic confirmation is not required prior to resection or initiation of a liver transplant trimodal treatment protocol. The ideal treatment for suitable patients with resectable hilar malignancy is resection of the intra- and extrahepatic bile ducts, as well as resection of the involved ipsilateral liver. Preoperative biliary drainage is best achieved with percutaneous transhepatic approaches and may be indicated for patients with cholangitis, malnutrition or hepatic insufficiency. Portal vein embolization is a safe and effective strategy for increasing the future liver remnant (FLR) and is particularly useful for patients with an FLR of <30%. Selected patients with unresectable hilar cholangiocarcinoma should be evaluated for a standard trimodal protocol incorporating external beam and endoluminal radiation therapy, systemic chemotherapy and liver transplantation. Post-resection chemoradiation should be offered to patients who show high-risk features on surgical pathology. Chemoradiation is also recommended for patients with locally advanced, unresectable hilar cancers. For patients with locally recurrent or metastatic hilar cholangiocarcinoma, first-line chemotherapy with gemcitabine and cisplatin is recommended based on multiple Phase II trials and a large randomized controlled trial including a heterogeneous population of patients with biliary cancers. PMID:26172136

  5. Intrahepatic cholangiocarcinoma: expert consensus statement.

    PubMed

    Weber, Sharon M; Ribero, Dario; O'Reilly, Eileen M; Kokudo, Norihiro; Miyazaki, Masaru; Pawlik, Timothy M

    2015-08-01

    An American Hepato-Pancreato-Biliary Association (AHPBA)-sponsored consensus meeting of expert panellists met on 15 January 2014 to review current evidence on the management of intrahepatic cholangiocarcinoma (ICC) in order to establish practice guidelines and to agree on consensus statements. The treatment of ICC requires a coordinated, multidisciplinary approach to optimize survival. Biopsy is not necessary if the surgeon suspects ICC and is planning curative resection, although biopsy should be obtained before systemic or locoregional therapies are initiated. Assessment of resectability is best accomplished using cross-sectional imaging [computed tomography (CT) or magnetic resonance imaging (MRI)], but the role of positron emission tomography (PET) is unclear. Resectability in ICC is defined by the ability to completely remove the disease while leaving an adequate liver remnant. Extrahepatic disease, multiple bilobar or multicentric tumours, and lymph node metastases beyond the primary echelon are contraindications to resection. Regional lymphadenectomy should be considered a standard part of surgical therapy. In patients with high-risk features, the routine use of diagnostic laparoscopy is recommended. The preoperative diagnosis of combined hepatocellular carcinoma and cholangiocarcinoma (cHCC-CC) by imaging studies is extremely difficult. Surgical resection remains the mainstay of treatment, but survival is worse than in HCC alone. There are no adequately powered, randomized Phase III trials that can provide definitive recommendations for adjuvant therapy for ICC. Patients with high-risk features (lymphovascular invasion, multicentricity or satellitosis, large tumours) should be encouraged to enrol in clinical trials and to consider adjuvant therapy. Cisplatin plus gemcitabine represents the standard-of-care, front-line systemic therapy for metastatic ICC. Genomic analyses of biliary cancers support the development of targeted therapeutic interventions. PMID

  6. International Consensus on drug allergy.

    PubMed

    Demoly, P; Adkinson, N F; Brockow, K; Castells, M; Chiriac, A M; Greenberger, P A; Khan, D A; Lang, D M; Park, H-S; Pichler, W; Sanchez-Borges, M; Shiohara, T; Thong, B Y- H

    2014-04-01

    When drug reactions resembling allergy occur, they are called drug hypersensitivity reactions (DHRs) before showing the evidence of either drug-specific antibodies or T cells. DHRs may be allergic or nonallergic in nature, with drug allergies being immunologically mediated DHRs. These reactions are typically unpredictable. They can be life-threatening, may require or prolong hospitalization, and may necessitate changes in subsequent therapy. Both underdiagnosis (due to under-reporting) and overdiagnosis (due to an overuse of the term ‘allergy’) are common. A definitive diagnosis of such reactions is required in order to institute adequate treatment options and proper preventive measures. Misclassification based solely on the DHR history without further testing may affect treatment options, result in adverse consequences, and lead to the use of more-expensive or less-effective drugs, in contrast to patients who had undergone a complete drug allergy workup. Several guidelines and/or consensus documents on general or specific drug class-induced DHRs are available to support the medical decision process. The use of standardized systematic approaches for the diagnosis and management of DHRs carries the potential to improve outcomes and should thus be disseminated and implemented. Consequently, the International Collaboration in Asthma, Allergy and Immunology (iCAALL), formed by the European Academy of Allergy and Clinical Immunology (EAACI), the American Academy of Allergy, Asthma and Immunology (AAAAI), the American College of Allergy, Asthma and Immunology (ACAAI), and the World Allergy Organization (WAO), has decided to issue an International CONsensus (ICON) on drug allergy. The purpose of this document is to highlight the key messages that are common to many of the existing guidelines, while critically reviewing and commenting on any differences and deficiencies of evidence, thus providing a comprehensive reference document for the diagnosis and management of

  7. Peptides Composed of Alternating L- and D-Amino Acids Inhibit Amyloidogenesis in Three Distinct Amyloid Systems Independent of Sequence.

    PubMed

    Kellock, Jackson; Hopping, Gene; Caughey, Byron; Daggett, Valerie

    2016-06-01

    There is now substantial evidence that soluble oligomers are primary toxic agents in amyloid diseases. The development of an antibody recognizing the toxic soluble oligomeric forms of different and unrelated amyloid species suggests a common conformational intermediate during amyloidogenesis. We previously observed a common occurrence of a novel secondary structure element, which we call α-sheet, in molecular dynamics (MD) simulations of various amyloidogenic proteins, and we hypothesized that the toxic conformer is composed of α-sheet structure. As such, α-sheet may represent a conformational signature of the misfolded intermediates of amyloidogenesis and a potential unique binding target for peptide inhibitors. Recently, we reported the design and characterization of a novel hairpin peptide (α1 or AP90) that adopts stable α-sheet structure and inhibits the aggregation of the β-Amyloid Peptide Aβ42 and transthyretin. AP90 is a 23-residue hairpin peptide featuring alternating D- and L-amino acids with favorable conformational propensities for α-sheet formation, and a designed turn. For this study, we reverse engineered AP90 to identify which of its design features is most responsible for conferring α-sheet stability and inhibitory activity. We present experimental characterization (CD and FTIR) of seven peptides designed to accomplish this. In addition, we measured their ability to inhibit aggregation in three unrelated amyloid species: Aβ42, transthyretin, and human islet amylin polypeptide. We found that a hairpin peptide featuring alternating L- and D-amino acids, independent of sequence, is sufficient for conferring α-sheet structure and inhibition of aggregation. Additionally, we show a correlation between α-sheet structural stability and inhibitory activity. PMID:27012425

  8. The delta EEG (sleep)-inducing peptide (DSIP). XI. Amino-acid analysis, sequence, synthesis and activity of the nonapeptide.

    PubMed

    Schoenenberger, G A; Maier, P F; Tobler, H J; Wilson, K; Monnier, M

    1978-09-01

    A peptide which induces slow-wave EEG (sleep) after intraventricular infusion into the brain has been isolated from the extracorporeal dialysate of cerebral venous blood in rabbits submitted to hypnogenic electrical stimulation of the intralaminar thalamic area. It was shown by amino-acid analysis and sequence determination to be Trp-Ala-Gly-Gly-Asp-Ala-Ser-Gly-Glu and named "Delta Sleep-Inducing Peptide" (DSIP). This compound was synthesized as well as 5 possible metabolic products (1--8, 2--9, 2--8, 1--4 and 5--9), 2 nonapeptide analogues (with one and two amino-acids exchanged) and a related tripeptide (Trp-Ser-Glu). All 9 synthetic peptides were infused intraventricularly in rabbits (6 nmol/kg in 0.05 ml of CSF-like solution over 3.5 min) and tested under double-blind conditions. A total of 61 rabbits including controls were used. The EEG from the frontal neocortex and the limbic archicortex were subjected to direct fast-Fourier transformation and analyzed by an 1108 computer system. A highly specific delta and spindle EEG-enhancing effect of the synthetic DSIP could be demonstrated. The mean increase of EEG delta activity reached 35% in the neocortex and limbic cortex as compared to control animals receiving CSF-like solution or any of the other 8 peptides. The final chemical characterization of the synthetic DSIP revealed that only the pure alpha-aspartyl peptide is highly active in contrast to its beta-Asp isomer. A neurohumoral modulating and programming activity was suggested. PMID:568769

  9. Species specific identification of spore-producing microbes using the gene sequence of small acid-soluble spore coat proteins for amplification based diagnostics

    DOEpatents

    McKinney, Nancy

    2002-01-01

    PCR (polymerase chain reaction) primers for the detection of certain Bacillus species, such as Bacillus anthracis. The primers specifically amplify only DNA found in the target species and can distinguish closely related species. Species-specific PCR primers for Bacillus anthracis, Bacillus globigii and Clostridium perfringens are disclosed. The primers are directed to unique sequences within sasp (small acid soluble protein) genes.

  10. Draft Genome Sequences of Salmonella enterica subsp. enterica Serovar Berta ATCC 8392 and a Nalidixic Acid-Resistant Isolate of This Strain

    PubMed Central

    Cooper, Ashley; Koziol, Adam G.; Carrillo, Catherine D.

    2016-01-01

    Salmonella enterica subspecies enterica serovar Berta has been isolated in multiple animal species and has been implicated in human disease. Here, we report a 4.7-Mbp draft genome sequence of S. enterica serovar Berta (ATCC strain 8392) and a nalidixic acid-resistant isolate derived from this strain. PMID:27103707

  11. COMPARISON OF PHYLOGENETIC RELATIONSHIPS BASED ON PHOSPHOLIPID FATTY ACID PROFILES AND RIBOSOMAL RNA SEQUENCE SIMILARITIES AMONG DISSIMILATORY SULFATE-REDUCING BACTERIA

    EPA Science Inventory

    Twenty-five isolates of dissimilatory sulfate-reducing bacteria were clustered based on similarity analysis of their phospholipid ester-linked fatty acids (PLFA). f these, twenty-three showed the phylogenetic relationships based on the sequence similarity of their 16S rRNA direct...

  12. Nucleotide sequence of the gene encoding the nitrogenase iron protein of Thiobacillus ferrooxidans

    SciTech Connect

    Pretorius, I.M.; Rawlings, D.E.; O'Neill, E.G.; Jones, W.A.; Kirby, R.; Woods, D.R.

    1987-01-01

    The DNA sequence was determined for the cloned Thiobacillus ferrooxidans nifH and part of the nifD genes. The DNA chains were radiolabeled with (..cap alpha..-/sup 32/P)dCTP (3000 Ci/mmol) or (..cap alpha..-/sup 35/S)dCTP (400 Ci/mmol). A putative T. ferrooxidans nifH promoter was identified whose sequences showed perfect consensus with those of the Klebsiella pneumoniae nif promoter. Two putative consensus upstream activator sequences were also identified. The amino acid sequence was deduced from the DNA sequence. In a comparison of nifH DNA sequences from T. ferrooxidans and eight other nitrogen-fixing microbes, a Rhizobium sp. isolated from Parasponia andersonii showed the greatest homology (74%) and Clostridium pasteurianum (nifH1) showed the least homology (54%). In the comparison of the amino acid sequences of the Fe proteins, the Rhizobium sp. and Rhizobium japonicum showed the greatest homology (both 86%) and C. pasteurianum (nifH1 gene product) demonstrated the least homology (56%) to the T. ferrooxidans Fe protein.

  13. Amino acid sequences of peptides from a tryptic digest of a urea-soluble protein fraction (U.S.3) from oxidized wool

    PubMed Central

    Corfield, M. C.; Fletcher, J. C.; Robson, A.

    1967-01-01

    1. A tryptic digest of the protein fraction U.S.3 from oxidized wool has been separated into 32 peptide fractions by cation-exchange resin chromatography. 2. Most of these fractions have been resolved into their component peptides by a combination of the techniques of cation-exchange resin chromatography, paper chromatography and paper electrophoresis. 3. The amino acid compositions of 58 of the peptides in the digest present in the largest amounts have been determined. 4. The amino acid sequences of 38 of these have been completely elucidated and those of six others partially derived. 5. These findings indicate that the parent protein in wool from which the protein fraction U.S.3 is derived has a minimum molecular weight of 74000. 6. The structures of wool proteins are discussed in the light of the peptide sequences determined, and, in particular, of those sequences in fraction U.S.3 that could not be elucidated. PMID:16742497

  14. A Single Amino Acid Substitution in 1918 Influenza Virus Hemagglutinin Changes Receptor Binding Specificity

    PubMed Central

    Glaser, Laurel; Stevens, James; Zamarin, Dmitriy; Wilson, Ian A.; García-Sastre, Adolfo; Tumpey, Terrence M.; Basler, Christopher F.; Taubenberger, Jeffery K.; Palese, Peter

    2005-01-01

    The receptor binding specificity of influenza viruses may be important for host restriction of human and avian viruses. Here, we show that the hemagglutinin (HA) of the virus that caused the 1918 influenza pandemic has strain-specific differences in its receptor binding specificity. The A/South Carolina/1/18 HA preferentially binds the α2,6 sialic acid (human) cellular receptor, whereas the A/New York/1/18 HA, which differs by only one amino acid, binds both the α2,6 and the α2,3 sialic acid (avian) cellular receptors. Compared to the conserved consensus sequence in the receptor binding site of avian HAs, only a single amino acid at position 190 was changed in the A/New York/1/18 HA. Mutation of this single amino acid back to the avian consensus resulted in a preference for the avian receptor. PMID:16103207

  15. RoboOligo: software for mass spectrometry data to support manual and de novo sequencing of post-transcriptionally modified ribonucleic acids

    PubMed Central

    Sample, Paul J.; Gaston, Kirk W.; Alfonzo, Juan D.; Limbach, Patrick A.

    2015-01-01

    Ribosomal ribonucleic acid (RNA), transfer RNA and other biological or synthetic RNA polymers can contain nucleotides that have been modified by the addition of chemical groups. Traditional Sanger sequencing methods cannot establish the chemical nature and sequence of these modified-nucleotide containing oligomers. Mass spectrometry (MS) has become the conventional approach for determining the nucleotide composition, modification status and sequence of modified RNAs. Modified RNAs are analyzed by MS using collision-induced dissociation tandem mass spectrometry (CID MS/MS), which produces a complex dataset of oligomeric fragments that must be interpreted to identify and place modified nucleosides within the RNA sequence. Here we report the development of RoboOligo, an interactive software program for the robust analysis of data generated by CID MS/MS of RNA oligomers. There are three main functions of RoboOligo: (i) automated de novo sequencing via the local search paradigm. (ii) Manual sequencing with real-time spectrum labeling and cumulative intensity scoring. (iii) A hybrid approach, coined ‘variable sequencing’, which combines the user intuition of manual sequencing with the high-throughput sampling of automated de novo sequencing. PMID:25820423

  16. Nucleic acid sequence-based amplification assays for rapid detection of West Nile and St. Louis encephalitis viruses.

    PubMed

    Lanciotti, R S; Kerst, A J

    2001-12-01

    The development and application of nucleic acid sequence-based amplification (NASBA) assays for the detection of West Nile (WN) and St. Louis encephalitis (SLE) viruses are reported. Two unique detection formats were developed for the NASBA assays: a postamplification detection step with a virus-specific internal capture probe and electrochemiluminescence (NASBA-ECL assay) and a real-time assay with 6-carboxyfluorescein-labeled virus-specific molecular beacon probes (NASBA-beacon assay). The sensitivities and specificities of these NASBA assays were compared to those of a newly described standard reverse transcription (RT)-PCR and TaqMan assays for SLE virus and to a previously published TaqMan assay for WN virus. The NASBA assays demonstrated exceptional sensitivities and specificities compared to those of virus isolation, the TaqMan assays, and standard RT-PCR, with the NASBA-beacon assay yielding results in less than 1 h. These assays should be of utility in the diagnostic laboratory to complement existing diagnostic testing methodologies and as a tool in conducting flavivirus surveillance in the United States. PMID:11724870

  17. Purification and complete amino acid sequence of a new type of sweet protein taste-modifying activity, curculin.

    PubMed

    Yamashita, H; Theerasilp, S; Aiuchi, T; Nakaya, K; Nakamura, Y; Kurihara, Y

    1990-09-15

    A new taste-modifying protein named curculin was extracted with 0.5 M NaCl from the fruits of Curculigo latifolia and purified by ammonium sulfate fractionation, CM-Sepharose ion-exchange chromatography, and gel filtration. Purified curculin thus obtained gave a single band having a Mr of 12,000 on sodium dodecyl sulfate-polyacrylamide gel electrophoresis in the presence of 8 M urea. The molecular weight determined by low-angle laser light scattering was 27,800. These results suggest that native curculin is a dimer of a 12,000-Da polypeptide. The complete amino acid sequence of curculin was determined by automatic Edman degradation. Curculin consists of 114 residues. Curculin itself elicits a sweet taste. After curculin, water elicits a sweet taste, and sour substances induce a stronger sense of sweetness. No protein with both sweet-tasting and taste-modifying activities has ever been found. There are five sets of tripeptides common to miraculin (a taste-modifying protein), six sets of tripeptides common to thaumatin (a sweet protein), and two sets of tripeptides common to monellin (a sweet protein). Anti-miraculin serum was not immunologically reactive with curculin. The mechanism of the taste-modifying action of curculin is discussed. PMID:2394746

  18. Sequencing around 5-Hydroxyconiferyl Alcohol-Derived Units in Caffeic Acid O-Methyltransferase-Deficient Poplar Lignins1[OA

    PubMed Central

    Lu, Fachuang; Marita, Jane M.; Lapierre, Catherine; Jouanin, Lise; Morreel, Kris; Boerjan, Wout; Ralph, John

    2010-01-01

    Caffeic acid O-methyltransferase (COMT) is a bifunctional enzyme that methylates the 5- and 3-hydroxyl positions on the aromatic ring of monolignol precursors, with a preference for 5-hydroxyconiferaldehyde, on the way to producing sinapyl alcohol. Lignins in COMT-deficient plants contain benzodioxane substructures due to the incorporation of 5-hydroxyconiferyl alcohol (5-OH-CA), as a monomer, into the lignin polymer. The derivatization followed by reductive cleavage method can be used to detect and determine benzodioxane structures because of their total survival under this degradation method. Moreover, partial sequencing information for 5-OH-CA incorporation into lignin can be derived from detection or isolation and structural analysis of the resulting benzodioxane products. Results from a modified derivatization followed by reductive cleavage analysis of COMT-deficient lignins provide evidence that 5-OH-CA cross couples (at its β-position) with syringyl and guaiacyl units (at their O-4-positions) in the growing lignin polymer and then either coniferyl or sinapyl alcohol, or another 5-hydroxyconiferyl monomer, adds to the resulting 5-hydroxyguaiacyl terminus, producing the benzodioxane. This new terminus may also become etherified by coupling with further monolignols, incorporating the 5-OH-CA integrally into the lignin structure. PMID:20427467

  19. Sequencing around 5-hydroxyconiferyl alcohol-derived units in caffeic acid O-methyltransferase-deficient poplar lignins.

    PubMed

    Lu, Fachuang; Marita, Jane M; Lapierre, Catherine; Jouanin, Lise; Morreel, Kris; Boerjan, Wout; Ralph, John

    2010-06-01

    Caffeic acid O-methyltransferase (COMT) is a bifunctional enzyme that methylates the 5- and 3-hydroxyl positions on the aromatic ring of monolignol precursors, with a preference for 5-hydroxyconiferaldehyde, on the way to producing sinapyl alcohol. Lignins in COMT-deficient plants contain benzodioxane substructures due to the incorporation of 5-hydroxyconiferyl alcohol (5-OH-CA), as a monomer, into the lignin polymer. The derivatization followed by reductive cleavage method can be used to detect and determine benzodioxane structures because of their total survival under this degradation method. Moreover, partial sequencing information for 5-OH-CA incorporation into lignin can be derived from detection or isolation and structural analysis of the resulting benzodioxane products. Results from a modified derivatization followed by reductive cleavage analysis of COMT-deficient lignins provide evidence that 5-OH-CA cross couples (at its beta-position) with syringyl and guaiacyl units (at their O-4-positions) in the growing lignin polymer and then either coniferyl or sinapyl alcohol, or another 5-hydroxyconiferyl monomer, adds to the resulting 5-hydroxyguaiacyl terminus, producing the benzodioxane. This new terminus may also become etherified by coupling with further monolignols, incorporating the 5-OH-CA integrally into the lignin structure. PMID:20427467

  20. Transposition of a plasmid deoxyribonucleic acid sequence that mediates ampicillin resistance: independence from host rec functions and orientation of insertion.

    PubMed Central

    Rubens, C; Heffron, F; Falkow, S

    1976-01-01

    Insertion of the transposable deoxyribonucleic acid sequence that specifies the TEM beta-lactamase (TnA) occurred in at least 19 sites on the 5.5 x 10(6)-dalton plasmid RSF1010. There was no significant difference in the frequency of transposition or in the distribution of TnA insertion sites for recombinant plasmids isolated from recombination-proficient (rec+) or recombination-deficient (rec-) bacterial host cells. The site and orientation of TnA insertions were determined by both heteroduplex analysis and enzymatic digestion with restriction endonucleases. Insertion in the gene encoding for sulfonamide resistance occurred without circular permutation in one or the other of two distinct orientations. Insertions in orientation P were strongly polar on distal gene expression, whereas insertions in orientation M were mutagenic but not polar. In addition, we have observed that TnA elements from different R plasmids show fine structural heterogeneity, and that TnA insertion at a site adjacent to the origin of replication causes an increase in plasmid copy number. Images PMID:789346