Science.gov

Sample records for acids sequence comparison

  1. Nucleotide and derived amino acid sequences of the major porin of Comamonas acidovorans and comparison of porin primary structures.

    PubMed Central

    Gerbl-Rieger, S; Peters, J; Kellermann, J; Lottspeich, F; Baumeister, W

    1991-01-01

    The DNA sequence of the gene which codes for the major outer membrane porin (Omp32) of Comamonas acidovorans has been determined. The structural gene encodes a precursor consisting of 351 amino acid residues with a signal peptide of 19 amino acid residues. Comparisons with amino acid sequences of outer membrane proteins and porins from several other members of the class Proteobacteria and of the Chlamydia trachomatis porin and the Neurospora crassa mitochondrial porin revealed a motif of eight regions of local homology. The results of this analysis are discussed with regard to common structural features of porins. PMID:1848840

  2. Complete amino acid sequence of branched-chain amino acid aminotransferase (transaminase B) of Salmonella typhimurium, identification of the coenzyme-binding site and sequence comparison analysis

    SciTech Connect

    Feild, M.J.

    1988-01-01

    The complete amino acid sequence of the subunit of branched-chain amino acid aminotransferase of Salmonella typhimurium was determined by automated Edman degradation of peptide fragments generated by chemical and enzymatic digestion of S-carboxymethylated and S-pyridylethylated transaminase B. Peptide fragments of transaminase B were generated by treatment of the enzyme with trypsin, Staphylococcus aureus V8 protease, endoproteinase Lys-C, and cyanogen bromide. Protocols were developed for separation of the peptide fragments by reverse-phase high performance liquid chromatography (HPLC), ion-exchange HPLC, and SDS-urea gel electrophoresis. The enzyme subunit contains 308 amino acid residues and has a molecular weight of 33,920 daltons. The coenzyme-binding site was determined by treatment of the enzyme, containing bound pyridoxal 5-phosphate, with tritiated sodium borohydride prior to trypsin digestion. Monitoring radioactivity incorporation and peptide map comparisons with an apoenzyme tryptic digest, allowed identification of the pyridoxylated-peptide which was isolated by reverse-phase HPLC and sequenced. The coenzyme-binding site is a lysyl residue at position 159. Some peptides were further characterized by fast atom bombardment mass spectrometry.

  3. Nucleic acid sequence of an internal image-bearing monoclonal anti-idiotype and its comparison to the sequence of the external antigen.

    PubMed Central

    Bruck, C; Co, M S; Slaoui, M; Gaulton, G N; Smith, T; Fields, B N; Mullins, J I; Greene, M I

    1986-01-01

    The monoclonal anti-idiotypic antibody (mAb2) 87.92.6 directed against the 9B.G5 antibody specific for the virus neutralizing epitope on the mammalian reovirus type 3 hemagglutinin was previously demonstrated to express an internal image of the receptor binding epitope of the reovirus type 3. Furthermore, this mAb2 has autoimmune reactivity to the cell surface receptor of the reovirus. The nucleotide and deduced amino acid sequences of the 87.92.6 mAb2 heavy and light chains are described in this report. The sequence analysis reveals that the same heavy chain variable and joining (VH and JH) gene segments are used by the 87.92.6 anti-idiotypic mAb2 and by the dominant idiotypes of the BALB/c anti-GAT (cGAT) and anti-NP (NPa) responses. [GAT; random polymer that is 60% glutamic acid, 30% alanine, and 10% tyrosine. NP; (4-hydroxy-3-nitrophenyl)-acetyl.] Despite extensive homology at the level of the heavy chain variable regions, the NPa positive BALB/c anti-NP monoclonal antibody 17.2.25 binds neither 9B.G5 nor the cellular receptor for the hemagglutinin. Amino acid sequence comparison between the viral hemagglutinin and the 87.92.6 mAb2 light chain "internal image," reveals an area of significant homology indicating that antigen mimicry by antibodies may be achieved by sharing primary structure. PMID:2428036

  4. Complete amino acid sequence of ananain and a comparison with stem bromelain and other plant cysteine proteases.

    PubMed Central

    Lee, K L; Albee, K L; Bernasconi, R J; Edmunds, T

    1997-01-01

    The amino acid sequences of ananain (EC3.4.22.31) and stem bromelain (3.4.22.32), two cysteine proteases from pineapple stem, are similar yet ananain and stem bromelain possess distinct specificities towards synthetic peptide substrates and different reactivities towards the cysteine protease inhibitors E-64 and chicken egg white cystatin. We present here the complete amino acid sequence of ananain and compare it with the reported sequences of pineapple stem bromelain, papain and chymopapain from papaya and actinidin from kiwifruit. Ananain is comprised of 216 residues with a theoretical mass of 23464 Da. This primary structure includes a sequence insert between residues 170 and 174 not present in stem bromelain or papain and a hydrophobic series of amino acids adjacent to His-157. It is possible that these sequence differences contribute to the different substrate and inhibitor specificities exhibited by ananain and stem bromelain. PMID:9355753

  5. Amino acid sequence and structural comparison of BACE1 and BACE2 using evolutionary trace method.

    PubMed

    Mirsafian, Hoda; Mat Ripen, Adiratna; Merican, Amir Feisal; Bin Mohamad, Saharuddin

    2014-01-01

    Beta-amyloid precursor protein cleavage enzyme 1 (BACE1) and beta-amyloid precursor protein cleavage enzyme 2 (BACE2), members of aspartyl protease family, are close homologues and have high similarity in their protein crystal structures. However, their enzymatic properties differ leading to disparate clinical consequences. In order to identify the residues that are responsible for such differences, we used evolutionary trace (ET) method to compare the amino acid conservation patterns of BACE1 and BACE2 in several mammalian species. We found that, in BACE1 and BACE2 structures, most of the ligand binding sites are conserved which indicate their enzymatic property of aspartyl protease family members. The other conserved residues are more or less randomly localized in other parts of the structures. Four group-specific residues were identified at the ligand binding site of BACE1 and BACE2. We postulated that these residues would be essential for selectivity of BACE1 and BACE2 biological functions and could be sites of interest for the design of selective inhibitors targeting either BACE1 or BACE2.

  6. The complete amino acid sequence of prochymosin.

    PubMed Central

    Foltmann, B; Pedersen, V B; Jacobsen, H; Kauffman, D; Wybrandt, G

    1977-01-01

    The total sequence of 365 amino acid residues in bovine prochymosin is presented. Alignment with the amino acid sequence of porcine pepsinogen shows that 204 amino acid residues are common to the two zymogens. Further comparison and alignment with the amino acid sequence of penicillopepsin shows that 66 residues are located at identical positions in all three proteases. The three enzymes belong to a large group of proteases with two aspartate residues in the active center. This group forms a family derived from one common ancestor. PMID:329280

  7. Large-Scale Sequence Comparison.

    PubMed

    Lal, Devi; Verma, Mansi

    2017-01-01

    There are millions of sequences deposited in genomic databases, and it is an important task to categorize them according to their structural and functional roles. Sequence comparison is a prerequisite for proper categorization of both DNA and protein sequences, and helps in assigning a putative or hypothetical structure and function to a given sequence. There are various methods available for comparing sequences, alignment being first and foremost for sequences with a small number of base pairs as well as for large-scale genome comparison. Various tools are available for performing pairwise large sequence comparison. The best known tools either perform global alignment or generate local alignments between the two sequences. In this chapter we first provide basic information regarding sequence comparison. This is followed by the description of the PAM and BLOSUM matrices that form the basis of sequence comparison. We also give a practical overview of currently available methods such as BLAST and FASTA, followed by a description and overview of tools available for genome comparison including LAGAN, MumMER, BLASTZ, and AVID.

  8. Composition for nucleic acid sequencing

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2008-08-26

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  9. Sequence Comparison and Phylogeny of Nucleotide Sequence of Coat Protein and Nucleic Acid Binding Protein of a Distinct Isolate of Shallot virus X from India.

    PubMed

    Majumder, S; Baranwal, V K

    2011-06-01

    Shallot virus X (ShVX), a type species in the genus Allexivirus of the family Alfaflexiviridae has been associated with shallot plants in India and other shallot growing countries like Russia, Germany, Netherland, and New Zealand. Coat protein (CP) and nucleic acid binding protein (NB) region of the virus was obtained by reverse transcriptase polymerase chain reaction from scales leaves of shallot bulbs. The partial cDNA contained two open reading frames encoding proteins of molecular weights of 28.66 and 14.18 kDa belonging to Flexi_CP super-family and viral NB super-family, respectively. The percent identity and phylogenetic analysis of amino acid sequences of CP and NB region of the virus associated with shallot indicated that it was a distinct isolate of ShVX.

  10. High speed nucleic acid sequencing

    SciTech Connect

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2011-05-17

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid. Each type of labeled nucleotide comprises an acceptor fluorophore attached to a phosphate portion of the nucleotide such that the fluorophore is removed upon incorporation into a growing strand. Fluorescent signal is emitted via fluorescent resonance energy transfer between the donor fluorophore and the acceptor fluorophore as each nucleotide is incorporated into the growing strand. The sequence is deduced by identifying which base is being incorporated into the growing strand.

  11. Comparison of 5 monoclonal antibodies for immunopurification of human butyrylcholinesterase on Dynabeads: KD values, binding pairs, and amino acid sequences

    PubMed Central

    Peng, Hong; Brimijoin, Stephen; Hrabovska, Anna; Targosova, Katarina; Krejci, Eric; Blake, Thomas A.; Johnson, Rudolph C.; Masson, Patrick; Lockridge, Oksana

    2016-01-01

    Human butyrylcholinesterase (HuBChE) is a stoichiometric bioscavenger of nerve agents and organophosphorus pesticides. Mass spectrometry methods detect stable nerve agent adducts on the active site serine of HuBChE. The first step in sample preparation is immunopurification of HuBChE from plasma. Our goal was to identify monoclonal antibodies that could be used to immunopurify HuBChE on Dynabeads Protein G. Mouse anti-HuBChE monoclonal antibodies were obtained in the form of ascites fluid, dead hybridoma cells stored frozen at −80°C for 30 years, or recently frozen hybridoma cells. RNA from 4 hybridoma cell lines was amplified by PCR for determination of their nucleotide and amino acid sequences. Full-length light and heavy chains were expressed, and the antibodies purified from culture medium. A fifth monoclonal was purchased. The 5 monoclonal antibodies were compared for ability to capture HuBChE from human plasma on Dynabeads Protein G. In addition, they were evaluated for binding affinity by Biacore and ELISA. Epitope mapping by pairing analysis was performed on the Octet Red96 instrument. The 5 monoclonal antibodies, B2 12-1, B2 18-5, 3E8, mAb2, and 11D8, had similar KD values of 10−9 M for HuBChE. Monoclonal B2 18-5 outperformed the others in the Dynabeads Protein G assay where it captured 97% of the HuBChE in 0.5 ml plasma. Pairing analysis showed that 3E8 and B2 12-1 share the same epitope, 11D8 and B2 18-5 share the same epitope, but mAb2 and B2 12-1 or mAb2 and 3E8 bind to different epitopes on HuBChE. B2 18-5 was selected for establishment of a stable CHO cell line for production of mouse anti-HuBChE monoclonal. PMID:26343001

  12. Comparison of 5 monoclonal antibodies for immunopurification of human butyrylcholinesterase on Dynabeads: KD values, binding pairs, and amino acid sequences.

    PubMed

    Peng, Hong; Brimijoin, Stephen; Hrabovska, Anna; Targosova, Katarina; Krejci, Eric; Blake, Thomas A; Johnson, Rudolph C; Masson, Patrick; Lockridge, Oksana

    2015-10-05

    Human butyrylcholinesterase (HuBChE) is a stoichiometric bioscavenger of nerve agents and organophosphorus pesticides. Mass spectrometry methods detect stable nerve agent adducts on the active site serine of HuBChE. The first step in sample preparation is immunopurification of HuBChE from plasma. Our goal was to identify monoclonal antibodies that could be used to immunopurify HuBChE on Dynabeads Protein G. Mouse anti-HuBChE monoclonal antibodies were obtained in the form of ascites fluid, dead hybridoma cells stored frozen at -80 °C for 30 years, or recently frozen hybridoma cells. RNA from 4 hybridoma cell lines was amplified by PCR for determination of their nucleotide and amino acid sequences. Full-length light and heavy chains were expressed, and the antibodies purified from culture medium. A fifth monoclonal was purchased. The 5 monoclonal antibodies were compared for ability to capture HuBChE from human plasma on Dynabeads Protein G. In addition, they were evaluated for binding affinity by Biacore and ELISA. Epitope mapping by pairing analysis was performed on the Octet Red96 instrument. The 5 monoclonal antibodies, B2 12-1, B2 18-5, 3E8, mAb2, and 11D8, had similar KD values of 10(-9) M for HuBChE. Monoclonal B2 18-5 outperformed the others in the Dynabeads Protein G assay where it captured 97% of the HuBChE in 0.5 ml plasma. Pairing analysis showed that 3E8 and B2 12-1 share the same epitope, 11D8 and B2 18-5 share the same epitope, but mAb2 and B2 12-1 or mAb2 and 3E8 bind to different epitopes on HuBChE. B2 18-5 was selected for establishment of a stable CHO cell line for production of mouse anti-HuBChE monoclonal.

  13. Structural repertoire in human V{sub L} pseudogenes of immunoglobulins: Comparison with functional germline genes and amino acid sequences

    SciTech Connect

    Almagro, J.C.; Lara-Ochoa, F.; Dominguez-Martinez, V.

    1996-06-01

    Antibody molecules are highly antigen-specific receptors of the immune system. Antigen-antibody interaction involves the antibody V{sub L} and V{sub H} domains, each composed of a framework whose structure is well conserved. The antigen-binding site is composed of six hypervariable loops, three from the V{sub L} domain and three from the V{sub H} domain: L1, L2, L3, and H1, H2, H3, respectively. Genetically, L1 and L2 are encoded by the V{sub L} gene, while L3 is produced by the recombination of an additional gene segment, A. In a similar way, H1 and H2 are encoded by the V{sub H} gene, and H3 is a result of the recombination of two additional gene segments, D and J{sub H}. Analysis of antibodies of known atomic structure has revealed a small number of main-chain conformations or canonical structures for L1, L2, and L3, as well as for H1 and H2. Canonical structures in five of six hypervariable loops imply that only a few main-chain conformations are present in a large set of antibody molecules with different loop sequences. Examination of the known human IGHV, IGKV, and IGLV functional germline genes indicates that most of these sequences have canonical structures. This finding provides evidence concerning structural restrictions at work in the process of antigen recognition. 28 refs., 1 fig., 4 tabs.

  14. Purification and N-terminal amino acid sequence comparisons of structural proteins from retrovirus-D/Washington and Mason-Pfizer monkey virus.

    PubMed Central

    Henderson, L E; Sowder, R; Smythers, G; Benveniste, R E; Oroszlan, S

    1985-01-01

    A new D-type retrovirus originally designated SAIDS-D/Washington and here referred to as retrovirus-D/Washington (R-D/W) was recently isolated at the University of Washington Primate Center, Seattle, Wash., from a rhesus monkey with an acquired immunodeficiency syndrome and retroperitoneal fibromatosis. To better establish the relationship of this new D-type virus to the prototype D-type virus, Mason-Pfizer monkey virus (MPMV), we have purified and compared six structural proteins from each virus. The proteins purified from each D-type retrovirus include p4, p10, p12, p14, p27, and a phosphoprotein designated pp18 for MPMV and pp20 for R-D/W. Amino acid analysis and N-terminal amino acid sequence analysis show that the p4, p12, p14, and p27 proteins of R-D/W are distinct from the homologous proteins of MPMV but that these proteins from the two different viruses share a high degree of amino acid sequence homology. The p10 proteins from the two viruses have similar amino acid compositions, and both are blocked to N-terminal Edman degradation. The phosphoproteins from the two viruses each contain phosphoserine but are different from each other in amino acid composition, molecular weight, and N-terminal amino acid sequence. The data thus show that each of the R-D/W proteins examined is distinguishable from its MPMV homolog and that a major difference between these two D-type retroviruses is found in the viral phosphoproteins. The N-terminal amino acid sequences of D-type retroviral proteins were used to search for sequence homologies between D-type and other retroviral amino acid sequences. An unexpected amino acid sequence homology was found between R-D/W pp20 (a gag protein) and a 28-residue segment of the env precursor polyprotein of Rous sarcoma virus. The N-terminal amino acid sequences of the D-type major gag protein (p27) and the nucleic acid-binding protein (p14) show only limited amino acid sequence homology to functionally homologous proteins of C

  15. Chip-based sequencing nucleic acids

    DOEpatents

    Beer, Neil Reginald

    2014-08-26

    A system for fast DNA sequencing by amplification of genetic material within microreactors, denaturing, demulsifying, and then sequencing the material, while retaining it in a PCR/sequencing zone by a magnetic field. One embodiment includes sequencing nucleic acids on a microchip that includes a microchannel flow channel in the microchip. The nucleic acids are isolated and hybridized to magnetic nanoparticles or to magnetic polystyrene-coated beads. Microreactor droplets are formed in the microchannel flow channel. The microreactor droplets containing the nucleic acids and the magnetic nanoparticles are retained in a magnetic trap in the microchannel flow channel and sequenced.

  16. Method and apparatus for biological sequence comparison

    DOEpatents

    Marr, Thomas G.; Chang, William I-Wei

    1997-01-01

    A method and apparatus for comparing biological sequences from a known source of sequences, with a subject (query) sequence. The apparatus takes as input a set of target similarity levels (such as evolutionary distances in units of PAM), and finds all fragments of known sequences that are similar to the subject sequence at each target similarity level, and are long enough to be statistically significant. The invention device filters out fragments from the known sequences that are too short, or have a lower average similarity to the subject sequence than is required by each target similarity level. The subject sequence is then compared only to the remaining known sequences to find the best matches. The filtering member divides the subject sequence into overlapping blocks, each block being sufficiently large to contain a minimum-length alignment from a known sequence. For each block, the filter member compares the block with every possible short fragment in the known sequences and determines a best match for each comparison. The determined set of short fragment best matches for the block provide an upper threshold on alignment values. Regions of a certain length from the known sequences that have a mean alignment value upper threshold greater than a target unit score are concatenated to form a union. The current block is compared to the union and provides an indication of best local alignment with the subject sequence.

  17. Method and apparatus for biological sequence comparison

    DOEpatents

    Marr, T.G.; Chang, W.I.

    1997-12-23

    A method and apparatus are disclosed for comparing biological sequences from a known source of sequences, with a subject (query) sequence. The apparatus takes as input a set of target similarity levels (such as evolutionary distances in units of PAM), and finds all fragments of known sequences that are similar to the subject sequence at each target similarity level, and are long enough to be statistically significant. The invention device filters out fragments from the known sequences that are too short, or have a lower average similarity to the subject sequence than is required by each target similarity level. The subject sequence is then compared only to the remaining known sequences to find the best matches. The filtering member divides the subject sequence into overlapping blocks, each block being sufficiently large to contain a minimum-length alignment from a known sequence. For each block, the filter member compares the block with every possible short fragment in the known sequences and determines a best match for each comparison. The determined set of short fragment best matches for the block provide an upper threshold on alignment values. Regions of a certain length from the known sequences that have a mean alignment value upper threshold greater than a target unit score are concatenated to form a union. The current block is compared to the union and provides an indication of best local alignment with the subject sequence. 5 figs.

  18. Sequence comparisons via algorithmic mutual information

    SciTech Connect

    Milosavijevic, A.

    1994-12-31

    One of the main problems in DNA and protein sequence comparisons is to decide whether observed similarity of two sequences should be explained by their relatedness or by mere presence of some shared internal structure, e.g., shared internal tandem repeats. The standard methods that are based on statistics or classical information theory can be used to discover either internal structure or mutual sequence similarity, but cannot take into account both. Consequently, currently used methods for sequence comparison employ {open_quotes}masking{close_quotes} techniques that simply eliminate sequences that exhibit internal repetitive structure prior to sequence comparisons. The {open_quotes}masking{close_quotes} approach precludes discovery of homologous sequences of moderate or low complexity, which abound at both DNA and protein levels. As a solution to this problem, we propose a general method that is based on algorithmic information theory and minimal length encoding. We show that algorithmic mutual information factors out the sequence similarity that is due to shared internal structure and thus enables discovery of truly related sequences. We extend the recently developed algorithmic significance method to show that significance depends exponentially on algorithmic mutual information.

  19. A case of orthologous sequences of hemocyanin subunits for an evolutionary study of horseshoe crabs: amino acid sequence comparison of immunologically identical subunits of Carcinoscorpius rotundicauda and Tachypleus tridentatus.

    PubMed

    Sugita, H; Shishikura, F

    1995-10-01

    About 83% of the amino acid sequence of hemocyanin subunit HR6 from the Southeast Asian horseshoe crab, Carcinoscorpius rotundicauda, has been determined. There is a difference of about 43% between HR6 and complete sequences of chelicerate hemocyanin subunits from the American horseshoe crab, Limulus polyphemus, and a tarantula, Eurypelma californicum. However, the immunologically identical subunits HR6 and HT6 from Tachypleus tridentatus (Japanese horseshoe crab) show 2.7% sequence difference. Based on the amino acid sequences of HR6 and HT6, the divergence between C. rotundicauda and T. tridentatus occurred about 9.6 million years ago. In the case of horseshoe crab hemocyanin subunits, it seems that the orthologous homologues in many homologous subunits between species are immunologically detectable.

  20. Distinguishing Proteins From Arbitrary Amino Acid Sequences

    PubMed Central

    Yau, Stephen S.-T.; Mao, Wei-Guang; Benson, Max; He, Rong Lucy

    2015-01-01

    What kinds of amino acid sequences could possibly be protein sequences? From all existing databases that we can find, known proteins are only a small fraction of all possible combinations of amino acids. Beginning with Sanger's first detailed determination of a protein sequence in 1952, previous studies have focused on describing the structure of existing protein sequences in order to construct the protein universe. No one, however, has developed a criteria for determining whether an arbitrary amino acid sequence can be a protein. Here we show that when the collection of arbitrary amino acid sequences is viewed in an appropriate geometric context, the protein sequences cluster together. This leads to a new computational test, described here, that has proved to be remarkably accurate at determining whether an arbitrary amino acid sequence can be a protein. Even more, if the results of this test indicate that the sequence can be a protein, and it is indeed a protein sequence, then its identity as a protein sequence is uniquely defined. We anticipate our computational test will be useful for those who are attempting to complete the job of discovering all proteins, or constructing the protein universe. PMID:25609314

  1. Method for sequencing nucleic acid molecules

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2006-05-30

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  2. Method for sequencing nucleic acid molecules

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2006-06-06

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  3. Amino acid sequence of mouse submaxillary gland renin.

    PubMed Central

    Misono, K S; Chang, J J; Inagami, T

    1982-01-01

    The complete amino acid sequences of the heavy chain and light chain of mouse submaxillary gland renin have been determined. The heavy chain consists of 288 amino acid residues having a Mr of 31,036 calculated from the sequence. The light chain contains 48 amino acid residues with a Mr of 5,458. The sequence of the heavy chain was determined by automated Edman degradations of the cyanogen bromide peptides and tryptic peptides generated after citraconylation, as well as other peptides generated therefrom. The sequence of the light chain was derived from sequence analyses of the peptides generated by cyanogen bromide cleavage or by digestion with Staphylococcus aureus protease. The sequences in the active site regions in renin containing two catalytically essential aspartyl residues 32 and 215 were found identical with those in pepsin, chymosin, and penicillopepsin. Comparison of the amino acid sequence of renin with that of porcine pepsin indicated a 42% sequence identity of the heavy chain with the amino-terminal and middle regions and a 46% identity of the light chain with the carboxyl-terminal region of the porcine pepsin sequence. Residues identical in renin and pepsin are distributed throughout the length of the molecules, suggesting a similarity in their overall structures. PMID:6812055

  4. Ct shift: A novel and accurate real-time PCR quantification model for direct comparison of different nucleic acid sequences and its application for transposon quantifications.

    PubMed

    Kolacsek, Orsolya; Pergel, Enikő; Varga, Nóra; Apáti, Ágota; Orbán, Tamás I

    2017-01-20

    There are numerous applications of quantitative PCR for both diagnostic and basic research. As in many other techniques the basis of quantification is that comparisons are made between different (unknown and known or reference) specimens of the same entity. When the aim is to compare real quantities of different species in samples, one cannot escape their separate precise absolute quantification. We have established a simple and reliable method for this purpose (Ct shift method) which combines the absolute and the relative approach. It requires a plasmid standard containing both sequences of amplicons to be compared (e.g. the target of interest and the endogenous control). It can serve as a reference sample with equal copies of templates for both targets. Using the ΔΔCt formula we can quantify the exact ratio of the two templates in each unknown sample. The Ct shift method has been successfully applied for transposon gene copy measurements, as well as for comparison of different mRNAs in cDNA samples. This study provides the proof of concept and introduces some potential applications of the method; the absolute nature of results even without the need for real reference samples can contribute to the universality of the method and comparability of different studies.

  5. Comparison and evaluation of real-time PCR, real-time nucleic acid sequence-based amplification, conventional PCR, and serology for diagnosis of Mycoplasma pneumoniae.

    PubMed

    Templeton, Kate E; Scheltinga, Sitha A; Graffelman, A Willy; Van Schie, Jolanda M; Crielaard, Jantine W; Sillekens, Peter; Van Den Broek, Peterhans J; Goossens, Herman; Beersma, Matthias F C; Claas, Eric C J

    2003-09-01

    Mycoplasma pneumoniae is a common cause of community-acquired pneumonia and lower-respiratory-tract infections. Diagnosis has traditionally been obtained by serological diagnosis, but increasingly, molecular techniques have been applied. However, the number of studies actually comparing these assays is limited. The development of a novel duplex real-time PCR assay for detection of M. pneumoniae in the presence of an internal control real-time PCR is described. In addition, real-time nucleic acid sequence-based amplification (NASBA) on an iCycler apparatus is evaluated. These assays were compared to serology and a conventional PCR assay for 106 clinical samples from patients with lower-respiratory-tract infection. Of the 106 samples, 12 (11.3%) were positive by all the molecular methods whereas serology with acute sample and convalescent samples detected 6 (5.6%) and 9 (8.5%), respectively. Clinical symptoms of the patients with Mycoplasma-positive results were compared to those of the other patients with lower-respiratory-tract infections, and it was found that the results for mean lower age numbers as well as the presence of chills, increased erythrocyte sedimentation rate, and raised C-reactive protein levels showed significant differences. Molecular methods are superior for diagnosis of M. pneumoniae, providing more timely diagnosis. In addition, using real-time methods involves less hands-on time and affords the ability to monitor the reaction in the same tube.

  6. Phenolic acid esterases, coding sequences and methods

    DOEpatents

    Blum, David L.; Kataeva, Irina; Li, Xin-Liang; Ljungdahl, Lars G.

    2002-01-01

    Described herein are four phenolic acid esterases, three of which correspond to domains of previously unknown function within bacterial xylanases, from XynY and XynZ of Clostridium thermocellum and from a xylanase of Ruminococcus. The fourth specifically exemplified xylanase is a protein encoded within the genome of Orpinomyces PC-2. The amino acids of these polypeptides and nucleotide sequences encoding them are provided. Recombinant host cells, expression vectors and methods for the recombinant production of phenolic acid esterases are also provided.

  7. Amino acid sequence and comparative antigenicity of chicken metallothionein.

    PubMed Central

    McCormick, C C; Fullmer, C S; Garvey, J S

    1988-01-01

    The complete amino acid sequence of metallothionein (MT) from chicken liver is reported. The primary structure was determined by automated sequence analysis of peptides produced by limited acid hydrolysis and by trypsin digestion. The comparative antigenicity of chicken MT was determined by radioimmunoassay using rabbit anti-rat MT polyclonal antibody. Chicken MT consists of 63 amino acids as compared to 61 found in MTs from mammals. One insertion (and two substitutions) occurs in the amino-terminal region, a region considered invariant among mammalian MTs. Eighteen of the 20 cysteines in chicken MT were aligned with cysteines from other mammalian sequences. Two cysteines near the carboxyl terminus are shifted by one residue due to the insertion of proline in that region. Overall, the chicken protein showed approximately equal to 68% sequence identity in a comparison with various mammalian MTs. The affinity of the polyclonal antibody for chicken MT was decreased by 2 orders of magnitude in comparison to that of a mammalian MT (rat MT isoforms). This reduced affinity is attributed to major substitutions in chicken MT in the regions of the principal determinants of mammalian MTs. Theoretical analysis of the primary structure predicted the secondary structure to consist of reverse turns and random coils with no stable beta or helix conformations. There is no evidence that chicken MT differs functionally from mammalian MTs. PMID:2448773

  8. Method for identifying and quantifying nucleic acid sequence aberrations

    DOEpatents

    Lucas, J.N.; Straume, T.; Bogen, K.T.

    1998-07-21

    A method is disclosed for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe. 11 figs.

  9. Method for identifying and quantifying nucleic acid sequence aberrations

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    1998-01-01

    A method for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe.

  10. Methods for analyzing nucleic acid sequences

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2011-05-17

    The present invention is directed to a method of sequencing a target nucleic acid. The method provides a complex comprising a polymerase enzyme, a target nucleic acid molecule, and a primer, wherein the complex is immobilized on a support Fluorescent label is attached to a terminal phosphate group of the nucleotide or nucleotide analog. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The time duration of the signal from labeled nucleotides or nucleotide analogs that become incorporated is distinguished from freely diffusing labels by a longer retention in the observation volume for the nucleotides or nucleotide analogs that become incorporated than for the freely diffusing labels.

  11. Protein sequence comparison and protein evolution

    SciTech Connect

    Pearson, W.R.

    1995-12-31

    This tutorial was one of eight tutorials selected to be presented at the Third International Conference on Intelligent Systems for Molecular Biology which was held in the United Kingdom from July 16 to 19, 1995. This tutorial examines how the information conserved during the evolution of a protein molecule can be used to infer reliably homology, and thus a shared proteinfold and possibly a shared active site or function. The authors start by reviewing a geological/evolutionary time scale. Next they look at the evolution of several protein families. During the tutorial, these families will be used to demonstrate that homologous protein ancestry can be inferred with confidence. They also examine different modes of protein evolution and consider some hypotheses that have been presented to explain the very earliest events in protein evolution. The next part of the tutorial will examine the technical aspects of protein sequence comparison. Both optimal and heuristic algorithms and their associated parameters that are used to characterize protein sequence similarities are discussed. Perhaps more importantly, they survey the statistics of local similarity scores, and how these statistics can both be used to improve the selectivity of a search and to evaluate the significance of a match. They them examine distantly related members of three protein families, the serine proteases, the glutathione transferases, and the G-protein-coupled receptors (GCRs). Finally, the discuss how sequence similarity can be used to examine internal repeated or mosaic structures in proteins.

  12. The amino acid sequences of the Fd fragments of two human γ heavy chains

    PubMed Central

    Press, E. M.; Hogg, N. M.

    1970-01-01

    The amino acid sequences of the Fd fragments of two human pathological immunoglobulins of the immunoglobulin G1 class are reported. Comparison of the two sequences shows that the heavy-chain variable regions are similar in length to those of the light chains. The existence of heavy chain variable region subgroups is also deduced, from a comparison of these two sequences with those of another γ 1 chain, Eu, a μ chain, Ou, and the partial sequence of a fourth γ 1 chain, Ste. Carbohydrate has been found to be linked to an aspartic acid residue in the variable region of one of the γ 1 chains, Cor. PMID:5449120

  13. Amino acid sequence of bovine heart coupling factor 6.

    PubMed Central

    Fang, J K; Jacobs, J W; Kanner, B I; Racker, E; Bradshaw, R A

    1984-01-01

    The amino acid sequence of bovine heart mitochondrial coupling factor 6 (F6) has been determined by automated Edman degradation of the whole protein and derived peptides. Preparations based on heat precipitation and ethanol extraction showed allotypic variation at three positions while material further purified by HPLC yielded only one sequence that also differed by a Phe-Thr replacement at residue 62. The mature protein contains 76 amino acids with a calculated molecular weight of 9006 and a pI of approximately equal to 5, in good agreement with experimentally measured values. The charged amino acids are mainly clustered at the termini and in one section in the middle; these three polar segments are separated by two segments relatively rich in nonpolar residues. Chou-Fasman analysis suggests three stretches of alpha-helix coinciding (or within) the high-charge-density sequences with a single beta-turn at the first polar-nonpolar junction. Comparison of the F6 sequence with those of other proteins did not reveal any homologous structures. PMID:6149548

  14. 77 FR 65537 - Requirements for Patent Applications Containing Nucleotide Sequence and/or Amino Acid Sequence...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2012-10-29

    ... Amino Acid Sequence Disclosures ACTION: Proposed collection; comment request. SUMMARY: The United States....'' SUPPLEMENTARY INFORMATION: I. Abstract Patent applications that contain nucleotide and/or amino acid...

  15. Detection of nucleic acid sequences by invader-directed cleavage

    DOEpatents

    Brow, Mary Ann D.; Hall, Jeff Steven Grotelueschen; Lyamichev, Victor; Olive, David Michael; Prudent, James Robert

    1999-01-01

    The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The 5' nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based by charge.

  16. Los Alamos sequence analysis package for nucleic acids and proteins.

    PubMed Central

    Kanehisa, M I

    1982-01-01

    An interactive system for computer analysis of nucleic acid and protein sequences has been developed for the Los Alamos DNA Sequence Database. It provides a convenient way to search or verify various sequence features, e.g., restriction enzyme sites, protein coding frames, and properties of coded proteins. Further, the comprehensive analysis package on a large-scale database can be used for comparative studies on sequence and structural homologies in order to find unnoted information stored in nucleic acid sequences. PMID:6174934

  17. Hybridization and sequencing of nucleic acids using base pair mismatches

    DOEpatents

    Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua

    2001-01-01

    Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.

  18. Amino acid sequence of a mouse immunoglobulin mu chain.

    PubMed Central

    Kehry, M; Sibley, C; Fuhrman, J; Schilling, J; Hood, L E

    1979-01-01

    The complete amino acid sequence of the mouse mu chain from the BALB/c myeloma tumor MOPC 104E is reported. The C mu region contains four consecutive homology regions of approximately 110 residues and a COOH-terminal region of 19 residues. A comparison of this mu chain from mouse with a complete mu sequence from human (Ou) and a partial mu chain sequence from dog (Moo) reveals a striking gradient of increasing homology from the NH2-terminal to the COOH-terminal portion of these mu chains, with the former being the least and the latter the most highly conserved. Four of the five sites of carbohydrate attachment appear to be at identical residue positions when the constant regions of the mouse and human mu chains are compared. The mu chain of MOPC 104E has a carbohydrate moiety attached in the second hypervariable region. This is particularly interesting in view of the fact that MOPC 104E binds alpha-(1 leads to 3)-dextran, a simple carbohydrate. The structural and functional constraints imposed by these comparative sequence analyses are discussed. PMID:111247

  19. Comparison of Next-Generation Sequencing Systems

    PubMed Central

    Liu, Lin; Li, Yinhu; Li, Siliang; Hu, Ni; He, Yimin; Pong, Ray; Lin, Danni; Lu, Lihua; Law, Maggie

    2012-01-01

    With fast development and wide applications of next-generation sequencing (NGS) technologies, genomic sequence information is within reach to aid the achievement of goals to decode life mysteries, make better crops, detect pathogens, and improve life qualities. NGS systems are typically represented by SOLiD/Ion Torrent PGM from Life Sciences, Genome Analyzer/HiSeq 2000/MiSeq from Illumina, and GS FLX Titanium/GS Junior from Roche. Beijing Genomics Institute (BGI), which possesses the world's biggest sequencing capacity, has multiple NGS systems including 137 HiSeq 2000, 27 SOLiD, one Ion Torrent PGM, one MiSeq, and one 454 sequencer. We have accumulated extensive experience in sample handling, sequencing, and bioinformatics analysis. In this paper, technologies of these systems are reviewed, and first-hand data from extensive experience is summarized and analyzed to discuss the advantages and specifics associated with each sequencing system. At last, applications of NGS are summarized. PMID:22829749

  20. New developments of alignment-free sequence comparison: measures, statistics and next-generation sequencing

    PubMed Central

    Song, Kai; Ren, Jie; Reinert, Gesine; Deng, Minghua

    2014-01-01

    With the development of next-generation sequencing (NGS) technologies, a large amount of short read data has been generated. Assembly of these short reads can be challenging for genomes and metagenomes without template sequences, making alignment-based genome sequence comparison difficult. In addition, sequence reads from NGS can come from different regions of various genomes and they may not be alignable. Sequence signature-based methods for genome comparison based on the frequencies of word patterns in genomes and metagenomes can potentially be useful for the analysis of short reads data from NGS. Here we review the recent development of alignment-free genome and metagenome comparison based on the frequencies of word patterns with emphasis on the dissimilarity measures between sequences, the statistical power of these measures when two sequences are related and the applications of these measures to NGS data. PMID:24064230

  1. The amino acid sequence of rabbit cardiac troponin I.

    PubMed Central

    Grand, R J; Wilkinson, J M

    1976-01-01

    The complete amino acid sequence of troponin I from rabbit cardiac muscle was determined by the isolation of four unique CNBr fragments, together with overlapping tryptic peptides containing radioactive methionine residues. Overlap data for residues 35-36, 93-94 and 140-145 are incomplete, the sequence at these positions being based on homology with the sequence of the fast-skeletal-muscle protein. Cardiac troponin I is a single polypeptide chain of 206 residues with mol.wt. 23550 and an extinction coefficient, E 1%,1cm/280, of 4.37. The protein has a net positive charge of 14 and is thus somewhat more basic than troponin I from fast-skeletal muscle. Comparison of the sequences of troponin I from cardiac and fast skeletal muscle show that the cardiac protein has 26 extra residues at the N-terminus which account for the larger size of the protein. In the remainder of sequence there is a considerable degree of homology, this being greater in the C-terminal two-thirds of the molecule. The region in the cardiac protein corresponding to the peptide with inhibitory activity from the fast-skeletal-muscle protein is very similar and it seems unlikely that this is the cause of the difference in inhibitory activity between the two proteins. The region responsible for binding troponin C, however, possesses a lower degree of homology. Detailed evidence on which the sequence is based has been deposited as Supplementary Publication SUP 50072 (20 pages), at the British Library Lending Division, Boston Spa, Wetherby, West Yorkshire LS23 7QB, U.K., from whom copies may be obtained on the terms given in Biochem. J. (1976) 153, 5. PMID:1008822

  2. Parallel Computation of Multiple Biological Sequence Comparisons

    DTIC Science & Technology

    1989-07-01

    Stearothermophilus 408 Bacillus Megaterium 411 Bacillus Brevis 354 Pseudomonas Fluorescens 375 Salmonella Typhi 377 Escherichia Coli 282 Saccharomyces Octosporus...This included implied secondary structure and conservation of pairs of nucleotides that are complementary. The first four sequences are all Bacillus ...need to obtain sequences of ribonuclease P RNA from additional species to provide a more 13 Length Name 401 Bacillus Subtilis 417 Bacillus

  3. Methods and compositions for efficient nucleic acid sequencing

    DOEpatents

    Drmanac, Radoje

    2006-07-04

    Disclosed are novel methods and compositions for rapid and highly efficient nucleic acid sequencing based upon hybridization with two sets of small oligonucleotide probes of known sequences. Extremely large nucleic acid molecules, including chromosomes and non-amplified RNA, may be sequenced without prior cloning or subcloning steps. The methods of the invention also solve various current problems associated with sequencing technology such as, for example, high noise to signal ratios and difficult discrimination, attaching many nucleic acid fragments to a surface, preparing many, longer or more complex probes and labelling more species.

  4. Methods and compositions for efficient nucleic acid sequencing

    DOEpatents

    Drmanac, Radoje

    2002-01-01

    Disclosed are novel methods and compositions for rapid and highly efficient nucleic acid sequencing based upon hybridization with two sets of small oligonucleotide probes of known sequences. Extremely large nucleic acid molecules, including chromosomes and non-amplified RNA, may be sequenced without prior cloning or subcloning steps. The methods of the invention also solve various current problems associated with sequencing technology such as, for example, high noise to signal ratios and difficult discrimination, attaching many nucleic acid fragments to a surface, preparing many, longer or more complex probes and labelling more species.

  5. Kit for detecting nucleic acid sequences using competitive hybridization probes

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    2001-01-01

    A kit is provided for detecting a target nucleic acid sequence in a sample, the kit comprising: a first hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the first hybridization probe including a first complexing agent for forming a binding pair with a second complexing agent; and a second hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the first hybridization probe does not selectively hybridize, the second hybridization probe including a detectable marker; a third hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the third hybridization probe including the same detectable marker as the second hybridization probe; and a fourth hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the third hybridization probe does not selectively hybridize, the fourth hybridization probe including the first complexing agent for forming a binding pair with the second complexing agent; wherein the first and second hybridization probes are capable of simultaneously hybridizing to the target sequence and the third and fourth hybridization probes are capable of simultaneously hybridizing to the target sequence, the detectable marker is not present on the first or fourth hybridization probes and the first, second, third, and fourth hybridization probes each include a competitive nucleic acid sequence which is sufficiently complementary to a third portion of the target sequence that the competitive sequences of the first, second, third, and fourth hybridization probes compete with each other to hybridize to the third portion of the

  6. Analysis and Annotation of Nucleic Acid Sequence

    SciTech Connect

    States, David J.

    2004-07-28

    The aims of this project were to develop improved methods for computational genome annotation and to apply these methods to improve the annotation of genomic sequence data with a specific focus on human genome sequencing. The project resulted in a substantial body of published work. Notable contributions of this project were the identification of basecalling and lane tracking as error processes in genome sequencing and contributions to improved methods for these steps in genome sequencing. This technology improved the accuracy and throughput of genome sequence analysis. Probabilistic methods for physical map construction were developed. Improved methods for sequence alignment, alternative splicing analysis, promoter identification and NF kappa B response gene prediction were also developed.

  7. Solid phase sequencing of double-stranded nucleic acids

    DOEpatents

    Fu, Dong-Jing; Cantor, Charles R.; Koster, Hubert; Smith, Cassandra L.

    2002-01-01

    This invention relates to methods for detecting and sequencing of target double-stranded nucleic acid sequences, to nucleic acid probes and arrays of probes useful in these methods, and to kits and systems which contain these probes. Useful methods involve hybridizing the nucleic acids or nucleic acids which represent complementary or homologous sequences of the target to an array of nucleic acid probes. These probe comprise a single-stranded portion, an optional double-stranded portion and a variable sequence within the single-stranded portion. The molecular weights of the hybridized nucleic acids of the set can be determined by mass spectroscopy, and the sequence of the target determined from the molecular weights of the fragments. Nucleic acids whose sequences can be determined include nucleic acids in biological samples such as patient biopsies and environmental samples. Probes may be fixed to a solid support such as a hybridization chip to facilitate automated determination of molecular weights and identification of the target sequence.

  8. Dipeptide Sequence Determination: Analyzing Phenylthiohydantoin Amino Acids by HPLC

    NASA Astrophysics Data System (ADS)

    Barton, Janice S.; Tang, Chung-Fei; Reed, Steven S.

    2000-02-01

    Amino acid composition and sequence determination, important techniques for characterizing peptides and proteins, are essential for predicting conformation and studying sequence alignment. This experiment presents improved, fundamental methods of sequence analysis for an upper-division biochemistry laboratory. Working in pairs, students use the Edman reagent to prepare phenylthiohydantoin derivatives of amino acids for determination of the sequence of an unknown dipeptide. With a single HPLC technique, students identify both the N-terminal amino acid and the composition of the dipeptide. This method yields good precision of retention times and allows use of a broad range of amino acids as components of the dipeptide. Students learn fundamental principles and techniques of sequence analysis and HPLC.

  9. Amino acid sequence of anionic peroxidase from the windmill palm tree Trachycarpus fortunei.

    PubMed

    Baker, Margaret R; Zhao, Hongwei; Sakharov, Ivan Yu; Li, Qing X

    2014-12-10

    Palm peroxidases are extremely stable and have uncommon substrate specificity. This study was designed to fill in the knowledge gap about the structures of a peroxidase from the windmill palm tree Trachycarpus fortunei. The complete amino acid sequence and partial glycosylation were determined by MALDI-top-down sequencing of native windmill palm tree peroxidase (WPTP), MALDI-TOF/TOF MS/MS of WPTP tryptic peptides, and cDNA sequencing. The propeptide of WPTP contained N- and C-terminal signal sequences which contained 21 and 17 amino acid residues, respectively. Mature WPTP was 306 amino acids in length, and its carbohydrate content ranged from 21% to 29%. Comparison to closely related royal palm tree peroxidase revealed structural features that may explain differences in their substrate specificity. The results can be used to guide engineering of WPTP and its novel applications.

  10. Intra-species sequence comparisons for annotating genomes

    SciTech Connect

    Boffelli, Dario; Weer, Claire V.; Weng, Li; Lewis, Keith D.; Shoukry, Malak I.; Pachter, Lior; Keys, David N.; Rubin, Edward M.

    2004-07-15

    Analysis of sequence variation among members of a single species offers a potential approach to identify functional DNA elements responsible for biological features unique to that species. Due to its high rate of allelic polymorphism and ease of genetic manipulability, we chose the sea squirt, Ciona intestinalis, to explore intra-species sequence comparisons for genome annotation. A large number of C. intestinalis specimens were collected from four continents and a set of genomic intervals amplified, resequenced and analyzed to determine the mutation rates at each nucleotide in the sequence. We found that regions with low mutation rates efficiently demarcated functionally constrained sequences: these include a set of noncoding elements, which we showed in C intestinalis transgenic assays to act as tissue-specific enhancers, as well as the location of coding sequences. This illustrates that comparisons of multiple members of a species can be used for genome annotation, suggesting a path for the annotation of the sequenced genomes of organisms occupying uncharacterized phylogenetic branches of the animal kingdom and raises the possibility that the resequencing of a large number of Homo sapiens individuals might be used to annotate the human genome and identify sequences defining traits unique to our species. The sequence data from this study has been submitted to GenBank under accession nos. AY667278-AY667407.

  11. Alignment-free sequence comparison based on next-generation sequencing reads.

    PubMed

    Song, Kai; Ren, Jie; Zhai, Zhiyuan; Liu, Xuemei; Deng, Minghua; Sun, Fengzhu

    2013-02-01

    Next-generation sequencing (NGS) technologies have generated enormous amounts of shotgun read data, and assembly of the reads can be challenging, especially for organisms without template sequences. We study the power of genome comparison based on shotgun read data without assembly using three alignment-free sequence comparison statistics, D(2), D(*)(2) and D(s)(2), both theoretically and by simulations. Theoretical formulas for the power of detecting the relationship between two sequences related through a common motif model are derived. It is shown that both D(*)(2) and D(s)(2), outperform D(2) for detecting the relationship between two sequences based on NGS data. We then study the effects of length of the tuple, read length, coverage, and sequencing error on the power of D(*)(2) and D(s)(2). Finally, variations of these statistics, d(2), d(*)(2) and d(s)(2), respectively, are used to first cluster five mammalian species with known phylogenetic relationships, and then cluster 13 tree species whose complete genome sequences are not available using NGS shotgun reads. The clustering results using d(s)(2) are consistent with biological knowledge for the 5 mammalian and 13 tree species, respectively. Thus, the statistic d(s)(2) provides a powerful alignment-free comparison tool to study the relationships among different organisms based on NGS read data without assembly.

  12. Amino Acid Sequence of Human Cholinesterase

    DTIC Science & Technology

    1985-10-01

    liquid chromatography (HPLC). Activity testing of the aged, DFP-labeled cholinesterase showed that 99.8% of the active sites had been labeled, since...acids were quantitated by ninhydrin at the AAA Labs, or by derivatization with phenylisothiocyanate at the University of Michigan. The latter method

  13. Cystatin. Amino acid sequence and possible secondary structure.

    PubMed Central

    Schwabe, C; Anastasi, A; Crow, H; McDonald, J K; Barrett, A J

    1984-01-01

    The amino acid sequence of cystatin, the protein from chicken egg-white that is a tight-binding inhibitor of many cysteine proteinases, is reported. Cystatin is composed of 116 amino acid residues, and the Mr is calculated to be 13 143. No striking similarity to any other known sequence has been detected. The results of computer analysis of the sequence and c.d. spectrometry indicate that the secondary structure includes relatively little alpha-helix (about 20%) and that the remainder is mainly beta-structure. PMID:6712597

  14. Comparison of mitochondrial genome sequences of pangolins (Mammalia, Pholidota).

    PubMed

    Hassanin, Alexandre; Hugot, Jean-Pierre; van Vuuren, Bettine Jansen

    2015-04-01

    The complete mitochondrial genome was sequenced for three species of pangolins, Manis javanica, Phataginus tricuspis, and Smutsia temminckii, and comparisons were made with two other species, Manis pentadactyla and Phataginus tetradactyla. The genome of Manidae contains the 37 genes found in a typical mammalian genome, and the structure of the control region is highly conserved among species. In Manis, the overall base composition differs from that found in African genera. Phylogenetic analyses support the monophyly of the genera Manis, Phataginus, and Smutsia, as well as the basal division between Maninae and Smutsiinae. Comparisons with GenBank sequences reveal that the reference genomes of M. pentadactyla and P. tetradactyla (accession numbers NC_016008 and NC_004027) were sequenced from misidentified taxa, and that a new species of tree pangolin should be described in Gabon.

  15. TranslatorX: multiple alignment of nucleotide sequences guided by amino acid translations.

    PubMed

    Abascal, Federico; Zardoya, Rafael; Telford, Maximilian J

    2010-07-01

    We present TranslatorX, a web server designed to align protein-coding nucleotide sequences based on their corresponding amino acid translations. Many comparisons between biological sequences (nucleic acids and proteins) involve the construction of multiple alignments. Alignments represent a statement regarding the homology between individual nucleotides or amino acids within homologous genes. As protein-coding DNA sequences evolve as triplets of nucleotides (codons) and it is known that sequence similarity degrades more rapidly at the DNA than at the amino acid level, alignments are generally more accurate when based on amino acids than on their corresponding nucleotides. TranslatorX novelties include: (i) use of all documented genetic codes and the possibility of assigning different genetic codes for each sequence; (ii) a battery of different multiple alignment programs; (iii) translation of ambiguous codons when possible; (iv) an innovative criterion to clean nucleotide alignments with GBlocks based on protein information; and (v) a rich output, including Jalview-powered graphical visualization of the alignments, codon-based alignments coloured according to the corresponding amino acids, measures of compositional bias and first, second and third codon position specific alignments. The TranslatorX server is freely available at http://translatorx.co.uk.

  16. Mouse Vk gene classification by nucleic acid sequence similarity.

    PubMed

    Strohal, R; Helmberg, A; Kroemer, G; Kofler, R

    1989-01-01

    Analyses of immunoglobulin (Ig) variable (V) region gene usage in the immune response, estimates of V gene germline complexity, and other nucleic acid hybridization-based studies depend on the extent to which such genes are related (i.e., sequence similarity) and their organization in gene families. While mouse Igh heavy chain V region (VH) gene families are relatively well-established, a corresponding systematic classification of Igk light chain V region (Vk) genes has not been reported. The present analysis, in the course of which we reviewed the known extent of the Vk germline gene repertoire and Vk gene usage in a variety of responses to foreign and self antigens, provides a classification of mouse Vk genes in gene families composed of members with greater than 80% overall nucleic acid sequence similarity. This classification differed in several aspects from that of VH genes: only some Vk gene families were as clearly separated (by greater than 25% sequence dissimilarity) as typical VH gene families; most Vk gene families were closely related and, in several instances, members from different families were very similar (greater than 80%) over large sequence portions; frequently, classification by nucleic acid sequence similarity diverged from existing classifications based on amino-terminal protein sequence similarity. Our data have implications for Vk gene analyses by nucleic acid hybridization and describe potentially important differences in sequence organization between VH and Vk genes.

  17. Amphioxus mitochondrial DNA, chordate phylogeny, and the limits of inference based on comparisons of sequences.

    PubMed

    Naylor, G J; Brown, W M

    1998-03-01

    Analyses of both the nucleotide and amino acid sequences derived from all 13 mitochondrial protein-encoding genes (12,234 bp) of 19 metazoan species, including that of the lancelet Branchiostoma floridae ("amphioxus"), fail to yield the widely accepted phylogeny for chordates and, within chordates, for vertebrates. Given the breadth and the compelling nature of the data supporting that phylogeny, relationships supported by the mitochondrial sequence comparisons are almost certainly incorrect, despite their being supported by equally weighted parsimony, distance, and maximum-likelihood analyses. The incorrect groupings probably result in part from convergent base-compositional similarities among some of the taxa, similarities that are strong enough to overwhelm the historical signal. Comparisons among very distantly related taxa are likely to be particularly susceptible to such artifacts, because the historical signal is already greatly attenuated. Empirical results underscore the need for approaches to phylogenetic inference that go beyond simple site-by-site comparison of aligned sequences. This study and others indicate that, once a sequence sample of reasonable size has been obtained, accurate phylogenetic estimation may be better served by incorporating knowledge of molecular structures and processes into inference models and by seeking additional higher order characters embedded in those sequences, than by gathering ever larger sequence samples from the same organisms in he hope that the historical signal will eventually prevail.

  18. Complete cDNA and derived amino acid sequence of human factor V

    SciTech Connect

    Jenny, R.J.; Pittman, D.D.; Toole, J.J.; Kriz, R.W.; Aldape, R.A.; Hewick, R.M.; Kaufman, R.J.; Mann, K.G.

    1987-07-01

    cDNA clones encoding human factor V have been isolated from an oligo(dT)-primed human fetal liver cDNA library prepared with vector Charon 21A. The cDNA sequence of factor V from three overlapping clones includes a 6672-base-pair (bp) coding region, a 90-bp 5' untranslated region, and a 163-bp 3' untranslated region within which is a poly(A)tail. The deduced amino acid sequence consists of 2224 amino acids inclusive of a 28-amino acid leader peptide. Direct comparison with human factor VIII reveals considerable homology between proteins in amino acid sequence and domain structure: a triplicated A domain and duplicated C domain show approx. 40% identity with the corresponding domains in factor VIII. As in factor VIII, the A domains of factor V share approx. 40% amino acid-sequence homology with the three highly conserved domains in ceruloplasmin. The B domain of factor V contains 35 tandem and approx. 9 additional semiconserved repeats of nine amino acids of the form Asp-Leu-Ser-Gln-Thr-Thr/Asn-Leu-Ser-Pro and 2 additional semiconserved repeats of 17 amino acids. Factor V contains 37 potential N-linked glycosylation sites, 25 of which are in the B domain, and a total of 19 cysteine residues.

  19. In silico comparative analysis of DNA and amino acid sequences for prion protein gene.

    PubMed

    Kim, Y; Lee, J; Lee, C

    2008-01-01

    Genetic variability might contribute to species specificity of prion diseases in various organisms. In this study, structures of the prion protein gene (PRNP) and its amino acids were compared among species of which sequence data were available. Comparisons of PRNP DNA sequences among 12 species including human, chimpanzee, monkey, bovine, ovine, dog, mouse, rat, wallaby, opossum, chicken and zebrafish allowed us to identify candidate regulatory regions in intron 1 and 3'-untranslated region (UTR) in addition to the coding region. Highly conserved putative binding sites for transcription factors, such as heat shock factor 2 (HSF2) and myocite enhancer factor 2 (MEF2), were discovered in the intron 1. In 3'-UTR, the functional sequence (ATTAAA) for nucleus-specific polyadenylation was found in all the analysed species. The functional sequence (TTTTTAT) for maturation-specific polyadenylation was identically observed only in ovine, and one or two nucleotide mismatches in the other species. A comparison of the amino acid sequences in 53 species revealed a large sequence identity. Especially the octapeptide repeat region was observed in all the species but frog and zebrafish. Functional changes and susceptibility to prion diseases with various isoforms of prion protein could be caused by numeric variability and conformational changes discovered in the repeat sequences.

  20. Beyond Linear Sequence Comparisons: The use of genome-levelcharacters for phylogenetic reconstruction

    SciTech Connect

    Boore, Jeffrey L.

    2004-11-27

    Although the phylogenetic relationships of many organisms have been convincingly resolved by the comparisons of nucleotide or amino acid sequences, others have remained equivocal despite great effort. Now that large-scale genome sequencing projects are sampling many lineages, it is becoming feasible to compare large data sets of genome-level features and to develop this as a tool for phylogenetic reconstruction that has advantages over conventional sequence comparisons. Although it is unlikely that these will address a large number of evolutionary branch points across the broad tree of life due to the infeasibility of such sampling, they have great potential for convincingly resolving many critical, contested relationships for which no other data seems promising. However, it is important that we recognize potential pitfalls, establish reasonable standards for acceptance, and employ rigorous methodology to guard against a return to earlier days of scenario-driven evolutionary reconstructions.

  1. Metabolic pathways variability and sequence/networks comparisons

    PubMed Central

    Tun, Kyaw; Dhar, Pawan K; Palumbo, Maria Concetta; Giuliani, Alessandro

    2006-01-01

    Background In this work a simple method for the computation of relative similarities between homologous metabolic network modules is presented. The method is similar to classical sequence alignment and allows for the generation of phenotypic trees amenable to be compared with correspondent sequence based trees. The procedure can be applied to both single metabolic modules and whole metabolic network data without the need of any specific assumption. Results We demonstrate both the ability of the proposed method to build reliable biological classification of a set of microrganisms and the strong correlation between the metabolic network wiringand involved enzymes sequence space. Conclusion The method represents a valuable tool for the investigation of genotype/phenotype correlationsallowing for a direct comparison of different species as for their metabolic machinery. In addition the detection of enzymes whose sequence space is maximally correlated with the metabolicnetwork space gives an indication of the most crucial (on an evolutionary viewpoint) steps of the metabolic process. PMID:16420696

  2. Sequence information signal processor for local and global string comparisons

    DOEpatents

    Peterson, John C.; Chow, Edward T.; Waterman, Michael S.; Hunkapillar, Timothy J.

    1997-01-01

    A sequence information signal processing integrated circuit chip designed to perform high speed calculation of a dynamic programming algorithm based upon the algorithm defined by Waterman and Smith. The signal processing chip of the present invention is designed to be a building block of a linear systolic array, the performance of which can be increased by connecting additional sequence information signal processing chips to the array. The chip provides a high speed, low cost linear array processor that can locate highly similar global sequences or segments thereof such as contiguous subsequences from two different DNA or protein sequences. The chip is implemented in a preferred embodiment using CMOS VLSI technology to provide the equivalent of about 400,000 transistors or 100,000 gates. Each chip provides 16 processing elements, and is designed to provide 16 bit, two's compliment operation for maximum score precision of between -32,768 and +32,767. It is designed to provide a comparison between sequences as long as 4,194,304 elements without external software and between sequences of unlimited numbers of elements with the aid of external software. Each sequence can be assigned different deletion and insertion weight functions. Each processor is provided with a similarity measure device which is independently variable. Thus, each processor can contribute to maximum value score calculation using a different similarity measure.

  3. Amino acid sequence repertoire of the bacterial proteome and the occurrence of untranslatable sequences

    PubMed Central

    Navon, Sharon Penias; Kornberg, Guy; Chen, Jin; Schwartzman, Tali; Tsai, Albert; Puglisi, Elisabetta Viani; Puglisi, Joseph D.; Adir, Noam

    2016-01-01

    Bioinformatic analysis of Escherichia coli proteomes revealed that all possible amino acid triplet sequences occur at their expected frequencies, with four exceptions. Two of the four underrepresented sequences (URSs) were shown to interfere with translation in vivo and in vitro. Enlarging the URS by a single amino acid resulted in increased translational inhibition. Single-molecule methods revealed stalling of translation at the entrance of the peptide exit tunnel of the ribosome, adjacent to ribosomal nucleotides A2062 and U2585. Interaction with these same ribosomal residues is involved in regulation of translation by longer, naturally occurring protein sequences. The E. coli exit tunnel has evidently evolved to minimize interaction with the exit tunnel and maximize the sequence diversity of the proteome, although allowing some interactions for regulatory purposes. Bioinformatic analysis of the human proteome revealed no underrepresented triplet sequences, possibly reflecting an absence of regulation by interaction with the exit tunnel. PMID:27307442

  4. Amino acid sequence repertoire of the bacterial proteome and the occurrence of untranslatable sequences.

    PubMed

    Navon, Sharon Penias; Kornberg, Guy; Chen, Jin; Schwartzman, Tali; Tsai, Albert; Puglisi, Elisabetta Viani; Puglisi, Joseph D; Adir, Noam

    2016-06-28

    Bioinformatic analysis of Escherichia coli proteomes revealed that all possible amino acid triplet sequences occur at their expected frequencies, with four exceptions. Two of the four underrepresented sequences (URSs) were shown to interfere with translation in vivo and in vitro. Enlarging the URS by a single amino acid resulted in increased translational inhibition. Single-molecule methods revealed stalling of translation at the entrance of the peptide exit tunnel of the ribosome, adjacent to ribosomal nucleotides A2062 and U2585. Interaction with these same ribosomal residues is involved in regulation of translation by longer, naturally occurring protein sequences. The E. coli exit tunnel has evidently evolved to minimize interaction with the exit tunnel and maximize the sequence diversity of the proteome, although allowing some interactions for regulatory purposes. Bioinformatic analysis of the human proteome revealed no underrepresented triplet sequences, possibly reflecting an absence of regulation by interaction with the exit tunnel.

  5. Cytochrome oxidase subunit III from Arbacia lixula: detection of functional constraints by comparison with homologous sequences.

    PubMed

    De Giorgi, C; Martiradonna, A; Saccone, C

    1993-01-01

    In this paper we report the comparison of the sequences of the cytochrome oxidase subunit III from three different sea urchin species. Both nucleotide and amino acid sequences have been analyzed. The nucleotide sequence analysis reveals that the sea urchin sequences obey some rules already found in mammals. The base substitution analysis carried out on the sequences of the three species pairs, shows that the evolutionary dynamics of the first and the second codon positions are so slow that do not allow a quantitative measurement of their genetic distances, thus demonstrating that also in these species the COIII gene is strongly conserved during evolution. Changes occurring at the third codon positions indicate that the three species evolved from a common ancestor under different directional mutational pressure. The multi-alignment of the sea urchin proteins indicates the existence of the amino acid sequence motif N R T that represents a possible glycosylation site. Another glycosylation site has been detected in the mammalian cytochrome oxidase subunit III, in a position slightly different. Such an analysis revealed, for the first time, a new functional aspect of this sequence.

  6. Amino acid sequences of proteins from Leptospira serovar pomona.

    PubMed

    Alves, S F; Lefebvre, R B; Probert, W

    2000-01-01

    This report describes a partial amino acid sequences from three putative outer envelope proteins from Leptospira serovar pomona. In order to obtain internal fragments for protein sequencing, enzymatic and chemical digestion was performed. The enzyme clostripain was used to digest the proteins 32 and 45 kDa. In situ digestion of 40 kDa molecular weight protein was accomplished using cyanogen bromide. The 32 kDa protein generated two fragments, one of 21 kDa and another of 10 kDa that yielded five residues. A fragment of 24 kDa that yielded nineteen residues of amino acids was obtained from 45 kDa protein. A fragment with a molecular weight of 20 kDa, yielding a twenty amino acids sequence from the 40 kDa protein.

  7. Extensive amino acid sequence homologies between animal lectins

    SciTech Connect

    Paroutaud, P.; Levi, G.; Teichberg, V.I.; Strosberg, A.D.

    1987-09-01

    The authors have established the amino acid sequence of the ..beta..-D-galactoside binding lectin from the electric eel and the sequences of several peptides from a similar lectin isolated from human placenta. These sequences were compared with the published sequences of peptides derived from the ..beta..-D-galactoside binding lectin from human lung and with sequences deduced from cDNAs assigned to the ..beta..-D-galactoside binding lectins from chicken embryo skin and human hepatomas. Significant homologies were observed. One of the highly conserved regions that contains a tryptophan residue and two glutamic acid resides is probably part of the ..beta..-D-galactoside binding site, which, on the basis of spectroscopic studies of the electric eel lectin, is expected to contain such residues. The similarity of the hydropathy profiles and the predicted secondary structure of the lectins from chicken skin and electric eel, in spite of differences in their amino acid sequences, strongly suggests that these proteins have maintained structural homologies during evolution and together with the other ..beta..-D-galactoside binding lectins were derived form a common ancestor gene.

  8. Amino acid sequence of porcine spleen cathepsin D.

    PubMed Central

    Shewale, J G; Tang, J

    1984-01-01

    The amino acid sequence of porcine spleen cathepsin D heavy chain has been determined and, hence, the complete structure of this enzyme is now known. The sequence of heavy chain was constructed by aligning the structures of peptides generated by cyanogen bromide, trypsin, and endo-proteinase Lys C cleavages. The structure of the light chain has been published previously. The cathepsin D molecule contains 339 amino acid residues in two polypeptide chains: a 97-residue light chain and a 242-residue heavy chain, with a combined Mr of 36,779 (without carbohydrate). There are two carbohydrate units linked to asparagine residues 70 and 192. The disulfide bond arrangement in cathepsin D is probably similar to that of pepsin, because the positions of six half-cystine residues are conserved. The active site aspartyl residues, corresponding to aspartic acid-32 and -215 of pepsin, are located at residues 33 and 224 in the cathepsin D molecule. The amino acid sequence around these aspartyl residues is strongly conserved. Cathepsin D shows a strong homology with other acid proteases. When the sequence of cathepsin D, renin, and pepsin are aligned, 32.7% of the residues are identical. The homology is observed throughout the length of the molecules, indicating that three-dimensional structures of all three molecules are similar. PMID:6587385

  9. The amino acid sequence of Lady Amherst's pheasant (Chrysolophus amherstiae) and golden pheasant (Chrysolophus pictus) egg-white lysozymes.

    PubMed

    Araki, T; Kuramoto, M; Torikata, T

    1990-09-01

    The amino acids of Lady Amherst's pheasant and golden pheasant egg-white lysozymes have been sequenced. The carboxymethylated lysozymes were digested with trypsin followed by sequencing of the tryptic peptides. Lady Amherst's pheasant lysozyme proved to consist of 129 amino acid residues, and a relative molecular mass of 14,423 Da was calculated. This lysozyme had 6 amino acids substitutions when compared with hen egg-white lysozyme: Phe3 to Tyr, His15 to Leu, Gln41 to His, Asn77 to His, Gln 121 to Asn, and a newly found substitution of Ile124 to Thr. The amino acid sequence of golden pheasant lysozyme was identical to that of Lady Amherst's phesant lysozyme. The phylogenetic tree constructured by the comparison of amino acid sequences of phasianoid birds lysozymes revealed a minimum genetic distance between these pheasants and the turkey-peafowl group.

  10. Comparison of latent and nominal rabbit Ig VHa1 allotype cDNA sequences.

    PubMed

    McCormack, W T; Dhanarajan, P; Roux, K H

    1988-09-15

    The genetic basis for the expression of a latent VH allotype in the rabbit was investigated. VH region cDNA libraries were produced from spleen mRNA derived from a homozygous a2a2 rabbit expressing an induced latent VHa1 allotype and, for comparison, from a normal homozygus a1a1 rabbit expressing nominal VHa1 allotype. The deduced amino acid sequences of the nominal VHa1 cDNA were concordant with previously published VHa1 protein sequences. A comparison of two complete VH-DH-JH and six partial VHa1 sequences reveals highly conserved sequence within VH framework regions (FR) and considerable diversity in complementarity-determining regions and D region sequences. Two functional JH genes or alleles are evident. Amino acid sequencing of the N-terminal 15 residues of pooled affinity-purified latent VHa1 H chain showed complete sequence identity with the nominal VHa1 sequences. Possible latent VHa1-encoding cDNA clones, derived from the a2a2 rabbit, were selected by hybridization with oligonucleotide probes corresponding to the VHa1 allotype-associated segments of the first and third framework regions (FR1 and FR3). cDNA sequence analysis reveals that the 5' untranslated regions of nominal and latent VHa1 cDNA were virtually identical to each other and to previously reported sequences associated with VHa2 and VHa-negative genes. Moreover, some latent VHa1 genes encode FR1 segments that are essentially homologous to the corresponding segment of a nominal VHa1 allotype. In contrast, other putative latent genes display blocks of VHa1 sequence in either FR1 or FR3 that are flanked by blocks of sequence identical to other rabbit VH genes (i.e., VHa2 or VHa-negative). These composite sequences may be directly encoded by composite germ-line VH genes or may be the products of somatically generated recombination or gene conversion between genes encoding latent and nominal allotypes. The data do not support the hypothesis that latent genes are the result of extensive modification

  11. Amino-Acid Sequence of NADP-Specific Glutamate Dehydrogenase of Neurospora crassa

    PubMed Central

    Wootton, John C.; Chambers, Geoffrey K.; Holder, Anthony A.; Baron, Andrew J.; Taylor, John G.; Fincham, John R. S.; Blumenthal, Kenneth M.; Moon, Kenneth; Smith, Emil L.

    1974-01-01

    A tentative primary structure of the NADP-specific glutamate dehydrogenase [L-glutamate: NADP oxidoreductase (deaminating), EC 1.4.1.4] from Neurospora crassa has been determined. The proposed sequence contains 452 amino-acid residues in each of the identical subunits of the hexameric enzyme. Comparison of the sequence with that of the bovine liver enzyme reveals considerable homology in the amino-terminal portion of the chain, including the vicinity of the reactive lysine, with only shorter stretches of homology within the carboxyl-terminal regions. The significance of this distribution of homologous regions is discussed. PMID:4155068

  12. Active site amino acid sequence of human factor D.

    PubMed

    Davis, A E

    1980-08-01

    Factor D was isolated from human plasma by chromatography on CM-Sephadex C50, Sephadex G-75, and hydroxylapatite. Digestion of reduced, S-carboxymethylated factor D with cyanogen bromide resulted in three peptides which were isolated by chromatography on Sephadex G-75 (superfine) equilibrated in 20% formic acid. NH2-Terminal sequences were determined by automated Edman degradation with a Beckman 890C sequencer using a 0.1 M Quadrol program. The smallest peptide (CNBr III) consisted of the NH2-terminal 14 amino acids. The other two peptides had molecular weights of 17,000 (CNBr I) and 7000 (CNBr II). Overlap of the NH2-terminal sequence of factor D with the NH2-terminal sequence of CNBr I established the order of the peptides. The NH2-terminal 53 residues of factor D are somewhat more homologous with the group-specific protease of rat intestine than with other serine proteases. The NH2-terminal sequence of CNBr II revealed the active site serine of factor D. The typical serine protease active site sequence (Gly-Asp-Ser-Gly-Gly-Pro was found at residues 12-17. The region surrounding the active site serine does not appear to be more highly homologous with any one of the other serine proteases. The structural data obtained point out the similarities between factor D and the other proteases. However, complete definition of the degree of relationship between factor D and other proteases will require determination of the remainder of the primary structure.

  13. SUBGROUPS OF AMINO ACID SEQUENCES IN THE VARIABLE REGIONS OF IMMUNOGLOBULIN HEAVY CHAINS*

    PubMed Central

    Cunningham, Bruce A.; Pflumm, Mollie N.; User, Urs Rutisha; Edelman, Gerald M.

    1969-01-01

    The amino acid sequence of the first 133 residues of the heavy (γ) chain from a human γG immunoglobulin (He) has been determined. This γ-chain is identical in Gm type to that of protein Eu, the complete sequence of which has been reported. Comparison of the two sequences substantiates the previous suggestion that there are subgroups of variable regions of heavy chains. The variable region of Eu has been assigned to subgroup I and that of He to subgroup II; on the other hand, the constant regions of the two proteins appear to be identical. Comparison of the sequence of the heavy chain of He with the heavy chain sequences determined in other laboratories suggests that the variable region of subgroup II is at least 118 residues long. The nature and distribution of amino acid variations in this heavy chain subgroup resemble those observed in light chain subgroups. These studies provide evidence that the translocation hypothesis applies to heavy as well as to light chains, viz., genes for variable regions (V) are somatically translocated to genes for constant regions (C) to form complete VC structural genes. Images PMID:5264153

  14. Nucleotide sequence of the luxC gene encoding fatty acid reductase of the lux operon from Photobacterium leiognathi.

    PubMed

    Lin, J W; Chao, Y F; Weng, S F

    1993-02-26

    The nucleotide sequence of the luxC gene (EMBL Accession No. 65156) encoding fatty acid reductase (FAR) of the lux operon from Photobacterium leiognathi PL741 was determined and the encoded amino acid sequence deduced. The fatty acid reductase is a component of the fatty acid reductase complex. The complex is responsible for converting fatty acid to aldehyde which serves as the substrate in the luciferase-catalyzed bioluminescent reaction. The protein comprises 478 amino acid residues and has a calculated M(r) of 53,858. Alignment and comparison of the fatty acid reductase of P. leiognathi with that of Vibrio harveyi B392 and Vibrio fischeri ATCC 7744 shows that there is 70% and 59% amino acid residues identity, respectively.

  15. The amino acid sequence of iguana (Iguana iguana) pancreatic ribonuclease.

    PubMed

    Zhao, W; Beintema, J J; Hofsteenge, J

    1994-01-15

    The pyrimidine-specific ribonuclease superfamily constitutes a group of homologous proteins so far found only in higher vertebrates. Four separate families are found in mammals, which have resulted from gene duplications in mammalian ancestors. To learn more about the evolutionary history of this superfamily, the primary structure and other characteristics of the pancreatic enzyme from iguana (Iguana iguana), a herbivorous lizard species belonging to the reptiles, have been determined. The polypeptide chain consists of 119 amino acid residues. The positions of insertions and deletions in the sequence are identical to those in the enzyme from snapping turtle. However, the two enzymes differ at 54% of the amino acid positions. Iguana ribonuclease contains no carbohydrate, although the enzyme possesses three recognition sites for carbohydrate attachment, and has a high number of acidic residues in a localized part of the sequence.

  16. Evaluation of integrated anaerobic/aerobic fixed-bed sequencing batch biofilm reactor for decolorization and biodegradation of azo dye acid red 18: comparison of using two types of packing media.

    PubMed

    Hosseini Koupaie, E; Alavi Moghaddam, M R; Hashemi, S H

    2013-01-01

    Two integrated anaerobic/aerobic fixed-bed sequencing batch biofilm reactor (FB-SBBR) were operated to evaluate decolorization and biodegradation of azo dye Acid Red 18 (AR18). Volcanic pumice stones and a type of plastic media made of polyethylene were used as packing media in FB-SBBR1 and FB-SBBR2, respectively. Decolorization of AR18 in both reactors followed first-order kinetic with respect to dye concentration. More than 63.7% and 71.3% of anaerobically formed 1-naphthylamine-4-sulfonate (1N-4S), as one of the main sulfonated aromatic constituents of AR18 was removed during the aerobic reaction phase in FB-SBBR1 and FB-SBBR2, respectively. Based on statistical analysis, performance of FB-SBBR2 in terms of COD removal as well as biodegradation of 1N-4S was significantly higher than that of FB-SBBR1. Spherical and rod shaped bacteria were the dominant species of bacteria in the biofilm grown on the pumice stones surfaces, while, the biofilm grown on surfaces of the polyethylene media had a fluffy structure.

  17. Comparison of Levels of Human Immunodeficiency Virus Type 1 RNA in Plasma as Measured by the NucliSens Nucleic Acid Sequence-Based Amplification and Quantiplex Branched-DNA Assays

    PubMed Central

    Ginocchio, C. C.; Tetali, S.; Washburn, D.; Zhang, F.; Kaplan, M. H.

    1999-01-01

    This study compared levels of human immunodeficiency virus type 1 RNA in plasma as measured by the Quantiplex branched-DNA and NucliSens nucleic acid sequence-based amplification assays. RNA was detectable in 118 of 184 samples (64.13%) by the Quantiplex assay and in 171 of 184 samples (92.94%) by the NucliSens assay. Regression analysis indicated that a linear relationship existed between the two sets of values (P < 0.0001), although the Quantiplex and NucliSens values were significantly different (P < 0.001), with the NucliSens values being approximately 0.323 log higher. Spearman correlation analysis indicated that the overall changes in patient viral load patterns were highly correlative between the two assays: r = 0.912, P < 0.0001. The lower limits of sensitivity were determined to be approximately 100 copies/ml and 1,200 to 1,400 copies/ml for the NucliSens and Quantiplex assays, respectively. PMID:10074556

  18. Comparison of levels of human immunodeficiency virus type 1 RNA in plasma as measured by the NucliSens nucleic acid sequence-based amplification and Quantiplex branched-DNA assays.

    PubMed

    Ginocchio, C C; Tetali, S; Washburn, D; Zhang, F; Kaplan, M H

    1999-04-01

    This study compared levels of human immunodeficiency virus type 1 RNA in plasma as measured by the Quantiplex branched-DNA and NucliSens nucleic acid sequence-based amplification assays. RNA was detectable in 118 of 184 samples (64.13%) by the Quantiplex assay and in 171 of 184 samples (92.94%) by the NucliSens assay. Regression analysis indicated that a linear relationship existed between the two sets of values (P < 0.0001), although the Quantiplex and NucliSens values were significantly different (P < 0.001), with the NucliSens values being approximately 0.323 log higher. Spearman correlation analysis indicated that the overall changes in patient viral load patterns were highly correlative between the two assays: r = 0.912, P < 0.0001. The lower limits of sensitivity were determined to be approximately 100 copies/ml and 1,200 to 1,400 copies/ml for the NucliSens and Quantiplex assays, respectively.

  19. Bioinformatics comparison of sulfate-reducing metabolism nucleotide sequences

    NASA Astrophysics Data System (ADS)

    Tremberger, G.; Dehipawala, Sunil; Nguyen, A.; Cheung, E.; Sullivan, R.; Holden, T.; Lieberman, D.; Cheung, T.

    2015-09-01

    The sulfate-reducing bacteria can be traced back to 3.5 billion years ago. The thermodynamics details of the sulfur cycle have been well documented. A recent sulfate-reducing bacteria report (Robator, Jungbluth, et al , 2015 Jan, Front. Microbiol) with Genbank nucleotide data has been analyzed in terms of the sulfite reductase (dsrAB) via fractal dimension and entropy values. Comparison to oil field sulfate-reducing sequences was included. The AUCG translational mass fractal dimension versus ATCG transcriptional mass fractal dimension for the low temperature dsrB and dsrA sequences reported in Reference Thirteen shows correlation R-sq ~ 0.79 , with a probably of about 3% in simulation. A recent report of using Cystathionine gamma-lyase sequence to produce CdS quantum dot in a biological method, where the sulfur is reduced just like in the H2S production process, was included for comparison. The AUCG mass fractal dimension versus ATCG mass fractal dimension for the Cystathionine gamma-lyase sequences was found to have R-sq of 0.72, similar to the low temperature dissimilatory sulfite reductase dsr group with 3% probability, in contrary to the oil field group having R-sq ~ 0.94, a high probable outcome in the simulation. The other two simulation histograms, namely, fractal dimension versus entropy R-sq outcome values, and di-nucleotide entropy versus mono-nucleotide entropy R-sq outcome values are also discussed in the data analysis focusing on low probability outcomes.

  20. Sequences Of Amino Acids For Human Serum Albumin

    NASA Technical Reports Server (NTRS)

    Carter, Daniel C.

    1992-01-01

    Sequences of amino acids defined for use in making polypeptides one-third to one-sixth as large as parent human serum albumin molecule. Smaller, chemically stable peptides have diverse applications including service as artificial human serum and as active components of biosensors and chromatographic matrices. In applications involving production of artificial sera from new sequences, little or no concern about viral contaminants. Smaller genetically engineered polypeptides more easily expressed and produced in large quantities, making commercial isolation and production more feasible and profitable.

  1. rasbhari: Optimizing Spaced Seeds for Database Searching, Read Mapping and Alignment-Free Sequence Comparison

    PubMed Central

    Hahn, Lars; Leimeister, Chris-André; Morgenstern, Burkhard

    2016-01-01

    Many algorithms for sequence analysis rely on word matching or word statistics. Often, these approaches can be improved if binary patterns representing match and don’t-care positions are used as a filter, such that only those positions of words are considered that correspond to the match positions of the patterns. The performance of these approaches, however, depends on the underlying patterns. Herein, we show that the overlap complexity of a pattern set that was introduced by Ilie and Ilie is closely related to the variance of the number of matches between two evolutionarily related sequences with respect to this pattern set. We propose a modified hill-climbing algorithm to optimize pattern sets for database searching, read mapping and alignment-free sequence comparison of nucleic-acid sequences; our implementation of this algorithm is called rasbhari. Depending on the application at hand, rasbhari can either minimize the overlap complexity of pattern sets, maximize their sensitivity in database searching or minimize the variance of the number of pattern-based matches in alignment-free sequence comparison. We show that, for database searching, rasbhari generates pattern sets with slightly higher sensitivity than existing approaches. In our Spaced Words approach to alignment-free sequence comparison, pattern sets calculated with rasbhari led to more accurate estimates of phylogenetic distances than the randomly generated pattern sets that we previously used. Finally, we used rasbhari to generate patterns for short read classification with CLARK-S. Here too, the sensitivity of the results could be improved, compared to the default patterns of the program. We integrated rasbhari into Spaced Words; the source code of rasbhari is freely available at http://rasbhari.gobics.de/ PMID:27760124

  2. Nanopores and nucleic acids: prospects for ultrarapid sequencing

    NASA Technical Reports Server (NTRS)

    Deamer, D. W.; Akeson, M.

    2000-01-01

    DNA and RNA molecules can be detected as they are driven through a nanopore by an applied electric field at rates ranging from several hundred microseconds to a few milliseconds per molecule. The nanopore can rapidly discriminate between pyrimidine and purine segments along a single-stranded nucleic acid molecule. Nanopore detection and characterization of single molecules represents a new method for directly reading information encoded in linear polymers. If single-nucleotide resolution can be achieved, it is possible that nucleic acid sequences can be determined at rates exceeding a thousand bases per second.

  3. Nucleotide sequence of the Klebsiella pneumoniae nifD gene and predicted amino acid sequence of the alpha-subunit of nitrogenase MoFe protein.

    PubMed Central

    Ioannidis, I; Buck, M

    1987-01-01

    The nucleotide sequence of the Klebsiella pneumoniae nifD gene is presented and together with the accompanying paper [Holland, Zilberstein, Zamir & Sussman (1987) Biochem. J. 247, 277-285] completes the sequence of the nifHDK genes encoding the nitrogenase polypeptides. The K. pneumoniae nifD gene encodes the 483-amino acid-residue nitrogenase alpha-subunit polypeptide of Mr 54156. The alpha-subunit has five strongly conserved cysteine residues at positions 63, 89, 155, 184 and 275, some occurring in a region showing both primary sequence and potential structural homology to the K. pneumoniae nitrogenase beta-subunit. A comparison with six other alpha-subunit amino acid sequences has been made, which indicates a number of potentially important domains within alpha-subunits. PMID:3322262

  4. Complete amino acid sequence of the A chain of human complement-classical-pathway enzyme C1r.

    PubMed Central

    Arlaud, G J; Willis, A C; Gagnon, J

    1987-01-01

    The amino acid sequence of human C1r A chain was determined, from sequence analysis performed on fragments obtained from C1r autolytic cleavage, cleavage of methionyl bonds, tryptic cleavages at arginine and lysine residues, and cleavages by staphylococcal proteinase. The polypeptide chain has an N-terminal serine residue and contains 446 amino acid residues (Mr 51,200). The sequence data allow chemical characterization of fragments alpha (positions 1-211), beta (positions 212-279) and gamma (positions 280-446) yielded from C1r autolytic cleavage, and identification of the two major cleavage sites generating these fragments. Position 150 of C1r A chain is occupied by a modified amino acid residue that, upon acid hydrolysis, yields erythro-beta-hydroxyaspartic acid, and that is located in a sequence homologous to the beta-hydroxyaspartic acid-containing regions of Factor IX, Factor X, protein C and protein Z. Sequence comparison reveals internal homology between two segments (positions 10-78 and 186-257). Two carbohydrate moieties are attached to the polypeptide chain, both via asparagine residues at positions 108 and 204. Combined with the previously determined sequence of C1r B chain [Arlaud & Gagnon (1983) Biochemistry 22, 1758-1764], these data give the complete sequence of human C1r. PMID:3036070

  5. Amino acid sequence of myoglobin from emu (Dromaius novaehollandiae) skeletal muscle.

    PubMed

    Suman, S P; Joseph, P; Li, S; Beach, C M; Fontaine, M; Steinke, L

    2010-11-01

    The objective of the present study was to characterize the primary structure of emu myoglobin (Mb). Emu Mb was isolated from Iliofibularis muscle employing gel-filtration chromatography. Matrix Assisted Laser Desorption Ionization-Time of Flight Mass Spectrometry was employed to determine the exact molecular mass of emu Mb in comparison with horse Mb, and Edman degradation was utilized to characterize the amino acid sequence. The molecular mass of emu Mb was 17,380 Da and was close to those reported for ratite and poultry myoglobins. Similar to myoglobins from meat-producing livestock and birds, emu Mb has 153 amino acids. Emu Mb contains 9 histidines. Proximal and distal histidines, responsible for coordinating oxygen-binding property of Mb, are conserved in emu. Emu Mb shared more than 90% homology with ratite and chicken myoglobins, whereas it demonstrated only less than 70% sequence similarity with ruminant myoglobins.

  6. Mathematical Characterization of Protein Sequences Using Patterns as Chemical Group Combinations of Amino Acids.

    PubMed

    Das, Jayanta Kumar; Das, Provas; Ray, Korak Kumar; Choudhury, Pabitra Pal; Jana, Siddhartha Sankar

    2016-01-01

    Comparison of amino acid sequence similarity is the fundamental concept behind the protein phylogenetic tree formation. By virtue of this method, we can explain the evolutionary relationships, but further explanations are not possible unless sequences are studied through the chemical nature of individual amino acids. Here we develop a new methodology to characterize the protein sequences on the basis of the chemical nature of the amino acids. We design various algorithms for studying the variation of chemical group transitions and various chemical group combinations as patterns in the protein sequences. The amino acid sequence of conventional myosin II head domain of 14 family members are taken to illustrate this new approach. We find two blocks of maximum length 6 aa as 'FPKATD' and 'Y/FTNEKL' without repeating the same chemical nature and one block of maximum length 20 aa with the repetition of chemical nature which are common among all 14 members. We also check commonality with another motor protein sub-family kinesin, KIF1A. Based on our analysis we find a common block of length 8 aa both in myosin II and KIF1A. This motif is located in the neck linker region which could be responsible for the generation of mechanical force, enabling us to find the unique blocks which remain chemically conserved across the family. We also validate our methodology with different protein families such as MYOI, Myosin light chain kinase (MLCK) and Rho-associated protein kinase (ROCK), Na+/K+-ATPase and Ca2+-ATPase. Altogether, our studies provide a new methodology for investigating the conserved amino acids' pattern in different proteins.

  7. Amino acid sequence of band-3 protein from rainbow trout erythrocytes derived from cDNA.

    PubMed Central

    Hübner, S; Michel, F; Rudloff, V; Appelhans, H

    1992-01-01

    In this report we present the first complete band-3 cDNA sequence of a poikilothermic lower vertebrate. The primary structure of the anion-exchange protein band 3 (AE1) from rainbow trout erythrocytes was determined by nucleotide sequencing of cDNA clones. The overlapping clones have a total length of 3827 bp with a 5'-terminal untranslated region of 150 bp, a 2754 bp open reading frame and a 3'-untranslated region of 924 bp. Band-3 protein from trout erythrocytes consists of 918 amino acid residues with a calculated molecular mass of 101 827 Da. Comparison of its amino acid sequence revealed a 60-65% identity within the transmembrane spanning sequence of band-3 proteins published so far. An additional insertion of 24 amino acid residues within the membrane-associated domain of trout band-3 protein was identified, which until now was thought to be a general feature only of mammalian band-3-related proteins. PMID:1637296

  8. Thermal adaptation analyzed by comparison of protein sequences from mesophilic and extremely thermophilic Methanococcus species

    NASA Technical Reports Server (NTRS)

    Haney, P. J.; Badger, J. H.; Buldak, G. L.; Reich, C. I.; Woese, C. R.; Olsen, G. J.

    1999-01-01

    The genome sequence of the extremely thermophilic archaeon Methanococcus jannaschii provides a wealth of data on proteins from a thermophile. In this paper, sequences of 115 proteins from M. jannaschii are compared with their homologs from mesophilic Methanococcus species. Although the growth temperatures of the mesophiles are about 50 degrees C below that of M. jannaschii, their genomic G+C contents are nearly identical. The properties most correlated with the proteins of the thermophile include higher residue volume, higher residue hydrophobicity, more charged amino acids (especially Glu, Arg, and Lys), and fewer uncharged polar residues (Ser, Thr, Asn, and Gln). These are recurring themes, with all trends applying to 83-92% of the proteins for which complete sequences were available. Nearly all of the amino acid replacements most significantly correlated with the temperature change are the same relatively conservative changes observed in all proteins, but in the case of the mesophile/thermophile comparison there is a directional bias. We identify 26 specific pairs of amino acids with a statistically significant (P < 0.01) preferred direction of replacement.

  9. The complementary deoxyribonucleic acid sequence of guinea pig endometrial prorelaxin.

    PubMed

    Lee, Y A; Bryant-Greenwood, G D; Mandel, M; Greenwood, F C

    1992-03-01

    The nucleotide sequence of the relaxin gene transcript in the endometrium of the late pregnant guinea pig has been determined. The strategy used was a combination of polymerase chain reaction (PCR) with primers designed from the mRNA sequence of porcine preprorelaxin, rapid amplification of cDNA ends-PCR, and blunt end cloning in M13 mp18. With heterologous primers, a 226-basepair (bp) segment of the guinea pig relaxin gene sequence was obtained and was used to design a guinea pig-specific primer for use with the rapid amplification of cDNA ends-PCR method. The latter allowed completion of the sequence of 336 bp, with a 96-bp overlap. The sequence obtained shows greater homology at both the nucleotide and amino acid levels with porcine and human relaxins H1 and H2 than with rat relaxin, supporting the thesis that the guinea pig is not a rodent. The transcription of the guinea pig endometrial relaxin gene during pregnancy was confirmed by Northern analysis of guinea pig endometrial tissues with a species-specific cDNA probe. The endometrial relaxin gene is transcribed during pregnancy, but not in lactation, consistent with the observed immunostaining for relaxin.

  10. Quantum-Sequencing: Biophysics of quantum tunneling through nucleic acids

    NASA Astrophysics Data System (ADS)

    Casamada Ribot, Josep; Chatterjee, Anushree; Nagpal, Prashant

    2014-03-01

    Tunneling microscopy and spectroscopy has extensively been used in physical surface sciences to study quantum tunneling to measure electronic local density of states of nanomaterials and to characterize adsorbed species. Quantum-Sequencing (Q-Seq) is a new method based on tunneling microscopy for electronic sequencing of single molecule of nucleic acids. A major goal of third-generation sequencing technologies is to develop a fast, reliable, enzyme-free single-molecule sequencing method. Here, we present the unique ``electronic fingerprints'' for all nucleotides on DNA and RNA using Q-Seq along their intrinsic biophysical parameters. We have analyzed tunneling spectra for the nucleotides at different pH conditions and analyzed the HOMO, LUMO and energy gap for all of them. In addition we show a number of biophysical parameters to further characterize all nucleobases (electron and hole transition voltage and energy barriers). These results highlight the robustness of Q-Seq as a technique for next-generation sequencing.

  11. A cytochrome ba3 functions as a quinol oxidase in Paracoccus denitrificans. Purification, cloning, and sequence comparison.

    PubMed

    Richter, O M; Tao, J S; Turba, A; Ludwig, B

    1994-09-16

    A quinol oxidase has been purified from the cytoplasmic membrane of Paracoccus denitrificans; its heme composition and CO binding properties identify it as a cytochrome ba3. On SDS gels, the purified enzyme complex is separated into five polypeptides. Using partial peptide sequence information for subunit II, the gene locus has been cloned and sequenced. In a typical operon pattern, four genes were identified: qoxA, -B, -C, and -D, coding for subunits II, I, III, and IV. DNA-derived amino acid sequence comparisons reveal extensive similarities to other members of the terminal oxidase superfamily.

  12. Molecular cloning and amino acid sequence of human 5-lipoxygenase

    SciTech Connect

    Matsumoto, T.; Funk, C.D.; Radmark, O.; Hoeoeg, J.O.; Joernvall, H.; Samuelsson, B.

    1988-01-01

    5-Lipoxygenase (EC 1.13.11.34), a Ca/sup 2 +/- and ATP-requiring enzyme, catalyzes the first two steps in the biosynthesis of the peptidoleukotrienes and the chemotactic factor leukotriene B/sub 4/. A cDNA clone corresponding to 5-lipoxygenase was isolated from a human lung lambda gt11 expression library by immunoscreening with a polyclonal antibody. Additional clones from a human placenta lambda gt11 cDNA library were obtained by plaque hybridization with the /sup 32/P-labeled lung cDNA clone. Sequence data obtained from several overlapping clones indicate that the composite DNAs contain the complete coding region for the enzyme. From the deduced primary structure, 5-lipoxygenase encodes a 673 amino acid protein with a calculated molecular weight of 77,839. Direct analysis of the native protein and its proteolytic fragments confirmed the deduced composition, the amino-terminal amino acid sequence, and the structure of many internal segments. 5-Lipoxygenase has no apparent sequence homology with leukotriene A/sub 4/ hydrolase or Ca/sup 2 +/-binding proteins. RNA blot analysis indicated substantial amounts of an mRNA species of approx. = 2700 nucleotides in leukocytes, lung, and placenta.

  13. Nucleic acid sequence detection using multiplexed oligonucleotide PCR

    DOEpatents

    Nolan, John P.; White, P. Scott

    2006-12-26

    Methods for rapidly detecting single or multiple sequence alleles in a sample nucleic acid are described. Provided are all of the oligonucleotide pairs capable of annealing specifically to a target allele and discriminating among possible sequences thereof, and ligating to each other to form an oligonucleotide complex when a particular sequence feature is present (or, alternatively, absent) in the sample nucleic acid. The design of each oligonucleotide pair permits the subsequent high-level PCR amplification of a specific amplicon when the oligonucleotide complex is formed, but not when the oligonucleotide complex is not formed. The presence or absence of the specific amplicon is used to detect the allele. Detection of the specific amplicon may be achieved using a variety of methods well known in the art, including without limitation, oligonucleotide capture onto DNA chips or microarrays, oligonucleotide capture onto beads or microspheres, electrophoresis, and mass spectrometry. Various labels and address-capture tags may be employed in the amplicon detection step of multiplexed assays, as further described herein.

  14. The amino acid sequence of chymopapain from Carica papaya.

    PubMed Central

    Watson, D C; Yaguchi, M; Lynn, K R

    1990-01-01

    Chymopapain is a polypeptide of 218 amino acid residues. It has considerable structural similarity with papain and papaya proteinase omega, including conservation of the catalytic site and of the disulphide bonding. Chymopapain is like papaya proteinase omega in carrying four extra residues between papain positions 168 and 169, but differs from both papaya proteinases in the composition of its S2 subsite, as well as in having a second thiol group, Cys-117. Some evidence for the amino acid sequence of chymopapain has been deposited as Supplementary Publication SUP 50153 (12 pages) at the British Library Document Supply Centre, Boston Spa., Wetherby, West Yorkshire LS23 7BQ, U.K., from whom copies may be obtained on the terms indicated in Biochem. J. (1990) 265, 5. The information comprises Supplement Tables 1-4, which contain, in order, amino acid compositions of peptides from tryptic, peptic, CNBr and mild acid cleavages, Supplement Fig. 1, showing re-fractionation of selected peaks from Fig. 2 of the main paper. Supplement Fig. 2, showing cation-exchange chromatography of the earliest-eluted peak of Fig. 3 of the main paper, Supplement Fig. 3, showing reverse-phase h.p.l.c. of the later-eluted peak from Fig. 3 of the main paper, and Supplement Fig. 4, showing the separation of peptides after mild acid hydrolysis of CNBr-cleavage fragment CB3. PMID:2106878

  15. The sequence of carnation etched ring virus DNA: comparison with cauliflower mosaic virus and retroviruses

    PubMed Central

    Hull, R.; Sadler, J.; Longstaff, M.

    1986-01-01

    Carnation etched ring virus (CERV) DNA comprises 7932 bp. CERV primer binding sites and overall genome organization are similar to those of the related cauliflower mosaic virus (CaMV). The six open reading frames of CERV showed amino acid homology (50-80%) with CaMV ORFs I-VI; no homologues of CaMV ORFs VII or VIII were found. CERV ORFs 1-5 interface each other with the sequence ATGA. The comparison of CERV ORF5 with CaMV ORFV highlighted regions which show homologies to retrovirus gag/pol protease, RNase H and DNA polymerase domains; the possibility that the DNA polymerase domain comprises two subdomains, operating off different templates, is discussed. Both CERV and CaMV ORFs I have sequence homology to tobacco mosaic virus P30 and plastocyanin. PMID:16453731

  16. An amino acid sequence motif sufficient for subnuclear localization of an arginine/serine-rich splicing factor.

    PubMed

    Hedley, M L; Amrein, H; Maniatis, T

    1995-12-05

    We have identified an amino acid sequence in the Drosophila Transformer (Tra) protein that is capable of directing a heterologous protein to nuclear speckles, regions of the nucleus previously shown to contain high concentrations of spliceosomal small nuclear RNAs and splicing factors. This sequence contains a nucleoplasmin-like bipartite nuclear localization signal (NLS) and a repeating arginine/serine (RS) dipeptide sequence adjacent to a short stretch of basic amino acids. Sequence comparisons from a number of other splicing factors that colocalize to nuclear speckles reveal the presence of one or more copies of this motif. We propose a two-step subnuclear localization mechanism for splicing factors. The first step is transport across the nuclear envelope via the nucleoplasmin-like NLS, while the second step is association with components in the speckled domain via the RS dipeptide sequence.

  17. Studies on adenosine triphosphate transphosphorylases. Amino acid sequence of rabbit muscle ATP-AMP transphosphorylase.

    PubMed

    Kuby, S A; Palmieri, R H; Frischat, A; Fischer, A H; Wu, L H; Maland, L; Manship, M

    1984-05-22

    The total amino acid sequence of rabbit muscle adenylate kinase has been determined, and the single polypeptide chain of 194 amino acid residues starts with N-acetylmethionine and ends with leucyllysine at its carboxyl terminus, in agreement with the earlier data on its amino acid composition [Mahowald, T. A., Noltmann, E. A., & Kuby, S. A. (1962) J. Biol. Chem. 237, 1138-1145] and its carboxyl-terminus sequence [Olson, O. E., & Kuby, S. A. (1964) J. Biol. Chem. 239, 460-467]. Elucidation of the primary structure was based on tryptic and chymotryptic cleavages of the performic acid oxidized protein, cyanogen bromide cleavages of the 14C-labeled S-carboxymethylated protein at its five methionine sites (followed by maleylation of peptide fragments), and tryptic cleavages at its 12 arginine sites of the maleylated 14C-labeled S-carboxymethylated protein. Calf muscle myokinase, whose sequence has also been established, differs primarily from the rabbit muscle myokinase's sequence in the following: His-30 is replaced by Gln-30; Lys-56 is replaced by Met-56; Ala-84 and Asp 85 are replaced by Val-84 and Asn-85. A comparison of the four muscle-type adenylate kinases, whose covalent structures have now been determined, viz., rabbit, calf, porcine, and human [for the latter two sequences see Heil, A., Müller, G., Noda, L., Pinder, T., Schirmer, H., Schirmer, I., & Von Zabern, I. (1974) Eur. J. Biochem. 43, 131-144, and Von Zabern, I., Wittmann-Liebold, B., Untucht-Grau, R., Schirmer, R. H., & Pai, E. F. (1976) Eur. J. Biochem. 68, 281-290], demonstrates an extraordinary degree of homology.(ABSTRACT TRUNCATED AT 250 WORDS)

  18. Mathematical Characterization of Protein Sequences Using Patterns as Chemical Group Combinations of Amino Acids

    PubMed Central

    Choudhury, Pabitra Pal; Jana, Siddhartha Sankar

    2016-01-01

    Comparison of amino acid sequence similarity is the fundamental concept behind the protein phylogenetic tree formation. By virtue of this method, we can explain the evolutionary relationships, but further explanations are not possible unless sequences are studied through the chemical nature of individual amino acids. Here we develop a new methodology to characterize the protein sequences on the basis of the chemical nature of the amino acids. We design various algorithms for studying the variation of chemical group transitions and various chemical group combinations as patterns in the protein sequences. The amino acid sequence of conventional myosin II head domain of 14 family members are taken to illustrate this new approach. We find two blocks of maximum length 6 aa as ‘FPKATD’ and ‘Y/FTNEKL’ without repeating the same chemical nature and one block of maximum length 20 aa with the repetition of chemical nature which are common among all 14 members. We also check commonality with another motor protein sub-family kinesin, KIF1A. Based on our analysis we find a common block of length 8 aa both in myosin II and KIF1A. This motif is located in the neck linker region which could be responsible for the generation of mechanical force, enabling us to find the unique blocks which remain chemically conserved across the family. We also validate our methodology with different protein families such as MYOI, Myosin light chain kinase (MLCK) and Rho-associated protein kinase (ROCK), Na+/K+-ATPase and Ca2+-ATPase. Altogether, our studies provide a new methodology for investigating the conserved amino acids’ pattern in different proteins. PMID:27930687

  19. Comparison of Buffer Effect of Different Acids During Sandstone Acidizing

    NASA Astrophysics Data System (ADS)

    Umer Shafiq, Mian; Khaled Ben Mahmud, Hisham; Hamid, Mohamed Ali

    2015-04-01

    The most important concern of sandstone matrix acidizing is to increase the formation permeability by removing the silica particles. To accomplish this, the mud acid (HF: HCl) has been utilized successfully for many years to stimulate the sandstone formations, but still it has many complexities. This paper presents the results of laboratory investigations of different acid combinations (HF: HCl, HF: H3PO4 and HF: HCOOH). Hydrofluoric acid and fluoboric acid are used to dissolve clays and feldspar. Phosphoric and formic acids are added as a buffer to maintain the pH of the solution; also it allows the maximum penetration of acid into the core sample. Different tests have been performed on the core samples before and after the acidizing to do the comparative study on the buffer effect of these acids. The analysis consists of permeability, porosity, color change and pH value tests. There is more increase in permeability and porosity while less change in pH when phosphoric and formic acids were used compared to mud acid. From these results it has been found that the buffer effect of phosphoric acid and formic acid is better than hydrochloric acid.

  20. Ultrasensitive nucleic acid sequence detection by single-molecule electrophoresis

    SciTech Connect

    Castro, A; Shera, E.B.

    1996-09-01

    This is the final report of a one-year laboratory-directed research and development project at Los Alamos National Laboratory. There has been considerable interest in the development of very sensitive clinical diagnostic techniques over the last few years. Many pathogenic agents are often present in extremely small concentrations in clinical samples, especially at the initial stages of infection, making their detection very difficult. This project sought to develop a new technique for the detection and accurate quantification of specific bacterial and viral nucleic acid sequences in clinical samples. The scheme involved the use of novel hybridization probes for the detection of nucleic acids combined with our recently developed technique of single-molecule electrophoresis. This project is directly relevant to the DOE`s Defense Programs strategic directions in the area of biological warfare counter-proliferation.

  1. The amino acid sequence of the aspartate aminotransferase from baker's yeast (Saccharomyces cerevisiae).

    PubMed Central

    Cronin, V B; Maras, B; Barra, D; Doonan, S

    1991-01-01

    1. The single (cytosolic) aspartate aminotransferase was purified in high yield from baker's yeast (Saccharomyces cerevisiae). 2. Amino-acid-sequence analysis was carried out by digestion of the protein with trypsin and with CNBr; some of the peptides produced were further subdigested with Staphylococcus aureus V8 proteinase or with pepsin. Peptides were sequenced by the dansyl-Edman method and/or by automated gas-phase methods. The amino acid sequence obtained was complete except for a probable gap of two residues as indicated by comparison with the structures of counterpart proteins in other species. 3. The N-terminus of the enzyme is blocked. Fast-atom-bombardment m.s. was used to identify the blocking group as an acetyl one. 4. Alignment of the sequence of the enzyme with those of vertebrate cytosolic and mitochondrial aspartate aminotransferases and with the enzyme from Escherichia coli showed that about 25% of residues are conserved between these distantly related forms. 5. Experimental details and confirmatory data for the results presented here are given in a Supplementary Publication (SUP 50164, 25 pages) that has been deposited at the British Library Document Supply Centre, Boston Spa. Wetherby, West Yorkshire LS23 7 BQ, U.K., from whom copies can be obtained on the terms indicated in Biochem. J. (1991) 273, 5. PMID:1859361

  2. Nucleic acid (cDNA) and amino acid sequences of alpha-type gliadins from wheat (Triticum aestivum).

    PubMed Central

    Kasarda, D D; Okita, T W; Bernardin, J E; Baecker, P A; Nimmo, C C; Lew, E J; Dietler, M D; Greene, F C

    1984-01-01

    The complete amino acid sequence for an alpha-type gliadin protein of wheat (Triticum aestivum Linnaeus) endosperm has been derived from a cloned cDNA sequence. An additional cDNA clone that corresponds to about 75% of a similar alpha-type gliadin has been sequenced and shows some important differences. About 97% of the composite sequence of A-gliadin (an alpha-type gliadin fraction) has also been obtained by direct amino acid sequencing. This sequence shows a high degree of similarity with amino acid sequences derived from both cDNA clones and is virtually identical to one of them. On the basis of sequence information, after loss of the signal sequence, the mature alpha-type gliadins may be divided into five different domains, two of which may have evolved from an ancestral gliadin gene, whereas the remaining three contain repeating sequences that may have developed independently. Images PMID:6589619

  3. Comparison of exon 5 sequences from 35 class I genes of the BALB/c mouse

    PubMed Central

    1989-01-01

    DNA sequences of the fifth exon, which encodes the transmembrane domain, were determined for the BALB/c mouse class I MHC genes and used to study the relationships between them. Based on nucleotide sequence similarity, the exon 5 sequences can be divided into seven groups. Although most members within each group are at least 80% similar to each other, comparison between groups reveals that the groups share little similarity. However, in spite of the extensive variation of the fifth exon sequences, analysis of their predicted amino acid translations reveals that only four class I gene fifth exons have frameshifts or stop codons that terminate their translation and prevent them from encoding a domain that is both hydrophobic and long enough to span a lipid bilayer. Exactly 27 of the remaining fifth exons could encode a domain that is similar to those of the transplantation antigens in that it consists of a proline-rich connecting peptide, a transmembrane segment, and a cytoplasmic portion with membrane- anchoring basic residues. The conservation of this motif in the majority of the fifth exon translations in spite of extensive variation suggests that selective pressure exists for these exons to maintain their ability to encode a functional transmembrane domain, raising the possibility that many of the nonclassical class I genes encode functionally important products. PMID:2584927

  4. Structural gene and complete amino acid sequence of Vibrio alginolyticus collagenase.

    PubMed Central

    Takeuchi, H; Shibano, Y; Morihara, K; Fukushima, J; Inami, S; Keil, B; Gilles, A M; Kawamoto, S; Okuda, K

    1992-01-01

    The DNA encoding the collagenase of Vibrio alginolyticus was cloned, and its complete nucleotide sequence was determined. When the cloned gene was ligated to pUC18, the Escherichia coli expression vector, bacteria carrying the gene exhibited both collagenase antigen and collagenase activity. The open reading frame from the ATG initiation codon was 2442 bp in length for the collagenase structural gene. The amino acid sequence, deduced from the nucleotide sequence, revealed that the mature collagenase consists of 739 amino acids with an Mr of 81875. The amino acid sequences of 20 polypeptide fragments were completely identical with the deduced amino acid sequences of the collagenase gene. The amino acid composition predicted from the DNA sequence was similar to the chemically determined composition of purified collagenase reported previously. The analyses of both the DNA and amino acid sequences of the collagenase gene were rigorously performed, but we could not detect any significant sequence similarity to other collagenases. Images Fig. 2. PMID:1311172

  5. Characterization of the microbial acid mine drainage microbial community using culturing and direct sequencing techniques.

    PubMed

    Auld, Ryan R; Myre, Maxine; Mykytczuk, Nadia C S; Leduc, Leo G; Merritt, Thomas J S

    2013-05-01

    We characterized the bacterial community from an AMD tailings pond using both classical culturing and modern direct sequencing techniques and compared the two methods. Acid mine drainage (AMD) is produced by the environmental and microbial oxidation of minerals dissolved from mining waste. Surprisingly, we know little about the microbial communities associated with AMD, despite the fundamental ecological roles of these organisms and large-scale economic impact of these waste sites. AMD microbial communities have classically been characterized by laboratory culturing-based techniques and more recently by direct sequencing of marker gene sequences, primarily the 16S rRNA gene. In our comparison of the techniques, we find that their results are complementary, overall indicating very similar community structure with similar dominant species, but with each method identifying some species that were missed by the other. We were able to culture the majority of species that our direct sequencing results indicated were present, primarily species within the Acidithiobacillus and Acidiphilium genera, although estimates of relative species abundance were only obtained from direct sequencing. Interestingly, our culture-based methods recovered four species that had been overlooked from our sequencing results because of the rarity of the marker gene sequences, likely members of the rare biosphere. Further, direct sequencing indicated that a single genus, completely missed in our culture-based study, Legionella, was a dominant member of the microbial community. Our results suggest that while either method does a reasonable job of identifying the dominant members of the AMD microbial community, together the methods combine to give a more complete picture of the true diversity of this environment.

  6. A comparison of chromic acid and sulfuric acid anodizing

    NASA Technical Reports Server (NTRS)

    Danford, M. D.

    1992-01-01

    Because of federal and state mandates restricting the use of hexavalent chromium, it was deemed worthwhile to compare the corrosion protection afforded 2219-T87 aluminum alloy by both Type I chromic acid and Type II sulfuric acid anodizing per MIL-A-8625. Corrosion measurements were made on large, flat 2219-T87 aluminum alloy sheet material with an area of 1 cm(exp 2) exposed to a corrosive medium of 3.5-percent sodium chloride at pH 5.5. Both ac electrochemical impedance spectroscopy and the dc polarization resistance techniques were employed. The results clearly indicate that the corrosion protection obtained by Type II sulfuric acid anodizing is superior, and no problems should result by substituting Type II sulfuric acid anodizing for Type I chromic acid anodizing.

  7. Alignment of 700 globin sequences: extent of amino acid substitution and its correlation with variation in volume.

    PubMed Central

    Kapp, O. H.; Moens, L.; Vanfleteren, J.; Trotman, C. N.; Suzuki, T.; Vinogradov, S. N.

    1995-01-01

    Seven-hundred globin sequences, including 146 nonvertebrate sequences, were aligned on the basis of conservation of secondary structure and the avoidance of gap penalties. Of the 182 positions needed to accommodate all the globin sequences, only 84 are common to all, including the absolutely conserved PheCD1 and HisF8. The mean number of amino acid substitutions per position ranges from 8 to 13 for all globins and 5 to 9 for internal positions. Although the total sequence volumes have a variation approximately 2-3%, the variation in volume per position ranges from approximately 13% for the internal to approximately 21% for the surface positions. Plausible correlations exist between amino acid substitution and the variation in volume per position for the 84 common and the internal but not the surface positions. The amino acid substitution matrix derived from the 84 common positions was used to evaluate sequence similarity within the globins and between the globins and phycocyanins C and colicins A, via calculation of pairwise similarity scores. The scores for globin-globin comparisons over the 84 common positions overlap the globin-phycocyanin and globin-colicin scores, with the former being intermediate. For the subset of internal positions, overlap is minimal between the three groups of scores. These results imply a continuum of amino acid sequences able to assume the common three-on-three alpha-helical structure and suggest that the determinants of the latter include sites other than those inaccessible to solvent. PMID:8535255

  8. Nucleic acid (cDNA) and amino acid sequences of the maize endosperm protein glutelin-2.

    PubMed Central

    Prat, S; Cortadas, J; Puigdomènech, P; Palau, J

    1985-01-01

    The cDNA coding for a glutelin-2 protein from maize endosperm has been cloned and the complete amino acid sequence of the protein derived for the first time. An immature maize endosperm cDNA bank was screened for the expression of a beta-lactamase:glutelin-2 (G2) fusion polypeptide by using antibodies against the purified 28 kd G2 protein. A clone corresponding to the 28 kd G2 protein was sequenced and the primary structure of this protein was derived. Five regions can be defined in the protein sequence: an 11 residue N-terminal part, a repeated region formed by eight units of the sequence Pro-Pro-Pro-Val-His-Leu, an alternating Pro-X stretch 21 residues long, a Cys rich domain and a C-terminal part rich in Gln. The protein sequence is preceded by 19 residues which have the characteristics of the signal peptide found in secreted proteins. Unlike zeins, the main maize storage proteins, 28 kd glutelin-2 has several homologous sequences in common with other cereal storage proteins. Images PMID:3839076

  9. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... 37 Patents, Trademarks, and Copyrights 1 2010-07-01 2010-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide...

  10. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... 37 Patents, Trademarks, and Copyrights 1 2012-07-01 2012-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide...

  11. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... 37 Patents, Trademarks, and Copyrights 1 2014-07-01 2014-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide...

  12. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... 37 Patents, Trademarks, and Copyrights 1 2011-07-01 2011-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide...

  13. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... 37 Patents, Trademarks, and Copyrights 1 2013-07-01 2013-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide...

  14. Genomic Sequence Comparisons, 1987-2003 Final Report

    SciTech Connect

    George M. Church

    2004-07-29

    This project was to develop new DNA sequencing and RNA and protein quantitation methods and related genome annotation tools. The project began in 1987 with the development of multiplex sequencing (published in Science in 1988), and one of the first automated sequencing methods. This lead to the first commercial genome sequence in 1994 and to the establishment of the main commercial participants (GTC then Agencourt) in the public DOE/NIH genome project. In collaboration with GTC we contributed to one of the first complete DOE genome sequences, in 1997, that of Methanobacterium thermoautotropicum, a species of great relevance to energy-rich gas production.

  15. Amino acid substitutions in genetic variants of human serum albumin and in sequences inferred from molecular cloning

    SciTech Connect

    Takahashi, N.; Takahashi, Y.; Blumberg, B.S.; Putnam, F.W.

    1987-07-01

    The structural changes in four genetic variants of human serum albumin were analyzed by tandem high-pressure liquid chromatography (HPLC) of the tryptic peptides, HPLC mapping and isoelectric focusing of the CNBr fragments, and amino acid sequence analysis of the purified peptides. Lysine-372 of normal (common) albumin A was changed to glutamic acid both in albumin Naskapi, a widespread polymorphic variant of North American Indians, and in albumin Mersin found in Eti Turks. The two variants also exhibited anomalous migration in NaDodSO/sub 4//PAGE, which is attributed to a conformational change. The identity of albumins Naskapi and Mersin may have originated through descent from a common mid-Asiatic founder of the two migrating ethnic groups, or it may represent identical but independent mutations of the albumin gene. In albumin Adana, from Eti Turks, the substitution site was not identified but was localized to the region from positions 447 through 548. The substitution of aspartic acid-550 by glycine was found in albumin Mexico-2 from four individuals of the Pima tribe. Although only single-point substitutions have been found in these and in certain other genetic variants of human albumin, five differences exist in the amino acid sequences inferred from cDNA sequences by workers in three other laboratories. However, our results on albumin A and on 14 different genetic variants accord with the amino acid sequence of albumin deduced from the genomic sequence. The apparent amino acid substitutions inferred from comparison of individual cDNA sequences probably reflect artifacts in cloning or in cDNA sequence analysis rather than polymorphism of the coding sections of the albumin gene.

  16. Human liver apolipoprotein B-100 cDNA: complete nucleic acid and derived amino acid sequence.

    PubMed Central

    Law, S W; Grant, S M; Higuchi, K; Hospattankar, A; Lackner, K; Lee, N; Brewer, H B

    1986-01-01

    Human apolipoprotein B-100 (apoB-100), the ligand on low density lipoproteins that interacts with the low density lipoprotein receptor and initiates receptor-mediated endocytosis and low density lipoprotein catabolism, has been cloned, and the complete nucleic acid and derived amino acid sequences have been determined. ApoB-100 cDNAs were isolated from normal human liver cDNA libraries utilizing immunoscreening as well as filter hybridization with radiolabeled apoB-100 oligodeoxynucleotides. The apoB-100 mRNA is 14.1 kilobases long encoding a mature apoB-100 protein of 4536 amino acids with a calculated amino acid molecular weight of 512,723. ApoB-100 contains 20 potential glycosylation sites, and 12 of a total of 25 cysteine residues are located in the amino-terminal region of the apolipoprotein providing a potential globular structure of the amino terminus of the protein. ApoB-100 contains relatively few regions of amphipathic helices, but compared to other human apolipoproteins it is enriched in beta-structure. The delineation of the entire human apoB-100 sequence will now permit a detailed analysis of the conformation of the protein, the low density lipoprotein receptor binding domain(s), and the structural relationship between apoB-100 and apoB-48 and will provide the basis for the study of genetic defects in apoB-100 in patients with dyslipoproteinemias. PMID:3464946

  17. Computer selection of oligonucleotide probes from amino acid sequences for use in gene library screening.

    PubMed

    Yang, J H; Ye, J H; Wallace, D C

    1984-01-11

    We present a computer program, FINPROBE, which utilizes known amino acid sequence data to deduce minimum redundancy oligonucleotide probes for use in screening cDNA or genomic libraries or in primer extension. The user enters the amino acid sequence of interest, the desired probe length, the number of probes sought, and the constraints on oligonucleotide synthesis. The computer generates a table of possible probes listed in increasing order of redundancy and provides the location of each probe in the protein and mRNA coding sequence. Activation of a next function provides the amino acid and mRNA sequences of each probe of interest as well as the complementary sequence and the minimum dissociation temperature of the probe. A final routine prints out the amino acid sequence of the protein in parallel with the mRNA sequence listing all possible codons for each amino acid.

  18. Fmoc/Trt-amino acids: comparison to Fmoc/tBu-amino acids in peptide synthesis.

    PubMed

    Barlos, K; Gatos, D; Koutsogianni, S

    1998-03-01

    Model peptides containing the nucleophilic amino acids Trp and Met have been synthesized with the application of Fmoc/Trt- and Fmoc/tBu-amino acids, for comparison. The deprotection of the peptides synthesized using Fmoc/Trt-amino acids in all cases leads to crude peptides of higher purity than that of the same peptides synthesized using Fmoc/tBu-amino acids.

  19. Numerical characteristics of word frequencies and their application to dissimilarity measure for sequence comparison.

    PubMed

    Dai, Qi; Liu, Xiaoqing; Yao, Yuhua; Zhao, Fukun

    2011-05-07

    Sequence comparison is one of the major tasks in bioinformatics, which can be used to study structural and functional conservation, as well as evolutionary relations among the sequences. Numerous dissimilarity measures achieve promising results in sequence comparison, but challenges remain. This paper studied numerical characteristics of word frequencies and proposed a novel dissimilarity measure for sequence comparison. Instead of using the word frequencies directly, the proposed measure considers both the word frequencies and overlapping structures of words. To verify the effectiveness of the proposed measure, we tested it with two experiments and further compared it with alignment-based and alignment-free measures. The results demonstrate that the proposed measure extracting more information on the overlapping structures of the words improves the efficiency of sequence comparison.

  20. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data...

  1. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data...

  2. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data...

  3. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data...

  4. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data...

  5. Direct Chloroplast Sequencing: Comparison of Sequencing Platforms and Analysis Tools for Whole Chloroplast Barcoding

    PubMed Central

    Brozynska, Marta; Furtado, Agnelo; Henry, Robert James

    2014-01-01

    Direct sequencing of total plant DNA using next generation sequencing technologies generates a whole chloroplast genome sequence that has the potential to provide a barcode for use in plant and food identification. Advances in DNA sequencing platforms may make this an attractive approach for routine plant identification. The HiSeq (Illumina) and Ion Torrent (Life Technology) sequencing platforms were used to sequence total DNA from rice to identify polymorphisms in the whole chloroplast genome sequence of a wild rice plant relative to cultivated rice (cv. Nipponbare). Consensus chloroplast sequences were produced by mapping sequence reads to the reference rice chloroplast genome or by de novo assembly and mapping of the resulting contigs to the reference sequence. A total of 122 polymorphisms (SNPs and indels) between the wild and cultivated rice chloroplasts were predicted by these different sequencing and analysis methods. Of these, a total of 102 polymorphisms including 90 SNPs were predicted by both platforms. Indels were more variable with different sequencing methods, with almost all discrepancies found in homopolymers. The Ion Torrent platform gave no apparent false SNP but was less reliable for indels. The methods should be suitable for routine barcoding using appropriate combinations of sequencing platform and data analysis. PMID:25329378

  6. Comparison of antioxidant effectiveness of lipoic acid and dihydrolipoic acid.

    PubMed

    Zhao, Feng; Liu, Zai-Qun

    2011-01-01

    The abilities of dihydrolipoic acid (DHLA) to scavenge peroxynitrite (ONOO(-) ), galvinoxyl radical, 2,2'-azinobis(3-ethylbenzothiazoline-6-sulfonate) cation radical (ABTS(+•) ), and 2,2'-diphenyl-1-picrylhydrazyl radical (DPPH) were higher than those of lipoic acid (LA). The effectiveness of DHLA to protect methyl linoleate against 2,2'-azobis(2-amidinopropane hydrochloride) (AAPH)-induced oxidation was about 2.2-fold higher than that of LA, and DHLA can retard the autoxidation of linoleic acid (LH) in the β-carotene-bleaching test. DHLA can also trap ∼0.6 radicals in AAPH-induced oxidation of LH. Moreover, DHLA can scavenge ∼2.0 radicals in AAPH-induced oxidation of DNA and AAPH-induced hemolysis of erythrocytes, whereas LA can scavenge ∼1.5 radicals at the same experimental conditions. DHLA can protect erythrocytes against hemin-induced hemolysis, but accelerate the degradation of DNA in the presence of Cu(2+) . Therefore, the antioxidant capacity of -SH in DHLA is higher than S-S in LA.

  7. Human retroviruses and AIDS 1996. A compilation and analysis of nucleic acid and amino acid sequences

    SciTech Connect

    Myers, G.; Foley, B.; Korber, B.; Mellors, J.W.; Jeang, K.T.; Wain-Hobson, S.

    1997-04-01

    This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (1) Nuclear Acid Alignments and Sequences; (2) Amino Acid Alignments; (3) Analysis; (4) Related Sequences; and (5) Database Communications. Information within all the parts is updated throughout the year on the Web site, http://hiv-web.lanl.gov. While this publication could take the form of a review or sequence monograph, it is not so conceived. Instead, the literature from which the database is derived has simply been summarized and some elementary computational analyses have been performed upon the data. Interpretation and commentary have been avoided insofar as possible so that the reader can form his or her own judgments concerning the complex information. In addition to the general descriptions of the parts of the compendium, the user should read the individual introductions for each part.

  8. Transcriptome Sequencing in Response to Salicylic Acid in Salvia miltiorrhiza

    PubMed Central

    Zhang, Xiaoru; Dong, Juane; Liu, Hailong; Wang, Jiao; Qi, Yuexin; Liang, Zongsuo

    2016-01-01

    Salvia miltiorrhiza is a traditional Chinese herbal medicine, whose quality and yield are often affected by diseases and environmental stresses during its growing season. Salicylic acid (SA) plays a significant role in plants responding to biotic and abiotic stresses, but the involved regulatory factors and their signaling mechanisms are largely unknown. In order to identify the genes involved in SA signaling, the RNA sequencing (RNA-seq) strategy was employed to evaluate the transcriptional profiles in S. miltiorrhiza cell cultures. A total of 50,778 unigenes were assembled, in which 5,316 unigenes were differentially expressed among 0-, 2-, and 8-h SA induction. The up-regulated genes were mainly involved in stimulus response and multi-organism process. A core set of candidate novel genes coding SA signaling component proteins was identified. Many transcription factors (e.g., WRKY, bHLH and GRAS) and genes involved in hormone signal transduction were differentially expressed in response to SA induction. Detailed analysis revealed that genes associated with defense signaling, such as antioxidant system genes, cytochrome P450s and ATP-binding cassette transporters, were significantly overexpressed, which can be used as genetic tools to investigate disease resistance. Our transcriptome analysis will help understand SA signaling and its mechanism of defense systems in S. miltiorrhiza. PMID:26808150

  9. Purification of a marsupial insulin: amino-acid sequence of insulin from the eastern grey kangaroo Macropus giganteus.

    PubMed

    Treacy, G B; Shaw, D C; Griffiths, M E; Jeffrey, P D

    1989-03-24

    Insulin has been purified from kangaroo pancreas by acidic ethanol extraction, diethyl ether precipitation and gel filtration. The amino-acid sequence of this, the first marsupial insulin to be studied, is reported. It differs from human insulin by only four amino-acid substitutions, all in regions of the molecule previously known to be variable. However, it should be noted that one of these, asparagine for threonine at A8, has not been reported before. Computer comparisons of all 43 insulin sequences reported to date with kangaroo insulin show it to be most closely related to a group of mammalian insulins (dog, pig, cow, human) known to be of high biological potency. The measurement of blood glucose lowering in the rabbit by kangaroo insulin is consistent with this conclusion. Comparisons of amino-acid sequences of other proteins with their kangaroo counterparts show a greater difference, in line with the time of divergence of marsupials. The limited differences observed in insulin and cytochrome c suggest that their structures need to be closely conserved in order to maintain function.

  10. A parallel computing approach to genetic sequence comparison: the master-worker paradigm with interworker communication.

    PubMed

    Sittig, D F; Foulser, D; Carriero, N; McCorkle, G; Miller, P L

    1991-04-01

    We have implemented a parallel version of a dynamic programming biological sequence comparison algorithm to study the potential applicability of using parallel computers for genetic sequence comparisons. Our parallel program is built using C-Linda, a machine-independent parallel programming language, and was tested on both a 10 CPU Sequent Symmetry and a 64 CPU Intel Hypercube. C-Linda implements a shared associative memory model, "tuple space," through which multiple processes can communicate and coordinate control. In our master-worker (MW) parallel implementation, a master process creates several worker processes, extracts a test sequence and multiple library sequences from a database and stores them in tuple space. Each worker reads the test sequence and then repeatedly extracts library strings from tuple space, performs pairwise sequence comparison using a local comparison algorithm to generate a similarity score, and returns the similarity scores to tuple space. The master collects the scores from tuple space and identifies the best match over all library sequences. We also implemented a method of global interworker communication to reduce the total search time by stopping those string comparisons that had no chance of improving on the current best match. Comparisons of the total run time, speedup, and efficiency were made for parallel and sequential versions of a basic MW implementation as well as versions with the global abort threshold.

  11. Human retroviruses and aids, 1992. A compilation and analysis of nucleic acid and amino acid sequences

    SciTech Connect

    Myers, G.; Korber, B.; Berzofsky, J.A.; Pavlakis, G.N.; Smith, R.F.

    1992-10-01

    This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (1) HIV and SIV Nucleotide Sequences; (H) Amino Acid Sequences; (III) Analyses; (IV) Related Sequences; and (V) Database Communications. information within all the parts is updated at least twice in each year, which accounts for the modes of binding and pagination in the compendium. While this publication could take the form of a review or sequence monograph, it is not so conceived. Instead, the literature from which the database is derived has simply been summarized and some elementary computational analyses have been performed upon the data. Interpretation and commentary have been avoided insofar as possible so that the reader can form his or her own judgments concerning the complex information. In addition to the general descriptions below of the parts of the compendium, the user should read the individual introductions for each part.

  12. Complete Genome Sequence of the Prototype Lactic Acid Bacterium Lactococcus lactis subsp. cremoris MG1363▿

    PubMed Central

    Wegmann, Udo; O'Connell-Motherway, Mary; Zomer, Aldert; Buist, Girbe; Shearman, Claire; Canchaya, Carlos; Ventura, Marco; Goesmann, Alexander; Gasson, Michael J.; Kuipers, Oscar P.; van Sinderen, Douwe; Kok, Jan

    2007-01-01

    Lactococcus lactis is of great importance for the nutrition of hundreds of millions of people worldwide. This paper describes the genome sequence of Lactococcus lactis subsp. cremoris MG1363, the lactococcal strain most intensively studied throughout the world. The 2,529,478-bp genome contains 81 pseudogenes and encodes 2,436 proteins. Of the 530 unique proteins, 47 belong to the COG (clusters of orthologous groups) functional category “carbohydrate metabolism and transport,” by far the largest category of novel proteins in comparison with L. lactis subsp. lactis IL1403. Nearly one-fifth of the 71 insertion elements are concentrated in a specific 56-kb region. This integration hot-spot region carries genes that are typically associated with lactococcal plasmids and a repeat sequence specifically found on plasmids and in the “lateral gene transfer hot spot” in the genome of Streptococcus thermophilus. Although the parent of L. lactis MG1363 was used to demonstrate lysogeny in Lactococcus, L. lactis MG1363 carries four remnant/satellite phages and two apparently complete prophages. The availability of the L. lactis MG1363 genome sequence will reinforce its status as the prototype among lactic acid bacteria through facilitation of further applied and fundamental research. PMID:17307855

  13. Completion of the amino acid sequence of the alpha 1 chain from type I calf skin collagen. Amino acid sequence of alpha 1(I)B8.

    PubMed Central

    Glanville, R W; Breitkreutz, D; Meitinger, M; Fietzek, P P

    1983-01-01

    The complete amino acid sequence of the 279-residue CNBr peptide CB8 from the alpha 1 chain of type I calf skin collagen is presented. It was determined by sequencing overlapping fragments of CB8 produced by Staphylococcus aureus V8 proteinase, trypsin, Endoproteinase Arg-C and hydroxylamine. Tryptic cleavages were also made specific for lysine by blocking arginine residues with cyclohexane-1,2-dione. This completes the amino acid sequence analysis of the 1054-residues-long alpha (I) chain of calf skin collagen. PMID:6354180

  14. Multiple Comparison Analysis of Two New Genomic Sequences of ILTV Strains from China with Other Strains from Different Geographic Regions.

    PubMed

    Zhao, Yan; Kong, Congcong; Wang, Yunfeng

    2015-01-01

    To date, twenty complete genome sequences of ILTV strains have been published in GenBank, including one strain from China, and nineteen strains from Australian and the United States. To investigate the genomic information on ILTVs from different geographic regions, two additional individual complete genome sequences of WG and K317 strains from China were determined. The genomes of WG and K317 strains were 153,505 and 153,639 bp in length, respectively. Alignments performed on the amino acid sequences of the twelve glycoproteins showed that 13 out of 116 mutational sites were present only among the Chinese strain WG and the Australian strains SA2 and A20. The phylogenetic tree analysis suggested that the WG strain established close relationships with the Australian strain SA2. The recombination events were detected and confirmed in different subregions of the WG strain with the sequences of SA2 and K317 strains as parental. In this study, two new complete genome sequences of Chinese ILTV strains were used in comparative analysis with other complete genome sequences of ILTV strains from China, the United States, and Australia. The analysis of genome comparison, phylogenetic trees, and recombination events showed close relationships among the Chinese strain WG and the Australian strains SA2. The information of the two new complete genome sequences from China will help to facilitate the analysis of phylogenetic relationships and the molecular differences among ILTV strains from different geographic regions.

  15. Comparison of simple sequence repeats in 19 Archaea.

    PubMed

    Trivedi, S

    2006-12-05

    All organisms that have been studied until now have been found to have differential distribution of simple sequence repeats (SSRs), with more SSRs in intergenic than in coding sequences. SSR distribution was investigated in Archaea genomes where complete chromosome sequences of 19 Archaea were analyzed with the program SPUTNIK to find di- to penta-nucleotide repeats. The number of repeats was determined for the complete chromosome sequences and for the coding and non-coding sequences. Different from what has been found for other groups of organisms, there is an abundance of SSRs in coding regions of the genome of some Archaea. Dinucleotide repeats were rare and CG repeats were found in only two Archaea. In general, trinucleotide repeats are the most abundant SSR motifs; however, pentanucleotide repeats are abundant in some Archaea. Some of the tetranucleotide and pentanucleotide repeat motifs are organism specific. In general, repeats are short and CG-rich repeats are present in Archaea having a CG-rich genome. Among the 19 Archaea, SSR density was not correlated with genome size or with optimum growth temperature. Pentanucleotide density had an inverse correlation with the CG content of the genome.

  16. An Integrated Sequence-Structure Database incorporating matching mRNA sequence, amino acid sequence and protein three-dimensional structure data.

    PubMed Central

    Adzhubei, I A; Adzhubei, A A; Neidle, S

    1998-01-01

    We have constructed a non-homologous database, termed the Integrated Sequence-Structure Database (ISSD) which comprises the coding sequences of genes, amino acid sequences of the corresponding proteins, their secondary structure and straight phi,psi angles assignments, and polypeptide backbone coordinates. Each protein entry in the database holds the alignment of nucleotide sequence, amino acid sequence and the PDB three-dimensional structure data. The nucleotide and amino acid sequences for each entry are selected on the basis of exact matches of the source organism and cell environment. The current version 1.0 of ISSD is available on the WWW at http://www.protein.bio.msu.su/issd/ and includes 107 non-homologous mammalian proteins, of which 80 are human proteins. The database has been used by us for the analysis of synonymous codon usage patterns in mRNA sequences showing their correlation with the three-dimensional structure features in the encoded proteins. Possible ISSD applications include optimisation of protein expression, improvement of the protein structure prediction accuracy, and analysis of evolutionary aspects of the nucleotide sequence-protein structure relationship. PMID:9399866

  17. Revision of Begomovirus taxonomy based on pairwise sequence comparisons.

    PubMed

    Brown, Judith K; Zerbini, F Murilo; Navas-Castillo, Jesús; Moriones, Enrique; Ramos-Sobrinho, Roberto; Silva, José C F; Fiallo-Olivé, Elvira; Briddon, Rob W; Hernández-Zepeda, Cecilia; Idris, Ali; Malathi, V G; Martin, Darren P; Rivera-Bustamante, Rafael; Ueda, Shigenori; Varsani, Arvind

    2015-06-01

    Viruses of the genus Begomovirus (family Geminiviridae) are emergent pathogens of crops throughout the tropical and subtropical regions of the world. By virtue of having a small DNA genome that is easily cloned, and due to the recent innovations in cloning and low-cost sequencing, there has been a dramatic increase in the number of available begomovirus genome sequences. Even so, most of the available sequences have been obtained from cultivated plants and are likely a small and phylogenetically unrepresentative sample of begomovirus diversity, a factor constraining taxonomic decisions such as the establishment of operationally useful species demarcation criteria. In addition, problems in assigning new viruses to established species have highlighted shortcomings in the previously recommended mechanism of species demarcation. Based on the analysis of 3,123 full-length begomovirus genome (or DNA-A component) sequences available in public databases as of December 2012, a set of revised guidelines for the classification and nomenclature of begomoviruses are proposed. The guidelines primarily consider a) genus-level biological characteristics and b) results obtained using a standardized classification tool, Sequence Demarcation Tool, which performs pairwise sequence alignments and identity calculations. These guidelines are consistent with the recently published recommendations for the genera Mastrevirus and Curtovirus of the family Geminiviridae. Genome-wide pairwise identities of 91 % and 94 % are proposed as the demarcation threshold for begomoviruses belonging to different species and strains, respectively. Procedures and guidelines are outlined for resolving conflicts that may arise when assigning species and strains to categories wherever the pairwise identity falls on or very near the demarcation threshold value.

  18. Complete amino acid sequence and structure characterization of the taste-modifying protein, miraculin.

    PubMed

    Theerasilp, S; Hitotsuya, H; Nakajo, S; Nakaya, K; Nakamura, Y; Kurihara, Y

    1989-04-25

    The taste-modifying protein, miraculin, has the unusual property of modifying sour taste into sweet taste. The complete amino acid sequence of miraculin purified from miracle fruits by a newly developed method (Theerasilp, S., and Kurihara, Y. (1988) J. Biol. Chem. 263, 11536-11539) was determined by an automatic Edman degradation method. Miraculin was a single polypeptide with 191 amino acid residues. The calculated molecular weight based on the amino acid sequence and the carbohydrate content (13.9%) was 24,600. Asn-42 and Asn-186 were linked N-glycosidically to carbohydrate chains. High homology was found between the amino acid sequences of miraculin and soybean trypsin inhibitor.

  19. Amino acid sequence of rabbit kidney neutral endopeptidase 24.11 (enkephalinase) deduced from a complementary DNA.

    PubMed Central

    Devault, A; Lazure, C; Nault, C; Le Moual, H; Seidah, N G; Chrétien, M; Kahn, P; Powell, J; Mallet, J; Beaumont, A

    1987-01-01

    Neutral endopeptidase (EC 3.4.24.11) is a major constituent of kidney brush border membranes. It is also present in the brain where it has been shown to be involved in the inactivation of opioid peptides, methionine- and leucine-enkephalins. For this reason this enzyme is often called 'enkephalinase'. In order to characterize the primary structure of the enzyme, oligonucleotide probes were designed from partial amino acid sequences and used to isolate clones from kidney cDNA libraries. Sequencing of the cDNA inserts revealed the complete primary structure of the enzyme. Neutral endopeptidase consists of 750 amino acids. It contains a short N-terminal cytoplasmic domain (27 amino acids), a single membrane-spanning segment (23 amino acids) and an extracellular domain that comprises most of the protein mass. The comparison of the primary structure of neutral endopeptidase with that of thermolysin, a bacterial Zn-metallopeptidase, indicates that most of the amino acid residues involved in Zn coordination and catalytic activity in thermolysin are found within highly honmologous sequences in neutral endopeptidase. Images Fig. 1. Fig. 3. PMID:2440677

  20. Detection and isolation of nucleic acid sequences using a bifunctional hybridization probe

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    2000-01-01

    A method for detecting and isolating a target sequence in a sample of nucleic acids is provided using a bifunctional hybridization probe capable of hybridizing to the target sequence that includes a detectable marker and a first complexing agent capable of forming a binding pair with a second complexing agent. A kit is also provided for detecting a target sequence in a sample of nucleic acids using a bifunctional hybridization probe according to this method.

  1. A Guaranteed Similarity Metric Learning Framework for Biological Sequence Comparison.

    PubMed

    Hua, Keru; Yu, Qin; Zhang, Ruiming

    2016-01-01

    Similarity of sequences is a key mathematical notion for Classification and Phylogenetic studies in Biology. The distance and similarity between two sequence are very important and widely studied. During the last decades, Similarity(distance) metric learning is one of the hottest topics of machine learning/data mining as well as their applications in the bioinformatics field. It is feasible to introduce machine learning technology to learn similarity metric from biological data. In this paper, we propose a novel framework of guaranteed similarity metric learning (GMSL) to perform alignment of biology sequences in any feature vector space. It introduces the (ϵ, γ, τ)-goodness similarity theory to Mahalanobis metric learning. As a theoretical guaranteed similarity metric learning approach, GMSL guarantees that the learned similarity function performs well in classification and clustering. Our experiments on the most used datasets demonstrate that our approach outperforms the state-of-the-art biological sequences alignment methods and other similarity metric learning algorithms in both accuracy and stability.

  2. Complete genome sequence of virulent duck enteritis virus (DEV) strain 2085 and comparison with genome sequences of virulent and attenuated DEV strains.

    PubMed

    Wang, Jichun; Höper, Dirk; Beer, Martin; Osterrieder, Nikolaus

    2011-09-01

    We here report the complete genome sequence of the duck enteritis virus (DEV) wild-type strain 2085, an avian herpesvirus (GenBank ID: JF999965). The nucleotide sequence was derived from the 2085 genome cloned as an infectious bacterial artificial chromosome (BAC) clone. The DEV 2085 genome is 160,649-bp in length and encodes 78 predicted open reading frames (ORFs), a number identical to that identified for the attenuated DEV VAC strain (GenBank ID: EU082088.2). Comparison of the genome sequences DEV 2085 and VAC with partial sequences of the virulent CHv strain and the attenuated strain Clone-03 was carried out to identify nucleotide or amino acid polymorphisms that potentially contribute to DEV virulence. No amino acid changes were identified in 24 of the 78 ORFs, a result indicating high conservation in DEV independently of strain origin or virulence. In addition, 39 ORFs contain non-synonymous nucleotide substitutions, while 15 ORFs had nucleotide insertions or deletions, frame-shift mutations and/or non-synonymous nucleotide substitutions with an effect on ORF initiation or termination. In 7 of the 15 ORFs with high and 27 of the 39 ORFs with low variability, polymorphisms were exclusively found in DEV 2085, a finding that likely is a result of a different origin of 2085 (Europe) or VAC, Clone-03 and CHv (Eastern Asia). Five ORFs (UL2, UL12, US10, UL47 and UL41) with polymorphisms were identical between the virulent DEV 2085 and CHv but different from VAC or Clone-03. They, individually or in combination, may therefore represent DEV virulence factors. Our comparative analysis of four DEV sequences provides a comprehensive overview of DEV genome structure and identifies ORFs that are changed during serial virus passage.

  3. Weighting in sequence space: A comparison of methods in terms of generalized sequences

    SciTech Connect

    Vingron, M. ); Sibbald, P.R. )

    1993-10-01

    Four methods for weighting aligned biological sequences have recently appeared that differ mathematically, philosophically, and in their results. Thus, while there is consensus about the need to weight sequences, the method to use is contentious. A geometric analysis based on a continuous sequence space is presented that provides a common framework in which to compare the methods. It is concluded that there are two best' methods. When the sequences are known to be phylogenetically related and a tree can be generated without introducing excessive stress into the data, the method of Altschul et al. [Altschul, S.F., Carroll, R.J. Lipman, D.J. (1989) J. Mol. Biol. 207, 647-653] is appropriate. When the sequences are not known to be phylogenetically related or a tree cannot be produced without unduly distorting the distances between the sequences, a modification of the method of Sibbald and Argos [Sibbald, P.R. Argos, p. (1990) J. Mol. Biol. 216, 813-818] is preferable. 29 refs., 3 figs., 2 tabs.

  4. 3D reconstruction software comparison for short sequences

    NASA Astrophysics Data System (ADS)

    Strupczewski, Adam; Czupryński, BłaŻej

    2014-11-01

    Large scale multiview reconstruction is recently a very popular area of research. There are many open source tools that can be downloaded and run on a personal computer. However, there are few, if any, comparisons between all the available software in terms of accuracy on small datasets that a single user can create. The typical datasets for testing of the software are archeological sites or cities, comprising thousands of images. This paper presents a comparison of currently available open source multiview reconstruction software for small datasets. It also compares the open source solutions with a simple structure from motion pipeline developed by the authors from scratch with the use of OpenCV and Eigen libraries.

  5. Sequence comparison and classification of beet luteovirus isolates.

    PubMed

    de Miranda, J R; Stevens, M; de Bruyne, E; Smith, H G; Bird, C; Hull, R

    1995-01-01

    Three distinct sequence groups were found among partial nucleotide sequences of 38 isolates of beet western yellows virus (BWYV) and beet mild yellowing virus (BMYV) from Europe, Iran and the USA. The first group contains both sugar beet and oilseed rape specific isolates, and the differentiating characteristic linked to this host range specificity are 2 single base pair changes in a 1,200 nucleotide region of the genome. It is proposed that the European BWYV strains that can be transferred at low frequency between rape and sugar beet belong to this group. Also belonging to this group are the published BWYV sequences of Veidt et al. and of the California BWYV-ST9 isolate. The second group contains mostly rape-derived isolates which have an intergenic region highly distinct from that of group-1 isolates but similar polymerase and coat protein regions. It is proposed that the rape-specific BWYV isolates which cannot be transmitted to sugar beet belong to this group. The third group contains mostly beet-specific isolates from Southern Europe and Iran, and may be adapted to the Mediterranean climate and flora. It is distinct from groups 1 and 2 in all three genome regions investigated and its polymerase and intergenic regions are as much related to those of potato leafroll virus (PLRV) and curcurbit aphid borne yellows virus (CABYV) as they are to those of group-1 and group-2. On the basis of sequence similarities and established nomenclature it is proposed to use BWYV for groups 1 and 2 (BWYV-1 and BWYV-2 respectively) and to use BMYV for group-3 isolates, which are distinct enough from the other two groups to merit a separate nomenclature.

  6. Trichomonas vaginalis acidic phospholipase A2: isolation and partial amino acid sequence.

    PubMed

    Escobedo-Guajardo, Brenda L; González-Salazar, Francisco; Palacios-Corona, Rebeca; Torres de la Cruz, Víctor M; Morales-Vallarta, Mario; Mata-Cárdenas, Benito D; Garza-González, Jesús N; Rivera-Silva, Gerardo; Vargas-Villarreal, Javier

    2013-12-01

    Sexually transmitted diseases are a major cause of acute disease worldwide, and trichomoniasis is the most common and curable disease, generating more than 170 million cases annually worldwide. Trichomonas vaginalis is the causal agent of trichomoniasis and has the ability to destroy in vitro cell monolayers of the vaginal mucosa, where the phospholipases A2 (PLA2) have been reported as potential virulence factors. These enzymes have been partially characterized from the subcellular fraction S30 of pathogenic T. vaginalis strains. The main objective of this study was to purify a phospholipase A2 from T. vaginalis, make a partial characterization, obtain a partial amino acid sequence, and determine its enzymatic participation as hemolytic factor causing lysis of erythrocytes. Trichomonas S30, RF30 and UFF30 sub-fractions from GT-15 strain have the capacity to hydrolyze [2-(14)C-PA]-PC at pH 6.0. Proteins from the UFF30 sub-fraction were separated by affinity chromatography into two eluted fractions with detectable PLA A2 activity. The EDTA-eluted fraction was analyzed by HPLC using on-line HPLC-tandem mass spectrometry and two protein peaks were observed at 8.2 and 13 kDa. Peptide sequences were identified from the proteins present in the eluted EDTA UFF30 fraction; bioinformatic analysis using Protein Link Global Server charged with T. vaginalis protein database suggests that eluted peptides correspond a putative ubiquitin protein in the 8.2 kDa fraction and a phospholipase preserved in the 13 kDa fraction. The EDTA-eluted fraction hydrolyzed [2-(14)C-PA]-PC lyses erythrocytes from Sprague-Dawley in a time and dose-dependent manner. The acidic hemolytic activity decreased by 84% with the addition of 100 μM of Rosenthal's inhibitor.

  7. The amino acid sequence of Ole e I, the major allergen from olive tree (Olea europaea) pollen.

    PubMed

    Villalba, M; Batanero, E; López-Otín, C; Sánchez, L M; Monsalve, R I; González de la Peña, M A; Lahoz, C; Rodríguez, R

    1993-09-15

    The complete primary structure of the major allergen from Olea europaea (olive tree) pollen, Ole e I (IUIS nomenclature), has been determined. The amino acid sequence was established by automated Edman degradation of the reduced and alkylated molecule as well as of selected fragments obtained by proteolytic digestions. Ole e I contains a single polypeptide chain of 145 amino acid residues with a calculated molecular mass of 16331 Da. No free sulfhydryl groups have been detected in the native protein. The molecule contains a putative glycosylation site. A high degree of microheterogeneity has been observed, mainly centered in the first 33% of the molecule. Comparison of Ole e I sequence with protein sequence databases showed no similarity with other known allergens. However, it has a 36% and 38% sequence identity with the putative polypeptide structures, deduced, respectively, from nucleotide sequences of genes isolated from tomato anthers and corn pollen, which have been suggested to be involved in the growing of the pollen tube. Therefore, the olive tree allergen may be a constitutive protein of the pollen involved in reproductive functions.

  8. Close Sequence Comparisons are Sufficient to Identify Humancis-Regulatory Elements

    SciTech Connect

    Prabhakar, Shyam; Poulin, Francis; Shoukry, Malak; Afzal, Veena; Rubin, Edward M.; Couronne, Olivier; Pennacchio, Len A.

    2005-12-01

    Cross-species DNA sequence comparison is the primary method used to identify functional noncoding elements in human and other large genomes. However, little is known about the relative merits of evolutionarily close and distant sequence comparisons, due to the lack of a universal metric for sequence conservation, and also the paucity of empirically defined benchmark sets of cis-regulatory elements. To address this problem, we developed a general-purpose algorithm (Gumby) that detects slowly-evolving regions in primate, mammalian and more distant comparisons without requiring adjustment of parameters, and ranks conserved elements by P-value using Karlin-Altschul statistics. We benchmarked Gumby predictions against previously identified cis-regulatory elements at diverse genomic loci, and also tested numerous extremely conserved human-rodent sequences for transcriptional enhancer activity using reporter-gene assays in transgenic mice. Human regulatory elements were identified with acceptable sensitivity and specificity by comparison with 1-5 other eutherian mammals or 6 other simian primates. More distant comparisons (marsupial, avian, amphibian and fish) failed to identify many of the empirically defined functional noncoding elements. We derived an intuitive relationship between ancient and recent noncoding sequence conservation from whole genome comparative analysis, which explains some of these findings. Lastly, we determined that, in addition to strength of conservation, genomic location and/or density of surrounding conserved elements must also be considered in selecting candidate enhancers for testing at embryonic time points.

  9. Identification of random nucleic acid sequence aberrations using dual capture probes which hybridize to different chromosome regions

    DOEpatents

    Lucas, J.N.; Straume, T.; Bogen, K.T.

    1998-03-24

    A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration. 14 figs.

  10. Identification of random nucleic acid sequence aberrations using dual capture probes which hybridize to different chromosome regions

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    1998-01-01

    A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration.

  11. Basal Murphy belt and Chilhowee Group -- Sequence stratigraphic comparison

    SciTech Connect

    Aylor, J.G. Jr. . Dept. of Geology)

    1994-03-01

    The lower Murphy belt in the central western Blue Ridge is interpreted to be correlative to the Early Cambrian Chilhowee Group of the westernmost Blue Ridge and Appalachian fold and thrust belt. Basal Murphy belt depositional sequence stratigraphy represents a second-order, type-2 transgressive systems tract initiated with deposition of lowstand turbidites of the Dean Formation. These transgressive deposits of the Nantahala and Brasstown Formations are interpreted as middle to outer continental shelf deposits. Cyclic and stacked third-order regressive, coarsening upwards sequences of the Nantahala Formation display an overall increase in feldspar content stratigraphically upsection. These transgressive siliciclastic deposits are interpreted to be conformably overlain by a carbonate highstand systems tract of the Murphy Marble. Palinspastic reconstruction indicates that the Nantahala and Brasstown Formations possibly represent a basinward extension of up to 3 km thick siliciclastic wedge. The wedge tapers to the southwest along the strike of the Murphy belt at 10[degree] and thins northwestward to 2 km in the Tennessee depocenter where it is represented by the Chilhowee Group. The Murphy belt basin is believed to represent a transitional rift-to-drift facies deposited on the lower plate of the southern Blue Ridge rift zone.

  12. Molecular Evolution of the Escherichia Coli Chromosome. IV. Sequence Comparisons

    PubMed Central

    Milkman, R.; Bridges, M. M.

    1993-01-01

    DNA sequences have been compared in a 4,400-bp region for Escherichia coli K12 and 36 ECOR strains. Discontinuities in degree of similarity, previously inferred, are confirmed in detail. Three clonal frames are described on the basis of the present local high-resolution data, as well as previous analyses of restriction fragment length polymorphism (RFLP) and of multilocus enzyme electrophoresis (MLEE) covering small regions more widely dispersed on the chromosome. These three approaches show important consistency. The data illustrate the fact that, in the limited context of intraspecific genomic sequence variation, clonality and homology are synonymous. Two estimable quantitative properties are defined: recency of common ancestry (the reciprocal of the log(10) of the number of generations since the most recent common ancestor), and the number of nucleotide pairs over which a given recency of common ancestry applies. In principle, these parameters are measures of the degree and physical extent of homology. The small size of apparent recombinational replacements, together with the observation that they occasionally occur in discontinuous series, raises the question of whether they result from the superimposition of replacements of much larger size (as expected from an elementary interpretation of conjugation and transduction in experimental E. coli systems) or via an alternative mechanism. Length polymorphisms of several sorts are described. PMID:8095913

  13. The amino acid sequence of protein CM-3 from Dendroaspis polylepis polylepis (black mamba) venom.

    PubMed

    Joubert, F J

    1985-01-01

    Protein CM-3 from Dendroaspis polylepis polylepis venom was purified by gel filtration and ion exchange chromatography. It comprises 65 amino acids including eight half-cystines. The complete amino acid sequence of protein CM-3 has been elucidated. The sequence (residues 1-50) resembles that of the N-terminal sequence of the subunits of a synergistic type protein and residues 51-65 that of the C-terminal sequence of an angusticeps type protein. Mixtures of protein CM-3 and angusticeps type proteins showed no apparent synergistic effect, in that their toxicity in combination was no greater than the sum of their individual toxicities.

  14. Alignment-free Transcriptomic and Metatranscriptomic Comparison Using Sequencing Signatures with Variable Length Markov Chains

    PubMed Central

    Liao, Weinan; Ren, Jie; Wang, Kun; Wang, Shun; Zeng, Feng; Wang, Ying; Sun, Fengzhu

    2016-01-01

    The comparison between microbial sequencing data is critical to understand the dynamics of microbial communities. The alignment-based tools analyzing metagenomic datasets require reference sequences and read alignments. The available alignment-free dissimilarity approaches model the background sequences with Fixed Order Markov Chain (FOMC) yielding promising results for the comparison of microbial communities. However, in FOMC, the number of parameters grows exponentially with the increase of the order of Markov Chain (MC). Under a fixed high order of MC, the parameters might not be accurately estimated owing to the limitation of sequencing depth. In our study, we investigate an alternative to FOMC to model background sequences with the data-driven Variable Length Markov Chain (VLMC) in metatranscriptomic data. The VLMC originally designed for long sequences was extended to apply to high-throughput sequencing reads and the strategies to estimate the corresponding parameters were developed. The flexible number of parameters in VLMC avoids estimating the vast number of parameters of high-order MC under limited sequencing depth. Different from the manual selection in FOMC, VLMC determines the MC order adaptively. Several beta diversity measures based on VLMC were applied to compare the bacterial RNA-Seq and metatranscriptomic datasets. Experiments show that VLMC outperforms FOMC to model the background sequences in transcriptomic and metatranscriptomic samples. A software pipeline is available at https://d2vlmc.codeplex.com. PMID:27876823

  15. Reconstruction of an ancestral Yersinia pestis genome and comparison with an ancient sequence

    PubMed Central

    2015-01-01

    Background We propose the computational reconstruction of a whole bacterial ancestral genome at the nucleotide scale, and its validation by a sequence of ancient DNA. This rare possibility is offered by an ancient sequence of the late middle ages plague agent. It has been hypothesized to be ancestral to extant Yersinia pestis strains based on the pattern of nucleotide substitutions. But the dynamics of indels, duplications, insertion sequences and rearrangements has impacted all genomes much more than the substitution process, which makes the ancestral reconstruction task challenging. Results We use a set of gene families from 13 Yersinia species, construct reconciled phylogenies for all of them, and determine gene orders in ancestral species. Gene trees integrate information from the sequence, the species tree and gene order. We reconstruct ancestral sequences for ancestral genic and intergenic regions, providing nearly a complete genome sequence for the ancestor, containing a chromosome and three plasmids. Conclusion The comparison of the ancestral and ancient sequences provides a unique opportunity to assess the quality of ancestral genome reconstruction methods. But the quality of the sequencing and assembly of the ancient sequence can also be questioned by this comparison. PMID:26450112

  16. A comparison of several similarity indices used in the classification of protein sequences: a multivariate analysis.

    PubMed Central

    Landès, C; Hénaut, A; Risler, J L

    1992-01-01

    The present work describes an attempt to identify reliable criteria which could be used as distance indices between protein sequences. Seven different criteria have been tested: i and ii) the scores of the alignments as given by the BESTFIT and the FASTA programs; iii) the ratio parameter, i.e. the BESTFIT score divided by the length of the aligned peptides; iv and v) the statistical significance (Z-scores) of the scores calculated by BESTFIT and FASTA, as obtained by comparison with shuffled sequences; vi) the Z-scores provided by the program RELATE which performs a segment-by-segment comparison of 2 sequences, and vii) an original distance index calculated by the program DOCMA from all the pairwise dotplots between the sequences. These 7 criteria have been tested against the aminoacid sequences of 39 globins and those of the 20 aminoacyl-tRNA synthetases from E. coli. The distances between the sequences were analyzed by the multivariate analysis techniques. The results show that the distances calculated from the scores of the pairwise alignments are not adequately sensitive. The Z-score from RELATE is not selective enough and too demanding in computer time. Three criteria gave a classification consistent with the known similarities between the sequences in the sets, namely the Z-scores from BESTFIT and FASTA and the multiple dotplot comparison distance index from DOCMA. PMID:1641329

  17. Primary structure of rat secretory acid phosphatase and comparison to other acid phosphatases.

    PubMed

    Roiko, K; Jänne, O A; Vihko, P

    1990-05-14

    Overlapping cDNA clones encoding rat prostatic acid phosphatase (rPAP) were isolated by using two human prostatic acid phosphatase (hPAP)-encoding cDNAs to screen rat prostatic cDNA libraries. The isolated cDNAs encompassed a total of 1626 nucleotides (nt), of which 1143 nt corresponded to the protein coding sequence encoding a mature polypeptide of 350 amino acids (aa) and a 31-aa long signal peptide-like sequence. The deduced Mr of the mature rPAP was 40,599. RNA blot analysis indicated the presence of three mRNA species (4.9, 2.3 and 1.5 kb in size) in the rat prostate. The deduced aa sequences of rPAP and hPAP show 75% identity, whereas the similarity between rPAP and human lysosomal acid phosphatase (hLAP) is only 45%. Furthermore, the sequence similarity between rPAP and rat lysosomal acid phosphatase (rLAP) is 46% at the aa level. Similar to hPAP, but unlike hLAP and rLAP, the rPAP sequence lacks a membrane-anchoring domain indicating the secretory character of this phosphatase. All six cysteines present in the overlapping areas of the mature rPAP, hPAP, rLAP and hLAP proteins are positionally conserved, suggesting that these residues are important for the tertiary structure of acid phosphatases (APs). The previously reported active site residues, two arginines and one histidine, are also conserved in these APs.

  18. The Chinese hamster Alu-equivalent sequence: a conserved highly repetitious, interspersed deoxyribonucleic acid sequence in mammals has a structure suggestive of a transposable element.

    PubMed Central

    Haynes, S R; Toomey, T P; Leinwand, L; Jelinek, W R

    1981-01-01

    A consensus sequence has been determined for a major interspersed deoxyribonucleic acid repeat in the genome of Chinese hamster ovary cells (CHO cells). This sequence is extensively homologous to (i) the human Alu sequence (P. L. Deininger et al., J. Mol. Biol., in press), (ii) the mouse B1 interspersed repetitious sequence (Krayev et al., Nucleic Acids Res. 8:1201-1215, 1980) (iii) an interspersed repetitious sequence from African green monkey deoxyribonucleic acid (Dhruva et al., Proc. Natl. Acad. Sci. U.S.A. 77:4514-4518, 1980) and (iv) the CHO and mouse 4.5S ribonucleic acid (this report; F. Harada and N. Kato, Nucleic Acids Res. 8:1273-1285, 1980). Because the CHO consensus sequence shows significant homology to the human Alu sequence it is termed the CHO Alu-equivalent sequence. A conserved structure surrounding CHO Alu-equivalent family members can be recognized. It is similar to that surrounding the human Alu and the mouse B1 sequences, and is represented as follows: direct repeat-CHO-Alu-A-rich sequence-direct repeat. A composite interspersed repetitious sequence has been identified. Its structure is represented as follows: direct repeat-residue 47 to 107 of CHO-Alu-non-Alu repetitious sequence-A-rich sequence-direct repeat. Because the Alu flanking sequences resemble those that flank known transposable elements, we think it likely that the Alu sequence dispersed throughout the mammalian genome by transposition. Images PMID:9279371

  19. Triose phosphate isomerase from the coelacanth. An approach to the rapid determination of an amino acid sequence with small amounts of material.

    PubMed

    Kolb, E; Harris, J I; Bridgen, J

    1974-02-01

    The preparation and purification of cyanogen bromide fragments from [(14)C]carboxymethylated coelacanth triose phosphate isomerase is presented. The automated sequencing of these fragments, the lysine-blocked tryptic peptides derived from them, and also of the intact protein, is described. Combination with results from manual sequence analysis has given the 247-residue amino acid sequence of coelacanth triose phosphate isomerase in 4 months, by using 100mg of enzyme. (Two small adjacent peptides were placed by homology with the rabbit enzyme.) Comparison of this sequence with that of the rabbit muscle enzyme shows that 207 (84%) of the residues are identical. This slow rate of evolutionary change (corresponding to two amino acid substitutions per 100 residues per 100 million years) is similar to that found for glyceraldehyde 3-phosphate dehydrogenase. The reliability of sequence information obtained by automated methods is discussed.

  20. Nucleotide sequence of a cloned woodchuck hepatitis virus genome: comparison with the hepatitis B virus sequence.

    PubMed Central

    Galibert, F; Chen, T N; Mandart, E

    1982-01-01

    The complete nucleotide sequence of a woodchuck hepatitis virus genome cloned in Escherichia coli was determined by the method of Maxam and Gilbert. This sequence was found to be 3,308 nucleotides long. Potential ATG initiator triplets and nonsense codons were identified and used to locate regions with a substantial coding capacity. A striking similarity was observed between the organization of human hepatitis B virus and woodchuck hepatitis virus. Nucleotide sequences of these open regions in the woodchuck virus were compared with corresponding regions present in hepatitis B virus. This allowed the location of four viral genes on the L strand and indicated the absence of protein coded by the S strand. Evolution rates of the various parts of the genome as well as of the four different proteins coded by hepatitis B virus and woodchuck hepatitis virus were compared. These results indicated that: (i) the core protein has evolved slightly less rapidly than the other proteins; and (ii) when a region of DNA codes for two different proteins, there is less freedom for the DNA to evolve and, moreover, one of the proteins can evolve more rapidly than the other. A hairpin structure, very well conserved in the two genomes, was located in the only region devoid of coding function, suggesting the location of the origin of replication of the viral DNA. Images PMID:7086958

  1. Protein Sequence Comparison Based on Physicochemical Properties and the Position-Feature Energy Matrix

    PubMed Central

    Yu, Lulu; Zhang, Yusen; Gutman, Ivan; Shi, Yongtang; Dehmer, Matthias

    2017-01-01

    We develop a novel position-feature-based model for protein sequences by employing physicochemical properties of 20 amino acids and the measure of graph energy. The method puts the emphasis on sequence order information and describes local dynamic distributions of sequences, from which one can get a characteristic B-vector. Afterwards, we apply the relative entropy to the sequences representing B-vectors to measure their similarity/dissimilarity. The numerical results obtained in this study show that the proposed methods leads to meaningful results compared with competitors such as Clustal W. PMID:28393857

  2. The amino acid sequence of goat beta-lactoglobulin.

    PubMed

    Préaux, G; Braunitzer, G; Schrank, B; Stangl, A

    1979-11-01

    The isolation of beta-lactoglobulin from milk of the goat is described. The purified protein was checked for purity and has been characterized by its gross composition and end groups. The native or the modified protein was then degraded by tryptic and cyanogen bromide cleavage. The cleavage products were isolated and sequenced in the sequenator using a Quadrol and propyne program. These data provide the complete sequence of beta-lactoglobulin of the goat. The results are discussed and compared particularly with bovine beta-lactoglobulin components AB. Some biological aspects are described.

  3. Layered materials with coexisting acidic and basic sites for catalytic one-pot reaction sequences.

    PubMed

    Motokura, Ken; Tada, Mizuki; Iwasawa, Yasuhiro

    2009-06-17

    Acidic montmorillonite-immobilized primary amines (H-mont-NH(2)) were found to be excellent acid-base bifunctional catalysts for one-pot reaction sequences, which are the first materials with coexisting acid and base sites active for acid-base tamdem reactions. For example, tandem deacetalization-Knoevenagel condensation proceeded successfully with the H-mont-NH(2), affording the corresponding condensation product in a quantitative yield. The acidity of the H-mont-NH(2) was strongly influenced by the preparation solvent, and the base-catalyzed reactions were enhanced by interlayer acid sites.

  4. alpha. -Amylase of Clostridium thermosulfurogenes EM1: Nucleotide sequence of the gene, processing of the enzyme, and comparison to other. alpha. -amylases

    SciTech Connect

    Bahl, H.; Burchhardt, G.; Spreinat, A.; Haeckel, K.; Wienecke, A.; Antranikian, G.; Schmidt, B. )

    1991-05-01

    The nucleotide sequence of the {alpha}-amylase gene (amyA) from Clostridium thermosulfurogenes EM1 cloned in Escherichia coli was determined. The reading frame of the gene consisted of 2,121 bp. Comparison of the DNA sequence data with the amino acid sequence of the N terminus of the purified secreted protein of C. thermosulfurogenes Em1 suggested that the {alpha}-amylase is translated form mRNA as a secretory precursor with a signal peptide of 27 amino acid residues. The deduced amino acid sequence of the mature {alpha}-amylase contained 679 residues, resulting in a protein with a molecular mass of 75,112 Da. In E. coli the enzyme was transported to the periplasmic space and the signal peptide was cleaved at exactly the same site between two alanine residues. Comparison of the amino acid sequence of the C. thermosulfurogenes EM1 {alpha}-amylase with those from other bacterial and eukaryotic {alpha}-amylases showed several homologous regions, probably in the enzymatically functioning regions. The tentative Ca{sup 2+}-binding site (consensus region I) of this Ca{sub 2+}-independent enzyme showed only limited homology. The deduced amino acid sequence of a second obviously truncated open reading frame showed significant homology to the malG gene product of E. coli. Comparison of the {alpha}-amylase gene region of C. thermosulfurogenes EM1 (DSM3896) with the {beta}-amylase gene region of C. thermosulfurogenes (ATCC 33743) indicated that both genes have been exchanged with each other at identical sites in the chromosomes of these strains.

  5. alpha-Amylase of Clostridium thermosulfurogenes EM1: nucleotide sequence of the gene, processing of the enzyme, and comparison of other alpha-amylases.

    PubMed Central

    Bahl, H; Burchhardt, G; Spreinat, A; Haeckel, K; Wienecke, A; Schmidt, B; Antranikian, G

    1991-01-01

    The nucleotide sequence of the alpha-amylase gene (amyA) from Clostridium thermosulfurogenes EM1 cloned in Escherichia coli was determined. The reading frame of the gene consisted of 2,121 bp. Comparison of the DNA sequence data with the amino acid sequence of the N terminus of the purified secreted protein of C. thermosulfurogenes EM1 suggested that the alpha-amylase is translated from mRNA as a secretory precursor with a signal peptide of 27 amino acid residues. The deduced amino acid sequence of the mature alpha-amylase contained 679 residues, resulting in a protein with a molecular mass of 75,112 Da. In E. coli the enzyme was transported to the periplasmic space and the signal peptide was cleaved at exactly the same site between two alanine residues. Comparison of the amino acid sequence of the C. thermosulfurogenes EM1 alpha-amylase with those from other bacterial and eucaryotic alpha-amylases showed several homologous regions, probably in the enzymatically functioning regions. The tentative Ca(2+)-binding site (consensus region I) of this Ca(2+)-independent enzyme showed only limited homology. The deduced amino acid sequence of a second obviously truncated open reading frame showed significant homology to the malG gene product of E. coli. Comparison of the alpha-amylase gene region of C. thermosulfurogenes EM1 (DSM3896) with the beta-amylase gene region of C. thermosulfurogenes (ATCC 33743) indicated that both genes have been exchanged with each other at identical sites in the chromosomes of these strains. PMID:1854207

  6. Nucleotide sequence of murine PCNA: interspecies comparison of the cDNA and the 5' flanking region of the gene.

    PubMed

    Shipman-Appasamy, P M; Cohen, K S; Prystowsky, M B

    1991-01-01

    Proliferating cell nuclear antigen (PCNA) RNA levels are regulated by transcription as well as changes in stability, in growing cells. We have cloned the murine PCNA cDNA and a fragment of the murine PCNA gene flanking the transcription initiation site. Comparison of the murine deduced amino acid sequence with the PCNA sequence from rat, human, Drosophila, Saccharomyces cerevisiae, and higher plants, reveals extensive homology between species. The homology is likely to be related to the fundamental role of PCNA as an auxiliary protein for DNA replication. Consensus sequences for transcriptional regulatory factors identified within 520 bp 5' of the cap site of the murine PCNA gene include: an inverted CCAAT site, an enhancer core element (EBP-1), three cAMP-response elements (CRE-BP), one AP-2 site, three Sp1 sites, and two octamer sequences. The first 20 bp of the transcriptional unit are homologous to an initiator element, which may direct transcription from RNA polymerase II in the absence of a TATAA box. The consensus elements in the murine PCNA gene are similar in sequence and/or location to elements identified in the genes for human, Drosophilia, and yeast PCNA.

  7. Improvements to pairwise sequence comparison (PASC): a genome-based web tool for virus classification.

    PubMed

    Bao, Yiming; Chetvernin, Vyacheslav; Tatusova, Tatiana

    2014-12-01

    The number of viral genome sequences in the public databases is increasing dramatically, and these sequences are playing an important role in virus classification. Pairwise sequence comparison is a sequence-based virus classification method. A program using this method calculates the pairwise identities of virus sequences within a virus family and displays their distribution, and visual analysis helps to determine demarcations at different taxonomic levels such as strain, species, genus and subfamily. Subsequent comparison of new sequences against existing ones allows viruses from which the new sequences were derived to be classified. Although this method cannot be used as the only criterion for virus classification in some cases, it is a quantitative method and has many advantages over conventional virus classification methods. It has been applied to several virus families, and there is an increasing interest in using this method for other virus families/groups. The Pairwise Sequence Comparison (PASC) classification tool was created at the National Center for Biotechnology Information. The tool's database stores pairwise identities for complete genomes/segments of 56 virus families/groups. Data in the system are updated every day to reflect changes in virus taxonomy and additions of new virus sequences to the public database. The web interface of the tool ( http://www.ncbi.nlm.nih.gov/sutils/pasc/ ) makes it easy to navigate and perform analyses. Multiple new viral genome sequences can be tested simultaneously with this system to suggest the taxonomic position of virus isolates in a specific family. PASC eliminates potential discrepancies in the results caused by different algorithms and/or different data used by researchers.

  8. Synthesis of gamma,delta-unsaturated glycolic acids via sequenced brook and Ireland--claisen rearrangements.

    PubMed

    Schmitt, Daniel C; Johnson, Jeffrey S

    2010-03-05

    Organozinc, -magnesium, and -lithium nucleophiles initiate a Brook/Ireland-Claisen rearrangement sequence of allylic silyl glyoxylates resulting in the formation of gamma,delta-unsaturated alpha-silyloxy acids.

  9. Computer Simulation of the Determination of Amino Acid Sequences in Polypeptides

    ERIC Educational Resources Information Center

    Daubert, Stephen D.; Sontum, Stephen F.

    1977-01-01

    Describes a computer program that generates a random string of amino acids and guides the student in determining the correct sequence of a given protein by using experimental analytic data for that protein. (MLH)

  10. An efficient binomial model-based measure for sequence comparison and its application.

    PubMed

    Liu, Xiaoqing; Dai, Qi; Li, Lihua; He, Zerong

    2011-04-01

    Sequence comparison is one of the major tasks in bioinformatics, which could serve as evidence of structural and functional conservation, as well as of evolutionary relations. There are several similarity/dissimilarity measures for sequence comparison, but challenges remains. This paper presented a binomial model-based measure to analyze biological sequences. With help of a random indicator, the occurrence of a word at any position of sequence can be regarded as a random Bernoulli variable, and the distribution of a sum of the word occurrence is well known to be a binomial one. By using a recursive formula, we computed the binomial probability of the word count and proposed a binomial model-based measure based on the relative entropy. The proposed measure was tested by extensive experiments including classification of HEV genotypes and phylogenetic analysis, and further compared with alignment-based and alignment-free measures. The results demonstrate that the proposed measure based on binomial model is more efficient.

  11. Naked but Not Hairless: The Pitfalls of Analyses of Molecular Adaptation Based on Few Genome Sequence Comparisons

    PubMed Central

    Delsuc, Frédéric; Tilak, Marie-Ka

    2015-01-01

    The naked mole-rat (Heterocephalus glaber) is the only rodent species that naturally lacks fur. Genome sequencing of this atypical rodent species recently shed light on a number of its morphological and physiological adaptations. More specifically, its hairless phenotype has been traced back to a single amino acid change (C397W) in the hair growth associated (HR) protein (or Hairless). By considering the available species diversity, we show that this specific position is in fact variable across mammals, including in the horse that was misleadingly reported to have the ancestral Cysteine. Moreover, by sequencing the corresponding HR exon in additional rodent species, we demonstrate that the C397W substitution is actually not a peculiarity of the naked mole-rat. Instead, this specific amino acid substitution is present in all hystricognath rodents investigated, which are all fully furred, including the naked mole-rat closest relative, the Damaraland mole-rat (Fukomys damarensis). Overall, we found no statistical correlation between amino acid changes at position 397 of the HR protein and reduced pilosity across the mammalian phylogeny. This demonstrates that this single amino acid change does not explain the naked mole-rat hairless phenotype. Our case study calls for caution before making strong claims regarding the molecular basis of phenotypic adaptation based on the screening of specific amino acid substitutions using only few model species in genome sequence comparisons. It also exposes the more general problem of the dilution of essential information in the supplementary material of genome papers thereby increasing the probability that misleading results will escape the scrutiny of editors, reviewers, and ultimately readers. PMID:25714745

  12. Naked but not Hairless: the pitfalls of analyses of molecular adaptation based on few genome sequence comparisons.

    PubMed

    Delsuc, Frédéric; Tilak, Marie-Ka

    2015-02-20

    The naked mole-rat (Heterocephalus glaber) is the only rodent species that naturally lacks fur. Genome sequencing of this atypical rodent species recently shed light on a number of its morphological and physiological adaptations. More specifically, its hairless phenotype has been traced back to a single amino acid change (C397W) in the hair growth associated (HR) protein (or Hairless). By considering the available species diversity, we show that this specific position is in fact variable across mammals, including in the horse that was misleadingly reported to have the ancestral Cysteine. Moreover, by sequencing the corresponding HR exon in additional rodent species, we demonstrate that the C397W substitution is actually not a peculiarity of the naked mole-rat. Instead, this specific amino acid substitution is present in all hystricognath rodents investigated, which are all fully furred, including the naked mole-rat closest relative, the Damaraland mole-rat (Fukomys damarensis). Overall, we found no statistical correlation between amino acid changes at position 397 of the HR protein and reduced pilosity across the mammalian phylogeny. This demonstrates that this single amino acid change does not explain the naked mole-rat hairless phenotype. Our case study calls for caution before making strong claims regarding the molecular basis of phenotypic adaptation based on the screening of specific amino acid substitutions using only few model species in genome sequence comparisons. It also exposes the more general problem of the dilution of essential information in the supplementary material of genome papers thereby increasing the probability that misleading results will escape the scrutiny of editors, reviewers, and ultimately readers.

  13. Cloning, sequence analysis, and expression in Escherichia coli of the gene encoding an alpha-amino acid ester hydrolase from Acetobacter turbidans.

    PubMed

    Polderman-Tijmes, Jolanda J; Jekel, Peter A; de Vries, Erik J; van Merode, Annet E J; Floris, René; van der Laan, Jan-Metske; Sonke, Theo; Janssen, Dick B

    2002-01-01

    The alpha-amino acid ester hydrolase from Acetobacter turbidans ATCC 9325 is capable of hydrolyzing and synthesizing beta-lactam antibiotics, such as cephalexin and ampicillin. N-terminal amino acid sequencing of the purified alpha-amino acid ester hydrolase allowed cloning and genetic characterization of the corresponding gene from an A. turbidans genomic library. The gene, designated aehA, encodes a polypeptide with a molecular weight of 72,000. Comparison of the determined N-terminal sequence and the deduced amino acid sequence indicated the presence of an N-terminal leader sequence of 40 amino acids. The aehA gene was subcloned in the pET9 expression plasmid and expressed in Escherichia coli. The recombinant protein was purified and found to be dimeric with subunits of 70 kDa. A sequence similarity search revealed 26% identity with a glutaryl 7-ACA acylase precursor from Bacillus laterosporus, but no homology was found with other known penicillin or cephalosporin acylases. There was some similarity to serine proteases, including the conservation of the active site motif, GXSYXG. Together with database searches, this suggested that the alpha-amino acid ester hydrolase is a beta-lactam antibiotic acylase that belongs to a class of hydrolases that is different from the Ntn hydrolase superfamily to which the well-characterized penicillin acylase from E. coli belongs. The alpha-amino acid ester hydrolase of A. turbidans represents a subclass of this new class of beta-lactam antibiotic acylases.

  14. Genome sequence of the acid-tolerant strain Rhizobium sp. LPU83.

    PubMed

    Wibberg, Daniel; Tejerizo, Gonzalo Torres; Del Papa, María Florencia; Martini, Carla; Pühler, Alfred; Lagares, Antonio; Schlüter, Andreas; Pistorio, Mariano

    2014-04-20

    Rhizobia are important members of the soil microbiome since they enter into nitrogen-fixing symbiosis with different legume host plants. Rhizobium sp. LPU83 is an acid-tolerant Rhizobium strain featuring a broad-host-range. However, it is ineffective in nitrogen fixation. Here, the improved draft genome sequence of this strain is reported. Genome sequence information provides the basis for analysis of its acid tolerance, symbiotic properties and taxonomic classification.

  15. The amino acid sequence of monal pheasant lysozyme and its activity.

    PubMed

    Araki, T; Matsumoto, T; Torikata, T

    1998-10-01

    The amino acid sequence of monal pheasant lysozyme and its activity were analyzed. Carboxymethylated lysozyme was digested with trypsin and the resulting peptides were sequenced. The established amino acid sequence had one amino acid substitution at position 102 (Arg to Gly) comparing with Indian peafowl lysozyme and four amino acid substitutions at positions 3 (Phe to Tyr), 15 (His to Leu), 41 (Gln to His), and 121 (Gln to His) with chicken lysozyme. Analysis of the time-courses of reaction using N-acetylglucosamine pentamer as a substrate showed a difference of binding free energy change (-0.4 kcal/mol) at subsites A between monal pheasant and Indian peafowl lysozyme. This was assumed to be caused by the amino acid substitution at subsite A with loss of a positive charge at position 102 (Arg102 to Gly).

  16. Single-chain structure of human ceruloplasmin: the complete amino acid sequence of the whole molecule.

    PubMed Central

    Takahashi, N; Ortel, T L; Putnam, F W

    1984-01-01

    We have determined the amino acid sequence of the amino-terminal 67,000-dalton (67-kDa) fragment of human ceruloplasmin and have established overlapping sequences between the 67-kDa and 50-kDa fragments and between the 50-kDa and 19-kDa fragments. The 67-kDa fragment contains 480 amino acid residues and three glucosamine oligosaccharides. These results together with our previous sequence data for the 50-kDa and 19-kDa fragments complete the amino acid sequence of human ceruloplasmin. The polypeptide chain has a total of 1,046 amino acid residues (Mr 120,085) and has attachment sites for four glucosamine oligosaccharides; together these account for the total molecular mass of human ceruloplasmin (132 kDa). The sequence analysis of the peptides overlapping the fragments showed that one additional amino acid, arginine, is present between the 67-kDa and 50-kDa fragments, and another, lysine, is between the 50-kDa and 19-kDa fragments. Only two apparent sites of amino acid interchange have been identified in the polypeptide chain. Both involve a single-point interchange of glycine and lysine that would result in a difference in charge. The results of the complete sequence analysis verified that human ceruloplasmin is composed of a single polypeptide chain and that the subunit-like fragments are produced by proteolytic cleavage during purification (and possibly also in vivo). PMID:6582496

  17. Multiple Genome Sequences of Important Beer-Spoiling Lactic Acid Bacteria

    PubMed Central

    Geissler, Andreas J.; Vogel, Rudi F.

    2016-01-01

    Seven strains of important beer-spoiling lactic acid bacteria were sequenced using single-molecule real-time sequencing. Complete genomes were obtained for strains of Lactobacillus paracollinoides, Lactobacillus lindneri, and Pediococcus claussenii. The analysis of these genomes emphasizes the role of plasmids as the genomic foundation of beer-spoiling ability. PMID:27795248

  18. PASTA: Ultra-Large Multiple Sequence Alignment for Nucleotide and Amino-Acid Sequences.

    PubMed

    Mirarab, Siavash; Nguyen, Nam; Guo, Sheng; Wang, Li-San; Kim, Junhyong; Warnow, Tandy

    2015-05-01

    We introduce PASTA, a new multiple sequence alignment algorithm. PASTA uses a new technique to produce an alignment given a guide tree that enables it to be both highly scalable and very accurate. We present a study on biological and simulated data with up to 200,000 sequences, showing that PASTA produces highly accurate alignments, improving on the accuracy and scalability of the leading alignment methods (including SATé). We also show that trees estimated on PASTA alignments are highly accurate--slightly better than SATé trees, but with substantial improvements relative to other methods. Finally, PASTA is faster than SATé, highly parallelizable, and requires relatively little memory.

  19. SETG: Nucleic Acid Extraction and Sequencing for In Situ Life Detection on Mars

    NASA Astrophysics Data System (ADS)

    Mojarro, A.; Hachey, J.; Tani, J.; Smith, A.; Bhattaru, S. A.; Pontefract, A.; Doebler, R.; Brown, M.; Ruvkun, G.; Zuber, M. T.; Carr, C. E.

    2016-10-01

    We are developing an integrated nucleic acid extraction and sequencing instrument: the Search for Extra-Terrestrial Genomes (SETG) for in situ life detection on Mars. Our goals are to identify related or unrelated nucleic acid-based life on Mars.

  20. Draft Genome Sequence of Cyanobacterium sp. Strain IPPAS B-1200 with a Unique Fatty Acid Composition

    PubMed Central

    Starikov, Alexander Y.; Usserbaeva, Aizhan A.; Sinetova, Maria A.; Sarsekeyeva, Fariza K.; Zayadan, Bolatkhan K.; Ustinova, Vera V.; Kupriyanova, Elena V.; Los, Dmitry A.

    2016-01-01

    Here, we report the draft genome of Cyanobacterium sp. IPPAS strain B-1200, isolated from Lake Balkhash, Kazakhstan, and characterized by the unique fatty acid composition of its membrane lipids, which are enriched with myristic and myristoleic acids. The approximate genome size is 3.4 Mb, and the predicted number of coding sequences is 3,119. PMID:27856596

  1. Iranian johnsongrass mosaic virus: the complete genome sequence, molecular and biological characterization, and comparison of coat protein gene sequences.

    PubMed

    Moradi, Zohreh; Mehrvar, Mohsen; Nazifi, Ehsan; Zakiaghl, Mohammad

    2017-02-01

    Iranian johnsongrass mosaic virus (IJMV) is one of the most prevalent viruses causing maize mosaic disease in Iran. An IJMV isolate, Maz-Bah, was obtained from the maize showing mosaic symptoms in Mazandaran, north of Iran. The complete genomic sequence of Maz-Bah is 9544 nucleotides, excluding the poly(A) tail. It contains one single open reading frame of 9165 nucleotides and encodes a large polyprotein of 3054 amino acids, flanked by a 5'-untranslated region (UTR) of 143 nucleotides and a 3'-UTR of 236 nucleotides. The entire genomic sequence of Maz-Bah isolate shares identities of 84.9 and 94.2 % with the IJMV (Shz) isolate, the lone complete genome sequence available in the GenBank at the nucleotide (nt) and deduced amino acid (aa) levels, respectively. The whole genome sequences share identities of 51.5-69.8 and 44.9-74.3 % with those of other Sugarcane mosaic virus (SCMV) subgroup potyviruses at nt and aa levels, respectively. In phylogenetic trees based on the multiple alignments of the entire nt and aa sequences, IJMV isolates formed a separate sublineage of the tree with potyviruses infecting monocotyledons of cereals, indicating that IJMV is a member of SCMV subgroup of potyviruses. IJMV is most closely related to Sorghum mosaic virus and Maize dwarf mosaic virus and less closely related to the Johnsongrass mosaic virus and Cocksfoot streak virus. To further investigate the genetic relationship of IJMV, 9 other isolates from different hosts were cloned and sequenced. The identity of IJMV CP nt and aa sequences of 11 Iranian isolates ranged from 86.4 to 99.8 % and 90.5 to 99.7 %, respectively, indicating a high nt variability in CP gene. Furthermore, in the CP-based phylogenetic tree, IJMV isolates were clustered together with a maize potyvirus described as Zea mosaic virus from Israel (with 86-89 % nt identity), indicating that both isolates probably are the strains of the same virus.

  2. Sequencing and computational analysis of complete genome sequences of Citrus yellow mosaic badna virus from acid lime and pummelo.

    PubMed

    Borah, Basanta K; Johnson, A M Anthony; Sai Gopal, D V R; Dasgupta, Indranil

    2009-08-01

    Citrus yellow mosaic badna virus (CMBV), a member of the Family Caulimoviridae, Genus Badnavirus, is the causative agent of Citrus mosaic disease in India. Although the virus has been detected in several citrus species, only two full-length genomes, one each from Sweet orange and Rangpur lime, are available in publicly accessible databases. In order to obtain a better understanding of the genetic variability of the virus in other citrus mosaic-affected citrus species, we performed the cloning and sequence analysis of complete genomes of CMBV from two additional citrus species, Acid lime and Pummelo. We show that CMBV genomes from the two hosts share high homology with previously reported CMBV sequences and hence conclude that the new isolates represent variants of the virus present in these species. Based on in silico sequence analysis, we predict the possible function of the protein encoded by one of the five ORFs.

  3. Parvalbumins from coelacanth muscle. III. Amino acid sequence of the major component.

    PubMed

    Jauregui-Adell, J; Pechere, J F

    1978-09-26

    The primary structure of the major parvalbumin (pI = 4.52) from coelacanth muscle (Latimeria chalumnae) has been determined. Sequence analysis of the tryptic peptides, in some cases obtained with beta-trypsin, accounts for the total amino acid content of the protein. Chymotryptic peptides provide appropriate sequence overlaps, to complete the localization of the tryptic peptides. Examination of the amino acid sequence of this protein shows the typical structure of a beta-parvalbumin. Its position in the dendrogram of related calcium-binding proteins corresponds to that usually accepted for crossopterygians.

  4. Comparison of alignment software for genome-wide bisulphite sequence data

    PubMed Central

    Chatterjee, Aniruddha; Stockwell, Peter A.; Rodger, Euan J.; Morison, Ian M.

    2012-01-01

    Recent advances in next generation sequencing (NGS) technology now provide the opportunity to rapidly interrogate the methylation status of the genome. However, there are challenges in handling and interpretation of the methylation sequence data because of its large volume and the consequences of bisulphite modification. We sequenced reduced representation human genomes on the Illumina platform and efficiently mapped and visualized the data with different pipelines and software packages. We examined three pipelines for aligning bisulphite converted sequencing reads and compared their performance. We also comment on pre-processing and quality control of Illumina data. This comparison highlights differences in methods for NGS data processing and provides guidance to advance sequence-based methylation data analysis for molecular biologists. PMID:22344695

  5. A Comparison of Base-calling Algorithms for Illumina Sequencing Technology.

    PubMed

    Cacho, Ashley; Smirnova, Ekaterina; Huzurbazar, Snehalata; Cui, Xinping

    2016-09-01

    Recent advances in next-generation sequencing technology have yielded increasing cost-effectiveness and higher throughput produced per run, in turn, greatly influencing the analysis of DNA sequences. Among the various sequencing technologies, Illumina is by far the most widely used platform. However, the Illumina sequencing platform suffers from several imperfections that can be attributed to the chemical processes inherent to the sequencing-by-synthesis technology. With the enormous amounts of reads produced, statistical methodologies and computationally efficient algorithms are required to improve the accuracy and speed of base-calling. Over the past few years, several papers have proposed methods to model the various imperfections, giving rise to accurate and/or efficient base-calling algorithms. In this article, we provide a comprehensive comparison of the performance of recently developed base-callers and we present a general statistical model that unifies a large majority of these base-callers.

  6. Method comparison study for weak acid dissociation cyanide analysis.

    PubMed

    Evans, Joseph D; Thompson, Leslie; Clark, Patrick J; Beckman, Scott W

    2003-02-01

    Method comparison studies of two different methods for the analysis of weak acid dissociable (WAD) cyanide revealed analytical flaws and/or matrix interference problems with both procedures. EPA "draft" method 1677 using a Perstorp 3202 CN analyzer was compared to Standard Method 4500 CN I. It was discovered that the Perstorp analyzer produced more precise and more accurate results once appropriate and necessary procedural steps from the EPA draft method were modified. Comparison of these two methods, was based on "real world" samples collected from a mine-tailing solution. The mine-tailing solution contained high concentrations of cyanide and metals. Inconsistencies in method procedures were traced to sulfide interferences and high concentrations of WAD metals. Conclusions were based upon a large sample base collected from a mine site over a 90-day period.

  7. Analysis of cloned cDNA and genomic sequences for phytochrome: complete amino acid sequences for two gene products expressed in etiolated Avena.

    PubMed Central

    Hershey, H P; Barker, R F; Idler, K B; Lissemore, J L; Quail, P H

    1985-01-01

    Cloned cDNA and genomic sequences have been analyzed to deduce the amino acid sequence of phytochrome from etiolated Avena. Restriction endonuclease site polymorphism between clones indicates that at least four phytochrome genes are expressed in this tissue. Sequence analysis of two complete and one partial coding region shows approximately 98% homology at both the nucleotide and amino acid levels, with the majority of amino acid changes being conservative. High sequence homology is also found in the 5'-untranslated region but significant divergence occurs in the 3'-untranslated region. The phytochrome polypeptides are 1128 amino acid residues long corresponding to a molecular mass of 125 kdaltons. The known protein sequence at the chromophore attachment site occurs only once in the polypeptide, establishing that phytochrome has a single chromophore per monomer covalently linked to Cys-321. Computer analyses of the amino acid sequences have provided predictions regarding a number of structural features of the phytochrome molecule. PMID:3001642

  8. Comparison and quantitative verification of mapping algorithms for whole genome bisulfite sequencing

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Coupling bisulfite conversion with next-generation sequencing (Bisulfite-seq) enables genome-wide measurement of DNA methylation, but poses unique challenges for mapping. However, despite a proliferation of Bisulfite-seq mapping tools, no systematic comparison of their genomic coverage and quantitat...

  9. Comparison of solution-based exome capture methods for next generation sequencing

    PubMed Central

    2011-01-01

    Background Techniques enabling targeted re-sequencing of the protein coding sequences of the human genome on next generation sequencing instruments are of great interest. We conducted a systematic comparison of the solution-based exome capture kits provided by Agilent and Roche NimbleGen. A control DNA sample was captured with all four capture methods and prepared for Illumina GAII sequencing. Sequence data from additional samples prepared with the same protocols were also used in the comparison. Results We developed a bioinformatics pipeline for quality control, short read alignment, variant identification and annotation of the sequence data. In our analysis, a larger percentage of the high quality reads from the NimbleGen captures than from the Agilent captures aligned to the capture target regions. High GC content of the target sequence was associated with poor capture success in all exome enrichment methods. Comparison of mean allele balances for heterozygous variants indicated a tendency to have more reference bases than variant bases in the heterozygous variant positions within the target regions in all methods. There was virtually no difference in the genotype concordance compared to genotypes derived from SNP arrays. A minimum of 11× coverage was required to make a heterozygote genotype call with 99% accuracy when compared to common SNPs on genome-wide association arrays. Conclusions Libraries captured with NimbleGen kits aligned more accurately to the target regions. The updated NimbleGen kit most efficiently covered the exome with a minimum coverage of 20×, yet none of the kits captured all the Consensus Coding Sequence annotated exons. PMID:21955854

  10. Purification, characterization and partial amino acid sequence of glycogen synthase from Saccharomyces cerevisiae.

    PubMed Central

    Carabaza, A; Arino, J; Fox, J W; Villar-Palasi, C; Guinovart, J J

    1990-01-01

    Glycogen synthase from Saccharomyces cerevisiae was purified to homogeneity. The enzyme showed a subunit molecular mass of 80 kDa. The holoenzyme appears to be a tetramer. Antibodies developed against purified yeast glycogen synthase inactivated the enzyme in yeast extracts and allowed the detection of the protein in Western blots. Amino acid analysis showed that the enzyme is very rich in glutamate and/or glutamine residues. The N-terminal sequence (11 amino acid residues) was determined. In addition, selected tryptic-digest peptides were purified by reverse-phase h.p.l.c. and submitted to gas-phase sequencing. Up to eight sequences (79 amino acid residues) could be aligned with the human muscle enzyme sequence. Levels of identity range between 37 and 100%, indicating that, although human and yeast glycogen synthases probably share some conserved regions, significant differences in their primary structure should be expected. Images Fig. 1. Fig. 2. Fig. 3. PMID:2114092

  11. Microsequence analysis of electroblotted proteins. II. Comparison of sequence performance on different types of PVDF membranes.

    PubMed

    Reim, D F; Speicher, D W

    1992-11-15

    The influence of different types of polyvinylidene difluoride (PVDF) membranes on gas phase sequence performance has been evaluated. These PVDF membranes have been classified as either high retention (Trans-Blot and ProBlott) or low retention membranes (Immobilon-P) based on their ability to bind proteins during electroblotting from gels. Initial yields, repetitive yields, and extraction efficiency of the anilinothiazolinone amino acid derivatives have been compared for several standard proteins that have been either electroblotted or loaded onto PVDF membranes by direct adsorption. These results show that the major differences in initial sequence yields between membranes arise from differences in the amount of protein actually transferred to the membrane rather than sequencer-related factors. In contrast to several previous observations from other laboratories, more tightly bound proteins do not sequence with lower initial yields and initial yields are not affected by the ratio of surface area to protein. The stronger binding on high retention PVDF membranes does not adversely affect recoveries of difficult to extract, or very hydrophobic, amino acid derivatives. Several amino acids, especially tryptophan, are actually recovered in dramatically higher yield on high retention membranes compared with either Immobilon or glass filters. At the same time, the protein and peptide binding properties of high retention membranes will frequently improve the repetitive yield by minimizing sample extraction during the sequencer cycle. Stronger protein binding together with improved electroblotting yields offer substantially improved sequence performance when high retention PVDF membranes are used.

  12. Comparison and trend study on acidity and acidic buffering capacity of particulate matter in China

    NASA Astrophysics Data System (ADS)

    Ren, Lihong; Wang, Wei; Wang, Qingyue; Yang, XiaoYang; Tang, Dagang

    2011-12-01

    The acidity of about 2000 particulate matter samples from aircraft and ground-based monitoring is analyzed by the method similar to soil acidity determination. The ground-based samples were collected at about 50 urban or background sites in northern and southern China. Moreover, the acidic buffering capacity of those samples is also analyzed by the method of micro acid-base titration. Results indicate that the acidity level is lower in most northern areas than those in the south, and the acidic buffering capacity showed inverse tendency, correspondingly. This is the most important reason why the pollution of acidic-precipitation is much more serious in Southern China than that in Northern China. The acidity increases and the acidic buffering capacity drops with the decreasing of the particle sizes, indicating that fine particle is the main influencing factor of the acidification. The ionic results show that Ca salt is the main alkaline substance in particulate matter, whereas the acidification of particulate matter is due to the SO 2 and NO x emitted from the fossil fuel burning. And among of them, coal burning is the main contributor of SO 2, however the contribution of NO x that emitted from fuel burning of motor vehicles has increased in recent years. By comparison of the experimental results during the past 20 years, it can be concluded that the acid precipitation of particulate matter has not been well controlled, and it even shows an increasing tendency in China lately. The acid precipitation of particulate matter has begun to frequently attack in part of the northern areas. Multiple regression analysis indicates that coefficient value of the ions is the lowest at the urban sites and the highest at the regional sites, whereas the aircraft measurement results are intermediate between those two kinds of sites.

  13. Amino acid sequence of homologous rat atrial peptides: natriuretic activity of native and synthetic forms.

    PubMed Central

    Seidah, N G; Lazure, C; Chrétien, M; Thibault, G; Garcia, R; Cantin, M; Genest, J; Nutt, R F; Brady, S F; Lyle, T A

    1984-01-01

    A substance called atrial natriuretic factor (ANF), localized in secretory granules of atrial cardiocytes, was isolated as four homologous natriuretic peptides from homogenates of rat atria. The complete sequence of the longest form showed that it is composed of 33 amino acids. The three other shorter forms (2-33, 3-33, and 8-33) represent amino-terminally truncated versions of the 33 amino acid parent molecule as shown by analysis of sequence, amino acid composition, or both. The proposed primary structure agrees entirely with the amino acid composition and reveals no significant sequence homology with any known protein or segment of protein. The short form ANF-(8-33) was synthesized by a multi-fragment condensation approach and the synthetic product was shown to exhibit specific activity comparable to that of the natural ANF-(3-33). PMID:6232612

  14. Nucleotide and deduced amino acid sequences of a new subtilisin from an alkaliphilic Bacillus isolate.

    PubMed

    Saeki, Katsuhisa; Magallones, Marietta V; Takimura, Yasushi; Hatada, Yuji; Kobayashi, Tohru; Kawai, Shuji; Ito, Susumu

    2003-10-01

    The gene for a new subtilisin from the alkaliphilic Bacillus sp. KSM-LD1 was cloned and sequenced. The open reading frame of the gene encoded a 97 amino-acid prepro-peptide plus a 307 amino-acid mature enzyme that contained a possible catalytic triad of residues, Asp32, His66, and Ser224. The deduced amino acid sequence of the mature enzyme (LD1) showed approximately 65% identity to those of subtilisins SprC and SprD from alkaliphilic Bacillus sp. LG12. The amino acid sequence identities of LD1 to those of previously reported true subtilisins and high-alkaline proteases were below 60%. LD1 was characteristically stable during incubation with surfactants and chemical oxidants. Interestingly, an oxidizable Met residue is located next to the catalytic Ser224 of the enzyme as in the cases of the oxidation-susceptible subtilisins reported to date.

  15. Shark myelin basic protein: amino acid sequence, secondary structure, and self-association.

    PubMed

    Milne, T J; Atkins, A R; Warren, J A; Auton, W P; Smith, R

    1990-09-01

    Myelin basic protein (MBP) from the Whaler shark (Carcharhinus obscurus) has been purified from acid extracts of a chloroform/methanol pellet from whole brains. The amino acid sequence of the majority of the protein has been determined and compared with the sequences of other MBPs. The shark protein has only 44% homology with the bovine protein, but, in common with other MBPs, it has basic residues distributed throughout the sequence and no extensive segments that are predicted to have an ordered secondary structure in solution. Shark MBP lacks the triproline sequence previously postulated to form a hairpin bend in the molecule. The region containing the putative consensus sequence for encephalitogenicity in the guinea pig contains several substitutions, thus accounting for the lack of activity of the shark protein. Studies of the secondary structure and self-association have shown that shark MBP possesses solution properties similar to those of the bovine protein, despite the extensive differences in primary structure.

  16. An analysis of amino acid sequences surrounding archaeal glycoprotein sequons.

    PubMed

    Abu-Qarn, Mehtap; Eichler, Jerry

    2007-05-01

    Despite having provided the first example of a prokaryal glycoprotein, little is known of the rules governing the N-glycosylation process in Archaea. As in Eukarya and Bacteria, archaeal N-glycosylation takes place at the Asn residues of Asn-X-Ser/Thr sequons. Since not all sequons are utilized, it is clear that other factors, including the context in which a sequon exists, affect glycosylation efficiency. As yet, the contribution to N-glycosylation made by sequon-bordering residues and other related factors in Archaea remains unaddressed. In the following, the surroundings of Asn residues confirmed by experiment as modified were analyzed in an attempt to define sequence rules and requirements for archaeal N-glycosylation.

  17. Detection of Weakly Conserved Ancestral Mammalian RegulatorySequences by Primate Comparisons

    SciTech Connect

    Wang, Qian-fei; Prabhakar, Shyam; Chanan, Sumita; Cheng,Jan-Fang; Rubin, Edward M.; Boffelli, Dario

    2006-06-01

    Genomic comparisons between human and distant, non-primatemammals are commonly used to identify cis-regulatory elements based onconstrained sequence evolution. However, these methods fail to detectcryptic functional elements, which are too weakly conserved among mammalsto distinguish from nonfunctional DNA. To address this problem, weexplored the potential of deep intra-primate sequence comparisons. Wesequenced the orthologs of 558 kb of human genomic sequence, coveringmultiple loci involved in cholesterol homeostasis, in 6 nonhumanprimates. Our analysis identified 6 noncoding DNA elements displayingsignificant conservation among primates, but undetectable in more distantcomparisons. In vitro and in vivo tests revealed that at least three ofthese 6 elements have regulatory function. Notably, the mouse orthologsof these three functional human sequences had regulatory activity despitetheir lack of significant sequence conservation, indicating that they arecryptic ancestral cis-regulatory elements. These regulatory elementscould still be detected in a smaller set of three primate speciesincluding human, rhesus and marmoset. Since the human and rhesus genomesequences are already available, and the marmoset genome is activelybeing sequenced, the primate-specific conservation analysis describedhere can be applied in the near future on a whole-genome scale, tocomplement the annotation provided by more distant speciescomparisons.

  18. Comparison of methods for acid quantification: impact of resist components on acid-generating efficiency

    NASA Astrophysics Data System (ADS)

    Cameron, James F.; Fradkin, Leslie; Moore, Kathryn; Pohlers, Gerd

    2000-06-01

    Chemically amplified deep UV (CA-DUV) positive resists are the enabling materials for manufacture of devices at and below 0.18 micrometer design rules in the semiconductor industry. CA-DUV resists are typically based on a combination of an acid labile polymer and a photoacid generator (PAG). Upon UV exposure, a catalytic amount of a strong Bronsted acid is released and is subsequently used in a post-exposure bake step to deprotect the acid labile polymer. Deprotection transforms the acid labile polymer into a base soluble polymer and ultimately enables positive tone image development in dilute aqueous base. As CA-DUV resist systems continue to mature and are used in increasingly demanding situations, it is critical to develop a fundamental understanding of how robust these materials are. One of the most important factors to quantify is how much acid is photogenerated in these systems at key exposure doses. For the purpose of quantifying photoacid generation several methods have been devised. These include spectrophotometric methods, ion conductivity methods and most recently an acid-base type titration similar to the standard addition method. This paper compares many of these techniques. First, comparisons between the most commonly used acid sensitive dye, tetrabromophenol blue sodium salt (TBPB) and a less common acid sensitive dye, Rhodamine B base (RB) are made in several resist systems. Second, the novel acid-base type titration based on the standard addition method is compared to the spectrophotometric titration method. During these studies, the make up of the resist system is probed as follows: the photoacid generator and resist additives are varied to understand the impact of each of these resist components on the acid generation process.

  19. Nucleic and Amino Acid Sequences Support Structure-Based Viral Classification

    PubMed Central

    Sinclair, Robert M.; Ravantti, Janne J.

    2017-01-01

    ABSTRACT Viral capsids ensure viral genome integrity by protecting the enclosed nucleic acids. Interactions between the genome and capsid and between individual capsid proteins (i.e., capsid architecture) are intimate and are expected to be characterized by strong evolutionary conservation. For this reason, a capsid structure-based viral classification has been proposed as a way to bring order to the viral universe. The seeming lack of sufficient sequence similarity to reproduce this classification has made it difficult to reject structural convergence as the basis for the classification. We reinvestigate whether the structure-based classification for viral coat proteins making icosahedral virus capsids is in fact supported by previously undetected sequence similarity. Since codon choices can influence nascent protein folding cotranslationally, we searched for both amino acid and nucleotide sequence similarity. To demonstrate the sensitivity of the approach, we identify a candidate gene for the pandoravirus capsid protein. We show that the structure-based classification is strongly supported by amino acid and also nucleotide sequence similarities, suggesting that the similarities are due to common descent. The correspondence between structure-based and sequence-based analyses of the same proteins shown here allow them to be used in future analyses of the relationship between linear sequence information and macromolecular function, as well as between linear sequence and protein folds. IMPORTANCE Viral capsids protect nucleic acid genomes, which in turn encode capsid proteins. This tight coupling of protein shell and nucleic acids, together with strong functional constraints on capsid protein folding and architecture, leads to the hypothesis that capsid protein-coding nucleotide sequences may retain signatures of ancient viral evolution. We have been able to show that this is indeed the case, using the major capsid proteins of viruses forming icosahedral capsids

  20. Classification of mouse VK groups based on the partial amino acid sequence to the first invariant tryptophan: impact of 14 new sequences from IgG myeloma proteins.

    PubMed

    Potter, M; Newell, J B; Rudikoff, S; Haber, E

    1982-12-01

    Fourteen new VK sequences derived from BALB/c IgG myeloma proteins were determined to the first invariant tryptophan (Trp 35). These partial sequences were compared with 65 other published VK sequences using a computer program. The 79 sequences were organized according to the length of the sequence from the amino terminus to the first invariant tryptophan (Trp 35), into seven groups (33, 34, 35, 36, 39, 40 and 41aa). A distance matrix of all 79 sequences was then computed, i.e. the number of amino acid substitutions necessary to convert one sequence to another was determined. From these data a dendrogram was constructed. Most of the VK sequences fell into clusters or closely related groups. The definition of a sequence group is arbitrary but facilitates the classification of VK proteins. We used 12 substitutions as the basis for defining a sequence group based on the known number of substitutions that are found in the VK21 proteins. By this criterion there were 18 groups in the Trp 35 dendrogram. Twelve of the 14 new sequences fell into one of these sequence groups; two formed new sequence groups. Collective amino acid sequencing is still encountering new VK structures indicating more sequences will be required to attain an accurate estimate of the total number of VK groups. Updated dendrograms can be quickly generated to include newly generated sequences.

  1. PAirwise Sequence Comparison (PASC) and its application in the classification of filoviruses.

    PubMed

    Bao, Yiming; Chetvernin, Vyacheslav; Tatusova, Tatiana

    2012-08-01

    PAirwise Sequence Comparison (PASC) is a tool that uses genome sequence similarity to help with virus classification. The PASC tool at NCBI uses two methods: local alignment based on BLAST and global alignment based on Needleman-Wunsch algorithm. It works for complete genomes of viruses of several families/groups, and for the family of Filoviridae, it currently includes 52 complete genomes available in GenBank. It has been shown that BLAST-based alignment approach works better for filoviruses, and therefore is recommended for establishing taxon demarcations criteria. When more genome sequences with high divergence become available, these demarcation will most likely become more precise. The tool can compare new genome sequences of filoviruses with the ones already in the database, and propose their taxonomic classification.

  2. Detection and isolation of nucleic acid sequences using competitive hybridization probes

    DOEpatents

    Lucas, J.N.; Straume, T.; Bogen, K.T.

    1997-04-01

    A method for detecting a target nucleic acid sequence in a sample is provided using hybridization probes which competitively hybridize to a target nucleic acid. According to the method, a target nucleic acid sequence is hybridized to first and second hybridization probes which are complementary to overlapping portions of the target nucleic acid sequence, the first hybridization probe including a first complexing agent capable of forming a binding pair with a second complexing agent and the second hybridization probe including a detectable marker. The first complexing agent attached to the first hybridization probe is contacted with a second complexing agent, the second complexing agent being attached to a solid support such that when the first and second complexing agents are attached, target nucleic acid sequences hybridized to the first hybridization probe become immobilized on to the solid support. The immobilized target nucleic acids are then separated and detected by detecting the detectable marker attached to the second hybridization probe. A kit for performing the method is also provided. 7 figs.

  3. Detection and isolation of nucleic acid sequences using competitive hybridization probes

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    1997-01-01

    A method for detecting a target nucleic acid sequence in a sample is provided using hybridization probes which competitively hybridize to a target nucleic acid. According to the method, a target nucleic acid sequence is hybridized to first and second hybridization probes which are complementary to overlapping portions of the target nucleic acid sequence, the first hybridization probe including a first complexing agent capable of forming a binding pair with a second complexing agent and the second hybridization probe including a detectable marker. The first complexing agent attached to the first hybridization probe is contacted with a second complexing agent, the second complexing agent being attached to a solid support such that when the first and second complexing agents are attached, target nucleic acid sequences hybridized to the first hybridization probe become immobilized on to the solid support. The immobilized target nucleic acids are then separated and detected by detecting the detectable marker attached to the second hybridization probe. A kit for performing the method is also provided.

  4. iPBA: a tool for protein structure comparison using sequence alignment strategies

    PubMed Central

    Gelly, Jean-Christophe; Joseph, Agnel Praveen; Srinivasan, Narayanaswamy; de Brevern, Alexandre G.

    2011-01-01

    With the immense growth in the number of available protein structures, fast and accurate structure comparison has been essential. We propose an efficient method for structure comparison, based on a structural alphabet. Protein Blocks (PBs) is a widely used structural alphabet with 16 pentapeptide conformations that can fairly approximate a complete protein chain. Thus a 3D structure can be translated into a 1D sequence of PBs. With a simple Needleman–Wunsch approach and a raw PB substitution matrix, PB-based structural alignments were better than many popular methods. iPBA web server presents an improved alignment approach using (i) specialized PB Substitution Matrices (SM) and (ii) anchor-based alignment methodology. With these developments, the quality of ∼88% of alignments was improved. iPBA alignments were also better than DALI, MUSTANG and GANGSTA+ in >80% of the cases. The webserver is designed to for both pairwise comparisons and database searches. Outputs are given as sequence alignment and superposed 3D structures displayed using PyMol and Jmol. A local alignment option for detecting subs-structural similarity is also embedded. As a fast and efficient ‘sequence-based’ structure comparison tool, we believe that it will be quite useful to the scientific community. iPBA can be accessed at http://www.dsimb.inserm.fr/dsimb_tools/ipba/. PMID:21586582

  5. Amino acid sequence around the active-site serine residue in the acyltransferase domain of goat mammary fatty acid synthetase.

    PubMed Central

    Mikkelsen, J; Højrup, P; Rasmussen, M M; Roepstorff, P; Knudsen, J

    1985-01-01

    Goat mammary fatty acid synthetase was labelled in the acyltransferase domain by formation of O-ester intermediates by incubation with [1-14C]acetyl-CoA and [2-14C]malonyl-CoA. Tryptic-digest and CNBr-cleavage peptides were isolated and purified by high-performance reverse-phase and ion-exchange liquid chromatography. The sequences of the malonyl- and acetyl-labelled peptides were shown to be identical. The results confirm the hypothesis that both acetyl and malonyl groups are transferred to the mammalian fatty acid synthetase complex by the same transferase. The sequence is compared with those of other fatty acid synthetase transferases. PMID:3922356

  6. Ligation with nucleic acid sequence-based amplification.

    PubMed

    Ong, Carmichael; Tai, Warren; Sarma, Aartik; Opal, Steven M; Artenstein, Andrew W; Tripathi, Anubhav

    2012-01-01

    This work presents a novel method for detecting nucleic acid targets using a ligation step along with an isothermal, exponential amplification step. We use an engineered ssDNA with two variable regions on the ends, allowing us to design the probe for optimal reaction kinetics and primer binding. This two-part probe is ligated by T4 DNA Ligase only when both parts bind adjacently to the target. The assay demonstrates that the expected 72-nt RNA product appears only when the synthetic target, T4 ligase, and both probe fragments are present during the ligation step. An extraneous 38-nt RNA product also appears due to linear amplification of unligated probe (P3), but its presence does not cause a false-positive result. In addition, 40 mmol/L KCl in the final amplification mix was found to be optimal. It was also found that increasing P5 in excess of P3 helped with ligation and reduced the extraneous 38-nt RNA product. The assay was also tested with a single nucleotide polymorphism target, changing one base at the ligation site. The assay was able to yield a negative signal despite only a single-base change. Finally, using P3 and P5 with longer binding sites results in increased overall sensitivity of the reaction, showing that increasing ligation efficiency can improve the assay overall. We believe that this method can be used effectively for a number of diagnostic assays.

  7. Analyses of mitochondrial amino acid sequence datasets support the proposal that specimens of Hypodontus macropi from three species of macropodid hosts represent distinct species

    PubMed Central

    2013-01-01

    Background Hypodontus macropi is a common intestinal nematode of a range of kangaroos and wallabies (macropodid marsupials). Based on previous multilocus enzyme electrophoresis (MEE) and nuclear ribosomal DNA sequence data sets, H. macropi has been proposed to be complex of species. To test this proposal using independent molecular data, we sequenced the whole mitochondrial (mt) genomes of individuals of H. macropi from three different species of hosts (Macropus robustus robustus, Thylogale billardierii and Macropus [Wallabia] bicolor) as well as that of Macropicola ocydromi (a related nematode), and undertook a comparative analysis of the amino acid sequence datasets derived from these genomes. Results The mt genomes sequenced by next-generation (454) technology from H. macropi from the three host species varied from 13,634 bp to 13,699 bp in size. Pairwise comparisons of the amino acid sequences predicted from these three mt genomes revealed differences of 5.8% to 18%. Phylogenetic analysis of the amino acid sequence data sets using Bayesian Inference (BI) showed that H. macropi from the three different host species formed distinct, well-supported clades. In addition, sliding window analysis of the mt genomes defined variable regions for future population genetic studies of H. macropi in different macropodid hosts and geographical regions around Australia. Conclusions The present analyses of inferred mt protein sequence datasets clearly supported the hypothesis that H. macropi from M. robustus robustus, M. bicolor and T. billardierii represent distinct species. PMID:24261823

  8. Thin-film technology for direct visual detection of nucleic acid sequences: applications in clinical research.

    PubMed

    Jenison, Robert D; Bucala, Richard; Maul, Diana; Ward, David C

    2006-01-01

    Certain optical conditions permit the unaided eye to detect thickness changes on surfaces on the order of 20 A, which are of similar dimensions to monomolecular interactions between proteins or hybridization of complementary nucleic acid sequences. Such detection exploits specific interference of reflected white light, wherein thickness changes are perceived as surface color changes. This technology, termed thin-film detection, allows for the visualization of subattomole amounts of nucleic acid targets, even in complex clinical samples. Thin-film technology has been applied to a broad range of clinically relevant indications, including the detection of pathogenic bacterial and viral nucleic acid sequences and the discrimination of sequence variations in human genes causally related to susceptibility or severity of disease.

  9. mtDNAprofiler: a Web application for the nomenclature and comparison of human mitochondrial DNA sequences.

    PubMed

    Yang, In Seok; Lee, Hwan Young; Yang, Woo Ick; Shin, Kyoung-Jin

    2013-07-01

    Mitochondrial DNA (mtDNA) is a valuable tool in the fields of forensic, population, and medical genetics. However, recording and comparing mtDNA control region or entire genome sequences would be difficult if researchers are not familiar with mtDNA nomenclature conventions. Therefore, mtDNAprofiler, a Web application, was designed for the analysis and comparison of mtDNA sequences in a string format or as a list of mtDNA single-nucleotide polymorphisms (mtSNPs). mtDNAprofiler which comprises four mtDNA sequence-analysis tools (mtDNA nomenclature, mtDNA assembly, mtSNP conversion, and mtSNP concordance-check) supports not only the accurate analysis of mtDNA sequences via an automated nomenclature function, but also consistent management of mtSNP data via direct comparison and validity-check functions. Since mtDNAprofiler consists of four tools that are associated with key steps of mtDNA sequence analysis, mtDNAprofiler will be helpful for researchers working with mtDNA. mtDNAprofiler is freely available at http://mtprofiler.yonsei.ac.kr.

  10. Enzyme sequence similarity improves the reaction alignment method for cross-species pathway comparison

    SciTech Connect

    Ovacik, Meric A.; Androulakis, Ioannis P.

    2013-09-15

    Pathway-based information has become an important source of information for both establishing evolutionary relationships and understanding the mode of action of a chemical or pharmaceutical among species. Cross-species comparison of pathways can address two broad questions: comparison in order to inform evolutionary relationships and to extrapolate species differences used in a number of different applications including drug and toxicity testing. Cross-species comparison of metabolic pathways is complex as there are multiple features of a pathway that can be modeled and compared. Among the various methods that have been proposed, reaction alignment has emerged as the most successful at predicting phylogenetic relationships based on NCBI taxonomy. We propose an improvement of the reaction alignment method by accounting for sequence similarity in addition to reaction alignment method. Using nine species, including human and some model organisms and test species, we evaluate the standard and improved comparison methods by analyzing glycolysis and citrate cycle pathways conservation. In addition, we demonstrate how organism comparison can be conducted by accounting for the cumulative information retrieved from nine pathways in central metabolism as well as a more complete study involving 36 pathways common in all nine species. Our results indicate that reaction alignment with enzyme sequence similarity results in a more accurate representation of pathway specific cross-species similarities and differences based on NCBI taxonomy.

  11. Conservation of Shannon's redundancy for proteins. [information theory applied to amino acid sequences

    NASA Technical Reports Server (NTRS)

    Gatlin, L. L.

    1974-01-01

    Concepts of information theory are applied to examine various proteins in terms of their redundancy in natural originators such as animals and plants. The Monte Carlo method is used to derive information parameters for random protein sequences. Real protein sequence parameters are compared with the standard parameters of protein sequences having a specific length. The tendency of a chain to contain some amino acids more frequently than others and the tendency of a chain to contain certain amino acid pairs more frequently than other pairs are used as randomness measures of individual protein sequences. Non-periodic proteins are generally found to have random Shannon redundancies except in cases of constraints due to short chain length and genetic codes. Redundant characteristics of highly periodic proteins are discussed. A degree of periodicity parameter is derived.

  12. RNA internal standard synthesis by nucleic acid sequence-based amplification for competitive quantitative amplification reactions.

    PubMed

    Lo, Wan-Yu; Baeumner, Antje J

    2007-02-15

    Nucleic acid sequence-based amplification (NASBA) reactions have been demonstrated to successfully synthesize new sequences based on deletion and insertion reactions. Two RNA internal standards were synthesized for use in competitive amplification reactions in which quantitative analysis can be achieved by coamplifying the internal standard with the wild type sample. The sequences were created in two consecutive NASBA reactions using the E. coli clpB mRNA sequence as model analyte. The primer sequences of the wild type sequence were maintained, and a 20-nt-long segment inside the amplicon region was exchanged for a new segment of similar GC content and melting temperature. The new RNA sequence was thus amplifiable using the wild type primers and detectable via a new inserted sequence. In the first reaction, the forwarding primer and an additional 20-nt-long sequence was deleted and replaced by a new 20-nt-long sequence. In the second reaction, a forwarding primer containing as 5' overhang sequence the wild type primer sequence was used. The presence of pure internal standard was verified using electrochemiluminescence and RNA lateral-flow biosensor analysis. Additional sequence deletion in order to shorten the internal standard amplicons and thus generate higher detection signals was found not to be required. Finally, a competitive NASBA reaction between one internal standard and the wild type sequence was carried out proving its functionality. This new rapid construction method via NASBA provides advantages over the traditional techniques since it requires no traditional cloning procedures, no thermocyclers, and can be completed in less than 4 h.

  13. Definition and Analysis of a System for the Automated Comparison of Curriculum Sequencing Algorithms in Adaptive Distance Learning

    ERIC Educational Resources Information Center

    Limongelli, Carla; Sciarrone, Filippo; Temperini, Marco; Vaste, Giulia

    2011-01-01

    LS-Lab provides automatic support to comparison/evaluation of the Learning Object Sequences produced by different Curriculum Sequencing Algorithms. Through this framework a teacher can verify the correspondence between the behaviour of different sequencing algorithms and her pedagogical preferences. In fact the teacher can compare algorithms…

  14. The new sequencer on the block: comparison of Life Technology's Proton sequencer to an Illumina HiSeq for whole-exome sequencing.

    PubMed

    Boland, Joseph F; Chung, Charles C; Roberson, David; Mitchell, Jason; Zhang, Xijun; Im, Kate M; He, Ji; Chanock, Stephen J; Yeager, Meredith; Dean, Michael

    2013-10-01

    We assessed the performance of the new Life Technologies Proton sequencer by comparing whole-exome sequence data in a Centre d'Etude du Polymorphisme Humain trio (family 1463) to the Illumina HiSeq instrument. To simulate a typical user's results, we utilized the standard capture, alignment and variant calling methods specific to each platform. We restricted data analysis to include the capture region common to both methods. The Proton produced high quality data at a comparable average depth and read length, and the Ion Reporter variant caller identified 96 % of single nucleotide polymorphisms (SNPs) detected by the HiSeq and GATK pipeline. However, only 40 % of small insertion and deletion variants (indels) were identified by both methods. Usage of the trio structure and segregation of platform-specific alleles supported this result. Further comparison of the trio data with Complete Genomics sequence data and Illumina SNP microarray genotypes documented high concordance and accurate SNP genotyping of both Proton and Illumina platforms. However, our study underscored the problem of accurate detection of indels for both the Proton and HiSeq platforms.

  15. Comparison of the sequence of the gene encoding African swine fever virus attachment protein p12 from field virus isolates and viruses passaged in tissue culture.

    PubMed Central

    Angulo, A; Viñuela, E; Alcamí, A

    1992-01-01

    Comparison of the amino acid sequence of the African swine fever virus attachment protein p12 from different field virus isolates, deduced from the nucleotide sequence of the gene, revealed a high degree of conservation. No mutations were found after adaptation to Vero cells, and a polypeptide with similar characteristics was present in an IBRS2-adapted virus. The sequence of the 5' flanking region was conserved among the isolates, whereas sequences downstream of the gene were highly variable in length and contained direct repeats in tandem that may account for the deletions found in different isolates. Protein p12 was synthesized in swine macrophages infected with all of the viruses tested. PMID:1583733

  16. Amino acid sequences of two nonspecific lipid-transfer proteins from germinated castor bean.

    PubMed

    Takishima, K; Watanabe, S; Yamada, M; Suga, T; Mamiya, G

    1988-11-01

    The amino acid sequence of two nonspecific lipid-transfer proteins (nsLTP) B and C from germinated castor bean seeds have been determined. Both the proteins consist of 92 residues, as for nsLTP previously reported, and their calculated Mr values are 9847 and 9593 for nsLTP-B and nsLTP-C, respectively. The sequences of nsLTP-B and nsLTP-C, compared to the known sequence of nsLTP-A from the same source, are 68% and 35% similar, respectively. No variation was found at the positions of the cysteine residues, indicating that they might be involved in disulfide bridges.

  17. A classification of glycosyl hydrolases based on amino acid sequence similarities.

    PubMed Central

    Henrissat, B

    1991-01-01

    The amino acid sequences of 301 glycosyl hydrolases and related enzymes have been compared. A total of 291 sequences corresponding to 39 EC entries could be classified into 35 families. Only ten sequences (less than 5% of the sample) could not be assigned to any family. With the sequences available for this analysis, 18 families were found to be monospecific (containing only one EC number) and 17 were found to be polyspecific (containing at least two EC numbers). Implications on the folding characteristics and mechanism of action of these enzymes and on the evolution of carbohydrate metabolism are discussed. With the steady increase in sequence and structural data, it is suggested that the enzyme classification system should perhaps be revised. PMID:1747104

  18. Complete amino acid sequence of the N-terminal extension of calf skin type III procollagen.

    PubMed Central

    Brandt, A; Glanville, R W; Hörlein, D; Bruckner, P; Timpl, R; Fietzek, P P; Kühn, K

    1984-01-01

    The N-terminal extension peptide of type III procollagen, isolated from foetal-calf skin, contains 130 amino acid residues. To determine its amino acid sequence, the peptide was reduced and carboxymethylated or aminoethylated and fragmented with trypsin, Staphylococcus aureus V8 proteinase and bacterial collagenase. Pyroglutamate aminopeptidase was used to deblock the N-terminal collagenase fragment to enable amino acid sequencing. The type III collagen extension peptide is homologous to that of the alpha 1 chain of type I procollagen with respect to a three-domain structure. The N-terminal 79 amino acids, which contain ten of the 12 cysteine residues, form a compact globular domain. The next 39 amino acids are in a collagenase triplet sequence (Gly- Xaa - Yaa )n with a high hydroxyproline content. Finally, another short non-collagenous domain of 12 amino acids ends at the cleavage site for procollagen aminopeptidase, which cleaves a proline-glutamine bond. In contrast with type I procollagen, the type III procollagen extension peptides contain interchain disulphide bridges located at the C-terminus of the triple-helical domain. PMID:6331392

  19. 37 CFR 1.824 - Form and format for nucleotide and/or amino acid sequence submissions in computer readable form.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... nucleotide and/or amino acid sequence submissions in computer readable form. 1.824 Section 1.824 Patents... And/or Amino Acid Sequences § 1.824 Form and format for nucleotide and/or amino acid sequence... readable form may be created by any means, such as word processors, nucleotide/amino acid sequence...

  20. 37 CFR 1.824 - Form and format for nucleotide and/or amino acid sequence submissions in computer readable form.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... nucleotide and/or amino acid sequence submissions in computer readable form. 1.824 Section 1.824 Patents... And/or Amino Acid Sequences § 1.824 Form and format for nucleotide and/or amino acid sequence... readable form may be created by any means, such as word processors, nucleotide/amino acid sequence...

  1. 37 CFR 1.824 - Form and format for nucleotide and/or amino acid sequence submissions in computer readable form.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... nucleotide and/or amino acid sequence submissions in computer readable form. 1.824 Section 1.824 Patents... And/or Amino Acid Sequences § 1.824 Form and format for nucleotide and/or amino acid sequence... readable form may be created by any means, such as word processors, nucleotide/amino acid sequence...

  2. A putative carbohydrate-binding domain of the lactose-binding Cytisus sessilifolius anti-H(O) lectin has a similar amino acid sequence to that of the L-fucose-binding Ulex europaeus anti-H(O) lectin.

    PubMed

    Konami, Y; Yamamoto, K; Osawa, T; Irimura, T

    1995-04-01

    The complete amino acid sequence of a lactose-binding Cytisus sessilifolius anti-H(O) lectin II (CSA-II) was determined using a protein sequencer. After digestion of CSA-II with endoproteinase Lys-C or Asp-N, the resulting peptides were purified by reversed-phase high performance liquid chromatography (HPLC) and then subjected to sequence analysis. Comparison of the complete amino acid sequence of CSA-II with the sequences of other leguminous seed lectins revealed regions of extensive homology. The amino acid sequence of a putative carbohydrate-binding domain of CSA-II was found to be similar to those of several anti-H(O) leguminous lectins, especially to that of the L-fucose-binding Ulex europaeus lectin I (UEA-I).

  3. A Comparison of the First Two Sequenced Chloroplast Genomes in Asteraceae: Lettuce and Sunflower

    SciTech Connect

    Timme, Ruth E.; Kuehl, Jennifer V.; Boore, Jeffrey L.; Jansen, Robert K.

    2006-01-20

    Asteraceae is the second largest family of plants, with over 20,000 species. For the past few decades, numerous phylogenetic studies have contributed to our understanding of the evolutionary relationships within this family, including comparisons of the fast evolving chloroplast gene, ndhF, rbcL, as well as non-coding DNA from the trnL intron plus the trnLtrnF intergenic spacer, matK, and, with lesser resolution, psbA-trnH. This culminated in a study by Panero and Funk in 2002 that used over 13,000 bp per taxon for the largest taxonomic revision of Asteraceae in over a hundred years. Still, some uncertainties remain, and it would be very useful to have more information on the relative rates of sequence evolution among various genes and on genome structure as a potential set of phylogenetic characters to help guide future phylogenetic structures. By way of contributing to this, we report the first two complete chloroplast genome sequences from members of the Asteraceae, those of Helianthus annuus and Lactuca sativa. These plants belong to two distantly related subfamilies, Asteroideae and Cichorioideae, respectively. In addition to these, there is only one other published chloroplast genome sequence for any plant within the larger group called Eusterids II, that of Panax ginseng (Araliaceae, 156,318 bps, AY582139). Early chloroplast genome mapping studies demonstrated that H. annuus and L. sativa share a 22 kb inversion relative to members of the subfamily Barnadesioideae. By comparison to outgroups, this inversion was shown to be derived, indicating that the Asteroideae and Cichorioideae are more closely related than either is to the Barnadesioideae. Later sequencing study found that taxa that share this 22 kb inversion also contain within this region a second, smaller, 3.3 kb inversion. These sequences also enable an analysis of patterns of shared repeats in the genomes at fine level and of RNA editing by comparison to available EST sequences. In addition, since

  4. Rat androgen-binding protein: evidence for identical subunits and amino acid sequence homology with human sex hormone-binding globulin.

    PubMed

    Joseph, D R; Hall, S H; French, F S

    1987-01-01

    The cDNA for rat androgen-binding protein (ABP) was previously isolated from a bacteriophage lambda gt11 rat testis cDNA library and its identity was confirmed by epitope selection. Hybrid-arrested translation studies have now demonstrated the identity of the isolates. The nucleotide sequence of a near full-length cDNA encodes a 403-amino acid precursor (Mr = 44,539), which agrees in size with the cell-free translation product (Mr = 45,000) of ABP mRNA. Putative sites of N-glycosylation and signal peptide cleavage were identified. Comparison of the predicted amino acid sequence of rat ABP with the amino-terminal amino acid sequence of human sex hormone-binding globulin revealed that 17 of 25 residues are identical. On the basis of the predicted amino acid sequence the molecular weight of the primary translation product, lacking the signal peptide, was 41,183. Hybridization analyses indicated that the two subunits of ABP are coded for by a single gene and a single mRNA species. Our results suggest that ABP consists of two subunits with identical primary sequences and that differences in post-translational processing result in the production of 47,000 and 41,000 molecular weight monomers.

  5. Protein sequence comparison and fold recognition: progress and good-practice benchmarking.

    PubMed

    Söding, Johannes; Remmert, Michael

    2011-06-01

    Protein sequence comparison methods have grown increasingly sensitive during the last decade and can often identify distantly related proteins sharing a common ancestor some 3 billion years ago. Although cellular function is not conserved so long, molecular functions and structures of protein domains often are. In combination with a domain-centered approach to function and structure prediction, modern remote homology detection methods have a great and largely underexploited potential for elucidating protein functions and evolution. Advances during the last few years include nonlinear scoring functions combining various sequence features, the use of sequence context information, and powerful new software packages. Since progress depends on realistically assessing new and existing methods and published benchmarks are often hard to compare, we propose 10 rules of good-practice benchmarking.

  6. The amino-acid sequence of the 2S sulphur-rich proteins from seeds of Brazil nut (Bertholletia excelsa H.B.K.).

    PubMed

    Ampe, C; Van Damme, J; de Castro, L A; Sampaio, M J; Van Montagu, M; Vandekerckhove, J

    1986-09-15

    Storage proteins of the albumin solubility fraction from seeds of Bertholletia excelsa H.B.K. were separated by reversed-phase high-performance liquid chromatography and their primary structures were determined by gas-phase sequencing on intact polypeptides and on the overlapping tryptic and thermolysin peptides. The 2S storage proteins consist of two subunits linked by disulphide bridges. The large subunit (8.5 kDa) is expressed in at least six different isoforms while the small subunit (3.6 kDa) consists of only one form. These proteins are extremely rich in glutamine, glutamic acid, arginine and the sulphur-containing amino acids cysteine and methionine. One of the variants even contains a sequence of six methionine residues in a row. Comparison with known sequences of 2S proteins of other dicotyledonous plants shows limited but distinct sequence homology. In particular, the positions of the cysteine residues relative to each other appear to be completely conserved, suggesting that tertiary structure constraints imposed by disulphide bridges dominate sequence conservation. It has been proposed that the two subunits of a related protein (the Brassica napus storage protein) is cleaved from a precursor polypeptide [Crouch, M. L., Tenbarge, K. M., Simon, A. E. & Ferl, R. (1983) J. Mol. Appl. Genet. 2,273-283]. The amino acid sequence homology of the Brazil nut protein with the former suggests that a similar protein processing event could occur.

  7. The amino acid sequence of cytochromes c-551 from three species of Pseudomonas

    PubMed Central

    Ambler, R. P.; Wynn, Margaret

    1973-01-01

    The amino acid sequences of the cytochromes c-551 from three species of Pseudomonas have been determined. Each resembles the protein from Pseudomonas strain P6009 (now known to be Pseudomonas aeruginosa, not Pseudomonas fluorescens) in containing 82 amino acids in a single peptide chain, with a haem group covalently attached to cysteine residues 12 and 15. In all four sequences 43 residues are identical. Although by bacteriological criteria the organisms are closely related, the differences between pairs of sequences range from 22% to 39%. These values should be compared with the differences in the sequence of mitochondrial cytochrome c between mammals and amphibians (about 18%) or between mammals and insects (about 33%). Detailed evidence for the amino acid sequences of the proteins has been deposited as Supplementary Publication SUP 50015 at the National Lending Library for Science and Technology, Boston Spa, Yorks. LS23 7BQ, U.K., from whom copies can be obtained on the terms indicated in Biochem. J. (1973), 131, 5. PMID:4352718

  8. Draft Genome Sequence of Sorghum Grain Mold Fungus Epicoccum sorghinum, a Producer of Tenuazonic Acid

    PubMed Central

    Oliveira, Rodrigo C.; Davenport, Karen W.; Hovde, Blake; Silva, Danielle; Chain, Patrick S. G.; Correa, Benedito

    2017-01-01

    ABSTRACT The facultative plant pathogen Epicoccum sorghinum is associated with grain mold of sorghum and produces the mycotoxin tenuazonic acid. This fungus can have serious economic impact on sorghum production. Here, we report the draft genome sequence of E. sorghinum (USPMTOX48). PMID:28126937

  9. Snake venom. The amino acid sequence of protein A from Dendroaspis polylepis polylepis (black mamba) venom.

    PubMed

    Joubert, F J; Strydom, D J

    1980-12-01

    Protein A from Dendroaspis polylepis polylepis venom comprises 81 amino acids, including ten half-cystine residues. The complete primary structures of protein A and its variant A' were elucidated. The sequences of proteins A and A', which differ in a single position, show no homology with various neurotoxins and non-neurotoxic proteins and represent a new type of elapid venom protein.

  10. Draft Genome Sequence of Bacillus coagulans NL01, a Wonderful l-Lactic Acid Producer

    PubMed Central

    Zheng, Zhaojuan; Jiang, Ting; Lin, Xi; Zhou, Jie

    2015-01-01

    Here, we report the draft genome sequence of Bacillus coagulans NL01, which could produce high optically pure l-lactic acid using xylose as a sole carbon source. The draft genome is 3,505,081 bp, with 144 contigs. About 3,903 protein-coding genes and 92 rRNAs are predicted from this assembly. PMID:26089419

  11. Amino acid sequences of heterotrophic and photosynthetic ferredoxins from the tomato plant (Lycopersicon esculentum Mill.).

    PubMed

    Kamide, K; Sakai, H; Aoki, K; Sanada, Y; Wada, K; Green, L S; Yee, B C; Buchanan, B B

    1995-11-01

    Several forms (isoproteins) of ferredoxin in roots, leaves, and green and red pericarps in tomato plants (Lycopersicon esculentum Mill.) were earlier identified on the basis of N-terminal amino acid sequence and chromatographic behavior (Green et al. 1991). In the present study, a large scale preparation made possible determination of the full length amino acid sequence of the two ferredoxins from leaves. The ferredoxins characteristic of fruit and root were sequenced from the amino terminus to the 30th residue or beyond. The leaf ferredoxins were confirmed to be expressed in pericarp of both green and red fruit. The ferredoxins characteristic of fruit and root appeared to be restricted to those tissue. The results extend earlier findings in demonstrating that ferredoxin occurs in the major organs of the tomato plant where it appears to function irrespective of photosynthetic competence.

  12. Amino acid sequence of myoglobin from white-tailed deer (Odocoileus virginianus).

    PubMed

    Joseph, Poulson; Suman, Surendranath P; Li, Shuting; Fontaine, Michele; Steinke, Laurey

    2012-10-01

    Our objective was to determine the primary structure of white-tailed deer myoglobin (Mb). White-tailed deer Mb was isolated from cardiac muscles employing ammonium sulfate precipitation and gel-filtration chromatography. The amino acid sequence was determined by Edman degradation. Sequence analyses of intact Mb as well as tryptic- and cyanogen bromide-peptides yielded the complete primary structure of white-tailed deer Mb, which shared 100% similarity with red deer Mb. White-tailed deer Mb consists of 153 amino acid residues and shares more than 96% sequence similarity with myoglobins from meat-producing ruminants, such as cattle, buffalo, sheep, and goat. Similar to sheep and goat myoglobins, white-tailed deer Mb contains 12 histidine residues. Proximal (position 93) and distal (position 64) histidine residues responsible for maintaining the stability of heme are conserved in white-tailed deer Mb.

  13. Nucleotide sequence and the encoded amino acids of human apolipoprotein A-I mRNA.

    PubMed Central

    Law, S W; Brewer, H B

    1984-01-01

    The cDNA clones encoding the precursor form of human liver apolipoprotein A-I (apoA-I), preproapoA-I, have been isolated from a cDNA library. A 17-base synthetic oligonucleotide based on residues 108-113 of apoA-I and a 26-base primer-extended, dideoxynucleotide-terminated cDNA were used as hybridization probes to select for recombinant plasmids bearing the apoA-I sequence. The complete nucleic acid sequence of human liver preproapoA-I has been determined by analysis of the cloned cDNA. The sequence is composed of 801 nucleotides encoding 267 amino acid residues. PreproapoA-I contains an 18-amino-acid prepeptide and a 6-amino-acid propeptide connected to the amino terminus of the 243-amino acid mature apoA-I. Southern blotting analysis of chromosomal DNA obtained from peripheral blood indicated the apoA-I gene is contained in a 2.1-kilobase-pair Pst I fragment and there is no gross difference in structural organization between the normal apoA-I gene and the Tangier disease apoA-I gene. Images PMID:6198645

  14. Pleistocene glaciation of volcano Ajusco, central Mexico, and comparison with the standard Mexican glacial sequence

    NASA Astrophysics Data System (ADS)

    White, Sidney E.; Valastro, Salvatore

    1984-01-01

    Three Pleistocene glaciations and two Holocene Neoglacial advances occurred on volcano Ajusco in central Mexico. Lateral moraines of the oldest glaciation, the Marqués, above 3250 m are made of light-gray indurated till and are extensively modified by erosion. Below 3200 m the till is dark red, decomposed, and buried beneath volcanic colluvium and tephra. Very strongly to strongly developed soil profiles (Inceptisols) have formed in the Marqués till and in overlying colluvia and tephra. Large sharp-crested moraines of the second glaciation, the Santo Tomás, above 3300 m are composed of pale-brown firm till and are somewhat eroded by gullies. Below 3250 m the till is light reddish brown, cemented, and weathered. Less-strongly developed soil profiles (Inceptisols) have formed in the Santo Tomás till and in overlying colluvia and tephra. Narrow-crested moraines of yellowish-brown loose till of the third glaciation, the Albergue, are uneroded. Weakly developed soil profiles (Inceptisols) in the Albergue till have black ash in the upper horizon. Two small Neoglacial moraines of yellowish-brown bouldery till on the cirque floor of the largest valley support weakly developed soil profiles with only A and Cox horizons and no ash in the upper soil horizons. Radiocarbon dating of organic matter of the B horizons developed in tills, volcanic ash, and colluvial volcanic sand includes ages for both the soil-organic residue and the humic-acid fraction, with differences from 140 to 660 yr. The dating provides minimum ages of about 27,000 yr for the Marqués glaciation and about 25,000 yr for the Santo Tomás glaciation. Dates for the overlying tephra indicate a complex volcanic history for at least another 15,000 yr. Comparison of the Ajusco glacial sequence with that on Iztaccíhuatl to the east suggests that the Marqués and Santo Tomás glaciations may be equivalent to the Diamantes glaciation First and Second advances, the Albergue to the Alcalican glaciations, and the

  15. The amino-acid sequence of alpha A- and beta-chains from the major hemoglobin component of American flamingo (Phoenicopterus ruber ruber).

    PubMed

    Godovac-Zimmermann, J; Braunitzer, G

    1984-04-01

    The complete amino-acid sequence of alpha A- and beta-chains from the major hemoglobin component (HbA) of American Flamingo ( Phoenicopterus ruber ruber) is presented. The minor component (HbD) with alpha D-chains was detected in similar amounts (25%) as in chicken and pheasant hemoglobins. The comparison of American Flamingo and Greylag Goose (Anser anser) hemoglobins shows that alpha A-chains differ by 22 exchanges and beta-chains by only 4 exchanges. Two substitutions modify alpha 1 beta 1-contacts. Amino-acid replacements between American Flamingo and other bird hemoglobins are discussed.

  16. Software scripts for quality checking of high-throughput nucleic acid sequencers.

    PubMed

    Lazo, G R; Tong, J; Miller, R; Hsia, C; Rausch, C; Kang, Y; Anderson, O D

    2001-06-01

    We have developed a graphical interface to allow the researcher to view and assess the quality of sequencing results using a series of program scripts developed to process data generated by automated sequencers. The scripts are written in Perl programming language and are executable under the cgibin directory of a Web server environment. The scripts direct nucleic acid sequencing trace file data output from automated sequencers to be analyzed by the phred molecular biology program and are displayed as graphical hypertext mark-up language (HTML) pages. The scripts are mainly designed to handle 96-well microtiter dish samples, but the scripts are also able to read data from 384-well microtiter dishes 96 samples at a time. The scripts may be customized for different laboratory environments and computer configurations. Web links to the sources and discussion page are provided.

  17. Sequence comparison of pepsin-resistant segments of basement-membrane collagen alpha 1(IV) chains from bovine lens capsule and mouse tumour.

    PubMed Central

    Schuppan, D; Glanville, R W; Timpl, R; Dixit, S N; Kang, A H

    1984-01-01

    The C-terminal peptic fragment P1 (about 518 amino acid residues) of bovine lens-capsule collagen alpha 1(IV) chain was cleaved with CNBr and trypsin. The peptides were purified and characterized, allowing their ordering within the P1 fragment by comparison with a corresponding section of mouse collagen alpha 1(IV) chain [Schuppan, Glanville & Timpl (1982) Eur. J. Biochem. 123, 505-512]. About 67% of the sequence of bovine collagen fragment P1 was determined by Edman degradation. Comparison with the sequence of the corresponding mouse collagen fragment P1 showed 76% identity for positions Xaa and Yaa of the triplet structures Gly-Xaa-Yaa. Invariance was found for the positions of two non-triplet interruptions and of 3-hydroxyproline residues, pointing to the functional importance of these structures. PMID:6430279

  18. Structure-based evaluation of sequence comparison and fold recognition alignment accuracy.

    PubMed

    Domingues, F S; Lackner, P; Andreeva, A; Sippl, M J

    2000-04-07

    The biological role, biochemical function, and structure of uncharacterized protein sequences is often inferred from their similarity to known proteins. A constant goal is to increase the reliability, sensitivity, and accuracy of alignment techniques to enable the detection of increasingly distant relationships. Development, tuning, and testing of these methods benefit from appropriate benchmarks for the assessment of alignment accuracy.Here, we describe a benchmark protocol to estimate sequence-to-sequence and sequence-to-structure alignment accuracy. The protocol consists of structurally related pairs of proteins and procedures to evaluate alignment accuracy over the whole set. The set of protein pairs covers all the currently known fold types. The benchmark is challenging in the sense that it consists of proteins lacking clear sequence similarity. Correct target alignments are derived from the three-dimensional structures of these pairs by rigid body superposition. An evaluation engine computes the accuracy of alignments obtained from a particular algorithm in terms of alignment shifts with respect to the structure derived alignments. Using this benchmark we estimate that the best results can be obtained from a combination of amino acid residue substitution matrices and knowledge-based potentials.

  19. Preparation of Nucleic Acid Libraries for Personalized Sequencing Systems Using an Integrated Microfluidic Hub Technology (Seventh Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting 2012)

    ScienceCinema

    Patel, Kamlesh D [Ken; SNL,

    2016-07-12

    Kamlesh (Ken) Patel from Sandia National Laboratories (Livermore, California) presents "Preparation of Nucleic Acid Libraries for Personalized Sequencing Systems Using an Integrated Microfluidic Hub Technology " at the 7th Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting held in June, 2012 in Santa Fe, NM.

  20. Amino acid racemization dating of fossil bones, I. inter-laboratory comparison of racemization measurements

    USGS Publications Warehouse

    Bada, J.L.; Hoopes, E.; Darling, D.; Dungworth, G.; Kessels, H.J.; Kvenvolden, K.A.; Blunt, D.J.

    1979-01-01

    Enantiomeric measurements for aspartic acid, glutamic acid, and alanine in twenty-one different fossil bone samples have been carried out by three different laboratories using different analytical methods. These inter-laboratory comparisons demonstrate that D/L aspartic acid measurements are highly reproducible, whereas the enantiomeric measurements for the other amino acids show a wide variation between the three laboratories. At present, aspartic acid measurements are the most suitable for racemization dating of bone because of their superior analytical precision. ?? 1979.

  1. Role of the two-component leader sequence and mature amino acid sequences in extracellular export of endoglucanase EGL from Pseudomonas solanacearum.

    PubMed Central

    Huang, J Z; Schell, M A

    1992-01-01

    The egl gene of Pseudomonas solanacearum encodes a 43-kDa extracellular endoglucanase (mEGL) involved in wilt disease caused by this phytopathogen. Egl is initially translated with a 45-residue, two-part leader sequence. The first 19 residues are apparently removed by signal peptidase II during export of Egl across the inner membrane (IM); the remaining residues of the leader sequence (modified with palmitate) are removed during export across the outer membrane (OM). Localization of Egl-PhoA fusion proteins showed that the first 26 residues of the Egl leader sequence are required and sufficient to direct lipid modification, processing, and export of Egl or PhoA across the IM but not the OM. Fusions of the complete 45-residue leader sequence or of the leader and increasing portions of mEgl sequences to PhoA did not cause its export across the OM. In-frame deletion of portions of mEGL-coding sequences blocked export of the truncated polypeptides across the OM without affecting export across the IM. These results indicate that the first part of the leader sequence functions independently to direct export of Egl across the IM while the second part and sequences and structures in mEGL are involved in export across the OM. Computer analysis of the mEgl amino acid sequence obtained from its nucleotide sequence identified a region of mEGL similar in amino acid sequence to regions in other prokaryotic endoglucanases. Images PMID:1735723

  2. The complete amino acid sequence of a trypsin inhibitor from Bauhinia variegata var. candida seeds.

    PubMed

    Di Ciero, L; Oliva, M L; Torquato, R; Köhler, P; Weder, J K; Camillo Novello, J; Sampaio, C A; Oliveira, B; Marangoni, S

    1998-11-01

    Trypsin inhibitors of two varieties of Bauhinia variegata seeds have been isolated and characterized. Bauhinia variegata candida trypsin inhibitor (BvcTI) and B. variegata lilac trypsin inhibitor (BvlTI) are proteins with Mr of about 20,000 without free sulfhydryl groups. Amino acid analysis shows a high content of aspartic acid, glutamic acid, serine, and glycine, and a low content of histidine, tyrosine, methionine, and lysine in both inhibitors. Isoelectric focusing for both varieties detected three isoforms (pI 4.85, 5.00, and 5.15), which were resolved by HPLC procedure. The trypsin inhibitors show Ki values of 6.9 and 1.2 nM for BvcTI and BvlTI, respectively. The N-terminal sequences of the three trypsin inhibitor isoforms from both varieties of Bauhinia variegata and the complete amino acid sequence of B. variegata var. candida L. trypsin inhibitor isoform 3 (BvcTI-3) are presented. The sequences have been determined by automated Edman degradation of the reduced and carboxymethylated proteins of the peptides resulting from Staphylococcus aureus protease and trypsin digestion. BvcTI-3 is composed of 167 residues and has a calculated molecular mass of 18,529. Homology studies with other trypsin inhibitors show that BvcTI-3 belongs to the Kunitz family. The putative active site encompasses Arg (63)-Ile (64).

  3. Multiple site-selective insertions of non-canonical amino acids into sequence-repetitive polypeptides

    PubMed Central

    Wu, I-Lin; Patterson, Melissa A.; Carpenter Desai, Holly E.; Mehl, Ryan A.; Giorgi, Gianluca

    2013-01-01

    A simple and efficient method is described for introduction of non-canonical amino acids at multiple, structurally defined sites within recombinant polypeptide sequences. E. coli MRA30, a bacterial host strain with attenuated activity for release factor 1 (RF1), is assessed for its ability to support the incorporation of a diverse range of non-canonical amino acids in response to multiple encoded amber (TAG) codons within genetic templates derived from superfolder GFP and an elastin-mimetic protein polymer. Suppression efficiency and isolated protein yield were observed to depend on the identity of the orthogonal aminoacyl-tRNA synthetase/tRNACUA pair and the non-canonical amino acid substrate. This approach afforded elastin-mimetic protein polymers containing non-canonical amino acid derivatives at up to twenty-two positions within the repeat sequence with high levels of substitution. The identity and position of the variant residues was confirmed by mass spectrometric analysis of the full-length polypeptides and proteolytic cleavage fragments resulting from thermolysin digestion. The accumulated data suggest that this multi-site suppression approach permits the preparation of protein-based materials in which novel chemical functionality can be introduced at precisely defined positions within the polypeptide sequence. PMID:23625817

  4. Deduced amino acid sequence of human pulmonary surfactant proteolipid: SPL(pVal)

    SciTech Connect

    Whitsett, J.A.; Glasser, S.W.; Korfhagen, T.R.; Weaver, T.E.; Clark, J.; Pilot-Matias, T.; Meuth, J.; Fox, J.L.

    1987-05-01

    Hydrophobic, proteolipid-like protein of Mr 6500 was isolated from ether/ethanol extracts of human, canine and bovine pulmonary surfactant. Amino acid composition of the protein demonstrated a remarkable abundance of hydrophobic residues, particularly valine and leucine. The N-terminal amino acid sequence of the human protein was determined: N-Leu-Ile-Pro-Cys-Cys-Pro-Val-Asn-Leu-Lys-Arg-Leu-Leu-Ile-Val4... An oligonucleotide probe was used to screen an adult human lung cDNA library and resulted in detection of cDNA clones with predicted amino acid sequence with close identity to the N-terminal amino acid sequence of the human peptide. SPL(pVal) was found within the reading frame of a larger peptide. SPL(pVal) results from proteolytic processing of a larger preprotein. Northern blot analysis detected in a single 1.0 kilobase SPL(pVal) RNA which was less abundant in fetal than in adult lung. Mixtures of purified canine and bovine SPL(pVal) and synthetic phospholipids display properties of rapid adsorption and surface tension lowering activity characteristic of surfactant. Human SPL(pVal) is a pulmonary surfactant proteolipid which may therefore be useful in combination with phospholipids and/or other surfactant proteins for the treatment of surfactant deficiency such as hyaline membrane disease in newborn infants.

  5. Complete nucleic acid sequence of Penaeus stylirostris densovirus (PstDNV) from India.

    PubMed

    Rai, Praveen; Safeena, Muhammed P; Karunasagar, Iddya; Karunasagar, Indrani

    2011-06-01

    Infectious hypodermal and hematopoietic necrosis virus (IHHNV) of shrimp, recently been classified as Penaeus stylirostris densovirus (PstDNV). The complete nucleic acid sequence of PstDNV from India was obtained by cloning and sequencing of different DNA fragment of the virus. The genome organisation of PstDNV revealed that there were three major coding domains: a left ORF (NS1) of 2001 bp, a mid ORF (NS2) of 1092 bp and a right ORF (VP) of 990 bp. The complete genome and amino acid sequences of three proteins viz., NS1, NS2 and VP were compared with the genomes of the virus reported from Hawaii, China and Mexico and with partial sequence available from isolates from different regions. The phylogenetic analysis of shrimp, insect and vertebrate parvovirus sequences showed that the Indian PstDNV isolate is phylogenetically more closely related to one of the three isolates from Taiwan (AY355307), and two isolates (AY362547 and AY102034) from Thailand.

  6. DNA Cloning of Plasmodium falciparum Circumsporozoite Gene: Amino Acid Sequence of Repetitive Epitope

    NASA Astrophysics Data System (ADS)

    Enea, Vincenzo; Ellis, Joan; Zavala, Fidel; Arnot, David E.; Asavanich, Achara; Masuda, Aoi; Quakyi, Isabella; Nussenzweig, Ruth S.

    1984-08-01

    A clone of complementary DNA encoding the circumsporozoite (CS) protein of the human malaria parasite Plasmodium falciparum has been isolated by screening an Escherichia coli complementary DNA library with a monoclonal antibody to the CS protein. The DNA sequence of the complementary DNA insert encodes a four-amino acid sequence: proline-asparagine-alanine-asparagine, tandemly repeated 23 times. The CS β -lactamase fusion protein specifically binds monoclonal antibodies to the CS protein and inhibits the binding of these antibodies to native Plasmodium falciparum CS protein. These findings provide a basis for the development of a vaccine against Plasmodium falciparum malaria.

  7. Purification to homogeneity and partial amino acid sequence of a fragment which includes the methyl acceptor site of the human DNA repair protein for O6-methylguanine.

    PubMed

    Major, G N; Gardner, E J; Carne, A F; Lawley, P D

    1990-03-25

    DNA repair by O6-methylguanine-DNA methyltransferase (O6-MT) is accomplished by removal by the enzyme of the methyl group from premutagenic O6-methylguanine-DNA, thereby restoring native guanine in DNA. The methyl group is transferred to an acceptor site cysteine thiol group in the enzyme, which causes the irreversible inactivation of O6-MT. We detected a variety of different forms of the methylated, inactivated enzyme in crude extracts of human spleen of molecular weights higher and lower than the usually observed 21-24kDa for the human O6-MT. Several apparent fragments of the methylated form of the protein were purified to homogeneity following reaction of partially-purified extract enzyme with O6-[3H-CH3]methylguanine-DNA substrate. One of these fragments yielded amino acid sequence information spanning fifteen residues, which was identified as probably belonging to human methyltransferase by virtue of both its significant sequence homology to three procaryote forms of O6-MT encoded by the ada, ogt (both from E. coli) and dat (B. subtilis) genes, and sequence position of the radiolabelled methyl group which matched the position of the conserved procaryote methyl acceptor site cysteine residue. Statistical prediction of secondary structure indicated good homologies between the human fragment and corresponding regions of the constitutive form of O6-MT in procaryotes (ogt and dat gene products), but not with the inducible ada protein, indicating the possibility that we had obtained partial amino acid sequence for a non-inducible form of the human enzyme. The identity of the fragment sequence as belonging to human methyltransferase was more recently confirmed by comparison with cDNA-derived amino acid sequence from the cloned human O6-MT gene from HeLa cells (1). The two sequences compared well, with only three out of fifteen amino acids being different (and two of them by only one nucleotide in each codon).

  8. Gene tree discordance of wild and cultivated Asian rice deciphered by genome-wide sequence comparison.

    PubMed

    Yang, Ching-chia; Sakai, Hiroaki; Numa, Hisataka; Itoh, Takeshi

    2011-05-15

    Although a large number of genes are expected to correctly solve a phylogenetic relationship, inconsistent gene tree topologies have been observed. This conflicting evidence in gene tree topologies, known as gene tree discordance, becomes increasingly important as advanced sequencing technologies produce an enormous amount of sequence information for phylogenomic studies among closely related species. Here, we aim to characterize the gene tree discordance of the Asian cultivated rice Oryza sativa and its progenitor, O. rufipogon, which will be an ideal case study of gene tree discordance. Using genome and cDNA sequences of O. sativa and O. rufipogon, we have conducted the first in-depth analyses of gene tree discordance in Asian rice. Our comparison of full-length cDNA sequences of O. rufipogon with the genome sequences of the japonica and indica cultivars of O. sativa revealed that 60% of the gene trees showed a topology consistent with the expected one, whereas the remaining genes supported significantly different topologies. Moreover, the proportions of the topologies deviated significantly from expectation, suggesting at least one hybridization event between the two subgroups of O. sativa, japonica and indica. In fact, a genome-wide alignment between japonica and indica indicated that significant portions of the indica genome are derived from japonica. In addition, literature concerning the pedigree of the indica cultivar strongly supported the hybridization hypothesis. Our molecular evolutionary analyses deciphered complicated evolutionary processes in closely related species. They also demonstrated the importance of gene tree discordance in the era of high-speed DNA sequencing.

  9. Seq2Logo: a method for construction and visualization of amino acid binding motifs and sequence profiles including sequence weighting, pseudo counts and two-sided representation of amino acid enrichment and depletion

    PubMed Central

    Thomsen, Martin Christen Frølund; Nielsen, Morten

    2012-01-01

    Seq2Logo is a web-based sequence logo generator. Sequence logos are a graphical representation of the information content stored in a multiple sequence alignment (MSA) and provide a compact and highly intuitive representation of the position-specific amino acid composition of binding motifs, active sites, etc. in biological sequences. Accurate generation of sequence logos is often compromised by sequence redundancy and low number of observations. Moreover, most methods available for sequence logo generation focus on displaying the position-specific enrichment of amino acids, discarding the equally valuable information related to amino acid depletion. Seq2logo aims at resolving these issues allowing the user to include sequence weighting to correct for data redundancy, pseudo counts to correct for low number of observations and different logotype representations each capturing different aspects related to amino acid enrichment and depletion. Besides allowing input in the format of peptides and MSA, Seq2Logo accepts input as Blast sequence profiles, providing easy access for non-expert end-users to characterize and identify functionally conserved/variable amino acids in any given protein of interest. The output from the server is a sequence logo and a PSSM. Seq2Logo is available at http://www.cbs.dtu.dk/biotools/Seq2Logo (14 May 2012, date last accessed). PMID:22638583

  10. Seq2Logo: a method for construction and visualization of amino acid binding motifs and sequence profiles including sequence weighting, pseudo counts and two-sided representation of amino acid enrichment and depletion.

    PubMed

    Thomsen, Martin Christen Frølund; Nielsen, Morten

    2012-07-01

    Seq2Logo is a web-based sequence logo generator. Sequence logos are a graphical representation of the information content stored in a multiple sequence alignment (MSA) and provide a compact and highly intuitive representation of the position-specific amino acid composition of binding motifs, active sites, etc. in biological sequences. Accurate generation of sequence logos is often compromised by sequence redundancy and low number of observations. Moreover, most methods available for sequence logo generation focus on displaying the position-specific enrichment of amino acids, discarding the equally valuable information related to amino acid depletion. Seq2logo aims at resolving these issues allowing the user to include sequence weighting to correct for data redundancy, pseudo counts to correct for low number of observations and different logotype representations each capturing different aspects related to amino acid enrichment and depletion. Besides allowing input in the format of peptides and MSA, Seq2Logo accepts input as Blast sequence profiles, providing easy access for non-expert end-users to characterize and identify functionally conserved/variable amino acids in any given protein of interest. The output from the server is a sequence logo and a PSSM. Seq2Logo is available at http://www.cbs.dtu.dk/biotools/Seq2Logo (14 May 2012, date last accessed).

  11. Method for high-volume sequencing of nucleic acids: random and directed priming with libraries of oligonucleotides

    DOEpatents

    Studier, F.W.

    1995-04-18

    Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient. 2 figs.

  12. Method for high-volume sequencing of nucleic acids: random and directed priming with libraries of oligonucleotides

    DOEpatents

    Studier, F. William

    1995-04-18

    Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient.

  13. RNA editing events in mitochondrial genes by ultra-deep sequencing methods: a comparison of cytoplasmic male sterile, fertile and restored genotypes in cotton.

    PubMed

    Suzuki, Hideaki; Yu, Jiwen; Ness, Scott A; O'Connell, Mary A; Zhang, Jinfa

    2013-09-01

    Cytoplasmic male sterility (CMS) is a maternally inherited trait resulting in failure to produce functional pollen and is widely used in the production of hybrid seed. Improper RNA editing is implicated as the molecular basis for some CMS systems. However, the mechanism of CMS in cotton is unknown. This study compared RNA editing events in eight mitochondrial genes (atp1, 4, 6, 8, 9, and cox1, 2, 3) among three lines (maintainer B, CMS A, and restorer R). These events were quantified by ultra-deep sequencing of mitochondrial transcripts and sequencing of cloned versions of these genes as cDNAs. A comparison of genomic PCR and RT-PCR products detected 72 editing sites in coding sequences in the eight genes and four partial editing sites in the 3'-untranslated region of atp6. The most frequent alteration (61.4 %) resulted in changes of hydrophilic amino acids to hydrophobic amino acids and the most common alteration was proline (P) to leucine (L) (26.7 %). In atp6, RNA editing created a stop codon from a glutamine in the genomic sequence. Statistical analysis of the frequencies of RNA editing events detected differences between mtDNA genes, but no differences between cotton cytoplasms that could account for the CMS phenotype or restoration. This study represents the first work to use next-generation sequencing to identify RNA editing positions and efficiency, and possible association with CMS and restoration in plants.

  14. Sequence-specific thermodynamic properties of nucleic acids influence both transcriptional pausing and backtracking in yeast

    PubMed Central

    2017-01-01

    RNA Polymerase II pauses and backtracks during transcription, with many consequences for gene expression and cellular physiology. Here, we show that the energy required to melt double-stranded nucleic acids in the transcription bubble predicts pausing in Saccharomyces cerevisiae far more accurately than nucleosome roadblocks do. In addition, the same energy difference also determines when the RNA polymerase backtracks instead of continuing to move forward. This data-driven model corroborates—in a genome wide and quantitative manner—previous evidence that sequence-dependent thermodynamic features of nucleic acids influence both transcriptional pausing and backtracking. PMID:28301878

  15. Respiratory syncytial virus fusion glycoprotein: nucleotide sequence of mRNA, identification of cleavage activation site and amino acid sequence of N-terminus of F1 subunit.

    PubMed Central

    Elango, N; Satake, M; Coligan, J E; Norrby, E; Camargo, E; Venkatesan, S

    1985-01-01

    The amino acid sequence of respiratory syncytial virus fusion protein (Fo) was deduced from the sequence of a partial cDNA clone of mRNA and from the 5' mRNA sequence obtained by primer extension and dideoxysequencing. The encoded protein of 574 amino acids is extremely hydrophobic and has a molecular weight of 63371 daltons. The site of proteolytic cleavage within this protein was accurately mapped by determining a partial amino acid sequence of the N-terminus of the larger subunit (F1) purified by radioimmunoprecipitation using monoclonal antibodies. Alignment of the N-terminus of the F1 subunit within the deduced amino acid sequence of Fo permitted us to identify a sequence of lys-lys-arg-lys-arg-arg at the C-terminus of the smaller N-terminal F2 subunit that appears to represent the cleavage/activation domain. Five potential sites of glycosylation, four within the F2 subunit, were also identified. Three extremely hydrophobic domains are present in the protein; a) the N-terminal signal sequence, b) the N-terminus of the F1 subunit that is analogous to the N-terminus of the paramyxovirus F1 subunit and the HA2 subunit of influenza virus hemagglutinin, and c) the putative membrane anchorage domain near the C-terminus of F1. Images PMID:2987829

  16. Analysis of protein function and its prediction from amino acid sequence.

    PubMed

    Clark, Wyatt T; Radivojac, Predrag

    2011-07-01

    Understanding protein function is one of the keys to understanding life at the molecular level. It is also important in the context of human disease because many conditions arise as a consequence of alterations of protein function. The recent availability of relatively inexpensive sequencing technology has resulted in thousands of complete or partially sequenced genomes with millions of functionally uncharacterized proteins. Such a large volume of data, combined with the lack of high-throughput experimental assays to functionally annotate proteins, attributes to the growing importance of automated function prediction. Here, we study proteins annotated by Gene Ontology (GO) terms and estimate the accuracy of functional transfer from protein sequence only. We find that the transfer of GO terms by pairwise sequence alignments is only moderately accurate, showing a surprisingly small influence of sequence identity (SID) in a broad range (30-100%). We developed and evaluated a new predictor of protein function, functional annotator (FANN), from amino acid sequence. The predictor exploits a multioutput neural network framework which is well suited to simultaneously modeling dependencies between functional terms. Experiments provide evidence that FANN-GO (predictor of GO terms; available from http://www.informatics.indiana.edu/predrag) outperforms standard methods such as transfer by global or local SID as well as GOtcha, a method that incorporates the structure of GO.

  17. The Complete Genome Sequence of the Lactic Acid Bacterium Lactococcus lactis ssp. lactis IL1403

    PubMed Central

    Bolotin, Alexander; Wincker, Patrick; Mauger, Stéphane; Jaillon, Olivier; Malarme, Karine; Weissenbach, Jean; Ehrlich, S. Dusko; Sorokin, Alexei

    2001-01-01

    Lactococcus lactis is a nonpathogenic AT-rich gram-positive bacterium closely related to the genus Streptococcus and is the most commonly used cheese starter. It is also the best-characterized lactic acid bacterium. We sequenced the genome of the laboratory strain IL1403, using a novel two-step strategy that comprises diagnostic sequencing of the entire genome and a shotgun polishing step. The genome contains 2,365,589 base pairs and encodes 2310 proteins, including 293 protein-coding genes belonging to six prophages and 43 insertion sequence (IS) elements. Nonrandom distribution of IS elements indicates that the chromosome of the sequenced strain may be a product of recent recombination between two closely related genomes. A complete set of late competence genes is present, indicating the ability of L. lactis to undergo DNA transformation. Genomic sequence revealed new possibilities for fermentation pathways and for aerobic respiration. It also indicated a horizontal transfer of genetic information from Lactococcus to gram-negative enteric bacteria of Salmonella-Escherichia group. [The sequence data described in this paper has been submitted to the GenBank data library under accession no. AE005176.] PMID:11337471

  18. Stereochemical Sequence Ion Selectivity: Proline versus Pipecolic-acid-containing Protonated Peptides

    NASA Astrophysics Data System (ADS)

    Abutokaikah, Maha T.; Guan, Shanshan; Bythell, Benjamin J.

    2017-01-01

    Substitution of proline by pipecolic acid, the six-membered ring congener of proline, results in vastly different tandem mass spectra. The well-known proline effect is eliminated and amide bond cleavage C-terminal to pipecolic acid dominates instead. Why do these two ostensibly similar residues produce dramatically differing spectra? Recent evidence indicates that the proton affinities of these residues are similar, so are unlikely to explain the result [Raulfs et al., J. Am. Soc. Mass Spectrom. 25, 1705-1715 (2014)]. An additional hypothesis based on increased flexibility was also advocated. Here, we provide a computational investigation of the "pipecolic acid effect," to test this and other hypotheses to determine if theory can shed additional light on this fascinating result. Our calculations provide evidence for both the increased flexibility of pipecolic-acid-containing peptides, and structural changes in the transition structures necessary to produce the sequence ions. The most striking computational finding is inversion of the stereochemistry of the transition structures leading to "proline effect"-type amide bond fragmentation between the proline/pipecolic acid-congeners: R (proline) to S (pipecolic acid). Additionally, our calculations predict substantial stabilization of the amide bond cleavage barriers for the pipecolic acid congeners by reduction in deleterious steric interactions and provide evidence for the importance of experimental energy regime in rationalizing the spectra.

  19. Implicit Sequence Learning in Dyslexia: A Within-Sequence Comparison of First- and Higher-Order Information

    ERIC Educational Resources Information Center

    Du, Wenchong; Kelly, Steve W.

    2013-01-01

    The present study examines implicit sequence learning in adult dyslexics with a focus on comparing sequence transitions with different statistical complexities. Learning of a 12-item deterministic sequence was assessed in 12 dyslexic and 12 non-dyslexic university students. Both groups showed equivalent standard reaction time increments when the…

  20. Self-sequencing of amino acids and origins of polyfunctional protocells

    NASA Technical Reports Server (NTRS)

    Fox, S. W.

    1984-01-01

    The role of proteins in the origin of living things is discussed. It has been experimentally established that amino acids can sequence themselves under simulated geological conditions with highly nonrandom products which accordingly contain diverse information. Multiple copies of each type of macromolecule are formed, resulting in greater power for any protoenzymic molecule than would accrue from a single copy of each type. Thermal proteins are readily incorporated into laboratory protocells. The experimental evidence for original polyfunctional protocells is discussed.

  1. Amino acid sequence of atrial natriuretic peptides in human coronary sinus plasma.

    PubMed

    Yandle, T; Crozier, I; Nicholls, G; Espiner, E; Carne, A; Brennan, S

    1987-07-31

    Two atrial natriuretic peptides were purified from pooled human coronary sinus plasma by Sep-Pak extraction, immunoaffinity chromatography and reverse phase HPLC. The amino acid sequences of the two peptides were homologous with 99-126 human atrial natriuretic peptide (hANP) and 106-126 hANP, the latter being most probably linked to 99-105 ANP by the disulphide bond. The molar ratio of the peptides in plasma, as assessed by radioimmunoassay was 10:3.

  2. Amino Acid Sequences Mediating Vascular Cell Adhesion Molecule 1 Binding to Integrin Alpha 4: Homologous DSP Sequence Found for JC Polyoma VP1 Coat Protein

    PubMed Central

    Meyer, Michael Andrew

    2013-01-01

    The JC polyoma viral coat protein VP1 was analyzed for amino acid sequences homologies to the IDSP sequence which mediates binding of VLA-4 (integrin alpha 4) to vascular cell adhesion molecule 1. Although the full sequence was not found, a DSP sequence was located near the critical arginine residue linked to infectivity of the virus and binding to sialic acid containing molecules such as integrins (3). For the JC polyoma virus, a DSP sequence was found at residues 70, 71 and 72 with homology also noted for the mouse polyoma virus and SV40 virus. Three dimensional modeling of the VP1 molecule suggests that the DSP loop has an accessible site for interaction from the external side of the assembled viral capsid pentamer. PMID:24147211

  3. Amino Acid Sequences Mediating Vascular Cell Adhesion Molecule 1 Binding to Integrin Alpha 4: Homologous DSP Sequence Found for JC Polyoma VP1 Coat Protein.

    PubMed

    Meyer, Michael Andrew

    2013-01-01

    The JC polyoma viral coat protein VP1 was analyzed for amino acid sequences homologies to the IDSP sequence which mediates binding of VLA-4 (integrin alpha 4) to vascular cell adhesion molecule 1. Although the full sequence was not found, a DSP sequence was located near the critical arginine residue linked to infectivity of the virus and binding to sialic acid containing molecules such as integrins (3). For the JC polyoma virus, a DSP sequence was found at residues 70, 71 and 72 with homology also noted for the mouse polyoma virus and SV40 virus. Three dimensional modeling of the VP1 molecule suggests that the DSP loop has an accessible site for interaction from the external side of the assembled viral capsid pentamer.

  4. Amino acid sequence similarity between rabies virus glycoprotein and snake venom curaremimetic neurotoxins.

    PubMed

    Lentz, T L; Wilson, P T; Hawrot, E; Speicher, D W

    1984-11-16

    Evidence was presented earlier that a host-cell receptor for the highly neurotropic rabies virus might be the acetylcholine receptor. The amino acid sequence of the glycoprotein of rabies virus was compared by computer analysis with that of snake venom curaremimetic neurotoxins, potent ligands of the acetylcholine receptor. A statistically significant sequence relation was found between a segment of the rabies glycoprotein and the entire sequence of long neurotoxins. The greatest identity occurs with residues considered most important in neurotoxicity, including those interacting with the acetylcholine binding site of the acetylcholine receptor. Because of the similarity between the glycoprotein and the receptor-binding region of the neurotoxins, this region of the viral glycoprotein may function as a recognition site for the acetylcholine receptor. Direct binding of the rabies virus glycoprotein to the acetylcholine receptor could contribute to the neurotropism of this virus.

  5. Partial amino acid sequence of human pancreatic stone protein, a novel pancreatic secretory protein.

    PubMed Central

    Montalto, G; Bonicel, J; Multigner, L; Rovery, M; Sarles, H; De Caro, A

    1986-01-01

    Pancreatic stone protein (PSP) is the major organic component of human pancreatic stones. With the use of monoclonal antibody immunoadsorbents, five immunoreactive forms (PSP-S) with close Mr values (14,000-19,000) were isolated from normal pancreatic juice. By CM-Trisacryl M chromatography the lowest-Mr form (PSP-S1) was separated from the others and some of its molecular characteristics were investigated. The Mr of the PSP-S1 polypeptide chain calculated from the amino acid composition was about 16,100. The N-terminal sequences (40 residues) of PSP and PSP-S1 are identical, which suggests that the peptide backbone is the same for both of these polypeptides. The PSP-S1 sequence was determined up to residue 65 and was found to be different from all other known protein sequences. Images Fig. 1. PMID:3541906

  6. Comparison of the complete genome sequences of Pseudomonas syringae pv. syringae B728a and pv. tomato DC3000

    SciTech Connect

    Feil, H; Feil, W S; Chain, P; Larimer, F; DiBartolo, G; Copeland, A; Lykidis, A; Trong, S; Nolan, M; Goltsman, E; Thiel, J; Malfatti, S; Loper, J E; Lapidus, A; Detter, J C; Land, M; Richardson, P M; Kyrpides, N C; Ivanova, N; Lindow, S E

    2005-07-14

    The complete genomic sequence of Pseudomonas syringae pathovar syringae B728a (Pss B728a), has been determined and is compared with that of Pseudomonas syringae pv. tomato DC3000 (Pst DC3000). The two pathovars of this economically important species of plant pathogenic bacteria differ in host range and other interactions with plants, with Pss having a more pronounced epiphytic stage of growth and higher abiotic stress tolerance and Pst DC3000 having a more pronounced apoplastic growth habitat. The Pss B728a genome (6.1 megabases) contains a circular chromosome and no plasmid, whereas the Pst DC3000 genome is 6.5 mbp in size, composed of a circular chromosome and two plasmids. While a high degree of similarity exists between the two sequenced Pseudomonads, 976 protein-encoding genes are unique to Pss B728a when compared to Pst DC3000, including large genomic islands likely to contribute to virulence and host specificity. Over 375 repetitive extragenic palindromic sequences (REPs) unique to Pss B728a when compared to Pst DC3000 are widely distributed throughout the chromosome except in 14 genomic islands, which generally had lower GC content than the genome as a whole. Content of the genomic islands vary, with one containing a prophage and another the plasmid pKLC102 of P. aeruginosa PAO1. Among the 976 genes of Pss B728a with no counterpart in Pst DC3000 are those encoding for syringopeptin (SP), syringomycin (SR), indole acetic acid biosynthesis, arginine degradation, and production of ice nuclei. The genomic comparison suggests that several unique genes for Pss B728a such as ectoine synthase, DNA repair, and antibiotic production may contribute to epiphytic fitness and stress tolerance of this organism.

  7. Comparison of the complete genome sequences of Pseudomonas syringae pv. syringae B728a and pv. tomato DC3000

    SciTech Connect

    Feil, Helene; Feil, William; Chain, Patrick S. G.; Larimer, Frank W; DiBartolo, Genevieve; Copeland, A; Lykidis, A; Trong, Stephen; Nolan, Matt; Goltsman, Eugene; Thiel, James; Malfatti, Stephanie; Loper, Joyce E.; Detter, J C; Lapidus, Alla L.; Land, Miriam L; Richardson, P M; Kyrpides, Nikos C; Ivanova, N; Lindow, Steven E.

    2005-01-01

    The complete genomic sequence of Pseudomonas syringae pv. syringae B728a (Pss B728a) has been determined and is compared with that of A syringae pv. tomato DC3000 (Pst DC3000). The two pathovars of this economically important species of plant pathogenic bacteria differ in host range and other interactions with plants, with Pss having a more pronounced epiphytic stage of growth and higher abiotic stress tolerance and Pst DC3000 having a more pronounced apoplastic growth habitat. The Pss B728a genome (6.1 Mb) contains a circular chromosome and no plasmid, whereas the Pst DC3000 genome is 6.5 mbp in size, composed of a circular chromosome and two plasmids. Although a high degree of similarity exists between the two sequenced Pseudomonads, 976 protein-encoding genes are unique to Pss B728a when compared with Pst DC3000, including large genomic islands likely to contribute to virulence and host specificity. Over 375 repetitive extragenic palindromic sequences unique to Pss B728a when compared with Pst DC3000 are widely distributed throughout the chromosome except in 14 genomic islands, which generally had lower GC content than the genome as a whole. Content of the genomic islands varies, with one containing a prophage and another the plasmid pKLC102 of Pseudomonas aeruginosa PAO1. Among the 976 genes of Pss B728a with no counterpart in Pst DC3000 are those encoding for syringopeptin, syringomycin, indole acetic acid biosynthesis, arginine degradation, and production of ice nuclei. The genomic comparison suggests that several unique genes for Pss B728a such as ectoine synthase, DNA repair, and antibiotic production may contribute to the epiphytic fitness and stress tolerance of this organism.

  8. The Evidence for α-Linolenic Acid and Cardiovascular Disease Benefits: Comparisons with Eicosapentaenoic Acid and Docosahexaenoic Acid12

    PubMed Central

    Fleming, Jennifer A.; Kris-Etherton, Penny M.

    2014-01-01

    Our understanding of the cardiovascular disease (CVD) benefits of α-linolenic acid (ALA, 18:3n–3) has advanced markedly during the past decade. It is now evident that ALA benefits CVD risk. The expansion of the ALA evidence base has occurred in parallel with ongoing research on eicosapentaenoic acid (EPA, 20:5n–3) and docosahexaenoic acid (DHA, 22:6n–3) and CVD. The available evidence enables comparisons to be made for ALA vs. EPA + DHA for CVD risk reduction. The epidemiologic evidence suggests comparable benefits of plant-based and marine-derived n–3 (omega-3) PUFAs. The clinical trial evidence for ALA is not as extensive; however, there have been CVD event benefits reported. Those that have been reported for EPA + DHA are stronger because only EPA + DHA differed between the treatment and control groups, whereas in the ALA studies there were diet differences beyond ALA between the treatment and control groups. Despite this, the evidence suggests many comparable CVD benefits of ALA vs. EPA + DHA. Thus, we believe that it is time to revisit what the contemporary dietary recommendation should be for ALA to decrease the risk of CVD. Our perspective is that increasing dietary ALA will decrease CVD risk; however, randomized controlled clinical trials are necessary to confirm this and to determine what the recommendation should be. With a stronger evidence base, the nutrition community will be better positioned to revise the dietary recommendation for ALA for CVD risk reduction. PMID:25398754

  9. [MOLECULAR EVOLUTION OF ION CHANNELS: AMINO ACID SEQUENCES AND 3D STRUCTURES].

    PubMed

    Korkosh, V S; Zhorov, B S; Tikhonov, D B

    2016-01-01

    An integral part of modern evolutionary biology is comparative analysis of structure and function of macromolecules such as proteins. The first and critical step to understand evolution of homologous proteins is their amino acid sequence alignment. However, standard algorithms fop not provide unambiguous sequence alignments for proteins of poor homology. More reliable results can be obtained by comparing experimental 3D structures obtained at atomic resolution, for instance, with the aid of X-ray structural analysis. If such structures are lacking, homology modeling is used, which may take into account indirect experimental data on functional roles of individual amino-acid residues. An important problem is that the sequence alignment, which reflects genetic modifications, does not necessarily correspond to the functional homology. The latter depends on three-dimensional structures which are critical for natural selection. Since alignment techniques relying only on the analysis of primary structures carry no information on the functional properties of proteins, including 3D structures into consideration is very important. Here we consider several examples involving ion channels and demonstrate that alignment of their three-dimensional structures can significantly improve sequence alignments obtained by traditional methods.

  10. Analysis of amino acid sequence variations and immunoglobulin E-binding epitopes of German cockroach tropomyosin.

    PubMed

    Jeong, Kyoung Yong; Lee, Jongweon; Lee, In-Yong; Ree, Han-Il; Hong, Chein-Soo; Yong, Tai-Soon

    2004-09-01

    The allergenicities of tropomyosins from different organisms have been reported to vary. The cDNA encoding German cockroach tropomyosin (Bla g 7) was isolated, expressed, and characterized previously. In the present study, the amino acid sequence variations in German cockroach tropomyosin were analyzed in order to investigate its influence on allergenicity. We also undertook the identification of immunodominant peptides containing immunoglobulin E (IgE) epitopes which may facilitate the development of diagnostic and immunotherapeutic strategies based on the recombinant proteins. Two-dimensional gel electrophoresis and immunoblot analysis with mouse anti-recombinant German cockroach tropomyosin serum was performed to investigate the isoforms at the protein level. Reverse transcriptase PCR (RT-PCR) was applied to examine the sequence diversity. Eleven different variants of the deduced amino acid sequences were identified by RT-PCR. German cockroach tropomyosin has only minor sequence variations that did not seem to affect its allergenicity significantly. These results support the molecular basis underlying the cross-reactivities of arthropod tropomyosins. Recombinant fragments were also generated by PCR, and IgE-binding epitopes were assessed by enzyme-linked immunosorbent assay. Sera from seven patients revealed heterogeneous IgE-binding responses. This study demonstrates multiple IgE-binding epitope regions in a single molecule, suggesting that full-length tropomyosin should be used for the development of diagnostic and therapeutic reagents.

  11. Complete amino acid sequence of a histidine-rich proteolytic fragment of human ceruloplasmin.

    PubMed

    Kingston, I B; Kingston, B L; Putnam, F W

    1979-04-01

    The complete amino acid sequence has been determined for a fragment of human ceruloplasmin [ferroxidase; iron(II):oxygen oxidoreductase, EC 1.16.3.1]. The fragment (designated Cp F5) contains 159 amino acid residues and has a molecular weight of 18,650; it lacks carbohydrate, is rich in histidine, and contains one free cysteine that may be part of a copper-binding site. This fragment is present in most commercial preparations of ceruloplasmin, probably owing to proteolytic degradation, but can also be obtained by limited cleavage of single-chain ceruloplasmin with plasmin. Cp F5 probably is an intact domain attached to the COOH-terminal end of single-chain ceruloplasmin via a labile interdomain peptide bond. A model of the secondary structure predicted by empirical methods suggests that almost one-third of the amino acid residues are distributed in alpha helices, about a third in beta-sheet structure, and the remainder in beta turns and unidentified structures. Computer analysis of the amino acid sequence has not demonstrated a statistically significant relationship between this ceruloplasmin fragment and any other protein, but there is some evidence for an internal duplication.

  12. Substrate-Driven Mapping of the Degradome by Comparison of Sequence Logos

    PubMed Central

    Fuchs, Julian E.; von Grafenstein, Susanne; Huber, Roland G.; Kramer, Christian; Liedl, Klaus R.

    2013-01-01

    Sequence logos are frequently used to illustrate substrate preferences and specificity of proteases. Here, we employed the compiled substrates of the MEROPS database to introduce a novel metric for comparison of protease substrate preferences. The constructed similarity matrix of 62 proteases can be used to intuitively visualize similarities in protease substrate readout via principal component analysis and construction of protease specificity trees. Since our new metric is solely based on substrate data, we can engraft the protease tree including proteolytic enzymes of different evolutionary origin. Thereby, our analyses confirm pronounced overlaps in substrate recognition not only between proteases closely related on sequence basis but also between proteolytic enzymes of different evolutionary origin and catalytic type. To illustrate the applicability of our approach we analyze the distribution of targets of small molecules from the ChEMBL database in our substrate-based protease specificity trees. We observe a striking clustering of annotated targets in tree branches even though these grouped targets do not necessarily share similarity on protein sequence level. This highlights the value and applicability of knowledge acquired from peptide substrates in drug design of small molecules, e.g., for the prediction of off-target effects or drug repurposing. Consequently, our similarity metric allows to map the degradome and its associated drug target network via comparison of known substrate peptides. The substrate-driven view of protein-protein interfaces is not limited to the field of proteases but can be applied to any target class where a sufficient amount of known substrate data is available. PMID:24244149

  13. Processing and amino acid sequence analysis of the mouse mammary tumor virus env gene product.

    PubMed Central

    Arthur, L O; Copeland, T D; Oroszlan, S; Schochetman, G

    1982-01-01

    The envelope proteins of mouse mammary tumor virus (MMTV) are synthesized from a subgenomic 24S mRNA as a 75,000-dalton glycosylated precursor polyprotein which is eventually processed to the mature glycoproteins gp52 and gp36. In vivo synthesis of this env precursor in the presence of the core glycosylation inhibitor tunicamycin yielded a precursor of approximately 61,000 daltons (P61env). However, a 67,000-dalton protein (P67env) was obtained from cell-free translation with the MMTV 24S mRNA as the template. To determine whether the portion of the protein cleaved from P67env to give P61env was removed from the NH2-terminal end of P67env and as such would represent a leader sequence, the NH2-terminal amino acid sequence of the terminal peptide gp52 was determined. Glutamic acid, and not methionine, was found to be the amino-terminal residue of gp52, indicating that the cleaved portion was derived from the NH2-terminal end of P67env. The NH2-terminal amino acid sequences of gp52's from endogenous and exogenous C3H MMTVs were determined though 46 residues and found to be identical. However, amino acid composition and type-specific gp52 radioimmunoassays from MMTVs grown in heterologous cells indicated primary structure differences between gp52's of the two viruses. The nucleic acid sequence of cloned MMTV DNA fragments (J. Majors and H. E. Varmus, personal communication) in conjunction with the NH2-terminal sequence of gp52 allowed localization of the env gene in the MMTV genome. Nucleotides coding for the NH2 terminus of gp52 begin approximately 0.8 kilobase to the 3' side of the single EcoRI cleavage site. Localization of the env gene at that point agrees with the proposed gene order -gag-pol-env- and also allows sufficient coding potential for the glycoprotein precursor without extending into the long terminal repeat. Images PMID:6281457

  14. Complete Genome Sequence of a thermotolerant sporogenic lactic acid bacterium, Bacillus coagulans strain 36D1

    PubMed Central

    Rhee, Mun Su; Moritz, Brélan E.; Xie, Gary; Glavina del Rio, T.; Dalin, E.; Tice, H.; Bruce, D.; Goodwin, L.; Chertkov, O.; Brettin, T.; Han, C.; Detter, C.; Pitluck, S.; Land, Miriam L.; Patel, Milind; Ou, Mark; Harbrucker, Roberta; Ingram, Lonnie O.; Shanmugam, K. T.

    2011-01-01

    Bacillus coagulans is a ubiquitous soil bacterium that grows at 50-55 °C and pH 5.0 and ferments various sugars that constitute plant biomass to L (+)-lactic acid. The ability of this sporogenic lactic acid bacterium to grow at 50-55 °C and pH 5.0 makes this organism an attractive microbial biocatalyst for production of optically pure lactic acid at industrial scale not only from glucose derived from cellulose but also from xylose, a major constituent of hemicellulose. This bacterium is also considered as a potential probiotic. Complete genome sequence of a representative strain, B. coagulans strain 36D1, is presented and discussed. PMID:22675583

  15. BeadCons: detection of nucleic acid sequences by flow cytometry.

    PubMed

    Horejsh, Douglas; Martini, Federico; Capobianchi, Maria Rosaria

    2005-11-01

    Molecular beacons are single-stranded nucleic acid structures with a terminal fluorophore and a distal, terminal quencher. These molecules are typically used in real-time PCR assays, but have also been conjugated with solid matrices. This unit describes protocols related to molecular beacon-conjugated beads (BeadCons), whose specific hybridization with complementary target sequences can be resolved by cytometry. Assay sensitivity is achieved through the concentration of fluorescence signal on discrete particles. By using molecular beacons with different fluorophores and microspheres of different sizes, it is possible to construct a fluid array system with each bead corresponding to a specific target nucleic acid. Methods are presented for the design, construction, and use of BeadCons for the specific, multiplexed detection of unlabeled nucleic acids in solution. The use of bead-based detection methods will likely lead to the design of new multiplex molecular diagnostic tools.

  16. Measuring nanometer distances in nucleic acids using a sequence-independent nitroxide probe

    PubMed Central

    Qin, Peter Z; Haworth, Ian S; Cai, Qi; Kusnetzow, Ana K; Grant, Gian Paola G; Price, Eric A; Sowa, Glenna Z; Popova, Anna; Herreros, Bruno; He, Honghang

    2008-01-01

    This protocol describes the procedures for measuring nanometer distances in nucleic acids using a nitroxide probe that can be attached to any nucleotide within a given sequence. Two nitroxides are attached to phosphorothioates that are chemically substituted at specific sites of DNA or RNA. Inter-nitroxide distances are measured using a four-pulse double electron–electron resonance technique, and the measured distances are correlated to the parent structures using a Web-accessible computer program. Four to five days are needed for sample labeling, purification and distance measurement. The procedures described herein provide a method for probing global structures and studying conformational changes of nucleic acids and protein/nucleic acid complexes. PMID:17947978

  17. Complete Genome Sequence of a thermotolerant sporogenic lactic acid bacterium, Bacillus coagulans strain 36D1.

    PubMed

    Rhee, Mun Su; Moritz, Brélan E; Xie, Gary; Glavina Del Rio, T; Dalin, E; Tice, H; Bruce, D; Goodwin, L; Chertkov, O; Brettin, T; Han, C; Detter, C; Pitluck, S; Land, Miriam L; Patel, Milind; Ou, Mark; Harbrucker, Roberta; Ingram, Lonnie O; Shanmugam, K T

    2011-12-31

    Bacillus coagulans is a ubiquitous soil bacterium that grows at 50-55 °C and pH 5.0 and ferments various sugars that constitute plant biomass to L (+)-lactic acid. The ability of this sporogenic lactic acid bacterium to grow at 50-55 °C and pH 5.0 makes this organism an attractive microbial biocatalyst for production of optically pure lactic acid at industrial scale not only from glucose derived from cellulose but also from xylose, a major constituent of hemicellulose. This bacterium is also considered as a potential probiotic. Complete genome sequence of a representative strain, B. coagulans strain 36D1, is presented and discussed.

  18. Comparison of Mycoplasma pneumoniae Genome Sequences from Strains Isolated from Symptomatic and Asymptomatic Patients

    PubMed Central

    Spuesens, Emiel B. M.; Brouwer, Rutger W. W.; Mol, Kristin H. J. M.; Hoogenboezem, Theo; Kockx, Christel E. M.; Jansen, Ruud; Van IJcken, Wilfred F. J.; Van Rossum, Annemarie M. C.; Vink, Cornelis

    2016-01-01

    Mycoplasma pneumoniae is a common cause of respiratory tract infections (RTIs) in children. We recently demonstrated that this bacterium can be carried asymptomatically in the respiratory tract of children. To identify potential genetic differences between M. pneumoniae strains that are carried asymptomatically and those that cause symptomatic infections, we performed whole-genome sequence analysis of 20 M. pneumoniae strains. The analyzed strains included 3 reference strains, 3 strains isolated from asymptomatic children, 13 strains isolated from clinically well-defined patients suffering from an upper (n = 4) or lower (n = 9) RTI, and one strain isolated from a follow-up patient who recently recovered from an RTI. The obtained sequences were each compared to the sequences of the reference strains. To find differences between strains isolated from asymptomatic and symptomatic individuals, a variant comparison was performed between the different groups of strains. Irrespective of the group (asymptomatic vs. symptomatic) from which the strains originated, subtype 1 and subtype 2 strains formed separate clusters. We could not identify a specific genotype associated with M. pneumoniae virulence. However, we found marked genetic differences between clinical isolates and the reference strains, which indicated that the latter strains may not be regarded as appropriate representatives of circulating M. pneumoniae strains. PMID:27833597

  19. International interlaboratory study comparing single organism 16S rRNA gene sequencing data: Beyond consensus sequence comparisons

    PubMed Central

    Olson, Nathan D.; Lund, Steven P.; Zook, Justin M.; Rojas-Cornejo, Fabiola; Beck, Brian; Foy, Carole; Huggett, Jim; Whale, Alexandra S.; Sui, Zhiwei; Baoutina, Anna; Dobeson, Michael; Partis, Lina; Morrow, Jayne B.

    2015-01-01

    This study presents the results from an interlaboratory sequencing study for which we developed a novel high-resolution method for comparing data from different sequencing platforms for a multi-copy, paralogous gene. The combination of PCR amplification and 16S ribosomal RNA gene (16S rRNA) sequencing has revolutionized bacteriology by enabling rapid identification, frequently without the need for culture. To assess variability between laboratories in sequencing 16S rRNA, six laboratories sequenced the gene encoding the 16S rRNA from Escherichia coli O157:H7 strain EDL933 and Listeria monocytogenes serovar 4b strain NCTC11994. Participants performed sequencing methods and protocols available in their laboratories: Sanger sequencing, Roche 454 pyrosequencing®, or Ion Torrent PGM®. The sequencing data were evaluated on three levels: (1) identity of biologically conserved position, (2) ratio of 16S rRNA gene copies featuring identified variants, and (3) the collection of variant combinations in a set of 16S rRNA gene copies. The same set of biologically conserved positions was identified for each sequencing method. Analytical methods using Bayesian and maximum likelihood statistics were developed to estimate variant copy ratios, which describe the ratio of nucleotides at each identified biologically variable position, as well as the likely set of variant combinations present in 16S rRNA gene copies. Our results indicate that estimated variant copy ratios at biologically variable positions were only reproducible for high throughput sequencing methods. Furthermore, the likely variant combination set was only reproducible with increased sequencing depth and longer read lengths. We also demonstrate novel methods for evaluating variable positions when comparing multi-copy gene sequence data from multiple laboratories generated using multiple sequencing technologies. PMID:27077030

  20. Performance evaluation of Sanger sequencing for the diagnosis of primary hyperoxaluria and comparison with targeted next generation sequencing

    PubMed Central

    Williams, Emma L; Bagg, Eleanor A L; Mueller, Michael; Vandrovcova, Jana; Aitman, Timothy J; Rumsby, Gill

    2015-01-01

    Definitive diagnosis of primary hyperoxaluria (PH) currently utilizes sequential Sanger sequencing of the AGXT, GRPHR, and HOGA1 genes but efficacy is unproven. This analysis is time-consuming, relatively expensive, and delays in diagnosis and inappropriate treatment can occur if not pursued early in the diagnostic work-up. We reviewed testing outcomes of Sanger sequencing in 200 consecutive patient samples referred for analysis. In addition, the Illumina Truseq custom amplicon system was evaluated for paralleled next-generation sequencing (NGS) of AGXT,GRHPR, and HOGA1 in 90 known PH patients. AGXT sequencing was requested in all patients, permitting a diagnosis of PH1 in 50%. All remaining patients underwent targeted exon sequencing of GRHPR and HOGA1 with 8% diagnosed with PH2 and 8% with PH3. Complete sequencing of both GRHPR and HOGA1 was not requested in 25% of patients referred leaving their diagnosis in doubt. NGS analysis showed 98% agreement with Sanger sequencing and both approaches had 100% diagnostic specificity. Diagnostic sensitivity of Sanger sequencing was 98% and for NGS it was 97%. NGS has comparable diagnostic performance to Sanger sequencing for the diagnosis of PH and, if implemented, would screen for all forms of PH simultaneously ensuring prompt diagnosis at decreased cost. PMID:25629080

  1. A 25-Amino Acid Sequence of the Arabidopsis TGD2 Protein Is Sufficient for Specific Binding of Phosphatidic Acid*

    PubMed Central

    Lu, Binbin; Benning, Christoph

    2009-01-01

    Genetic analysis suggests that the TGD2 protein of Arabidopsis is required for the biosynthesis of endoplasmic reticulum derived thylakoid lipids. TGD2 is proposed to be the substrate-binding protein of a presumed lipid transporter consisting of the TGD1 (permease) and TGD3 (ATPase) proteins. The TGD1, -2, and -3 proteins are localized in the inner chloroplast envelope membrane. TGD2 appears to be anchored with an N-terminal membrane-spanning domain into the inner envelope membrane, whereas the C-terminal domain faces the intermembrane space. It was previously shown that the C-terminal domain of TGD2 binds phosphatidic acid (PtdOH). To investigate the PtdOH binding site of TGD2 in detail, the C-terminal domain of the TGD2 sequence lacking the transit peptide and transmembrane sequences was fused to the C terminus of the Discosoma sp. red fluorescent protein (DR). This greatly improved the solubility of the resulting DR-TGD2C fusion protein following production in Escherichia coli. The DR-TGD2C protein bound PtdOH with high specificity, as demonstrated by membrane lipid-protein overlay and liposome association assays. Internal deletion and truncation mutagenesis identified a previously undescribed minimal 25-amino acid fragment in the C-terminal domain of TGD2 that is sufficient for PtdOH binding. Binding characteristics of this 25-mer were distinctly different from those of TGD2C, suggesting that additional sequences of TGD2 providing the proper context for this 25-mer are needed for wild type-like PtdOH binding. PMID:19416982

  2. Isolation, amino acid sequence and biological characterization of an "aspartic-49" phospholipase A₂ from Bothrops (Rhinocerophis) ammodytoides venom.

    PubMed

    Clement, Herlinda; Costa de Oliveira, Vanessa; Zamudio, Fernando Z; Lago, Néstor R; Valdez-Cruz, Norma A; Bérnard Valle, Melisa; Hajos, Silvia E; Alagón, Alejandro; Possani, Lourival D; de Roodt, Adolfo R

    2012-12-01

    A phospholipase enzyme was separated by chromatography from the venom of the snake Bothrops (Rhinocerophis) ammodytoides and characterized. The experimentally determined molecular weight was 13,853.65 Da, and the full primary structure was determined by Edman degradation and mass spectrometry analysis. The enzyme contains 122 amino acids residues closely stabilized by 7 disulfide bridges with an isoelectric point of 6.13. Sequence comparison with other known secretory PLA2 shows that the enzyme isolated belongs to the group II, presenting an aspartic acid residue at position 48 (numbered by convention as Asp49) of the active site, and accordingly displaying enzymatic activity. The enzyme corresponds to 3% of the total mass of the venom. The enzyme is mildly toxic to mice. The intravenous LD₅₀ of this phospholipase in CD-1 mice was around 6 μg/g of mouse body weight (more exactly 117 μg/mouse of 20 g) and the minimal mortal dose (MMD) was estimated to be close to 10 μg/g. In contrast, the LD₅₀ of the venom was circa 2 μg/g mouse body weight. Toxicological analyses of the purified enzyme were performed in vitro and in vivo using experimental animals (mice and rats). The enzyme at high doses caused pulmonary congestion, intraperitoneal bleeding, inhibition of clot retraction and muscle tissue alterations with increasing of creatine kinase levels.

  3. Comparison of the effects of three different (-)-hydroxycitric acid preparations on food intake in rats: response

    PubMed Central

    Preuss, Harry G; Bagchi, Manashi; Bagchi, Debasis

    2006-01-01

    A response to Louter-van de Haar J, Wielinga PY, Scheurink AJ, Nieuwenhuizen AG: Comparison of the effects of three different (-)-hydroxycitric acid preparations on food intake in rats. Nutr Metabol 2005, 2:23 PMID:16846513

  4. Analysis of expressed sequence tags from Uromyces appendiculatus hyphae and haustoria and their comparison to sequences from other rust fungi

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Two separate cDNA libraries were prepared for RNA extracted from bean rust (Uromyces appendiculatus) hyphae and haustoria isolated from infected leaves bean leaves (Phaseolus vulgaris cv Pint 111) between 2 and 8 dpi. Approximately 13,000 clones were sequenced from both ends and the sequences assem...

  5. Nucleotide sequences of the Pseudomonas savastanoi indoleacetic acid genes show homology with Agrobacterium tumefaciens T-DNA

    PubMed Central

    Yamada, Tetsuji; Palm, Curtis J.; Brooks, Bob; Kosuge, Tsune

    1985-01-01

    We report the nucleotide sequences of iaaM and iaaH, the genetic determinants for, respectively, tryptophan 2-monooxygenase and indoleacetamide hydrolase, the enzymes that catalyze the conversion of L-tryptophan to indoleacetic acid in the tumor-forming bacterium Pseudomonas syringae pv. savastanoi. The sequence analysis indicates that the iaaM locus contains an open reading frame encoding 557 amino acids that would comprise a protein with a molecular weight of 61,783; the iaaH locus contains an open reading frame of 455 amino acids that would comprise a protein with a molecular weight of 48,515. Significant amino acid sequence homology was found between the predicted sequence of the tryptophan monooxygenase of P. savastanoi and the deduced product of the T-DNA tms-1 gene of the octopine-type plasmid pTiA6NC from Agrobacterium tumefaciens. Strong homology was found in the 25 amino acid sequence in the putative FAD-binding region of tryptophan monooxygenase. Homology was also found in the amino acid sequences representing the central regions of the putative products of iaaH and tms-2 T-DNA. The results suggest a strong similarity in the pathways for indoleacetic acid synthesis encoded by genes in P. savastanoi and in A. tumefaciens T-DNA. Images PMID:16593610

  6. Large-scale sequence and structural comparisons of human naive and antigen-experienced antibody repertoires

    PubMed Central

    DeKosky, Brandon J.; Lungu, Oana I.; Park, Daechan; Johnson, Erik L.; Charab, Wissam; Chrysostomou, Constantine; Kuroda, Daisuke; Ellington, Andrew D.; Ippolito, Gregory C.; Gray, Jeffrey J.; Georgiou, George

    2016-01-01

    Elucidating how antigen exposure and selection shape the human antibody repertoire is fundamental to our understanding of B-cell immunity. We sequenced the paired heavy- and light-chain variable regions (VH and VL, respectively) from large populations of single B cells combined with computational modeling of antibody structures to evaluate sequence and structural features of human antibody repertoires at unprecedented depth. Analysis of a dataset comprising 55,000 antibody clusters from CD19+CD20+CD27− IgM-naive B cells, >120,000 antibody clusters from CD19+CD20+CD27+ antigen–experienced B cells, and >2,000 RosettaAntibody-predicted structural models across three healthy donors led to a number of key findings: (i) VH and VL gene sequences pair in a combinatorial fashion without detectable pairing restrictions at the population level; (ii) certain VH:VL gene pairs were significantly enriched or depleted in the antigen-experienced repertoire relative to the naive repertoire; (iii) antigen selection increased antibody paratope net charge and solvent-accessible surface area; and (iv) public heavy-chain third complementarity-determining region (CDR-H3) antibodies in the antigen-experienced repertoire showed signs of convergent paired light-chain genetic signatures, including shared light-chain third complementarity-determining region (CDR-L3) amino acid sequences and/or Vκ,λ–Jκ,λ genes. The data reported here address several longstanding questions regarding antibody repertoire selection and development and provide a benchmark for future repertoire-scale analyses of antibody responses to vaccination and disease. PMID:27114511

  7. Prediction of flexible/rigid regions from protein sequences using k-spaced amino acid pairs

    PubMed Central

    Chen, Ke; Kurgan, Lukasz A; Ruan, Jishou

    2007-01-01

    Background Traditionally, it is believed that the native structure of a protein corresponds to a global minimum of its free energy. However, with the growing number of known tertiary (3D) protein structures, researchers have discovered that some proteins can alter their structures in response to a change in their surroundings or with the help of other proteins or ligands. Such structural shifts play a crucial role with respect to the protein function. To this end, we propose a machine learning method for the prediction of the flexible/rigid regions of proteins (referred to as FlexRP); the method is based on a novel sequence representation and feature selection. Knowledge of the flexible/rigid regions may provide insights into the protein folding process and the 3D structure prediction. Results The flexible/rigid regions were defined based on a dataset, which includes protein sequences that have multiple experimental structures, and which was previously used to study the structural conservation of proteins. Sequences drawn from this dataset were represented based on feature sets that were proposed in prior research, such as PSI-BLAST profiles, composition vector and binary sequence encoding, and a newly proposed representation based on frequencies of k-spaced amino acid pairs. These representations were processed by feature selection to reduce the dimensionality. Several machine learning methods for the prediction of flexible/rigid regions and two recently proposed methods for the prediction of conformational changes and unstructured regions were compared with the proposed method. The FlexRP method, which applies Logistic Regression and collocation-based representation with 95 features, obtained 79.5% accuracy. The two runner-up methods, which apply the same sequence representation and Support Vector Machines (SVM) and Naïve Bayes classifiers, obtained 79.2% and 78.4% accuracy, respectively. The remaining considered methods are characterized by accuracies below 70

  8. Nucleic and amino acid sequences relating to a novel transketolase, and methods for the expression thereof

    DOEpatents

    Croteau, Rodney Bruce; Wildung, Mark Raymond; Lange, Bernd Markus; McCaskill, David G.

    2001-01-01

    cDNAs encoding 1-deoxyxylulose-5-phosphate synthase from peppermint (Mentha piperita) have been isolated and sequenced, and the corresponding amino acid sequences have been determined. Accordingly, isolated DNA sequences (SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7) are provided which code for the expression of 1-deoxyxylulose-5-phosphate synthase from plants. In another aspect the present invention provides for isolated, recombinant DXPS proteins, such as the proteins having the sequences set forth in SEQ ID NO:4, SEQ ID NO:6 and SEQ ID NO:8. In other aspects, replicable recombinant cloning vehicles are provided which code for plant 1-deoxyxylulose-5-phosphate synthases, or for a base sequence sufficiently complementary to at least a portion of 1-deoxyxylulose-5-phosphate synthase DNA or RNA to enable hybridization therewith. In yet other aspects, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding a plant 1-deoxyxylulose-5-phosphate synthase. Thus, systems and methods are provided for the recombinant expression of the aforementioned recombinant 1-deoxyxylulose-5-phosphate synthase that may be used to facilitate its production, isolation and purification in significant amounts. Recombinant 1-deoxyxylulose-5-phosphate synthase may be used to obtain expression or enhanced expression of 1-deoxyxylulose-5-phosphate synthase in plants in order to enhance the production of 1-deoxyxylulose-5-phosphate, or its derivatives such as isopentenyl diphosphate (BP), or may be otherwise employed for the regulation or expression of 1-deoxyxylulose-5-phosphate synthase, or the production of its products.

  9. Initial sequence of the chimpanzee genome and comparison with the human genome.

    PubMed

    2005-09-01

    Here we present a draft genome sequence of the common chimpanzee (Pan troglodytes). Through comparison with the human genome, we have generated a largely complete catalogue of the genetic differences that have accumulated since the human and chimpanzee species diverged from our common ancestor, constituting approximately thirty-five million single-nucleotide changes, five million insertion/deletion events, and various chromosomal rearrangements. We use this catalogue to explore the magnitude and regional variation of mutational forces shaping these two genomes, and the strength of positive and negative selection acting on their genes. In particular, we find that the patterns of evolution in human and chimpanzee protein-coding genes are highly correlated and dominated by the fixation of neutral and slightly deleterious alleles. We also use the chimpanzee genome as an outgroup to investigate human population genetics and identify signatures of selective sweeps in recent human evolution.

  10. Identification of Campylobacter spp. and discrimination from Helicobacter and Arcobacter spp. by direct sequencing of PCR-amplified cpn60 sequences and comparison to cpnDB, a chaperonin reference sequence database.

    PubMed

    Hill, Janet E; Paccagnella, Ana; Law, Kee; Melito, Pasquale L; Woodward, David L; Price, Lawrence; Leung, Amy H; Ng, Lai-King; Hemmingsen, Sean M; Goh, Swee Han

    2006-04-01

    A robust method for the identification of Campylobacter spp. based on direct sequencing of PCR-amplified partial cpn60 sequences and comparison of these to a reference database of cpn60 sequences is reported. A total of 53 Campylobacter isolates, representing 15 species, were identified and distinguished from phenotypically similar Helicobacter and Arcobacter strains. Pairwise cpn60 sequence identities between Campylobacter spp. ranged from 71 to 92 %, with most between 71 and 79 %, making discrimination of these species obvious. The method described overcomes limitations of existing PCR-based methods, which require time-consuming and complex post-amplification steps such as the cloning of amplification products. The results of this study demonstrate the potential for use of the reference chaperonin sequence database, cpnDB, as a tool for identification of bacterial isolates based on cpn60 sequences amplified with universal primers.

  11. Gene sequence and predicted amino acid sequence of the motA protein, a membrane-associated protein required for flagellar rotation in Escherichia coli.

    PubMed Central

    Dean, G E; Macnab, R M; Stader, J; Matsumura, P; Burks, C

    1984-01-01

    The motA and motB gene products of Escherichia coli are integral membrane proteins necessary for flagellar rotation. We determined the DNA sequence of the region containing the motA gene and its promoter. Within this sequence, there is an open reading frame of 885 nucleotides, which with high probability (98% confidence level) meets criteria for a coding sequence. The 295-residue amino acid translation product had a molecular weight of 31,974, in good agreement with the value determined experimentally by gel electrophoresis. The amino acid sequence, which was quite hydrophobic, was subjected to a theoretical analysis designed to predict membrane-spanning alpha-helical segments of integral membrane proteins; four such hydrophobic helices were predicted by this treatment. Additional amphipathic helices may also be present. A remarkable feature of the sequence is the existence of two segments of high uncompensated charge density, one positive and the other negative. Possible organization of the protein in the membrane is discussed. Asymmetry in the amino acid composition of translated DNA sequences was used to distinguish between two possible initiation codons. The use of this method as a criterion for authentication of coding regions is described briefly in an Appendix. PMID:6090403

  12. Genome Sequence Analysis of the Naphthenic Acid Degrading and Metal Resistant Bacterium Cupriavidus gilardii CR3

    PubMed Central

    Xiao, Jingfa; Hao, Lirui; Crowley, David E.; Zhang, Zhewen; Yu, Jun; Huang, Ning; Huo, Mingxin; Wu, Jiayan

    2015-01-01

    Cupriavidus sp. are generally heavy metal tolerant bacteria with the ability to degrade a variety of aromatic hydrocarbon compounds, although the degradation pathways and substrate versatilities remain largely unknown. Here we studied the bacterium Cupriavidus gilardii strain CR3, which was isolated from a natural asphalt deposit, and which was shown to utilize naphthenic acids as a sole carbon source. Genome sequencing of C. gilardii CR3 was carried out to elucidate possible mechanisms for the naphthenic acid biodegradation. The genome of C. gilardii CR3 was composed of two circular chromosomes chr1 and chr2 of respectively 3,539,530 bp and 2,039,213 bp in size. The genome for strain CR3 encoded 4,502 putative protein-coding genes, 59 tRNA genes, and many other non-coding genes. Many genes were associated with xenobiotic biodegradation and metal resistance functions. Pathway prediction for degradation of cyclohexanecarboxylic acid, a representative naphthenic acid, suggested that naphthenic acid undergoes initial ring-cleavage, after which the ring fission products can be degraded via several plausible degradation pathways including a mechanism similar to that used for fatty acid oxidation. The final metabolic products of these pathways are unstable or volatile compounds that were not toxic to CR3. Strain CR3 was also shown to have tolerance to at least 10 heavy metals, which was mainly achieved by self-detoxification through ion efflux, metal-complexation and metal-reduction, and a powerful DNA self-repair mechanism. Our genomic analysis suggests that CR3 is well adapted to survive the harsh environment in natural asphalts containing naphthenic acids and high concentrations of heavy metals. PMID:26301592

  13. Genome Sequence Analysis of the Naphthenic Acid Degrading and Metal Resistant Bacterium Cupriavidus gilardii CR3.

    PubMed

    Wang, Xiaoyu; Chen, Meili; Xiao, Jingfa; Hao, Lirui; Crowley, David E; Zhang, Zhewen; Yu, Jun; Huang, Ning; Huo, Mingxin; Wu, Jiayan

    2015-01-01

    Cupriavidus sp. are generally heavy metal tolerant bacteria with the ability to degrade a variety of aromatic hydrocarbon compounds, although the degradation pathways and substrate versatilities remain largely unknown. Here we studied the bacterium Cupriavidus gilardii strain CR3, which was isolated from a natural asphalt deposit, and which was shown to utilize naphthenic acids as a sole carbon source. Genome sequencing of C. gilardii CR3 was carried out to elucidate possible mechanisms for the naphthenic acid biodegradation. The genome of C. gilardii CR3 was composed of two circular chromosomes chr1 and chr2 of respectively 3,539,530 bp and 2,039,213 bp in size. The genome for strain CR3 encoded 4,502 putative protein-coding genes, 59 tRNA genes, and many other non-coding genes. Many genes were associated with xenobiotic biodegradation and metal resistance functions. Pathway prediction for degradation of cyclohexanecarboxylic acid, a representative naphthenic acid, suggested that naphthenic acid undergoes initial ring-cleavage, after which the ring fission products can be degraded via several plausible degradation pathways including a mechanism similar to that used for fatty acid oxidation. The final metabolic products of these pathways are unstable or volatile compounds that were not toxic to CR3. Strain CR3 was also shown to have tolerance to at least 10 heavy metals, which was mainly achieved by self-detoxification through ion efflux, metal-complexation and metal-reduction, and a powerful DNA self-repair mechanism. Our genomic analysis suggests that CR3 is well adapted to survive the harsh environment in natural asphalts containing naphthenic acids and high concentrations of heavy metals.

  14. Repeat sequence chromosome specific nucleic acid probes and methods of preparing and using

    DOEpatents

    Weier, Heinz-Ulrich G.; Gray, Joe W.

    1995-01-01

    A primer directed DNA amplification method to isolate efficiently chromosome-specific repeated DNA wherein degenerate oligonucleotide primers are used is disclosed. The probes produced are a heterogeneous mixture that can be used with blocking DNA as a chromosome-specific staining reagent, and/or the elements of the mixture can be screened for high specificity, size and/or high degree of repetition among other parameters. The degenerate primers are sets of primers that vary in sequence but are substantially complementary to highly repeated nucleic acid sequences, preferably clustered within the template DNA, for example, pericentromeric alpha satellite repeat sequences. The template DNA is preferably chromosome-specific. Exemplary primers ard probes are disclosed. The probes of this invention can be used to determine the number of chromosomes of a specific type in metaphase spreads, in germ line and/or somatic cell interphase nuclei, micronuclei and/or in tissue sections. Also provided is a method to select arbitrarily repeat sequence probes that can be screened for chromosome-specificity.

  15. Unconventional amino acid sequence of the sun anemone (Stoichactis helianthus) polypeptide neurotoxin

    SciTech Connect

    Kem, W.; Dunn, B.; Parten, B.; Pennington, M.; Price, D.

    1986-05-01

    A 5000 dalton polypeptide neurotoxin (Sh-NI) purified by G50 Sephadex, P-cellulose, and SP-Sephadex chromatography was homogeneous by isoelectric focusing. Sh-NI was highly toxic to crayfish (LD/sub 50/ 0.6 ..mu..g/kg) but without effect upon mice at 15,000 ..mu..g/kg (i.p. injection). The reduced, /sup 3/H-carboxymethylated toxin and its fragments were subjected to automatic Edman degradation and the resulting PTH-amino acids were identified by HPLC, back hydrolysis, and scintillation counting. Peptides resulting from proteolytic (clostripain, staphylococcal protease) and chemical (tryptophan) cleavage were sequenced. The sequence is: AACKCDDEGPDIRTAPLTGTVDLGSCNAGWEKCASYYTIIADCCRKKK. This sequence differs considerably from the homologous Anemonia and Anthopleura toxins; many of the identical residues (6 half-cystines, G9, P10, R13, G19, G29, W30) are probably critical for folding rather than receptor recognition. However, the Sh-NI sequence closely resembles Radioanthus macrodactylus neurotoxin III and r. paumotensis II. The authors propose that Sh-NI and related Radioanthus toxins act upon a different site on the sodium channel.

  16. Repeat sequence chromosome specific nucleic acid probes and methods of preparing and using

    DOEpatents

    Weier, H.U.G.; Gray, J.W.

    1995-06-27

    A primer directed DNA amplification method to isolate efficiently chromosome-specific repeated DNA wherein degenerate oligonucleotide primers are used is disclosed. The probes produced are a heterogeneous mixture that can be used with blocking DNA as a chromosome-specific staining reagent, and/or the elements of the mixture can be screened for high specificity, size and/or high degree of repetition among other parameters. The degenerate primers are sets of primers that vary in sequence but are substantially complementary to highly repeated nucleic acid sequences, preferably clustered within the template DNA, for example, pericentromeric alpha satellite repeat sequences. The template DNA is preferably chromosome-specific. Exemplary primers and probes are disclosed. The probes of this invention can be used to determine the number of chromosomes of a specific type in metaphase spreads, in germ line and/or somatic cell interphase nuclei, micronuclei and/or in tissue sections. Also provided is a method to select arbitrarily repeat sequence probes that can be screened for chromosome-specificity. 18 figs.

  17. Sequence-defined bioactive macrocycles via an acid-catalysed cascade reaction

    NASA Astrophysics Data System (ADS)

    Porel, Mintu; Thornlow, Dana N.; Phan, Ngoc N.; Alabi, Christopher A.

    2016-06-01

    Synthetic macrocycles derived from sequence-defined oligomers are a unique structural class whose ring size, sequence and structure can be tuned via precise organization of the primary sequence. Similar to peptides and other peptidomimetics, these well-defined synthetic macromolecules become pharmacologically relevant when bioactive side chains are incorporated into their primary sequence. In this article, we report the synthesis of oligothioetheramide (oligoTEA) macrocycles via a one-pot acid-catalysed cascade reaction. The versatility of the cyclization chemistry and modularity of the assembly process was demonstrated via the synthesis of >20 diverse oligoTEA macrocycles. Structural characterization via NMR spectroscopy revealed the presence of conformational isomers, which enabled the determination of local chain dynamics within the macromolecular structure. Finally, we demonstrate the biological activity of oligoTEA macrocycles designed to mimic facially amphiphilic antimicrobial peptides. The preliminary results indicate that macrocyclic oligoTEAs with just two-to-three cationic charge centres can elicit potent antibacterial activity against Gram-positive and Gram-negative bacteria.

  18. Microbial community dynamics in bioaugmented sequencing batch reactors for bromoamine acid removal.

    PubMed

    Qu, Yuanyuan; Zhou, Jiti; Wang, Jing; Fu, Xiang; Xing, Linlin

    2005-05-01

    Sphingomonas xenophaga QYY with the ability to degrade bromoamine acid (BAA) was previously isolated from sludge samples. The enhancement of BAA removal by strain QYY in sequencing batch reactors (SBRs) was investigated in this study. The results showed that augmented SBRs exhibited stronger abilities to degrade BAA than the non-augmented control one. In order to estimate the relationship between community dynamics and function of augmented SBRs, a combined method based on fingerprints (ribosomal intergenic spacer analysis, RISA) and 16S rRNA gene sequencing was used. The results indicated that the microbial community dynamics were substantially changed, and the introduced strain QYY was persistent in the augmented systems. This study suggests that it is feasible and potentially useful to enhance BAA removal using BAA-degrading bacteria, such as S. xenophaga QYY.

  19. [Measurement of the amino acid sequence for the fusion protein FP3 with LC-MS/MS].

    PubMed

    Li, Xiang; Gao, Xiang-Dong; Tao, Lei; Pei, De-Ning; Guo, Ying; Rao, Chun-Ming; Wang, Jun-Zhi

    2012-02-01

    The amino acid sequence of the fusion protein FP3 was measured by two types of LC-MS/MS and its primary structure was confirmed. After reduction and alkylation, the protein was digested with trypsin and glycosyl groups in glycopeptide were removed by PNGase F. The mixed peptides were separated by LC, then Q-TOF and Ion trap tandem mass spectrometry were used to measure b, y fragment ions of each peptide to analyze the amino acid sequence of fusion protein FP3. Seventy-six percent of full amino acid sequence of the fusion protein FP3 was measured by LC-ESI-Q-TOF with the remaining 24% completed by LC-ESI-Trap. As LC-MS and tandem mass spectrometry are rapid, sensitive, accurate to measure the protein amino acid sequence, they are important approach to structure analysis and identification of recombinant protein.

  20. NullSeq: A Tool for Generating Random Coding Sequences with Desired Amino Acid and GC Contents

    PubMed Central

    Liu, Sophia S.; Hockenberry, Adam J.; Lancichinetti, Andrea; Jewett, Michael C.

    2016-01-01

    The existence of over- and under-represented sequence motifs in genomes provides evidence of selective evolutionary pressures on biological mechanisms such as transcription, translation, ligand-substrate binding, and host immunity. In order to accurately identify motifs and other genome-scale patterns of interest, it is essential to be able to generate accurate null models that are appropriate for the sequences under study. While many tools have been developed to create random nucleotide sequences, protein coding sequences are subject to a unique set of constraints that complicates the process of generating appropriate null models. There are currently no tools available that allow users to create random coding sequences with specified amino acid composition and GC content for the purpose of hypothesis testing. Using the principle of maximum entropy, we developed a method that generates unbiased random sequences with pre-specified amino acid and GC content, which we have developed into a python package. Our method is the simplest way to obtain maximally unbiased random sequences that are subject to GC usage and primary amino acid sequence constraints. Furthermore, this approach can easily be expanded to create unbiased random sequences that incorporate more complicated constraints such as individual nucleotide usage or even di-nucleotide frequencies. The ability to generate correctly specified null models will allow researchers to accurately identify sequence motifs which will lead to a better understanding of biological processes as well as more effective engineering of biological systems. PMID:27835644

  1. Biosynthesis of D-alanyl-lipoteichoic acid: cloning, nucleotide sequence, and expression of the Lactobacillus casei gene for the D-alanine-activating enzyme.

    PubMed Central

    Heaton, M P; Neuhaus, F C

    1992-01-01

    The D-alanine-activating enzyme (Dae; EC 6.3.2.4) encoded by the dae gene from Lactobacillus casei ATCC 7469 is a cytosolic protein essential for the formation of the D-alanyl esters of membrane-bound lipoteichoic acid. The gene has been cloned, sequenced, and expressed in Escherichia coli, an organism which does not possess Dae activity. The open reading frame is 1,518 nucleotides and codes for a protein of 55.867 kDa, a value in agreement with the 56 kDa obtained by electrophoresis. A putative promoter and ribosome-binding site immediately precede the dae gene. A second open reading frame contiguous with the dae gene has also been partially sequenced. The organization of these genetic elements suggests that more than one enzyme necessary for the biosynthesis of D-alanyl-lipoteichoic acid may be present in this operon. Analysis of the amino acid sequence deduced from the dae gene identified three regions with significant homology to proteins in the following groups of ATP-utilizing enzymes: (i) the acid-thiol ligases, (ii) the activating enzymes for the biosynthesis of enterobactin, and (iii) the synthetases for tyrocidine, gramicidin S, and penicillin. From these comparisons, a common motif (GXXGXPK) has been identified that is conserved in the 19 protein domains analyzed. This motif may represent the phosphate-binding loop of an ATP-binding site for this class of enzymes. A DNA fragment (1,568 nucleotides) containing the dae gene and its putative ribosome-binding site has been subcloned and expressed in E. coli. Approximately 0.5% of the total cell protein is active Dae, whereas 21% is in the form of inclusion bodies. The isolation of this minimal fragment without a native promoter sequence provides the basis for designing a genetic system for modulating the D-alanine ester content of lipoteichoic acid. PMID:1385594

  2. Morphological tranformation of calcite crystal growth by prismatic "acidic" polypeptide sequences.

    SciTech Connect

    Kim, I; Giocondi, J L; Orme, C A; Collino, J; Evans, J S

    2007-02-13

    Many of the interesting mechanical and materials properties of the mollusk shell are thought to stem from the prismatic calcite crystal assemblies within this composite structure. It is now evident that proteins play a major role in the formation of these assemblies. Recently, a superfamily of 7 conserved prismatic layer-specific mollusk shell proteins, Asprich, were sequenced, and the 42 AA C-terminal sequence region of this protein superfamily was found to introduce surface voids or porosities on calcite crystals in vitro. Using AFM imaging techniques, we further investigate the effect that this 42 AA domain (Fragment-2) and its constituent subdomains, DEAD-17 and Acidic-2, have on the morphology and growth kinetics of calcite dislocation hillocks. We find that Fragment-2 adsorbs on terrace surfaces and pins acute steps, accelerates then decelerates the growth of obtuse steps, forms clusters and voids on terrace surfaces, and transforms calcite hillock morphology from a rhombohedral form to a rounded one. These results mirror yet are distinct from some of the earlier findings obtained for nacreous polypeptides. The subdomains Acidic-2 and DEAD-17 were found to accelerate then decelerate obtuse steps and induce oval rather than rounded hillock morphologies. Unlike DEAD-17, Acidic-2 does form clusters on terrace surfaces and exhibits stronger obtuse velocity inhibition effects than either DEAD-17 or Fragment-2. Interestingly, a 1:1 mixture of both subdomains induces an irregular polygonal morphology to hillocks, and exhibits the highest degree of acute step pinning and obtuse step velocity inhibition. This suggests that there is some interplay between subdomains within an intra (Fragment-2) or intermolecular (1:1 mixture) context, and sequence interplay phenomena may be employed by biomineralization proteins to exert net effects on crystal growth and morphology.

  3. Sequence selective recognition of double-stranded RNA using triple helix-forming peptide nucleic acids.

    PubMed

    Zengeya, Thomas; Gupta, Pankaj; Rozners, Eriks

    2014-01-01

    Noncoding RNAs are attractive targets for molecular recognition because of the central role they play in gene expression. Since most noncoding RNAs are in a double-helical conformation, recognition of such structures is a formidable problem. Herein, we describe a method for sequence-selective recognition of biologically relevant double-helical RNA (illustrated on ribosomal A-site RNA) using peptide nucleic acids (PNA) that form a triple helix in the major grove of RNA under physiologically relevant conditions. Protocols for PNA preparation and binding studies using isothermal titration calorimetry are described in detail.

  4. Fast computational methods for predicting protein structure from primary amino acid sequence

    DOEpatents

    Agarwal, Pratul Kumar

    2011-07-19

    The present invention provides a method utilizing primary amino acid sequence of a protein, energy minimization, molecular dynamics and protein vibrational modes to predict three-dimensional structure of a protein. The present invention also determines possible intermediates in the protein folding pathway. The present invention has important applications to the design of novel drugs as well as protein engineering. The present invention predicts the three-dimensional structure of a protein independent of size of the protein, overcoming a significant limitation in the prior art.

  5. Fluorescence energy transfer as a probe for nucleic acid structures and sequences.

    PubMed Central

    Mergny, J L; Boutorine, A S; Garestier, T; Belloc, F; Rougée, M; Bulychev, N V; Koshkin, A A; Bourson, J; Lebedev, A V; Valeur, B

    1994-01-01

    The primary or secondary structure of single-stranded nucleic acids has been investigated with fluorescent oligonucleotides, i.e., oligonucleotides covalently linked to a fluorescent dye. Five different chromophores were used: 2-methoxy-6-chloro-9-amino-acridine, coumarin 500, fluorescein, rhodamine and ethidium. The chemical synthesis of derivatized oligonucleotides is described. Hybridization of two fluorescent oligonucleotides to adjacent nucleic acid sequences led to fluorescence excitation energy transfer between the donor and the acceptor dyes. This phenomenon was used to probe primary and secondary structures of DNA fragments and the orientation of oligodeoxynucleotides synthesized with the alpha-anomers of nucleoside units. Fluorescence energy transfer can be used to reveal the formation of hairpin structures and the translocation of genes between two chromosomes. PMID:8152922

  6. Amino acid sequence of two neurotoxins from the venom of the Egyptian black snake (Walterinnesia aegyptia).

    PubMed

    Samejima, Y; Aoki-Tomomatsu, Y; Yanagisawa, M; Mebs, D

    1997-02-01

    The venom of the Egyptian black snake Walterinnesia aegyptia contains at least three toxins, which act postsynaptically to block the neuromuscular transmission of isolated rat phrenic nerve-diaphragm and chicken biventer cervicis muscle. The complete amino acid sequence of the two toxins, W-III and W-IV, consisting of 62 amino acid residues, was elucidated by Edman degradation of fragments obtained after Staphylococcus aureus protease and prolylpeptidase digestion. Although the toxins exhibit close structural homology to other short-chain postsynaptic neurotoxins from Elapidae venoms, toxin IV is unique by having a free SH-group (cysteine) at position 16. In position 35 of W-III, which is located at the tip of the central loop, threonine is replaced by lysine, which may alter the interaction of the toxin with the acetylcholine receptor, since the toxin is seven times less lethal than toxin W-IV.

  7. Complete genome sequence of Lactococcus lactis IO-1, a lactic acid bacterium that utilizes xylose and produces high levels of L-lactic acid.

    PubMed

    Kato, Hiroaki; Shiwa, Yuh; Oshima, Kenshiro; Machii, Miki; Araya-Kojima, Tomoko; Zendo, Takeshi; Shimizu-Kadota, Mariko; Hattori, Masahira; Sonomoto, Kenji; Yoshikawa, Hirofumi

    2012-04-01

    We report the complete genome sequence of Lactococcus lactis IO-1 (= JCM7638). It is a nondairy lactic acid bacterium, produces nisin Z, ferments xylose, and produces predominantly L-lactic acid at high xylose concentrations. From ortholog analysis with other five L. lactis strains, IO-1 was identified as L. lactis subsp. lactis.

  8. Complete genome sequence of Bacillus amyloliquefaciens LL3, which exhibits glutamic acid-independent production of poly-γ-glutamic acid.

    PubMed

    Geng, Weitao; Cao, Mingfeng; Song, Cunjiang; Xie, Hui; Liu, Li; Yang, Chao; Feng, Jun; Zhang, Wei; Jin, Yinghong; Du, Yang; Wang, Shufang

    2011-07-01

    Bacillus amyloliquefaciens is one of most prevalent Gram-positive aerobic spore-forming bacteria with the ability to synthesize polysaccharides and polypeptides. Here, we report the complete genome sequence of B. amyloliquefaciens LL3, which was isolated from fermented food and presents the glutamic acid-independent production of poly-γ-glutamic acid.

  9. Formation Sequences of Iron Minerals in the Acidic Alteration Products and Variation of Hydrothermal Fluid Conditions

    NASA Astrophysics Data System (ADS)

    Isobe, H.; Yoshizawa, M.

    2008-12-01

    Iron minerals have important role in environmental issues not only on the Earth but also other terrestrial planets. Iron mineral species related to alteration products of primary minerals with surface or subsurface fluids are characterized by temperature, acidity and redox conditions of the fluids. We can see various iron- bearing alteration products in alteration products around fumaroles in geothermal/volcanic areas. In this study, zonal structures of iron minerals in alteration products of the geothermal area are observed to elucidate temporal and spatial variation of hydrothermal fluids. Alteration of the pyroxene-amphibole andesite of Garan-dake volcano, Oita, Japan occurs by the acidic hydrothermal fluid to form cristobalite leaching out elements other than Si. Hand specimens with unaltered or weakly altered core and cristobalite crust show various sequences of layers. XRD analysis revealed that the alteration degree is represented by abundance of cristobalite. Intermediately altered layers are characterized by occurrence including alunite, pyrite, kaolinite, goethite and hematite. A specimen with reddish brown core surrounded by cristobalite-rich white crust has brown colored layers at the boundary of core and the crust. Reddish core is characterized by occurrence of crystalline hematite by XRD. Another hand specimen has light gray core, which represents reduced conditions, and white cristobalite crust with light brown and reddish brown layers of ferric iron minerals between the core and the crust. On the other hand, hornblende crystals, typical ferrous iron-bearing mineral of the host rock, are well preserved in some samples with strongly decolorized cristobalite-rich groundmass. Hydrothermal alteration experiments of iron-rich basaltic material shows iron mineral species depend on acidity and temperature of the fluid. Oxidation states of the iron-bearing mineral species are strongly influenced by the acidity and redox conditions. Variations of alteration

  10. Design, synthesis, and characterization of a protein sequencing reagent yielding amino acid derivatives with enhanced detectability by mass spectrometry.

    PubMed Central

    Aebersold, R.; Bures, E. J.; Namchuk, M.; Goghari, M. H.; Shushan, B.; Covey, T. C.

    1992-01-01

    We report the design, chemical synthesis, and structural and functional characterization of a novel reagent for protein sequence analysis by the Edman degradation, yielding amino acid derivatives rapidly detectable at high sensitivity by ion-evaporation mass spectrometry. We demonstrate that the reagent 3-[4'(ethylene-N,N,N-trimethylamino)phenyl]-2-isothiocyanate is chemically stable and shows coupling and cyclization/cleavage yields comparable to phenylisothiocyanate, the standard reagent in chemical sequence analysis, under conditions typically encountered in manual or automated sequence analysis. Amino acid derivatives generated with this reagent were detectable by ion-evaporation mass spectrometry at the subfemtomole sensitivity level at a pace of one sample per minute. Furthermore, derivatives were identified by their mass, thus permitting the rapid and highly sensitive determination of the molecular nature of modified amino acids. Derivatives of amino acids with acidic, basic, polar, or hydrophobic side chains were reproducibly detectable at comparable sensitivities. The polar nature of the reagent required covalent immobilization of polypeptides prior to automated sequence analysis. This reagent, used in automated sequence analysis, has the potential for overcoming the limitations in sensitivity, speed, and the ability to characterize modified amino acid residues inherent in the chemical sequencing methods that are currently used. PMID:1304351

  11. Sequencing and phylogenetic analysis of neurotoxin gene from an environmental isolate of Clostridium sp.: comparison with other clostridial neurotoxins.

    PubMed

    Dixit, Aparna; Alam, Syed Imteyaz; Singh, Lokendra

    2006-07-01

    A Clostridium sp. isolated from intestine of decaying fish exhibited 99% sequence identity with C. tetani at 16S rRNA level. It produced a neurotoxin that was neutralized by botulinum antitoxin (A+B+E) as well as tetanus antitoxin. The gene fragments for light chain, C-terminal and N-terminal regions of the heavy chain of the toxin were amplified using three reported primer sets for tetanus neurotoxin (TeNT). The neurotoxin gene fragments were cloned in Escherichia coli and sequenced. The sequences obtained exhibited approximately 98, 99 and 98% sequence identity with reported gene sequences of TeNT/LC, TeNT/HC and TeNT/HN, respectively. The phylogenetic interrelationship between the neurotoxin gene of Clostridium sp. with previously reported gene sequences of Clostridium botulinum A to G and C. tetani was examined by analysis of differences in the nucleotide sequences. Six amino acids were substituted at four different positions in the light chain of neurotoxin from the isolate when compared with the reported closest sequence of TeNT. Of these, four were located in the beta15 motif at a solvent inaccessible, buried region of the protein molecule. One of these substitutions were on the solvent accessible surface residue of alpha1 motif, previously shown to have strong sequence conservation. A substitution of two amino acids observed in N-terminal region of heavy chain were buried residues, located in the beta21 and beta37 motifs showing variability in other related sequences. The C-terminal region responsible for binding to receptor was conserved, showing no changes in the amino acid sequence.

  12. Nonprotein Amino Acids from Spark Discharges and Their Comparison with the Murchison Meteorite Amino Acids

    PubMed Central

    Wolman, Yecheskel; Haverland, William J.; Miller, Stanley L.

    1972-01-01

    All the nonprotein amino acids found in the Murchison meteorite are products of the action of electric discharge on a mixture of methane, nitrogen, and water with traces of ammonia. These amino acids include α-amino-n-butyric acid, α-aminoisobutyric acid, norvaline, isovaline, pipecolic acid, β-alanine, β-amino-n-butyric acid, β-aminoisobutyric acid, γ-aminobutyric acid, sarcosine, N-ethylglycine, and N-methylalanine. In addition, norleucine, alloisoleucine, N-propylglycine, N-isopropylglycine, N-methyl-β-alanine, N-ethyl-β-alanine α,β-diaminopropionic acid, isoserine, α,γ-diaminobutyric acid, and α-hydroxy-γ-aminobutyric acid are produced by the electric discharge, but have not been found in the meteorite. PMID:16591973

  13. Complete Genome Sequence of Enterobacter cloacae UW5, a Rhizobacterium Capable of High Levels of Indole-3-Acetic Acid Production.

    PubMed

    Coulson, Thomas J D; Patten, Cheryl L

    2015-08-06

    We report the complete genome sequence of Enterobacter cloacae UW5, an indole-3-acetic acid-producing rhizobacterium originally isolated from the rhizosphere of grass. The 4.9-Mbp genome has a G+C content of 54% and contains 4,496 protein-coding sequences.

  14. Complete Genome Sequence of Enterobacter cloacae UW5, a Rhizobacterium Capable of High Levels of Indole-3-Acetic Acid Production

    PubMed Central

    Coulson, Thomas J. D.

    2015-01-01

    We report the complete genome sequence of Enterobacter cloacae UW5, an indole-3-acetic acid-producing rhizobacterium originally isolated from the rhizosphere of grass. The 4.9-Mbp genome has a G+C content of 54% and contains 4,496 protein-coding sequences. PMID:26251488

  15. Genome Sequence of the Lactic Acid Bacterium Lactococcus lactis subsp. lactis TOMSC161, Isolated from a Nonscalded Curd Pressed Cheese

    PubMed Central

    Velly, H.; Abraham, A.-L.; Loux, V.; Delacroix-Buchet, A.; Fonseca, F.; Bouix, M.

    2014-01-01

    Lactococcus lactis is a lactic acid bacterium used in the production of many fermented foods, such as dairy products. Here, we report the genome sequence of L. lactis subsp. lactis TOMSC161, isolated from nonscalded curd pressed cheese. This genome sequence provides information in relation to dairy environment adaptation. PMID:25377704

  16. Deoxyribonucleic acid sequence of araBAD promoter mutants of Escherichia coli.

    PubMed

    Horwitz, A H; Morandi, C; Wilcox, G

    1980-05-01

    The controlling site region for the araBAD operon is defined, in part, by two classes of cis-acting constitutive mutations. The aralc mutations allow low-level constitutive expression of ara-BAD in the absence of the positive regulatory protein coded for by the araC gene, whereas the araXc mutations allow expression of araBAD in the absence of the cyclic adenosine monophosphate receptor protein. Six independently isolated aralc mutations and three independently isolated araXc mutations were cloned onto the plasmid pBR322 using in vitro recombinant deoxyribonucleic acid techniques and in vivo recombination between plasmid and chromosomal deoxyribonucleic acid. The location of these mutations was determined by deoxyribonucleic acid sequence analysis. All of the aralc mutations occurred at position -35 within the araBAD promoter (+1 = messenger ribonucleic acid start for araBAD) and resulted from an AT leads to GC transition. All of the araXc mutations occurred at position -10 within the araBAD promoter and resulted from a GC leads to AT transition. Models are presented to explain the mode of action of the aralc and araXc mutations.

  17. Paenungulata: a comparison of the hemoglobin sequences from elephant, hyrax, and manatee.

    PubMed

    Kleinschmidt, T; Czelusniak, J; Goodman, M; Braunitzer, G

    1986-09-01

    Inspection of the amino acid differences among hemoglobin sequences of a wide range of mammalian species suggested that at alpha 19, alpha 110, alpha 111, beta 23, beta 44, and beta 56, synapomorphies group manatee (Trichechus inungius, Sirenia), Indian and African elephant (Elephas maximus and Loxodonta africana, Proboscidea), and rock hyrax (Procavia habessinica, Hyracoidea) into a monophyletic clade. Results obtained by parsimony analysis provide evidence for this grouping--and thus support for the genealogical validity of Simpson's superorder Paenungulata, which contains as the extant orders Proboscidea, Sirenia, and Hyracoidea. All of the 39 most, or nearly most, parsimonious of 10,395 trees constructed from a tandemly combined alpha- and beta-hemoglobin sequence for 103 vertebrate species (of which 79 were mammals from 16 extant orders), depicted Paenungulata as one of the most anciently separated branches of Eutheria. It was found on examining thousands of alternative trees that to not group Proboscidea, Hyracoidea, and Sirenia in a monophyletic clade required at least four additional substitutions.

  18. In the TTF-1 homeodomain the contribution of several amino acids to DNA recognition depends on the bound sequence.

    PubMed Central

    Fabbro, D; Tell, G; Leonardi, A; Pellizzari, L; Pucillo, C; Lonigro, R; Formisano, S; Damante, G

    1996-01-01

    The thyroid transcription factor-1 homeodomain (TTF-1HD) shows a peculiar DNA binding specificity, preferentially recognizing sequences containing the 5'-CAAG-3' core motif. Most other homeodomains instead recognize sites containing the 5'-TAAT-3' core motif. Here, we show that TTF-1HD efficiently recognizes another sequence, called D1, devoid of the 5'-CAAG-3' core motif. Different experimental approaches indicate that TTF-1HD contacts the D1 sequence in a manner which is different to that used to interact with sequences containing the 5'-CAAG-3' core motif. The binding activities that mutants of TTF-1HD display with the D1 sequence or with the sequence containing the 5'-CAAG-3' core motif indicate that the role of several DNA-contacting amino acids is different. In particular, during recognition of the D1 sequence, backbone-interacting amino acids not relevant in binding to sequences containing the 5'-CAAG-3' core motif play an important role. In the TTF-1HD, therefore, the contribution of several amino acids to DNA recognition depends on the bound sequence. These data indicate that although a common bonding network exists in all of the HD/DNA complexes, peculiarities important for DNA recognition may occur in single cases. PMID:8811078

  19. Diversification of mitochondrial genome of Daphnia galeata (Cladocera, Crustacea): Comparison with phylogenetic consideration of the complete sequences of clones isolated from five lakes in Japan.

    PubMed

    Tokishita, Shin-Ichi; Shibuya, Hiroyuki; Kobayashi, Taku; Sakamoto, Masaki; Ha, Jin-Yong; Yokobori, Shin-Ichi; Yamagata, Hideo; Hanazato, Takayuki

    2017-02-17

    To characterize genetic diversity and gene flow among Daphnia galeata populations, the complete nucleotide (nt) sequences of the mitochondrial (mt) DNAs of D. galeata clones isolated from five lakes in Japan (Lakes Shirakaba, Suwa, Kizaki, Kasumigaura, and Biwa) were determined. Comparison of non-synonymous (amino acid altering) substitution rates with synonymous substitution rates of D. galeata mt protein-coding genes demonstrated that ATPase8 and COI genes were the most and least susceptible, respectively, to the evolutional forces selecting the aa substitutions. Several non-synonymous substitutions were found in ATPase8 and ATPase6 even in the comparison that no synonymous substitution was found. Comparison of the total number of nt variations among the mt DNAs suggested the phylogenetic relationship ((((Shirakaba/Suwa, Kizaki), Kasumigaura), Biwa), D. pulex). Maximum-likelihood analysis using the total nt sequences of mt protein-coding genes confirmed this relationship with bootstrap values higher than 98%. All the mtDNAs of the analyzed Japanese D. galeata clones contained a control region of essentially the same structure that is distinct from those of the previously reported European Daphnia species of the D. longispina complex. The two control regions of different structures spread among mtDNAs of the Japanese and European Daphnia species, respectively, probably after the divergence of the Japanese D. galeata under different selection pressures associated with their habitats.

  20. Molecular cloning, encoding sequence, and expression of vaccinia virus nucleic acid-dependent nucleoside triphosphatase gene.

    PubMed Central

    Rodriguez, J F; Kahn, J S; Esteban, M

    1986-01-01

    A rabbit poxvirus genomic library contained within the expression vector lambda gt11 was screened with polyclonal antiserum prepared against vaccinia virus nucleic acid-dependent nucleoside triphosphatase (NTPase)-I enzyme. Five positive phage clones containing from 0.72- to 2.5-kilobase-pair (kbp) inserts expressed a beta-galactosidase fusion protein that was reactive by immunoblotting with the NTPase-I antibody. Hybridization analysis allowed the location of this gene within the vaccinia HindIIID restriction fragment. From the known nucleotide sequence of the 16-kbp vaccinia HindIIID fragment, we identified a region that contains a 1896-base open reading frame coding for a 631-amino acid protein. Analysis of the complete sequence revealed a highly basic protein, with hydrophilic COOH and NH2 termini, various hydrophobic domains, and no significant homology to other known proteins. Translational studies demonstrate that NTPase-I belongs to a late class of viral genes. This protein is highly conserved among Orthopoxviruses. Images PMID:3025846

  1. The amino acid sequences and activities of synergistic hemolysins from Staphylococcus cohnii.

    PubMed

    Mak, Pawel; Maszewska, Agnieszka; Rozalska, Malgorzata

    2008-10-01

    Staphylococcus cohnii ssp. cohnii and S. cohnii ssp. urealyticus are a coagulase-negative staphylococci considered for a long time as unable to cause infections. This situation changed recently and pathogenic strains of these bacteria were isolated from hospital environments, patients and medical staff. Most of the isolated strains were resistant to many antibiotics. The present work describes isolation and characterization of several synergistic peptide hemolysins produced by these bacteria and acting as virulence factors responsible for hemolytic and cytotoxic activities. Amino acid sequences of respective hemolysins from S. cohnii ssp. cohnii (named as H1C, H2C and H3C) and S. cohnii ssp. urealyticus (H1U, H2U and H3U) were identical. Peptides H1 and H3 possessed significant amino acid homology to three synergistic hemolysins secreted by Staphylococcus lugdunensis and to putative antibacterial peptide produced by Staphylococcus saprophyticus ssp. saprophyticus. On the other hand, hemolysin H2 had a unique sequence. All isolated peptides lysed red cells from different mammalian species and exerted a cytotoxic effect on human fibroblasts.

  2. Complete amino acid sequence of a Lolium perenne (perennial rye grass) pollen allergen, Lol p II.

    PubMed

    Ansari, A A; Shenbagamurthi, P; Marsh, D G

    1989-07-05

    The complete amino acid sequence of a Lolium perenne (rye grass) pollen allergen, Lol p II was determined by automated Edman degradation of the protein and selected fragments. Cleavage of the protein by enzymatic and chemical techniques established an unambiguous sequence for the protein. Lol p II contains 97 amino acid residues, with a calculated molecular weight of 10,882. The protein lacks cysteine and glutamine and shows no evidence of glycosylation. Theoretical predictions by Fraga's (Fraga, S. (1982) Can. J. Chem. 60, 2606-2610) and Hopp and Woods' (Hopp, T. P., and Woods, K. R. (1981) Proc. Natl. Acad. Sci. U.S.A. 78, 3824-3828) methods indicate the presence of four hydrophilic regions, which may contribute to sequential or parts of conformational B-cell epitopes. Analysis of amphipathic regions by Berzofsky's method indicates the presence of a highly amphipathic region, which may contain, or contribute to, an Ia/T-cell epitope. This latter segment of Lol p II was found to be highly homologous with an antibody-binding segment of the major rye allergen Lol p I and may explain why immune responsiveness to both the allergens is associated with HLA-DR3.

  3. The Sequence-Specific Cellular Uptake of Spherical Nucleic Acid Nanoparticle Conjugates

    PubMed Central

    Narayan, Suguna P.; Choi, Chung Hang J.; Hao, Liangliang; Calabrese, Colin M.; Auyeung, Evelyn; Zhang, Chuan; Goor, Olga J.G.M.

    2015-01-01

    We investigated the sequence-dependent cellular uptake of spherical nucleic acid nanoparticle conjugates (SNAs). This process occurs by interaction with class A scavenger receptors (SR-A) and caveolae-mediated endocytosis. It is known that linear poly(guanine) (poly G) is a natural ligand for SR-A, and it has been proposed that interaction of poly G with SR-A is dependent on the formation of G-quadruplexes. Since G-rich oligonucleotides are known to interact strongly with SR-A, we hypothesized that SNAs with higher G contents would be able to enter cells in larger amounts than SNAs composed of other nucleotides, and as such we measured cellular internalization of SNAs as a function of constituent oligonucleotide sequence. Indeed, SNAs with enriched G content show the highest cellular uptake. Using this hypothesis, we chemically conjugated a small molecule (camptothecin) with SNAs to create drug-SNA conjugates and observed that poly G SNAs deliver the most camptothecin to cells and have the highest cytotoxicity in cancer cells. Our data elucidate important design considerations for enhancing the intracellular delivery of spherical nucleic acids. PMID:26097111

  4. Partial amino acid sequences around sulfhydryl groups of soybean beta-amylase.

    PubMed

    Nomura, K; Mikami, B; Morita, Y

    1987-08-01

    Sulfhydryl (SH) groups of soybean beta-amylase were modified with 5-(iodoaceto-amidoethyl)aminonaphthalene-1-sulfonate (IAEDANS) and the SH-containing peptides exhibiting fluorescence were purified after chymotryptic digestion of the modified enzyme. The sequence analysis of the peptides derived from the modification of all SH groups in the denatured enzyme revealed the existence of six SH groups, in contrast to five reported previously. One of them was found to have extremely low reactivity toward SH-reagents without reduction. In the native state, IAEDANS reacted with 2 mol of SH groups per mol of the enzyme (SH1 and SH2) accompanied with inactivation of the enzyme owing to the modification of SH2 located near the active site of this enzyme. The selective modification of SH2 with IAEDANS was attained after the blocking of SH1 with 5,5'-dithiobis-(2-nitrobenzoic acid). The amino acid sequences of the peptides containing SH1 and SH2 were determined to be Cys-Ala-Asn-Pro-Gln and His-Gln-Cys-Gly-Gly-Asn-Val-Gly-Asp-Ile-Val-Asn-Ile-Pro-Ile-Pro-Gln-Trp, respectively.

  5. Genome Sequence of Lactobacillus rhamnosus Strain CASL, an Efficient l-Lactic Acid Producer from Cheap Substrate Cassava

    PubMed Central

    Yu, Bo; Su, Fei; Wang, Limin; Zhao, Bo; Qin, Jiayang; Ma, Cuiqing; Xu, Ping; Ma, Yanhe

    2011-01-01

    Lactobacillus rhamnosus is a type of probiotic bacteria with industrial potential for l-lactic acid production. We announce the draft genome sequence of L. rhamnosus CASL (2,855,156 bp with a G+C content of 46.6%), which is an efficient producer of l-lactic acid from cheap, nonfood substrate cassava with a high production titer. PMID:22123765

  6. Amino acid sequence of versutoxin, a lethal neurotoxin from the venom of the funnel-web spider Atrax versutus.

    PubMed

    Brown, M R; Sheumack, D D; Tyler, M I; Howden, M E

    1988-03-01

    The complete amino acid sequence of versutoxin, a lethal neurotoxic polypeptide isolated from the venom of male and female funnel-web spiders of the species Atrax versutus, was determined. Sequencing was performed in a gas-phase protein sequencer by automated Edman degradation of the S-carboxymethylated toxin and fragments of it produced by reaction with CNBr. Versutoxin consisted of a single chain of 42 amino acid residues. It was found to have a high proportion of basic residues and of cystine. The primary structure showed marked homology with that of robustoxin, a novel neurotoxin recently isolated from the venom of another funnel-web-spider species, Atrax robustus.

  7. Amino acid sequence of versutoxin, a lethal neurotoxin from the venom of the funnel-web spider Atrax versutus.

    PubMed Central

    Brown, M R; Sheumack, D D; Tyler, M I; Howden, M E

    1988-01-01

    The complete amino acid sequence of versutoxin, a lethal neurotoxic polypeptide isolated from the venom of male and female funnel-web spiders of the species Atrax versutus, was determined. Sequencing was performed in a gas-phase protein sequencer by automated Edman degradation of the S-carboxymethylated toxin and fragments of it produced by reaction with CNBr. Versutoxin consisted of a single chain of 42 amino acid residues. It was found to have a high proportion of basic residues and of cystine. The primary structure showed marked homology with that of robustoxin, a novel neurotoxin recently isolated from the venom of another funnel-web-spider species, Atrax robustus. PMID:3355530

  8. Clostridium sticklandii, a specialist in amino acid degradation:revisiting its metabolism through its genome sequence

    PubMed Central

    2010-01-01

    Background Clostridium sticklandii belongs to a cluster of non-pathogenic proteolytic clostridia which utilize amino acids as carbon and energy sources. Isolated by T.C. Stadtman in 1954, it has been generally regarded as a "gold mine" for novel biochemical reactions and is used as a model organism for studying metabolic aspects such as the Stickland reaction, coenzyme-B12- and selenium-dependent reactions of amino acids. With the goal of revisiting its carbon, nitrogen, and energy metabolism, and comparing studies with other clostridia, its genome has been sequenced and analyzed. Results C. sticklandii is one of the best biochemically studied proteolytic clostridial species. Useful additional information has been obtained from the sequencing and annotation of its genome, which is presented in this paper. Besides, experimental procedures reveal that C. sticklandii degrades amino acids in a preferential and sequential way. The organism prefers threonine, arginine, serine, cysteine, proline, and glycine, whereas glutamate, aspartate and alanine are excreted. Energy conservation is primarily obtained by substrate-level phosphorylation in fermentative pathways. The reactions catalyzed by different ferredoxin oxidoreductases and the exergonic NADH-dependent reduction of crotonyl-CoA point to a possible chemiosmotic energy conservation via the Rnf complex. C. sticklandii possesses both the F-type and V-type ATPases. The discovery of an as yet unrecognized selenoprotein in the D-proline reductase operon suggests a more detailed mechanism for NADH-dependent D-proline reduction. A rather unusual metabolic feature is the presence of genes for all the enzymes involved in two different CO2-fixation pathways: C. sticklandii harbours both the glycine synthase/glycine reductase and the Wood-Ljungdahl pathways. This unusual pathway combination has retrospectively been observed in only four other sequenced microorganisms. Conclusions Analysis of the C. sticklandii genome and

  9. Amino acid sequence of neurotoxin III of the scorpion Androctonus austrialis Hector.

    PubMed

    Kopeyan, C; Martinez, G; Rochat, H

    1979-03-01

    The amino acid sequence of neurotoxin III, purified from the venom of the North African scorpion Androctonus australis Hector, has been determined by Edman degradation using a liquid-phase sequencer. Carboxypeptidase A hydrolyses confirmed not only the sequence of the five last residues but also the presence of a free alpha-carboxylic group at the C-terminus. Edman degradation was conducted on one hand with the Quadrol [N,N,N',N'-tetrakis(2-hydroxypropyl)ethylene diamine] program and S-alkylated protein before or after coupling with sulfophenylisothiocynate (the first 34 residues were thus identified), on the other hand on tryptic and chymotryptic peptides with a dimethylbenzylamine program (residues 1--23 and 31--34 were confirmed, the positions of residues 35-64 were established). Neurotoxin III was found to belong to the same group of scorpion toxins active on mammals as neurotoxin I purified from the same venom (50 homologous positions exist in the two proteins).

  10. Isolation and amino acid sequences of squirrel monkey (Saimiri sciurea) insulin and glucagon

    SciTech Connect

    Yu, Jinghua ); Eng, J.; Yalow, R.S. City Univ. of New York, NY )

    1990-12-01

    It was reported two decades ago that insulin was not detectable in the glucose-stimulated state in Saimiri sciurea, the New World squirrel monkey, by a radioimmunoassay system developed with guinea pig anti-pork insulin antibody and labeled park insulin. With the same system, reasonable levels were observed in rhesus monkeys and chimpanzees. This suggested that New World monkeys, like the New World hystricomorph rodents such as the guinea pig and the coypu, might have insulins whose sequences differ markedly from those of Old World mammals. In this report the authors describe the purification and amino acid sequences of squirrel monkey insulin and glucagon. They demonstrate that the substitutions at B29, B27, A2, A4, and A17 of squirrel monkey insulin are identical with those previously found in another New World primate, the owl monkey (Aotus trivirgatus). The immunologic cross-reactivity of this insulin in their immunoassay system is only a few percent of that of human insulin. It appears that the peptides of the New World monkeys have diverged less from those of the Old World mammals than have those of the New World hystricomorph rodents. The striking improvements in peptide purification and sequencing have the potential for adding new information concerning the evolutionary divergence of species.

  11. Purification, amino acid sequence and characterisation of kangaroo IGF-I.

    PubMed

    Yandell, C A; Francis, G L; Wheldrake, J F; Upton, Z

    1998-01-01

    Insulin-like growth factor-I (IGF-I) and IGF-II have been purified to homogeneity from kangaroo (Macropus fuliginosus) serum, thus this represents the first report of the purification, sequencing and characterisation of marsupial IGFs. N-Terminal protein sequencing reveals that there are six amino acid differences between kangaroo and human IGF-I. Kangaroo IGF-II has been partially sequenced and no differences were found between human and kangaroo IGF-II in the 53 residues identified. Thus the IGFs appear to be remarkably structurally conserved during mammalian radiation. In addition, in vitro characterisation of kangaroo IGF-I demonstrated that the functional properties of human, kangaroo and chicken IGF-I are very similar. In an assay measuring the ability of the proteins to stimulate protein synthesis in rat L6 myoblasts, all IGF-I proteins were found to be equally potent. The ability of all three proteins to compete for binding with radiolabelled human IGF-I to type-1 IGF receptors in L6 myoblasts and in Sminthopsis crassicaudata transformed lung fibroblasts, a marsupial cell line, was comparable. Furthermore, kangaroo and human IGF-I react equally in a human IGF-I RIA using a human reference standard, radiolabelled human IGF-I and a polyclonal antibody raised against recombinant human IGF-I. This study indicates that not only is the primary structure of eutherian and metatherian IGF-I conserved, but also the proteins appear to be functionally similar.

  12. The ABRF Edman Sequencing Research Group 2008 Study: Investigation into Homopolymeric Amino Acid N-Terminal Sequence Tags and Their Effects on Automated Edman Degradation

    PubMed Central

    Thoma, R. S.; Smith, J. S.; Sandoval, W.; Leone, J. W.; Hunziker, P.; Hampton, B.; Linse, K. D.; Denslow, N. D.

    2009-01-01

    The Edman Sequence Research Group (ESRG) of the Association of Biomolecular Resource designs and executes interlaboratory studies investigating the use of automated Edman degradation for protein and peptide analysis. In 2008, the ESRG enlisted the help of core sequencing facilities to investigate the effects of a repeating amino acid tag at the N-terminus of a protein. Commonly, to facilitate protein purification, an affinity tag containing a polyhistidine sequence is conjugated to the N-terminus of the protein. After expression, polyhistidine-tagged protein is readily purified via chelation with an immobilized metal affinity resin. The addition of the polyhistidine tag presents unique challenges for the determination of protein identity using Edman degradation chemistry. Participating laboratories were asked to sequence one protein engineered in three configurations: with an N-terminal polyhistidine tag; with an N-terminal polyalanine tag; or with no tag. Study participants were asked to return a data file containing the uncorrected amino acid picomole yields for the first 17 cycles. Initial and repetitive yield (R.Y.) information and the amount of lag were evaluated. Information about instrumentation and sample treatment was also collected as part of the study. For this study, the majority of participating laboratories successfully called the amino acid sequence for 17 cycles for all three test proteins. In general, laboratories found it more difficult to call the sequence containing the polyhistidine tag. Lag was observed earlier and more consistently with the polyhistidine-tagged protein than the polyalanine-tagged protein. Histidine yields were significantly less than the alanine yields in the tag portion of each analysis. The polyhistidine and polyalanine protein-R.Y. calculations were found to be equivalent. These calculations showed that the nontagged portion from each protein was equivalent. The terminal histidines from the tagged portion of the protein

  13. The amino acid sequence around the active-site cysteine and histidine residues, and the buried cysteine residue in ficin.

    PubMed

    Husain, S S; Lowe, G

    1970-04-01

    Ficin that had been prepared from the latex of Ficus glabrata by salt fractionation and chromatography on carboxymethylcellulose was completely and irreversibly inhibited with 1,3-dibromo[2-(14)C]acetone and then treated with N-(4-dimethylamino-3,5-dinitrophenyl)maleimide in 6m-guanidinium chloride. After reduction and carboxymethylation of the labelled protein, it was digested with trypsin and alpha-chymotrypsin. Two radioactive peptides and two coloured peptides were isolated chromatographically and their sequences determined. The radioactive peptides revealed the amino acid sequences around the active-site cysteine and histidine residues and showed a high degree of homology with the omino acid sequence around the active-site cysteine and histidine residues in papain. The coloured peptides allowed the amino acid sequence around the buried cysteine residue in ficin to be determined.

  14. The `heavy' subunit of the photosynthetic reaction centre from Rhodopseudomonas viridis: isolation of the gene, nucleotide and amino acid sequence

    PubMed Central

    Michel, H.; Weyer, K. A.; Gruenberg, H.; Lottspeich, F.

    1985-01-01

    The gene coding for the `heavy' subunit of the photosynthetic reaction centre from Rhodopseudomonas viridis was isolated in an expression vector. Expression of the heavy subunit in Escherichia coli was detected with antibodies raised against crystalline reaction centres. The entire subunit, and not a fusion protein, was expressed in E. coli. The protein coding region of the gene was sequenced and the amino acid sequence derived. Part of the amino acid sequence was confirmed by chemical sequence analysis of the protein. The heavy subunit consists of 258 amino acids and its mol. wt. is 28 345. It possesses one membrane-spanning α-helical segment, as was revealed by the concomitant X-ray structure analysis. ImagesFig. 1.Fig. 2. PMID:16453623

  15. Purification, amino acid sequence and immunological characterization of Ole e 6, a cysteine-enriched allergen from olive tree pollen.

    PubMed

    Batanero, E; Ledesma, A; Villalba, M; Rodríguez, R

    1997-06-30

    The Ole e 6 allergen from olive tree pollen has been isolated by combining gel permeation and reverse-phase chromatographies. It is a single and highly acidic (pI 4.2) polypeptide chain protein. Its NH2-terminal amino acid sequence has been determined by Edman degradation. Total RNA from the olive tree pollen was isolated, and a specific cDNA was amplified by the polymerase chain reaction using a degenerate oligonucleotide primer designed according to the NH2-terminal sequence of the protein. The nucleotide sequencing of the cDNA rendered an open reading frame encoding a 50 amino acid polypeptide chain, in which two sets of the sequential motif Cys-X3-Cys-X3-Cys are present. No sequence similarity has been found between this protein and other previously described polypeptides.

  16. The evolution of proteins from random amino acid sequences: II. Evidence from the statistical distributions of the lengths of modern protein sequences.

    PubMed

    White, S H

    1994-04-01

    This paper continues an examination of the hypothesis that modern proteins evolved from random heteropeptide sequences. In support of the hypothesis, White and Jacobs (1993, J Mol Evol 36:79-95) have shown that any sequence chosen randomly from a large collection of nonhomologous proteins has a 90% or better chance of having a lengthwise distribution of amino acids that is indistinguishable from the random expectation regardless of amino acid type. The goal of the present study was to investigate the possibility that the random-origin hypothesis could explain the lengths of modern protein sequences without invoking specific mechanisms such as gene duplication or exon splicing. The sets of sequences examined were taken from the 1989 PIR database and consisted of 1,792 "super-family" proteins selected to have little sequence identity, 623 E. coli sequences, and 398 human sequences. The length distributions of the proteins could be described with high significance by either of two closely related probability density functions: The gamma distribution with parameter 2 or the distribution for the sum of two exponential random independent variables. A simple theory for the distributions was developed which assumes that (1) protoprotein sequences had exponentially distributed random independent lengths, (2) the length dependence of protein stability determined which of these protoproteins could fold into compact primitive proteins and thereby attain the potential for biochemical activity, (3) the useful protein sequences were preserved by the primitive genome, and (4) the resulting distribution of sequence lengths is reflected by modern proteins. The theory successfully predicts the two observed distributions which can be distinguished by the functional form of the dependence of protein stability on length. The theory leads to three interesting conclusions. First, it predicts that a tetra-nucleotide was the signal for primitive translation termination. This prediction is

  17. A Comparison between Transcriptome Sequencing and 16S Metagenomics for Detection of Bacterial Pathogens in Wildlife

    PubMed Central

    Razzauti, Maria; Galan, Maxime; Bernard, Maria; Maman, Sarah; Klopp, Christophe; Charbonnel, Nathalie; Vayssier-Taussat, Muriel; Eloit, Marc; Cosson, Jean-François

    2015-01-01

    Background Rodents are major reservoirs of pathogens responsible for numerous zoonotic diseases in humans and livestock. Assessing their microbial diversity at both the individual and population level is crucial for monitoring endemic infections and revealing microbial association patterns within reservoirs. Recently, NGS approaches have been employed to characterize microbial communities of different ecosystems. Yet, their relative efficacy has not been assessed. Here, we compared two NGS approaches, RNA-Sequencing (RNA-Seq) and 16S-metagenomics, assessing their ability to survey neglected zoonotic bacteria in rodent populations. Methodology/Principal Findings We first extracted nucleic acids from the spleens of 190 voles collected in France. RNA extracts were pooled, randomly retro-transcribed, then RNA-Seq was performed using HiSeq. Assembled bacterial sequences were assigned to the closest taxon registered in GenBank. DNA extracts were analyzed via a 16S-metagenomics approach using two sequencers: the 454 GS-FLX and the MiSeq. The V4 region of the gene coding for 16S rRNA was amplified for each sample using barcoded universal primers. Amplicons were multiplexed and processed on the distinct sequencers. The resulting datasets were de-multiplexed, and each read was processed through a pipeline to be taxonomically classified using the Ribosomal Database Project. Altogether, 45 pathogenic bacterial genera were detected. The bacteria identified by RNA-Seq were comparable to those detected by 16S-metagenomics approach processed with MiSeq (16S-MiSeq). In contrast, 21 of these pathogens went unnoticed when the 16S-metagenomics approach was processed via 454-pyrosequencing (16S-454). In addition, the 16S-metagenomics approaches revealed a high level of coinfection in bank voles. Conclusions/Significance We concluded that RNA-Seq and 16S-MiSeq are equally sensitive in detecting bacteria. Although only the 16S-MiSeq method enabled identification of bacteria in each

  18. Differences in acid tolerance between Bifidobacterium breve BB8 and its acid-resistant derivative B. breve BB8dpH, revealed by RNA-sequencing and physiological analysis.

    PubMed

    Yang, Xu; Hang, Xiaomin; Tan, Jing; Yang, Hong

    2015-06-01

    Bifidobacteria are common inhabitants of the human gastrointestinal tract, and their application has increased dramatically in recent years due to their health-promoting effects. The ability of bifidobacteria to tolerate acidic environments is particularly important for their function as probiotics because they encounter such environments in food products and during passage through the gastrointestinal tract. In this study, we generated a derivative, Bifidobacterium breve BB8dpH, which displayed a stable, acid-resistant phenotype. To investigate the possible reasons for the higher acid tolerance of B. breve BB8dpH, as compared with its parental strain B. breve BB8, a combined transcriptome and physiological approach was used to characterize differences between the two strains. An analysis of the transcriptome by RNA-sequencing indicated that the expression of 121 genes was increased by more than 2-fold, while the expression of 146 genes was reduced more than 2-fold, in B. breve BB8dpH. Validation of the RNA-sequencing data using real-time quantitative PCR analysis demonstrated that the RNA-sequencing results were highly reliable. The comparison analysis, based on differentially expressed genes, suggested that the acid tolerance of B. breve BB8dpH was enhanced by regulating the expression of genes involved in carbohydrate transport and metabolism, energy production, synthesis of cell envelope components (peptidoglycan and exopolysaccharide), synthesis and transport of glutamate and glutamine, and histidine synthesis. Furthermore, an analysis of physiological data showed that B. breve BB8dpH displayed higher production of exopolysaccharide and lower H(+)-ATPase activity than B. breve BB8. The results presented here will improve our understanding of acid tolerance in bifidobacteria, and they will lead to the development of new strategies to enhance the acid tolerance of bifidobacterial strains.

  19. Fragmentation Characteristics of Deprotonated N-linked Glycopeptides: Influences of Amino Acid Composition and Sequence

    NASA Astrophysics Data System (ADS)

    Nishikaze, Takashi; Kawabata, Shin-ichirou; Tanaka, Koichi

    2014-06-01

    Glycopeptide structural analysis using tandem mass spectrometry is becoming a common approach for elucidating site-specific N-glycosylation. The analysis is generally performed in positive-ion mode. Therefore, fragmentation of protonated glycopeptides has been extensively investigated; however, few studies are available on deprotonated glycopeptides, despite the usefulness of negative-ion mode analysis in detecting glycopeptide signals. Here, large sets of glycopeptides derived from well-characterized glycoproteins were investigated to understand the fragmentation behavior of deprotonated N-linked glycopeptides under low-energy collision-induced dissociation (CID) conditions. The fragment ion species were found to be significantly variable depending on their amino acid sequence and could be classified into three types: (i) glycan fragment ions, (ii) glycan-lost fragment ions and their secondary cleavage products, and (iii) fragment ions with intact glycan moiety. The CID spectra of glycopeptides having a short peptide sequence were dominated by type (i) glycan fragments (e.g., 2,4AR, 2,4AR-1, D, and E ions). These fragments define detailed structural features of the glycan moiety such as branching. For glycopeptides with medium or long peptide sequences, the major fragments were type (ii) ions (e.g., [peptide + 0,2X0-H]- and [peptide-NH3-H]-). The appearance of type (iii) ions strongly depended on the peptide sequence, and especially on the presence of Asp, Asn, and Glu. When a glycosylated Asn is located on the C-terminus, an interesting fragment having an Asn residue with intact glycan moiety, [glycan + Asn-36]-, was abundantly formed. Observed fragments are reasonably explained by a combination of existing fragmentation rules suggested for N-glycans and peptides.

  20. Microbial analysis of bite marks by sequence comparison of streptococcal DNA.

    PubMed

    Kennedy, Darnell M; Stanton, Jo-Ann L; García, José A; Mason, Chris; Rand, Christy J; Kieser, Jules A; Tompkins, Geoffrey R

    2012-01-01

    Bite mark injuries often feature in violent crimes. Conventional morphometric methods for the forensic analysis of bite marks involve elements of subjective interpretation that threaten the credibility of this field. Human DNA recovered from bite marks has the highest evidentiary value, however recovery can be compromised by salivary components. This study assessed the feasibility of matching bacterial DNA sequences amplified from experimental bite marks to those obtained from the teeth responsible, with the aim of evaluating the capability of three genomic regions of streptococcal DNA to discriminate between participant samples. Bite mark and teeth swabs were collected from 16 participants. Bacterial DNA was extracted to provide the template for PCR primers specific for streptococcal 16S ribosomal RNA (16S rRNA) gene, 16S-23S intergenic spacer (ITS) and RNA polymerase beta subunit (rpoB). High throughput sequencing (GS FLX 454), followed by stringent quality filtering, generated reads from bite marks for comparison to those generated from teeth samples. For all three regions, the greatest overlaps of identical reads were between bite mark samples and the corresponding teeth samples. The average proportions of reads identical between bite mark and corresponding teeth samples were 0.31, 0.41 and 0.31, and for non-corresponding samples were 0.11, 0.20 and 0.016, for 16S rRNA, ITS and rpoB, respectively. The probabilities of correctly distinguishing matching and non-matching teeth samples were 0.92 for ITS, 0.99 for 16S rRNA and 1.0 for rpoB. These findings strongly support the tenet that bacterial DNA amplified from bite marks and teeth can provide corroborating information in the identification of assailants.

  1. Purification and partial amino acid sequence of the chloroplast cytochrome b-559.

    PubMed

    Widger, W R; Cramer, W A; Hermodson, M; Meyer, D; Gullifor, M

    1984-03-25

    The hydrophobic cytochrome b-559, purified from unstacked, ethanol-washed spinach thylakoid membranes, using extraction with 2% Triton X-100 in 4 M urea and three chromatographic steps in the presence of protease inhibitors, has a dominant band on sodium dodecyl sulfate-urea gels corresponding to Mr = 10,000. The yield of this preparation is 30-50% (5-10 mg) starting with 600 mg of chlorophyll. The heme content yields a calculated molecular weight of no more than 17,500/heme, and perhaps somewhat smaller after correction for impurities. The Mr = 10,000 band is stained by the tetramethylbenzidine-H2O2 heme reagent on lithium dodecyl sulfate gels run at 0 degrees C. The Mr = 10,000 protein, further separated by high performance liquid chromatography, contains a unique NH2 terminus that is not blocked, and the amino acid sequence for the first 27 residues is NH2-Ser-Gly-Ser-Thr-Gly-Glu-Arg-Ser-Phe-Ala-Asp-Ile-Ile-Thr-Ser-Ile-Arg-Tyr-Trp -Val-Ile-X-Ser-Ile-Thr-Ile-Pro. . . COOH. Approximately 55% of the amino acids are hydrophobic, based on amino acid analysis of the Mr = 10,000 peptide, which also indicated the presence of at least one histidine. Only one cytochrome b-559 component could be identified, whose yield indicated that it arises from a single b-559 protein in chloroplasts corresponding to the in situ high potential cytochrome of the chloroplast photosystem II.

  2. Complete Genome Sequence of the Mosquitocidal Bacterium Bacillus sphaericus C3-41 and Comparison with Those of Closely Related Bacillus Species▿ †

    PubMed Central

    Hu, Xiaomin; Fan, Wei; Han, Bei; Liu, Haizhou; Zheng, Dasheng; Li, Qibin; Dong, Wei; Yan, Jianping; Gao, Meiying; Berry, Colin; Yuan, Zhiming

    2008-01-01

    Bacillus sphaericus strain C3-41 is an aerobic, mesophilic, spore-forming bacterium that has been used with great success in mosquito control programs worldwide. Genome sequencing revealed that the complete genome of this entomopathogenic bacterium is composed of a chromosomal replicon of 4,639,821 bp and a plasmid replicon of 177,642 bp, containing 4,786 and 186 potential protein-coding sequences, respectively. Comparison of the genome with other published sequences indicated that the B. sphaericus C3-41 chromosome is most similar to that of Bacillus sp. strain NRRL B-14905, a marine species that, like B. sphaericus, is unable to metabolize polysaccharides. The lack of key enzymes and sugar transport systems in the two bacteria appears to be the main reason for this inability, and the abundance of proteolytic enzymes and transport systems may endow these bacteria with exclusive metabolic pathways for a wide variety of organic compounds and amino acids. The genes shared between B. sphaericus C3-41 and Bacillus sp. strain NRRL B-14905, including mobile genetic elements, membrane-associated proteins, and transport systems, demonstrated that these two species are a biologically and phylogenetically divergent group. Knowledge of the genome sequence of B. sphaericus C3-41 thus increases our understanding of the bacilli and may also offer prospects for future genetic improvement of this important biological control agent. PMID:18296527

  3. Evaluation of global sequence comparison and one-to-one FASTA local alignment in regulatory allergenicity assessment of transgenic proteins in food crops.

    PubMed

    Song, Ping; Herman, Rod A; Kumpatla, Siva

    2014-09-01

    To address the high false positive rate using >35% identity over 80 amino acids in the regulatory assessment of transgenic proteins for potential allergenicity and the change of E-value with database size, the Needleman-Wunsch global sequence alignment and a one-to-one (1:1) local FASTA search (one protein in the target database at a time) using FASTA were evaluated by comparing proteins randomly selected from Arabidopsis, rice, corn, and soybean with known allergens in a peer-reviewed allergen database (http://www.allergenonline.org/). Compared with the approach of searching >35%/80aa+, the false positive rate measured by specificity rate for identification of true allergens was reduced by a 1:1 global sequence alignment with a cut-off threshold of ≧30% identity and a 1:1 FASTA local alignment with a cut-off E-value of ≦1.0E-09 while maintaining the same sensitivity. Hence, a 1:1 sequence comparison, especially using the FASTA local alignment tool with a biological relevant E-value of 1.0E-09 as a threshold, is recommended for the regulatory assessment of sequence identities between transgenic proteins in food crops and known allergens.

  4. Comparison of natural adsorbents for metal removal from acidic effluent.

    PubMed

    Blais, J F; Shen, S; Meunier, N; Tyagi, R D

    2003-02-01

    Adsorption tests were carried out in acidic synthetic solutions (pH 2.0) using 20 g l(-1) of various natural adsorbents and 0.25 mM of 11 different metals. In decreasing order, the most efficient adsorbents tested were: oyster shells, cedar bark, vermiculite, cocoa shells and peanut shells. In contrast, weak metal adsorption was demonstrated by: red cedar wood, peat moss, pine wood, corn cobs and perlite. Metal adsorption capacities in acidic synthetic solution followed the order: Pb2+> Cr3+> Cu2+> Fe2+> Al3+> Ni2+> Cd2+ > Mn2+ > Zn2+ > Ca2+, Mg2+. Alkaline treatment (0.75 M NaOH) increased the effectiveness of metal removal for the majority of adsorbents. In contrast, acid treatment (0.75 M H2SO4) either reduced or did not affect the adsorption capacity of the materials tested. Finally, oyster shells, red cedar wood, vermiculite, cocoa shells and peanut shells, were effective natural adsorbents for the selective recovery of lead and trivalent chromium from acidic effluent.

  5. Complexation of NpO2+ with N-methyl-iminodiacetic Acid: in Comparison with Iminodiacetic and Dipicolinic Acids

    SciTech Connect

    Tian, Guoxin; Rao, Linfeng

    2010-10-01

    Complexation of Np(V) with N-methyl-iminodiacetic acid (MIDA) in 1 M NaClO{sub 4} solution was studied with multiple techniques including potentiometry, spectrophotometry, and microcalorimetry. The 1:2 complex, NpO{sub 2}(MIDA){sub 2}{sup 3-} was identified for the first time in aqueous solution. The correlation between its optical absorption properties and symmetry was discussed, in comparison with Np(V) complexes with two structurally related nitrilo-dicarboxylic acids, iminodiacetic acid (IDA) and dipicolinic acid (DPA). The order of the binding strength (DPA > MIDA > IDA) is explained by the difference in the structural and electronic properties of the ligands. In general, the nitrilo-dicarboxylates form stronger complexes with Np(V) than oxy-dicarboxylates due to a much more favorable enthalpy of complexation.

  6. Comparison study between fasting total serum bile acid and post prandial bile acid in hepatic diseases: a preliminary study.

    PubMed

    Boonyapisit, S; Lekhakula, S; Amornkittichareon, B; Shumnumsirivath, D

    1994-01-01

    Fasting bile acid, two-hour post prandial bile acid and other liver function tests (Bili, AST, ALT, ALB, Glob, ALP) were measured in 22 normal and 28 liver diseased patients. In normal volunteers, the mean value of fasting total serum bile acid (FTBA) and postprandial serum bile acid (PTBA) were 3.08 mumole/L (S.D. 1.65) range 0.21-6.26 mumol/L, and 8.07 mumole/L (S.D. 2.99) range 4.06-15.65 mumole/L. Comparison between FTBA, PTBA and other liver function tests in various liver diseases from this study the PTBA was not statistically significant superior to FTBA. Therefore, it is not necessary to do the PTBA at this time until more data is available.

  7. A structural and functional comparison of nematode and crustacean PDH-like sequences.

    PubMed

    Meelkop, E; Marco, H G; Janssen, T; Temmerman, L; Vanhove, M P M; Schoofs, L

    2012-03-01

    The elucidation of the whole genome of the nematode Caenorhabditis elegans allowed for the identification of ortholog genes belonging to the pigment dispersing hormone/factor (PDH/PDF) peptide family. Members of this peptide family are known from crustaceans, insects and nematodes and seem to exist exclusively in ecdysozoans where they play a role in different processes, ranging from the dispersion of integumental and eye (retinal) pigments in decapod crustaceans to circadian rhythms in insects and locomotion in C. elegans. Two pdf genes (pdf-1 and pdf-2) encoding three different peptides: PDF-1a, PDF-1b and PDF-2 have been identified in C. elegans. These three C. elegans PDH-like peptides are similar but not identical in primary structure to PDHs from decapod crustaceans. We investigate whether this divergence has an influence on the pigment dispersing function of the peptides in a decapod crustacean, namely the shrimp Palaemon pacificus. We show that C. elegans PDF-1a and b peptides display cross-functional activity by dispersing pigments in the epithelium of P. pacificus at physiological doses. Moreover, by means of a comparative amino acid sequence analysis of nematode and crustacean PDH-like peptides, we can pinpoint several potentially important residues for eliciting pigment dispersing activity in decapod crustaceans. Although there is no sequence information on a receptor for PDH in decapod crustaceans, we postulate that there is general conservation of the PDH/PDF signaling system based on structural similarities of precursor proteins and receptors (including those from a branchiopod crustacean and from C. elegans).

  8. [Comparison research on two-stage sequencing batch MBR and one-stage MBR].

    PubMed

    Yuan, Xin-Yan; Shen, Heng-Gen; Sun, Lei; Wang, Lin; Li, Shi-Feng

    2011-01-01

    Aiming at resolving problems in MBR operation, like low nitrogen and phosphorous removal efficiency, severe membrane fouling and etc, comparison research on two-stage sequencing batch MBR (TSBMBR) and one-stage aerobic MBR has been done in this paper. The results indicated that TSBMBR owned advantages of SBR in removing nitrogen and phosphorous, which could make up the deficiency of traditional one-stage aerobic MBR in nitrogen and phosphorous removal. During steady operation period, effluent average NH4(+) -N, TN and TP concentration is 2.83, 12.20, 0.42 mg/L, which could reach domestic scenic environment use. From membrane fouling control point of view, TSBMBR has lower SMP in supernatant, specific trans-membrane flux deduction rate, membrane fouling resistant than one-stage aerobic MBR. The sedimentation and gel layer resistant of TSBMBR was only 6.5% and 33.12% of one-stage aerobic MBR. Besides high efficiency in removing nitrogen and phosphorous, TSBMBR could effectively reduce sedimentation and gel layer pollution on membrane surface. Comparing with one-stage MBR, TSBMBR could operate with higher trans-membrane flux, lower membrane fouling rate and better pollutants removal effects.

  9. Removal of typical endocrine disrupting chemicals by membrane bioreactor: in comparison with sequencing batch reactor.

    PubMed

    Zhou, Yingjun; Huang, Xia; Zhou, Haidong; Chen, Jianhua; Xue, Wenchao

    2011-01-01

    The removal of endocrine disrupting chemicals (EDCs) by a laboratory-scale membrane bioreactor (MBR) fed with synthetic sewage was evaluated and moreover, compared with that by a sequencing batch reactor (SBR) operated under same conditions in parallel. Eight kinds of typical EDCs, including 17β-estradiol (E2), estrone (E1), estriol (E3), 17α-ethynilestradiol (EE2), 4-octylphenol (4-OP), 4-nonylphenol (4-NP), bisphenol A (BPA) and nonylphenol ethoxylates (NPnEO), were spiked into the feed. Their concentrations in influent, effluent and supernatant were determined by gas chromatography-mass spectrometry method. The overall estrogenecity was evaluated as 17β-estradiol equivalent quantity (EEQ), determined via yeast estrogen screen (YES) assay. E2, E3, BPA and 4-OP were well removed by both MBR and SBR, with removal rates more than 95% and no significant differences between the two reactors. However, with regard to the other four EDCs, of which the removal rates were lower, MBR performed better. Comparison between supernatant and effluent of the two reactors indicated that membrane separation of sludge and effluent, compared with sedimentation, can relatively improve elimination of target EDCs and total estrogenecity. By applying different solids retention times (SRTs) (5, 10, 20 and 40 d) to the MBR, 10 and 5 d were found to be the lower critical SRTs for efficient target EDCs and EEQ removal, respectively.

  10. Comparison of Two Massively Parallel Sequencing Platforms using 83 Single Nucleotide Polymorphisms for Human Identification.

    PubMed

    Apaga, Dame Loveliness T; Dennis, Sheila E; Salvador, Jazelyn M; Calacal, Gayvelline C; De Ungria, Maria Corazon A

    2017-03-24

    The potential of Massively Parallel Sequencing (MPS) technology to vastly expand the capabilities of human identification led to the emergence of different MPS platforms that use forensically relevant genetic markers. Two of the MPS platforms that are currently available are the MiSeq(®) FGx™ Forensic Genomics System (Illumina) and the HID-Ion Personal Genome Machine (PGM)™ (Thermo Fisher Scientific). These are coupled with the ForenSeq™ DNA Signature Prep kit (Illumina) and the HID-Ion AmpliSeq™ Identity Panel (Thermo Fisher Scientific), respectively. In this study, we compared the genotyping performance of the two MPS systems based on 83 SNP markers that are present in both MPS marker panels. Results show that MiSeq(®) FGx™ has greater sample-to-sample variation than the HID-Ion PGM™ in terms of read counts for all the 83 SNP markers. Allele coverage ratio (ACR) values show generally balanced heterozygous reads for both platforms. Two and four SNP markers from the MiSeq(®) FGx™ and HID-Ion PGM™, respectively, have average ACR values lower than the recommended value of 0.67. Comparison of genotype calls showed 99.7% concordance between the two platforms.

  11. Real-Time Nucleic Acid Sequence-Based Amplification Assay for Detection of Hepatitis A Virus

    PubMed Central

    Abd El Galil, Khaled H.; El Sokkary, M. A.; Kheira, S. M.; Salazar, Andre M.; Yates, Marylynn V.; Chen, Wilfred; Mulchandani, Ashok

    2005-01-01

    A nucleic acid sequence-based amplification (NASBA) assay in combination with a molecular beacon was developed for the real-time detection and quantification of hepatitis A virus (HAV). A 202-bp, highly conserved 5′ noncoding region of HAV was targeted. The sensitivity of the real-time NASBA assay was tested with 10-fold dilutions of viral RNA, and a detection limit of 1 PFU was obtained. The specificity of the assay was demonstrated by testing with other environmental pathogens and indicator microorganisms, with only HAV positively identified. When combined with immunomagnetic separation, the NASBA assay successfully detected as few as 10 PFU from seeded lake water samples. Due to its isothermal nature, its speed, and its similar sensitivity compared to the real-time RT-PCR assay, this newly reported real-time NASBA method will have broad applications for the rapid detection of HAV in contaminated food or water. PMID:16269748

  12. Evolutionary connections of biological kingdoms based on protein and nucleic acid sequence evidence

    NASA Technical Reports Server (NTRS)

    Dayhoff, M. O.

    1983-01-01

    Prokaryotic and eukaryotic evolutionary trees are developed from protein and nucleic-acid sequences by the methods of numerical taxonomy. Trees are presented for bacterial ferredoxins, 5S ribosomal RNA, c-type cytochromes , cytochromes c2 and c', and 5.8S ribosomal RNA; the implications for early evolution are discussed; and a composite tree showing the branching of the anaerobes, aerobes, archaebacteria, and eukaryotes is shown. Single lines are found for all oxygen-evolving photosynthetic forms and for the salt-loving and high-temperature forms of archaebacteria. It is argued that the eukaryote mitochondria, chloroplasts, and cytoplasmic host material are descended from free-living prokaryotes that formed symbiotic associations, with more than one symbiotic event involved in the evolution of each organelle.

  13. Sequence-defined shuttles for targeted nucleic acid and protein delivery.

    PubMed

    Röder, Ruth; Wagner, Ernst

    2014-01-01

    Molecular medicine opens into a space of novel specific therapeutic agents: intracellularly active drugs such as peptides, proteins or nucleic acids, which are not able to cross cell membranes and enter the intracellular space on their own. Through the development of cell-targeted shuttles for specific delivery, this restriction in delivery has the potential to be converted into an advantage. On the one hand, due to the multiple extra- and intracellular barriers, such carrier systems need to be multifunctional. On the other hand, they must be precise and reproducibly manufactured due to pharmaceutical reasons. Here we review the design of precise sequence-defined delivery carriers, including solid-phase synthesized peptides and nonpeptidic oligomers, or nucleotide-based carriers such as aptamers and origami nanoboxes.

  14. Identification of amino acid sequences in the polyomavirus capsid proteins that serve as nuclear localization signals

    NASA Technical Reports Server (NTRS)

    Chang, D.; Haynes, J. I. Jr; Brady, J. N.; Consigli, R. A.; Spooner, B. S. (Principal Investigator)

    1993-01-01

    The molecular mechanism participating in the transport of newly synthesized proteins from the cytoplasm to the nucleus in mammalian cells is poorly understood. Recently, the nuclear localization signal sequences (NLS) of many nuclear proteins have been identified, and most have been found to be composed of a highly basic amino acid stretch. A genetic "subtractive" and a biochemical "additive" approach were used in our studies to identify the NLS's of the polyomavirus structural capsid proteins. An NLS was identified at the N-terminus (Ala1-Pro-Lys-Arg-Lys-Ser-Gly-Val-Ser-Lys-Cys11) of the major capsid protein VP1 and at the C-terminus (Glu307 -Glu-Asp-Gly-Pro-Glu-Lys-Lys-Lys-Arg-Arg-Leu318) of the VP2/VP3 minor capsid proteins.

  15. The amino acid sequence of a carbohydrate-containing fragment of hen ovotransferrin.

    PubMed Central

    Kingston, I B; Williams, J

    1975-01-01

    1. Hen ovotransferrin was treated with CNBr and fractionated by gel filtration. 2. After further treatment by reduction and carboxymethylation a carbohydrate-containing fragment of molecular weight 11990 was obtained (fragment BCd). 3. The amino acid sequence of this fragment was determined. It consists of a single chain of 94 residues. 4. The structure of a tryptic glycopeptide derived from whole ovotransferrin permitted a further eight residues to be assigned at the N-terminus of fragment BCd. 5. Heterogeneity was found at two positions. 6. Further evidence has been deposited as Supplementary Publication SUP 50045 (19 pages) at the British Library (Lending Division), Boston Spa, Wetherby, W. Yorkshire LS23 7BQ, U.K., from whom copies may be obtained on the terms indicated in Biochem. J. (1975), 145, 5. PMID:1172663

  16. Molecular cloning and sequence analysis of the Sta58 major antigen gene of Rickettsia tsutsugamushi: sequence homology and antigenic comparison of Sta58 to the 60-kilodalton family of stress proteins.

    PubMed Central

    Stover, C K; Marana, D P; Dasch, G A; Oaks, E V

    1990-01-01

    The scrub typhus 58-kilodalton (kDa) antigen (Sta58) of Rickettsia tsutsugamushi is a major protein antigen often recognized by humans infected with scrub typhus rickettsiae. A 2.9-kilobase HindIII fragment containing a complete sta58 gene was cloned in Escherichia coli and found to express the entire Sta58 antigen and a smaller protein with an apparent molecular mass of 11 kDa (Stp11). DNA sequence analysis of the 2.9-kilobase HindIII fragment revealed two adjacent open reading frames encoding proteins of 11 (Stp11) and 60 (Sta58) kDa. Comparisons of deduced amino acid sequences disclosed a high degree of homology between the R. tsutsugamushi proteins Stp11 and Sta58 and the E. coli proteins GroES and GroEL, respectively, and the family of primordial heat shock proteins designated Hsp10 Hsp60. Although the sequence homology between the Sta58 antigen and the Hsp60 protein family is striking, the Sta58 protein appeared to be antigenically distinct among a sample of other bacterial Hsp60 homologs, including the typhus group of rickettsiae. The antigenic uniqueness of the Sta58 antigen indicates that this protein may be a potentially protective antigen and a useful diagnostic reagent for scrub typhus fever. Images PMID:2108930

  17. Fatty acid mobilization and comparison to milk fatty acid content in northern elephant seals.

    PubMed

    Fowler, Melinda A; Debier, Cathy; Mignolet, Eric; Linard, Clementine; Crocker, Daniel E; Costa, Daniel P

    2014-01-01

    A fundamental feature of the life history of true seals, bears and baleen whales is lactation while fasting. This study examined the mobilization of fatty acids from blubber and their subsequent partitioning into maternal metabolism and milk production in northern elephant seals (Mirounga angustirostris). The fatty acid composition of blubber and milk was measured in both early and late lactation. Proportions of fatty acids in milk and blubber were found to display a high degree of similarity both early and late in lactation. Seals mobilized an enormous amount of lipid (~66 kg in 17 days), but thermoregulatory fatty acids, those that remain fluid at low temperatures, were relatively conserved in the outer blubber layer. Despite the stratification, the pattern of mobilization of specific fatty acids conforms to biochemical predictions. Long chain (>20C) monounsaturated fatty acids (MUFAs) were the least mobilized from blubber and the only class of fatty acids that showed a proportional increase in milk in late lactation. Polyunsaturated fatty acids (PUFAs) and saturated fatty acids (SFAs) were more mobilized from the blubber, but neither proportion increased in milk at late lactation. These data suggest that of the long chain MUFA mobilized, the majority is directed to milk synthesis. The mother may preferentially use PUFA and SFA for her own metabolism, decreasing the availability for deposition into milk. The potential impacts of milk fatty acid delivery on pup diving development and thermoregulation are exciting avenues for exploration.

  18. Phylogenetic analysis of beta-papillomaviruses as inferred from nucleotide and amino acid sequence data.

    PubMed

    Gottschling, Marc; Köhler, Anja; Stockfleth, Eggert; Nindl, Ingo

    2007-01-01

    Human papillomaviruses (HPV) of the beta-group seem to be involved in the pathogenesis of non-melanoma skin cancer. Papillomaviruses are host specific and are considered closely co-evolving with their hosts. Evolutionary incongruence between early genes and late genes has been reported among oncogenic genital alpha-papillomaviruses and considerably challenge phylogenetic reconstructions. We investigated the relationships of 29 beta-HPV (25 types plus four putative new types, subtypes, or variants) as inferred from codon aligned and amino acid sequence data of the genes E1, E2, E6, E7, L1, and L2 using likelihood, distance, and parsimony approaches. An analysis of a L1 fragment included additional nucleotide and amino acid sequences from seven non-human beta-papillomaviruses. Early genes and late genes evolution did not conflict significantly in beta-papillomaviruses based on partition homogeneity tests (p > or = 0.001). As inferred from the complete genome analyses, beta-papillomaviruses were monophyletic and segregated into four highly supported monophyletic assemblages corresponding to the species 1, 2, 3, and fused 4/5. They basically split into the species 1 and the remainder of beta-papillomaviruses, whose species 3, 4, and 5 constituted the sistergroup of species 2. beta-Papillomaviruses have been isolated from humans, apes, and monkeys, and phylogenetic analyses of the L1 fragment showed non-human papillomaviruses highly polyphyletic nesting within the HPV species. Thus, host and virus phylogenies were not congruent in beta-papillomaviruses, and multiple invasions across species borders may contribute (additionally to host-linked evolution) to their diversification.

  19. The amino acid sequence of Canada goose (Branta canadensis) and mute swan (Cygnus olor) hemoglobins. Two different species with identical beta-chains.

    PubMed

    Oberthür, W; Godovac-Zimmermann, J; Braunitzer, G; Wiesner, H

    1982-08-01

    The amino acid sequences of the alpha- and beta-chains from the major hemoglobin component (HbA) of Canada goose (Branta canadensis) and mute swan (Cygnus olor) are given. The alpha-chains are of the alpha A-type, since alpha D-type was expressed but only found in low concentrations. By homologous comparison, greylag goose hemoglobin (Anser anser) and Canada goose hemoglobin alpha-chains differ by two exchanges, and beta-chains by three exchanges. A valine substitution for threonine was found at position alpha 34 (B15). This exchange is a result of a two point mutation. Thus, there are three nucleotide mutations in alpha-chains, as in beta-chains. Substitutions in positions alpha 34 (B15) and beta 125 (H3) have modified intersubunit contacts (alpha 1 beta 1-contacts). A comparison of mute swan hemoglobin with greylag goose hemoglobin shows four exchanges in alpha-chains and three in beta-chains. Canada goose and mute swan have identical beta-chains, while alpha-chains differ in two amino acids. One of these exchanges is implicated in one of the alpha 1 beta 1-contact points (alpha 34) where isoleucine substitution for valine was found. Comparison of hemoglobins from different species in the same tribe (Anserini) shows a high homology between Canada goose and mute swan hemoglobins.

  20. Amino acid sequence homology between rat and human C-reactive protein.

    PubMed Central

    Taylor, J A; Bruton, C J; Anderson, J K; Mole, J E; De Beer, F C; Baltz, M L; Pepys, M B

    1984-01-01

    The rat serum protein that undergoes Ca2+-dependent binding to pneumococcal C-polysaccharide and to phosphocholine residues, and that is evidently a member of the pentraxin family of proteins by virtue of its appearance under the electron microscope, has been variously designated as rat C-reactive protein (CRP) [de Beer, Baltz, Munn, Feinstein, Taylor, Bruton, Clamp & Pepys (1982) Immunology 45, 55-70], 'phosphoryl choline-binding protein' [Nagpurkar & Mookerjea (1981) J. Biol. Chem. 256, 7440-7448] and rat serum amyloid P component (SAP) [Pontet, D'Asnieres, Gache, Escaig & Engler (1981) Biochim. Biophys. Acta 671, 202-210]. The partial amino acid sequence (45 residues) towards the C-terminus of this protein was determined, and it showed 71.7% identity with the known sequence of human CRP but only 54.3% identity with human SAP. Since human CRP and SAP are themselves approximately 50% homologous, the level of identity between the rat protein and human SAP is evidence only of membership of the pentraxin family. In contrast, the much greater resemblance to human CRP confirms that the rat C-polysaccharide-binding/phosphocholine-binding protein is in fact rat CRP. PMID:6477504

  1. Comparison and analysis of the complete nucleotide sequence of foot-and-mouth disease viruses from animals in Korea and other PanAsia strains.

    PubMed

    Oem, Jae Ku; Lee, Kwang Nyeong; Cho, In Soo; Kye, Soo Jeong; Park, Jong Hyeon; Joo, Yi Seok

    2004-08-01

    During the last 3 years, foot-and-mouth disease virus serotype O, named PanAsia, caused two outbreaks in the Republic of Korea. To determine if there was an obvious genetic relationship between the virus isolated in 2002 (O/SKR/2002) and the O/SKR/2000, and to further analyze the epidemiological relationships between the PanAsia viruses and the viruses identified in Korea, the complete nucleotide sequence of the O/SKR/2002 and the O/SKR/2000 were determined by automatic cycling sequencing and primer walking. The nucleotides and the deduced amino acid (aa) sequences of the strains identified in Korea were compared with each other and also those enrolled in the GenBank database. In comparison and analysis of the viruses identified in Korea, any deletions or insertions in the specific fragment gene of both the O/SKR/2002 and O/SKR/2000 were not identified. However, comparison of the aa sequence of the identified virus in 2002 from pigs with those of other PanAsia strains revealed significant substitutions of 4 aa in the VPI region and 8 aa in the 3A region. In phylogenetic analysis based on the translated region, the identified virus in 2002 appeared to be the divergence of approximately 1% degree with other PanAsia viruses. Also, animal experiments indicated that O/SKR/2000 is not host-restricted and develop the clinical signs in the main susceptible livestock species (cattle and pigs). However, O/SKR/2002 did not develop the clinical signs in cattle and showed severe clinical signs only in pigs. These analytic data suggest that 2002 outbreaks in Korea is not re-occurred but re-introduced from nowhere.

  2. Exon-intron organization and sequence comparison of human and murine T11 (CD2) genes

    SciTech Connect

    Diamond, D.J.; Clayton, L.K.; Sayre, P.H.; Reinherz, E.L.

    1988-03-01

    Genomic DNA clones containing the human and murine genes coding for the 50-kDa T11 (CD2) T-cell surface glycoprotein were characterized. The human T11 gene is approx. = 12 kilobases long and comprised of five exons. A leader exon (L) contains the 5'-untranslated region and most of the nucleotides defining the signal peptide (amino acids (aa) -24 to -5). Two exons encode the extracellular segment; exon Ex1 is 321 base pairs (bp) long and codes for four residues of the leader peptide and aa 1-103 of the mature protein, and exon Ex2 is 231 bp long and encodes aa 104-180. Exon TM is 123 bp long and codes for the single transmembrane region of the molecule (aa 181-221). Exon C is a large 765-bp exon encoding virtually the entire cytoplasmic domain (aa 222-327) and the 3'-untranslated region. The murine region T11 gene has a similar organization with exon-intron boundaries essentially identical to the human gene. Substantial conservation of nucleotide sequences between species in both 5'- and 3'-gene flanking regions equivalent to that among homologous exons suggests that murine and human genes may be regulated in a similar fashion. The probable relationship of the individual T11 exons to functional and structural protein domains is discussed.

  3. Amino acid sequences of alpha-helical segments from S-carbosymethylkerateine-A. Complete sequence of a type-I segment.

    PubMed Central

    Gough, K H; Inglis, A S; Crewther, W G

    1978-01-01

    The amino acid sequence of a type-I helical segment from the low-sulphur protein (S-carboxymethylkerateine-A) of wool was determined by combining automatic and manual-sequencing data. Whereas in the type-II helical segment most of the cationic groups occur in pairs, 11 of the 22 anionic residues in the sequence of the type-I segment were situated next to a second anionic residue. This suggests possible interactions between type-I and type-II helical segments in alpha-keratin. As observed with the sequence of a type-II helical segment a model constructed on 3.6 residues per turn of helix shows a line of hydrophobic residues along the helix, thereby supporting the physicochemical evidence that the molecule is predominantly helical and forms part of a coiled-coil structure. Examination of the sequence data by predictive methods indicates the possibilty of extensive sections of alpha-helix interspersed with discontinuities. The molecule contains a number of regions with peptide sequences identical with those found by other workers after enzymic digestion of fractions from oxidized wool. Images Fig. 1. PMID:697725

  4. Spermatogenesis of the lizard Lacerta vivipara: histological studies and amino acid sequence of a protamine lacertine 1.

    PubMed

    Martinage, A; Depeiges, A; Wouters, D; Morel, L; Sautière, P

    1996-06-01

    The lizard Lacerta vivipara is a seasonal breeder with a well characterized reproductive cycle. An histological study of the lizard testis has been performed at different stages of spermatogenesis and the nuclear basic proteins content was assessed by electrophoretical analysis. Two protamines, lacertines 1 and 2, are present in spermatozoa in April and May. We have isolated lacertine1 and characterized a protamine with a mass of 4,963.7 Da. Amino acid sequence of this protamine (41 residues) was established from data provided by automated Edman degradation. It is characterized by a basic amino acid stretch in the N- and C-terminal regions and by a central part which only consists of 3 different intermingled amino acids. This protamine presents 62% homology with scylliorhinine Z3 from dog-fish Scylliorhinus caniculus and 58% homology with quail protamine. The reported lizard protamine sequence is the first reptilian protamine sequence available so far.

  5. The amino acid sequence of the cytochrome c-554(547) from the chemolithotrophic bacterium Thiobacillus neapolitanus.

    PubMed Central

    Ambler, R P; Meyer, T E; Trudinger, P A; Kamen, M D

    1985-01-01

    An amino acid sequence is proposed for the cytochrome c-554(547) from the bacterium Thiobacillus neapolitanus N.C.I.B. 8539). It consists of a polypeptide chain of 91 residues, with a pair of haem-attachment cysteine residues at positions 15 and 18. There is similarity in sequence with each of the halves of the sequence of the dihaem cytochromes c4 and with a cytochrome c-554(548) from a halophilic strain of Paracoccus. Detailed evidence for the amino acid sequence of the protein has been deposited as Supplementary Publication SUP 50127 (11 pages) at the British Library (Lending Division), Boston Spa, Wetherby, West Yorkshire LS23 7BQ, U.K., from whom copies can be obtained on the terms indicated in Biochem. J. (1985) 225, 5. PMID:2988504

  6. Human Retroviruses and AIDS. A compilation and analysis of nucleic acid and amino acid sequences: I--II; III--V

    SciTech Connect

    Myers, G.; Korber, B.; Wain-Hobson, S.; Smith, R.F.; Pavlakis, G.N.

    1993-12-31

    This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (I) HIV and SIV Nucleotide Sequences; (II) Amino Acid Sequences; (III) Analyses; (IV) Related Sequences; and (V) Database Communications. Information within all the parts is updated at least twice in each year, which accounts for the modes of binding and pagination in the compendium.

  7. Draft Genome Sequence of Escherichia coli O157:H7 ATCC 35150 and a Nalidixic Acid-Resistant Mutant Derivative

    PubMed Central

    Markell, James A.; Koziol, Adam G.

    2015-01-01

    Shiga toxin-producing Escherichia coli strains, occasionally isolated from food, are of public health importance. Here, we report on the 5.30-Mbp draft genome sequence of E. coli O157:H7 EDL931 (strain ATCC 35150) and the 5.32-Mbp draft genome sequence of a nalidixic acid-resistant mutant derivative used as a distinguishable control strain in food-testing laboratories. PMID:26205873

  8. Anaerobic organic acid metabolism of Candida zemplinina in comparison with Saccharomyces wine yeasts.

    PubMed

    Magyar, Ildikó; Nyitrai-Sárdy, Diána; Leskó, Annamária; Pomázi, Andrea; Kállay, Miklós

    2014-05-16

    Organic acid production under oxygen-limited conditions has been thoroughly studied in the Saccharomyces species, but practically never investigated in Candida zemplinina, which seems to be an acidogenic species under oxidative laboratory conditions. In this study, several strains of C. zemplinina were tested for organic acid metabolism, in comparison with Saccharomyces cerevisiae, Saccharomyces uvarum and Candida stellata, under fermentative conditions. Only C. stellata produced significantly higher acidity in simple minimal media (SM) with low sugar content and two different nitrogen sources (ammonia or glutamic acid) at low level. However, the acid profile differed largely between the Saccharomyces and Candida species and showed inverse types of N-dependence in some cases. Succinic acid production was strongly enhanced on glutamic acid in Saccharomyces species, but not in Candida species. 2-oxoglutarate production was strongly supported on ammonium nitrogen in Candida species, but remained low in Saccharomyces. Candida species, C. stellata in particular, produced more pyruvic acid regardless of N-sources. From the results, we concluded that the anaerobic organic acid metabolisms of C. zemplinina and C. stellata are different from each other and also from that of the Saccharomyces species. In the formation of succinic acid, the oxidative pathway from glutamic acid seems to play little or no role in C. zemplinina. The reductive branch of the TCA cycle, however, produces acidic intermediates (malic, fumaric, and succinic acid) in a level comparable with the production of the Saccharomyces species. An unidentified organic acid, which was produced on glutamic acid only by the Candida species, needs further investigation.

  9. Microwave-assisted acid and base hydrolysis of intact proteins containing disulfide bonds for protein sequence analysis by mass spectrometry.

    PubMed

    Reiz, Bela; Li, Liang

    2010-09-01

    Controlled hydrolysis of proteins to generate peptide ladders combined with mass spectrometric analysis of the resultant peptides can be used for protein sequencing. In this paper, two methods of improving the microwave-assisted protein hydrolysis process are described to enable rapid sequencing of proteins containing disulfide bonds and increase sequence coverage, respectively. It was demonstrated that proteins containing disulfide bonds could be sequenced by MS analysis by first performing hydrolysis for less than 2 min, followed by 1 h of reduction to release the peptides originally linked by disulfide bonds. It was shown that a strong base could be used as a catalyst for microwave-assisted protein hydrolysis, producing complementary sequence information to that generated by microwave-assisted acid hydrolysis. However, using either acid or base hydrolysis, amide bond breakages in small regions of the polypeptide chains of the model proteins (e.g., cytochrome c and lysozyme) were not detected. Dynamic light scattering measurement of the proteins solubilized in an acid or base indicated that protein-protein interaction or aggregation was not the cause of the failure to hydrolyze certain amide bonds. It was speculated that there were some unknown local structures that might play a role in preventing an acid or base from reacting with the peptide bonds therein.

  10. Negative Ion In-Source Decay Matrix-Assisted Laser Desorption/Ionization Mass Spectrometry for Sequencing Acidic Peptides

    NASA Astrophysics Data System (ADS)

    McMillen, Chelsea L.; Wright, Patience M.; Cassady, Carolyn J.

    2016-05-01

    Matrix-assisted laser desorption/ionization (MALDI) in-source decay was studied in the negative ion mode on deprotonated peptides to determine its usefulness for obtaining extensive sequence information for acidic peptides. Eight biological acidic peptides, ranging in size from 11 to 33 residues, were studied by negative ion mode ISD (nISD). The matrices 2,5-dihydroxybenzoic acid, 2-aminobenzoic acid, 2-aminobenzamide, 1,5-diaminonaphthalene, 5-amino-1-naphthol, 3-aminoquinoline, and 9-aminoacridine were used with each peptide. Optimal fragmentation was produced with 1,5-diaminonphthalene (DAN), and extensive sequence informative fragmentation was observed for every peptide except hirudin(54-65). Cleavage at the N-Cα bond of the peptide backbone, producing c' and z' ions, was dominant for all peptides. Cleavage of the N-Cα bond N-terminal to proline residues was not observed. The formation of c and z ions is also found in electron transfer dissociation (ETD), electron capture dissociation (ECD), and positive ion mode ISD, which are considered to be radical-driven techniques. Oxidized insulin chain A, which has four highly acidic oxidized cysteine residues, had less extensive fragmentation. This peptide also exhibited the only charged localized fragmentation, with more pronounced product ion formation adjacent to the highly acidic residues. In addition, spectra were obtained by positive ion mode ISD for each protonated peptide; more sequence informative fragmentation was observed via nISD for all peptides. Three of the peptides studied had no product ion formation in ISD, but extensive sequence informative fragmentation was found in their nISD spectra. The results of this study indicate that nISD can be used to readily obtain sequence information for acidic peptides.

  11. Comparison of sequencing-based methods to profile DNA methylation and identification of monoallelic epigenetic modifications

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Analysis of DNA methylation patterns relies increasingly on sequencing-based profiling methods. The four most frequently used sequencing-based technologies are the bisulfite-based methods MethylC-seq and reduced representation bisulfite sequencing (RRBS), and the enrichment-based techniques methylat...

  12. Comparison of intravenous acetylsalicylic acid and dipyrone in postoperative pain

    PubMed Central

    Blendinger, I.; Eberlein, H. J.

    1980-01-01

    1 In 17 gynaecological patients with postoperative pain the analgesic efficacy of intravenous lysine salicylate 1.8 g (corresponding to acetylsalicyclic acid (ASA) 1.0 g) and dipyrone 1.0 g were compared in a double-blind randomized study. 2 In the ASA group, mean pain relief and pain intensity difference scores reached a maximum 30 min after drug administration and remained at this level for the next 90 minutes. 3 In the dipyrone group, these scores reached their peak 60 min after drug administration and seemed to fall off during the next hour. 4 The mean pain relief and intensity difference scores were greater following aspirin than dipyrone. However, firm conclusions cannot be drawn from the results of this small study. PMID:7437274

  13. Thermochemical comparisons of homogeneous and heterogeneous acids and bases. 1. Sulfonic acid solutions and resins as prototype Broensted acids

    SciTech Connect

    Arnett, E.M.; Haaksma, R.A.; Chawla, B.; Healy, M.H.

    1986-08-06

    Heats of ionization by thermometric titration for a series of bases (or acids) can be used to compare solid acids (or bases) with liquid analogues bearing the same functionalities in homogeneous solutions. The method is demonstrated for Broensted acids by reacting a series of substituted nitrogen bases with solutions of p-toluenesulfonic acid (PTSA) in acetonitrile and with suspensions of the microporous polymeric arylsulfonic acid resin-Dowex 50W-X8 in the same solvent. Under well-controlled anhydrous conditions there is a good correlation (r = 0.992) between the heats of reaction of the bases with the homogeneous and heterogeneous acid systems, but the homogeneous system gives a more exothermic interaction by 3-4 kcal mol/sup -1/ for a series of 29 substituted pyrimidines, anilines, and some other amines. This difference may be attributed to homohydrogen bonding interactions between excess acid and sulfonate anion sites which are more restricted geometrically in the resin than in solution. Other factors which affect the enthalpy change for the acid-base interaction are the acid/base ratio, the water content of the sulfonic acid, the solvent, and the resin structure (e.g., microporous vs. macroporous). Steric hindrance in the base does not differentiate solid from homogeneous acid. In addition to the use of titration calorimetry, heats of immersion are reported for the Dowex-arylsulfonic acid resins and the Nafion-perfluorinated sulfonic acid resin in a series of basic liquids. The results are compared with each other and with those from a previous study of several varieties of coal.

  14. Purification, characterization, gene cloning and nucleotide sequencing of D: -stereospecific amino acid amidase from soil bacterium: Delftia acidovorans.

    PubMed

    Hongpattarakere, Tipparat; Komeda, Hidenobu; Asano, Yasuhisa

    2005-12-01

    The D-amino acid amidase-producing bacterium was isolated from soil samples using an enrichment culture technique in medium broth containing D-phenylalanine amide as a sole source of nitrogen. The strain exhibiting the strongest activity was identified as Delftia acidovorans strain 16. This strain produced intracellular D-amino acid amidase constitutively. The enzyme was purified about 380-fold to homogeneity and its molecular mass was estimated to be about 50 kDa, on sodium dodecyl sulfate polyacrylamide gel electrophoresis. The enzyme was active preferentially toward D-amino acid amides rather than their L-counterparts. It exhibited strong amino acid amidase activity toward aromatic amino acid amides including D-phenylalanine amide, D-tryptophan amide and D-tyrosine amide, yet it was not specifically active toward low-molecular-weight D-amino acid amides such as D-alanine amide, L-alanine amide and L-serine amide. Moreover, it was not specifically active toward oligopeptides. The enzyme showed maximum activity at 40 degrees C and pH 8.5 and appeared to be very stable, with 92.5% remaining activity after the reaction was performed at 45 degrees C for 30 min. However, it was mostly inactivated in the presence of phenylmethanesulfonyl fluoride or Cd2+, Ag+, Zn2+, Hg2+ and As3+ . The NH2 terminal and internal amino acid sequences of the enzyme were determined; and the gene was cloned and sequenced. The enzyme gene damA encodes a 466-amino-acid protein (molecular mass 49,860.46 Da); and the deduced amino acid sequence exhibits homology to the D-amino acid amidase from Variovorax paradoxus (67.9% identity), the amidotransferase A subunit from Burkholderia fungorum (50% identity) and other enantioselective amidases.

  15. Comparison of methods of extracting messenger Ribonucleic Acid from ejaculated Porcine (Sus Scrofa) Spermatozoa

    Technology Transfer Automated Retrieval System (TEKTRAN)

    H. D. Guthrie, G.R. Welch, and L. A. Blomberg. Comparison of Methods of Extracting Messenger Ribonucleic Acid from Ejaculated Porcine (Sus Scrofa) Spermatozoa. Biotechnology and Germplasm Laboratory, Agricultural Research Service U. S. Department of Agriculture, Beltsville, MD 20705 The purpos...

  16. Effects of Acidic Peptide Size and Sequence on Trivalent Praseodymium Adduction and Electron Transfer Dissociation Mass Spectrometry.

    PubMed

    Commodore, Juliette J; Cassady, Carolyn J

    2017-02-07

    Using the lanthanide ion praseodymium, Pr(III), metallated ion formation and electron transfer dissociation (ETD) were studied for 25 biological and model acidic peptides. For chain lengths of seven or more residues, even highly acidic peptides that can be difficult to protonate by electrospray ionization will metallate and undergo abundant ETD fragmentation. Peptides composed of predominantly acidic residues form only the deprotonated ion, [M + Pr - H](2+) ; this ion yields near complete ETD sequence coverage for larger peptides. Peptides with a mixture of acidic and neutral residues, generate [M + Pr](3+) , which cleaves between every residue for many peptides. Acidic peptides that contain at least one residue with a basic side chain also produce the protonated ion, [M + Pr + H](4+) ; this ion undergoes the most extensive sequence coverage by ETD. Primarily metallated and non-metallated c- and z-ions form for all peptides investigated. Metal adducted product ions are only present when at least half of the peptide sequence can be incorporated into the ion; this suggests that the metal ion simultaneously attaches to more than one acidic site. The only site consistently lacking dissociation is at the N-terminal side of a proline residue. Increasing peptide chain length generates more backbone cleavage for metal-peptide complexes with the same charge state. For acidic peptides with the same length, increasing the precursor ion charge state from 2+ to 3+ also leads to more cleavage. The results of this study indicate that highly acidic peptides can be sequenced by ETD of complexes formed with Pr(III).

  17. Extended amino acid sequences around the active-site lysine residue of class-I fructose 1,6-bisphosphate aldolases from rabbit muscle, sturgeon muscle, trout muscle and ox liver.

    PubMed Central

    Benfield, P A; Forcina, B G; Gibbons, I; Perham, R N

    1979-01-01

    1. Amino acid sequences covering the region between residues 173 and 248 [adopting the numbering system proposed by Lai, Nakai & Chang (1974) Science 183, 1204-1206] were derived for trout (Salmo trutta) muscle aldolase and for ox liver aldolase. A comparable sequence was derived for residues 180-248 of sturgeon (Acipenser transmontanus) muscle aldolase. The close homology with the rabbit muscle enzyme was used to align the peptides of the other aldolases from which the sequences were derived. The results also allowed a partial sequence for the N-terminal 39 residues for the ox liver enzyme to be deduced. 2. In the light of the strong homology evinced for these enzymes, a re-investigation of the amino acid sequence of rabbit muscle aldolase between residues 181 and 185 was undertaken. This indicated the presence of a hitherto unsuspected -Ile-Val-sequence between residues 181 and 182 and the need to invert the sequence -Glu-Val- to -Val-Glx- at positions 184 and 185. 3. Comparison of the available amino acid sequences of these enzymes suggested an early evolutionary divergence of the genes for muscle and liver aldolases. It was also consistent with other evidence that the central region of the primary structure of these enzymes (which includes the active-site lysine-227) forms part of a conserved folding domain in the protein subunit. 4. Detailed evidence for the amino acid sequences proposed has been deposited as Suy Lending Division, Boston Spa, Wetherby, West Yorkshire LS23 7BQ, U.K., from whom copies can be obtained on the terms indicated in Biochem. J. (1978) 169, 5. PMID:534504

  18. From First Base: The Sequence of the Tip of the X Chromosome of Drosophila melanogaster, a Comparison of Two Sequencing Strategies

    PubMed Central

    Benos, Panayiotis V.; Gatt, Melanie K.; Murphy, Lee; Harris, David; Barrell, Bart; Ferraz, Concepcion; Vidal, Sophie; Brun, Christine; Demaille, Jacques; Cadieu, Edouard; Dreano, Stephane; Gloux, Stéphanie; Lelaure, Valerie; Mottier, Stephanie; Galibert, Francis; Borkova, Dana; Miñana, Belen; Kafatos, Fotis C.; Bolshakov, Slava; Sidén-Kiamos, Inga; Papagiannakis, George; Spanos, Lefteris; Louis, Christos; Madueño, Encarnación; de Pablos, Beatriz; Modolell, Juan; Peter, Annette; Schöttler, Petra; Werner, Meike; Mourkioti, Fotini; Beinert, Nicole; Dowe, Gordon; Schäfer, Ulrich; Jäckle, Herbert; Bucheton, Alain; Callister, Debbie; Campbell, Lorna; Henderson, Nadine S.; McMillan, Paul J.; Salles, Cathy; Tait, Evelyn; Valenti, Phillipe; Saunders, Robert D.C.; Billaud, Alain; Pachter, Lior; Glover, David M.; Ashburner, Michael

    2001-01-01

    We present the sequence of a contiguous 2.63 Mb of DNA extending from the tip of the X chromosome of Drosophila melanogaster. Within this sequence, we predict 277 protein coding genes, of which 94 had been sequenced already in the course of studying the biology of their gene products, and examples of 12 different transposable elements. We show that an interval between bands 3A2 and 3C2, believed in the 1970s to show a correlation between the number of bands on the polytene chromosomes and the 20 genes identified by conventional genetics, is predicted to contain 45 genes from its DNA sequence. We have determined the insertion sites of P-elements from 111 mutant lines, about half of which are in a position likely to affect the expression of novel predicted genes, thus representing a resource for subsequent functional genomic analysis. We compare the European Drosophila Genome Project sequence with the corresponding part of the independently assembled and annotated Joint Sequence determined through “shotgun” sequencing. Discounting differences in the distribution of known transposable elements between the strains sequenced in the two projects, we detected three major sequence differences, two of which are probably explained by errors in assembly; the origin of the third major difference is unclear. In addition there are eight sequence gaps within the Joint Sequence. At least six of these eight gaps are likely to be sites of transposable elements; the other two are complex. Of the 275 genes in common to both projects, 60% are identical within 1% of their predicted amino-acid sequence and 31% show minor differences such as in choice of translation initiation or termination codons; the remaining 9% show major differences in interpretation. [All of the sequences analyzed in this paper have been deposited in the EMBL-Bank database under the following accession nos.: AL009146, AL009147, AL009171, AL009188–AL009196, AL021067, AL021086, AL021106–AL021108, AL021726, AL

  19. Allelic diversity of the MHC class II DRB genes in brown bears (Ursus arctos) and a comparison of DRB sequences within the family Ursidae.

    PubMed

    Goda, N; Mano, T; Kosintsev, P; Vorobiev, A; Masuda, R

    2010-11-01

    The allelic diversity of the DRB locus in major histocompatibility complex (MHC) genes was analyzed in the brown bear (Ursus arctos) from the Hokkaido Island of Japan, Siberia, and Kodiak of Alaska. Nineteen alleles of the DRB exon 2 were identified from a total of 38 individuals of U. arctos and were highly polymorphic. Comparisons of non-synonymous and synonymous substitutions in the antigen-binding sites of deduced amino acid sequences indicated evidence for balancing selection on the bear DRB locus. The phylogenetic analysis of the DRB alleles among three genera (Ursus, Tremarctos, and Ailuropoda) in the family Ursidae revealed that DRB allelic lineages were not separated according to species. This strongly shows trans-species persistence of DRB alleles within the Ursidae.

  20. Method for the detection of specific nucleic acid sequences by polymerase nucleotide incorporation

    DOEpatents

    Castro, Alonso

    2004-06-01

    A method for rapid and efficient detection of a target DNA or RNA sequence is provided. A primer having a 3'-hydroxyl group at one end and having a sequence of nucleotides sufficiently homologous with an identifying sequence of nucleotides in the target DNA is selected. The primer is hybridized to the identifying sequence of nucleotides on the DNA or RNA sequence and a reporter molecule is synthesized on the target sequence by progressively binding complementary nucleotides to the primer, where the complementary nucleotides include nucleotides labeled with a fluorophore. Fluorescence emitted by fluorophores on single reporter molecules is detected to identify the target DNA or RNA sequence.

  1. Numerical encoding of DNA sequences by chaos game representation with application in similarity comparison.

    PubMed

    Hoang, Tung; Yin, Changchuan; Yau, Stephen S-T

    2016-10-01

    Numerical encoding plays an important role in DNA sequence analysis via computational methods, in which numerical values are associated with corresponding symbolic characters. After numerical representation, digital signal processing methods can be exploited to analyze DNA sequences. To reflect the biological properties of the original sequence, it is vital that the representation is one-to-one. Chaos Game Representation (CGR) is an iterative mapping technique that assigns each nucleotide in a DNA sequence to a respective position on the plane that allows the depiction of the DNA sequence in the form of image. Using CGR, a biological sequence can be transformed one-to-one to a numerical sequence that preserves the main features of the original sequence. In this research, we propose to encode DNA sequences by considering 2D CGR coordinates as complex numbers, and apply digital signal processing methods to analyze their evolutionary relationship. Computational experiments indicate that this approach gives comparable results to the state-of-the-art multiple sequence alignment method, Clustal Omega, and is significantly faster. The MATLAB code for our method can be accessed from: www.mathworks.com/matlabcentral/fileexchange/57152.

  2. [The hemoglobin of adult Andean condors (Vultur gryphus, Cathartiformes). The amino acid sequence of the major (HbA) and minor component (HbD)].

    PubMed

    Bauer, H; Braunitzer, G; Oberthür, W; Kösters, J; Grimm, F

    1985-12-01

    The complete amino-acid sequence of the alpha A- and the beta-chains of the major component (HbA) and the alpha D- and the beta-chains of the minor component (HbD) of Andean Condor (Vultur gryphus) is presented. The minor component with the alpha D-chains is present in smaller amounts (17%) than in other birds (25%). The comparison with the corresponding chains of Greylag Goose (Anser anser) shows 17 different amino acids (17 nucleotides, only one-point mutations) in the alpha A-chains and 8 (8 nucleotides) in the beta-chains. The alpha D-chains differ from those of the pheasant (Phasanius cholchicus cholchicus) in 24 amino acids (27 nucl., 3 two-point mutations). Seven alpha 1 beta 1-, one alpha 1 beta 2-, three alpha 1 alpha 1-contacts and one beta 1 beta 1-contact are exchanged. The systematy of Cathartiformes, Ciconiiformes and Phoenicopteriformes is discussed, based on the amino-acid exchanges of all known adult hemoglobins of birds.

  3. Comparison the effectiveness of pyruvic acid 50% and salicylic acid 30% in the treatment of acne

    PubMed Central

    Jaffary, Fariba; Faghihi, Gita; Saraeian, Sara; Hosseini, Sayed Mohsen

    2016-01-01

    Background: Acne vulgaris is a chronic inflammatory disease of the pilosebaceous follicles and one of the most common skin diseases. The peeling method has been recently found to be effective for acne treatment. This study aimed to compare the efficacy of pyruvic acid 50% and salicylic acid 30% peeling in the treatment of mild to moderate acne. Materials and Methods: In a prospective single-blinded clinical trial, 86 patients with acne were randomly assigned into two groups. In both groups, the routine treatment of acne (topical solution of erythromycin 4%, triclorocarban soap, and sunscreen) were used twice a day for 8 weeks. In addition, salicylic acid 30% for the control group and pyruvic acid 50% for the case group were used. In both groups, acne severity index (ASI) was calculated before and at week 2, 4, 6, and 8 of the treatment. Patient satisfaction was assessed at the end of the treatment. Side effects were recorded using a checklist. Results: In both groups, the reduction in the number of comedones, papules, and ASI were statistically significant (P < 0.001) in the course of treatment. However, it was not significant regarding the number of pustules (P = 0.09). None of the number of comedone, papules, pustules, and ASI was statistically different between study groups. Both treatment groups had similar side effects except for scaling in the fifth session, which was significantly lower in salicylic acid – treated patients (P = 0.015). Conclusion: Both pyruvic acid 50% and salicylic acid 30% are effective in the improvement of mild to moderate acne with no significant difference in efficacy and side effects. PMID:27904577

  4. Comparison of sequences of RNAs 3 and 4 of rice stripe virus from China with those of Japanese isolates.

    PubMed

    Qu, Z; Liang, D; Harper, G; Hull, R

    1997-01-01

    The sequences were determined of RNAs 3 and 4 of a Chinese isolate (Y) of rice stripe tenuivirus (RStV) and were compared with those of two RStV isolates (M and T) from Japan. Both RNAs of the Y isolate were longer than those of the M and T isolates. There was almost complete conservation in the 5' and 3' non-coding regions for each RNA between the isolates. The analogous ambisense coding regions for each isolate were exactly the same size and the sequences were highly conserved. The major differences were in the intergenic regions, the sizes of which accounted for the differences in size of each RNA of the three isolates. There were no obvious patterns of differences in comparisons of the two RNA over the three isolates. The significance of the similarities and differences in sequences of isolates of RStV separated by more than 3500 km is discussed.

  5. Comparison of base composition analysis and Sanger sequencing of mitochondrial DNA for four U.S. population groups.

    PubMed

    Kiesler, Kevin M; Coble, Michael D; Hall, Thomas A; Vallone, Peter M

    2014-01-01

    A set of 711 samples from four U.S. population groups was analyzed using a novel mass spectrometry based method for mitochondrial DNA (mtDNA) base composition profiling. Comparison of the mass spectrometry results with Sanger sequencing derived data yielded a concordance rate of 99.97%. Length heteroplasmy was identified in 46% of samples and point heteroplasmy was observed in 6.6% of samples in the combined mass spectral and Sanger data set. Using discrimination capacity as a metric, Sanger sequencing of the full control region had the highest discriminatory power, followed by the mass spectrometry base composition method, which was more discriminating than Sanger sequencing of just the hypervariable regions. This trend is in agreement with the number of nucleotides covered by each of the three assays.

  6. Sequence heterogeneity of cannabidiolic- and tetrahydrocannabinolic acid-synthase in Cannabis sativa L. and its relationship with chemical phenotype.

    PubMed

    Onofri, Chiara; de Meijer, Etienne P M; Mandolino, Giuseppe

    2015-08-01

    Sequence variants of THCA- and CBDA-synthases were isolated from different Cannabis sativa L. strains expressing various wild-type and mutant chemical phenotypes (chemotypes). Expressed and complete sequences were obtained from mature inflorescences. Each strain was shown to have a different specificity and/or ability to convert the precursor CBGA into CBDA and/or THCA type products. The comparison of the expressed sequences led to the identification of different mutations, all of them due to SNPs. These SNPs were found to relate to the cannabinoid composition of the inflorescence at maturity and are therefore proposed to have a functional significance. The amount of variation was found to be higher within the CBDAS sequence family than in the THCAS family, suggesting a more recent evolution of THCA-forming enzymes from the CBDAS group. We therefore consider CBDAS as the ancestral type of these synthases.

  7. Identification of tropomyosins as major allergens in antarctic krill and mantis shrimp and their amino acid sequence characteristics.

    PubMed

    Motoyama, Kanna; Suma, Yota; Ishizaki, Shoichiro; Nagashima, Yuji; Lu, Ying; Ushio, Hideki; Shiomi, Kazuo

    2008-01-01

    Tropomyosin represents a major allergen of decapod crustaceans such as shrimps and crabs, and its highly conserved amino acid sequence (>90% identity) is a molecular basis of the immunoglobulin E (IgE) cross-reactivity among decapods. At present, however, little information is available about allergens in edible crustaceans other than decapods. In this study, the major allergen in two species of edible crustaceans, Antarctic krill Euphausia superba and mantis shrimp Oratosquilla oratoria that are taxonomically distinct from decapods, was demonstrated to be tropomyosin by IgE-immunoblotting using patient sera. The cross-reactivity of the tropomyosins from both species with decapod tropomyosins was also confirmed by inhibition IgE immunoblotting. Sequences of the tropomyosins from both species were determined by complementary deoxyribonucleic acid cloning. The mantis shrimp tropomyosin has high sequence identity (>90% identity) with decapod tropomyosins, especially with fast-type tropomyosins. On the other hand, the Antarctic krill tropomyosin is characterized by diverse alterations in region 13-42, the amino acid sequence of which is highly conserved for decapod tropomyosins, and hence, it shares somewhat lower sequence identity (82.4-89.8% identity) with decapod tropomyosins than the mantis shrimp tropomyosin. Quantification by enzyme-linked immunosorbent assay revealed that Antarctic krill contains tropomyosin at almost the same level as decapods, suggesting that its allergenicity is equivalent to decapods. However, mantis shrimp was assumed to be substantially not allergenic because of the extremely low content of tropomyosin.

  8. Molecular cloning and sequencing of a cDNA encoding the thioesterase domain of the rat fatty acid synthetase.

    PubMed

    Naggert, J; Witkowski, A; Mikkelsen, J; Smith, S

    1988-01-25

    A cloned cDNA containing the entire coding sequence for the long-chain S-acyl fatty acid synthetase thioester hydrolase (thioesterase I) component as well as the 3'-noncoding region of the fatty acid synthetase has been isolated using an expression vector and domain-specific antibodies. The coding region was assigned to the thioesterase I domain by identification of sequences coding for characterized peptide fragments, amino-terminal analysis of the isolated thioesterase I domain and the presence of the serine esterase active-site sequence motif. The thioesterase I domain is 306 amino acids long with a calculated molecular mass of 33,476 daltons; its DNA is flanked at the 5'-end by a region coding for the acyl carrier protein domain and at the 3'-end by a 1,537-base pairs-long noncoding sequence with a poly(A) tail. The thioesterase I domain exhibits a low, albeit discernible, homology with the discrete medium-chain S-acyl fatty acid synthetase thioester hydrolases (thioesterase II) from rat mammary gland and duck uropygial gland, suggesting a distant but common evolutionary ancestry for these proteins.

  9. Comparison of Exome and Genome Sequencing Technologies for the Complete Capture of Protein‐Coding Regions

    PubMed Central

    Lelieveld, Stefan H.; Spielmann, Malte; Mundlos, Stefan; Veltman, Joris A.

    2015-01-01

    ABSTRACT For next‐generation sequencing technologies, sufficient base‐pair coverage is the foremost requirement for the reliable detection of genomic variants. We investigated whether whole‐genome sequencing (WGS) platforms offer improved coverage of coding regions compared with whole‐exome sequencing (WES) platforms, and compared single‐base coverage for a large set of exome and genome samples. We find that WES platforms have improved considerably in the last years, but at comparable sequencing depth, WGS outperforms WES in terms of covered coding regions. At higher sequencing depth (95x–160x), WES successfully captures 95% of the coding regions with a minimal coverage of 20x, compared with 98% for WGS at 87‐fold coverage. Three different assessments of sequence coverage bias showed consistent biases for WES but not for WGS. We found no clear differences for the technologies concerning their ability to achieve complete coverage of 2,759 clinically relevant genes. We show that WES performs comparable to WGS in terms of covered bases if sequenced at two to three times higher coverage. This does, however, go at the cost of substantially more sequencing biases in WES approaches. Our findings will guide laboratories to make an informed decision on which sequencing platform and coverage to choose. PMID:25973577

  10. Comparison of Exome and Genome Sequencing Technologies for the Complete Capture of Protein-Coding Regions.

    PubMed

    Lelieveld, Stefan H; Spielmann, Malte; Mundlos, Stefan; Veltman, Joris A; Gilissen, Christian

    2015-08-01

    For next-generation sequencing technologies, sufficient base-pair coverage is the foremost requirement for the reliable detection of genomic variants. We investigated whether whole-genome sequencing (WGS) platforms offer improved coverage of coding regions compared with whole-exome sequencing (WES) platforms, and compared single-base coverage for a large set of exome and genome samples. We find that WES platforms have improved considerably in the last years, but at comparable sequencing depth, WGS outperforms WES in terms of covered coding regions. At higher sequencing depth (95x-160x), WES successfully captures 95% of the coding regions with a minimal coverage of 20x, compared with 98% for WGS at 87-fold coverage. Three different assessments of sequence coverage bias showed consistent biases for WES but not for WGS. We found no clear differences for the technologies concerning their ability to achieve complete coverage of 2,759 clinically relevant genes. We show that WES performs comparable to WGS in terms of covered bases if sequenced at two to three times higher coverage. This does, however, go at the cost of substantially more sequencing biases in WES approaches. Our findings will guide laboratories to make an informed decision on which sequencing platform and coverage to choose.

  11. Human parainfluenza type 3 virus hemagglutinin-neuraminidase glycoprotein: nucleotide sequence of mRNA and limited amino acid sequence of the purified protein.

    PubMed Central

    Elango, N; Coligan, J E; Jambou, R C; Venkatesan, S

    1986-01-01

    The nucleotide sequence of mRNA for the hemagglutinin-neuraminidase (HN) protein of human parainfluenza type 3 virus obtained from the corresponding cDNA clone had a single long open reading frame encoding a putative protein of 64,254 daltons consisting of 572 amino acids. The deduced protein sequence was confirmed by limited N-terminal amino acid microsequencing of CNBr cleavage fragments of native HN that was purified by immunoprecipitation. The HN protein is moderately hydrophobic and has four potential sites (Asn-X-Ser/Thr) of N-glycosylation in the C-terminal half of the molecule. It is devoid of both the N-terminal signal sequence and the C-terminal membrane anchorage domain characteristic of the hemagglutinin of influenza virus and the fusion (F0) protein of the paramyxoviruses. Instead, it has a single prominent hydrophobic region capable of membrane insertion beginning at 32 residues from the N terminus. This N-terminal membrane insertion is similar to that of influenza virus neuraminidase and the recently reported structures of HN proteins of Sendai virus and simian virus 5. Images PMID:3003381

  12. Sequence dependent N-terminal rearrangement and degradation of peptide nucleic acid (PNA) in aqueous solution

    NASA Technical Reports Server (NTRS)

    Eriksson, M.; Christensen, L.; Schmidt, J.; Haaima, G.; Orgel, L.; Nielsen, P. E.

    1998-01-01

    The stability of the PNA (peptide nucleic acid) thymine monomer inverted question markN-[2-(thymin-1-ylacetyl)]-N-(2-aminoaminoethyl)glycine inverted question mark and those of various PNA oligomers (5-8-mers) have been measured at room temperature (20 degrees C) as a function of pH. The thymine monomer undergoes N-acyl transfer rearrangement with a half-life of 34 days at pH 11 as analyzed by 1H NMR; and two reactions, the N-acyl transfer and a sequential degradation, are found by HPLC analysis to occur at measurable rates for the oligomers at pH 9 or above. Dependent on the amino-terminal sequence, half-lives of 350 h to 163 days were found at pH 9. At pH 12 the half-lives ranged from 1.5 h to 21 days. The results are discussed in terms of PNA as a gene therapeutic drug as well as a possible prebiotic genetic material.

  13. Solubility Challenges in High Concentration Monoclonal Antibody Formulations: Relationship with Amino Acid Sequence and Intermolecular Interactions.

    PubMed

    Pindrus, Mariya; Shire, Steven J; Kelley, Robert F; Demeule, Barthélemy; Wong, Rita; Xu, Yiren; Yadav, Sandeep

    2015-11-02

    The purpose of this work was to elucidate the molecular interactions leading to monoclonal antibody self-association and precipitation and utilize biophysical measurements to predict solubility behavior at high protein concentration. Two monoclonal antibodies (mAb-G and mAb-R) binding to overlapping epitopes were investigated. Precipitation of mAb-G solutions was most prominent at high ionic strength conditions and demonstrated strong dependence on ionic strength, as well as slight dependence on solution pH. At similar conditions no precipitation was observed for mAb-R solutions. Intermolecular interactions (interaction parameter, kD) related well with high concentration solubility behavior of both antibodies. Upon increasing buffer ionic strength, interactions of mAb-R tended to weaken, while those of mAb-G became more attractive. To investigate the role of amino acid sequence on precipitation behavior, mutants were designed by substituting the CDR of mAb-R into the mAb-G framework (GM-1) or deleting two hydrophobic residues in the CDR of mAb-G (GM-2). No precipitation was observed at high ionic strength for either mutant. The molecular interactions of mutants were similar in magnitude to those of mAb-R. The results suggest that presence of hydrophobic groups in the CDR of mAb-G may be responsible for compromising its solubility at high ionic strength conditions since deleting these residues mitigated the solubility issue.

  14. Frequencies of amino acid strings in globular protein sequences indicate suppression of blocks of consecutive hydrophobic residues

    PubMed Central

    Schwartz, Russell; Istrail, Sorin; King, Jonathan

    2001-01-01

    Patterns of hydrophobic and hydrophilic residues play a major role in protein folding and function. Long, predominantly hydrophobic strings of 20–22 amino acids each are associated with transmembrane helices and have been used to identify such sequences. Much less attention has been paid to hydrophobic sequences within globular proteins. In prior work on computer simulations of the competition between on-pathway folding and off-pathway aggregate formation, we found that long sequences of consecutive hydrophobic residues promoted aggregation within the model, even controlling for overall hydrophobic content. We report here on an analysis of the frequencies of different lengths of contiguous blocks of hydrophobic residues in a database of amino acid sequences of proteins of known structure. Sequences of three or more consecutive hydrophobic residues are found to be significantly less common in actual globular proteins than would be predicted if residues were selected independently. The result may reflect selection against long blocks of hydrophobic residues within globular proteins relative to what would be expected if residue hydrophobicities were independent of those of nearby residues in the sequence. PMID:11316883

  15. Comparison of ribosomal RNA removal methods for transcriptome sequencing workflows in teleost fish

    Technology Transfer Automated Retrieval System (TEKTRAN)

    RNA sequencing (RNA-Seq) is becoming the standard for transcriptome analysis. Removal of contaminating ribosomal RNA (rRNA) is a priority in the preparation of libraries suitable for sequencing. rRNAs are commonly removed from total RNA via either mRNA selection or rRNA depletion. These methods have...

  16. Genomic sequence and virulence comparison of four type 2 porcine reproductive and respiratory syndrome virus strains

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Porcine reproductive and respiratory syndrome virus (PRRSV) is a ubiquitous and costly virus that exhibits substantial sequence and virulence disparity among diverse isolates. In this study, we compared the whole genomic sequence and virulence of 4 North American Type 2 PRRSV isolates. Among the 4 i...

  17. Sequence Tolerance of a Highly Stable Single Domain Antibody: Comparison of Computational and Experimental Profiles

    DTIC Science & Technology

    2016-09-09

    unusually high thermal stability is explored by a combined computational and experimental study. Starting with the crystallographic structure ...RosettaBackrub simulations were applied to model sequence- structure tolerance profiles and identify key substitution sites. Experimental site-directed...1]. From an experimental perspective, sequence- structure tolerance is typically probed by site-directed mutagenesis and the landscape is thermal

  18. Comparison of ancient and modern Clonorchis sinensis based on ITS1 and ITS2 sequences.

    PubMed

    Liu, Wen-Qi; Liu, Juan; Zhang, Jun-Hua; Long, Xiao-Chun; Lei, Jia-Hui; Li, Yong-Long

    2007-02-01

    In 1975, an ancient corpse buried in 167 BC was found at Jiangling County, Hubei Province of China. The eggs of Clonorchis sinensis found in the gall bladder of the corpse were preserved well. In the present paper, we extracted the genomic DNA from the ancient eggs and modern eggs, respectively, and the internal transcribed spacer 1 and 2 (ITS1 and ITS2) at ribosomal RNA genes were studied. The results show that ITS2 sequences from the ancient sample were identical with those from modern samples, but in ITS1 differences in 15 nucleotide positions were found between the ancient and modern samples. The results demonstrated that it is possible to extract and sequence DNA from ancient parasite eggs. The ITS1 sequence obtained differed from all modern ones available to date. This might indicate sequence divergence through time, or might reflect a sequence polymorphism that may eventually be found also in modern samples.

  19. Classifying nucleic acid sub-sequences as introns or exons using genetic programming

    SciTech Connect

    Handley, S.

    1995-12-31

    An evolutionary computation technique, genetic programming, created programs that classify messenger RNA sequences into one of two classes: (1) the sequence is expressed as (part of) a protein (an exon), or (2) not expressed as protein (an intron).

  20. 5S ribosomal ribonucleic acid sequences in Bacteroides and Fusobacterium: evolutionary relationships within these genera and among eubacteria in general

    NASA Technical Reports Server (NTRS)

    Van den Eynde, H.; De Baere, R.; Shah, H. N.; Gharbia, S. E.; Fox, G. E.; Michalik, J.; Van de Peer, Y.; De Wachter, R.

    1989-01-01

    The 5S ribosomal ribonucleic acid (rRNA) sequences were determined for Bacteroides fragilis, Bacteroides thetaiotaomicron, Bacteroides capillosus, Bacteroides veroralis, Porphyromonas gingivalis, Anaerorhabdus furcosus, Fusobacterium nucleatum, Fusobacterium mortiferum, and Fusobacterium varium. A dendrogram constructed by a clustering algorithm from these sequences, which were aligned with all other hitherto known eubacterial 5S rRNA sequences, showed differences as well as similarities with respect to results derived from 16S rRNA analyses. In the 5S rRNA dendrogram, Bacteroides clustered together with Cytophaga and Fusobacterium, as in 16S rRNA analyses. Intraphylum relationships deduced from 5S rRNAs suggested that Bacteroides is specifically related to Cytophaga rather than to Fusobacterium, as was suggested by 16S rRNA analyses. Previous taxonomic considerations concerning the genus Bacteroides, based on biochemical and physiological data, were confirmed by the 5S rRNA sequence analysis.

  1. Clickable Nucleic Acids: Sequence-Controlled Periodic Copolymer/Oligomer Synthesis by Orthogonal Thiol-X Reactions.

    PubMed

    Xi, Weixian; Pattanayak, Sankha; Wang, Chen; Fairbanks, Benjamin; Gong, Tao; Wagner, Justine; Kloxin, Christopher J; Bowman, Christopher N

    2015-11-23

    Synthetic polymer approaches generally lack the ability to control the primary sequence, with sequence control referred to as the holy grail. Two click chemistry reactions were now combined to form nucleobase-containing sequence-controlled polymers in simple polymerization reactions. Two distinct approaches are used to form these click nucleic acid (CNA) polymers. These approaches employ thiol-ene and thiol-Michael reactions to form homopolymers of a single nucleobase (e.g., poly(A)n ) or homopolymers of specific repeating nucleobase sequences (e.g., poly(ATC)n). Furthermore, the incorporation of monofunctional thiol-terminated polymers into the polymerization system enables the preparation of multiblock copolymers in a single reaction vessel; the length of the diblock copolymer can be tuned by the stoichiometric ratio and/or the monomer functionality. These polymers are also used for organogel formation where complementary CNA-based polymers form reversible crosslinks.

  2. Genome sequence of Clostridium sporogenes DSM 795(T), an amino acid-degrading, nontoxic surrogate of neurotoxin-producing Clostridium botulinum.

    PubMed

    Poehlein, Anja; Riegel, Karin; König, Sandra M; Leimbach, Andreas; Daniel, Rolf; Dürre, Peter

    2015-01-01

    Clostridium sporogenes DSM 795 is the type strain of the species Clostridium sporogenes, first described by Metchnikoff in 1908. It is a Gram-positive, rod-shaped, anaerobic bacterium isolated from human faeces and belongs to the proteolytic branch of clostridia. C. sporogenes attracts special interest because of its potential use in a bacterial therapy for certain cancer types. Genome sequencing and annotation revealed several gene clusters coding for proteins involved in anaerobic degradation of amino acids, such as glycine and betaine via Stickland reaction. Genome comparison showed that C. sporogenes is closely related to C. botulinum. The genome of C. sporogenes DSM 795 consists of a circular chromosome of 4.1 Mb with an overall GC content of 27.81 mol% harboring 3,744 protein-coding genes, and 80 RNAs.

  3. Differentiation of acetic acid bacteria based on sequence analysis of 16S-23S rRNA gene internal transcribed spacer sequences.

    PubMed

    González, Angel; Mas, Albert

    2011-06-30

    The 16S-23S gene internal transcribed spacer sequence of sixty-four strains belonging to different acetic acid bacteria genera were analyzed, and phylogenetic trees were generated for each genera. The topologies of the different trees were in accordance with the 16S rRNA gene trees, although the similarity percentages obtained between the species was shown to be much lower. These values suggest the usefulness of including the 16S-23S gene internal transcribed spacer region as a part of the polyphasic approach required for the further classification of acetic acid bacteria. Furthermore, the region could be a good target for primer and probe design. It has also been validated for use in the identification of unknown samples of this bacterial group from wine vinegar and fruit condiments.

  4. Magnetic resonance elastography: A comparison between pulse sequences across field strengths

    NASA Astrophysics Data System (ADS)

    Owen, Graham

    Several Magnetic Resonance Elastography (MRE) techniques have been developed to non-invasively measure tissue stiffness which can be altered by disease processes such as liver fibrosis. Different MRE sequences are needed to fill various roles clinically such as spin-echo based sequences for patients with iron overload, or rapid sequences for patients who cannot execute long breath holds. The purpose of this study was to compare the mean stiffness, variance, and presence of artifacts using three MRE sequences at 1.5T and 3T in phantoms and healthy volunteers. In the phantom study variance was found to decrease with increasing slice thickness as well as at higher field strength. The SE-EPI sequence tended to overestimate low stiffness and underestimate high stiffness while the rapid sequence significantly overestimated stiffness of both the soft and stiff phantom. In the volunteers no significant difference was found between the sequences in terms of measured stiffness. The variability between acquisitions in a single setup as well as between setups was minimal, showing that MRE is a very robust technique.

  5. Comparison with Magnetic Resonance Three-Dimensional Sequence for Lumbar Nerve Root with Intervertebral Foramen

    PubMed Central

    Takashima, Hiroyuki; Shishido, Hiroki; Yoshimoto, Mitsunori; Imamura, Rui; Akatsuka, Yoshihiro; Terashima, Yoshinori; Fujiwara, Hiroyoshi; Nagae, Masateru; Kubo, Toshikazu; Yamashita, Toshihiko

    2016-01-01

    Study Design Prospective study based on magnetic resonance (MR) imaging of the lumbar spinal root of the intervertebral foramen. Purpose This study was to compare MR three-dimensional (3D) sequences for the evaluation of the lumbar spinal root of the intervertebral foramen. Overview of Literature The diagnosis of spinal disorders by MR imaging is commonly performed using two-dimensional T1- and T2-weighted images, whereas 3D MR images can be used for acquiring further detailed data using thin slices with multi-planar reconstruction. Methods On twenty healthy volunteers, we investigated the contrast-to-noise ratio (CNR) of the lumbar spinal root of the intervertebral foramen with a 3D balanced sequence. The sequences used were the fast imaging employing steady state acquisition and the coherent oscillatory state acquisition for the manipulation of image contrast (COSMIC). COSMIC can be used with or without fat suppression (FS). We compared these sequence to determine the optimized visualization sequence for the lumbar spinal root of the intervertebral foramen. Results For the CNR between the nerve root and the peripheral tissue, these were no significant differences between the sequences at the entry of foramen. There was a significant difference and the highest CNR was seen with COSMIC-FS for the intra- and extra-foramen. Conclusions In this study, the findings suggest that the COSMIC-FS sequences should be used for the internal or external foramen for spinal root disorders. PMID:26949459

  6. Human ribosomal RNA gene: nucleotide sequence of the transcription initiation region and comparison of three mammalian genes.

    PubMed Central

    Financsek, I; Mizumoto, K; Mishima, Y; Muramatsu, M

    1982-01-01

    The transcription initiation site of the human ribosomal RNA gene (rDNA) was located by using the single-strand specific nuclease protection method and by determining the first nucleotide of the in vitro capped 45S preribosomal RNA. The sequence of 1,211 nucleotides surrounding the initiation site was determined. The sequenced region was found to consist of 75% G and C and to contain a number of short direct and inverted repeats and palindromes. By comparison of the corresponding initiation regions of three mammalian species, several conserved sequences were found upstream and downstream from the transcription starting point. Two short A + T-rich sequences are present on human, mouse, and rat ribosomal RNA genes between the initiation site and 40 nucleotides upstream, and a C + T cluster is located at a position around -60. At and downstream from the initiation site, a common sequence, T-AG-C-T-G-A-C-A-C-G-C-T-G-T-C-C-T-CT-T, was found in the three genes from position -1 through +18. The strong conservation of these sequences suggests their functional significance in rDNA. The S1 nuclease protection experiments with cloned rDNA fragments indicated the presence in human 45S RNA of molecules several hundred nucleotides shorter than the supposed primary transcript. The first 19 nucleotides of these molecules appear identical--except for one mismatch--to the nucleotide sequence of the 5' end of a supposed early processing product of the mouse 45S RNA. Images PMID:6954460

  7. A rapid method for manual or automated purification of fluorescently labeled nucleic acids for sequencing, genotyping, and microarrays.

    PubMed

    Springer, Amy L; Booth, Lisa R; Braid, Michael D; Houde, Christiane M; Hughes, Karin A; Kaiser, Robert J; Pedrak, Casandra; Spicer, Douglas A; Stolyar, Sergey

    2003-03-01

    Fluorescent dyes provide specific, sensitive, and multiplexed detection of nucleic acids. To maximize sensitivity, fluorescently labeled reaction products (e.g., cycle sequencing or primer extension products) must be purified away from residual dye-labeled precursors. Successful high-throughput analyses require that this purification be reliable, rapid, and amenable to automation. Common methods for purifying reaction products involve several steps and require processes that are not easily automated. Prolinx, Inc. has devel oped RapXtract superparamagnetic separation technology affording rapid and easy-to-perform methods that yield high-quality product and are easily automated. The technology uses superparamagnetic particles that specifically remove unincorporated dye-labeled precursors. These particles are efficiently pelleted in the presence of a magnetic field, making them ideal for purification because of the rapid separations that they allow. RapXtract-purified sequencing reactions yield data with good signal and high Phred quality scores, and they work with various sequencing dye chemistries, including BigDye and near-infrared fluorescence IRDyes. RapXtract technology can also be used to purify dye primer sequencing reactions, primer extension reactions for genotyping analysis, and nucleic acid labeling reactions for microarray hybridization. The ease of use and versatility of RapXtract technology makes it a good choice for manual or automated purification of fluorescently labeled nucleic acids.

  8. Gene structure and amino acid sequence of Latimeria chalumnae (coelacanth) myelin DM20: phylogenetic relation of the fish.

    PubMed

    Tohyama, Y; Kasama-Yoshida, H; Sakuma, M; Kobayashi, Y; Cao, Y; Hasegawa, M; Kojima, H; Tamai, Y; Tanokura, M; Kurihara, T

    1999-07-01

    The structure of Latimeria chalumnae (coelacanth) proteolipid protein/DM20 gene excluding exon 1 was determined, and the amino acid sequence of Latimeria DM20 corresponding to exons 2-7 was deduced. The nucleotide sequence of exon 3 suggests that only DM20 isoform is expressed in Latimeria. The structure of proteolipid protein/DM20 gene is well preserved among human, dog, mouse, and Latimeria. Southern blot analysis indicates that Latimeria DM20 gene is a single-copy gene. When the amino acid sequences of DM20 were compared among various species, Latimeria was more similar to tetrapods than other fishes including lungfish, confirming the previous finding by immunoreactivity (Waehneldt and Malotka 1989 J. Neurochem. 52:1941-1943). However, when phylogenetic trees were constructed from the DM20 sequences, lungfish was clearly the closest to tetrapods. Latimeria was situated outside of lungfish by the maximum likelihood method. The apparent similarity of Latimeria DM20 to tetrapod proteolipid protein/DM20 is explained by the slow amino acid substitution rate of Latimeria DM20.

  9. Heat*seq: an interactive web tool for high-throughput sequencing experiment comparison with public data

    PubMed Central

    Devailly, Guillaume; Mantsoki, Anna; Joshi, Anagha

    2016-01-01

    Summary: Better protocols and decreasing costs have made high-throughput sequencing experiments now accessible even to small experimental laboratories. However, comparing one or few experiments generated by an individual lab to the vast amount of relevant data freely available in the public domain might be limited due to lack of bioinformatics expertise. Though several tools, including genome browsers, allow such comparison at a single gene level, they do not provide a genome-wide view. We developed Heat*seq, a web-tool that allows genome scale comparison of high throughput experiments chromatin immuno-precipitation followed by sequencing, RNA-sequencing and Cap Analysis of Gene Expression) provided by a user, to the data in the public domain. Heat*seq currently contains over 12 000 experiments across diverse tissues and cell types in human, mouse and drosophila. Heat*seq displays interactive correlation heatmaps, with an ability to dynamically subset datasets to contextualize user experiments. High quality figures and tables are produced and can be downloaded in multiple formats. Availability and Implementation: Web application: http://www.heatstarseq.roslin.ed.ac.uk/. Source code: https://github.com/gdevailly. Contact: Guillaume.Devailly@roslin.ed.ac.uk or Anagha.Joshi@roslin.ed.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27378302

  10. Massively parallel rRNA gene sequencing exacerbates the potential for biased community diversity comparisons due to variable library sizes

    SciTech Connect

    Gihring, Thomas; Green, Stefan; Schadt, Christopher Warren

    2011-01-01

    Technologies for massively parallel sequencing are revolutionizing microbial ecology and are vastly increasing the scale of ribosomal RNA (rRNA) gene studies. Although pyrosequencing has increased the breadth and depth of possible rRNA gene sampling, one drawback is that the number of reads obtained per sample is difficult to control. Pyrosequencing libraries typically vary widely in the number of sequences per sample, even within individual studies, and there is a need to revisit the behaviour of richness estimators and diversity indices with variable gene sequence library sizes. Multiple reports and review papers have demonstrated the bias in non-parametric richness estimators (e.g. Chao1 and ACE) and diversity indices when using clone libraries. However, we found that biased community comparisons are accumulating in the literature. Here we demonstrate the effects of sample size on Chao1, ACE, CatchAll, Shannon, Chao-Shen and Simpson's estimations specifically using pyrosequencing libraries. The need to equalize the number of reads being compared across libraries is reiterated, and investigators are directed towards available tools for making unbiased diversity comparisons.

  11. Cloning and sequencing of the Bet v 1-homologous allergen Fra a 1 in strawberry (Fragaria ananassa) shows the presence of an intron and little variability in amino acid sequence.

    PubMed

    Musidlowska-Persson, Anna; Alm, Rikard; Emanuelsson, Cecilia

    2007-02-01

    The Fra a 1 allergen in strawberry (Fragaria ananassa) is homologous to the major birch pollen allergen Bet v 1, which has numerous isoforms differing in terms of amino acid sequence and immunological impact. To map the extent of sequence differences in the Fra a 1 allergen, PCR cloning and sequencing was applied. Several genomic sequences of Fra a 1, with a length of either 584, 591 or 594 nucleotides, were obtained from three different strawberry varieties. All contained one intron, with the length of either 101 or 110 nucleotides. By sequencing 30 different clones, eight different DNA sequences were obtained, giving in total five potential Fra a 1 protein isoforms, with high sequence similarity (>97% sequence identity) and only seven positions of amino acid variability, which were largely confirmed by mass spectrometry of expressed proteins. We conclude that the sequence variability in the strawberry allergen Fra a 1 is small, within and between strawberry varieties, and that multiple spots, previously detected in 2DE, are presumably due to differences in post-translational modification rather than differences in amino acid sequence. The most abundant Fra a 1 isoform sequence, recombinantly expressed in Escherichia coli after removal of the intron, was recognized by IgE from strawberry allergic patients. It cross-reacted with antibodies to Bet v 1 and the homologous apple allergen Mal d 1 (61 and 78% sequence identity, respectively), and will be used in further analyses of variation in Fra a 1-expression.

  12. Comparison of phenolic acids profile and antioxidant potential of six varieties of spelt (Triticum spelta L.).

    PubMed

    Gawlik-Dziki, Urszula; Świeca, Michał; Dziki, Dariusz

    2012-05-09

    Phenolic acids profile and antioxidant activity of six diverse varieties of spelt are reported. Antioxidant activity was assessed using eight methods based on different mechanism of action. Phenolic acids composition of spelt differed significantly between varieties and ranged from 506.6 to 1257.4 μg/g DW. Ferulic and sinapinic acids were the predominant phenolic acids found in spelt. Total ferulic acid content ranged from 144.2 to 691.5 μg/g DW. All analyzed spelt varieties possessed high antioxidant potential. In spite of the fact that bound phenolic acids possessed higher antioxidant activities, analysis of antioxidant potential and their relationship with phenolic acid content showed that free phenolics were more effective. Eight antioxidant methods were integrated to obtain a total antioxidant capacity index that may be used for comparison of total antioxidant capacity of spelt varieties. Total antioxidant potential of spelt cultivars were ordered as follows: Ceralio > Spelt INZ ≈ Ostro > Oberkulmer Rotkorn > Schwabenspelz > Schwabenkorn.

  13. K-Pax2: Bayesian identification of cluster-defining amino acid positions in large sequence datasets

    PubMed Central

    Grad, Yonatan; Cobey, Sarah; Puranen, Juha Santeri; Corander, Jukka

    2015-01-01

    The recent growth in publicly available sequence data has introduced new opportunities for studying microbial evolution and spread. Because the pace of sequence accumulation tends to exceed the pace of experimental studies of protein function and the roles of individual amino acids, statistical tools to identify meaningful patterns in protein diversity are essential. Large sequence alignments from fast-evolving micro-organisms are particularly challenging to dissect using standard tools from phylogenetics and multivariate statistics because biologically relevant functional signals are easily masked by neutral variation and noise. To meet this need, a novel computational method is introduced that is easily executed in parallel using a cluster environment and can handle thousands of sequences with minimal subjective input from the user. The usefulness of this kind of machine learning is demonstrated by applying it to nearly 5000 haemagglutinin sequences of influenza A/H3N2.Antigenic and 3D structural mapping of the results show that the method can recover the major jumps in antigenic phenotype that occurred between 1968 and 2013 and identify specific amino acids associated with these changes. The method is expected to provide a useful tool to uncover patterns of protein evolution. PMID:28348810

  14. Complete genome sequence of Enterococcus mundtii QU 25, an efficient L-(+)-lactic acid-producing bacterium.

    PubMed

    Shiwa, Yuh; Yanase, Hiroaki; Hirose, Yuu; Satomi, Shohei; Araya-Kojima, Tomoko; Watanabe, Satoru; Zendo, Takeshi; Chibazakura, Taku; Shimizu-Kadota, Mariko; Yoshikawa, Hirofumi; Sonomoto, Kenji

    2014-08-01

    Enterococcus mundtii QU 25, a non-dairy bacterial strain of ovine faecal origin, can ferment both cellobiose and xylose to produce l-lactic acid. The use of this strain is highly desirable for economical l-lactate production from renewable biomass substrates. Genome sequence determination is necessary for the genetic improvement of this strain. We report the complete genome sequence of strain QU 25, primarily determined using Pacific Biosciences sequencing technology. The E. mundtii QU 25 genome comprises a 3 022 186-bp single circular chromosome (GC content, 38.6%) and five circular plasmids: pQY182, pQY082, pQY039, pQY024, and pQY003. In all, 2900 protein-coding sequences, 63 tRNA genes, and 6 rRNA operons were predicted in the QU 25 chromosome. Plasmid pQY024 harbours genes for mundticin production. We found that strain QU 25 produces a bacteriocin, suggesting that mundticin-encoded genes on plasmid pQY024 were functional. For lactic acid fermentation, two gene clusters were identified-one involved in the initial metabolism of xylose and uptake of pentose and the second containing genes for the pentose phosphate pathway and uptake of related sugars. This is the first complete genome sequence of an E. mundtii strain. The data provide insights into lactate production in this bacterium and its evolution among enterococci.

  15. Gastropod arginine kinases from Cellana grata and Aplysia kurodai. Isolation and cDNA-derived amino acid sequences.

    PubMed

    Suzuki, T; Inoue, N; Higashi, T; Mizobuchi, R; Sugimura, N; Yokouchi, K; Furukohri, T

    2000-12-01

    Arginine kinase (AK) was isolated from the radular muscle of the gastropod molluscs Cellana grata (subclass Prosobranchia) and Aplysia kurodai (subclass Opisthobranchia), respectively, by ammonium sulfate fractionation, Sephadex G-75 gel filtration and DEAE-ion exchange chromatography. The denatured relative molecular mass values were estimated to be 40 kDa by sodium dodecyl sulfate-polyacrylamide gel electrophoresis. The isolated enzyme from Aplysia gave a Km value of 0.6 mM for arginine and a Vmax value of 13 micromole Pi min(-1) mg protein(-1) for the forward reaction. These values are comparable to other molluscan AKs. The cDNAs encoding Cellana and Aplysia AKs were amplified by polymerase chain reaction, and the nucleotide sequences of 1,608 and 1,239 bp, respectively, were determined. The open reading frame for Cellana AK is 1044 nucleotides in length and encodes a protein with 347 amino acid residues, and that for A. kurodai is 1077 nucleotides and 354 residues. The cDNA-derived amino acid sequences were validated by chemical sequencing of internal lysyl endopeptidase peptides. The amino acid sequences of Cellana and Aplysia AKs showed the highest percent identity (66-73%) with those of the abalone Nordotis and turbanshell Battilus belonging to the same class Gastropoda. These AK sequences still have a strong homology (63-71%) with that of the chiton Liolophura (class Polyplacophora), which is believed to be one of the most primitive molluscs. On the other hand, these AK sequences are less homologous (55-57%) with that of the clam Pseudocardium (class Bivalvia), suggesting that the biological position of the class Polyplacophora should be reconsidered.

  16. Isolation and a partial amino acid sequence of insulin from the islet tissue of cod (Gadus callarias)

    PubMed Central

    Grant, P. T.; Reid, K. B. M.

    1968-01-01

    1. Insulin has been isolated by gel filtration and ion-exchange chromatography from extracts of the discrete islet tissue of cod. The final preparation yielded a single band on electrophoresis at two pH values. The biological potency was 11·5 international units/mg. in mouse-convulsion and other assay procedures. 2. Glycine and methionine were shown to be the N-terminal amino acids of the A and B chains respectively. An estimate of the molecular weight together with amino acid analyses indicated that cod insulin, like the bovine hormone, consists of 51 amino acid residues. In contrast, the amino acid composition differs markedly from bovine insulin. 3. Oxidation of insulin with performic acid yielded the A and B peptide chains, which were separated by ion-exchange chromatography. Sequence studies on smaller peptides isolated from enzymic digests or from dilute acetic acid hydrolysates of the two chains have established the sequential order of 14 of the 21 amino acid residues of the A chain and 25 of the 30 amino acid residues of the B chain. PMID:4866431

  17. Oxygen affinity and amino acid sequence of myoglobins from endothermic and ectothermic fish.

    PubMed

    Marcinek, D J; Bonaventura, J; Wittenberg, J B; Block, B A

    2001-04-01

    Myoglobin (Mb) buffers intracellular O2 and facilitates diffusion of O2 through the cell. These functions of Mb will be most effective when intracellular PO2 is near the partial pressure of oxygen at which Mb is half saturated (P50) of the molecule. We test the hypothesis that Mb oxygen affinity has evolved such that it is conserved when adjusted for body temperature among closely related animals. We measure oxygen P50s tonometrically and oxygen dissociation rate constants with stopped flow and generate amino acid sequence from cDNA of Mbs from fish with different body temperatures. P50s for the endothermic bluefin tuna, skipjack tuna, and blue marlin at 20 degrees C were 0.62 +/- 0.02, 0.59 +/- 0.01, 0.58 +/- 0.04 mmHg, respectively, and were significantly lower than those for ectothermic bonito (1.03 +/- 0.07 mmHg) and mackerel (1.39 +/- 0.03 mmHg). Because the oxygen affinity of Mb decreases with increasing temperature, the above differences in oxygen affinity between endothermic and ectothermic fish are reduced when adjusted for the in vivo muscle temperature of the animal. Oxygen dissociation rate constants at 20 degrees C for the endothermic species ranged from 34.1 to 49.3 s(-1), whereas those for mackerel and bonito were 102 and 62 s(-1), respectively. Correlated with the low oxygen affinity and fast dissociation kinetics of mackerel Mb is a substitution of alanine for proline that would likely result in a more flexible mackerel protein.

  18. Molecular cloning, nucleotide sequence, and abscisic acid induction of a suberization-associated highly anionic peroxidase.

    PubMed

    Roberts, E; Kolattukudy, P E

    1989-06-01

    A highly anionic peroxidase induced in suberizing cells was suggested to be the key enzyme involved in polymerization of phenolic monomers to generate the aromatic matrix of suberin. The enzyme encoded by a potato cDNA was found to be highly homologous to the anionic peroxidase induced in suberizing tomato fruit. A tomato genomic library was screened using the potato anionic peroxidase cDNA and one genomic clone was isolated that contained two tandemly oriented anionic peroxidase genes. These genes were sequenced and were 96% and 87% identical to the mRNA for potato anionic peroxidase. Both genes consist of three exons with the relative positions of their two introns being conserved between the two genes. Primer extension analysis showed that only one of the genes is expressed in the periderm of 3 day wound-healed tomato fruits. Southern blot analyses suggested that there are two copies each of the two highly homologous genes per haploid genome in both potato and tomato. Abscisic acid (ABA) induced the accumulation of the anionic peroxidase transcripts in potato and tomato callus tissues. Northern blots showed that peroxidase mRNA was detectable at 2 days and was maximal at 8 days after transfer of potato callus to solid agar media containing 10(-4) M ABA. The transcripts induced by ABA in both potato and tomato callus were identical in size to those induced in wound-healing potato tuber and tomato fruit. The anionic peroxidase peptide was detected in extracts of potato callus grown on the ABA-containing media by western blot analysis. The results support the suggestion that stimulation of suberization by ABA involves the induction of the highly anionic peroxidase.

  19. Complete genome sequence of the probiotic lactic acid bacterium Lactobacillus acidophilus NCFM

    PubMed Central

    Altermann, Eric; Russell, W. Michael; Azcarate-Peril, M. Andrea; Barrangou, Rodolphe; Buck, B. Logan; McAuliffe, Olivia; Souther, Nicole; Dobson, Alleson; Duong, Tri; Callanan, Michael; Lick, Sonja; Hamrick, Alice; Cano, Raul; Klaenhammer, Todd R.

    2005-01-01

    Lactobacillus acidophilus NCFM is a probiotic bacterium that has been produced commercially since 1972. The complete genome is 1,993,564 nt and devoid of plasmids. The average GC content is 34.71% with 1,864 predicted ORFs, of which 72.5% were functionally classified. Nine phage-related integrases were predicted, but no complete prophages were found. However, three unique regions designated as potential autonomous units (PAUs) were identified. These units resemble a unique structure and bear characteristics of both plasmids and phages. Analysis of the three PAUs revealed the presence of two R/M systems and a prophage maintenance system killer protein. A spacers interspersed direct repeat locus containing 32 nearly perfect 29-bp repeats was discovered and may provide a unique molecular signature for this organism. In silico analyses predicted 17 transposase genes and a chromosomal locus for lactacin B, a class II bacteriocin. Several mucus- and fibronectin-binding proteins, implicated in adhesion to human intestinal cells, were also identified. Gene clusters for transport of a diverse group of carbohydrates, including fructooligosaccharides and raffinose, were present and often accompanied by transcriptional regulators of the lacI family. For protein degradation and peptide utilization, the organism encoded 20 putative peptidases, homologs for PrtP and PrtM, and two complete oligopeptide transport systems. Nine two-component regulatory systems were predicted, some associated with determinants implicated in bacteriocin production and acid tolerance. Collectively, these features within the genome sequence of L. acidophilus are likely to contribute to the organisms' gastric survival and promote interactions with the intestinal mucosa and microbiota. PMID:15671160

  20. Comparison of serological and sequence-based methods for typing feline calcivirus isolates from vaccine failures.

    PubMed

    Radford, A D; Dawson, S; Wharmby, C; Ryvar, R; Gaskell, R M

    2000-01-29

    Feline calicivirus (FCV) can be typed by exploiting antigenic differences between isolates or, more recently, by the sequence analysis of a hypervariable region of the virus's capsid gene. These two methods were used to characterise FCV isolates from 20 vaccine failures which occurred after the use of a commercial, live-attenuated vaccine. Using virus neutralisation, the isolates showed a spectrum of relatedness to the vaccine; depending on the criterion adopted for identity, 10 to 40 per cent of them appeared to be similar to the vaccine virus. Using sequence analysis, the isolates fell into one of two categories; 20 per cent had a similar sequence to the vaccine (0-67 to 2-67 per cent distant), and the remainder had a dissimilar sequence (21-3 to 36-0 per cent distant). Sequence analysis identified one cat that appeared to be infected with two distinct FCVs. The serological and sequence-based typing methods gave the same result in 80 to 95 per cent of individual cases, depending on the criterion adopted for serological identity. It is suggested that molecular typing is a more definitive method for characterising the relatedness of FCV isolates.

  1. Sequence Comparison for Non-Enhanced MRA of the Lower Extremity Arteries at 7 Tesla

    PubMed Central

    Johst, Sören; Orzada, Stephan; Fischer, Anja; Schäfer, Lena C.; Nassenstein, Kai; Umutlu, Lale; Lauenstein, Thomas C.; Ladd, Mark E.; Maderwald, Stefan

    2014-01-01

    In this study three sequences for non-contrast-enhanced MRA of the lower extremity arteries at 7T were compared. Cardiac triggering was used with the aim to reduce signal variations in the arteries. Two fast single-shot 2D sequences, a modified Ultrafast Spoiled Gradient Echo (UGRE) sequence and a variant of the Quiescent-Interval Single-Shot (QISS) sequence were triggered via phonocardiogram and compared in volunteer examinations to a non-triggered 2D gradient echo (GRE) sequence. For image acquisition, a 16-channel transmit/receive coil and a manually positionable AngioSURF table were used. To tackle B1 inhomogeneities at 7T, Time-Interleaved Acquisition of Modes (TIAMO) was integrated in GRE and UGRE. To compare the three sequences quantitatively, a vessel-to-background ratio (VBR) was measured in all volunteers and stations. In conclusion, cardiac triggering was able to suppress flow artifacts satisfactorily. The modified UGRE showed only moderate image artifacts. Averaged over all volunteers and stations, GRE reached a VBR of 4.18±0.05, UGRE 5.20±0.06, and QISS 2.72±0.03. Using cardiac triggering and TIAMO imaging technique was essential to perform non-enhanced MRA of the lower extremities vessels at 7T. The modified UGRE performed best, as observed artifacts were only moderate and the highest average VBR was reached. PMID:24454963

  2. Interlaboratory comparison of measurements of acid-volatile sulfide and simultaneously extracted nickel in spiked sediments

    USGS Publications Warehouse

    Brumbaugh, William G.; Hammerschmidt, Chad R.; Zanella, Luciana; Rogevich, Emily; Salata, Gregory; Bolek, Radoslaw

    2011-01-01

    An interlaboratory comparison of acid-volatile sulfide (AVS) and simultaneously extracted nickel (SEM_Ni) measurements of sediments was conducted among five independent laboratories. Relative standard deviations for the seven test samples ranged from 5.6 to 71% (mean = 25%) for AVS and from 5.5 to 15% (mean = 10%) for SEM_Ni. These results are in stark contrast to a recently published study that indicated AVS and SEM analyses were highly variable among laboratories.

  3. Interlaboratory comparison of measurements of acid-volatile sulfide and simultaneously extracted nickel in spiked sediments

    USGS Publications Warehouse

    Brumbaugh, W.G.; Hammerschmidt, C.R.; Zanella, L.; Rogevich, E.; Salata, G.; Bolek, R.

    2011-01-01

    An interlaboratory comparison of acid-volatile sulfide (AVS) and simultaneously extracted nickel (SEM-Ni) measurements of sediments was conducted among five independent laboratories. Relative standard deviations for the seven test samples ranged from 5.6 to 71% (mean=25%) for AVS and from 5.5 to 15% (mean=10%) for SEM-Ni. These results are in stark contrast to a recently published study that indicated AVS and SEM analyses were highly variable among laboratories. ?? 2011 SETAC.

  4. Interlaboratory comparison of measurements of acid-volatile sulfide and simultaneously extracted nickel in spiked sediments

    USGS Publications Warehouse

    Brumbaugh, William G.; Hammerschmidt, Chad R.; Zanella, Luciana; Rogevich, Emily; Salata, Gregory; Bolek, Radoslaw

    2011-01-01

    An interlaboratory comparison of acid-volatile sulfide (AVS) and simultaneously extracted nickel (SEM_Ni) measurements of sediments was conducted among five independent laboratories. Relative standard deviations for the seven test samples ranged from 5.6 to 71% (mean?=?25%) for AVS and from 5.5 to 15% (mean?=?10%) for SEM_Ni. These results are in stark contrast to a recently published study that indicated AVS and SEM analyses were highly variable among laboratories.

  5. Complete nucleotide and derived amino acid sequence of cDNA encoding the mitochondrial uncoupling protein of rat brown adipose tissue: lack of a mitochondrial targeting presequence.

    PubMed Central

    Ridley, R G; Patel, H V; Gerber, G E; Morton, R C; Freeman, K B

    1986-01-01

    A cDNA clone spanning the entire amino acid sequence of the nuclear-encoded uncoupling protein of rat brown adipose tissue mitochondria has been isolated and sequenced. With the exception of the N-terminal methionine the deduced N-terminus of the newly synthesized uncoupling protein is identical to the N-terminal 30 amino acids of the native uncoupling protein as determined by protein sequencing. This proves that the protein contains no N-terminal mitochondrial targeting prepiece and that a targeting region must reside within the amino acid sequence of the mature protein. Images PMID:3012461

  6. Amino acid sequence analysis and identification of mutations in the NS gene of 2009 influenza A (H1N1) isolates from Kenya.

    PubMed

    George, Gachara; Samuel, Symekher; John, Mbithi; James, Simwa; Musa, Ng'ayo; Japheth, Magana; Wallace, Bulimo

    2011-08-01

    Although the important role of the nonstructural (NS) gene of influenza A virus in virulence and replication is well-established, the knowledge about the extent of variation in the NS gene of 2009 influenza A (H1N1) viruses in Kenya and Africa is scanty. This study analysed the NS gene of 31 isolates from Kenya in order to obtain a more detailed knowledge about the genetic variation of NS gene of 2009 influenza A (H1N1) isolates from Kenya. A comparison with the vaccine strain and viruses isolated elsewhere in Africa was also made. The amino acid sequences of the non-structural protein, NS1 of the viruses from this study and the vaccine strain revealed 18 differences. Conversely, the nuclear export protein (NEP) of the isolates in this study had 11 differences from the vaccine strain. Analysis of the NS1 protein showed only one fixed amino acid change I123V which is one of the characteristics of clade 7 viruses. In the NEP, the amino acid at position 77 was the most mutable with 9 (39%) of all mutations seen in this protein. A mutation A115T which is a characteristic of clade 5 viruses was noted in the isolates from Lagos, Nigeria. The study shows a substantial number of mutations in the NS gene that has not been reported elsewhere and gives a glimpse of the evolution of this gene in the region.

  7. Peptide Mass Fingerprinting and N-Terminal Amino Acid Sequencing of Glycosylated Cysteine Protease of Euphorbia nivulia Buch.-Ham.

    PubMed Central

    Badgujar, Shamkant B.; Mahajan, Raghunath T.

    2013-01-01

    A new cysteine protease named Nivulian-II has been purified from the latex of Euphorbia nivulia Buch.-Ham. The apparent molecular mass of Nivulian-II is 43670.846 Da (MALDI TOF/MS). Peptide mass fingerprint analysis revealed peptide matches to Maturase K (Q52ZV1_9MAGN) of Banksia quercifolia. The N-terminal sequence (DFPPNTCCCICC) showed partial homology with those of other cysteine proteinases of biological origin. This is the first paper to characterize a Nivulian-II of E. nivulia latex with respect to amino acid sequencing. PMID:23476742

  8. Binary nucleation in acid-water systems. II. Sulfuric acid-water and a comparison with methanesulfonic acid-water

    NASA Astrophysics Data System (ADS)

    Wyslouzil, B. E.; Seinfeld, J. H.; Flagan, R. C.; Okuyama, K.

    1991-05-01

    This work presents a systematic investigation of binary nucleation rates for sulfuric acid and water and the effect of temperature on these rates at isothermal, subsaturated conditions. The results from nucleation rate measurements for the sulfuric acid (H2SO4) -water system are discussed and compared to those previously presented for methanesulfonic acid (MSA)-water [B. E. Wyslouzil, J. H. Seinfeld, R. C. Flagan, and K. Okuyama, J. Chem. Phys. (submitted)]. Experiments were conducted at relative humidities (Rh) ranging from 0.006acidities (Ra) in the range of 0.04acid molecules in the critical nucleus for both the H2SO4 -water and MSA-water systems.

  9. Isolation of chicken major histocompatibility complex class II (B-L) beta chain sequences: comparison with mammalian beta chains and expression in lymphoid organs.

    PubMed Central

    Bourlet, Y; Béhar, G; Guillemot, F; Fréchin, N; Billault, A; Chaussé, A M; Zoorob, R; Auffray, C

    1988-01-01

    By cross-hybridization in low stringency conditions, using a probe derived from an HLA-DQ beta cDNA clone, we have isolated several chicken genomic DNA clones. These clones were mapped to the major histocompatibility complex (MHC) of the chick (B complex) by virtue of their ability to detect restriction enzyme length polymorphisms between congenic lines of chicken. Evidence was obtained for the presence of at least three B-L beta genes in the chicken genome. The B-L beta genes are transcribed specifically in tissues containing cells of the B lymphocyte and myeloid lineages and expressing the B-L antigens. Exons encoding the beta 1, beta 2 and transmembrane domains of a B-L beta chain have been identified with 63, 66 and 62% similarity with the HLA-DQ beta sequence. This first isolation of an MHC class II gene outside of the mammalian class provides insight into the evolution of MHC genes based on the comparison of avian and mammalian class II beta chain amino acid and nucleotide sequences. Images PMID:2841107

  10. Comparison between phosphoric acid and hydrochloric acid in microabrasion technique for the treatment of dental fluorosis

    PubMed Central

    Bassir, Mahshid Mohammadi; Bagheri, Golnaz

    2013-01-01

    Purpose: To compare the effectiveness of phosphoric acid (H3PO4)-pumice compound with conventional hydrochloric acid (HCl)-pumice compound in treating different severities of dental fluorosis with the microabrasion technique. Materials and Methods: Sixty-seven anterior teeth from seven patients with different severities of dental fluorosis were treated. In each patient, half of the teeth were treated with HCl-pumice compound and the other half with H3PO4-pumice compound (split-mouth design). Both treatment compounds were applied for 30-second periods and treatment continued up to 10 minutes. Before and after treatment, standardized photographs were taken. The photographs were compared by two experienced observers unaware of the modality of treatment. Two indices of aesthetics, improvement in appearance (IA) and degree of stain removal (DSR), were determined according to a visual analog scale. The inter- and intra-correlation coefficients were made; then, statistical analyses were calculated using Mann-Whitney and t-test. Results: There were no significant differences in interobserver evaluation. Improvements in aesthetic indices were observed in all fluorotic teeth by both compounds; however, the mean treatment time with HCl-pumice was significantly lower than H3PO4-pumice. Conclusion: The H3PO4-pumice compound improved aesthetic indices in fluorotic teeth similar to the HCl-pumice compound. PMID:23349575

  11. Comparison of next-generation sequencing and clone-based sequencing in analysis of hepatitis B virus reverse transcriptase quasispecies heterogeneity.

    PubMed

    Gong, Ling; Han, Yue; Chen, Li; Liu, Feng; Hao, Pei; Sheng, Jia; Li, Xin-Hua; Yu, De-Min; Gong, Qi-Ming; Tian, Fei; Guo, Xiao-Kui; Zhang, Xin-Xin

    2013-12-01

    We previously reported that, based on clone-based sequencing (CBS), hepatitis B virus (HBV) heterogeneity within the reverse transcriptase (RT) region was a predictor of antiviral efficacy. Here, by comparing ultradeep pyrosequencing (UDPS), i.e., next-generation sequencing (NGS), with CBS in characterizing the genetic heterogeneity of HBV quasispecies within the RT region, we evaluated the performance of UDPS in the analysis of HBV viral populations. HBV genomic DNA was extracted from serum samples from 31 antiviral treatment-naive patients with chronic hepatitis B. The RT region quasispecies were analyzed in parallel using CBS and UDPS. Characterization of quasispecies heterogeneity was conducted using bioinformatics analysis. Quasispecies complexity values were calculated with the formula Sn = -Σi(pilnpi)/lnN. The number of qualified strains obtained by UDPS was much larger than that obtained by CBS (P < 0.001). Pearson analysis showed that there was a positive correlation of quasispecies complexity values at the nucleotide level for the two methods (P < 0.05), while the complexity value derived from UDPS data was higher than that derived from CBS data (P < 0.001). Study of the prevalences of variations within the RT region showed that CBS detected an average of 9.7 ± 1.1 amino acid substitutions/sample and UDPS detected an average of 16.2 ± 1.4 amino acid substitutions/sample. The phylogenetic analysis based on UDPS data showed more genetic entities than did that based on CBS data. Viral heterogeneity determination by the UDPS technique is more sensitive and efficient in terms of low-abundance variation detection and quasispecies simulation than that by the CBS method, although imperfect, and thus sheds light on the future clinical application of NGS in HBV quasispecies studies.

  12. Comparison of hepatocellular carcinoma miRNA expression profiling as evaluated by next generation sequencing and microarray.

    PubMed

    Murakami, Yoshiki; Tanahashi, Toshihito; Okada, Rina; Toyoda, Hidenori; Kumada, Takashi; Enomoto, Masaru; Tamori, Akihiro; Kawada, Norifumi; Taguchi, Y-h; Azuma, Takeshi

    2014-01-01

    MicroRNA (miRNA) expression profiling has proven useful in diagnosing and understanding the development and progression of several diseases. Microarray is the standard method for analyzing miRNA expression profiles; however, it has several disadvantages, including its limited detection of miRNAs. In recent years, advances in genome sequencing have led to the development of next-generation sequencing (NGS) technologies, which significantly advance genome sequencing speed and discovery. In this study, we compared the expression profiles obtained by next generation sequencing (NGS) with the profiles created using microarray to assess if NGS could produce a more accurate and complete miRNA profile. Total RNA from 14 hepatocellular carcinoma tumors (HCC) and 6 matched non-tumor control tissues were sequenced with Illumina MiSeq 50-bp single-end reads. Micro RNA expression profiles were estimated using miRDeep2 software. As a comparison, miRNA expression profiles for 11 out of 14 HCCs were also established by microarray (Agilent human microRNA microarray). The average total sequencing exceeded 2.2 million reads per sample and of those reads, approximately 57% mapped to the human genome. The average correlation for miRNA expression between microarray and NGS and subtraction were 0.613 and 0.587, respectively, while miRNA expression between technical replicates was 0.976. The diagnostic accuracy of HCC, p-value, and AUC were 90.0%, 7.22×10(-4), and 0.92, respectively. In summary, NGS created an miRNA expression profile that was reproducible and comparable to that produced by microarray. Moreover, NGS discovered novel miRNAs that were otherwise undetectable by microarray. We believe that miRNA expression profiling by NGS can be a useful diagnostic tool applicable to multiple fields of medicine.

  13. Next-Generation Sequencing of the Bacterial 16S rRNA Gene for Forensic Soil Comparison: A Feasibility Study.

    PubMed

    Jesmok, Ellen M; Hopkins, James M; Foran, David R

    2016-05-01

    Soil has the potential to be valuable forensic evidence linking a person or item to a crime scene; however, there is no established soil individualization technique. In this study, the utility of soil bacterial profiling via next-generation sequencing of the 16S rRNA gene was examined for associating soils with their place of origin. Soil samples were collected from ten diverse and nine similar habitats over time, and within three habitats at various horizontal and vertical distances. Bacterial profiles were analyzed using four methods: abundance charts and nonmetric multidimensional scaling provided simplification and visualization of the massive datasets, potentially aiding in expert testimony, while analysis of similarities and k-nearest neighbor offered objective statistical comparisons. The vast majority of soil bacterial profiles (95.4%) were classified to their location of origin, highlighting the potential of bacterial profiling via next-generation sequencing for the forensic analysis of soil samples.

  14. Phylogenetic relationships of Salmonella based on DNA sequence comparison of atpD encoding the beta subunit of ATP synthase.

    PubMed

    Christensen, H; Olsen, J E

    1998-04-01

    DNA sequences covering 57% of atpD encoding the beta subunit of ATP synthase were determined for 16 strains of Salmonella enterica, two strains of S. bongori, and one strain each of Citrobacter freundii and Yersinia enterocolitica, and comparison was made with the published Escherichia coli and Enterobacter aerogenes sequences. The phylogenetic tree based on maximum-likelihood analysis showed separation of the subspecies of S. enterica except for two serotypes of subspecies II which were unsupported by a common node. The two serotypes of S. bongori were separated from S. enterica and related to the serotypes of subspecies II. A tight relationship was found between S. enterica subspecies IIIa consisting of monophasic serotypes and subspecies IIIb consisting of diphasic serotypes. This is in conflict with results obtained for most other housekeeping genes and the 23S rRNA gene separating mono- from diphasic subspecies.

  15. DNA Sequence and Expression Variation of Hop (Humulus lupulus) Valerophenone Synthase (VPS), a Key Gene in Bitter Acid Biosynthesis

    PubMed Central

    Castro, Consuelo B.; Whittock, Lucy D.; Whittock, Simon P.; Leggett, Grey; Koutoulis, Anthony

    2008-01-01

    Background The hop plant (Humulus lupulus) is a source of many secondary metabolites, with bitter acids essential in the beer brewing industry and others having potential applications for human health. This study investigated variation in DNA sequence and gene expression of valerophenone synthase (VPS), a key gene in the bitter acid biosynthesis pathway of hop. Methods Sequence variation was studied in 12 varieties, and expression was analysed in four of the 12 varieties in a series across the development of the hop cone. Results Nine single nucleotide polymorphisms (SNPs) were detected in VPS, seven of which were synonymous. The two non-synonymous polymorphisms did not appear to be related to typical bitter acid profiles of the varieties studied. However, real-time quantitative reverse-transcription polymerase chain reaction (qRT-PCR) analysis of VPS expression during hop cone development showed a clear link with the bitter acid content. The highest levels of VPS expression were observed in two triploid varieties, ‘Symphony’ and ‘Ember’, which typically have high bitter acid levels. Conclusions In all hop varieties studied, VPS expression was lowest in the leaves and an increase in expression was consistently observed during the early stages of cone development. PMID:18519445

  16. A knowledge engineering approach to recognizing and extracting sequences of nucleic acids from scientific literature.

    PubMed

    García-Remesal, Miguel; Maojo, Victor; Crespo, José

    2010-01-01

    In this paper we present a knowledge engineering approach to automatically recognize and extract genetic sequences from scientific articles. To carry out this task, we use a preliminary recognizer based on a finite state machine to extract all candidate DNA/RNA sequences. The latter are then fed into a knowledge-based system that automatically discards false positives and refines noisy and incorrectly merged sequences. We created the knowledge base by manually analyzing different manuscripts containing genetic sequences. Our approach was evaluated using a test set of 211 full-text articles in PDF format containing 3134 genetic sequences. For such set, we achieved 87.76% precision and 97.70% recall respectively. This method can facilitate different research tasks. These include text mining, information extraction, and information retrieval research dealing with large collections of documents containing genetic sequences.

  17. Zucchini yellow mosaic virus: biological properties, detection procedures and comparison of coat protein gene sequences.

    PubMed

    Coutts, B A; Kehoe, M A; Webster, C G; Wylie, S J; Jones, R A C

    2011-12-01

    Between 2006 and 2010, 5324 samples from at least 34 weed, two cultivated legume and 11 native species were collected from three cucurbit-growing areas in tropical or subtropical Western Australia. Two new alternative hosts of zucchini yellow mosaic virus (ZYMV) were identified, the Australian native cucurbit Cucumis maderaspatanus, and the naturalised legume species Rhyncosia minima. Low-level (0.7%) seed transmission of ZYMV was found in seedlings grown from seed collected from zucchini (Cucurbita pepo) fruit infected with isolate Cvn-1. Seed transmission was absent in >9500 pumpkin (C. maxima and C. moschata) seedlings from fruit infected with isolate Knx-1. Leaf samples from symptomatic cucurbit plants collected from fields in five cucurbit-growing areas in four Australian states were tested for the presence of ZYMV. When 42 complete coat protein (CP) nucleotide (nt) sequences from the new ZYMV isolates obtained were compared to those of 101 complete CP nt sequences from five other continents, phylogenetic analysis of the 143 ZYMV sequences revealed three distinct groups (A, B and C), with four subgroups in A (I-IV) and two in B (I-II). The new Australian sequences grouped according to collection location, fitting within A-I, A-II and B-II. The 16 new sequences from one isolated location in tropical northern Western Australia all grouped into subgroup B-II, which contained no other isolates. In contrast, the three sequences from the Northern Territory fitted into A-II with 94.6-99.0% nt identities with isolates from the United States, Iran, China and Japan. The 23 new sequences from the central west coast and two east coast locations all fitted into A-I, with 95.9-98.9% nt identities to sequences from Europe and Japan. These findings suggest that (i) there have been at least three separate ZYMV introductions into Australia and (ii) there are few changes to local isolate CP sequences following their establishment in remote growing areas. Isolates from A-I and B

  18. Cloning, sequence, and developmental expression of a type 5, tartrate-resistant, acid phosphatase of rat bone.

    PubMed

    Ek-Rylander, B; Bill, P; Norgård, M; Nilsson, S; Andersson, G

    1991-12-25

    Tartrate-resistant acid phosphatase (TRAP) is a characteristic constituent of osteoclasts and some mononuclear preosteoclasts and, therefore, used as a histochemical and biochemical marker for osteoclasts and bone resorption. We now report the isolation of a 1397-base pair (bp) full-length TRAP/tartrate-resistant acid ATPase (TrATPase) cDNA clone from a neonatal rat calvaria lambda gt11 cDNA library. The cDNA clone consists of a 92-bp untranslated 5'-flank, an open reading frame of 981 bp and a 324-bp untranslated 3'-poly(A)-containing region. The deduced protein sequence of 327 amino acids contains a putative cleavable signal sequence of 21 amino acids. The mature polypeptide of 306 amino acids has a calculated Mr of 34,350 Da and a pI of 9.18, and it contains two potential N-glycosylation sites and the lysosomal targeting sequence DKRFQ. At the protein level, the sequence displays 89-94% homology to TRAP enzymes from human placenta, beef spleen, and uteroferrin and identity to the N terminus of purified rat bone TRAP/TrATPase. An N-terminal amino acid segment is strikingly homologous to the corresponding region in lysosomal and prostatic acid phosphatases. The cDNA recognized a 1.5-kilobase mRNA in long bones and calvaria, and in vitro translation using, as template, mRNA transcribed from the full-length insert yielded an immunoprecipitated product of 34 kDa. In neonatal rats, TRAP/TrATPase mRNA was highly expressed in skeletal tissues, with much lower (less than 10%) levels detected in spleen, thymus, liver, skin, brain, kidney, brain, lung, and heart. In situ hybridization demonstrated specific labeling of osteoclasts at endostal surfaces and bone trabeculae of long bones. Thus, despite the apparent similarity of this osteoclastic TRAP/TrATPase with type 5, tartrate-resistant and purple, acid phosphatases expressed in other mammalian tissues, this gene appears to be preferentially expressed at skeletal sites.

  19. Snake venoms. The amino-acid sequence of protein S5C4 from Dendroaspis jamesoni kaimosae (Jameson's mamba) venom.

    PubMed

    Joubert, F J; Strydom, A J; Taljaard, N

    1978-06-01

    A major component (S5C4) was purified from Jameson's mamba by gel filtration on Sephadex G-50 and by ion-exchange chromotography on CM-cellulose. Protein S5C4 contains 60 amino acid residues and is cross-linked by four intrachain disulphide bridges. The complete primary structure of the protein has been elucidated. The toxicities, the immunochemical properties, the sequence and the invariant amino acid residues of protein S5C4 resemble subgroup II of the angusticeps-type proteins.

  20. A comparison of the VP7 gene sequences of human and bovine rotaviruses.

    PubMed

    Gerna, G; Steele, A D; Hoshino, Y; Sereno, M; Garcia, D; Sarasini, A; Flores, J

    1994-07-01

    The sequences of the gene encoding VP7 (the major outer capsid protein) from one bovine and three human rotavirus strains were determined because of their unusual VP7 specificities. Two of the human strains (PA 169 and PA 151) had VP7 serotype 6 specificity whereas the two other strains, recovered from a child (HAL 1166) and a calf (678) belonged to VP7 serotype 8. The serotype 8 strains exhibited a high degree of sequence conservation when compared with each other and with other serotype 8 strains previously sequenced. The serotype 6 human strains shared a greater degree of sequence similarity with previously reported serotype 6 bovine strains than with other rotavirus serotypes; however the degree of sequence similarity among PA 169, PA 151 and the bovine strains was lower than had been previously reported for strains belonging to the same serotype. The demonstration of rotavirus serotypes that are shared between human and animal species supports the concept that interspecies transmission occurs and may play a role in rotavirus evolution.

  1. Comparison of sequences formed in Marine sabkha (subaerial) and salina (Subaqueous) settings-modern and ancient

    SciTech Connect

    Warren, J.K.; St. Kendall, C.G.

    1985-06-01

    Marine evaporites occurring in modern subaqueous (salina) settings and subaerial (sabkha) settings are different. Subaqueous Holocene evaporites occur as shoalingupward lacustrine sequences up to 10 m thick. They are evaporite dominated and are composed primarily of bottom-nucleated crystals that may be deposited as massive, laminated, or rippled units. Each coastal lake is dominated by laminated evaporites with subordinate carbonate sediments. In plan view, they show a well-developed bull's-eye pattern with a sulfate center and a carbonate rim. In contrast, subaerial (sabkha) evaporites occur as part of a laterally prograding, shoaling-upward, peritidal sequence in which the supratidal unit is usually no more than 1 m thick. Sabkha sequences are matrix dominated, not evaporite dominated, with the bulk of the sulfate phase occurring as diagenetic nodules, enteroliths, or diapirlike structures. These sulfates were formed during syndepositional diagenesis by replacement and displacement processes. The various facies of the sequence tend to accumulate in belts parallel with the shoreline. Relative to the sea level or the brine level, sabkhas tend to form over paleotopographic highs whereas salinas tend to occur in paleotopographic lows. Some of the characteristics that distinguish Holocene subaerial and subaqueous evaporite sequences can be used to do the same for similar ancient facies, even when gypsum has been converted to nodular anhydrite. The distinction is important for it can be used by explorationists in the oil industry to define the paleotopography of the associated underlying porous and nonporous carbonates.

  2. Identification of novel rice low phytic acid mutations via TILLING by sequencing

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Phytic acid (myo-inositol-1,2,3,4,5,6-hexakisphosphate or InsP6) accounts for 75-85% of the total phosphorus in seeds. Low phytic acid (lpa) mutants exhibit decreases in seed InsP6 with corresponding increases in inorganic P which, unlike phytic acid P, is readily utilized by humans and monogastric ...

  3. Comparison of a neonatal versus general-purpose amino acid formulation in preterm neonates.

    PubMed

    Adamkin, D D; Radmacher, P; Rosen, P

    1995-01-01

    Twenty preterm infants were randomly assigned to receive either a general-purpose intravenous amino acid solution or a neonatal formula at 1.5 gm/kg/day with 50 nonprotein energy calories for 1 week. Both groups demonstrated similar rates of weight gain and similar liver function test results, but comparison of changes from baseline amino acid values disclosed significant intergroup differences for levels of plasma glycine, taurine, valine, and methionine after 7 days of infusion. Comparison of plasma aminograms of study patients with those of normally growing, enterally fed preterm infants showed that infants who received the neonatal solution had similar amino acid profiles, with the exception of low plasma concentrations of tyrosine and elevated concentrations of threonine. The general-purpose solution, as compared with findings in historic enteral controls, produced low plasma valine, tyrosine, and taurine levels, and elevated levels of glycine and methionine. Advantages with the neonatal amino acid solution include reductions in plasma glycine and methionine levels, plus the provision of taurine. Providing tyrosine remains a problem.

  4. Comparison of sodium acid sulfate to citric acid to inhibit browning of fresh-cut potatoes.

    PubMed

    Calder, Beth L; Kash, Emily A; Davis-Dentici, Katherine; Bushway, Alfred A

    2011-04-01

    Sodium acid sulfate (SAS) dip treatments were evaluated against a distilled water control and citric acid (CA) to compare its effectiveness in reducing enzymatic browning of raw, French-fry cut potatoes. Two separate studies were conducted with dip concentrations ranging from 0%, 1%, and 3% in experiment 1 to 0%, 2%, and 2.5% in experiment 2 to determine optimal dip concentrations. Russet Burbank potatoes were peeled, sliced, and dipped for 1 min and stored at 3 °C. Color, texture, fry surface pH, and microbiological analyses were conducted on days 0, 7, and 14. The 3% SAS- and CA-treated samples had significantly (p<0.0001) lower pH levels on fry surfaces than all other treatments. Both acidulants had significantly (p≤0.05) lower aerobic plate counts compared to controls in both studies by day 7. However, SAS appeared to be the most effective at the 3% level in maintaining a light fry color up to day 14 and had the highest L-values than all other treatments. The 3% SAS-treated fry slices appeared to have the least change in textural properties over storage time, having a significantly (p=0.0002) higher force value (kg force [kgf]) than the other treatments during experiment 1, without any signs of case-hardening that appeared in the control and CA-treated samples. SAS was just as comparable to CA in reducing surface fry pH and also lowering microbial counts over storage time. According to the results, SAS may be another viable acidulant to be utilized in the fresh-cut fruit and vegetable industry.

  5. A comparison of virus genome sequences with their host silkworm, Bombyx mori.

    PubMed

    Tang, Xu-Dong; Yue, Ya-Jie; Wang, Wei; Li, Nan; Shen, Zhong-Yuan

    2016-01-15

    With the recent availability of the genomes of many viruses and the silkworm, Bombyx mori, as well as a variety of Basic Local Alignment Search Tool (BLAST) programs, a new opportunity to gain insight into the interaction of viruses with the silkworm is possible. This study aims to determine the possible existence of sequence identities between the genomes of viruses and the silkworm and attempts to explain this phenomenon. BLAST searches of the genomes of viruses against the silkworm genome were performed using the resources of the National Center for Biotechnology Information. All studied viruses contained variable numbers of short regions with sequence identity to the genome of the silkworm. The short regions of sequence identity in the genome of the silkworm may be derived from the genomes of viruses in the long history of silkworm-virus interaction. This study is the first to compare these genomes, and may contribute to research on the interaction between viruses and the silkworm.

  6. Comparison of whole genome amplification techniques for human single cell exome sequencing

    PubMed Central

    Borgström, Erik; Paterlini, Marta; Mold, Jeff E.; Frisen, Jonas; Lundeberg, Joakim

    2017-01-01

    Background Whole genome amplification (WGA) is currently a prerequisite for single cell whole genome or exome sequencing. Depending on the method used the rate of artifact formation, allelic dropout and sequence coverage over the genome may differ significantly. Results The largest difference between the evaluated protocols was observed when analyzing the target coverage and read depth distribution. These differences also had impact on the downstream variant calling. Conclusively, the products from the AMPLI1 and MALBAC kits were shown to be most similar to the bulk samples and are therefore recommended for WGA of single cells. Discussion In this study four commercial kits for WGA (AMPLI1, MALBAC, Repli-G and PicoPlex) were used to amplify human single cells. The WGA products were exome sequenced together with non-amplified bulk samples from the same source. The resulting data was evaluated in terms of genomic coverage, allelic dropout and SNP calling. PMID:28207771

  7. Environmental comparison of biobased chemicals from glutamic acid with their petrochemical equivalents.

    PubMed

    Lammens, Tijs M; Potting, José; Sanders, Johan P M; De Boer, Imke J M

    2011-10-01

    Glutamic acid is an important constituent of waste streams from biofuels production. It is an interesting starting material for the synthesis of biobased chemicals, thereby decreasing the dependency on fossil fuels. The objective of this paper was to compare the environmental impact of four biobased chemicals from glutamic acid with their petrochemical equivalents, that is, N-methylpyrrolidone (NMP), N-vinylpyrrolidone (NVP), acrylonitrile (ACN), and succinonitrile (SCN). A consequential life cycle assessment was performed, wherein glutamic acid was obtained from sugar beet vinasse. The removed glutamic acid was substituted with cane molasses and ureum. The comparison between the four biobased and petrochemical products showed that for NMP and NVP the biobased version had less impact on the environment, while for ACN and SCN the petrochemical version had less impact on the environment. For the latter two an optimized scenario was computed, which showed that the process for SCN can be improved to a level at which it can compete with the petrochemical process. For biobased ACN large improvements are required to make it competitive with its petrochemical equivalent. The results of this LCA and the research preceding it also show that glutamic acid can be a building block for a variety of molecules that are currently produced from petrochemical resources. Currently, most methods to produce biobased products are biotechnological processes based on sugar, but this paper demonstrates that the use of amino acids from low-value byproducts can certainly be a method as well.

  8. Complete amino acid sequence of an acidic, cardiotoxic phospholipase A2 from the venom of Ophiophagus hannah (King Cobra): a novel cobra venom enzyme with "pancreatic loop".

    PubMed

    Huang, M Z; Gopalakrishnakone, P; Chung, M C; Kini, R M

    1997-02-15

    A phospholipase A2 (OHV A-PLA2) from the venom of Ophiophagus hannah (King cobra) is an acidic protein exhibiting cardiotoxicity, myotoxicity, and antiplatelet activity. The complete amino acid sequence of OHV A-PLA2 has been determined using a combination of Edman degradation and mass spectrometric techniques. OHV A-PLA2 is composed of a single chain of 124 amino acid residues with 14 cysteines and a calculated molecular weight of 13719 Da. It contains the loop of residues (62-66) found in pancreatic PLA2s and hence belongs to class IB enzymes. This pancreatic loop is between two proline residues (Pro 59 and Pro 68) and contains several hydrophilic amino acids (Ser and Asp). This region has high degree of conformational flexibility and is on the surface of the molecule, and hence it may be a potential protein-protein interaction site. A relatively low sequence homology is found between OHV A-PLA2 and other known cardiotoxic PLA2s, and hence a contiguous segment could not be identified as a site responsible for the cardiotoxic activity.

  9. Snake venoms. The amino-acid sequence of trypsin inhibitor E of Dendroaspis polylepis polylepis (Black Mamba) venom.

    PubMed

    Joubert, F J; Strydom, D J

    1978-06-01

    Trypsin inhibitor E from black mamba venom comprises 59 amino acid residues in a single polypeptide chain, cross-linked by three intrachain disulphide bridges. The complete primary structure of inhibitor E was elucidated. The sequence is homologous with trypsin inhibitors from different sources. Unique among this homologous series of proteinase inhibitors, inhibitor E has an affinity for transition metal ions, exemplified here by Cu2 and Co2+.

  10. The amino acid sequence of the zinc-requiring beta-lactamase II from the bacterium Bacillus cereus 569.

    PubMed

    Ambler, R P; Daniel, M; Fleming, J; Hermoso, J M; Pang, C; Waley, S G

    1985-09-23

    The amino acid sequence of the zinc-requiring beta-lactamase II from Bacillus cereus strain 569 has been determined. It consists of a single polypeptide chain of 227 residues. It is the only example so far fully characterized of a class B beta-lactamase, and is structurally and mechanistically distinct from both the widely distributed class A beta-lactamases (such as the Escherichia coli RTEM enzyme) and from the chromosomally encoded class C enzymes from Gram-negative bacteria.

  11. A simple ligation-based method to increase the information density in sequencing reactions used to deconvolute nucleic acid selections

    PubMed Central

    Childs-Disney, Jessica L.; Disney, Matthew D.

    2008-01-01

    Herein, a method is described to increase the information density of sequencing experiments used to deconvolute nucleic acid selections. The method is facile and should be applicable to any selection experiment. A critical feature of this method is the use of biotinylated primers to amplify and encode a BamHI restriction site on both ends of a PCR product. After amplification, the PCR reaction is captured onto streptavidin resin, washed, and digested directly on the resin. Resin-based digestion affords clean product that is devoid of partially digested products and unincorporated PCR primers. The product's complementary ends are annealed and ligated together with T4 DNA ligase. Analysis of ligation products shows formation of concatemers of different length and little detectable monomer. Sequencing results produced data that routinely contained three to four copies of the library. This method allows for more efficient formulation of structure-activity relationships since multiple active sequences are identified from a single clone. PMID:18065718

  12. The isolation, purification and amino-acid sequence of insulin from the teleost fish Cottus scorpius (daddy sculpin).

    PubMed

    Cutfield, J F; Cutfield, S M; Carne, A; Emdin, S O; Falkmer, S

    1986-07-01

    Insulin from the principal islets of the teleost fish, Cottus scorpius (daddy sculpin), has been isolated and sequenced. Purification involved acid/alcohol extraction, gel filtration, and reverse-phase high-performance liquid chromatography to yield nearly 1 mg pure insulin/g wet weight islet tissue. Biological potency was estimated as 40% compared to porcine insulin. The sculpin insulin crystallised in the absence of zinc ions although zinc is known to be present in the islets in significant amounts. Two other hormones, glucagon and pancreatic polypeptide, were copurified with the insulin, and an N-terminal sequence for pancreatic polypeptide was determined. The primary structure of sculpin insulin shows a number of sequence changes unique so far amongst teleost fish. These changes occur at A14 (Arg), A15 (Val), and B2 (Asp). The B chain contains 29 amino acids and there is no N-terminal extension as seen with several other fish. Presumably as a result of the amino acid substitutions, sculpin insulin does not readily form crystals containing zinc-insulin hexamers, despite the presence of the coordinating B10 His.

  13. Sequence of the canine herpesvirus thymidine kinase gene: taxon-preferred amino acid residues in the alphaherpesviral thymidine kinases.

    PubMed

    Rémond, M; Sheldrick, P; Lebreton, F; Foulon, T

    1995-12-01

    Multiple sequence alignments of evolutionarily related proteins are finding increasing use as indicators of critical amino acid residues necessary for structural stability or involved in functional domains responsible for catalytic activities. In the past, a number of alignments have provided such information for the herpesviral thymidine kinases, for which three-dimensional structures are not yet available. We have sequenced the thymidine kinase gene of a canine herpesvirus, and with a multiple alignment have identified amino acids preferentially conserved in either of two taxons, the genera Varicellovirus and Simplexvirus, of the subfamily Alphaherpesvirinae. Since some regions of the thymidine kinases show otherwise elevated levels of substitutional tolerance, these conserved amino acids are candidates for critical residues which have become fixed through selection during the evolutionary divergence of these enzymes. Several pairs with distinctive patterns of distribution among the various viruses occur in or near highly conserved sequence motifs previously proposed to form the catalytic site, and we speculate that they may represent interacting, co-ordinately variable residues.

  14. The amino acid sequence of a carbohydrate-containing immunoglobulin-light-chain-type amyloid-fibril protein.

    PubMed Central

    Tveteraas, T; Sletten, K; Westermark, P

    1985-01-01

    The amino acid sequence of an amyloid-fibril protein Es492 of immunoglobulin-lambda-light-chain origin (AL) was elucidated. The amyloid fibrils were obtained from the spleen of a patient who died from systemic amyloidosis. The amino acid sequence was elucidated from structural studies of peptides derived from digestion of the protein with trypsin, thermolysin, chymotrypsin and Staphylococcus aureus V8 proteinase and from cleavage of the protein with CNBr and BNPS-skatole. A heterogeneity in the length of the polypeptide was seen in the C-terminal region. The protein was by sequence homology to other lambda-chains shown to be of the V lambda II subgroup. Although an extensive homology was seen, some amino acid residues in positions 26, 31, 32, 40, 44, 93, 97, 98 and 99 have not previously been reported in these positions of V lambda II proteins. The significance of these residues in the fibril formation is unclear. The protein was found to contain carbohydrate, with glycosylation sites in two of the hypervariable regions. PMID:3936482

  15. Coronavirus genome: prediction of putative functional domains in the non-structural polyprotein by comparative amino acid sequence analysis.

    PubMed Central

    Gorbalenya, A E; Koonin, E V; Donchenko, A P; Blinov, V M

    1989-01-01

    Amino acid sequences of 2 giant non-structural polyproteins (F1 and F2) of infectious bronchitis virus (IBV), a member of Coronaviridae, were compared, by computer-assisted methods, to sequences of a number of other positive strand RNA viral and cellular proteins. By this approach, juxtaposed putative RNA-dependent RNA polymerase, nucleic acid binding ("finger"-like) and RNA helicase domains were identified in F2. Together, these domains might constitute the core of the protein complex involved in the primer-dependent transcription, replication and recombination of coronaviruses. In F1, two cysteine protease-like domains and a growth factor-like one were revealed. One of the putative proteases of IBV is similar to 3C proteases of picornaviruses and related enzymes of como- nepo- and potyviruses. Search of IBV F1 and F2 sequences for sites similar to those cleaved by the latter proteases and intercomparison of the surrounding sequence stretches revealed 13 dipeptides Q/S(G) which are probably cleaved by the coronavirus 3C-like protease. Based on these observations, a partial tentative scheme for the functional organization and expression strategy of the non-structural polyproteins of IBV was proposed. It implies that, despite the general similarity to other positive strand RNA viruses, and particularly to potyviruses, coronaviruses possess a number of unique structural and functional features. PMID:2526320

  16. Recognition of 5'-YpG-3' sequences by coupled stacking/hydrogen bonding interactions with amino acid residues.

    PubMed

    Lamoureux, Jason S; Maynes, Jason T; Glover, J N Mark

    2004-01-09

    The combined biochemical and structural study of hundreds of protein-DNA complexes has indicated that sequence-specific interactions are mediated by two mechanisms termed direct and indirect readout. Direct readout involves direct interactions between the protein and base-specific atoms exposed in the major and minor grooves of DNA. For indirect readout, the protein recognizes DNA by sensing conformational variations in the structure dependent on nucleotide sequence, typically through interactions with the phosphodiester backbone. Based on our recent structure of Ndt80 bound to DNA in conjunction with a search of the existing PDB database, we propose a new method of sequence-specific recognition that utilizes both direct and indirect readout. In this mode, a single amino acid side-chain recognizes two consecutive base-pairs. The 3'-base is recognized by canonical direct readout, while the 5'-base is recognized through a variation of indirect readout, whereby the conformational flexibility of the particular dinucleotide step, namely a 5'-pyrimidine-purine-3' step, facilitates its recognition by the amino acid via cation-pi interactions. In most cases, this mode of DNA recognition helps explain the sequence specificity of the protein for its target DNA.

  17. Comparison of Ribotyping and sequence-based typing for discriminating among isolates of Bordetella bronchiseptica

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Aims: Our goal was to compare the discriminatory power of PvuII ribotyping and MLST using a single set of diverse Bordetella bronchiseptica isolates and to determine whether subtyping based on repeat region sequences of the pertactin gene (prn) provides additional resolution. Methods and Results: ...

  18. Next-Generation Sequencing of Aquatic Oligochaetes: Comparison of Experimental Communities.

    PubMed

    Vivien, Régis; Lejzerowicz, Franck; Pawlowski, Jan

    2016-01-01

    Aquatic oligochaetes are a common group of freshwater benthic invertebrates known to be very sensitive to environmental changes and currently used as bioindicators in some countries. However, more extensive application of oligochaetes for assessing the ecological quality of sediments in watercourses and lakes would require overcoming the difficulties related to morphology-based identification of oligochaetes species. This study tested the Next-Generation Sequencing (NGS) of a standard cytochrome c oxydase I (COI) barcode as a tool for the rapid assessment of oligochaete diversity in environmental samples, based on mixed specimen samples. To know the composition of each sample we Sanger sequenced every specimen present in these samples. Our study showed that a large majority of OTUs (Operational Taxonomic Unit) could be detected by NGS analyses. We also observed congruence between the NGS and specimen abundance data for several but not all OTUs. Because the differences in sequence abundance data were consistent across samples, we exploited these variations to empirically design correction factors. We showed that such factors increased the congruence between the values of oligochaetes-based indices inferred from the NGS and the Sanger-sequenced specimen data. The validation of these correction factors by further experimental studies will be needed for the adaptation and use of NGS technology in biomonitoring studies based on oligochaete communities.

  19. Genomic Analysis of a Marine Bacterium: Bioinformatics for Comparison, Evaluation, and Interpretation of DNA Sequences

    PubMed Central

    Khobragade, Chandrahasya N.

    2016-01-01

    A total of five highly related strains of an unidentified marine bacterium were analyzed through their short genome sequences (AM260709–AM260713). Genome-to-Genome Distance (GGDC) showed high similarity to Pseudoalteromonas haloplanktis (X67024). The generated unique Quick Response (QR) codes indicated no identity to other microbial species or gene sequences. Chaos Game Representation (CGR) showed the number of bases concentrated in the area. Guanine residues were highest in number followed by cytosine. Frequency of Chaos Game Representation (FCGR) indicated that CC and GG blocks have higher frequency in the sequence from the evaluated marine bacterium strains. Maximum GC content for the marine bacterium strains ranged 53-54%. The use of QR codes, CGR, FCGR, and GC dataset helped in identifying and interpreting short genome sequences from specific isolates. A phylogenetic tree was constructed with the bootstrap test (1000 replicates) using MEGA6 software. Principal Component Analysis (PCA) was carried out using EMBL-EBI MUSCLE program. Thus, generated genomic data are of great assistance for hierarchical classification in Bacterial Systematics which combined with phenotypic features represents a basic procedure for a polyphasic approach on unambiguous bacterial isolate taxonomic classification. PMID:27882328

  20. The 5'-flanking regions of three pea legumin genes: comparison of the DNA sequences.

    PubMed Central

    Lycett, G W; Croy, R R; Shirsat, A H; Richards, D M; Boulter, D

    1985-01-01

    Approximately 1200 nucleotides of sequence data from the promoter and 5'-flanking regions of each of three pea (Pisum sativum L.) legumin genes (legA, legB and legC) are presented. The promoter regions of all three genes were found to be identical including the "TATA box", and "CAAT box', and sequences showing homology to the SV40 enhancers. The legA sequence begins to diverge from the others about 300bp from the start codon, whereas the other two genes remain identical for another 550bp. The regions of partial homology exhibit deletions or insertions and some short, comparatively well conserved sequences. The significance of these features is discussed in terms of evolutionary mechanisms and their possible functional roles. The legC gene contains a region that may potentially form either of two mutually exclusive stem-loop structures, one of which has a stem 42bp long, which suggests that it could be fairly stable. We suggest that a mechanism of switching between such alternative structures may play some role in gene control or may represent the insertion of a transposable element. PMID:2997721

  1. Comparison of Computer Vision and Photogrammetric Approaches for Epipolar Resampling of Image Sequence

    PubMed Central

    Kim, Jae-In; Kim, Taejung

    2016-01-01

    Epipolar resampling is the procedure of eliminating vertical disparity between stereo images. Due to its importance, many methods have been developed in the computer vision and photogrammetry field. However, we argue that epipolar resampling of image sequences, instead of a single pair, has not been studied thoroughly. In this paper, we compare epipolar resampling methods developed in both fields for handling image sequences. Firstly we briefly review the uncalibrated and calibrated epipolar resampling methods developed in computer vision and photogrammetric epipolar resampling methods. While it is well known that epipolar resampling methods developed in computer vision and in photogrammetry are mathematically identical, we also point out differences in parameter estimation between them. Secondly, we tested representative resampling methods in both fields and performed an analysis. We showed that for epipolar resampling of a single image pair all uncalibrated and photogrammetric methods tested could be used. More importantly, we also showed that, for image sequences, all methods tested, except the photogrammetric Bayesian method, showed significant variations in epipolar resampling performance. Our results indicate that the Bayesian method is favorable for epipolar resampling of image sequences. PMID:27011186

  2. Comparison of Synchronization Techniques for the AFIT Direct Sequence Spread Spectrum System

    DTIC Science & Technology

    1992-04-21

    13 Sklar , Bernard . Digital Communications - Fundamentals and Applications. Englewood Cliffs, N.J.: Prentice Hall, 1988. 14. Stephens, James P. Direct...Transmitted Reference Synchronization Diagram ..... ... 17 3. Baseband Digital Matched Filter ...... ............ .. 22 4. Delay Line Matched Filter...direct sequence technique uses a pseudo-noise (PN) digital code to directly modulate a conventional frequency modulated (FM) carrier in the AFIT system

  3. Next-Generation Sequencing of Aquatic Oligochaetes: Comparison of Experimental Communities

    PubMed Central

    Vivien, Régis; Lejzerowicz, Franck; Pawlowski, Jan

    2016-01-01

    Aquatic oligochaetes are a common group of freshwater benthic invertebrates known to be very sensitive to environmental changes and currently used as bioindicators in some countries. However, more extensive application of oligochaetes for assessing the ecological quality of sediments in watercourses and lakes would require overcoming the difficulties related to morphology-based identification of oligochaetes species. This study tested the Next-Generation Sequencing (NGS) of a standard cytochrome c oxydase I (COI) barcode as a tool for the rapid assessment of oligochaete diversity in environmental samples, based on mixed specimen samples. To know the composition of each sample we Sanger sequenced every specimen present in these samples. Our study showed that a large majority of OTUs (Operational Taxonomic Unit) could be detected by NGS analyses. We also observed congruence between the NGS and specimen abundance data for several but not all OTUs. Because the differences in sequence abundance data were consistent across samples, we exploited these variations to empirically design correction factors. We showed that such factors increased the congruence between the values of oligochaetes-based indices inferred from the NGS and the Sanger-sequenced specimen data. The validation of these correction factors by further experimental studies will be needed for the adaptation and use of NGS technology in biomonitoring studies based on oligochaete communities. PMID:26866802

  4. Complete genome sequence and comparison of two Shiga toxin-producing Escherichia coli O104 isolates

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Shiga toxin-producing Escherichia coli (STEC) O104 strains have been associated with sporadic cases of illness and have caused outbreaks associated with milk and sprouts. E. coli O104:H21 caused an outbreak associated with milk in the U.S. in 1994. In this study, next generation sequencing techno...

  5. Bringing Next-Generation Sequencing into the Classroom through a Comparison of Molecular Biology Techniques

    ERIC Educational Resources Information Center

    Bowling, Bethany; Zimmer, Erin; Pyatt, Robert E.

    2014-01-01

    Although the development of next-generation (NextGen) sequencing technologies has revolutionized genomic research and medicine, the incorporation of these topics into the classroom is challenging, given an implied high degree of technical complexity. We developed an easy-to-implement, interactive classroom activity investigating the similarities…

  6. The Complete Mitochondrial Genome Sequence of Bactericera cockerelli and Comparison with Three Other Psylloidea Species.

    PubMed

    Wu, Fengnian; Cen, Yijing; Wallis, Christopher M; Trumble, John T; Prager, Sean; Yokomi, Ray; Zheng, Zheng; Deng, Xiaoling; Chen, Jianchi; Liang, Guangwen

    2016-01-01

    Potato psyllid (Bactericera cockerelli) is an important pest of potato, tomato and pepper. Not only could a toxin secreted by nymphs results in serious phytotoxemia in some host plants, but also over the past few years B. cockerelli was shown to transmit "Candidatus Liberibacter solanacearum", the putative bacterial pathogen of potato zebra chip (ZC) disease, to potato and tomato. ZC has caused devastating losses to potato production in the western U.S., Mexico, and elsewhere. New knowledge of the genetic diversity of the B. cockerelli is needed to develop improved strategies to manage pest populations. Mitochondrial genome (mitogenome) sequencing provides important knowledge about insect evolution and diversity in and among populations. This report provides the first complete B. cockerelli mitogenome sequence as determined by next generation sequencing technology (Illumina MiSeq). The circular B. cockerelli mitogenome had a size of 15,220 bp with 13 protein-coding gene (PCGs), 2 ribosomal RNA genes (rRNAs), 22 transfer RNA genes (tRNAs), and a non-coding region of 975 bp. The overall gene order of the B. cockerelli mitogenome is identical to three other published Psylloidea mitogenomes: one species from the Triozidae, Paratrioza sinica; and two species from the Psyllidae, Cacopsylla coccinea and Pachypsylla venusta. This suggests all of these species share a common ancestral mitogenome. However, sequence analyses revealed differences between and among the insect families, in particular a unique region that can be folded into three stem-loop secondary structures present only within the B. cockerelli mitogenome. A phylogenetic tree based on the 13 PCGs matched an existing taxonomy scheme that was based on morphological characteristics. The available complete mitogenome sequence makes it accessible to all genes for future population diversity evaluation of B. cockerelli.

  7. The Complete Mitochondrial Genome Sequence of Bactericera cockerelli and Comparison with Three Other Psylloidea Species

    PubMed Central

    Wu, Fengnian; Cen, Yijing; Wallis, Christopher M.; Trumble, John T.; Prager, Sean; Yokomi, Ray; Zheng, Zheng; Deng, Xiaoling; Chen, Jianchi; Liang, Guangwen

    2016-01-01

    Potato psyllid (Bactericera cockerelli) is an important pest of potato, tomato and pepper. Not only could a toxin secreted by nymphs results in serious phytotoxemia in some host plants, but also over the past few years B. cockerelli was shown to transmit “Candidatus Liberibacter solanacearum”, the putative bacterial pathogen of potato zebra chip (ZC) disease, to potato and tomato. ZC has caused devastating losses to potato production in the western U.S., Mexico, and elsewhere. New knowledge of the genetic diversity of the B. cockerelli is needed to develop improved strategies to manage pest populations. Mitochondrial genome (mitogenome) sequencing provides important knowledge about insect evolution and diversity in and among populations. This report provides the first complete B. cockerelli mitogenome sequence as determined by next generation sequencing technology (Illumina MiSeq). The circular B. cockerelli mitogenome had a size of 15,220 bp with 13 protein-coding gene (PCGs), 2 ribosomal RNA genes (rRNAs), 22 transfer RNA genes (tRNAs), and a non-coding region of 975 bp. The overall gene order of the B. cockerelli mitogenome is identical to three other published Psylloidea mitogenomes: one species from the Triozidae, Paratrioza sinica; and two species from the Psyllidae, Cacopsylla coccinea and Pachypsylla venusta. This suggests all of these species share a common ancestral mitogenome. However, sequence analyses revealed differences between and among the insect families, in particular a unique region that can be folded into three stem-loop secondary structures present only within the B. cockerelli mitogenome. A phylogenetic tree based on the 13 PCGs matched an existing taxonomy scheme that was based on morphological characteristics. The available complete mitogenome sequence makes it accessible to all genes for future population diversity evaluation of B. cockerelli. PMID:27227976

  8. Comparison of classic with novel in situ extraction of soil amino acids from grassland soil

    NASA Astrophysics Data System (ADS)

    Chen, J.; Williams, D. G.

    2012-12-01

    Characterization of organic and inorganic soil nitrogen availability is important for determining ecosystem response to global change, as nitrogen limitation is often a major constraint on ecosystem productivity. Classic methods of soil nitrogen extraction involve field collection of soil samples and disturbance of soil aggregates during processing. A novel method of soil amino acid extraction is described that allows the collection of semi-sterile soil water extracts in situ with minimal disturbance to soils. Comparison of samples collected using this novel method to samples collected in parallel using classic methods developed by Brookes et al. 1985 and Kielland 1994 revealed different detectable amino acid N pools relative to ammonium. Glutamate and arginine comprised the highest amino acid N pools from extracts collected from a semiarid grassland site using this new method of extraction. In contrast, samples collected and extracted using the classic method contained higher relative levels of serine, glycine and glutamate. The amounts of dominant amino acids relative to ammonium were significantly greater using the classic method compared to the new method. These observed higher ratios of amino acids to ammonium are likely the result of additional amino acid inputs by lysis of microorganisms which are not removed when filtering in the classic method. Disturbance associated with classic methods of soil N determination may have led to alterations in the quantity and distribution of ammonium and amino acids in extracts. Minimizing disturbance of soil aggregates when sampling nitrogen pools and selection of an appropriate filter for collecting free amino acids may be important for accurately determining nitrogen availability to plant roots and soil microbes.

  9. Complete Genome Sequences of Escherichia coli O157:H7 Strains SRCC 1675 and 28RC, Which Vary in Acid Resistance

    PubMed Central

    Baranzoni, Gian Marco; Reichenberger, Erin R.; Kim, Gwang-Hee; Breidt, Frederick; Kay, Kathryn; Oh, Deog-Hwan

    2016-01-01

    The level of acid resistance among Escherichia coli O157:H7 strains varies, and strains with higher resistance to acid may have a lower infectious dose. The complete genome sequences belonging to two strains of Escherichia coli O157:H7 with different levels of acid resistance are presented here. PMID:27469964

  10. Complete Genome Sequences of Escherichia coli O157:H7 Strains SRCC 1675 and 28RC, Which Vary in Acid Resistance.

    PubMed

    Baranzoni, Gian Marco; Fratamico, Pina M; Reichenberger, Erin R; Kim, Gwang-Hee; Breidt, Frederick; Kay, Kathryn; Oh, Deog-Hwan

    2016-07-28

    The level of acid resistance among Escherichia coli O157:H7 strains varies, and strains with higher resistance to acid may have a lower infectious dose. The complete genome sequences belonging to two strains of Escherichia coli O157:H7 with different levels of acid resistance are presented here.

  11. Complete genome sequences of Escherichia coli O157:H7 strains SRCC 1675 and 28RC that vary in acid resistance

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The level of acid resistance among Escherichia coli O157:H7 strains varies, and strains with higher resistance to acid may have a lower infectious dose. The complete genome sequences belonging to two strains of Escherichia coli O157:H7 with different levels of acid resistance are presented....

  12. Genomic sequence of 'Candidatus Liberibacter solanacearum' haplotype C and its comparison with haplotype A and B genomes

    PubMed Central

    Haapalainen, Minna; Schott, Thomas; Thompson, Sarah M.; Smith, Grant R.; Nissinen, Anne I.; Pirhonen, Minna

    2017-01-01

    Haplotypes A and B of ‘Candidatus Liberibacter solanacearum’ (CLso) are associated with diseases of solanaceous plants, especially Zebra chip disease of potato, and haplotypes C, D and E are associated with symptoms on apiaceous plants. To date, one complete genome of haplotype B and two high quality draft genomes of haplotype A have been obtained for these unculturable bacteria using metagenomics from the psyllid vector Bactericera cockerelli. Here, we present the first genomic sequences obtained for the carrot-associated CLso. These two genomic sequences of haplotype C, FIN114 (1.24 Mbp) and FIN111 (1.20 Mbp), were obtained from carrot psyllids (Trioza apicalis) harboring CLso. Genomic comparisons between the haplotypes A, B and C revealed that the genome organization differs between these haplotypes, due to large inversions and other recombinations. Comparison of protein-coding genes indicated that the core genome of CLso consists of 885 ortholog groups, with the pan-genome consisting of 1327 ortholog groups. Twenty-seven ortholog groups are unique to CLso haplotype C, whilst 11 ortholog groups shared by the haplotypes A and B, are not found in the haplotype C. Some of these ortholog groups that are not part of the core genome may encode functions related to interactions with the different host plant and psyllid species. PMID:28158295

  13. Amino acid sequence and some properties of phytolacain G, a cysteine protease from growing fruit of pokeweed, Phytolacca americana.

    PubMed

    Uchikoba, T; Arima, K; Yonezawa, H; Shimada, M; Kaneda, M

    2000-10-18

    A protease, phytolacain G, has been found to appear on CM-Sepharose ion-exchange chromatography of greenish small-size fruits of pokeweed, Phytolacca americana L, from ca. 2 weeks after flowering, and increases during fruit enlargement. Reddish ripe fruit of the pokeweed contained both phytolacain G and R. The molecular mass of phytolacain G was estimated to be 25.5 kDa by SDS-PAGE. Its amino acid sequence was reconstructed by automated sequence analysis of the peptides obtained after cleavage with Achromobacter protease I, chymotrypsin, and cyanogen bromide. The enzyme is composed of 216 amino acid residues, of which it shares 152 identical amino acid residues (70%) with phytolacain R, 126 (58%) with melain G, 108 (50%) with papain, 106 (49%) with actinidain, and 96 (44%) with stem bromelain. The amino acid residues forming the substrate binding S(2) pocket of papain, Tyr67, Pro68, Trp69, Val133, and Phe207, were predicted to be replaced by Trp, Met, His, Ala, and Ser in phytolacain G, respectively. As a consequence of these substitutions, the S(2) pocket is expected to be less hydrophobic in phytolacain G than in papain.

  14. The Main Sequence of Explosive Solar Active Regions: Comparison of Emerging and Mature Active Regions

    NASA Technical Reports Server (NTRS)

    Falconer, David; Moore, Ron

    2011-01-01

    For mature active regions, an active region s magnetic flux content determines the maximum free energy the active region can have. Most Large flares and CMEs occur in active regions that are near their free-energy limit. Active-region flare power radiated in the GOES 1-8 band increases steeply as the free-energy limit is approached. We infer that the free-energy limit is set by the rate of release of an active region s free magnetic energy by flares, CMEs and coronal heating balancing the maximum rate the Sun can put free energy into the active region s magnetic field. This balance of maximum power results in explosive active regions residing in a "mainsequence" in active-region (flux content, free energy content) phase space, which sequence is analogous to the main sequence of hydrogen-burning stars in (mass, luminosity) phase space.

  15. Comparison and correlation of Simple Sequence Repeats distribution in genomes of Brucella species

    PubMed Central

    Kiran, Jangampalli Adi Pradeep; Chakravarthi, Veeraraghavulu Praveen; Kumar, Yellapu Nanda; Rekha, Somesula Swapna; Kruti, Srinivasan Shanthi; Bhaskar, Matcha

    2011-01-01

    Computational genomics is one of the important tools to understand the distribution of closely related genomes including simple sequence repeats (SSRs) in an organism, which gives valuable information regarding genetic variations. The central objective of the present study was to screen the SSRs distributed in coding and non-coding regions among different human Brucella species which are involved in a range of pathological disorders. Computational analysis of the SSRs in the Brucella indicates few deviations from expected random models. Statistical analysis also reveals that tri-nucleotide SSRs are overrepresented and tetranucleotide SSRs underrepresented in Brucella genomes. From the data, it can be suggested that over expressed tri-nucleotide SSRs in genomic and coding regions might be responsible in the generation of functional variation of proteins expressed which in turn may lead to different pathogenicity, virulence determinants, stress response genes, transcription regulators and host adaptation proteins of Brucella genomes. Abbreviations SSRs - Simple Sequence Repeats, ORFs - Open Reading Frames. PMID:21738309

  16. Relating Promoter Sequences to the Proteins that Bind to Them: A Comparison Study.

    NASA Astrophysics Data System (ADS)

    Glass, Kimberly

    2007-03-01

    Chromatin Immunoprecipitation (ChIP-on-ChIP) microarray data reveals that the proteins H3K9dimethyl and RNA-Polymerase II are exclusive regarding their binding to the promoter region of genes. When comparing the base pair sequences of the promoters that bind to Pol2 versus H3K9, striking differences appear. The mononucleotides have fundamentally different behaviors in each group. In addition, motifs that cluster before the transcriptional start site also generally have a strong enrichment in one group compared to the other. Using this knowledge a model can be developed that allows one to calculate a probability that a promoter will bind to either H3K9 or Pol2 based on its base pair sequence.

  17. FMRI/ERP of musical syntax: comparison of melodies and unstructured note sequences.

    PubMed

    Minati, Ludovico; Rosazza, Cristina; D'Incerti, Ludovico; Pietrocini, Emanuela; Valentini, Laura; Scaioli, Vidmer; Loveday, Catherine; Bruzzone, Maria Grazia

    2008-09-17

    To date, the neural correlates of musical syntax processing have been investigated mainly by means of paradigms in which isolated chords are made incongruent with the harmonic context. Here, we present results obtained contrasting unfamiliar one-part piano melodies with unstructured note sequences, comparable in pitch and rhythm but devoid of any syntactic structure. This paradigm indexes a superset of the cognitive functions involved in processing of harmonic rules. Using functional magnetic resonance imaging, differential activation of a bilateral cortical network comprising the inferior frontal gyrus, superior temporal gyrus and premotor cortex was found. Using event-related potentials, the N2 evoked by each note in melodies was found to have longer latency and a more frontal distribution than that evoked in unstructured sequences.

  18. Phylogenetic structure of Xanthomonas determined by comparison of gyrB sequences.

    PubMed

    Parkinson, Neil; Cowie, Claire; Heeney, John; Stead, David

    2009-02-01

    Previously, we have produced a phylogeny of species type strains from the plant-pathogenic genus Xanthomonas based on gyrB sequences. To evaluate this locus further for species and infraspecies identification, we sequenced an additional 203 strains comprising all the pathovar reference strains (which have defined plant hosts), 67 poorly characterized pathovars, currently classified as Xanthomonas campestris, and 59 unidentified xanthomonads. The well-characterized pathovars grouped either in clades containing their respective species type strain or in clades containing species related to Xanthomonas axonopodis. The Xanthomonas euvesicatoria, Xanthomonas perforans and Xanthomonas alfalfae species complex, Xanthomonas fuscans and Xanthomonas citri were discriminated as X. axonopodis-related clades and comprised a large proportion of unidentified strains as well as 80 pathovars representing all the X. axonopodis pathovars and many poorly characterized pathovars, greatly increasing the plant host ranges of the constituent species. Most xanthomonads from these three large clades were isolated from a taxonomically diverse range of plant hosts, including many weed species, from field systems in India, suggesting that these lineages became established and diversified in agricultural areas in this region. The majority of these xanthomonads had minimal sequence diversity, consistent with rapid and highly extensive pathovar diversification that has occurred in relatively recent times. Low-intensity farming practices may have provided conditions conducive to pathovar development, and evidence for pathovar diversification within other regional angiosperm floras is discussed. The gyrB locus was sufficiently discriminating to identify diversity within many species. Seven branches or clades were sufficiently distinct to be considered as potential novel species. This study has provided a comprehensive xanthomonad classification framework and has firmly established gyrB sequencing

  19. On the comparison between equilibrium and disequilibrium condensation sequences of meteorites

    NASA Technical Reports Server (NTRS)

    De, B. R.

    1977-01-01

    A definition is presented of a simple formalism for calculating the condensation sequence of meteoritic minerals in a cooling vapor phase in temperature disequilibrium (between the vapor phase and the condensed phase) when such minerals condense congruently. Certain preliminary comments are made about the possibility of judging the relative plausibility of the equilibrium and the disequilibrium models from the observations in meteorites if a monotonic cooling were indeed the course of condensation in either case.

  20. Nucleotide sequence of the pyruvate decarboxylase gene from Zymomonas mobilis.

    PubMed

    Neale, A D; Scopes, R K; Wettenhall, R E; Hoogenraad, N J

    1987-02-25

    Pyruvate decarboxylase (EC 4.1.1.1), the penultimate enzyme in the alcoholic fermentation pathway of Zymomonas mobilis, converts pyruvate to acetaldehyde and carbon dioxide. The complete nucleotide sequence of the structural gene encoding pyruvate decarboxylase from Zymomonas mobilis has been determined. The coding region is 1704 nucleotides long and encodes a polypeptide of 567 amino acids with a calculated subunit mass of 60,790 daltons. The amino acid sequence was confirmed by comparison with the amino acid sequence of a selection of tryptic fragments of the enzyme. The amino acid composition obtained from the nucleotide sequence is in good agreement with that obtained experimentally.

  1. Mesoscopic modeling of DNA denaturation rates: Sequence dependence and experimental comparison

    SciTech Connect

    Dahlen, Oda Erp, Titus S. van

    2015-06-21

    Using rare event simulation techniques, we calculated DNA denaturation rate constants for a range of sequences and temperatures for the Peyrard-Bishop-Dauxois (PBD) model with two different parameter sets. We studied a larger variety of sequences compared to previous studies that only consider DNA homopolymers and DNA sequences containing an equal amount of weak AT- and strong GC-base pairs. Our results show that, contrary to previous findings, an even distribution of the strong GC-base pairs does not always result in the fastest possible denaturation. In addition, we applied an adaptation of the PBD model to study hairpin denaturation for which experimental data are available. This is the first quantitative study in which dynamical results from the mesoscopic PBD model have been compared with experiments. Our results show that present parameterized models, although giving good results regarding thermodynamic properties, overestimate denaturation rates by orders of magnitude. We believe that our dynamical approach is, therefore, an important tool for verifying DNA models and for developing next generation models that have higher predictive power than present ones.

  2. Sequence comparison and phylogenetic analysis by the Maximum Likelihood method of ribosome-inactivating proteins from angiosperms.

    PubMed

    Di Maro, Antimo; Citores, Lucía; Russo, Rosita; Iglesias, Rosario; Ferreras, José Miguel

    2014-08-01

    Ribosome-inactivating proteins (RIPs) from angiosperms are rRNA N-glycosidases that have been proposed as defence proteins against virus and fungi. They have been classified as type 1 RIPs, consisting of single-chain proteins, and type 2 RIPs, consisting of an A chain with RIP properties covalently linked to a B chain with lectin properties. In this work we have carried out a broad search of RIP sequence data banks from angiosperms in order to study their main structural characteristics and phylogenetic evolution. The comparison of the sequences revealed the presence, outside of the active site, of a novel structure that might be involved in the internal protein dynamics linked to enzyme catalysis. Also the B-chains presented another conserved structure that might function either supporting the beta-trefoil structure or in the communication between both sugar-binding sites. A systematic phylogenetic analysis of RIP sequences revealed that the most primitive type 1 RIPs were similar to that of the actual monocots (Poaceae and Asparagaceae). The primitive RIPs evolved to the dicot type 1 related RIPs (like those from Caryophyllales, Lamiales and Euphorbiales). The gene of a type 1 RIP related with the actual Euphorbiaceae type 1 RIPs fused with a double beta trefoil lectin gene similar to the actual Cucurbitaceae lectins to generate the type 2 RIPs and finally this gene underwent deletions rendering either type 1 RIPs (like those from Cucurbitaceae, Rosaceae and Iridaceae) or lectins without A chain (like those from Adoxaceae).

  3. Comparison Study between North Texas Earthquake Sequences from 2008-2015

    NASA Astrophysics Data System (ADS)

    DeShon, H. R.; Hayward, C.; Scales, M. M.; Magnani, M. B.; Hornbach, M. J.; Stump, B. W.

    2015-12-01

    High-resolution earthquake locations and fault plane solutions are combined with information on subsurface geology, fault structure, well data, and 3D pore pressure modeling to provide further insight into the relationship between fluid migration at depth and modern seismicity in North Texas. Since 2008, the USGS has reported over 165 felt earthquakes in the Fort Worth (Barnett Shale) Basin located in northern Texas. Five earthquake sequences warranted deployment of local seismic networks, and there are currently 30+ temporary seismic stations operating in the basin. Event size has increased over time. The 2008/2012 DFW Airport and 2009 Cleburne sequences had maximum event magnitudes of 3.3 and 2.8, respectively. The 2013/2014 Azle-Reno and 2014/2015 Irving-Dallas swarms contain multiple M3.5 earthquakes, and the M4.0 May 2015 Venus earthquake is the largest event recorded to date. Causative faults strike NNE-SSW to NE-SW and are associated with normal faulting, consistent with fault reactivation in the current stress regime. The active faults range from 2-5 km in length, are steeply dipping (45-70º) and can dip to the SE (DFW Airport; Irving-Dallas) or NW (Azle-Reno; Venus). Events occur between 2-6 km depth in the DFW, Cleburne, and Venus sequences but extend to 8 km in the Irving-Dallas and Azle-Reno earthquake swarms. These depths are consistent with reactivation of ancient faults located in the basement granites and/or overlying sedimentary units. The top of the Ordovician Ellenburger group, which serves as a wastewater injection unit in the basin, ranges from 1.4-2.7 km depth across the seismogenic area, is ~1 km thick on average, and generally overlies the Precambrian basement. The DFW Airport, Cleburne, and Azle-Reno sequences have been linked to nearby wastewater injection and/or production activity associated with shale gas extraction. Causal studies of the 2014/2015 Irving-Dallas and 2015 Venus sequences are ongoing and will be updated here.

  4. Application of genotyping-by-sequencing on semiconductor sequencing platforms: A comparison of genetic and reference-based marker ordering in barley

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The rapid development of next generation sequencing platforms has enabled the use of sequencing for routine genotyping across a range of genetics studies and breeding applications. Genotyping-by-sequencing (GBS), a low-cost, reduced representation sequencing method, is becoming a common approach fo...

  5. Modulation of anti-endotoxin property of Temporin L by minor amino acid substitution in identified phenylalanine zipper sequence.

    PubMed

    Srivastava, Saurabh; Kumar, Amit; Tripathi, Amit Kumar; Tandon, Anshika; Ghosh, Jimut Kanti

    2016-11-01

    A 13-residue frog antimicrobial peptide Temporin L (TempL) possesses versatile antimicrobial activities and is considered a lead molecule for the development of new antimicrobial agents. To find out the amino acid sequences that influence the anti-microbial property of TempL, a phenylalanine zipper-like sequence was identified in it which was not reported earlier. Several alanine-substituted analogs and a scrambled peptide having the same composition of TempL were designed for evaluating the role of this motif. To investigate whether leucine residues instead of phenylalanine residues at 'a' and/or 'd' position(s) of the heptad repeat sequence could alter its antimicrobial property, several TempL analogs were synthesized after replacing these phenylalanine residues with leucine residues. Replacing phenylalanine residues with alanine residues in the phenylalanine zipper sequence significantly compromised the anti-endotoxin property of TempL. This is evident from the higher production of tumor necrosis factor-α and interleukin-6 in lipopolysaccharide (LPS)-stimulated rat bone-marrow-derived macrophage cells in the presence of its alanine-substituted analogs than TempL itself. However, replacement of these phenylalanine residues with leucine residues significantly augmented anti-endotoxin property of TempL. A single alanine-substituted TempL analog (F8A-TempL) showed significantly reduced cytotoxicity but retained the antibacterial activity of TempL, while the two single leucine-substituted analogs (F5L-TempL and F8L-TempL), although exhibiting lower cytotoxicity, were able to retain the antibacterial activity of the parent peptide. The results demonstrate how minor amino acid substitutions in the identified phenylalanine zipper sequence in TempL could yield analogs with better antibacterial and/or anti-endotoxin properties with their plausible mechanism of action.

  6. Genome Sequencing of the Pyruvate-producing Strain Candida glabrata CCTCC M202019 and Genomic Comparison with Strain CBS138

    PubMed Central

    Xu, Nan; Ye, Chao; Chen, Xiulai; Liu, Jia; Liu, Liming; Chen, Jian

    2016-01-01

    Candida glabrata CCTCC M202019 as an industrial yeast strain that is widely used to produce α-oxocarboxylic acid. Strain M202019 has been proven to have a higher pyruvate-producing capacity than the reference strain CBS138. To characterize the genotype of the M202019 strain, we generated a draft sequence of its genome, which has a size of 12.1 Mbp and a GC content of 38.47%. Evidence accumulated during genome annotation suggests that strain M202019 has strong capacities for glucose transport and pyruvate biosynthesis, defects in pyruvate catabolism, as well as variations in genes involved in nutrient and dicarboxylic acid transport, oxidative phosphorylation, and other relevant aspects of carbon metabolism, which might promote pyruvate accumulation. In addition to differences in its central carbon metabolism, a genomic analysis revealed genetic differences in adhesion metabolism. Forty-nine adhesin-like proteins of strain M202019 were identified classified into seven subfamilies. Decreased amounts of adhesive proteins, and deletions or changes of low-complexity repeats and functional domains might lead to lower adhesion and reduced pathogenicity. Further virulence experiments validated the biological safety of strain M202019. Analysis of the C. glabrata CCTCC M202019 genome sequence provides useful insights into its genetic context, physical characteristics, and potential metabolic capacity. PMID:27713500

  7. Crystal structure of transglutaminase 2 with GTP complex and amino acid sequence evidence of evolution of GTP binding site.

    PubMed

    Jang, Tae-Ho; Lee, Dong-Sup; Choi, Kihang; Jeong, Eui Man; Kim, In-Gyu; Kim, Young Whan; Chun, Jung Nyeo; Jeon, Ju-Hong; Park, Hyun Ho

    2014-01-01

    Transglutaminase2 (TG2) is a multi-functional protein involved in various cellular processes, including apoptosis, differentiation, wound healing, and angiogenesis. The malfunction of TG2 causes many human disease including inflammatory disease, celiac disease, neurodegenerative diseases, tissue fibrosis, and cancers. Protein cross-linking activity, which is representative of TG2, is activated by calcium ions and suppressed by GTP. Here, we elucidated the structure of TG2 in complex with its endogenous inhibitor, GTP. Our structure showed why GTP is the optimal nucleotide for interacting with and inhibiting TG2. In addition, sequence comparison provided information describing the evolutionary scenario of GTP usage for controlling the activity of TG2.

  8. JRC GMO-Amplicons: a collection of nucleic acid sequences related to genetically modified organisms

    PubMed Central

    Petrillo, Mauro; Angers-Loustau, Alexandre; Henriksson, Peter; Bonfini, Laura; Patak, Alex; Kreysa, Joachim

    2015-01-01

    The DNA target sequence is the key element in designing detection methods for genetically modified organisms (GMOs). Unfortunately this information is frequently lacking, especially for unauthorized GMOs. In addition, patent sequences are generally poorly annotated, buried in complex and extensive documentation and hard to link to the corresponding GM event. Here, we present the JRC GMO-Amplicons, a database of amplicons collected by screening public nucleotide sequence databanks by in silico determination of PCR amplification with reference methods for GMO analysis. The European Union Reference Laboratory for Genetically Modified Food and Feed (EU-RL GMFF) provides these methods in the GMOMETHODS database to support enforcement of EU legislation and GM food/feed control. The JRC GMO-Amplicons database is composed of more than 240 000 amplicons, which can be easily accessed and screened through a web interface. To our knowledge, this is the first attempt at pooling and collecting publicly available sequences related to GMOs in food and feed. The JRC GMO-Amplicons supports control laboratories in the design and assessment of GMO methods, providing inter-alia in silico prediction of primers specificity and GM targets coverage. The new tool can assist the laboratories in the analysis of complex issues, such as the detection and identification of unauthorized GMOs. Notably, the JRC GMO-Amplicons database allows the retrieval and characterization of GMO-related sequences included in patents documentation. Finally, it can help annotating poorly described GM sequences and identifying new relevant GMO-related sequences in public databases. The JRC GMO-Amplicons is freely accessible through a web-based portal that is hosted on the EU-RL GMFF website. Database URL: http://gmo-crl.jrc.ec.europa.eu/jrcgmoamplicons/ PMID:26424080

  9. JRC GMO-Amplicons: a collection of nucleic acid sequences related to genetically modified organisms.

    PubMed

    Petrillo, Mauro; Angers-Loustau, Alexandre; Henriksson, Peter; Bonfini, Laura; Patak, Alex; Kreysa, Joachim

    2015-01-01

    The DNA target sequence is the key element in designing detection methods for genetically modified organisms (GMOs). Unfortunately this information is frequently lacking, especially for unauthorized GMOs. In addition, patent sequences are generally poorly annotated, buried in complex and extensive documentation and hard to link to the corresponding GM event. Here, we present the JRC GMO-Amplicons, a database of amplicons collected by screening public nucleotide sequence databanks by in silico determination of PCR amplification with reference methods for GMO analysis. The European Union Reference Laboratory for Genetically Modified Food and Feed (EU-RL GMFF) provides these methods in the GMOMETHODS database to support enforcement of EU legislation and GM food/feed control. The JRC GMO-Amplicons database is composed of more than 240 000 amplicons, which can be easily accessed and screened through a web interface. To our knowledge, this is the first attempt at pooling and collecting publicly available sequences related to GMOs in food and feed. The JRC GMO-Amplicons supports control laboratories in the design and assessment of GMO methods, providing inter-alia in silico prediction of primers specificity and GM targets coverage. The new tool can assist the laboratories in the analysis of complex issues, such as the detection and identification of unauthorized GMOs. Notably, the JRC GMO-Amplicons database allows the retrieval and characterization of GMO-related sequences included in patents documentation. Finally, it can help annotating poorly described GM sequences and identifying new relevant GMO-related sequences in public databases. The JRC GMO-Amplicons is freely accessible through a web-based portal that is hosted on the EU-RL GMFF website. Database URL: http://gmo-crl.jrc.ec.europa.eu/jrcgmoamplicons/.

  10. A comparison of tools for the simulation of genomic next-generation sequencing data

    PubMed Central

    Escalona, Merly; Rocha, Sara; Posada, David

    2017-01-01

    Computer simulation of genomic data has become increasingly popular for assessing and validating biological models or to gain understanding about specific datasets. Multiple computational tools for the simulation of next-generation sequencing (NGS) data have been developed in recent years, which could be used to compare existing and new NGS analytical pipelines. Here we review 23 of these tools, highlighting their distinct functionality, requirements and potential applications. We also provide a decision tree for the informed selection of an appropriate NGS simulation tool for the specific question at hand. PMID:27320129

  11. A comparison of tools for the simulation of genomic next-generation sequencing data.

    PubMed

    Escalona, Merly; Rocha, Sara; Posada, David

    2016-08-01

    Computer simulation of genomic data has become increasingly popular for assessing and validating biological models or for gaining an understanding of specific data sets. Several computational tools for the simulation of next-generation sequencing (NGS) data have been developed in recent years, which could be used to compare existing and new NGS analytical pipelines. Here we review 23 of these tools, highlighting their distinct functionality, requirements and potential applications. We also provide a decision tree for the informed selection of an appropriate NGS simulation tool for the specific question at hand.

  12. A structural and primary sequence comparison of the viral RNA-dependent RNA polymerases

    PubMed Central

    Bruenn, Jeremy A.

    2003-01-01

    A systematic bioinformatic approach to identifying the evolutionarily conserved regions of proteins has verified the universality of a newly described conserved motif in RNA-dependent RNA polymerases (motif F). In combination with structural comparisons, this approach has defined two regions that may be involved in unwinding double-stranded RNA (dsRNA) for transcription. One of these is the N-terminal portion of motif F and the second is a large insertion in motif F present in the RNA-dependent RNA polymerases of some dsRNA viruses. PMID:12654997

  13. Applicability Comparison of Methods for Acid Generation Assessment of Rock Samples

    NASA Astrophysics Data System (ADS)

    Oh, Chamteut; Ji, Sangwoo; Yim, Giljae; Cheong, Youngwook

    2014-05-01

    Minerals including various forms of sulfur could generate AMD (Acid Mine Drainage) or ARD (Acid Rock Drainage), which can have serious effects on the ecosystem and even on human when exposed to air and/or water. To minimize the hazards by acid drainage, it is necessary to assess in advance the acid generation possibility of rocks and estimate the amount of acid generation. Because of its relatively simple and effective experiment procedure, the method of combining the results of ABA (Acid Base Accounting) and NAG (Net Acid Generation) tests have been commonly used in determining acid drainage conditions. The simplicity and effectiveness of the above method however, are derived from massive assumptions of simplified chemical reactions and this often leads to results of classifying the samples as UC (Uncertain) which would then require additional experimental or field data to reclassify them properly. This paper therefore, attempts to find the reasons that cause samples to be classified as UC and suggest new series of experiments where samples can be reclassified appropriately. Study precedents on evaluating potential acid generation and neutralization capacity were reviewed and as a result three individual experiments were selected in the light of applicability and compatibility of minimizing unnecessary influence among other experiments. The proposed experiments include sulfur speciation, ABCC (Acid Buffering Characteristic Curve), and Modified NAG which are all improved versions of existing experiments of Total S, ANC (Acid Neutralizing Capacity), and NAG respectively. To assure the applicability of the experiments, 36 samples from 19 sites with diverse geologies, field properties, and weathering conditions were collected. The samples were then subject to existing experiments and as a result, 14 samples which either were classified as UC or could be used as a comparison group had been selected. Afterwards, the selected samples were used to conduct the suggested

  14. The human erythrocyte anion-transport protein. Partial amino acid sequence, conformation and a possible molecular mechanism for anion exchange.

    PubMed Central

    Brock, C J; Tanner, M J; Kempf, C

    1983-01-01

    The N-terminal 72 residues of an integral membrane fragment, P5, of the human erythrocyte anion-transport protein, which is known to be directly involved in the anion-exchange process, was shown to have the following amino acid sequence: Met-Val-Pro-Lys-Pro-Gln-Gly-Pro-Leu-Pro-Asn-Thr-Ala-Leu-Leu-Ser-Leu-Val-Leu-Met -Ala-Gly-Thr-Phe-Phe-Phe-Ala-Met-Met-Leu-Arg-Lys-Phe-Lys-Asn-Ser-Ser-Tyr-Phe-Pro-Gly-Lys-Leu-Arg-Arg-Val-Ile-Gly-Asp-Phe-Gly-Val-Pro-Ile-Ser-Ile-Leu-Ile-Met-Val-Leu-Val-Asp-Phe-Phe-Ile-Gln-Asp-Thr-Tyr-Thr-Gln- The structure of this fragment was analysed, with account being taken of the constraints that apply to the folding of integral membrane proteins and the topographical locations of various sites in the sequence. It was concluded that this sequence forms two transmembrane alpha-helices. These are probably part of a cluster of amphipathic transmembrane alpha-helices, which could comprise that part of the protein responsible for transport activity. The presently available evidence relating to the anion-exchange process was considered with the structural features noted in this study and a possible molecular mechanism is proposed. In this model the rearrangement of a network of intramembranous charged pairs mediates the translocation of an anion between anion-binding regions at each surface of the membrane, which are composed of clusters of positively charged amino acids. This model imposes a sequential exchange mechanism on the system. Supplementary material, including Tables and Figures describing the compositions of peptides determined by amino acid analysis and sequence studies, quantitative and qualitative data that provide a residue-by-residue justification for the sequence assignment and a description of modifications to and use of the solid-phase sequencer has been deposited as Supplementary Publication SUP 50123 (12 pages) with the British Library Lending Division, Boston Spa, Wetherby, West Yorkshire LS23 7BQ, U.K., from whom copies can be

  15. Genome sequence of the acid-tolerant Burkholderia sp. strain WSM2230 from Karijini National Park, Australia

    PubMed Central

    Walker, Robert; Watkin, Elizabeth; Tian, Rui; Bräu, Lambert; O’Hara, Graham; Goodwin, Lynne; Han, James; Lobos, Elizabeth; Huntemann, Marcel; Pati, Amrita; Woyke, Tanja; Mavromatis, Konstantinos; Markowitz, Victor; Ivanova, Natalia; Kyrpides, Nikos; Reeve, Wayne

    2013-01-01

    Burkholderia sp. strain WSM2230 is an aerobic, motile, Gram-negative, non-spore-forming acid-tolerant rod isolated from acidic soil collected in 2001 from Karijini National Park, Western Australia, using Kennedia coccinea (Coral Vine) as a host. WSM2230 was initially effective in nitrogen-fixation with K. coccinea, but subsequently lost symbiotic competence. Here we describe the features of Burkholderia sp. strain WSM2230, together with genome sequence information and its annotation. The 6,309,801 bp high-quality-draft genome is arranged into 33 scaffolds of 33 contigs containing 5,590 protein-coding genes and 63 RNA-only encoding genes. The genome sequence of WSM2230 failed to identify nodulation genes and provides an explanation for the observed failure of the laboratory grown strain to nodulate. The genome of this strain is one of 100 sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project. PMID:25197440

  16. Genome sequence of the acid-tolerant Burkholderia sp. strain WSM2230 from Karijini National Park, Australia.

    PubMed

    Walker, Robert; Watkin, Elizabeth; Tian, Rui; Bräu, Lambert; O'Hara, Graham; Goodwin, Lynne; Han, James; Lobos, Elizabeth; Huntemann, Marcel; Pati, Amrita; Woyke, Tanja; Mavromatis, Konstantinos; Markowitz, Victor; Ivanova, Natalia; Kyrpides, Nikos; Reeve, Wayne

    2014-06-15

    Burkholderia sp. strain WSM2230 is an aerobic, motile, Gram-negative, non-spore-forming acid-tolerant rod isolated from acidic soil collected in 2001 from Karijini National Park, Western Australia, using Kennedia coccinea (Coral Vine) as a host. WSM2230 was initially effective in nitrogen-fixation with K. coccinea, but subsequently lost symbiotic competence. Here we describe the features of Burkholderia sp. strain WSM2230, together with genome sequence information and its annotation. The 6,309,801 bp high-quality-draft genome is arranged into 33 scaffolds of 33 contigs containing 5,590 protein-coding genes and 63 RNA-only encoding genes. The genome sequence of WSM2230 failed to identify nodulation genes and provides an explanation for the observed failure of the laboratory grown strain to nodulate. The genome of this strain is one of 100 sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project.

  17. Cloning, sequence analysis and expression of the F1F0-ATPase beta-subunit from wine lactic acid bacteria.

    PubMed

    Sievers, Martin; Uermösi, Christina; Fehlmann, Marc; Krieger, Sibylle

    2003-09-01

    The nucleotide sequences of the genes encoding the F1F0-ATPase beta-subunit from Oenococcus oeni, Leuconostoc mesenteroides subsp. mesenteroides, Pediococcus damnosus, Pediococcus parvulus, Lactobacillus brevis and Lactobacillus hilgardii were determined. Their deduced amino acid sequences showed homology values of 79-98%. Data from the alignment and ATPase tree indicated that O. oeni and L. mesenteroides subsp. mesenteroides formed a group well-separated from P. damnosus and P. parvulus and from the group comprises L. brevis and L. hilgardii. The N-terminus of the F1F0-ATPase beta-subunit of O. oeni contains a stretch of additional 38 amino acid residues. The catalytic site of the ATPase beta-subunit of the investigated strains is characterized by the two conserved motifs GGAGVGKT and GERTRE. The amplified atpD coding sequences were inserted into the pCRT7/CT-TOPO vector using TA-cloning strategy and transformed in Escherichia coli. SDS-PAGE and Western blot analyses confirmed that O. oeni has an ATPase beta-subunit protein which is larger in size than the corresponding molecules from the investigated strains.

  18. The amino acid sequence of protein SCMK-B2C from the high-sulphur fraction of wool keratin

    PubMed Central

    Elleman, T. C.

    1972-01-01

    1. The amino acid sequence of a protein from the reduced and carboxymethylated high-sulphur fraction of wool has been determined. 2. The sequence of this S-carboxymethylkerateine (SCMK-B2C) of 151 amino acid residues displays much internal homology and an unusual residue distribution. Thus a ten-residue sequence occurs four times near the N-terminus and five times near the C-terminus with few changes. These regions contain much of the molecule's half-cystine, whereas between them there is a region of 19 residues that are mainly small and devoid of cystine and proline. 3. Certain models of the wool fibre base