Science.gov

Sample records for acid sequence identities

  1. Parameters of proteome evolution from histograms of amino-acid sequence identities of paralogous proteins

    PubMed Central

    Axelsen, Jacob Bock; Yan, Koon-Kiu; Maslov, Sergei

    2007-01-01

    Background The evolution of the full repertoire of proteins encoded in a given genome is mostly driven by gene duplications, deletions, and sequence modifications of existing proteins. Indirect information about relative rates and other intrinsic parameters of these three basic processes is contained in the proteome-wide distribution of sequence identities of pairs of paralogous proteins. Results We introduce a simple mathematical framework based on a stochastic birth-and-death model that allows one to extract some of this information and apply it to the set of all pairs of paralogous proteins in H. pylori, E. coli, S. cerevisiae, C. elegans, D. melanogaster, and H. sapiens. It was found that the histogram of sequence identities p generated by an all-to-all alignment of all protein sequences encoded in a genome is well fitted with a power-law form ~ p-γ with the value of the exponent γ around 4 for the majority of organisms used in this study. This implies that the intra-protein variability of substitution rates is best described by the Gamma-distribution with the exponent α ≈ 0.33. Different features of the shape of such histograms allow us to quantify the ratio between the genome-wide average deletion/duplication rates and the amino-acid substitution rate. Conclusion We separately measure the short-term ("raw") duplication and deletion rates rdup∗, rdel∗ which include gene copies that will be removed soon after the duplication event and their dramatically reduced long-term counterparts rdup, rdel. High deletion rate among recently duplicated proteins is consistent with a scenario in which they didn't have enough time to significantly change their functional roles and thus are to a large degree disposable. Systematic trends of each of the four duplication/deletion rates with the total number of genes in the genome were analyzed. All but the deletion rate of recent duplicates rdel∗ were shown to systematically increase with Ngenes. Abnormally flat shapes

  2. Rat androgen-binding protein: evidence for identical subunits and amino acid sequence homology with human sex hormone-binding globulin.

    PubMed

    Joseph, D R; Hall, S H; French, F S

    1987-01-01

    The cDNA for rat androgen-binding protein (ABP) was previously isolated from a bacteriophage lambda gt11 rat testis cDNA library and its identity was confirmed by epitope selection. Hybrid-arrested translation studies have now demonstrated the identity of the isolates. The nucleotide sequence of a near full-length cDNA encodes a 403-amino acid precursor (Mr = 44,539), which agrees in size with the cell-free translation product (Mr = 45,000) of ABP mRNA. Putative sites of N-glycosylation and signal peptide cleavage were identified. Comparison of the predicted amino acid sequence of rat ABP with the amino-terminal amino acid sequence of human sex hormone-binding globulin revealed that 17 of 25 residues are identical. On the basis of the predicted amino acid sequence the molecular weight of the primary translation product, lacking the signal peptide, was 41,183. Hybridization analyses indicated that the two subunits of ABP are coded for by a single gene and a single mRNA species. Our results suggest that ABP consists of two subunits with identical primary sequences and that differences in post-translational processing result in the production of 47,000 and 41,000 molecular weight monomers.

  3. A case of orthologous sequences of hemocyanin subunits for an evolutionary study of horseshoe crabs: amino acid sequence comparison of immunologically identical subunits of Carcinoscorpius rotundicauda and Tachypleus tridentatus.

    PubMed

    Sugita, H; Shishikura, F

    1995-10-01

    About 83% of the amino acid sequence of hemocyanin subunit HR6 from the Southeast Asian horseshoe crab, Carcinoscorpius rotundicauda, has been determined. There is a difference of about 43% between HR6 and complete sequences of chelicerate hemocyanin subunits from the American horseshoe crab, Limulus polyphemus, and a tarantula, Eurypelma californicum. However, the immunologically identical subunits HR6 and HT6 from Tachypleus tridentatus (Japanese horseshoe crab) show 2.7% sequence difference. Based on the amino acid sequences of HR6 and HT6, the divergence between C. rotundicauda and T. tridentatus occurred about 9.6 million years ago. In the case of horseshoe crab hemocyanin subunits, it seems that the orthologous homologues in many homologous subunits between species are immunologically detectable.

  4. The amino acid sequence of Canada goose (Branta canadensis) and mute swan (Cygnus olor) hemoglobins. Two different species with identical beta-chains.

    PubMed

    Oberthür, W; Godovac-Zimmermann, J; Braunitzer, G; Wiesner, H

    1982-08-01

    The amino acid sequences of the alpha- and beta-chains from the major hemoglobin component (HbA) of Canada goose (Branta canadensis) and mute swan (Cygnus olor) are given. The alpha-chains are of the alpha A-type, since alpha D-type was expressed but only found in low concentrations. By homologous comparison, greylag goose hemoglobin (Anser anser) and Canada goose hemoglobin alpha-chains differ by two exchanges, and beta-chains by three exchanges. A valine substitution for threonine was found at position alpha 34 (B15). This exchange is a result of a two point mutation. Thus, there are three nucleotide mutations in alpha-chains, as in beta-chains. Substitutions in positions alpha 34 (B15) and beta 125 (H3) have modified intersubunit contacts (alpha 1 beta 1-contacts). A comparison of mute swan hemoglobin with greylag goose hemoglobin shows four exchanges in alpha-chains and three in beta-chains. Canada goose and mute swan have identical beta-chains, while alpha-chains differ in two amino acids. One of these exchanges is implicated in one of the alpha 1 beta 1-contact points (alpha 34) where isoleucine substitution for valine was found. Comparison of hemoglobins from different species in the same tribe (Anserini) shows a high homology between Canada goose and mute swan hemoglobins.

  5. My Identical Twin Sequenced our Genome.

    PubMed

    Schilit, Samantha L P; Schilit Nitenson, Arielle

    2017-04-01

    With rapidly declining costs, whole genome sequencing is becoming feasible for widespread use. Although cost-effectiveness is driving increased use of the technology, comprehensive recommendations on how to handle ethical dilemmas have yet to reach a consensus. In this article, Sam shares her experience of undergoing whole genome sequencing. Despite the deeply private nature of the test, the results do not solely belong to Sam; her identical twin sister, Arielle, shares virtually the same genome and received results without a formal consent process. This article explores their parallel experiences as a way of highlighting the controversial ethics of a private test with familial implications.

  6. Screening of transgenic proteins expressed in transgenic food crops for the presence of short amino acid sequences identical to potential, IgE – binding linear epitopes of allergens

    PubMed Central

    Kleter, Gijs A; Peijnenburg, Ad ACM

    2002-01-01

    Background Transgenic proteins expressed by genetically modified food crops are evaluated for their potential allergenic properties prior to marketing, among others by identification of short identical amino acid sequences that occur both in the transgenic protein and allergenic proteins. A strategy is proposed, in which the positive outcomes of the sequence comparison with a minimal length of six amino acids are further screened for the presence of potential linear IgE-epitopes. This double track approach involves the use of literature data on IgE-epitopes and an antigenicity prediction algorithm. Results Thirty-three transgenic proteins have been screened for identities of at least six contiguous amino acids shared with allergenic proteins. Twenty-two transgenic proteins showed positive results of six- or seven-contiguous amino acids length. Only a limited number of identical stretches shared by transgenic proteins (papaya ringspot virus coat protein, acetolactate synthase GH50, and glyphosate oxidoreductase) and allergenic proteins could be identified as (part of) potential linear epitopes. Conclusion Many transgenic proteins have identical stretches of six or seven amino acids in common with allergenic proteins. Most identical stretches are likely to be false positives. As shown in this study, identical stretches can be further screened for relevance by comparison with linear IgE-binding epitopes described in literature. In the absence of literature data on epitopes, antigenicity prediction by computer aids to select potential antibody binding sites that will need verification of IgE binding by sera binding tests. Finally, the positive outcomes of this approach warrant further clinical testing for potential allergenicity. PMID:12477382

  7. The complete amino acid sequence of prochymosin.

    PubMed Central

    Foltmann, B; Pedersen, V B; Jacobsen, H; Kauffman, D; Wybrandt, G

    1977-01-01

    The total sequence of 365 amino acid residues in bovine prochymosin is presented. Alignment with the amino acid sequence of porcine pepsinogen shows that 204 amino acid residues are common to the two zymogens. Further comparison and alignment with the amino acid sequence of penicillopepsin shows that 66 residues are located at identical positions in all three proteases. The three enzymes belong to a large group of proteases with two aspartate residues in the active center. This group forms a family derived from one common ancestor. PMID:329280

  8. Identical amino acid sequence of the aroA(G) gene products of Bacillus subtilis 168 and B. subtilis Marburg strain.

    PubMed

    Bolotin, A; Khazak, V; Stoynova, N; Ratmanova, K; Yomantas, Y; Kozlov, Y

    1995-09-01

    A DNA fragment containing the aroA(G) gene of Bacillus subtilis 168, encoding 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase-chorismate mutase, was cloned and sequenced. The N-terminus of the protein encoded by aroA(G) showed homology with chorismate mutase encoded by aroH of B. subtilis and with the chorismate mutase parts of proteins encoded by the pheA and tyrA genes of Escherichia coli. The C-terminus of the aroA(G) product has sequence similarity with 3-deoxy-D-manno-octulosonate 8-phosphate synthase of E. coli. It was shown that the proteins encoded by the aroA(G) gene of B. subtilis 168 and the aroA gene of B. subtilis ATCC 6051 Marburg strain are identical, so the observed differences in DAHP synthase activity from these two strains must result from other changes.

  9. Distinguishing Proteins From Arbitrary Amino Acid Sequences

    PubMed Central

    Yau, Stephen S.-T.; Mao, Wei-Guang; Benson, Max; He, Rong Lucy

    2015-01-01

    What kinds of amino acid sequences could possibly be protein sequences? From all existing databases that we can find, known proteins are only a small fraction of all possible combinations of amino acids. Beginning with Sanger's first detailed determination of a protein sequence in 1952, previous studies have focused on describing the structure of existing protein sequences in order to construct the protein universe. No one, however, has developed a criteria for determining whether an arbitrary amino acid sequence can be a protein. Here we show that when the collection of arbitrary amino acid sequences is viewed in an appropriate geometric context, the protein sequences cluster together. This leads to a new computational test, described here, that has proved to be remarkably accurate at determining whether an arbitrary amino acid sequence can be a protein. Even more, if the results of this test indicate that the sequence can be a protein, and it is indeed a protein sequence, then its identity as a protein sequence is uniquely defined. We anticipate our computational test will be useful for those who are attempting to complete the job of discovering all proteins, or constructing the protein universe. PMID:25609314

  10. Composition for nucleic acid sequencing

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2008-08-26

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  11. Some identities of generalized Fibonacci sequence

    NASA Astrophysics Data System (ADS)

    Chong, Chin-Yoon; Cheah, C. L.; Ho, C. K.

    2014-07-01

    We introduced the generalized Fibonacci sequence {Un} defined by U0 = 0, U1 = 1, and Un+2 = pUn+1+qUn for all p, q∈Z+ and for all non-negative integers n. In this paper, we obtained some recursive formulas of the sequence.

  12. Amino acid sequence of mouse submaxillary gland renin.

    PubMed Central

    Misono, K S; Chang, J J; Inagami, T

    1982-01-01

    The complete amino acid sequences of the heavy chain and light chain of mouse submaxillary gland renin have been determined. The heavy chain consists of 288 amino acid residues having a Mr of 31,036 calculated from the sequence. The light chain contains 48 amino acid residues with a Mr of 5,458. The sequence of the heavy chain was determined by automated Edman degradations of the cyanogen bromide peptides and tryptic peptides generated after citraconylation, as well as other peptides generated therefrom. The sequence of the light chain was derived from sequence analyses of the peptides generated by cyanogen bromide cleavage or by digestion with Staphylococcus aureus protease. The sequences in the active site regions in renin containing two catalytically essential aspartyl residues 32 and 215 were found identical with those in pepsin, chymosin, and penicillopepsin. Comparison of the amino acid sequence of renin with that of porcine pepsin indicated a 42% sequence identity of the heavy chain with the amino-terminal and middle regions and a 46% identity of the light chain with the carboxyl-terminal region of the porcine pepsin sequence. Residues identical in renin and pepsin are distributed throughout the length of the molecules, suggesting a similarity in their overall structures. PMID:6812055

  13. High speed nucleic acid sequencing

    DOEpatents

    Korlach, Jonas [Ithaca, NY; Webb, Watt W [Ithaca, NY; Levene, Michael [Ithaca, NY; Turner, Stephen [Ithaca, NY; Craighead, Harold G [Ithaca, NY; Foquet, Mathieu [Ithaca, NY

    2011-05-17

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid. Each type of labeled nucleotide comprises an acceptor fluorophore attached to a phosphate portion of the nucleotide such that the fluorophore is removed upon incorporation into a growing strand. Fluorescent signal is emitted via fluorescent resonance energy transfer between the donor fluorophore and the acceptor fluorophore as each nucleotide is incorporated into the growing strand. The sequence is deduced by identifying which base is being incorporated into the growing strand.

  14. Sequence of the human 40-kDa keratin reveals an unusual structure with very high sequence identity to the corresponding bovine keratin

    SciTech Connect

    Eckert, R.L.

    1988-02-01

    The complete amino acid and DNA sequences of the human 40-kDa keratin are reported. The DNA sequence encodes a protein of 44,098 Da, which is unique in that it lacks the terminal non-..cap alpha..-helical tail segment found in all other keratins. When the human 40-kDa keratin amino acid sequence is compared to the corresponding bovine keratin, the overall identity is 89%. The coil-forming regions are 89% identical and the head regions are 88% identical. This similarity is also evident in the DNA sequence of the coding region, the 5' upstream sequences, and the 3' noncoding sequences. The high degree of cross-species identity between bovine and human 40-kDa keratins suggests that there is strong evolutionary pressure to conserve the structure of this keratin. This in turn suggests an important and universal role for this intermediate filament subunit in all species.

  15. SDT: a virus classification tool based on pairwise sequence alignment and identity calculation.

    PubMed

    Muhire, Brejnev Muhizi; Varsani, Arvind; Martin, Darren Patrick

    2014-01-01

    The perpetually increasing rate at which viral full-genome sequences are being determined is creating a pressing demand for computational tools that will aid the objective classification of these genome sequences. Taxonomic classification approaches that are based on pairwise genetic identity measures are potentially highly automatable and are progressively gaining favour with the International Committee on Taxonomy of Viruses (ICTV). There are, however, various issues with the calculation of such measures that could potentially undermine the accuracy and consistency with which they can be applied to virus classification. Firstly, pairwise sequence identities computed based on multiple sequence alignments rather than on multiple independent pairwise alignments can lead to the deflation of identity scores with increasing dataset sizes. Also, when gap-characters need to be introduced during sequence alignments to account for insertions and deletions, methodological variations in the way that these characters are introduced and handled during pairwise genetic identity calculations can cause high degrees of inconsistency in the way that different methods classify the same sets of sequences. Here we present Sequence Demarcation Tool (SDT), a free user-friendly computer program that aims to provide a robust and highly reproducible means of objectively using pairwise genetic identity calculations to classify any set of nucleotide or amino acid sequences. SDT can produce publication quality pairwise identity plots and colour-coded distance matrices to further aid the classification of sequences according to ICTV approved taxonomic demarcation criteria. Besides a graphical interface version of the program for Windows computers, command-line versions of the program are available for a variety of different operating systems (including a parallel version for cluster computing platforms).

  16. Molecular identity of ramie germplasms using simple sequence repeat markers.

    PubMed

    Luan, M B; Chen, B F; Zou, Z Z; Zhu, J J; Wang, X F; Xu, Y; Sun, Z M; Chen, J H

    2015-03-27

    DNA identity is highly effective and efficient for distinguishing crop varieties regardless of their phenotypic similarities. To establish DNA identity in ramie, 21 simple sequence repeat primers were amplified in 108 accessions of domestic and exotic ramie germplasms. Sixty polymorphic bands were obtained, with an average of 2.9 bands per locus and 2-8 band types per primer locus (average of 5.19 band types). The Simpson's diversity index of the 21 simple sequence repeat loci ranged from 0.158 to 0.808 with an average of 0.612. There was large difference in the specific index in the germplasm tested, from 44.082 to 218.163, with an average of 83.620. Based on allele band type, 8 primer pairs were selected for DNA fingerprinting of the 108 genotypes. The combination of the 8 primer pairs were found to be very effective for distinguishing these genotypes, indicating that they can be used in the molecular DNA identity of ramie.

  17. The complete amino acid sequence of yeast phosphoglycerate kinase.

    PubMed Central

    Perkins, R E; Conroy, S C; Dunbar, B; Fothergill, L A; Tuite, M F; Dobson, M J; Kingsman, S M; Kingsman, A J

    1983-01-01

    The complete amino acid sequence of yeast phosphoglycerate kinase, comprising 415 residues, was determined. The sequence of residues 1-173 was deduced mainly from nucleotide sequence analysis of a series of overlapping fragments derived from the relevant portion of a 2.95-kilobase endonuclease-HindIII-digest fragment containing the yeast phosphoglycerate kinase gene. The sequence of residues 174-415 was deduced mainly from amino acid sequence analysis of three CNBr-cleavage fragments, and from peptides derived from these fragments after digestion by a number of proteolytic enzymes. Cleavage at the two tryptophan residues with o-iodosobenzoic acid was also used to isolate fragments suitable for amino acid sequence analysis. Determination of the complete sequence now allows a detailed interpretation of the existing high-resolution X-ray-crystallographic structure. The sequence -Ile-Ile-Gly-Gly-Gly- occurs twice in distant parts of the linear sequence (residues 232-236 and 367-371). Both these regions contribute to the nucleoside phosphate-binding site. A comparison of the sequence of yeast phosphoglycerate kinase reported here with the sequences of phosphoglycerate kinase from horse muscle and human erythrocytes shows that the yeast enzyme is 64% identical with the mammalian enzymes. The yeast has strikingly fewer methionine, cysteine and tryptophan residues. PMID:6347186

  18. Chip-based sequencing nucleic acids

    DOEpatents

    Beer, Neil Reginald

    2014-08-26

    A system for fast DNA sequencing by amplification of genetic material within microreactors, denaturing, demulsifying, and then sequencing the material, while retaining it in a PCR/sequencing zone by a magnetic field. One embodiment includes sequencing nucleic acids on a microchip that includes a microchannel flow channel in the microchip. The nucleic acids are isolated and hybridized to magnetic nanoparticles or to magnetic polystyrene-coated beads. Microreactor droplets are formed in the microchannel flow channel. The microreactor droplets containing the nucleic acids and the magnetic nanoparticles are retained in a magnetic trap in the microchannel flow channel and sequenced.

  19. Amino acid sequences of bacterial cytochromes c' and c-556.

    PubMed Central

    Ambler, R P; Bartsch, R G; Daniel, M; Kamen, M D; McLellan, L; Meyer, T E; Van Beeumen, J

    1981-01-01

    The cytochrome c' are electron transport proteins widely distributed in photosynthetic and aerobic bacteria. We report the amino acid sequences of the proteins from 12 different bacterial species, and we show by sequences that the cytochromes c-556 from 2 different bacteria are structurally related to the cytochromes c'. Unlike the mitochondrial cytochromes c, the heme binding site in the cytochromes c' and c-556 is near the COOH terminus. The cytochromes c-556 probably have a methionine sixth heme ligand located near the NH2 terminus, whereas the cytochromes c' may be pentacoordinate. Quantitative comparison of cytochrome c' and c-556 sequences indicates a relatively low 28% average identity. PMID:6273892

  20. High sequence homology between protein tyrosine acid phosphatase from boar seminal vesicles and human prostatic acid phosphatase.

    PubMed

    Wysocki, Paweł; Płucienniczak, Grazyna; Strzezek, Jerzy

    2009-01-01

    Boar seminal vesicle protein tyrosine acid phosphatase (PTAP) and human prostatic acid phosphatase (PAP) show high affinity for protein phosphotyrosine residues. The physico-chemical and kinetic properties of the boar and human enzymes are different. The main objective of this study was to establish the nucleotide sequence of cDNA encoding boar PTAP and compare it with that of human PAP cDNA. Also, the amino-acid sequence of boar PTAP was compared with the sequence of human PAP. PTAP was isolated from boar seminal vesicle fluid and sequenced. cDNA to boar seminal vesicle RNA was synthesized, amplified by PCR, cloned in E. coli and sequenced. The obtained N-terminal amino-acid sequence of boar PTAP showed 92% identity with the N-terminal amino-acid sequence of human PAP. The determined sequence of a 354 bp nucleotide fragment (GenBank accession number: GQ184596) showed 90% identity with the corresponding sequence of human PAP. On the basis of this sequence a 118 amino acid fragment of boar PTAP was predicted. This fragment showed 89% identity with the corresponding fragment of human PAP and had a similar hydropathy profile. The compared sequences differ in terms of their isoelectric points and amino-acid composition. This may explain the differences in substrate specificity and inhibitor resistance of boar PTAP and human PAP.

  1. How close is close: 16S rRNA sequence identity may not be sufficient to guarantee species identity

    NASA Technical Reports Server (NTRS)

    Fox, G. E.; Wisotzkey, J. D.; Jurtshuk, P. Jr

    1992-01-01

    16S rRNA (genes coding for rRNA) sequence comparisons were conducted with the following three psychrophilic strains: Bacillus globisporus W25T (T = type strain) and Bacillus psychrophilus W16AT, and W5. These strains exhibited more than 99.5% sequence identity and within experimental uncertainty could be regarded as identical. Their close taxonomic relationship was further documented by phenotypic similarities. In contrast, previously published DNA-DNA hybridization results have convincingly established that these strains do not belong to the same species if current standards are used. These results emphasize the important point that effective identity of 16S rRNA sequences is not necessarily a sufficient criterion to guarantee species identity. Thus, although 16S rRNA sequences can be used routinely to distinguish and establish relationships between genera and well-resolved species, very recently diverged species may not be recognizable.

  2. How close is close: 16S rRNA sequence identity may not be sufficient to guarantee species identity

    NASA Technical Reports Server (NTRS)

    Fox, G. E.; Wisotzkey, J. D.; Jurtshuk, P. Jr

    1992-01-01

    16S rRNA (genes coding for rRNA) sequence comparisons were conducted with the following three psychrophilic strains: Bacillus globisporus W25T (T = type strain) and Bacillus psychrophilus W16AT, and W5. These strains exhibited more than 99.5% sequence identity and within experimental uncertainty could be regarded as identical. Their close taxonomic relationship was further documented by phenotypic similarities. In contrast, previously published DNA-DNA hybridization results have convincingly established that these strains do not belong to the same species if current standards are used. These results emphasize the important point that effective identity of 16S rRNA sequences is not necessarily a sufficient criterion to guarantee species identity. Thus, although 16S rRNA sequences can be used routinely to distinguish and establish relationships between genera and well-resolved species, very recently diverged species may not be recognizable.

  3. Parameters of the proteome evolution from the distribution of sequence identities of paralogous proteins

    NASA Astrophysics Data System (ADS)

    Yan, Koon-Kiu; Axelsen, Jacob; Maslov, Sergei

    2006-03-01

    The evolution of the full repertoire of proteins encoded in a given genome is driven by gene duplications, deletions and modifications of amino-acid sequences of already existing proteins. The information about relative rates and other intrinsic parameters of these three basic processes is contained in the distribution of sequence identities of pairs of paralogous proteins. We introduced a simple mathematical framework that allows one to extract some of this hidden information. It was then applied to the proteome-wide set of paralogous proteins in H. pylori, E. coli, S. cerevisiae, C. elegans, D. melanogaster and H. sapiens. We estimated the stationary per-gene deletion and duplication rates, the distribution of amino-acid substitution rate of these organisms. The validity of our mathematical framework was further confirmed by numerical simulations of a simple evolutionary model of a fixed-size proteome.

  4. Method for sequencing nucleic acid molecules

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2006-06-06

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  5. Method for sequencing nucleic acid molecules

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2006-05-30

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  6. Method for sequencing nucleic acid molecules

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2006-06-06

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  7. Structural gene and complete amino acid sequence of Vibrio alginolyticus collagenase.

    PubMed Central

    Takeuchi, H; Shibano, Y; Morihara, K; Fukushima, J; Inami, S; Keil, B; Gilles, A M; Kawamoto, S; Okuda, K

    1992-01-01

    The DNA encoding the collagenase of Vibrio alginolyticus was cloned, and its complete nucleotide sequence was determined. When the cloned gene was ligated to pUC18, the Escherichia coli expression vector, bacteria carrying the gene exhibited both collagenase antigen and collagenase activity. The open reading frame from the ATG initiation codon was 2442 bp in length for the collagenase structural gene. The amino acid sequence, deduced from the nucleotide sequence, revealed that the mature collagenase consists of 739 amino acids with an Mr of 81875. The amino acid sequences of 20 polypeptide fragments were completely identical with the deduced amino acid sequences of the collagenase gene. The amino acid composition predicted from the DNA sequence was similar to the chemically determined composition of purified collagenase reported previously. The analyses of both the DNA and amino acid sequences of the collagenase gene were rigorously performed, but we could not detect any significant sequence similarity to other collagenases. Images Fig. 2. PMID:1311172

  8. Nucleic acid (cDNA) and amino acid sequences of alpha-type gliadins from wheat (Triticum aestivum).

    PubMed Central

    Kasarda, D D; Okita, T W; Bernardin, J E; Baecker, P A; Nimmo, C C; Lew, E J; Dietler, M D; Greene, F C

    1984-01-01

    The complete amino acid sequence for an alpha-type gliadin protein of wheat (Triticum aestivum Linnaeus) endosperm has been derived from a cloned cDNA sequence. An additional cDNA clone that corresponds to about 75% of a similar alpha-type gliadin has been sequenced and shows some important differences. About 97% of the composite sequence of A-gliadin (an alpha-type gliadin fraction) has also been obtained by direct amino acid sequencing. This sequence shows a high degree of similarity with amino acid sequences derived from both cDNA clones and is virtually identical to one of them. On the basis of sequence information, after loss of the signal sequence, the mature alpha-type gliadins may be divided into five different domains, two of which may have evolved from an ancestral gliadin gene, whereas the remaining three contain repeating sequences that may have developed independently. Images PMID:6589619

  9. The amino acid sequence of iguana (Iguana iguana) pancreatic ribonuclease.

    PubMed

    Zhao, W; Beintema, J J; Hofsteenge, J

    1994-01-15

    The pyrimidine-specific ribonuclease superfamily constitutes a group of homologous proteins so far found only in higher vertebrates. Four separate families are found in mammals, which have resulted from gene duplications in mammalian ancestors. To learn more about the evolutionary history of this superfamily, the primary structure and other characteristics of the pancreatic enzyme from iguana (Iguana iguana), a herbivorous lizard species belonging to the reptiles, have been determined. The polypeptide chain consists of 119 amino acid residues. The positions of insertions and deletions in the sequence are identical to those in the enzyme from snapping turtle. However, the two enzymes differ at 54% of the amino acid positions. Iguana ribonuclease contains no carbohydrate, although the enzyme possesses three recognition sites for carbohydrate attachment, and has a high number of acidic residues in a localized part of the sequence.

  10. Amino acid sequence of porcine spleen cathepsin D.

    PubMed Central

    Shewale, J G; Tang, J

    1984-01-01

    The amino acid sequence of porcine spleen cathepsin D heavy chain has been determined and, hence, the complete structure of this enzyme is now known. The sequence of heavy chain was constructed by aligning the structures of peptides generated by cyanogen bromide, trypsin, and endo-proteinase Lys C cleavages. The structure of the light chain has been published previously. The cathepsin D molecule contains 339 amino acid residues in two polypeptide chains: a 97-residue light chain and a 242-residue heavy chain, with a combined Mr of 36,779 (without carbohydrate). There are two carbohydrate units linked to asparagine residues 70 and 192. The disulfide bond arrangement in cathepsin D is probably similar to that of pepsin, because the positions of six half-cystine residues are conserved. The active site aspartyl residues, corresponding to aspartic acid-32 and -215 of pepsin, are located at residues 33 and 224 in the cathepsin D molecule. The amino acid sequence around these aspartyl residues is strongly conserved. Cathepsin D shows a strong homology with other acid proteases. When the sequence of cathepsin D, renin, and pepsin are aligned, 32.7% of the residues are identical. The homology is observed throughout the length of the molecules, indicating that three-dimensional structures of all three molecules are similar. PMID:6587385

  11. 77 FR 65537 - Requirements for Patent Applications Containing Nucleotide Sequence and/or Amino Acid Sequence...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2012-10-29

    ... Amino Acid Sequence Disclosures ACTION: Proposed collection; comment request. SUMMARY: The United States....'' SUPPLEMENTARY INFORMATION: I. Abstract Patent applications that contain nucleotide and/or amino acid sequence...

  12. Identities and relations associated with Lucas and some special sequences

    NASA Astrophysics Data System (ADS)

    Ozdemir, Gulsah; Simsek, Yilmaz

    2017-07-01

    In this article, by using generating functions method we derive some new identities and relations related to the Fibonacci numbers, Lucas numbers, Bernoulli numbers, Euler numbers. We also give a relation between Lucas numbers of order k and the numbers y3(n, k; λ; a, b) which were defined by the second author [7].

  13. Amino acid sequence and comparative antigenicity of chicken metallothionein.

    PubMed Central

    McCormick, C C; Fullmer, C S; Garvey, J S

    1988-01-01

    The complete amino acid sequence of metallothionein (MT) from chicken liver is reported. The primary structure was determined by automated sequence analysis of peptides produced by limited acid hydrolysis and by trypsin digestion. The comparative antigenicity of chicken MT was determined by radioimmunoassay using rabbit anti-rat MT polyclonal antibody. Chicken MT consists of 63 amino acids as compared to 61 found in MTs from mammals. One insertion (and two substitutions) occurs in the amino-terminal region, a region considered invariant among mammalian MTs. Eighteen of the 20 cysteines in chicken MT were aligned with cysteines from other mammalian sequences. Two cysteines near the carboxyl terminus are shifted by one residue due to the insertion of proline in that region. Overall, the chicken protein showed approximately equal to 68% sequence identity in a comparison with various mammalian MTs. The affinity of the polyclonal antibody for chicken MT was decreased by 2 orders of magnitude in comparison to that of a mammalian MT (rat MT isoforms). This reduced affinity is attributed to major substitutions in chicken MT in the regions of the principal determinants of mammalian MTs. Theoretical analysis of the primary structure predicted the secondary structure to consist of reverse turns and random coils with no stable beta or helix conformations. There is no evidence that chicken MT differs functionally from mammalian MTs. PMID:2448773

  14. Phenolic acid esterases, coding sequences and methods

    DOEpatents

    Blum, David L.; Kataeva, Irina; Li, Xin-Liang; Ljungdahl, Lars G.

    2002-01-01

    Described herein are four phenolic acid esterases, three of which correspond to domains of previously unknown function within bacterial xylanases, from XynY and XynZ of Clostridium thermocellum and from a xylanase of Ruminococcus. The fourth specifically exemplified xylanase is a protein encoded within the genome of Orpinomyces PC-2. The amino acids of these polypeptides and nucleotide sequences encoding them are provided. Recombinant host cells, expression vectors and methods for the recombinant production of phenolic acid esterases are also provided.

  15. Comparative genomic analysis of equilibrative nucleoside transporters suggests conserved protein structure despite limited sequence identity.

    PubMed

    Sankar, Narendra; Machado, Jerry; Abdulla, Parween; Hilliker, Arthur J; Coe, Imogen R

    2002-10-15

    Equilibrative nucleoside transporters (ENTs) are a recently characterized and poorly understood group of membrane proteins that are important in the uptake of endogenous nucleosides required for nucleic acid and nucleoside triphosphate synthesis. Despite their central importance in cellular metabolism and nucleoside analog chemotherapy, no human ENT gene has been described and nothing is known about gene structure and function. To gain insight into the ENT gene family, we used experimental and in silico comparative genomic approaches to identify ENT genes in three evolutionarily diverse organisms with completely (or almost completely) sequenced genomes, Homo sapiens, Caenorhabditis elegans and Drosophila melanogaster. We describe the chromosomal location, the predicted ENT gene structure and putative structural topologies of predicted ENT proteins derived from the open reading frames. Despite variations in genomic layout and limited ortholog protein sequence identity (< or =27.45%), predicted topologies of ENT proteins are strikingly similar, suggesting an evolutionary conservation of a prototypic structure. In addition, a similar distribution of protein domains on exons is apparent in all three taxa. These data demonstrate that comparative sequence analyses should be combined with other approaches (such as genomic and proteomic analyses) to fully understand structure, function and evolution of protein families.

  16. Method for identifying and quantifying nucleic acid sequence aberrations

    DOEpatents

    Lucas, J.N.; Straume, T.; Bogen, K.T.

    1998-07-21

    A method is disclosed for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe. 11 figs.

  17. Method for identifying and quantifying nucleic acid sequence aberrations

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    1998-01-01

    A method for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe.

  18. Methods for analyzing nucleic acid sequences

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2011-05-17

    The present invention is directed to a method of sequencing a target nucleic acid. The method provides a complex comprising a polymerase enzyme, a target nucleic acid molecule, and a primer, wherein the complex is immobilized on a support Fluorescent label is attached to a terminal phosphate group of the nucleotide or nucleotide analog. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The time duration of the signal from labeled nucleotides or nucleotide analogs that become incorporated is distinguished from freely diffusing labels by a longer retention in the observation volume for the nucleotides or nucleotide analogs that become incorporated than for the freely diffusing labels.

  19. Vocal generalization depends on gesture identity and sequence.

    PubMed

    Hoffmann, Lukas A; Sober, Samuel J

    2014-04-16

    Generalization, the brain's ability to transfer motor learning from one context to another, occurs in a wide range of complex behaviors. However, the rules of generalization in vocal behavior are poorly understood, and it is unknown how vocal learning generalizes across an animal's entire repertoire of natural vocalizations and sequences. Here, we asked whether generalization occurs in a nonhuman vocal learner and quantified its properties. We hypothesized that adaptive error correction of a vocal gesture produced in one sequence would generalize to the same gesture produced in other sequences. To test our hypothesis, we manipulated the fundamental frequency (pitch) of auditory feedback in Bengalese finches (Lonchura striata var. domestica) to create sensory errors during vocal gestures (song syllables) produced in particular sequences. As hypothesized, error-corrective learning on pitch-shifted vocal gestures generalized to the same gestures produced in other sequential contexts. Surprisingly, generalization magnitude depended strongly on sequential distance from the pitch-shifted syllables, with greater adaptation for gestures produced near to the pitch-shifted syllable. A further unexpected result was that nonshifted syllables changed their pitch in the direction opposite from the shifted syllables. This apparently antiadaptive pattern of generalization could not be explained by correlations between generalization and the acoustic similarity to the pitch-shifted syllable. These findings therefore suggest that generalization depends on the type of vocal gesture and its sequential context relative to other gestures and may reflect an advantageous strategy for vocal learning and maintenance.

  20. Vocal Generalization Depends on Gesture Identity and Sequence

    PubMed Central

    Sober, Samuel J.

    2014-01-01

    Generalization, the brain's ability to transfer motor learning from one context to another, occurs in a wide range of complex behaviors. However, the rules of generalization in vocal behavior are poorly understood, and it is unknown how vocal learning generalizes across an animal's entire repertoire of natural vocalizations and sequences. Here, we asked whether generalization occurs in a nonhuman vocal learner and quantified its properties. We hypothesized that adaptive error correction of a vocal gesture produced in one sequence would generalize to the same gesture produced in other sequences. To test our hypothesis, we manipulated the fundamental frequency (pitch) of auditory feedback in Bengalese finches (Lonchura striata var. domestica) to create sensory errors during vocal gestures (song syllables) produced in particular sequences. As hypothesized, error-corrective learning on pitch-shifted vocal gestures generalized to the same gestures produced in other sequential contexts. Surprisingly, generalization magnitude depended strongly on sequential distance from the pitch-shifted syllables, with greater adaptation for gestures produced near to the pitch-shifted syllable. A further unexpected result was that nonshifted syllables changed their pitch in the direction opposite from the shifted syllables. This apparently antiadaptive pattern of generalization could not be explained by correlations between generalization and the acoustic similarity to the pitch-shifted syllable. These findings therefore suggest that generalization depends on the type of vocal gesture and its sequential context relative to other gestures and may reflect an advantageous strategy for vocal learning and maintenance. PMID:24741046

  1. Complete amino acid sequence of three reptile lysozymes.

    PubMed

    Ponkham, Pornpimol; Daduang, Sakda; Kitimasak, Wachira; Krittanai, Chartchai; Chokchaichamnankit, Daranee; Srisomsap, Chantragan; Svasti, Jisnuson; Kawamura, Shunsuke; Araki, Tomohiro; Thammasirirak, Sompong

    2010-01-01

    To study the structure and function of reptile lysozymes, we have reported their purification, and in this study we have established the amino acid sequence of three egg white lysozymes in soft-shelled turtle eggs (SSTL A and SSTL B from Trionyx sinensis, ASTL from Amyda cartilaginea) by using the rapid peptide mapping method. The established amino acid sequence of SSTL A, SSTL B, and ASTL showed substitutions of 43, 42, and 44 residues respectively when compared with the HEWL (hen egg white lysozyme) sequence. In these reptile lysozymes, SSTL A had one substitution compared with SSTL B (Gly126Asp) and had an N-terminal extra Gly and 11 substitutions compared with ASTL. SSTL B had an N-terminal extra Gly and 10 residues different from ASTL. The sequence of SSTL B was identical to soft-shelled turtle lysozyme from STL (Trionyx sinensis japonicus). The Ile residue at position 93 of ASTL is the first report in all C-type lysozymes. Furthermore, amino acid substitutions (Phe34His, Arg45Tyr, Thr47Arg, and Arg114Tyr) were also found at subsites E and F when compared with HEWL. The time course using N-acetylglucosamine pentamer as a substrate exhibited a reduction of the rate constant of glycosidic cleavage and increase of binding free energy for subsites E and F, which proved the contribution for amino acids mentioned above for substrate binding at subsites E and F. Interestingly, the variable binding free energy values occurred on ASTL, may be contributed from substitutions at outside of subsites E and F.

  2. New complete mitochondrial DNA sequence of the lancelet Branchiostoma lanceolatum (Cephalochordata) and the identity of this species' sequences.

    PubMed

    Nohara, Masahiro; Nishida, Mutsumi; Nishikawa, Teruaki

    2005-06-01

    Three mitochondrial (mt) genes were sequenced for two Atlantic lancelet species, Branchiostoma lanceolatum and B. floridae, to examine a serious discrepancy among previously published results of molecular studies: substantial sequence difference in a nuclear gene vs. virtual identity in the mt genome sequence. The results revealed that three mt genes of B. lanceolatum, collected from Helgoland in the North Sea and Naples in the Mediterranean, were quite diverged from those of B. floridae, collected from Tampa Bay, Florida. Therefore, the previously recognized identity in the mt genome between the two species is attributable to misidentification of materials used. To correct this misleading information, the complete mtDNA sequence of B. lanceolatum was determined for an individual from Helgoland.

  3. Nucleotide and deduced amino acid sequences of a new subtilisin from an alkaliphilic Bacillus isolate.

    PubMed

    Saeki, Katsuhisa; Magallones, Marietta V; Takimura, Yasushi; Hatada, Yuji; Kobayashi, Tohru; Kawai, Shuji; Ito, Susumu

    2003-10-01

    The gene for a new subtilisin from the alkaliphilic Bacillus sp. KSM-LD1 was cloned and sequenced. The open reading frame of the gene encoded a 97 amino-acid prepro-peptide plus a 307 amino-acid mature enzyme that contained a possible catalytic triad of residues, Asp32, His66, and Ser224. The deduced amino acid sequence of the mature enzyme (LD1) showed approximately 65% identity to those of subtilisins SprC and SprD from alkaliphilic Bacillus sp. LG12. The amino acid sequence identities of LD1 to those of previously reported true subtilisins and high-alkaline proteases were below 60%. LD1 was characteristically stable during incubation with surfactants and chemical oxidants. Interestingly, an oxidizable Met residue is located next to the catalytic Ser224 of the enzyme as in the cases of the oxidation-susceptible subtilisins reported to date.

  4. Amino acid sequence of a mouse immunoglobulin mu chain.

    PubMed Central

    Kehry, M; Sibley, C; Fuhrman, J; Schilling, J; Hood, L E

    1979-01-01

    The complete amino acid sequence of the mouse mu chain from the BALB/c myeloma tumor MOPC 104E is reported. The C mu region contains four consecutive homology regions of approximately 110 residues and a COOH-terminal region of 19 residues. A comparison of this mu chain from mouse with a complete mu sequence from human (Ou) and a partial mu chain sequence from dog (Moo) reveals a striking gradient of increasing homology from the NH2-terminal to the COOH-terminal portion of these mu chains, with the former being the least and the latter the most highly conserved. Four of the five sites of carbohydrate attachment appear to be at identical residue positions when the constant regions of the mouse and human mu chains are compared. The mu chain of MOPC 104E has a carbohydrate moiety attached in the second hypervariable region. This is particularly interesting in view of the fact that MOPC 104E binds alpha-(1 leads to 3)-dextran, a simple carbohydrate. The structural and functional constraints imposed by these comparative sequence analyses are discussed. PMID:111247

  5. Porcine proinsulin: characterization and amino acid sequence.

    PubMed

    Chance, R E; Ellis, R M; Bromer, W W

    1968-07-12

    Proinsulin in nearly homogeneous form has been isolated from a preparation of porcine insulin. A molecular weight close to 9100 was calculated from the amino acid composition and from sedimentation-equilibrium studies. Through the action of trypsin this single-chain protein is transformed to desalanine insulin by cleavage of a polypeptide chain connecting the carboxy-terminus of the B chain to the amino-terminus of the A chain of insulin. The amino acid sequence of this connecting peptide was found to be Arg-Arg-Glu-Ala-Gln-Asn-Pro-Gln-Ala-Gly-Ala-Val-Glu-Leu-Gly-Gly-Gly-Leu-Gly-Gly-Leu-Gln-Ala-Leu-Ala-Leu-Glu-Gly-Pro-Pro-Gln-Lys-Arg.

  6. Identical ribosomal DNA sequence data from Pfiesteria piscicida (Dinophyceae) isolates with different toxicity phenotypes.

    PubMed

    Tengs, Torstein; Bowers, Holly A; Glasgow, Howard B; Burkholder, JoAnn M; Oldach, David W

    2003-09-01

    Complete small subunit ribosomal RNA, internal transcribed spacer 1 and 2, 5.8S, and partial large subunit ribosomal RNA gene sequences were generated from multiple isolates of Pfiesteria piscicida. Sequences were derived from isolates that have been shown to be ichthyotoxic as well as isolates that have no history of toxic behavior. All of the sequences generated were identical for the different cultures, and we therefore conclude that differences in toxicity seen between isolates of P. piscicida are linked to factors other than genetic strain variation detectable by ribosomal gene sequence analyses.

  7. Detection of nucleic acid sequences by invader-directed cleavage

    DOEpatents

    Brow, Mary Ann D.; Hall, Jeff Steven Grotelueschen; Lyamichev, Victor; Olive, David Michael; Prudent, James Robert

    1999-01-01

    The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The 5' nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based by charge.

  8. Proviruses with identical sequences comprise a large fraction of the replication-competent HIV reservoir.

    PubMed

    Bui, John K; Sobolewski, Michele D; Keele, Brandon F; Spindler, Jonathan; Musick, Andrew; Wiegand, Ann; Luke, Brian T; Shao, Wei; Hughes, Stephen H; Coffin, John M; Kearney, Mary F; Mellors, John W

    2017-03-01

    The major obstacle to curing HIV infection is the persistence of cells with intact proviruses that can produce replication-competent virus. This HIV reservoir is believed to exist primarily in CD4+ T-cells and is stable despite years of suppressive antiretroviral therapy. A potential mechanism for HIV persistence is clonal expansion of infected cells, but how often such clones carry replication-competent proviruses has been controversial. Here, we used single-genome sequencing to probe for identical HIV sequence matches among viruses recovered in different viral outgrowth cultures and between the sequences of outgrowth viruses and proviral or intracellular HIV RNA sequences in uncultured blood mononuclear cells from eight donors on suppressive ART with diverse proviral populations. All eight donors had viral outgrowth virus that was fully susceptible to their current ART drug regimen. Six of eight donors studied had identical near full-length HIV RNA sequences recovered from different viral outgrowth cultures, and one of the two remaining donors had identical partial viral sequence matches between outgrowth virus and intracellular HIV RNA. These findings provide evidence that clonal expansion of HIV-infected cells is an important mechanism of reservoir persistence that should be targeted to cure HIV infection.

  9. Proviruses with identical sequences comprise a large fraction of the replication-competent HIV reservoir

    PubMed Central

    Sobolewski, Michele D.; Keele, Brandon F.; Spindler, Jonathan; Wiegand, Ann; Shao, Wei; Hughes, Stephen H.; Coffin, John M.; Kearney, Mary F.; Mellors, John W.

    2017-01-01

    The major obstacle to curing HIV infection is the persistence of cells with intact proviruses that can produce replication-competent virus. This HIV reservoir is believed to exist primarily in CD4+ T-cells and is stable despite years of suppressive antiretroviral therapy. A potential mechanism for HIV persistence is clonal expansion of infected cells, but how often such clones carry replication-competent proviruses has been controversial. Here, we used single-genome sequencing to probe for identical HIV sequence matches among viruses recovered in different viral outgrowth cultures and between the sequences of outgrowth viruses and proviral or intracellular HIV RNA sequences in uncultured blood mononuclear cells from eight donors on suppressive ART with diverse proviral populations. All eight donors had viral outgrowth virus that was fully susceptible to their current ART drug regimen. Six of eight donors studied had identical near full-length HIV RNA sequences recovered from different viral outgrowth cultures, and one of the two remaining donors had identical partial viral sequence matches between outgrowth virus and intracellular HIV RNA. These findings provide evidence that clonal expansion of HIV-infected cells is an important mechanism of reservoir persistence that should be targeted to cure HIV infection. PMID:28328934

  10. Identical Genotype B3 Sequences from Measles Patients in 4 Countries, 2005

    PubMed Central

    Lowe, Luis; Rota, Paul; Bellini, William; Redd, Susan; Dayan, Gustavo; van Binnendijk, Rob; Hahné, Susan; Tipples, Graham; Macey, Jeannette; Espinoza, Rita; Posey, Drew; Plummer, Andrew; Bateman, John; Gudiño, José; Cruz-Ramirez, Edith; Lopez-Martinez, Irma; Anaya-Lopez, Luis; Akwar, Teneg Holy; Giffin, Scott; Carrión, Verónica; Bispo de Filippis, Ana Maria; Vicari, Andrea; Tan, Christina; Wolf, Bruce; Wytovich, Katherine; Borus, Peter; Mbugua, Francis; Chege, Paul; Kombich, Janeth; Akoua-Koffi, Chantal; Smit, Sheilagh; Bukenya, Henry; Bwogi, Josephine; Baliraine, Frederick Ndhoga; Kremer, Jacques; Muller, Claude; Santibanez, Sabine

    2006-01-01

    Surveillance of measles virus detected an epidemiologic link between a refugee from Kenya and a Dutch tourist in New Jersey, USA. Identical genotype B3 sequences from patients with contemporaneous cases in the United States, Canada, and Mexico in November and December 2005 indicate that Kenya was likely to have been the common source of virus. PMID:17283637

  11. Identical genotype B3 sequences from measles patients in 4 countries, 2005.

    PubMed

    Rota, Jennifer; Lowe, Luis; Rota, Paul; Bellini, William; Redd, Susan; Dayan, Gustavo; van Binnendijk, Rob; Hahné, Susan; Tipples, Graham; Macey, Jeannette; Espinoza, Rita; Posey, Drew; Plummer, Andrew; Bateman, John; Gudiño, José; Cruz-Ramirez, Edith; Lopez-Martinez, Irma; Anaya-Lopez, Luis; Holy Akwar, Teneg; Giffin, Scott; Carrión, Verónica; de Filippis, Ana Maria Bispo; Vicari, Andrea; Tan, Christina; Wolf, Bruce; Wytovich, Katherine; Borus, Peter; Mbugua, Francis; Chege, Paul; Kombich, Janeth; Akoua-Koffi, Chantal; Smit, Sheilagh; Bukenya, Henry; Bwogi, Josephine; Baliraine, Frederick Ndhoga; Kremer, Jacques; Muller, Claude; Santibanez, Sabine

    2006-11-01

    Surveillance of measles virus detected an epidemiologic link between a refugee from Kenya and a Dutch tourist in New Jersey, USA. Identical genotype B3 sequences from patients with contemporaneous cases in the United States, Canada, and Mexico in November and December 2005 indicate that Kenya was likely to have been the common source of virus.

  12. Sequences of Citrus tristeza virus separated in time and space are essentially identical.

    PubMed

    Albiach-Martí, M R; Mawassi, M; Gowda, S; Satyanarayana, T; Hilf, M E; Shanker, S; Almira, E C; Vives, M C; López, C; Guerri, J; Flores, R; Moreno, P; Garnsey, S M; Dawson, W O

    2000-08-01

    The first Citrus tristeza virus (CTV) genomes completely sequenced (19.3-kb positive-sense RNA), from four biologically distinct isolates, are unexpectedly divergent in nucleotide sequence (up to 60% divergence). Understanding of whether these large sequence differences resulted from recent evolution is important for the design of disease management strategies, particularly the use of genetically engineered mild (essentially symptomless)-strain cross protection and RNA-mediated transgenic resistance. The complete sequence of a mild isolate (T30) which has been endemic in Florida for about a century was found to be nearly identical to the genomic sequence of a mild isolate (T385) from Spain. Moreover, samples of sequences of other isolates from distinct geographic locations, maintained in different citrus hosts and also separated in time (B252 from Taiwan, B272 from Colombia, and B354 from California), were nearly identical to the T30 sequence. The sequence differences between these isolates were within or near the range of variability of the T30 population. A possible explanation for these results is that the parents of isolates T30, T385, B252, B272, and B354 have a common origin, probably Asia, and have changed little since they were dispersed throughout the world by the movement of citrus. Considering that the nucleotide divergence among the other known CTV genomes is much greater than those expected for strains of the same virus, the remarkable similarity of these five isolates indicates a high degree of evolutionary stasis in some CTV populations.

  13. Hybridization and sequencing of nucleic acids using base pair mismatches

    DOEpatents

    Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua

    2001-01-01

    Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.

  14. Transitive homology-guided structural studies lead to discovery of Cro proteins with 40% sequence identity but different folds.

    PubMed

    Roessler, Christian G; Hall, Branwen M; Anderson, William J; Ingram, Wendy M; Roberts, Sue A; Montfort, William R; Cordes, Matthew H J

    2008-02-19

    Proteins that share common ancestry may differ in structure and function because of divergent evolution of their amino acid sequences. For a typical diverse protein superfamily, the properties of a few scattered members are known from experiment. A satisfying picture of functional and structural evolution in relation to sequence changes, however, may require characterization of a larger, well chosen subset. Here, we employ a "stepping-stone" method, based on transitive homology, to target sequences intermediate between two related proteins with known divergent properties. We apply the approach to the question of how new protein folds can evolve from preexisting folds and, in particular, to an evolutionary change in secondary structure and oligomeric state in the Cro family of bacteriophage transcription factors, initially identified by sequence-structure comparison of distant homologs from phages P22 and lambda. We report crystal structures of two Cro proteins, Xfaso 1 and Pfl 6, with sequences intermediate between those of P22 and lambda. The domains show 40% sequence identity but differ by switching of alpha-helix to beta-sheet in a C-terminal region spanning approximately 25 residues. Sedimentation analysis also suggests a correlation between helix-to-sheet conversion and strengthened dimerization.

  15. Purification, characterization and partial amino acid sequence of glycogen synthase from Saccharomyces cerevisiae.

    PubMed Central

    Carabaza, A; Arino, J; Fox, J W; Villar-Palasi, C; Guinovart, J J

    1990-01-01

    Glycogen synthase from Saccharomyces cerevisiae was purified to homogeneity. The enzyme showed a subunit molecular mass of 80 kDa. The holoenzyme appears to be a tetramer. Antibodies developed against purified yeast glycogen synthase inactivated the enzyme in yeast extracts and allowed the detection of the protein in Western blots. Amino acid analysis showed that the enzyme is very rich in glutamate and/or glutamine residues. The N-terminal sequence (11 amino acid residues) was determined. In addition, selected tryptic-digest peptides were purified by reverse-phase h.p.l.c. and submitted to gas-phase sequencing. Up to eight sequences (79 amino acid residues) could be aligned with the human muscle enzyme sequence. Levels of identity range between 37 and 100%, indicating that, although human and yeast glycogen synthases probably share some conserved regions, significant differences in their primary structure should be expected. Images Fig. 1. Fig. 2. Fig. 3. PMID:2114092

  16. Large distribution and high sequence identity of a Copia-type retrotransposon in angiosperm families.

    PubMed

    Dias, Elaine Silva; Hatt, Clémence; Hamon, Serge; Hamon, Perla; Rigoreau, Michel; Crouzillat, Dominique; Carareto, Claudia Marcia Aparecida; de Kochko, Alexandre; Guyot, Romain

    2015-09-01

    Retrotransposons are the main component of plant genomes. Recent studies have revealed the complexity of their evolutionary dynamics. Here, we have identified Copia25 in Coffea canephora, a new plant retrotransposon belonging to the Ty1-Copia superfamily. In the Coffea genomes analyzed, Copia25 is present in relatively low copy numbers and transcribed. Similarity sequence searches and PCR analyses show that this retrotransposon with LTRs (Long Terminal Repeats) is widely distributed among the Rubiaceae family and that it is also present in other distantly related species belonging to Asterids, Rosids and monocots. A particular situation is the high sequence identity found between the Copia25 sequences of Musa, a monocot, and Ixora, a dicot species (Rubiaceae). Our results reveal the complexity of the evolutionary dynamics of the ancient element Copia25 in angiosperm, involving several processes including sequence conservation, rapid turnover, stochastic losses and horizontal transfer.

  17. Quantification of the variation in percentage identity for protein sequence alignments

    PubMed Central

    Raghava, GPS; Barton, Geoffrey J

    2006-01-01

    Background Percentage Identity (PID) is frequently quoted in discussion of sequence alignments since it appears simple and easy to understand. However, although there are several different ways to calculate percentage identity and each may yield a different result for the same alignment, the method of calculation is rarely reported. Accordingly, quantification of the variation in PID caused by the different calculations would help in interpreting PID values in the literature. In this study, the variation in PID was quantified systematically on a reference set of 1028 alignments generated by comparison of the protein three-dimensional structures. Since the alignment algorithm may also affect the range of PID, this study also considered the effect of algorithm, and the combination of algorithm and PID method. Results The maximum variation in PID due to the calculation method was 11.5% while the effect of alignment algorithm on PID was up to 14.6% across three popular alignment methods. The combined effect of alignment algorithm and PID calculation gave a variation of up to 22% on the test data, with an average of 5.3% ± 2.8% for sequence pairs with < 30% identity. In order to see which PID method was most highly correlated with structural similarity, four different PID calculations were compared to similarity scores (Sc) from the comparison of the corresponding protein three-dimensional structures. The highest correlation coefficient for a PID calculation was 0.80. In contrast, the more sophisticated Z-score calculated by reference to randomized sequences gave a correlation coefficient of 0.84. Conclusion Although it is well known amongst expert sequence analysts that PID is a poor score for discriminating between protein sequences, the apparent simplicity of the percentage identity score encourages its widespread use in establishing cutoffs for structural similarity. This paper illustrates that not only is PID a poor measure of sequence similarity when compared to

  18. Mojave rattlesnakes (Crotalus scutulatus scutulatus) lacking the acidic subunit DNA sequence lack Mojave toxin in their venom.

    PubMed

    Wooldridge, B J; Pineda, G; Banuelas-Ornelas, J J; Dagda, R K; Gasanov, S E; Rael, E D; Lieb, C S

    2001-09-01

    The venom composition of Mojave rattlesnakes (Crotalus scutulatus scutulatus) differs in that some individuals have Mojave toxin and others do not. In order to understand the genetic basis for this difference, genomic DNA samples from Mojave rattlesnakes collected in Arizona, New Mexico, and Texas were analyzed for the presence of DNA sequences that relate to the acidic (Mta) and basic (Mtb) subunits of this toxin. DNA samples were subjected to PCR to amplify nucleotide sequences from second to fourth exons of the acidic and basic subunits. These nucleotide sequences were cloned and sequenced. The nucleotide sequences generated aligned exactly to previously published nucleotide sequences of Mojave toxin. All DNA samples analyzed generated product using the basic subunit primers, and aligned identically to the Mtb nucleotide sequence. However, only 11 out of the 14 samples generated a product with the acidic subunit primers. These 11 sequences aligned identically to the Mta nucleotide sequence. The venom from the three snakes whose DNA did not amplify with the acidic subunit primers were not recognized by antibodies to Mojave toxin. This suggests that snakes with venom lacking Mojave toxin also lack the productive nucleotide sequence for the acidic subunit in their DNA.

  19. Methods and compositions for efficient nucleic acid sequencing

    DOEpatents

    Drmanac, Radoje

    2002-01-01

    Disclosed are novel methods and compositions for rapid and highly efficient nucleic acid sequencing based upon hybridization with two sets of small oligonucleotide probes of known sequences. Extremely large nucleic acid molecules, including chromosomes and non-amplified RNA, may be sequenced without prior cloning or subcloning steps. The methods of the invention also solve various current problems associated with sequencing technology such as, for example, high noise to signal ratios and difficult discrimination, attaching many nucleic acid fragments to a surface, preparing many, longer or more complex probes and labelling more species.

  20. Methods and compositions for efficient nucleic acid sequencing

    DOEpatents

    Drmanac, Radoje

    2006-07-04

    Disclosed are novel methods and compositions for rapid and highly efficient nucleic acid sequencing based upon hybridization with two sets of small oligonucleotide probes of known sequences. Extremely large nucleic acid molecules, including chromosomes and non-amplified RNA, may be sequenced without prior cloning or subcloning steps. The methods of the invention also solve various current problems associated with sequencing technology such as, for example, high noise to signal ratios and difficult discrimination, attaching many nucleic acid fragments to a surface, preparing many, longer or more complex probes and labelling more species.

  1. Kit for detecting nucleic acid sequences using competitive hybridization probes

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    2001-01-01

    A kit is provided for detecting a target nucleic acid sequence in a sample, the kit comprising: a first hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the first hybridization probe including a first complexing agent for forming a binding pair with a second complexing agent; and a second hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the first hybridization probe does not selectively hybridize, the second hybridization probe including a detectable marker; a third hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the third hybridization probe including the same detectable marker as the second hybridization probe; and a fourth hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the third hybridization probe does not selectively hybridize, the fourth hybridization probe including the first complexing agent for forming a binding pair with the second complexing agent; wherein the first and second hybridization probes are capable of simultaneously hybridizing to the target sequence and the third and fourth hybridization probes are capable of simultaneously hybridizing to the target sequence, the detectable marker is not present on the first or fourth hybridization probes and the first, second, third, and fourth hybridization probes each include a competitive nucleic acid sequence which is sufficiently complementary to a third portion of the target sequence that the competitive sequences of the first, second, third, and fourth hybridization probes compete with each other to hybridize to the third portion of the

  2. In silico comparative analysis of DNA and amino acid sequences for prion protein gene.

    PubMed

    Kim, Y; Lee, J; Lee, C

    2008-01-01

    Genetic variability might contribute to species specificity of prion diseases in various organisms. In this study, structures of the prion protein gene (PRNP) and its amino acids were compared among species of which sequence data were available. Comparisons of PRNP DNA sequences among 12 species including human, chimpanzee, monkey, bovine, ovine, dog, mouse, rat, wallaby, opossum, chicken and zebrafish allowed us to identify candidate regulatory regions in intron 1 and 3'-untranslated region (UTR) in addition to the coding region. Highly conserved putative binding sites for transcription factors, such as heat shock factor 2 (HSF2) and myocite enhancer factor 2 (MEF2), were discovered in the intron 1. In 3'-UTR, the functional sequence (ATTAAA) for nucleus-specific polyadenylation was found in all the analysed species. The functional sequence (TTTTTAT) for maturation-specific polyadenylation was identically observed only in ovine, and one or two nucleotide mismatches in the other species. A comparison of the amino acid sequences in 53 species revealed a large sequence identity. Especially the octapeptide repeat region was observed in all the species but frog and zebrafish. Functional changes and susceptibility to prion diseases with various isoforms of prion protein could be caused by numeric variability and conformational changes discovered in the repeat sequences.

  3. Detecting Identity by Descent and Estimating Genotype Error Rates in Sequence Data

    PubMed Central

    Browning, Brian L.; Browning, Sharon R.

    2013-01-01

    Existing methods for identity by descent (IBD) segment detection were designed for SNP array data, not sequence data. Sequence data have a much higher density of genetic variants and a different allele frequency distribution, and can have higher genotype error rates. Consequently, best practices for IBD detection in SNP array data do not necessarily carry over to sequence data. We present a method, IBDseq, for detecting IBD segments in sequence data and a method, SEQERR, for estimating genotype error rates at low-frequency variants by using detected IBD. The IBDseq method estimates probabilities of genotypes observed with error for each pair of individuals under IBD and non-IBD models. The ratio of estimated probabilities under the two models gives a LOD score for IBD. We evaluate several IBD detection methods that are fast enough for application to sequence data (IBDseq, Beagle Refined IBD, PLINK, and GERMLINE) under multiple parameter settings, and we show that IBDseq achieves high power and accuracy for IBD detection in sequence data. The SEQERR method estimates genotype error rates by comparing observed and expected rates of pairs of homozygote and heterozygote genotypes at low-frequency variants in IBD segments. We demonstrate the accuracy of SEQERR in simulated data, and we apply the method to estimate genotype error rates in sequence data from the UK10K and 1000 Genomes projects. PMID:24207118

  4. The amino acid sequence of wood duck lysozyme.

    PubMed

    Araki, T; Torikata, T

    1999-01-01

    The amino acid sequence of wood duck (Aix sponsa) lysozyme was analyzed. Carboxymethylated lysozyme was digested with trypsin and the resulting peptides were sequenced. The established amino acid sequence had the highest similarity to duck III lysozyme with four amino acid substitutions, and had eighteen amino acid substitutions from chicken lysozyme. The valine at position 75 was newly detected in chicken-type lysozymes. In the active site, Tyr34 and Glu57 were found at subsites F and D, respectively, when compared with chicken lysozyme.

  5. Folding pathways of proteins with increasing degree of sequence identities but different structure and function.

    PubMed

    Giri, Rajanish; Morrone, Angela; Travaglini-Allocatelli, Carlo; Jemth, Per; Brunori, Maurizio; Gianni, Stefano

    2012-10-30

    Much experimental work has been devoted in comparing the folding behavior of proteins sharing the same fold but different sequence. The recent design of proteins displaying very high sequence identities but different 3D structure allows the unique opportunity to address the protein-folding problem from a complementary perspective. Here we explored by Φ-value analysis the pathways of folding of three different heteromorphic pairs, displaying increasingly high-sequence identity (namely, 30%, 77%, and 88%), but different structures called G(A) (a 3-α helix fold) and G(B) (an α/β fold). The analysis, based on 132 site-directed mutants, is fully consistent with the idea that protein topology is committed very early along the pathway of folding. Furthermore, data reveals that when folding approaches a perfect two-state scenario, as in the case of the G(A) domains, the structural features of the transition state appear very robust to changes in sequence composition. On the other hand, when folding is more complex and multistate, as for the G(B)s, there are alternative nuclei or accessible pathways that can be alternatively stabilized by altering the primary structure. The implications of our results in the light of previous work on the folding of different members belonging to the same protein family are discussed.

  6. Nearly identical bacteriophage structural gene sequences are widely distributed in both marine and freshwater environments.

    PubMed

    Short, Cindy M; Suttle, Curtis A

    2005-01-01

    Primers were designed to amplify a 592-bp region within a conserved structural gene (g20) found in some cyanophages. The goal was to use this gene as a proxy to infer genetic richness in natural cyanophage communities and to determine if sequences were more similar in similar environments. Gene products were amplified from samples from the Gulf of Mexico, the Arctic, Southern, and Northeast and Southeast Pacific Oceans, an Arctic cyanobacterial mat, a catfish production pond, lakes in Canada and Germany, and a depth of ca. 3,246 m in the Chuckchi Sea. Amplicons were separated by denaturing gradient gel electrophoresis, and selected bands were sequenced. Phylogenetic analysis revealed four previously unknown groups of g20 clusters, two of which were entirely found in freshwater. Also, sequences with >99% identities were recovered from environments that differed greatly in temperature and salinity. For example, nearly identical sequences were recovered from the Gulf of Mexico, the Southern Pacific Ocean, an Arctic freshwater cyanobacterial mat, and Lake Constance, Germany. These results imply that closely related hosts and the viruses infecting them are distributed widely across environments or that horizontal gene exchange occurs among phage communities from very different environments. Moreover, the amplification of g20 products from deep in the cyanobacterium-sparse Chuckchi Sea suggests that this primer set targets bacteriophages other than those infecting cyanobacteria.

  7. Analysis and Annotation of Nucleic Acid Sequence

    SciTech Connect

    States, David J.

    2004-07-28

    The aims of this project were to develop improved methods for computational genome annotation and to apply these methods to improve the annotation of genomic sequence data with a specific focus on human genome sequencing. The project resulted in a substantial body of published work. Notable contributions of this project were the identification of basecalling and lane tracking as error processes in genome sequencing and contributions to improved methods for these steps in genome sequencing. This technology improved the accuracy and throughput of genome sequence analysis. Probabilistic methods for physical map construction were developed. Improved methods for sequence alignment, alternative splicing analysis, promoter identification and NF kappa B response gene prediction were also developed.

  8. Isolation and amino-acid sequence determination of monkey insulin and proinsulin.

    PubMed

    Naithani, V K; Steffens, G J; Tager, H S; Buse, G; Rubenstein, A H; Steiner, D F

    1984-05-01

    Insulin has been isolated and purified from rhesus monkey pancreas by means of acid-ethanol extraction, gel filtration and ion exchange chromatography. The complete amino-acid sequence of the hormone has been determined by amino-acid analysis of the oxidized A- and B-chains, by end group determination, by the identification of the C-terminal residues (AsnA21 and ThrB30) by carboxypeptidase A digestion and by Edman degradation of the S-carboxymethylated A- and B-chains. The 51-residue monkey insulin was shown to be identical to human insulin. From the known insulin and C-peptide sequence the primary sequence of monkey proinsulin has been proposed.

  9. Solid phase sequencing of double-stranded nucleic acids

    DOEpatents

    Fu, Dong-Jing; Cantor, Charles R.; Koster, Hubert; Smith, Cassandra L.

    2002-01-01

    This invention relates to methods for detecting and sequencing of target double-stranded nucleic acid sequences, to nucleic acid probes and arrays of probes useful in these methods, and to kits and systems which contain these probes. Useful methods involve hybridizing the nucleic acids or nucleic acids which represent complementary or homologous sequences of the target to an array of nucleic acid probes. These probe comprise a single-stranded portion, an optional double-stranded portion and a variable sequence within the single-stranded portion. The molecular weights of the hybridized nucleic acids of the set can be determined by mass spectroscopy, and the sequence of the target determined from the molecular weights of the fragments. Nucleic acids whose sequences can be determined include nucleic acids in biological samples such as patient biopsies and environmental samples. Probes may be fixed to a solid support such as a hybridization chip to facilitate automated determination of molecular weights and identification of the target sequence.

  10. Sequences encoding identical peptides for the analysis and manipulation of coding DNA

    PubMed Central

    Sánchez, Joaquín

    2013-01-01

    The use of sequences encoding identical peptides (SEIP) for the in silico analysis of coding DNA from different species has not been reported; the study of such sequences could directly reveal properties of coding DNA that are independent of peptide sequences. For practical purposes SEIP might also be manipulated for e.g. heterologous protein expression. We extracted 1,551 SEIP from human and E. coli and 2,631 SEIP from human and D. melanogaster. We then analyzed codon usage and intercodon dinucleotide tendencies and found differences in both, with more conspicuous disparities between human and E. coli than between human and D. melanogaster. We also briefly manipulated SEIP to find out if they could be used to create new coding sequences. We hence attempted replacement of human by E. coli codons via dicodon exchange but found that full replacement was not possible, this indicated robust species-specific dicodon tendencies. To test another form of codon replacement we isolated SEIP from human and the jellyfish green fluorescent protein (GFP) and we then re-constructed the GFP coding DNA with human tetra-peptide-coding sequences. Results provide proof-of-principle that SEIP may be used to reveal differences in the properties of coding DNA and to reconstruct in pieces a protein coding DNA with sequences from a different organism, the latter might be exploited in heterologous protein expression. PMID:23861567

  11. Sequences encoding identical peptides for the analysis and manipulation of coding DNA.

    PubMed

    Sánchez, Joaquín

    2013-01-01

    The use of sequences encoding identical peptides (SEIP) for the in silico analysis of coding DNA from different species has not been reported; the study of such sequences could directly reveal properties of coding DNA that are independent of peptide sequences. For practical purposes SEIP might also be manipulated for e.g. heterologous protein expression. We extracted 1,551 SEIP from human and E. coli and 2,631 SEIP from human and D. melanogaster. We then analyzed codon usage and intercodon dinucleotide tendencies and found differences in both, with more conspicuous disparities between human and E. coli than between human and D. melanogaster. We also briefly manipulated SEIP to find out if they could be used to create new coding sequences. We hence attempted replacement of human by E. coli codons via dicodon exchange but found that full replacement was not possible, this indicated robust species-specific dicodon tendencies. To test another form of codon replacement we isolated SEIP from human and the jellyfish green fluorescent protein (GFP) and we then re-constructed the GFP coding DNA with human tetra-peptide-coding sequences. Results provide proof-of-principle that SEIP may be used to reveal differences in the properties of coding DNA and to reconstruct in pieces a protein coding DNA with sequences from a different organism, the latter might be exploited in heterologous protein expression.

  12. Identification of tropomyosins as major allergens in antarctic krill and mantis shrimp and their amino acid sequence characteristics.

    PubMed

    Motoyama, Kanna; Suma, Yota; Ishizaki, Shoichiro; Nagashima, Yuji; Lu, Ying; Ushio, Hideki; Shiomi, Kazuo

    2008-01-01

    Tropomyosin represents a major allergen of decapod crustaceans such as shrimps and crabs, and its highly conserved amino acid sequence (>90% identity) is a molecular basis of the immunoglobulin E (IgE) cross-reactivity among decapods. At present, however, little information is available about allergens in edible crustaceans other than decapods. In this study, the major allergen in two species of edible crustaceans, Antarctic krill Euphausia superba and mantis shrimp Oratosquilla oratoria that are taxonomically distinct from decapods, was demonstrated to be tropomyosin by IgE-immunoblotting using patient sera. The cross-reactivity of the tropomyosins from both species with decapod tropomyosins was also confirmed by inhibition IgE immunoblotting. Sequences of the tropomyosins from both species were determined by complementary deoxyribonucleic acid cloning. The mantis shrimp tropomyosin has high sequence identity (>90% identity) with decapod tropomyosins, especially with fast-type tropomyosins. On the other hand, the Antarctic krill tropomyosin is characterized by diverse alterations in region 13-42, the amino acid sequence of which is highly conserved for decapod tropomyosins, and hence, it shares somewhat lower sequence identity (82.4-89.8% identity) with decapod tropomyosins than the mantis shrimp tropomyosin. Quantification by enzyme-linked immunosorbent assay revealed that Antarctic krill contains tropomyosin at almost the same level as decapods, suggesting that its allergenicity is equivalent to decapods. However, mantis shrimp was assumed to be substantially not allergenic because of the extremely low content of tropomyosin.

  13. Soil amino acid composition across a boreal forest successional sequence

    Treesearch

    Nancy R. Werdin-Pfisterer; Knut Kielland; Richard D. Boone

    2009-01-01

    Soil amino acids are important sources of organic nitrogen for plant nutrition, yet few studies have examined which amino acids are most prevalent in the soil. In this study, we examined the composition, concentration, and seasonal patterns of soil amino acids across a primary successional sequence encompassing a natural gradient of plant productivity and soil...

  14. Amino acid sequence around the active-site serine residue in the acyltransferase domain of goat mammary fatty acid synthetase.

    PubMed Central

    Mikkelsen, J; Højrup, P; Rasmussen, M M; Roepstorff, P; Knudsen, J

    1985-01-01

    Goat mammary fatty acid synthetase was labelled in the acyltransferase domain by formation of O-ester intermediates by incubation with [1-14C]acetyl-CoA and [2-14C]malonyl-CoA. Tryptic-digest and CNBr-cleavage peptides were isolated and purified by high-performance reverse-phase and ion-exchange liquid chromatography. The sequences of the malonyl- and acetyl-labelled peptides were shown to be identical. The results confirm the hypothesis that both acetyl and malonyl groups are transferred to the mammalian fatty acid synthetase complex by the same transferase. The sequence is compared with those of other fatty acid synthetase transferases. PMID:3922356

  15. The complete amino acid sequence of growth hormone of an elasmobranch, the blue shark (Prionace glauca).

    PubMed

    Yamaguchi, K; Yasuda, A; Lewis, U J; Yokoo, Y; Kawauchi, H

    1989-02-01

    The complete amino acid sequence of growth hormone (GH) from a phylogenetically ancient fish, the blue shark (Prionace glauca), was determined. The shark GH isolated from pituitary glands by U. J. Lewis, R. N. P. Singh, B. K. Seavey, R. Lasker, and G. E. Pickford (1972, Fish. Bull. 70, 933-939) was purified by reversed-phase high-performance liquid chromatography. The hormone was reduced, carboxymethylated, and subsequently cleaved in turn with cyanogen bromide and Staphylococcus aureus protease. The intact protein was also cleaved with lysyl endopeptidase and o-iodosobenzoic acid. The resulting peptide fragments were separated by rpHPLC and submitted to sequence analysis by automated and manual Edman methods. The shark GH consists of 183 amino acid residues with a calculated molecular weight of 21,081. Sequence comparisons revealed that the elasmobranch GH is considerably more similar to tetrapod GHs (e.g., 68% identity with sea turtle GH, 63% with chicken GH, and 58% with ovine GH) than teleostean GHs (e.g., 38% identities with salmon GH and 42% with bonito GH) except for eel GH (61% identity), and substantiates the earlier finding derived from the immunochemical and biological studies (Hayashida and Lewis, 1978) that the primitive fish are less diverged from the main line of vertebrate evolution leading to the tetrapod than are the modern bony fish.

  16. Complete cDNA and derived amino acid sequence of human factor V

    SciTech Connect

    Jenny, R.J.; Pittman, D.D.; Toole, J.J.; Kriz, R.W.; Aldape, R.A.; Hewick, R.M.; Kaufman, R.J.; Mann, K.G.

    1987-07-01

    cDNA clones encoding human factor V have been isolated from an oligo(dT)-primed human fetal liver cDNA library prepared with vector Charon 21A. The cDNA sequence of factor V from three overlapping clones includes a 6672-base-pair (bp) coding region, a 90-bp 5' untranslated region, and a 163-bp 3' untranslated region within which is a poly(A)tail. The deduced amino acid sequence consists of 2224 amino acids inclusive of a 28-amino acid leader peptide. Direct comparison with human factor VIII reveals considerable homology between proteins in amino acid sequence and domain structure: a triplicated A domain and duplicated C domain show approx. 40% identity with the corresponding domains in factor VIII. As in factor VIII, the A domains of factor V share approx. 40% amino acid-sequence homology with the three highly conserved domains in ceruloplasmin. The B domain of factor V contains 35 tandem and approx. 9 additional semiconserved repeats of nine amino acids of the form Asp-Leu-Ser-Gln-Thr-Thr/Asn-Leu-Ser-Pro and 2 additional semiconserved repeats of 17 amino acids. Factor V contains 37 potential N-linked glycosylation sites, 25 of which are in the B domain, and a total of 19 cysteine residues.

  17. Dipeptide Sequence Determination: Analyzing Phenylthiohydantoin Amino Acids by HPLC

    NASA Astrophysics Data System (ADS)

    Barton, Janice S.; Tang, Chung-Fei; Reed, Steven S.

    2000-02-01

    Amino acid composition and sequence determination, important techniques for characterizing peptides and proteins, are essential for predicting conformation and studying sequence alignment. This experiment presents improved, fundamental methods of sequence analysis for an upper-division biochemistry laboratory. Working in pairs, students use the Edman reagent to prepare phenylthiohydantoin derivatives of amino acids for determination of the sequence of an unknown dipeptide. With a single HPLC technique, students identify both the N-terminal amino acid and the composition of the dipeptide. This method yields good precision of retention times and allows use of a broad range of amino acids as components of the dipeptide. Students learn fundamental principles and techniques of sequence analysis and HPLC.

  18. A protective protein antigen of Rickettsia rickettsii has tandemly repeated, near-identical sequences.

    PubMed Central

    Anderson, B E; McDonald, G A; Jones, D C; Regnery, R L

    1990-01-01

    The nucleotide sequence of a Rickettsia rickettsii gene that encodes a high-molecular-mass surface antigen (190 kilodaltons), which elicits protective immunity, was determined. The 6,747-nucleotide gene coded for a 2,249-amino-acid protein with a calculated molecular weight of 224,321. A 3.8-kilobase PstI fragment proximal to the 5' end of the gene was found to consist of 13 highly related tandem repeats which constituted over 40% of the coding region. The repeated sequences could be divided into either a 225-nucleotide, 75-amino-acid unit (type I) or a 216-nucleotide, 72-amino-acid unit (type II), with extensive homology between the two types of repeating units. The deduced amino acid sequence for these repeat units, overall, was slightly hydrophobic with short hydrophilic domains. The carboxy-terminal (nonrepetitive) portion of the deduced protein sequence was hydrophilic, with potential surface-exposed epitopes. The full-length reading frame was reconstructed in Escherichia coli, and transient expression of the 190-kilodalton antigen was demonstrated; however, the protein appeared to be severely degraded by proteases and was apparently toxic to E. coli. The conservation of this unique repetitive gene structure, coupled with results from previous reports showing the protective properties of the 190-kilodalton antigen, suggests that this protein plays an important role in the pathogenesis of and immunity to Rocky Mountain spotted fever. Images PMID:2117568

  19. Amino-Acid Sequence of NADP-Specific Glutamate Dehydrogenase of Neurospora crassa

    PubMed Central

    Wootton, John C.; Chambers, Geoffrey K.; Holder, Anthony A.; Baron, Andrew J.; Taylor, John G.; Fincham, John R. S.; Blumenthal, Kenneth M.; Moon, Kenneth; Smith, Emil L.

    1974-01-01

    A tentative primary structure of the NADP-specific glutamate dehydrogenase [L-glutamate: NADP oxidoreductase (deaminating), EC 1.4.1.4] from Neurospora crassa has been determined. The proposed sequence contains 452 amino-acid residues in each of the identical subunits of the hexameric enzyme. Comparison of the sequence with that of the bovine liver enzyme reveals considerable homology in the amino-terminal portion of the chain, including the vicinity of the reactive lysine, with only shorter stretches of homology within the carboxyl-terminal regions. The significance of this distribution of homologous regions is discussed. PMID:4155068

  20. Bovine testis acylphosphatase: purification and amino acid sequence.

    PubMed

    Pazzagli, L; Cappugi, G; Camici, G; Manao, G; Ramponi, G

    1993-10-01

    Two acylphosphatase molecular forms have been isolated from bovine testis. Their amino acid sequence was determined. One (ACY1) consists of 98 amino acid residues, while the other one (ACY2) consists of 100 amino acid residues. Both molecular forms are N-acetylated and differ only in the amino terminus. ACY2 has an additional Ser-Met tail with respect to ACY1. Both ACY1 and ACY2 are organ-common type isoenzymes and thus differ for about half of the amino acid positions from the previously sequenced bovine muscle isoenzyme.

  1. The amino acid sequence of Lady Amherst's pheasant (Chrysolophus amherstiae) and golden pheasant (Chrysolophus pictus) egg-white lysozymes.

    PubMed

    Araki, T; Kuramoto, M; Torikata, T

    1990-09-01

    The amino acids of Lady Amherst's pheasant and golden pheasant egg-white lysozymes have been sequenced. The carboxymethylated lysozymes were digested with trypsin followed by sequencing of the tryptic peptides. Lady Amherst's pheasant lysozyme proved to consist of 129 amino acid residues, and a relative molecular mass of 14,423 Da was calculated. This lysozyme had 6 amino acids substitutions when compared with hen egg-white lysozyme: Phe3 to Tyr, His15 to Leu, Gln41 to His, Asn77 to His, Gln 121 to Asn, and a newly found substitution of Ile124 to Thr. The amino acid sequence of golden pheasant lysozyme was identical to that of Lady Amherst's phesant lysozyme. The phylogenetic tree constructured by the comparison of amino acid sequences of phasianoid birds lysozymes revealed a minimum genetic distance between these pheasants and the turkey-peafowl group.

  2. Multiple site-selective insertions of non-canonical amino acids into sequence-repetitive polypeptides

    PubMed Central

    Wu, I-Lin; Patterson, Melissa A.; Carpenter Desai, Holly E.; Mehl, Ryan A.; Giorgi, Gianluca

    2013-01-01

    A simple and efficient method is described for introduction of non-canonical amino acids at multiple, structurally defined sites within recombinant polypeptide sequences. E. coli MRA30, a bacterial host strain with attenuated activity for release factor 1 (RF1), is assessed for its ability to support the incorporation of a diverse range of non-canonical amino acids in response to multiple encoded amber (TAG) codons within genetic templates derived from superfolder GFP and an elastin-mimetic protein polymer. Suppression efficiency and isolated protein yield were observed to depend on the identity of the orthogonal aminoacyl-tRNA synthetase/tRNACUA pair and the non-canonical amino acid substrate. This approach afforded elastin-mimetic protein polymers containing non-canonical amino acid derivatives at up to twenty-two positions within the repeat sequence with high levels of substitution. The identity and position of the variant residues was confirmed by mass spectrometric analysis of the full-length polypeptides and proteolytic cleavage fragments resulting from thermolysin digestion. The accumulated data suggest that this multi-site suppression approach permits the preparation of protein-based materials in which novel chemical functionality can be introduced at precisely defined positions within the polypeptide sequence. PMID:23625817

  3. SUBGROUPS OF AMINO ACID SEQUENCES IN THE VARIABLE REGIONS OF IMMUNOGLOBULIN HEAVY CHAINS*

    PubMed Central

    Cunningham, Bruce A.; Pflumm, Mollie N.; User, Urs Rutisha; Edelman, Gerald M.

    1969-01-01

    The amino acid sequence of the first 133 residues of the heavy (γ) chain from a human γG immunoglobulin (He) has been determined. This γ-chain is identical in Gm type to that of protein Eu, the complete sequence of which has been reported. Comparison of the two sequences substantiates the previous suggestion that there are subgroups of variable regions of heavy chains. The variable region of Eu has been assigned to subgroup I and that of He to subgroup II; on the other hand, the constant regions of the two proteins appear to be identical. Comparison of the sequence of the heavy chain of He with the heavy chain sequences determined in other laboratories suggests that the variable region of subgroup II is at least 118 residues long. The nature and distribution of amino acid variations in this heavy chain subgroup resemble those observed in light chain subgroups. These studies provide evidence that the translocation hypothesis applies to heavy as well as to light chains, viz., genes for variable regions (V) are somatically translocated to genes for constant regions (C) to form complete VC structural genes. Images PMID:5264153

  4. Subdomain interactions foster the design of two protein pairs with ∼80% sequence identity but different folds.

    PubMed

    Porter, Lauren L; He, Yanan; Chen, Yihong; Orban, John; Bryan, Philip N

    2015-01-06

    Metamorphic proteins, including proteins with high levels of sequence identity but different folds, are exceptions to the long-standing rule-of-thumb that proteins with as little as 30% sequence identity adopt the same fold. Which topologies can be bridged by these highly identical sequences remains an open question. Here we bridge two 3-α-helix bundle proteins with two radically different folds. Using a straightforward approach, we engineered the sequences of one subdomain within maltose binding protein (MBP, α/β/α-sandwich) and another within outer surface protein A (OspA, β-sheet) to have high sequence identity (80 and 77%, respectively) with engineered variants of protein G (GA, 3-α-helix bundle). Circular dichroism and nuclear magnetic resonance spectra of all engineered variants demonstrate that they maintain their native conformations despite substantial sequence modification. Furthermore, the MBP variant (80% identical to GA) remained active. Thermodynamic analysis of numerous GA and MBP variants suggests that the key to our approach involved stabilizing the modified MBP and OspA subdomains via external interactions with neighboring substructures, indicating that subdomain interactions can stabilize alternative folds over a broad range of sequence variation. These findings suggest that it is possible to bridge one fold with many other topologies, which has implications for protein folding, evolution, and misfolding diseases.

  5. A SNP panel for identity and kinship testing using massive parallel sequencing.

    PubMed

    Grandell, Ida; Samara, Raed; Tillmar, Andreas O

    2016-07-01

    Within forensic genetics, there is still a need for supplementary DNA marker typing in order to increase the power to solve cases for both identity testing and complex kinship issues. One major disadvantage with current capillary electrophoresis (CE) methods is the limitation in DNA marker multiplex capability. By utilizing massive parallel sequencing (MPS) technology, this capability can, however, be increased. We have designed a customized GeneRead DNASeq SNP panel (Qiagen) of 140 previously published autosomal forensically relevant identity SNPs for analysis using MPS. One single amplification step was followed by library preparation using the GeneRead Library Prep workflow (Qiagen). The sequencing was performed on a MiSeq System (Illumina), and the bioinformatic analyses were done using the software Biomedical Genomics Workbench (CLC Bio, Qiagen). Forty-nine individuals from a Swedish population were genotyped in order to establish genotype frequencies and to evaluate the performance of the assay. The analyses showed to have a balanced coverage among the included loci, and the heterozygous balance showed to have less than 0.5 % outliers. Analyses of dilution series of the 2800M Control DNA gave reproducible results down to 0.2 ng DNA input. In addition, typing of FTA samples and bone samples was performed with promising results. Further studies and optimizations are, however, required for a more detailed evaluation of the performance of degraded and PCR-inhibited forensic samples. In summary, the assay offers a straightforward sample-to-genotype workflow and could be useful to gain information in forensic casework, for both identity testing and in order to solve complex kinship issues.

  6. Amino Acid Sequence of Human Cholinesterase

    DTIC Science & Technology

    1985-10-01

    liquid chromatography (HPLC). Activity testing of the aged, DFP-labeled cholinesterase showed that 99.8% of the active sites had been labeled, since...acids were quantitated by ninhydrin at the AAA Labs, or by derivatization with phenylisothiocyanate at the University of Michigan. The latter method

  7. Nucleotide sequence of the luxC gene encoding fatty acid reductase of the lux operon from Photobacterium leiognathi.

    PubMed

    Lin, J W; Chao, Y F; Weng, S F

    1993-02-26

    The nucleotide sequence of the luxC gene (EMBL Accession No. 65156) encoding fatty acid reductase (FAR) of the lux operon from Photobacterium leiognathi PL741 was determined and the encoded amino acid sequence deduced. The fatty acid reductase is a component of the fatty acid reductase complex. The complex is responsible for converting fatty acid to aldehyde which serves as the substrate in the luciferase-catalyzed bioluminescent reaction. The protein comprises 478 amino acid residues and has a calculated M(r) of 53,858. Alignment and comparison of the fatty acid reductase of P. leiognathi with that of Vibrio harveyi B392 and Vibrio fischeri ATCC 7744 shows that there is 70% and 59% amino acid residues identity, respectively.

  8. The complete amino acid sequence of chicken skeletal-muscle enolase.

    PubMed Central

    Russell, G A; Dunbar, B; Fothergill-Gilmore, L A

    1986-01-01

    The complete amino acid sequence of chicken skeletal-muscle enolase, comprising 433 residues, was determined. The sequence was deduced by automated sequencing of hydroxylamine-cleavage, CNBr-cleavage, o-iodosobenzoic acid-cleavage, clostripain-digest and staphylococcal-proteinase-digest fragments. The presence of several acid-labile peptide bonds and the tenacious aggregation of most CNBr-cleavage fragments meant that a commonly used sequencing strategy involving initial CNBr cleavage was unproductive. Cleavage at the single Asn-Gly peptide bond with hydroxylamine proved to be particularly useful. Comparison of the sequence of chicken enolase with the two yeast enolase isoenzyme sequences shows that the enzyme is strongly conserved, with 60% of the residues identical. The histidine and arginine residues implicated as being important for the activity of yeast enolase are conserved in the chicken enzyme. Secondary-structure predictions are analysed in an accompanying paper [Sawyer, Fothergill-Gilmore & Russell (1986) Biochem. J. 236, 127-130]. PMID:3539098

  9. Cystatin. Amino acid sequence and possible secondary structure.

    PubMed Central

    Schwabe, C; Anastasi, A; Crow, H; McDonald, J K; Barrett, A J

    1984-01-01

    The amino acid sequence of cystatin, the protein from chicken egg-white that is a tight-binding inhibitor of many cysteine proteinases, is reported. Cystatin is composed of 116 amino acid residues, and the Mr is calculated to be 13 143. No striking similarity to any other known sequence has been detected. The results of computer analysis of the sequence and c.d. spectrometry indicate that the secondary structure includes relatively little alpha-helix (about 20%) and that the remainder is mainly beta-structure. PMID:6712597

  10. Mouse Vk gene classification by nucleic acid sequence similarity.

    PubMed

    Strohal, R; Helmberg, A; Kroemer, G; Kofler, R

    1989-01-01

    Analyses of immunoglobulin (Ig) variable (V) region gene usage in the immune response, estimates of V gene germline complexity, and other nucleic acid hybridization-based studies depend on the extent to which such genes are related (i.e., sequence similarity) and their organization in gene families. While mouse Igh heavy chain V region (VH) gene families are relatively well-established, a corresponding systematic classification of Igk light chain V region (Vk) genes has not been reported. The present analysis, in the course of which we reviewed the known extent of the Vk germline gene repertoire and Vk gene usage in a variety of responses to foreign and self antigens, provides a classification of mouse Vk genes in gene families composed of members with greater than 80% overall nucleic acid sequence similarity. This classification differed in several aspects from that of VH genes: only some Vk gene families were as clearly separated (by greater than 25% sequence dissimilarity) as typical VH gene families; most Vk gene families were closely related and, in several instances, members from different families were very similar (greater than 80%) over large sequence portions; frequently, classification by nucleic acid sequence similarity diverged from existing classifications based on amino-terminal protein sequence similarity. Our data have implications for Vk gene analyses by nucleic acid hybridization and describe potentially important differences in sequence organization between VH and Vk genes.

  11. Amino acid sequence of toxin III from Anemonia sulcata.

    PubMed

    Bĕress, L; Wunderer, G; Wachter, E

    1977-08-01

    Toxin III, the smallest toxin component of the poison of the sea anemone Anemonia sulcata, is a polypeptide with 27 amino acids. Its structure is stabilized by three disulfide bridges. The amino acid sequence was determined by solid-phase Edman degradation of the aminoethylated derivative. The peptide was coupled to the carrier, porous glass, by thiourea bridges between the alpha-amino group of arginine-1 and the epsilon-amino group of lysine-26 and the isothiocyanate groups of the carrier. Another fraction of the polypeptide was bound by an acid-amide condensation of the C-terminal valine-27 with the aminopropyl group of the carrier. The sequence of toxin III has no regions homologous to the 47-residue toxin II. Comparison with the known partial sequence of toxin I, which contains 46 amino acids (Wunderer, G. & Eulitz, M., in preparation) also fails to reveal homologies.

  12. Influence of sequence identity and unique breakpoints on the frequency of intersubtype HIV-1 recombination

    PubMed Central

    Baird, Heather A; Gao, Yong; Galetto, Román; Lalonde, Matthew; Anthony, Reshma M; Giacomoni, Véronique; Abreha, Measho; Destefano, Jeffrey J; Negroni, Matteo; Arts, Eric J

    2006-01-01

    Background HIV-1 recombination between different subtypes has a major impact on the global epidemic. The generation of these intersubtype recombinants follows a defined set of events starting with dual infection of a host cell, heterodiploid virus production, strand transfers during reverse transcription, and then selection. In this study, recombination frequencies were measured in the C1-C4 regions of the envelope gene in the presence (using a multiple cycle infection system) and absence (in vitro reverse transcription and single cycle infection systems) of selection for replication-competent virus. Ugandan subtypes A and D HIV-1 env sequences (115-A, 120-A, 89-D, 122-D, 126-D) were employed in all three assay systems. These subtypes co-circulate in East Africa and frequently recombine in this human population. Results Increased sequence identity between viruses or RNA templates resulted in increased recombination frequencies, with the exception of the 115-A virus or RNA template. Analyses of the recombination breakpoints and mechanistic studies revealed that the presence of a recombination hotspot in the C3/V4 env region, unique to 115-A as donor RNA, could account for the higher recombination frequencies with the 115-A virus/template. Single-cycle infections supported proportionally less recombination than the in vitro reverse transcription assay but both systems still had significantly higher recombination frequencies than observed in the multiple-cycle virus replication system. In the multiple cycle assay, increased replicative fitness of one HIV-1 over the other in a dual infection dramatically decreased recombination frequencies. Conclusion Sequence variation at specific sites between HIV-1 isolates can introduce unique recombination hotspots, which increase recombination frequencies and skew the general observation that decreased HIV-1 sequence identity reduces recombination rates. These findings also suggest that the majority of intra- or intersubtype A

  13. The amino acid sequence of cytochromes c-551 from three species of Pseudomonas

    PubMed Central

    Ambler, R. P.; Wynn, Margaret

    1973-01-01

    The amino acid sequences of the cytochromes c-551 from three species of Pseudomonas have been determined. Each resembles the protein from Pseudomonas strain P6009 (now known to be Pseudomonas aeruginosa, not Pseudomonas fluorescens) in containing 82 amino acids in a single peptide chain, with a haem group covalently attached to cysteine residues 12 and 15. In all four sequences 43 residues are identical. Although by bacteriological criteria the organisms are closely related, the differences between pairs of sequences range from 22% to 39%. These values should be compared with the differences in the sequence of mitochondrial cytochrome c between mammals and amphibians (about 18%) or between mammals and insects (about 33%). Detailed evidence for the amino acid sequences of the proteins has been deposited as Supplementary Publication SUP 50015 at the National Lending Library for Science and Technology, Boston Spa, Yorks. LS23 7BQ, U.K., from whom copies can be obtained on the terms indicated in Biochem. J. (1973), 131, 5. PMID:4352718

  14. Amino acid sequence repertoire of the bacterial proteome and the occurrence of untranslatable sequences

    PubMed Central

    Navon, Sharon Penias; Kornberg, Guy; Chen, Jin; Schwartzman, Tali; Tsai, Albert; Puglisi, Elisabetta Viani; Puglisi, Joseph D.; Adir, Noam

    2016-01-01

    Bioinformatic analysis of Escherichia coli proteomes revealed that all possible amino acid triplet sequences occur at their expected frequencies, with four exceptions. Two of the four underrepresented sequences (URSs) were shown to interfere with translation in vivo and in vitro. Enlarging the URS by a single amino acid resulted in increased translational inhibition. Single-molecule methods revealed stalling of translation at the entrance of the peptide exit tunnel of the ribosome, adjacent to ribosomal nucleotides A2062 and U2585. Interaction with these same ribosomal residues is involved in regulation of translation by longer, naturally occurring protein sequences. The E. coli exit tunnel has evidently evolved to minimize interaction with the exit tunnel and maximize the sequence diversity of the proteome, although allowing some interactions for regulatory purposes. Bioinformatic analysis of the human proteome revealed no underrepresented triplet sequences, possibly reflecting an absence of regulation by interaction with the exit tunnel. PMID:27307442

  15. Amino acid sequence repertoire of the bacterial proteome and the occurrence of untranslatable sequences.

    PubMed

    Navon, Sharon Penias; Kornberg, Guy; Chen, Jin; Schwartzman, Tali; Tsai, Albert; Puglisi, Elisabetta Viani; Puglisi, Joseph D; Adir, Noam

    2016-06-28

    Bioinformatic analysis of Escherichia coli proteomes revealed that all possible amino acid triplet sequences occur at their expected frequencies, with four exceptions. Two of the four underrepresented sequences (URSs) were shown to interfere with translation in vivo and in vitro. Enlarging the URS by a single amino acid resulted in increased translational inhibition. Single-molecule methods revealed stalling of translation at the entrance of the peptide exit tunnel of the ribosome, adjacent to ribosomal nucleotides A2062 and U2585. Interaction with these same ribosomal residues is involved in regulation of translation by longer, naturally occurring protein sequences. The E. coli exit tunnel has evidently evolved to minimize interaction with the exit tunnel and maximize the sequence diversity of the proteome, although allowing some interactions for regulatory purposes. Bioinformatic analysis of the human proteome revealed no underrepresented triplet sequences, possibly reflecting an absence of regulation by interaction with the exit tunnel.

  16. Amino acid sequences of proteins from Leptospira serovar pomona.

    PubMed

    Alves, S F; Lefebvre, R B; Probert, W

    2000-01-01

    This report describes a partial amino acid sequences from three putative outer envelope proteins from Leptospira serovar pomona. In order to obtain internal fragments for protein sequencing, enzymatic and chemical digestion was performed. The enzyme clostripain was used to digest the proteins 32 and 45 kDa. In situ digestion of 40 kDa molecular weight protein was accomplished using cyanogen bromide. The 32 kDa protein generated two fragments, one of 21 kDa and another of 10 kDa that yielded five residues. A fragment of 24 kDa that yielded nineteen residues of amino acids was obtained from 45 kDa protein. A fragment with a molecular weight of 20 kDa, yielding a twenty amino acids sequence from the 40 kDa protein.

  17. Multiple recombination events maintain sequence identity among members of the nitrogenase multigene family in Rhizobium etli.

    PubMed Central

    Rodríguez, C; Romero, D

    1998-01-01

    A distinctive characteristic of the Rhizobium genome is the frequent finding of reiterated sequences, which often constitute multigene families. Interestingly, these families usually maintain a high degree of nucleotide sequence identity. It is commonly assumed that apparent gene conversion between reiterated elements might lead to concerted variation among members of a multigene family. However, the operation of this mechanism has not yet been demonstrated in the Rhizobiaceae. In this work, we employed different genetic constructions to address the role of apparent gene conversion as a homogenizing mechanism between members of the plasmid-located nitrogenase multigene family in Rhizobium etli. Our results show that a 28-bp insertion into one of the nitrogenase reiterations can be corrected by multiple recombination events, including apparent gene conversion. The correction process was dependent on the presence of both a wild-type recA gene and wild-type copies of the nitrogenase reiterations. Frequencies of apparent gene conversion to the wild-type nitrogenase reiterations were the same when the insertion to be corrected was located either in cis or in trans, indicating that this event frequently occurs through intermolecular interactions. Interestingly, a high frequency of multiple crossovers was observed, suggesting that these large plasmid molecules are engaging repeatedly in recombination events, in a situation akin to phage recombination or recombination among small, high-copy number plasmids. PMID:9611191

  18. Distinctive properties of identical twins' TCR repertoires revealed by high-throughput sequencing.

    PubMed

    Zvyagin, Ivan V; Pogorelyy, Mikhail V; Ivanova, Marina E; Komech, Ekaterina A; Shugay, Mikhail; Bolotin, Dmitry A; Shelenkov, Andrey A; Kurnosov, Alexey A; Staroverov, Dmitriy B; Chudakov, Dmitriy M; Lebedev, Yuri B; Mamedov, Ilgar Z

    2014-04-22

    Adaptive immunity in humans is provided by hypervariable Ig-like molecules on the surface of B and T cells. The final set of these molecules in each organism is formed under the influence of two forces: individual genetic traits and the environment, which includes the diverse spectra of alien and self-antigens. Here we assess the impact of individual genetic factors on the formation of the adaptive immunity by analyzing the T-cell receptor (TCR) repertoires of three pairs of monozygous twins by next-generation sequencing. Surprisingly, we found that an overlap between the TCR repertoires of monozygous twins is similar to an overlap between the TCR repertoires of nonrelated individuals. However, the number of identical complementary determining region 3 sequences in two individuals is significantly increased for twin pairs in the fraction of highly abundant TCR molecules, which is enriched by the antigen-experienced T cells. We found that the initial recruitment of particular TCR V genes for recombination and subsequent selection in the thymus is strictly determined by individual genetic factors. J genes of TCRs are selected randomly for recombination; however, the subsequent selection in the thymus gives preference to some α but not β J segments. These findings provide a deeper insight into the mechanism of TCR repertoire generation.

  19. Extensive amino acid sequence homologies between animal lectins

    SciTech Connect

    Paroutaud, P.; Levi, G.; Teichberg, V.I.; Strosberg, A.D.

    1987-09-01

    The authors have established the amino acid sequence of the ..beta..-D-galactoside binding lectin from the electric eel and the sequences of several peptides from a similar lectin isolated from human placenta. These sequences were compared with the published sequences of peptides derived from the ..beta..-D-galactoside binding lectin from human lung and with sequences deduced from cDNAs assigned to the ..beta..-D-galactoside binding lectins from chicken embryo skin and human hepatomas. Significant homologies were observed. One of the highly conserved regions that contains a tryptophan residue and two glutamic acid resides is probably part of the ..beta..-D-galactoside binding site, which, on the basis of spectroscopic studies of the electric eel lectin, is expected to contain such residues. The similarity of the hydropathy profiles and the predicted secondary structure of the lectins from chicken skin and electric eel, in spite of differences in their amino acid sequences, strongly suggests that these proteins have maintained structural homologies during evolution and together with the other ..beta..-D-galactoside binding lectins were derived form a common ancestor gene.

  20. Amino acid sequence similarity between rabies virus glycoprotein and snake venom curaremimetic neurotoxins.

    PubMed

    Lentz, T L; Wilson, P T; Hawrot, E; Speicher, D W

    1984-11-16

    Evidence was presented earlier that a host-cell receptor for the highly neurotropic rabies virus might be the acetylcholine receptor. The amino acid sequence of the glycoprotein of rabies virus was compared by computer analysis with that of snake venom curaremimetic neurotoxins, potent ligands of the acetylcholine receptor. A statistically significant sequence relation was found between a segment of the rabies glycoprotein and the entire sequence of long neurotoxins. The greatest identity occurs with residues considered most important in neurotoxicity, including those interacting with the acetylcholine binding site of the acetylcholine receptor. Because of the similarity between the glycoprotein and the receptor-binding region of the neurotoxins, this region of the viral glycoprotein may function as a recognition site for the acetylcholine receptor. Direct binding of the rabies virus glycoprotein to the acetylcholine receptor could contribute to the neurotropism of this virus.

  1. Partial amino acid sequence of human pancreatic stone protein, a novel pancreatic secretory protein.

    PubMed Central

    Montalto, G; Bonicel, J; Multigner, L; Rovery, M; Sarles, H; De Caro, A

    1986-01-01

    Pancreatic stone protein (PSP) is the major organic component of human pancreatic stones. With the use of monoclonal antibody immunoadsorbents, five immunoreactive forms (PSP-S) with close Mr values (14,000-19,000) were isolated from normal pancreatic juice. By CM-Trisacryl M chromatography the lowest-Mr form (PSP-S1) was separated from the others and some of its molecular characteristics were investigated. The Mr of the PSP-S1 polypeptide chain calculated from the amino acid composition was about 16,100. The N-terminal sequences (40 residues) of PSP and PSP-S1 are identical, which suggests that the peptide backbone is the same for both of these polypeptides. The PSP-S1 sequence was determined up to residue 65 and was found to be different from all other known protein sequences. Images Fig. 1. PMID:3541906

  2. New Functional Identity for the DNA Uptake Sequence in Transformation and Its Presence in Transcriptional Terminators▿

    PubMed Central

    Ambur, O. Herman; Frye, Stephan A.; Tønjum, Tone

    2007-01-01

    The frequently occurring DNA uptake sequence (DUS), recognized as a 10-bp repeat, is required for efficient genetic transformation in the human pathogens Neisseria meningitidis and Neisseria gonorrhoeae. Genome scanning for DUS occurrences in three different species of Neisseria demonstrated that 76% of the nearly 2,000 neisserial DUS were found to have two semiconserved base pairs extending from the 5′ end of DUS to constitute a 12-mer repeat. Plasmids containing sequential variants of the neisserial DUS were tested for their ability to transform N. meningitidis and N. gonorrhoeae, and the 12-mer was found to outperform the 10-mer DUS in transformation efficiency. Assessment of meningococcal uptake of DNA confirmed the enhanced performance of the 12-mer compared to the 10-mer DUS. An inverted repeat DUS was not more efficient in transformation than DNA species containing a single or direct repeat DUS. Genome-wide analysis revealed that half of the nearly 1,500 12-mer DUS are arranged as inverted repeats predicted to be involved in rho-independent transcriptional termination or attenuation. The distribution of the uptake signal sequence required for transformation in the Pasteurellaceae was also biased towards transcriptional terminators, although to a lesser extent. In addition to assessing the intergenic location of DUS, we propose that the 10-mer identity of DUS should be extended and recognized as a 12-mer DUS. The dual role of DUS in transformation and as a structural component on RNA affecting transcription makes this a relevant model system for assessing significant roles of repeat sequences in biology. PMID:17194793

  3. Complete nucleotide and derived amino acid sequence of cDNA encoding the mitochondrial uncoupling protein of rat brown adipose tissue: lack of a mitochondrial targeting presequence.

    PubMed Central

    Ridley, R G; Patel, H V; Gerber, G E; Morton, R C; Freeman, K B

    1986-01-01

    A cDNA clone spanning the entire amino acid sequence of the nuclear-encoded uncoupling protein of rat brown adipose tissue mitochondria has been isolated and sequenced. With the exception of the N-terminal methionine the deduced N-terminus of the newly synthesized uncoupling protein is identical to the N-terminal 30 amino acids of the native uncoupling protein as determined by protein sequencing. This proves that the protein contains no N-terminal mitochondrial targeting prepiece and that a targeting region must reside within the amino acid sequence of the mature protein. Images PMID:3012461

  4. The NH2-terminal amino acid sequence of the immunochemically partial identical major allergens of Alder (Alnus glutinosa) Aln g I, birch (Betula verrucosa) Bet v I, hornbeam (Carpinus betulus) Car b I and oak (Quercus alba) Que a I pollens.

    PubMed

    Ipsen, H; Hansen, O C

    1991-11-01

    The tree pollen major allergens (Aln g I), Bet v I, Car b I and Que a I, were purified to near homogeneity. Rabbit antibodies raised towards the major allergen Bet v I react with the major allergens of alder, hazel, hornbeam and oak, although with distinct variation in affinity. Immunochemically the major allergens seem to resemble one another more within than between the botanically established families of the fagale order. The physico-chemical parameters (molecular weight, pI values, amino acid composition and NH2-terminal amino acid sequence) of the major allergens exhibit a higher degree of consistency than do the immunochemical parameters, indicating that nuances in the allergens' secondary and tertiary structures are likely to govern the immunochemical differences.

  5. Amino acid sequence of band-3 protein from rainbow trout erythrocytes derived from cDNA.

    PubMed Central

    Hübner, S; Michel, F; Rudloff, V; Appelhans, H

    1992-01-01

    In this report we present the first complete band-3 cDNA sequence of a poikilothermic lower vertebrate. The primary structure of the anion-exchange protein band 3 (AE1) from rainbow trout erythrocytes was determined by nucleotide sequencing of cDNA clones. The overlapping clones have a total length of 3827 bp with a 5'-terminal untranslated region of 150 bp, a 2754 bp open reading frame and a 3'-untranslated region of 924 bp. Band-3 protein from trout erythrocytes consists of 918 amino acid residues with a calculated molecular mass of 101 827 Da. Comparison of its amino acid sequence revealed a 60-65% identity within the transmembrane spanning sequence of band-3 proteins published so far. An additional insertion of 24 amino acid residues within the membrane-associated domain of trout band-3 protein was identified, which until now was thought to be a general feature only of mammalian band-3-related proteins. PMID:1637296

  6. A molecular footprint of limb loss: sequence variation of the autopodial identity gene Hoxa-13.

    PubMed

    Kohlsdorf, Tiana; Cummings, Michael P; Lynch, Vincent J; Stopper, Geffrey F; Takahashi, Kazuhiko; Wagner, Günter P

    2008-12-01

    The homeobox gene Hoxa-13 codes for a transcription factor involved in multiple functions, including body axis and hand/foot development in tetrapods. In this study we investigate whether the loss of one function (e.g., limb loss in snakes) left a molecular footprint in exon 1 of Hoxa-13 that could be associated with the release of functional constraints caused by limb loss. Fragments of the Hoxa-13 exon 1 were sequenced from 13 species and analyzed, with additional published sequences of the same region, using relative rates and likelihood-ratio tests. Five amino acid sites in exon 1 of Hoxa-13 were detected as evolving under positive selection in the stem lineage of snakes. To further investigate whether there is an association between limb loss and sequence variation in Hoxa-13, we used the random forest method on an alignment that included shark, basal fish lineages, and "eu-tetrapods" such as mammals, turtle, alligator, and birds. The random forest method approaches the problem as one of classification, where we seek to predict the presence or absence of autopodium based on amino acid variation in Hoxa-13 sequences. Different alignments tested were associated with similar error rates (18.42%). The random forest method suggested that phenotypic states (autopodium present and absent) can often be correctly predicted based on Hoxa-13 sequences. Basal, nontetrapod gnat-hostomes that never had an autopodium were consistently classified as limbless together with the snakes, while eu-tetrapods without any history of limb loss in their phylogeny were also consistently classified as having a limb. Misclassifications affected mostly lizards, which, as a group, have a history of limb loss and limb re-evolution, and the urodele and caecilian in our sample. We conclude that a molecular footprint can be detected in Hoxa-13 that is associated with the lack of an autopodium; groups with classification ambiguity (lizards) are characterized by a history of repeated limb loss

  7. Amino acid sequence of a trypsin inhibitor from a Spirometra (Spirometra erinaceieuropaei).

    PubMed

    Sanda, A; Uchida, A; Itagaki, T; Kobayashi, H; Inokuchi, N; Koyama, T; Iwama, M; Ohgi, K; Irie, M

    2001-12-01

    A trypsin inhibitor that is highly homologous with bovine pancreatic trypsin inhibitor (BPTI) was co-purified along with RNase from Spirometra (Spirometra erinaceieuropaei). The amino acid sequence of this inhibitor (SETI) and the nucleotide sequence of the cDNA encoding this protein were determined by protein chemistry and gene technology. SETI contains 68 amino acid residues and has a molecular mass of 7,798 Da. SETI has 31 amino acid residues that are identical with BPTI's sequence, including 6 half-cystine and 5 aromatic amino acid residues. The active site Lys residue in BPTI is replaced by an Arg residue in SETI. SETI is an effective inhibitor of trypsin and moderately inhibits a-chymotrypsin, but less inhibits elastase or subtilisin. SETI was expressed by E. coli containing a PelB vector carrying the SETI encoding cDNA; an expression yield of 0.68 mg/l was obtained. The phylogenetic relationship of SETI and the other BPTI-like trypsin inhibitors was analyzed using most likelihood inference methods.

  8. Deduced amino acid sequence of human pulmonary surfactant proteolipid: SPL(pVal)

    SciTech Connect

    Whitsett, J.A.; Glasser, S.W.; Korfhagen, T.R.; Weaver, T.E.; Clark, J.; Pilot-Matias, T.; Meuth, J.; Fox, J.L.

    1987-05-01

    Hydrophobic, proteolipid-like protein of Mr 6500 was isolated from ether/ethanol extracts of human, canine and bovine pulmonary surfactant. Amino acid composition of the protein demonstrated a remarkable abundance of hydrophobic residues, particularly valine and leucine. The N-terminal amino acid sequence of the human protein was determined: N-Leu-Ile-Pro-Cys-Cys-Pro-Val-Asn-Leu-Lys-Arg-Leu-Leu-Ile-Val4... An oligonucleotide probe was used to screen an adult human lung cDNA library and resulted in detection of cDNA clones with predicted amino acid sequence with close identity to the N-terminal amino acid sequence of the human peptide. SPL(pVal) was found within the reading frame of a larger peptide. SPL(pVal) results from proteolytic processing of a larger preprotein. Northern blot analysis detected in a single 1.0 kilobase SPL(pVal) RNA which was less abundant in fetal than in adult lung. Mixtures of purified canine and bovine SPL(pVal) and synthetic phospholipids display properties of rapid adsorption and surface tension lowering activity characteristic of surfactant. Human SPL(pVal) is a pulmonary surfactant proteolipid which may therefore be useful in combination with phospholipids and/or other surfactant proteins for the treatment of surfactant deficiency such as hyaline membrane disease in newborn infants.

  9. Romidepsin-induced HIV-1 viremia during effective antiretroviral therapy contains identical viral sequences with few deleterious mutations

    PubMed Central

    Winckelmann, Anni; Barton, Kirston; Hiener, Bonnie; Schlub, Timothy E.; Shao, Wei; Rasmussen, Thomas A.; Østergaard, Lars; Søgaard, Ole S.; Tolstrup, Martin; Palmer, Sarah

    2017-01-01

    Objective: To investigate the origin of the HIV-1 viremia induced by the latency-reversing agent romidepsin. Design: Six individuals on suppressive antiretroviral therapy received romidepsin administered intravenously once weekly for 3 consecutive weeks. CD4+ T cells were obtained at baseline, following the second and third romidepsin infusion, and 10 weeks after the final romidepsin treatment. Plasma samples were collected 24 and 72 h after romidepsin infusions. Methods: Single-genome sequencing of the env and p24-RT region was used to genetically characterize the virus from proviral DNA, the transcribed cell-associated RNA and the plasma RNA pool. Results: In three of six participants with available plasma samples we identified plasma HIV-1 RNA sequences that were identical to DNA and/or cell-associated RNA sequences from peripheral blood CD4+ T cells. In two participants, plasma RNA sequences contained expansions of identical sequences, corresponding to 62 and 100% of the total sequences, respectively. Plasma HIV-1 RNA had very low amounts of defective viruses compared to cell-associated RNA (odds ratio 20.85, P < 0.001) and to DNA (odds ratio 7.07, P = 0.011) during romidepsin therapy. Conclusions: Romidepsin induced transcription from proviruses in peripheral blood cells, which contributed to viremia in patients on suppressive therapy. The intermingling of these cell-associated HIV-1 RNA with DNA sequences indicates transcription from a diverse range of proviruses, but the expansions of identical viral plasma sequences with few defects indicate that the romidepsin-induced viremia arises from intact proviruses with highly similar or identical genetic backgrounds. PMID:28272134

  10. Amino acid sequences and structures of chicken and turkey beta 2-microglobulin.

    PubMed

    Welinder, K G; Jespersen, H M; Walther-Rasmussen, J; Skjødt, K

    1991-01-01

    The complete amino acid sequences of chicken and turkey beta 2-microglobulins have been determined by analyses of tryptic, V8-proteolytic and cyanogen bromide fragments, and by N-terminal sequencing. Mass spectrometric analysis of chicken beta 2-microglobulin supports the sequence-derived Mr of 11,048. The higher apparent Mr obtained for the avian beta 2-microglobulins as compared to human beta 2-microglobulin by SDS-PAGE is not understood. Chicken and turkey beta 2-microglobulin consist of 98 residues and deviate at seven positions: 60, 66, 74-76, 78 and 82. The chicken and turkey sequences are identical to human beta 2-microglobulin at 46 and 47 positions, respectively, and to bovine beta 2-microglobulin at 47 positions, i.e. there is about 47% identity between avian and mammalian beta 2-microglobulins. The known X-ray crystallographic structures of bovine beta 2-microglobulin and human HLA-A2 complex suggest that the seven chicken to turkey differences are exposed to solvent in the avian MHC class I complex. The key residues of beta 2-microglobulin involved in alpha chain contacts within the MHC class I molecule are highly conserved between chicken and man. This explains that heterologous human beta 2-microglobulin can substitute the chicken beta 2-microglobulin in exchange studies with B-F (chicken MHC class I molecule), and suggests that the MHC class I structure is conserved over long evolutionary distances.

  11. Molecular cloning and amino acid sequence of human plakoglobin, the common junctional plaque protein

    SciTech Connect

    Franke, W.W.; Goldschmidt, M.D.; Zimbelmann, R.; Mueller, H.M.; Schiller, D.L.; Cowin, P. )

    1989-06-01

    Plakoglobin is a major cytoplasmic protein that occurs in a soluble and a membrane-associated form and is the only known constituent common to the submembranous plaques of both kinds of adhering junctions, the desmosomes and the intermediate junctions. Using a partial cDNA clone for bovine plakoglobin, the authors isolated cDNAs encoding human plakoglobin, determined its nucleotide sequence, and deduced the complete amino acid sequence. The polypeptide encoded by the cDNA was synthesized by in vitro transcription and translation and identified by its comigration with authentic plakoglobin in two-dimensional gel electrophoresis. The identity was further confirmed by comparison of the deduced sequence with the directly determined amino acid sequence of two fragments from bovine plakoglobin. Analysis of the plakoglobin sequence showed the protein to be unrelated to any other known proteins, highly conserved between human and bovine tissues, and characterized by numerous changes between hydrophilic and hydrophobic sections. Only one kind of plakoglobin mRNA was found in most tissues, but an additional mRNA was detected in certain human tumor cell lines. This longer mRNA may be represented by a second type of plakoglobin cDNA, which contains an insertion of 297 nucleotides in the 3{prime} noncoding region.

  12. Active site amino acid sequence of human factor D.

    PubMed

    Davis, A E

    1980-08-01

    Factor D was isolated from human plasma by chromatography on CM-Sephadex C50, Sephadex G-75, and hydroxylapatite. Digestion of reduced, S-carboxymethylated factor D with cyanogen bromide resulted in three peptides which were isolated by chromatography on Sephadex G-75 (superfine) equilibrated in 20% formic acid. NH2-Terminal sequences were determined by automated Edman degradation with a Beckman 890C sequencer using a 0.1 M Quadrol program. The smallest peptide (CNBr III) consisted of the NH2-terminal 14 amino acids. The other two peptides had molecular weights of 17,000 (CNBr I) and 7000 (CNBr II). Overlap of the NH2-terminal sequence of factor D with the NH2-terminal sequence of CNBr I established the order of the peptides. The NH2-terminal 53 residues of factor D are somewhat more homologous with the group-specific protease of rat intestine than with other serine proteases. The NH2-terminal sequence of CNBr II revealed the active site serine of factor D. The typical serine protease active site sequence (Gly-Asp-Ser-Gly-Gly-Pro was found at residues 12-17. The region surrounding the active site serine does not appear to be more highly homologous with any one of the other serine proteases. The structural data obtained point out the similarities between factor D and the other proteases. However, complete definition of the degree of relationship between factor D and other proteases will require determination of the remainder of the primary structure.

  13. Sequence analysis proves clonal identity in five patients with typical and blastoid mantle cell lymphoma.

    PubMed

    Yin, C Cameron; Medeiros, L Jeffrey; Cromwell, Candy C; Mehta, Ashwini P; Lin, Pei; Luthra, Rajyalakshmi; Abruzzo, Lynne V

    2007-01-01

    Mantle cell lymphoma (MCL) is typically composed of small irregular lymphoid cells. Blastoid variants, composed of lymphoblast-like (classic type) or large (pleomorphic type) cells, arise de novo or in patients with typical MCL. Although it has been assumed that blastoid variant represents histologic transformation of typical MCL, the clonal relationship between the two tumors has rarely been assessed at the molecular level. We identified five patients with typical MCL who subsequently developed the blastoid variant. There were two men and three women with a median age of 65 years (range, 34-70) at diagnosis of typical MCL involving lymph nodes. The median interval between typical and blastoid MCL was 36 months (range, 11-103). Subsequent blastoid variant MCL involved soft tissue (two), lymph node (one), ileum (one), or rectum (one). All typical and blastoid neoplasms were positive for CD20, cyclin D1, and monotypic surface immunoglobulin light chain, and all typical cases were positive for CD5. Two blastoid neoplasms lost CD5 expression, one of which aberrantly expressed CD10. Immunostaining for Ki-67 showed a median proliferative fraction of 20% in typical and 70% in blastoid neoplasms. Sequence analysis of the VDJ regions of the rearranged IgH allele proved clonal identity in each set of paired samples in all five patients. These results support the concept that blastoid MCL arising in patients with typical MCL represents histologic transformation of the original neoplastic clone.

  14. Cloning and sequencing of the Bet v 1-homologous allergen Fra a 1 in strawberry (Fragaria ananassa) shows the presence of an intron and little variability in amino acid sequence.

    PubMed

    Musidlowska-Persson, Anna; Alm, Rikard; Emanuelsson, Cecilia

    2007-02-01

    The Fra a 1 allergen in strawberry (Fragaria ananassa) is homologous to the major birch pollen allergen Bet v 1, which has numerous isoforms differing in terms of amino acid sequence and immunological impact. To map the extent of sequence differences in the Fra a 1 allergen, PCR cloning and sequencing was applied. Several genomic sequences of Fra a 1, with a length of either 584, 591 or 594 nucleotides, were obtained from three different strawberry varieties. All contained one intron, with the length of either 101 or 110 nucleotides. By sequencing 30 different clones, eight different DNA sequences were obtained, giving in total five potential Fra a 1 protein isoforms, with high sequence similarity (>97% sequence identity) and only seven positions of amino acid variability, which were largely confirmed by mass spectrometry of expressed proteins. We conclude that the sequence variability in the strawberry allergen Fra a 1 is small, within and between strawberry varieties, and that multiple spots, previously detected in 2DE, are presumably due to differences in post-translational modification rather than differences in amino acid sequence. The most abundant Fra a 1 isoform sequence, recombinantly expressed in Escherichia coli after removal of the intron, was recognized by IgE from strawberry allergic patients. It cross-reacted with antibodies to Bet v 1 and the homologous apple allergen Mal d 1 (61 and 78% sequence identity, respectively), and will be used in further analyses of variation in Fra a 1-expression.

  15. Amino acid sequence homology between rat and human C-reactive protein.

    PubMed Central

    Taylor, J A; Bruton, C J; Anderson, J K; Mole, J E; De Beer, F C; Baltz, M L; Pepys, M B

    1984-01-01

    The rat serum protein that undergoes Ca2+-dependent binding to pneumococcal C-polysaccharide and to phosphocholine residues, and that is evidently a member of the pentraxin family of proteins by virtue of its appearance under the electron microscope, has been variously designated as rat C-reactive protein (CRP) [de Beer, Baltz, Munn, Feinstein, Taylor, Bruton, Clamp & Pepys (1982) Immunology 45, 55-70], 'phosphoryl choline-binding protein' [Nagpurkar & Mookerjea (1981) J. Biol. Chem. 256, 7440-7448] and rat serum amyloid P component (SAP) [Pontet, D'Asnieres, Gache, Escaig & Engler (1981) Biochim. Biophys. Acta 671, 202-210]. The partial amino acid sequence (45 residues) towards the C-terminus of this protein was determined, and it showed 71.7% identity with the known sequence of human CRP but only 54.3% identity with human SAP. Since human CRP and SAP are themselves approximately 50% homologous, the level of identity between the rat protein and human SAP is evidence only of membership of the pentraxin family. In contrast, the much greater resemblance to human CRP confirms that the rat C-polysaccharide-binding/phosphocholine-binding protein is in fact rat CRP. PMID:6477504

  16. Amino acid sequence of bovine gamma E (IVa) lens crystallin.

    PubMed Central

    Kilby, G. W.; Sheil, M. M.; Shaw, D.; Harding, J. J.; Truscott, R. J.

    1997-01-01

    When electrospray ionization mass spectrometry (ESMS) was used to analyze purified bovine gamma E (gamma IVa)-crystallin, it yielded a relative molecular mass (M(r)) of 20.955 +/- 5. This mass is significantly different from that calculated from the published sequence (M(r) 20.894) (White HE et al., 1989, J Mol Biol 207:217-235). Further, ES-MS analysis of the protein after it had been reduced and carboxymethylated indicated the presence of five cysteine residues, whereas the published sequence contains six (Kilby GW et al., 1995, Eur Mass Spectrom 1:203-208). The entire protein sequence of gamma E crystallin has therefore been studied via a combination of ES-MS, ES-MS/MS, and Edman amino acid sequencing. The corrected sequence gives an M(r) of 20.955.3, which matches that obtained by ES-MS analysis of the purified native protein. The corrected sequence is also in agreement with a recent cDNA sequence obtained for a bovine gamma-crystallin by R. Hay (pers. comm.). PMID:9098901

  17. Amino acid sequence of bovine gamma E (IVa) lens crystallin.

    PubMed

    Kilby, G W; Sheil, M M; Shaw, D; Harding, J J; Truscott, R J

    1997-04-01

    When electrospray ionization mass spectrometry (ESMS) was used to analyze purified bovine gamma E (gamma IVa)-crystallin, it yielded a relative molecular mass (M(r)) of 20.955 +/- 5. This mass is significantly different from that calculated from the published sequence (M(r) 20.894) (White HE et al., 1989, J Mol Biol 207:217-235). Further, ES-MS analysis of the protein after it had been reduced and carboxymethylated indicated the presence of five cysteine residues, whereas the published sequence contains six (Kilby GW et al., 1995, Eur Mass Spectrom 1:203-208). The entire protein sequence of gamma E crystallin has therefore been studied via a combination of ES-MS, ES-MS/MS, and Edman amino acid sequencing. The corrected sequence gives an M(r) of 20.955.3, which matches that obtained by ES-MS analysis of the purified native protein. The corrected sequence is also in agreement with a recent cDNA sequence obtained for a bovine gamma-crystallin by R. Hay (pers. comm.).

  18. Amino acid sequence of bovine heart coupling factor 6.

    PubMed Central

    Fang, J K; Jacobs, J W; Kanner, B I; Racker, E; Bradshaw, R A

    1984-01-01

    The amino acid sequence of bovine heart mitochondrial coupling factor 6 (F6) has been determined by automated Edman degradation of the whole protein and derived peptides. Preparations based on heat precipitation and ethanol extraction showed allotypic variation at three positions while material further purified by HPLC yielded only one sequence that also differed by a Phe-Thr replacement at residue 62. The mature protein contains 76 amino acids with a calculated molecular weight of 9006 and a pI of approximately equal to 5, in good agreement with experimentally measured values. The charged amino acids are mainly clustered at the termini and in one section in the middle; these three polar segments are separated by two segments relatively rich in nonpolar residues. Chou-Fasman analysis suggests three stretches of alpha-helix coinciding (or within) the high-charge-density sequences with a single beta-turn at the first polar-nonpolar junction. Comparison of the F6 sequence with those of other proteins did not reveal any homologous structures. PMID:6149548

  19. Analysis of protein function and its prediction from amino acid sequence.

    PubMed

    Clark, Wyatt T; Radivojac, Predrag

    2011-07-01

    Understanding protein function is one of the keys to understanding life at the molecular level. It is also important in the context of human disease because many conditions arise as a consequence of alterations of protein function. The recent availability of relatively inexpensive sequencing technology has resulted in thousands of complete or partially sequenced genomes with millions of functionally uncharacterized proteins. Such a large volume of data, combined with the lack of high-throughput experimental assays to functionally annotate proteins, attributes to the growing importance of automated function prediction. Here, we study proteins annotated by Gene Ontology (GO) terms and estimate the accuracy of functional transfer from protein sequence only. We find that the transfer of GO terms by pairwise sequence alignments is only moderately accurate, showing a surprisingly small influence of sequence identity (SID) in a broad range (30-100%). We developed and evaluated a new predictor of protein function, functional annotator (FANN), from amino acid sequence. The predictor exploits a multioutput neural network framework which is well suited to simultaneously modeling dependencies between functional terms. Experiments provide evidence that FANN-GO (predictor of GO terms; available from http://www.informatics.indiana.edu/predrag) outperforms standard methods such as transfer by global or local SID as well as GOtcha, a method that incorporates the structure of GO.

  20. Amino acid substitutions in genetic variants of human serum albumin and in sequences inferred from molecular cloning

    SciTech Connect

    Takahashi, N.; Takahashi, Y.; Blumberg, B.S.; Putnam, F.W.

    1987-07-01

    The structural changes in four genetic variants of human serum albumin were analyzed by tandem high-pressure liquid chromatography (HPLC) of the tryptic peptides, HPLC mapping and isoelectric focusing of the CNBr fragments, and amino acid sequence analysis of the purified peptides. Lysine-372 of normal (common) albumin A was changed to glutamic acid both in albumin Naskapi, a widespread polymorphic variant of North American Indians, and in albumin Mersin found in Eti Turks. The two variants also exhibited anomalous migration in NaDodSO/sub 4//PAGE, which is attributed to a conformational change. The identity of albumins Naskapi and Mersin may have originated through descent from a common mid-Asiatic founder of the two migrating ethnic groups, or it may represent identical but independent mutations of the albumin gene. In albumin Adana, from Eti Turks, the substitution site was not identified but was localized to the region from positions 447 through 548. The substitution of aspartic acid-550 by glycine was found in albumin Mexico-2 from four individuals of the Pima tribe. Although only single-point substitutions have been found in these and in certain other genetic variants of human albumin, five differences exist in the amino acid sequences inferred from cDNA sequences by workers in three other laboratories. However, our results on albumin A and on 14 different genetic variants accord with the amino acid sequence of albumin deduced from the genomic sequence. The apparent amino acid substitutions inferred from comparison of individual cDNA sequences probably reflect artifacts in cloning or in cDNA sequence analysis rather than polymorphism of the coding sections of the albumin gene.

  1. Constrained Multistate Sequence Design for Nucleic Acid Reaction Pathway Engineering.

    PubMed

    Wolfe, Brian R; Porubsky, Nicholas J; Zadeh, Joseph N; Dirks, Robert M; Pierce, Niles A

    2017-03-01

    We describe a framework for designing the sequences of multiple nucleic acid strands intended to hybridize in solution via a prescribed reaction pathway. Sequence design is formulated as a multistate optimization problem using a set of target test tubes to represent reactant, intermediate, and product states of the system, as well as to model crosstalk between components. Each target test tube contains a set of desired "on-target" complexes, each with a target secondary structure and target concentration, and a set of undesired "off-target" complexes, each with vanishing target concentration. Optimization of the equilibrium ensemble properties of the target test tubes implements both a positive design paradigm, explicitly designing for on-pathway elementary steps, and a negative design paradigm, explicitly designing against off-pathway crosstalk. Sequence design is performed subject to diverse user-specified sequence constraints including composition constraints, complementarity constraints, pattern prevention constraints, and biological constraints. Constrained multistate sequence design facilitates nucleic acid reaction pathway engineering for diverse applications in molecular programming and synthetic biology. Design jobs can be run online via the NUPACK web application.

  2. The amino acid sequence of the alpha chain of HB 2 completes the primary structure of the hemoglobins of the Antarctic fish Notothenia coriiceps neglecta.

    PubMed

    D'Avino, R; Camardella, L; Carratore, V; di Prisco, G

    1990-01-01

    1. The blood of Notothenia coriiceps neglecta (a cold-adapted notothenioid fish, widely distributed in Antarctic waters, and characterized by a relatively low content of erythrocytes and hemoglobin), contains two hemoglobin components, Hb 1 and Hb 2; the amino acid sequences of the beta chain of Hb 1 and Hb 2 are identical. 2. The amino acid sequence of the alpha chain of Hb 2 has been established, thus completing the elucidation of the primary structure of the two hemoglobins.

  3. Sequences Of Amino Acids For Human Serum Albumin

    NASA Technical Reports Server (NTRS)

    Carter, Daniel C.

    1992-01-01

    Sequences of amino acids defined for use in making polypeptides one-third to one-sixth as large as parent human serum albumin molecule. Smaller, chemically stable peptides have diverse applications including service as artificial human serum and as active components of biosensors and chromatographic matrices. In applications involving production of artificial sera from new sequences, little or no concern about viral contaminants. Smaller genetically engineered polypeptides more easily expressed and produced in large quantities, making commercial isolation and production more feasible and profitable.

  4. NGSCheckMate: software for validating sample identity in next-generation sequencing studies within and across data types.

    PubMed

    Lee, Sejoon; Lee, Soohyun; Ouellette, Scott; Park, Woong-Yang; Lee, Eunjung A; Park, Peter J

    2017-03-23

    In many next-generation sequencing (NGS) studies, multiple samples or data types are profiled for each individual. An important quality control (QC) step in these studies is to ensure that datasets from the same subject are properly paired. Given the heterogeneity of data types, file types and sequencing depths in a multi-dimensional study, a robust program that provides a standardized metric for genotype comparisons would be useful. Here, we describe NGSCheckMate, a user-friendly software package for verifying sample identities from FASTQ, BAM or VCF files. This tool uses a model-based method to compare allele read fractions at known single-nucleotide polymorphisms, considering depth-dependent behavior of similarity metrics for identical and unrelated samples. Our evaluation shows that NGSCheckMate is effective for a variety of data types, including exome sequencing, whole-genome sequencing, RNA-seq, ChIP-seq, targeted sequencing and single-cell whole-genome sequencing, with a minimal requirement for sequencing depth (>0.5X). An alignment-free module can be run directly on FASTQ files for a quick initial check. We recommend using this software as a QC step in NGS studies.

  5. Nanopores and nucleic acids: prospects for ultrarapid sequencing

    NASA Technical Reports Server (NTRS)

    Deamer, D. W.; Akeson, M.

    2000-01-01

    DNA and RNA molecules can be detected as they are driven through a nanopore by an applied electric field at rates ranging from several hundred microseconds to a few milliseconds per molecule. The nanopore can rapidly discriminate between pyrimidine and purine segments along a single-stranded nucleic acid molecule. Nanopore detection and characterization of single molecules represents a new method for directly reading information encoded in linear polymers. If single-nucleotide resolution can be achieved, it is possible that nucleic acid sequences can be determined at rates exceeding a thousand bases per second.

  6. Nanopores and nucleic acids: prospects for ultrarapid sequencing

    NASA Technical Reports Server (NTRS)

    Deamer, D. W.; Akeson, M.

    2000-01-01

    DNA and RNA molecules can be detected as they are driven through a nanopore by an applied electric field at rates ranging from several hundred microseconds to a few milliseconds per molecule. The nanopore can rapidly discriminate between pyrimidine and purine segments along a single-stranded nucleic acid molecule. Nanopore detection and characterization of single molecules represents a new method for directly reading information encoded in linear polymers. If single-nucleotide resolution can be achieved, it is possible that nucleic acid sequences can be determined at rates exceeding a thousand bases per second.

  7. Nanopore-based sequencing and detection of nucleic acids.

    PubMed

    Ying, Yi-Lun; Zhang, Junji; Gao, Rui; Long, Yi-Tao

    2013-12-09

    Nanopore-based techniques, which mimic the functions of natural ion channels, have attracted increasing attention as unique methods for single-molecule detection. The technology allows the real-time, selective, high-throughput analysis of nucleic acids through both biological and solid-state nanopores. In this Minireview, the background and latest progress in nanopore-based sequencing and detection of nucleic acids are summarized, and light is shed on a novel platform for nanopore-based detection. Copyright © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  8. Amino acid sequence of tyrosinase from Neurospora crassa.

    PubMed Central

    Lerch, K

    1978-01-01

    The amino-acid sequence of tyrosinase from Neurospora crassa (monophenol,dihydroxyphenylalanine:oxygen oxidoreductase, EC 1.14.18.1) is reported. This copper-containing oxidase consists of a single polypeptide chain of 407 amino acids. The primary structure was determined by automated and manual sequence analysis on fragments produced by cleavage with cyanogen bromide and on peptides obtained by digestion with trypsin, pepsin, thermolysin, or chymotrypsin. The amino terminus of the protein is acetylated and the single cysteinyl residue 96 is covalently linked via a thioether bridge to histidyl residue 94. The formation and the possible role of this unusual structure in Neurospora tyrosinase is discussed. Dye-sensitized photooxidation of apotyrosinase and active-site-directed inactivation of the native enzyme indicate the possible involvement of histidyl residues 188, 192, 289, and 305 or 306 as ligands to the active-site copper as well as in the catalytic mechanism of this monooxygenase. PMID:151279

  9. Amino-acid sequence of toxin I from Anemonia sulcata.

    PubMed

    Wunderer, G; Eulitz, M

    1978-08-15

    Toxin I from Anemonia sulcata, a major component of the sea anemone venom, consists of 46 amino acid residues which are linked by three disulfide bridges. The [14C]carboxymethylated polypeptide was sequenced to position 29 by automated Edman degradation. The remaining sequence was determined from cyanogen bromide peptides and from tryptic peptides of the citraconylated [14C]carboxymethylated toxin. Toxin I is homologous to toxin II from Anemonia sulcata and to anthopleurin A, a toxin from the sea anemone Anthopleura xanthogrammica. These toxins constitute a new class of polypeptide toxins. No significant homologies exist with toxin III from Anemonia sulcata nor with known sequences of neurotoxins or cardiotoxins of various origin.

  10. Purification, characterization, gene cloning and nucleotide sequencing of D: -stereospecific amino acid amidase from soil bacterium: Delftia acidovorans.

    PubMed

    Hongpattarakere, Tipparat; Komeda, Hidenobu; Asano, Yasuhisa

    2005-12-01

    The D-amino acid amidase-producing bacterium was isolated from soil samples using an enrichment culture technique in medium broth containing D-phenylalanine amide as a sole source of nitrogen. The strain exhibiting the strongest activity was identified as Delftia acidovorans strain 16. This strain produced intracellular D-amino acid amidase constitutively. The enzyme was purified about 380-fold to homogeneity and its molecular mass was estimated to be about 50 kDa, on sodium dodecyl sulfate polyacrylamide gel electrophoresis. The enzyme was active preferentially toward D-amino acid amides rather than their L-counterparts. It exhibited strong amino acid amidase activity toward aromatic amino acid amides including D-phenylalanine amide, D-tryptophan amide and D-tyrosine amide, yet it was not specifically active toward low-molecular-weight D-amino acid amides such as D-alanine amide, L-alanine amide and L-serine amide. Moreover, it was not specifically active toward oligopeptides. The enzyme showed maximum activity at 40 degrees C and pH 8.5 and appeared to be very stable, with 92.5% remaining activity after the reaction was performed at 45 degrees C for 30 min. However, it was mostly inactivated in the presence of phenylmethanesulfonyl fluoride or Cd2+, Ag+, Zn2+, Hg2+ and As3+ . The NH2 terminal and internal amino acid sequences of the enzyme were determined; and the gene was cloned and sequenced. The enzyme gene damA encodes a 466-amino-acid protein (molecular mass 49,860.46 Da); and the deduced amino acid sequence exhibits homology to the D-amino acid amidase from Variovorax paradoxus (67.9% identity), the amidotransferase A subunit from Burkholderia fungorum (50% identity) and other enantioselective amidases.

  11. Quantum-Sequencing: Biophysics of quantum tunneling through nucleic acids

    NASA Astrophysics Data System (ADS)

    Casamada Ribot, Josep; Chatterjee, Anushree; Nagpal, Prashant

    2014-03-01

    Tunneling microscopy and spectroscopy has extensively been used in physical surface sciences to study quantum tunneling to measure electronic local density of states of nanomaterials and to characterize adsorbed species. Quantum-Sequencing (Q-Seq) is a new method based on tunneling microscopy for electronic sequencing of single molecule of nucleic acids. A major goal of third-generation sequencing technologies is to develop a fast, reliable, enzyme-free single-molecule sequencing method. Here, we present the unique ``electronic fingerprints'' for all nucleotides on DNA and RNA using Q-Seq along their intrinsic biophysical parameters. We have analyzed tunneling spectra for the nucleotides at different pH conditions and analyzed the HOMO, LUMO and energy gap for all of them. In addition we show a number of biophysical parameters to further characterize all nucleobases (electron and hole transition voltage and energy barriers). These results highlight the robustness of Q-Seq as a technique for next-generation sequencing.

  12. The complementary deoxyribonucleic acid sequence of guinea pig endometrial prorelaxin.

    PubMed

    Lee, Y A; Bryant-Greenwood, G D; Mandel, M; Greenwood, F C

    1992-03-01

    The nucleotide sequence of the relaxin gene transcript in the endometrium of the late pregnant guinea pig has been determined. The strategy used was a combination of polymerase chain reaction (PCR) with primers designed from the mRNA sequence of porcine preprorelaxin, rapid amplification of cDNA ends-PCR, and blunt end cloning in M13 mp18. With heterologous primers, a 226-basepair (bp) segment of the guinea pig relaxin gene sequence was obtained and was used to design a guinea pig-specific primer for use with the rapid amplification of cDNA ends-PCR method. The latter allowed completion of the sequence of 336 bp, with a 96-bp overlap. The sequence obtained shows greater homology at both the nucleotide and amino acid levels with porcine and human relaxins H1 and H2 than with rat relaxin, supporting the thesis that the guinea pig is not a rodent. The transcription of the guinea pig endometrial relaxin gene during pregnancy was confirmed by Northern analysis of guinea pig endometrial tissues with a species-specific cDNA probe. The endometrial relaxin gene is transcribed during pregnancy, but not in lactation, consistent with the observed immunostaining for relaxin.

  13. Cloning and sequencing of Escherichia coli mutR shows its identity to topB, encoding topoisomerase III.

    PubMed Central

    Schofield, M A; Agbunag, R; Michaels, M L; Miller, J H

    1992-01-01

    We have cloned and sequenced the mutR gene from Escherichia coli, which results in an increased frequency of spontaneous deletions, by using a strain carrying a Tn10 derivative inserted into mutR. The analysis of 1,286 bp of mutR sequence shows that this gene is identical to the topB gene, which encodes topoisomerase III. The increased deletion formation is the first reported phenotype for cells lacking topoisomerase III, and this suggests that topoisomerase III is involved in reactions that normally reduce the levels of spontaneous deletions. Images PMID:1321123

  14. Identity-by-descent filtering of exome sequence data identifies PIGV mutations in hyperphosphatasia mental retardation syndrome.

    PubMed

    Krawitz, Peter M; Schweiger, Michal R; Rödelsperger, Christian; Marcelis, Carlo; Kölsch, Uwe; Meisel, Christian; Stephani, Friederike; Kinoshita, Taroh; Murakami, Yoshiko; Bauer, Sebastian; Isau, Melanie; Fischer, Axel; Dahl, Andreas; Kerick, Martin; Hecht, Jochen; Köhler, Sebastian; Jäger, Marten; Grünhagen, Johannes; de Condor, Birgit Jonske; Doelken, Sandra; Brunner, Han G; Meinecke, Peter; Passarge, Eberhard; Thompson, Miles D; Cole, David E; Horn, Denise; Roscioli, Tony; Mundlos, Stefan; Robinson, Peter N

    2010-10-01

    Hyperphosphatasia mental retardation (HPMR) syndrome is an autosomal recessive form of mental retardation with distinct facial features and elevated serum alkaline phosphatase. We performed whole-exome sequencing in three siblings of a nonconsanguineous union with HPMR and performed computational inference of regions identical by descent in all siblings to establish PIGV, encoding a member of the GPI-anchor biosynthesis pathway, as the gene mutated in HPMR. We identified homozygous or compound heterozygous mutations in PIGV in three additional families.

  15. Amino Acid Sequence of Mung Bean Trypsin Inhibitor and Its Modified Forms Appearing during Germination.

    PubMed

    Wilson, K A; Chen, J C

    1983-02-01

    The amino acid sequence of the major trypsin inhibitor, F, of ungerminated mung beans (Vigna radiata [L.] Wilczek) was determined by a combination of automatic solid phase and manual sequencing techniques. F is a typical Bowman-Birk-type proteinase inhibitor with 80 amino acid residues and exhibits a high degree of identity with the other sequenced members of the Bowman-Birk family of inhibitors. Thin layer peptide maps of mung bean inhibitors E and C (which appear during germination) indicate that both are derived from inhibitor F by limited specific proteolysis. Loss of the carboxyl-terminal residues 77 to 80 from F produces inhibitor E, while the loss of an additional two carboxyl-terminal residues, the loss of the amino-terminal residues 1 to 8, and an internal cleavage at Ala(35)-Asp(36) produces inhibitor C from E. Another inhibitor species, E', was isolated from ungerminated seeds. It differs from F in the loss of residues 1 to 6. The majority of the proteolytic cleavages noted in the F-E-C-E' system are at peptide bonds involving aspartyl residues.

  16. Molecular cloning and amino acid sequence of human 5-lipoxygenase

    SciTech Connect

    Matsumoto, T.; Funk, C.D.; Radmark, O.; Hoeoeg, J.O.; Joernvall, H.; Samuelsson, B.

    1988-01-01

    5-Lipoxygenase (EC 1.13.11.34), a Ca/sup 2 +/- and ATP-requiring enzyme, catalyzes the first two steps in the biosynthesis of the peptidoleukotrienes and the chemotactic factor leukotriene B/sub 4/. A cDNA clone corresponding to 5-lipoxygenase was isolated from a human lung lambda gt11 expression library by immunoscreening with a polyclonal antibody. Additional clones from a human placenta lambda gt11 cDNA library were obtained by plaque hybridization with the /sup 32/P-labeled lung cDNA clone. Sequence data obtained from several overlapping clones indicate that the composite DNAs contain the complete coding region for the enzyme. From the deduced primary structure, 5-lipoxygenase encodes a 673 amino acid protein with a calculated molecular weight of 77,839. Direct analysis of the native protein and its proteolytic fragments confirmed the deduced composition, the amino-terminal amino acid sequence, and the structure of many internal segments. 5-Lipoxygenase has no apparent sequence homology with leukotriene A/sub 4/ hydrolase or Ca/sup 2 +/-binding proteins. RNA blot analysis indicated substantial amounts of an mRNA species of approx. = 2700 nucleotides in leukocytes, lung, and placenta.

  17. Nucleic acid sequence detection using multiplexed oligonucleotide PCR

    DOEpatents

    Nolan, John P.; White, P. Scott

    2006-12-26

    Methods for rapidly detecting single or multiple sequence alleles in a sample nucleic acid are described. Provided are all of the oligonucleotide pairs capable of annealing specifically to a target allele and discriminating among possible sequences thereof, and ligating to each other to form an oligonucleotide complex when a particular sequence feature is present (or, alternatively, absent) in the sample nucleic acid. The design of each oligonucleotide pair permits the subsequent high-level PCR amplification of a specific amplicon when the oligonucleotide complex is formed, but not when the oligonucleotide complex is not formed. The presence or absence of the specific amplicon is used to detect the allele. Detection of the specific amplicon may be achieved using a variety of methods well known in the art, including without limitation, oligonucleotide capture onto DNA chips or microarrays, oligonucleotide capture onto beads or microspheres, electrophoresis, and mass spectrometry. Various labels and address-capture tags may be employed in the amplicon detection step of multiplexed assays, as further described herein.

  18. The amino acid sequence of chymopapain from Carica papaya.

    PubMed Central

    Watson, D C; Yaguchi, M; Lynn, K R

    1990-01-01

    Chymopapain is a polypeptide of 218 amino acid residues. It has considerable structural similarity with papain and papaya proteinase omega, including conservation of the catalytic site and of the disulphide bonding. Chymopapain is like papaya proteinase omega in carrying four extra residues between papain positions 168 and 169, but differs from both papaya proteinases in the composition of its S2 subsite, as well as in having a second thiol group, Cys-117. Some evidence for the amino acid sequence of chymopapain has been deposited as Supplementary Publication SUP 50153 (12 pages) at the British Library Document Supply Centre, Boston Spa., Wetherby, West Yorkshire LS23 7BQ, U.K., from whom copies may be obtained on the terms indicated in Biochem. J. (1990) 265, 5. The information comprises Supplement Tables 1-4, which contain, in order, amino acid compositions of peptides from tryptic, peptic, CNBr and mild acid cleavages, Supplement Fig. 1, showing re-fractionation of selected peaks from Fig. 2 of the main paper. Supplement Fig. 2, showing cation-exchange chromatography of the earliest-eluted peak of Fig. 3 of the main paper, Supplement Fig. 3, showing reverse-phase h.p.l.c. of the later-eluted peak from Fig. 3 of the main paper, and Supplement Fig. 4, showing the separation of peptides after mild acid hydrolysis of CNBr-cleavage fragment CB3. PMID:2106878

  19. The amino acid sequence of rabbit cardiac troponin I.

    PubMed Central

    Grand, R J; Wilkinson, J M

    1976-01-01

    The complete amino acid sequence of troponin I from rabbit cardiac muscle was determined by the isolation of four unique CNBr fragments, together with overlapping tryptic peptides containing radioactive methionine residues. Overlap data for residues 35-36, 93-94 and 140-145 are incomplete, the sequence at these positions being based on homology with the sequence of the fast-skeletal-muscle protein. Cardiac troponin I is a single polypeptide chain of 206 residues with mol.wt. 23550 and an extinction coefficient, E 1%,1cm/280, of 4.37. The protein has a net positive charge of 14 and is thus somewhat more basic than troponin I from fast-skeletal muscle. Comparison of the sequences of troponin I from cardiac and fast skeletal muscle show that the cardiac protein has 26 extra residues at the N-terminus which account for the larger size of the protein. In the remainder of sequence there is a considerable degree of homology, this being greater in the C-terminal two-thirds of the molecule. The region in the cardiac protein corresponding to the peptide with inhibitory activity from the fast-skeletal-muscle protein is very similar and it seems unlikely that this is the cause of the difference in inhibitory activity between the two proteins. The region responsible for binding troponin C, however, possesses a lower degree of homology. Detailed evidence on which the sequence is based has been deposited as Supplementary Publication SUP 50072 (20 pages), at the British Library Lending Division, Boston Spa, Wetherby, West Yorkshire LS23 7QB, U.K., from whom copies may be obtained on the terms given in Biochem. J. (1976) 153, 5. PMID:1008822

  20. Bacteriorhodopsin: partial sequence of mRNA provides amino acid sequence in the precursor region.

    PubMed Central

    Chang, S H; Majumdar, A; Dunn, R; Makabe, O; RajBhandary, U L; Khorana, H G; Ohtsuka, E; Tanaka, T; Taniyama, Y O; Ikehara, M

    1981-01-01

    mRNA for bacteriorhodopsin from Halobacterium halobium has been partially purified. By using this mRNA as template in the presence of reverse transcriptase RNA-dependent DNA nucleotidyltransferase and a 5'-[32P] synthetic oligodeoxyribonucleotide corresponding to amino acids 9-12 of bacteriorhodopsin as primer, we have isolated the major 5'-[32P]cDNA product, approximately 80 nucleotides long, and determined its sequence. Based on the cDNA sequence, the 5'-proximal sequence of bacteriorhodopsin mRNA is G-C-A-U-G-U-U-G-G-A-G-U-U-A-U-U-G-C-C-A-A-C-A-G-C-A-G-U-G-G-A-G-G-G-G-G-U-A-U-C -G-C-A-G-G-C-C-C-A-G-A-U-C-A-C-C-G-G-A-C-G-U-C-C-G. This includes the expected sequence for amino acids 1-8 and shows that bacteriorhodopsin is synthesized as a precursor that is at least 13 amino acids longer (Met-Leu-Glu-Leu-Leu-Pro-Thr-Ala-Val-Glu-Gly-Val-Ser) at the NH2 terminus. Agarose/urea gel electrophoresis of the partially purified mRNA showed several bands; of these, a major one hybridized with 5'-[32P]cDNA. These results suggest that the bacteriorhodopsin mRNA in the partially purified preparation is homogeneous in size and that it constitutes a substantial portion of the RNA preparation subjected to electrophoresis. Images PMID:6943548

  1. Relationship between peptide amino acid sequence and membrane curvature generation

    NASA Astrophysics Data System (ADS)

    Schmidt, Nathan; Kuo, David; Hwee Lai, Ghee; Mishra, Abhijit; Wong, Gerard

    2012-02-01

    Amphipathic peptides and amphipathic domains in proteins can perturb and restructure biological membranes. For example, it is believed that the cationic, amphipathic motif found in membrane active antimicrobial peptides (AMPs) is responsible for their membrane disruption mechanisms of action. And ApoA-I, the main apolipoprotein in high density lipoprotein contains a series of amphipathic α-helical repeats which are responsible for its lipid associating properties. We use small angle x-ray scattering (SAXS) to investigate the interaction of model cell membranes with prototypical AMPs and consensus peptides derived from the helical structural motif of ApoA-I. The relationship between peptide sequence and the peptide-induced changes in membrane curvature and topology is examined. By comparing the membrane rearrangement and corresponding phase behavior induced by these two distinct classes of membrane restructuring peptides we will discuss the role of amino acid sequence on membrane curvature generation.

  2. Unconventional amino acid sequence of the sun anemone (Stoichactis helianthus) polypeptide neurotoxin

    SciTech Connect

    Kem, W.; Dunn, B.; Parten, B.; Pennington, M.; Price, D.

    1986-05-01

    A 5000 dalton polypeptide neurotoxin (Sh-NI) purified by G50 Sephadex, P-cellulose, and SP-Sephadex chromatography was homogeneous by isoelectric focusing. Sh-NI was highly toxic to crayfish (LD/sub 50/ 0.6 ..mu..g/kg) but without effect upon mice at 15,000 ..mu..g/kg (i.p. injection). The reduced, /sup 3/H-carboxymethylated toxin and its fragments were subjected to automatic Edman degradation and the resulting PTH-amino acids were identified by HPLC, back hydrolysis, and scintillation counting. Peptides resulting from proteolytic (clostripain, staphylococcal protease) and chemical (tryptophan) cleavage were sequenced. The sequence is: AACKCDDEGPDIRTAPLTGTVDLGSCNAGWEKCASYYTIIADCCRKKK. This sequence differs considerably from the homologous Anemonia and Anthopleura toxins; many of the identical residues (6 half-cystines, G9, P10, R13, G19, G29, W30) are probably critical for folding rather than receptor recognition. However, the Sh-NI sequence closely resembles Radioanthus macrodactylus neurotoxin III and r. paumotensis II. The authors propose that Sh-NI and related Radioanthus toxins act upon a different site on the sodium channel.

  3. Ultrasensitive nucleic acid sequence detection by single-molecule electrophoresis

    SciTech Connect

    Castro, A; Shera, E.B.

    1996-09-01

    This is the final report of a one-year laboratory-directed research and development project at Los Alamos National Laboratory. There has been considerable interest in the development of very sensitive clinical diagnostic techniques over the last few years. Many pathogenic agents are often present in extremely small concentrations in clinical samples, especially at the initial stages of infection, making their detection very difficult. This project sought to develop a new technique for the detection and accurate quantification of specific bacterial and viral nucleic acid sequences in clinical samples. The scheme involved the use of novel hybridization probes for the detection of nucleic acids combined with our recently developed technique of single-molecule electrophoresis. This project is directly relevant to the DOE`s Defense Programs strategic directions in the area of biological warfare counter-proliferation.

  4. A novel phytase with sequence similarity to purple acid phosphatases is expressed in cotyledons of germinating soybean seedlings.

    PubMed

    Hegeman, C E; Grabau, E A

    2001-08-01

    Phytic acid (myo-inositol hexakisphosphate) is the major storage form of phosphorus in plant seeds. During germination, stored reserves are used as a source of nutrients by the plant seedling. Phytic acid is degraded by the activity of phytases to yield inositol and free phosphate. Due to the lack of phytases in the non-ruminant digestive tract, monogastric animals cannot utilize dietary phytic acid and it is excreted into manure. High phytic acid content in manure results in elevated phosphorus levels in soil and water and accompanying environmental concerns. The use of phytases to degrade seed phytic acid has potential for reducing the negative environmental impact of livestock production. A phytase was purified to electrophoretic homogeneity from cotyledons of germinated soybeans (Glycine max L. Merr.). Peptide sequence data generated from the purified enzyme facilitated the cloning of the phytase sequence (GmPhy) employing a polymerase chain reaction strategy. The introduction of GmPhy into soybean tissue culture resulted in increased phytase activity in transformed cells, which confirmed the identity of the phytase gene. It is surprising that the soybean phytase was unrelated to previously characterized microbial or maize (Zea mays) phytases, which were classified as histidine acid phosphatases. The soybean phytase sequence exhibited a high degree of similarity to purple acid phosphatases, a class of metallophosphoesterases.

  5. Processing and amino acid sequence analysis of the mouse mammary tumor virus env gene product.

    PubMed Central

    Arthur, L O; Copeland, T D; Oroszlan, S; Schochetman, G

    1982-01-01

    The envelope proteins of mouse mammary tumor virus (MMTV) are synthesized from a subgenomic 24S mRNA as a 75,000-dalton glycosylated precursor polyprotein which is eventually processed to the mature glycoproteins gp52 and gp36. In vivo synthesis of this env precursor in the presence of the core glycosylation inhibitor tunicamycin yielded a precursor of approximately 61,000 daltons (P61env). However, a 67,000-dalton protein (P67env) was obtained from cell-free translation with the MMTV 24S mRNA as the template. To determine whether the portion of the protein cleaved from P67env to give P61env was removed from the NH2-terminal end of P67env and as such would represent a leader sequence, the NH2-terminal amino acid sequence of the terminal peptide gp52 was determined. Glutamic acid, and not methionine, was found to be the amino-terminal residue of gp52, indicating that the cleaved portion was derived from the NH2-terminal end of P67env. The NH2-terminal amino acid sequences of gp52's from endogenous and exogenous C3H MMTVs were determined though 46 residues and found to be identical. However, amino acid composition and type-specific gp52 radioimmunoassays from MMTVs grown in heterologous cells indicated primary structure differences between gp52's of the two viruses. The nucleic acid sequence of cloned MMTV DNA fragments (J. Majors and H. E. Varmus, personal communication) in conjunction with the NH2-terminal sequence of gp52 allowed localization of the env gene in the MMTV genome. Nucleotides coding for the NH2 terminus of gp52 begin approximately 0.8 kilobase to the 3' side of the single EcoRI cleavage site. Localization of the env gene at that point agrees with the proposed gene order -gag-pol-env- and also allows sufficient coding potential for the glycoprotein precursor without extending into the long terminal repeat. Images PMID:6281457

  6. Prediction of protein antigenic determinants from amino acid sequences

    SciTech Connect

    Hopp, T.P.; Woods, K.R.

    1981-06-01

    A method is presented for locating protein antigenic determinants by analyzing amino acid sequences in order to find the point of greatest local hydrophilicity. This is accomplished by assigning each amino acid a numerical value (hydrophilicity value) and then repetitively averaging these values along the peptide chain. The point of highest local average hydrophilicity is invariably located in, or immediately adjacent to, an antigenic determinant. It was found that the prediction success rate depended on averaging group length, with hexapeptide averages yielding optimal results. The method was developed using 12 proteins for which extensive immunochemical analysis has been carried out and subsequently was used to predict antigenic determinants for the following proteins: hepatitis B surface antigen, influenza hemagglutinis, fowl plague virus hemagglutinin, human histocompatibility antigen HLA-B7, human interferons, Escherichia coli and cholera enterotoxins, ragweed allergens Ra3 and Ra5, and streptococcal M protein. The hepatitis B surface antigen sequence was synthesized by chemical means and was shown to have antigenic activity by radioimmunoassay.

  7. The ABRF Edman Sequencing Research Group 2008 Study: Investigation into Homopolymeric Amino Acid N-Terminal Sequence Tags and Their Effects on Automated Edman Degradation

    PubMed Central

    Thoma, R. S.; Smith, J. S.; Sandoval, W.; Leone, J. W.; Hunziker, P.; Hampton, B.; Linse, K. D.; Denslow, N. D.

    2009-01-01

    The Edman Sequence Research Group (ESRG) of the Association of Biomolecular Resource designs and executes interlaboratory studies investigating the use of automated Edman degradation for protein and peptide analysis. In 2008, the ESRG enlisted the help of core sequencing facilities to investigate the effects of a repeating amino acid tag at the N-terminus of a protein. Commonly, to facilitate protein purification, an affinity tag containing a polyhistidine sequence is conjugated to the N-terminus of the protein. After expression, polyhistidine-tagged protein is readily purified via chelation with an immobilized metal affinity resin. The addition of the polyhistidine tag presents unique challenges for the determination of protein identity using Edman degradation chemistry. Participating laboratories were asked to sequence one protein engineered in three configurations: with an N-terminal polyhistidine tag; with an N-terminal polyalanine tag; or with no tag. Study participants were asked to return a data file containing the uncorrected amino acid picomole yields for the first 17 cycles. Initial and repetitive yield (R.Y.) information and the amount of lag were evaluated. Information about instrumentation and sample treatment was also collected as part of the study. For this study, the majority of participating laboratories successfully called the amino acid sequence for 17 cycles for all three test proteins. In general, laboratories found it more difficult to call the sequence containing the polyhistidine tag. Lag was observed earlier and more consistently with the polyhistidine-tagged protein than the polyalanine-tagged protein. Histidine yields were significantly less than the alanine yields in the tag portion of each analysis. The polyhistidine and polyalanine protein-R.Y. calculations were found to be equivalent. These calculations showed that the nontagged portion from each protein was equivalent. The terminal histidines from the tagged portion of the protein

  8. Selection of target sequences as well as sequence identity determine the outcome of RNAi approach for resistance against cotton leaf curl geminivirus complex

    PubMed Central

    2011-01-01

    Cotton leaf curl disease is caused by a geminivirus complex that involves multiple distinct begomoviruses and a disease-specific DNA satellite, cotton leaf curl Multan betasatellite (CLCuMB), which is essential to induce disease symptoms. Here we have investigated the use of RNA interference (RNAi) for obtaining resistance against one of the viruses, Cotton leaf curl Multan virus (CLCuMV), associated with the disease. Three hairpin RNAi constructs were produced containing either complementary-sense genes essential for replication/pathogenicity or non-coding regulatory sequences of CLCuMV. In transient assays all three RNAi constructs significantly reduced the replication of the virus in inoculated tissues. However, only one of the constructs, that targeting the overlapping genes involved in virus replication and pathogenicity (the replication-associated protein (Rep), the transcriptional activator protein and the replication enhancer protein) was able to prevent systemic movement of the virus, although the other constructs significantly reduced the levels of virus in systemic tissues. In the presence of CLCuMB, however, a small number of plants co-inoculated with even the most efficient RNAi construct developed symptoms of virus infection, suggesting that the betasatellite may compromise resistance. Further analyses, using Rep gene sequences of distinct begomoviruses expressed from a PVX vector as the target, are consistent with the idea that the success of the RNAi approach depends on sequence identity to the target virus. The results show that selection of both the target sequence, as well as the levels of identity between the construct and target sequence, determine the outcome of RNAi-based resistance against geminivirus complexes. PMID:21410988

  9. Developmental variation and amino acid sequences of cytochromes c of the fruit fly Drosophila melanogaster and the flesh fly Boettcherisca peregrina.

    PubMed

    Inoue, S; Inoue, H; Hiroyoshi, T; Matsubara, H; Yamanaka, T

    1986-10-01

    The amino acid sequences of cytochromes c purified from the fruit fly Drosophila melanogaster and the flesh fly Boettcherisca peregrina were determined. In contrast with the case of the housefly, isocytochromes c were not detected in these flies at any developmental stage. The sequence of fruit fly cytochrome c differed from that reported previously but was identical with that predicted from the nucleotide sequence of the fruit fly cytochrome c gene (DC4) (Limbach, K.J. & Wu, R. (1985) Nucl. Acids Res. 13, 631-644). Isocytochrome c of the fruit fly, reported to be encoded by the DC3 gene, was not detected as a functional cytochrome c molecule.

  10. Cloning and sequencing of the medium-chain S-acyl fatty acid synthetase thioester hydrolase cDNA from rat mammary gland.

    PubMed Central

    Naggert, J; Williams, B; Cashman, D P; Smith, S

    1987-01-01

    cDNA clones coding for the medium-chain S-acyl fatty acid synthetase thioester hydrolase (thioesterase II) from rat mammary gland were identified in a bacteriophage lambda gt11 library and their nucleotide sequences were determined. The predicted coding region spans 263 amino acid residues and includes a sequence identical with that of a peptide derived from the enzyme active site. The rat thioesterase II cDNA sequence exhibits homology with that of a thioesterase found in duck uropygial glands. Images Fig. 3. PMID:3632637

  11. Protein identities from 'Graphocephala atropunctata' expressed sequence tags: Expanding leafhopper vector biology

    USDA-ARS?s Scientific Manuscript database

    Heat shock proteins and 44 protein sequences from the blue-green sharpshooter, BGSS, were produced and identified. The sequences were submitted and published under accession numbers: DQ445499-DQ445542, in the National Center for Biotechnology Information, NCBI, Public Database. The blue-green sharps...

  12. Amino Acid Sequence of a Novel Calmodulin from the Unicellular Alga Chlamydomonas1

    PubMed Central

    Lukas, Thomas J.; Wiggins, Michael E.; Watterson, D. Martin

    1985-01-01

    An amino acid sequence for a Chlamydomonas calmodulin has been elucidated with emphasis on the characterization of differences that are unique to Chlamydomonas and Dictyostelium calmodulin. While the concentration of calmodulin required for half-maximal activation of plant NAD kinase varies among vertebrate, higher plant, algal, and slime mold calmodulins, only calmodulins from the unicellular alga Chlamydomonas and the slime mold Dictyostelium show increased maximal activation of NAD kinase (Roberts, Burgess, Watterson 1984 Plant Physiol 75: 796-798; Marshak, Clarke, Roberts, Watterson 1984 Biochemistry 23: 2891-2899). The same preparations of calmodulin do not show major differences in phosphodiesterase or myosin light chain kinase activator activity. We report here that a Chlamydomonas calmodulin has four primary structural features similar to Dictyostelium that are not found in other calmodulins characterized to date: an altered carboxy terminus including a novel 11-residue extension for Chlamydomonas calmodulin, unique residues at positions 81 and 118, and an unmethylated lysine at position 115. The only amino acid sequence identity unique to Chlamydomonas and Dictyostelium calmodulin is the presence of a lysine at position 115 instead of a trimethyllysine. These studies indicate that the methylation state of lysine 115 may be important in the maximal NAD kinase activator activity of calmodulin and support the concept that calmodulin has multiple functional domains in addition to multiple structural domains. PMID:16664269

  13. The amino acid sequences and activities of synergistic hemolysins from Staphylococcus cohnii.

    PubMed

    Mak, Pawel; Maszewska, Agnieszka; Rozalska, Malgorzata

    2008-10-01

    Staphylococcus cohnii ssp. cohnii and S. cohnii ssp. urealyticus are a coagulase-negative staphylococci considered for a long time as unable to cause infections. This situation changed recently and pathogenic strains of these bacteria were isolated from hospital environments, patients and medical staff. Most of the isolated strains were resistant to many antibiotics. The present work describes isolation and characterization of several synergistic peptide hemolysins produced by these bacteria and acting as virulence factors responsible for hemolytic and cytotoxic activities. Amino acid sequences of respective hemolysins from S. cohnii ssp. cohnii (named as H1C, H2C and H3C) and S. cohnii ssp. urealyticus (H1U, H2U and H3U) were identical. Peptides H1 and H3 possessed significant amino acid homology to three synergistic hemolysins secreted by Staphylococcus lugdunensis and to putative antibacterial peptide produced by Staphylococcus saprophyticus ssp. saprophyticus. On the other hand, hemolysin H2 had a unique sequence. All isolated peptides lysed red cells from different mammalian species and exerted a cytotoxic effect on human fibroblasts.

  14. A novel regucalcin gene promoter region-related protein: comparison of nucleotide and amino acid sequences in vertebrate species.

    PubMed

    Sawada, Natsumi; Yamaguchi, Masayoshi

    2005-01-01

    The molecular cloning and sequencing of the cDNA coding for a novel regucalcin gene promoter region-related protein (RGPR-p117) from bovine, rabbit and chicken livers was investigated using rapid amplification of cDNA endo (RACE) method. Their nucleotide and amino acid sequences were compared with human, rat and mouse sequences published previously. RGPR-p117 of bovine, rabbit and chicken livers consisted of 1052, 1045, and 929 amino acid residues with calculated molecular mass of 117, 114, and 103 kDa, and estimated pI of 5.64, 5.84, and 5.59, respectively. Comparison analysis revealed that the nucleotide sequences of RGPR-p117 from mammalian species were highly-conserved in their coding region, and the homologies were at least 72.9%. The RGPR-p117 proteins in mammalian species consisted of 1045-1060 amino acids, and had 63.1-90.2% identity. Meanwhile, the nucleotide and amino acid sequences of chicken RGPR-p117 had at least 36.4 and 43.7% identities, respectively. Phylogenetic analysis showed that RGPR-p117 in six vertebrates appears to form a single cluster. Mammalian RGPR-p117 conserved a leucine zipper motif. Moreover, the analysis for subcellular localization of RGPR-p117 from six vertebrates showed the probability of nuclear localization >52.2%; the nuclear localization in rat and mouse was 78.3%. This study demonstrates a great conservation of RGPR-p117 genes throughout evolution.

  15. Amino acid sequences of alpha-helical segments from S-carbosymethylkerateine-A. Complete sequence of a type-I segment.

    PubMed Central

    Gough, K H; Inglis, A S; Crewther, W G

    1978-01-01

    The amino acid sequence of a type-I helical segment from the low-sulphur protein (S-carboxymethylkerateine-A) of wool was determined by combining automatic and manual-sequencing data. Whereas in the type-II helical segment most of the cationic groups occur in pairs, 11 of the 22 anionic residues in the sequence of the type-I segment were situated next to a second anionic residue. This suggests possible interactions between type-I and type-II helical segments in alpha-keratin. As observed with the sequence of a type-II helical segment a model constructed on 3.6 residues per turn of helix shows a line of hydrophobic residues along the helix, thereby supporting the physicochemical evidence that the molecule is predominantly helical and forms part of a coiled-coil structure. Examination of the sequence data by predictive methods indicates the possibilty of extensive sections of alpha-helix interspersed with discontinuities. The molecule contains a number of regions with peptide sequences identical with those found by other workers after enzymic digestion of fractions from oxidized wool. Images Fig. 1. PMID:697725

  16. Isolation and amino acid sequences of squirrel monkey (Saimiri sciurea) insulin and glucagon

    SciTech Connect

    Yu, Jinghua ); Eng, J.; Yalow, R.S. City Univ. of New York, NY )

    1990-12-01

    It was reported two decades ago that insulin was not detectable in the glucose-stimulated state in Saimiri sciurea, the New World squirrel monkey, by a radioimmunoassay system developed with guinea pig anti-pork insulin antibody and labeled park insulin. With the same system, reasonable levels were observed in rhesus monkeys and chimpanzees. This suggested that New World monkeys, like the New World hystricomorph rodents such as the guinea pig and the coypu, might have insulins whose sequences differ markedly from those of Old World mammals. In this report the authors describe the purification and amino acid sequences of squirrel monkey insulin and glucagon. They demonstrate that the substitutions at B29, B27, A2, A4, and A17 of squirrel monkey insulin are identical with those previously found in another New World primate, the owl monkey (Aotus trivirgatus). The immunologic cross-reactivity of this insulin in their immunoassay system is only a few percent of that of human insulin. It appears that the peptides of the New World monkeys have diverged less from those of the Old World mammals than have those of the New World hystricomorph rodents. The striking improvements in peptide purification and sequencing have the potential for adding new information concerning the evolutionary divergence of species.

  17. Sequence evolution and sex-specific expression patterns of the C class floral identity gene, SpAGAMOUS, in dioecious Spinacia oleracea L.

    PubMed

    Sather, D Noah; York, Amber; Pobursky, Kevin J; Golenberg, Edward M

    2005-10-01

    Development in dioecious cultivated spinach, Spinacia oleracea, is distinguished by the absence of alternative reproductive organ primordia in male and female flowers. Given the highly derived floral developmental program in spinach, we wished to characterize a spinach C class floral identity gene and to determine the patterns of sequence evolution as well as compare the spatial and temporal expression patterns with those of AGAMOUS. The isolated cDNA sequence clusters phylogenetically within the AGAMOUS/FARINELLI C class clade. In comparison with the SLM1 sequence from the related Silene latifolia, amino acid replacements are highly conservative and non-randomly distributed, being predominantly found in hinge regions or on exposed surfaces of helices. The spinach gene (SpAGAMOUS) appears to be exclusively expressed in reproductive tissues and not in vegetative organs. Initial expression of SpAGAMOUS is similar in male and female floral primordia. However, upon initiation of the first whorl organs, SpAGAMOUS becomes restricted to meristemic regions from which the reproductive primordia will develop. This results in an early gender-specific pattern. Thus, the spinach C class gene is differentially expressed prior to reproductive organ development and is, at least, correlated with, if not directly involved in, the sexual dimorphism in spinach.

  18. Cloning and nucleotide sequencing of a novel 7 beta-(4-carboxybutanamido)cephalosporanic acid acylase gene of Bacillus laterosporus and its expression in Escherichia coli and Bacillus subtilis.

    PubMed

    Aramori, I; Fukagawa, M; Tsumura, M; Iwami, M; Ono, H; Kojo, H; Kohsaka, M; Ueda, Y; Imanaka, H

    1991-12-01

    A strain of Bacillus species which produced an enzyme named glutaryl 7-ACA acylase which converts 7 beta-(4-carboxybutanamido)cephalosporanic acid (glutaryl 7-ACA) to 7-amino cephalosporanic acid (7-ACA) was isolated from soil. The gene for the glutaryl 7-ACA acylase was cloned with pHSG298 in Escherichia coli JM109, and the nucleotide sequence was determined by the M13 dideoxy chain termination method. The DNA sequence revealed only one large open reading frame composed of 1,902 bp corresponding to 634 amino acid residues. The deduced amino acid sequence contained a potential signal sequence in its amino-terminal region. Expression of the gene for glutaryl 7-ACA acylase was performed in both E. coli and Bacillus subtilis. The enzyme preparations purified from either recombinant strain of E. coli or B. subtilis were shown to be identical with each other as regards the profile of sodium dodecyl sulfate-polyacrylamide gel electrophoresis and were composed of a single peptide with the molecular size of 70 kDa. Determination of the amino-terminal sequence of the two enzyme preparations revealed that both amino-terminal sequences (the first nine amino acids) were identical and completely coincided with residues 28 to 36 of the open reading frame. Extracellular excretion of the enzyme was observed in a recombinant strain of B. subtilis.

  19. Cloning and nucleotide sequencing of a novel 7 beta-(4-carboxybutanamido)cephalosporanic acid acylase gene of Bacillus laterosporus and its expression in Escherichia coli and Bacillus subtilis.

    PubMed Central

    Aramori, I; Fukagawa, M; Tsumura, M; Iwami, M; Ono, H; Kojo, H; Kohsaka, M; Ueda, Y; Imanaka, H

    1991-01-01

    A strain of Bacillus species which produced an enzyme named glutaryl 7-ACA acylase which converts 7 beta-(4-carboxybutanamido)cephalosporanic acid (glutaryl 7-ACA) to 7-amino cephalosporanic acid (7-ACA) was isolated from soil. The gene for the glutaryl 7-ACA acylase was cloned with pHSG298 in Escherichia coli JM109, and the nucleotide sequence was determined by the M13 dideoxy chain termination method. The DNA sequence revealed only one large open reading frame composed of 1,902 bp corresponding to 634 amino acid residues. The deduced amino acid sequence contained a potential signal sequence in its amino-terminal region. Expression of the gene for glutaryl 7-ACA acylase was performed in both E. coli and Bacillus subtilis. The enzyme preparations purified from either recombinant strain of E. coli or B. subtilis were shown to be identical with each other as regards the profile of sodium dodecyl sulfate-polyacrylamide gel electrophoresis and were composed of a single peptide with the molecular size of 70 kDa. Determination of the amino-terminal sequence of the two enzyme preparations revealed that both amino-terminal sequences (the first nine amino acids) were identical and completely coincided with residues 28 to 36 of the open reading frame. Extracellular excretion of the enzyme was observed in a recombinant strain of B. subtilis. Images FIG. 2 FIG. 5 FIG. 6 PMID:1744041

  20. The evolution of proteins from random amino acid sequences: II. Evidence from the statistical distributions of the lengths of modern protein sequences.

    PubMed

    White, S H

    1994-04-01

    This paper continues an examination of the hypothesis that modern proteins evolved from random heteropeptide sequences. In support of the hypothesis, White and Jacobs (1993, J Mol Evol 36:79-95) have shown that any sequence chosen randomly from a large collection of nonhomologous proteins has a 90% or better chance of having a lengthwise distribution of amino acids that is indistinguishable from the random expectation regardless of amino acid type. The goal of the present study was to investigate the possibility that the random-origin hypothesis could explain the lengths of modern protein sequences without invoking specific mechanisms such as gene duplication or exon splicing. The sets of sequences examined were taken from the 1989 PIR database and consisted of 1,792 "super-family" proteins selected to have little sequence identity, 623 E. coli sequences, and 398 human sequences. The length distributions of the proteins could be described with high significance by either of two closely related probability density functions: The gamma distribution with parameter 2 or the distribution for the sum of two exponential random independent variables. A simple theory for the distributions was developed which assumes that (1) protoprotein sequences had exponentially distributed random independent lengths, (2) the length dependence of protein stability determined which of these protoproteins could fold into compact primitive proteins and thereby attain the potential for biochemical activity, (3) the useful protein sequences were preserved by the primitive genome, and (4) the resulting distribution of sequence lengths is reflected by modern proteins. The theory successfully predicts the two observed distributions which can be distinguished by the functional form of the dependence of protein stability on length. The theory leads to three interesting conclusions. First, it predicts that a tetra-nucleotide was the signal for primitive translation termination. This prediction is

  1. Reticuloendotheliosis Virus Nucleic Acid Sequences in Cellular DNA

    PubMed Central

    Kang, Chil-Yong; Temin, Howard M.

    1974-01-01

    Reticuloendotheliosis virus 60S RNA labeled with 125I, or reticuloendotheliosis virus complementary DNA labeled with 3H, were hybridized to DNAs from infected chicken and pheasant cells. Most of the sequences of the viral RNA were found in the infected cell DNAs. The reticuloendotheliosis viruses, therefore, replicate through a DNA intermediate. The same labeled nucleic acids were hybridized to DNA of uninfected chicken, pheasant, quail, turkey, and duck. About 10% of the sequences of reticuloendotheliosis virus RNA were present in the DNA of uninfected chicken, pheasant, quail, and turkey. None were detected in DNA of duck. The specificity of the hybridization was shown by competition between unlabeled and 125I-labeled viral RNAs and by determination of melting temperatures. In contrast, 125I-labeled RNA of Rous-associated virus-O, an avian leukosis-sarcoma virus, hybridized 55% to DNA of uninfected chicken, 20% to DNA of uninfected pheasant, 15% to DNA of uninfected quail, 10% to DNA of uninfected turkey, and less than 1% to DNA of uninfected duck. PMID:4372393

  2. Nucleic acid (cDNA) and amino acid sequences of the maize endosperm protein glutelin-2.

    PubMed Central

    Prat, S; Cortadas, J; Puigdomènech, P; Palau, J

    1985-01-01

    The cDNA coding for a glutelin-2 protein from maize endosperm has been cloned and the complete amino acid sequence of the protein derived for the first time. An immature maize endosperm cDNA bank was screened for the expression of a beta-lactamase:glutelin-2 (G2) fusion polypeptide by using antibodies against the purified 28 kd G2 protein. A clone corresponding to the 28 kd G2 protein was sequenced and the primary structure of this protein was derived. Five regions can be defined in the protein sequence: an 11 residue N-terminal part, a repeated region formed by eight units of the sequence Pro-Pro-Pro-Val-His-Leu, an alternating Pro-X stretch 21 residues long, a Cys rich domain and a C-terminal part rich in Gln. The protein sequence is preceded by 19 residues which have the characteristics of the signal peptide found in secreted proteins. Unlike zeins, the main maize storage proteins, 28 kd glutelin-2 has several homologous sequences in common with other cereal storage proteins. Images PMID:3839076

  3. Predicting protein amidation sites by orchestrating amino acid sequence features

    NASA Astrophysics Data System (ADS)

    Zhao, Shuqiu; Yu, Hua; Gong, Xiujun

    2017-08-01

    Amidation is the fourth major category of post-translational modifications, which plays an important role in physiological and pathological processes. Identifying amidation sites can help us understanding the amidation and recognizing the original reason of many kinds of diseases. But the traditional experimental methods for predicting amidation sites are often time-consuming and expensive. In this study, we propose a computational method for predicting amidation sites by orchestrating amino acid sequence features. Three kinds of feature extraction methods are used to build a feature vector enabling to capture not only the physicochemical properties but also position related information of the amino acids. An extremely randomized trees algorithm is applied to choose the optimal features to remove redundancy and dependence among components of the feature vector by a supervised fashion. Finally the support vector machine classifier is used to label the amidation sites. When tested on an independent data set, it shows that the proposed method performs better than all the previous ones with the prediction accuracy of 0.962 at the Matthew's correlation coefficient of 0.89 and area under curve of 0.964.

  4. Sequence Comparison and Phylogeny of Nucleotide Sequence of Coat Protein and Nucleic Acid Binding Protein of a Distinct Isolate of Shallot virus X from India.

    PubMed

    Majumder, S; Baranwal, V K

    2011-06-01

    Shallot virus X (ShVX), a type species in the genus Allexivirus of the family Alfaflexiviridae has been associated with shallot plants in India and other shallot growing countries like Russia, Germany, Netherland, and New Zealand. Coat protein (CP) and nucleic acid binding protein (NB) region of the virus was obtained by reverse transcriptase polymerase chain reaction from scales leaves of shallot bulbs. The partial cDNA contained two open reading frames encoding proteins of molecular weights of 28.66 and 14.18 kDa belonging to Flexi_CP super-family and viral NB super-family, respectively. The percent identity and phylogenetic analysis of amino acid sequences of CP and NB region of the virus associated with shallot indicated that it was a distinct isolate of ShVX.

  5. Complete amino acid sequence of chicken liver acyl carrier protein derived from the fatty acid synthase.

    PubMed

    Huang, W Y; Stoops, J K; Wakil, S J

    1989-04-01

    The acyl carrier protein domain of the chicken liver fatty acid synthase has been isolated after tryptic treatment of the synthase. The isolated domain functions as an acceptor of acetyl and malonyl moieties in the synthase-catalyzed transfer of these groups from their coenzyme A esters and therefore indicates that the acyl carrier protein domain exists in the complex as a discrete entity. The amino acid sequence of the acyl carrier protein was derived from analyses of peptide fragments produced by cyanogen bromide cleavage and trypsin and Staphylococcus aureus V8 protease digestions of the molecule. The isolated acyl carrier protein domain consists of 89 amino acid residues and has a calculated molecular weight of 10,127. The protein contains the phosphopantetheine group attached to the serine residue at position 38. The isolated acyl carrier protein peptide shows some sequence homology with the acyl carrier protein of Escherichia coli, particularly in the vicinity of the site of phosphopantetheine attachment, and shows extensive sequence homology with the acyl carrier protein from the uropygial gland of goose.

  6. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... 37 Patents, Trademarks, and Copyrights 1 2011-07-01 2011-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide and...

  7. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... 37 Patents, Trademarks, and Copyrights 1 2010-07-01 2010-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide and...

  8. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... 37 Patents, Trademarks, and Copyrights 1 2013-07-01 2013-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide and...

  9. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... 37 Patents, Trademarks, and Copyrights 1 2012-07-01 2012-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide and...

  10. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... 37 Patents, Trademarks, and Copyrights 1 2014-07-01 2014-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide and...

  11. Protein identities - Graphocephala atropunctata expressed sequenced tags: expanding leafhopper vector biology

    USDA-ARS?s Scientific Manuscript database

    A small heat shock protein was isolated and sequenced from the Blue-green sharpshooter, BGSS, Graphocephala atropunctata (Signoret) (Hemiptera: Cicadellidae). The BGSS has been the native vector of Pierce’s disease in vineyards in California for nearly a century. The importance of this vector spec...

  12. The Effect of Expression Sequence and Expressor Identity on Judgments of the Intensity of Facial Expression.

    ERIC Educational Resources Information Center

    Thayer, Stephen

    Three hundred female college students participated in a study to determine whether viewing a prior sequence of photographs of emotional-facial expressions would influence their impressions of later expressions. Each subject was tested individually and shown one series of five separate photographs of varying facial expressions, one photograph at a…

  13. Fostering Research Identities in Two-Course Writing Sequences: A Curricular Perspective

    ERIC Educational Resources Information Center

    Sura, Thomas

    2015-01-01

    Two-course writing sequences are valuable because they extend the time that students spend focused on developing as writers and researchers, yet they cannot rely on a "more is better" argument to justify their ongoing implementation, especially when general education curricula are shrinking and one course often looks much the same as the…

  14. The amino acid sequence and reactive site of a single-headed trypsin inhibitor from wheat endosperm.

    PubMed

    Poerio, E; Caporale, C; Carrano, L; Caruso, C; Vacca, F; Buonocore, V

    1994-02-01

    The sequence of a trypsin inhibitor, isolated from wheat endosperm, is reported. The primary structure was obtained by automatic sequence analysis of the S-alkylated protein and of purified peptides derived from chemical cleavage by cyanogen bromide and digestion with Staphylococcus aureus V8 protease. This protein, named wheat trypsin inhibitor (WTI), which is comprised of a total of 71 amino acid residues, has 12 cysteines, all involved in disulfide bridges. The primary site of interaction (reactive site) with bovine trypsin has been identified as the dipeptide arginyl-methionyl at positions 19 and 20. WTI has a high degree of sequence identity with a number of serine proteinase inhibitors isolated from both cereal and leguminous plants. On the basis of the findings presented, this protein has been classified as a single-headed trypsin inhibitor of Bowman-Birk type.

  15. Human liver apolipoprotein B-100 cDNA: complete nucleic acid and derived amino acid sequence.

    PubMed Central

    Law, S W; Grant, S M; Higuchi, K; Hospattankar, A; Lackner, K; Lee, N; Brewer, H B

    1986-01-01

    Human apolipoprotein B-100 (apoB-100), the ligand on low density lipoproteins that interacts with the low density lipoprotein receptor and initiates receptor-mediated endocytosis and low density lipoprotein catabolism, has been cloned, and the complete nucleic acid and derived amino acid sequences have been determined. ApoB-100 cDNAs were isolated from normal human liver cDNA libraries utilizing immunoscreening as well as filter hybridization with radiolabeled apoB-100 oligodeoxynucleotides. The apoB-100 mRNA is 14.1 kilobases long encoding a mature apoB-100 protein of 4536 amino acids with a calculated amino acid molecular weight of 512,723. ApoB-100 contains 20 potential glycosylation sites, and 12 of a total of 25 cysteine residues are located in the amino-terminal region of the apolipoprotein providing a potential globular structure of the amino terminus of the protein. ApoB-100 contains relatively few regions of amphipathic helices, but compared to other human apolipoproteins it is enriched in beta-structure. The delineation of the entire human apoB-100 sequence will now permit a detailed analysis of the conformation of the protein, the low density lipoprotein receptor binding domain(s), and the structural relationship between apoB-100 and apoB-48 and will provide the basis for the study of genetic defects in apoB-100 in patients with dyslipoproteinemias. PMID:3464946

  16. Computer selection of oligonucleotide probes from amino acid sequences for use in gene library screening.

    PubMed

    Yang, J H; Ye, J H; Wallace, D C

    1984-01-11

    We present a computer program, FINPROBE, which utilizes known amino acid sequence data to deduce minimum redundancy oligonucleotide probes for use in screening cDNA or genomic libraries or in primer extension. The user enters the amino acid sequence of interest, the desired probe length, the number of probes sought, and the constraints on oligonucleotide synthesis. The computer generates a table of possible probes listed in increasing order of redundancy and provides the location of each probe in the protein and mRNA coding sequence. Activation of a next function provides the amino acid and mRNA sequences of each probe of interest as well as the complementary sequence and the minimum dissociation temperature of the probe. A final routine prints out the amino acid sequence of the protein in parallel with the mRNA sequence listing all possible codons for each amino acid.

  17. Noggin and retinoic acid transform the identity of avian facial prominences.

    PubMed

    Lee, S H; Fu, K K; Hui, J N; Richman, J M

    The signals that determine body part identity in vertebrate embryos are largely unknown, with some exceptions such as those for teeth and digits. The vertebrate face is derived from small buds of tissue, facial prominences, that surround the embryonic oral cavity. In chicken embryos, the skeleton of the upper beak is derived from the frontonasal mass and maxillary prominences. Here we show that bone morphogenetic proteins (Bmps) and the vitamin A derivative, retinoic acid (RA), are used to specify the identity of the frontonasal mass and maxillary prominences. Implanting two beads adjacent to the stage-15 presumptive maxillary field, one soaked in the Bmp antagonist Noggin and one soaked in RA, induces a duplicate set of frontonasal mass skeletal elements in place of maxillary bones. We also show that the duplicated beak is due to transformation of the maxillary prominence into a second frontonasal mass and not due to ectopic migration of cells or splitting of the normal frontonasal mass. Thus the levels of Bmp and RA determine whether specific regions of the face form maxillary or frontonasal mass derivatives.

  18. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data shall...

  19. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data shall...

  20. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data shall...

  1. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data shall...

  2. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data shall...

  3. Identity between rat htf and human xbp-1 genes: determination of gene structure, target sequence, and transcription promotion function for HTF.

    PubMed

    Kokura, K; Kishimoto, T; Tamura, T

    2000-01-11

    Hepatocarcinogenesis-related transcription factor (HTF) was originally isolated from rats in which the expression was enhanced in hepatocellular carcinomas. Rat HTF (rHTF) is structurally similar to human X-box-binding protein-1 (hXBP-1), and both factors are unique in respective genomes. A previous study showed that hXBP-1 mRNA is detectable ubiquitously but is enriched in the human liver as rHTF. In this study, we demonstrated the analogous exon-intron organization and significant sequence homology for rhtf and hxbp-1 genes. Alignment of amino acid sequences of rHTF and hXBP-1 revealed that all the characteristic motifs in rHTF were conserved in hXBP-1. Moreover, Southern blotting patterns provided with the rHTF and hXBP-1 probes were basically the same. These two genes were thus thought to belong to the same evolutional lineage. We determined the consensus binding sequence (CRCGTCA) for rHTF by CASTing, and it was found to be nearly the same as that for hXBP-1. Transactivation ability of rHTF was also demonstrated. The rhtf gene generates two types of mRNAs (2.0 kb and 2.5 kb), both of which encode identical rHTF protein. These transcripts had distinct transcription initiation sites. The 2.0 kb promoter, that was revealed by the transient luciferase assay, contained GC-box and CAAT-box. Sequences around the transcription initiation site for the 2.0 kb transcript were similar in rhtf and hxbp-1 genes. Our observations suggest that HTF is a rat homolog of hXBP-1.

  4. Average nucleotide identity of genome sequences supports the description of Rhizobium lentis sp. nov., Rhizobium bangladeshense sp. nov. and Rhizobium binae sp. nov. from lentil (Lens culinaris) nodules.

    PubMed

    Rashid, M Harun-or; Young, J Peter W; Everall, Isobel; Clercx, Pia; Willems, Anne; Santhosh Braun, Markus; Wink, Michael

    2015-09-01

    Rhizobial strains isolated from effective root nodules of field-grown lentil (Lens culinaris) from different parts of Bangladesh were previously analysed using sequences of the 16S rRNA gene, three housekeeping genes (recA, atpD and glnII) and three nodulation genes (nodA, nodC and nodD), DNA fingerprinting and phenotypic characterization. Analysis of housekeeping gene sequences and DNA fingerprints indicated that the strains belonged to three novel clades in the genus Rhizobium. In present study, a representative strain from each clade was further characterized by determination of cellular fatty acid compositions, carbon substrate utilization patterns and DNA-DNA hybridization and average nucleotide identity (ANI) analyses from whole-genome sequences. DNA-DNA hybridization showed 50-62% relatedness to their closest relatives (the type strains of Rhizobium etli and Rhizobium phaseoli) and 50-60% relatedness to each other. These results were further supported by ANI values, based on genome sequencing, which were 87-92% with their close relatives and 88-89% with each other. On the basis of these results, three novel species, Rhizobium lentis sp. nov. (type strain BLR27(T) = LMG 28441(T) = DSM 29286(T)), Rhizobium bangladeshense sp. nov. (type strain BLR175(T) = LMG 28442(T) = DSM 29287(T)) and Rhizobium binae sp. nov. (type strain BLR195(T) = LMG 28443(T) = DSM 29288(T)), are proposed. These species share common nodulation genes (nodA, nodC and nodD) that are similar to those of the symbiovar viciae.

  5. Triose phosphate isomerase from the coelacanth. An approach to the rapid determination of an amino acid sequence with small amounts of material.

    PubMed

    Kolb, E; Harris, J I; Bridgen, J

    1974-02-01

    The preparation and purification of cyanogen bromide fragments from [(14)C]carboxymethylated coelacanth triose phosphate isomerase is presented. The automated sequencing of these fragments, the lysine-blocked tryptic peptides derived from them, and also of the intact protein, is described. Combination with results from manual sequence analysis has given the 247-residue amino acid sequence of coelacanth triose phosphate isomerase in 4 months, by using 100mg of enzyme. (Two small adjacent peptides were placed by homology with the rabbit enzyme.) Comparison of this sequence with that of the rabbit muscle enzyme shows that 207 (84%) of the residues are identical. This slow rate of evolutionary change (corresponding to two amino acid substitutions per 100 residues per 100 million years) is similar to that found for glyceraldehyde 3-phosphate dehydrogenase. The reliability of sequence information obtained by automated methods is discussed.

  6. Human retroviruses and AIDS 1996. A compilation and analysis of nucleic acid and amino acid sequences

    SciTech Connect

    Myers, G.; Foley, B.; Korber, B.; Mellors, J.W.; Jeang, K.T.; Wain-Hobson, S.

    1997-04-01

    This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (1) Nuclear Acid Alignments and Sequences; (2) Amino Acid Alignments; (3) Analysis; (4) Related Sequences; and (5) Database Communications. Information within all the parts is updated throughout the year on the Web site, http://hiv-web.lanl.gov. While this publication could take the form of a review or sequence monograph, it is not so conceived. Instead, the literature from which the database is derived has simply been summarized and some elementary computational analyses have been performed upon the data. Interpretation and commentary have been avoided insofar as possible so that the reader can form his or her own judgments concerning the complex information. In addition to the general descriptions of the parts of the compendium, the user should read the individual introductions for each part.

  7. The amino acid sequence of Ole e I, the major allergen from olive tree (Olea europaea) pollen.

    PubMed

    Villalba, M; Batanero, E; López-Otín, C; Sánchez, L M; Monsalve, R I; González de la Peña, M A; Lahoz, C; Rodríguez, R

    1993-09-15

    The complete primary structure of the major allergen from Olea europaea (olive tree) pollen, Ole e I (IUIS nomenclature), has been determined. The amino acid sequence was established by automated Edman degradation of the reduced and alkylated molecule as well as of selected fragments obtained by proteolytic digestions. Ole e I contains a single polypeptide chain of 145 amino acid residues with a calculated molecular mass of 16331 Da. No free sulfhydryl groups have been detected in the native protein. The molecule contains a putative glycosylation site. A high degree of microheterogeneity has been observed, mainly centered in the first 33% of the molecule. Comparison of Ole e I sequence with protein sequence databases showed no similarity with other known allergens. However, it has a 36% and 38% sequence identity with the putative polypeptide structures, deduced, respectively, from nucleotide sequences of genes isolated from tomato anthers and corn pollen, which have been suggested to be involved in the growing of the pollen tube. Therefore, the olive tree allergen may be a constitutive protein of the pollen involved in reproductive functions.

  8. Transcriptome Sequencing in Response to Salicylic Acid in Salvia miltiorrhiza

    PubMed Central

    Zhang, Xiaoru; Dong, Juane; Liu, Hailong; Wang, Jiao; Qi, Yuexin; Liang, Zongsuo

    2016-01-01

    Salvia miltiorrhiza is a traditional Chinese herbal medicine, whose quality and yield are often affected by diseases and environmental stresses during its growing season. Salicylic acid (SA) plays a significant role in plants responding to biotic and abiotic stresses, but the involved regulatory factors and their signaling mechanisms are largely unknown. In order to identify the genes involved in SA signaling, the RNA sequencing (RNA-seq) strategy was employed to evaluate the transcriptional profiles in S. miltiorrhiza cell cultures. A total of 50,778 unigenes were assembled, in which 5,316 unigenes were differentially expressed among 0-, 2-, and 8-h SA induction. The up-regulated genes were mainly involved in stimulus response and multi-organism process. A core set of candidate novel genes coding SA signaling component proteins was identified. Many transcription factors (e.g., WRKY, bHLH and GRAS) and genes involved in hormone signal transduction were differentially expressed in response to SA induction. Detailed analysis revealed that genes associated with defense signaling, such as antioxidant system genes, cytochrome P450s and ATP-binding cassette transporters, were significantly overexpressed, which can be used as genetic tools to investigate disease resistance. Our transcriptome analysis will help understand SA signaling and its mechanism of defense systems in S. miltiorrhiza. PMID:26808150

  9. Prevalence and Sequence-Based Identity of Rumen Fluke in Cattle and Deer in New Caledonia.

    PubMed

    Cauquil, Laura; Hüe, Thomas; Hurlin, Jean-Claude; Mitchell, Gillian; Searle, Kate; Skuce, Philip; Zadoks, Ruth

    2016-01-01

    An abattoir survey was performed in the French Melanesian archipelago of New Caledonia to determine the prevalence of paramphistomes in cattle and deer and to generate material for molecular typing at species and subspecies level. Prevalence in adult cattle was high at animal level (70% of 387 adult cattle) and batch level (81%). Prevalence was lower in calves at both levels (33% of 484 calves, 51% at batch level). Animals from 2 of 7 deer farms were positive for rumen fluke, with animal-level prevalence of 41.4% (29/70) and 47.1% (33/70), respectively. Using ITS-2 sequencing, 3 species of paramphistomes were identified, i.e. Calicophoron calicophorum, Fischoederius elongatus and Orthocoelium streptocoelium. All three species were detected in cattle as well as deer, suggesting the possibility of rumen fluke transmission between the two host species. Based on heterogeneity in ITS-2 sequences, the C. calicophorum population comprises two clades, both of which occur in cattle as well as deer. The results suggest two distinct routes of rumen fluke introduction into this area. This approach has wider applicability for investigations of the origin of rumen fluke infections and for the possibility of parasite transmission at the livestock-wildlife interface.

  10. Prevalence and Sequence-Based Identity of Rumen Fluke in Cattle and Deer in New Caledonia

    PubMed Central

    Cauquil, Laura; Hüe, Thomas; Hurlin, Jean-Claude; Mitchell, Gillian; Searle, Kate; Skuce, Philip; Zadoks, Ruth

    2016-01-01

    An abattoir survey was performed in the French Melanesian archipelago of New Caledonia to determine the prevalence of paramphistomes in cattle and deer and to generate material for molecular typing at species and subspecies level. Prevalence in adult cattle was high at animal level (70% of 387 adult cattle) and batch level (81%). Prevalence was lower in calves at both levels (33% of 484 calves, 51% at batch level). Animals from 2 of 7 deer farms were positive for rumen fluke, with animal-level prevalence of 41.4% (29/70) and 47.1% (33/70), respectively. Using ITS-2 sequencing, 3 species of paramphistomes were identified, i.e. Calicophoron calicophorum, Fischoederius elongatus and Orthocoelium streptocoelium. All three species were detected in cattle as well as deer, suggesting the possibility of rumen fluke transmission between the two host species. Based on heterogeneity in ITS-2 sequences, the C. calicophorum population comprises two clades, both of which occur in cattle as well as deer. The results suggest two distinct routes of rumen fluke introduction into this area. This approach has wider applicability for investigations of the origin of rumen fluke infections and for the possibility of parasite transmission at the livestock-wildlife interface. PMID:27043709

  11. Human retroviruses and aids, 1992. A compilation and analysis of nucleic acid and amino acid sequences

    SciTech Connect

    Myers, G.; Korber, B.; Berzofsky, J.A.; Pavlakis, G.N.; Smith, R.F.

    1992-10-01

    This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (1) HIV and SIV Nucleotide Sequences; (H) Amino Acid Sequences; (III) Analyses; (IV) Related Sequences; and (V) Database Communications. information within all the parts is updated at least twice in each year, which accounts for the modes of binding and pagination in the compendium. While this publication could take the form of a review or sequence monograph, it is not so conceived. Instead, the literature from which the database is derived has simply been summarized and some elementary computational analyses have been performed upon the data. Interpretation and commentary have been avoided insofar as possible so that the reader can form his or her own judgments concerning the complex information. In addition to the general descriptions below of the parts of the compendium, the user should read the individual introductions for each part.

  12. Different thresholds of bond percolation in scale-free networks with identical degree sequence.

    PubMed

    Zhang, Zhongzhi; Zhou, Shuigeng; Zou, Tao; Chen, Lichao; Guan, Jihong

    2009-03-01

    Generally, the threshold of percolation in complex networks depends on the underlying structural characterization. However, what topological property plays a predominant role is still unknown, despite the speculation of some authors that degree distribution is a key ingredient. The purpose of this paper is to show that power-law degree distribution itself is not sufficient to characterize the threshold of bond percolation in scale-free networks. To achieve this goal, we first propose a family of scale-free networks with the same degree sequence and obtain by analytical or numerical means several topological features of the networks. Then, by making use of the renormalization-group technique we determine the threshold of bond percolation in our networks. We find an existence of nonzero thresholds and demonstrate that these thresholds can be quite different, which implies that power-law degree distribution does not suffice to characterize the percolation threshold in scale-free networks.

  13. Cloning, sequence analysis, and expression in Escherichia coli of the gene encoding an alpha-amino acid ester hydrolase from Acetobacter turbidans.

    PubMed

    Polderman-Tijmes, Jolanda J; Jekel, Peter A; de Vries, Erik J; van Merode, Annet E J; Floris, René; van der Laan, Jan-Metske; Sonke, Theo; Janssen, Dick B

    2002-01-01

    The alpha-amino acid ester hydrolase from Acetobacter turbidans ATCC 9325 is capable of hydrolyzing and synthesizing beta-lactam antibiotics, such as cephalexin and ampicillin. N-terminal amino acid sequencing of the purified alpha-amino acid ester hydrolase allowed cloning and genetic characterization of the corresponding gene from an A. turbidans genomic library. The gene, designated aehA, encodes a polypeptide with a molecular weight of 72,000. Comparison of the determined N-terminal sequence and the deduced amino acid sequence indicated the presence of an N-terminal leader sequence of 40 amino acids. The aehA gene was subcloned in the pET9 expression plasmid and expressed in Escherichia coli. The recombinant protein was purified and found to be dimeric with subunits of 70 kDa. A sequence similarity search revealed 26% identity with a glutaryl 7-ACA acylase precursor from Bacillus laterosporus, but no homology was found with other known penicillin or cephalosporin acylases. There was some similarity to serine proteases, including the conservation of the active site motif, GXSYXG. Together with database searches, this suggested that the alpha-amino acid ester hydrolase is a beta-lactam antibiotic acylase that belongs to a class of hydrolases that is different from the Ntn hydrolase superfamily to which the well-characterized penicillin acylase from E. coli belongs. The alpha-amino acid ester hydrolase of A. turbidans represents a subclass of this new class of beta-lactam antibiotic acylases.

  14. Completion of the amino acid sequence of the alpha 1 chain from type I calf skin collagen. Amino acid sequence of alpha 1(I)B8.

    PubMed Central

    Glanville, R W; Breitkreutz, D; Meitinger, M; Fietzek, P P

    1983-01-01

    The complete amino acid sequence of the 279-residue CNBr peptide CB8 from the alpha 1 chain of type I calf skin collagen is presented. It was determined by sequencing overlapping fragments of CB8 produced by Staphylococcus aureus V8 proteinase, trypsin, Endoproteinase Arg-C and hydroxylamine. Tryptic cleavages were also made specific for lysine by blocking arginine residues with cyclohexane-1,2-dione. This completes the amino acid sequence analysis of the 1054-residues-long alpha (I) chain of calf skin collagen. PMID:6354180

  15. Gastropod arginine kinases from Cellana grata and Aplysia kurodai. Isolation and cDNA-derived amino acid sequences.

    PubMed

    Suzuki, T; Inoue, N; Higashi, T; Mizobuchi, R; Sugimura, N; Yokouchi, K; Furukohri, T

    2000-12-01

    Arginine kinase (AK) was isolated from the radular muscle of the gastropod molluscs Cellana grata (subclass Prosobranchia) and Aplysia kurodai (subclass Opisthobranchia), respectively, by ammonium sulfate fractionation, Sephadex G-75 gel filtration and DEAE-ion exchange chromatography. The denatured relative molecular mass values were estimated to be 40 kDa by sodium dodecyl sulfate-polyacrylamide gel electrophoresis. The isolated enzyme from Aplysia gave a Km value of 0.6 mM for arginine and a Vmax value of 13 micromole Pi min(-1) mg protein(-1) for the forward reaction. These values are comparable to other molluscan AKs. The cDNAs encoding Cellana and Aplysia AKs were amplified by polymerase chain reaction, and the nucleotide sequences of 1,608 and 1,239 bp, respectively, were determined. The open reading frame for Cellana AK is 1044 nucleotides in length and encodes a protein with 347 amino acid residues, and that for A. kurodai is 1077 nucleotides and 354 residues. The cDNA-derived amino acid sequences were validated by chemical sequencing of internal lysyl endopeptidase peptides. The amino acid sequences of Cellana and Aplysia AKs showed the highest percent identity (66-73%) with those of the abalone Nordotis and turbanshell Battilus belonging to the same class Gastropoda. These AK sequences still have a strong homology (63-71%) with that of the chiton Liolophura (class Polyplacophora), which is believed to be one of the most primitive molluscs. On the other hand, these AK sequences are less homologous (55-57%) with that of the clam Pseudocardium (class Bivalvia), suggesting that the biological position of the class Polyplacophora should be reconsidered.

  16. An Integrated Sequence-Structure Database incorporating matching mRNA sequence, amino acid sequence and protein three-dimensional structure data.

    PubMed Central

    Adzhubei, I A; Adzhubei, A A; Neidle, S

    1998-01-01

    We have constructed a non-homologous database, termed the Integrated Sequence-Structure Database (ISSD) which comprises the coding sequences of genes, amino acid sequences of the corresponding proteins, their secondary structure and straight phi,psi angles assignments, and polypeptide backbone coordinates. Each protein entry in the database holds the alignment of nucleotide sequence, amino acid sequence and the PDB three-dimensional structure data. The nucleotide and amino acid sequences for each entry are selected on the basis of exact matches of the source organism and cell environment. The current version 1.0 of ISSD is available on the WWW at http://www.protein.bio.msu.su/issd/ and includes 107 non-homologous mammalian proteins, of which 80 are human proteins. The database has been used by us for the analysis of synonymous codon usage patterns in mRNA sequences showing their correlation with the three-dimensional structure features in the encoded proteins. Possible ISSD applications include optimisation of protein expression, improvement of the protein structure prediction accuracy, and analysis of evolutionary aspects of the nucleotide sequence-protein structure relationship. PMID:9399866

  17. A naturally occurring repeat protein with high internal sequence identity defines a new class of TPR-like proteins

    PubMed Central

    Marold, Jacob D.; Kavran, Jennifer M.; Bowman, Gregory D.; Barrick, Doug

    2016-01-01

    SUMMARY Linear repeat proteins often have high structural similarity and low (~25%) pairwise sequence identities (PSI) among modules. We identified a unique P. anserina (Pa) sequence with tetratricopeptide repeat (TPR) homology, which contains longer (42 residue) repeats (42PRs) with an average PSI >91%. We determined the crystal structure of five tandem Pa 42PRs to 1.6Å, and examined the stability and solution properties of constructs containing three to six Pa 42PRs. Compared to 34-residue TPRs (34PRs), Pa 42PRs have a one-turn extension of each helix, and bury more surface area. Unfolding transitions shift to higher denaturant concentration and become sharper as repeats are added. Fitted Ising models show Pa 42PRs to be more cooperative than consensus 34PRs, with increased magnitudes of intrinsic and interfacial free energies. These results demonstrate the tolerance of the TPR motif to length variation, and provide a basis to understand the effects of helix length on intrinsic/interfacial stability. PMID:26439765

  18. Complete amino acid sequence and structure characterization of the taste-modifying protein, miraculin.

    PubMed

    Theerasilp, S; Hitotsuya, H; Nakajo, S; Nakaya, K; Nakamura, Y; Kurihara, Y

    1989-04-25

    The taste-modifying protein, miraculin, has the unusual property of modifying sour taste into sweet taste. The complete amino acid sequence of miraculin purified from miracle fruits by a newly developed method (Theerasilp, S., and Kurihara, Y. (1988) J. Biol. Chem. 263, 11536-11539) was determined by an automatic Edman degradation method. Miraculin was a single polypeptide with 191 amino acid residues. The calculated molecular weight based on the amino acid sequence and the carbohydrate content (13.9%) was 24,600. Asn-42 and Asn-186 were linked N-glycosidically to carbohydrate chains. High homology was found between the amino acid sequences of miraculin and soybean trypsin inhibitor.

  19. Detection and isolation of nucleic acid sequences using a bifunctional hybridization probe

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    2000-01-01

    A method for detecting and isolating a target sequence in a sample of nucleic acids is provided using a bifunctional hybridization probe capable of hybridizing to the target sequence that includes a detectable marker and a first complexing agent capable of forming a binding pair with a second complexing agent. A kit is also provided for detecting a target sequence in a sample of nucleic acids using a bifunctional hybridization probe according to this method.

  20. Molecular cloning, nucleotide sequence, and abscisic acid induction of a suberization-associated highly anionic peroxidase.

    PubMed

    Roberts, E; Kolattukudy, P E

    1989-06-01

    A highly anionic peroxidase induced in suberizing cells was suggested to be the key enzyme involved in polymerization of phenolic monomers to generate the aromatic matrix of suberin. The enzyme encoded by a potato cDNA was found to be highly homologous to the anionic peroxidase induced in suberizing tomato fruit. A tomato genomic library was screened using the potato anionic peroxidase cDNA and one genomic clone was isolated that contained two tandemly oriented anionic peroxidase genes. These genes were sequenced and were 96% and 87% identical to the mRNA for potato anionic peroxidase. Both genes consist of three exons with the relative positions of their two introns being conserved between the two genes. Primer extension analysis showed that only one of the genes is expressed in the periderm of 3 day wound-healed tomato fruits. Southern blot analyses suggested that there are two copies each of the two highly homologous genes per haploid genome in both potato and tomato. Abscisic acid (ABA) induced the accumulation of the anionic peroxidase transcripts in potato and tomato callus tissues. Northern blots showed that peroxidase mRNA was detectable at 2 days and was maximal at 8 days after transfer of potato callus to solid agar media containing 10(-4) M ABA. The transcripts induced by ABA in both potato and tomato callus were identical in size to those induced in wound-healing potato tuber and tomato fruit. The anionic peroxidase peptide was detected in extracts of potato callus grown on the ABA-containing media by western blot analysis. The results support the suggestion that stimulation of suberization by ABA involves the induction of the highly anionic peroxidase.

  1. Complete amino acid sequence of luffin-a, a ribosome-inactivating protein from the seeds of sponge gourd (Luffa cylindrica).

    PubMed

    Islam, M R; Nishida, H; Funatsu, G

    1990-11-01

    The complete amino acid sequence of luffin-a has been determined. Twenty-two peptides were isolated from the tryptic digest of luffin-a and sequenced employing the DABITC/PITC double coupling method. Overlaping of these peptides was achieved by analyzing the chymotryptic peptides or CNBr-fragments of luffin-a and their S. aureus V8 protease peptides. Luffin-a consists of 248 amino acid residues and its relative molecular mass is calculated to be 27,021 Da, excluding the attached sugar chains reasoned to be present at each Asn residue of positions 28, 33, 77, 84, 206, and 227. A comparison with the sequence of ricin A-chain showed 33% sequence identity indicating that these proteins are homologous.

  2. Amino acid sequence and disulfide bridges of affinity purified Kunitz-type chymotrypsin inhibitor from winged bean seed (Psophocarpus tetragonolobus (L.) DC).

    PubMed

    Kortt, A A; Burns, J E; Strike, P M

    1990-11-01

    The primary sequence of the affinity purified chymotrypsin inhibitor, WBCI, isolated from the albumin fraction of Psophocarpus tetragonolobus (L.) DC cv. UPS-122 seed was determined. The inhibitor consisted of a single polypeptide chain of 183 amino acids (Mr 20285) and the four half-cystine residues in the molecule formed two intramolecular disulfide bridges equivalent to those in other Kunitz-type seed inhibitors. The sequence of this chymotrypsin inhibitor was identical to that of chymotrypsin inhibitor-3 from cultivar UPS-31 and it showed about 50% sequence similarity to the winged bean acidic (WBTI-2, pI 5.1) and basic (WBTI-1, pI 8.9) trypsin inhibitors. Sequence similarities to other Kunitz-type seed inhibitors are discussed.

  3. Amino acid sequence and some properties of phytolacain G, a cysteine protease from growing fruit of pokeweed, Phytolacca americana.

    PubMed

    Uchikoba, T; Arima, K; Yonezawa, H; Shimada, M; Kaneda, M

    2000-10-18

    A protease, phytolacain G, has been found to appear on CM-Sepharose ion-exchange chromatography of greenish small-size fruits of pokeweed, Phytolacca americana L, from ca. 2 weeks after flowering, and increases during fruit enlargement. Reddish ripe fruit of the pokeweed contained both phytolacain G and R. The molecular mass of phytolacain G was estimated to be 25.5 kDa by SDS-PAGE. Its amino acid sequence was reconstructed by automated sequence analysis of the peptides obtained after cleavage with Achromobacter protease I, chymotrypsin, and cyanogen bromide. The enzyme is composed of 216 amino acid residues, of which it shares 152 identical amino acid residues (70%) with phytolacain R, 126 (58%) with melain G, 108 (50%) with papain, 106 (49%) with actinidain, and 96 (44%) with stem bromelain. The amino acid residues forming the substrate binding S(2) pocket of papain, Tyr67, Pro68, Trp69, Val133, and Phe207, were predicted to be replaced by Trp, Met, His, Ala, and Ser in phytolacain G, respectively. As a consequence of these substitutions, the S(2) pocket is expected to be less hydrophobic in phytolacain G than in papain.

  4. N-terminal amino acid sequences of D-serine deaminases of wild-type and operator-constitutive strains of Escherichia coli K-12.

    PubMed Central

    Heincz, M C; McFall, E

    1975-01-01

    The N-terminal amino acid sequences of the D-serine deaminases from strains of Escherichia coli K-12 that harbor wild-type and high-level constitutive catabolite-insensitive operator-initiator regions are identical: Met-Ser-GluNH2-Ser-Gly-Arg-His-Cys. This result indicates that the operator-initiator region is probably distinct from the D-serine deaminase structural gene. Images PMID:1099073

  5. ABRF ESRG 2005 Study: Identification of Seven Modified Amino Acids by Edman Sequencing

    PubMed Central

    Brune, D.; Denslow, N.D.; Kobayashi, R.; Lane, W.S.; Leone, J.W.; Madden, B.J.; Neveu, J. M.; Pohl, J.

    2006-01-01

    Identification of modified amino acids can be a challenging part for Edman degradation sequence analysis, largely because they are not included among the commonly used phenylthiohydantion amino acid standards. Yet many can have unique retention times and can be assigned by an experienced researcher or through the use of a guide showing their typical chromatography characteristics. The Edman Sequencing Research Group (ESRG) 2005 study is a continuation of the 2004 study, in which the participating laboratories were provided a synthetic peptide and asked to identify the modified amino acids present in the sequence. The study sample provided an opportunity to sequence a peptide containing a variety of modified amino acids and note their retention times relative to the common amino acids. It also allowed the ESRG to compile the chromatographic properties and intensities from multiple instruments and tabulate an average elution position for these modified amino acids on commonly used instruments. Participating laboratories were given 2000 pmoles of a synthetic peptide, 18 amino acids long, containing the following modified amino acids: dimethyl- and trimethyl-lysine, 3-methyl-histidine, N-carbamyl-lysine, cystine, N-methyl-alanine, and isoaspartic acid. The modified amino acids were interspersed with standard amino acids to help in the assessment of initial and repetitive yields. In addition to filling in an assignment sheet, which included retention times and peak areas, participants were asked to provide specific details about the parameters used for the sequencing run. References for some of the modified amino acid elution characteristics were provided and the participants had the option of viewing a list of the modified amino acids present in the peptide at the ESRG Web site. The ABRF ESRG 2005 sample is the seventeenth in a series of studies designed to aid laboratories in evaluating their abilities to obtain and interpret amino acid sequence data. PMID:17122064

  6. Field-Based Stable Isotope Probing Reveals the Identities of Benzoic Acid-Metabolizing Microorganisms and Their In Situ Growth in Agricultural Soil▿

    PubMed Central

    Pumphrey, Graham M.; Madsen, Eugene L.

    2008-01-01

    We used a combination of stable isotope probing (SIP), gas chromatography-mass spectrometry-based respiration, isolation/cultivation, and quantitative PCR procedures to discover the identity and in situ growth of soil microorganisms that metabolize benzoic acid. We added [13C]benzoic acid or [12C]benzoic acid (100 μg) once, four times, or five times at 2-day intervals to agricultural field plots. After monitoring 13CO2 evolution from the benzoic acid-dosed soil, field soils were harvested and used for nucleic acid extraction and for cultivation of benzoate-degrading bacteria. Exposure of soil to benzoate increased the number of culturable benzoate degraders compared to unamended soil, and exposure to benzoate shifted the dominant culturable benzoate degraders from Pseudomonas species to Burkholderia species. Isopycnic separation of heavy [13C]DNA from the unlabeled fraction allowed terminal restriction fragment length polymorphism (T-RFLP) analyses to confirm that distinct 16S rRNA genes were localized in the heavy fraction. Phylogenetic analysis of sequenced 16S rRNA genes revealed a predominance (15 of 58 clones) of Burkholderia species in the heavy fraction. Burkholderia sp. strain EBA09 shared 99.5% 16S rRNA sequence similarity with a group of clones representing the dominant RFLP pattern, and the T-RFLP fragment for strain EBA09 and a clone from that cluster matched the fragment enriched in the [13C]DNA fraction. Growth of the population represented by EBA09 during the field-dosing experiment was demonstrated by using most-probable-number-PCR and primers targeting EBA09 and the closely related species Burkholderia hospita. Thus, the target population identified by SIP not only actively metabolized benzoic acid but reproduced in the field upon the addition of the substrate. PMID:18469130

  7. Feature selection from short amino acid sequences in phosphorylation prediction problem

    NASA Astrophysics Data System (ADS)

    Wecławski, Jakub; Jankowski, Stanisław; Szymański, Zbigniew

    The paper describes solution of feature selection from amino acid sequences in phosphorylation prediction problem. We show that even for short sequences the variable selection leads to better classification performance. Moreover, the final simplicity of models allows for better data understanding and can be used by an expert for further analysis. The feature selection process is divided into two parts: i) the classification tree is used for finding the most relevant positions in amino acid sequences, ii) then the contrast pattern kernel is applied for pattern selection. This work summarizes the research made on classification of short amino acid sequences. The results of the research allowed us to propose a general scheme of amino acid sequence analysis.

  8. Nucleic-acid characterization of the identity and activity of subsurface microorganisms

    NASA Astrophysics Data System (ADS)

    Madsen, E. L.

    Nucleic-acid approaches to characterizing naturally occurring microorganisms in their habitats have risen to prominence during the last decade. Extraction of deoxyribonucleic-acid (DNA) and ribonucleic-acid (RNA) biomarkers directly from environmental samples provides a new means of gathering information in microbial ecology. This review article defines: (1) the subsurface habitat; (2) what nucleic-acid procedures are; and (3) the types of information nucleic-acid procedures can and cannot reveal. Recent literature examining microbial nucleic acids in the terrestrial subsurface is tabulated and reviewed. The majority of effort to date has focused upon insights into the identity and phylogeny of subsurface microorganisms afforded by analysis of their 16S rRNA genes. Given the power of nucleic-acid-based procedures and their limited application to subsurface habitats to date, many future opportunities await exploration. Au cours des derniers dix ans, les approches basées sur les acides nucléiques sont apparues et devenues essentielles pour caractériser dans leurs habitats les microorganismes existant à l'état naturel. L'extraction directe de l'ADN et de l'ARN, qui sont des biomarqueurs, d'échantillons environnementaux a fourni un nouveau moyen d'obtenir des informations sur l'écologie microbienne. Cet article synthétique définit 1) l'habitat souterrain, 2) ce que sont les procédures basées sur les acides nucléiques, 3) quel type d'informations ces procéedures peuvent et ne peuvent pas révéler. Les travaux récemment publiés concernatn les acides nucléiques microbiens dans le milieu souterrain terrestre sont catalogués et passés en revue. La majorité des efforts pour obtenir es données s'est concentrée sur l'identité et la phylogénie des microorganismes souterrains fournies par l'analyse de leurs gènes 16S rRNA. Étant donné la puissance des procédures basées sur les acides nucléiques et leur application limitée aux habitats souterrains

  9. Identification of Nucleic Acid High Affinity Binding Sequences of Proteins by SELEX.

    PubMed

    Bouvet, Philippe

    2015-01-01

    A technique is described for the identification of nucleic acid sequences bound with high affinity by proteins or by other molecules suitable for a partitioning assay. Here, a histidine-tagged protein is allowed to interact with a pool of nucleic acids and the protein-nucleic acid complexes formed are retained on a Ni-NTA matrix. Nucleic acids with a low level of recognition by the protein are washed away. The pool of recovered nucleic acids is amplified by the polymerase chain reaction and is submitted to further rounds of selection. Each round of selection increases the proportion of sequences that are avidly bound by the protein of interest. The cloning and sequencing of these sequences finally completes their identification.

  10. Identification of nucleic acid high-affinity binding sequences of proteins by SELEX.

    PubMed

    Bouvet, Philippe

    2009-01-01

    A technique is described for the identification of nucleic acid sequences bound with high affinity by proteins or by other molecules suitable for a partitioning assay. Here, a histidine-tagged protein is allowed to interact with a pool of nucleic acids and the protein-nucleic acid complexes formed are retained on a Ni-NTA matrix. Nucleic acids with a low level of recognition by the protein are washed away. The pool of recovered nucleic acids is amplified by the polymerase chain reaction and is submitted to further rounds of selection. Each round of selection increases the proportion of sequences that are avidly bound by the protein of interest. The cloning and sequencing of these sequences finally completes their identification.

  11. Trichomonas vaginalis acidic phospholipase A2: isolation and partial amino acid sequence.

    PubMed

    Escobedo-Guajardo, Brenda L; González-Salazar, Francisco; Palacios-Corona, Rebeca; Torres de la Cruz, Víctor M; Morales-Vallarta, Mario; Mata-Cárdenas, Benito D; Garza-González, Jesús N; Rivera-Silva, Gerardo; Vargas-Villarreal, Javier

    2013-12-01

    Sexually transmitted diseases are a major cause of acute disease worldwide, and trichomoniasis is the most common and curable disease, generating more than 170 million cases annually worldwide. Trichomonas vaginalis is the causal agent of trichomoniasis and has the ability to destroy in vitro cell monolayers of the vaginal mucosa, where the phospholipases A2 (PLA2) have been reported as potential virulence factors. These enzymes have been partially characterized from the subcellular fraction S30 of pathogenic T. vaginalis strains. The main objective of this study was to purify a phospholipase A2 from T. vaginalis, make a partial characterization, obtain a partial amino acid sequence, and determine its enzymatic participation as hemolytic factor causing lysis of erythrocytes. Trichomonas S30, RF30 and UFF30 sub-fractions from GT-15 strain have the capacity to hydrolyze [2-(14)C-PA]-PC at pH 6.0. Proteins from the UFF30 sub-fraction were separated by affinity chromatography into two eluted fractions with detectable PLA A2 activity. The EDTA-eluted fraction was analyzed by HPLC using on-line HPLC-tandem mass spectrometry and two protein peaks were observed at 8.2 and 13 kDa. Peptide sequences were identified from the proteins present in the eluted EDTA UFF30 fraction; bioinformatic analysis using Protein Link Global Server charged with T. vaginalis protein database suggests that eluted peptides correspond a putative ubiquitin protein in the 8.2 kDa fraction and a phospholipase preserved in the 13 kDa fraction. The EDTA-eluted fraction hydrolyzed [2-(14)C-PA]-PC lyses erythrocytes from Sprague-Dawley in a time and dose-dependent manner. The acidic hemolytic activity decreased by 84% with the addition of 100 μM of Rosenthal's inhibitor.

  12. Nucleotide sequence of a lysine transfer ribonucleic Acid from bakers' yeast.

    PubMed

    Madison, J T; Boguslawski, S J; Teetor, G H

    1972-05-12

    The nucleotide sequence of one of the two major lysine transfer RNA's from bakers' yeast has been determined. Its structure is compared to that of a lysine tRNA from a haploid yeast. A total of 21 nucleotides differ in the two molecules. Only the T-psi-C-G (thymidine-pseudouridine-cytidine-guanosine) loop and its supporting stem are identical.

  13. Quantitative analysis of polycomb response elements (PREs) at identical genomic locations distinguishes contributions of PRE sequence and genomic environment

    PubMed Central

    2011-01-01

    Background Polycomb/Trithorax response elements (PREs) are cis-regulatory elements essential for the regulation of several hundred developmentally important genes. However, the precise sequence requirements for PRE function are not fully understood, and it is also unclear whether these elements all function in a similar manner. Drosophila PRE reporter assays typically rely on random integration by P-element insertion, but PREs are extremely sensitive to genomic position. Results We adapted the ΦC31 site-specific integration tool to enable systematic quantitative comparison of PREs and sequence variants at identical genomic locations. In this adaptation, a miniwhite (mw) reporter in combination with eye-pigment analysis gives a quantitative readout of PRE function. We compared the Hox PRE Frontabdominal-7 (Fab-7) with a PRE from the vestigial (vg) gene at four landing sites. The analysis revealed that the Fab-7 and vg PREs have fundamentally different properties, both in terms of their interaction with the genomic environment at each site and their inherent silencing abilities. Furthermore, we used the ΦC31 tool to examine the effect of deletions and mutations in the vg PRE, identifying a 106 bp region containing a previously predicted motif (GTGT) that is essential for silencing. Conclusions This analysis showed that different PREs have quantifiably different properties, and that changes in as few as four base pairs have profound effects on PRE function, thus illustrating the power and sensitivity of ΦC31 site-specific integration as a tool for the rapid and quantitative dissection of elements of PRE design. PMID:21410956

  14. Molecular characterization of two Pepino mosaic virus variants from imported tomato seed reveals high levels of sequence identity between Chilean and US isolates.

    PubMed

    Ling, Kai-Shu

    2007-01-01

    Pepino mosaic virus (PepMV), a member of the genus Potexvirus, was first described in South America on pepino (Solanum muricatum A.). Only in recent years, it was reported to infect greenhouse-grown tomatoes. Genome nucleotide sequences from several European isolates showed extensive sequence identity (>99%). Recent genome nucleotide sequences from two US isolates (US1 and US2) however showed much greater sequence divergence from that of the European PepMV isolates. My interest in characterizing virus isolates from South America was due to an active commercial tomato seed production in Chile. Through genome sequence comparison and phylogenetic analyses, we may be able to understand the source of virus infection and control this devastating disease from further spreading into new tomato growing regions of the world. Complete genome nucleotide sequences from two PepMV variants (designated as Ch1 and Ch2) were determined from a virus isolate obtained from a commercial tomato seed lot produced in Chile. Using RT-PCR-based genome walking strategy, complete genome sequences from these two variants were determined. Excluding poly (A) tails, the genomes of PepMV Ch1 and Ch2 were 6414 and 6412 nucleotides (nt), respectively. Pairwise comparisons of PepMV Ch1 and Ch2 genomes with other PepMV isolates showed that the highest nucleotide sequence identity was with two US isolates, 98.7% between PepMV Ch1 and US1, and 90.7% between Ch2 and US2. Similar to PepMV US1 and US2, the two Chilean variants were the most divergent from one another (78% nt identity). These two Chilean PepMV variants also shared only 78-86% nucleotide sequence identity to that of five European isolates. The high level of nucleotide sequence identity between Chilean and US isolates suggests a common origin. Phylogenetic analyses with various gene products generated three distinct sequence clusters (or strains): US1 and Ch1 in the first group, US2 and Ch2 in the second, and the European tomato isolates in

  15. Sequence-Specific Covalent Capture Coupled with High-Contrast Nanopore Detection of a Disease-Derived Nucleic Acid Sequence.

    PubMed

    Nejad, Maryam Imani; Shi, Ruicheng; Zhang, Xinyue; Gu, Li-Qun; Gates, Kent S

    2017-07-18

    Hybridization-based methods for the detection of nucleic acid sequences are important in research and medicine. Short probes provide sequence specificity, but do not always provide a durable signal. Sequence-specific covalent crosslink formation can anchor probes to target DNA and might also provide an additional layer of target selectivity. Here, we developed a new crosslinking reaction for the covalent capture of specific nucleic acid sequences. This process involved reaction of an abasic (Ap) site in a probe strand with an adenine residue in the target strand and was used for the detection of a disease-relevant T→A mutation at position 1799 of the human BRAF kinase gene sequence. Ap-containing probes were easily prepared and displayed excellent specificity for the mutant sequence under isothermal assay conditions. It was further shown that nanopore technology provides a high contrast-in essence, digital-signal that enables sensitive, single-molecule sensing of the cross-linked duplexes. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

  16. Identification of random nucleic acid sequence aberrations using dual capture probes which hybridize to different chromosome regions

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    1998-01-01

    A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration.

  17. Identification of random nucleic acid sequence aberrations using dual capture probes which hybridize to different chromosome regions

    DOEpatents

    Lucas, J.N.; Straume, T.; Bogen, K.T.

    1998-03-24

    A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration. 14 figs.

  18. PrDOS: prediction of disordered protein regions from amino acid sequence.

    PubMed

    Ishida, Takashi; Kinoshita, Kengo

    2007-07-01

    PrDOS is a server that predicts the disordered regions of a protein from its amino acid sequence (http://prdos.hgc.jp). The server accepts a single protein amino acid sequence, in either plain text or FASTA format. The prediction system is composed of two predictors: a predictor based on local amino acid sequence information and one based on template proteins. The server combines the results of the two predictors and returns a two-state prediction (order/disorder) and a disorder probability for each residue. The prediction results are sent by e-mail, and the server also provides a web-interface to check the results.

  19. The amino acid sequence of protein CM-3 from Dendroaspis polylepis polylepis (black mamba) venom.

    PubMed

    Joubert, F J

    1985-01-01

    Protein CM-3 from Dendroaspis polylepis polylepis venom was purified by gel filtration and ion exchange chromatography. It comprises 65 amino acids including eight half-cystines. The complete amino acid sequence of protein CM-3 has been elucidated. The sequence (residues 1-50) resembles that of the N-terminal sequence of the subunits of a synergistic type protein and residues 51-65 that of the C-terminal sequence of an angusticeps type protein. Mixtures of protein CM-3 and angusticeps type proteins showed no apparent synergistic effect, in that their toxicity in combination was no greater than the sum of their individual toxicities.

  20. The amino acid sequences of the Fd fragments of two human γ heavy chains

    PubMed Central

    Press, E. M.; Hogg, N. M.

    1970-01-01

    The amino acid sequences of the Fd fragments of two human pathological immunoglobulins of the immunoglobulin G1 class are reported. Comparison of the two sequences shows that the heavy-chain variable regions are similar in length to those of the light chains. The existence of heavy chain variable region subgroups is also deduced, from a comparison of these two sequences with those of another γ 1 chain, Eu, a μ chain, Ou, and the partial sequence of a fourth γ 1 chain, Ste. Carbohydrate has been found to be linked to an aspartic acid residue in the variable region of one of the γ 1 chains, Cor. PMID:5449120

  1. Cloning, sequence, and developmental expression of a type 5, tartrate-resistant, acid phosphatase of rat bone.

    PubMed

    Ek-Rylander, B; Bill, P; Norgård, M; Nilsson, S; Andersson, G

    1991-12-25

    Tartrate-resistant acid phosphatase (TRAP) is a characteristic constituent of osteoclasts and some mononuclear preosteoclasts and, therefore, used as a histochemical and biochemical marker for osteoclasts and bone resorption. We now report the isolation of a 1397-base pair (bp) full-length TRAP/tartrate-resistant acid ATPase (TrATPase) cDNA clone from a neonatal rat calvaria lambda gt11 cDNA library. The cDNA clone consists of a 92-bp untranslated 5'-flank, an open reading frame of 981 bp and a 324-bp untranslated 3'-poly(A)-containing region. The deduced protein sequence of 327 amino acids contains a putative cleavable signal sequence of 21 amino acids. The mature polypeptide of 306 amino acids has a calculated Mr of 34,350 Da and a pI of 9.18, and it contains two potential N-glycosylation sites and the lysosomal targeting sequence DKRFQ. At the protein level, the sequence displays 89-94% homology to TRAP enzymes from human placenta, beef spleen, and uteroferrin and identity to the N terminus of purified rat bone TRAP/TrATPase. An N-terminal amino acid segment is strikingly homologous to the corresponding region in lysosomal and prostatic acid phosphatases. The cDNA recognized a 1.5-kilobase mRNA in long bones and calvaria, and in vitro translation using, as template, mRNA transcribed from the full-length insert yielded an immunoprecipitated product of 34 kDa. In neonatal rats, TRAP/TrATPase mRNA was highly expressed in skeletal tissues, with much lower (less than 10%) levels detected in spleen, thymus, liver, skin, brain, kidney, brain, lung, and heart. In situ hybridization demonstrated specific labeling of osteoclasts at endostal surfaces and bone trabeculae of long bones. Thus, despite the apparent similarity of this osteoclastic TRAP/TrATPase with type 5, tartrate-resistant and purple, acid phosphatases expressed in other mammalian tissues, this gene appears to be preferentially expressed at skeletal sites.

  2. An Alignment-Free Algorithm in Comparing the Similarity of Protein Sequences Based on Pseudo-Markov Transition Probabilities among Amino Acids

    PubMed Central

    Li, Yushuang; Yang, Jiasheng; Zhang, Yi

    2016-01-01

    In this paper, we have proposed a novel alignment-free method for comparing the similarity of protein sequences. We first encode a protein sequence into a 440 dimensional feature vector consisting of a 400 dimensional Pseudo-Markov transition probability vector among the 20 amino acids, a 20 dimensional content ratio vector, and a 20 dimensional position ratio vector of the amino acids in the sequence. By evaluating the Euclidean distances among the representing vectors, we compare the similarity of protein sequences. We then apply this method into the ND5 dataset consisting of the ND5 protein sequences of 9 species, and the F10 and G11 datasets representing two of the xylanases containing glycoside hydrolase families, i.e., families 10 and 11. As a result, our method achieves a correlation coefficient of 0.962 with the canonical protein sequence aligner ClustalW in the ND5 dataset, much higher than those of other 5 popular alignment-free methods. In addition, we successfully separate the xylanases sequences in the F10 family and the G11 family and illustrate that the F10 family is more heat stable than the G11 family, consistent with a few previous studies. Moreover, we prove mathematically an identity equation involving the Pseudo-Markov transition probability vector and the amino acids content ratio vector. PMID:27918587

  3. Lactic acid bacterial diversity in the traditional mexican fermented dough pozol as determined by 16S rDNA sequence analysis.

    PubMed

    Escalante, A; Wacher, C; Farrés, A

    2001-02-28

    The lactic acid bacteria diversity of pozol, a Mexican fermented maize dough, was studied using a total DNA extraction and purification procedure and PCR amplification of 16S rDNA for gram-positive and related bacterial groups. Thirty-six clones were obtained and sequenced to 650 nucleotides. These partial sequences were identified by submission to the non-redundant nucleotide database of NCBI. The identified sequences were aligned with reference sequences of the closest related organisms. This analysis indicated that only 14 sequences were unique clones and these were identified as Lactococcus lactis, Streptococcus suis, Lactobacillus plantarum, Lact. casei, Lact. alimentarium, and Lact. delbruekii and Clostridium sp. Two non-ribosomal sequences were also detected. Unlike other environments analyzed with this molecular approach where many unidentified microorganisms are found, the identity of most sequences could be established as lactic acid bacteria, indicating that this is the main group among the gram-positive bacteria in pozol. Use of this molecular method permitted detection of lactic acid bacteria different from those previously isolated and identified by culture techniques

  4. An Alignment-Free Algorithm in Comparing the Similarity of Protein Sequences Based on Pseudo-Markov Transition Probabilities among Amino Acids.

    PubMed

    Li, Yushuang; Song, Tian; Yang, Jiasheng; Zhang, Yi; Yang, Jialiang

    2016-01-01

    In this paper, we have proposed a novel alignment-free method for comparing the similarity of protein sequences. We first encode a protein sequence into a 440 dimensional feature vector consisting of a 400 dimensional Pseudo-Markov transition probability vector among the 20 amino acids, a 20 dimensional content ratio vector, and a 20 dimensional position ratio vector of the amino acids in the sequence. By evaluating the Euclidean distances among the representing vectors, we compare the similarity of protein sequences. We then apply this method into the ND5 dataset consisting of the ND5 protein sequences of 9 species, and the F10 and G11 datasets representing two of the xylanases containing glycoside hydrolase families, i.e., families 10 and 11. As a result, our method achieves a correlation coefficient of 0.962 with the canonical protein sequence aligner ClustalW in the ND5 dataset, much higher than those of other 5 popular alignment-free methods. In addition, we successfully separate the xylanases sequences in the F10 family and the G11 family and illustrate that the F10 family is more heat stable than the G11 family, consistent with a few previous studies. Moreover, we prove mathematically an identity equation involving the Pseudo-Markov transition probability vector and the amino acids content ratio vector.

  5. Amino acid sequence and posttranslational modifications of human factor VII sub a from plasma and transfected baby hamster kidney cells

    SciTech Connect

    Thim, L.; Bjoern, S.; Christensen, M.; Nicolaisen, E.M.; Lund-Hansen, T.; Pedersen, A.H.; Hedner, U. )

    1988-10-04

    Blood coagulation factor VII is a vitamin K dependent glycoprotein which in its activated form, factor VII{sub a}, participates in the coagulation process by activating factor X and/or factor IX in the presence of Ca{sup 2+} and tissue factor. Three types of potential posttranslational modifications exist in the human factor VII{sub a} molecule, namely, 10 {gamma}-carboxylated, N-terminally located glutamic acid residues, 1 {beta}-hydroxylated aspartic acid residue, and 2 N-glycosylated asparagine residues. In the present study, the amino acid sequence and posttranslational modifications of recombinant factor VII{sub a} as purified from the culture medium of a transfected baby hamster kidney cell line have been compared to human plasma factor VII{sub a}. By use of HPLC, amino acid analysis, peptide mapping, and automated Edman degradation, the protein backbone of recombinant factor VII{sub a} was found to be identical with human factor VII{sub a}. Asparagine residues 145 and 322 were found to be fully N-glycosylated in human plasma factor VII{sub a}. In the recombinant factor VII{sub a}, asparagine residue 322 was fully glycosylated whereas asparagine residue 145 was only partially (approximately 66%) glycosylated. Besides minor differences in the sialic acid and fucose contents, the overall carbohydrate compositions were nearly identical in recombinant factor VII{sub a} and human plasma factor VII{sub a}. These results show that factor VII{sub a} as produced in the transfected baby hamster kidney cells is very similar to human plasma factor VII{sub a} and that this cell line thus might represent an alternative source for human factor VII{sub a}.

  6. The Chinese hamster Alu-equivalent sequence: a conserved highly repetitious, interspersed deoxyribonucleic acid sequence in mammals has a structure suggestive of a transposable element.

    PubMed Central

    Haynes, S R; Toomey, T P; Leinwand, L; Jelinek, W R

    1981-01-01

    A consensus sequence has been determined for a major interspersed deoxyribonucleic acid repeat in the genome of Chinese hamster ovary cells (CHO cells). This sequence is extensively homologous to (i) the human Alu sequence (P. L. Deininger et al., J. Mol. Biol., in press), (ii) the mouse B1 interspersed repetitious sequence (Krayev et al., Nucleic Acids Res. 8:1201-1215, 1980) (iii) an interspersed repetitious sequence from African green monkey deoxyribonucleic acid (Dhruva et al., Proc. Natl. Acad. Sci. U.S.A. 77:4514-4518, 1980) and (iv) the CHO and mouse 4.5S ribonucleic acid (this report; F. Harada and N. Kato, Nucleic Acids Res. 8:1273-1285, 1980). Because the CHO consensus sequence shows significant homology to the human Alu sequence it is termed the CHO Alu-equivalent sequence. A conserved structure surrounding CHO Alu-equivalent family members can be recognized. It is similar to that surrounding the human Alu and the mouse B1 sequences, and is represented as follows: direct repeat-CHO-Alu-A-rich sequence-direct repeat. A composite interspersed repetitious sequence has been identified. Its structure is represented as follows: direct repeat-residue 47 to 107 of CHO-Alu-non-Alu repetitious sequence-A-rich sequence-direct repeat. Because the Alu flanking sequences resemble those that flank known transposable elements, we think it likely that the Alu sequence dispersed throughout the mammalian genome by transposition. Images PMID:9279371

  7. Nucleotide sequence and spatial expression pattern of a drought- and abscisic Acid-induced gene of tomato.

    PubMed

    Plant, A L; Cohen, A; Moses, M S; Bray, E A

    1991-11-01

    The nucleotide sequence of le16, a tomato (Lycopersicon esculentum Mill.) gene induced by drought stress and regulated by abscisic acid specifically in aerial vegetative tissue, is presented. The single open reading frame contained within the gene has the capacity to encode a polypeptide of 12.7 kilodaltons and is interrupted by a small intron. The predicted polypeptide is rich in leucine, glycine, and alanine and has an isoelectric point of 8.7. The amino terminus is hydrophobic and characteristic of signal sequences that target polypeptides for export from the cytoplasm. There is homology (47.2% identity) between the amino terminus of the LE 16 polypeptide and the corresponding amino terminal domain of the maize phospholipid transfer protein. le16 was expressed in drought-stressed leaf, petiole, and stem tissue and to a much lower extent in the pericarp of mature green tomato fruit and developing seeds. No expression was detected in the pericarp of red fruit or in drought-stressed roots. Expression of le16 was also induced in leaf tissue by a variety of other abiotic stresses including polyethylene glycol-mediated water deficit, salinity, cold stress, and heat stress. None of these stresses or direct applications of abscisic acid induced the expression of le16 in the roots of the same plants. The unique expression characteristics of this gene indicates that novel regulatory mechanisms, in addition to endogenous abscisic acid, are involved in controlling gene expression.

  8. The amino acid sequence of goat beta-lactoglobulin.

    PubMed

    Préaux, G; Braunitzer, G; Schrank, B; Stangl, A

    1979-11-01

    The isolation of beta-lactoglobulin from milk of the goat is described. The purified protein was checked for purity and has been characterized by its gross composition and end groups. The native or the modified protein was then degraded by tryptic and cyanogen bromide cleavage. The cleavage products were isolated and sequenced in the sequenator using a Quadrol and propyne program. These data provide the complete sequence of beta-lactoglobulin of the goat. The results are discussed and compared particularly with bovine beta-lactoglobulin components AB. Some biological aspects are described.

  9. Layered materials with coexisting acidic and basic sites for catalytic one-pot reaction sequences.

    PubMed

    Motokura, Ken; Tada, Mizuki; Iwasawa, Yasuhiro

    2009-06-17

    Acidic montmorillonite-immobilized primary amines (H-mont-NH(2)) were found to be excellent acid-base bifunctional catalysts for one-pot reaction sequences, which are the first materials with coexisting acid and base sites active for acid-base tamdem reactions. For example, tandem deacetalization-Knoevenagel condensation proceeded successfully with the H-mont-NH(2), affording the corresponding condensation product in a quantitative yield. The acidity of the H-mont-NH(2) was strongly influenced by the preparation solvent, and the base-catalyzed reactions were enhanced by interlayer acid sites.

  10. Computer Simulation of the Determination of Amino Acid Sequences in Polypeptides

    ERIC Educational Resources Information Center

    Daubert, Stephen D.; Sontum, Stephen F.

    1977-01-01

    Describes a computer program that generates a random string of amino acids and guides the student in determining the correct sequence of a given protein by using experimental analytic data for that protein. (MLH)

  11. Computer Simulation of the Determination of Amino Acid Sequences in Polypeptides

    ERIC Educational Resources Information Center

    Daubert, Stephen D.; Sontum, Stephen F.

    1977-01-01

    Describes a computer program that generates a random string of amino acids and guides the student in determining the correct sequence of a given protein by using experimental analytic data for that protein. (MLH)

  12. Synthesis of gamma,delta-unsaturated glycolic acids via sequenced brook and Ireland--claisen rearrangements.

    PubMed

    Schmitt, Daniel C; Johnson, Jeffrey S

    2010-03-05

    Organozinc, -magnesium, and -lithium nucleophiles initiate a Brook/Ireland-Claisen rearrangement sequence of allylic silyl glyoxylates resulting in the formation of gamma,delta-unsaturated alpha-silyloxy acids.

  13. Reconstruction of cyclooxygenase evolution in animals suggests variable, lineage-specific duplications, and homologs with low sequence identity.

    PubMed

    Havird, Justin C; Kocot, Kevin M; Brannock, Pamela M; Cannon, Johanna T; Waits, Damien S; Weese, David A; Santos, Scott R; Halanych, Kenneth M

    2015-04-01

    Cyclooxygenase (COX) enzymatically converts arachidonic acid into prostaglandin G/H in animals and has importance during pregnancy, digestion, and other physiological functions in mammals. COX genes have mainly been described from vertebrates, where gene duplications are common, but few studies have examined COX in invertebrates. Given the increasing ease in generating genomic data, as well as recent, although incomplete descriptions of potential COX sequences in Mollusca, Crustacea, and Insecta, assessing COX evolution across Metazoa is now possible. Here, we recover 40 putative COX orthologs by searching publicly available genomic resources as well as ~250 novel invertebrate transcriptomic datasets. Results suggest the common ancestor of Cnidaria and Bilateria possessed a COX homolog similar to those of vertebrates, although such homologs were not found in poriferan and ctenophore genomes. COX was found in most crustaceans and the majority of molluscs examined, but only specific taxa/lineages within Cnidaria and Annelida. For example, all octocorallians appear to have COX, while no COX homologs were found in hexacorallian datasets. Most species examined had a single homolog, although species-specific COX duplications were found in members of Annelida, Mollusca, and Cnidaria. Additionally, COX genes were not found in Hemichordata, Echinodermata, or Platyhelminthes, and the few previously described COX genes in Insecta lacked appreciable sequence homology (although structural analyses suggest these may still be functional COX enzymes). This analysis provides a benchmark for identifying COX homologs in future genomic and transcriptomic datasets, and identifies lineages for future studies of COX.

  14. Genome sequence of the acid-tolerant strain Rhizobium sp. LPU83.

    PubMed

    Wibberg, Daniel; Tejerizo, Gonzalo Torres; Del Papa, María Florencia; Martini, Carla; Pühler, Alfred; Lagares, Antonio; Schlüter, Andreas; Pistorio, Mariano

    2014-04-20

    Rhizobia are important members of the soil microbiome since they enter into nitrogen-fixing symbiosis with different legume host plants. Rhizobium sp. LPU83 is an acid-tolerant Rhizobium strain featuring a broad-host-range. However, it is ineffective in nitrogen fixation. Here, the improved draft genome sequence of this strain is reported. Genome sequence information provides the basis for analysis of its acid tolerance, symbiotic properties and taxonomic classification.

  15. The amino acid sequence of monal pheasant lysozyme and its activity.

    PubMed

    Araki, T; Matsumoto, T; Torikata, T

    1998-10-01

    The amino acid sequence of monal pheasant lysozyme and its activity were analyzed. Carboxymethylated lysozyme was digested with trypsin and the resulting peptides were sequenced. The established amino acid sequence had one amino acid substitution at position 102 (Arg to Gly) comparing with Indian peafowl lysozyme and four amino acid substitutions at positions 3 (Phe to Tyr), 15 (His to Leu), 41 (Gln to His), and 121 (Gln to His) with chicken lysozyme. Analysis of the time-courses of reaction using N-acetylglucosamine pentamer as a substrate showed a difference of binding free energy change (-0.4 kcal/mol) at subsites A between monal pheasant and Indian peafowl lysozyme. This was assumed to be caused by the amino acid substitution at subsite A with loss of a positive charge at position 102 (Arg102 to Gly).

  16. Single-chain structure of human ceruloplasmin: the complete amino acid sequence of the whole molecule.

    PubMed Central

    Takahashi, N; Ortel, T L; Putnam, F W

    1984-01-01

    We have determined the amino acid sequence of the amino-terminal 67,000-dalton (67-kDa) fragment of human ceruloplasmin and have established overlapping sequences between the 67-kDa and 50-kDa fragments and between the 50-kDa and 19-kDa fragments. The 67-kDa fragment contains 480 amino acid residues and three glucosamine oligosaccharides. These results together with our previous sequence data for the 50-kDa and 19-kDa fragments complete the amino acid sequence of human ceruloplasmin. The polypeptide chain has a total of 1,046 amino acid residues (Mr 120,085) and has attachment sites for four glucosamine oligosaccharides; together these account for the total molecular mass of human ceruloplasmin (132 kDa). The sequence analysis of the peptides overlapping the fragments showed that one additional amino acid, arginine, is present between the 67-kDa and 50-kDa fragments, and another, lysine, is between the 50-kDa and 19-kDa fragments. Only two apparent sites of amino acid interchange have been identified in the polypeptide chain. Both involve a single-point interchange of glycine and lysine that would result in a difference in charge. The results of the complete sequence analysis verified that human ceruloplasmin is composed of a single polypeptide chain and that the subunit-like fragments are produced by proteolytic cleavage during purification (and possibly also in vivo). PMID:6582496

  17. Myoglobin of the shark Heterodontus portusjacksoni: isolation and amino acid sequence.

    PubMed

    Fisher, W K; Thompson, E O

    1979-06-01

    Myoglobin isolated from red muscle of the shark H. portusjacksoni was purified by ion-exchange chromatography on sulfopropyl-Sephadex and gel-filtration. Amino acid analysis and sequence determination showed 148 amino acid residues. The amino terminal residue is acetylated as shown by mass spectrographic analysis of N-terminal peptides. There is a deletion of four residues at the amino terminal end as well as one residue in the CD interhelical area relative to other myoglobins. The complete amino acid sequence has been determined following digestion with trypsin, chymotrypsin, pepsin and staphylococcal protease. Sequences of the purified peptides were determined by the dansyl-Edman procedure. The amino acid sequence showed approximately 85 differences from mammalian, monotreme and bird myoglobins. The date of divergence of the shark H. portusjacksoni from these other orders was estimated at 450 +/- 16 million years, based on the number of amino acid differences between species and allowing for multiple mutations during the evolutionary period. This estimate agrees well with similar estimates made using alpha- and beta-globin sequences, in contrast to widely differing estimates of dates of divergence for monotremes using the same three globin chains. Compared with myoglobins from species previously studied, there are many more differences in amino acid sequences, and in many positions residues are found that are more characteristic of alpha- and beta-globins, suggesting a conservation of residues over a long period of evolutionary time. There are fewer stabilizing hydrogen bonds and salt-linkages than in other myoglobins.

  18. Multiple Genome Sequences of Important Beer-Spoiling Lactic Acid Bacteria.

    PubMed

    Geissler, Andreas J; Behr, Jürgen; Vogel, Rudi F

    2016-10-06

    Seven strains of important beer-spoiling lactic acid bacteria were sequenced using single-molecule real-time sequencing. Complete genomes were obtained for strains of Lactobacillus paracollinoides, Lactobacillus lindneri, and Pediococcus claussenii The analysis of these genomes emphasizes the role of plasmids as the genomic foundation of beer-spoiling ability. Copyright © 2016 Geissler et al.

  19. Multiple Genome Sequences of Important Beer-Spoiling Lactic Acid Bacteria

    PubMed Central

    Geissler, Andreas J.; Vogel, Rudi F.

    2016-01-01

    Seven strains of important beer-spoiling lactic acid bacteria were sequenced using single-molecule real-time sequencing. Complete genomes were obtained for strains of Lactobacillus paracollinoides, Lactobacillus lindneri, and Pediococcus claussenii. The analysis of these genomes emphasizes the role of plasmids as the genomic foundation of beer-spoiling ability. PMID:27795248

  20. Amino acid sequence of fibrolase, a direct-acting fibrinolytic enzyme from Agkistrodon contortrix contortrix venom.

    PubMed Central

    Randolph, A.; Chamberlain, S. H.; Chu, H. L.; Retzios, A. D.; Markland, F. S.; Masiarz, F. R.

    1992-01-01

    The complete amino acid sequence of fibrolase, a fibrinolytic enzyme from southern copperhead (Agkistrodon contortrix contortrix) venom, has been determined. This is the first report of the sequence of a direct-acting, nonhemorrhagic fibrinolytic enzyme found in snake venom. The majority of the sequence was established by automated Edman degradation of overlapping peptides generated by a variety of selective cleavage procedures. The amino-terminus is blocked by a cyclized glutamine (pyroglutamic acid) residue, and the sequence of this region of the molecule was determined by mass spectrometry. Fibrolase is composed of 203 residues in a single polypeptide chain with a molecular weight of 22,891, as determined by the sequence. Its sequence is homologous to the sequence of the hemorrhagic toxin Ht-d of Crotalus atrox venom and with the sequences of two metalloproteinases from Trimeresurus flavoviridis venom. Microheterogeneity in the sequence was found at both the amino-terminus and at residues 189 and 192. All six cysteine residues in fibrolase are involved in disulfide bonds. A disulfide bond between cysteine-118 and cysteine-198 has been established and bonds between cysteines-158/165 and between cysteines-160/192 are inferred from the homology to Ht-d. Secondary structure prediction reveals a very low percentage of alpha-helix (4%), but much greater beta-structure (39.5%). Analysis of the sequence reveals the absence of asparagine-linked glycosylation sites defined by the consensus sequence: asparagine-X-serine/threonine. PMID:1304358

  1. PASTA: Ultra-Large Multiple Sequence Alignment for Nucleotide and Amino-Acid Sequences.

    PubMed

    Mirarab, Siavash; Nguyen, Nam; Guo, Sheng; Wang, Li-San; Kim, Junhyong; Warnow, Tandy

    2015-05-01

    We introduce PASTA, a new multiple sequence alignment algorithm. PASTA uses a new technique to produce an alignment given a guide tree that enables it to be both highly scalable and very accurate. We present a study on biological and simulated data with up to 200,000 sequences, showing that PASTA produces highly accurate alignments, improving on the accuracy and scalability of the leading alignment methods (including SATé). We also show that trees estimated on PASTA alignments are highly accurate--slightly better than SATé trees, but with substantial improvements relative to other methods. Finally, PASTA is faster than SATé, highly parallelizable, and requires relatively little memory.

  2. Bacterial Regulon Evolution: Distinct Responses and Roles for the Identical OmpR Proteins of Salmonella Typhimurium and Escherichia coli in the Acid Stress Response

    PubMed Central

    Quinn, Heather J.; Cameron, Andrew D. S.; Dorman, Charles J.

    2014-01-01

    The evolution of new gene networks is a primary source of genetic innovation that allows bacteria to explore and exploit new niches, including pathogenic interactions with host organisms. For example, the archetypal DNA binding protein, OmpR, is identical between Salmonella Typhimurium serovar Typhimurium and Escherichia coli, but regulatory specialization has resulted in different environmental triggers of OmpR expression and largely divergent OmpR regulons. Specifically, ompR mRNA and OmpR protein levels are elevated by acid pH in S. Typhimurium but not in E. coli. This differential expression pattern is due to differences in the promoter regions of the ompR genes and the E. coli ompR orthologue can be made acid-inducible by introduction of the appropriate sequences from S. Typhimurium. The OmpR regulon in S. Typhimurium overlaps that of E. coli at only 15 genes and includes many horizontally acquired genes (including virulence genes) that E. coli does not have. We found that OmpR binds to its genomic targets in higher abundance when the DNA is relaxed, something that occurs in S. Typhimurium as a result of acid stress and which is a requirement for optimal expression of its virulence genes. The genomic targets of OmpR do not share a strong nucleotide sequence consensus: we propose that the ability of OmpR to recruit additional genes to its regulon arises from its modest requirements for specificity in its DNA targets with its preference for relaxed DNA allowing it to cooperate with DNA-topology-based allostery to modulate transcription in response to acid stress. PMID:24603618

  3. Draft Genome Sequence of Gephyronic Acid Producer Cystobacter violaceus Strain Cb vi76

    PubMed Central

    Stevens, D. Cole; Young, Jeanette; Carmichael, Rory; Tan, John

    2014-01-01

    A draft genome sequence of Cystobacter violaceus strain Cb vi76, which produces the eukaryotic protein synthesis inhibitor gephyronic acid, has been obtained. The genome contains numerous predicted secondary metabolite clusters, including the gephyronic acid biosynthetic pathway. This genome will contribute to the investigation of secondary metabolism in other Cystobacter strains. PMID:25502681

  4. SETG: Nucleic Acid Extraction and Sequencing for In Situ Life Detection on Mars

    NASA Astrophysics Data System (ADS)

    Mojarro, A.; Hachey, J.; Tani, J.; Smith, A.; Bhattaru, S. A.; Pontefract, A.; Doebler, R.; Brown, M.; Ruvkun, G.; Zuber, M. T.; Carr, C. E.

    2016-10-01

    We are developing an integrated nucleic acid extraction and sequencing instrument: the Search for Extra-Terrestrial Genomes (SETG) for in situ life detection on Mars. Our goals are to identify related or unrelated nucleic acid-based life on Mars.

  5. Draft Genome Sequence of Cyanobacterium sp. Strain IPPAS B-1200 with a Unique Fatty Acid Composition

    PubMed Central

    Starikov, Alexander Y.; Usserbaeva, Aizhan A.; Sinetova, Maria A.; Sarsekeyeva, Fariza K.; Zayadan, Bolatkhan K.; Ustinova, Vera V.; Kupriyanova, Elena V.; Los, Dmitry A.

    2016-01-01

    Here, we report the draft genome of Cyanobacterium sp. IPPAS strain B-1200, isolated from Lake Balkhash, Kazakhstan, and characterized by the unique fatty acid composition of its membrane lipids, which are enriched with myristic and myristoleic acids. The approximate genome size is 3.4 Mb, and the predicted number of coding sequences is 3,119. PMID:27856596

  6. Molecular genetics of herpes simplex virus: Demonstration of regions of obligatory and nonobligatory identity within diploid regions of the genome by sequence replacement and insertion

    PubMed Central

    Knipe, David M.; Ruyechan, William T.; Roizman, Bernard; Halliburton, Ian W.

    1978-01-01

    The DNAs of herpes simplex virus (HSV) 1 and 2 consist of two components, L and S, each composed of unique sequences bracketed by inverted repeats. In this study we have probed the structure of the reiterated regions of the S component in marker rescue experiments involving transfection of cells with mixtures of intact HSV-1 mutant viral DNA and individual DNA fragments generated by restriction endonuclease digestion of wild-type HSV-1 or HSV-2 DNAs. The results were as follows: (i) HSV is diploid for the wild-type sequences that rescue two temperature-sensitive (ts) mutants. DNA fragments from both reiterated regions of the S component of HSV-1(F) DNA can rescue tsLB2 and tsD mutants. (ii) Identity of the entire reiterated sequence at both ends of S is not obligatory because only one end of the S component of wild phenotype virus HSV-1(1061) rescues tsD even though both ends rescue tsLB2. (iii) Genes in both reiterated sequences can be expressed. We produced, by marker rescue experiments, recombinants with heterotypic ends of the S component, and these specified corresponding polypeptides characteristic of both HSV-1 and HSV-2. (iv) The reiterated sequences of the S component may contain a region of obligatory identity. Thus, several recombinant clones produced by rescue with HSV-2 DNA contained identical HSV-2 DNA insertions within both reiterated regions of the HSV-1 S component. Consistent with this conclusion, the termini of the S component in the heterodiploids described in iii were identical by restriction enzyme analysis. (v) The observation that HSV DNA can be expanded by at least 5 × 106 by means of insertion in the S component suggests that it can be a vehicle for exogenous DNA. Images PMID:211508

  7. Parvalbumins from coelacanth muscle. III. Amino acid sequence of the major component.

    PubMed

    Jauregui-Adell, J; Pechere, J F

    1978-09-26

    The primary structure of the major parvalbumin (pI = 4.52) from coelacanth muscle (Latimeria chalumnae) has been determined. Sequence analysis of the tryptic peptides, in some cases obtained with beta-trypsin, accounts for the total amino acid content of the protein. Chymotryptic peptides provide appropriate sequence overlaps, to complete the localization of the tryptic peptides. Examination of the amino acid sequence of this protein shows the typical structure of a beta-parvalbumin. Its position in the dendrogram of related calcium-binding proteins corresponds to that usually accepted for crossopterygians.

  8. Sequencing and computational analysis of complete genome sequences of Citrus yellow mosaic badna virus from acid lime and pummelo.

    PubMed

    Borah, Basanta K; Johnson, A M Anthony; Sai Gopal, D V R; Dasgupta, Indranil

    2009-08-01

    Citrus yellow mosaic badna virus (CMBV), a member of the Family Caulimoviridae, Genus Badnavirus, is the causative agent of Citrus mosaic disease in India. Although the virus has been detected in several citrus species, only two full-length genomes, one each from Sweet orange and Rangpur lime, are available in publicly accessible databases. In order to obtain a better understanding of the genetic variability of the virus in other citrus mosaic-affected citrus species, we performed the cloning and sequence analysis of complete genomes of CMBV from two additional citrus species, Acid lime and Pummelo. We show that CMBV genomes from the two hosts share high homology with previously reported CMBV sequences and hence conclude that the new isolates represent variants of the virus present in these species. Based on in silico sequence analysis, we predict the possible function of the protein encoded by one of the five ORFs.

  9. Analysis of cloned cDNA and genomic sequences for phytochrome: complete amino acid sequences for two gene products expressed in etiolated Avena.

    PubMed Central

    Hershey, H P; Barker, R F; Idler, K B; Lissemore, J L; Quail, P H

    1985-01-01

    Cloned cDNA and genomic sequences have been analyzed to deduce the amino acid sequence of phytochrome from etiolated Avena. Restriction endonuclease site polymorphism between clones indicates that at least four phytochrome genes are expressed in this tissue. Sequence analysis of two complete and one partial coding region shows approximately 98% homology at both the nucleotide and amino acid levels, with the majority of amino acid changes being conservative. High sequence homology is also found in the 5'-untranslated region but significant divergence occurs in the 3'-untranslated region. The phytochrome polypeptides are 1128 amino acid residues long corresponding to a molecular mass of 125 kdaltons. The known protein sequence at the chromophore attachment site occurs only once in the polypeptide, establishing that phytochrome has a single chromophore per monomer covalently linked to Cys-321. Computer analyses of the amino acid sequences have provided predictions regarding a number of structural features of the phytochrome molecule. PMID:3001642

  10. SNBRFinder: A Sequence-Based Hybrid Algorithm for Enhanced Prediction of Nucleic Acid-Binding Residues

    PubMed Central

    Sun, Jun; Liu, Rong

    2015-01-01

    Protein-nucleic acid interactions are central to various fundamental biological processes. Automated methods capable of reliably identifying DNA- and RNA-binding residues in protein sequence are assuming ever-increasing importance. The majority of current algorithms rely on feature-based prediction, but their accuracy remains to be further improved. Here we propose a sequence-based hybrid algorithm SNBRFinder (Sequence-based Nucleic acid-Binding Residue Finder) by merging a feature predictor SNBRFinderF and a template predictor SNBRFinderT. SNBRFinderF was established using the support vector machine whose inputs include sequence profile and other complementary sequence descriptors, while SNBRFinderT was implemented with the sequence alignment algorithm based on profile hidden Markov models to capture the weakly homologous template of query sequence. Experimental results show that SNBRFinderF was clearly superior to the commonly used sequence profile-based predictor and SNBRFinderT can achieve comparable performance to the structure-based template methods. Leveraging the complementary relationship between these two predictors, SNBRFinder reasonably improved the performance of both DNA- and RNA-binding residue predictions. More importantly, the sequence-based hybrid prediction reached competitive performance relative to our previous structure-based counterpart. Our extensive and stringent comparisons show that SNBRFinder has obvious advantages over the existing sequence-based prediction algorithms. The value of our algorithm is highlighted by establishing an easy-to-use web server that is freely accessible at http://ibi.hzau.edu.cn/SNBRFinder. PMID:26176857

  11. Amino acid sequence of winged bean (Psophocarpus tetragonolobus (L.) DC.) chymotrypsin inhibitor, WCI-3.

    PubMed

    Shibata, H; Hara, S; Ikenaka, T

    1988-10-01

    The complete amino acid sequence of winged bean chymotrypsin inhibitor 3 (WCI-3) was determined by the conventional methods. WCI-3 consisted of 183 amino acid residues, but was heterogeneous in the carboxyl terminal region owing to the loss of one to four carboxyl terminal amino acid residues. The sequence of WCI-3 was highly homologous with those of soybean trypsin inhibitor Tia, winged bean trypsin inhibitor WTI-1, and Erythrina latissima trypsin inhibitor DE-3. One of the reactive site peptide bonds of WCI-3 was identified as Leu(65)-Ser(66), which was located at the same position as those of the other Kunitz-family leguminous proteinase inhibitors.

  12. Amino acid sequence of anionic peroxidase from the windmill palm tree Trachycarpus fortunei.

    PubMed

    Baker, Margaret R; Zhao, Hongwei; Sakharov, Ivan Yu; Li, Qing X

    2014-12-10

    Palm peroxidases are extremely stable and have uncommon substrate specificity. This study was designed to fill in the knowledge gap about the structures of a peroxidase from the windmill palm tree Trachycarpus fortunei. The complete amino acid sequence and partial glycosylation were determined by MALDI-top-down sequencing of native windmill palm tree peroxidase (WPTP), MALDI-TOF/TOF MS/MS of WPTP tryptic peptides, and cDNA sequencing. The propeptide of WPTP contained N- and C-terminal signal sequences which contained 21 and 17 amino acid residues, respectively. Mature WPTP was 306 amino acids in length, and its carbohydrate content ranged from 21% to 29%. Comparison to closely related royal palm tree peroxidase revealed structural features that may explain differences in their substrate specificity. The results can be used to guide engineering of WPTP and its novel applications.

  13. Amino Acid Sequence of Anionic Peroxidase from the Windmill Palm Tree Trachycarpus fortunei

    PubMed Central

    2015-01-01

    Palm peroxidases are extremely stable and have uncommon substrate specificity. This study was designed to fill in the knowledge gap about the structures of a peroxidase from the windmill palm tree Trachycarpus fortunei. The complete amino acid sequence and partial glycosylation were determined by MALDI-top-down sequencing of native windmill palm tree peroxidase (WPTP), MALDI-TOF/TOF MS/MS of WPTP tryptic peptides, and cDNA sequencing. The propeptide of WPTP contained N- and C-terminal signal sequences which contained 21 and 17 amino acid residues, respectively. Mature WPTP was 306 amino acids in length, and its carbohydrate content ranged from 21% to 29%. Comparison to closely related royal palm tree peroxidase revealed structural features that may explain differences in their substrate specificity. The results can be used to guide engineering of WPTP and its novel applications. PMID:25383699

  14. TranslatorX: multiple alignment of nucleotide sequences guided by amino acid translations.

    PubMed

    Abascal, Federico; Zardoya, Rafael; Telford, Maximilian J

    2010-07-01

    We present TranslatorX, a web server designed to align protein-coding nucleotide sequences based on their corresponding amino acid translations. Many comparisons between biological sequences (nucleic acids and proteins) involve the construction of multiple alignments. Alignments represent a statement regarding the homology between individual nucleotides or amino acids within homologous genes. As protein-coding DNA sequences evolve as triplets of nucleotides (codons) and it is known that sequence similarity degrades more rapidly at the DNA than at the amino acid level, alignments are generally more accurate when based on amino acids than on their corresponding nucleotides. TranslatorX novelties include: (i) use of all documented genetic codes and the possibility of assigning different genetic codes for each sequence; (ii) a battery of different multiple alignment programs; (iii) translation of ambiguous codons when possible; (iv) an innovative criterion to clean nucleotide alignments with GBlocks based on protein information; and (v) a rich output, including Jalview-powered graphical visualization of the alignments, codon-based alignments coloured according to the corresponding amino acids, measures of compositional bias and first, second and third codon position specific alignments. The TranslatorX server is freely available at http://translatorx.co.uk.

  15. RNAblueprint: flexible multiple target nucleic acid sequence design.

    PubMed

    Hammer, Stefan; Tschiatschek, Birgit; Flamm, Christoph; Hofacker, Ivo L; Findeiß, Sven

    2017-09-15

    Realizing the value of synthetic biology in biotechnology and medicine requires the design of molecules with specialized functions. Due to its close structure to function relationship, and the availability of good structure prediction methods and energy models, RNA is perfectly suited to be synthetically engineered with predefined properties. However, currently available RNA design tools cannot be easily adapted to accommodate new design specifications. Furthermore, complicated sampling and optimization methods are often developed to suit a specific RNA design goal, adding to their inflexibility. We developed a C ++  library implementing a graph coloring approach to stochastically sample sequences compatible with structural and sequence constraints from the typically very large solution space. The approach allows to specify and explore the solution space in a well defined way. Our library also guarantees uniform sampling, which makes optimization runs performant by not only avoiding re-evaluation of already found solutions, but also by raising the probability of finding better solutions for long optimization runs. We show that our software can be combined with any other software package to allow diverse RNA design applications. Scripting interfaces allow the easy adaption of existing code to accommodate new scenarios, making the whole design process very flexible. We implemented example design approaches written in Python to demonstrate these advantages. RNAblueprint , Python implementations and benchmark datasets are available at github: https://github.com/ViennaRNA . s.hammer@univie.ac.at, ivo@tbi.univie.ac.at or sven@tbi.univie.ac.at. Supplementary data are available at Bioinformatics online.

  16. Amino acid sequence of homologous rat atrial peptides: natriuretic activity of native and synthetic forms.

    PubMed Central

    Seidah, N G; Lazure, C; Chrétien, M; Thibault, G; Garcia, R; Cantin, M; Genest, J; Nutt, R F; Brady, S F; Lyle, T A

    1984-01-01

    A substance called atrial natriuretic factor (ANF), localized in secretory granules of atrial cardiocytes, was isolated as four homologous natriuretic peptides from homogenates of rat atria. The complete sequence of the longest form showed that it is composed of 33 amino acids. The three other shorter forms (2-33, 3-33, and 8-33) represent amino-terminally truncated versions of the 33 amino acid parent molecule as shown by analysis of sequence, amino acid composition, or both. The proposed primary structure agrees entirely with the amino acid composition and reveals no significant sequence homology with any known protein or segment of protein. The short form ANF-(8-33) was synthesized by a multi-fragment condensation approach and the synthetic product was shown to exhibit specific activity comparable to that of the natural ANF-(3-33). PMID:6232612

  17. Shark myelin basic protein: amino acid sequence, secondary structure, and self-association.

    PubMed

    Milne, T J; Atkins, A R; Warren, J A; Auton, W P; Smith, R

    1990-09-01

    Myelin basic protein (MBP) from the Whaler shark (Carcharhinus obscurus) has been purified from acid extracts of a chloroform/methanol pellet from whole brains. The amino acid sequence of the majority of the protein has been determined and compared with the sequences of other MBPs. The shark protein has only 44% homology with the bovine protein, but, in common with other MBPs, it has basic residues distributed throughout the sequence and no extensive segments that are predicted to have an ordered secondary structure in solution. Shark MBP lacks the triproline sequence previously postulated to form a hairpin bend in the molecule. The region containing the putative consensus sequence for encephalitogenicity in the guinea pig contains several substitutions, thus accounting for the lack of activity of the shark protein. Studies of the secondary structure and self-association have shown that shark MBP possesses solution properties similar to those of the bovine protein, despite the extensive differences in primary structure.

  18. Nucleotide sequence determination of bacteriophage T4 glycine transfer ribonucleic acid

    PubMed Central

    Stahl, Stephen; Paddock, Gary V.; Abelson, John

    1974-01-01

    The nucleotide sequence of a T4 tRNA with an anticodon for glycine has been determined using 32P-labeled material from T4-infected cultures of Escherichia coli. The sequence is: pGCGGAUAUCGUAUAAUGmGDAUUACCUCAGACUUCCAAψCUGAUGAUGUGAGTψCGAUUCUCAUUAUCCGCUCCA-OH. The 74 nucleotide sequence can be arranged in the classic cloverleaf pattern for tRNAs. The anticodon of T4 tRNAGly is UCC with a possible modification of the U. The tRNA molecule would thus be expected to recognize the glycine codons GGG and GGA. Comparative analysis of tRNAsGly from T2 and T6 indicate that their sequences are identical with that from T4. Images PMID:10793690

  19. Using Conductivity Measurements to Determine the Identities and Concentrations of Unknown Acids: An Inquiry Laboratory Experiment

    ERIC Educational Resources Information Center

    Smith, K. Christopher; Garza, Ariana

    2015-01-01

    This paper describes a student designed experiment using titrations involving conductivity measurements to identify unknown acids as being either HCl or H[subscript 2]SO[subscript 4], and to determine the concentrations of the acids, thereby improving the utility of standard acid-base titrations. Using an inquiry context, students gain experience…

  20. Using Conductivity Measurements to Determine the Identities and Concentrations of Unknown Acids: An Inquiry Laboratory Experiment

    ERIC Educational Resources Information Center

    Smith, K. Christopher; Garza, Ariana

    2015-01-01

    This paper describes a student designed experiment using titrations involving conductivity measurements to identify unknown acids as being either HCl or H[subscript 2]SO[subscript 4], and to determine the concentrations of the acids, thereby improving the utility of standard acid-base titrations. Using an inquiry context, students gain experience…

  1. An analysis of amino acid sequences surrounding archaeal glycoprotein sequons.

    PubMed

    Abu-Qarn, Mehtap; Eichler, Jerry

    2007-05-01

    Despite having provided the first example of a prokaryal glycoprotein, little is known of the rules governing the N-glycosylation process in Archaea. As in Eukarya and Bacteria, archaeal N-glycosylation takes place at the Asn residues of Asn-X-Ser/Thr sequons. Since not all sequons are utilized, it is clear that other factors, including the context in which a sequon exists, affect glycosylation efficiency. As yet, the contribution to N-glycosylation made by sequon-bordering residues and other related factors in Archaea remains unaddressed. In the following, the surroundings of Asn residues confirmed by experiment as modified were analyzed in an attempt to define sequence rules and requirements for archaeal N-glycosylation.

  2. Nucleic and Amino Acid Sequences Support Structure-Based Viral Classification

    PubMed Central

    Sinclair, Robert M.; Ravantti, Janne J.

    2017-01-01

    ABSTRACT Viral capsids ensure viral genome integrity by protecting the enclosed nucleic acids. Interactions between the genome and capsid and between individual capsid proteins (i.e., capsid architecture) are intimate and are expected to be characterized by strong evolutionary conservation. For this reason, a capsid structure-based viral classification has been proposed as a way to bring order to the viral universe. The seeming lack of sufficient sequence similarity to reproduce this classification has made it difficult to reject structural convergence as the basis for the classification. We reinvestigate whether the structure-based classification for viral coat proteins making icosahedral virus capsids is in fact supported by previously undetected sequence similarity. Since codon choices can influence nascent protein folding cotranslationally, we searched for both amino acid and nucleotide sequence similarity. To demonstrate the sensitivity of the approach, we identify a candidate gene for the pandoravirus capsid protein. We show that the structure-based classification is strongly supported by amino acid and also nucleotide sequence similarities, suggesting that the similarities are due to common descent. The correspondence between structure-based and sequence-based analyses of the same proteins shown here allow them to be used in future analyses of the relationship between linear sequence information and macromolecular function, as well as between linear sequence and protein folds. IMPORTANCE Viral capsids protect nucleic acid genomes, which in turn encode capsid proteins. This tight coupling of protein shell and nucleic acids, together with strong functional constraints on capsid protein folding and architecture, leads to the hypothesis that capsid protein-coding nucleotide sequences may retain signatures of ancient viral evolution. We have been able to show that this is indeed the case, using the major capsid proteins of viruses forming icosahedral capsids

  3. Classification of mouse VK groups based on the partial amino acid sequence to the first invariant tryptophan: impact of 14 new sequences from IgG myeloma proteins.

    PubMed

    Potter, M; Newell, J B; Rudikoff, S; Haber, E

    1982-12-01

    Fourteen new VK sequences derived from BALB/c IgG myeloma proteins were determined to the first invariant tryptophan (Trp 35). These partial sequences were compared with 65 other published VK sequences using a computer program. The 79 sequences were organized according to the length of the sequence from the amino terminus to the first invariant tryptophan (Trp 35), into seven groups (33, 34, 35, 36, 39, 40 and 41aa). A distance matrix of all 79 sequences was then computed, i.e. the number of amino acid substitutions necessary to convert one sequence to another was determined. From these data a dendrogram was constructed. Most of the VK sequences fell into clusters or closely related groups. The definition of a sequence group is arbitrary but facilitates the classification of VK proteins. We used 12 substitutions as the basis for defining a sequence group based on the known number of substitutions that are found in the VK21 proteins. By this criterion there were 18 groups in the Trp 35 dendrogram. Twelve of the 14 new sequences fell into one of these sequence groups; two formed new sequence groups. Collective amino acid sequencing is still encountering new VK structures indicating more sequences will be required to attain an accurate estimate of the total number of VK groups. Updated dendrograms can be quickly generated to include newly generated sequences.

  4. Molecular cloning and sequencing of the human erythrocyte 2,3-bisphosphoglycerate mutase cDNA: revised amino acid sequence.

    PubMed Central

    Joulin, V; Peduzzi, J; Roméo, P H; Rosa, R; Valentin, C; Dubart, A; Lapeyre, B; Blouquit, Y; Garel, M C; Goossens, M

    1986-01-01

    The human erythrocyte 2,3-bisphosphoglycerate mutase (BPGM) is a multifunctional enzyme which controls the metabolism of 2,3-diphosphoglycerate, the main allosteric effector of haemoglobin. Several cDNA banks were constructed from reticulocyte mRNA, either by conventional cloning methods in pBR322 and screening with specific mixed oligonucleotide probes, or in the expression vector lambda gt 11. The largest cDNA isolated contained 1673 bases [plus the poly(A) tail], which is slightly smaller than the size of the intact mRNA as estimated by Northern blot analysis (approximately 1800 bases). This cDNA encodes for a protein of 258 residues; the protein yielded 34 tryptic peptides which were subsequently isolated by h.p.l.c. Our nucleotide sequence data were entirely confirmed by the amino acid composition of these tryptic peptides and reveal several major differences from the published sequence; the revised amino acid sequence of human BPGM is presented. These findings represent the first step in the study of the expression and regulation of this enzyme as a specific marker of the erythroid cell line. Images Fig. 5. PMID:3023066

  5. Plant mitochondrial nucleic acid sequences as a tool for phylogenetic analysis.

    PubMed Central

    Hiesel, R; von Haeseler, A; Brennicke, A

    1994-01-01

    To evaluate the potential of mitochondrial nucleic acid sequences as a phylogenetic tool, we have analyzed cytochrome oxidase subunit III (coxIII) coding sequences in representatives of the major groups of land plants. The phylogenetic tree derived from these mitochondrial sequences confirms the monophyletic origin of land plant mitochondria with the general order and descent of land plants deduced by other molecular, physiological, and morphological traits. The mitochondrial sequences strongly suggest a close phylogenetic relationship between Bryophyta and Lycopodiatae, whereas Psilophytatae cluster with the other vascular plants. In addition to the high sequence similarity, both Hepaticophytina and Lycopodiatae contain a related intron in the coxIII gene that, to our knowledge, is not found in any other plant species. The slowly evolving mitochondrial sequences of plants are shown to provide a useful phylogenetic tool to evaluate distant evolutionary relationships within this kingdom. PMID:7507251

  6. Evolution of phosphagen kinase V. cDNA-derived amino acid sequences of two molluscan arginine kinases from the chiton Liolophura japonica and the turbanshell Battilus cornutus.

    PubMed

    Suzuki, T; Ban, T; Furukohri, T

    1997-06-20

    The cDNAs of arginine kinases from the chiton Liolophura japonica (Polyplacophora) and the turbanshell Battilus cornutus (Gastropoda) were amplified by polymerase chain reaction (PCR), and the complete nucleotide sequences of 1669 and 1624 bp, respectively, were determined. The open reading frame for Liolophura arginine kinase is 1050 nucleotides in length and encodes a protein with 349 amino acid residues, and that for Battilus is 1077 nucleotides and 358 residues. The validity of the cDNA-derived amino acid sequence was supported by chemical sequencing of internal tryptic peptides. The molecular masses were calculated to be 39,057 and 39,795 Da, respectively. The amino acid sequence of Liolophura arginine kinase showed 65-68% identity with those of Battilus and Nordotis (abalone) arginine kinases, and the homology between Battilus and Nordotis was 79%. Molluscan arginine kinases also show lower, but significant homology (38-43%) with rabbit creatine kinase. The sequences of arginine kinases could be used as a molecular clock to elucidate the phylogeny of Mollusca, one of the most diverse animal phyla.

  7. Detection and isolation of nucleic acid sequences using competitive hybridization probes

    DOEpatents

    Lucas, J.N.; Straume, T.; Bogen, K.T.

    1997-04-01

    A method for detecting a target nucleic acid sequence in a sample is provided using hybridization probes which competitively hybridize to a target nucleic acid. According to the method, a target nucleic acid sequence is hybridized to first and second hybridization probes which are complementary to overlapping portions of the target nucleic acid sequence, the first hybridization probe including a first complexing agent capable of forming a binding pair with a second complexing agent and the second hybridization probe including a detectable marker. The first complexing agent attached to the first hybridization probe is contacted with a second complexing agent, the second complexing agent being attached to a solid support such that when the first and second complexing agents are attached, target nucleic acid sequences hybridized to the first hybridization probe become immobilized on to the solid support. The immobilized target nucleic acids are then separated and detected by detecting the detectable marker attached to the second hybridization probe. A kit for performing the method is also provided. 7 figs.

  8. Detection and isolation of nucleic acid sequences using competitive hybridization probes

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    1997-01-01

    A method for detecting a target nucleic acid sequence in a sample is provided using hybridization probes which competitively hybridize to a target nucleic acid. According to the method, a target nucleic acid sequence is hybridized to first and second hybridization probes which are complementary to overlapping portions of the target nucleic acid sequence, the first hybridization probe including a first complexing agent capable of forming a binding pair with a second complexing agent and the second hybridization probe including a detectable marker. The first complexing agent attached to the first hybridization probe is contacted with a second complexing agent, the second complexing agent being attached to a solid support such that when the first and second complexing agents are attached, target nucleic acid sequences hybridized to the first hybridization probe become immobilized on to the solid support. The immobilized target nucleic acids are then separated and detected by detecting the detectable marker attached to the second hybridization probe. A kit for performing the method is also provided.

  9. Single-cell sequencing unveils the lifestyle and CRISPR-based population history of Hydrotalea sp. in acid mine drainage.

    PubMed

    Medeiros, J D; Leite, L R; Pylro, V S; Oliveira, F S; Almeida, V M; Fernandes, G R; Salim, A C M; Araújo, F M G; Volpini, A C; Oliveira, G; Cuadros-Orellana, S

    2017-08-12

    Acid mine drainage (AMD) is characterized by an acid and metal-rich run-off that originates from mining systems. Despite having been studied for many decades, much remains unknown about the microbial community dynamics in AMD sites, especially during their early development, when the acidity is moderate. Here, we describe draft genome assemblies from single cells retrieved from an early-stage AMD sample. These cells belong to the genus Hydrotalea and are closely related to Hydrotalea flava. The phylogeny and average nucleotide identity analysis suggest that all single amplified genomes (SAGs) form two clades that may represent different strains. These cells have the genomic potential for denitrification, copper and other metal resistance. Two coexisting CRISPR-Cas loci were recovered across SAGs, and we observed heterogeneity in the population with regard to the spacer sequences, together with the loss of trailer-end spacers. Our results suggest that the genomes of Hydrotalea sp. strains studied here are adjusting to a quickly changing selective pressure at the microhabitat scale, and an important form of this selective pressure is infection by foreign DNA. © 2017 John Wiley & Sons Ltd.

  10. Cloning and sequencing of the cDNA for S-acyl fatty acid synthase thioesterase from the uropygial gland of mallard duck.

    PubMed

    Poulose, A J; Rogers, L; Cheesbrough, T M; Kolattukudy, P E

    1985-12-15

    In vitro translation of poly(A)+ RNA from the uropygial glands of mallard ducks (Anas platyrhynchos) generated a 29-kDa protein which cross-reacted with rabbit antibodies prepared against S-acyl fatty acid synthase thioesterase (Kolattukudy, P. E., Rogers, L., and Flurkey, W. (1985) J. Biol. Chem., 260, 10789-10793). A poly(A)+ RNA fraction enriched in this thioesterase mRNA, isolated by sucrose density gradient centrifugation, was used to prepare cDNA which was cloned in Escherichia coli using the plasmid pUC9. Using hybrid-selected translation and colony hybridization, 17 clones were selected which contained the cDNA for S-acyl fatty acid synthase thioesterase. Northern blot analysis showed that the mature mRNA for this thioesterase contained 1350 nucleotides whereas the cloned cDNA inserts contained 1150-1200 base pairs. Five of the 6 clones tested for 5'-sequence had identical sequences, and the three tested for 3'-end showed the same sequence with poly(A) tails. Two clones, pTE1 and pTE3, representing nearly the full length of mRNA, were selected for sequencing. Maxam-Gilbert and Sanger dideoxy chain termination methods were used on the cloned cDNA and on restriction fragments subcloned in M13 in order to determine the complete nucleotide sequence of the cloned cDNA. The nucleotide sequence showed an open reading frame coding for a peptide of 28.8 kDa. Two peptides isolated from the tryptic digest of the thioesterase purified from the gland showed amino acid sequences which matched with two segments of the sequence deduced from the nucleotide sequence. Another segment containing a serine residue showed an amino acid sequence homologous to the active serine-containing segment of the thioesterase domain of fatty acid synthase. Thus, the clones represent cDNA for S-acyl fatty acid synthase thioesterase. The present results constitute the first case of a complete sequence of a thioesterase.

  11. Ligation with nucleic acid sequence-based amplification.

    PubMed

    Ong, Carmichael; Tai, Warren; Sarma, Aartik; Opal, Steven M; Artenstein, Andrew W; Tripathi, Anubhav

    2012-01-01

    This work presents a novel method for detecting nucleic acid targets using a ligation step along with an isothermal, exponential amplification step. We use an engineered ssDNA with two variable regions on the ends, allowing us to design the probe for optimal reaction kinetics and primer binding. This two-part probe is ligated by T4 DNA Ligase only when both parts bind adjacently to the target. The assay demonstrates that the expected 72-nt RNA product appears only when the synthetic target, T4 ligase, and both probe fragments are present during the ligation step. An extraneous 38-nt RNA product also appears due to linear amplification of unligated probe (P3), but its presence does not cause a false-positive result. In addition, 40 mmol/L KCl in the final amplification mix was found to be optimal. It was also found that increasing P5 in excess of P3 helped with ligation and reduced the extraneous 38-nt RNA product. The assay was also tested with a single nucleotide polymorphism target, changing one base at the ligation site. The assay was able to yield a negative signal despite only a single-base change. Finally, using P3 and P5 with longer binding sites results in increased overall sensitivity of the reaction, showing that increasing ligation efficiency can improve the assay overall. We believe that this method can be used effectively for a number of diagnostic assays.

  12. Computational simulations of protein folding to engineer amino acid sequences to encourage desired supersecondary structure formation.

    PubMed

    Gerstman, Bernard S; Chapagain, Prem P

    2013-01-01

    The dynamics of protein folding are complicated because of the various types of amino acid interactions that create secondary, supersecondary, and tertiary interactions. Computational modeling can be used to simulate the biophysical and biochemical interactions that determine protein folding. Effective folding to a desired protein configuration requires a compromise between speed, stability, and specificity. If the primary sequence of amino acids emphasizes one of these characteristics, the others might suffer and the folding process may not be optimized. We provide an example of a model peptide whose primary sequence produces a highly stable supersecondary two-helix bundle structure, but at the expense of lower speed and specificity of the folding process. We show how computational simulations can be used to discover the configuration of the kinetic trap that causes the degradation in the speed and specificity of folding. We also show how amino acid sequences can be engineered by specific substitutions to optimize the folding to the desired supersecondary structure.

  13. Thin-film technology for direct visual detection of nucleic acid sequences: applications in clinical research.

    PubMed

    Jenison, Robert D; Bucala, Richard; Maul, Diana; Ward, David C

    2006-01-01

    Certain optical conditions permit the unaided eye to detect thickness changes on surfaces on the order of 20 A, which are of similar dimensions to monomolecular interactions between proteins or hybridization of complementary nucleic acid sequences. Such detection exploits specific interference of reflected white light, wherein thickness changes are perceived as surface color changes. This technology, termed thin-film detection, allows for the visualization of subattomole amounts of nucleic acid targets, even in complex clinical samples. Thin-film technology has been applied to a broad range of clinically relevant indications, including the detection of pathogenic bacterial and viral nucleic acid sequences and the discrimination of sequence variations in human genes causally related to susceptibility or severity of disease.

  14. The human Ig-[beta] cDNA sequence, a homologue of murine B29, is identical in B cell and plasma cell lines producing all the human Ig isotypes

    SciTech Connect

    Hashimoto, Shiori; Gregersen, P.K.; Chiorazzi, N. Cornell Univ., New York, NY )

    1993-01-15

    The B cell Ag receptor complex consists of at least two disulfide-linked, heterodimeric structures: the clonally restricted membrane Ig (mIg) molecule and the nonpolymorphic Ig-[alpha]:Ig-[beta] protein dimer. The latter molecule is encoded by two separate genes, mb-1 and B29. The DNA sequences of murine and human mb-1 and murine B29 have been determined previously. This study describes the sequence of the full-length human cDNA homologue of the murine Ig-[beta]/B29 message. The human sequence codes for a protein that displays the typical subunit features of a transmembrane member of the Ig superfamily. The transmembrane and intracytoplasmic domains exhibit striking nucleotide and amino acid sequence similarity between the two species. These regions show almost complete conservation of areas presumed to be involved in noncovalent interactions with other members of the receptor complex and with intracellular kinases and cytoskeletal components. The only sequence dissimilarity seen in these presumed critical areas involves the Y-E-G-L-N motif, a potential target for tyrosine phosphorylation. In contrast, the extracellular portion is much more divergent. Inasmuch as similar patterns of species diversity have been reported for Ig-[alpha], the Ig-[alpha] and Ig-[beta] molecules may have coevolved to maintain species-specific extracellular interactions between one another and with mIg. Similar to the Ig-[alpha] molecule, the Ig-[beta] sequence is identical in B lineage cells expressing all five Ig isotypes. However, in contrast to the Ig-[alpha] molecule, the Ig-[beta] sequence is expressed at apparently similar levels in terminally differentiated, mIg[sup [minus

  15. Amino acid sequences of two trypsin inhibitors from winged bean seeds (Psophocarpus tetragonolobus (L)DC.).

    PubMed

    Yamamoto, M; Hara, S; Ikenaka, T

    1983-09-01

    The trypsin inhibitor (WTI-1) purified from winged bean seeds is a Kunitz type protease inhibitor having a molecular weight of 19,200. WTI-1 inhibits bovine trypsin stoichiometrically, but not bovine alpha-chymotrypsin. The approximate Ki value for the trypsin-inhibitor complex is 2.5 X 10(-9) M. The complete amino acid sequence of WTI-1 was determined by conventional methods. Comparison of the sequence with that of soybean trypsin inhibitor (STI) indicated that the sequence of WTI-1 had 50% homology with that of STI. WTI-1 was separated into 2 homologous inhibitors, WTI-1A and WTI-1B, by isoelectric focusing. The isoelectric points of WTI-1A and WTI-1B were 8.5 and 9.4, respectively, and their sequences were presumed from their amino acid compositions.

  16. Conservation of Shannon's redundancy for proteins. [information theory applied to amino acid sequences

    NASA Technical Reports Server (NTRS)

    Gatlin, L. L.

    1974-01-01

    Concepts of information theory are applied to examine various proteins in terms of their redundancy in natural originators such as animals and plants. The Monte Carlo method is used to derive information parameters for random protein sequences. Real protein sequence parameters are compared with the standard parameters of protein sequences having a specific length. The tendency of a chain to contain some amino acids more frequently than others and the tendency of a chain to contain certain amino acid pairs more frequently than other pairs are used as randomness measures of individual protein sequences. Non-periodic proteins are generally found to have random Shannon redundancies except in cases of constraints due to short chain length and genetic codes. Redundant characteristics of highly periodic proteins are discussed. A degree of periodicity parameter is derived.

  17. RNA internal standard synthesis by nucleic acid sequence-based amplification for competitive quantitative amplification reactions.

    PubMed

    Lo, Wan-Yu; Baeumner, Antje J

    2007-02-15

    Nucleic acid sequence-based amplification (NASBA) reactions have been demonstrated to successfully synthesize new sequences based on deletion and insertion reactions. Two RNA internal standards were synthesized for use in competitive amplification reactions in which quantitative analysis can be achieved by coamplifying the internal standard with the wild type sample. The sequences were created in two consecutive NASBA reactions using the E. coli clpB mRNA sequence as model analyte. The primer sequences of the wild type sequence were maintained, and a 20-nt-long segment inside the amplicon region was exchanged for a new segment of similar GC content and melting temperature. The new RNA sequence was thus amplifiable using the wild type primers and detectable via a new inserted sequence. In the first reaction, the forwarding primer and an additional 20-nt-long sequence was deleted and replaced by a new 20-nt-long sequence. In the second reaction, a forwarding primer containing as 5' overhang sequence the wild type primer sequence was used. The presence of pure internal standard was verified using electrochemiluminescence and RNA lateral-flow biosensor analysis. Additional sequence deletion in order to shorten the internal standard amplicons and thus generate higher detection signals was found not to be required. Finally, a competitive NASBA reaction between one internal standard and the wild type sequence was carried out proving its functionality. This new rapid construction method via NASBA provides advantages over the traditional techniques since it requires no traditional cloning procedures, no thermocyclers, and can be completed in less than 4 h.

  18. Purification to homogeneity and partial amino acid sequence of a fragment which includes the methyl acceptor site of the human DNA repair protein for O6-methylguanine.

    PubMed

    Major, G N; Gardner, E J; Carne, A F; Lawley, P D

    1990-03-25

    DNA repair by O6-methylguanine-DNA methyltransferase (O6-MT) is accomplished by removal by the enzyme of the methyl group from premutagenic O6-methylguanine-DNA, thereby restoring native guanine in DNA. The methyl group is transferred to an acceptor site cysteine thiol group in the enzyme, which causes the irreversible inactivation of O6-MT. We detected a variety of different forms of the methylated, inactivated enzyme in crude extracts of human spleen of molecular weights higher and lower than the usually observed 21-24kDa for the human O6-MT. Several apparent fragments of the methylated form of the protein were purified to homogeneity following reaction of partially-purified extract enzyme with O6-[3H-CH3]methylguanine-DNA substrate. One of these fragments yielded amino acid sequence information spanning fifteen residues, which was identified as probably belonging to human methyltransferase by virtue of both its significant sequence homology to three procaryote forms of O6-MT encoded by the ada, ogt (both from E. coli) and dat (B. subtilis) genes, and sequence position of the radiolabelled methyl group which matched the position of the conserved procaryote methyl acceptor site cysteine residue. Statistical prediction of secondary structure indicated good homologies between the human fragment and corresponding regions of the constitutive form of O6-MT in procaryotes (ogt and dat gene products), but not with the inducible ada protein, indicating the possibility that we had obtained partial amino acid sequence for a non-inducible form of the human enzyme. The identity of the fragment sequence as belonging to human methyltransferase was more recently confirmed by comparison with cDNA-derived amino acid sequence from the cloned human O6-MT gene from HeLa cells (1). The two sequences compared well, with only three out of fifteen amino acids being different (and two of them by only one nucleotide in each codon).

  19. Conversion of amino-acid sequence in proteins to classical music: search for auditory patterns

    PubMed Central

    2007-01-01

    We have converted genome-encoded protein sequences into musical notes to reveal auditory patterns without compromising musicality. We derived a reduced range of 13 base notes by pairing similar amino acids and distinguishing them using variations of three-note chords and codon distribution to dictate rhythm. The conversion will help make genomic coding sequences more approachable for the general public, young children, and vision-impaired scientists. PMID:17477882

  20. Extremely Acidophilic Protists from Acid Mine Drainage Host Rickettsiales-Lineage Endosymbionts That Have Intervening Sequences in Their 16S rRNA Genes

    PubMed Central

    Baker, Brett J.; Hugenholtz, Philip; Dawson, Scott C.; Banfield, Jillian F.

    2003-01-01

    During a molecular phylogenetic survey of extremely acidic (pH < 1), metal-rich acid mine drainage habitats in the Richmond Mine at Iron Mountain, Calif., we detected 16S rRNA gene sequences of a novel bacterial group belonging to the order Rickettsiales in the Alphaproteobacteria. The closest known relatives of this group (92% 16S rRNA gene sequence identity) are endosymbionts of the protist Acanthamoeba. Oligonucleotide 16S rRNA probes were designed and used to observe members of this group within acidophilic protists. To improve visualization of eukaryotic populations in the acid mine drainage samples, broad-specificity probes for eukaryotes were redesigned and combined to highlight this component of the acid mine drainage community. Approximately 4% of protists in the acid mine drainage samples contained endosymbionts. Measurements of internal pH of the protists showed that their cytosol is close to neutral, indicating that the endosymbionts may be neutrophilic. The endosymbionts had a conserved 273-nucleotide intervening sequence (IVS) in variable region V1 of their 16S rRNA genes. The IVS does not match any sequence in current databases, but the predicted secondary structure forms well-defined stem loops. IVSs are uncommon in rRNA genes and appear to be confined to bacteria living in close association with eukaryotes. Based on the phylogenetic novelty of the endosymbiont sequences and initial culture-independent characterization, we propose the name “Candidatus Captivus acidiprotistae.” To our knowledge, this is the first report of an endosymbiotic relationship in an extremely acidic habitat. PMID:12957940

  1. Nucleotide and deduced amino acid sequences of Torpedo californica acetylcholine receptor gamma subunit.

    PubMed Central

    Claudio, T; Ballivet, M; Patrick, J; Heinemann, S

    1983-01-01

    The nucleotide sequence has been determined of a cDNA clone that codes for the 60,000-dalton gamma subunit of Torpedo californica acetylcholine receptor. The length of the cDNA clone is 2,010 base pairs. The 5' and 3' untranslated regions have respective lengths of 31 and 461 base pairs. Data suggest that the putative polyadenylylation consensus sequence A-A-T-A-A-A may not be required for polyadenylylation of the mRNA corresponding to the cDNA clone described in this study. From the DNA sequence data, the amino acid sequence of the gamma subunit was deduced. The subunit is composed of 489 amino acids giving a molecular mass of 56,600 daltons. The deduced amino acid sequence data also indicate the presence of a 17-amino acid extension or signal peptide on this subunit. From these data, structural predictions for the gamma subunit are made such as potential membrane-spanning regions, possible asparagine-linked glycosylation sites, and the assignment of regions of the protein to the extracellular, internal, and cytoplasmic domains of the lipid bilayer. Images PMID:6573658

  2. Diagnostics based on nucleic acid sequence variant profiling: PCR, hybridization, and NGS approaches.

    PubMed

    Khodakov, Dmitriy; Wang, Chunyan; Zhang, David Yu

    2016-10-01

    Nucleic acid sequence variations have been implicated in many diseases, and reliable detection and quantitation of DNA/RNA biomarkers can inform effective therapeutic action, enabling precision medicine. Nucleic acid analysis technologies being translated into the clinic can broadly be classified into hybridization, PCR, and sequencing, as well as their combinations. Here we review the molecular mechanisms of popular commercial assays, and their progress in translation into in vitro diagnostics. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.

  3. Ab initio detection of fuzzy amino acid tandem repeats in protein sequences

    PubMed Central

    2012-01-01

    Background Tandem repetitions within protein amino acid sequences often correspond to regular secondary structures and form multi-repeat 3D assemblies of varied size and function. Developing internal repetitions is one of the evolutionary mechanisms that proteins employ to adapt their structure and function under evolutionary pressure. While there is keen interest in understanding such phenomena, detection of repeating structures based only on sequence analysis is considered an arduous task, since structure and function is often preserved even under considerable sequence divergence (fuzzy tandem repeats). Results In this paper we present PTRStalker, a new algorithm for ab-initio detection of fuzzy tandem repeats in protein amino acid sequences. In the reported results we show that by feeding PTRStalker with amino acid sequences from the UniProtKB/Swiss-Prot database we detect novel tandemly repeated structures not captured by other state-of-the-art tools. Experiments with membrane proteins indicate that PTRStalker can detect global symmetries in the primary structure which are then reflected in the tertiary structure. Conclusions PTRStalker is able to detect fuzzy tandem repeating structures in protein sequences, with performance beyond the current state-of-the art. Such a tool may be a valuable support to investigating protein structural properties when tertiary X-ray data is not available. PMID:22536906

  4. Human debrisoquine 4-hydroxylase (P450IID1): cDNA and deduced amino acid sequence and assignment of the CYP2D locus to chromosome 22.

    PubMed

    Gonzalez, F J; Vilbois, F; Hardwick, J P; McBride, O W; Nebert, D W; Gelboin, H V; Meyer, U A

    1988-02-01

    The enzyme P450db1 (db1) is responsible for the common human defect in drug oxidation known as the "debrisoquine/sparteine polymorphism." Polyclonal antibody against the rat db1 protein was used to screen a human liver lambda gt11 library for the db1 cDNA clone. A cDNA containing the full protein coding sequence was isolated; the deduced NH2-terminal sequence of this cDNA was identical to that derived from direct sequencing of the purified human db1 protein. Comparison of the human db1 with rat db1 revealed 71 and 73% similarities of nucleotides and amino acids, respectively. By use of human-rodent somatic cell hybrids the db1 gene was localized to human chromosome 22 (CYP2D locus).

  5. The amino acid sequence around the active-site cysteine and histidine residues of stem bromelain

    PubMed Central

    Husain, S. S.; Lowe, G.

    1970-01-01

    Stem bromelain that had been irreversibly inhibited with 1,3-dibromo[2-14C]-acetone was reduced with sodium borohydride and carboxymethylated with iodoacetic acid. After digestion with trypsin and α-chymotrypsin three radioactive peptides were isolated chromatographically. The amino acid sequences around the cross-linked cysteine and histidine residues were determined and showed a high degree of homology with those around the active-site cysteine and histidine residues of papain and ficin. PMID:5420046

  6. Amino acid sequences of two nonspecific lipid-transfer proteins from germinated castor bean.

    PubMed

    Takishima, K; Watanabe, S; Yamada, M; Suga, T; Mamiya, G

    1988-11-01

    The amino acid sequence of two nonspecific lipid-transfer proteins (nsLTP) B and C from germinated castor bean seeds have been determined. Both the proteins consist of 92 residues, as for nsLTP previously reported, and their calculated Mr values are 9847 and 9593 for nsLTP-B and nsLTP-C, respectively. The sequences of nsLTP-B and nsLTP-C, compared to the known sequence of nsLTP-A from the same source, are 68% and 35% similar, respectively. No variation was found at the positions of the cysteine residues, indicating that they might be involved in disulfide bridges.

  7. A classification of glycosyl hydrolases based on amino acid sequence similarities.

    PubMed Central

    Henrissat, B

    1991-01-01

    The amino acid sequences of 301 glycosyl hydrolases and related enzymes have been compared. A total of 291 sequences corresponding to 39 EC entries could be classified into 35 families. Only ten sequences (less than 5% of the sample) could not be assigned to any family. With the sequences available for this analysis, 18 families were found to be monospecific (containing only one EC number) and 17 were found to be polyspecific (containing at least two EC numbers). Implications on the folding characteristics and mechanism of action of these enzymes and on the evolution of carbohydrate metabolism are discussed. With the steady increase in sequence and structural data, it is suggested that the enzyme classification system should perhaps be revised. PMID:1747104

  8. Synthetic oligonucleotide probes deduced from amino acid sequence data. Theoretical and practical considerations.

    PubMed

    Lathe, R

    1985-05-05

    Synthetic probes deduced from amino acid sequence data are widely used to detect cognate coding sequences in libraries of cloned DNA segments. The redundancy of the genetic code dictates that a choice must be made between (1) a mixture of probes reflecting all codon combinations, and (2) a single longer "optimal" probe. The second strategy is examined in detail. The frequency of sequences matching a given probe by chance alone can be determined and also the frequency of sequences closely resembling the probe and contributing to the hybridization background. Gene banks cannot be treated as random associations of the four nucleotides, and probe sequences deduced from amino acid sequence data occur more often than predicted by chance alone. Probe lengths must be increased to confer the necessary specificity. Examination of hybrids formed between unique homologous probes and their cognate targets reveals that short stretches of perfect homology occurring by chance make a significant contribution to the hybridization background. Statistical methods for improving homology are examined, taking human coding sequences as an example, and considerations of codon utilization and dinucleotide frequencies yield an overall homology of greater than 82%. Recommendations for probe design and hybridization are presented, and the choice between using multiple probes reflecting all codon possibilities and a unique optimal probe is discussed.

  9. AcalPred: a sequence-based tool for discriminating between acidic and alkaline enzymes.

    PubMed

    Lin, Hao; Chen, Wei; Ding, Hui

    2013-01-01

    The structure and activity of enzymes are influenced by pH value of their surroundings. Although many enzymes work well in the pH range from 6 to 8, some specific enzymes have good efficiencies only in acidic (pH<5) or alkaline (pH>9) solution. Studies have demonstrated that the activities of enzymes correlate with their primary sequences. It is crucial to judge enzyme adaptation to acidic or alkaline environment from its amino acid sequence in molecular mechanism clarification and the design of high efficient enzymes. In this study, we developed a sequence-based method to discriminate acidic enzymes from alkaline enzymes. The analysis of variance was used to choose the optimized discriminating features derived from g-gap dipeptide compositions. And support vector machine was utilized to establish the prediction model. In the rigorous jackknife cross-validation, the overall accuracy of 96.7% was achieved. The method can correctly predict 96.3% acidic and 97.1% alkaline enzymes. Through the comparison between the proposed method and previous methods, it is demonstrated that the proposed method is more accurate. On the basis of this proposed method, we have built an online web-server called AcalPred which can be freely accessed from the website (http://lin.uestc.edu.cn/server/AcalPred). We believe that the AcalPred will become a powerful tool to study enzyme adaptation to acidic or alkaline environment.

  10. Complete amino acid sequence of the N-terminal extension of calf skin type III procollagen.

    PubMed Central

    Brandt, A; Glanville, R W; Hörlein, D; Bruckner, P; Timpl, R; Fietzek, P P; Kühn, K

    1984-01-01

    The N-terminal extension peptide of type III procollagen, isolated from foetal-calf skin, contains 130 amino acid residues. To determine its amino acid sequence, the peptide was reduced and carboxymethylated or aminoethylated and fragmented with trypsin, Staphylococcus aureus V8 proteinase and bacterial collagenase. Pyroglutamate aminopeptidase was used to deblock the N-terminal collagenase fragment to enable amino acid sequencing. The type III collagen extension peptide is homologous to that of the alpha 1 chain of type I procollagen with respect to a three-domain structure. The N-terminal 79 amino acids, which contain ten of the 12 cysteine residues, form a compact globular domain. The next 39 amino acids are in a collagenase triplet sequence (Gly- Xaa - Yaa )n with a high hydroxyproline content. Finally, another short non-collagenous domain of 12 amino acids ends at the cleavage site for procollagen aminopeptidase, which cleaves a proline-glutamine bond. In contrast with type I procollagen, the type III procollagen extension peptides contain interchain disulphide bridges located at the C-terminus of the triple-helical domain. PMID:6331392

  11. Detection of multiple, novel reverse transcriptase coding sequences in human nucleic acids: relation to primate retroviruses

    SciTech Connect

    Shih, A.; Misra, R.; Rush, M.G.

    1989-01-01

    A variety of chemically synthesized oligonucleotides designed on the basis of amino acid and/or nucleotide sequence data were used to detect a large number of novel reverse transcriptase coding sequences in human and mouse DNAs. Procedures involving Southern blotting, library screening, and the polymerase chain reaction were all used to detect such sequences; the polymerase chain reaction was the most rapid and productive approach. In the polymerase chain reaction, oligonucleotide mixtures based on consensus sequence homologies to reverse transcriptase coding sequences and unique oligonucleotides containing perfect homology to the coding sequences of human T-cell leukemia virus types I and II were both effective in amplifying reverse transcriptase-related DNA. It is shown that human DNA contains a wide spectrum of retrovirus-related reverse transcriptase coding sequences, including some that are clearly related to human T-cell leukemia virus types I and II, some that are related to the L-1 family of long interspersed nucleotide sequences, and others that are related to previously described human endogenous proviral DNAs. In addition, human T-cell leukemia virus type I-related sequences appear to be transcribed in both normal human T cells and in a cell line derived from a human teratocarcinoma.

  12. 37 CFR 1.824 - Form and format for nucleotide and/or amino acid sequence submissions in computer readable form.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... nucleotide and/or amino acid sequence submissions in computer readable form. 1.824 Section 1.824 Patents... And/or Amino Acid Sequences § 1.824 Form and format for nucleotide and/or amino acid sequence... readable form may be created by any means, such as word processors, nucleotide/amino acid sequence editors...

  13. 37 CFR 1.824 - Form and format for nucleotide and/or amino acid sequence submissions in computer readable form.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... nucleotide and/or amino acid sequence submissions in computer readable form. 1.824 Section 1.824 Patents... And/or Amino Acid Sequences § 1.824 Form and format for nucleotide and/or amino acid sequence... readable form may be created by any means, such as word processors, nucleotide/amino acid sequence editors...

  14. 37 CFR 1.824 - Form and format for nucleotide and/or amino acid sequence submissions in computer readable form.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... nucleotide and/or amino acid sequence submissions in computer readable form. 1.824 Section 1.824 Patents... And/or Amino Acid Sequences § 1.824 Form and format for nucleotide and/or amino acid sequence... readable form may be created by any means, such as word processors, nucleotide/amino acid sequence editors...

  15. 37 CFR 1.824 - Form and format for nucleotide and/or amino acid sequence submissions in computer readable form.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... nucleotide and/or amino acid sequence submissions in computer readable form. 1.824 Section 1.824 Patents... And/or Amino Acid Sequences § 1.824 Form and format for nucleotide and/or amino acid sequence... readable form may be created by any means, such as word processors, nucleotide/amino acid sequence editors...

  16. 37 CFR 1.824 - Form and format for nucleotide and/or amino acid sequence submissions in computer readable form.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... nucleotide and/or amino acid sequence submissions in computer readable form. 1.824 Section 1.824 Patents... And/or Amino Acid Sequences § 1.824 Form and format for nucleotide and/or amino acid sequence... readable form may be created by any means, such as word processors, nucleotide/amino acid sequence editors...

  17. Isolation and amino acid sequences of opossum vasoactive intestinal polypeptide and cholecystokinin octapeptide.

    PubMed Central

    Eng, J; Yu, J; Rattan, S; Yalow, R S

    1992-01-01

    Evolutionary history suggests that the marsupials entered South America from North America about 75 million years ago and subsequently dispersed into Australia before the separation between South America and Antarctica-Australia. A question of interest is whether marsupial peptides resemble the corresponding peptides of Old or New World mammals. Previous studies had shown that "little" gastrin of the North American marsupial, the opossum, is identical in length to that of the New World mammals, the guinea pig and chinchilla. In this report, we demonstrate that opossum cholecystokinin octapeptide, like that of the Australian marsupials, the Eastern quoll and the Tamar wallaby, is identical to the cholecystokinin octapeptide of Old World mammals and differs from that of the guinea pig and chinchilla. However, opossum vasoactive intestinal polypeptide differs from the usual Old World mammalian vasoactive intestinal polypeptide in five sites: [sequence; see text]. PMID:1542675

  18. Complete amino acid sequence of branched-chain amino acid aminotransferase (transaminase B) of Salmonella typhimurium, identification of the coenzyme-binding site and sequence comparison analysis

    SciTech Connect

    Feild, M.J.

    1988-01-01

    The complete amino acid sequence of the subunit of branched-chain amino acid aminotransferase of Salmonella typhimurium was determined by automated Edman degradation of peptide fragments generated by chemical and enzymatic digestion of S-carboxymethylated and S-pyridylethylated transaminase B. Peptide fragments of transaminase B were generated by treatment of the enzyme with trypsin, Staphylococcus aureus V8 protease, endoproteinase Lys-C, and cyanogen bromide. Protocols were developed for separation of the peptide fragments by reverse-phase high performance liquid chromatography (HPLC), ion-exchange HPLC, and SDS-urea gel electrophoresis. The enzyme subunit contains 308 amino acid residues and has a molecular weight of 33,920 daltons. The coenzyme-binding site was determined by treatment of the enzyme, containing bound pyridoxal 5-phosphate, with tritiated sodium borohydride prior to trypsin digestion. Monitoring radioactivity incorporation and peptide map comparisons with an apoenzyme tryptic digest, allowed identification of the pyridoxylated-peptide which was isolated by reverse-phase HPLC and sequenced. The coenzyme-binding site is a lysyl residue at position 159. Some peptides were further characterized by fast atom bombardment mass spectrometry.

  19. The primary structure of E. coli RNA polymerase, Nucleotide sequence of the rpoC gene and amino acid sequence of the beta'-subunit.

    PubMed

    Ovchinnikov YuA; Monastyrskaya, G S; Gubanov, V V; Guryev, S O; Salomatina, I S; Shuvaeva, T M; Lipkin, V M; Sverdlov, E D

    1982-07-10

    The primary structure of the E. coli rpoC gene (5321 base pairs) coding the beta'-subunit of RNA polymerase as well as its adjacent segment have been determined. The structure analysis of the peptides obtained by cleavage of the protein with cyanogen bromide and trypsin has confirmed the amino acid sequence of the beta'-subunit deduced from the nucleotide sequence analysis. The beta'-subunit of E. coli RNA polymerase contains 1407 amino acid residues. Its translation is initiated by codon GUG and terminated by codon TAA. It has been detected that the sequence following the terminating codon is strikingly homologous to known sequences of rho-independent terminators.

  20. Code optimization of the subroutine to remove near identical matches in the sequence database homology search tool PSI-BLAST.

    PubMed

    Aspnäs, Mats; Mattila, Kimmo; Osowski, Kristoffer; Westerholm, Jan

    2010-06-01

    A central task in protein sequence characterization is the use of a sequence database homology search tool to find similar protein sequences in other individuals or species. PSI-BLAST is a widely used module of the BLAST package that calculates a position-specific score matrix from the best matching sequences and performs iterated searches using a method to avoid many similar sequences for the score. For some queries and parameter settings, PSI-BLAST may find many similar high-scoring matches, and therefore up to 80% of the total run time may be spent in this procedure. In this article, we present code optimizations that improve the cache utilization and the overall performance of this procedure. Measurements show that, for queries where the number of similar matches is high, the optimized PSI-BLAST program may be as much as 2.9 times faster than the original program.

  1. Sequence variation divides Equine rhinitis B virus into three distinct phylogenetic groups that correlate with serotype and acid stability.

    PubMed

    Black, Wesley D; Hartley, Carol A; Ficorilli, Nino P; Studdert, Michael J

    2005-08-01

    Equine rhinitis B virus (ERBV), genus Erbovirus, family Picornaviridae, occurs as two serotypes, ERBV1 and ERBV2, and the few isolates previously tested were acid labile. Of 24 ERBV1 isolates tested in the studies reported here, 19 were acid labile and five were acid stable. The two available ERBV2 isolates, as expected, were acid labile. Nucleotide sequences of the P1 region encoding the capsid proteins VP1, VP2, VP3 and VP4 were determined for five acid-labile and three acid-stable ERBV1 isolates and one acid-labile ERBV2 isolate. The sequences were aligned with the published sequences of the prototype acid-labile ERBV1.1436/71 and the prototype ERBV2.313/75. The three acid-stable ERBV1 were closely related in a phylogenetic group that was distinct from the group of six acid-labile ERBV1, which were also closely related to each other. The two acid-labile ERBV2 formed a third distinct group. One acid-labile ERBV1 had a chimeric acid-labile/acid-stable ERBV1 P1 sequence, presumably because of a recombination event within VP2 and this was supported by SimPlot analysis. ERBV1 rabbit antiserum neutralized acid-stable and acid-labile ERBV1 isolates similarly. Accordingly, three distinct phylogenetic groups of erboviruses exist that are consistent with serotype and acid stability phenotypes.

  2. Complete amino acid sequence of BSP-A3 from bovine seminal plasma. Homology to PDC-109 and to the collagen-binding domain of fibronectin.

    PubMed Central

    Seidah, N G; Manjunath, P; Rochemont, J; Sairam, M R; Chrétien, M

    1987-01-01

    Bovine seminal plasma was shown to contain three similar proteins, called BSP-A1, BSP-A2 and BSP-A3. Both BSP-A1 and BSP-A2 were shown to be molecular variants of a recently characterized peptide called PDC-109. They seem to differ only in their degree of glycosylation and otherwise seem to possess an identical amino acid composition. The work in the present paper deals with the complete characterization of the third member of this series, namely BSP-A3. The complete amino acid sequence revealed that it is composed of 115 amino acids and predicts a Mr of 13,403. An analysis of the primary structure of BSP-A3 revealed a high degree of internal homology, with two homologous domains composed of 39 (residues 28-66) and 43 (residues 73-115) amino acids. An exhaustive computer-bank search for the similarity of this sequence to any known protein, or segment thereof, revealed two significant homologies. The first is between PDC-109 and BSP-A3, which is so high that we can confidently predict that both proteins evolved from a single ancestral gene. The collagen-binding domain of bovine fibronectin (type II sequence) was also found to be highly homologous to both BSP-A3 and PDC-109. PMID:3606570

  3. Draft Genome Sequence of Sorghum Grain Mold Fungus Epicoccum sorghinum, a Producer of Tenuazonic Acid

    PubMed Central

    Oliveira, Rodrigo C.; Davenport, Karen W.; Hovde, Blake; Silva, Danielle; Chain, Patrick S. G.; Correa, Benedito

    2017-01-01

    ABSTRACT The facultative plant pathogen Epicoccum sorghinum is associated with grain mold of sorghum and produces the mycotoxin tenuazonic acid. This fungus can have serious economic impact on sorghum production. Here, we report the draft genome sequence of E. sorghinum (USPMTOX48). PMID:28126937

  4. Snake venom. The amino acid sequence of protein A from Dendroaspis polylepis polylepis (black mamba) venom.

    PubMed

    Joubert, F J; Strydom, D J

    1980-12-01

    Protein A from Dendroaspis polylepis polylepis venom comprises 81 amino acids, including ten half-cystine residues. The complete primary structures of protein A and its variant A' were elucidated. The sequences of proteins A and A', which differ in a single position, show no homology with various neurotoxins and non-neurotoxic proteins and represent a new type of elapid venom protein.

  5. Draft Genome Sequence of Bacillus coagulans NL01, a Wonderful l-Lactic Acid Producer

    PubMed Central

    Zheng, Zhaojuan; Jiang, Ting; Lin, Xi; Zhou, Jie

    2015-01-01

    Here, we report the draft genome sequence of Bacillus coagulans NL01, which could produce high optically pure l-lactic acid using xylose as a sole carbon source. The draft genome is 3,505,081 bp, with 144 contigs. About 3,903 protein-coding genes and 92 rRNAs are predicted from this assembly. PMID:26089419

  6. The Corynebacterium xerosis composite transposon Tn5432 consists of two identical insertion sequences, designated IS1249, flanking the erythromycin resistance gene ermCX.

    PubMed

    Tauch, A; Kassing, F; Kalinowski, J; Pühler, A

    1995-09-01

    Analysis of the 50-kb R-plasmid pTP10 from the clinical isolate Corynebacterium xerosis M82B revealed that the erythromycin resistance gene, ermCX, is located on a 4524-bp composite transposable element, Tn5432. The ends of Tn5432 are identical, direct repeats of an insertion sequence, designated IS1249, encoding a putative transposase of the IS256 family. IS1249 consists of 1385 bp with 45/42 imperfect terminal inverted repeats. The nucleotide sequence of the 1754-bp Tn5432 central region is 99% identical to the previously sequenced erythromycin resistance region of the Corynebacterium diphtheriae plasmid pNG2. It encodes the erythromycin resistance gene, ermCX, and an ORF homologous to the amino-terminal end of the transposase of IS31831 from Corynebacterium glutamicum. Transposons with regions flanking the insertion sites were recovered from the C. glutamicum chromosome by a plasmid rescue technique. Insertion of Tn5432 created 8-bp target site duplications. A Tn5432-induced isoleucine/valine-auxotrophic mutant was found to carry the transposon in the 5' region of the ilvBNC cluster; in pTP10 the transposon is inserted in a region similar to replication and partitioning functions of the Enterococcus faecalis plasmid pAD1 and the Agrobacterium tumefaciens plasmid pTAR.

  7. Amino acid sequence of myoglobin from white-tailed deer (Odocoileus virginianus).

    PubMed

    Joseph, Poulson; Suman, Surendranath P; Li, Shuting; Fontaine, Michele; Steinke, Laurey

    2012-10-01

    Our objective was to determine the primary structure of white-tailed deer myoglobin (Mb). White-tailed deer Mb was isolated from cardiac muscles employing ammonium sulfate precipitation and gel-filtration chromatography. The amino acid sequence was determined by Edman degradation. Sequence analyses of intact Mb as well as tryptic- and cyanogen bromide-peptides yielded the complete primary structure of white-tailed deer Mb, which shared 100% similarity with red deer Mb. White-tailed deer Mb consists of 153 amino acid residues and shares more than 96% sequence similarity with myoglobins from meat-producing ruminants, such as cattle, buffalo, sheep, and goat. Similar to sheep and goat myoglobins, white-tailed deer Mb contains 12 histidine residues. Proximal (position 93) and distal (position 64) histidine residues responsible for maintaining the stability of heme are conserved in white-tailed deer Mb.

  8. Amino acid sequences of heterotrophic and photosynthetic ferredoxins from the tomato plant (Lycopersicon esculentum Mill.).

    PubMed

    Kamide, K; Sakai, H; Aoki, K; Sanada, Y; Wada, K; Green, L S; Yee, B C; Buchanan, B B

    1995-11-01

    Several forms (isoproteins) of ferredoxin in roots, leaves, and green and red pericarps in tomato plants (Lycopersicon esculentum Mill.) were earlier identified on the basis of N-terminal amino acid sequence and chromatographic behavior (Green et al. 1991). In the present study, a large scale preparation made possible determination of the full length amino acid sequence of the two ferredoxins from leaves. The ferredoxins characteristic of fruit and root were sequenced from the amino terminus to the 30th residue or beyond. The leaf ferredoxins were confirmed to be expressed in pericarp of both green and red fruit. The ferredoxins characteristic of fruit and root appeared to be restricted to those tissue. The results extend earlier findings in demonstrating that ferredoxin occurs in the major organs of the tomato plant where it appears to function irrespective of photosynthetic competence.

  9. Complete complementary DNA-derived amino acid sequence of canine cardiac phospholamban.

    PubMed Central

    Fujii, J; Ueno, A; Kitano, K; Tanaka, S; Kadoma, M; Tada, M

    1987-01-01

    Complementary DNA (cDNA) clones specific for phospholamban of sarcoplasmic reticulum membranes have been isolated from a canine cardiac cDNA library. The amino acid sequence deduced from the cDNA sequence indicates that phospholamban consists of 52 amino acid residues and lacks an amino-terminal signal sequence. The protein has an inferred mol wt 6,080 that is in agreement with its apparent monomeric mol wt 6,000, estimated previously by sodium dodecyl sulfate-polyacrylamide gel electrophoresis. Phospholamban contains two distinct domains, a hydrophilic region at the amino terminus (domain I) and a hydrophobic region at the carboxy terminus (domain II). We propose that domain I is localized at the cytoplasmic surface and offers phosphorylatable sites whereas domain II is anchored into the sarcoplasmic reticulum membrane. PMID:3793929

  10. Folic acid blinded trial in identical twins with fragile X syndrome.

    PubMed Central

    Rosenblatt, D S; Duschenes, E A; Hellstrom, F V; Golick, M S; Vekemans, M J; Zeesman, S F; Andermann, E

    1985-01-01

    Monozygous twin 14-year-old mentally retarded boys with the fragile X syndrome were treated either with 10 mg folic acid by mouth daily or with a placebo for three test periods of 3-month duration each in a blind study. For each twin, tests of cognitive functioning, reading, spelling, and math skills, and linguistic and perceptual skills were compared. Although there was considerable variation in performance on these tests during the two baseline periods, there were no observable beneficial effects of therapy. The routine use of folic acid in patients with established mental retardation and the fragile X syndrome is not indicated. PMID:3890529

  11. Nucleotide sequence and the encoded amino acids of human apolipoprotein A-I mRNA.

    PubMed Central

    Law, S W; Brewer, H B

    1984-01-01

    The cDNA clones encoding the precursor form of human liver apolipoprotein A-I (apoA-I), preproapoA-I, have been isolated from a cDNA library. A 17-base synthetic oligonucleotide based on residues 108-113 of apoA-I and a 26-base primer-extended, dideoxynucleotide-terminated cDNA were used as hybridization probes to select for recombinant plasmids bearing the apoA-I sequence. The complete nucleic acid sequence of human liver preproapoA-I has been determined by analysis of the cloned cDNA. The sequence is composed of 801 nucleotides encoding 267 amino acid residues. PreproapoA-I contains an 18-amino-acid prepeptide and a 6-amino-acid propeptide connected to the amino terminus of the 243-amino acid mature apoA-I. Southern blotting analysis of chromosomal DNA obtained from peripheral blood indicated the apoA-I gene is contained in a 2.1-kilobase-pair Pst I fragment and there is no gross difference in structural organization between the normal apoA-I gene and the Tangier disease apoA-I gene. Images PMID:6198645

  12. Mathematical Characterization of Protein Sequences Using Patterns as Chemical Group Combinations of Amino Acids.

    PubMed

    Das, Jayanta Kumar; Das, Provas; Ray, Korak Kumar; Choudhury, Pabitra Pal; Jana, Siddhartha Sankar

    2016-01-01

    Comparison of amino acid sequence similarity is the fundamental concept behind the protein phylogenetic tree formation. By virtue of this method, we can explain the evolutionary relationships, but further explanations are not possible unless sequences are studied through the chemical nature of individual amino acids. Here we develop a new methodology to characterize the protein sequences on the basis of the chemical nature of the amino acids. We design various algorithms for studying the variation of chemical group transitions and various chemical group combinations as patterns in the protein sequences. The amino acid sequence of conventional myosin II head domain of 14 family members are taken to illustrate this new approach. We find two blocks of maximum length 6 aa as 'FPKATD' and 'Y/FTNEKL' without repeating the same chemical nature and one block of maximum length 20 aa with the repetition of chemical nature which are common among all 14 members. We also check commonality with another motor protein sub-family kinesin, KIF1A. Based on our analysis we find a common block of length 8 aa both in myosin II and KIF1A. This motif is located in the neck linker region which could be responsible for the generation of mechanical force, enabling us to find the unique blocks which remain chemically conserved across the family. We also validate our methodology with different protein families such as MYOI, Myosin light chain kinase (MLCK) and Rho-associated protein kinase (ROCK), Na+/K+-ATPase and Ca2+-ATPase. Altogether, our studies provide a new methodology for investigating the conserved amino acids' pattern in different proteins.

  13. Homology of the NH2-terminal amino acid sequences of the heavy and light chains of human monoclonal lupus autoantibodies containing the dominant 16/6 idiotype.

    PubMed Central

    Atkinson, P M; Lampman, G W; Furie, B C; Naparstek, Y; Schwartz, R S; Stollar, B D; Furie, B

    1985-01-01

    The NH2-terminal amino acid sequences have been determined by automated Edman degradation for the heavy and light chains of five monoclonal IgM anti-DNA autoantibodies that were produced by human-human hybridomas derived from lymphocytes of two patients with systemic lupus erythematosus. Four of the antibodies were closely related to the idiotype system 16/6, whereas the fifth antibody was unrelated idiotypically. The light chains of the 16/6 idiotype-positive autoantibodies (HF2-1/13b, HF2-1/17, HF2-18/2, and HF3-16/6) had identical amino acid sequences from residues 1 to 40. Their framework structures were characteristic of VKI light chains. The light chain of the 16/6 idiotype-negative autoantibody HF6-21/28 was characteristic of the VKII subgroup. The heavy chains of the 16/6 idiotype-positive autoantibodies had nearly identical amino acid sequences from residues 1 to 40. The framework structures were characteristic of the VHIII subgroup. In contrast, the GM4672 fusion partner of the hybridoma produced small quantities of an IgG with a VHI heavy chain and a VKI light chain. The heavy chains of the lupus autoantibodies and the light chains of those autoantibodies that were idiotypically related to the 16/6 system had marked sequence homology with WEA, a Waldenstrom IgM that binds to Klebsiella polysaccharides and expresses the 16/6 idiotype. These results indicate a striking homology in the amino termini of the heavy and light chains of the lupus autoantibodies studied and suggest that the V regions of the heavy and light chains of the 16/6 idiotype-positive DNA-binding lupus auto-antibodies are each encoded by a single germ line gene. PMID:3921567

  14. Software scripts for quality checking of high-throughput nucleic acid sequencers.

    PubMed

    Lazo, G R; Tong, J; Miller, R; Hsia, C; Rausch, C; Kang, Y; Anderson, O D

    2001-06-01

    We have developed a graphical interface to allow the researcher to view and assess the quality of sequencing results using a series of program scripts developed to process data generated by automated sequencers. The scripts are written in Perl programming language and are executable under the cgibin directory of a Web server environment. The scripts direct nucleic acid sequencing trace file data output from automated sequencers to be analyzed by the phred molecular biology program and are displayed as graphical hypertext mark-up language (HTML) pages. The scripts are mainly designed to handle 96-well microtiter dish samples, but the scripts are also able to read data from 384-well microtiter dishes 96 samples at a time. The scripts may be customized for different laboratory environments and computer configurations. Web links to the sources and discussion page are provided.

  15. Further characterization and amino acid sequence of m-type thioredoxins from spinach chloroplasts.

    PubMed

    Maeda, K; Tsugita, A; Dalzoppo, D; Vilbois, F; Schürmann, P

    1986-01-02

    The complete primary structure of m-type thioredoxin from spinach chloroplasts has been sequenced by conventional sequencing including fragmentation, Edman degradation and carboxypeptidase digestion. As already reported [Tsugita, A., Maeda, K. & Schürmann, P. (1983) Biochem. Biophys. Res. Commun. 115, 1-7] these thioredoxins contain the same active-site sequence as thioredoxins from other sources. Based on the amino acid sequence thioredoxin mc contains 103 residues, has a relative molecular mass of 11425 and a molar absorption coefficient at 280 nm of 19 300 M-1 cm-1. The spinach thioredoxin mc has an overall homology of 44% with the thioredoxin from Escherichia coli mainly due to differences in the N-terminal and C-terminal regions.

  16. Lactobacillus kefiri shows inter-strain variations in the amino acid sequence of the S-layer proteins.

    PubMed

    Malamud, Mariano; Carasi, Paula; Bronsoms, Sílvia; Trejo, Sebastián A; Serradell, María de Los Angeles

    2017-04-01

    The S-layer is a proteinaceous envelope constituted by subunits that self-assemble to form a two-dimensional lattice that covers the surface of different species of Bacteria and Archaea, and it could be involved in cell recognition of microbes among other several distinct functions. In this work, both proteomic and genomic approaches were used to gain knowledge about the sequences of the S-layer protein (SLPs) encoding genes expressed by six aggregative and sixteen non-aggregative strains of potentially probiotic Lactobacillus kefiri. Peptide mass fingerprint (PMF) analysis confirmed the identity of SLPs extracted from L. kefiri, and based on the homology with phylogenetically related species, primers located outside and inside the SLP-genes were employed to amplify genomic DNA. The O-glycosylation site SASSAS was found in all L. kefiri SLPs. Ten strains were selected for sequencing of the complete genes. The total length of the mature proteins varies from 492 to 576 amino acids, and all SLPs have a calculated pI between 9.37 and 9.60. The N-terminal region is relatively conserved and shows a high percentage of positively charged amino acids. Major differences among strains are found in the C-terminal region. Different groups could be distinguished regarding the mature SLPs and the similarities observed in the PMF spectra. Interestingly, SLPs of the aggregative strains are 100% homologous, although these strains were isolated from different kefir grains. This knowledge provides relevant data for better understanding of the mechanisms involved in SLPs functionality and could contribute to the development of products of biotechnological interest from potentially probiotic bacteria.

  17. Protein sequence alignment with family-specific amino acid similarity matrices

    PubMed Central

    2011-01-01

    Background Alignment of amino acid sequences by means of dynamic programming is a cornerstone sequence comparison method. The quality of alignments produced by dynamic programming critically depends on the choice of the alignment scoring function. Therefore, for a specific alignment problem one needs a way of selecting the best performing scoring function. This work is focused on the issue of finding optimized protein family- and fold-specific scoring functions for global similarity matrix-based sequence alignment. Findings I utilize a comprehensive set of reference alignments obtained from structural superposition of homologous and analogous proteins to design a quantitative statistical framework for evaluating the performance of alignment scoring functions in global pairwise sequence alignment. This framework is applied to study how existing general-purpose amino acid similarity matrices perform on individual protein families and structural folds, and to compare them to family-specific and fold-specific matrices derived in this work. I describe an adaptive alignment procedure that automatically selects an appropriate similarity matrix and optimized gap penalties based on the properties of the sequences being aligned. Conclusions The results of this work indicate that using family-specific similarity matrices significantly improves the quality of the alignment of homologous sequences over the traditional sequence alignment based on a single general-purpose similarity matrix. However, using fold-specific similarity matrices can only marginally improve sequence alignment of proteins that share the same structural fold but do not share a common evolutionary origin. The family-specific matrices derived in this work and the optimized gap penalties are available at http://taurus.crc.albany.edu/fsm. PMID:21846354

  18. Common recognition principles across diverse sequence and structural families of sialic acid binding proteins.

    PubMed

    Bhagavat, Raghu; Chandra, Nagasuma

    2014-01-01

    Sialic acids form a large family of 9-carbon monosaccharides and are integral components of glycoconjugates. They are known to bind to a wide range of receptors belonging to diverse sequence families and fold classes and are key mediators in a plethora of cellular processes. Thus, it is of great interest to understand the features that give rise to such a recognition capability. Structural analyses using a non-redundant data set of known sialic acid binding proteins was carried out, which included exhaustive binding site comparisons and site alignments using in-house algorithms, followed by clustering and tree computation, which has led to derivation of sialic acid recognition principles. Although the proteins in the data set belong to several sequence and structure families, their binding sites could be grouped into only six types. Structural comparison of the binding sites indicates that all sites contain one or more different combinations of key structural features over a common scaffold. The six binding site types thus serve as structural motifs for recognizing sialic acid. Scanning the motifs against a non-redundant set of binding sites from PDB indicated the motifs to be specific for sialic acid recognition. Knowledge of determinants obtained from this study will be useful for detecting function in unknown proteins. As an example analysis, a genome-wide scan for the motifs in structures of Mycobacterium tuberculosis proteome identified 17 hits that contain combinations of the features, suggesting a possible function of sialic acid binding by these proteins.

  19. Experimental and computational investigation of acetic acid deoxygenation over oxophilic molybdenum carbide: Surface chemistry and active site identity

    SciTech Connect

    Schaidle, Joshua A.; Blackburn, Jeffrey; Farberow, Carrie A.; Nash, Connor; Steirer, K. Xerxes; Clark, Jared; Robichaud, David J.; Ruddy, Daniel A.

    2016-01-21

    Ex situ catalytic fast pyrolysis (CFP) is a promising route for producing fungible biofuels; however, this process requires bifunctional catalysts that favor C–O bond cleavage, activate hydrogen at near atmospheric pressure and high temperature (350–500 °C), and are stable under high-steam, low hydrogen-to-carbon environments. Recently, early transition-metal carbides have been reported to selectively cleave C–O bonds of alcohols, aldehydes, and oxygenated aromatics, yet there is limited understanding of the metal carbide surface chemistry under reaction conditions and the identity of the active sites for deoxygenation. In this study, we evaluated molybdenum carbide (Mo2C) for the deoxygenation of acetic acid, an abundant component of biomass pyrolysis vapors, under ex situ CFP conditions, and we probed the Mo2C surface chemistry, identity of the active sites, and deoxygenation pathways using in situ diffuse reflectance infrared Fourier transform spectroscopy (DRIFTS), X-ray photoelectron spectroscopy (XPS), and density functional theory (DFT) calculations.

  20. Experimental and computational investigation of acetic acid deoxygenation over oxophilic molybdenum carbide: Surface chemistry and active site identity

    DOE PAGES

    Schaidle, Joshua A.; Blackburn, Jeffrey; Farberow, Carrie A.; ...

    2016-01-21

    Ex situ catalytic fast pyrolysis (CFP) is a promising route for producing fungible biofuels; however, this process requires bifunctional catalysts that favor C–O bond cleavage, activate hydrogen at near atmospheric pressure and high temperature (350–500 °C), and are stable under high-steam, low hydrogen-to-carbon environments. Recently, early transition-metal carbides have been reported to selectively cleave C–O bonds of alcohols, aldehydes, and oxygenated aromatics, yet there is limited understanding of the metal carbide surface chemistry under reaction conditions and the identity of the active sites for deoxygenation. In this study, we evaluated molybdenum carbide (Mo2C) for the deoxygenation of acetic acid, anmore » abundant component of biomass pyrolysis vapors, under ex situ CFP conditions, and we probed the Mo2C surface chemistry, identity of the active sites, and deoxygenation pathways using in situ diffuse reflectance infrared Fourier transform spectroscopy (DRIFTS), X-ray photoelectron spectroscopy (XPS), and density functional theory (DFT) calculations.« less

  1. Myoglobins of cartilaginous fishes III. Amino acid sequence of myoglobin of the shark Galeorhinus australis.

    PubMed

    Fisher, W K; Koureas, D D; Thompson, E O

    1981-01-01

    Myoglobin isolated from the red muscle of the school shark Galeorhinus australis was purified by gel filtration and ion-exchange chromatography. The amino acid sequence was determined following digestion with trypsin and purification of the peptides by paper ionophoresis and chromatography. Sequences of purified peptides were determined by the dansyl-Edman procedure and the peptides aligned by homology with the sequence of the myoglobin of the gummy shark Mustelus antarcticus. The two myoglobin sequences showed a marked similarity (16 differences), but both sequences showed approximately the same number of differences (68) from myoglobin of the Port Jackson shark Heterodontus portusjacksoni. There are 19 residues unique to three shark myoglobin sequences. As found with other fish myoglobins there are 148 residues with deletions of four residues at the amino terminal end as well as one residue in the CD region. The amino terminal residue is acetylated. The distal E7 histidine residue was found to be replaced by glutamine, as only previously reported for the myoglobin sequence of gummy shark.

  2. N-terminal amino acid sequence of proalbumin from inbred buffalo rats.

    PubMed

    Millership, A; Edwards, K; Chelladurai, M; Dryburgh, H; Inglis, A S; Urban, J; Schreiber, G

    1980-03-01

    The sequence of radioactively labelled amino acids at the N-terminus of proalbumin was determined by automated Edman-degradation. [3H] Valine, [3H]phenylalanine or [14C]arginine was incorporated into protein in vivo for a time period of 10 min after injection. Since albumin remains unlabelled during this time period (Urban et al., 1976), separation of proalbumin and albumin was not required for this work. Hence, compared to previous methods, a shorter purification procedure could be used which increased the yield of anti-albumin-precipitable protein and reduced the risk of proteolysis. Microsomes were prepared from livers removed 10 min after injection of the radioactively labelled amino acids. A buffer extract of the acetone-dried powder from these microsomes was chromatographed on DEAE-cellulose. All protein obtained after chromatography which could be precipitated with antiserum to serum albumin was isolated by immunoprecipitation and subsequent separation of the antigen-antibody complex. The sequence of radioactive amino acids in this antigen preparation suggests that about 20-25% of proalbumin possessed at the N-terminus the pentapeptide sequence X-Val-Phe-Arg-Arg- whereas 75-80% contained the hexapeptide sequence Arg-X-Val-Phe-Arg-Arg-.

  3. Haemoglobins of the shark, Heterodontus portusjacksoni II. Amino acid sequence of the alpha-chain.

    PubMed

    Nash, A R; Fisher, W K; Thompson, E O

    1976-03-01

    The amino acid sequence of the alpha-chain of the principal haemoglobin from the shark, H. portusjacksoni has been determined. The chain has 148 residues and is acetylated at the amino terminal. The soluble peptides obtained by tryptic and chymotryptic digestion of the protein or its cyanogen bromide fragments were isolated by gel filtration, paper ionophoresis and paper chromatography. The amino acid sequences were determined by the dansyl-Edman procedure. The insoluble "core" peptide from the tryptic digestion contained 34 residues and required cleavage by several prosteases before the sequence was established. Compared with human alpha-chain there are 88 amino acid differences including the additional seven residues which appear on the amino terminal of the shark chain. There is also one deletion and one insertion. The chain contains no tryptophan but has four cysteinyl residues which is the highest number of such residues recorded for a vertebrate globin. In the alpha1beta1 contact sites there are four changes in the oxyhaemoglobin form and six deoxy form. Nine of the 16, alpha1beta1 contact sites show variation while three of the haem contact sites have changed in comparison to the residues known to be involved in these interactions in horse haemoglobin alpha-chain. Use of the sequence data to estimate a time of divergence of the shark from the main vertebrate line yielded the value of 410 +/- 46 million years. The data, in general, support the palaeontological view that bony fishes arose before the elasmobranchs.

  4. Immunoreactivity of polyclonal antibodies generated against the carboxy terminus of the predicted amino acid sequence of the Huntington disease gene

    SciTech Connect

    Alkatib, G.; Graham, R.; Pelmear-Telenius, A.

    1994-09-01

    A cDNA fragment spanning the 3{prime}-end of the Huntington disease gene (from 8052 to 9252) was cloned into a prokaryotic expression vector containing the E. Coli lac promoter and a portion of the coding sequence for {beta}-galactosidase. The truncated {beta}-galactosidase gene was cleaved with BamHl and fused in frame to the BamHl fragment of the Huntington disease gene 3{prime}-end. Expression analysis of proteins made in E. Coli revealed that 20-30% of the total cellular proteins was represented by the {beta}-galactosidase-huntingtin fusion protein. The identity of the Huntington disease protein amino acid sequences was confirmed by protein sequence analysis. Affinity chromatography was used to purify large quantities of the fusion protein from bacterial cell lysates. Affinity-purified proteins were used to immunize New Zealand white rabbits for antibody production. The generated polyclonal antibodies were used to immunoprecipitate the Huntington disease gene product expressed in a neuroblastoma cell line. In this cell line the antibodies precipitated two protein bands of apparent gel migrations of 200 and 150 kd which together, correspond to the calculated molecular weight of the Huntington disease gene product (350 kd). Immunoblotting experiments revealed the presence of a large precursor protein in the range of 350-750 kd which is in agreement with the predicted molecular weight of the protein without post-translational modifications. These results indicate that the huntingtin protein is cleaved into two subunits in this neuroblastoma cell line and implicate that cleavage of a large precursor protein may contribute to its biological activity. Experiments are ongoing to determine the precursor-product relationship and to examine the synthesis of the huntingtin protein in freshly isolated rat brains, and to determine cellular and subcellular distribution of the gene product.

  5. Preparation of Nucleic Acid Libraries for Personalized Sequencing Systems Using an Integrated Microfluidic Hub Technology (Seventh Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting 2012)

    ScienceCinema

    Patel, Kamlesh D [Ken; SNL,

    2016-07-12

    Kamlesh (Ken) Patel from Sandia National Laboratories (Livermore, California) presents "Preparation of Nucleic Acid Libraries for Personalized Sequencing Systems Using an Integrated Microfluidic Hub Technology " at the 7th Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting held in June, 2012 in Santa Fe, NM.

  6. Preparation of Nucleic Acid Libraries for Personalized Sequencing Systems Using an Integrated Microfluidic Hub Technology (Seventh Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting 2012)

    SciTech Connect

    Patel, Kamlesh D; SNL,

    2012-06-01

    Kamlesh (Ken) Patel from Sandia National Laboratories (Livermore, California) presents "Preparation of Nucleic Acid Libraries for Personalized Sequencing Systems Using an Integrated Microfluidic Hub Technology " at the 7th Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting held in June, 2012 in Santa Fe, NM.

  7. Mathematical Characterization of Protein Sequences Using Patterns as Chemical Group Combinations of Amino Acids

    PubMed Central

    Choudhury, Pabitra Pal; Jana, Siddhartha Sankar

    2016-01-01

    Comparison of amino acid sequence similarity is the fundamental concept behind the protein phylogenetic tree formation. By virtue of this method, we can explain the evolutionary relationships, but further explanations are not possible unless sequences are studied through the chemical nature of individual amino acids. Here we develop a new methodology to characterize the protein sequences on the basis of the chemical nature of the amino acids. We design various algorithms for studying the variation of chemical group transitions and various chemical group combinations as patterns in the protein sequences. The amino acid sequence of conventional myosin II head domain of 14 family members are taken to illustrate this new approach. We find two blocks of maximum length 6 aa as ‘FPKATD’ and ‘Y/FTNEKL’ without repeating the same chemical nature and one block of maximum length 20 aa with the repetition of chemical nature which are common among all 14 members. We also check commonality with another motor protein sub-family kinesin, KIF1A. Based on our analysis we find a common block of length 8 aa both in myosin II and KIF1A. This motif is located in the neck linker region which could be responsible for the generation of mechanical force, enabling us to find the unique blocks which remain chemically conserved across the family. We also validate our methodology with different protein families such as MYOI, Myosin light chain kinase (MLCK) and Rho-associated protein kinase (ROCK), Na+/K+-ATPase and Ca2+-ATPase. Altogether, our studies provide a new methodology for investigating the conserved amino acids’ pattern in different proteins. PMID:27930687

  8. Complete Amino Acid Sequence of a Copper/Zinc-Superoxide Dismutase from Ginger Rhizome.

    PubMed

    Nishiyama, Yuki; Fukamizo, Tamo; Yoneda, Kazunari; Araki, Tomohiro

    2017-04-01

    Superoxide dismutase (SOD) is an antioxidant enzyme protecting cells from oxidative stress. Ginger (Zingiber officinale) is known for its antioxidant properties, however, there are no data on SODs from ginger rhizomes. In this study, we purified SOD from the rhizome of Z. officinale (Zo-SOD) and determined its complete amino acid sequence using N terminal sequencing, amino acid analysis, and de novo sequencing by tandem mass spectrometry. Zo-SOD consists of 151 amino acids with two signature Cu/Zn-SOD motifs and has high similarity to other plant Cu/Zn-SODs. Multiple sequence alignment showed that Cu/Zn-binding residues and cysteines forming a disulfide bond, which are highly conserved in Cu/Zn-SODs, are also present in Zo-SOD. Phylogenetic analysis revealed that plant Cu/Zn-SODs clustered into distinct chloroplastic, cytoplasmic, and intermediate groups. Among them, only chloroplastic enzymes carried amino acid substitutions in the region functionally important for enzymatic activity, suggesting that chloroplastic SODs may have a function distinct from those of SODs localized in other subcellular compartments. The nucleotide sequence of the Zo-SOD coding region was obtained by reverse-translation, and the gene was synthesized, cloned, and expressed. The recombinant Zo-SOD demonstrated pH stability in the range of 5-10, which is similar to other reported Cu/Zn-SODs, and thermal stability in the range of 10-60 °C, which is higher than that for most plant Cu/Zn-SODs but lower compared to the enzyme from a Z. officinale relative Curcuma aromatica.

  9. Studies on adenosine triphosphate transphosphorylases. Amino acid sequence of rabbit muscle ATP-AMP transphosphorylase.

    PubMed

    Kuby, S A; Palmieri, R H; Frischat, A; Fischer, A H; Wu, L H; Maland, L; Manship, M

    1984-05-22

    The total amino acid sequence of rabbit muscle adenylate kinase has been determined, and the single polypeptide chain of 194 amino acid residues starts with N-acetylmethionine and ends with leucyllysine at its carboxyl terminus, in agreement with the earlier data on its amino acid composition [Mahowald, T. A., Noltmann, E. A., & Kuby, S. A. (1962) J. Biol. Chem. 237, 1138-1145] and its carboxyl-terminus sequence [Olson, O. E., & Kuby, S. A. (1964) J. Biol. Chem. 239, 460-467]. Elucidation of the primary structure was based on tryptic and chymotryptic cleavages of the performic acid oxidized protein, cyanogen bromide cleavages of the 14C-labeled S-carboxymethylated protein at its five methionine sites (followed by maleylation of peptide fragments), and tryptic cleavages at its 12 arginine sites of the maleylated 14C-labeled S-carboxymethylated protein. Calf muscle myokinase, whose sequence has also been established, differs primarily from the rabbit muscle myokinase's sequence in the following: His-30 is replaced by Gln-30; Lys-56 is replaced by Met-56; Ala-84 and Asp 85 are replaced by Val-84 and Asn-85. A comparison of the four muscle-type adenylate kinases, whose covalent structures have now been determined, viz., rabbit, calf, porcine, and human [for the latter two sequences see Heil, A., Müller, G., Noda, L., Pinder, T., Schirmer, H., Schirmer, I., & Von Zabern, I. (1974) Eur. J. Biochem. 43, 131-144, and Von Zabern, I., Wittmann-Liebold, B., Untucht-Grau, R., Schirmer, R. H., & Pai, E. F. (1976) Eur. J. Biochem. 68, 281-290], demonstrates an extraordinary degree of homology.(ABSTRACT TRUNCATED AT 250 WORDS)

  10. Role of the two-component leader sequence and mature amino acid sequences in extracellular export of endoglucanase EGL from Pseudomonas solanacearum.

    PubMed Central

    Huang, J Z; Schell, M A

    1992-01-01

    The egl gene of Pseudomonas solanacearum encodes a 43-kDa extracellular endoglucanase (mEGL) involved in wilt disease caused by this phytopathogen. Egl is initially translated with a 45-residue, two-part leader sequence. The first 19 residues are apparently removed by signal peptidase II during export of Egl across the inner membrane (IM); the remaining residues of the leader sequence (modified with palmitate) are removed during export across the outer membrane (OM). Localization of Egl-PhoA fusion proteins showed that the first 26 residues of the Egl leader sequence are required and sufficient to direct lipid modification, processing, and export of Egl or PhoA across the IM but not the OM. Fusions of the complete 45-residue leader sequence or of the leader and increasing portions of mEgl sequences to PhoA did not cause its export across the OM. In-frame deletion of portions of mEGL-coding sequences blocked export of the truncated polypeptides across the OM without affecting export across the IM. These results indicate that the first part of the leader sequence functions independently to direct export of Egl across the IM while the second part and sequences and structures in mEGL are involved in export across the OM. Computer analysis of the mEgl amino acid sequence obtained from its nucleotide sequence identified a region of mEGL similar in amino acid sequence to regions in other prokaryotic endoglucanases. Images PMID:1735723

  11. The amino acid sequence of Neurospora NADP-specific glutamate dehydrogenase. Peptic and chymotryptic peptides and the complete sequence.

    PubMed Central

    Holder, A A; Wootton, J C; Baron, A J; Chambers, G K; Fincham, J R

    1975-01-01

    Peptic and chymotryptic peptides were isolated form the NADP-specific glutamate dehydrogenase of Neurospora crassa and substantially sequenced. Out of 452 residues in the polypeptide chain, 265 were recovered in the peptic and 427 in the chymotryptic peptides. Together with the tryptic peptides [Wootton, J. C., Taylor, J. G., Jackson, A. A., Chambers, G. K. & Fincham, J. R. S. (1975) Biochem. J. 149, 749-755], these establish the complete sequence of the chain, including the acid and amide assignments, except for seven places where overlaps are inadequate. These remaining alignments are deduced from information on the CNBr fragments obtained in another laboratory [Blumenthal, K. M., Moon, K. & Smith, E. L. (1975), J. Biol. Chem. 250, 3644-3654]. Further information has been deposited as Supplementary Publication SUP 50054 (17 pages) with the British Library (Lending Division), Boston Spa, Wetherby, W. Yorkshire LS23 7BQ, U.K., from whom copies may be obtained under the terms given in Biochem. J. (1975) 145, 5. PMID:1002

  12. The complete amino acid sequence of a trypsin inhibitor from Bauhinia variegata var. candida seeds.

    PubMed

    Di Ciero, L; Oliva, M L; Torquato, R; Köhler, P; Weder, J K; Camillo Novello, J; Sampaio, C A; Oliveira, B; Marangoni, S

    1998-11-01

    Trypsin inhibitors of two varieties of Bauhinia variegata seeds have been isolated and characterized. Bauhinia variegata candida trypsin inhibitor (BvcTI) and B. variegata lilac trypsin inhibitor (BvlTI) are proteins with Mr of about 20,000 without free sulfhydryl groups. Amino acid analysis shows a high content of aspartic acid, glutamic acid, serine, and glycine, and a low content of histidine, tyrosine, methionine, and lysine in both inhibitors. Isoelectric focusing for both varieties detected three isoforms (pI 4.85, 5.00, and 5.15), which were resolved by HPLC procedure. The trypsin inhibitors show Ki values of 6.9 and 1.2 nM for BvcTI and BvlTI, respectively. The N-terminal sequences of the three trypsin inhibitor isoforms from both varieties of Bauhinia variegata and the complete amino acid sequence of B. variegata var. candida L. trypsin inhibitor isoform 3 (BvcTI-3) are presented. The sequences have been determined by automated Edman degradation of the reduced and carboxymethylated proteins of the peptides resulting from Staphylococcus aureus protease and trypsin digestion. BvcTI-3 is composed of 167 residues and has a calculated molecular mass of 18,529. Homology studies with other trypsin inhibitors show that BvcTI-3 belongs to the Kunitz family. The putative active site encompasses Arg (63)-Ile (64).

  13. Complete nucleic acid sequence of Penaeus stylirostris densovirus (PstDNV) from India.

    PubMed

    Rai, Praveen; Safeena, Muhammed P; Karunasagar, Iddya; Karunasagar, Indrani

    2011-06-01

    Infectious hypodermal and hematopoietic necrosis virus (IHHNV) of shrimp, recently been classified as Penaeus stylirostris densovirus (PstDNV). The complete nucleic acid sequence of PstDNV from India was obtained by cloning and sequencing of different DNA fragment of the virus. The genome organisation of PstDNV revealed that there were three major coding domains: a left ORF (NS1) of 2001 bp, a mid ORF (NS2) of 1092 bp and a right ORF (VP) of 990 bp. The complete genome and amino acid sequences of three proteins viz., NS1, NS2 and VP were compared with the genomes of the virus reported from Hawaii, China and Mexico and with partial sequence available from isolates from different regions. The phylogenetic analysis of shrimp, insect and vertebrate parvovirus sequences showed that the Indian PstDNV isolate is phylogenetically more closely related to one of the three isolates from Taiwan (AY355307), and two isolates (AY362547 and AY102034) from Thailand.

  14. Determination of the complete amino acid sequence for the coat protein of brome mosaic virus by time-of-flight mass spectrometry. Evidence for mutations associated with change of propagation host.

    PubMed

    She, Y M; Haber, S; Seifers, D L; Loboda, A; Chernushevich, I; Perreault, H; Ens, W; Standing, K G

    2001-06-08

    Time-of-flight mass spectrometry (TOFMS) has been applied to determine the complete coat protein amino acid sequences of a number of distinct brome mosaic virus (BMV) isolates. Ionization was carried out by both electrospray ionization and matrix-assisted laser desorption/ionization (MALDI). After determining overall coat protein masses, the proteins were digested with trypsin or Lys-C proteinases, and the digestion products were analyzed in a MALDI QqTOF mass spectrometer. The N terminus of the coat protein was found to be acetylated in each BMV isolate analyzed. In one isolate (BMV-Valverde), the amino acid sequence was identical to that predicted from the cDNA sequence of the "type" isolate, but deviations from the predicted amino acid sequence were observed for all the other isolates analyzed. When isolates were propagated in different host taxa, modified coat protein sequences were observed in some cases, along with the original sequence. Sequencing by TOFMS may therefore provide a basis for monitoring the effects of host passaging on a virus at the molecular level. Such TOFMS-based analyses assess the complete profiles of coat protein sequences actually present in infected tissues. They are therefore not subject to the selection biases inherent in deducing such sequences from reverse-transcribed viral RNA and cloning the resulting cDNA.

  15. Molecular cytogenetics by polymerase catalyzed amplification or in situ labelling of specific nucleic acid sequences

    SciTech Connect

    Bolund, L.; Brandt, C.; Hindkjaer, J.; Koch, J.; Koelvraa, S.; Pedersen, S. )

    1993-01-01

    The Polymerase Chain Reaction (PCR) can be performed on isolated cells or chromosomes and the product can be analyzed by DNA technology or by FISH to test metaphases. The authors have good experiences analyzing aberrant chromosomes by FACS sorting, PCR with degenerated primers and painting of test metaphases with the PCR product. They also utilize polymerases for PRimed IN Situ labelling (PRINS) of specific nucleic acid sequences. In PRINS oligonucleotides are hybridized to their target sequences and labeled nucleotides are incorporated at the site of hybridization with the oligonucleotide as primer. PRINS may eventually allow the study of individual genes, gene expression and even somatic mutations (in mRNA) in single cells.

  16. DNA Cloning of Plasmodium falciparum Circumsporozoite Gene: Amino Acid Sequence of Repetitive Epitope

    NASA Astrophysics Data System (ADS)

    Enea, Vincenzo; Ellis, Joan; Zavala, Fidel; Arnot, David E.; Asavanich, Achara; Masuda, Aoi; Quakyi, Isabella; Nussenzweig, Ruth S.

    1984-08-01

    A clone of complementary DNA encoding the circumsporozoite (CS) protein of the human malaria parasite Plasmodium falciparum has been isolated by screening an Escherichia coli complementary DNA library with a monoclonal antibody to the CS protein. The DNA sequence of the complementary DNA insert encodes a four-amino acid sequence: proline-asparagine-alanine-asparagine, tandemly repeated 23 times. The CS β -lactamase fusion protein specifically binds monoclonal antibodies to the CS protein and inhibits the binding of these antibodies to native Plasmodium falciparum CS protein. These findings provide a basis for the development of a vaccine against Plasmodium falciparum malaria.

  17. Seq2Logo: a method for construction and visualization of amino acid binding motifs and sequence profiles including sequence weighting, pseudo counts and two-sided representation of amino acid enrichment and depletion

    PubMed Central

    Thomsen, Martin Christen Frølund; Nielsen, Morten

    2012-01-01

    Seq2Logo is a web-based sequence logo generator. Sequence logos are a graphical representation of the information content stored in a multiple sequence alignment (MSA) and provide a compact and highly intuitive representation of the position-specific amino acid composition of binding motifs, active sites, etc. in biological sequences. Accurate generation of sequence logos is often compromised by sequence redundancy and low number of observations. Moreover, most methods available for sequence logo generation focus on displaying the position-specific enrichment of amino acids, discarding the equally valuable information related to amino acid depletion. Seq2logo aims at resolving these issues allowing the user to include sequence weighting to correct for data redundancy, pseudo counts to correct for low number of observations and different logotype representations each capturing different aspects related to amino acid enrichment and depletion. Besides allowing input in the format of peptides and MSA, Seq2Logo accepts input as Blast sequence profiles, providing easy access for non-expert end-users to characterize and identify functionally conserved/variable amino acids in any given protein of interest. The output from the server is a sequence logo and a PSSM. Seq2Logo is available at http://www.cbs.dtu.dk/biotools/Seq2Logo (14 May 2012, date last accessed). PMID:22638583

  18. Seq2Logo: a method for construction and visualization of amino acid binding motifs and sequence profiles including sequence weighting, pseudo counts and two-sided representation of amino acid enrichment and depletion.

    PubMed

    Thomsen, Martin Christen Frølund; Nielsen, Morten

    2012-07-01

    Seq2Logo is a web-based sequence logo generator. Sequence logos are a graphical representation of the information content stored in a multiple sequence alignment (MSA) and provide a compact and highly intuitive representation of the position-specific amino acid composition of binding motifs, active sites, etc. in biological sequences. Accurate generation of sequence logos is often compromised by sequence redundancy and low number of observations. Moreover, most methods available for sequence logo generation focus on displaying the position-specific enrichment of amino acids, discarding the equally valuable information related to amino acid depletion. Seq2logo aims at resolving these issues allowing the user to include sequence weighting to correct for data redundancy, pseudo counts to correct for low number of observations and different logotype representations each capturing different aspects related to amino acid enrichment and depletion. Besides allowing input in the format of peptides and MSA, Seq2Logo accepts input as Blast sequence profiles, providing easy access for non-expert end-users to characterize and identify functionally conserved/variable amino acids in any given protein of interest. The output from the server is a sequence logo and a PSSM. Seq2Logo is available at http://www.cbs.dtu.dk/biotools/Seq2Logo (14 May 2012, date last accessed).

  19. Towards a taxonomic coherence between average nucleotide identity and 16S rRNA gene sequence similarity for species demarcation of prokaryotes.

    PubMed

    Kim, Mincheol; Oh, Hyun-Seok; Park, Sang-Cheol; Chun, Jongsik

    2014-02-01

    Among available genome relatedness indices, average nucleotide identity (ANI) is one of the most robust measurements of genomic relatedness between strains, and has great potential in the taxonomy of bacteria and archaea as a substitute for the labour-intensive DNA-DNA hybridization (DDH) technique. An ANI threshold range (95-96%) for species demarcation had previously been suggested based on comparative investigation between DDH and ANI values, albeit with rather limited datasets. Furthermore, its generality was not tested on all lineages of prokaryotes. Here, we investigated the overall distribution of ANI values generated by pairwise comparison of 6787 genomes of prokaryotes belonging to 22 phyla to see whether the suggested range can be applied to all species. There was an apparent distinction in the overall ANI distribution between intra- and interspecies relationships at around 95-96% ANI. We went on to determine which level of 16S rRNA gene sequence similarity corresponds to the currently accepted ANI threshold for species demarcation using over one million comparisons. A twofold cross-validation statistical test revealed that 98.65% 16S rRNA gene sequence similarity can be used as the threshold for differentiating two species, which is consistent with previous suggestions (98.2-99.0%) derived from comparative studies between DDH and 16S rRNA gene sequence similarity. Our findings should be useful in accelerating the use of genomic sequence data in the taxonomy of bacteria and archaea.

  20. Cloning and Sequence Analysis of cDNAs Encoding Two Acidic PLA(2) from venom of Ophiophagus hannah(King Cobra), Guangxi Species.

    PubMed

    Wang, Qiu-Yan; Shu, Yu-Yan; Zhuang, Mao-Xing; Lin, Zheng-Jiong

    2001-01-01

    Total RNA was extracted from venom glands of Ophiophagus hannah, Guangxi species. The cDNAs encoding PLA(2) were amplified by RT-PCR and cloned into the PUCm-T vector. The positive clones encoding two acidic PLA(2) (APLA(2)-1 and APLA(2)-2) were selected and bidirectionally sequenced. Their complete amino acid sequences were deduced and found to be identical to the known amino acid sequences. Their isoelectric points calculated by computer agreed with the values determined with their protein. Homology analysis indicated that the mature peptide of APLA(2)-1 had high homology with PLA(2) from venoms of Ophiophagus hannah, Fujian and Taiwan species, but APLA(2)-2 had lower homology. The most striking difference between APLA(2)-2 and other PLA(2) from Ophiophagus hannah venoms is the missing of a extra "pancreatic loop" at residues 62--66 in APLA(2)-2, and it may be related to their species evolution and biological activity.

  1. Nucleotide and deduced amino acid sequences of rat myosin binding protein H (MyBP-H).

    PubMed

    Jung, J; Oh, J; Lee, K

    1998-12-01

    The complete nucleotide sequence of the cDNA clone encoding rat skeletal muscle myosin-binding protein H (MyBP-H) was determined and amino acid sequence was deduced from the nucleotide sequence (GenBank accession number AF077338). The full-length cDNA of 1782 base pairs(bp) contains a single open reading frame of 1454 bp encoding a rat MyBP-H protein of the predicted molecular mass 52.7 kDa and includes the common consensus 'CA__TG' protein binding motif. The cDNA sequence of rat MyBP-H show 92%, 84% and 41% homology with those of mouse, human and chicken, respectively. The protein contains tandem internal motifs array (-FN III-Ig C2-FN III-Ig C2-) in the C-terminal region which resembles to the immunoglobulin superfamily C2 and fibronectin type III motifs. The amino acid sequence of the C-terminal Ig C2 was highly conserved among MyBPs family and other thick filament binding proteins, suggesting that the C-terminal Ig C2 might play an important role in its function. All proteins belonging to MyBP-H member contains 'RKPS' sequence which is assumed to be cAMP- and cGMP-dependent protein kinase A phosphorylation site. Computer analysis of the primary sequence of rat MyBP-H predicted 11 protein kinase C (PKC) phosphorylation site, 7 casein kinase II (CK2) phosphorylation site and 4 N-myristoylation site.

  2. Method for high-volume sequencing of nucleic acids: random and directed priming with libraries of oligonucleotides

    DOEpatents

    Studier, F. William

    1995-04-18

    Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient.

  3. Method for high-volume sequencing of nucleic acids: random and directed priming with libraries of oligonucleotides

    DOEpatents

    Studier, F.W.

    1995-04-18

    Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient. 2 figs.

  4. Analysis of the complete sequences of two biologically distinct Zucchini yellow mosaic virus isolates further evidences the involvement of a single amino acid in the virus pathogenicity.

    PubMed

    Nováková, S; Svoboda, J; Glasa, M

    2014-01-01

    The complete genome sequences of two Slovak Zucchini yellow mosaic virus isolates (ZYMV-H and ZYMV-SE04T) were determined. These isolates differ significantly in their pathogenicity, producing either severe or very mild symptoms on susceptible cucurbit hosts. The viral genome of both isolates consisted of 9593 nucleotides in size, and contained an open reading frame encoding a single polyprotein of 3080 amino acids. Despite their different biological properties, an extremely high nucleotide identity could be noted (99.8%), resulting in differences of only 5 aa, located in the HC-Pro, P3, and NIb, respectively. In silico analysis including 5 additional fully-sequenced and phylogenetically closely-related isolates known to induce different symptoms in cucurbits was performed. This suggested that the key single mutation responsible for virus pathogenicity is likely located in the N-terminal part of P3, adjacent to the PIPO.

  5. Sequence-specific thermodynamic properties of nucleic acids influence both transcriptional pausing and backtracking in yeast

    PubMed Central

    2017-01-01

    RNA Polymerase II pauses and backtracks during transcription, with many consequences for gene expression and cellular physiology. Here, we show that the energy required to melt double-stranded nucleic acids in the transcription bubble predicts pausing in Saccharomyces cerevisiae far more accurately than nucleosome roadblocks do. In addition, the same energy difference also determines when the RNA polymerase backtracks instead of continuing to move forward. This data-driven model corroborates—in a genome wide and quantitative manner—previous evidence that sequence-dependent thermodynamic features of nucleic acids influence both transcriptional pausing and backtracking. PMID:28301878

  6. Respiratory syncytial virus fusion glycoprotein: nucleotide sequence of mRNA, identification of cleavage activation site and amino acid sequence of N-terminus of F1 subunit.

    PubMed Central

    Elango, N; Satake, M; Coligan, J E; Norrby, E; Camargo, E; Venkatesan, S

    1985-01-01

    The amino acid sequence of respiratory syncytial virus fusion protein (Fo) was deduced from the sequence of a partial cDNA clone of mRNA and from the 5' mRNA sequence obtained by primer extension and dideoxysequencing. The encoded protein of 574 amino acids is extremely hydrophobic and has a molecular weight of 63371 daltons. The site of proteolytic cleavage within this protein was accurately mapped by determining a partial amino acid sequence of the N-terminus of the larger subunit (F1) purified by radioimmunoprecipitation using monoclonal antibodies. Alignment of the N-terminus of the F1 subunit within the deduced amino acid sequence of Fo permitted us to identify a sequence of lys-lys-arg-lys-arg-arg at the C-terminus of the smaller N-terminal F2 subunit that appears to represent the cleavage/activation domain. Five potential sites of glycosylation, four within the F2 subunit, were also identified. Three extremely hydrophobic domains are present in the protein; a) the N-terminal signal sequence, b) the N-terminus of the F1 subunit that is analogous to the N-terminus of the paramyxovirus F1 subunit and the HA2 subunit of influenza virus hemagglutinin, and c) the putative membrane anchorage domain near the C-terminus of F1. Images PMID:2987829

  7. The Complete Genome Sequence of the Lactic Acid Bacterium Lactococcus lactis ssp. lactis IL1403

    PubMed Central

    Bolotin, Alexander; Wincker, Patrick; Mauger, Stéphane; Jaillon, Olivier; Malarme, Karine; Weissenbach, Jean; Ehrlich, S. Dusko; Sorokin, Alexei

    2001-01-01

    Lactococcus lactis is a nonpathogenic AT-rich gram-positive bacterium closely related to the genus Streptococcus and is the most commonly used cheese starter. It is also the best-characterized lactic acid bacterium. We sequenced the genome of the laboratory strain IL1403, using a novel two-step strategy that comprises diagnostic sequencing of the entire genome and a shotgun polishing step. The genome contains 2,365,589 base pairs and encodes 2310 proteins, including 293 protein-coding genes belonging to six prophages and 43 insertion sequence (IS) elements. Nonrandom distribution of IS elements indicates that the chromosome of the sequenced strain may be a product of recent recombination between two closely related genomes. A complete set of late competence genes is present, indicating the ability of L. lactis to undergo DNA transformation. Genomic sequence revealed new possibilities for fermentation pathways and for aerobic respiration. It also indicated a horizontal transfer of genetic information from Lactococcus to gram-negative enteric bacteria of Salmonella-Escherichia group. [The sequence data described in this paper has been submitted to the GenBank data library under accession no. AE005176.] PMID:11337471

  8. Stereochemical Sequence Ion Selectivity: Proline versus Pipecolic-acid-containing Protonated Peptides

    NASA Astrophysics Data System (ADS)

    Abutokaikah, Maha T.; Guan, Shanshan; Bythell, Benjamin J.

    2017-01-01

    Substitution of proline by pipecolic acid, the six-membered ring congener of proline, results in vastly different tandem mass spectra. The well-known proline effect is eliminated and amide bond cleavage C-terminal to pipecolic acid dominates instead. Why do these two ostensibly similar residues produce dramatically differing spectra? Recent evidence indicates that the proton affinities of these residues are similar, so are unlikely to explain the result [Raulfs et al., J. Am. Soc. Mass Spectrom. 25, 1705-1715 (2014)]. An additional hypothesis based on increased flexibility was also advocated. Here, we provide a computational investigation of the "pipecolic acid effect," to test this and other hypotheses to determine if theory can shed additional light on this fascinating result. Our calculations provide evidence for both the increased flexibility of pipecolic-acid-containing peptides, and structural changes in the transition structures necessary to produce the sequence ions. The most striking computational finding is inversion of the stereochemistry of the transition structures leading to "proline effect"-type amide bond fragmentation between the proline/pipecolic acid-congeners: R (proline) to S (pipecolic acid). Additionally, our calculations predict substantial stabilization of the amide bond cleavage barriers for the pipecolic acid congeners by reduction in deleterious steric interactions and provide evidence for the importance of experimental energy regime in rationalizing the spectra.

  9. Sequence-specific nucleic acid detection from binary pore conductance measurement

    PubMed Central

    Esfandiari, Leyla; Monbouquette, Harold G.; Schmidt, Jacob J.

    2012-01-01

    We describe a platform for sequence-specific nucleic acid (NA) detection utilizing a micropipette tapered to a 2 μm diameter pore and 3 μm diameter polystyrene beads to which uncharged peptide nucleic acid (PNA) probe molecules have been conjugated. As the target NAs hybridize to the complementary PNA-beads, the beads acquire negative charge and become electrophoretically mobile. An applied electric field guides these NA-PNA-beads toward the pipette tip, which they obstruct, leading to an indefinite, electrically detectable, partial blockade of the pore. In the presence of non-complementary NA, even to the level of single base mismatch, permanent pore blockade is not seen. We show application of this platform to detection of the anthrax lethal factor sequence. PMID:22931376

  10. Self-sequencing of amino acids and origins of polyfunctional protocells

    NASA Technical Reports Server (NTRS)

    Fox, S. W.

    1984-01-01

    The role of proteins in the origin of living things is discussed. It has been experimentally established that amino acids can sequence themselves under simulated geological conditions with highly nonrandom products which accordingly contain diverse information. Multiple copies of each type of macromolecule are formed, resulting in greater power for any protoenzymic molecule than would accrue from a single copy of each type. Thermal proteins are readily incorporated into laboratory protocells. The experimental evidence for original polyfunctional protocells is discussed.

  11. Self-sequencing of amino acids and origins of polyfunctional protocells

    NASA Technical Reports Server (NTRS)

    Fox, S. W.

    1984-01-01

    The role of proteins in the origin of living things is discussed. It has been experimentally established that amino acids can sequence themselves under simulated geological conditions with highly nonrandom products which accordingly contain diverse information. Multiple copies of each type of macromolecule are formed, resulting in greater power for any protoenzymic molecule than would accrue from a single copy of each type. Thermal proteins are readily incorporated into laboratory protocells. The experimental evidence for original polyfunctional protocells is discussed.

  12. Amino acid sequence of atrial natriuretic peptides in human coronary sinus plasma.

    PubMed

    Yandle, T; Crozier, I; Nicholls, G; Espiner, E; Carne, A; Brennan, S

    1987-07-31

    Two atrial natriuretic peptides were purified from pooled human coronary sinus plasma by Sep-Pak extraction, immunoaffinity chromatography and reverse phase HPLC. The amino acid sequences of the two peptides were homologous with 99-126 human atrial natriuretic peptide (hANP) and 106-126 hANP, the latter being most probably linked to 99-105 ANP by the disulphide bond. The molar ratio of the peptides in plasma, as assessed by radioimmunoassay was 10:3.

  13. Amino Acid Sequences Mediating Vascular Cell Adhesion Molecule 1 Binding to Integrin Alpha 4: Homologous DSP Sequence Found for JC Polyoma VP1 Coat Protein

    PubMed Central

    Meyer, Michael Andrew

    2013-01-01

    The JC polyoma viral coat protein VP1 was analyzed for amino acid sequences homologies to the IDSP sequence which mediates binding of VLA-4 (integrin alpha 4) to vascular cell adhesion molecule 1. Although the full sequence was not found, a DSP sequence was located near the critical arginine residue linked to infectivity of the virus and binding to sialic acid containing molecules such as integrins (3). For the JC polyoma virus, a DSP sequence was found at residues 70, 71 and 72 with homology also noted for the mouse polyoma virus and SV40 virus. Three dimensional modeling of the VP1 molecule suggests that the DSP loop has an accessible site for interaction from the external side of the assembled viral capsid pentamer. PMID:24147211

  14. Amino Acid Sequences Mediating Vascular Cell Adhesion Molecule 1 Binding to Integrin Alpha 4: Homologous DSP Sequence Found for JC Polyoma VP1 Coat Protein.

    PubMed

    Meyer, Michael Andrew

    2013-01-01

    The JC polyoma viral coat protein VP1 was analyzed for amino acid sequences homologies to the IDSP sequence which mediates binding of VLA-4 (integrin alpha 4) to vascular cell adhesion molecule 1. Although the full sequence was not found, a DSP sequence was located near the critical arginine residue linked to infectivity of the virus and binding to sialic acid containing molecules such as integrins (3). For the JC polyoma virus, a DSP sequence was found at residues 70, 71 and 72 with homology also noted for the mouse polyoma virus and SV40 virus. Three dimensional modeling of the VP1 molecule suggests that the DSP loop has an accessible site for interaction from the external side of the assembled viral capsid pentamer.

  15. Coding and 3' non-coding nucleotide sequence of chalcone synthase mRNA and assignment of amino acid sequence of the enzyme

    PubMed Central

    Reimold, Ursula; Kröger, Manfred; Kreuzaler, Fritz; Hahlbrock, Klaus

    1983-01-01

    The nucleotide sequence of an almost complete cDNA copy of chalcone synthase mRNA from cultured parsley cells (Petroselinum hortense) has been determined. The cDNA copy comprised the complete coding sequence for chalcone synthase, a short A-rich stretch of the 5' non-coding region and the complete 3' non-coding region including a poly(A) tail. The amino acid sequence deduced from the nucleotide sequence of the cDNA is consistent with a partial N-terminal sequence analysis, the total amino acid composition, the cyanogen bromide cleavage pattern, and the apparent mol. wt. of the subunit of the purified enzyme. PMID:16453477

  16. Novel Numerical Characterization of Protein Sequences Based on Individual Amino Acid and Its Application

    PubMed Central

    Zhang, Yan-ping; Sheng, Ya-jun; He, Ping-an; Ruan, Ji-shuo

    2015-01-01

    The hydrophobicity and hydrophilicity of amino acids play a very important role in protein folding and its interaction with the environment and other molecules, as well as its catalytic mechanism. Based on the two physicochemical indexes, a 2D graphical representation of protein sequences is introduced; meanwhile, a new numerical characteristic has been proposed to compute the distance of different sequences for analysis of sequence similarity/dissimilarity on the basis of this graphical representation. Furthermore, we apply the new distance in the similarities/dissimilarities of ND5 proteins of nine species and predict the four major classes based on the dataset containing 639 domains. The results show that the method is simple and effective. PMID:25705698

  17. Low-Pass Genome-Wide Sequencing and Variant Inference Using Identity-by-Descent in an Isolated Human Population

    PubMed Central

    Gusev, A.; Shah, M. J.; Kenny, E. E.; Ramachandran, A.; Lowe, J. K.; Salit, J.; Lee, C. C.; Levandowsky, E. C.; Weaver, T. N.; Doan, Q. C.; Peckham, H. E.; McLaughlin, S. F.; Lyons, M. R.; Sheth, V. N.; Stoffel, M.; De La Vega, F. M.; Friedman, J. M.; Breslow, J. L.

    2012-01-01

    Whole-genome sequencing in an isolated population with few founders directly ascertains variants from the population bottleneck that may be rare elsewhere. In such populations, shared haplotypes allow imputation of variants in unsequenced samples without resorting to complex statistical methods as in studies of outbred cohorts. We focus on an isolated population cohort from the Pacific Island of Kosrae, Micronesia, where we previously collected SNP array and rich phenotype data for the majority of the population. We report identification of long regions with haplotypes co-inherited between pairs of individuals and methodology to leverage such shared genetic content for imputation. Our estimates show that sequencing as few as 40 personal genomes allows for inference in up to 60% of the 3000-person cohort at the average locus. We ascertained a pilot data set of whole-genome sequences from seven Kosraean individuals, with average 5× coverage. This assay identified 5,735,306 unique sites of which 1,212,831 were previously unknown. Additionally, these variants are unusually enriched for alleles that are rare in other populations when compared to geographic neighbors (published Korean genome SJK). We used the presence of shared haplotypes between the seven Kosraen individuals to estimate expected imputation accuracy of known and novel homozygous variants at 99.6% and 97.3%, respectively. This study presents whole-genome analysis of a homogenous isolate population with emphasis on optimal rare variant inference. PMID:22135348

  18. Complete amino acid sequences of three proteinase inhibitors from white sword bean (Canavalia gladiata).

    PubMed

    Park, S S; Sumi, T; Ohba, H; Nakamura, O; Kimura, M

    2000-10-01

    Three major serine proteinase inhibitors (SBI-1, -2, and -3) were purified from the seeds of white sword bean (Canavalia gladiata) by FPLC and reversed-phase HPLC. The sequences of these inhibitors were established by automatic Edman degradation and TOF-mass spectrometry. SBI-1, -2, and -3 consisted of 72, 73, and 75 amino acid residues, with molecular masses of 7806.5, 7919.8, and 8163.4, respectively. The sequences of SBI-1 and -2 coincided with those of CLT I and II [Terada et al. (1994) Biosci. Biotech. Biochem., 58, 376-379] except only N- or C-terminal amino acid residues. Analysis of the amino acid sequences showed that the active sites of the inhibitors contained a Lys21-Ser22 against trypsin and Leu48-Ser49 against chymotrypsin, respectively. Further, it became apparent that about seven disulfide bonds were present. These results suggest that sword bean inhibitors are members of the Bowman-Birk proteinase inhibitor family.

  19. Random Amino Acid Mutations and Protein Misfolding Lead to Shannon Limit in Sequence-Structure Communication

    PubMed Central

    Lisewski, Andreas Martin

    2008-01-01

    The transmission of genomic information from coding sequence to protein structure during protein synthesis is subject to stochastic errors. To analyze transmission limits in the presence of spurious errors, Shannon's noisy channel theorem is applied to a communication channel between amino acid sequences and their structures established from a large-scale statistical analysis of protein atomic coordinates. While Shannon's theorem confirms that in close to native conformations information is transmitted with limited error probability, additional random errors in sequence (amino acid substitutions) and in structure (structural defects) trigger a decrease in communication capacity toward a Shannon limit at 0.010 bits per amino acid symbol at which communication breaks down. In several controls, simulated error rates above a critical threshold and models of unfolded structures always produce capacities below this limiting value. Thus an essential biological system can be realistically modeled as a digital communication channel that is (a) sensitive to random errors and (b) restricted by a Shannon error limit. This forms a novel basis for predictions consistent with observed rates of defective ribosomal products during protein synthesis, and with the estimated excess of mutual information in protein contact potentials. PMID:18769673

  20. Characterization of the microbial acid mine drainage microbial community using culturing and direct sequencing techniques.

    PubMed

    Auld, Ryan R; Myre, Maxine; Mykytczuk, Nadia C S; Leduc, Leo G; Merritt, Thomas J S

    2013-05-01

    We characterized the bacterial community from an AMD tailings pond using both classical culturing and modern direct sequencing techniques and compared the two methods. Acid mine drainage (AMD) is produced by the environmental and microbial oxidation of minerals dissolved from mining waste. Surprisingly, we know little about the microbial communities associated with AMD, despite the fundamental ecological roles of these organisms and large-scale economic impact of these waste sites. AMD microbial communities have classically been characterized by laboratory culturing-based techniques and more recently by direct sequencing of marker gene sequences, primarily the 16S rRNA gene. In our comparison of the techniques, we find that their results are complementary, overall indicating very similar community structure with similar dominant species, but with each method identifying some species that were missed by the other. We were able to culture the majority of species that our direct sequencing results indicated were present, primarily species within the Acidithiobacillus and Acidiphilium genera, although estimates of relative species abundance were only obtained from direct sequencing. Interestingly, our culture-based methods recovered four species that had been overlooked from our sequencing results because of the rarity of the marker gene sequences, likely members of the rare biosphere. Further, direct sequencing indicated that a single genus, completely missed in our culture-based study, Legionella, was a dominant member of the microbial community. Our results suggest that while either method does a reasonable job of identifying the dominant members of the AMD microbial community, together the methods combine to give a more complete picture of the true diversity of this environment.

  1. The amino acid sequence of the aspartate aminotransferase from baker's yeast (Saccharomyces cerevisiae).

    PubMed Central

    Cronin, V B; Maras, B; Barra, D; Doonan, S

    1991-01-01

    1. The single (cytosolic) aspartate aminotransferase was purified in high yield from baker's yeast (Saccharomyces cerevisiae). 2. Amino-acid-sequence analysis was carried out by digestion of the protein with trypsin and with CNBr; some of the peptides produced were further subdigested with Staphylococcus aureus V8 proteinase or with pepsin. Peptides were sequenced by the dansyl-Edman method and/or by automated gas-phase methods. The amino acid sequence obtained was complete except for a probable gap of two residues as indicated by comparison with the structures of counterpart proteins in other species. 3. The N-terminus of the enzyme is blocked. Fast-atom-bombardment m.s. was used to identify the blocking group as an acetyl one. 4. Alignment of the sequence of the enzyme with those of vertebrate cytosolic and mitochondrial aspartate aminotransferases and with the enzyme from Escherichia coli showed that about 25% of residues are conserved between these distantly related forms. 5. Experimental details and confirmatory data for the results presented here are given in a Supplementary Publication (SUP 50164, 25 pages) that has been deposited at the British Library Document Supply Centre, Boston Spa. Wetherby, West Yorkshire LS23 7 BQ, U.K., from whom copies can be obtained on the terms indicated in Biochem. J. (1991) 273, 5. PMID:1859361

  2. Analysis of amino acid sequence variations and immunoglobulin E-binding epitopes of German cockroach tropomyosin.

    PubMed

    Jeong, Kyoung Yong; Lee, Jongweon; Lee, In-Yong; Ree, Han-Il; Hong, Chein-Soo; Yong, Tai-Soon

    2004-09-01

    The allergenicities of tropomyosins from different organisms have been reported to vary. The cDNA encoding German cockroach tropomyosin (Bla g 7) was isolated, expressed, and characterized previously. In the present study, the amino acid sequence variations in German cockroach tropomyosin were analyzed in order to investigate its influence on allergenicity. We also undertook the identification of immunodominant peptides containing immunoglobulin E (IgE) epitopes which may facilitate the development of diagnostic and immunotherapeutic strategies based on the recombinant proteins. Two-dimensional gel electrophoresis and immunoblot analysis with mouse anti-recombinant German cockroach tropomyosin serum was performed to investigate the isoforms at the protein level. Reverse transcriptase PCR (RT-PCR) was applied to examine the sequence diversity. Eleven different variants of the deduced amino acid sequences were identified by RT-PCR. German cockroach tropomyosin has only minor sequence variations that did not seem to affect its allergenicity significantly. These results support the molecular basis underlying the cross-reactivities of arthropod tropomyosins. Recombinant fragments were also generated by PCR, and IgE-binding epitopes were assessed by enzyme-linked immunosorbent assay. Sera from seven patients revealed heterogeneous IgE-binding responses. This study demonstrates multiple IgE-binding epitope regions in a single molecule, suggesting that full-length tropomyosin should be used for the development of diagnostic and therapeutic reagents.

  3. [MOLECULAR EVOLUTION OF ION CHANNELS: AMINO ACID SEQUENCES AND 3D STRUCTURES].

    PubMed

    Korkosh, V S; Zhorov, B S; Tikhonov, D B

    2016-01-01

    An integral part of modern evolutionary biology is comparative analysis of structure and function of macromolecules such as proteins. The first and critical step to understand evolution of homologous proteins is their amino acid sequence alignment. However, standard algorithms fop not provide unambiguous sequence alignments for proteins of poor homology. More reliable results can be obtained by comparing experimental 3D structures obtained at atomic resolution, for instance, with the aid of X-ray structural analysis. If such structures are lacking, homology modeling is used, which may take into account indirect experimental data on functional roles of individual amino-acid residues. An important problem is that the sequence alignment, which reflects genetic modifications, does not necessarily correspond to the functional homology. The latter depends on three-dimensional structures which are critical for natural selection. Since alignment techniques relying only on the analysis of primary structures carry no information on the functional properties of proteins, including 3D structures into consideration is very important. Here we consider several examples involving ion channels and demonstrate that alignment of their three-dimensional structures can significantly improve sequence alignments obtained by traditional methods.

  4. A proposal for a coherent mammalian histone H1 nomenclature correlated with amino acid sequences.

    PubMed

    Parseghian, M H; Henschen, A H; Krieglstein, K G; Hamkalo, B A

    1994-04-01

    Bio-Rex 70 chromatography was combined with reverse-phase (RP) HPLC to fractionate histone H1 zero and 4 histone H1 subtypes from human placental nuclei as previously described (Parseghian MH et al., 1993, Chromosome Res 1:127-139). After proteolytic digestion of the subtypes with Staphylococcus aureus V8 protease, peptides were fractionated by RP-HPLC and partially sequenced by Edman degradation in order to correlate them with human spleen subtypes (Ohe Y, Hayashi H, Iwai K, 1986, J Biochem (Tokyo) 100:359-368; 1989, J Biochem (Tokyo) 106:844-857). Based on comparisons with the sequence data available from other mammalian species, subtypes were grouped. These groupings were used to construct a coherent nomenclature for mammalian somatic H1s. Homologous subtypes possess characteristic patterns of growth-related and cAMP-dependent phosphorylation sites. The groupings defined by amino acid sequence also were used to correlate the elution profiles and electrophoretic mobilities of subtypes derived from different species. Previous attempts at establishing an H1 nomenclature by chromatographic or electrophoretic fractionations has resulted in several misidentifications. We present here, for the first time, a nomenclature for somatic H1s based on amino acid sequences that are analogous to those for H1 zero and H1t. The groupings defined should be useful in correlating the many observations regarding H1 subtypes in the literature.

  5. Complete amino acid sequence of a histidine-rich proteolytic fragment of human ceruloplasmin.

    PubMed

    Kingston, I B; Kingston, B L; Putnam, F W

    1979-04-01

    The complete amino acid sequence has been determined for a fragment of human ceruloplasmin [ferroxidase; iron(II):oxygen oxidoreductase, EC 1.16.3.1]. The fragment (designated Cp F5) contains 159 amino acid residues and has a molecular weight of 18,650; it lacks carbohydrate, is rich in histidine, and contains one free cysteine that may be part of a copper-binding site. This fragment is present in most commercial preparations of ceruloplasmin, probably owing to proteolytic degradation, but can also be obtained by limited cleavage of single-chain ceruloplasmin with plasmin. Cp F5 probably is an intact domain attached to the COOH-terminal end of single-chain ceruloplasmin via a labile interdomain peptide bond. A model of the secondary structure predicted by empirical methods suggests that almost one-third of the amino acid residues are distributed in alpha helices, about a third in beta-sheet structure, and the remainder in beta turns and unidentified structures. Computer analysis of the amino acid sequence has not demonstrated a statistically significant relationship between this ceruloplasmin fragment and any other protein, but there is some evidence for an internal duplication.

  6. Next generation sequencing of SNPs using the HID-Ion AmpliSeq™ Identity Panel on the Ion Torrent PGM™ platform.

    PubMed

    Guo, Fei; Zhou, Yishu; Song, He; Zhao, Jinling; Shen, Hongying; Zhao, Bin; Liu, Feng; Jiang, Xianhua

    2016-11-01

    The HID-Ion AmpliSeq™ Identity Panel (the HID Identity Panel) is designed to detect 124-plex single nucleotide polymorphisms (SNPs) with next generation sequencing (NGS) technology on the Ion Torrent PGM™ platform, including 90 individual identification SNPs (IISNPs) on autosomal chromosomes and 34 lineage informative SNPs (LISNPs) on Y chromosome. In this study, we evaluated performance for the HID Identity Panel to provide a reference for NGS-SNP application, focusing on locus strand balance, locus coverage balance, heterozygote balance, and background signals. Besides, several experiments were carried out to find out improvements and limitations of this panel, including studies of species specificity, repeatability and concordance, sensitivity, mixtures, case-type samples and degraded samples, population genetics and pedigrees following the Scientific Working Group on DNA Analysis Methods (SWGDAM) guidelines. In addition, Southern and Northern Chinese Han were investigated to assess applicability of this panel. Results showed this panel led to cross-reactivity with primates to some extent but rarely with non-primate animals. Repeatable and concordant genotypes could be obtained in triplicate with one exception at rs7520386. Full profiles could be obtained from 100pg input DNA, but the optimal input DNA would be 1ng-200pg with 21 initial PCR cycles. A sample with ≥20% minor contributor could be considered as a mixture by the number of homozygotes, and full profiles belonging to minor contributors could be detected between 9:1 and 1:9 mixtures with known reference profiles. Also, this assay could be used for case-type samples and degraded samples. For autosomal SNPs (A-SNPs), FST across all 90loci was not significantly different between Southern and Northern Chinese Han or between male and female samples. All A-SNP loci were independent in Chinese Han population. Except for 18loci with He <0.4, most of the A-SNPs in the HID Identity Panel presented high

  7. Complete Genome Sequence of a thermotolerant sporogenic lactic acid bacterium, Bacillus coagulans strain 36D1

    PubMed Central

    Rhee, Mun Su; Moritz, Brélan E.; Xie, Gary; Glavina del Rio, T.; Dalin, E.; Tice, H.; Bruce, D.; Goodwin, L.; Chertkov, O.; Brettin, T.; Han, C.; Detter, C.; Pitluck, S.; Land, Miriam L.; Patel, Milind; Ou, Mark; Harbrucker, Roberta; Ingram, Lonnie O.; Shanmugam, K. T.

    2011-01-01

    Bacillus coagulans is a ubiquitous soil bacterium that grows at 50-55 °C and pH 5.0 and ferments various sugars that constitute plant biomass to L (+)-lactic acid. The ability of this sporogenic lactic acid bacterium to grow at 50-55 °C and pH 5.0 makes this organism an attractive microbial biocatalyst for production of optically pure lactic acid at industrial scale not only from glucose derived from cellulose but also from xylose, a major constituent of hemicellulose. This bacterium is also considered as a potential probiotic. Complete genome sequence of a representative strain, B. coagulans strain 36D1, is presented and discussed. PMID:22675583

  8. Measuring nanometer distances in nucleic acids using a sequence-independent nitroxide probe

    PubMed Central

    Qin, Peter Z; Haworth, Ian S; Cai, Qi; Kusnetzow, Ana K; Grant, Gian Paola G; Price, Eric A; Sowa, Glenna Z; Popova, Anna; Herreros, Bruno; He, Honghang

    2008-01-01

    This protocol describes the procedures for measuring nanometer distances in nucleic acids using a nitroxide probe that can be attached to any nucleotide within a given sequence. Two nitroxides are attached to phosphorothioates that are chemically substituted at specific sites of DNA or RNA. Inter-nitroxide distances are measured using a four-pulse double electron–electron resonance technique, and the measured distances are correlated to the parent structures using a Web-accessible computer program. Four to five days are needed for sample labeling, purification and distance measurement. The procedures described herein provide a method for probing global structures and studying conformational changes of nucleic acids and protein/nucleic acid complexes. PMID:17947978

  9. BeadCons: detection of nucleic acid sequences by flow cytometry.

    PubMed

    Horejsh, Douglas; Martini, Federico; Capobianchi, Maria Rosaria

    2005-11-01

    Molecular beacons are single-stranded nucleic acid structures with a terminal fluorophore and a distal, terminal quencher. These molecules are typically used in real-time PCR assays, but have also been conjugated with solid matrices. This unit describes protocols related to molecular beacon-conjugated beads (BeadCons), whose specific hybridization with complementary target sequences can be resolved by cytometry. Assay sensitivity is achieved through the concentration of fluorescence signal on discrete particles. By using molecular beacons with different fluorophores and microspheres of different sizes, it is possible to construct a fluid array system with each bead corresponding to a specific target nucleic acid. Methods are presented for the design, construction, and use of BeadCons for the specific, multiplexed detection of unlabeled nucleic acids in solution. The use of bead-based detection methods will likely lead to the design of new multiplex molecular diagnostic tools.

  10. Complete amino acid sequence of globin chains and biological activity of fragmented crocodile hemoglobin (Crocodylus siamensis).

    PubMed

    Srihongthong, Saowaluck; Pakdeesuwan, Anawat; Daduang, Sakda; Araki, Tomohiro; Dhiravisit, Apisak; Thammasirirak, Sompong

    2012-08-01

    Hemoglobin, α-chain, β-chain and fragmented hemoglobin of Crocodylus siamensis demonstrated both antibacterial and antioxidant activities. Antibacterial and antioxidant properties of the hemoglobin did not depend on the heme structure but could result from the compositions of amino acid residues and structures present in their primary structure. Furthermore, thirteen purified active peptides were obtained by RP-HPLC analyses, corresponding to fragments in the α-globin chain and the β-globin chain which are mostly located at the N-terminal and C-terminal parts. These active peptides operate on the bacterial cell membrane. The globin chains of Crocodylus siamensis showed similar amino acids to the sequences of Crocodylus niloticus. The novel amino acid substitutions of α-chain and β-chain are not associated with the heme binding site or the bicarbonate ion binding site, but could be important through their interactions with membranes of bacteria.

  11. Complete Genome Sequence of a thermotolerant sporogenic lactic acid bacterium, Bacillus coagulans strain 36D1.

    PubMed

    Rhee, Mun Su; Moritz, Brélan E; Xie, Gary; Glavina Del Rio, T; Dalin, E; Tice, H; Bruce, D; Goodwin, L; Chertkov, O; Brettin, T; Han, C; Detter, C; Pitluck, S; Land, Miriam L; Patel, Milind; Ou, Mark; Harbrucker, Roberta; Ingram, Lonnie O; Shanmugam, K T

    2011-12-31

    Bacillus coagulans is a ubiquitous soil bacterium that grows at 50-55 °C and pH 5.0 and ferments various sugars that constitute plant biomass to L (+)-lactic acid. The ability of this sporogenic lactic acid bacterium to grow at 50-55 °C and pH 5.0 makes this organism an attractive microbial biocatalyst for production of optically pure lactic acid at industrial scale not only from glucose derived from cellulose but also from xylose, a major constituent of hemicellulose. This bacterium is also considered as a potential probiotic. Complete genome sequence of a representative strain, B. coagulans strain 36D1, is presented and discussed.

  12. Morphological Identities of Two Different Marine Stramenopile Environmental Sequence Clades: Bicosoeca kenaiensis (Hilliard, 1971) and Cantina marsupialis (Larsen and Patterson, 1990) gen. nov., comb. nov.

    PubMed

    Yubuki, Naoji; Pánek, Tomáš; Yabuki, Akinori; Čepička, Ivan; Takishita, Kiyotaka; Inagaki, Yuji; Leander, Brian S

    2015-01-01

    Although environmental DNA surveys improve our understanding of biodiversity, interpretation of unidentified lineages is limited by the absence of associated morphological traits and living cultures. Unidentified lineages of marine stramenopiles are called "MAST clades". Twenty-five MAST clades have been recognized: MAST-1 through MAST-25; seven of these have been subsequently discarded because the sequences representing those clades were found to either (1) be chimeric or (2) affiliate within previously described taxonomic groups. Eighteen MAST clades remain without a cellular identity. Moreover, the discarded "MAST-13" has been used in different studies to refer to two different environmental sequence clades. After establishing four cultures representing two different species of heterotrophic stramenopiles and then characterizing their morphology and molecular phylogenetic positions, we determined that the two different species represented the two different MAST-13 clades: (1) a lorica-bearing Bicosoeca kenaiensis and (2) a microaerophilic flagellate previously named "Cafeteria marsupialis". Both species were previously described with only light microscopy; no cultures, ultrastructural data or DNA sequences were available from these species prior to this study. The molecular phylogenetic position of three different "C. marsupialis" isolates was not closely related to the type species of Cafeteria; therefore, we established a new genus for these isolates, Cantina gen. nov.

  13. Evidence of Divergent Amino Acid Usage in Comparative Analyses of R5- and X4-Associated HIV-1 Vpr Sequences

    PubMed Central

    Antell, Gregory C.; Zhong, Wen; Kercher, Katherine; Passic, Shendra; Williams, Jean; Liu, Yucheng; James, Tony; Jacobson, Jeffrey M.; Szep, Zsofia

    2017-01-01

    Vpr is an HIV-1 accessory protein that plays numerous roles during viral replication, and some of which are cell type dependent. To test the hypothesis that HIV-1 tropism extends beyond the envelope into the vpr gene, studies were performed to identify the associations between coreceptor usage and Vpr variation in HIV-1-infected patients. Colinear HIV-1 Env-V3 and Vpr amino acid sequences were obtained from the LANL HIV-1 sequence database and from well-suppressed patients in the Drexel/Temple Medicine CNS AIDS Research and Eradication Study (CARES) Cohort. Genotypic classification of Env-V3 sequences as X4 (CXCR4-utilizing) or R5 (CCR5-utilizing) was used to group colinear Vpr sequences. To reveal the sequences associated with a specific coreceptor usage genotype, Vpr amino acid sequences were assessed for amino acid diversity and Jensen-Shannon divergence between the two groups. Five amino acid alphabets were used to comprehensively examine the impact of amino acid substitutions involving side chains with similar physiochemical properties. Positions 36, 37, 41, 89, and 96 of Vpr were characterized by statistically significant divergence across multiple alphabets when X4 and R5 sequence groups were compared. In addition, consensus amino acid switches were found at positions 37 and 41 in comparisons of the R5 and X4 sequence populations. These results suggest an evolutionary link between Vpr and gp120 in HIV-1-infected patients. PMID:28620613

  14. Common methods for fecal sample storage in field studies yield consistent signatures of individual identity in microbiome sequencing data

    PubMed Central

    Blekhman, Ran; Tang, Karen; Archie, Elizabeth A.; Barreiro, Luis B.; Johnson, Zachary P.; Wilson, Mark E.; Kohn, Jordan; Yuan, Michael L.; Gesquiere, Laurence; Grieneisen, Laura E.; Tung, Jenny

    2016-01-01

    Field studies of wild vertebrates are frequently associated with extensive collections of banked fecal samples—unique resources for understanding ecological, behavioral, and phylogenetic effects on the gut microbiome. However, we do not understand whether sample storage methods confound the ability to investigate interindividual variation in gut microbiome profiles. Here, we extend previous work on storage methods for gut microbiome samples by comparing immediate freezing, the gold standard of preservation, to three methods commonly used in vertebrate field studies: lyophilization, storage in ethanol, and storage in RNAlater. We found that the signature of individual identity consistently outweighed storage effects: alpha diversity and beta diversity measures were significantly correlated across methods, and while samples often clustered by donor, they never clustered by storage method. Provided that all analyzed samples are stored the same way, banked fecal samples therefore appear highly suitable for investigating variation in gut microbiota. Our results open the door to a much-expanded perspective on variation in the gut microbiome across species and ecological contexts. PMID:27528013

  15. Common methods for fecal sample storage in field studies yield consistent signatures of individual identity in microbiome sequencing data.

    PubMed

    Blekhman, Ran; Tang, Karen; Archie, Elizabeth A; Barreiro, Luis B; Johnson, Zachary P; Wilson, Mark E; Kohn, Jordan; Yuan, Michael L; Gesquiere, Laurence; Grieneisen, Laura E; Tung, Jenny

    2016-08-16

    Field studies of wild vertebrates are frequently associated with extensive collections of banked fecal samples-unique resources for understanding ecological, behavioral, and phylogenetic effects on the gut microbiome. However, we do not understand whether sample storage methods confound the ability to investigate interindividual variation in gut microbiome profiles. Here, we extend previous work on storage methods for gut microbiome samples by comparing immediate freezing, the gold standard of preservation, to three methods commonly used in vertebrate field studies: lyophilization, storage in ethanol, and storage in RNAlater. We found that the signature of individual identity consistently outweighed storage effects: alpha diversity and beta diversity measures were significantly correlated across methods, and while samples often clustered by donor, they never clustered by storage method. Provided that all analyzed samples are stored the same way, banked fecal samples therefore appear highly suitable for investigating variation in gut microbiota. Our results open the door to a much-expanded perspective on variation in the gut microbiome across species and ecological contexts.

  16. Position-dependent effects of locked nucleic acid (LNA) on DNA sequencing and PCR primers

    PubMed Central

    Levin, Joshua D.; Fiala, Dean; Samala, Meinrado F.; Kahn, Jason D.; Peterson, Raymond J.

    2006-01-01

    Genomes are becoming heavily annotated with important features. Analysis of these features often employs oligonucleotides that hybridize at defined locations. When the defined location lies in a poor sequence context, traditional design strategies may fail. Locked Nucleic Acid (LNA) can enhance oligonucleotide affinity and specificity. Though LNA has been used in many applications, formal design rules are still being defined. To further this effort we have investigated the effect of LNA on the performance of sequencing and PCR primers in AT-rich regions, where short primers yield poor sequencing reads or PCR yields. LNA was used in three positional patterns: near the 5′ end (LNA-5′), near the 3′ end (LNA-3′) and distributed throughout (LNA-Even). Quantitative measures of sequencing read length (Phred Q30 count) and real-time PCR signal (cycle threshold, CT) were characterized using two-way ANOVA. LNA-5′ increased the average Phred Q30 score by 60% and it was never observed to decrease performance. LNA-5′ generated cycle thresholds in quantitative PCR that were comparable to high-yielding conventional primers. In contrast, LNA-3′ and LNA-Even did not improve read lengths or CT. ANOVA demonstrated the statistical significance of these results and identified significant interaction between the positional design rule and primer sequence. PMID:17071964

  17. Amino-acid sequence data of beta-tubulin from Physarum polycephalum myxamoebae.

    PubMed

    Singhofer-Wowra, M; Clayton, L; Dawson, P; Gull, K; Little, M

    1986-12-15

    Starting with 7.7 mg of a beta-tubulin isolated from myxamoebae of the slime mould Physarum polycephalum, 90% of the sequence has been determined by the Edman degradation of peptides generated by cyanogen bromide, trypsin and Staphylococcus aureus protease. Differences to other beta-tubulins are mainly conservative and spread evenly throughout the chain except for a high concentration at the C-terminus. The Physarum beta-tubulin shows most homology to Chlamydomonas beta-tubulin (90.5%) and least homology to yeast beta-tubulin (S. cerevisiae, 73.4%). Two tryptic peptides were isolated in approximately equal quantities which were identical except in one position (S/ALTVPELTQRMFDA) showing that at least two beta-tubulins are present in myxamoebae. However, since this was the only heterogeneity found, these beta-tubulins are probably very similar.

  18. Microencapsulated sorbic acid and nature-identical compounds reduced Salmonella Hadar and Salmonella Enteritidis colonization in experimentally infected chickens.

    PubMed

    Grilli, E; Tugnoli, B; Formigoni, A; Massi, P; Fantinati, P; Tosi, G; Piva, A

    2011-08-01

    The reduction of Salmonella prevalence in broilers is a priority in European Union agricultural policies because treatment with antibiotics is forbidden by Regulation (EC) 2160/2003. Two trials were conducted to evaluate the efficacy of a microencapsulated blend of sorbic acid and nature-identical compounds (i.e., chemically synthesized botanicals; SAB) on the reduction of the cecal prevalence and contents of Salmonella enterica serovars Hadar and Enteritidis in experimentally infected chickens. In the first trial, 125 one-day-old Lohmann specific-pathogen-free chickens were assigned to one of the following treatments: negative control (not challenged and not treated), positive control (challenged and not treated), SAB0.3, SAB1, or SAB5 (challenged and treated with the microencapsulated blend included in the feed at 0.03, 0.1, or 0.5%, respectively). At 30 d of age, birds were infected with 10(6) cfu of Salmonella Hadar, and after 5, 10, or 20 d postinfection, 5, 10, and 10 birds per treatment, respectively, were killed and the cecal contents and liver and spleen samples were analyzed for Salmonella Hadar. In the second trial, 100 one-day-old Ross 708 chickens were assigned to 1 of 5 treatments: control (not treated), SAB0.3, SAB1, SAB2, or SAB5 (treated with the blend included in the feed at 0.03, 0.1, 0.2, or 0.5%, respectively). At 7 d of age, the birds were challenged with 10(5) cfu of Salmonella Enteritidis, and after 7, 14, or 24 d after challenge, 5, 5, and 10 birds per treatment, respectively, were killed and cecal contents were analyzed for Salmonella Enteritidis. Results showed that in the early stage of infection Salmonella prevalence was high in both studies, whereas at the end of the observation periods, the blends at 0.03, 0.1, and 0.5 in the challenge with Salmonella Hadar and at 0.2 and 0.5% in the challenge with Salmonella Enteritidis significantly reduced (by 2 log(10) cfu) the cecal content of Salmonella. This study showed that intestinal

  19. A 25-Amino Acid Sequence of the Arabidopsis TGD2 Protein Is Sufficient for Specific Binding of Phosphatidic Acid*

    PubMed Central

    Lu, Binbin; Benning, Christoph

    2009-01-01

    Genetic analysis suggests that the TGD2 protein of Arabidopsis is required for the biosynthesis of endoplasmic reticulum derived thylakoid lipids. TGD2 is proposed to be the substrate-binding protein of a presumed lipid transporter consisting of the TGD1 (permease) and TGD3 (ATPase) proteins. The TGD1, -2, and -3 proteins are localized in the inner chloroplast envelope membrane. TGD2 appears to be anchored with an N-terminal membrane-spanning domain into the inner envelope membrane, whereas the C-terminal domain faces the intermembrane space. It was previously shown that the C-terminal domain of TGD2 binds phosphatidic acid (PtdOH). To investigate the PtdOH binding site of TGD2 in detail, the C-terminal domain of the TGD2 sequence lacking the transit peptide and transmembrane sequences was fused to the C terminus of the Discosoma sp. red fluorescent protein (DR). This greatly improved the solubility of the resulting DR-TGD2C fusion protein following production in Escherichia coli. The DR-TGD2C protein bound PtdOH with high specificity, as demonstrated by membrane lipid-protein overlay and liposome association assays. Internal deletion and truncation mutagenesis identified a previously undescribed minimal 25-amino acid fragment in the C-terminal domain of TGD2 that is sufficient for PtdOH binding. Binding characteristics of this 25-mer were distinctly different from those of TGD2C, suggesting that additional sequences of TGD2 providing the proper context for this 25-mer are needed for wild type-like PtdOH binding. PMID:19416982

  20. The Path of Carbon in Photosynthesis IV. The Identity and Sequence of the Intermediates in Sucrose Synthesis

    DOE R&D Accomplishments Database

    Calvin, M.; Benson, A.

    1948-12-14

    The synthesis of sucrose from C{sup 14}0{sub 2} by green algae has been investigated and the intermediates separated by the method of paper chromatography. It is shown that sucrose is the first free sugar appearing during photosynthesis. It is apparently formed by condensation of the glucose-I-phosphate and a fructose phosphate. A series of radioautographs of paper chromatograms of extracts from plants which have photosynthesized for different periods of time has been prepared. The results indicate that 2-phosphoglyceric acid is the first product synthesized from C0{sub 2} during photosynthesis.

  1. Coding sequences and levels of expression of Hsc70t are identical in mice with different Orch-1 alleles

    SciTech Connect

    Snoek, M.; Vugt, H. van; Olavesen, M.G.; Milner, C.M.; Campbell, R.D.; Teuscher, C.

    1994-12-31

    Experimental allergic orchitis (EAO) is an autoimmune disease of the testis that is controlled by multple genes. The use of recombinant mouse strains has defined the map position of the H-2-associated locus controlling disease susceptibility, Orch-1, within the H-2S/H-2D interval. Over the last few years the definition of the structural organization of the C4-H-2D segment and identification of the recombination sites of the various intra-H-2 recombinations has reduced the map position of Orch-1 to the Hsp70.1-G7 interval. Three Hsp70 genes, Hsp70.1, Hsp70.3, and Hsc70t, and the genes G7b and G7a are located in this segment of DNA. In order to investigate whether Hsc70t is a suitable candidate for Orch-1 we have compared the sequence of the gene from a susceptible and a resistant haplotype.

  2. Revealing the challenges of low template DNA analysis with the prototype Ion AmpliSeq™ Identity panel v2.3 on the PGM™ Sequencer.

    PubMed

    Elena, Salata; Alessandro, Agostino; Ignazio, Ciuna; Sharon, Wootton; Luigi, Ripani; Andrea, Berti

    2016-05-01

    Forensic scientists frequently have to deal with the analysis of challenging sources of DNA such as degraded and low template DNA (LtDNA). The capacity to genotype difficult biological traces has been facilitated by emerging technologies. Massive parallel sequencing (MPS) on microchip among other technologies promises high sensitivity and discrimination power. In this study we evaluated the combined use of the Quantifiler(®) Trio DNA Quantification Kit with the prototype Ion AmpliSeq™ Identity panel v2.3 and PGM™ platform in LtDNA samples. Coverage, allele balance, allele drop-out/in, consistency and variance were assessed. Overall, the results showed a great level of performance and consistency in terms of genotyping capability even under the most challenging conditions, making it possible to obtain consistent SNP profiles with 31 pg of DNA and partial informative profiles with as little as 5 pg or with severely degraded DNA. In addition, we demonstrated that the stochastic effects observed in some samples are due to the amplification of the library rather than sequencing. Based on our data, we proposed general recommendations for the analysis of casework samples starting from the use of quantification data, which proved to be critical in deciding whether to process the samples via STR (short tandem repeat) analysis or SNP MPS. In our experience, the use of the prototype Ion AmpliSeq™ Identity panel v2.3 has revealed a new applicable solution for processing LtDNAs. This approach provides users with an additional tool for analysis of traces that either would not give informative results with conventional STR-based techniques.

  3. Amino acid sequence and carbohydrate-binding analysis of the N-acetyl-D-galactosamine-specific C-type lectin, CEL-I, from the Holothuroidea, Cucumaria echinata.

    PubMed

    Hatakeyama, Tomomitsu; Matsuo, Noriaki; Shiba, Kouhei; Nishinohara, Shoichi; Yamasaki, Nobuyuki; Sugawara, Hajime; Aoyagi, Haruhiko

    2002-01-01

    CEL-I is one of the Ca2+-dependent lectins that has been isolated from the sea cucumber, Cucumaria echinata. This protein is composed of two identical subunits held by a single disulfide bond. The complete amino acid sequence of CEL-I was determined by sequencing the peptides produced by proteolytic fragmentation of S-pyridylethylated CEL-I. A subunit of CEL-I is composed of 140 amino acid residues. Two intrachain (Cys3-Cys14 and Cys31-Cys135) and one interchain (Cys36) disulfide bonds were also identified from an analysis of the cystine-containing peptides obtained from the intact protein. The similarity between the sequence of CEL-I and that of other C-type lectins was low, while the C-terminal region, including the putative Ca2+ and carbohydrate-binding sites, was relatively well conserved. When the carbohydrate-binding activity was examined by a solid-phase microplate assay, CEL-I showed much higher affinity for N-acetyl-D-galactosamine than for other galactose-related carbohydrates. The association constant of CEL-I for p-nitrophenyl N-acetyl-beta-D-galactosaminide (NP-GalNAc) was determined to be 2.3 x 10(4) M(-1), and the maximum number of bound NP-GalNAc was estimated to be 1.6 by an equilibrium dialysis experiment.

  4. Enzymatic properties and nucleotide and amino acid sequences of a thermostable beta-agarase from a novel species of deep-sea Microbulbifer.

    PubMed

    Ohta, Y; Hatada, Y; Nogi, Y; Miyazaki, M; Li, Z; Akita, M; Hidaka, Y; Goda, S; Ito, S; Horikoshi, K

    2004-05-01

    An agar-degrading bacterium, strain JAMB-A7, was isolated from the sediment in Sagami Bay, Japan, at a depth of 1,174 m and identified as a novel species of the genus Microbulbifer. The gene for a novel beta-agarase from the isolate was cloned and sequenced. It encodes a protein of 441 amino acids with a calculated molecular mass of 48,989 Da. The deduced amino acid sequence showed similarity to those of known beta-agarases in glycoside hydrolase family 16, with only 34-55% identity. A sequence similar to a carbohydrate-binding module was found in the C-terminal region of the enzyme. The recombinant agarase was hyper-produced extracellularly using Bacillus subtilis as the host, and the enzyme purified to homogeneity had a specific activity of 398 U (mg protein)(-1) at pH 7.0 and 50 degrees C. It was thermostable, with a half-life of 502 min at 50 degrees C. The optimal pH and temperature for activity were around 7 and 50 degrees C, respectively. The pattern of agarose hydrolysis showed that the enzyme was an endo-type beta-agarase, and the final main product was neoagarotetraose. The activity was not inhibited by NaCl, EDTA, and various surfactants at high concentrations. In particular, sodium dodecyl sulfate had no inhibitory effect up to 2%.

  5. The characterization of Mycoplasma synoviae EF-Tu protein and proteins involved in hemadherence and their N-terminal amino acid sequences.

    PubMed

    Bencina, D; Narat, M; Dovc, P; Drobnic-Valic, M; Habe, F; Kleven, S H

    1999-04-01

    An abundant cytoplasmic 43-kDa protein from Mycoplasma synoviae, a major pathogen from poultry, was identified as elongation factor Tu. The N-terminal amino acid sequence (AKLDFDRSKEHVNVGTIGHV) has 90% identity with the sequence of the Mycoplasma hominis elongation factor Tu protein. Monoclonal antibodies reacting with the M. synoviae elongation factor Tu protein also reacted with 43-kDa proteins from the avian Mycoplasma species Mycoplasma gallinarum, Mycoplasma gallinaceum, Mycoplasma pullorum, Mycoplasma cloacale, Mycoplasma iners and Mycoplasma meleagridis, but not with the proteins from Mycoplasma gallisepticum, Mycoplasma imitans or Mycoplasma iowae. In addition, two groups of phase variable integral membrane proteins, pMSA and pMSB, associated with hemadherence and pathogenicity of M. synoviae strains AAY-4 and ULB925 were identified. The cleavage of a larger hemagglutinating protein encoded by a gene homologous to the vlhA gene of M. synoviae generates pMSB1 and pMSA1 proteins defined by mAb 125 and by hemagglutination inhibiting mAb 3E10, respectively. The N-terminal amino acid sequences of pMSA proteins (SENKLI ... and SENETQ ...) probably indicate the cleavage site of the M. synoviae strain ULB 925 hemagglutinin.

  6. Nucleotide sequence of the Klebsiella pneumoniae nifD gene and predicted amino acid sequence of the alpha-subunit of nitrogenase MoFe protein.

    PubMed Central

    Ioannidis, I; Buck, M

    1987-01-01

    The nucleotide sequence of the Klebsiella pneumoniae nifD gene is presented and together with the accompanying paper [Holland, Zilberstein, Zamir & Sussman (1987) Biochem. J. 247, 277-285] completes the sequence of the nifHDK genes encoding the nitrogenase polypeptides. The K. pneumoniae nifD gene encodes the 483-amino acid-residue nitrogenase alpha-subunit polypeptide of Mr 54156. The alpha-subunit has five strongly conserved cysteine residues at positions 63, 89, 155, 184 and 275, some occurring in a region showing both primary sequence and potential structural homology to the K. pneumoniae nitrogenase beta-subunit. A comparison with six other alpha-subunit amino acid sequences has been made, which indicates a number of potentially important domains within alpha-subunits. PMID:3322262

  7. The primary structure of E. coli RNA polymerase, Nucleotide sequence of the rpoC gene and amino acid sequence of the beta'-subunit.

    PubMed Central

    Ovchinnikov YuA; Monastyrskaya, G S; Gubanov, V V; Guryev, S O; Salomatina, I S; Shuvaeva, T M; Lipkin, V M; Sverdlov, E D

    1982-01-01

    The primary structure of the E. coli rpoC gene (5321 base pairs) coding the beta'-subunit of RNA polymerase as well as its adjacent segment have been determined. The structure analysis of the peptides obtained by cleavage of the protein with cyanogen bromide and trypsin has confirmed the amino acid sequence of the beta'-subunit deduced from the nucleotide sequence analysis. The beta'-subunit of E. coli RNA polymerase contains 1407 amino acid residues. Its translation is initiated by codon GUG and terminated by codon TAA. It has been detected that the sequence following the terminating codon is strikingly homologous to known sequences of rho-independent terminators. PMID:6287430

  8. Complete amino acid sequence of the A chain of human complement-classical-pathway enzyme C1r.

    PubMed Central

    Arlaud, G J; Willis, A C; Gagnon, J

    1987-01-01

    The amino acid sequence of human C1r A chain was determined, from sequence analysis performed on fragments obtained from C1r autolytic cleavage, cleavage of methionyl bonds, tryptic cleavages at arginine and lysine residues, and cleavages by staphylococcal proteinase. The polypeptide chain has an N-terminal serine residue and contains 446 amino acid residues (Mr 51,200). The sequence data allow chemical characterization of fragments alpha (positions 1-211), beta (positions 212-279) and gamma (positions 280-446) yielded from C1r autolytic cleavage, and identification of the two major cleavage sites generating these fragments. Position 150 of C1r A chain is occupied by a modified amino acid residue that, upon acid hydrolysis, yields erythro-beta-hydroxyaspartic acid, and that is located in a sequence homologous to the beta-hydroxyaspartic acid-containing regions of Factor IX, Factor X, protein C and protein Z. Sequence comparison reveals internal homology between two segments (positions 10-78 and 186-257). Two carbohydrate moieties are attached to the polypeptide chain, both via asparagine residues at positions 108 and 204. Combined with the previously determined sequence of C1r B chain [Arlaud & Gagnon (1983) Biochemistry 22, 1758-1764], these data give the complete sequence of human C1r. PMID:3036070

  9. Proteus mirabilis fimbriae: N-terminal amino acid sequence of a major fimbrial subunit and nucleotide sequences of the genes from two strains.

    PubMed

    Bahrani, F K; Cook, S; Hull, R A; Massad, G; Mobley, H L

    1993-03-01

    Proteus mirabilis, a common cause of urinary tract infection in hospitalized and catheterized patients, produces mannose-resistant/klebsiella-like (MR/K) and mannose-resistant/proteus-like (MR/P) hemagglutinins. The gene encoding the major structural subunit of a fimbria, possibly MR/K, was identified in two strains. A degenerate oligonucleotide probe based on the N terminus of the Proteus uroepithelial cell adhesin and antiserum raised against the denatured polypeptide were used to screen a cosmid gene bank of strain HU1069. A cosmid clone that reacted with the probe and antiserum was identified, and a fimbria-like open reading frame was determined by nucleotide sequencing. The predicted N-terminal amino acid sequence of the processed polypeptide, ENETPAPKVSSTKGEIQLKG (residues 23 to 42), did not match the uroepithelial cell adhesin N terminus but, rather, matched exactly the N-terminal amino acid sequence of a polypeptide with an apparent molecular size of 19.5 kDa isolated by sodium dodecyl sulfate-polyacrylamide gel electrophoresis of a fimbrial preparation from strain HI4320 expressing MR/K hemagglutinin. By using an oligonucleotide from the HU1069 open reading frame, the fimbrial gene was isolated and sequenced from a cosmid gene bank clone of strain HI4320. A 552-bp open reading frame predicts a 184-amino-acid polypeptide including a 22-amino-acid hydrophobic leader sequence. The unprocessed polypeptide is predicted to be 18,921 Da; the processed polypeptide is predicted to be 16,749 Da. The predicted amino acid sequence of the polypeptide encoded by the gene, designated pmfA, displayed 36% exact matches with the mannose-resistant fimbrial subunit encoded by smfA of Serratia marcescens but only 15% exact matches with the predicted sequence encoded by mrkA of Klebsiella pneumoniae.

  10. Bacteria obtained from a sequencing batch reactor that are capable of growth on dehydroabietic acid.

    PubMed Central

    Mohn, W W

    1995-01-01

    Eleven isolates capable of growth on the resin acid dehydroabietic acid (DhA) were obtained from a sequencing batch reactor designed to treat a high-strength process stream from a paper mill. The isolates belonged to two groups, represented by strains DhA-33 and DhA-35, which were characterized. In the bioreactor, bacteria like DhA-35 were more abundant than those like DhA-33. The population in the bioreactor of organisms capable of growth on DhA was estimated to be 1.1 x 10(6) propagules per ml, based on a most-probable-number determination. Analysis of small-subunit rRNA partial sequences indicated that DhA-33 was most closely related to Sphingomonas yanoikuyae (Sab = 0.875) and that DhA-35 was most closely related to Zoogloea ramigera (Sab = 0.849). Both isolates additionally grew on other abietanes, i.e., abietic and palustric acids, but not on the pimaranes, pimaric and isopimaric acids. For DhA-33 and DhA-35 with DhA as the sole organic substrate, doubling times were 2.7 and 2.2 h, respectively, and growth yields were 0.30 and 0.25 g of protein per g of DhA, respectively. Glucose as a cosubstrate stimulated growth of DhA-33 on DhA and stimulated DhA degradation by the culture. Pyruvate as a cosubstrate did not stimulate growth of DhA-35 on DhA and reduced the specific rate of DhA degradation of the culture. DhA induced DhA and abietic acid degradation activities in both strains, and these activities were heat labile. Cell suspensions of both strains consumed DhA at a rate of 6 mumol mg of protein-1 h-1.(ABSTRACT TRUNCATED AT 250 WORDS) PMID:7793937

  11. Bacteria obtained from a sequencing batch reactor that are capable of growth on dehydroabietic acid.

    PubMed

    Mohn, W W

    1995-06-01

    Eleven isolates capable of growth on the resin acid dehydroabietic acid (DhA) were obtained from a sequencing batch reactor designed to treat a high-strength process stream from a paper mill. The isolates belonged to two groups, represented by strains DhA-33 and DhA-35, which were characterized. In the bioreactor, bacteria like DhA-35 were more abundant than those like DhA-33. The population in the bioreactor of organisms capable of growth on DhA was estimated to be 1.1 x 10(6) propagules per ml, based on a most-probable-number determination. Analysis of small-subunit rRNA partial sequences indicated that DhA-33 was most closely related to Sphingomonas yanoikuyae (Sab = 0.875) and that DhA-35 was most closely related to Zoogloea ramigera (Sab = 0.849). Both isolates additionally grew on other abietanes, i.e., abietic and palustric acids, but not on the pimaranes, pimaric and isopimaric acids. For DhA-33 and DhA-35 with DhA as the sole organic substrate, doubling times were 2.7 and 2.2 h, respectively, and growth yields were 0.30 and 0.25 g of protein per g of DhA, respectively. Glucose as a cosubstrate stimulated growth of DhA-33 on DhA and stimulated DhA degradation by the culture. Pyruvate as a cosubstrate did not stimulate growth of DhA-35 on DhA and reduced the specific rate of DhA degradation of the culture. DhA induced DhA and abietic acid degradation activities in both strains, and these activities were heat labile. Cell suspensions of both strains consumed DhA at a rate of 6 mumol mg of protein-1 h-1.(ABSTRACT TRUNCATED AT 250 WORDS)

  12. Nucleotide sequences of the Pseudomonas savastanoi indoleacetic acid genes show homology with Agrobacterium tumefaciens T-DNA

    PubMed Central

    Yamada, Tetsuji; Palm, Curtis J.; Brooks, Bob; Kosuge, Tsune

    1985-01-01

    We report the nucleotide sequences of iaaM and iaaH, the genetic determinants for, respectively, tryptophan 2-monooxygenase and indoleacetamide hydrolase, the enzymes that catalyze the conversion of L-tryptophan to indoleacetic acid in the tumor-forming bacterium Pseudomonas syringae pv. savastanoi. The sequence analysis indicates that the iaaM locus contains an open reading frame encoding 557 amino acids that would comprise a protein with a molecular weight of 61,783; the iaaH locus contains an open reading frame of 455 amino acids that would comprise a protein with a molecular weight of 48,515. Significant amino acid sequence homology was found between the predicted sequence of the tryptophan monooxygenase of P. savastanoi and the deduced product of the T-DNA tms-1 gene of the octopine-type plasmid pTiA6NC from Agrobacterium tumefaciens. Strong homology was found in the 25 amino acid sequence in the putative FAD-binding region of tryptophan monooxygenase. Homology was also found in the amino acid sequences representing the central regions of the putative products of iaaH and tms-2 T-DNA. The results suggest a strong similarity in the pathways for indoleacetic acid synthesis encoded by genes in P. savastanoi and in A. tumefaciens T-DNA. Images PMID:16593610

  13. Prediction of flexible/rigid regions from protein sequences using k-spaced amino acid pairs

    PubMed Central

    Chen, Ke; Kurgan, Lukasz A; Ruan, Jishou

    2007-01-01

    Background Traditionally, it is believed that the native structure of a protein corresponds to a global minimum of its free energy. However, with the growing number of known tertiary (3D) protein structures, researchers have discovered that some proteins can alter their structures in response to a change in their surroundings or with the help of other proteins or ligands. Such structural shifts play a crucial role with respect to the protein function. To this end, we propose a machine learning method for the prediction of the flexible/rigid regions of proteins (referred to as FlexRP); the method is based on a novel sequence representation and feature selection. Knowledge of the flexible/rigid regions may provide insights into the protein folding process and the 3D structure prediction. Results The flexible/rigid regions were defined based on a dataset, which includes protein sequences that have multiple experimental structures, and which was previously used to study the structural conservation of proteins. Sequences drawn from this dataset were represented based on feature sets that were proposed in prior research, such as PSI-BLAST profiles, composition vector and binary sequence encoding, and a newly proposed representation based on frequencies of k-spaced amino acid pairs. These representations were processed by feature selection to reduce the dimensionality. Several machine learning methods for the prediction of flexible/rigid regions and two recently proposed methods for the prediction of conformational changes and unstructured regions were compared with the proposed method. The FlexRP method, which applies Logistic Regression and collocation-based representation with 95 features, obtained 79.5% accuracy. The two runner-up methods, which apply the same sequence representation and Support Vector Machines (SVM) and Naïve Bayes classifiers, obtained 79.2% and 78.4% accuracy, respectively. The remaining considered methods are characterized by accuracies below 70

  14. Nucleic and amino acid sequences relating to a novel transketolase, and methods for the expression thereof

    DOEpatents

    Croteau, Rodney Bruce; Wildung, Mark Raymond; Lange, Bernd Markus; McCaskill, David G.

    2001-01-01

    cDNAs encoding 1-deoxyxylulose-5-phosphate synthase from peppermint (Mentha piperita) have been isolated and sequenced, and the corresponding amino acid sequences have been determined. Accordingly, isolated DNA sequences (SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7) are provided which code for the expression of 1-deoxyxylulose-5-phosphate synthase from plants. In another aspect the present invention provides for isolated, recombinant DXPS proteins, such as the proteins having the sequences set forth in SEQ ID NO:4, SEQ ID NO:6 and SEQ ID NO:8. In other aspects, replicable recombinant cloning vehicles are provided which code for plant 1-deoxyxylulose-5-phosphate synthases, or for a base sequence sufficiently complementary to at least a portion of 1-deoxyxylulose-5-phosphate synthase DNA or RNA to enable hybridization therewith. In yet other aspects, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding a plant 1-deoxyxylulose-5-phosphate synthase. Thus, systems and methods are provided for the recombinant expression of the aforementioned recombinant 1-deoxyxylulose-5-phosphate synthase that may be used to facilitate its production, isolation and purification in significant amounts. Recombinant 1-deoxyxylulose-5-phosphate synthase may be used to obtain expression or enhanced expression of 1-deoxyxylulose-5-phosphate synthase in plants in order to enhance the production of 1-deoxyxylulose-5-phosphate, or its derivatives such as isopentenyl diphosphate (BP), or may be otherwise employed for the regulation or expression of 1-deoxyxylulose-5-phosphate synthase, or the production of its products.

  15. Genome Sequence Analysis of the Naphthenic Acid Degrading and Metal Resistant Bacterium Cupriavidus gilardii CR3

    PubMed Central

    Xiao, Jingfa; Hao, Lirui; Crowley, David E.; Zhang, Zhewen; Yu, Jun; Huang, Ning; Huo, Mingxin; Wu, Jiayan

    2015-01-01

    Cupriavidus sp. are generally heavy metal tolerant bacteria with the ability to degrade a variety of aromatic hydrocarbon compounds, although the degradation pathways and substrate versatilities remain largely unknown. Here we studied the bacterium Cupriavidus gilardii strain CR3, which was isolated from a natural asphalt deposit, and which was shown to utilize naphthenic acids as a sole carbon source. Genome sequencing of C. gilardii CR3 was carried out to elucidate possible mechanisms for the naphthenic acid biodegradation. The genome of C. gilardii CR3 was composed of two circular chromosomes chr1 and chr2 of respectively 3,539,530 bp and 2,039,213 bp in size. The genome for strain CR3 encoded 4,502 putative protein-coding genes, 59 tRNA genes, and many other non-coding genes. Many genes were associated with xenobiotic biodegradation and metal resistance functions. Pathway prediction for degradation of cyclohexanecarboxylic acid, a representative naphthenic acid, suggested that naphthenic acid undergoes initial ring-cleavage, after which the ring fission products can be degraded via several plausible degradation pathways including a mechanism similar to that used for fatty acid oxidation. The final metabolic products of these pathways are unstable or volatile compounds that were not toxic to CR3. Strain CR3 was also shown to have tolerance to at least 10 heavy metals, which was mainly achieved by self-detoxification through ion efflux, metal-complexation and metal-reduction, and a powerful DNA self-repair mechanism. Our genomic analysis suggests that CR3 is well adapted to survive the harsh environment in natural asphalts containing naphthenic acids and high concentrations of heavy metals. PMID:26301592

  16. Genome Sequence Analysis of the Naphthenic Acid Degrading and Metal Resistant Bacterium Cupriavidus gilardii CR3.

    PubMed

    Wang, Xiaoyu; Chen, Meili; Xiao, Jingfa; Hao, Lirui; Crowley, David E; Zhang, Zhewen; Yu, Jun; Huang, Ning; Huo, Mingxin; Wu, Jiayan

    2015-01-01

    Cupriavidus sp. are generally heavy metal tolerant bacteria with the ability to degrade a variety of aromatic hydrocarbon compounds, although the degradation pathways and substrate versatilities remain largely unknown. Here we studied the bacterium Cupriavidus gilardii strain CR3, which was isolated from a natural asphalt deposit, and which was shown to utilize naphthenic acids as a sole carbon source. Genome sequencing of C. gilardii CR3 was carried out to elucidate possible mechanisms for the naphthenic acid biodegradation. The genome of C. gilardii CR3 was composed of two circular chromosomes chr1 and chr2 of respectively 3,539,530 bp and 2,039,213 bp in size. The genome for strain CR3 encoded 4,502 putative protein-coding genes, 59 tRNA genes, and many other non-coding genes. Many genes were associated with xenobiotic biodegradation and metal resistance functions. Pathway prediction for degradation of cyclohexanecarboxylic acid, a representative naphthenic acid, suggested that naphthenic acid undergoes initial ring-cleavage, after which the ring fission products can be degraded via several plausible degradation pathways including a mechanism similar to that used for fatty acid oxidation. The final metabolic products of these pathways are unstable or volatile compounds that were not toxic to CR3. Strain CR3 was also shown to have tolerance to at least 10 heavy metals, which was mainly achieved by self-detoxification through ion efflux, metal-complexation and metal-reduction, and a powerful DNA self-repair mechanism. Our genomic analysis suggests that CR3 is well adapted to survive the harsh environment in natural asphalts containing naphthenic acids and high concentrations of heavy metals.

  17. Gene sequence and predicted amino acid sequence of the motA protein, a membrane-associated protein required for flagellar rotation in Escherichia coli.

    PubMed Central

    Dean, G E; Macnab, R M; Stader, J; Matsumura, P; Burks, C

    1984-01-01

    The motA and motB gene products of Escherichia coli are integral membrane proteins necessary for flagellar rotation. We determined the DNA sequence of the region containing the motA gene and its promoter. Within this sequence, there is an open reading frame of 885 nucleotides, which with high probability (98% confidence level) meets criteria for a coding sequence. The 295-residue amino acid translation product had a molecular weight of 31,974, in good agreement with the value determined experimentally by gel electrophoresis. The amino acid sequence, which was quite hydrophobic, was subjected to a theoretical analysis designed to predict membrane-spanning alpha-helical segments of integral membrane proteins; four such hydrophobic helices were predicted by this treatment. Additional amphipathic helices may also be present. A remarkable feature of the sequence is the existence of two segments of high uncompensated charge density, one positive and the other negative. Possible organization of the protein in the membrane is discussed. Asymmetry in the amino acid composition of translated DNA sequences was used to distinguish between two possible initiation codons. The use of this method as a criterion for authentication of coding regions is described briefly in an Appendix. PMID:6090403

  18. Sequence-defined bioactive macrocycles via an acid-catalysed cascade reaction

    NASA Astrophysics Data System (ADS)

    Porel, Mintu; Thornlow, Dana N.; Phan, Ngoc N.; Alabi, Christopher A.

    2016-06-01

    Synthetic macrocycles derived from sequence-defined oligomers are a unique structural class whose ring size, sequence and structure can be tuned via precise organization of the primary sequence. Similar to peptides and other peptidomimetics, these well-defined synthetic macromolecules become pharmacologically relevant when bioactive side chains are incorporated into their primary sequence. In this article, we report the synthesis of oligothioetheramide (oligoTEA) macrocycles via a one-pot acid-catalysed cascade reaction. The versatility of the cyclization chemistry and modularity of the assembly process was demonstrated via the synthesis of >20 diverse oligoTEA macrocycles. Structural characterization via NMR spectroscopy revealed the presence of conformational isomers, which enabled the determination of local chain dynamics within the macromolecular structure. Finally, we demonstrate the biological activity of oligoTEA macrocycles designed to mimic facially amphiphilic antimicrobial peptides. The preliminary results indicate that macrocyclic oligoTEAs with just two-to-three cationic charge centres can elicit potent antibacterial activity against Gram-positive and Gram-negative bacteria.

  19. Repeat sequence chromosome specific nucleic acid probes and methods of preparing and using

    DOEpatents

    Weier, H.U.G.; Gray, J.W.

    1995-06-27

    A primer directed DNA amplification method to isolate efficiently chromosome-specific repeated DNA wherein degenerate oligonucleotide primers are used is disclosed. The probes produced are a heterogeneous mixture that can be used with blocking DNA as a chromosome-specific staining reagent, and/or the elements of the mixture can be screened for high specificity, size and/or high degree of repetition among other parameters. The degenerate primers are sets of primers that vary in sequence but are substantially complementary to highly repeated nucleic acid sequences, preferably clustered within the template DNA, for example, pericentromeric alpha satellite repeat sequences. The template DNA is preferably chromosome-specific. Exemplary primers and probes are disclosed. The probes of this invention can be used to determine the number of chromosomes of a specific type in metaphase spreads, in germ line and/or somatic cell interphase nuclei, micronuclei and/or in tissue sections. Also provided is a method to select arbitrarily repeat sequence probes that can be screened for chromosome-specificity. 18 figs.

  20. Repeat sequence chromosome specific nucleic acid probes and methods of preparing and using

    DOEpatents

    Weier, Heinz-Ulrich G.; Gray, Joe W.

    1995-01-01

    A primer directed DNA amplification method to isolate efficiently chromosome-specific repeated DNA wherein degenerate oligonucleotide primers are used is disclosed. The probes produced are a heterogeneous mixture that can be used with blocking DNA as a chromosome-specific staining reagent, and/or the elements of the mixture can be screened for high specificity, size and/or high degree of repetition among other parameters. The degenerate primers are sets of primers that vary in sequence but are substantially complementary to highly repeated nucleic acid sequences, preferably clustered within the template DNA, for example, pericentromeric alpha satellite repeat sequences. The template DNA is preferably chromosome-specific. Exemplary primers ard probes are disclosed. The probes of this invention can be used to determine the number of chromosomes of a specific type in metaphase spreads, in germ line and/or somatic cell interphase nuclei, micronuclei and/or in tissue sections. Also provided is a method to select arbitrarily repeat sequence probes that can be screened for chromosome-specificity.

  1. KM+, a mannose-binding lectin from Artocarpus integrifolia: amino acid sequence, predicted tertiary structure, carbohydrate recognition, and analysis of the beta-prism fold.

    PubMed Central

    Rosa, J. C.; De Oliveira, P. S.; Garratt, R.; Beltramini, L.; Resing, K.; Roque-Barreira, M. C.; Greene, L. J.

    1999-01-01

    The complete amino acid sequence of the lectin KM+ from Artocarpus integrifolia (jackfruit), which contains 149 residues/mol, is reported and compared to those of other members of the Moraceae family, particularly that of jacalin, also from jackfruit, with which it shares 52% sequence identity. KM+ presents an acetyl-blocked N-terminus and is not posttranslationally modified by proteolytic cleavage as is the case for jacalin. Rather, it possesses a short, glycine-rich linker that unites the regions homologous to the alpha- and beta-chains of jacalin. The results of homology modeling implicate the linker sequence in sterically impeding rotation of the side chain of Asp141 within the binding site pocket. As a consequence, the aspartic acid is locked into a conformation adequate only for the recognition of equatorial hydroxyl groups on the C4 epimeric center (alpha-D-mannose, alpha-D-glucose, and their derivatives). In contrast, the internal cleavage of the jacalin chain permits free rotation of the homologous aspartic acid, rendering it capable of accepting hydrogen bonds from both possible hydroxyl configurations on C4. We suggest that, together with direct recognition of epimeric hydroxyls and the steric exclusion of disfavored ligands, conformational restriction of the lectin should be considered to be a new mechanism by which selectivity may be built into carbohydrate binding sites. Jacalin and KM+ adopt the beta-prism fold already observed in two unrelated protein families. Despite presenting little or no sequence similarity, an analysis of the beta-prism reveals a canonical feature repeatedly present in all such structures, which is based on six largely hydrophobic residues within a beta-hairpin containing two classic-type beta-bulges. We suggest the term beta-prism motif to describe this feature. PMID:10210179

  2. Structural similarity between native proteins and chimera constructs obtained by inverting the amino Acid sequence.

    PubMed

    Carugo, Oliviero

    2010-12-01

    The analysis of the symmetry of protein three-dimensional structures can be extremely useful in order to understand and classify the protein structural universe. The structures of proteins with back-traced amino acid sequence were modeled and compared to the structures of their native counterparts. Only in a very limited set of cases, the two objects showed a significant level of similarity. These extremely symmetric examples can be of any structural class and of any dimension. The lack of biunique "N to C" and "C to N" symmetry at the structural level mirrors that at the sequence level and we propose to design as a dlof symmetry the cases in which a protein structure is similar to its back-traced variant.

  3. Microbial community dynamics in bioaugmented sequencing batch reactors for bromoamine acid removal.

    PubMed

    Qu, Yuanyuan; Zhou, Jiti; Wang, Jing; Fu, Xiang; Xing, Linlin

    2005-05-01

    Sphingomonas xenophaga QYY with the ability to degrade bromoamine acid (BAA) was previously isolated from sludge samples. The enhancement of BAA removal by strain QYY in sequencing batch reactors (SBRs) was investigated in this study. The results showed that augmented SBRs exhibited stronger abilities to degrade BAA than the non-augmented control one. In order to estimate the relationship between community dynamics and function of augmented SBRs, a combined method based on fingerprints (ribosomal intergenic spacer analysis, RISA) and 16S rRNA gene sequencing was used. The results indicated that the microbial community dynamics were substantially changed, and the introduced strain QYY was persistent in the augmented systems. This study suggests that it is feasible and potentially useful to enhance BAA removal using BAA-degrading bacteria, such as S. xenophaga QYY.

  4. Complete amino acid sequence of ananain and a comparison with stem bromelain and other plant cysteine proteases.

    PubMed Central

    Lee, K L; Albee, K L; Bernasconi, R J; Edmunds, T

    1997-01-01

    The amino acid sequences of ananain (EC3.4.22.31) and stem bromelain (3.4.22.32), two cysteine proteases from pineapple stem, are similar yet ananain and stem bromelain possess distinct specificities towards synthetic peptide substrates and different reactivities towards the cysteine protease inhibitors E-64 and chicken egg white cystatin. We present here the complete amino acid sequence of ananain and compare it with the reported sequences of pineapple stem bromelain, papain and chymopapain from papaya and actinidin from kiwifruit. Ananain is comprised of 216 residues with a theoretical mass of 23464 Da. This primary structure includes a sequence insert between residues 170 and 174 not present in stem bromelain or papain and a hydrophobic series of amino acids adjacent to His-157. It is possible that these sequence differences contribute to the different substrate and inhibitor specificities exhibited by ananain and stem bromelain. PMID:9355753

  5. [Measurement of the amino acid sequence for the fusion protein FP3 with LC-MS/MS].

    PubMed

    Li, Xiang; Gao, Xiang-Dong; Tao, Lei; Pei, De-Ning; Guo, Ying; Rao, Chun-Ming; Wang, Jun-Zhi

    2012-02-01

    The amino acid sequence of the fusion protein FP3 was measured by two types of LC-MS/MS and its primary structure was confirmed. After reduction and alkylation, the protein was digested with trypsin and glycosyl groups in glycopeptide were removed by PNGase F. The mixed peptides were separated by LC, then Q-TOF and Ion trap tandem mass spectrometry were used to measure b, y fragment ions of each peptide to analyze the amino acid sequence of fusion protein FP3. Seventy-six percent of full amino acid sequence of the fusion protein FP3 was measured by LC-ESI-Q-TOF with the remaining 24% completed by LC-ESI-Trap. As LC-MS and tandem mass spectrometry are rapid, sensitive, accurate to measure the protein amino acid sequence, they are important approach to structure analysis and identification of recombinant protein.

  6. NullSeq: A Tool for Generating Random Coding Sequences with Desired Amino Acid and GC Contents

    PubMed Central

    Liu, Sophia S.; Hockenberry, Adam J.; Lancichinetti, Andrea; Jewett, Michael C.

    2016-01-01

    The existence of over- and under-represented sequence motifs in genomes provides evidence of selective evolutionary pressures on biological mechanisms such as transcription, translation, ligand-substrate binding, and host immunity. In order to accurately identify motifs and other genome-scale patterns of interest, it is essential to be able to generate accurate null models that are appropriate for the sequences under study. While many tools have been developed to create random nucleotide sequences, protein coding sequences are subject to a unique set of constraints that complicates the process of generating appropriate null models. There are currently no tools available that allow users to create random coding sequences with specified amino acid composition and GC content for the purpose of hypothesis testing. Using the principle of maximum entropy, we developed a method that generates unbiased random sequences with pre-specified amino acid and GC content, which we have developed into a python package. Our method is the simplest way to obtain maximally unbiased random sequences that are subject to GC usage and primary amino acid sequence constraints. Furthermore, this approach can easily be expanded to create unbiased random sequences that incorporate more complicated constraints such as individual nucleotide usage or even di-nucleotide frequencies. The ability to generate correctly specified null models will allow researchers to accurately identify sequence motifs which will lead to a better understanding of biological processes as well as more effective engineering of biological systems. PMID:27835644

  7. Morphological tranformation of calcite crystal growth by prismatic "acidic" polypeptide sequences.

    SciTech Connect

    Kim, I; Giocondi, J L; Orme, C A; Collino, J; Evans, J S

    2007-02-13

    Many of the interesting mechanical and materials properties of the mollusk shell are thought to stem from the prismatic calcite crystal assemblies within this composite structure. It is now evident that proteins play a major role in the formation of these assemblies. Recently, a superfamily of 7 conserved prismatic layer-specific mollusk shell proteins, Asprich, were sequenced, and the 42 AA C-terminal sequence region of this protein superfamily was found to introduce surface voids or porosities on calcite crystals in vitro. Using AFM imaging techniques, we further investigate the effect that this 42 AA domain (Fragment-2) and its constituent subdomains, DEAD-17 and Acidic-2, have on the morphology and growth kinetics of calcite dislocation hillocks. We find that Fragment-2 adsorbs on terrace surfaces and pins acute steps, accelerates then decelerates the growth of obtuse steps, forms clusters and voids on terrace surfaces, and transforms calcite hillock morphology from a rhombohedral form to a rounded one. These results mirror yet are distinct from some of the earlier findings obtained for nacreous polypeptides. The subdomains Acidic-2 and DEAD-17 were found to accelerate then decelerate obtuse steps and induce oval rather than rounded hillock morphologies. Unlike DEAD-17, Acidic-2 does form clusters on terrace surfaces and exhibits stronger obtuse velocity inhibition effects than either DEAD-17 or Fragment-2. Interestingly, a 1:1 mixture of both subdomains induces an irregular polygonal morphology to hillocks, and exhibits the highest degree of acute step pinning and obtuse step velocity inhibition. This suggests that there is some interplay between subdomains within an intra (Fragment-2) or intermolecular (1:1 mixture) context, and sequence interplay phenomena may be employed by biomineralization proteins to exert net effects on crystal growth and morphology.

  8. Purification and N-terminal amino acid sequence of dextranicin 24, a bacteriocin of Leuconostoc sp.

    PubMed

    Revol-Junelles, A M; Lefebvre, G

    1996-08-01

    Leuconostoc mesenteroides subsp. dextranicum strain J24 synthesized a bacteriocin named Dextranicin 24 (Dex-24), which inhibited only other Leuconostoc sp. strains. It was purified by a two-step procedure from the fraction of the bacteriocin bound to the producer cells at the end of the growth: desorption form the cells at acidic pH, followed by reserve phase HPLC. The N-terminal sequence of Dex-24 was the following: NH2(-) K G V L G W L S M A S S A L T G P Q Q . . .

  9. Sequence selective recognition of double-stranded RNA using triple helix-forming peptide nucleic acids.

    PubMed

    Zengeya, Thomas; Gupta, Pankaj; Rozners, Eriks

    2014-01-01

    Noncoding RNAs are attractive targets for molecular recognition because of the central role they play in gene expression. Since most noncoding RNAs are in a double-helical conformation, recognition of such structures is a formidable problem. Herein, we describe a method for sequence-selective recognition of biologically relevant double-helical RNA (illustrated on ribosomal A-site RNA) using peptide nucleic acids (PNA) that form a triple helix in the major grove of RNA under physiologically relevant conditions. Protocols for PNA preparation and binding studies using isothermal titration calorimetry are described in detail.

  10. Fast computational methods for predicting protein structure from primary amino acid sequence

    DOEpatents

    Agarwal, Pratul Kumar

    2011-07-19

    The present invention provides a method utilizing primary amino acid sequence of a protein, energy minimization, molecular dynamics and protein vibrational modes to predict three-dimensional structure of a protein. The present invention also determines possible intermediates in the protein folding pathway. The present invention has important applications to the design of novel drugs as well as protein engineering. The present invention predicts the three-dimensional structure of a protein independent of size of the protein, overcoming a significant limitation in the prior art.

  11. Hemoglobin from the antarctic fish Notothenia coriiceps neglecta. Amino acid sequence of the beta chain.

    PubMed

    D'Avino, R; Caruso, C; Schinina, M E; Rutigliano, B; Romano, M; Camardella, L; Bossa, F; Barra, D; di Prisco, G

    1990-01-01

    1. Notothenia coriiceps neglecta is a cold-adapted notothenioid teleost, widely distributed in the Antarctic waters. 2. In comparison with fishes from temperate waters, the blood of this teleost contains a reduced number of erythrocytes and concentration of hemoglobin; the erythrocytes contain two hemoglobins, Hb1 and Hb2, respectively accounting for approximately 90, and 5% of the total. 3. The two components differ by the alpha chain; the amino acid sequence of the beta chain in common to the two hemoglobins has been established, thus completing the elucidation of the primary structure of the major component Hb 1.

  12. Amino acid sequence of two neurotoxins from the venom of the Egyptian black snake (Walterinnesia aegyptia).

    PubMed

    Samejima, Y; Aoki-Tomomatsu, Y; Yanagisawa, M; Mebs, D

    1997-02-01

    The venom of the Egyptian black snake Walterinnesia aegyptia contains at least three toxins, which act postsynaptically to block the neuromuscular transmission of isolated rat phrenic nerve-diaphragm and chicken biventer cervicis muscle. The complete amino acid sequence of the two toxins, W-III and W-IV, consisting of 62 amino acid residues, was elucidated by Edman degradation of fragments obtained after Staphylococcus aureus protease and prolylpeptidase digestion. Although the toxins exhibit close structural homology to other short-chain postsynaptic neurotoxins from Elapidae venoms, toxin IV is unique by having a free SH-group (cysteine) at position 16. In position 35 of W-III, which is located at the tip of the central loop, threonine is replaced by lysine, which may alter the interaction of the toxin with the acetylcholine receptor, since the toxin is seven times less lethal than toxin W-IV.

  13. Complete genome sequence of Lactococcus lactis IO-1, a lactic acid bacterium that utilizes xylose and produces high levels of L-lactic acid.

    PubMed

    Kato, Hiroaki; Shiwa, Yuh; Oshima, Kenshiro; Machii, Miki; Araya-Kojima, Tomoko; Zendo, Takeshi; Shimizu-Kadota, Mariko; Hattori, Masahira; Sonomoto, Kenji; Yoshikawa, Hirofumi

    2012-04-01

    We report the complete genome sequence of Lactococcus lactis IO-1 (= JCM7638). It is a nondairy lactic acid bacterium, produces nisin Z, ferments xylose, and produces predominantly L-lactic acid at high xylose concentrations. From ortholog analysis with other five L. lactis strains, IO-1 was identified as L. lactis subsp. lactis.

  14. Complete genome sequence of Bacillus amyloliquefaciens LL3, which exhibits glutamic acid-independent production of poly-γ-glutamic acid.

    PubMed

    Geng, Weitao; Cao, Mingfeng; Song, Cunjiang; Xie, Hui; Liu, Li; Yang, Chao; Feng, Jun; Zhang, Wei; Jin, Yinghong; Du, Yang; Wang, Shufang

    2011-07-01

    Bacillus amyloliquefaciens is one of most prevalent Gram-positive aerobic spore-forming bacteria with the ability to synthesize polysaccharides and polypeptides. Here, we report the complete genome sequence of B. amyloliquefaciens LL3, which was isolated from fermented food and presents the glutamic acid-independent production of poly-γ-glutamic acid.

  15. Formation Sequences of Iron Minerals in the Acidic Alteration Products and Variation of Hydrothermal Fluid Conditions

    NASA Astrophysics Data System (ADS)

    Isobe, H.; Yoshizawa, M.

    2008-12-01

    Iron minerals have important role in environmental issues not only on the Earth but also other terrestrial planets. Iron mineral species related to alteration products of primary minerals with surface or subsurface fluids are characterized by temperature, acidity and redox conditions of the fluids. We can see various iron- bearing alteration products in alteration products around fumaroles in geothermal/volcanic areas. In this study, zonal structures of iron minerals in alteration products of the geothermal area are observed to elucidate temporal and spatial variation of hydrothermal fluids. Alteration of the pyroxene-amphibole andesite of Garan-dake volcano, Oita, Japan occurs by the acidic hydrothermal fluid to form cristobalite leaching out elements other than Si. Hand specimens with unaltered or weakly altered core and cristobalite crust show various sequences of layers. XRD analysis revealed that the alteration degree is represented by abundance of cristobalite. Intermediately altered layers are characterized by occurrence including alunite, pyrite, kaolinite, goethite and hematite. A specimen with reddish brown core surrounded by cristobalite-rich white crust has brown colored layers at the boundary of core and the crust. Reddish core is characterized by occurrence of crystalline hematite by XRD. Another hand specimen has light gray core, which represents reduced conditions, and white cristobalite crust with light brown and reddish brown layers of ferric iron minerals between the core and the crust. On the other hand, hornblende crystals, typical ferrous iron-bearing mineral of the host rock, are well preserved in some samples with strongly decolorized cristobalite-rich groundmass. Hydrothermal alteration experiments of iron-rich basaltic material shows iron mineral species depend on acidity and temperature of the fluid. Oxidation states of the iron-bearing mineral species are strongly influenced by the acidity and redox conditions. Variations of alteration

  16. Design, synthesis, and characterization of a protein sequencing reagent yielding amino acid derivatives with enhanced detectability by mass spectrometry.

    PubMed Central

    Aebersold, R.; Bures, E. J.; Namchuk, M.; Goghari, M. H.; Shushan, B.; Covey, T. C.

    1992-01-01

    We report the design, chemical synthesis, and structural and functional characterization of a novel reagent for protein sequence analysis by the Edman degradation, yielding amino acid derivatives rapidly detectable at high sensitivity by ion-evaporation mass spectrometry. We demonstrate that the reagent 3-[4'(ethylene-N,N,N-trimethylamino)phenyl]-2-isothiocyanate is chemically stable and shows coupling and cyclization/cleavage yields comparable to phenylisothiocyanate, the standard reagent in chemical sequence analysis, under conditions typically encountered in manual or automated sequence analysis. Amino acid derivatives generated with this reagent were detectable by ion-evaporation mass spectrometry at the subfemtomole sensitivity level at a pace of one sample per minute. Furthermore, derivatives were identified by their mass, thus permitting the rapid and highly sensitive determination of the molecular nature of modified amino acids. Derivatives of amino acids with acidic, basic, polar, or hydrophobic side chains were reproducibly detectable at comparable sensitivities. The polar nature of the reagent required covalent immobilization of polypeptides prior to automated sequence analysis. This reagent, used in automated sequence analysis, has the potential for overcoming the limitations in sensitivity, speed, and the ability to characterize modified amino acid residues inherent in the chemical sequencing methods that are currently used. PMID:1304351

  17. 3-d structure-based amino acid sequence alignment of esterases, lipases and related proteins

    SciTech Connect

    Gentry, M.K.; Doctor, B.P.; Cygler, M.; Schrag, J.D.; Sussman, J.L.

    1993-05-13

    Acetylcholinesterase and butyrylcholinesterase, enzymes with potential as pretreatment drugs for organophosphate toxicity, are members of a larger family of homologous proteins that includes carboxylesterases, cholesterol esterases, lipases, and several nonhydrolytic proteins. A computer-generated alignment of 18 of the proteins, the acetylcholinesases, butyrylcholinesterases, carboxylesterases, some esterases, and the nonenzymatic proteins has been previously presented. More recently, the three-dimensional structures of two enzymes enzymes in this group, acetylcholinesterase from Torpedo californica and lipase from Geotrichum candidum, have been determined. Based on the x-ray structures and the superposition of these two enzymes, it was possible to obtain an improved amino acid sequence alignment of 32 members of this family of proteins. Examination of this alignment reveals that 24 amino acids are invariant in all of the hydrolytic proteins, and an additional 49 are well conserved. Conserved amino acids include those of the active site, the disulfide bridges, the salt bridges, in the core of the proteins, and at the edges of secondary structural elements. Comparison of the three-dimensional structures makes it possible to find a well-defined structural basis for the conservation of many of these amino acids.

  18. Complete Genome Sequence of Enterobacter cloacae UW5, a Rhizobacterium Capable of High Levels of Indole-3-Acetic Acid Production.

    PubMed

    Coulson, Thomas J D; Patten, Cheryl L

    2015-08-06

    We report the complete genome sequence of Enterobacter cloacae UW5, an indole-3-acetic acid-producing rhizobacterium originally isolated from the rhizosphere of grass. The 4.9-Mbp genome has a G+C content of 54% and contains 4,496 protein-coding sequences.

  19. Draft Genome Sequence of Bacillus subtilis subsp. natto Strain CGMCC 2108, a High Producer of Poly-γ-Glutamic Acid

    PubMed Central

    Tan, Siyuan; Su, Anping; Zhang, Chen; Ren, Yuanyuan

    2016-01-01

    Here, we report the 4.1-Mb draft genome sequence of Bacillus subtilis subsp. natto strain CGMCC 2108, a high producer of poly-γ-glutamic acid (γ-PGA). This sequence will provide further help for the biosynthesis of γ-PGA and will greatly facilitate research efforts in metabolic engineering of B. subtilis subsp. natto strain CGMCC 2108. PMID:27231363

  20. Complete Genome Sequence of Enterobacter cloacae UW5, a Rhizobacterium Capable of High Levels of Indole-3-Acetic Acid Production

    PubMed Central

    Coulson, Thomas J. D.

    2015-01-01

    We report the complete genome sequence of Enterobacter cloacae UW5, an indole-3-acetic acid-producing rhizobacterium originally isolated from the rhizosphere of grass. The 4.9-Mbp genome has a G+C content of 54% and contains 4,496 protein-coding sequences. PMID:26251488

  1. Genome Sequence of the Lactic Acid Bacterium Lactococcus lactis subsp. lactis TOMSC161, Isolated from a Nonscalded Curd Pressed Cheese

    PubMed Central

    Velly, H.; Abraham, A.-L.; Loux, V.; Delacroix-Buchet, A.; Fonseca, F.; Bouix, M.

    2014-01-01

    Lactococcus lactis is a lactic acid bacterium used in the production of many fermented foods, such as dairy products. Here, we report the genome sequence of L. lactis subsp. lactis TOMSC161, isolated from nonscalded curd pressed cheese. This genome sequence provides information in relation to dairy environment adaptation. PMID:25377704

  2. ANTICALIgN: visualizing, editing and analyzing combined nucleotide and amino acid sequence alignments for combinatorial protein engineering.

    PubMed

    Jarasch, Alexander; Kopp, Melanie; Eggenstein, Evelyn; Richter, Antonia; Gebauer, Michaela; Skerra, Arne

    2016-07-01

    ANTIC ALIGN: is an interactive software developed to simultaneously visualize, analyze and modify alignments of DNA and/or protein sequences that arise during combinatorial protein engineering, design and selection. ANTIC ALIGN: combines powerful functions known from currently available sequence analysis tools with unique features for protein engineering, in particular the possibility to display and manipulate nucleotide sequences and their translated amino acid sequences at the same time. ANTIC ALIGN: offers both template-based multiple sequence alignment (MSA), using the unmutated protein as reference, and conventional global alignment, to compare sequences that share an evolutionary relationship. The application of similarity-based clustering algorithms facilitates the identification of duplicates or of conserved sequence features among a set of selected clones. Imported nucleotide sequences from DNA sequence analysis are automatically translated into the corresponding amino acid sequences and displayed, offering numerous options for selecting reading frames, highlighting of sequence features and graphical layout of the MSA. The MSA complexity can be reduced by hiding the conserved nucleotide and/or amino acid residues, thus putting emphasis on the relevant mutated positions. ANTIC ALIGN: is also able to handle suppressed stop codons or even to incorporate non-natural amino acids into a coding sequence. We demonstrate crucial functions of ANTIC ALIGN: in an example of Anticalins selected from a lipocalin random library against the fibronectin extradomain B (ED-B), an established marker of tumor vasculature. Apart from engineered protein scaffolds, ANTIC ALIGN: provides a powerful tool in the area of antibody engineering and for directed enzyme evolution. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  3. Multiple Amino Acid Sequence Alignment Nitrogenase Component 1: Insights into Phylogenetics and Structure-Function Relationships

    PubMed Central

    Howard, James B.; Kechris, Katerina J.; Rees, Douglas C.; Glazer, Alexander N.

    2013-01-01

    Amino acid residues critical for a protein's structure-function are retained by natural selection and these residues are identified by the level of variance in co-aligned homologous protein sequences. The relevant residues in the nitrogen fixation Component 1 α- and β-subunits were identified by the alignment of 95 protein sequences. Proteins were included from species encompassing multiple microbial phyla and diverse ecological niches as well as the nitrogen fixation genotypes, anf, nif, and vnf, which encode proteins associated with cofactors differing at one metal site. After adjusting for differences in sequence length, insertions, and deletions, the remaining >85% of the sequence co-aligned the subunits from the three genotypes. Six Groups, designated Anf, Vnf , and Nif I-IV, were assigned based upon genetic origin, sequence adjustments, and conserved residues. Both subunits subdivided into the same groups. Invariant and single variant residues were identified and were defined as “core” for nitrogenase function. Three species in Group Nif-III, Candidatus Desulforudis audaxviator, Desulfotomaculum kuznetsovii, and Thermodesulfatator indicus, were found to have a seleno-cysteine that replaces one cysteinyl ligand of the 8Fe:7S, P-cluster. Subsets of invariant residues, limited to individual groups, were identified; these unique residues help identify the gene of origin (anf, nif, or vnf) yet should not be considered diagnostic of the metal content of associated cofactors. Fourteen of the 19 residues that compose the cofactor pocket are invariant or single variant; the other five residues are highly variable but do not correlate with the putative metal content of the cofactor. The variable residues are clustered on one side of the cofactor, away from other functional centers in the three dimensional structure. Many of the invariant and single variant residues were not previously recognized as potentially critical and their identification provides the bases

  4. Multiple amino acid sequence alignment nitrogenase component 1: insights into phylogenetics and structure-function relationships.

    PubMed

    Howard, James B; Kechris, Katerina J; Rees, Douglas C; Glazer, Alexander N

    2013-01-01

    Amino acid residues critical for a protein's structure-function are retained by natural selection and these residues are identified by the level of variance in co-aligned homologous protein sequences. The relevant residues in the nitrogen fixation Component 1 α- and β-subunits were identified by the alignment of 95 protein sequences. Proteins were included from species encompassing multiple microbial phyla and diverse ecological niches as well as the nitrogen fixation genotypes, anf, nif, and vnf, which encode proteins associated with cofactors differing at one metal site. After adjusting for differences in sequence length, insertions, and deletions, the remaining >85% of the sequence co-aligned the subunits from the three genotypes. Six Groups, designated Anf, Vnf , and Nif I-IV, were assigned based upon genetic origin, sequence adjustments, and conserved residues. Both subunits subdivided into the same groups. Invariant and single variant residues were identified and were defined as "core" for nitrogenase function. Three species in Group Nif-III, Candidatus Desulforudis audaxviator, Desulfotomaculum kuznetsovii, and Thermodesulfatator indicus, were found to have a seleno-cysteine that replaces one cysteinyl ligand of the 8Fe:7S, P-cluster. Subsets of invariant residues, limited to individual groups, were identified; these unique residues help identify the gene of origin (anf, nif, or vnf) yet should not be considered diagnostic of the metal content of associated cofactors. Fourteen of the 19 residues that compose the cofactor pocket are invariant or single variant; the other five residues are highly variable but do not correlate with the putative metal content of the cofactor. The variable residues are clustered on one side of the cofactor, away from other functional centers in the three dimensional structure. Many of the invariant and single variant residues were not previously recognized as potentially critical and their identification provides the bases for

  5. Identification of cytokine-induced neutrophil chemoattractants (CINC), rat GRO/CINC-2 alpha and CINC-2 beta, produced by granulation tissue in culture: purification, complete amino acid sequences and characterization.

    PubMed Central

    Nakagawa, H; Komorita, N; Shibata, F; Ikesue, A; Konishi, K; Fujioka, M; Kato, H

    1994-01-01

    Four basic neutrophil chemotactic factors (chemokines) have been purified from conditioned medium of granulation tissue obtained from carrageenin-induced inflammation in the rat. On the basis of their N-terminal amino acid sequences, one of the chemokines was identical with rat GRO/cytokine-induced neutrophil chemoattractant (CINC) which we reported previously, and another was identical with rat macrophage inflammatory protein-2 (MIP-2). Two other chemokines were novel chemoattractants related to MIP-2. The novel chemokines are referred to as rat GRO/CINC-2 alpha and CINC-2 beta, and consequently CINC and rat MIP-2 are renamed rat GRO/CINC-1 and CINC-3 respectively. The complete amino acid sequences of purified CINC-2 alpha and CINC-3 were determined by analysis of the fragments isolated from proteinase V8-treated CINCs. The cDNA for CINC-2 beta was cloned by reverse transcription/PCR amplification using specific primers starting with total RNA extracted from lipopolysaccharide-stimulated rat macrophages. A comparison of the amino acid sequence encoded by the cDNA with the N-terminal amino acid sequence of purified CINC-2 beta revealed that mature CINC-2 beta is a 68-residue chemoattractant produced by cleavage of a 32-residue signal peptide. The difference in amino acid sequences between CINC-2 alpha and CINC-2 beta consisted of only three C-terminal residues. Rat GRO/CINC-2 alpha is a major chemokine, and the four purified chemokines have similar chemotactic activity, suggesting that they contribute to neutrophil infiltration into inflammatory sites in rats. Images Figure 2 PMID:8043001

  6. In the TTF-1 homeodomain the contribution of several amino acids to DNA recognition depends on the bound sequence.

    PubMed Central

    Fabbro, D; Tell, G; Leonardi, A; Pellizzari, L; Pucillo, C; Lonigro, R; Formisano, S; Damante, G

    1996-01-01

    The thyroid transcription factor-1 homeodomain (TTF-1HD) shows a peculiar DNA binding specificity, preferentially recognizing sequences containing the 5'-CAAG-3' core motif. Most other homeodomains instead recognize sites containing the 5'-TAAT-3' core motif. Here, we show that TTF-1HD efficiently recognizes another sequence, called D1, devoid of the 5'-CAAG-3' core motif. Different experimental approaches indicate that TTF-1HD contacts the D1 sequence in a manner which is different to that used to interact with sequences containing the 5'-CAAG-3' core motif. The binding activities that mutants of TTF-1HD display with the D1 sequence or with the sequence containing the 5'-CAAG-3' core motif indicate that the role of several DNA-contacting amino acids is different. In particular, during recognition of the D1 sequence, backbone-interacting amino acids not relevant in binding to sequences containing the 5'-CAAG-3' core motif play an important role. In the TTF-1HD, therefore, the contribution of several amino acids to DNA recognition depends on the bound sequence. These data indicate that although a common bonding network exists in all of the HD/DNA complexes, peculiarities important for DNA recognition may occur in single cases. PMID:8811078

  7. Complete amino acid sequence of a Lolium perenne (perennial rye grass) pollen allergen, Lol p II.

    PubMed

    Ansari, A A; Shenbagamurthi, P; Marsh, D G

    1989-07-05

    The complete amino acid sequence of a Lolium perenne (rye grass) pollen allergen, Lol p II was determined by automated Edman degradation of the protein and selected fragments. Cleavage of the protein by enzymatic and chemical techniques established an unambiguous sequence for the protein. Lol p II contains 97 amino acid residues, with a calculated molecular weight of 10,882. The protein lacks cysteine and glutamine and shows no evidence of glycosylation. Theoretical predictions by Fraga's (Fraga, S. (1982) Can. J. Chem. 60, 2606-2610) and Hopp and Woods' (Hopp, T. P., and Woods, K. R. (1981) Proc. Natl. Acad. Sci. U.S.A. 78, 3824-3828) methods indicate the presence of four hydrophilic regions, which may contribute to sequential or parts of conformational B-cell epitopes. Analysis of amphipathic regions by Berzofsky's method indicates the presence of a highly amphipathic region, which may contain, or contribute to, an Ia/T-cell epitope. This latter segment of Lol p II was found to be highly homologous with an antibody-binding segment of the major rye allergen Lol p I and may explain why immune responsiveness to both the allergens is associated with HLA-DR3.

  8. The Sequence-Specific Cellular Uptake of Spherical Nucleic Acid Nanoparticle Conjugates

    PubMed Central

    Narayan, Suguna P.; Choi, Chung Hang J.; Hao, Liangliang; Calabrese, Colin M.; Auyeung, Evelyn; Zhang, Chuan; Goor, Olga J.G.M.

    2015-01-01

    We investigated the sequence-dependent cellular uptake of spherical nucleic acid nanoparticle conjugates (SNAs). This process occurs by interaction with class A scavenger receptors (SR-A) and caveolae-mediated endocytosis. It is known that linear poly(guanine) (poly G) is a natural ligand for SR-A, and it has been proposed that interaction of poly G with SR-A is dependent on the formation of G-quadruplexes. Since G-rich oligonucleotides are known to interact strongly with SR-A, we hypothesized that SNAs with higher G contents would be able to enter cells in larger amounts than SNAs composed of other nucleotides, and as such we measured cellular internalization of SNAs as a function of constituent oligonucleotide sequence. Indeed, SNAs with enriched G content show the highest cellular uptake. Using this hypothesis, we chemically conjugated a small molecule (camptothecin) with SNAs to create drug-SNA conjugates and observed that poly G SNAs deliver the most camptothecin to cells and have the highest cytotoxicity in cancer cells. Our data elucidate important design considerations for enhancing the intracellular delivery of spherical nucleic acids. PMID:26097111

  9. Partial amino acid sequences around sulfhydryl groups of soybean beta-amylase.

    PubMed

    Nomura, K; Mikami, B; Morita, Y

    1987-08-01

    Sulfhydryl (SH) groups of soybean beta-amylase were modified with 5-(iodoaceto-amidoethyl)aminonaphthalene-1-sulfonate (IAEDANS) and the SH-containing peptides exhibiting fluorescence were purified after chymotryptic digestion of the modified enzyme. The sequence analysis of the peptides derived from the modification of all SH groups in the denatured enzyme revealed the existence of six SH groups, in contrast to five reported previously. One of them was found to have extremely low reactivity toward SH-reagents without reduction. In the native state, IAEDANS reacted with 2 mol of SH groups per mol of the enzyme (SH1 and SH2) accompanied with inactivation of the enzyme owing to the modification of SH2 located near the active site of this enzyme. The selective modification of SH2 with IAEDANS was attained after the blocking of SH1 with 5,5'-dithiobis-(2-nitrobenzoic acid). The amino acid sequences of the peptides containing SH1 and SH2 were determined to be Cys-Ala-Asn-Pro-Gln and His-Gln-Cys-Gly-Gly-Asn-Val-Gly-Asp-Ile-Val-Asn-Ile-Pro-Ile-Pro-Gln-Trp, respectively.

  10. Molecular cloning, encoding sequence, and expression of vaccinia virus nucleic acid-dependent nucleoside triphosphatase gene.

    PubMed Central

    Rodriguez, J F; Kahn, J S; Esteban, M

    1986-01-01

    A rabbit poxvirus genomic library contained within the expression vector lambda gt11 was screened with polyclonal antiserum prepared against vaccinia virus nucleic acid-dependent nucleoside triphosphatase (NTPase)-I enzyme. Five positive phage clones containing from 0.72- to 2.5-kilobase-pair (kbp) inserts expressed a beta-galactosidase fusion protein that was reactive by immunoblotting with the NTPase-I antibody. Hybridization analysis allowed the location of this gene within the vaccinia HindIIID restriction fragment. From the known nucleotide sequence of the 16-kbp vaccinia HindIIID fragment, we identified a region that contains a 1896-base open reading frame coding for a 631-amino acid protein. Analysis of the complete sequence revealed a highly basic protein, with hydrophilic COOH and NH2 termini, various hydrophobic domains, and no significant homology to other known proteins. Translational studies demonstrate that NTPase-I belongs to a late class of viral genes. This protein is highly conserved among Orthopoxviruses. Images PMID:3025846

  11. NASP: a parallel program for identifying evolutionarily conserved nucleic acid secondary structures from nucleotide sequence alignments.

    PubMed

    Semegni, J Y; Wamalwa, M; Gaujoux, R; Harkins, G W; Gray, A; Martin, D P

    2011-09-01

    Many natural nucleic acid sequences have evolutionarily conserved secondary structures with diverse biological functions. A reliable computational tool for identifying such structures would be very useful in guiding experimental analyses of their biological functions. NASP (Nucleic Acid Structure Predictor) is a program that takes into account thermodynamic stability, Boltzmann base pair probabilities, alignment uncertainty, covarying sites and evolutionary conservation to identify biologically relevant secondary structures within multiple sequence alignments. Unique to NASP is the consideration of all this information together with a recursive permutation-based approach to progressively identify and list the most conserved probable secondary structures that are likely to have the greatest biological relevance. By focusing on identifying only evolutionarily conserved structures, NASP forgoes the prediction of complete nucleotide folds but outperforms various other secondary structure prediction methods in its ability to selectively identify actual base pairings. Downloable and web-based versions of NASP are freely available at http://web.cbio.uct.ac.za/~yves/nasp_portal.php yves@cbio.uct.ac.za Supplementary data are available at Bioinformatics online.

  12. Draft Genome Sequences of Gluconobacter cerinus CECT 9110 and Gluconobacter japonicus CECT 8443, Acetic Acid Bacteria Isolated from Grape Must

    PubMed Central

    Sainz, Florencia

    2016-01-01

    We report here the draft genome sequences of Gluconobacter cerinus strain CECT9110 and Gluconobacter japonicus CECT8443, acetic acid bacteria isolated from grape must. Gluconobacter species are well known for their ability to oxidize sugar alcohols into the corresponding acids. Our objective was to select strains to oxidize effectively d-glucose. PMID:27365351

  13. Genome Sequence of Lactobacillus rhamnosus Strain CASL, an Efficient l-Lactic Acid Producer from Cheap Substrate Cassava

    PubMed Central

    Yu, Bo; Su, Fei; Wang, Limin; Zhao, Bo; Qin, Jiayang; Ma, Cuiqing; Xu, Ping; Ma, Yanhe

    2011-01-01

    Lactobacillus rhamnosus is a type of probiotic bacteria with industrial potential for l-lactic acid production. We announce the draft genome sequence of L. rhamnosus CASL (2,855,156 bp with a G+C content of 46.6%), which is an efficient producer of l-lactic acid from cheap, nonfood substrate cassava with a high production titer. PMID:22123765

  14. Amino acid sequence of versutoxin, a lethal neurotoxin from the venom of the funnel-web spider Atrax versutus.

    PubMed

    Brown, M R; Sheumack, D D; Tyler, M I; Howden, M E

    1988-03-01

    The complete amino acid sequence of versutoxin, a lethal neurotoxic polypeptide isolated from the venom of male and female funnel-web spiders of the species Atrax versutus, was determined. Sequencing was performed in a gas-phase protein sequencer by automated Edman degradation of the S-carboxymethylated toxin and fragments of it produced by reaction with CNBr. Versutoxin consisted of a single chain of 42 amino acid residues. It was found to have a high proportion of basic residues and of cystine. The primary structure showed marked homology with that of robustoxin, a novel neurotoxin recently isolated from the venom of another funnel-web-spider species, Atrax robustus.

  15. Amino acid sequence of versutoxin, a lethal neurotoxin from the venom of the funnel-web spider Atrax versutus.

    PubMed Central

    Brown, M R; Sheumack, D D; Tyler, M I; Howden, M E

    1988-01-01

    The complete amino acid sequence of versutoxin, a lethal neurotoxic polypeptide isolated from the venom of male and female funnel-web spiders of the species Atrax versutus, was determined. Sequencing was performed in a gas-phase protein sequencer by automated Edman degradation of the S-carboxymethylated toxin and fragments of it produced by reaction with CNBr. Versutoxin consisted of a single chain of 42 amino acid residues. It was found to have a high proportion of basic residues and of cystine. The primary structure showed marked homology with that of robustoxin, a novel neurotoxin recently isolated from the venom of another funnel-web-spider species, Atrax robustus. PMID:3355530

  16. Clostridium sticklandii, a specialist in amino acid degradation:revisiting its metabolism through its genome sequence

    PubMed Central

    2010-01-01

    Background Clostridium sticklandii belongs to a cluster of non-pathogenic proteolytic clostridia which utilize amino acids as carbon and energy sources. Isolated by T.C. Stadtman in 1954, it has been generally regarded as a "gold mine" for novel biochemical reactions and is used as a model organism for studying metabolic aspects such as the Stickland reaction, coenzyme-B12- and selenium-dependent reactions of amino acids. With the goal of revisiting its carbon, nitrogen, and energy metabolism, and comparing studies with other clostridia, its genome has been sequenced and analyzed. Results C. sticklandii is one of the best biochemically studied proteolytic clostridial species. Useful additional information has been obtained from the sequencing and annotation of its genome, which is presented in this paper. Besides, experimental procedures reveal that C. sticklandii degrades amino acids in a preferential and sequential way. The organism prefers threonine, arginine, serine, cysteine, proline, and glycine, whereas glutamate, aspartate and alanine are excreted. Energy conservation is primarily obtained by substrate-level phosphorylation in fermentative pathways. The reactions catalyzed by different ferredoxin oxidoreductases and the exergonic NADH-dependent reduction of crotonyl-CoA point to a possible chemiosmotic energy conservation via the Rnf complex. C. sticklandii possesses both the F-type and V-type ATPases. The discovery of an as yet unrecognized selenoprotein in the D-proline reductase operon suggests a more detailed mechanism for NADH-dependent D-proline reduction. A rather unusual metabolic feature is the presence of genes for all the enzymes involved in two different CO2-fixation pathways: C. sticklandii harbours both the glycine synthase/glycine reductase and the Wood-Ljungdahl pathways. This unusual pathway combination has retrospectively been observed in only four other sequenced microorganisms. Conclusions Analysis of the C. sticklandii genome and

  17. Purification, characterization, and complete amino acid sequence of a thioredoxin from a green alga, Chlamydomonas reinhardtii.

    PubMed

    Decottignies, P; Schmitter, J M; Jacquot, J P; Dutka, S; Picaud, A; Gadal, P

    1990-07-01

    Two thioredoxins (named Ch1 and Ch2 in reference to their elution pattern on an anion-exchange column) have been purified to homogeneity from the green alga, Chlamydomonas reinhardtii. In this paper, we described the properties and the sequence of the most abundant form, Ch2. Its activity in various enzymatic assays has been compared with those of Escherichia coli and spinach thioredoxins. C. reinhardtii thioredoxin Ch2 can serve as a substrate for E. coli thioredoxin reductase with a lower efficiency when compared to the homologous system. In the presence of dithiothreitol (DTT), the protein is able to catalyze the reduction of porcine insulin. Thioredoxin Ch2 is as efficient as its spinach counterpart in the DTT or light activation of corn NADP-malate dehydrogenase, but it only activates spinach fructose-1, 6-bisphosphatase at very high concentrations. The complete primary structure of the C. reinhardtii thioredoxin Ch2 was determined by automated Edman degradation of the intact protein and of peptides derived from trypsin, chymotrypsin, clostripain, and SV8 protease digestions. It consists of a polypeptide of 106 amino acids (MW 11,808) and contains the well-conserved active site sequence Trp-Cys-Gly-Pro-Cys. The sequence of the algal thioredoxin Ch2 has been compared to that of thioredoxins from other sources and has the greatest similarity (67%) with the thioredoxin from Anabaena 7119.

  18. Complete Genome Sequence of the Prototype Lactic Acid Bacterium Lactococcus lactis subsp. cremoris MG1363▿

    PubMed Central

    Wegmann, Udo; O'Connell-Motherway, Mary; Zomer, Aldert; Buist, Girbe; Shearman, Claire; Canchaya, Carlos; Ventura, Marco; Goesmann, Alexander; Gasson, Michael J.; Kuipers, Oscar P.; van Sinderen, Douwe; Kok, Jan

    2007-01-01

    Lactococcus lactis is of great importance for the nutrition of hundreds of millions of people worldwide. This paper describes the genome sequence of Lactococcus lactis subsp. cremoris MG1363, the lactococcal strain most intensively studied throughout the world. The 2,529,478-bp genome contains 81 pseudogenes and encodes 2,436 proteins. Of the 530 unique proteins, 47 belong to the COG (clusters of orthologous groups) functional category “carbohydrate metabolism and transport,” by far the largest category of novel proteins in comparison with L. lactis subsp. lactis IL1403. Nearly one-fifth of the 71 insertion elements are concentrated in a specific 56-kb region. This integration hot-spot region carries genes that are typically associated with lactococcal plasmids and a repeat sequence specifically found on plasmids and in the “lateral gene transfer hot spot” in the genome of Streptococcus thermophilus. Although the parent of L. lactis MG1363 was used to demonstrate lysogeny in Lactococcus, L. lactis MG1363 carries four remnant/satellite phages and two apparently complete prophages. The availability of the L. lactis MG1363 genome sequence will reinforce its status as the prototype among lactic acid bacteria through facilitation of further applied and fundamental research. PMID:17307855

  19. Purification, amino acid sequence and characterisation of kangaroo IGF-I.

    PubMed

    Yandell, C A; Francis, G L; Wheldrake, J F; Upton, Z

    1998-01-01

    Insulin-like growth factor-I (IGF-I) and IGF-II have been purified to homogeneity from kangaroo (Macropus fuliginosus) serum, thus this represents the first report of the purification, sequencing and characterisation of marsupial IGFs. N-Terminal protein sequencing reveals that there are six amino acid differences between kangaroo and human IGF-I. Kangaroo IGF-II has been partially sequenced and no differences were found between human and kangaroo IGF-II in the 53 residues identified. Thus the IGFs appear to be remarkably structurally conserved during mammalian radiation. In addition, in vitro characterisation of kangaroo IGF-I demonstrated that the functional properties of human, kangaroo and chicken IGF-I are very similar. In an assay measuring the ability of the proteins to stimulate protein synthesis in rat L6 myoblasts, all IGF-I proteins were found to be equally potent. The ability of all three proteins to compete for binding with radiolabelled human IGF-I to type-1 IGF receptors in L6 myoblasts and in Sminthopsis crassicaudata transformed lung fibroblasts, a marsupial cell line, was comparable. Furthermore, kangaroo and human IGF-I react equally in a human IGF-I RIA using a human reference standard, radiolabelled human IGF-I and a polyclonal antibody raised against recombinant human IGF-I. This study indicates that not only is the primary structure of eutherian and metatherian IGF-I conserved, but also the proteins appear to be functionally similar.

  20. Amino acid sequence of neurotoxin III of the scorpion Androctonus austrialis Hector.

    PubMed

    Kopeyan, C; Martinez, G; Rochat, H

    1979-03-01

    The amino acid sequence of neurotoxin III, purified from the venom of the North African scorpion Androctonus australis Hector, has been determined by Edman degradation using a liquid-phase sequencer. Carboxypeptidase A hydrolyses confirmed not only the sequence of the five last residues but also the presence of a free alpha-carboxylic group at the C-terminus. Edman degradation was conducted on one hand with the Quadrol [N,N,N',N'-tetrakis(2-hydroxypropyl)ethylene diamine] program and S-alkylated protein before or after coupling with sulfophenylisothiocynate (the first 34 residues were thus identified), on the other hand on tryptic and chymotryptic peptides with a dimethylbenzylamine program (residues 1--23 and 31--34 were confirmed, the positions of residues 35-64 were established). Neurotoxin III was found to belong to the same group of scorpion toxins active on mammals as neurotoxin I purified from the same venom (50 homologous positions exist in the two proteins).

  1. The complete amino acid sequence of ubiquitin, an adenylate cyclase stimulating polypeptide probably universal in living cells.

    PubMed

    Schlesinger, D H; Goldstein, G; Niall, H D

    1975-05-20

    The complete amino acid sequence was determined for bovine ubiquitin, and adenylate cyclase stimulating polypeptide, which is probably represented universally in living cells. Ubiquitin has a molecular weight of 8451 and consists of a single polypeptide chain containing 74 amino acid residues. It contains four arginine residues but no cysteine or trytophan residues. The first 61 amino acid residues were obtained by automated Edman degradations. Tryptic digestion of maleated ubiquitin yielded four peptide fragments that were resolved by molecular sieve chromatography and coded in order of decreasing chain length (MT-1, MT-2, MT-3, and MT-4). The automated sequenator determinations on native ubiquintin provided overlapping sequence data for three of these fragments that gave an order of MT-1, MT-3, and then MT-2; Peptide MT-4, a dipeptide, was therefore assigned to the C terminus, and the placement of peptide MT-2 was corroborated by analysis of data from carboxypeptidase digestions of maleated ubiquitin. Peptide MT-2 was domaleated and sequenced by manual Edman degradations through a single lysine residue. It was cleaved at this residue with trypsin, and the two resultant peptides were separated by ion-exchange chromatography. Manual sequencing of the C-terminal demaleated tryptic peptide of MT-2 completed the sequence of MT-2 and that of native ubiquitin. The sequence of ubiquitin was further confirmed and supported by amino acid and parital sequence anlysis of fragments obtained by digestion of maleated ubiquitin with chymotrypsin or staphylococcal protease.

  2. The amino acid sequence around the active-site cysteine and histidine residues, and the buried cysteine residues in ficin

    PubMed Central

    Husain, S. S.; Lowe, G.

    1970-01-01

    Ficin that had been prepared from the latex of Ficus glabrata by salt fractionation and chromatography on carboxymethylcellulose was completely and irreversibly inhibited with 1,3-dibromo[2-14C]acetone and then treated with N-(4-dimethylamino-3,5-dinitrophenyl)maleimide in 6m-guanidinium chloride. After reduction and carboxymethylation of the labelled protein, it was digested with trypsin and α-chymotrypsin. Two radioactive peptides and two coloured peptides were isolated chromatographically and their sequences determined. The radioactive peptides revealed the amino acid sequences around the active-site cysteine and histidine residues and showed a high degree of homology with the omino acid sequence around the active-site cysteine and histidine residues in papain. The coloured peptides allowed the amino acid sequence around the buried cysteine residue in ficin to be determined. PMID:5420043

  3. Purification, amino acid sequence and immunological characterization of Ole e 6, a cysteine-enriched allergen from olive tree pollen.

    PubMed

    Batanero, E; Ledesma, A; Villalba, M; Rodríguez, R

    1997-06-30

    The Ole e 6 allergen from olive tree pollen has been isolated by combining gel permeation and reverse-phase chromatographies. It is a single and highly acidic (pI 4.2) polypeptide chain protein. Its NH2-terminal amino acid sequence has been determined by Edman degradation. Total RNA from the olive tree pollen was isolated, and a specific cDNA was amplified by the polymerase chain reaction using a degenerate oligonucleotide primer designed according to the NH2-terminal sequence of the protein. The nucleotide sequencing of the cDNA rendered an open reading frame encoding a 50 amino acid polypeptide chain, in which two sets of the sequential motif Cys-X3-Cys-X3-Cys are present. No sequence similarity has been found between this protein and other previously described polypeptides.

  4. The amino acid sequence around the active-site cysteine and histidine residues, and the buried cysteine residue in ficin.

    PubMed

    Husain, S S; Lowe, G

    1970-04-01

    Ficin that had been prepared from the latex of Ficus glabrata by salt fractionation and chromatography on carboxymethylcellulose was completely and irreversibly inhibited with 1,3-dibromo[2-(14)C]acetone and then treated with N-(4-dimethylamino-3,5-dinitrophenyl)maleimide in 6m-guanidinium chloride. After reduction and carboxymethylation of the labelled protein, it was digested with trypsin and alpha-chymotrypsin. Two radioactive peptides and two coloured peptides were isolated chromatographically and their sequences determined. The radioactive peptides revealed the amino acid sequences around the active-site cysteine and histidine residues and showed a high degree of homology with the omino acid sequence around the active-site cysteine and histidine residues in papain. The coloured peptides allowed the amino acid sequence around the buried cysteine residue in ficin to be determined.

  5. The `heavy' subunit of the photosynthetic reaction centre from Rhodopseudomonas viridis: isolation of the gene, nucleotide and amino acid sequence

    PubMed Central

    Michel, H.; Weyer, K. A.; Gruenberg, H.; Lottspeich, F.

    1985-01-01

    The gene coding for the `heavy' subunit of the photosynthetic reaction centre from Rhodopseudomonas viridis was isolated in an expression vector. Expression of the heavy subunit in Escherichia coli was detected with antibodies raised against crystalline reaction centres. The entire subunit, and not a fusion protein, was expressed in E. coli. The protein coding region of the gene was sequenced and the amino acid sequence derived. Part of the amino acid sequence was confirmed by chemical sequence analysis of the protein. The heavy subunit consists of 258 amino acids and its mol. wt. is 28 345. It possesses one membrane-spanning α-helical segment, as was revealed by the concomitant X-ray structure analysis. ImagesFig. 1.Fig. 2. PMID:16453623

  6. Isolation, characterization, and amino acid sequences of auracyanins, blue copper proteins from the green photosynthetic bacterium Chloroflexus aurantiacus

    NASA Technical Reports Server (NTRS)

    McManus, J. D.; Brune, D. C.; Han, J.; Sanders-Loehr, J.; Meyer, T. E.; Cusanovich, M. A.; Tollin, G.; Blankenship, R. E.

    1992-01-01

    Three small blue copper proteins designated auracyanin A, auracyanin B-1, and auracyanin B-2 have been isolated from the thermophilic green gliding photosynthetic bacterium Chloroflexus aurantiacus. All three auracyanins are peripheral membrane proteins. Auracyanin A was described previously (Trost, J. T., McManus, J. D., Freeman, J. C., Ramakrishna, B. L., and Blankenship, R. E. (1988) Biochemistry 27, 7858-7863) and is not glycosylated. The two B forms are glycoproteins and have almost identical properties to each other, but are distinct from the A form. The sodium dodecyl sulfate-polyacrylamide gel electrophoresis apparent monomer molecular masses are 14 (A), 18 (B-2), and 22 (B-1) kDa. The amino acid sequences of the B forms are presented. All three proteins have similar absorbance, circular dichroism, and resonance Raman spectra, but the electron spin resonance signals are quite different. Laser flash photolysis kinetic analysis of the reactions of the three forms of auracyanin with lumiflavin and flavin mononucleotide semiquinones indicates that the site of electron transfer is negatively charged and has an accessibility similar to that found in other blue copper proteins. Copper analysis indicates that all three proteins contain 1 mol of copper per mol of protein. All three auracyanins exhibit a midpoint redox potential of +240 mV. Light-induced absorbance changes and electron spin resonance signals suggest that auracyanin A may play a role in photosynthetic electron transfer. Kinetic data indicate that all three proteins can donate electrons to cytochrome c-554, the electron donor to the photosynthetic reaction center.

  7. Isolation, characterization, and amino acid sequences of auracyanins, blue copper proteins from the green photosynthetic bacterium Chloroflexus aurantiacus

    NASA Technical Reports Server (NTRS)

    McManus, J. D.; Brune, D. C.; Han, J.; Sanders-Loehr, J.; Meyer, T. E.; Cusanovich, M. A.; Tollin, G.; Blankenship, R. E.

    1992-01-01

    Three small blue copper proteins designated auracyanin A, auracyanin B-1, and auracyanin B-2 have been isolated from the thermophilic green gliding photosynthetic bacterium Chloroflexus aurantiacus. All three auracyanins are peripheral membrane proteins. Auracyanin A was described previously (Trost, J. T., McManus, J. D., Freeman, J. C., Ramakrishna, B. L., and Blankenship, R. E. (1988) Biochemistry 27, 7858-7863) and is not glycosylated. The two B forms are glycoproteins and have almost identical properties to each other, but are distinct from the A form. The sodium dodecyl sulfate-polyacrylamide gel electrophoresis apparent monomer molecular masses are 14 (A), 18 (B-2), and 22 (B-1) kDa. The amino acid sequences of the B forms are presented. All three proteins have similar absorbance, circular dichroism, and resonance Raman spectra, but the electron spin resonance signals are quite different. Laser flash photolysis kinetic analysis of the reactions of the three forms of auracyanin with lumiflavin and flavin mononucleotide semiquinones indicates that the site of electron transfer is negatively charged and has an accessibility similar to that found in other blue copper proteins. Copper analysis indicates that all three proteins contain 1 mol of copper per mol of protein. All three auracyanins exhibit a midpoint redox potential of +240 mV. Light-induced absorbance changes and electron spin resonance signals suggest that auracyanin A may play a role in photosynthetic electron transfer. Kinetic data indicate that all three proteins can donate electrons to cytochrome c-554, the electron donor to the photosynthetic reaction center.

  8. cDNA and derived amino acid sequence of ethanol-inducible rabbit liver cytochrome P-450 isozyme 3a (P-450ALC).

    PubMed Central

    Khani, S C; Zaphiropoulos, P G; Fujita, V S; Porter, T D; Koop, D R; Coon, M J

    1987-01-01

    Administration of ethanol to rabbits is known to induce a unique liver microsomal cytochrome P-450, termed isozyme 3a or P-450ALC, which is responsible for the increased oxidation of ethanol and other alcohols and the activation of toxic or carcinogenic compounds such as acetaminophen and N-nitrosodimethylamine. To further characterize this cytochrome P-450 we have identified cDNA clones to isozyme 3a by immunoscreening, DNA hybridization, and hybridization-selection. The cDNA sequence determined from two overlapping clones contains an open reading frame of 1416 nucleotides, and the first 25 amino acids of this reading frame correspond to residues 21-45 of cytochrome P-450 3a. The complete polypeptide, including residues 1 to 20, contains 492 amino acids and has a molecular weight of 56,820. Cytochrome P-450 3a is approximately 55% identical in sequence to P-450 isozymes 1 and 3b and 48% identical to isozyme 2. Hybridization of clone p3a-2 to electrophoretically fractionated rabbit liver poly(A)+ RNA revealed multiple bands, but, with a probe derived from the 3' nontranslated portion of this cDNA, only a 1.9-kilobase band was observed. Treatment of rabbits with imidazole, which increases the content of isozyme 3a, resulted in a transient increase in form 3a mRNA, but this was judged to be insufficient to account for the known 4.5-fold increase in form 3a protein. Genomic DNA analysis indicated that the cytochrome P-450 3a gene does not belong to a large subfamily. Images PMID:3027695

  9. Synthesis and use of universal sequence probes in fluorogenic multi-strand hybridisation complexes for economical nucleic acid testing.

    PubMed

    French, David J; Richardson, James A; Howard, Rebecca L; Brown, Tom; Debenham, Paul G

    2015-08-01

    Analysis of nucleic acid amplification products has become the gold standard for applications such as pathogen detection and characterisation of single nucleotide polymorphisms and short tandem repeat sequences. The development of real-time PCR and melting curve analysis using fluorescent probes has simplified nucleic acid analyses. However, the cost of probe synthesis can be prohibitive when developing large panels of tests. We describe an economic two-stage method for probe synthesis, and a new method for nucleic acid sequence analysis which together considerably reduce costs. The analysis method utilises three-strand and four-strand hybridisation complexes for the detection and identification of nucleic acid target sequences by real-time PCR and fluorescence melting. Copyright © 2015 Elsevier Ltd. All rights reserved.

  10. Fragmentation Characteristics of Deprotonated N-linked Glycopeptides: Influences of Amino Acid Composition and Sequence

    NASA Astrophysics Data System (ADS)

    Nishikaze, Takashi; Kawabata, Shin-ichirou; Tanaka, Koichi

    2014-06-01

    Glycopeptide structural analysis using tandem mass spectrometry is becoming a common approach for elucidating site-specific N-glycosylation. The analysis is generally performed in positive-ion mode. Therefore, fragmentation of protonated glycopeptides has been extensively investigated; however, few studies are available on deprotonated glycopeptides, despite the usefulness of negative-ion mode analysis in detecting glycopeptide signals. Here, large sets of glycopeptides derived from well-characterized glycoproteins were investigated to understand the fragmentation behavior of deprotonated N-linked glycopeptides under low-energy collision-induced dissociation (CID) conditions. The fragment ion species were found to be significantly variable depending on their amino acid sequence and could be classified into three types: (i) glycan fragment ions, (ii) glycan-lost fragment ions and their secondary cleavage products, and (iii) fragment ions with intact glycan moiety. The CID spectra of glycopeptides having a short peptide sequence were dominated by type (i) glycan fragments (e.g., 2,4AR, 2,4AR-1, D, and E ions). These fragments define detailed structural features of the glycan moiety such as branching. For glycopeptides with medium or long peptide sequences, the major fragments were type (ii) ions (e.g., [peptide + 0,2X0-H]- and [peptide-NH3-H]-). The appearance of type (iii) ions strongly depended on the peptide sequence, and especially on the presence of Asp, Asn, and Glu. When a glycosylated Asn is located on the C-terminus, an interesting fragment having an Asn residue with intact glycan moiety, [glycan + Asn-36]-, was abundantly formed. Observed fragments are reasonably explained by a combination of existing fragmentation rules suggested for N-glycans and peptides.

  11. Purification and partial amino acid sequence of the chloroplast cytochrome b-559.

    PubMed

    Widger, W R; Cramer, W A; Hermodson, M; Meyer, D; Gullifor, M

    1984-03-25

    The hydrophobic cytochrome b-559, purified from unstacked, ethanol-washed spinach thylakoid membranes, using extraction with 2% Triton X-100 in 4 M urea and three chromatographic steps in the presence of protease inhibitors, has a dominant band on sodium dodecyl sulfate-urea gels corresponding to Mr = 10,000. The yield of this preparation is 30-50% (5-10 mg) starting with 600 mg of chlorophyll. The heme content yields a calculated molecular weight of no more than 17,500/heme, and perhaps somewhat smaller after correction for impurities. The Mr = 10,000 band is stained by the tetramethylbenzidine-H2O2 heme reagent on lithium dodecyl sulfate gels run at 0 degrees C. The Mr = 10,000 protein, further separated by high performance liquid chromatography, contains a unique NH2 terminus that is not blocked, and the amino acid sequence for the first 27 residues is NH2-Ser-Gly-Ser-Thr-Gly-Glu-Arg-Ser-Phe-Ala-Asp-Ile-Ile-Thr-Ser-Ile-Arg-Tyr-Trp -Val-Ile-X-Ser-Ile-Thr-Ile-Pro. . . COOH. Approximately 55% of the amino acids are hydrophobic, based on amino acid analysis of the Mr = 10,000 peptide, which also indicated the presence of at least one histidine. Only one cytochrome b-559 component could be identified, whose yield indicated that it arises from a single b-559 protein in chloroplasts corresponding to the in situ high potential cytochrome of the chloroplast photosystem II.

  12. An amino acid sequence motif sufficient for subnuclear localization of an arginine/serine-rich splicing factor.

    PubMed

    Hedley, M L; Amrein, H; Maniatis, T

    1995-12-05

    We have identified an amino acid sequence in the Drosophila Transformer (Tra) protein that is capable of directing a heterologous protein to nuclear speckles, regions of the nucleus previously shown to contain high concentrations of spliceosomal small nuclear RNAs and splicing factors. This sequence contains a nucleoplasmin-like bipartite nuclear localization signal (NLS) and a repeating arginine/serine (RS) dipeptide sequence adjacent to a short stretch of basic amino acids. Sequence comparisons from a number of other splicing factors that colocalize to nuclear speckles reveal the presence of one or more copies of this motif. We propose a two-step subnuclear localization mechanism for splicing factors. The first step is transport across the nuclear envelope via the nucleoplasmin-like NLS, while the second step is association with components in the speckled domain via the RS dipeptide sequence.

  13. Genome sequence of vanilla distortion mosaic virus infecting Coriandrum sativum.

    PubMed

    Adams, I P; Rai, S; Deka, M; Harju, V; Hodges, T; Hayward, G; Skelton, A; Fox, A; Boonham, N

    2014-12-01

    The 9573-nucleotide genome of a potyvirus was sequenced from a Coriandrum sativum plant from India with viral symptoms. On analysis, this virus was shown to have greater than 85 % nucleotide sequence identity to vanilla distortion mosaic virus (VDMV). Analysis of the putative coat protein sequence confirmed that this virus was in fact VDMV, with greater than 91 % amino acid sequence identity. The genome appears to encode a 3083-amino-acid polyprotein potentially cleaved into the 10 mature proteins expected in potyviruses. Phylogenetic analysis confirmed that VDMV is a distinct but ungrouped member of the genus Potyvirus.

  14. Low sequence identity but high structural and functional conservation: The case of Hsp70/Hsp90 organizing protein (Hop/Sti1) of Leishmania braziliensis.

    PubMed

    Batista, Fernanda A H; Seraphim, Thiago V; Santos, Clelton A; Gonzaga, Marisvanda R; Barbosa, Leandro R S; Ramos, Carlos H I; Borges, Júlio C

    2016-06-15

    Parasites belonging to the genus Leishmania are subjected to extensive environmental changes during their life cycle; molecular chaperones/co-chaperones act as protagonists in this scenario to maintain cellular homeostasis. Hop/Sti1 is a co-chaperone that connects the Hsp90 and Hsp70 systems, modulating their ATPase activities and affecting the fate of client proteins because it facilitates their transfer from the Hsp70 to the Hsp90 chaperone. Hop/Sti1 is one of the most prevalent co-chaperones, highlighting its importance despite the relatively low sequence identity among orthologue proteins. This multi-domain protein comprises three tetratricopeptides domains (TPR1, TPR2A and TPR2B) and two Asp/Pro-rich domains. Given the importance of Hop/Sti1 for the chaperone system and for Leishmania protozoa viability, the Leishmania braziliensis Hop (LbHop) and a truncated mutant (LbHop(TPR2AB)) were characterized. Structurally, both proteins are α-helix-rich and highly elongated monomeric proteins. Functionally, they inhibited the ATPase activity of Leishmania braziliensis Hsp90 (LbHsp90) to a similar extent, and the thermodynamic parameters of their interactions with LbHsp90 were similar, indicating that TPR2A-TPR2B forms the functional center for the LbHop interaction with LbHsp90. These results highlight the structural and functional similarity of Hop/Sti1 proteins, despite their low sequence conservation compared to the Hsp70 and Hsp90 systems, which are phylogenetic highly conserved. Copyright © 2016 Elsevier Inc. All rights reserved.

  15. Deduced amino acid sequence of a putative sodium channel from the scyphozoan jellyfish Cyanea capillata.

    PubMed Central

    Anderson, P A; Holman, M A; Greenberg, R M

    1993-01-01

    Members of the phylum Cnidaria are the lowest extant organisms to possess a nervous system and are the first that are known to contain cells that produce action potentials carried exclusively by Na+ ions. They thus occupy an important position in the evolution of Na+ channels. A cDNA encoding a 198-kDa protein with high sequence identity to known Na+ channels was isolated from the scyphozoan jellyfish Cyanea capillata. The similarity between this and other Na+ channels is greatest in the transmembrane segments and the putative pore region and less so in the cytoplasmic loops that link the four domains of the protein. Phylogenetic analysis of the deduced protein reveals that it is closely related to known Na+ channels, particularly those of squid and Drosophila, and more distantly separated from Ca2+ channels. Scrutiny of the Cyanea channel in regions corresponding to those purported to form the tetrodotoxin receptor and selectivity filter of Na+ channels in higher animals reveals several anomalies that suggest that current models of the location of the tetrodotoxin binding site and Na+ channel selectivity filter are incomplete. Images Fig. 1 Fig. 2 Fig. 3 PMID:8394021

  16. Deduced amino acid sequence of a putative sodium channel from the scyphozoan jellyfish Cyanea capillata.

    PubMed

    Anderson, P A; Holman, M A; Greenberg, R M

    1993-08-01

    Members of the phylum Cnidaria are the lowest extant organisms to possess a nervous system and are the first that are known to contain cells that produce action potentials carried exclusively by Na+ ions. They thus occupy an important position in the evolution of Na+ channels. A cDNA encoding a 198-kDa protein with high sequence identity to known Na+ channels was isolated from the scyphozoan jellyfish Cyanea capillata. The similarity between this and other Na+ channels is greatest in the transmembrane segments and the putative pore region and less so in the cytoplasmic loops that link the four domains of the protein. Phylogenetic analysis of the deduced protein reveals that it is closely related to known Na+ channels, particularly those of squid and Drosophila, and more distantly separated from Ca2+ channels. Scrutiny of the Cyanea channel in regions corresponding to those purported to form the tetrodotoxin receptor and selectivity filter of Na+ channels in higher animals reveals several anomalies that suggest that current models of the location of the tetrodotoxin binding site and Na+ channel selectivity filter are incomplete.

  17. Sinorhizobium meliloti strains TII7 and A5 by Multilocus Sequence Typing (MLST) have chromsomes identical with Rm1021 and form an effective and ineffective symbiosis with Medicago truncatula line Jemalong A17, respectively

    USDA-ARS?s Scientific Manuscript database

    The strains TII7 and A5 formed an effective and ineffective symbiosis with Medicago truncatula Jemalong A17, respectively. Both were shown to have identical chromsomes with strains Rm1021 and RCR2011 using a Multilocus Sequence Typing method. The 2260 bp segments of DNA stretching from the 3’ end ...

  18. Haemoglobins of the shark, Heterodontus portusjacksoni. III. Amino acid sequence of the beta-chain.

    PubMed

    Fisher, W K; Nash, A R; Thompson, E O

    1977-12-01

    The amino acid sequence of the beta-chain of the principal haemoglobin from the shark H. portusjacksoni has been determined. The chain has 141 residues, the same as that of mammalian alpha-chains and less than the 146 residues of mammalian beta-chains or the 148 residues of the alpha-chain from the tetrameric shark haemoglobin. The sequence was deduced from the sequences of peptides obtained by digestion of the globin or its cyanogen bromide fragments with trypsin, chymotrypsin, pepsin and papain. The difference in length of the beta-chain is most readily accounted for by the absence of the D helix. This small helical section is normally present in myoglobins and beta-globins but absent in alpha-chains. The deduction that it is absent from shark beta-chain is based on consideration of homology. The beta-chain shows the insertion of histidine beta2 and the deletions corresponding to residues A17 and AB1 relative to alpha-and myoglobin chains. The reactive thiol group in shark haemoglobin was shown by radioactive labelling to be residue 51 in the beta-chain, immediately preceding the E helix. The amino acid sequence of shark beta-chain shows 92 differences from human beta-chain, significantly more differences than shown by chicken or frog beta-chains, in line with its earlier time of divergence. If the tertiary structure of the shark beta-chain is the same as that of the horse then there are two changes in the alpha1beta2 contact site in oxyhaemoglobin and an additional one in deoxyhaemoglobin. When both alpha- and beta-chain contacts are considered there is a total of nine changes in residues involved in the alpha1beta2 contacts. There is no Bohr effect in shark haemoglobin, and of the residues normally involved in this effect the C-terminal histidine residue of the beta-chain is present, but the aspartyl (FG1) residue to which it is salt-linked is not, being replaced by a glutamyl residue.

  19. Deaf Identity.

    ERIC Educational Resources Information Center

    Mottez, Bernard

    1990-01-01

    Explores the rarely studied concept of "Deaf Identity" through discussion of such issues as the typology of the elements of identity; destiny; other ethnic or racial group identity; awareness of deafness; and deaf culture. (CB)

  20. Amino acid sequence of the Bb fragment from complement Factor B. Sequence of the major cyanogen bromide-cleavage peptide (CB-II) and completion of the sequence of the Bb fragment.

    PubMed Central

    Christie, D L; Gagnon, J

    1983-01-01

    The amino acid sequence of peptide CB-II, the major product (mol.wt. 30 000) of CNBr cleavage of fragment Bb from human complement Factor B, is given. The sequence was obtained from peptides derived by trypsin cleavage of peptide CB-II and clostripain digestion of fragment Bb. Cleavage of two Asn-Gly bonds in peptide CB-II was also found useful. These results, along with those presented in the preceding paper [Gagnon & Christie (1983) Biochem. J. 209, 51-60], yield the complete sequence of the 505 amino acid residues of fragment Bb. The C-terminal half of the molecule shows strong homology of sequence with serine proteinases. Factor B has a catalytic chain (fragment Bb) with a molecular weight twice that of proteinases previously described, suggesting that it is a novel type of serine proteinase, probably with a different activation mechanism. PMID:6342610

  1. Alignment of 700 globin sequences: extent of amino acid substitution and its correlation with variation in volume.

    PubMed Central

    Kapp, O. H.; Moens, L.; Vanfleteren, J.; Trotman, C. N.; Suzuki, T.; Vinogradov, S. N.

    1995-01-01

    Seven-hundred globin sequences, including 146 nonvertebrate sequences, were aligned on the basis of conservation of secondary structure and the avoidance of gap penalties. Of the 182 positions needed to accommodate all the globin sequences, only 84 are common to all, including the absolutely conserved PheCD1 and HisF8. The mean number of amino acid substitutions per position ranges from 8 to 13 for all globins and 5 to 9 for internal positions. Although the total sequence volumes have a variation approximately 2-3%, the variation in volume per position ranges from approximately 13% for the internal to approximately 21% for the surface positions. Plausible correlations exist between amino acid substitution and the variation in volume per position for the 84 common and the internal but not the surface positions. The amino acid substitution matrix derived from the 84 common positions was used to evaluate sequence similarity within the globins and between the globins and phycocyanins C and colicins A, via calculation of pairwise similarity scores. The scores for globin-globin comparisons over the 84 common positions overlap the globin-phycocyanin and globin-colicin scores, with the former being intermediate. For the subset of internal positions, overlap is minimal between the three groups of scores. These results imply a continuum of amino acid sequences able to assume the common three-on-three alpha-helical structure and suggest that the determinants of the latter include sites other than those inaccessible to solvent. PMID:8535255

  2. PubDNA Finder: a web database linking full-text articles to sequences of nucleic acids.

    PubMed

    García-Remesal, Miguel; Cuevas, Alejandro; Pérez-Rey, David; Martín, Luis; Anguita, Alberto; de la Iglesia, Diana; de la Calle, Guillermo; Crespo, José; Maojo, Víctor

    2010-11-01

    PubDNA Finder is an online repository that we have created to link PubMed Central manuscripts to the sequences of nucleic acids appearing in them. It extends the search capabilities provided by PubMed Central by enabling researchers to perform advanced searches involving sequences of nucleic acids. This includes, among other features (i) searching for papers mentioning one or more specific sequences of nucleic acids and (ii) retrieving the genetic sequences appearing in different articles. These additional query capabilities are provided by a searchable index that we created by using the full text of the 176 672 papers available at PubMed Central at the time of writing and the sequences of nucleic acids appearing in them. To automatically extract the genetic sequences occurring in each paper, we used an original method we have developed. The database is updated monthly by automatically connecting to the PubMed Central FTP site to retrieve and index new manuscripts. Users can query the database via the web interface provided. PubDNA Finder can be freely accessed at http://servet.dia.fi.upm.es:8080/pubdnafinder

  3. Sequence-defined shuttles for targeted nucleic acid and protein delivery.

    PubMed

    Röder, Ruth; Wagner, Ernst

    2014-01-01

    Molecular medicine opens into a space of novel specific therapeutic agents: intracellularly active drugs such as peptides, proteins or nucleic acids, which are not able to cross cell membranes and enter the intracellular space on their own. Through the development of cell-targeted shuttles for specific delivery, this restriction in delivery has the potential to be converted into an advantage. On the one hand, due to the multiple extra- and intracellular barriers, such carrier systems need to be multifunctional. On the other hand, they must be precise and reproducibly manufactured due to pharmaceutical reasons. Here we review the design of precise sequence-defined delivery carriers, including solid-phase synthesized peptides and nonpeptidic oligomers, or nucleotide-based carriers such as aptamers and origami nanoboxes.

  4. Evolutionary connections of biological kingdoms based on protein and nucleic acid sequence evidence

    NASA Technical Reports Server (NTRS)

    Dayhoff, M. O.

    1983-01-01

    Prokaryotic and eukaryotic evolutionary trees are developed from protein and nucleic-acid sequences by the methods of numerical taxonomy. Trees are presented for bacterial ferredoxins, 5S ribosomal RNA, c-type cytochromes , cytochromes c2 and c', and 5.8S ribosomal RNA; the implications for early evolution are discussed; and a composite tree showing the branching of the anaerobes, aerobes, archaebacteria, and eukaryotes is shown. Single lines are found for all oxygen-evolving photosynthetic forms and for the salt-loving and high-temperature forms of archaebacteria. It is argued that the eukaryote mitochondria, chloroplasts, and cytoplasmic host material are descended from free-living prokaryotes that formed symbiotic associations, with more than one symbiotic event involved in the evolution of each organelle.

  5. Identification of amino acid sequences in the polyomavirus capsid proteins that serve as nuclear localization signals

    NASA Technical Reports Server (NTRS)

    Chang, D.; Haynes, J. I. Jr; Brady, J. N.; Consigli, R. A.; Spooner, B. S. (Principal Investigator)

    1993-01-01

    The molecular mechanism participating in the transport of newly synthesized proteins from the cytoplasm to the nucleus in mammalian cells is poorly understood. Recently, the nuclear localization signal sequences (NLS) of many nuclear proteins have been identified, and most have been found to be composed of a highly basic amino acid stretch. A genetic "subtractive" and a biochemical "additive" approach were used in our studies to identify the NLS's of the polyomavirus structural capsid proteins. An NLS was identified at the N-terminus (Ala1-Pro-Lys-Arg-Lys-Ser-Gly-Val-Ser-Lys-Cys11) of the major capsid protein VP1 and at the C-terminus (Glu307 -Glu-Asp-Gly-Pro-Glu-Lys-Lys-Lys-Arg-Arg-Leu318) of the VP2/VP3 minor capsid proteins.

  6. Identification of amino acid sequences in the polyomavirus capsid proteins that serve as nuclear localization signals

    NASA Technical Reports Server (NTRS)

    Chang, D.; Haynes, J. I. Jr; Brady, J. N.; Consigli, R. A.; Spooner, B. S. (Principal Investigator)

    1993-01-01

    The molecular mechanism participating in the transport of newly synthesized proteins from the cytoplasm to the nucleus in mammalian cells is poorly understood. Recently, the nuclear localization signal sequences (NLS) of many nuclear proteins have been identified, and most have been found to be composed of a highly basic amino acid stretch. A genetic "subtractive" and a biochemical "additive" approach were used in our studies to identify the NLS's of the polyomavirus structural capsid proteins. An NLS was identified at the N-terminus (Ala1-Pro-Lys-Arg-Lys-Ser-Gly-Val-Ser-Lys-Cys11) of the major capsid protein VP1 and at the C-terminus (Glu307 -Glu-Asp-Gly-Pro-Glu-Lys-Lys-Lys-Arg-Arg-Leu318) of the VP2/VP3 minor capsid proteins.

  7. Evolutionary connections of biological kingdoms based on protein and nucleic acid sequence evidence

    NASA Technical Reports Server (NTRS)

    Dayhoff, M. O.

    1983-01-01

    Prokaryotic and eukaryotic evolutionary trees are developed from protein and nucleic-acid sequences by the methods of numerical taxonomy. Trees are presented for bacterial ferredoxins, 5S ribosomal RNA, c-type cytochromes , cytochromes c2 and c', and 5.8S ribosomal RNA; the implications for early evolution are discussed; and a composite tree showing the branching of the anaerobes, aerobes, archaebacteria, and eukaryotes is shown. Single lines are found for all oxygen-evolving photosynthetic forms and for the salt-loving and high-temperature forms of archaebacteria. It is argued that the eukaryote mitochondria, chloroplasts, and cytoplasmic host material are descended from free-living prokaryotes that formed symbiotic associations, with more than one symbiotic event involved in the evolution of each organelle.

  8. Real-Time Nucleic Acid Sequence-Based Amplification Assay for Detection of Hepatitis A Virus

    PubMed Central

    Abd El Galil, Khaled H.; El Sokkary, M. A.; Kheira, S. M.; Salazar, Andre M.; Yates, Marylynn V.; Chen, Wilfred; Mulchandani, Ashok

    2005-01-01

    A nucleic acid sequence-based amplification (NASBA) assay in combination with a molecular beacon was developed for the real-time detection and quantification of hepatitis A virus (HAV). A 202-bp, highly conserved 5′ noncoding region of HAV was targeted. The sensitivity of the real-time NASBA assay was tested with 10-fold dilutions of viral RNA, and a detection limit of 1 PFU was obtained. The specificity of the assay was demonstrated by testing with other environmental pathogens and indicator microorganisms, with only HAV positively identified. When combined with immunomagnetic separation, the NASBA assay successfully detected as few as 10 PFU from seeded lake water samples. Due to its isothermal nature, its speed, and its similar sensitivity compared to the real-time RT-PCR assay, this newly reported real-time NASBA method will have broad applications for the rapid detection of HAV in contaminated food or water. PMID:16269748

  9. Enzymatic properties, crystallization, and deduced amino acid sequence of an alkaline endoglucanase from Bacillus circulans.

    PubMed

    Hakamada, Yoshihiro; Endo, Keiji; Takizawa, Shuichi; Kobayashi, Tohru; Shirai, Tsuyoshi; Yamane, Takashi; Ito, Susumu

    2002-04-15

    A high-isoelectric-point (pI), alkaline endo-1,4-beta-glucanase (Egl-257) of Bacillus circulans KSM-N257 was purified to homogeneity and crystallized. The purified enzyme hydrolyzed carboxymethyl cellulose (CMC) with optima of pH 8.5 and 55 degrees C. The molecular mass was 43 kDa, and the pI was pH 9.3. The structural gene contained a single open reading frame of 1221 bp, corresponding to 407 amino acids (aa), including a 30-aa signal peptide (377 aa and 41,680 Da for the mature enzyme). Egl-257 hydrolyzed lichenan and showed 76.3% aa identity to a lichenase from B. circulans WL-12 belonging to glycosyl hydrolase family 8 but did not hydrolyze laminarin, curdran, and xylan at all. This indicates that Egl-257 is a true endo-1,4-beta-glucanase. However, this enzyme was not active on p-nitrophenyl beta-D-cellotrioside and p-nitrophenyl beta-D-cellotetraoside. It was crystallized by the hanging-drop vapor-diffusion method with phosphate plus CdCl(2) as precipitant. Pyramid-like crystals were formed, and they diffracted X-rays beyond 2.2 A resolution. It belongs to the space group P2(1)2(1)2(1) with unit cell parameters of a=62.5 A, b=71.7 A, and c=88.6 A.

  10. Formation of specific amino acid sequences during carbodiimide-mediated condensation of amino acids in aqueous solution, and computer-simulated sequence generation

    NASA Astrophysics Data System (ADS)

    Hartmann, Jürgen; Nawroth, Thomas; Dose, Klaus

    1984-12-01

    Carbodiimide-mediated peptide synthesis in aqueous solution has been studied with respect to self-ordering of amino acids. The copolymerisation of amino acids in the presence of glutamic acid or pyroglutamic acid leads to short pyroglutamyl peptides. Without pyroglutamic acid the formation of higher polymers is favoured. The interactions of the amino acids and the peptides, however, are very complex. Therefore, the experimental results are rather difficult to explain. Some of the experimental results, however, can be explained with the aid of computer simulation programs. Regarding only the tripeptide fraction the copolymerisation of pyroGlu, Ala and Leu, as well as the simulated copolymerisation lead to pyroGlu-Ala-Leu as the main reaction product. The amino acid composition of the insoluble peptides formed during the copolymerisation of Ser, Gly, Ala, Val, Phe, Leu and Ile corresponds in part to the computer-simulated copolymerisation data.

  11. Enzyme-Free Translation of DNA into Sequence-Defined Synthetic Polymers Structurally Unrelated to Nucleic Acids

    PubMed Central

    Niu, Jia; Hili, Ryan; Liu, David R.

    2014-01-01

    The translation of DNA sequences into corresponding biopolymers enables the production, function, and evolution of the macromolecules of life. In contrast, methods to generate sequence-defined synthetic polymers with similar levels of control have remained elusive. Here we report the development of a DNA-templated translation system that enables the enzyme-free translation of DNA templates into sequence-defined synthetic polymers that have no necessary structural relationship with nucleic acids. We demonstrate the efficiency, sequence-specificity, and generality of this translation system by oligomerizing building blocks including polyethylene glycol (PEG), α-(d)-peptides, and β-peptides in a DNA-programmed manner. Sequence-defined synthetic polymers with molecular weights of 26 kDa containing 16 consecutively coupled building blocks and 90 densely functionalized β-amino acid residues were translated from DNA templates using this strategy. We integrated the DNA-templated translation system developed here into a complete cycle of translation, coding sequence replication, template regeneration, and re-translation suitable for the iterated in vitro selection of functional sequence-defined synthetic polymers unrelated in structure to nucleic acids. PMID:23511416

  12. Enzyme-free translation of DNA into sequence-defined synthetic polymers structurally unrelated to nucleic acids.

    PubMed

    Niu, Jia; Hili, Ryan; Liu, David R

    2013-04-01

    The translation of DNA sequences into corresponding biopolymers enables the production, function and evolution of the macromolecules of life. In contrast, methods to generate sequence-defined synthetic polymers with similar levels of control have remained elusive. Here, we report the development of a DNA-templated translation system that enables the enzyme-free translation of DNA templates into sequence-defined synthetic polymers that have no necessary structural relationship with nucleic acids. We demonstrate the efficiency, sequence-specificity and generality of this translation system by oligomerizing building blocks including polyethylene glycol, α-(D)-peptides, and β-peptides in a DNA-programmed manner. Sequence-defined synthetic polymers with molecular weights of 26 kDa containing 16 consecutively coupled building blocks and 90 densely functionalized β-amino acid residues were translated from DNA templates using this strategy. We integrated the DNA-templated translation system developed here into a complete cycle of translation, coding sequence replication, template regeneration and re-translation suitable for the iterated in vitro selection of functional sequence-defined synthetic polymers unrelated in structure to nucleic acids.

  13. Detection of Vibrio cholerae by Real-Time Nucleic Acid Sequence-Based Amplification▿

    PubMed Central

    Fykse, Else M.; Skogan, Gunnar; Davies, William; Olsen, Jaran Strand; Blatny, Janet M.

    2007-01-01

    A multitarget molecular beacon-based real-time nucleic acid sequence-based amplification (NASBA) assay for the specific detection of Vibrio cholerae has been developed. The genes encoding the cholera toxin (ctxA), the toxin-coregulated pilus (tcpA; colonization factor), the ctxA toxin regulator (toxR), hemolysin (hlyA), and the 60-kDa chaperonin product (groEL) were selected as target sequences for detection. The beacons for the five different genetic targets were evaluated by serial dilution of RNA from V. cholerae cells. RNase treatment of the nucleic acids eliminated all NASBA, whereas DNase treatment had no effect, showing that RNA and not DNA was amplified. The specificity of the assay was investigated by testing several isolates of V. cholerae, other Vibrio species, and Bacillus cereus, Salmonella enterica, and Escherichia coli strains. The toxR, groEL, and hlyA beacons identified all V. cholerae isolates, whereas the ctxA and tcpA beacons identified the O1 toxigenic clinical isolates. The NASBA assay detected V. cholerae at 50 CFU/ml by using the general marker groEL and tcpA that specifically indicates toxigenic strains. A correlation between cell viability and NASBA was demonstrated for the ctxA, toxR, and hlyA targets. RNA isolated from different environmental water samples spiked with V. cholerae was specifically detected by NASBA. These results indicate that NASBA can be used in the rapid detection of V. cholerae from various environmental water samples. This method has a strong potential for detecting toxigenic strains by using the tcpA and ctxA markers. The entire assay including RNA extraction and NASBA was completed within 3 h. PMID:17220262

  14. The Use of Orthologous Sequences to Predict the Impact of Amino Acid Substitutions on Protein Function

    PubMed Central

    Rine, Jasper

    2010-01-01

    Computational predictions of the functional impact of genetic variation play a critical role in human genetics research. For nonsynonymous coding variants, most prediction algorithms make use of patterns of amino acid substitutions observed among homologous proteins at a given site. In particular, substitutions observed in orthologous proteins from other species are often assumed to be tolerated in the human protein as well. We examined this assumption by evaluating a panel of nonsynonymous mutants of a prototypical human enzyme, methylenetetrahydrofolate reductase (MTHFR), in a yeast cell-based functional assay. As expected, substitutions in human MTHFR at sites that are well-conserved across distant orthologs result in an impaired enzyme, while substitutions present in recently diverged sequences (including a 9-site mutant that “resurrects” the human-macaque ancestor) result in a functional enzyme. We also interrogated 30 sites with varying degrees of conservation by creating substitutions in the human enzyme that are accepted in at least one ortholog of MTHFR. Quite surprisingly, most of these substitutions were deleterious to the human enzyme. The results suggest that selective constraints vary between phylogenetic lineages such that inclusion of distant orthologs to infer selective pressures on the human enzyme may be misleading. We propose that homologous proteins are best used to reconstruct ancestral sequences and infer amino acid conservation among only direct lineal ancestors of a particular protein. We show that such an “ancestral site preservation” measure outperforms other prediction methods, not only in our selected set for MTHFR, but also in an exhaustive set of E. coli LacI mutants. PMID:20523748

  15. Purification, amino acid sequence, and some properties of rabbit kidney lysozyme.

    PubMed

    Ito, Y; Yamada, H; Nakamura, S; Imoto, T

    1990-02-01

    The lysozyme (rabbit kidney lysozyme) from the homogenate of rabbit kidney (Japanese white) was purified by repeated cation-exchange chromatography on Bio-Rex 70. The amino acid sequence was determined by automated gas-phase Edman degradation of the peptides obtained from the digestion of reduced and S-carboxymethylated rabbit lysozyme with Achromobacter protease I (lysyl endopeptidase). The sequence thus determined was KIYERCELARTLKKLGLDGYKGVSLANWMCLAKWESSYNTRATNYNPGDKSTDYGIFQ INSRYWCNDGKTPRAVNACHIPCSDLLKDDITQAVACAKRVVSDPQGIRAWVAWRNHCQ NQDLTPYIRGCGV, indicating 25 amino acid substitutions from human lysozyme. The lytic activity of rabbit lysozyme against Micrococcus lysodeikticus at pH 7, ionic strength of 0.1, and 30 degrees C was found to be 190 and 60% of those of hen and human lysozymes, respectively. The lytic activity-pH profile of rabbit lysozyme was slightly different from those of hen and human lysozymes. While hen and human lysozymes had wide optimum activities at around pH 5.5-8.5, the optimum activity of rabbit lysozyme was at around pH 5.5-7.0. The high proline content (five residues per molecule compared with two prolines per molecule in hen or human lysozyme) is one of the interesting features of rabbit lysozyme. The transition temperatures for the unfolding of rabbit, human, and hen lysozymes in 3 M guanidine hydrochloride at pH 5.5 were 51.2, 45.5, and 45.4 degrees C, respectively, indicating that rabbit lysozyme is stabler than the other two lysozymes. The high proline content may be responsible for the increased stability of rabbit lysozyme.

  16. Phylogenetic analysis of beta-papillomaviruses as inferred from nucleotide and amino acid sequence data.

    PubMed

    Gottschling, Marc; Köhler, Anja; Stockfleth, Eggert; Nindl, Ingo

    2007-01-01

    Human papillomaviruses (HPV) of the beta-group seem to be involved in the pathogenesis of non-melanoma skin cancer. Papillomaviruses are host specific and are considered closely co-evolving with their hosts. Evolutionary incongruence between early genes and late genes has been reported among oncogenic genital alpha-papillomaviruses and considerably challenge phylogenetic reconstructions. We investigated the relationships of 29 beta-HPV (25 types plus four putative new types, subtypes, or variants) as inferred from codon aligned and amino acid sequence data of the genes E1, E2, E6, E7, L1, and L2 using likelihood, distance, and parsimony approaches. An analysis of a L1 fragment included additional nucleotide and amino acid sequences from seven non-human beta-papillomaviruses. Early genes and late genes evolution did not conflict significantly in beta-papillomaviruses based on partition homogeneity tests (p > or = 0.001). As inferred from the complete genome analyses, beta-papillomaviruses were monophyletic and segregated into four highly supported monophyletic assemblages corresponding to the species 1, 2, 3, and fused 4/5. They basically split into the species 1 and the remainder of beta-papillomaviruses, whose species 3, 4, and 5 constituted the sistergroup of species 2. beta-Papillomaviruses have been isolated from humans, apes, and monkeys, and phylogenetic analyses of the L1 fragment showed non-human papillomaviruses highly polyphyletic nesting within the HPV species. Thus, host and virus phylogenies were not congruent in beta-papillomaviruses, and multiple invasions across species borders may contribute (additionally to host-linked evolution) to their diversification.

  17. Complete nucleotide sequence of a new isolate of passion fruit woodiness virus from Western Australia.

    PubMed

    Fukumoto, Tomohiro; Nakamura, Masayuki; Wylie, Stephen J; Chiaki, Yuya; Iwai, Hisashi

    2013-08-01

    We determined the complete genome sequence of the passion fruit woodiness virus Gld-1 isolate (PWV-Gld-1) from Australia and compared it with that of PWV-MU-2, another Australian isolate of PWV. The genomes shared high sequence identity in both the complete nucleotide sequence and the ORF amino acid sequence. All of the cleavage sites of each protein were identical to those of MU-2, and the sequence identity for the individual proteins ranged from 97.2 % to 100.0 %. However, the 5' untranslated region (5'UTR) of the Gld-1 isolate shared only 46.8 % sequence identity with that of PWV-MU-2 and was 177 nucleotides shorter. Re-sequencing of the 5'UTR of MU-2 revealed that the 5' end of the original sequence includes an artifact generated by deep sequencing.

  18. Physiology of acetic acid bacteria in light of the genome sequence of Gluconobacter oxydans.

    PubMed

    Deppenmeier, Uwe; Ehrenreich, Armin

    2009-01-01

    Acetic acid bacteria are a distinct group of microorganisms within the family Acetobacteriaceae. They are characterized by their ability to incompletely oxidize a wide range of carbohydrates and alcohols. The great advantage of these reactions is that many substrates are regio- and stereoselectively oxidized. This feature is already exploited in several combined biotechnological-chemical procedures for the synthesis of sugar derivatives. Therefore, it is important to understand the basic concepts of this type of physiology to construct strains for improved or new oxidative fermentations. Based on the genome sequence of Gluconobacteroxydans, we will shed light on the central carbon metabolism, the composition of the respiratory chain and the analysis of uncharacterized oxidoreductases. In this context, the role of membrane-bound and -soluble dehydrogenases are of major importance in the process of incomplete oxidation. Other topics deal with the question of how these organisms generate energy and assimilate carbon. Furthermore, we will discuss how acetic acid bacteria thrive in their nutrient-rich environment and how they outcompete other microorganisms. Copyright (c) 2008 S. Karger AG, Basel.

  19. Lactic acid production from potato peel waste by anaerobic sequencing batch fermentation using undefined mixed culture.

    PubMed

    Liang, Shaobo; McDonald, Armando G; Coats, Erik R

    2015-11-01

    Lactic acid (LA) is a necessary industrial feedstock for producing the bioplastic, polylactic acid (PLA), which is currently produced by pure culture fermentation of food carbohydrates. This work presents an alternative to produce LA from potato peel waste (PPW) by anaerobic fermentation in a sequencing batch reactor (SBR) inoculated with undefined mixed culture from a municipal wastewater treatment plant. A statistical design of experiments approach was employed using set of 0.8L SBRs using gelatinized PPW at a solids content range from 30 to 50 g L(-1), solids retention time of 2-4 days for yield and productivity optimization. The maximum LA production yield of 0.25 g g(-1) PPW and highest productivity of 125 mg g(-1) d(-1) were achieved. A scale-up SBR trial using neat gelatinized PPW (at 80 g L(-1) solids content) at the 3 L scale was employed and the highest LA yield of 0.14 g g(-1) PPW and a productivity of 138 mg g(-1) d(-1) were achieved with a 1 d SRT. Copyright © 2015 Elsevier Ltd. All rights reserved.

  20. The Effect of Protic Acid Identity on the Structures of Complexes with Vinyl Chloride: Fourier Transform Microwave Spectroscopy and Molecular Structure of the Vinyl Chloride-Acetylene Complex

    NASA Astrophysics Data System (ADS)

    Leung, Helen O.; Marshall, Mark D.; Feng, Fan

    2013-06-01

    In all previous examples of complexes formed between protic acids and haloethylenes, we have observed similar modes of binding regardless of the specific identity of the acid, HF, HCl, or HCCH. Although details of the structures, such as hydrogen bond length and amount of deviation from linearity, do reflect the strength of the interaction and show clear correlations with the gas-phase acidity, the complexes of a given haloethylene with any of the acids have identical structural motifs. As part of a systematic study of the effects of chlorine substitution on intermolecular interactions of haloethylenes, we have studied the complexes of vinyl chloride with both HF and HCCH. The HF complex, reported last year, has a geometry with HF interacting across the double bond of vinyl chloride and forming a secondary interaction with the hydrogen cis to the chlorine atom. We have obtained the broadband, chirped-pulse and narrow band, Balle-Flygare Fourier transform microwave spectra of the vinyl chloride-HCCH complex. The spectra indicate that HCCH locates at one end of the vinyl chloride with the secondary interaction occurring with the geminal hydrogen atom.

  1. Spermatogenesis of the lizard Lacerta vivipara: histological studies and amino acid sequence of a protamine lacertine 1.

    PubMed

    Martinage, A; Depeiges, A; Wouters, D; Morel, L; Sautière, P

    1996-06-01

    The lizard Lacerta vivipara is a seasonal breeder with a well characterized reproductive cycle. An histological study of the lizard testis has been performed at different stages of spermatogenesis and the nuclear basic proteins content was assessed by electrophoretical analysis. Two protamines, lacertines 1 and 2, are present in spermatozoa in April and May. We have isolated lacertine1 and characterized a protamine with a mass of 4,963.7 Da. Amino acid sequence of this protamine (41 residues) was established from data provided by automated Edman degradation. It is characterized by a basic amino acid stretch in the N- and C-terminal regions and by a central part which only consists of 3 different intermingled amino acids. This protamine presents 62% homology with scylliorhinine Z3 from dog-fish Scylliorhinus caniculus and 58% homology with quail protamine. The reported lizard protamine sequence is the first reptilian protamine sequence available so far.

  2. Complete amino acid sequence of luffin-b, a ribosome-inactivating protein from sponge gourd (Luffa cylindrica) seeds.

    PubMed

    Islam, M R; Hirayama, H; Funatsu, G

    1991-01-01

    The complete amino acid sequence of luffin-b has been determined. All the twenty-seven tryptic peptides were isolated by reverse-phase HPLC from the tryptic digests of intact luffin-b and one of its CNBr fragments (CB4), and sequenced using the DABITC/PITC double coupling method. The overlap of these peptides was achieved by analyzing the CNBr fragments and their chymotryptic peptides. Luffin-b consists of 250 amino acid residues with a relative molecular mass of 27,275 Da. Investigation for glycosylation sites indicated that Asn at positions 2, 78, and 85 might carry sugars. Sequence comparison with luffin-a showed that amino acid substitution occurred in 55 positions. Luffin-b contains three glycosylation sites instead of the six sites in luffin-a, of which two were found to be conserved.

  3. Robust sequence alignment using evolutionary rates coupled with an amino acid substitution matrix.

    PubMed

    Ndhlovu, Andrew; Hazelhurst, Scott; Durand, Pierre M

    2015-08-14

    Selective pressures at the DNA level shape genes into profiles consisting of patterns of rapidly evolving sites and sites withstanding change. These profiles remain detectable even when protein sequences become extensively diverged. A common task in molecular biology is to infer functional, structural or evolutionary relationships by querying a database using an algorithm. However, problems arise when sequence similarity is low. This study presents an algorithm that uses the evolutionary rate at codon sites, the dN/dS (ω) parameter, coupled to a substitution matrix as an alignment metric for detecting distantly related proteins. The algorithm, called BLOSUM-FIRE couples a newer and improved version of the original FIRE (Functional Inference using Rates of Evolution) algorithm with an amino acid substitution matrix in a dynamic scoring function. The enigmatic hepatitis B virus X protein was used as a test case for BLOSUM-FIRE and its associated database EvoDB. The evolutionary rate based approach was coupled with a conventional BLOSUM substitution matrix. The two approaches are combined in a dynamic scoring function, which uses the selective pressure to score aligned residues. The dynamic scoring function is based on a coupled additive approach that scores aligned sites based on the level of conservation inferred from the ω values. Evaluation of the accuracy of this new implementation, BLOSUM-FIRE, using MAFFT alignment as reference alignments has shown that it is more accurate than its predecessor FIRE. Comparison of the alignment quality with widely used algorithms (MUSCLE, T-COFFEE, and CLUSTAL Omega) revealed that the BLOSUM-FIRE algorithm performs as well as conventional algorithms. Its main strength lies in that it provides greater potential for aligning divergent sequences and addresses the problem of low specificity inherent in the original FIRE algorithm. The utility of this algorithm is demonstrated using the Hepatitis B virus X (HBx) protein, a protein

  4. The amino acid sequence of the cytochrome c-554(547) from the chemolithotrophic bacterium Thiobacillus neapolitanus.

    PubMed Central

    Ambler, R P; Meyer, T E; Trudinger, P A; Kamen, M D

    1985-01-01

    An amino acid sequence is proposed for the cytochrome c-554(547) from the bacterium Thiobacillus neapolitanus N.C.I.B. 8539). It consists of a polypeptide chain of 91 residues, with a pair of haem-attachment cysteine residues at positions 15 and 18. There is similarity in sequence with each of the halves of the sequence of the dihaem cytochromes c4 and with a cytochrome c-554(548) from a halophilic strain of Paracoccus. Detailed evidence for the amino acid sequence of the protein has been deposited as Supplementary Publication SUP 50127 (11 pages) at the British Library (Lending Division), Boston Spa, Wetherby, West Yorkshire LS23 7BQ, U.K., from whom copies can be obtained on the terms indicated in Biochem. J. (1985) 225, 5. PMID:2988504

  5. Human Retroviruses and AIDS. A compilation and analysis of nucleic acid and amino acid sequences: I--II; III--V

    SciTech Connect

    Myers, G.; Korber, B.; Wain-Hobson, S.; Smith, R.F.; Pavlakis, G.N.

    1993-12-31

    This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (I) HIV and SIV Nucleotide Sequences; (II) Amino Acid Sequences; (III) Analyses; (IV) Related Sequences; and (V) Database Communications. Information within all the parts is updated at least twice in each year, which accounts for the modes of binding and pagination in the compendium.

  6. Origins of structural diversity within sequentially identical hexapeptides.

    PubMed Central

    Cohen, B. I.; Presnell, S. R.; Cohen, F. E.

    1993-01-01

    Efforts to predict protein secondary structure have been hampered by the apparent structural plasticity of local amino acid sequences. Kabsch and Sander (1984, Proc. Natl. Acad. Sci. USA 81, 1075-1078) articulated this problem by demonstrating that identical pentapeptide sequences can adopt distinct structures in different proteins. With the increased size of the protein structure database and the availability of new methods to characterize structural environments, we revisit this observation of structural plasticity. Within a set of proteins with less than 50% sequence identity, 59 pairs of identical hexapeptide sequences were identified. These local structures were compared and their surrounding structural environments examined. Within a protein structural class (alpha/alpha, beta/beta, alpha/beta, alpha + beta), the structural similarity of sequentially identical hexapeptides usually is preserved. This study finds eight pairs of identical hexapeptide sequences that adopt beta-strand structure in one protein and alpha-helical structure in the other. In none of the eight cases do the members of these sequences pairs come from proteins within the same folding class. These results have implications for class dependent secondary structure prediction algorithms. PMID:8298461

  7. Nucleic acid sequence of an internal image-bearing monoclonal anti-idiotype and its comparison to the sequence of the external antigen.

    PubMed Central

    Bruck, C; Co, M S; Slaoui, M; Gaulton, G N; Smith, T; Fields, B N; Mullins, J I; Greene, M I

    1986-01-01

    The monoclonal anti-idiotypic antibody (mAb2) 87.92.6 directed against the 9B.G5 antibody specific for the virus neutralizing epitope on the mammalian reovirus type 3 hemagglutinin was previously demonstrated to express an internal image of the receptor binding epitope of the reovirus type 3. Furthermore, this mAb2 has autoimmune reactivity to the cell surface receptor of the reovirus. The nucleotide and deduced amino acid sequences of the 87.92.6 mAb2 heavy and light chains are described in this report. The sequence analysis reveals that the same heavy chain variable and joining (VH and JH) gene segments are used by the 87.92.6 anti-idiotypic mAb2 and by the dominant idiotypes of the BALB/c anti-GAT (cGAT) and anti-NP (NPa) responses. [GAT; random polymer that is 60% glutamic acid, 30% alanine, and 10% tyrosine. NP; (4-hydroxy-3-nitrophenyl)-acetyl.] Despite extensive homology at the level of the heavy chain variable regions, the NPa positive BALB/c anti-NP monoclonal antibody 17.2.25 binds neither 9B.G5 nor the cellular receptor for the hemagglutinin. Amino acid sequence comparison between the viral hemagglutinin and the 87.92.6 mAb2 light chain "internal image," reveals an area of significant homology indicating that antigen mimicry by antibodies may be achieved by sharing primary structure. PMID:2428036

  8. Draft Genome Sequence of Bacillus subtilis subsp. natto Strain CGMCC 2108, a High Producer of Poly-γ-Glutamic Acid.

    PubMed

    Tan, Siyuan; Meng, Yonghong; Su, Anping; Zhang, Chen; Ren, Yuanyuan

    2016-05-26

    Here, we report the 4.1-Mb draft genome sequence of Bacillus subtilis subsp. natto strain CGMCC 2108, a high producer of poly-γ-glutamic acid (γ-PGA). This sequence will provide further help for the biosynthesis of γ-PGA and will greatly facilitate research efforts in metabolic engineering of B. subtilis subsp. natto strain CGMCC 2108. Copyright © 2016 Tan et al.

  9. Draft Genome Sequence of Escherichia coli O157:H7 ATCC 35150 and a Nalidixic Acid-Resistant Mutant Derivative

    PubMed Central

    Markell, James A.; Koziol, Adam G.

    2015-01-01

    Shiga toxin-producing Escherichia coli strains, occasionally isolated from food, are of public health importance. Here, we report on the 5.30-Mbp draft genome sequence of E. coli O157:H7 EDL931 (strain ATCC 35150) and the 5.32-Mbp draft genome sequence of a nalidixic acid-resistant mutant derivative used as a distinguishable control strain in food-testing laboratories. PMID:26205873

  10. Microwave-assisted acid and base hydrolysis of intact proteins containing disulfide bonds for protein sequence analysis by mass spectrometry.

    PubMed

    Reiz, Bela; Li, Liang

    2010-09-01

    Controlled hydrolysis of proteins to generate peptide ladders combined with mass spectrometric analysis of the resultant peptides can be used for protein sequencing. In this paper, two methods of improving the microwave-assisted protein hydrolysis process are described to enable rapid sequencing of proteins containing disulfide bonds and increase sequence coverage, respectively. It was demonstrated that proteins containing disulfide bonds could be sequenced by MS analysis by first performing hydrolysis for less than 2 min, followed by 1 h of reduction to release the peptides originally linked by disulfide bonds. It was shown that a strong base could be used as a catalyst for microwave-assisted protein hydrolysis, producing complementary sequence information to that generated by microwave-assisted acid hydrolysis. However, using either acid or base hydrolysis, amide bond breakages in small regions of the polypeptide chains of the model proteins (e.g., cytochrome c and lysozyme) were not detected. Dynamic light scattering measurement of the proteins solubilized in an acid or base indicated that protein-protein interaction or aggregation was not the cause of the failure to hydrolyze certain amide bonds. It was speculated that there were some unknown local structures that might play a role in preventing an acid or base from reacting with the peptide bonds therein.

  11. Negative Ion In-Source Decay Matrix-Assisted Laser Desorption/Ionization Mass Spectrometry for Sequencing Acidic Peptides

    NASA Astrophysics Data System (ADS)

    McMillen, Chelsea L.; Wright, Patience M.; Cassady, Carolyn J.

    2016-05-01

    Matrix-assisted laser desorption/ionization (MALDI) in-source decay was studied in the negative ion mode on deprotonated peptides to determine its usefulness for obtaining extensive sequence information for acidic peptides. Eight biological acidic peptides, ranging in size from 11 to 33 residues, were studied by negative ion mode ISD (nISD). The matrices 2,5-dihydroxybenzoic acid, 2-aminobenzoic acid, 2-aminobenzamide, 1,5-diaminonaphthalene, 5-amino-1-naphthol, 3-aminoquinoline, and 9-aminoacridine were used with each peptide. Optimal fragmentation was produced with 1,5-diaminonphthalene (DAN), and extensive sequence informative fragmentation was observed for every peptide except hirudin(54-65). Cleavage at the N-Cα bond of the peptide backbone, producing c' and z' ions, was dominant for all peptides. Cleavage of the N-Cα bond N-terminal to proline residues was not observed. The formation of c and z ions is also found in electron transfer dissociation (ETD), electron capture dissociation (ECD), and positive ion mode ISD, which are considered to be radical-driven techniques. Oxidized insulin chain A, which has four highly acidic oxidized cysteine residues, had less extensive fragmentation. This peptide also exhibited the only charged localized fragmentation, with more pronounced product ion formation adjacent to the highly acidic residues. In addition, spectra were obtained by positive ion mode ISD for each protonated peptide; more sequence informative fragmentation was observed via nISD for all peptides. Three of the peptides studied had no product ion formation in ISD, but extensive sequence informative fragmentation was found in their nISD spectra. The results of this study indicate that nISD can be used to readily obtain sequence information for acidic peptides.

  12. A Monte Carlo sampling method of amino acid sequences adaptable to given main-chain atoms in the proteins.

    PubMed

    Ogata, Koji; Soejima, Kenji; Higo, Junichi

    2006-10-01

    We have developed a computational method of protein design to detect amino acid sequences that are adaptable to given main-chain coordinates of a protein. In this method, the selection of amino acid types employs a Metropolis Monte Carlo method with a scoring function in conjunction with the approximation of free energies computed from 3D structures. To compute the scoring function, a side-chain prediction using another Metropolis Monte Carlo method was performed to select structurally suitable side-chain conformations from a side-chain library. In total, two layers of Monte Carlo procedures were performed, first to select amino acid types (1st layer Monte Carlo) and then to predict side-chain conformations (2nd layers Monte Carlo). We applied this method to sequence design for the entire sequence on the SH3 domain, Protein G, and BPTI. The predicted sequences were similar to those of the wild-type proteins. We compared the results of the predictions with and without the 2nd layer Monte Carlo method. The results revealed that the two-layer Monte Carlo method produced better sequence similarity to the wild-type proteins than the one-layer method. Finally, we applied this method to neuraminidase of influenza virus. The results were consistent with the sequences identified from the isolated viruses.

  13. Synthesis of Bisdesmosidic Oleanolic Acid Saponins via a Glycosylation-Deprotection Sequence under Continuous Microfluidic/Batch Conditions.

    PubMed

    Konishi, Naruki; Shirahata, Tatsuya; Yokoyama, Masaki; Katsumi, Tatsuya; Ito, Yoshikazu; Hirata, Nozomu; Nishino, Takashi; Makino, Kazuishi; Sato, Noriko; Nagai, Takayuki; Kiyohara, Hiroaki; Yamada, Haruki; Kaji, Eisuke; Kobayashi, Yoshinori

    2017-07-07

    We report the first synthesis of a series of bisdesmosidic oleanolic acid saponins using microflow reactor Comet X-01 via a continuous flow glycosylation-batch deprotection sequence. The main results of this study can be summarized as follows: (1) The microfluidic glycosylation of oleanolic acid at C-28 was achieved in quantitative yield and was applied to the synthesis of six C-28-monoglycosidic saponins. (2) The microfluidic glycosylation of oleanolic acid at C-3 was achieved in good yield without orthoester byproduct formation and was applied to the synthesis of three bisdesmosidic saponins. (3) The continuous synthesis of saponins via a microfluidic glycosylation-batch deprotection sequence was achieved in four steps involving two purifications. Thus, the continuous microfluidic glycosylation-deprotection process is expected to be suitable for the preparation of a library of bisdesmosidic oleanolic acid saponins for in vivo pharmacological studies.

  14. Exome sequencing identifies de novo gain of function missense mutation in KCND2 in identical twins with autism and seizures that slows potassium channel inactivation.

    PubMed

    Lee, Hane; Lin, Meng-chin A; Kornblum, Harley I; Papazian, Diane M; Nelson, Stanley F

    2014-07-01

    Numerous studies and case reports show comorbidity of autism and epilepsy, suggesting some common molecular underpinnings of the two phenotypes. However, the relationship between the two, on the molecular level, remains unclear. Here, whole exome sequencing was performed on a family with identical twins affected with autism and severe, intractable seizures. A de novo variant was identified in the KCND2 gene, which encodes the Kv4.2 potassium channel. Kv4.2 is a major pore-forming subunit in somatodendritic subthreshold A-type potassium current (ISA) channels. The de novo mutation p.Val404Met is novel and occurs at a highly conserved residue within the C-terminal end of the transmembrane helix S6 region of the ion permeation pathway. Functional analysis revealed the likely pathogenicity of the variant in that the p.Val404Met mutant construct showed significantly slowed inactivation, either by itself or after equimolar coexpression with the wild-type Kv4.2 channel construct consistent with a dominant effect. Further, the effect of the mutation on closed-state inactivation was evident in the presence of auxiliary subunits that associate with Kv4 subunits to form ISA channels in vivo. Discovery of a functionally relevant novel de novo variant, coupled with physiological evidence that the mutant protein disrupts potassium current inactivation, strongly supports KCND2 as the causal gene for epilepsy in this family. Interaction of KCND2 with other genes implicated in autism and the role of KCND2 in synaptic plasticity provide suggestive evidence of an etiological role in autism.

  15. Effects of Acidic Peptide Size and Sequence on Trivalent Praseodymium Adduction and Electron Transfer Dissociation Mass Spectrometry.

    PubMed

    Commodore, Juliette J; Cassady, Carolyn J

    2017-02-07

    Using the lanthanide ion praseodymium, Pr(III), metallated ion formation and electron transfer dissociation (ETD) were studied for 25 biological and model acidic peptides. For chain lengths of seven or more residues, even highly acidic peptides that can be difficult to protonate by electrospray ionization will metallate and undergo abundant ETD fragmentation. Peptides composed of predominantly acidic residues form only the deprotonated ion, [M + Pr - H](2+) ; this ion yields near complete ETD sequence coverage for larger peptides. Peptides with a mixture of acidic and neutral residues, generate [M + Pr](3+) , which cleaves between every residue for many peptides. Acidic peptides that contain at least one residue with a basic side chain also produce the protonated ion, [M + Pr + H](4+) ; this ion undergoes the most extensive sequence coverage by ETD. Primarily metallated and non-metallated c- and z-ions form for all peptides investigated. Metal adducted product ions are only present when at least half of the peptide sequence can be incorporated into the ion; this suggests that the metal ion simultaneously attaches to more than one acidic site. The only site consistently lacking dissociation is at the N-terminal side of a proline residue. Increasing peptide chain length generates more backbone cleavage for metal-peptide complexes with the same charge state. For acidic peptides with the same length, increasing the precursor ion charge state from 2+ to 3+ also leads to more cleavage. The results of this study indicate that highly acidic peptides can be sequenced by ETD of complexes formed with Pr(III).

  16. Method for the detection of specific nucleic acid sequences by polymerase nucleotide incorporation

    DOEpatents

    Castro, Alonso

    2004-06-01

    A method for rapid and efficient detection of a target DNA or RNA sequence is provided. A primer having a 3'-hydroxyl group at one end and having a sequence of nucleotides sufficiently homologous with an identifying sequence of nucleotides in the target DNA is selected. The primer is hybridized to the identifying sequence of nucleotides on the DNA or RNA sequence and a reporter molecule is synthesized on the target sequence by progressively binding complementary nucleotides to the primer, where the complementary nucleotides include nucleotides labeled with a fluorophore. Fluorescence emitted by fluorophores on single reporter molecules is detected to identify the target DNA or RNA sequence.

  17. Characterization, Genome Sequence, and Analysis of Escherichia Phage CICC 80001, a Bacteriophage Infecting an Efficient L-Aspartic Acid Producing Escherichia coli.

    PubMed

    Xu, Youqiang; Ma, Yuyue; Yao, Su; Jiang, Zengyan; Pei, Jiangsen; Cheng, Chi

    2016-03-01

    Escherichia phage CICC 80001 was isolated from the bacteriophage contaminated medium of an Escherichia coli strain HY-05C (CICC 11022S) which could produce L-aspartic acid. The phage had a head diameter of 45-50 nm and a tail of about 10 nm. The one-step growth curve showed a latent period of 10 min and a rise period of about 20 min. The average burst size was about 198 phage particles per infected cell. Tests were conducted on the plaques, multiplicity of infection, and host range. The genome of CICC 80001 was sequenced with a length of 38,810 bp, and annotated. The key proteins leading to host-cell lysis were phylogenetically analyzed. One protein belonged to class II holin, and the other two belonged to the endopeptidase family and N-acetylmuramoyl-L-alanine amidase family, respectively. The genome showed the sequence identity of 82.7% with that of Enterobacteria phage T7, and carried ten unique open reading frames. The bacteriophage resistant E. coli strain designated CICC 11021S was breeding and its L-aspartase activity was 84.4% of that of CICC 11022S.

  18. Partial amino acid sequence of potato solanidine UDP-glucose glucosyltransferase purified by new anion-exchange and size exclusion media.

    PubMed

    Stapleton, A; Allen, P V; Tao, H P; Belknap, W R; Friedman, M

    1992-04-01

    Solanidine UDP-glucose glucosyltransferase (SGT) is involved in the biosynthesis of steroidal glycoalkaloids in potatoes. This enzyme is present at an extremely low level, is inherently unstable, and copurifies with the major storage protein patatin during isolation. We describe an improved method for isolating SGT from greening potato peel using two new chromatographic supports, Macro-Prep 50 Q anion-exchange and Superdex 75HR size exclusion media, under medium-pressure conditions at room temperature. The enzyme preparation was further resolved by SDS-PAGE and the proteins transferred to PVDF membrane (Immobilon-P). Two protein bands corresponding to active forms of SGT (36 and 37 kDa) were excised and cleaved with cyanogen bromide in trifluoroacetic acid. The resultant peptide mixtures were then separated by Tricine-SDS-PAGE and transferred to a PVDF membrane (Pro-Blott). The two major peptide bands observed in both digests (17 and 19 kDa) were sequenced. Identical N-terminal sequences were obtained from the 19-kDa peptides from both digests.

  19. Characterization of the 33-kilodalton major allergen of Penicillium citrinum by using MoAbs and N-terminal amino acid sequencing.

    PubMed

    Shen, H D; Lin, W L; Liaw, S F; Tam, M F; Han, S H

    1997-01-01

    The 33 kD component has been identified as a major allergen of Penicillium citrinum, the most prevalent Penicillium species in the Taipei area of Taiwan. This study analyses the isoforms, antigenic cross-reactivity and the N-terminal amino acid sequence of the 33 kD allergen of P. citrinum. The composition of isoforms and antigenic cross-reactivity was analysed by SDS-PAGE and 2D-immunoblotting using MoAbs generated. The N-terminal sequence was analysed by using an automatic gas/liquid phase sequencer. Two MoAbs (55A and 34H) against the 33 kD allergen were generated in the present study. In addition to the 33 kD component, MoAb 34H also showed immunoblot reactivity to other components in the crude extract of P. citrinum. Analysed by 2D-immunoblotting, at least six different isoforms of the 33 kD component with pI values ranging from 6.75 to greater than 7.0 were shown to be reactive to both MoAbs and IgE antibodies in serum of an asthmatic patient. Different immunoblot patterns were observed when both MoAbs were reacted with four different strains of P. citrinum used in the present study. Among another six different Penicillium and four different Aspergillus species tested, only an immunoblot reactivity of MoAb 55A to the 33 kD component of P. brevicompactum was observed. In 2D-immunoblotting, components of P. brevicompactum with an MW of about 33 kD and pI values similar to those of the 33 kD component of P. citrinum reacted with MoAb 55A and IgE antibodies in serum of the asthmatic patient. The N-terminal amino acid sequence of the 33 kD component of P. citrinum was determined to be ANVVQSNVP which was identical to the first 9 N-terminal amino acids of a heat-labile alkaline serine proteinase from P. citrinum. Results obtained in the present study suggest that the 33 kD major allergen of P. citrinum may be an alkaline serine proteinase.

  20. Genome Sequence of Sphingomonas wittichii DP58, the First Reported Phenazine-1-Carboxylic Acid-Degrading Strain

    PubMed Central

    Ma, Zhiwei; Shen, Xuemei; Wang, Wei; Peng, Huasong; Xu, Ping; Zhang, Xuehong

    2012-01-01

    Sphingomonas wittichii DP58 (CCTCC M 2012027), the first reported phenazine-1-carboxylic acid (PCA)-degrading strain, was isolated from pimiento rhizosphere soils. Here we present a 5.6-Mb assembly of its genome. This sequence would contribute to the elucidation of the molecular mechanism of PCA degradation to improve the antifungal's effectiveness or remove superfluous PCA. PMID:22689229