Science.gov

Sample records for acid sequence homologies

  1. Extensive amino acid sequence homologies between animal lectins

    SciTech Connect

    Paroutaud, P.; Levi, G.; Teichberg, V.I.; Strosberg, A.D.

    1987-09-01

    The authors have established the amino acid sequence of the ..beta..-D-galactoside binding lectin from the electric eel and the sequences of several peptides from a similar lectin isolated from human placenta. These sequences were compared with the published sequences of peptides derived from the ..beta..-D-galactoside binding lectin from human lung and with sequences deduced from cDNAs assigned to the ..beta..-D-galactoside binding lectins from chicken embryo skin and human hepatomas. Significant homologies were observed. One of the highly conserved regions that contains a tryptophan residue and two glutamic acid resides is probably part of the ..beta..-D-galactoside binding site, which, on the basis of spectroscopic studies of the electric eel lectin, is expected to contain such residues. The similarity of the hydropathy profiles and the predicted secondary structure of the lectins from chicken skin and electric eel, in spite of differences in their amino acid sequences, strongly suggests that these proteins have maintained structural homologies during evolution and together with the other ..beta..-D-galactoside binding lectins were derived form a common ancestor gene.

  2. Amino acid sequence of homologous rat atrial peptides: natriuretic activity of native and synthetic forms.

    PubMed Central

    Seidah, N G; Lazure, C; Chrétien, M; Thibault, G; Garcia, R; Cantin, M; Genest, J; Nutt, R F; Brady, S F; Lyle, T A

    1984-01-01

    A substance called atrial natriuretic factor (ANF), localized in secretory granules of atrial cardiocytes, was isolated as four homologous natriuretic peptides from homogenates of rat atria. The complete sequence of the longest form showed that it is composed of 33 amino acids. The three other shorter forms (2-33, 3-33, and 8-33) represent amino-terminally truncated versions of the 33 amino acid parent molecule as shown by analysis of sequence, amino acid composition, or both. The proposed primary structure agrees entirely with the amino acid composition and reveals no significant sequence homology with any known protein or segment of protein. The short form ANF-(8-33) was synthesized by a multi-fragment condensation approach and the synthetic product was shown to exhibit specific activity comparable to that of the natural ANF-(3-33). PMID:6232612

  3. Nucleotide sequences of the Pseudomonas savastanoi indoleacetic acid genes show homology with Agrobacterium tumefaciens T-DNA

    PubMed Central

    Yamada, Tetsuji; Palm, Curtis J.; Brooks, Bob; Kosuge, Tsune

    1985-01-01

    We report the nucleotide sequences of iaaM and iaaH, the genetic determinants for, respectively, tryptophan 2-monooxygenase and indoleacetamide hydrolase, the enzymes that catalyze the conversion of L-tryptophan to indoleacetic acid in the tumor-forming bacterium Pseudomonas syringae pv. savastanoi. The sequence analysis indicates that the iaaM locus contains an open reading frame encoding 557 amino acids that would comprise a protein with a molecular weight of 61,783; the iaaH locus contains an open reading frame of 455 amino acids that would comprise a protein with a molecular weight of 48,515. Significant amino acid sequence homology was found between the predicted sequence of the tryptophan monooxygenase of P. savastanoi and the deduced product of the T-DNA tms-1 gene of the octopine-type plasmid pTiA6NC from Agrobacterium tumefaciens. Strong homology was found in the 25 amino acid sequence in the putative FAD-binding region of tryptophan monooxygenase. Homology was also found in the amino acid sequences representing the central regions of the putative products of iaaH and tms-2 T-DNA. The results suggest a strong similarity in the pathways for indoleacetic acid synthesis encoded by genes in P. savastanoi and in A. tumefaciens T-DNA. Images PMID:16593610

  4. Amino acid sequence homology between rat and human C-reactive protein.

    PubMed Central

    Taylor, J A; Bruton, C J; Anderson, J K; Mole, J E; De Beer, F C; Baltz, M L; Pepys, M B

    1984-01-01

    The rat serum protein that undergoes Ca2+-dependent binding to pneumococcal C-polysaccharide and to phosphocholine residues, and that is evidently a member of the pentraxin family of proteins by virtue of its appearance under the electron microscope, has been variously designated as rat C-reactive protein (CRP) [de Beer, Baltz, Munn, Feinstein, Taylor, Bruton, Clamp & Pepys (1982) Immunology 45, 55-70], 'phosphoryl choline-binding protein' [Nagpurkar & Mookerjea (1981) J. Biol. Chem. 256, 7440-7448] and rat serum amyloid P component (SAP) [Pontet, D'Asnieres, Gache, Escaig & Engler (1981) Biochim. Biophys. Acta 671, 202-210]. The partial amino acid sequence (45 residues) towards the C-terminus of this protein was determined, and it showed 71.7% identity with the known sequence of human CRP but only 54.3% identity with human SAP. Since human CRP and SAP are themselves approximately 50% homologous, the level of identity between the rat protein and human SAP is evidence only of membership of the pentraxin family. In contrast, the much greater resemblance to human CRP confirms that the rat C-polysaccharide-binding/phosphocholine-binding protein is in fact rat CRP. PMID:6477504

  5. Amino Acid Sequences Mediating Vascular Cell Adhesion Molecule 1 Binding to Integrin Alpha 4: Homologous DSP Sequence Found for JC Polyoma VP1 Coat Protein

    PubMed Central

    Meyer, Michael Andrew

    2013-01-01

    The JC polyoma viral coat protein VP1 was analyzed for amino acid sequences homologies to the IDSP sequence which mediates binding of VLA-4 (integrin alpha 4) to vascular cell adhesion molecule 1. Although the full sequence was not found, a DSP sequence was located near the critical arginine residue linked to infectivity of the virus and binding to sialic acid containing molecules such as integrins (3). For the JC polyoma virus, a DSP sequence was found at residues 70, 71 and 72 with homology also noted for the mouse polyoma virus and SV40 virus. Three dimensional modeling of the VP1 molecule suggests that the DSP loop has an accessible site for interaction from the external side of the assembled viral capsid pentamer. PMID:24147211

  6. Amino Acid Sequences Mediating Vascular Cell Adhesion Molecule 1 Binding to Integrin Alpha 4: Homologous DSP Sequence Found for JC Polyoma VP1 Coat Protein.

    PubMed

    Meyer, Michael Andrew

    2013-01-01

    The JC polyoma viral coat protein VP1 was analyzed for amino acid sequences homologies to the IDSP sequence which mediates binding of VLA-4 (integrin alpha 4) to vascular cell adhesion molecule 1. Although the full sequence was not found, a DSP sequence was located near the critical arginine residue linked to infectivity of the virus and binding to sialic acid containing molecules such as integrins (3). For the JC polyoma virus, a DSP sequence was found at residues 70, 71 and 72 with homology also noted for the mouse polyoma virus and SV40 virus. Three dimensional modeling of the VP1 molecule suggests that the DSP loop has an accessible site for interaction from the external side of the assembled viral capsid pentamer.

  7. Establishing homologies in protein sequences

    NASA Technical Reports Server (NTRS)

    Dayhoff, M. O.; Barker, W. C.; Hunt, L. T.

    1983-01-01

    Computer-based statistical techniques used to determine homologies between proteins occurring in different species are reviewed. The technique is based on comparison of two protein sequences, either by relating all segments of a given length in one sequence to all segments of the second or by finding the best alignment of the two sequences. Approaches discussed include selection using printed tabulations, identification of very similar sequences, and computer searches of a database. The use of the SEARCH, RELATE, and ALIGN programs (Dayhoff, 1979) is explained; sample data are presented in graphs, diagrams, and tables and the construction of scoring matrices is considered.

  8. Complete amino acid sequence of BSP-A3 from bovine seminal plasma. Homology to PDC-109 and to the collagen-binding domain of fibronectin.

    PubMed Central

    Seidah, N G; Manjunath, P; Rochemont, J; Sairam, M R; Chrétien, M

    1987-01-01

    Bovine seminal plasma was shown to contain three similar proteins, called BSP-A1, BSP-A2 and BSP-A3. Both BSP-A1 and BSP-A2 were shown to be molecular variants of a recently characterized peptide called PDC-109. They seem to differ only in their degree of glycosylation and otherwise seem to possess an identical amino acid composition. The work in the present paper deals with the complete characterization of the third member of this series, namely BSP-A3. The complete amino acid sequence revealed that it is composed of 115 amino acids and predicts a Mr of 13,403. An analysis of the primary structure of BSP-A3 revealed a high degree of internal homology, with two homologous domains composed of 39 (residues 28-66) and 43 (residues 73-115) amino acids. An exhaustive computer-bank search for the similarity of this sequence to any known protein, or segment thereof, revealed two significant homologies. The first is between PDC-109 and BSP-A3, which is so high that we can confidently predict that both proteins evolved from a single ancestral gene. The collagen-binding domain of bovine fibronectin (type II sequence) was also found to be highly homologous to both BSP-A3 and PDC-109. PMID:3606570

  9. Cloning and sequencing of the Bet v 1-homologous allergen Fra a 1 in strawberry (Fragaria ananassa) shows the presence of an intron and little variability in amino acid sequence.

    PubMed

    Musidlowska-Persson, Anna; Alm, Rikard; Emanuelsson, Cecilia

    2007-02-01

    The Fra a 1 allergen in strawberry (Fragaria ananassa) is homologous to the major birch pollen allergen Bet v 1, which has numerous isoforms differing in terms of amino acid sequence and immunological impact. To map the extent of sequence differences in the Fra a 1 allergen, PCR cloning and sequencing was applied. Several genomic sequences of Fra a 1, with a length of either 584, 591 or 594 nucleotides, were obtained from three different strawberry varieties. All contained one intron, with the length of either 101 or 110 nucleotides. By sequencing 30 different clones, eight different DNA sequences were obtained, giving in total five potential Fra a 1 protein isoforms, with high sequence similarity (>97% sequence identity) and only seven positions of amino acid variability, which were largely confirmed by mass spectrometry of expressed proteins. We conclude that the sequence variability in the strawberry allergen Fra a 1 is small, within and between strawberry varieties, and that multiple spots, previously detected in 2DE, are presumably due to differences in post-translational modification rather than differences in amino acid sequence. The most abundant Fra a 1 isoform sequence, recombinantly expressed in Escherichia coli after removal of the intron, was recognized by IgE from strawberry allergic patients. It cross-reacted with antibodies to Bet v 1 and the homologous apple allergen Mal d 1 (61 and 78% sequence identity, respectively), and will be used in further analyses of variation in Fra a 1-expression.

  10. [Effects of human and rat interferons-alpha on the behavior of rats of different ages. Comparative study of the homology of amino acid sequences].

    PubMed

    Loseva, E V; Loginova, N A; Nekliudov, V V; Mats, V N; Kurskaia, O V; Pasikova, N V

    2009-01-01

    Effects of chronic intranasal administration of human and rat interferons alpha on feeding and defensive behavior of rats were studied. Natural leukocyte human interferon "Lokferon" (a mixture of alpha interferon subtypes) and recombinant rat interferon alpha of the first subtype were used in the dose of 350 ME per rat daily. In addition, using the databases NCBI and EBI, we quantitatively estimated homology of amino-acid sequences between different subtypes of human and rat interferons. Both human (mostly in young rats) and rat interferons (mostly in old rats) increased rat feeding behavior after food conditioning to an audio tone. In old (but not in young) rats, both human and rat interferons worsened the ability of time interval assessment. In young (but not old) rats, both interferon kinds improved avoidance conditioning. The degree of homology between different human and rat interferons varied from 72% to 77%. Thus, generally, the effects of rat and human alpha interferons (350 ME) on rat conditioning were similar. This may be due to high degree of homology of amino-acid sequences between the two interferons.

  11. DNA Sequence Alignment during Homologous Recombination.

    PubMed

    Greene, Eric C

    2016-05-27

    Homologous recombination allows for the regulated exchange of genetic information between two different DNA molecules of identical or nearly identical sequence composition, and is a major pathway for the repair of double-stranded DNA breaks. A key facet of homologous recombination is the ability of recombination proteins to perfectly align the damaged DNA with homologous sequence located elsewhere in the genome. This reaction is referred to as the homology search and is akin to the target searches conducted by many different DNA-binding proteins. Here I briefly highlight early investigations into the homology search mechanism, and then describe more recent research. Based on these studies, I summarize a model that includes a combination of intersegmental transfer, short-distance one-dimensional sliding, and length-specific microhomology recognition to efficiently align DNA sequences during the homology search. I also suggest some future directions to help further our understanding of the homology search. Where appropriate, I direct the reader to other recent reviews describing various issues related to homologous recombination.

  12. Homology of the NH2-terminal amino acid sequences of the heavy and light chains of human monoclonal lupus autoantibodies containing the dominant 16/6 idiotype.

    PubMed Central

    Atkinson, P M; Lampman, G W; Furie, B C; Naparstek, Y; Schwartz, R S; Stollar, B D; Furie, B

    1985-01-01

    The NH2-terminal amino acid sequences have been determined by automated Edman degradation for the heavy and light chains of five monoclonal IgM anti-DNA autoantibodies that were produced by human-human hybridomas derived from lymphocytes of two patients with systemic lupus erythematosus. Four of the antibodies were closely related to the idiotype system 16/6, whereas the fifth antibody was unrelated idiotypically. The light chains of the 16/6 idiotype-positive autoantibodies (HF2-1/13b, HF2-1/17, HF2-18/2, and HF3-16/6) had identical amino acid sequences from residues 1 to 40. Their framework structures were characteristic of VKI light chains. The light chain of the 16/6 idiotype-negative autoantibody HF6-21/28 was characteristic of the VKII subgroup. The heavy chains of the 16/6 idiotype-positive autoantibodies had nearly identical amino acid sequences from residues 1 to 40. The framework structures were characteristic of the VHIII subgroup. In contrast, the GM4672 fusion partner of the hybridoma produced small quantities of an IgG with a VHI heavy chain and a VKI light chain. The heavy chains of the lupus autoantibodies and the light chains of those autoantibodies that were idiotypically related to the 16/6 system had marked sequence homology with WEA, a Waldenstrom IgM that binds to Klebsiella polysaccharides and expresses the 16/6 idiotype. These results indicate a striking homology in the amino termini of the heavy and light chains of the lupus autoantibodies studied and suggest that the V regions of the heavy and light chains of the 16/6 idiotype-positive DNA-binding lupus auto-antibodies are each encoded by a single germ line gene. PMID:3921567

  13. Text mining of DNA sequence homology searches.

    PubMed

    McCallum, John; Ganesh, Siva

    2003-01-01

    Primary tasks in analysis and annotation of expressed sequence tag (EST) datasets are to identify similarity among sequences by unsupervised clustering and assign putative function based on BLAST homology searches. We investigated the usefulness of text mining as a simple approach for further higher-level clustering of EST datasets using IBM Intelligent Miner for Text v2.3 tools. Agglomerative and k-means clustering tools were used to cluster BLASTx homology search documents from two onion EST datasets and optimised by pre-processing and pruning. Subjective evaluation confirmed that these tools provided biologically useful and complementary views of the two libraries, provided new insights into their composition and revealed clusters previously identified by human experts. We compared BLASTx textual clusters for two gene families with their DNA sequence-based clusters and confirmed that these shared similar morphology.

  14. Sequence context-specific profiles for homology searching

    PubMed Central

    Biegert, A.; Söding, J.

    2009-01-01

    Sequence alignment and database searching are essential tools in biology because a protein's function can often be inferred from homologous proteins. Standard sequence comparison methods use substitution matrices to find the alignment with the best sum of similarity scores between aligned residues. These similarity scores do not take the local sequence context into account. Here, we present an approach that derives context-specific amino acid similarities from short windows centered on each query sequence residue. Our results demonstrate that the sequence context contains much more information about the expected mutations than just the residue itself. By employing our context-specific similarities (CS-BLAST) in combination with NCBI BLAST, we increase the sensitivity more than 2-fold on a difficult benchmark set, without loss of speed. Alignment quality is likewise improved significantly. Furthermore, we demonstrate considerable improvements when applying this paradigm to sequence profiles: Two iterations of CSI-BLAST, our context-specific version of PSI-BLAST, are more sensitive than 5 iterations of PSI-BLAST. The paradigm for biological sequence comparison presented here is very general. It can replace substitution matrices in sequence- and profile-based alignment and search methods for both protein and nucleotide sequences. PMID:19234132

  15. Rat androgen-binding protein: evidence for identical subunits and amino acid sequence homology with human sex hormone-binding globulin.

    PubMed

    Joseph, D R; Hall, S H; French, F S

    1987-01-01

    The cDNA for rat androgen-binding protein (ABP) was previously isolated from a bacteriophage lambda gt11 rat testis cDNA library and its identity was confirmed by epitope selection. Hybrid-arrested translation studies have now demonstrated the identity of the isolates. The nucleotide sequence of a near full-length cDNA encodes a 403-amino acid precursor (Mr = 44,539), which agrees in size with the cell-free translation product (Mr = 45,000) of ABP mRNA. Putative sites of N-glycosylation and signal peptide cleavage were identified. Comparison of the predicted amino acid sequence of rat ABP with the amino-terminal amino acid sequence of human sex hormone-binding globulin revealed that 17 of 25 residues are identical. On the basis of the predicted amino acid sequence the molecular weight of the primary translation product, lacking the signal peptide, was 41,183. Hybridization analyses indicated that the two subunits of ABP are coded for by a single gene and a single mRNA species. Our results suggest that ABP consists of two subunits with identical primary sequences and that differences in post-translational processing result in the production of 47,000 and 41,000 molecular weight monomers.

  16. Should nucleotide sequence analyzing computer algorithms always extend homologies by extending homologies?

    PubMed

    Burnett, L; Basten, A; Hensley, W J

    1986-01-10

    Most computer algorithms used for comparing or aligning nucleotide sequences rely on the premise that the best way to extend a homology between the two sequences is to select a match rather than a mismatch. We have tested this assumption and found that it is not always valid.

  17. FAB overlapping: a strategy for sequencing homologous proteins

    NASA Astrophysics Data System (ADS)

    Ferranti, P.; Malorni, A.; Marino, G.; Pucci, P.; di Luccia, A.; Ferrara, L.

    1991-12-01

    Extensive similarity has been shown to exist between the primary structures of closely related proteins from different species, the only differences being restricted to a few amino acid variations. A new mass spectrometric procedure, which has been called FAB-overlapping, has been developed for sequencing highly homologous proteins based on the detection of these small differences as compared with a known protein used as a reference. Several complementary peptide maps are constructed using fast atom bombardment mass spectrometry (FAB-MS) analysis of different proteolytic digests of the unknown protein and the mass values are related to those expected on the basis of the sequence of the reference protein. The mass signals exhibiting unusual mass values identify those regions where variations have taken place; fine location of the mutations can be obtained by coupling simple protein chemistry methodologies with FAB-MS. Using the FAB-overlapping procedure, it was possible to determine the sequence of [alpha]1, [alpha]3 and [beta] globins from water buffalo (Bubalus bubalis hemoglobins (phenotype AA). Two amino acid substitutions were detected in the buffalo [beta] chain (Lys16 --> His and Asn118 --> His) whereas the [alpha]1 chains were found the [alpha]1 and [alpha]3 chains were found to contain four amino acid replacements, three of which were identical (Glu23 --> Asp, Glu71 --> Gly, Phe117 --> Cys), and the insertion of an alanine residue in position 124. The only differences between [alpha]1 and [alpha]3 globins were identified in the C -terminal region; [alpha]1 contains a Phe residue at position 130 whereas [alpha]3 shows serine at position 132.

  18. Biochemical characterization of NfsA, the Escherichia coli major nitroreductase exhibiting a high amino acid sequence homology to Frp, a Vibrio harveyi flavin oxidoreductase.

    PubMed Central

    Zenno, S; Koike, H; Kumar, A N; Jayaraman, R; Tanokura, M; Saigo, K

    1996-01-01

    We identified the nfsA gene, encoding the major oxygen-insensitive nitroreductase in Escherichia coli, and determined its position on the E. coli map to be 19 min. We also purified its gene product, NfsA, to homogeneity. It was suggested that NfsA is a nonglobular protein with a molecular weight of 26,799 and is associated tightly with a flavin mononucleotide. Its amino acid sequence is highly similar to that of Frp, a flavin oxidoreductase from Vibrio harveyi (B. Lei, M. Liu, S. Huang, and S.-C. Tu, J. Bacteriol. 176:3552-3558, 1994), an observation supporting the notion that E. coli nitroreductase and luminescent-bacterium flavin reductase families are intimately related in evolution. Although no appreciable sequence similarity was detected between two E. coli nitroreductases, NfsA and NfsB, NfsA exhibited a low level of the flavin reductase activity and a broad electron acceptor specificity similar to those of NfsB. NfsA reduced nitrofurazone by a ping-pong Bi-Bi mechanism possibly to generate a two-electron transfer product. PMID:8755878

  19. Sequence of the cDNA encoding an actin homolog in the crayfish Procambarus clarkii.

    PubMed

    Kang, W K; Naya, Y

    1993-11-15

    A cDNA library was constructed by using mRNAs purified from crayfish (Procambarus clarkii) muscle. Using a homology search of the nucleotide (nt) sequences, a clone of the library was found to encode a protein homologous to actin (Act). The insert fragment of this cDNA clone was 1072 nt in length. The amino acid sequence deduced from the nt sequence showed significant similarity to Act of various organisms as follows: 88.1% to Drosophila melanogaster, 88.2% to silk worm, 87.3% to brine shrimp, 86.3% to rat, and 86.3% to human (% identity).

  20. Homology and the optimization of DNA sequence data

    NASA Technical Reports Server (NTRS)

    Wheeler, W.

    2001-01-01

    Three methods of nucleotide character analysis are discussed. Their implications for molecular sequence homology and phylogenetic analysis are compared. The criterion of inter-data set congruence, both character based and topological, are applied to two data sets to elucidate and potentially discriminate among these parsimony-based ideas. c2001 The Willi Hennig Society.

  1. Homology and the optimization of DNA sequence data.

    PubMed

    Wheeler, W

    2001-03-01

    Three methods of nucleotide character analysis are discussed. Their implications for molecular sequence homology and phylogenetic analysis are compared. The criterion of inter-data set congruence, both character based and topological, are applied to two data sets to elucidate and potentially discriminate among these parsimony-based ideas.

  2. Isolation and identification by sequence homology of a putative cytosine methyltransferase from Arabidopsis thaliana.

    PubMed Central

    Finnegan, E J; Dennis, E S

    1993-01-01

    A plant cytosine methyltransferase cDNA was isolated using degenerate oligonucleotides, based on homology between prokaryote and mouse methyltransferases, and PCR to amplify a short fragment of a methyltransferase gene. A fragment of the predicted size was amplified from genomic DNA from Arabidopsis thaliana. Overlapping cDNA clones, some with homology to the PCR amplified fragment, were identified and sequenced. The assembled nucleic acid sequence is 4720 bp and encodes a protein of 1534 amino acids which has significant homology to prokaryote and mammalian cytosine methyltransferases. Like mammalian methylases, this enzyme has a C terminal methyltransferase domain linked to a second larger domain. The Arabidopsis methylase has eight of the ten conserved sequence motifs found in prokaryote cytosine-5 methyltransferases and shows 50% homology to the murine enzyme in the methyltransferase domain. The amino terminal domain is only 24% homologous to the murine enzyme and lacks the zinc binding region that has been found in methyltransferases from both mouse and man. In contrast to mouse where a single methyltransferase gene has been identified, a small multigene family with homology to the region amplified in PCR has been identified in Arabidopsis thaliana. Images PMID:8389441

  3. Heterozygous genome assembly via binary classification of homologous sequence

    PubMed Central

    2015-01-01

    Background Genome assemblers to date have predominantly targeted haploid reference reconstruction from homozygous data. When applied to diploid genome assembly, these assemblers perform poorly, owing to the violation of assumptions during both the contigging and scaffolding phases. Effective tools to overcome these problems are in growing demand. Increasing parameter stringency during contigging is an effective solution to obtaining haplotype-specific contigs; however, effective algorithms for scaffolding such contigs are lacking. Methods We present a stand-alone scaffolding algorithm, ScaffoldScaffolder, designed specifically for scaffolding diploid genomes. The algorithm identifies homologous sequences as found in "bubble" structures in scaffold graphs. Machine learning classification is used to then classify sequences in partial bubbles as homologous or non-homologous sequences prior to reconstructing haplotype-specific scaffolds. We define four new metrics for assessing diploid scaffolding accuracy: contig sequencing depth, contig homogeneity, phase group homogeneity, and heterogeneity between phase groups. Results We demonstrate the viability of using bubbles to identify heterozygous homologous contigs, which we term homolotigs. We show that machine learning classification trained on these homolotig pairs can be used effectively for identifying homologous sequences elsewhere in the data with high precision (assuming error-free reads). Conclusion More work is required to comparatively analyze this approach on real data with various parameters and classifiers against other diploid genome assembly methods. However, the initial results of ScaffoldScaffolder supply validity to the idea of employing machine learning in the difficult task of diploid genome assembly. Software is available at http://bioresearch.byu.edu/scaffoldscaffolder. PMID:25952609

  4. An expert system for processing sequence homology data

    SciTech Connect

    Sonnhammer, E.L.L.; Durbin, R.

    1994-12-31

    When confronted with the task of finding homology to large numbers of sequences, database searching tools such as Blast and Fasta generate prohibitively large amounts of information. An automatic way of making most of the decisions a trained sequence analyst would make was developed by means of a rule-based expert system combined with an algorithm to avoid non-informative biased residue composition matches. The results found relevant by the system are presented in a very concise and clear way, so that the homology can be assessed with minimum effort. The expert system, HSPcrunch, was implemented to process the output of the programs in the BLAST suite. HSPcrunch embodies rules on detecting distant similarities when pairs of weak matches are consistent with a larger gaped alignment, i.e. when Blast has broken a longer gaped alignment up into smaller ungaped ones. This way, more distant similarities can be detected with no or little side-effects of more spurious matches. The rules for how small the gaps must be to be considered significant have been derived empirically. Currently a set of rules are used that operate on two different scoring levels, one for very weak matches that have very small gaps and one for medium weak matches that have slightly larger gaps. This set of rules proved to be robust for most cases and gives high fidelity separation between real homologies and spurious matches, One of the most important rules for reducing the amount of output is to limit the number of overlapping matches to the same region of the query sequence. This way, a region with many high-scoring matches will not dominate the output and hide weaker but relevant matches to other regions. This is particularly valuable for multi-domain queries.

  5. Chloroplast DNA Sequence Homologies among Vascular Plants 1

    PubMed Central

    Lamppa, Gayle K.; Bendich, Arnold J.

    1979-01-01

    The extent of sequence conservation in the chloroplast genome of higher plants has been investigated. Supercoiled chloroplast DNA, prepared from pea seedlings, was labeled in vitro and used as a probe in reassociation experiments with a high concentration of total DNAs extracted from several angiosperms, gymnosperms, and lower vascular plants. In each case the probe reassociation was accelerated, demonstrating that some chloroplast sequences have been highly conserved throughout the evolution of vascular plants. Only among the flowering plants were distinct levels of cross-reaction with the pea chloroplast probe evident; broad bean and barley exhibited the highest and lowest levels, respectively. With the hydroxylapatite assay these levels decreased with a decrease in probe fragment length (from 1,860 to 735 bases), indicating that many conserved sequences in the chloroplast genome are separated by divergent sequences on a rather fine scale. Despite differences observed in levels of homology with the hydroxylapatite assay, S1 nuclease analysis of heteroduplexes showed that outside of the pea family the extent of sequence relatedness between the probe and various heterologous DNAs is approximately the same: 30%. In our interpretation, the fundamental changes in the chloroplast genome during angiosperm evolution involved the rearrangement of this 30% with respect to the more rapidly changing sequences of the genome. These rearrangements may have been more extensive in dicotyledons than in monocotyledons. We have estimated the amount of conserved and divergent DNA interspersed between one another. From the reassociation experiments, determinations were made of the percentage of chloroplast DNA in total DNA extracts from different higher plants; this value remained relatively constant when compared with the large variation in the diploid genome size of the plants. PMID:16660786

  6. Los Alamos sequence analysis package for nucleic acids and proteins.

    PubMed Central

    Kanehisa, M I

    1982-01-01

    An interactive system for computer analysis of nucleic acid and protein sequences has been developed for the Los Alamos DNA Sequence Database. It provides a convenient way to search or verify various sequence features, e.g., restriction enzyme sites, protein coding frames, and properties of coded proteins. Further, the comprehensive analysis package on a large-scale database can be used for comparative studies on sequence and structural homologies in order to find unnoted information stored in nucleic acid sequences. PMID:6174934

  7. Prediction of Functional Class of Proteins and Peptides Irrespective of Sequence Homology by Support Vector Machines

    PubMed Central

    Tang, Zhi Qun; Lin, Hong Huang; Zhang, Hai Lei; Han, Lian Yi; Chen, Xin; Chen, Yu Zong

    2007-01-01

    Various computational methods have been used for the prediction of protein and peptide function based on their sequences. A particular challenge is to derive functional properties from sequences that show low or no homology to proteins of known function. Recently, a machine learning method, support vector machines (SVM), have been explored for predicting functional class of proteins and peptides from amino acid sequence derived properties independent of sequence similarity, which have shown promising potential for a wide spectrum of protein and peptide classes including some of the low- and non-homologous proteins. This method can thus be explored as a potential tool to complement alignment-based, clustering-based, and structure-based methods for predicting protein function. This article reviews the strategies, current progresses, and underlying difficulties in using SVM for predicting the functional class of proteins. The relevant software and web-servers are described. The reported prediction performances in the application of these methods are also presented. PMID:20066123

  8. Sequence analysis and characterization of a 40-kilodalton Borrelia hermsii glycerophosphodiester phosphodiesterase homolog.

    PubMed Central

    Shang, E S; Skare, J T; Erdjument-Bromage, H; Blanco, D R; Tempst, P; Miller, J N; Lovett, M A

    1997-01-01

    We report the purification, molecular cloning, and characterization of a 40-kDa glycerophosphodiester phosphodiesterase homolog from Borrelia hermsii. The 40-kDa protein was solubilized from whole organisms with 0.1% Triton X-100, phase partitioned into the Triton X-114 detergent phase, and purified by fast-performance liquid chromatography (FPLC). The gene encoding the 40-kDa protein was cloned from a B. hermsii chromosomal DNA lambda EXlox expression library and identified by using affinity antibodies generated against the purified native protein. The deduced amino acid sequence included a 20-amino-acid signal peptide encoding a putative leader peptidase II cleavage site, indicating that the 40-kDa protein was a lipoprotein. Based on significant homology (31 to 52% identity) of the 40-kDa protein to glycerophosphodiester phosphodiesterases of Escherichia coli (GlpQ), Bacillus subtilis (GlpQ), and Haemophilus influenzae (Hpd; protein D), we have designated this B. hermsii 40-kDa lipoprotein a glycerophosphodiester phosphodiesterase (Gpd) homolog, the first B. hermsii lipoprotein to have a putative functional assignment. A nonlipidated form of the Gpd homolog was overproduced as a fusion protein in E. coli BL21(DE3)(pLysE) and was used to immunize rabbits to generate specific antiserum. Immunoblot analysis with anti-Gpd serum recognized recombinant H. influenzae protein D, and conversely, antiserum to H. influenzae protein D recognized recombinant B. hermsii Gpd (rGpd), indicating antigenic conservation between these proteins. Antiserum to rGpd also identified native Gpd as a constituent of purified outer membrane vesicles prepared from B. hermsii. Screening of other pathogenic spirochetes with anti-rGpd serum revealed the presence of antigenically related proteins in Borrelia burgdorferi, Treponema pallidum, and Leptospira kirschneri. Further sequence analysis both upstream and downstream of the Gpd homolog showed additional homologs of glycerol metabolism

  9. Sequence analysis and characterization of a 40-kilodalton Borrelia hermsii glycerophosphodiester phosphodiesterase homolog.

    PubMed

    Shang, E S; Skare, J T; Erdjument-Bromage, H; Blanco, D R; Tempst, P; Miller, J N; Lovett, M A

    1997-04-01

    We report the purification, molecular cloning, and characterization of a 40-kDa glycerophosphodiester phosphodiesterase homolog from Borrelia hermsii. The 40-kDa protein was solubilized from whole organisms with 0.1% Triton X-100, phase partitioned into the Triton X-114 detergent phase, and purified by fast-performance liquid chromatography (FPLC). The gene encoding the 40-kDa protein was cloned from a B. hermsii chromosomal DNA lambda EXlox expression library and identified by using affinity antibodies generated against the purified native protein. The deduced amino acid sequence included a 20-amino-acid signal peptide encoding a putative leader peptidase II cleavage site, indicating that the 40-kDa protein was a lipoprotein. Based on significant homology (31 to 52% identity) of the 40-kDa protein to glycerophosphodiester phosphodiesterases of Escherichia coli (GlpQ), Bacillus subtilis (GlpQ), and Haemophilus influenzae (Hpd; protein D), we have designated this B. hermsii 40-kDa lipoprotein a glycerophosphodiester phosphodiesterase (Gpd) homolog, the first B. hermsii lipoprotein to have a putative functional assignment. A nonlipidated form of the Gpd homolog was overproduced as a fusion protein in E. coli BL21(DE3)(pLysE) and was used to immunize rabbits to generate specific antiserum. Immunoblot analysis with anti-Gpd serum recognized recombinant H. influenzae protein D, and conversely, antiserum to H. influenzae protein D recognized recombinant B. hermsii Gpd (rGpd), indicating antigenic conservation between these proteins. Antiserum to rGpd also identified native Gpd as a constituent of purified outer membrane vesicles prepared from B. hermsii. Screening of other pathogenic spirochetes with anti-rGpd serum revealed the presence of antigenically related proteins in Borrelia burgdorferi, Treponema pallidum, and Leptospira kirschneri. Further sequence analysis both upstream and downstream of the Gpd homolog showed additional homologs of glycerol metabolism

  10. Phylogenetic analysis of sequences from diverse bacteria with homology to the Escherichia coli rho gene.

    PubMed Central

    Opperman, T; Richardson, J P

    1994-01-01

    Genes from Pseudomonas fluorescens, Chromatium vinosum, Micrococcus luteus, Deinococcus radiodurans, and Thermotoga maritima with homology to the Escherichia coli rho gene were cloned and sequenced, and their sequences were compared with other available sequences. The species for all of the compared sequences are members of five bacterial phyla, including Thermotogales, the most deeply diverged phylum. This suggests that a rho-like gene is ubiquitous in the Bacteria and was present in their common ancestor. The comparative analysis revealed that the Rho homologs are highly conserved, exhibiting a minimum identity of 50% of their amino acid residues in pairwise comparisons. The ATP-binding domain had a particularly high degree of conservation, consisting of some blocks with sequences of residues that are very similar to segments of the alpha and beta subunits of F1-ATPase and of other blocks with sequences that are unique to Rho. The RNA-binding domain is more diverged than the ATP-binding domain. However, one of its most highly conserved segments includes a RNP1-like sequence, which is known to be involved in RNA binding. Overall, the degree of similarity is lowest in the first 50 residues (the first half of the RNA-binding domain), in the putative connector region between the RNA-binding and the ATP-binding domains, and in the last 50 residues of the polypeptide. Since functionally defective mutants for E. coli Rho exist in all three of these segments, they represent important parts of Rho that have undergone adaptive evolution. PMID:8051015

  11. Using Amino Acid Physicochemical Distance Transformation for Fast Protein Remote Homology Detection

    PubMed Central

    Liu, Bin; Wang, Xiaolong; Chen, Qingcai; Dong, Qiwen; Lan, Xun

    2012-01-01

    Protein remote homology detection is one of the most important problems in bioinformatics. Discriminative methods such as support vector machines (SVM) have shown superior performance. However, the performance of SVM-based methods depends on the vector representations of the protein sequences. Prior works have demonstrated that sequence-order effects are relevant for discrimination, but little work has explored how to incorporate the sequence-order information along with the amino acid physicochemical properties into the prediction. In order to incorporate the sequence-order effects into the protein remote homology detection, the physicochemical distance transformation (PDT) method is proposed. Each protein sequence is converted into a series of numbers by using the physicochemical property scores in the amino acid index (AAIndex), and then the sequence is converted into a fixed length vector by PDT. The sequence-order information can be efficiently included into the feature vector with little computational cost by this approach. Finally, the feature vectors are input into a support vector machine classifier to detect the protein remote homologies. Our experiments on a well-known benchmark show the proposed method SVM-PDT achieves superior or comparable performance with current state-of-the-art methods and its computational cost is considerably superior to those of other methods. When the evolutionary information extracted from the frequency profiles is combined with the PDT method, the profile-based PDT approach can improve the performance by 3.4% and 11.4% in terms of ROC score and ROC50 score respectively. The local sequence-order information of the protein can be efficiently captured by the proposed PDT and the physicochemical properties extracted from the amino acid index are incorporated into the prediction. The physicochemical distance transformation provides a general framework, which would be a valuable tool for protein-level study. PMID:23029559

  12. Adhesive proteins of stalked and acorn barnacles display homology with low sequence similarities.

    PubMed

    Jonker, Jaimie-Leigh; Abram, Florence; Pires, Elisabete; Varela Coelho, Ana; Grunwald, Ingo; Power, Anne Marie

    2014-01-01

    Barnacle adhesion underwater is an important phenomenon to understand for the prevention of biofouling and potential biotechnological innovations, yet so far, identifying what makes barnacle glue proteins 'sticky' has proved elusive. Examination of a broad range of species within the barnacles may be instructive to identify conserved adhesive domains. We add to extensive information from the acorn barnacles (order Sessilia) by providing the first protein analysis of a stalked barnacle adhesive, Lepas anatifera (order Lepadiformes). It was possible to separate the L. anatifera adhesive into at least 10 protein bands using SDS-PAGE. Intense bands were present at approximately 30, 70, 90 and 110 kilodaltons (kDa). Mass spectrometry for protein identification was followed by de novo sequencing which detected 52 peptides of 7-16 amino acids in length. None of the peptides matched published or unpublished transcriptome sequences, but some amino acid sequence similarity was apparent between L. anatifera and closely-related Dosima fascicularis. Antibodies against two acorn barnacle proteins (ab-cp-52k and ab-cp-68k) showed cross-reactivity in the adhesive glands of L. anatifera. We also analysed the similarity of adhesive proteins across several barnacle taxa, including Pollicipes pollicipes (a stalked barnacle in the order Scalpelliformes). Sequence alignment of published expressed sequence tags clearly indicated that P. pollicipes possesses homologues for the 19 kDa and 100 kDa proteins in acorn barnacles. Homology aside, sequence similarity in amino acid and gene sequences tended to decline as taxonomic distance increased, with minimum similarities of 18-26%, depending on the gene. The results indicate that some adhesive proteins (e.g. 100 kDa) are more conserved within barnacles than others (20 kDa).

  13. Adhesive Proteins of Stalked and Acorn Barnacles Display Homology with Low Sequence Similarities

    PubMed Central

    Jonker, Jaimie-Leigh; Abram, Florence; Pires, Elisabete; Varela Coelho, Ana; Grunwald, Ingo; Power, Anne Marie

    2014-01-01

    Barnacle adhesion underwater is an important phenomenon to understand for the prevention of biofouling and potential biotechnological innovations, yet so far, identifying what makes barnacle glue proteins ‘sticky’ has proved elusive. Examination of a broad range of species within the barnacles may be instructive to identify conserved adhesive domains. We add to extensive information from the acorn barnacles (order Sessilia) by providing the first protein analysis of a stalked barnacle adhesive, Lepas anatifera (order Lepadiformes). It was possible to separate the L. anatifera adhesive into at least 10 protein bands using SDS-PAGE. Intense bands were present at approximately 30, 70, 90 and 110 kilodaltons (kDa). Mass spectrometry for protein identification was followed by de novo sequencing which detected 52 peptides of 7–16 amino acids in length. None of the peptides matched published or unpublished transcriptome sequences, but some amino acid sequence similarity was apparent between L. anatifera and closely-related Dosima fascicularis. Antibodies against two acorn barnacle proteins (ab-cp-52k and ab-cp-68k) showed cross-reactivity in the adhesive glands of L. anatifera. We also analysed the similarity of adhesive proteins across several barnacle taxa, including Pollicipes pollicipes (a stalked barnacle in the order Scalpelliformes). Sequence alignment of published expressed sequence tags clearly indicated that P. pollicipes possesses homologues for the 19 kDa and 100 kDa proteins in acorn barnacles. Homology aside, sequence similarity in amino acid and gene sequences tended to decline as taxonomic distance increased, with minimum similarities of 18–26%, depending on the gene. The results indicate that some adhesive proteins (e.g. 100 kDa) are more conserved within barnacles than others (20 kDa). PMID:25295513

  14. DNA sequence, structure, and tyrosine kinase activity of the Drosophila melanogaster abelson proto-oncogene homolog

    SciTech Connect

    Henkemeyer, M.J.; Bennett, R.L.; Gertler, F.B.; Hoffmann, F.M.

    1988-02-01

    The authors report their molecular characterization of the Drosophila melanogaster Abelson gene (abl), a gene in which recessive loss-of-function mutations result in lethality at the pupal stage of development. This essential gene consists of 10 exons extending over 26 kilobase pairs of genomic DNA. The DNA sequence encodes a protein of 1,520 amino acids with strong sequence similarity to the human c-abl proto-oncogene beginning in the type 1b 5' exon and extending through the region essential for tyrosine kinase activity. When the tyrosine kinase homologous region was expressed in Escherichia coli, phosphorylation of proteins on tyrosine residues was observed with an antiphosphotyrosine antibody. These results show that the abl gene is highly conserved through evolution and encodes a functional tyrosine protein kinase required for Drosophila development.

  15. Chemical property based sequence characterization of PpcA and its homolog proteins PpcB-E: A mathematical approach

    PubMed Central

    Pal Choudhury, Pabitra

    2017-01-01

    Periplasmic c7 type cytochrome A (PpcA) protein is determined in Geobacter sulfurreducens along with its other four homologs (PpcB-E). From the crystal structure viewpoint the observation emerges that PpcA protein can bind with Deoxycholate (DXCA), while its other homologs do not. But it is yet to be established with certainty the reason behind this from primary protein sequence information. This study is primarily based on primary protein sequence analysis through the chemical basis of embedded amino acids. Firstly, we look for the chemical group specific score of amino acids. Along with this, we have developed a new methodology for the phylogenetic analysis based on chemical group dissimilarities of amino acids. This new methodology is applied to the cytochrome c7 family members and pinpoint how a particular sequence is differing with others. Secondly, we build a graph theoretic model on using amino acid sequences which is also applied to the cytochrome c7 family members and some unique characteristics and their domains are highlighted. Thirdly, we search for unique patterns as subsequences which are common among the group or specific individual member. In all the cases, we are able to show some distinct features of PpcA that emerges PpcA as an outstanding protein compared to its other homologs, resulting towards its binding with deoxycholate. Similarly, some notable features for the structurally dissimilar protein PpcD compared to the other homologs are also brought out. Further, the five members of cytochrome family being homolog proteins, they must have some common significant features which are also enumerated in this study. PMID:28362850

  16. Reconstruction of cyclooxygenase evolution in animals suggests variable, lineage-specific duplications, and homologs with low sequence identity.

    PubMed

    Havird, Justin C; Kocot, Kevin M; Brannock, Pamela M; Cannon, Johanna T; Waits, Damien S; Weese, David A; Santos, Scott R; Halanych, Kenneth M

    2015-04-01

    Cyclooxygenase (COX) enzymatically converts arachidonic acid into prostaglandin G/H in animals and has importance during pregnancy, digestion, and other physiological functions in mammals. COX genes have mainly been described from vertebrates, where gene duplications are common, but few studies have examined COX in invertebrates. Given the increasing ease in generating genomic data, as well as recent, although incomplete descriptions of potential COX sequences in Mollusca, Crustacea, and Insecta, assessing COX evolution across Metazoa is now possible. Here, we recover 40 putative COX orthologs by searching publicly available genomic resources as well as ~250 novel invertebrate transcriptomic datasets. Results suggest the common ancestor of Cnidaria and Bilateria possessed a COX homolog similar to those of vertebrates, although such homologs were not found in poriferan and ctenophore genomes. COX was found in most crustaceans and the majority of molluscs examined, but only specific taxa/lineages within Cnidaria and Annelida. For example, all octocorallians appear to have COX, while no COX homologs were found in hexacorallian datasets. Most species examined had a single homolog, although species-specific COX duplications were found in members of Annelida, Mollusca, and Cnidaria. Additionally, COX genes were not found in Hemichordata, Echinodermata, or Platyhelminthes, and the few previously described COX genes in Insecta lacked appreciable sequence homology (although structural analyses suggest these may still be functional COX enzymes). This analysis provides a benchmark for identifying COX homologs in future genomic and transcriptomic datasets, and identifies lineages for future studies of COX.

  17. Composition for nucleic acid sequencing

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2008-08-26

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  18. Nasal pungency and odor of homologous aldehydes and carboxylic acids.

    PubMed

    Cometto-Muñiz, J E; Cain, W S; Abraham, M H

    1998-01-01

    Airborne substances can stimulate both the olfactory and the trigeminal nerve in the nose, giving rise to odor and pungent (irritant) sensations, respectively. Nose, eye, and throat irritation constitute common adverse effects in indoor environments. We measured odor and nasal pungency thresholds for homologous aliphatic aldehydes (butanal through octanal) and carboxylic acids (formic, acetic, butanoic, hexanoic, and octanoic). Nasal pungency was measured in subjects lacking olfaction (i.e., anosmics) to avoid odor biases. Similar to other homologous series, odor and pungency thresholds declined (i.e., sensory potency increased) with increasing carbon chain length. A previously derived quantitative structure-activity relationship (QSAR) based on solvation energies predicted all nasal pungency thresholds, except for acetic acid, implying that a key step in the mechanism for threshold pungency involves transfer of the inhaled substance from the vapor phase to the receptive biological phase. In contrast, acetic acid - with a pungency threshold lower than predicted - is likely to produce threshold pungency through direct chemical reaction with the mucosa. Both in the series studied here and in those studied previously, we reach a member at longer chain-lengths beyond which pungency fades. The evidence suggests a biological cut-off, presumably based upon molecular size, across the various series.

  19. QGRS-H Predictor: a web server for predicting homologous quadruplex forming G-rich sequence motifs in nucleotide sequences

    PubMed Central

    Menendez, Camille; Frees, Scott; Bagga, Paramjeet S.

    2012-01-01

    Naturally occurring G-quadruplex structural motifs, formed by guanine-rich nucleic acids, have been reported in telomeric, promoter and transcribed regions of mammalian genomes. G-quadruplex structures have received significant attention because of growing evidence for their role in important biological processes, human disease and as therapeutic targets. Lately, there has been much interest in the potential roles of RNA G-quadruplexes as cis-regulatory elements of post-transcriptional gene expression. Large-scale computational genomics studies on G-quadruplexes have difficulty validating their predictions without laborious testing in ‘wet’ labs. We have developed a bioinformatics tool, QGRS-H Predictor that can map and analyze conserved putative Quadruplex forming 'G'-Rich Sequences (QGRS) in mRNAs, ncRNAs and other nucleotide sequences, e.g. promoter, telomeric and gene flanking regions. Identifying conserved regulatory motifs helps validate computations and enhances accuracy of predictions. The QGRS-H Predictor is particularly useful for mapping homologous G-quadruplex forming sequences as cis-regulatory elements in the context of 5′- and 3′-untranslated regions, and CDS sections of aligned mRNA sequences. QGRS-H Predictor features highly interactive graphic representation of the data. It is a unique and user-friendly application that provides many options for defining and studying G-quadruplexes. The QGRS-H Predictor can be freely accessed at: http://quadruplex.ramapo.edu/qgrs/app/start. PMID:22576365

  20. Studying RNA homology and conservation with Infernal: from single sequences to RNA families

    PubMed Central

    Barquist, Lars; Burge, Sarah W.; Gardner, Paul P.

    2016-01-01

    Emerging high-throughput technologies have led to a deluge of putative non-coding RNA (ncRNA) sequences identified in a wide variety of organisms. Systematic characterization of these transcripts will be a tremendous challenge. Homology detection is critical to making maximal use of functional information gathered about ncRNAs: identifying homologous sequence allows us to transfer information gathered in one organism to another quickly and with a high degree of confidence. ncRNA presents a challenge for homology detection, as the primary sequence is often poorly conserved and de novo secondary structure prediction and search remains difficult. This protocol introduces methods developed by the Rfam database for identifying “families” of homologous ncRNAs starting from single “seed” sequences using manually curated sequence alignments to build powerful statistical models of sequence and structure conservation known as covariance models (CMs), implemented in the Infernal software package. We provide a step-by-step iterative protocol for identifying ncRNA homologs, then constructing an alignment and corresponding CM. We also work through an example for the bacterial small RNA MicA, discovering a previously unreported family of divergent MicA homologs in genus Xenorhabdus in the process. PMID:27322404

  1. Studying RNA Homology and Conservation with Infernal: From Single Sequences to RNA Families.

    PubMed

    Barquist, Lars; Burge, Sarah W; Gardner, Paul P

    2016-06-20

    Emerging high-throughput technologies have led to a deluge of putative non-coding RNA (ncRNA) sequences identified in a wide variety of organisms. Systematic characterization of these transcripts will be a tremendous challenge. Homology detection is critical to making maximal use of functional information gathered about ncRNAs: identifying homologous sequence allows us to transfer information gathered in one organism to another quickly and with a high degree of confidence. ncRNA presents a challenge for homology detection, as the primary sequence is often poorly conserved and de novo secondary structure prediction and search remain difficult. This unit introduces methods developed by the Rfam database for identifying "families" of homologous ncRNAs starting from single "seed" sequences, using manually curated sequence alignments to build powerful statistical models of sequence and structure conservation known as covariance models (CMs), implemented in the Infernal software package. We provide a step-by-step iterative protocol for identifying ncRNA homologs and then constructing an alignment and corresponding CM. We also work through an example for the bacterial small RNA MicA, discovering a previously unreported family of divergent MicA homologs in genus Xenorhabdus in the process. © 2016 by John Wiley & Sons, Inc.

  2. Hydrophobic-cluster analysis of plant protein sequences. A domain homology between storage and lipid-transfer proteins.

    PubMed Central

    Henrissat, B; Popineau, Y; Kader, J C

    1988-01-01

    Hydrophobic-cluster analysis was used to characterize a conserved domain located near the C-terminal amino acid sequence of wheat (Triticum aestivum) storage proteins. This domain was transformed into a linear template for a global search for similarities in over 5200 protein sequences. In addition to proteins that had already been found to exhibit homology to wheat storage proteins, a previously unreported homology was found with non-specific lipid-transfer proteins from castor bean (Ricinus communis) and from spinach (Spinacia oleracea) leaf. Hydrophobic-cluster analysis of various members of the present protein group clearly shows a typical domain structure where (i) variable and conserved domains are located along the sequence at precise positions, (ii) the conserved domains probably reflect a common ancestor, and (iii) the unique properties of a given protein (chain cut into subunits, repetitive domains, trypsin-inhibitor active site) are associated with the variable domains. PMID:3214430

  3. Induction of homologous recombination between sequence repeats by the activation induced cytidine deaminase (AID) protein.

    PubMed

    Buerstedde, Jean-Marie; Lowndes, Noel; Schatz, David G

    2014-07-08

    The activation induced cytidine deaminase (AID) protein is known to initiate somatic hypermutation, gene conversion or switch recombination by cytidine deamination within the immunoglobulin loci. Using chromosomally integrated fluorescence reporter transgenes, we demonstrate a new recombinogenic activity of AID leading to intra- and intergenic deletions via homologous recombination of sequence repeats. Repeat recombination occurs at high frequencies even when the homologous sequences are hundreds of bases away from the positions of AID-mediated cytidine deamination, suggesting DNA end resection before strand invasion. Analysis of recombinants between homeologous repeats yielded evidence for heteroduplex formation and preferential migration of the Holliday junctions to the boundaries of sequence homology. These findings broaden the target and off-target mutagenic potential of AID and establish a novel system to study induced homologous recombination in vertebrate cells.DOI: http://dx.doi.org/10.7554/eLife.03110.001.

  4. HorA web server to infer homology between proteins using sequence and structural similarity

    PubMed Central

    Kim, Bong-Hyun; Cheng, Hua; Grishin, Nick V.

    2009-01-01

    The biological properties of proteins are often gleaned through comparative analysis of evolutionary relatives. Although protein structure similarity search methods detect more distant homologs than purely sequence-based methods, structural resemblance can result from either homology (common ancestry) or analogy (similarity without common ancestry). While many existing web servers detect structural neighbors, they do not explicitly address the question of homology versus analogy. Here, we present a web server named HorA (Homology or Analogy) that identifies likely homologs for a query protein structure. Unlike other servers, HorA combines sequence information from state-of-the-art profile methods with structure information from spatial similarity measures using an advanced computational technique. HorA aims to identify biologically meaningful connections rather than purely 3D-geometric similarities. The HorA method finds ∼90% of remote homologs defined in the manually curated database SCOP. HorA will be especially useful for finding remote homologs that might be overlooked by other sequence or structural similarity search servers. The HorA server is available at http://prodata.swmed.edu/horaserver. PMID:19417074

  5. Homology Requirements for Targeting Heterologous Sequences during P-Induced Gap Repair in Drosophila Melanogaster

    PubMed Central

    Dray, T.; Gloor, G. B.

    1997-01-01

    The effect of homology on gene targeting was studied in the context of P-element-induced double-strand breaks at the white locus of Drosophila melanogaster. Double-strand breaks were made by excision of P-w(hd), a P-element insertion in the white gene. A nested set of repair templates was generated that contained the 8 kilobase (kb) yellow gene embedded within varying amounts of white gene sequence. Repair with unlimited homology was also analyzed. Flies were scored phenotypically for conversion of the yellow gene to the white locus. Targeting of the yellow gene was abolished when all of the 3' homology was removed. Increases in template homology up to 51 base pairs (bp) did not significantly promote targeting. Maximum conversion was observed with a construct containing 493 bp of homology, without a significant increase in frequency when homology extended to the tips of the chromosome. These results demonstrate that the homology requirements for targeting a large heterologous insertion are quite different than those for a point mutation. Furthermore, heterologous insertions strongly affect the homology requirements for the conversion of distal point mutations. Several aberrant conversion tracts, which arose from templates that contained reduced homology, also were examined and characterized. PMID:9335605

  6. Sequence homology of polymorphic AFLP markers in garlic (Allium sativum L.).

    PubMed

    Ipek, Meryem; Ipek, Ahmet; Simon, Philipp W

    2006-10-01

    Linkage mapping and genetic diversity studies with DNA markers in plant species assume that comigrating bands are identical, or at least that they have homologous sequences. To test this assumption in a plant with a large genome, sequence identities of 7 polymorphic amplified fragment length polymorphism (AFLP) markers of garlic, previously used to estimate similarity in genetic diversity studies, were characterized. Among 37 diverse garlic clones, 87 bands from these 7 polymorphisms were excised, amplicons were cloned, and 2 to 6 colonies were sequenced from each band, to yield a total of 191 DNA amplicons. Of these 87 bands, 83 bands (95.4%) contained AFLP amplicons that were identical or highly homologous to the typical marker of that band; only 4 bands contained amplicons with little homology to the same-sized amplicons of other garlic clones. Of these 83 bands, 64 (73.6%) contained only highly homologous amplicons (>90% sequence identity), whereas 19 (21.8%) contained both homologous and nonhomologous amplicons, with sequence identities less than 60%. Of the 37 nonhomologous amplicons identified, 25 (67.5%) differed in length from other amplicons in the band. Sequence conservation of AFLP amplicons followed patterns similar to phylogenetic relationships among garlic clones, making them useful for developing simple PCR-based markers in genetic mapping and diversity assessment.

  7. Distant homology detection using a LEngth and STructure-based sequence Alignment Tool (LESTAT).

    PubMed

    Lee, Marianne M; Bundschuh, Ralf; Chan, Michael K

    2008-05-15

    A new machine learning algorithm, LESTAT (LEngth and STructure-based sequence Alignment Tool) has been developed for detecting protein homologs having low-sequence identity. LESTAT is an iterative profile-based method that runs without reliance on a predefined library and incorporates several novel features that enhance its ability to identify remote sequences. To overcome the inherent bias associated with a single starting model, LESTAT utilizes three structural homologs to create a profile consisting of structurally conserved positions and block separation distances. Subsequent profiles are refined iteratively using sequence information obtained from previous cycles. Additionally, the refinement process incorporates a "lock-in" feature to retain the high-scoring sequences involved in previous alignments for subsequent model building and an enhancement factor to complement the weighting scheme used to build the position specific scoring matrix. A comparison of the performance of LESTAT against PSI-BLAST for seven systems reveals that LESTAT exhibits increased sensitivity and specificity over PSI-BLAST in six of these systems, based on the number of true homologs detected and the number of families these homologs covered. Notably, many of the hits identified are unique to each method, presumably resulting from the distinct differences in the two approaches. Taken together, these findings suggest that LESTAT is a useful complementary method to PSI-BLAST in the detection of distant homologs.

  8. Sequence analysis of carcinoembryonic antigen: identification of glycosylation sites and homology with the immunoglobulin supergene family.

    PubMed Central

    Paxton, R J; Mooser, G; Pande, H; Lee, T D; Shively, J E

    1987-01-01

    A direct method for the determination of N-linked glycosylation sites in highly glycosylated proteins is described. Carcinoembryonic antigen (CEA) and a nonspecific crossreacting antigen (NCA) were chemically deglycosylated, and peptide maps were prepared by reverse-phase HPLC. The peptides were sequenced on a gas-phase microsequencer, and glycosylation sites were identified as the phenylthiohydantoin derivative of N-acetylglucosaminylasparagine. The sequences were confirmed by fast atom bombardment mass spectrometry. Highly homologous, extended amino-terminal sequences were determined for CEA and two NCAs, NCA-95 and NCA-55. Cysteine-containing sequences for CEA and NCA-95 show up to 95% sequence homology, and the CEA sequences also show internal sequence homologies. A comparison of the CEA sequences with known protein sequences suggests that CEA may be a member of the immunoglobulin supergene family. The protein sequence data have been used to identify a genomic DNA clone for one of the NCA antigens [Thompson, J., Pande, H., Paxton, R. J., Shively, L., Padma, A., Simmer, R. L., Todd, C. W., Riggs, A. D. & Shively, J. E. (1987) Proc. Natl. Acad. Sci. USA, in press] and a cDNA clone for CEA [Zimmermann, W., Ortlieb, B., Friedrich, R. & von Kleist, S. (1987) Proc. Natl. Acad. Sci. USA, in press]. Images PMID:3469650

  9. Human beta-hexosaminidase alpha chain: coding sequence and homology with the beta chain.

    PubMed Central

    Myerowitz, R; Piekarz, R; Neufeld, E F; Shows, T B; Suzuki, K

    1985-01-01

    We have isolated a cDNA clone, p beta H alpha-5, from an adult human liver library that contains the entire coding sequence of the alpha chain of beta-hexosaminidase. The cDNA insert of p beta H alpha-5 is 1944 base pairs long and contains a 168-base-pair 5' untranslated region, a 186-base-pair 3' untranslated region, and an open reading frame of 1587 base pairs corresponding to 529 amino acids (Mr, 60,697). The first 17-22 amino acids satisfy the requirements of a signal sequence. A striking sequence homology with a published partial amino acid sequence for the beta chain [O'Dowd, B. F., Quan, F., Willard, H. F., Lamhonwah, A. M., Korneluk, R. G., Lowden, J. A., Gravel, R. A. & Mahuran, D. J. (1985) Proc. Natl. Acad. Sci. USA 82, 1184-1188] suggests that both chains may have evolved from a common ancestor. A shorter alpha-chain cDNA was found to hybridize to the long arm of chromosome 15, the known location for the alpha-chain gene. In addition, we isolated another alpha-chain cDNA clone, p beta H alpha-4, from a simian virus 40-transformed human fibroblast library that contained an extra 453-base-pair piece at its 3' end. A probe consisting of this additional sequence hybridized exclusively to a single mRNA species (2.6 kilobases) in mRNA preparations from cultured human fibroblasts. In contrast, p beta H alpha-5 hybridized to both a 2.1-kilobase major and a 2.6-kilobase minor mRNA species in these same mRNA preparations, indicating the presence of two distinct alpha-chain mRNA species differing at the 3' end. Fibroblasts from an Ashkenazi Jewish patient with classic Tay-Sachs disease were deficient in both species of mRNA, confirming their genetic relationship. Images PMID:2933746

  10. Homologous recombination drives both sequence diversity and gene content variation in Neisseria meningitidis.

    PubMed

    Kong, Ying; Ma, Jennifer H; Warren, Keisha; Tsang, Raymond S W; Low, Donald E; Jamieson, Frances B; Alexander, David C; Hao, Weilong

    2013-01-01

    The study of genetic and phenotypic variation is fundamental for understanding the dynamics of bacterial genome evolution and untangling the evolution and epidemiology of bacterial pathogens. Neisseria meningitidis (Nm) is among the most intriguing bacterial pathogens in genomic studies due to its dynamic population structure and complex forms of pathogenicity. Extensive genomic variation within identical clonal complexes (CCs) in Nm has been recently reported and suggested to be the result of homologous recombination, but the extent to which recombination contributes to genomic variation within identical CCs has remained unclear. In this study, we sequenced two Nm strains of identical serogroup (C) and multi-locus sequence type (ST60), and conducted a systematic analysis with an additional 34 Nm genomes. Our results revealed that all gene content variation between the two ST60 genomes was introduced by homologous recombination at the conserved flanking genes, and 94.25% or more of sequence divergence was caused by homologous recombination. Recombination was found in genes associated with virulence factors, antigenic outer membrane proteins, and vaccine targets, suggesting an important role of homologous recombination in rapidly altering the pathogenicity and antigenicity of Nm. Recombination was also evident in genes of the restriction and modification systems, which may undermine barriers to DNA exchange. In conclusion, homologous recombination can drive both gene content variation and sequence divergence in Nm. These findings shed new light on the understanding of the rapid pathoadaptive evolution of Nm and other recombinogenic bacterial pathogens.

  11. High speed nucleic acid sequencing

    SciTech Connect

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2011-05-17

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid. Each type of labeled nucleotide comprises an acceptor fluorophore attached to a phosphate portion of the nucleotide such that the fluorophore is removed upon incorporation into a growing strand. Fluorescent signal is emitted via fluorescent resonance energy transfer between the donor fluorophore and the acceptor fluorophore as each nucleotide is incorporated into the growing strand. The sequence is deduced by identifying which base is being incorporated into the growing strand.

  12. Nucleotide sequence and intergeminiviral homologies of the DNA-A of papaya leaf curl geminivirus from India.

    PubMed

    Saxena, S; Hallan, V; Singh, B P; Sane, P V

    1998-06-01

    Coat protein gene, rep protein gene and intergenic region of the genome of a whitefly transmitted geminivirus (WTG) causing severe leaf curl in papaya plants were PCR amplified, cloned and sequenced. Comparison of the amino acid sequence of the putative coat protein product of papaya leaf curl virus (PLCV) with some other mono and bipartite WTGs revealed a maximum of 89.8% homology with Indian cassava mosaic virus. The genomic organization of PLCV-India is similar to other WTGs with bipartite genomes. Comparison of the coat protein N-terminal 70 amino acid sequence (and other biological features) of PLCV with other geminiviruses shows that PLCV is a distinct geminivirus from India and is related to WTGs from the old world.

  13. Transitive Homology-Guided Structural Studies Lead to Discovery of Cro Proteins With 40% Sequence Identify But Different Folds

    SciTech Connect

    Roessler, C.G.; Hall, B.M.; Anderson, W.J.; Ingram, W.M.; Roberts, S.A.; Montfort, W.R.; Cordes, M.H.J.

    2009-05-27

    Proteins that share common ancestry may differ in structure and function because of divergent evolution of their amino acid sequences. For a typical diverse protein superfamily, the properties of a few scattered members are known from experiment. A satisfying picture of functional and structural evolution in relation to sequence changes, however, may require characterization of a larger, well chosen subset. Here, we employ a 'stepping-stone' method, based on transitive homology, to target sequences intermediate between two related proteins with known divergent properties. We apply the approach to the question of how new protein folds can evolve from preexisting folds and, in particular, to an evolutionary change in secondary structure and oligomeric state in the Cro family of bacteriophage transcription factors, initially identified by sequence-structure comparison of distant homologs from phages P22 and {lambda}. We report crystal structures of two Cro proteins, Xfaso 1 and Pfl 6, with sequences intermediate between those of P22 and {lambda}. The domains show 40% sequence identity but differ by switching of {alpha}-helix to {beta}-sheet in a C-terminal region spanning {approx}25 residues. Sedimentation analysis also suggests a correlation between helix-to-sheet conversion and strengthened dimerization.

  14. A work stealing based approach for enabling scalable optimal sequence homology detection

    SciTech Connect

    Daily, Jeffrey A.; Kalyanaraman, Anantharaman; Krishnamoorthy, Sriram; Vishnu, Abhinav

    2015-05-01

    Sequence homology detection is central to a number of bioinformatics applications including genome sequencing and protein family characterization. Given millions of sequences, the goal is to identify all pairs of sequences that are highly similar (or “homologous”) on the basis of alignment criteria. While there are optimal alignment algorithms to compute pairwise homology, their deployment for large-scale is currently not feasible; instead, heuristic methods are used at the expense of quality. Here, we present the design and evaluation of a parallel implementation for conducting optimal homology detection on distributed memory supercomputers. Our approach uses a combination of techniques from asynchronous load balancing (viz. work stealing, dynamic task counters), data replication, and exact-matching filters to achieve homology detection at scale. Results for 2.56M sequences on up to 8K cores show parallel efficiencies of ~ 75-100%, a time-to-solution of 33s, and a rate of ~ 2.0M alignments per second.

  15. The sequences of heat shock protein 40 (DnaJ) homologs provide evidence for a close evolutionary relationship between the Deinococcus-thermus group and cyanobacteria.

    PubMed

    Bustard, K; Gupta, R S

    1997-08-01

    The genes encoding for heat shock protein 40 (Hsp40 or DnaJ) homologs were cloned and sequenced from the archaebacterium Halobacterium cutirubrum and the eubacterium Deinococcus proteolyticus to add to sequences from the gene banks. These genes were identified downstream of the Hsp70 (or DnaK) genes in genomic fragments spanning this region and, as in other prokaryotic species, Hsp70-Hsp40 genes are likely part of the same operon. The Hsp40 homolog from D. proteolyticus was found to be lacking a central 204 base pair region present in H. cutirubrum that encodes for the four cysteine-rich domains of the repeat consensus sequence CxxCxGxG (where x is any amino acid), present in most Hsp40 homologs. The available sequences from various archaebacteria, eubacteria, and eukaryotes show that the same deletion is also present in the homologs from Thermus aquaticus and two cyanobacteria, but in no other species tested. This unique deletion and the clustering of homologs from the Deinococcus-Thermus group and cyanobacterial species in the Hsp40 phylogenetic trees suggest a close evolutionary relationship between these groups as was also shown recently for Hsp70 sequences (R.S. Gupta et al., J Bacteriol 179:345-357, 1997). Sequence comparisons indicate that the Hsp40 homologs are not as conserved as the Hsp70 sequences. Phylogenetic analysis provides no reliable information concerning evolutionary relationship between prokaryotes and eukaryotes and their usefulness in this regard is limited. However, in phylogenetic trees based on Hsp40 sequences, the two archaebacterial homologs showed a polyphyletic branching within Gram-positive bacteria, similar to that seen with Hsp70 sequences.

  16. Sequence divergence and chromosomal rearrangements during the evolution of human pseudoautosomal genes and their mouse homologs

    SciTech Connect

    Ellison, J.; Li, X.; Francke, U.

    1994-09-01

    The pseudoautosomal region (PAR) is an area of sequence identity between the X and Y chromosomes and is important for mediating X-Y pairing during male meiosis. Of the seven genes assigned to the human PAR, none of the mouse homologs have been isolated by a cross-hybridization strategy. Two of these homologs, Csfgmra and II3ra, have been isolated using a functional assay for the gene products. These genes are quite different in sequence from their human homologs, showing only 60-70% sequence similarity. The Csfgmra gene has been found to further differ from its human homolog in being isolated not on the sex chromosomes, but on a mouse autosome (chromosome 19). Using a mouse-hamster somatic cell hybrid mapping panel, we have mapped the II3ra gene to yet another mouse autosome, chromosome 14. Attempts to clone the mouse homolog of the ANT3 locus resulted in the isolation of two related genes, Ant1 and Ant2, but failed to yield the Ant3 gene. Southern blot analysis of the ANT/Ant genes showed the Ant1 and Ant2 sequences to be well-conserved among all of a dozen mammals tested. In contrast, the ANT3 gene only showed hybridization to non-rodent mammals, suggesting it is either greatly divergent or has been deleted in the rodent lineage. Similar experiments with other human pseudoautosomal probes likewise showed a lack of hybridization to rodent sequences. The results show a definite trend of extensive divergence of pseudoautosomal sequences in addition to chromosomal rearrangements involving X;autosome translocations and perhaps gene deletions. Such observations have interesting implications regarding the evolution of this important region of the sex chromosomes.

  17. Homology-Dependent Silencing by an Exogenous Sequence in the Drosophila Germline

    PubMed Central

    Pöyhönen, Maria; de Vanssay, Augustin; Delmarre, Valérie; Hermant, Catherine; Todeschini, Anne Laure; Teysset, Laure; Ronsseray, Stéphane

    2012-01-01

    The study of P transposable element repression in Drosophila melanogaster led to the discovery of the trans-silencing effect (TSE), a homology-dependent repression mechanism by which a P-transgene inserted in subtelomeric heterochromatin (Telomeric Associated Sequences) represses in trans, in the female germline, a homologous P-lacZ transgene inserted in euchromatin. TSE shows variegation in ovaries and displays a maternal effect as well as epigenetic transmission through meiosis. In addition, TSE is highly sensitive to mutations affecting heterochromatin components (including HP1) and the Piwi-interacting RNA silencing pathway (piRNA), a homology-dependent silencing mechanism that functions in the germline. TSE appears thus to involve the piRNA-based silencing proposed to play a major role in P repression. Under this hypothesis, TSE may also be established when homology between the telomeric and target loci involves sequences other than P elements, including sequences exogenous to the D. melanogaster genome. We have tested whether TSE can be induced via lacZ sequence homology. We generated a piggyBac-otu-lacZ transgene in which lacZ is under the control of the germline ovarian tumor promoter, resulting in strong expression in nurse cells and the oocyte. We show that all piggyBac-otu-lacZ transgene insertions are strongly repressed by maternally inherited telomeric P-lacZ transgenes. This repression shows variegation between egg chambers when it is incomplete and presents a maternal effect, two of the signatures of TSE. Finally, this repression is sensitive to mutations affecting aubergine, a key player of the piRNA pathway. These data show that TSE can occur when silencer and target loci share solely a sequence exogenous to the D. melanogaster genome. This functionally supports the hypothesis that TSE represents a general repression mechanism which can be co-opted by new transposable elements to regulate their activity after a transfer to the D. melanogaster

  18. Solid phase sequencing of double-stranded nucleic acids

    DOEpatents

    Fu, Dong-Jing; Cantor, Charles R.; Koster, Hubert; Smith, Cassandra L.

    2002-01-01

    This invention relates to methods for detecting and sequencing of target double-stranded nucleic acid sequences, to nucleic acid probes and arrays of probes useful in these methods, and to kits and systems which contain these probes. Useful methods involve hybridizing the nucleic acids or nucleic acids which represent complementary or homologous sequences of the target to an array of nucleic acid probes. These probe comprise a single-stranded portion, an optional double-stranded portion and a variable sequence within the single-stranded portion. The molecular weights of the hybridized nucleic acids of the set can be determined by mass spectroscopy, and the sequence of the target determined from the molecular weights of the fragments. Nucleic acids whose sequences can be determined include nucleic acids in biological samples such as patient biopsies and environmental samples. Probes may be fixed to a solid support such as a hybridization chip to facilitate automated determination of molecular weights and identification of the target sequence.

  19. PyMod: sequence similarity searches, multiple sequence-structure alignments, and homology modeling within PyMOL

    PubMed Central

    2012-01-01

    Background In recent years, an exponential growing number of tools for protein sequence analysis, editing and modeling tasks have been put at the disposal of the scientific community. Despite the vast majority of these tools have been released as open source software, their deep learning curves often discourages even the most experienced users. Results A simple and intuitive interface, PyMod, between the popular molecular graphics system PyMOL and several other tools (i.e., [PSI-]BLAST, ClustalW, MUSCLE, CEalign and MODELLER) has been developed, to show how the integration of the individual steps required for homology modeling and sequence/structure analysis within the PyMOL framework can hugely simplify these tasks. Sequence similarity searches, multiple sequence and structural alignments generation and editing, and even the possibility to merge sequence and structure alignments have been implemented in PyMod, with the aim of creating a simple, yet powerful tool for sequence and structure analysis and building of homology models. Conclusions PyMod represents a new tool for the analysis and the manipulation of protein sequences and structures. The ease of use, integration with many sequence retrieving and alignment tools and PyMOL, one of the most used molecular visualization system, are the key features of this tool. Source code, installation instructions, video tutorials and a user's guide are freely available at the URL http://schubert.bio.uniroma1.it/pymod/index.html PMID:22536966

  20. Molecular cloning, sequence analysis and homology modeling of the first caudata amphibian antifreeze-like protein in axolotl (Ambystoma mexicanum).

    PubMed

    Zhang, Songyan; Gao, Jiuxiang; Lu, Yiling; Cai, Shasha; Qiao, Xue; Wang, Yipeng; Yu, Haining

    2013-08-01

    Antifreeze proteins (AFPs) refer to a class of polypeptides that are produced by certain vertebrates, plants, fungi, and bacteria and which permit their survival in subzero environments. In this study, we report the molecular cloning, sequence analysis and three-dimensional structure of the axolotl antifreeze-like protein (AFLP) by homology modeling of the first caudate amphibian AFLP. We constructed a full-length spleen cDNA library of axolotl (Ambystoma mexicanum). An EST having highest similarity (∼42%) with freeze-responsive liver protein Li16 from Rana sylvatica was identified, and the full-length cDNA was subsequently obtained by RACE-PCR. The axolotl antifreeze-like protein sequence represents an open reading frame for a putative signal peptide and the mature protein composed of 93 amino acids. The calculated molecular mass and the theoretical isoelectric point (pl) of this mature protein were 10128.6 Da and 8.97, respectively. The molecular characterization of this gene and its deduced protein were further performed by detailed bioinformatics analysis. The three-dimensional structure of current AFLP was predicted by homology modeling, and the conserved residues required for functionality were identified. The homology model constructed could be of use for effective drug design. This is the first report of an antifreeze-like protein identified from a caudate amphibian.

  1. WeederH: an algorithm for finding conserved regulatory motifs and regions in homologous sequences

    PubMed Central

    Pavesi, Giulio; Zambelli, Federico; Pesole, Graziano

    2007-01-01

    Background This work addresses the problem of detecting conserved transcription factor binding sites and in general regulatory regions through the analysis of sequences from homologous genes, an approach that is becoming more and more widely used given the ever increasing amount of genomic data available. Results We present an algorithm that identifies conserved transcription factor binding sites in a given sequence by comparing it to one or more homologs, adapting a framework we previously introduced for the discovery of sites in sequences from co-regulated genes. Differently from the most commonly used methods, the approach we present does not need or compute an alignment of the sequences investigated, nor resorts to descriptors of the binding specificity of known transcription factors. The main novel idea we introduce is a relative measure of conservation, assuming that true functional elements should present a higher level of conservation with respect to the rest of the sequence surrounding them. We present tests where we applied the algorithm to the identification of conserved annotated sites in homologous promoters, as well as in distal regions like enhancers. Conclusion Results of the tests show how the algorithm can provide fast and reliable predictions of conserved transcription factor binding sites regulating the transcription of a gene, with better performances than other available methods for the same task. We also show examples on how the algorithm can be successfully employed when promoter annotations of the genes investigated are missing, or when regulatory sites and regions are located far away from the genes. PMID:17286865

  2. EUGENE'HOM: A generic similarity-based gene finder using multiple homologous sequences.

    PubMed

    Foissac, Sylvain; Bardou, Philippe; Moisan, Annick; Cros, Marie-Josée; Schiex, Thomas

    2003-07-01

    EUGENE'HOM is a gene prediction software for eukaryotic organisms based on comparative analysis. EUGENE'HOM is able to take into account multiple homologous sequences from more or less closely related organisms. It integrates the results of TBLASTX analysis, splice site and start codon prediction and a robust coding/non-coding probabilistic model which allows EUGENE'HOM to handle sequences from a variety of organisms. The current target of EUGENE'HOM is plant sequences. The EUGENE'HOM web site is available at http://genopole.toulouse.inra.fr/bioinfo/eugene/EuGeneHom/cgi-bin/EuGeneHom.pl.

  3. EUGÈNE'HOM: a generic similarity-based gene finder using multiple homologous sequences

    PubMed Central

    Foissac, Sylvain; Bardou, Philippe; Moisan, Annick; Cros, Marie-Josée; Schiex, Thomas

    2003-01-01

    EUGÈNE'HOM is a gene prediction software for eukaryotic organisms based on comparative analysis. EUGÈNE'HOM is able to take into account multiple homologous sequences from more or less closely related organisms. It integrates the results of TBLASTX analysis, splice site and start codon prediction and a robust coding/non-coding probabilistic model which allows EUGÈNE'HOM to handle sequences from a variety of organisms. The current target of EUGÈNE'HOM is plant sequences. The EUGÈNE'HOM web site is available at http://genopole.toulouse.inra.fr/bioinfo/eugene/EuGeneHom/cgi-bin/EuGeneHom.pl. PMID:12824408

  4. Recombination sequences in plant mitochondrial genomes: diversity and homologies to known mitochondrial genes.

    PubMed Central

    Stern, D B; Palmer, J D

    1984-01-01

    Several plant mitochondrial genomes contain repeated sequences that are postulated to be sites of homologous intragenomic recombination (1-3). In this report, we have used filter hybridizations to investigate sequence relationships between the cloned mitochondrial DNA (mtDNA) recombination repeats from turnip, spinach and maize and total mtDNA isolated from thirteen species of angiosperms. We find that strong sequence homologies exist between the spinach and turnip recombination repeats and essentially all other mitochondrial genomes tested, whereas a major maize recombination repeat does not hybridize to any other mtDNA. The sequences homologous to the turnip repeat do not appear to function in recombination in any other genome, whereas the spinach repeat hybridizes to reiterated sequences within the mitochondrial genomes of wheat and two species of pokeweed that do appear to be sites of recombination. Thus, although intragenomic recombination is a widespread phenomenon in plant mitochondria, it appears that different sequences either serve as substrates for this function in different species, or else surround a relatively short common recombination site which does not cross-hybridize under our experimental conditions. Identified gene sequences from maize mtDNA were used in heterologous hybridizations to show that the repeated sequences implicated in recombination in turnip and spinach/pokeweed/wheat mitochondria include, or are closely linked to genes for subunit II of cytochrome c oxidase and 26S rRNA, respectively. Together with previous studies indicating that the 18S rRNA gene in wheat mtDNA is contained within a recombination repeat (3), these results imply an unexpectedly frequent association between recombination repeats and plant mitochondrial genes. Images PMID:6473104

  5. Detection of sequences homologous to human retroviral DNA in multiple sclerosis by gene amplification

    SciTech Connect

    Greenberg, S.J.; Ehrlich, G.D.; Abbott, M.A.; Hurwitz, B.J.; Waldmann, T.A.; Poiesz, B.J. )

    1989-04-01

    Twenty-one patients with multiple sclerosis, chronic progressive type, were examined for DNA sequences homologous to a human retrovirus. Genomic DNA from peripheral blood mononuclear cells was analyzed for the presence of homologous sequences to the human T-cell leukemia/lymphoma virus type I (HTLV-I) long terminal repeat, 3{prime} gag, pol, and env domains by the enzymatic in vitro gene amplification technique, polymerase chain reaction. Positive identification of homologous pol sequences was made in the amplified DNA from six of these patients (29%). Three of these six patients (14%) also tested positive for the env region, but not for the other regions tested. In contrast, none of the samples from 35 normal individuals studied was positive when amplified and tested with the same primers and probes. Comparison of patterns obtained from controls and from patients with adult T-cell leukemia or tropical spastic paraparesis suggests that the DNA sequences identified are exogenous to the human genome and may correspond to a human retroviral species. The data support the detection of a human retroviral agent in some patients with multiple sclerosis.

  6. Sequence basis of Barnacle Cement Nanostructure is Defined by Proteins with Silk Homology

    NASA Astrophysics Data System (ADS)

    So, Christopher R.; Fears, Kenan P.; Leary, Dagmar H.; Scancella, Jenifer M.; Wang, Zheng; Liu, Jinny L.; Orihuela, Beatriz; Rittschof, Dan; Spillmann, Christopher M.; Wahl, Kathryn J.

    2016-11-01

    Barnacles adhere by producing a mixture of cement proteins (CPs) that organize into a permanently bonded layer displayed as nanoscale fibers. These cement proteins share no homology with any other marine adhesives, and a common sequence-basis that defines how nanostructures function as adhesives remains undiscovered. Here we demonstrate that a significant unidentified portion of acorn barnacle cement is comprised of low complexity proteins; they are organized into repetitive sequence blocks and found to maintain homology to silk motifs. Proteomic analysis of aggregate bands from PAGE gels reveal an abundance of Gly/Ala/Ser/Thr repeats exemplified by a prominent, previously unidentified, 43 kDa protein in the solubilized adhesive. Low complexity regions found throughout the cement proteome, as well as multiple lysyl oxidases and peroxidases, establish homology with silk-associated materials such as fibroin, silk gum sericin, and pyriform spidroins from spider silk. Distinct primary structures defined by homologous domains shed light on how barnacles use low complexity in nanofibers to enable adhesion, and serves as a starting point for unraveling the molecular architecture of a robust and unique class of adhesive nanostructures.

  7. Sequence basis of Barnacle Cement Nanostructure is Defined by Proteins with Silk Homology

    PubMed Central

    So, Christopher R.; Fears, Kenan P.; Leary, Dagmar H.; Scancella, Jenifer M.; Wang, Zheng; Liu, Jinny L.; Orihuela, Beatriz; Rittschof, Dan; Spillmann, Christopher M.; Wahl, Kathryn J.

    2016-01-01

    Barnacles adhere by producing a mixture of cement proteins (CPs) that organize into a permanently bonded layer displayed as nanoscale fibers. These cement proteins share no homology with any other marine adhesives, and a common sequence-basis that defines how nanostructures function as adhesives remains undiscovered. Here we demonstrate that a significant unidentified portion of acorn barnacle cement is comprised of low complexity proteins; they are organized into repetitive sequence blocks and found to maintain homology to silk motifs. Proteomic analysis of aggregate bands from PAGE gels reveal an abundance of Gly/Ala/Ser/Thr repeats exemplified by a prominent, previously unidentified, 43 kDa protein in the solubilized adhesive. Low complexity regions found throughout the cement proteome, as well as multiple lysyl oxidases and peroxidases, establish homology with silk-associated materials such as fibroin, silk gum sericin, and pyriform spidroins from spider silk. Distinct primary structures defined by homologous domains shed light on how barnacles use low complexity in nanofibers to enable adhesion, and serves as a starting point for unraveling the molecular architecture of a robust and unique class of adhesive nanostructures. PMID:27824121

  8. Sequence basis of Barnacle Cement Nanostructure is Defined by Proteins with Silk Homology.

    PubMed

    So, Christopher R; Fears, Kenan P; Leary, Dagmar H; Scancella, Jenifer M; Wang, Zheng; Liu, Jinny L; Orihuela, Beatriz; Rittschof, Dan; Spillmann, Christopher M; Wahl, Kathryn J

    2016-11-08

    Barnacles adhere by producing a mixture of cement proteins (CPs) that organize into a permanently bonded layer displayed as nanoscale fibers. These cement proteins share no homology with any other marine adhesives, and a common sequence-basis that defines how nanostructures function as adhesives remains undiscovered. Here we demonstrate that a significant unidentified portion of acorn barnacle cement is comprised of low complexity proteins; they are organized into repetitive sequence blocks and found to maintain homology to silk motifs. Proteomic analysis of aggregate bands from PAGE gels reveal an abundance of Gly/Ala/Ser/Thr repeats exemplified by a prominent, previously unidentified, 43 kDa protein in the solubilized adhesive. Low complexity regions found throughout the cement proteome, as well as multiple lysyl oxidases and peroxidases, establish homology with silk-associated materials such as fibroin, silk gum sericin, and pyriform spidroins from spider silk. Distinct primary structures defined by homologous domains shed light on how barnacles use low complexity in nanofibers to enable adhesion, and serves as a starting point for unraveling the molecular architecture of a robust and unique class of adhesive nanostructures.

  9. Amino acid sequence of porcine spleen cathepsin D.

    PubMed Central

    Shewale, J G; Tang, J

    1984-01-01

    The amino acid sequence of porcine spleen cathepsin D heavy chain has been determined and, hence, the complete structure of this enzyme is now known. The sequence of heavy chain was constructed by aligning the structures of peptides generated by cyanogen bromide, trypsin, and endo-proteinase Lys C cleavages. The structure of the light chain has been published previously. The cathepsin D molecule contains 339 amino acid residues in two polypeptide chains: a 97-residue light chain and a 242-residue heavy chain, with a combined Mr of 36,779 (without carbohydrate). There are two carbohydrate units linked to asparagine residues 70 and 192. The disulfide bond arrangement in cathepsin D is probably similar to that of pepsin, because the positions of six half-cystine residues are conserved. The active site aspartyl residues, corresponding to aspartic acid-32 and -215 of pepsin, are located at residues 33 and 224 in the cathepsin D molecule. The amino acid sequence around these aspartyl residues is strongly conserved. Cathepsin D shows a strong homology with other acid proteases. When the sequence of cathepsin D, renin, and pepsin are aligned, 32.7% of the residues are identical. The homology is observed throughout the length of the molecules, indicating that three-dimensional structures of all three molecules are similar. PMID:6587385

  10. Structure- and Sequence-Based Function Prediction for Non-Homologous Proteins

    PubMed Central

    Sael, Lee; Chitale, Meghana; Kihara, Daisuke

    2012-01-01

    The structural genomics projects have been accumulating an increasing number of protein structures, many of which remain functionally unknown. In parallel effort to experimental methods, computational methods are expected to make a significant contribution for functional elucidation of such proteins. However, conventional computational methods that transfer functions from homologous proteins do not help much for these uncharacterized protein structures because they do not have apparent structural or sequence similarity with the known proteins. Here, we briefly review two avenues of computational function prediction methods, i.e. structure-based methods and sequence-based methods. The focus is on our recently developments of local structure-based methods and sequence-based methods, which can effectively extract function information from distantly related proteins. Two structure-based methods, Pocket-Surfer and Patch-Surfer, identify similar known ligand binding sites for pocket regions in a query protein without using global protein fold similarity information. Two sequence-based methods, PFP and ESG, make use of weakly similar sequences that are conventionally discarded in homology based function annotation. Combined together with experimental methods we hope that computational methods will make leading contribution in functional elucidation of the protein structures. PMID:22270458

  11. Identification of novel DNA repair proteins via primary sequence, secondary structure, and homology

    PubMed Central

    Brown, JB; Akutsu, Tatsuya

    2009-01-01

    Background DNA repair is the general term for the collection of critical mechanisms which repair many forms of DNA damage such as methylation or ionizing radiation. DNA repair has mainly been studied in experimental and clinical situations, and relatively few information-based approaches to new extracting DNA repair knowledge exist. As a first step, automatic detection of DNA repair proteins in genomes via informatics techniques is desirable; however, there are many forms of DNA repair and it is not a straightforward process to identify and classify repair proteins with a single optimal method. We perform a study of the ability of homology and machine learning-based methods to identify and classify DNA repair proteins, as well as scan vertebrate genomes for the presence of novel repair proteins. Combinations of primary sequence polypeptide frequency, secondary structure, and homology information are used as feature information for input to a Support Vector Machine (SVM). Results We identify that SVM techniques are capable of identifying portions of DNA repair protein datasets without admitting false positives; at low levels of false positive tolerance, homology can also identify and classify proteins with good performance. Secondary structure information provides improved performance compared to using primary structure alone. Furthermore, we observe that machine learning methods incorporating homology information perform best when data is filtered by some clustering technique. Analysis by applying these methodologies to the scanning of multiple vertebrate genomes confirms a positive correlation between the size of a genome and the number of DNA repair protein transcripts it is likely to contain, and simultaneously suggests that all organisms have a non-zero minimum number of repair genes. In addition, the scan result clusters several organisms' repair abilities in an evolutionarily consistent fashion. Analysis also identifies several functionally unconfirmed

  12. Nucleotide Sequence of the Envelope Gene of Gardner-Arnstein Feline Leukemia Virus B Reveals Unique Sequence Homologies with a Murine Mink Cell Focus-Forming Virus †

    PubMed Central

    Elder, John H.; Mullins, James I.

    1983-01-01

    The nucleotide sequence of the envelope gene and the adjacent 3′ long terminal repeat (LTR) of Gardner-Arnstein feline leukemia virus of subgroup B (GA-FeLV-B) has been determined. Comparison of the derived amino acid sequence of the gp70-p15E polyprotein to those of several previously reported murine retroviruses revealed striking homologies between GA-FeLV-B gp70 and the gp70 of a Moloney virus-derived mink cell focus-forming virus. These homologies were located within the substituted (presumably xenotropic) portion of the mink cell focus-forming virus envelope gene and comprised amino acid sequences not present in three ecotropic virus gp70s. In addition, areas of insertions and deletions, in general, were the same between GA-FeLV-B and Moloney mink cell focus-forming virus, although the sizes of the insertions and deletions differed. Homologies between GA-FeLV-B and mink cell focus-forming virus gp70s is functionally significant in that they both possess expanded host ranges, a property dictated by gp70. The amino acid sequence of FeLV-B contains 12 Asn-X-Ser/Thr sequences, indicating 12 possible sites of N-linked glycosylation as compared with 7 or 8 for its murine counterparts. Comparison of the 3′ LTR of GA-FeLV-B to AKR and Moloney virus LTRs revealed extensive conservation in several regions including the “CCAAT” and Goldberg-Hogness (TATA) boxes thought to be involved in promotion of transcription and in the repeat region of the LTR. The inverted repeats that flanked the LTR of GA-FeLV-B were identical to the murine inverted repeats, but were one base longer than the latter. The region of U3 corresponding to the approximately 75-nucleotide “enhancer sequence” is present in GA-FeLV-B, but contains deletions relative to AKR and Moloney virus and is not repeated. An interesting pallindrome in the repeat region immediately 3′ to the U3 region was noted in all the LTRs, but was particularly pronounced in GA-FeLV-B. Possible roles for this

  13. Active site amino acid sequence of human factor D.

    PubMed

    Davis, A E

    1980-08-01

    Factor D was isolated from human plasma by chromatography on CM-Sephadex C50, Sephadex G-75, and hydroxylapatite. Digestion of reduced, S-carboxymethylated factor D with cyanogen bromide resulted in three peptides which were isolated by chromatography on Sephadex G-75 (superfine) equilibrated in 20% formic acid. NH2-Terminal sequences were determined by automated Edman degradation with a Beckman 890C sequencer using a 0.1 M Quadrol program. The smallest peptide (CNBr III) consisted of the NH2-terminal 14 amino acids. The other two peptides had molecular weights of 17,000 (CNBr I) and 7000 (CNBr II). Overlap of the NH2-terminal sequence of factor D with the NH2-terminal sequence of CNBr I established the order of the peptides. The NH2-terminal 53 residues of factor D are somewhat more homologous with the group-specific protease of rat intestine than with other serine proteases. The NH2-terminal sequence of CNBr II revealed the active site serine of factor D. The typical serine protease active site sequence (Gly-Asp-Ser-Gly-Gly-Pro was found at residues 12-17. The region surrounding the active site serine does not appear to be more highly homologous with any one of the other serine proteases. The structural data obtained point out the similarities between factor D and the other proteases. However, complete definition of the degree of relationship between factor D and other proteases will require determination of the remainder of the primary structure.

  14. Meiotic recombination at the Lmp2 hotspot tolerates minor sequence divergence between homologous chromosomes

    SciTech Connect

    Yoshino, Masayasu; Sagai, Tomoko; Shiroishi, Toshihiko

    1996-06-01

    Recombination is widely considered to linearly depend on the length of the homologous sequences. An 11% mismatch decreases the rate of phage-plasmid recombination 240-fold. Two single nucleotide mismatches, which reduce the longest uninterrupted stretch of similarity from 232 base pairs (bp) to 134 bp, reduce gene conversion in mouse L cells 20-fold. The efficiency of gene targeting through homologous recombination in mouse embryonic stem cells can be increased by using an isogenic, rather than a non-isogenic, DNA construct. In this study we asked whether a high degree of sequence identity between homologous mouse chromosomes enhances meiotic recombination at a hotspot. Sites of meiotic recombination in the mouse major histocompatibility complex (MHC) class II region are not randomly distributed but are almost all clustered within short segments known as recombinational hotspots. The wm7 MHC haplotype, derived from Japanese wild mice Mus musculus molossinus, enhances meiotic recombination at a hotspot near the Lmp2 gene. Heterozygotes between the wm7 haplotype and the b or k haplotypes have yielded a high frequency of recombination (2.1%) in 1.3 kilobase kb segment of this hotspot. 20 refs., 2 figs.

  15. Nucleotide sequence of the leukotoxin gene from Actinobacillus actinomycetemcomitans: homology to the alpha-hemolysin/leukotoxin gene family.

    PubMed Central

    Kraig, E; Dailey, T; Kolodrubetz, D

    1990-01-01

    The leukotoxin produced by Actinobacillus actinomycetemcomitans has been implicated in the etiology of localized juvenile periodontitis. To initiate a genetic analysis into the role of this protein in disease, we have cloned its gene, lktA. We now present the complete nucleotide sequence of the lktA gene from A. actinomycetemcomitans. When the deduced amino acid sequence of the leukotoxin protein was compared with those of other proteins, it was found to be homologous to the leukotoxin from Pasteurella haemolytica and to the alpha-hemolysins from Escherichia coli and Actinobacillus pleuropneumoniae. Each alignment showed at least 42% identity. As in the other organisms, the lktA gene of A. actinomycetemcomitans was linked to another gene, lktC, which is thought to be involved in the activation of the leukotoxin. The predicted LktC protein was related to the leukotoxin/hemolysin C proteins from the other bacteria, since they shared a minimum of 49% amino acid identity. Surprisingly, although actinobacillus species are more closely related to pasteurellae than to members of the family Enterobacteriaciae, LktA and LktC from A. actinomycetemcomitans shared significantly greater sequence identity with the E. coli alpha-hemolysin proteins than with the P. haemolytica leukotoxin proteins. Despite the overall homology to the other leukotoxin/hemolysin proteins, the LktA protein from A. actinomycetemcomitans has several unique properties. Most strikingly, it is a very basic protein with a calculated pI of 9.7; the other toxins have estimated pIs around 6.2. The unusual features of the A. actinomycetemcomitans protein are discussed in light of the different species and target-cell specificities of the hemolysins and the leukotoxins. Images PMID:2318535

  16. Detecting Remote Sequence Homology in Disordered Proteins: Discovery of Conserved Motifs in the N-Termini of Mononegavirales phosphoproteins

    PubMed Central

    Karlin, David; Belshaw, Robert

    2012-01-01

    Paramyxovirinae are a large group of viruses that includes measles virus and parainfluenza viruses. The viral Phosphoprotein (P) plays a central role in viral replication. It is composed of a highly variable, disordered N-terminus and a conserved C-terminus. A second viral protein alternatively expressed, the V protein, also contains the N-terminus of P, fused to a zinc finger. We suspected that, despite their high variability, the N-termini of P/V might all be homologous; however, using standard approaches, we could previously identify sequence conservation only in some Paramyxovirinae. We now compared the N-termini using sensitive sequence similarity search programs, able to detect residual similarities unnoticeable by conventional approaches. We discovered that all Paramyxovirinae share a short sequence motif in their first 40 amino acids, which we called soyuz1. Despite its short length (11–16aa), several arguments allow us to conclude that soyuz1 probably evolved by homologous descent, unlike linear motifs. Conservation across such evolutionary distances suggests that soyuz1 plays a crucial role and experimental data suggest that it binds the viral nucleoprotein to prevent its illegitimate self-assembly. In some Paramyxovirinae, the N-terminus of P/V contains a second motif, soyuz2, which might play a role in blocking interferon signaling. Finally, we discovered that the P of related Mononegavirales contain similarly overlooked motifs in their N-termini, and that their C-termini share a previously unnoticed structural similarity suggesting a common origin. Our results suggest several testable hypotheses regarding the replication of Mononegavirales and suggest that disordered regions with little overall sequence similarity, common in viral and eukaryotic proteins, might contain currently overlooked motifs (intermediate in length between linear motifs and disordered domains) that could be detected simply by comparing orthologous proteins. PMID:22403617

  17. Mining Novel Allergens from Coconut Pollen Employing Manual De Novo Sequencing and Homology-Driven Proteomics.

    PubMed

    Saha, Bodhisattwa; Sircar, Gaurab; Pandey, Naren; Gupta Bhattacharya, Swati

    2015-11-06

    Coconut pollen, one of the major palm pollen grains is an important constituent among vectors of inhalant allergens in India and a major sensitizer for respiratory allergy in susceptible patients. To gain insight into its allergenic components, pollen proteins were analyzed by two-dimensional electrophoresis, immunoblotted with coconut pollen sensitive patient sera, followed by mass spectrometry of IgE reactive proteins. Coconut being largely unsequenced, a proteomic workflow has been devised that combines the conventional database-dependent analysis of tandem mass spectral data and manual de novo sequencing followed by a homology-based search for identifying the allergenic proteins. N-terminal acetylation helped to distinguish "b" ions from others, facilitating reliable sequencing. This led to the identification of 12 allergenic proteins. Cluster analysis with individual patient sera recognized vicilin-like protein as a major allergen, which was purified to assess its in vitro allergenicity and then partially sequenced. Other IgE-sensitive spots showed significant homology with well-known allergenic proteins such as 11S globulin, enolase, and isoflavone reductase along with a few which are reported as novel allergens. The allergens identified can be used as potential candidates to develop hypoallergenic vaccines, to design specific immunotherapy trials, and to enrich the repertoire of existing IgE reactive proteins.

  18. Efficient system of homologous RNA recombination in brome mosaic virus: sequence and structure requirements and accuracy of crossovers.

    PubMed Central

    Nagy, P D; Bujarski, J J

    1995-01-01

    Brome mosaic virus (BMV), a tripartite positive-stranded RNA virus of plants engineered to support intersegment RNA recombination, was used for the determination of sequence and structural requirements of homologous crossovers. A 60-nucleotide (nt) sequence, common between wild-type RNA2 and mutant RNA3, supported efficient repair (90%) of a modified 3' noncoding region in the RNA3 segment by homologous recombination with wild-type RNA2 3' noncoding sequences. Deletions within this sequence in RNA3 demonstrated that a nucleotide identity as short as 15 nt can support efficient homologous recombination events, while shorter (5-nt) sequence identity resulted in reduced recombination frequency (5%) within this region. Three or more mismatches within a downstream portion of the common 60-nt RNA3 sequence affected both the incidence of recombination and the distribution of crossover sites, suggesting that besides the length, the extent of sequence identity between two recombining BMV RNAs is an important factor in homologous recombination. Site-directed mutagenesis of the common sequence in RNA3 did not reveal a clear correlation between the stability of predicted secondary structures and recombination activity. This indicates that homologous recombination does not require similar secondary structures between two recombining RNAs at the sites of crossovers. Nearly 20% of homologous recombinants were imprecise (aberrant), containing either nucleotide mismatches, small deletions, or small insertions within the region of crossovers. This implies that homologous RNA recombination is not as accurate as proposed previously. Our results provide experimental evidence that the requirements and thus the mechanism of homologous recombination in BMV differ from those of previously described heteroduplex-mediated nonhomologous recombination (P. D. Nagy and J. J. Bujarski, Proc. Natl. Acad. Sci. USA 90:6390-6394, 1993). PMID:7983703

  19. Structural homologies and functional similarities between mammalian origins of replication and amplification promoting sequences.

    PubMed

    Stolzenburg, F; Gerwig, R; Dinkl, E; Grummt, F

    1994-06-01

    MuNTS2, a 423 bp sequence isolated from the non-transcribed spacer of murine rDNA stimulates the amplification of cis-linked plasmid DNA in mouse cells under selective conditions. Here we demonstrate that a 180 bp subdomain of muNTS2 is highly homologous (approximately 70%) to three domains of the first well-characterized origin of replication of mammalian chromosomes, i.e. the origin of bidirectional replication (OBR) of the dihydrofolate reductase (DHFR) locus in Chinese hamster ovary (CHO) cells. When subcloned, the 180 bp homology region of muNTS2 was revealed to be essential for the amplification promoting activity of muNTS2. Fragments of the initiation zone of DNA replication from the DHFR locus of hamster cells containing the domains of homology to the mouse muNTS2 element proved also to promote DNA amplification. Thus, the screening system for amplification promoting elements turned out to detect an origin of bidirectional replication.

  20. Intrachromosomal recombination between well-separated, homologous sequences in mammalian cells.

    PubMed

    Baker, M D; Read, L R; Ng, P; Beatty, B G

    1999-06-01

    In the present study, we investigated intrachromosomal homologous recombination in a murine hybridoma in which the recipient for recombination, the haploid, endogenous chromosomal immunoglobulin mu-gene bearing a mutation in the constant (Cmu) region, was separated from the integrated single copy wild-type donor Cmu region by approximately 1 Mb along the hybridoma chromosome. Homologous recombination between the donor and recipient Cmu region occurred with high frequency, correcting the mutant chromosomal mu-gene in the hybridoma. This enabled recombinant hybridomas to synthesize normal IgM and to be detected as plaque-forming cells (PFC). Characterization of the recombinants revealed that they could be placed into three distinct classes. The generation of the class I recombinants was consistent with a simple unequal sister chromatid exchange (USCE) between the donor and recipient Cmu region, as they contained the three Cmu-bearing fragments expected from this recombination, the original donor Cmu region along with both products of the single reciprocal crossover. However, a simple mechanism of homologous recombination was not sufficient in explaining the more complex Cmu region structures characterizing the class II and class III recombinants. To explain these recombinants, a model is proposed in which unequal pairing between the donor and recipient Cmu regions located on sister chromatids resulted in two crossover events. One crossover resulted in the deletion of sequences from one chromatid forming a DNA circle, which then integrated into the sister chromatid by a second reciprocal crossover.

  1. Homologation of α-aryl amino acids through quinone-catalyzed decarboxylation/Mukaiyama-Mannich addition.

    PubMed

    Haugeberg, Benjamin J; Phan, Johnny H; Liu, Xinyun; O'Connor, Thomas J; Clift, Michael D

    2017-03-09

    A new method for amino acid homologation by way of formal C-C bond functionalization is reported. This method utilizes a 2-step/1-pot protocol to convert α-amino acids to their corresponding N-protected β-amino esters through quinone-catalyzed oxidative decarboxylation/in situ Mukaiyama-Mannich addition. The scope and limitations of this chemistry are presented. This methodology provides an alternative to the classical Arndt-Eistert homologation for accessing β-amino acid derivatives. The resulting N-protected amine products can be easily deprotected to afford the corresponding free amines.

  2. Chip-based sequencing nucleic acids

    DOEpatents

    Beer, Neil Reginald

    2014-08-26

    A system for fast DNA sequencing by amplification of genetic material within microreactors, denaturing, demulsifying, and then sequencing the material, while retaining it in a PCR/sequencing zone by a magnetic field. One embodiment includes sequencing nucleic acids on a microchip that includes a microchannel flow channel in the microchip. The nucleic acids are isolated and hybridized to magnetic nanoparticles or to magnetic polystyrene-coated beads. Microreactor droplets are formed in the microchannel flow channel. The microreactor droplets containing the nucleic acids and the magnetic nanoparticles are retained in a magnetic trap in the microchannel flow channel and sequenced.

  3. Amino-Acid Sequence of NADP-Specific Glutamate Dehydrogenase of Neurospora crassa

    PubMed Central

    Wootton, John C.; Chambers, Geoffrey K.; Holder, Anthony A.; Baron, Andrew J.; Taylor, John G.; Fincham, John R. S.; Blumenthal, Kenneth M.; Moon, Kenneth; Smith, Emil L.

    1974-01-01

    A tentative primary structure of the NADP-specific glutamate dehydrogenase [L-glutamate: NADP oxidoreductase (deaminating), EC 1.4.1.4] from Neurospora crassa has been determined. The proposed sequence contains 452 amino-acid residues in each of the identical subunits of the hexameric enzyme. Comparison of the sequence with that of the bovine liver enzyme reveals considerable homology in the amino-terminal portion of the chain, including the vicinity of the reactive lysine, with only shorter stretches of homology within the carboxyl-terminal regions. The significance of this distribution of homologous regions is discussed. PMID:4155068

  4. The amino acid sequence of iguana (Iguana iguana) pancreatic ribonuclease.

    PubMed

    Zhao, W; Beintema, J J; Hofsteenge, J

    1994-01-15

    The pyrimidine-specific ribonuclease superfamily constitutes a group of homologous proteins so far found only in higher vertebrates. Four separate families are found in mammals, which have resulted from gene duplications in mammalian ancestors. To learn more about the evolutionary history of this superfamily, the primary structure and other characteristics of the pancreatic enzyme from iguana (Iguana iguana), a herbivorous lizard species belonging to the reptiles, have been determined. The polypeptide chain consists of 119 amino acid residues. The positions of insertions and deletions in the sequence are identical to those in the enzyme from snapping turtle. However, the two enzymes differ at 54% of the amino acid positions. Iguana ribonuclease contains no carbohydrate, although the enzyme possesses three recognition sites for carbohydrate attachment, and has a high number of acidic residues in a localized part of the sequence.

  5. Neural network and SVM classifiers accurately predict lipid binding proteins, irrespective of sequence homology.

    PubMed

    Bakhtiarizadeh, Mohammad Reza; Moradi-Shahrbabak, Mohammad; Ebrahimi, Mansour; Ebrahimie, Esmaeil

    2014-09-07

    Due to the central roles of lipid binding proteins (LBPs) in many biological processes, sequence based identification of LBPs is of great interest. The major challenge is that LBPs are diverse in sequence, structure, and function which results in low accuracy of sequence homology based methods. Therefore, there is a need for developing alternative functional prediction methods irrespective of sequence similarity. To identify LBPs from non-LBPs, the performances of support vector machine (SVM) and neural network were compared in this study. Comprehensive protein features and various techniques were employed to create datasets. Five-fold cross-validation (CV) and independent evaluation (IE) tests were used to assess the validity of the two methods. The results indicated that SVM outperforms neural network. SVM achieved 89.28% (CV) and 89.55% (IE) overall accuracy in identification of LBPs from non-LBPs and 92.06% (CV) and 92.90% (IE) (in average) for classification of different LBPs classes. Increasing the number and the range of extracted protein features as well as optimization of the SVM parameters significantly increased the efficiency of LBPs class prediction in comparison to the only previous report in this field. Altogether, the results showed that the SVM algorithm can be run on broad, computationally calculated protein features and offers a promising tool in detection of LBPs classes. The proposed approach has the potential to integrate and improve the common sequence alignment based methods.

  6. GPU-Acceleration of Sequence Homology Searches with Database Subsequence Clustering

    PubMed Central

    Suzuki, Shuji; Kakuta, Masanori; Ishida, Takashi; Akiyama, Yutaka

    2016-01-01

    Sequence homology searches are used in various fields and require large amounts of computation time, especially for metagenomic analysis, owing to the large number of queries and the database size. To accelerate computing analyses, graphics processing units (GPUs) are widely used as a low-cost, high-performance computing platform. Therefore, we mapped the time-consuming steps involved in GHOSTZ, which is a state-of-the-art homology search algorithm for protein sequences, onto a GPU and implemented it as GHOSTZ-GPU. In addition, we optimized memory access for GPU calculations and for communication between the CPU and GPU. As per results of the evaluation test involving metagenomic data, GHOSTZ-GPU with 12 CPU threads and 1 GPU was approximately 3.0- to 4.1-fold faster than GHOSTZ with 12 CPU threads. Moreover, GHOSTZ-GPU with 12 CPU threads and 3 GPUs was approximately 5.8- to 7.7-fold faster than GHOSTZ with 12 CPU threads. PMID:27482905

  7. GPU-Acceleration of Sequence Homology Searches with Database Subsequence Clustering.

    PubMed

    Suzuki, Shuji; Kakuta, Masanori; Ishida, Takashi; Akiyama, Yutaka

    2016-01-01

    Sequence homology searches are used in various fields and require large amounts of computation time, especially for metagenomic analysis, owing to the large number of queries and the database size. To accelerate computing analyses, graphics processing units (GPUs) are widely used as a low-cost, high-performance computing platform. Therefore, we mapped the time-consuming steps involved in GHOSTZ, which is a state-of-the-art homology search algorithm for protein sequences, onto a GPU and implemented it as GHOSTZ-GPU. In addition, we optimized memory access for GPU calculations and for communication between the CPU and GPU. As per results of the evaluation test involving metagenomic data, GHOSTZ-GPU with 12 CPU threads and 1 GPU was approximately 3.0- to 4.1-fold faster than GHOSTZ with 12 CPU threads. Moreover, GHOSTZ-GPU with 12 CPU threads and 3 GPUs was approximately 5.8- to 7.7-fold faster than GHOSTZ with 12 CPU threads.

  8. Distinguishing Proteins From Arbitrary Amino Acid Sequences

    PubMed Central

    Yau, Stephen S.-T.; Mao, Wei-Guang; Benson, Max; He, Rong Lucy

    2015-01-01

    What kinds of amino acid sequences could possibly be protein sequences? From all existing databases that we can find, known proteins are only a small fraction of all possible combinations of amino acids. Beginning with Sanger's first detailed determination of a protein sequence in 1952, previous studies have focused on describing the structure of existing protein sequences in order to construct the protein universe. No one, however, has developed a criteria for determining whether an arbitrary amino acid sequence can be a protein. Here we show that when the collection of arbitrary amino acid sequences is viewed in an appropriate geometric context, the protein sequences cluster together. This leads to a new computational test, described here, that has proved to be remarkably accurate at determining whether an arbitrary amino acid sequence can be a protein. Even more, if the results of this test indicate that the sequence can be a protein, and it is indeed a protein sequence, then its identity as a protein sequence is uniquely defined. We anticipate our computational test will be useful for those who are attempting to complete the job of discovering all proteins, or constructing the protein universe. PMID:25609314

  9. The amino acid sequence of rabbit cardiac troponin I.

    PubMed Central

    Grand, R J; Wilkinson, J M

    1976-01-01

    The complete amino acid sequence of troponin I from rabbit cardiac muscle was determined by the isolation of four unique CNBr fragments, together with overlapping tryptic peptides containing radioactive methionine residues. Overlap data for residues 35-36, 93-94 and 140-145 are incomplete, the sequence at these positions being based on homology with the sequence of the fast-skeletal-muscle protein. Cardiac troponin I is a single polypeptide chain of 206 residues with mol.wt. 23550 and an extinction coefficient, E 1%,1cm/280, of 4.37. The protein has a net positive charge of 14 and is thus somewhat more basic than troponin I from fast-skeletal muscle. Comparison of the sequences of troponin I from cardiac and fast skeletal muscle show that the cardiac protein has 26 extra residues at the N-terminus which account for the larger size of the protein. In the remainder of sequence there is a considerable degree of homology, this being greater in the C-terminal two-thirds of the molecule. The region in the cardiac protein corresponding to the peptide with inhibitory activity from the fast-skeletal-muscle protein is very similar and it seems unlikely that this is the cause of the difference in inhibitory activity between the two proteins. The region responsible for binding troponin C, however, possesses a lower degree of homology. Detailed evidence on which the sequence is based has been deposited as Supplementary Publication SUP 50072 (20 pages), at the British Library Lending Division, Boston Spa, Wetherby, West Yorkshire LS23 7QB, U.K., from whom copies may be obtained on the terms given in Biochem. J. (1976) 153, 5. PMID:1008822

  10. The complete amino acid sequence of prochymosin.

    PubMed Central

    Foltmann, B; Pedersen, V B; Jacobsen, H; Kauffman, D; Wybrandt, G

    1977-01-01

    The total sequence of 365 amino acid residues in bovine prochymosin is presented. Alignment with the amino acid sequence of porcine pepsinogen shows that 204 amino acid residues are common to the two zymogens. Further comparison and alignment with the amino acid sequence of penicillopepsin shows that 66 residues are located at identical positions in all three proteases. The three enzymes belong to a large group of proteases with two aspartate residues in the active center. This group forms a family derived from one common ancestor. PMID:329280

  11. Cellular RNA homologous to the Abelson murine leukemia virus transforming gene: expression and relationship to the viral sequence.

    PubMed Central

    Wang, J Y; Baltimore, D

    1983-01-01

    To examine the expression of the cellular homolog of the Abelson murine leukemia virus transforming gene (the v-abl sequence), a DNA probe representing the v-abl sequence was prepared. The probe detected two cytoplasmic polyadenylic acid-containing c-abl RNAs of about 6.5 and 5.5 kilobases in a variety of rodent cells, and slightly larger RNAs were detected in human cells. These two RNA species were found in all normal tissues or cell lines examined, but at differing concentrations: liver cells had the least, fibroblastic cell lines had the most. By using a probe able to detect the cellular but not the viral gene, the two RNAs were shown to be present in Abelson murine leukemia virus-transformed cells at levels found either in their untransformed counterparts or in similar cell types transformed by other means. The target cells of the virus have a somewhat elevated level of the two RNAs although expression of the c-abl gene is not restricted to these cells. The v-abl sequence lacks 0.35 and 0.85 kilobases of the c-abl RNA on the 5' and 3' ends, respectively. Thus, the Abelson murine leukemia virus transforming gene is an internal fragment of the transcript of a normal cellular gene. Images PMID:6306446

  12. SGP-1: Prediction and Validation of Homologous Genes Based on Sequence Alignments

    PubMed Central

    Wiehe, Thomas; Gebauer-Jung, Steffi; Mitchell-Olds, Thomas; Guigó, Roderic

    2001-01-01

    Conventional methods of gene prediction rely on the recognition of DNA-sequence signals, the coding potential or the comparison of a genomic sequence with a cDNA, EST, or protein database. Reasons for limited accuracy in many circumstances are species-specific training and the incompleteness of reference databases. Lately, comparative genome analysis has attracted increasing attention. Several analysis tools that are based on human/mouse comparisons are already available. Here, we present a program for the prediction of protein-coding genes, termed SGP-1 (Syntenic Gene Prediction), which is based on the similarity of homologous genomic sequences. In contrast to most existing tools, the accuracy of SGP-1 depends little on species-specific properties such as codon usage or the nucleotide distribution. SGP-1 may therefore be applied to nonstandard model organisms in vertebrates as well as in plants, without the need for extensive parameter training. In addition to predicting genes in large-scale genomic sequences, the program may be useful to validate gene structure annotations from databases. To this end, SGP-1 output also contains comparisons between predicted and annotated gene structures in HTML format. The program can be accessed via a Web server at http://soft.ice.mpg.de/sgp-1. The source code, written in ANSI C, is available on request from the authors. PMID:11544202

  13. Germination behavior, biochemical features and sequence analysis of the RACK1/arcA homolog from Phaseolus vulgaris

    PubMed Central

    Islas-Flores, Tania; Guillén, Gabriel; Islas-Flores, Ignacio; Román-Roque, Carolina San; Sánchez, Federico; Loza-Tavera, Herminia; Bearer, Elaine L.; Villanueva, Marco A.

    2010-01-01

    Partial peptide sequence of a 36 kDa protein from common bean embryo axes showed 100% identity with a reported β-subunit of a heterotrimeric G protein from soybean. Analysis of the full sequence showed 96.6% identity with the reported soybean Gβ -subunit, 86% with RACK1B and C from Arabidopsis and 66% with human and mouse RACK1, at the amino acid level. In addition, it showed 85.5, 85 and 83% identities with arcA from Solanum lycopersicum, Arabidopsis (RACK1A) and Nicotiana tabacum, respectively. The amino acid sequence displayed seven WD40 domains and two sites for activated protein kinase C binding. The protein showed a constant expression level but the mRNA had a maximum at 32 h post-imbibition. Western immunoblotting showed the protein in vegetative plant tissues, and in both microsomal and soluble fractions from embryo axes. Synthetic auxin treatment during germination delayed the peak of RACK1 mRNA expression to 48 h but did not affect the protein expression level while the polar auxin transport inhibitor, naphtylphtalamic acid had no effect on either mRNA or protein expression levels. Southern blot and genomic DNA amplification revealed a small gene family with at least one member without introns in the genome. Thus, the RACK1/arcA homolog from common bean has the following features: (1) it is highly conserved; (2) it is both soluble and insoluble within the embryo axis; (3) it is encoded by a small gene family; (4) its mRNA has a peak of expression at the time point of germination stop and (5) its expression is only slightly affected by auxin but unaffected by an auxin transport blocker. PMID:19832940

  14. Analysis of cloned cDNA and genomic sequences for phytochrome: complete amino acid sequences for two gene products expressed in etiolated Avena.

    PubMed Central

    Hershey, H P; Barker, R F; Idler, K B; Lissemore, J L; Quail, P H

    1985-01-01

    Cloned cDNA and genomic sequences have been analyzed to deduce the amino acid sequence of phytochrome from etiolated Avena. Restriction endonuclease site polymorphism between clones indicates that at least four phytochrome genes are expressed in this tissue. Sequence analysis of two complete and one partial coding region shows approximately 98% homology at both the nucleotide and amino acid levels, with the majority of amino acid changes being conservative. High sequence homology is also found in the 5'-untranslated region but significant divergence occurs in the 3'-untranslated region. The phytochrome polypeptides are 1128 amino acid residues long corresponding to a molecular mass of 125 kdaltons. The known protein sequence at the chromophore attachment site occurs only once in the polypeptide, establishing that phytochrome has a single chromophore per monomer covalently linked to Cys-321. Computer analyses of the amino acid sequences have provided predictions regarding a number of structural features of the phytochrome molecule. PMID:3001642

  15. GHOSTX: an improved sequence homology search algorithm using a query suffix array and a database suffix array.

    PubMed

    Suzuki, Shuji; Kakuta, Masanori; Ishida, Takashi; Akiyama, Yutaka

    2014-01-01

    DNA sequences are translated into protein coding sequences and then further assigned to protein families in metagenomic analyses, because of the need for sensitivity. However, huge amounts of sequence data create the problem that even general homology search analyses using BLASTX become difficult in terms of computational cost. We designed a new homology search algorithm that finds seed sequences based on the suffix arrays of a query and a database, and have implemented it as GHOSTX. GHOSTX achieved approximately 131-165 times acceleration over a BLASTX search at similar levels of sensitivity. GHOSTX is distributed under the BSD 2-clause license and is available for download at http://www.bi.cs.titech.ac.jp/ghostx/. Currently, sequencing technology continues to improve, and sequencers are increasingly producing larger and larger quantities of data. This explosion of sequence data makes computational analysis with contemporary tools more difficult. We offer this tool as a potential solution to this problem.

  16. Identification of fungi based on the nucleotide sequence homology of their internal transcribed spacer 1 (ITS1) region.

    PubMed

    Narutaki, Shoji; Takatori, Kosuke; Nishimura, Hidekatsu; Terashima, Hiroshi; Sasaki, Tsuguo

    2002-01-01

    In this study, we examined the identification of fungi based on the sequence homology of the internal transcribed spacer 1 (ITS1) region. A newly designed primer pair could amplify the target region of all 42 strains tested. The PCR products were sequenced and the sequence homologies were searched by BLAST. It was demonstrated that this method is a reliable identification method at the genus or species level. At present, available databases are still insufficient to identify some fungi, but with the accumulation of further data in the ITS1 database, this method will be available for the identification of fungi.

  17. Method for sequencing nucleic acid molecules

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2006-05-30

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  18. Method for sequencing nucleic acid molecules

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2006-06-06

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  19. Cytochrome oxidase subunit III from Arbacia lixula: detection of functional constraints by comparison with homologous sequences.

    PubMed

    De Giorgi, C; Martiradonna, A; Saccone, C

    1993-01-01

    In this paper we report the comparison of the sequences of the cytochrome oxidase subunit III from three different sea urchin species. Both nucleotide and amino acid sequences have been analyzed. The nucleotide sequence analysis reveals that the sea urchin sequences obey some rules already found in mammals. The base substitution analysis carried out on the sequences of the three species pairs, shows that the evolutionary dynamics of the first and the second codon positions are so slow that do not allow a quantitative measurement of their genetic distances, thus demonstrating that also in these species the COIII gene is strongly conserved during evolution. Changes occurring at the third codon positions indicate that the three species evolved from a common ancestor under different directional mutational pressure. The multi-alignment of the sea urchin proteins indicates the existence of the amino acid sequence motif N R T that represents a possible glycosylation site. Another glycosylation site has been detected in the mammalian cytochrome oxidase subunit III, in a position slightly different. Such an analysis revealed, for the first time, a new functional aspect of this sequence.

  20. Complete amino acid sequence and structure characterization of the taste-modifying protein, miraculin.

    PubMed

    Theerasilp, S; Hitotsuya, H; Nakajo, S; Nakaya, K; Nakamura, Y; Kurihara, Y

    1989-04-25

    The taste-modifying protein, miraculin, has the unusual property of modifying sour taste into sweet taste. The complete amino acid sequence of miraculin purified from miracle fruits by a newly developed method (Theerasilp, S., and Kurihara, Y. (1988) J. Biol. Chem. 263, 11536-11539) was determined by an automatic Edman degradation method. Miraculin was a single polypeptide with 191 amino acid residues. The calculated molecular weight based on the amino acid sequence and the carbohydrate content (13.9%) was 24,600. Asn-42 and Asn-186 were linked N-glycosidically to carbohydrate chains. High homology was found between the amino acid sequences of miraculin and soybean trypsin inhibitor.

  1. The sequence organization of Yp/proximal Xq homologous regions of the human sex chromosomes is highly conserved

    SciTech Connect

    Sargent, C.A.; Briggs, H.; Chalmers, I.J.

    1996-03-01

    Detailed deletion analysis of patients with breakpoints in Yp has allowed the definition of two distinct intervals on the Y chromosome short arm outside the pseudoautosomal region that are homologous to Xq21.3. Detailed YAC contigs have been developed over these regions on both the X and Y chromosomes, and the relative order of markers has been compared to assess whether rearrangements on either sex chromosome have occurred since the transposition events creating these patterns of homology. On the X chromosome, the region forms almost one contiguous block of homology, whereas on the Y chromosome, there has been one major rearrangement leading to the two separate Yp-Xq21 blocks of homology. The rearrangement breakpoint has been mapped. Within these separate X-Y homologous blocks on Yp, the order of loci homologous to X has been conserved to a high degree between the sex chromosomes. With the exception of the amelogenin gene (proximal Yp block), all the X-Y homologous sequences in the two Yp blocks have homologues in Xq21.3, with the former having its X counterpart in Xp22.2. This suggests an independent evolutionary event leading to the formation of the amelogenin X-Y homology. 45 refs., 4 figs., 1 tab.

  2. Sequences homologous to ZFY, a candidate human sex-determining gene, are autosomal in marsupials.

    PubMed

    Sinclair, A H; Foster, J W; Spencer, J A; Page, D C; Palmer, M; Goodfellow, P N; Graves, J A

    Sexual differentiation in placental mammals results from the action of a testis-determining gene encoded by the Y chromosome. This gene causes the indifferent gonad to develop as a testis, thereby initiating a hormonal cascade which produces a male phenotype. Recently, a candidate for the testis-determining gene (ZFY, Y-borne zinc-finger protein) has been cloned. The ZFY probe detects a male-specific (Y-linked) sequence in DNA from a range of eutherian mammals, as well as an X-linked sequence (ZFX) which maps to the human X chromosome. In marsupials it is also the Y chromosome that seems to determine the fate of the gonad, but not all sexual dimorphisms. Using the ZFY probe we find, surprisingly, that the ZFY homologous sequences are not on either the X or the Y chromosome in marsupials, but map to the autosomes. This implies ZFY is not the primary sex-determining gene in marsupials. Either the genetic pathways of sex determination in marsupials and eutherians differ, or they are identical and ZFY is not the primary signal in human sex determination.

  3. Determination and augmentation of RNA sequence specificity of the Nova K-homology domains.

    PubMed

    Musunuru, Kiran; Darnell, Robert B

    2004-01-01

    The Nova onconeural antigens are implicated in the pathogenesis of paraneoplastic opsoclonus-myoclonus-ataxia (POMA). The Nova antigens are neuron-specific RNA-binding proteins harboring three repeats of the K-homology (KH) motif; they have been implicated in the regulation of alternative splicing of a host of genes involved in inhibitory synaptic transmission. Although the third Nova KH domain (KH3) has been extensively characterized using biochemical and crystallographic techniques, the roles of the KH1 and KH2 domains remain unclear. Furthermore, the specificity determinants that distinguish the Nova KH domains from those of the closely related hnRNP E and hnRNP K proteins are undefined. We demonstrate through the use of RNA selection and biochemical analysis that the sequence specificity of the Nova KH1/2 domains is similar to that of Nova KH3. We also show that the mutagenesis of a Nova KH domain to render it similar to the KH domains of the heterogeneous nuclear ribonucleoprotein E (hnRNP E) and hnRNP K allow it to recognize longer RNA sequences. These data yield important insights into KH domain function and suggest a strategy by which to engineer KH domains with novel sequence preferences.

  4. Amino acid sequence of a mouse immunoglobulin mu chain.

    PubMed Central

    Kehry, M; Sibley, C; Fuhrman, J; Schilling, J; Hood, L E

    1979-01-01

    The complete amino acid sequence of the mouse mu chain from the BALB/c myeloma tumor MOPC 104E is reported. The C mu region contains four consecutive homology regions of approximately 110 residues and a COOH-terminal region of 19 residues. A comparison of this mu chain from mouse with a complete mu sequence from human (Ou) and a partial mu chain sequence from dog (Moo) reveals a striking gradient of increasing homology from the NH2-terminal to the COOH-terminal portion of these mu chains, with the former being the least and the latter the most highly conserved. Four of the five sites of carbohydrate attachment appear to be at identical residue positions when the constant regions of the mouse and human mu chains are compared. The mu chain of MOPC 104E has a carbohydrate moiety attached in the second hypervariable region. This is particularly interesting in view of the fact that MOPC 104E binds alpha-(1 leads to 3)-dextran, a simple carbohydrate. The structural and functional constraints imposed by these comparative sequence analyses are discussed. PMID:111247

  5. A simple and rapid method for the preparation of homologous DNA oligonucleotide hybridization probes from heterologous gene sequences and probes.

    PubMed

    Maxwell, E S; Sarge, K D

    1988-11-30

    We describe a simple and rapid method for the preparation of homologous DNA oligonucleotide probes for hybridization analysis and/or cDNA/genomic library screening. With this method, a synthetic DNA oligonucleotide derived from a known heterologous DNA/RNA/protein sequence is annealed to an RNA preparation containing the gene transcript of interest. Any unpaired 3'-terminal oligonucleotides of the heterologous DNA primer are then removed using the 3' exonuclease activity of the DNA Polymerase I Klenow fragment before primer extension/dideoxynucleotide sequencing of the annealed RNA species with AMV reverse transcriptase. From the determined RNA sequence, a completely homologous DNA oligonucleotide probe is then prepared. This approach has been used to prepare a homologous DNA oligonucleotide probe for the successful library screening of the yeast hybRNA gene starting with a heterologous mouse hybRNA DNA oligonucleotide probe.

  6. Non-homologous sex chromosomes of birds and snakes share repetitive sequences.

    PubMed

    O'Meally, Denis; Patel, Hardip R; Stiglec, Rami; Sarre, Stephen D; Georges, Arthur; Marshall Graves, Jennifer A; Ezaz, Tariq

    2010-11-01

    Snake sex chromosomes provided Susumo Ohno with the material on which he based his theory of how sex chromosomes differentiate from autosomal pairs. Like birds, snakes have a ZZ male/ZW female sex chromosome system, in which the snake Z is a macrochromosome much the same size as the bird Z. However, the gene content shows clearly that the snake and bird Z chromosomes are completely non-homologous. The molecular aspect of W chromosome degeneration in snakes remains largely unexplored. We used comparative genomic hybridization to identify the female-specific region of the W chromosome in representative species of Australian snakes. Using this approach, we show that an increasingly complex suite of repeats accompanies the evolution of W chromosome heteromorphy. In particular, we found that while the python Liasis fuscus exhibits no sex-specific repeats and indeed, no cytologically recognizable sex-specific region, the colubrid Stegonotus cucullatus shows a large domain on the short arm of the W chromosome that consists of female-specific repeats, and the large W of Notechis scutatus is composed almost entirely of repetitive sequences, including Bkm and 18S rDNA-related elements. FISH mapping of both simple and complex probes shows patterns of repeat amplification concordant with the size of the female-specific region in each species examined. Mapping of intronic sequences of genes that are sex-linked in both birds (DMRT1) and snakes (CTNNB1) reveals massive amplification in discrete domains on the W chromosome of the elapid N. scutatus. Using chicken W chromosome paint, we demonstrate that repetitive sequences are shared between the sex chromosomes of birds and derived snakes. This could be explained by ancestral but as yet undetected shared synteny of bird and snake sex chromosomes or may indicate functional homology of the repeats and suggests that degeneration is a convergent property of sex chromosome evolution. We also establish that synteny of snake Z

  7. Partial primary structure of human pregnancy zone protein: extensive sequence homology with human alpha 2-macroglobulin.

    PubMed Central

    Sottrup-Jensen, L; Folkersen, J; Kristensen, T; Tack, B F

    1984-01-01

    Human pregnancy zone protein (PZP) is a major pregnancy-associated protein. Its quaternary structure (two covalently bound 180-kDa subunits, which are further non-covalently assembled into a tetramer of 720 kDa) is similar to that of human alpha 2-macroglobulin (alpha 2M). Here we show, from the results of complete or partial sequence determination of a random selection of 38 tryptic peptides covering 685 residues of the subunit of PZP, that PZP and alpha 2M indeed are extensively homologous. In the stretches of PZP sequenced so far, the degree of identically placed residues in the two proteins is 68%, indicating a close evolutionary relationship between PZP and alpha 2M. Although the function of PZP in pregnancy is largely unknown, its close structural relationship to alpha 2M suggests analogous proteinase binding properties and a potential for being taken up in cells by receptor-mediated endocytosis. In this regard our studies indicate a bait region in PZP significantly different from that present in alpha 2M. PZP could be the human equivalent of the acute-phase alpha-macroglobulins (e.g., rat alpha 2M and rabbit alpha 1M) described earlier. PMID:6209714

  8. Amino acid sequence of bovine heart coupling factor 6.

    PubMed Central

    Fang, J K; Jacobs, J W; Kanner, B I; Racker, E; Bradshaw, R A

    1984-01-01

    The amino acid sequence of bovine heart mitochondrial coupling factor 6 (F6) has been determined by automated Edman degradation of the whole protein and derived peptides. Preparations based on heat precipitation and ethanol extraction showed allotypic variation at three positions while material further purified by HPLC yielded only one sequence that also differed by a Phe-Thr replacement at residue 62. The mature protein contains 76 amino acids with a calculated molecular weight of 9006 and a pI of approximately equal to 5, in good agreement with experimentally measured values. The charged amino acids are mainly clustered at the termini and in one section in the middle; these three polar segments are separated by two segments relatively rich in nonpolar residues. Chou-Fasman analysis suggests three stretches of alpha-helix coinciding (or within) the high-charge-density sequences with a single beta-turn at the first polar-nonpolar junction. Comparison of the F6 sequence with those of other proteins did not reveal any homologous structures. PMID:6149548

  9. The Chinese hamster Alu-equivalent sequence: a conserved highly repetitious, interspersed deoxyribonucleic acid sequence in mammals has a structure suggestive of a transposable element.

    PubMed Central

    Haynes, S R; Toomey, T P; Leinwand, L; Jelinek, W R

    1981-01-01

    A consensus sequence has been determined for a major interspersed deoxyribonucleic acid repeat in the genome of Chinese hamster ovary cells (CHO cells). This sequence is extensively homologous to (i) the human Alu sequence (P. L. Deininger et al., J. Mol. Biol., in press), (ii) the mouse B1 interspersed repetitious sequence (Krayev et al., Nucleic Acids Res. 8:1201-1215, 1980) (iii) an interspersed repetitious sequence from African green monkey deoxyribonucleic acid (Dhruva et al., Proc. Natl. Acad. Sci. U.S.A. 77:4514-4518, 1980) and (iv) the CHO and mouse 4.5S ribonucleic acid (this report; F. Harada and N. Kato, Nucleic Acids Res. 8:1273-1285, 1980). Because the CHO consensus sequence shows significant homology to the human Alu sequence it is termed the CHO Alu-equivalent sequence. A conserved structure surrounding CHO Alu-equivalent family members can be recognized. It is similar to that surrounding the human Alu and the mouse B1 sequences, and is represented as follows: direct repeat-CHO-Alu-A-rich sequence-direct repeat. A composite interspersed repetitious sequence has been identified. Its structure is represented as follows: direct repeat-residue 47 to 107 of CHO-Alu-non-Alu repetitious sequence-A-rich sequence-direct repeat. Because the Alu flanking sequences resemble those that flank known transposable elements, we think it likely that the Alu sequence dispersed throughout the mammalian genome by transposition. Images PMID:9279371

  10. Sequence-Divergent Chordopoxvirus Homologs of the O3 Protein Maintain Functional Interactions with Components of the Vaccinia Virus Entry-Fusion Complex

    PubMed Central

    Satheshkumar, P. S.

    2012-01-01

    Composed of 35 amino acids, O3 is the smallest characterized protein encoded by vaccinia virus (VACV) and is an integral component of the entry-fusion complex (EFC). O3 is conserved with 100% identity in all orthopoxviruses except for monkeypox viruses, whose O3 homologs have 2 to 3 amino acid substitutions. Since O3 is part of the EFC, high conservation could suggest an immutable requirement for interaction with multiple proteins. Chordopoxviruses of other genera also encode small proteins with a characteristic predicted N-terminal α-helical hydrophobic domain followed by basic amino acids and proline in the same relative genome location as that of VACV O3. However, the statistical significance of their similarity to VACV O3 is low due to the large contribution of the transmembrane domain, their small size, and their sequence diversity. Nevertheless, trans-complementation experiments demonstrated the ability of a representative O3-like protein from each chordopoxvirus genus to rescue the infectivity of a VACV mutant that was unable to express endogenous O3. Moreover, recombinant viruses expressing O3 homologs in place of O3 replicated and formed plaques as well or nearly as well as wild-type VACV. The O3 homologs expressed by the recombinant VACVs were incorporated into the membranes of mature virions and, with one exception, remained stably associated with the detergent-extracted and affinity-purified EFC. The ability of the sequence-divergent O3 homologs to coordinate function with VACV entry proteins suggests the conservation of structural motifs. Analysis of chimeras formed by swapping domains of O3 with those of other proteins indicated that the N-terminal transmembrane segment was responsible for EFC interactions and for the complementation of infectivity. PMID:22114343

  11. An Integrated Sequence-Structure Database incorporating matching mRNA sequence, amino acid sequence and protein three-dimensional structure data.

    PubMed Central

    Adzhubei, I A; Adzhubei, A A; Neidle, S

    1998-01-01

    We have constructed a non-homologous database, termed the Integrated Sequence-Structure Database (ISSD) which comprises the coding sequences of genes, amino acid sequences of the corresponding proteins, their secondary structure and straight phi,psi angles assignments, and polypeptide backbone coordinates. Each protein entry in the database holds the alignment of nucleotide sequence, amino acid sequence and the PDB three-dimensional structure data. The nucleotide and amino acid sequences for each entry are selected on the basis of exact matches of the source organism and cell environment. The current version 1.0 of ISSD is available on the WWW at http://www.protein.bio.msu.su/issd/ and includes 107 non-homologous mammalian proteins, of which 80 are human proteins. The database has been used by us for the analysis of synonymous codon usage patterns in mRNA sequences showing their correlation with the three-dimensional structure features in the encoded proteins. Possible ISSD applications include optimisation of protein expression, improvement of the protein structure prediction accuracy, and analysis of evolutionary aspects of the nucleotide sequence-protein structure relationship. PMID:9399866

  12. Functional homology between the yeast regulatory proteins GAL4 and LAC9: LAC9-mediated transcriptional activation in Kluyveromyces lactis involves protein binding to a regulatory sequence homologous to the GAL4 protein-binding site.

    PubMed Central

    Breunig, K D; Kuger, P

    1987-01-01

    As shown previously, the beta-galactosidase gene of Kluyveromyces lactis is transcriptionally regulated via an upstream activation site (UASL) which contains a sequence homologous to the GAL4 protein-binding site in Saccharomyces cerevisiae (M. Ruzzi, K.D. Breunig, A.G. Ficca, and C.P. Hollenberg, Mol. Cell. Biol. 7:991-997, 1987). Here we demonstrate that the region of homology specifically binds a K. lactis regulatory protein. The binding activity was detectable in protein extracts from wild-type cells enriched for DNA-binding proteins by heparin affinity chromatography. These extracts could be used directly for DNase I and exonuclease III protection experiments. A lac9 deletion strain, which fails to induce the beta-galactosidase gene, did not contain the binding factor. The homology of LAC9 protein with GAL4 (J.M. Salmeron and S. A. Johnston, Nucleic Acids Res. 14:7767-7781, 1986) strongly suggests that LAC9 protein binds directly to UASL and plays a role similar to that of GAL4 in regulating transcription. Images PMID:2830492

  13. VITAL NMR: Using Chemical Shift Derived Secondary Structure Information for a Limited Set of Amino Acids to Assess Homology Model Accuracy

    SciTech Connect

    Brothers, Michael C; Nesbitt, Anna E; Hallock, Michael J; Rupasinghe, Sanjeewa; Tang, Ming; Harris, Jason B; Baudry, Jerome Y; Schuler, Mary A; Rienstra, Chad M

    2011-01-01

    Homology modeling is a powerful tool for predicting protein structures, whose success depends on obtaining a reasonable alignment between a given structural template and the protein sequence being analyzed. In order to leverage greater predictive power for proteins with few structural templates, we have developed a method to rank homology models based upon their compliance to secondary structure derived from experimental solid-state NMR (SSNMR) data. Such data is obtainable in a rapid manner by simple SSNMR experiments (e.g., (13)C-(13)C 2D correlation spectra). To test our homology model scoring procedure for various amino acid labeling schemes, we generated a library of 7,474 homology models for 22 protein targets culled from the TALOS+/SPARTA+ training set of protein structures. Using subsets of amino acids that are plausibly assigned by SSNMR, we discovered that pairs of the residues Val, Ile, Thr, Ala and Leu (VITAL) emulate an ideal dataset where all residues are site specifically assigned. Scoring the models with a predicted VITAL site-specific dataset and calculating secondary structure with the Chemical Shift Index resulted in a Pearson correlation coefficient (-0.75) commensurate to the control (-0.77), where secondary structure was scored site specifically for all amino acids (ALL 20) using STRIDE. This method promises to accelerate structure procurement by SSNMR for proteins with unknown folds through guiding the selection of remotely homologous protein templates and assessing model quality.

  14. Snake venom. The amino acid sequence of protein A from Dendroaspis polylepis polylepis (black mamba) venom.

    PubMed

    Joubert, F J; Strydom, D J

    1980-12-01

    Protein A from Dendroaspis polylepis polylepis venom comprises 81 amino acids, including ten half-cystine residues. The complete primary structures of protein A and its variant A' were elucidated. The sequences of proteins A and A', which differ in a single position, show no homology with various neurotoxins and non-neurotoxic proteins and represent a new type of elapid venom protein.

  15. The chromosomal arrangement of human alpha-like globin genes: sequence homology and alpha-globin gene deletions.

    PubMed

    Lauer, J; Shen, C K; Maniatis, T

    1980-05-01

    We report the isolation of a cluster of four alpha-like globin genes from a bacteriophage lambda library of human DNA (Lawn et al., 1978). Analysis of the cloned DNA confirms the linkage arrangement of the two adult alpha-globin genes (alpha 1 and alpha 2) previously derived from genomic blotting experiments (Orkin, 1978) and identifies two additional closely linked alpha-like genes. The nucleotide sequence of a portion of each of these alpha-like genes was determined. One of these sequences is tentatively identified as an embryonic zeta-globin gene (zeta 1) by comparison with structural data derived from purified zeta-globin protein (J. Clegg, personal communication), while the other sequence cannot be matched with any known alpha-like polypeptide sequence (we designate this sequence phi alpha 1). Localization of the four alpha-like sequences on a restriction map of the gene cluster indicates that the genes have the same transcriptional orientation and are arranged in the order 5'-zeta 1-phi alpha 1-alpha 2-alpha 1-3'. Genomic blotting experiments identified a second, nonallelic zeta-like globin gene (phi 2) located 10-12 kb 5' to the cloned zeta-globin gene. Comparison of the locations of restriction sites within alpha 1 and alpha 2 and heteroduplex studies reveal extensive sequence homology within and flanking the two genes. The homologous sequences, which are interrupted by two blocks of nonhomology, span a region of approximately 4 kb. This extensive sequence homology between two genes which are thought to be the products of an ancient duplication event suggests the existence of a mechanism for sequence matching during evolution. One consequence of this arrangement of homologous sequences is the occurrence of two types of deletions in recombinant phage DNA during propagation in E. coli. The locations and sizes of the two types of deletions are indistinguishable from those of the two types of deletions associated with alpha-thalassemia 2 (Embury et al., 1979

  16. Aza-amino acid scanning of chromobox homolog 7 (CBX7) ligands.

    PubMed

    Traoré, Mariam; Gignac, Michael; Doan, Ngoc-Duc; Hof, Fraser; Lubell, William D

    2017-02-21

    An aza-amino acid scan of peptide inhibitors of the chromobox homolog 7 (CBX7) was performed to study the conformational requirements for affinity to the methyllysine reader protein. Twelve azapeptide analogues were prepared using three different approaches employing respectively N-(Fmoc)aza-amino acid chlorides and submonomer azapeptide synthesis to install systematically aza-residues at the first four residues of the peptide, as well as to provide aza-lysine residues possessing saturated and unsaturated side chains. The aza-peptide ligands were evaluated in a chromobox homolog 7 binding assay, providing useful insight into structural requirements for affinity. Copyright © 2017 European Peptide Society and John Wiley & Sons, Ltd.

  17. Minimum length of direct repeat sequences required for efficient homologous recombination induced by zinc finger nuclease in yeast.

    PubMed

    Ren, ChongHua; Yan, Qiang; Zhang, ZhiYing

    2014-10-01

    Zinc finger nuclease (ZFN) technology is a powerful molecular tool for targeted genome modifications and genetic engineering. However, screening for specific ZFs and validation of ZFN activity are labor intensive and time consuming. We previously designed a yeast-based ZFN screening and validation system by inserting a ZFN binding site flanked by a 164 bp direct repeat sequence into the middle of a Gal4 transcription factor, disrupting the open reading frame of the yeast Gal4 gene. Expression of the ZFN causes a double stranded break at its binding site, which promotes the cellular DNA repair system to restore expression of a functional Gal transcriptional factor via homologous recombination. Expression of Gal4 transcription factor leads to activation of three reporter genes in an AH109 yeast two-hybrid strain. However, the 164 bp direct repeat appears to generate spontaneous homologous recombination frequently, resulting in many false positive ZFNs. To overcome this, a series of DNA fragments of various lengths from 10 to 150 bp with 10 bp increase each and 164 bp direct repeats flanking the ZFN binding site were designed and constructed. The results demonstrated that the minimum length required for ZFN-induced homologous recombination was 30 bp, which almost eliminated spontaneous recombination. Using the 30 bp direct repeat sequence, ZFN could efficiently induce homologous recombination, while false positive ZFNs resulting from spontaneous homologous recombination were minimized. Thus, this study provided a simple, fast and sensitive ZFN screening and activity validation system in yeast.

  18. The complementary deoxyribonucleic acid sequence of guinea pig endometrial prorelaxin.

    PubMed

    Lee, Y A; Bryant-Greenwood, G D; Mandel, M; Greenwood, F C

    1992-03-01

    The nucleotide sequence of the relaxin gene transcript in the endometrium of the late pregnant guinea pig has been determined. The strategy used was a combination of polymerase chain reaction (PCR) with primers designed from the mRNA sequence of porcine preprorelaxin, rapid amplification of cDNA ends-PCR, and blunt end cloning in M13 mp18. With heterologous primers, a 226-basepair (bp) segment of the guinea pig relaxin gene sequence was obtained and was used to design a guinea pig-specific primer for use with the rapid amplification of cDNA ends-PCR method. The latter allowed completion of the sequence of 336 bp, with a 96-bp overlap. The sequence obtained shows greater homology at both the nucleotide and amino acid levels with porcine and human relaxins H1 and H2 than with rat relaxin, supporting the thesis that the guinea pig is not a rodent. The transcription of the guinea pig endometrial relaxin gene during pregnancy was confirmed by Northern analysis of guinea pig endometrial tissues with a species-specific cDNA probe. The endometrial relaxin gene is transcribed during pregnancy, but not in lactation, consistent with the observed immunostaining for relaxin.

  19. External and semi-internal controls for PCR amplification of homologous sequences in mixed templates.

    PubMed

    Kalle, Elena; Gulevich, Alexander; Rensing, Christopher

    2013-11-01

    In a mixed template, the presence of homologous target DNA sequences creates environments that almost inevitably give rise to artifacts and biases during PCR. Heteroduplexes, chimeras, and skewed template-to-product ratios are the exclusive attributes of mixed template PCR and never occur in a single template assay. Yet, multi-template PCR has been used without appropriate attention to quality control and assay validation, in spite of the fact that such practice diminishes the reliability of results. External and internal amplification controls became obligatory elements of good laboratory practice in different PCR assays. We propose the inclusion of an analogous approach as a quality control system for multi-template PCR applications. The amplification controls must take into account the characteristics of multi-template PCR and be able to effectively monitor particular assay performance. This study demonstrated the efficiency of a model mixed template as an adequate external amplification control for a particular PCR application. The conditions of multi-template PCR do not allow implementation of a classic internal control; therefore we developed a convenient semi-internal control as an acceptable alternative. In order to evaluate the effects of inhibitors, a model multi-template mix was amplified in a mixture with DNAse-treated sample. Semi-internal control allowed establishment of intervals for robust PCR performance for different samples, thus enabling correct comparison of the samples. The complexity of the external and semi-internal amplification controls must be comparable with the assumed complexity of the samples. We also emphasize that amplification controls should be applied in multi-template PCR regardless of the post-assay method used to analyze products.

  20. Detection and mapping of homologous, repeated and amplified DNA sequences by DNA renaturation in agarose gels.

    PubMed Central

    Roninson, I B

    1983-01-01

    A new molecular hybridization approach to the analysis of complex genomes has been developed. Tracer and driver DNAs were digested with the same restriction enzyme(s), and tracer DNA was labeled with 32P using T4 DNA polymerase. Tracer DNA was mixed with an excess amount of driver, and the mixture was electrophoresed in an agarose gel. Following electrophoresis, DNA was alkali-denatured in situ and allowed to reanneal in the gel, so that tracer DNA fragments could hybridize to the driver only when homologous driver DNA sequences were present at the same place in the gel, i.e. within a restriction fragment of the same size. After reannealing, unhybridized single-stranded DNA was digested in situ with S1 nuclease. The hybridized tracer DNA was detected by autoradiography. The general applicability of this technique was demonstrated in the following experiments. The common EcoRI restriction fragments were identified in the genomes of E. coli and four other species of bacteria. Two of these fragments are conserved in all Enterobacteriaceae. In other experiments, repeated EcoRI fragments of eukaryotic DNA were visualized as bands of various intensity after reassociation of a total genomic restriction digest in the gel. The situation of gene amplification was modeled by the addition of varying amounts of lambda phage DNA to eukaryotic DNA prior to restriction enzyme digestion. Restriction fragments of lambda DNA were detectable at a ratio of 15 copies per chicken genome and 30 copies per human genome. This approach was used to detect amplified DNA fragments in methotrexate (MTX)-resistant mouse cells and to identify commonly amplified fragments in two independently derived MTX-resistant lines. Images PMID:6310499

  1. Molecular cloning and amino acid sequence of human 5-lipoxygenase

    SciTech Connect

    Matsumoto, T.; Funk, C.D.; Radmark, O.; Hoeoeg, J.O.; Joernvall, H.; Samuelsson, B.

    1988-01-01

    5-Lipoxygenase (EC 1.13.11.34), a Ca/sup 2 +/- and ATP-requiring enzyme, catalyzes the first two steps in the biosynthesis of the peptidoleukotrienes and the chemotactic factor leukotriene B/sub 4/. A cDNA clone corresponding to 5-lipoxygenase was isolated from a human lung lambda gt11 expression library by immunoscreening with a polyclonal antibody. Additional clones from a human placenta lambda gt11 cDNA library were obtained by plaque hybridization with the /sup 32/P-labeled lung cDNA clone. Sequence data obtained from several overlapping clones indicate that the composite DNAs contain the complete coding region for the enzyme. From the deduced primary structure, 5-lipoxygenase encodes a 673 amino acid protein with a calculated molecular weight of 77,839. Direct analysis of the native protein and its proteolytic fragments confirmed the deduced composition, the amino-terminal amino acid sequence, and the structure of many internal segments. 5-Lipoxygenase has no apparent sequence homology with leukotriene A/sub 4/ hydrolase or Ca/sup 2 +/-binding proteins. RNA blot analysis indicated substantial amounts of an mRNA species of approx. = 2700 nucleotides in leukocytes, lung, and placenta.

  2. Molecular cloning and sequence analysis of the Sta58 major antigen gene of Rickettsia tsutsugamushi: sequence homology and antigenic comparison of Sta58 to the 60-kilodalton family of stress proteins.

    PubMed Central

    Stover, C K; Marana, D P; Dasch, G A; Oaks, E V

    1990-01-01

    The scrub typhus 58-kilodalton (kDa) antigen (Sta58) of Rickettsia tsutsugamushi is a major protein antigen often recognized by humans infected with scrub typhus rickettsiae. A 2.9-kilobase HindIII fragment containing a complete sta58 gene was cloned in Escherichia coli and found to express the entire Sta58 antigen and a smaller protein with an apparent molecular mass of 11 kDa (Stp11). DNA sequence analysis of the 2.9-kilobase HindIII fragment revealed two adjacent open reading frames encoding proteins of 11 (Stp11) and 60 (Sta58) kDa. Comparisons of deduced amino acid sequences disclosed a high degree of homology between the R. tsutsugamushi proteins Stp11 and Sta58 and the E. coli proteins GroES and GroEL, respectively, and the family of primordial heat shock proteins designated Hsp10 Hsp60. Although the sequence homology between the Sta58 antigen and the Hsp60 protein family is striking, the Sta58 protein appeared to be antigenically distinct among a sample of other bacterial Hsp60 homologs, including the typhus group of rickettsiae. The antigenic uniqueness of the Sta58 antigen indicates that this protein may be a potentially protective antigen and a useful diagnostic reagent for scrub typhus fever. Images PMID:2108930

  3. Phenolic acid esterases, coding sequences and methods

    DOEpatents

    Blum, David L.; Kataeva, Irina; Li, Xin-Liang; Ljungdahl, Lars G.

    2002-01-01

    Described herein are four phenolic acid esterases, three of which correspond to domains of previously unknown function within bacterial xylanases, from XynY and XynZ of Clostridium thermocellum and from a xylanase of Ruminococcus. The fourth specifically exemplified xylanase is a protein encoded within the genome of Orpinomyces PC-2. The amino acids of these polypeptides and nucleotide sequences encoding them are provided. Recombinant host cells, expression vectors and methods for the recombinant production of phenolic acid esterases are also provided.

  4. Sea bass (Dicentrarchus labrax) invariant chain and class II major histocompatibility complex: sequencing and structural analysis using 3D homology modelling.

    PubMed

    Silva, Daniela S P; Reis, Marta I R; Nascimento, Diana S; do Vale, Ana; Pereira, Pedro J B; dos Santos, Nuno M S

    2007-07-01

    The present manuscript reports for the first time the sequencing and characterisation of sea bass (sb) MHCII alpha and beta chains and Ii chain cDNAs as well as their expression analysis under resting state. 3D homology modelling, using crystal structures from mammalian orthologues, has been used to illustrate and support putative structural homologies of the sea bass counterparts. The sbIi cDNA consists of 96 bp of 5'-UTR, a 843 bp open reading frame (ORF) and 899 bp of 3'-UTR including a canonical polyadenylation signal 16 nucleotides before the polyadenylation tail. The ORF was translated into a 280 amino acid sequence, in which all characteristic domains found in the Ii p41 human form could be identified, including the cytoplasmic N-terminus domain, the transmembrane (TM) region, the CLIP domain, the trimerization domain and the thyroglobulin (Tg) type I domain. The trimerization and Tg domains of sbIi were successfully modelled using the human counterparts as templates. Four different sequences of each class II alpha and beta MHCII were obtained from a single fish, apparently not derived from a single locus. All the characteristic features of the MHCII chain structure could be identified in the predicted ORF of sea bass alpha and beta sequences, consisting of leader peptide (LP), alpha1/beta1 and alpha2/beta2 domains, connecting peptide and TM and cytoplasmic regions. Furthermore, independently of the HLA-DR crystal structure used as template in homology modelling, a similar predicted 3D structure and trimeric quaternary architecture was obtained for sbMHC, with major deviations occurring only within the sea bass MHCII alpha1 domain.

  5. Code optimization of the subroutine to remove near identical matches in the sequence database homology search tool PSI-BLAST.

    PubMed

    Aspnäs, Mats; Mattila, Kimmo; Osowski, Kristoffer; Westerholm, Jan

    2010-06-01

    A central task in protein sequence characterization is the use of a sequence database homology search tool to find similar protein sequences in other individuals or species. PSI-BLAST is a widely used module of the BLAST package that calculates a position-specific score matrix from the best matching sequences and performs iterated searches using a method to avoid many similar sequences for the score. For some queries and parameter settings, PSI-BLAST may find many similar high-scoring matches, and therefore up to 80% of the total run time may be spent in this procedure. In this article, we present code optimizations that improve the cache utilization and the overall performance of this procedure. Measurements show that, for queries where the number of similar matches is high, the optimized PSI-BLAST program may be as much as 2.9 times faster than the original program.

  6. The detection of inherent homologous recombination between repeat sequences in H. pylori 26695 by the PCR-based method.

    PubMed

    Fu, Yajuan; Zepeda-Gurrola, Reyna Cristina; Aguilar-Gutiérrez, Germán Rubén; Lara-Ramírez, Edgar E; De Luna-Santillana, Erick J; Rodríguez-Luna, Isabel Cristina; Sánchez-Varela, Alejandro; Carreño-López, Ricardo; Moreno-Medina, Víctor Ricardo; Rodríguez-Pérez, Mario A; López-Vidal, Yolanda; Guo, Xianwu

    2014-02-01

    Helicobacter pylori infects more than half of the world's population, making it the most widespread infection of bacteria. It has high genetic diversity and has been considered as one of the most variable bacterial species. In the present study, a PCR-based method was used to detect the presence and the relative frequency of homologous recombination between repeat sequences (>500 bp) in H. pylori 26695. All the recombinant structures have been confirmed by sequencing. The inversion generated between inverted repeats showed distinct features from the recombination for duplication or deletion between direct repeats. Meanwhile, we gave the mathematic reasoning of a general formula for the calculation of relative recombination frequency and indicated the conditions for its application. This formula could be extensively applied to detect the frequency of homologous recombination, site-specific recombination, and other types of predictable recombination. Our results should be helpful for better understanding the genome evolution and adaptation of bacteria.

  7. Method for identifying and quantifying nucleic acid sequence aberrations

    DOEpatents

    Lucas, J.N.; Straume, T.; Bogen, K.T.

    1998-07-21

    A method is disclosed for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe. 11 figs.

  8. Method for identifying and quantifying nucleic acid sequence aberrations

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    1998-01-01

    A method for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe.

  9. [MOLECULAR EVOLUTION OF ION CHANNELS: AMINO ACID SEQUENCES AND 3D STRUCTURES].

    PubMed

    Korkosh, V S; Zhorov, B S; Tikhonov, D B

    2016-01-01

    An integral part of modern evolutionary biology is comparative analysis of structure and function of macromolecules such as proteins. The first and critical step to understand evolution of homologous proteins is their amino acid sequence alignment. However, standard algorithms fop not provide unambiguous sequence alignments for proteins of poor homology. More reliable results can be obtained by comparing experimental 3D structures obtained at atomic resolution, for instance, with the aid of X-ray structural analysis. If such structures are lacking, homology modeling is used, which may take into account indirect experimental data on functional roles of individual amino-acid residues. An important problem is that the sequence alignment, which reflects genetic modifications, does not necessarily correspond to the functional homology. The latter depends on three-dimensional structures which are critical for natural selection. Since alignment techniques relying only on the analysis of primary structures carry no information on the functional properties of proteins, including 3D structures into consideration is very important. Here we consider several examples involving ion channels and demonstrate that alignment of their three-dimensional structures can significantly improve sequence alignments obtained by traditional methods.

  10. Nucleotide sequence of the 3'-noncoding region of alfalfa mosaic virus RNA 4 and its homology with the genomic RNAs.

    PubMed Central

    Koper-Zwarthoff, E C; Brederode, F T; Walstra, P; Bol, J F

    1979-01-01

    A 226-nucleotide fragment was derived from alfalfa mosaic virus RNA 4 (ALMV RNA 4), the subgenomic messenger for viral coat protein, and its sequence was deduced by in vitro labeling with polynucleotide kinase and application of RNA sequencing techniques. The fragment contains the 3'-terminal 45 nucleotides of the coat protein cistron and the complete 3'-noncoding region of 182 nucleotides. The total length of RNA 4 was calculated to be 881 nucleotides. AlMV RNAs 1, 2 and 3 were elongated with a 3'-terminal poly(A) stretch and subjected to sequence analysis by using a specific primer, reverse transcriptase and chain terminators. This revealed and extensive homology between the 3'-terminal 140 to 150 nucleotides of all four ALMV RNAs. Despite a number of base substitutions, the secondary structure of the homologous region is highly conserved. The observed homology indicates that, as with RNA 4, the sites with a high affinity for the viral coat protein are located at the 3'-termini of the genomic RNAs. Images PMID:537914

  11. TranslatorX: multiple alignment of nucleotide sequences guided by amino acid translations.

    PubMed

    Abascal, Federico; Zardoya, Rafael; Telford, Maximilian J

    2010-07-01

    We present TranslatorX, a web server designed to align protein-coding nucleotide sequences based on their corresponding amino acid translations. Many comparisons between biological sequences (nucleic acids and proteins) involve the construction of multiple alignments. Alignments represent a statement regarding the homology between individual nucleotides or amino acids within homologous genes. As protein-coding DNA sequences evolve as triplets of nucleotides (codons) and it is known that sequence similarity degrades more rapidly at the DNA than at the amino acid level, alignments are generally more accurate when based on amino acids than on their corresponding nucleotides. TranslatorX novelties include: (i) use of all documented genetic codes and the possibility of assigning different genetic codes for each sequence; (ii) a battery of different multiple alignment programs; (iii) translation of ambiguous codons when possible; (iv) an innovative criterion to clean nucleotide alignments with GBlocks based on protein information; and (v) a rich output, including Jalview-powered graphical visualization of the alignments, codon-based alignments coloured according to the corresponding amino acids, measures of compositional bias and first, second and third codon position specific alignments. The TranslatorX server is freely available at http://translatorx.co.uk.

  12. Methods for analyzing nucleic acid sequences

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2011-05-17

    The present invention is directed to a method of sequencing a target nucleic acid. The method provides a complex comprising a polymerase enzyme, a target nucleic acid molecule, and a primer, wherein the complex is immobilized on a support Fluorescent label is attached to a terminal phosphate group of the nucleotide or nucleotide analog. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The time duration of the signal from labeled nucleotides or nucleotide analogs that become incorporated is distinguished from freely diffusing labels by a longer retention in the observation volume for the nucleotides or nucleotide analogs that become incorporated than for the freely diffusing labels.

  13. Complete cDNA and derived amino acid sequence of human factor V

    SciTech Connect

    Jenny, R.J.; Pittman, D.D.; Toole, J.J.; Kriz, R.W.; Aldape, R.A.; Hewick, R.M.; Kaufman, R.J.; Mann, K.G.

    1987-07-01

    cDNA clones encoding human factor V have been isolated from an oligo(dT)-primed human fetal liver cDNA library prepared with vector Charon 21A. The cDNA sequence of factor V from three overlapping clones includes a 6672-base-pair (bp) coding region, a 90-bp 5' untranslated region, and a 163-bp 3' untranslated region within which is a poly(A)tail. The deduced amino acid sequence consists of 2224 amino acids inclusive of a 28-amino acid leader peptide. Direct comparison with human factor VIII reveals considerable homology between proteins in amino acid sequence and domain structure: a triplicated A domain and duplicated C domain show approx. 40% identity with the corresponding domains in factor VIII. As in factor VIII, the A domains of factor V share approx. 40% amino acid-sequence homology with the three highly conserved domains in ceruloplasmin. The B domain of factor V contains 35 tandem and approx. 9 additional semiconserved repeats of nine amino acids of the form Asp-Leu-Ser-Gln-Thr-Thr/Asn-Leu-Ser-Pro and 2 additional semiconserved repeats of 17 amino acids. Factor V contains 37 potential N-linked glycosylation sites, 25 of which are in the B domain, and a total of 19 cysteine residues.

  14. Sequences homologous to retrovirus-like genes of the mouse are present in multiple copies in the Syrian hamster genome.

    PubMed Central

    Lueders, K K; Kuff, E L

    1981-01-01

    The genome of M. musculus contains many copies of DNA sequences homologous to the 35S RNA of intracisternal type-A particles (IAP) (1,2). A major class of IAP genes has been identified and isolated from a mouse library in Charon 4A (3). Cloned mouse IAP genes were used as probes to study homologous sequences in the DNA of other species. Sequences related to mouse IAP genes were detected in the DNAs from a variety of animal cells. DNAs from rat, gerbil, and hamster cells all gave strong reactions which could be localized to discrete restriction fragments on genomic blots. The reaction of Syrian hamster DNA was particularly strong. Fragments derived from different parts of the IAP gene all reacted with Syrian hamster DNA, and the reactive restriction fragments in the Syrian hamster DNA could be ordered with reference to the known restriction map of the IAP genes. The data suggest that sequences related to mouse IAP genes make up a 7 Kb unit in the Syrian hamster genome. Since the majority of the hamster sequences are quite divergent from those in the mouse, the ease with which they are detected suggests that they must be reiterated in the hamster genome. Images PMID:7198224

  15. CBH1 homologs and variant CBH1 cellulases

    DOEpatents

    Goedegebuur, Frits; Gualfetti, Peter; Mitchinson, Colin; Neefe, Paulien

    2008-11-18

    Disclosed are a number of homologs and variants of Hypocrea jecorina Cel7A (formerly Trichoderma reesei cellobiohydrolase I or CBH1), nucleic acids encoding the same and methods for producing the same. The homologs and variant cellulases have the amino acid sequence of a glycosyl hydrolase of family 7A wherein one or more amino acid residues are substituted and/or deleted.

  16. CBH1 homologs and variant CBH1 cellulases

    DOEpatents

    Goedegebuur, Frits; Gualfetti, Peter; Mitchinson, Colin; Neefe, Paulien

    2011-05-31

    Disclosed are a number of homologs and variants of Hypocrea jecorina Cel7A (formerly Trichoderma reesei cellobiohydrolase I or CBH1), nucleic acids encoding the same and methods for producing the same. The homologs and variant cellulases have the amino acid sequence of a glycosyl hydrolase of family 7A wherein one or more amino acid residues are substituted and/or deleted.

  17. CBH1 homologs and varian CBH1 cellulase

    DOEpatents

    Goedegebuur, Frits; Gualfetti, Peter; Mitchinson, Colin; Neefe, Paulien

    2014-07-01

    Disclosed are a number of homologs and variants of Hypocrea jecorina Cel7A (formerly Trichoderma reesei cellobiohydrolase I or CBH1), nucleic acids encoding the same and methods for producing the same. The homologs and variant cellulases have the amino acid sequence of a glycosyl hydrolase of family 7A wherein one or more amino acid residues are substituted and/or deleted.

  18. Nucleic acid (cDNA) and amino acid sequences of the maize endosperm protein glutelin-2.

    PubMed Central

    Prat, S; Cortadas, J; Puigdomènech, P; Palau, J

    1985-01-01

    The cDNA coding for a glutelin-2 protein from maize endosperm has been cloned and the complete amino acid sequence of the protein derived for the first time. An immature maize endosperm cDNA bank was screened for the expression of a beta-lactamase:glutelin-2 (G2) fusion polypeptide by using antibodies against the purified 28 kd G2 protein. A clone corresponding to the 28 kd G2 protein was sequenced and the primary structure of this protein was derived. Five regions can be defined in the protein sequence: an 11 residue N-terminal part, a repeated region formed by eight units of the sequence Pro-Pro-Pro-Val-His-Leu, an alternating Pro-X stretch 21 residues long, a Cys rich domain and a C-terminal part rich in Gln. The protein sequence is preceded by 19 residues which have the characteristics of the signal peptide found in secreted proteins. Unlike zeins, the main maize storage proteins, 28 kd glutelin-2 has several homologous sequences in common with other cereal storage proteins. Images PMID:3839076

  19. Homology modeling of Homo sapiens Lipoic acid Synthase: substrate docking and insights on its binding mode.

    PubMed

    Krishnamoorthy, Ezhilarasi; Hassan, Sameer; Hanna, Luke Elizabeth; Padmalayam, Indira; Rajaram, Rama; Viswanathan, Vijay

    2016-10-04

    Lipoic acid synthase (LIAS) is an iron-sulfur cluster mitochondrial enzyme which catalyzes the final step in the de novo pathway for the biosynthesis of lipoic acid, a potent antioxidant. Recently there has been significant interest in its role in metabolic diseases and its deficiency in LIAS expression has been linked to conditions such as diabetes, atherosclerosis and neonatal-onset epilepsy, suggesting a strong inverse correlation between LIAS reduction and disease status. In this study we use a bioinformatics approach to predict its structure, which would be helpful to understanding its role. A homology model for LIAS protein was generated using X - ray crystallographic structure of Thermosynechococcus elongatus BP-1 (PDB ID: 4U0P). The predicted structure has 93% of the residues in the most favour region of Ramachandran plot. The active site of LIAS protein was mapped and docked with S-Adenosyl Methionine (SAM) using GOLD software. The LIAS - SAM complex was further refined using molecular dynamics simulation within the subsite 1 and subsite 3 of the active site. To the best of our knowledge, this is the first study to report a reliable homology model of LIAS protein. This study will facilitate a better understanding mode of action of the enzyme-substrate complex for future studies in designing drugs that can target LIAS protein.

  20. Evolution and homologous recombination of the hemagglutinin-esterase gene sequences from porcine torovirus

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The objective of the present study was to gain new insights into the evolution, homologous recombination and selection pressures imposed on the porcine torovirus (PToV), by examining changes in the hemagglutinin-esterase (HE) gene. The most recent common ancestor of PToV was estimated to have emerge...

  1. Snake venoms. The amino-acid sequence of trypsin inhibitor E of Dendroaspis polylepis polylepis (Black Mamba) venom.

    PubMed

    Joubert, F J; Strydom, D J

    1978-06-01

    Trypsin inhibitor E from black mamba venom comprises 59 amino acid residues in a single polypeptide chain, cross-linked by three intrachain disulphide bridges. The complete primary structure of inhibitor E was elucidated. The sequence is homologous with trypsin inhibitors from different sources. Unique among this homologous series of proteinase inhibitors, inhibitor E has an affinity for transition metal ions, exemplified here by Cu2 and Co2+.

  2. Human meiotic recombination products revealed by sequencing a hotspot for homologous strand exchange in multiple HNPP deletion patients.

    PubMed

    Reiter, L T; Hastings, P J; Nelis, E; De Jonghe, P; Van Broeckhoven, C; Lupski, J R

    1998-05-01

    The HNPP (hereditary neuropathy with liability to pressure palsies) deletion and CMT1A (Charcot-Marie-Tooth disease type 1A) duplication are the reciprocal products of homologous recombination events between misaligned flanking CMT1A-REP repeats on chromosome 17p11. 2-p12. A 1.7-kb hotspot for homologous recombination was previously identified wherein the relative risk of an exchange event is 50 times higher than in the surrounding 98.7% identical sequence shared by the CMT1A-REPs. To refine the region of exchange further, we designed a PCR strategy to amplify the recombinant CMT1A-REP from HNPP patients as well as the proximal and distal CMT1A-REPs from control individuals. By comparing the sequences across recombinant CMT1A-REPs to that of the proximal and distal CMT1A-REPs, the exchange was mapped to a 557-bp region within the previously identified 1.7-kb hotspot in 21 of 23 unrelated HNPP deletion patients. Two patients had recombined sequences suggesting an exchange event closer to the mariner-like element previously identified near the hotspot. Five individuals also had interspersed patches of proximal or distal repeat specific DNA sequence indicating potential gene conversion during the exchange of genetic material. Our studies provide a direct observation of human meiotic recombination products. These results are consistent with the hypothesis that minimum efficient processing segments, which have been characterized in Escherichia coli, yeast, and cultured mammalian cells, may be required for efficient homologous meiotic recombination in humans.

  3. Distribution of alginate gene sequences in the Pseudomonas rRNA homology group I-Azomonas-Azotobacter lineage of superfamily B procaryotes.

    PubMed Central

    Fialho, A M; Zielinski, N A; Fett, W F; Chakrabarty, A M; Berry, A

    1990-01-01

    Chromosomal DNA from group I Pseudomonas species, Azotobacter vinelandii, Azomonas macrocytogens, Xanthomonas campestris, Serpens flexibilis, and three enteric bacteria was screened for sequences homologous to four Pseudomonas aeruginosa alginate (alg) genes (algA, pmm, algD, and algR1). All the group I Pseudomonas species tested (including alginate producers and nonproducers) contained sequences homologous to all the P. aeruginosa alg genes used as probes, with the exception of P. stutzeri, which lacked algD. Azotobacter vinelandii also contained sequences homologous to all the alg gene probes tested, while Azomonas macrocytogenes DNA showed homology to all but algD. X. campestris contained sequences homologous to pmm and algR1 but not to algA or algD. The helical bacterium S. flexibilis showed homology to the algR1 gene, suggesting that an environmentally responsive regulatory gene similar to algR1 exists in S. flexibilis. Escherichia coli showed homology to the algD and algR1 genes, while Salmonella typhimurium and Klebsiella pneumoniae failed to show homology with any of the P. aeruginosa alg genes. Since all the organisms tested are superfamily B procaryotes, these results suggest that within superfamily B, the alginate genes are distributed throughout the Pseudomonas group I-Azotobacter-Azomonas lineage, while only some alg genes have been retained in the Pseudomonas group V (Xanthomonas) and enteric lineages. Images PMID:1689562

  4. Isolation of Insertion Sequence ISRLdTAL1145-1 from a Rhizobium sp. (Leucaena diversifolia) and Distribution of Homologous Sequences Identifying Cross-Inoculation Group Relationships †

    PubMed Central

    Rice, Douglas J.; Somasegaran, Padma; MacGlashan, Kathryn; Bohlool, B. Ben

    1994-01-01

    Insertion sequence (IS) element ISRLdTAL1145-1 from Rhizobium sp. (Leucaena diversifolia) strain TAL 1145 was entrapped in the sacB gene of the positive selection vector pUCD800 by insertional inactivation. A hybridization probe prepared from the whole 2.5-kb element was used to determine the distribution of homologous sequences in a diverse collection of 135 Rhizobium and Bradyrhizobium strains. The IS probe hybridized strongly to Southern blots of genomic DNAs from 10 rhizobial strains that nodulate both Phaseolus vulgaris (beans) and Leucaena leucocephala (leguminous trees), 1 Rhizobium sp. that nodulates Leucaena spp., 9 R. meliloti (alfalfa) strains, 4 Rhizobium spp. that nodulate Sophora chrysophylla (leguminous trees), and 1 nonnodulating bacterium associated with the nodules of Pithecellobium dulce from the Leucaena cross-inoculation group, producing distinguishing IS patterns for each strain. Hybridization analysis revealed that ISRLdTAL1145-1 was strongly homologous with and closely related to a previously isolated element, ISRm USDA1024-1 from R. meliloti, while restriction enzyme analysis found structural similarities and differences between the two IS homologs. Two internal segments of these IS elements were used to construct hybridization probes of 1.2 kb and 380 bp that delineate a structural similarity and a difference, respectively, of the two IS homologs. The internal segment probes were used to analyze the structures of homologous IS elements in other strains. Five types of structural variation in homolog IS elements were found. The predominate IS structural type naturally occurring in a strain can reasonably identify the strain's cross-inoculation group relationships. Three IS structural types were found in Rhizobium species that nodulate beans and Leucaena species, one of which included the designated type IIB strain of R. tropici (CIAT 899). Weak homology to the whole IS probe, but not with the internal segments, was found with two

  5. Protegrin structure-activity relationships: using homology models of synthetic sequences to determine structural characteristics important for activity.

    PubMed

    Ostberg, Nathan; Kaznessis, Yiannis

    2005-02-01

    The protegrin family of antimicrobial peptides is among the shortest in sequence length while remaining very active against a variety of microorganisms. The major goal of this study is to characterize easily calculated molecular properties, which quantitatively show high correlation with antibacterial activity. The peptides studied have high sequence similarity but vary in activity over more than an order of magnitude. Hence, sequence analysis alone cannot be used to predict activity for these peptides. We calculate structural properties of 62 protegrin and protegrin-analogue peptides and correlate them to experimental activities against six microbe species, as well as hemolytic and cytotoxic activities. Natural protegrins structures were compared with synthetic derivatives using homology modeling, and property descriptors were calculated to determine the characteristics that confer their antimicrobial activity. A structure-activity relationship study of all these peptides provides information about the structural properties that affect activity against different microbial species.

  6. Shark myelin basic protein: amino acid sequence, secondary structure, and self-association.

    PubMed

    Milne, T J; Atkins, A R; Warren, J A; Auton, W P; Smith, R

    1990-09-01

    Myelin basic protein (MBP) from the Whaler shark (Carcharhinus obscurus) has been purified from acid extracts of a chloroform/methanol pellet from whole brains. The amino acid sequence of the majority of the protein has been determined and compared with the sequences of other MBPs. The shark protein has only 44% homology with the bovine protein, but, in common with other MBPs, it has basic residues distributed throughout the sequence and no extensive segments that are predicted to have an ordered secondary structure in solution. Shark MBP lacks the triproline sequence previously postulated to form a hairpin bend in the molecule. The region containing the putative consensus sequence for encephalitogenicity in the guinea pig contains several substitutions, thus accounting for the lack of activity of the shark protein. Studies of the secondary structure and self-association have shown that shark MBP possesses solution properties similar to those of the bovine protein, despite the extensive differences in primary structure.

  7. 77 FR 65537 - Requirements for Patent Applications Containing Nucleotide Sequence and/or Amino Acid Sequence...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2012-10-29

    ... Amino Acid Sequence Disclosures ACTION: Proposed collection; comment request. SUMMARY: The United States....'' SUPPLEMENTARY INFORMATION: I. Abstract Patent applications that contain nucleotide and/or amino acid...

  8. Amino acid sequence of atrial natriuretic peptides in human coronary sinus plasma.

    PubMed

    Yandle, T; Crozier, I; Nicholls, G; Espiner, E; Carne, A; Brennan, S

    1987-07-31

    Two atrial natriuretic peptides were purified from pooled human coronary sinus plasma by Sep-Pak extraction, immunoaffinity chromatography and reverse phase HPLC. The amino acid sequences of the two peptides were homologous with 99-126 human atrial natriuretic peptide (hANP) and 106-126 hANP, the latter being most probably linked to 99-105 ANP by the disulphide bond. The molar ratio of the peptides in plasma, as assessed by radioimmunoassay was 10:3.

  9. A convenient and adaptable package of computer programs for DNA and protein sequence management, analysis and homology determination.

    PubMed Central

    Pustell, J; Kafatos, F C

    1984-01-01

    We describe the further development of a widely used package of DNA/protein sequence analysis programs (1). Important revisions have been made based on user experience, and new features, multi-user capability, and a set of large scale homology programs have been added. The programs are very user friendly, economical of time and memory, and extremely transportable. They are written in a version of FORTRAN which will compile, with a few defined changes, as FORTRAN 66, FORTRAN 77, FORTRAN IV, FORTRAN IV+, and others. They are running on a variety of microcomputers, minicomputers, and mainframes, in both single user and multi-user configurations. PMID:6320100

  10. dRHP-PseRA: detecting remote homology proteins using profile-based pseudo protein sequence and rank aggregation

    PubMed Central

    Chen, Junjie; Long, Ren; Wang, Xiao-long; Liu, Bin; Chou, Kuo-Chen

    2016-01-01

    Protein remote homology detection is an important task in computational proteomics. Some computational methods have been proposed, which detect remote homology proteins based on different features and algorithms. As noted in previous studies, their predictive results are complementary to each other. Therefore, it is intriguing to explore whether these methods can be combined into one package so as to further enhance the performance power and application convenience. In view of this, we introduced a protein representation called profile-based pseudo protein sequence to extract the evolutionary information from the relevant profiles. Based on the concept of pseudo proteins, a new predictor, called “dRHP-PseRA”, was developed by combining four state-of-the-art predictors (PSI-BLAST, HHblits, Hmmer, and Coma) via the rank aggregation approach. Cross-validation tests on a SCOP benchmark dataset have demonstrated that the new predictor has remarkably outperformed any of the existing methods for the same purpose on ROC50 scores. Accordingly, it is anticipated that dRHP-PseRA holds very high potential to become a useful high throughput tool for detecting remote homology proteins. For the convenience of most experimental scientists, a web-server for dRHP-PseRA has been established at http://bioinformatics.hitsz.edu.cn/dRHP-PseRA/. PMID:27581095

  11. Top-Down-Assisted Bottom-Up Method for Homologous Protein Sequencing: Hemoglobin from 33 Bird Species

    NASA Astrophysics Data System (ADS)

    Song, Yang; Laskay, Ünige A.; Vilcins, Inger-Marie E.; Barbour, Alan G.; Wysocki, Vicki H.

    2015-11-01

    Ticks are vectors for disease transmission because they are indiscriminant in their feeding on multiple vertebrate hosts, transmitting pathogens between their hosts. Identifying the hosts on which ticks have fed is important for disease prevention and intervention. We have previously shown that hemoglobin (Hb) remnants from a host on which a tick fed can be used to reveal the host's identity. For the present research, blood was collected from 33 bird species that are common in the U.S. as hosts for ticks but that have unknown Hb sequences. A top-down-assisted bottom-up mass spectrometry approach with a customized searching database, based on variability in known bird hemoglobin sequences, has been devised to facilitate fast and complete sequencing of hemoglobin from birds with unknown sequences. These hemoglobin sequences will be added to a hemoglobin database and used for tick host identification. The general approach has the potential to sequence any set of homologous proteins completely in a rapid manner.

  12. Leojaponic acids A and B, two new homologous terpenoids, isolated from Leonurus japonicus.

    PubMed

    Wu, Han-Kui; Mao, Yan-Jun; Sun, Shan-Shan; Xu, Zhi-Yong; Ma, Ya; Cao, Jin-Xia; Qi, He; Wu, Zhi-Fu; Li, Gang; Yang, Wei-Hua

    2016-04-01

    The present study aimed at isolation and purification of the bioactive terpenoids from the herb of Leonurus japonicus by chromatographic separations such as silica gel, sephadex LH-20 and C18 reversed phase silica gel, as well as preparative HPLC. As a result, leojaponic acids A (1, C17H24O4) and B (2, C18H26O4), two homologous terpenoids, together with (-)-loliolide (3), 1-(3-ethylphenyl) ethane-1, 2-diol (4) and dibutyl phthalate (5), were isolated from the EtOH extract of L. japonicus. All the chemical structures of the isolates were elucidated on the basis of 1D and 2D NMR analyses. Compounds 1 and 2 were new terpenoids, and Compounds 3 and 4 were isolated and identified for the first time from this plant. In addition, the α-glucosidase and tyrosinase inhibitory activity of the new compounds were evaluated.

  13. Amino-terminal sequence of p36 and associated p10: identification of the site of tyrosine phosphorylation and homology with S-100.

    PubMed Central

    Glenney, J R; Tack, B F

    1985-01-01

    p36 is a major substrate of both viral and growth factor-receptor-associated tyrosine protein kinases. p36 can be isolated as a complex consisting of a subunit of Mr 36,000 (p36) and a subunit of Mr 10,000 (p10), and it represents an abundant cellular protein. We have isolated the p36-p10 complex from bovine intestinal epithelium and analyzed the amino terminus of both subunits. Sequence analysis of the first 56 amino acids of p10 demonstrates a striking sequence homology (48% identically placed residues) with the Mr 10,000 calcium-binding proteins from bovine brain, termed S-100. Intestinal p36 could be effectively labeled on a single tyrosine in vitro with immunoprecipitated pp60v-src and [gamma-32P]ATP. Mild proteolysis of p36 with chymotrypsin resulted in the cleavage into large (Mr, 33,000) and small domains (Mr, 3000), with the latter representing the phosphorylated amino terminus. Although the amino terminus is apparently blocked, sequence analysis of a secondary tryptic peptide of the Mr 3000 fragment as well as the amino-terminal sequence of the Mr 33,000 domain and overlapping peptides clearly established the site of tyrosine phosphorylation. Images PMID:2415974

  14. STRUCTFAST: protein sequence remote homology detection and alignment using novel dynamic programming and profile-profile scoring.

    PubMed

    Debe, Derek A; Danzer, Joseph F; Goddard, William A; Poleksic, Aleksandar

    2006-09-01

    STRUCTFAST is a novel profile-profile alignment algorithm capable of detecting weak similarities between protein sequences. The increased sensitivity and accuracy of the STRUCTFAST method are achieved through several unique features. First, the algorithm utilizes a novel dynamic programming engine capable of incorporating important information from a structural family directly into the alignment process. Second, the algorithm employs a rigorous analytical formula for profile-profile scoring to overcome the limitations of ad hoc scoring functions that require adjustable parameter training. Third, the algorithm employs Convergent Island Statistics (CIS) to compute the statistical significance of alignment scores independently for each pair of sequences. STRUCTFAST routinely produces alignments that meet or exceed the quality obtained by an expert human homology modeler, as evidenced by its performance in the latest CAFASP4 and CASP6 blind prediction benchmark experiments.

  15. Detection of nucleic acid sequences by invader-directed cleavage

    DOEpatents

    Brow, Mary Ann D.; Hall, Jeff Steven Grotelueschen; Lyamichev, Victor; Olive, David Michael; Prudent, James Robert

    1999-01-01

    The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The 5' nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based by charge.

  16. Sequencing and computational analysis of complete genome sequences of Citrus yellow mosaic badna virus from acid lime and pummelo.

    PubMed

    Borah, Basanta K; Johnson, A M Anthony; Sai Gopal, D V R; Dasgupta, Indranil

    2009-08-01

    Citrus yellow mosaic badna virus (CMBV), a member of the Family Caulimoviridae, Genus Badnavirus, is the causative agent of Citrus mosaic disease in India. Although the virus has been detected in several citrus species, only two full-length genomes, one each from Sweet orange and Rangpur lime, are available in publicly accessible databases. In order to obtain a better understanding of the genetic variability of the virus in other citrus mosaic-affected citrus species, we performed the cloning and sequence analysis of complete genomes of CMBV from two additional citrus species, Acid lime and Pummelo. We show that CMBV genomes from the two hosts share high homology with previously reported CMBV sequences and hence conclude that the new isolates represent variants of the virus present in these species. Based on in silico sequence analysis, we predict the possible function of the protein encoded by one of the five ORFs.

  17. Initial cloning and sequencing of hydHG, an operon homologous to ntrBC and regulating the labile hydrogenase activity in Escherichia coli K-12.

    PubMed Central

    Stoker, K; Reijnders, W N; Oltmann, L F; Stouthamer, A H

    1989-01-01

    To isolate genes from Escherichia coli which regulate the labile hydrogenase activity, a plasmid library was used to transform hydL mutants lacking the labile hydrogenase. A single type of gene, designated hydG, was isolated. This gene also partially restored the hydrogenase activity in hydF mutants (which are defective in all hydrogenase isoenzymes), although the low hydrogenase 1 and 2 levels were not induced. Therefore, hydG apparently regulates, specifically, the labile hydrogenase activity. Restoration of this latter activity in hydF mutants was accompanied by a proportional increase of the H2 uptake activity, suggesting a functional relationship. H2:fumarate oxidoreductase activity was not restored in complemented hydL mutants. These latter strains may therefore lack, in addition to the labile hydrogenase, a second component (provisionally designated component R), possibly an electron carrier coupling H2 oxidation to the anerobic respiratory chain. Sequence analysis showed an open reading frame of 1,314 base pairs for hydG. It was preceded by a ribosome-binding site but apparently lacked a promoter. Minicell experiments revealed a single polypeptide of approximately 50 kilodaltons. Comparison of the predicted amino acid sequence with a protein sequence data base revealed strong homology to NtrC from Klebsiella pneumoniae, a DNA-binding transcriptional activator. The 411 base pairs upstream from pHG40 contained a second open reading frame overlapping hydG by four bases. The deduced amino acid sequence showed considerable homology with the C-terminal part of NtrB. This sequence was therefore assumed to be part of a second gene, encoding the NtrB-like component, and was designated hydH. The labile hydrogenase activity in E. coli is apparently regulated by a multicomponent system analogous to the NtrB-NtrC system. This conclusion is in agreement with the results of Birkmann et al. (A. Birkmann, R. G. Sawers, and A. Böck, Mol. Gen. Genet. 210:535-542, 1987), who

  18. CTLA-8, cloned from an activated T cell, bearing AU-rich messenger RNA instability sequences, and homologous to a herpesvirus saimiri gene.

    PubMed

    Rouvier, E; Luciani, M F; Mattéi, M G; Denizot, F; Golstein, P

    1993-06-15

    To detect novel molecules involved in immune functions, a subtracted cDNA library between closely related murine lymphoid cells was prepared using improved technology. Differential screening of this library yielded several clones with a very restricted tissue specificity, including one that we named CTLA-8. CTLA-8 transcripts could be detected only in T cell hybridoma clones related to the one used to prepare the library. Southern blots showed that the CTLA-8 gene was single copy in mice, rats, and humans. By radioactive in situ hybridization, the CTLA-8 gene was mapped at a single site on mouse chromosome 1A and human chromosome 2q31, in a known interspecific syntenic region. The CTLA-8 cDNA sequence indicated the presence, in the 3'-untranslated region of the mRNA, of AU-rich repeats previously found in the mRNA of various cytokines, growth factors, and oncogenes. The CTLA-8 cDNA contained an open reading frame encoding a putative protein of 150 amino acids. This protein was 57% homologous to the putative protein encoded by the ORF13 gene of herpesvirus Saimiri, a T lymphotropic virus. These findings are discussed in the context of other genes of this herpesvirus homologous to known immunologically active molecules. More generally, CTLA-8 may belong to the growing set of virus-captured functionally important cellular genes related to the immune system or to cell death and cell survival.

  19. Endogenous small RNAs in grain: semi-quantification and sequence homology to human and animal genes.

    PubMed

    Ivashuta, Sergey I; Petrick, Jay S; Heisel, Sara E; Zhang, Yuanji; Guo, Liang; Reynolds, Tracey L; Rice, James F; Allen, Edwards; Roberts, James K

    2009-02-01

    Small interfering RNAs (siRNAs) and microRNAs (miRNAs) are effector molecules of RNA interference (RNAi), a highly conserved RNA-based gene suppression mechanism in plants, mammals and other eukaryotes. Endogenous RNAi-based gene suppression has been harnessed naturally and through conventional breeding to achieve desired plant phenotypes. The present study demonstrates that endogenous small RNAs, such as siRNAs and miRNAs, are abundant in soybean seeds, corn kernels, and rice grain, plant tissues that are traditionally used for food and feed. Numerous endogenous plant small RNAs were found to have perfect complementarity to human genes as well as those of other mammals. The abundance of endogenous small RNA molecules in grain from safely consumed food and feed crops such as soybean, corn, and rice and the homology of a number of these dietary small RNAs to human and animal genomes and transcriptomes establishes a history of safe consumption for dietary small RNAs.

  20. Homologous Recombination Defective Arabidopsis Mutants Exhibit Enhanced Sensitivity to Abscisic Acid

    PubMed Central

    Roy, Sujit; Das, Kali Pada

    2017-01-01

    Abscisic acid (ABA) acts as an important plant hormone in regulating various aspects of plant growth and developmental processes particularly under abiotic stress conditions. An increased ABA level in plant cells inhibits DNA replication and cell division, causing plant growth retardation. In this study, we have investigated the effects of ABA on the growth responses of some major loss-of-function mutants of DNA double-stand break (DSB) repair genes in Arabidopsis during seed germination and early stages of seedling growth for understanding the role of ABA in the induction of genome instability in plants. A comparative analysis of ABA sensitivity of wild-type Arabidopsis and the knockout mutant lines related to DSB sensors, including atatm, atatr, the non-homologous end joining (NHEJ) pathway genes, and mutants related to homologous recombination (HR) pathway genes showed relatively enhanced sensitivity of atatr and HR-related mutants to ABA treatment. The expression levels of HR-related genes were increased in wild-type Arabidopsis (Col-0) during seed germination and early stages of seedling growth. Immunoblotting experiments detected phosphorylation of histone H2AX in wild-type (Col-0) and DSB repair gene mutants after ABA treatment, indicating the activation of DNA damage response due to ABA treatment. Analyses of DSB repair kinetics using comet assay under neutral condition have revealed comparatively slower DSB repair activity in HR mutants. Overall, our results have provided comprehensive information on the possible effect of ABA on DNA repair machinery in plants and also indicated potential functional involvement of HR pathway in repairing ABA induced DNA damage in Arabidopsis. PMID:28046013

  1. Digital cloning: identification of human cDNAs homologous to novel kinases through expressed sequence tag database searching.

    PubMed

    Chen, H C; Kung, H J; Robinson, D

    1998-01-01

    Identification of novel kinases based on their sequence conservation within kinase catalytic domain has relied so far on two major approaches, low-stringency hybridization of cDNA libraries, and PCR method using degenerate primers. Both of these approaches at times are technically difficult and time-consuming. We have developed a procedure that can significantly reduce the time and effort involved in searching for novel kinases and increase the sensitivity of the analysis. This procedure exploits the computer analysis of a vast resource of human cDNA sequences represented in the expressed sequence tag (EST) database. Seventeen novel human cDNA clones showing significant homology to serine/threonine kinases, including STE-20, CDK- and YAK-related family kinases, were identified by searching EST database. Further sequence analysis of these novel kinases obtained either directly from EST clones or from PCR-RACE products confirmed their identity as protein kinases. Given the rapid accumulation of the EST database and the advent of powerful computer analysis software, this approach provides a fast, sensitive, and economical way to identify novel kinases as well as other genes from EST database.

  2. More Than 1,001 Problems with Protein Domain Databases: Transmembrane Regions, Signal Peptides and the Issue of Sequence Homology

    PubMed Central

    Wong, Wing-Cheong; Maurer-Stroh, Sebastian; Eisenhaber, Frank

    2010-01-01

    Large-scale genome sequencing gained general importance for life science because functional annotation of otherwise experimentally uncharacterized sequences is made possible by the theory of biomolecular sequence homology. Historically, the paradigm of similarity of protein sequences implying common structure, function and ancestry was generalized based on studies of globular domains. Having the same fold imposes strict conditions over the packing in the hydrophobic core requiring similarity of hydrophobic patterns. The implications of sequence similarity among non-globular protein segments have not been studied to the same extent; nevertheless, homology considerations are silently extended for them. This appears especially detrimental in the case of transmembrane helices (TMs) and signal peptides (SPs) where sequence similarity is necessarily a consequence of physical requirements rather than common ancestry. Thus, matching of SPs/TMs creates the illusion of matching hydrophobic cores. Therefore, inclusion of SPs/TMs into domain models can give rise to wrong annotations. More than 1001 domains among the 10,340 models of Pfam release 23 and 18 domains of SMART version 6 (out of 809) contain SP/TM regions. As expected, fragment-mode HMM searches generate promiscuous hits limited to solely the SP/TM part among clearly unrelated proteins. More worryingly, we show explicit examples that the scores of clearly false-positive hits, even in global-mode searches, can be elevated into the significance range just by matching the hydrophobic runs. In the PIR iProClass database v3.74 using conservative criteria, we find that at least between 2.1% and 13.6% of its annotated Pfam hits appear unjustified for a set of validated domain models. Thus, false-positive domain hits enforced by SP/TM regions can lead to dramatic annotation errors where the hit has nothing in common with the problematic domain model except the SP/TM region itself. We suggest a workflow of flagging

  3. Complete amino acid sequence of the A chain of human complement-classical-pathway enzyme C1r.

    PubMed Central

    Arlaud, G J; Willis, A C; Gagnon, J

    1987-01-01

    The amino acid sequence of human C1r A chain was determined, from sequence analysis performed on fragments obtained from C1r autolytic cleavage, cleavage of methionyl bonds, tryptic cleavages at arginine and lysine residues, and cleavages by staphylococcal proteinase. The polypeptide chain has an N-terminal serine residue and contains 446 amino acid residues (Mr 51,200). The sequence data allow chemical characterization of fragments alpha (positions 1-211), beta (positions 212-279) and gamma (positions 280-446) yielded from C1r autolytic cleavage, and identification of the two major cleavage sites generating these fragments. Position 150 of C1r A chain is occupied by a modified amino acid residue that, upon acid hydrolysis, yields erythro-beta-hydroxyaspartic acid, and that is located in a sequence homologous to the beta-hydroxyaspartic acid-containing regions of Factor IX, Factor X, protein C and protein Z. Sequence comparison reveals internal homology between two segments (positions 10-78 and 186-257). Two carbohydrate moieties are attached to the polypeptide chain, both via asparagine residues at positions 108 and 204. Combined with the previously determined sequence of C1r B chain [Arlaud & Gagnon (1983) Biochemistry 22, 1758-1764], these data give the complete sequence of human C1r. PMID:3036070

  4. Synthesis of a Homologous Series of Side Chain Extended Orthogonally-Protected Aminooxy-Containing Amino Acids

    PubMed Central

    Liu, Fa; Thomas, Joshua; Burke, Terrence R.

    2008-01-01

    Practical methodology is reported for the synthesis of a homologous series of side chain extended amino acids containing aminooxy functionality bearing orthogonal protection suitable for Fmoc peptide synthesis. These reagents may be useful for the preparation of libraries containing fragments joined by peptide linkers. PMID:19122755

  5. Comparative genomic survey, exon-intron annotation and phylogenetic analysis of NAT-homologous sequences in archaea, protists, fungi, viruses, and invertebrates

    Technology Transfer Automated Retrieval System (TEKTRAN)

    We have previously published extensive genomic surveys [1-3], reporting NAT-homologous sequences in hundreds of sequenced bacterial, fungal and vertebrate genomes. We present here the results of our latest search of 2445 genomes, representing 1532 (70 archaeal, 1210 bacterial, 43 protist, 97 fungal,...

  6. Hybridization and sequencing of nucleic acids using base pair mismatches

    DOEpatents

    Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua

    2001-01-01

    Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.

  7. Spermatogenesis of the lizard Lacerta vivipara: histological studies and amino acid sequence of a protamine lacertine 1.

    PubMed

    Martinage, A; Depeiges, A; Wouters, D; Morel, L; Sautière, P

    1996-06-01

    The lizard Lacerta vivipara is a seasonal breeder with a well characterized reproductive cycle. An histological study of the lizard testis has been performed at different stages of spermatogenesis and the nuclear basic proteins content was assessed by electrophoretical analysis. Two protamines, lacertines 1 and 2, are present in spermatozoa in April and May. We have isolated lacertine1 and characterized a protamine with a mass of 4,963.7 Da. Amino acid sequence of this protamine (41 residues) was established from data provided by automated Edman degradation. It is characterized by a basic amino acid stretch in the N- and C-terminal regions and by a central part which only consists of 3 different intermingled amino acids. This protamine presents 62% homology with scylliorhinine Z3 from dog-fish Scylliorhinus caniculus and 58% homology with quail protamine. The reported lizard protamine sequence is the first reptilian protamine sequence available so far.

  8. Multilocus Sequence Typing of Lactobacillus casei Reveals a Clonal Population Structure with Low Levels of Homologous Recombination▿ †

    PubMed Central

    Diancourt, Laure; Passet, Virginie; Chervaux, Christian; Garault, Peggy; Smokvina, Tamara; Brisse, Sylvain

    2007-01-01

    Robust genotyping methods for Lactobacillus casei are needed for strain tracking and collection management, as well as for population biology research. A collection of 52 strains initially labeled L. casei or Lactobacillus paracasei was first subjected to rplB gene sequencing together with reference strains of Lactobacillus zeae, Lactobacillus rhamnosus, and other species. Phylogenetic analysis showed that all 52 strains belonged to a single compact L. casei-L. paracasei sequence cluster, together with strain CIP107868 (= ATCC 334) but clearly distinct from L. rhamnosus and from a cluster with L. zeae and CIP103137T (= ATCC 393T). The strains were genotyped using amplified fragment length polymorphism, multilocus sequence typing based on internal portions of the seven housekeeping genes fusA, ileS, lepA, leuS, pyrG, recA, and recG, and tandem repeat variation (multilocus variable-number tandem repeats analysis [MLVA] using nine loci). Very high concordance was found between the three methods. Although amounts of nucleotide variation were low for the seven genes (π ranging from 0.0038 to 0.0109), 3 to 12 alleles were distinguished, resulting in 31 sequence types. One sequence type (ST1) was frequent (17 strains), but most others were represented by a single strain. Attempts to subtype ST1 strains by MLVA, ribotyping, clustered regularly interspaced short palindromic repeat characterization, and single nucleotide repeat variation were unsuccessful. We found clear evidence for homologous recombination during the diversification of L. casei clones, including a putative intragenic import of DNA into one strain. Nucleotides were estimated to change four times more frequently by recombination than by mutation. However, statistical congruence between individual gene trees was retained, indicating that recombination is not frequent enough to disrupt the phylogenetic signal. The developed multilocus sequence typing scheme should be useful for future studies of L. casei

  9. beta-Keratins in crocodiles reveal amino acid homology with avian keratins.

    PubMed

    Ye, Changjiang; Wu, Xiaobing; Yan, Peng; Amato, George

    2010-03-01

    The DNA sequences encoding beta-keratin have been obtained from Marsh Mugger (Crocodylus palustris) and Orinoco Crocodiles (Crocodylus intermedius). Through the deduced amino acid sequence, these proteins are rich in glycine, proline and serine. The central region of the proteins are composed of two beta-folded regions and show a high degree of identity with beta-keratins of aves and squamates. This central part is thought to be the site of polymerization to build the framework of beta-keratin filaments. It is believed that the beta-keratins in reptiles and birds share a common ancestry. Near the C-terminal, these beta-keratins contain a peptide rich in glycine-X and glycine-X-X, and the distinctive feature of the region is some 12-amino acid repeats, which are similar to the 13-amino acid repeats in chick scale keratin but absent from avian feather keratin. From our phylogenetic analysis, the beta-keratins in crocodile have a closer relationship with avian keratins than the other keratins in reptiles.

  10. Homology difference analysis of invasive mealybug species Phenacoccus solenopsis Tinsley in Southern China with COI gene sequence variability.

    PubMed

    Wu, F Z; Ma, J; Hu, X N; Zeng, L

    2015-02-01

    The mealybug species Phenacoccus solenopsis (P. solenopsis) has caused much agricultural damage since its recent invasion in China. However, the source of this invasion remains unclear. This study uses molecular methods to clarify the relationships among different population of P. solenopsis from China, USA, Pakistan, India, and Vietnam to determine the geographic origin of the introduction of this species into China. P. solenopsis samples were collected from 25 different locations in three provinces of Southern China. Samples from the USA, Pakistan, and Vietnam were also obtained. Parts of the mitochondrial genes for cytochrome oxidase I (COI) were sequenced for each sample. Homologous DNA sequences of the samples from the USA and India were downloaded from Gen Bank. Two haplotypes were found in China. The first was from most samples from the Guangdong, Guangxi, and Hainan populations in the China and Pakistan groups, and the second from a few samples from the Guangdong, Guangxi, Hainan populations in the China, Pakistan, India, and Vietnam groups. As shown in the maximum likelihood of trees constructed using the COI sequences, these samples belonged to two clades. Phylogenetic analysis suggested that most P. solenopsis mealybugs in Southern China are probably closely related to populations in Pakistan. The variation, relationship, expansion, and probable geographic origin of P. solenopsis mealybugs in Southern China are also discussed.

  11. Amino acid sequence of versutoxin, a lethal neurotoxin from the venom of the funnel-web spider Atrax versutus.

    PubMed

    Brown, M R; Sheumack, D D; Tyler, M I; Howden, M E

    1988-03-01

    The complete amino acid sequence of versutoxin, a lethal neurotoxic polypeptide isolated from the venom of male and female funnel-web spiders of the species Atrax versutus, was determined. Sequencing was performed in a gas-phase protein sequencer by automated Edman degradation of the S-carboxymethylated toxin and fragments of it produced by reaction with CNBr. Versutoxin consisted of a single chain of 42 amino acid residues. It was found to have a high proportion of basic residues and of cystine. The primary structure showed marked homology with that of robustoxin, a novel neurotoxin recently isolated from the venom of another funnel-web-spider species, Atrax robustus.

  12. Amino acid sequence of versutoxin, a lethal neurotoxin from the venom of the funnel-web spider Atrax versutus.

    PubMed Central

    Brown, M R; Sheumack, D D; Tyler, M I; Howden, M E

    1988-01-01

    The complete amino acid sequence of versutoxin, a lethal neurotoxic polypeptide isolated from the venom of male and female funnel-web spiders of the species Atrax versutus, was determined. Sequencing was performed in a gas-phase protein sequencer by automated Edman degradation of the S-carboxymethylated toxin and fragments of it produced by reaction with CNBr. Versutoxin consisted of a single chain of 42 amino acid residues. It was found to have a high proportion of basic residues and of cystine. The primary structure showed marked homology with that of robustoxin, a novel neurotoxin recently isolated from the venom of another funnel-web-spider species, Atrax robustus. PMID:3355530

  13. On the necessity of dissecting sequence similarity scores into segment-specific contributions for inferring protein homology, function prediction and annotation

    PubMed Central

    2014-01-01

    Background Protein sequence similarities to any types of non-globular segments (coiled coils, low complexity regions, transmembrane regions, long loops, etc. where either positional sequence conservation is the result of a very simple, physically induced pattern or rather integral sequence properties are critical) are pertinent sources for mistaken homologies. Regretfully, these considerations regularly escape attention in large-scale annotation studies since, often, there is no substitute to manual handling of these cases. Quantitative criteria are required to suppress events of function annotation transfer as a result of false homology assignments. Results The sequence homology concept is based on the similarity comparison between the structural elements, the basic building blocks for conferring the overall fold of a protein. We propose to dissect the total similarity score into fold-critical and other, remaining contributions and suggest that, for a valid homology statement, the fold-relevant score contribution should at least be significant on its own. As part of the article, we provide the DissectHMMER software program for dissecting HMMER2/3 scores into segment-specific contributions. We show that DissectHMMER reproduces HMMER2/3 scores with sufficient accuracy and that it is useful in automated decisions about homology for instructive sequence examples. To generalize the dissection concept for cases without 3D structural information, we find that a dissection based on alignment quality is an appropriate surrogate. The approach was applied to a large-scale study of SMART and PFAM domains in the space of seed sequences and in the space of UniProt/SwissProt. Conclusions Sequence similarity core dissection with regard to fold-critical and other contributions systematically suppresses false hits and, additionally, recovers previously obscured homology relationships such as the one between aquaporins and formate/nitrite transporters that, so far, was only

  14. Nucleotide and derived amino acid sequences of the major porin of Comamonas acidovorans and comparison of porin primary structures.

    PubMed Central

    Gerbl-Rieger, S; Peters, J; Kellermann, J; Lottspeich, F; Baumeister, W

    1991-01-01

    The DNA sequence of the gene which codes for the major outer membrane porin (Omp32) of Comamonas acidovorans has been determined. The structural gene encodes a precursor consisting of 351 amino acid residues with a signal peptide of 19 amino acid residues. Comparisons with amino acid sequences of outer membrane proteins and porins from several other members of the class Proteobacteria and of the Chlamydia trachomatis porin and the Neurospora crassa mitochondrial porin revealed a motif of eight regions of local homology. The results of this analysis are discussed with regard to common structural features of porins. PMID:1848840

  15. Homology Modeling of Human γ-Butyric Acid Transporters and the Binding of Pro-Drugs 5-Aminolevulinic Acid and Methyl Aminolevulinic Acid Used in Photodynamic Therapy

    PubMed Central

    Baglo, Yan; Gabrielsen, Mari; Sylte, Ingebrigt; Gederaas, Odrun A.

    2013-01-01

    Photodynamic therapy (PDT) is a safe and effective method currently used in the treatment of skin cancer. In ALA-based PDT, 5-aminolevulinic acid (ALA), or ALA esters, are used as pro-drugs to induce the formation of the potent photosensitizer protoporphyrin IX (PpIX). Activation of PpIX by light causes the formation of reactive oxygen species (ROS) and toxic responses. Studies have indicated that ALA and its methyl ester (MAL) are taken up into the cells via γ-butyric acid (GABA) transporters (GATs). Uptake via GATs into peripheral sensory nerve endings may also account for one of the few adverse side effects of ALA-based PDT, namely pain. In the present study, homology models of the four human GAT subtypes were constructed using three x-ray crystal structures of the homologous leucine transporter (LeuT) as templates. Binding of the native substrate GABA and the possible substrates ALA and MAL was investigated by molecular docking of the ligands into the central putative substrate binding sites in the outward-occluded GAT models. Electrostatic potentials (ESPs) of the putative substrate translocation pathway of each subtype were calculated using the outward-open and inward-open homology models. Our results suggested that ALA is a substrate of all four GATs and that MAL is a substrate of GAT-2, GAT-3 and BGT-1. The ESP calculations indicated that differences likely exist in the entry pathway of the transporters (i.e. in outward-open conformations). Such differences may be exploited for development of inhibitors that selectively target specific GAT subtypes and the homology models may hence provide tools for design of therapeutic inhibitors that can be used to reduce ALA-induced pain. PMID:23762315

  16. Mutations of the Corynebacterium glutamicum NCgl1221 gene, encoding a mechanosensitive channel homolog, induce L-glutamic acid production.

    PubMed

    Nakamura, Jun; Hirano, Seiko; Ito, Hisao; Wachi, Masaaki

    2007-07-01

    Corynebacterium glutamicum is a biotin auxotroph that secretes L-glutamic acid in response to biotin limitation; this process is employed in industrial L-glutamic acid production. Fatty acid ester surfactants and penicillin also induce L-glutamic acid secretion, even in the presence of biotin. However, the mechanism of L-glutamic acid secretion remains unclear. It was recently reported that disruption of odhA, encoding a subunit of the 2-oxoglutarate dehydrogenase complex, resulted in L-glutamic acid secretion without induction. In this study, we analyzed odhA disruptants and found that those which exhibited constitutive L-glutamic acid secretion carried additional mutations in the NCgl1221 gene, which encodes a mechanosensitive channel homolog. These NCgl1221 gene mutations lead to constitutive L-glutamic acid secretion even in the absence of odhA disruption and also render cells resistant to an L-glutamic acid analog, 4-fluoroglutamic acid. Disruption of the NCgl1221 gene essentially abolishes L-glutamic acid secretion, causing an increase in the intracellular L-glutamic acid pool under biotin-limiting conditions, while amplification of the wild-type NCgl1221 gene increased L-glutamate secretion, although only in response to induction. These results suggest that the NCgl1221 gene encodes an L-glutamic acid exporter. We propose that treatments that induce L-glutamic acid secretion alter membrane tension and trigger a structural transformation of the NCgl1221 protein, enabling it to export L-glutamic acid.

  17. Purification of the integration host factor homolog of Rhodobacter capsulatus: cloning and sequencing of the hip gene, which encodes the beta subunit.

    PubMed Central

    Toussaint, B; Delic-Attree, I; De Sury D'Aspremont, R; David, L; Vinçon, M; Vignais, P M

    1993-01-01

    We describe a method for rapid purification of the integration host factor (IHF) homolog of Rhodobacter capsulatus that has allowed us to obtain microgram quantities of highly purified protein. R. capsulatus IHF is an alpha beta heterodimer similar to IHF of Escherichia coli. We have cloned and sequenced the hip gene, which encodes the beta subunit. The deduced amino acid sequence (10.7 kDa) has 46% identity with the beta subunit of IHF from E. coli. In gel electrophoretic mobility shift DNA binding assays, R. capsulatus IHF was able to form a stable complex in a site-specific manner with a DNA fragment isolated from the promoter of the structural hupSL operon, which contains the IHF-binding site. The mutated IHF protein isolated from the Hup- mutant IR4, which is mutated in the himA gene (coding for the alpha subunit), gave a shifted band of greater mobility, and DNase I footprinting analysis has shown that the mutated IHF interacts with the DNA fragment from the hupSL promoter region differently from the way that the wild-type IHF does. Images PMID:8407826

  18. Identification of MicroRNAs in Helicoverpa armigera and Spodoptera litura Based on Deep Sequencing and Homology Analysis

    PubMed Central

    Ge, Xie; Zhang, Yong; Jiang, Jianhao; Zhong, Yi; Yang, Xiaonan; Li, Zhiqian; Huang, Yongping; Tan, Anjiang

    2013-01-01

    The current identification of microRNAs (miRNAs) in insects is largely dependent on genome sequences. However, the lack of available genome sequences inhibits the identification of miRNAs in various insect species. In this study, we used a miRNA database of the silkworm Bombyx mori as a reference to identify miRNAs in Helicoverpa armigera and Spodoptera litura using deep sequencing and homology analysis. Because all three species belong to the Lepidoptera, the experiment produced reliable results. Our study identified 97 and 91 conserved miRNAs in H. armigera and S. litura, respectively. Using the genome of B. mori and BAC sequences of H. armigera as references, 1 novel miRNA and 8 novel miRNA candidates were identified in H. armigera, and 4 novel miRNA candidates were identified in S. litura. An evolutionary analysis revealed that most of the identified miRNAs were insect-specific, and more than 20 miRNAs were Lepidoptera-specific. The investigation of the expression patterns of miR-2a, miR-34, miR-2796-3p and miR-11 revealed their potential roles in insect development. miRNA target prediction revealed that conserved miRNA target sites exist in various genes in the 3 species. Conserved miRNA target sites for the Hsp90 gene among the 3 species were validated in the mammalian 293T cell line using a dual-luciferase reporter assay. Our study provides a new approach with which to identify miRNAs in insects lacking genome information and contributes to the functional analysis of insect miRNAs. PMID:23289012

  19. Purification and N-terminal amino acid sequence comparisons of structural proteins from retrovirus-D/Washington and Mason-Pfizer monkey virus.

    PubMed Central

    Henderson, L E; Sowder, R; Smythers, G; Benveniste, R E; Oroszlan, S

    1985-01-01

    A new D-type retrovirus originally designated SAIDS-D/Washington and here referred to as retrovirus-D/Washington (R-D/W) was recently isolated at the University of Washington Primate Center, Seattle, Wash., from a rhesus monkey with an acquired immunodeficiency syndrome and retroperitoneal fibromatosis. To better establish the relationship of this new D-type virus to the prototype D-type virus, Mason-Pfizer monkey virus (MPMV), we have purified and compared six structural proteins from each virus. The proteins purified from each D-type retrovirus include p4, p10, p12, p14, p27, and a phosphoprotein designated pp18 for MPMV and pp20 for R-D/W. Amino acid analysis and N-terminal amino acid sequence analysis show that the p4, p12, p14, and p27 proteins of R-D/W are distinct from the homologous proteins of MPMV but that these proteins from the two different viruses share a high degree of amino acid sequence homology. The p10 proteins from the two viruses have similar amino acid compositions, and both are blocked to N-terminal Edman degradation. The phosphoproteins from the two viruses each contain phosphoserine but are different from each other in amino acid composition, molecular weight, and N-terminal amino acid sequence. The data thus show that each of the R-D/W proteins examined is distinguishable from its MPMV homolog and that a major difference between these two D-type retroviruses is found in the viral phosphoproteins. The N-terminal amino acid sequences of D-type retroviral proteins were used to search for sequence homologies between D-type and other retroviral amino acid sequences. An unexpected amino acid sequence homology was found between R-D/W pp20 (a gag protein) and a 28-residue segment of the env precursor polyprotein of Rous sarcoma virus. The N-terminal amino acid sequences of the D-type major gag protein (p27) and the nucleic acid-binding protein (p14) show only limited amino acid sequence homology to functionally homologous proteins of C

  20. A novel antimicrobial protein isolated from potato (Solanum tuberosum) shares homology with an acid phosphatase.

    PubMed Central

    Feng, Jie; Yuan, Fenghua; Gao, Yin; Liang, Chenggang; Xu, Jin; Zhang, Changling; He, Liyuan

    2003-01-01

    The nucleotide and amino acids sequences for AP(1) will appear in the GenBank(R) and NCBI databases under accession number AY297449. A novel antimicrobial protein (AP(1)) was purified from leaves of the potato ( Solanum tuberosum, variety MS-42.3) with a procedure involving ammonium sulphate fractionation, molecular sieve chromatography with Sephacryl S-200 and hydrophobic chromatography with Butyl-Sepharose using a FPLC system. The inhibition spectrum investigation showed that AP(1) had good inhibition activity against five different strains of Ralstonia solanacearum from potato or other crops, and two fungal pathogens, Rhizoctonia solani and Alternaria solani from potato. The full-length cDNA encoding AP(1) has been successfully cloned by screening a cDNA expression library of potato with an anti-AP(1) antibody and RACE (rapid amplification of cDNA ends) PCR. Determination of the nucleotide sequences revealed the presence of an open reading frame encoding 343 amino acids. At the C-terminus of AP(1) there is an ATP-binding domain, and the N-terminus exhibits 58% identity with an/the acid phosphatase from Mesorhizobium loti. SDS/PAGE and Western blotting analysis suggested that the AP(1) gene can be successfully expressed in Escherichia coli and recognized by an antibody against AP(1). Also the expressed protein showed an inhibition activity the same as original AP(1) protein isolated from potato. We suggest that AP(1) most likely belongs to a new group of proteins with antimicrobial characteristics in vitro and functions in relation to phosphorylation and energy metabolism of plants. PMID:12927022

  1. PSI/TM-Coffee: a web server for fast and accurate multiple sequence alignments of regular and transmembrane proteins using homology extension on reduced databases

    PubMed Central

    Floden, Evan W.; Tommaso, Paolo D.; Chatzou, Maria; Magis, Cedrik; Notredame, Cedric; Chang, Jia-Ming

    2016-01-01

    The PSI/TM-Coffee web server performs multiple sequence alignment (MSA) of proteins by combining homology extension with a consistency based alignment approach. Homology extension is performed with Position Specific Iterative (PSI) BLAST searches against a choice of redundant and non-redundant databases. The main novelty of this server is to allow databases of reduced complexity to rapidly perform homology extension. This server also gives the possibility to use transmembrane proteins (TMPs) reference databases to allow even faster homology extension on this important category of proteins. Aside from an MSA, the server also outputs topological prediction of TMPs using the HMMTOP algorithm. Previous benchmarking of the method has shown this approach outperforms the most accurate alignment methods such as MSAProbs, Kalign, PROMALS, MAFFT, ProbCons and PRALINE™. The web server is available at http://tcoffee.crg.cat/tmcoffee. PMID:27106060

  2. Methods and compositions for efficient nucleic acid sequencing

    DOEpatents

    Drmanac, Radoje

    2006-07-04

    Disclosed are novel methods and compositions for rapid and highly efficient nucleic acid sequencing based upon hybridization with two sets of small oligonucleotide probes of known sequences. Extremely large nucleic acid molecules, including chromosomes and non-amplified RNA, may be sequenced without prior cloning or subcloning steps. The methods of the invention also solve various current problems associated with sequencing technology such as, for example, high noise to signal ratios and difficult discrimination, attaching many nucleic acid fragments to a surface, preparing many, longer or more complex probes and labelling more species.

  3. Methods and compositions for efficient nucleic acid sequencing

    DOEpatents

    Drmanac, Radoje

    2002-01-01

    Disclosed are novel methods and compositions for rapid and highly efficient nucleic acid sequencing based upon hybridization with two sets of small oligonucleotide probes of known sequences. Extremely large nucleic acid molecules, including chromosomes and non-amplified RNA, may be sequenced without prior cloning or subcloning steps. The methods of the invention also solve various current problems associated with sequencing technology such as, for example, high noise to signal ratios and difficult discrimination, attaching many nucleic acid fragments to a surface, preparing many, longer or more complex probes and labelling more species.

  4. Kit for detecting nucleic acid sequences using competitive hybridization probes

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    2001-01-01

    A kit is provided for detecting a target nucleic acid sequence in a sample, the kit comprising: a first hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the first hybridization probe including a first complexing agent for forming a binding pair with a second complexing agent; and a second hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the first hybridization probe does not selectively hybridize, the second hybridization probe including a detectable marker; a third hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the third hybridization probe including the same detectable marker as the second hybridization probe; and a fourth hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the third hybridization probe does not selectively hybridize, the fourth hybridization probe including the first complexing agent for forming a binding pair with the second complexing agent; wherein the first and second hybridization probes are capable of simultaneously hybridizing to the target sequence and the third and fourth hybridization probes are capable of simultaneously hybridizing to the target sequence, the detectable marker is not present on the first or fourth hybridization probes and the first, second, third, and fourth hybridization probes each include a competitive nucleic acid sequence which is sufficiently complementary to a third portion of the target sequence that the competitive sequences of the first, second, third, and fourth hybridization probes compete with each other to hybridize to the third portion of the

  5. Relationships in the Caryophyllales as suggested by phylogenetic analyses of partial chloroplast DNA ORF2280 homolog sequences.

    PubMed

    Downie, S; Katz-Downie, D; Cho, K

    1997-02-01

    Phylogenetic relationships within the angiosperm order Caryophyllales were investigated by comparative sequencing of two portions of the highly conserved inverted repeat (totaling some 1100 base pairs) coinciding with the region occupied by ORF2280 in Nicotiana, the largest gene in the plastid genomes of most land plants. Data were obtained for 33 species in 11 families within the order and for one species each of Plumbaginaceae, Polygonaceae, and Nepenthaceae. These data, when analyzed along with previously published ORF (open reading frame) sequences from Nicotiana. Spinacia. Epifagus, and Pelargonium using parsimony, neighbor-joining, and maximum likelihood methods, reveal that: (1) Amaranthus, Celosia, and Froelichia (all Amaranthaceae) do not comprise a monophyletic group; (2) Amaranthus may be nested within a paraphyletic Chenopodiaceae; (3) Sarcobatus (Chenopodiaceae) is allied with Nyctaginaceae + Phytolaccaceae (the latter family excluding Stegnosperma but including Petiveria); and (4) Caryophyllaceae (with Corrigiola basal within the clade) are sister group to Chenopodiaceae + Amaranthaceae. Basal relations within the order remain obscure. Sequence divergence values in pairwise comparisons across all Caryophyllales taxa ranged from 0.1 to 5% of nucleotides. However, despite these low values, 23 insertion and deletion events were apparent, of which five were informative phylogenetically and bolstered several of the relationships listed above. A polymerase chain reaction (PCR) survey for ORF homolog length variants in representatives from 70 additional angiosperm families revealed major deletions, of 100 to 1400 base pairs, in 19 of these families. Although the ORF is located within the mutationally retarded inverted repeat region of most angiosperm chloroplast DNAs, this gene appears particularly prone to length mutation.

  6. Complete sequence analysis of cDNA clones encoding rat whey phosphoprotein: homology to a protease inhibitor.

    PubMed

    Dandekar, A M; Robinson, E A; Appella, E; Qasba, P K

    1982-07-01

    Lactoprotein clones have been isolated from a rat mammary gland recombinant library of cDNA plasmids. Clones p-Wp 52 and p-Wp 47 were shown by hybrid selection, in vitro translation, and immunoprecipitation to represent a cloned DNA sequence encoding rat whey phosphoprotein. We report here the nucleotide sequence of the cDNA insert of p-Wp 52 and shows that it encodes the complete whey phosphoprotein sequence. The encoded sequence shows a high content of half-cystine, glutamic acid, aspartic acid, and serine but an absence of tyrosine. The half-cystines appear in unique arrangements and are repeated in two domains of the protein. The second domain has striking similarities with the second domain of the red sea turtle protease inhibitor. Clone p-Wp 52 has allowed the study of expression of whey phosphoprotein mRNA during functional differentiation of rat mammary gland and in mammary tumors. The whey phosphoprotein mRNA is detected during midpregnancy and lactation in the rat mammary gland but is barely detected in mammary tumors in which other milk protein mRNAs are expressed. The whey phosphoprotein gene in these tumors is hypermethylated, correlating with the reduced expression of this gene.

  7. Complete Unique Genome Sequence, Expression Profile, and Salivary Gland Tissue Tropism of the Herpesvirus 7 Homolog in Pigtailed Macaques

    PubMed Central

    Staheli, Jeannette P.; Dyen, Michael R.; Deutsch, Gail H.; Basom, Ryan S.; Fitzgibbon, Matthew P.; Lewis, Patrick

    2016-01-01

    ABSTRACT Human herpesvirus 6A (HHV-6A), HHV-6B, and HHV-7 are classified as roseoloviruses and are highly prevalent in the human population. Roseolovirus reactivation in an immunocompromised host can cause severe pathologies. While the pathogenic potential of HHV-7 is unclear, it can reactivate HHV-6 from latency and thus contributes to severe pathological conditions associated with HHV-6. Because of the ubiquitous nature of roseoloviruses, their roles in such interactions and the resulting pathological consequences have been difficult to study. Furthermore, the lack of a relevant animal model for HHV-7 infection has hindered a better understanding of its contribution to roseolovirus-associated diseases. Using next-generation sequencing analysis, we characterized the unique genome of an uncultured novel pigtailed macaque roseolovirus. Detailed genomic analysis revealed the presence of gene homologs to all 84 known HHV-7 open reading frames. Phylogenetic analysis confirmed that the virus is a macaque homolog of HHV-7, which we have provisionally named Macaca nemestrina herpesvirus 7 (MneHV7). Using high-throughput RNA sequencing, we observed that the salivary gland tissue samples from nine different macaques had distinct MneHV7 gene expression patterns and that the overall number of viral transcripts correlated with viral loads in parotid gland tissue and saliva. Immunohistochemistry staining confirmed that, like HHV-7, MneHV7 exhibits a natural tropism for salivary gland ductal cells. We also observed staining for MneHV7 in peripheral nerve ganglia present in salivary gland tissues, suggesting that HHV-7 may also have a tropism for the peripheral nervous system. Our data demonstrate that MneHV7-infected macaques represent a relevant animal model that may help clarify the causality between roseolovirus reactivation and diseases. IMPORTANCE Human herpesvirus 6A (HHV-6A), HHV-6B, and HHV-7 are classified as roseoloviruses. We have recently discovered that pigtailed

  8. Analysis and Annotation of Nucleic Acid Sequence

    SciTech Connect

    States, David J.

    2004-07-28

    The aims of this project were to develop improved methods for computational genome annotation and to apply these methods to improve the annotation of genomic sequence data with a specific focus on human genome sequencing. The project resulted in a substantial body of published work. Notable contributions of this project were the identification of basecalling and lane tracking as error processes in genome sequencing and contributions to improved methods for these steps in genome sequencing. This technology improved the accuracy and throughput of genome sequence analysis. Probabilistic methods for physical map construction were developed. Improved methods for sequence alignment, alternative splicing analysis, promoter identification and NF kappa B response gene prediction were also developed.

  9. Structure of REC2, a recombinational repair gene of Ustilago maydis, and its function in homologous recombination between plasmid and chromosomal sequences.

    PubMed Central

    Rubin, B P; Ferguson, D O; Holloman, W K

    1994-01-01

    Mutation in the REC2 gene of Ustilago maydis leads to defects in DNA repair, recombination, and meiosis. Analysis of the primary sequence of the Rec2 protein reveals a region with significant homology to bacterial RecA protein and to the yeast recombination proteins Dmc1, Rad51, and Rad57. This homologous region in the U. maydis Rec2 protein was found to be functionally sensitive to mutation, lending support to the hypothesis that Rec2 has a functional RecA-like domain essential for activity in recombination and repair. Homologous recombination between plasmid and chromosomal DNA sequences is reduced substantially in the rec2 mutant following transformation. The frequency can be restored to a level approaching, but not exceeding, that observed in the wild-type strain if transformation is performed with cells containing multiple copies of REC2. Images PMID:8065360

  10. Complete amino acid sequence of the N-terminal extension of calf skin type III procollagen.

    PubMed Central

    Brandt, A; Glanville, R W; Hörlein, D; Bruckner, P; Timpl, R; Fietzek, P P; Kühn, K

    1984-01-01

    The N-terminal extension peptide of type III procollagen, isolated from foetal-calf skin, contains 130 amino acid residues. To determine its amino acid sequence, the peptide was reduced and carboxymethylated or aminoethylated and fragmented with trypsin, Staphylococcus aureus V8 proteinase and bacterial collagenase. Pyroglutamate aminopeptidase was used to deblock the N-terminal collagenase fragment to enable amino acid sequencing. The type III collagen extension peptide is homologous to that of the alpha 1 chain of type I procollagen with respect to a three-domain structure. The N-terminal 79 amino acids, which contain ten of the 12 cysteine residues, form a compact globular domain. The next 39 amino acids are in a collagenase triplet sequence (Gly- Xaa - Yaa )n with a high hydroxyproline content. Finally, another short non-collagenous domain of 12 amino acids ends at the cleavage site for procollagen aminopeptidase, which cleaves a proline-glutamine bond. In contrast with type I procollagen, the type III procollagen extension peptides contain interchain disulphide bridges located at the C-terminus of the triple-helical domain. PMID:6331392

  11. Homology among acid proteases: comparison of crystal structures at 3A resolution of acid proteases from Rhizopus chinensis and Endothia parasitica.

    PubMed Central

    Subramanian, E; Swan, I D; Liu, M; Davies, D R; Jenkins, J A; Tickle, I J; Blundell, T L

    1977-01-01

    The molecular structures of two fungal acid proteases at 3 A resolution have been compared, and found to have similar secondary and tertiary folding. These enzymes are bilobal and have a pronounced cleft between the two lobes. This cleft has been identified as the active site region from inhibitor binding studies. The results of the comparison are discussed in terms of homology among the acid proteases in general. Images PMID:322132

  12. Nucleotide sequence of the Klebsiella pneumoniae nifD gene and predicted amino acid sequence of the alpha-subunit of nitrogenase MoFe protein.

    PubMed Central

    Ioannidis, I; Buck, M

    1987-01-01

    The nucleotide sequence of the Klebsiella pneumoniae nifD gene is presented and together with the accompanying paper [Holland, Zilberstein, Zamir & Sussman (1987) Biochem. J. 247, 277-285] completes the sequence of the nifHDK genes encoding the nitrogenase polypeptides. The K. pneumoniae nifD gene encodes the 483-amino acid-residue nitrogenase alpha-subunit polypeptide of Mr 54156. The alpha-subunit has five strongly conserved cysteine residues at positions 63, 89, 155, 184 and 275, some occurring in a region showing both primary sequence and potential structural homology to the K. pneumoniae nitrogenase beta-subunit. A comparison with six other alpha-subunit amino acid sequences has been made, which indicates a number of potentially important domains within alpha-subunits. PMID:3322262

  13. Evolution of lactate dehydrogenase-A homologs of barracuda fishes (genus Sphyraena) from different thermal environments: differences in kinetic properties and thermal stability are due to amino acid substitutions outside the active site.

    PubMed

    Holland, L Z; McFall-Ngai, M; Somero, G N

    1997-03-18

    Orthologous homologs of lactate dehydrogenase-A (LDH-A) (EC 1.1.1.27; NAD+:lactate oxidoreductase) of six barracuda species (genus Sphyraena) display differences in Michaelis-Menten constants (apparent Km) for substrate (pyruvate) and cofactor (NADH) that reflect evolution at different habitat temperatures. Significant increases in Km with increasing measurement temperature occur for all homologs, yet Km at normal body temperatures is similar among species because of the inverse relationship between adaptation temperature and Km. Thermal stabilities of the homologs also differ. To determine the amino acid substitutions responsible for differences in Km and thermal stability, peptide mapping of the LDH-As of all six species was first performed. Then, the amino acid sequences of the three homologs having the most similar peptide maps, those of the north temperate species, S. argentea, the subtropical species, S. lucasana, and the south temperate species, S. idiastes, were deduced from the respective cDNA sequences. At most, there were four amino acid substitutions between any pair of species, none of which occurred in the loop or substrate binding sites of the enzymes. The sequence of LDH-A from S. lucasana differs from that of S. idiastes only at position 8. The homolog of S. argentea differs from the other two sequences at positions 8, 61, 68, and 223. We used a full-length cDNA clone of LDH-A of S. lucasana to test, by site-directed mutagenesis, the importance of these sequence changes in establishing the observed differences in kinetics and thermal stability. Differences in sequence at sites 61 and/or 68 appear to account for the differences in Km between the LDH-As of S. argentea and S. lucasana. Differences at position 8 appear to account for the difference in thermal stability between the homologs of S. argentea and S. lucasana. Evolutionary adaptation of proteins to temperature thus may be achieved by minor changes in sequence at locations outside of active

  14. Sequence homologies between Mycoplasma and Chlamydia spp. lead to false-positive results in chlamydial cell cultures tested for mycoplasma contamination with a commercial PCR assay.

    PubMed

    Maass, Viola; Kern, Jan Marco; Poeckl, Matthias; Maass, Matthias

    2011-10-01

    Mycoplasma contamination is a frequent problem in chlamydial cell culture. After obtaining contradictory contamination results, we compared three commercial PCR kits for mycoplasma detection. One kit signaled contamination in mycoplasma-free Chlamydia pneumoniae cultures. Sequencing of cloned PCR products revealed primer homology with the chlamydial genome as the basis of this false-positive result.

  15. Amino acid sequence of myoglobin from emu (Dromaius novaehollandiae) skeletal muscle.

    PubMed

    Suman, S P; Joseph, P; Li, S; Beach, C M; Fontaine, M; Steinke, L

    2010-11-01

    The objective of the present study was to characterize the primary structure of emu myoglobin (Mb). Emu Mb was isolated from Iliofibularis muscle employing gel-filtration chromatography. Matrix Assisted Laser Desorption Ionization-Time of Flight Mass Spectrometry was employed to determine the exact molecular mass of emu Mb in comparison with horse Mb, and Edman degradation was utilized to characterize the amino acid sequence. The molecular mass of emu Mb was 17,380 Da and was close to those reported for ratite and poultry myoglobins. Similar to myoglobins from meat-producing livestock and birds, emu Mb has 153 amino acids. Emu Mb contains 9 histidines. Proximal and distal histidines, responsible for coordinating oxygen-binding property of Mb, are conserved in emu. Emu Mb shared more than 90% homology with ratite and chicken myoglobins, whereas it demonstrated only less than 70% sequence similarity with ruminant myoglobins.

  16. Definition of the tempo of sequence diversity across an alignment and automatic identification of sequence motifs: Application to protein homologous families and superfamilies

    PubMed Central

    May, Alex C.W.

    2002-01-01

    It is often possible to identify sequence motifs that characterize a protein family in terms of its fold and/or function from aligned protein sequences. Such motifs can be used to search for new family members. Partitioning of sequence alignments into regions of similar amino acid variability is usually done by hand. Here, I present a completely automatic method for this purpose: one that is guaranteed to produce globally optimal solutions at all levels of partition granularity. The method is used to compare the tempo of sequence diversity across reliable three-dimensional (3D) structure-based alignments of 209 protein families (HOMSTRAD) and that for 69 superfamilies (CAMPASS). (The mean alignment length for HOMSTRAD and CAMPASS are very similar.) Surprisingly, the optimal segmentation distributions for the closely related proteins and distantly related ones are found to be very similar. Also, optimal segmentation identifies an unusual protein superfamily. Finally, protein 3D structure clues from the tempo of sequence diversity across alignments are examined. The method is general, and could be applied to any area of comparative biological sequence and 3D structure analysis where the constraint of the inherent linear organization of the data imposes an ordering on the set of objects to be clustered. PMID:12441381

  17. Dipeptide Sequence Determination: Analyzing Phenylthiohydantoin Amino Acids by HPLC

    NASA Astrophysics Data System (ADS)

    Barton, Janice S.; Tang, Chung-Fei; Reed, Steven S.

    2000-02-01

    Amino acid composition and sequence determination, important techniques for characterizing peptides and proteins, are essential for predicting conformation and studying sequence alignment. This experiment presents improved, fundamental methods of sequence analysis for an upper-division biochemistry laboratory. Working in pairs, students use the Edman reagent to prepare phenylthiohydantoin derivatives of amino acids for determination of the sequence of an unknown dipeptide. With a single HPLC technique, students identify both the N-terminal amino acid and the composition of the dipeptide. This method yields good precision of retention times and allows use of a broad range of amino acids as components of the dipeptide. Students learn fundamental principles and techniques of sequence analysis and HPLC.

  18. Four novel sequences in Drosophila melanogaster homologous to the auxiliary Para sodium channel subunit TipE.

    PubMed

    Derst, Christian; Walther, Christian; Veh, Rüdiger W; Wicher, Dieter; Heinemann, Stefan H

    2006-01-20

    TipE is an auxiliary subunit of the Drosophila Para sodium channel. Here we describe four sequences, TEH1-4, homologous to TipE in the Drosophila melanogaster genome, harboring all typical structures of both TipE and the beta-Subunit family of big-conductance Ca(2+)-activated potassium channels: short cytosolic N- and C-terminal stretches, two transmembrane domains, and a large extracellular loop with two disulfide bonds. Whereas TEH1 and TEH2 lack the TipE-specific extension in the extracellular loop, both TEH3 and TEH4 possess two extracellular EGF-like domains. A CNS-specific expression was found for TEH1, while TEH2-4 were more widely expressed. The genes for TEH2-4 are localized close to the tipE gene on chromosome 3L. Coexpression of TEH subunits with Para in Xenopus oocytes showed a strong (30-fold, TEH1), medium (5- to 10-fold, TEH2 and TEH3), or no (TEH4) increase in sodium current amplitude, while TipE increased the current 20-fold. In addition, steady-state inactivation and the recovery from fast inactivation were altered by coexpression of Para with TEH1. We conclude that members of the TEH-family are auxiliary subunits for Para sodium channels and possibly other ion channels.

  19. Homologous SV40 RNA trans-splicing: a new mechanism for diversification of viral sequences and phenotypes.

    PubMed

    Eul, Joachim; Patzel, Volker

    2013-11-01

    Simian Virus 40 (SV40) is a polyomavirus found in both monkeys and humans, which causes cancer in some animal models. In humans, SV40 has been reported to be associated with cancers but causality has not been proven yet. The transforming activity of SV40 is mainly due to its 94-kD large T antigen, which binds to the retinoblastoma (pRb) and p53 tumor suppressor proteins, and thereby perturbs their functions. Here we describe a 100 kD super T antigen harboring a duplication of the pRB binding domain that was associated with unusual high cell transformation activity and that was generated by a novel mechanism involving homologous RNA trans-splicing of SV40 early transcripts in transformed rodent cells. Enhanced trans-splice activity was observed in clones carrying a single point mutation in the large T antigen 5' donor splice site (ss). This mutation impaired cis-splicing in favor of an alternative trans-splice reaction via a cryptic 5'ss within a second cis-spliced SV40 pre-mRNA molecule and enabled detectable gene expression. Next to the cryptic 5'ss we identified additional trans-splice helper functions, including putative dimerization domains and a splice enhancer sequence. Our findings suggest RNA trans-splicing as a SV40-intrinsic mechanism that supports the diversification of viral RNA and phenotypes.

  20. Structural homologies among the hemopoietins.

    PubMed Central

    Schrader, J W; Ziltener, H J; Leslie, K B

    1986-01-01

    A group of cytokines characterized by a common set of target cells--namely, the pluripotential hemopoietic stem cells or their cellular derivatives--share similarities in the amino acid sequence at their N terminus or in the putative signal peptide immediately prior to the published N terminus. Murine P-cell-stimulating factor (PSF), murine and human interleukin 2 (IL-2), murine and human granulocyte-macrophage colony-stimulating factor (GM-CSF), human erythropoietin, and human interleukin 1 beta all share alanine as the N-terminal amino acid and have some similarities in the succeeding three or four amino acids. In the case of murine PSF and GM-CSF, the six N-terminal amino acids are readily cleaved from mature molecules and are lacking from the N-terminal amino acid sequences reported initially. A sixth cytokine, colony-stimulating factor 1, has an alanine followed by a similar pattern of five amino acids at the end of the putative signal peptide. GM-CSF and IL-2 have more extensive homology, about 25% of residues being identical in three regions that comprise about 70% of the molecules. Only minor similarities of uncertain significance were found among the complete amino acid sequences of the other cytokines. Although its evolutionary origin is uncertain, the homology around the N terminus may provide a structural marker for a group of cytokines active on the pluripotential hemopoietic stem cell and its derivatives. PMID:3085095

  1. The amino acid sequence around the active-site cysteine and histidine residues, and the buried cysteine residue in ficin.

    PubMed

    Husain, S S; Lowe, G

    1970-04-01

    Ficin that had been prepared from the latex of Ficus glabrata by salt fractionation and chromatography on carboxymethylcellulose was completely and irreversibly inhibited with 1,3-dibromo[2-(14)C]acetone and then treated with N-(4-dimethylamino-3,5-dinitrophenyl)maleimide in 6m-guanidinium chloride. After reduction and carboxymethylation of the labelled protein, it was digested with trypsin and alpha-chymotrypsin. Two radioactive peptides and two coloured peptides were isolated chromatographically and their sequences determined. The radioactive peptides revealed the amino acid sequences around the active-site cysteine and histidine residues and showed a high degree of homology with the omino acid sequence around the active-site cysteine and histidine residues in papain. The coloured peptides allowed the amino acid sequence around the buried cysteine residue in ficin to be determined.

  2. Amino acid sequence of mouse submaxillary gland renin.

    PubMed Central

    Misono, K S; Chang, J J; Inagami, T

    1982-01-01

    The complete amino acid sequences of the heavy chain and light chain of mouse submaxillary gland renin have been determined. The heavy chain consists of 288 amino acid residues having a Mr of 31,036 calculated from the sequence. The light chain contains 48 amino acid residues with a Mr of 5,458. The sequence of the heavy chain was determined by automated Edman degradations of the cyanogen bromide peptides and tryptic peptides generated after citraconylation, as well as other peptides generated therefrom. The sequence of the light chain was derived from sequence analyses of the peptides generated by cyanogen bromide cleavage or by digestion with Staphylococcus aureus protease. The sequences in the active site regions in renin containing two catalytically essential aspartyl residues 32 and 215 were found identical with those in pepsin, chymosin, and penicillopepsin. Comparison of the amino acid sequence of renin with that of porcine pepsin indicated a 42% sequence identity of the heavy chain with the amino-terminal and middle regions and a 46% identity of the light chain with the carboxyl-terminal region of the porcine pepsin sequence. Residues identical in renin and pepsin are distributed throughout the length of the molecules, suggesting a similarity in their overall structures. PMID:6812055

  3. Increased fatty acid unsaturation and production of arachidonic acid by homologous over-expression of the mitochondrial malic enzyme in Mortierella alpina.

    PubMed

    Hao, Guangfei; Chen, Haiqin; Du, Kai; Huang, Xiaoyun; Song, Yuanda; Gu, Zhennan; Wang, Lei; Zhang, Hao; Chen, Wei; Chen, Yong Q

    2014-09-01

    Malic enzyme (ME) catalyses the oxidative decarboxylation of L-malate to pyruvate and provides NADPH for intracellular metabolism, such as fatty acid synthesis. Here, the mitochondrial ME (mME) gene from Mortierella alpina was homologously over-expressed. Compared with controls, fungal arachidonic acid (ARA; 20:4 n-6) content increased by 60 % without affecting the total fatty acid content. Our results suggest that enhancing mME activity may be an effective mean to increase industrial production of ARA in M. alpina.

  4. Nucleotide sequence of the hexA gene for DNA mismatch repair in Streptococcus pneumoniae and homology of hexA to mutS of Escherichia coli and Salmonella typhimurium

    SciTech Connect

    Priebe, S.D.; Hadi, S.M.; Greenberg, B.; Lacks, S.A.

    1988-01-01

    The Hex system of heteroduplex DNA base mismatch repair operates in Streptococcus pneumoniae after transformation and replication to correct donor and nascent DNA strands, respectively. A functionally similar system, called Mut, operates in Escherichia coli and Salmonella typhimurium. The nucleotide sequence of a 3.8-kilobase segment from the S. pneumoniae chromosome that includes the 2.7-kilobase hexA gene was determined. Chromosomal DNA used as donor to measure Hex phenotype was irradiated with UV light. An open reading frame that could encode a 17-kilodalton polypeptide (OrfC) was located just upstream of the gene encoding a polypeptide of 95 kilodaltons corresponding to HexA. Shine-Dalgarno sequences and putative promoters were identified upstream of each protein start site. Insertion mutations showed that only HexA functioned in mismatch repair and that the promoter for hexA transcription was located within the OrfC-coding region. The HexA polypeptide contains a consensus sequence for ATP- or GTP-binding sites in proteins. Comparison of the entire HexA protein sequence to that of MutS of S. typhimurium, showed the proteins to be homologous, inasmuch as 36% of their amino acid residues were identical. This homology indicates that the Hex and Mut systems of mismatch repair evolved from an ancestor common to the gram-positive streptococci and the gram-negative enterobacteria. It is the first direct evidence linking the two systems.

  5. High-resolution melt analysis to detect sequence variations in highly homologous gene regions: application to CYP2B6.

    PubMed

    Twist, Greyson P; Gaedigk, Roger; Leeder, J Steven; Gaedigk, Andrea

    2013-06-01

    High-resolution melt (HRM) analysis using 'release-on-demand' dyes, such as EvaGreen(®) has the potential to resolve complex genotypes in situations where genotype interpretation is complicated by the presence of pseudogenes or allelic variants in close proximity to the locus of interest. We explored the utility of HRM to genotype a SNP (785A>G, K262R, rs2279343) that is located within exon 5 of the CYP2B6 gene, which contributes to the metabolism of a number of clinically used drugs. Testing of 785A>G is challenging, but crucial for accurate genotype determination. This SNP is part of multiple known CYP2B6 haplotypes and located in a region that is identical to CYP2B7, a nonfunctional pseudogene. Because small CYP2B6-specific PCR amplicons bracketing 785A>G cannot be generated, we simultaneously amplified both genes. A panel of 235 liver tissue DNAs and five Coriell samples were assessed. Eight CYP2B6/CYP2B7 diplotype combinations were found and a novel variant 769G>A (D257N) was discovered. The frequency of 785G corresponded to those reported for Caucasians and African-Americans. Assay performance was confirmed by CYP2B6 and/or CYP2B7 sequence analysis in a subset of samples, using a preamplified CYP2B6-specific long-range-PCR amplicon as HRM template. Inclusion rather than exclusion of a homologous pseudogene allowed us to devise a sensitive, reliable and affordable assay to test this CYP2B6 SNP. This assay design may be utilized to overcome the challenges and limitations of other methods. Owing to the flexibility of HRM, this assay design can easily be adapted to other gene loci of interest.

  6. DockRank: ranking docked conformations using partner-specific sequence homology-based protein interface prediction.

    PubMed

    Xue, Li C; Jordan, Rafael A; El-Manzalawy, Yasser; Dobbs, Drena; Honavar, Vasant

    2014-02-01

    Selecting near-native conformations from the immense number of conformations generated by docking programs remains a major challenge in molecular docking. We introduce DockRank, a novel approach to scoring docked conformations based on the degree to which the interface residues of the docked conformation match a set of predicted interface residues. DockRank uses interface residues predicted by partner-specific sequence homology-based protein-protein interface predictor (PS-HomPPI), which predicts the interface residues of a query protein with a specific interaction partner. We compared the performance of DockRank with several state-of-the-art docking scoring functions using Success Rate (the percentage of cases that have at least one near-native conformation among the top m conformations) and Hit Rate (the percentage of near-native conformations that are included among the top m conformations). In cases where it is possible to obtain partner-specific (PS) interface predictions from PS-HomPPI, DockRank consistently outperforms both (i) ZRank and IRAD, two state-of-the-art energy-based scoring functions (improving Success Rate by up to 4-fold); and (ii) Variants of DockRank that use predicted interface residues obtained from several protein interface predictors that do not take into account the binding partner in making interface predictions (improving success rate by up to 39-fold). The latter result underscores the importance of using partner-specific interface residues in scoring docked conformations. We show that DockRank, when used to re-rank the conformations returned by ClusPro, improves upon the original ClusPro rankings in terms of both Success Rate and Hit Rate. DockRank is available as a server at http://einstein.cs.iastate.edu/DockRank/.

  7. The complete amino acid sequence of a trypsin inhibitor from Bauhinia variegata var. candida seeds.

    PubMed

    Di Ciero, L; Oliva, M L; Torquato, R; Köhler, P; Weder, J K; Camillo Novello, J; Sampaio, C A; Oliveira, B; Marangoni, S

    1998-11-01

    Trypsin inhibitors of two varieties of Bauhinia variegata seeds have been isolated and characterized. Bauhinia variegata candida trypsin inhibitor (BvcTI) and B. variegata lilac trypsin inhibitor (BvlTI) are proteins with Mr of about 20,000 without free sulfhydryl groups. Amino acid analysis shows a high content of aspartic acid, glutamic acid, serine, and glycine, and a low content of histidine, tyrosine, methionine, and lysine in both inhibitors. Isoelectric focusing for both varieties detected three isoforms (pI 4.85, 5.00, and 5.15), which were resolved by HPLC procedure. The trypsin inhibitors show Ki values of 6.9 and 1.2 nM for BvcTI and BvlTI, respectively. The N-terminal sequences of the three trypsin inhibitor isoforms from both varieties of Bauhinia variegata and the complete amino acid sequence of B. variegata var. candida L. trypsin inhibitor isoform 3 (BvcTI-3) are presented. The sequences have been determined by automated Edman degradation of the reduced and carboxymethylated proteins of the peptides resulting from Staphylococcus aureus protease and trypsin digestion. BvcTI-3 is composed of 167 residues and has a calculated molecular mass of 18,529. Homology studies with other trypsin inhibitors show that BvcTI-3 belongs to the Kunitz family. The putative active site encompasses Arg (63)-Ile (64).

  8. Amino Acid Sequence of Human Cholinesterase

    DTIC Science & Technology

    1985-10-01

    liquid chromatography (HPLC). Activity testing of the aged, DFP-labeled cholinesterase showed that 99.8% of the active sites had been labeled, since...acids were quantitated by ninhydrin at the AAA Labs, or by derivatization with phenylisothiocyanate at the University of Michigan. The latter method

  9. Cystatin. Amino acid sequence and possible secondary structure.

    PubMed Central

    Schwabe, C; Anastasi, A; Crow, H; McDonald, J K; Barrett, A J

    1984-01-01

    The amino acid sequence of cystatin, the protein from chicken egg-white that is a tight-binding inhibitor of many cysteine proteinases, is reported. Cystatin is composed of 116 amino acid residues, and the Mr is calculated to be 13 143. No striking similarity to any other known sequence has been detected. The results of computer analysis of the sequence and c.d. spectrometry indicate that the secondary structure includes relatively little alpha-helix (about 20%) and that the remainder is mainly beta-structure. PMID:6712597

  10. Mouse Vk gene classification by nucleic acid sequence similarity.

    PubMed

    Strohal, R; Helmberg, A; Kroemer, G; Kofler, R

    1989-01-01

    Analyses of immunoglobulin (Ig) variable (V) region gene usage in the immune response, estimates of V gene germline complexity, and other nucleic acid hybridization-based studies depend on the extent to which such genes are related (i.e., sequence similarity) and their organization in gene families. While mouse Igh heavy chain V region (VH) gene families are relatively well-established, a corresponding systematic classification of Igk light chain V region (Vk) genes has not been reported. The present analysis, in the course of which we reviewed the known extent of the Vk germline gene repertoire and Vk gene usage in a variety of responses to foreign and self antigens, provides a classification of mouse Vk genes in gene families composed of members with greater than 80% overall nucleic acid sequence similarity. This classification differed in several aspects from that of VH genes: only some Vk gene families were as clearly separated (by greater than 25% sequence dissimilarity) as typical VH gene families; most Vk gene families were closely related and, in several instances, members from different families were very similar (greater than 80%) over large sequence portions; frequently, classification by nucleic acid sequence similarity diverged from existing classifications based on amino-terminal protein sequence similarity. Our data have implications for Vk gene analyses by nucleic acid hybridization and describe potentially important differences in sequence organization between VH and Vk genes.

  11. Nucleic acid sequence of an internal image-bearing monoclonal anti-idiotype and its comparison to the sequence of the external antigen.

    PubMed Central

    Bruck, C; Co, M S; Slaoui, M; Gaulton, G N; Smith, T; Fields, B N; Mullins, J I; Greene, M I

    1986-01-01

    The monoclonal anti-idiotypic antibody (mAb2) 87.92.6 directed against the 9B.G5 antibody specific for the virus neutralizing epitope on the mammalian reovirus type 3 hemagglutinin was previously demonstrated to express an internal image of the receptor binding epitope of the reovirus type 3. Furthermore, this mAb2 has autoimmune reactivity to the cell surface receptor of the reovirus. The nucleotide and deduced amino acid sequences of the 87.92.6 mAb2 heavy and light chains are described in this report. The sequence analysis reveals that the same heavy chain variable and joining (VH and JH) gene segments are used by the 87.92.6 anti-idiotypic mAb2 and by the dominant idiotypes of the BALB/c anti-GAT (cGAT) and anti-NP (NPa) responses. [GAT; random polymer that is 60% glutamic acid, 30% alanine, and 10% tyrosine. NP; (4-hydroxy-3-nitrophenyl)-acetyl.] Despite extensive homology at the level of the heavy chain variable regions, the NPa positive BALB/c anti-NP monoclonal antibody 17.2.25 binds neither 9B.G5 nor the cellular receptor for the hemagglutinin. Amino acid sequence comparison between the viral hemagglutinin and the 87.92.6 mAb2 light chain "internal image," reveals an area of significant homology indicating that antigen mimicry by antibodies may be achieved by sharing primary structure. PMID:2428036

  12. Nucleotide sequence of a Proteus mirabilis DNA fragment homologous to the 60K-rnpA-rpmH-dnaA-dnaN-recF-gyrB region of Escherichia coli.

    PubMed

    Skovgaard, O

    1990-09-01

    A 6.5-kb DNA fragment from Proteus mirabilis hybridized to the Escherichia coli dnaA gene. This DNA fragment was cloned and the nucleotide (nt) sequence determined. The fragment is homologous to a region of the E. coli chromosome containing a part of the gene encoding a 60-kDa membrane-associated protein (60K), the rnpA-rpmH-dnaA-dnaN-recF genes, and the N-terminal part of the gyrB gene. The degree of homology is variable: the amino-acid (aa) sequence of a part of the 60K protein and a part of the DnaA protein is only minimally conserved, whereas the C-terminal 148 aa of DnaA are identical in the two species. The conservation of the nt sequence between the rnpA gene and the gene encoding the 60K protein suggests that this region encodes a hitherto unrecognized protein. The ORF for this protein partially overlaps the 3' end of the rnpA structural gene, and the degree of conservation suggests that this gene is important for these bacteria.

  13. Amino acid sequence of two neurotoxins from the venom of the Egyptian black snake (Walterinnesia aegyptia).

    PubMed

    Samejima, Y; Aoki-Tomomatsu, Y; Yanagisawa, M; Mebs, D

    1997-02-01

    The venom of the Egyptian black snake Walterinnesia aegyptia contains at least three toxins, which act postsynaptically to block the neuromuscular transmission of isolated rat phrenic nerve-diaphragm and chicken biventer cervicis muscle. The complete amino acid sequence of the two toxins, W-III and W-IV, consisting of 62 amino acid residues, was elucidated by Edman degradation of fragments obtained after Staphylococcus aureus protease and prolylpeptidase digestion. Although the toxins exhibit close structural homology to other short-chain postsynaptic neurotoxins from Elapidae venoms, toxin IV is unique by having a free SH-group (cysteine) at position 16. In position 35 of W-III, which is located at the tip of the central loop, threonine is replaced by lysine, which may alter the interaction of the toxin with the acetylcholine receptor, since the toxin is seven times less lethal than toxin W-IV.

  14. Studies on adenosine triphosphate transphosphorylases. Amino acid sequence of rabbit muscle ATP-AMP transphosphorylase.

    PubMed

    Kuby, S A; Palmieri, R H; Frischat, A; Fischer, A H; Wu, L H; Maland, L; Manship, M

    1984-05-22

    The total amino acid sequence of rabbit muscle adenylate kinase has been determined, and the single polypeptide chain of 194 amino acid residues starts with N-acetylmethionine and ends with leucyllysine at its carboxyl terminus, in agreement with the earlier data on its amino acid composition [Mahowald, T. A., Noltmann, E. A., & Kuby, S. A. (1962) J. Biol. Chem. 237, 1138-1145] and its carboxyl-terminus sequence [Olson, O. E., & Kuby, S. A. (1964) J. Biol. Chem. 239, 460-467]. Elucidation of the primary structure was based on tryptic and chymotryptic cleavages of the performic acid oxidized protein, cyanogen bromide cleavages of the 14C-labeled S-carboxymethylated protein at its five methionine sites (followed by maleylation of peptide fragments), and tryptic cleavages at its 12 arginine sites of the maleylated 14C-labeled S-carboxymethylated protein. Calf muscle myokinase, whose sequence has also been established, differs primarily from the rabbit muscle myokinase's sequence in the following: His-30 is replaced by Gln-30; Lys-56 is replaced by Met-56; Ala-84 and Asp 85 are replaced by Val-84 and Asn-85. A comparison of the four muscle-type adenylate kinases, whose covalent structures have now been determined, viz., rabbit, calf, porcine, and human [for the latter two sequences see Heil, A., Müller, G., Noda, L., Pinder, T., Schirmer, H., Schirmer, I., & Von Zabern, I. (1974) Eur. J. Biochem. 43, 131-144, and Von Zabern, I., Wittmann-Liebold, B., Untucht-Grau, R., Schirmer, R. H., & Pai, E. F. (1976) Eur. J. Biochem. 68, 281-290], demonstrates an extraordinary degree of homology.(ABSTRACT TRUNCATED AT 250 WORDS)

  15. Influence of Bacteriophage PBS1 and φW-14 Deoxyribonucleic Acids on Homologous Deoxyribonucleic Acid Uptake and Transformation in Competent Bacillus subtilis

    PubMed Central

    López, Paloma; Espinosa, Manuel; Piechowska, Mirosława; Shugar, David

    1980-01-01

    Both bacteriophage PBS1 deoxyribonucleic acid (DNA) (in which all the thymine residues are replaced by uracil) and phage φW-14 DNA [in which half the thymine residues are replaced by 5-(aminobutylaminomethyl)uracil or 5-putrescinylthymine] exhibit comparable competing abilities for uptake of homologous DNA in a Bacillus subtilis competent system. But, whereas PBS1 DNA leads to a decrease in transformation frequencies compatible with its competing ability for DNA uptake, φW-14 DNA decreases transformation frequencies by a factor up to eightfold higher. The effect of φW-14 DNA on transformation frequencies is visible even at a concentration level that does not decrease transforming DNA uptake. No such effect was observed with heterologous DNA containing presumably ionically bound putrescine. Low concentrations of φW-14 DNA decreased the number of double (nonlinked) transformants more than single transformants. The influence on transformation was abolished when φW-14 DNA was added 20 min after addition of transforming DNA, i.e., when the recombination process was terminated. The putrescine-containing DNA also decreased retention of trichloroacetic acid-precipitable radioactivity of homologous DNA taken up. We conclude that φW-14 DNA inhibits some intracellular process(es) at the level of recombination. In addition, there is evidence that φW-14 DNA, but not heterologous DNA with ionically bound putrescine, binds also to site(s) on the cell surface other than receptors for homologous DNA. PMID:6772635

  16. Peptide Mass Fingerprinting and N-Terminal Amino Acid Sequencing of Glycosylated Cysteine Protease of Euphorbia nivulia Buch.-Ham.

    PubMed Central

    Badgujar, Shamkant B.; Mahajan, Raghunath T.

    2013-01-01

    A new cysteine protease named Nivulian-II has been purified from the latex of Euphorbia nivulia Buch.-Ham. The apparent molecular mass of Nivulian-II is 43670.846 Da (MALDI TOF/MS). Peptide mass fingerprint analysis revealed peptide matches to Maturase K (Q52ZV1_9MAGN) of Banksia quercifolia. The N-terminal sequence (DFPPNTCCCICC) showed partial homology with those of other cysteine proteinases of biological origin. This is the first paper to characterize a Nivulian-II of E. nivulia latex with respect to amino acid sequencing. PMID:23476742

  17. Amino acid sequence repertoire of the bacterial proteome and the occurrence of untranslatable sequences

    PubMed Central

    Navon, Sharon Penias; Kornberg, Guy; Chen, Jin; Schwartzman, Tali; Tsai, Albert; Puglisi, Elisabetta Viani; Puglisi, Joseph D.; Adir, Noam

    2016-01-01

    Bioinformatic analysis of Escherichia coli proteomes revealed that all possible amino acid triplet sequences occur at their expected frequencies, with four exceptions. Two of the four underrepresented sequences (URSs) were shown to interfere with translation in vivo and in vitro. Enlarging the URS by a single amino acid resulted in increased translational inhibition. Single-molecule methods revealed stalling of translation at the entrance of the peptide exit tunnel of the ribosome, adjacent to ribosomal nucleotides A2062 and U2585. Interaction with these same ribosomal residues is involved in regulation of translation by longer, naturally occurring protein sequences. The E. coli exit tunnel has evidently evolved to minimize interaction with the exit tunnel and maximize the sequence diversity of the proteome, although allowing some interactions for regulatory purposes. Bioinformatic analysis of the human proteome revealed no underrepresented triplet sequences, possibly reflecting an absence of regulation by interaction with the exit tunnel. PMID:27307442

  18. Amino acid sequence repertoire of the bacterial proteome and the occurrence of untranslatable sequences.

    PubMed

    Navon, Sharon Penias; Kornberg, Guy; Chen, Jin; Schwartzman, Tali; Tsai, Albert; Puglisi, Elisabetta Viani; Puglisi, Joseph D; Adir, Noam

    2016-06-28

    Bioinformatic analysis of Escherichia coli proteomes revealed that all possible amino acid triplet sequences occur at their expected frequencies, with four exceptions. Two of the four underrepresented sequences (URSs) were shown to interfere with translation in vivo and in vitro. Enlarging the URS by a single amino acid resulted in increased translational inhibition. Single-molecule methods revealed stalling of translation at the entrance of the peptide exit tunnel of the ribosome, adjacent to ribosomal nucleotides A2062 and U2585. Interaction with these same ribosomal residues is involved in regulation of translation by longer, naturally occurring protein sequences. The E. coli exit tunnel has evidently evolved to minimize interaction with the exit tunnel and maximize the sequence diversity of the proteome, although allowing some interactions for regulatory purposes. Bioinformatic analysis of the human proteome revealed no underrepresented triplet sequences, possibly reflecting an absence of regulation by interaction with the exit tunnel.

  19. Cloning and sequence analysis of a vasa homolog in the European sea bass (Dicentrarchus labrax): tissue distribution and mRNA expression levels during early development and sex differentiation.

    PubMed

    Blázquez, Mercedes; González, Alicia; Mylonas, Constantinos C; Piferrer, Francesc

    2011-01-15

    Vasa is a protein expressed mainly in germ cells and conserved across taxa. However, sex-related differences and environmental influences on vasa expression have not been documented. This study characterized the cDNA of a vasa homolog in the European sea bass, Dicentrarchuslabrax (sb-vasa), a gonochoristic fish with temperature influences on gonadogenesis. The 1911 bp open reading frame predicted a 637-amino acid protein with the eight conserved domains typical of Vasa proteins. Comparisons of the deduced amino acid sequence with those of other vertebrates and invertebrates revealed the highest homology (68-85%) with those of other teleosts. An updated tree with the full-length sequences for Vasa proteins in 66 species belonging to six different phyla was constructed, establishing the evolutionary relationships of Vasa amino acid sequences. European sea bass vasa was highly expressed in gonads with little or no expression in other tissues. Real time RT-PCR quantification of the temporal expression of sb-vasa from early development throughout sex differentiation showed that mRNA levels were high in unfertilized eggs, decreased during larval development and increased again during the period of germ cell proliferation. Rearing of fish at high temperature resulted in further increased sb-vasa levels, most likely reflecting temperature effects on both somatic and gonadal growth. Differences in expression were also found well before sex differentiation and persisted until the end of the first year, with higher levels present in females. These differences in expression demonstrate the implication of vasa during the initial stages of fish sex differentiation and gametogenesis and suggest that, through its helicase activity, it might be implicated in the translational regulation of mRNAs involved in the specification and differentiation of gonadal-specific cell types.

  20. Developmental rearrangement of cyanobacterial nif genes: nucleotide sequence, open reading frames, and cytochrome P-450 homology of the Anabaena sp. strain PCC 7120 nifD element.

    PubMed

    Lammers, P J; McLaughlin, S; Papin, S; Trujillo-Provencio, C; Ryncarz, A J

    1990-12-01

    An 11-kbp DNA element of unknown function interrupts the nifD gene in vegetative cells of Anabaena sp. strain PCC 7120. In developing heterocysts the nifD element excises from the chromosome via site-specific recombination between short repeat sequences that flank the element. The nucleotide sequence of the nifH-proximal half of the element was determined to elucidate the genetic potential of the element. Four open reading frames with the same relative orientation as the nifD element-encoded xisA gene were identified in the sequenced region. Each of the open reading frames was preceded by a reasonable ribosome-binding site and had biased codon utilization preferences consistent with low levels of expression. Open reading frame 3 was highly homologous with three cytochrome P-450 omega-hydroxylase proteins and showed regional homology to functionally significant domains common to the cytochrome P-450 superfamily. The sequence encoding open reading frame 2 was the most highly conserved portion of the sequenced region based on heterologous hybridization experiments with three genera of heterocystous cyanobacteria.

  1. Developmental rearrangement of cyanobacterial nif genes: nucleotide sequence, open reading frames, and cytochrome P-450 homology of the Anabaena sp. strain PCC 7120 nifD element.

    PubMed Central

    Lammers, P J; McLaughlin, S; Papin, S; Trujillo-Provencio, C; Ryncarz, A J

    1990-01-01

    An 11-kbp DNA element of unknown function interrupts the nifD gene in vegetative cells of Anabaena sp. strain PCC 7120. In developing heterocysts the nifD element excises from the chromosome via site-specific recombination between short repeat sequences that flank the element. The nucleotide sequence of the nifH-proximal half of the element was determined to elucidate the genetic potential of the element. Four open reading frames with the same relative orientation as the nifD element-encoded xisA gene were identified in the sequenced region. Each of the open reading frames was preceded by a reasonable ribosome-binding site and had biased codon utilization preferences consistent with low levels of expression. Open reading frame 3 was highly homologous with three cytochrome P-450 omega-hydroxylase proteins and showed regional homology to functionally significant domains common to the cytochrome P-450 superfamily. The sequence encoding open reading frame 2 was the most highly conserved portion of the sequenced region based on heterologous hybridization experiments with three genera of heterocystous cyanobacteria. Images PMID:2123860

  2. Evidence of mineralization activity and supramolecular assembly by the N-terminal sequence of ACCBP, a biomineralization protein that is homologous to the acetylcholine binding protein family.

    PubMed

    Amos, Fairland F; Ndao, Moise; Evans, John Spencer

    2009-12-14

    Several biomineralization proteins that exhibit intrinsic disorder also possess sequence regions that are homologous to nonmineral associated folded proteins. One such protein is the amorphous calcium carbonate binding protein (ACCBP), one of several proteins that regulate the formation of the oyster shell and exhibit 30% conserved sequence identity to the acetylcholine binding protein sequences. To gain a better understanding of the ACCBP protein, we utilized bioinformatic approaches to identify the location of disordered and folded regions within this protein. In addition, we synthesized a 50 AA polypeptide, ACCN, representing the N-terminal domain of the mature processed ACCBP protein. We then utilized this polypeptide to determine the mineralization activity and qualitative structure of the N-terminal region of ACCBP. Our bioinformatic studies indicate that ACCBP consists of a ten-stranded beta-sandwich structure that includes short disordered sequence blocks, two of which reside within the primarily helical and surface-accessible ACCN sequence. Circular dichroism studies reveal that ACCN is partially disordered in solution; however, ACCN can be induced to fold into an alpha helix in the presence of TFE. Furthermore, we confirm that the ACCN sequence is multifunctional; this sequence promotes radial calcite polycrystal growth on Kevlar threads and forms supramolecular assemblies in solution that contain amorphous-appearing deposits. We conclude that the partially disordered ACCN sequence is a putative site for mineralization activity within the ACCBP protein and that the presence of short disordered sequence regions within the ACCBP fold are essential for function.

  3. Amino acid sequences of proteins from Leptospira serovar pomona.

    PubMed

    Alves, S F; Lefebvre, R B; Probert, W

    2000-01-01

    This report describes a partial amino acid sequences from three putative outer envelope proteins from Leptospira serovar pomona. In order to obtain internal fragments for protein sequencing, enzymatic and chemical digestion was performed. The enzyme clostripain was used to digest the proteins 32 and 45 kDa. In situ digestion of 40 kDa molecular weight protein was accomplished using cyanogen bromide. The 32 kDa protein generated two fragments, one of 21 kDa and another of 10 kDa that yielded five residues. A fragment of 24 kDa that yielded nineteen residues of amino acids was obtained from 45 kDa protein. A fragment with a molecular weight of 20 kDa, yielding a twenty amino acids sequence from the 40 kDa protein.

  4. Detection of petunia vein-clearing virus: model for the detection of DNA viruses in plants with homologous endogenous pararetrovirus sequences.

    PubMed

    Harper, Glyn; Richert-Pöggeler, Katja R; Hohn, Thomas; Hull, Roger

    2003-02-01

    A number of cases of plant virus sequence integration into host plant genome have been reported. In at least two cases, endogenous pararetrovirus sequences are correlated strongly with subsequent episomal virus infection and there is circumstantial evidence that this also occurs for Petunia vein-clearing virus (PVCV). The detection of viruses is a critical component of plant health and therefore, it is important to have diagnostic procedures that differentiate between the detection of encapsidated viral DNA and homologous sequences in the host genome. PCR-based detection methods targeted at PVCV DNA have been tested and particular attention was paid to design controls that would indicate the existence of host DNA in the reaction. The use of ion-exchange chromatography for the partial purification of plant viruses from other cellular components, including chromosomal DNA, is described. The methods tested for PVCV detection are used to illustrate general principles for the specific detection of virus infections in host plants that carry homologous virus sequences in their genomes.

  5. Statistical model for self-assembly of trimesic acid molecules into homologous series of flower phases

    NASA Astrophysics Data System (ADS)

    Ibenskas, A.; Tornau, E. E.

    2012-11-01

    The statistical three-state model is proposed to describe the ordering of triangular TMA molecules into flower phases. The model is solved on a rescaled triangular lattice, assuming following intermolecular interactions: exclusion of any molecules on nearest neighbor sites, triangular trio H-bonding interactions for molecules of the same orientation on next-nearest neighbor sites, and dimeric H-bonding interactions for molecules of different (“tip-to-tip”) orientations on third-nearest neighbor sites. The model allows us to obtain the analytical solution for the ground state phase diagram with all homologous series of flower phases included, starting with the honeycomb phase (n=1) and ending with the superflower structure (n=∞). Monte Carlo simulations are used to obtain the thermodynamical properties of this model. It is found that phase transitions from disordered to any of the flower phases (except n=1) undergo via intermediate correlated triangular domains structure. The transition from the disordered phase to the intermediate phase is, most likely, of the first order, while the transition from the intermediate to the flower phase is definitely first order phase transition. The phase diagrams including low-temperature flower phases are obtained. The origin of the intermediate phase, phase separation, and metastable structures are discussed.

  6. Characterization of Group V Dubnium Homologs on DGA Extraction Chromatography Resin from Nitric and Hydrofluoric Acid Matrices

    SciTech Connect

    Despotopulos, J D; Sudowe, R

    2012-02-21

    somewhere between Nb and Pa. Much more recent studies have examined the properties of Db from HNO{sub 3}/HF matrices, and suggest Db forms complexes similar to those of Pa. Very little experimental work into the behavior of element 114 has been performed. Thermochromatography experiments of three atoms of element 114 indicate that the element 114 is at least as volatile as Hg, At, and element 112. Lead was shown to deposit on gold at temperatures about 1000 C higher than the atoms of element 114. Results indicate a substantially increased stability of element 114. No liquid phase studies of element 114 or its homologs (Pb, Sn, Ge) or pseudo-homologs (Hg, Cd) have been performed. Theoretical predictions indicate that element 114 is should have a much more stable +2 oxidation state and neutral state than Pb, which would result in element 114 being less reactive and less metallic than Pb. The relativistic effects on the 7p{sub 1/2} electrons are predicted to cause a diagonal relationship to be introduced into the periodic table. Therefore, 114{sup 2+} is expected to behave as if it were somewhere between Hg{sup 2+}, Cd{sup 2+}, and Pb{sup 2+}. In this work two commercially available extraction chromatography resins are evaluated, one for the separation of Db homologs and pseudo?homologs from each other as well as from potential interfering elements such as Group IV Rf homologs and actinides, and the other for separation of element 114 homologs. One resin, Eichrom's DGA resin, contains a N,N,N',N'-tetra-n-octyldiglycolamide extractant, which separates analytes based on both size and charge characteristics of the solvated metal species, coated on an inert support. The DGA resin was examined for Db chemical systems, and shows a high degree of selectivity for tri-, tetra-, and hexavalent metal ions in multiple acid matrices with fast kinetics. The other resin, Eichrom's Pb resin, contains a di-t-butylcyclohexano 18-crown-6 extractant with isodecanol solvent, which separates

  7. Homologous electron transport components fail to increase fatty acid hydroxylation in transgenic Arabidopsis thaliana

    PubMed Central

    Wayne, Laura L.; Browse, John

    2013-01-01

    Ricinoleic acid, a hydroxylated fatty acid (HFA) present in castor ( Ricinus communis) seeds, is an important industrial commodity used in products ranging from inks and paints to polymers and fuels. However, due to the deadly toxin ricin and allergens also present in castor, it would be advantageous to produce ricinoleic acid in a different agricultural crop. Unfortunately, repeated efforts at heterologous expression of the castor fatty acid hydroxylase (RcFAH12) in the model plant Arabidopsis thaliana have produced only 17-19% HFA in the seed triacylglycerols (TAG), whereas castor seeds accumulate up to 90% ricinoleic acid in the endosperm TAG. RcFAH12 requires an electron supply from NADH:cytochrome b5 reductase (CBR1) and cytochrome b5 (Cb5) to synthesize ricinoleic acid. Previously, our laboratory found a mutation in the Arabidopsis CBR1 gene, cbr1-1, that caused an 85% decrease in HFA levels in the RcFAH12 Arabidopsis line. These results raise the possibility that electron supply to the heterologous RcFAH12 may limit the production of HFA. Therefore, we hypothesized that by heterologously expressing RcCb5, the reductant supply to RcFAH12 would be improved and lead to increased HFA accumulation in Arabidopsis seeds. Contrary to this proposal, heterologous expression of the top three RcCb5 candidates did not increase HFA accumulation. Furthermore, coexpression of RcCBR1 and RcCb5 in RcFAH12 Arabidopsis also did not increase in HFA levels compared to the parental lines. These results demonstrate that the Arabidopsis electron transfer system is supplying sufficient reductant to RcFAH12 and that there must be other bottlenecks limiting the accumulation of HFA. PMID:24555099

  8. Homologous down-regulation of growth hormone-releasing hormone receptor messenger ribonucleic acid levels.

    PubMed

    Aleppo, G; Moskal, S F; De Grandis, P A; Kineman, R D; Frohman, L A

    1997-03-01

    Repeated stimulation of pituitary cell cultures with GH-releasing hormone (GHRH) results in diminished responsiveness, a phenomenon referred to as homologous desensitization. One component of GHRH-induced desensitization is a reduction in GHRH-binding sites, which is reflected by the decreased ability of GHRH to stimulate a rise in intracellular cAMP. In the present study, we sought to determine if homologous down-regulation of GHRH receptor number is due to a decrease in GHRH receptor synthesis. To this end, we developed and validated a quantitative RT-PCR assay system that was capable of assessing differences in GHRH-R messenger RNA (mRNA) levels in total RNA samples obtained from rat pituitary cell cultures. Treatment of pituitary cells with GHRH, for as little as 4 h, resulted in a dose-dependent decrease in GHRH-R mRNA levels. The maximum effect was observed with 0.1 and 1 nM GHRH, which reduced GHRH-R mRNA levels to 49 +/- 4% (mean +/- SEM) and 54 +/- 11% of control values, respectively (n = three separate experiments; P < 0.05). Accompanying the decline in GHRH-R mRNA levels was a rise in GH release; reaching 320 +/- 31% of control values (P < 0.01). Because of the possibility that the rise in medium GH level is the primary regulator of GHRH-R mRNA, we pretreated pituitary cultures for 4 h with GH to achieve a concentration comparable with that induced by a maximal stimulation with GHRH (8 micrograms GH/ml medium). Following pretreatment, cultures were stimulated for 15 min with GHRH and intracellular cAMP accumulation was measured by RIA. GH pretreatment did not impair the ability of GHRH to induce a rise in cAMP concentrations. However, as anticipated, GHRH pretreatment (10 nM) significantly reduced subsequent GHRH-stimulated cAMP to 46% of untreated controls. These data suggest that GHRH, but not GH, directly reduces GHRH-R mRNA levels. To determine whether this effect was mediated through cAMP, cultures were treated with forskolin, a direct stimulator of

  9. Unconventional amino acid sequence of the sun anemone (Stoichactis helianthus) polypeptide neurotoxin

    SciTech Connect

    Kem, W.; Dunn, B.; Parten, B.; Pennington, M.; Price, D.

    1986-05-01

    A 5000 dalton polypeptide neurotoxin (Sh-NI) purified by G50 Sephadex, P-cellulose, and SP-Sephadex chromatography was homogeneous by isoelectric focusing. Sh-NI was highly toxic to crayfish (LD/sub 50/ 0.6 ..mu..g/kg) but without effect upon mice at 15,000 ..mu..g/kg (i.p. injection). The reduced, /sup 3/H-carboxymethylated toxin and its fragments were subjected to automatic Edman degradation and the resulting PTH-amino acids were identified by HPLC, back hydrolysis, and scintillation counting. Peptides resulting from proteolytic (clostripain, staphylococcal protease) and chemical (tryptophan) cleavage were sequenced. The sequence is: AACKCDDEGPDIRTAPLTGTVDLGSCNAGWEKCASYYTIIADCCRKKK. This sequence differs considerably from the homologous Anemonia and Anthopleura toxins; many of the identical residues (6 half-cystines, G9, P10, R13, G19, G29, W30) are probably critical for folding rather than receptor recognition. However, the Sh-NI sequence closely resembles Radioanthus macrodactylus neurotoxin III and r. paumotensis II. The authors propose that Sh-NI and related Radioanthus toxins act upon a different site on the sodium channel.

  10. Origins of sequence selectivity in homologous genetic recombination: insights from rapid kinetic probing of RecA-mediated DNA strand exchange.

    PubMed

    Lee, Andrew M; Xiao, Jie; Singleton, Scott F

    2006-07-07

    Despite intense effort over the past 30 years, the molecular determinants of sequence selectivity in RecA-mediated homologous recombination have remained elusive. Here, we describe when and how sequence homology is recognized between DNA strands during recombination in the context of a kinetic model for RecA-mediated DNA strand exchange. We characterized the transient intermediates of the reaction using pre-steady-state kinetic analysis of strand exchange using oligonucleotide substrates containing a single fluorescent G analog. We observed that the reaction system was sensitive to heterology between the DNA substrates; however, such a "heterology effect" was not manifest when functional groups were added to or removed from the edges of the base-pairs facing the minor groove of the substrate duplex. Hence, RecA-mediated recombination must occur without the involvement of a triple helix, even as a transient intermediate in the process. The fastest detectable reaction phase was accelerated when the structure or stability of the substrate duplex was perturbed by internal mismatches or the replacement of G.C by I.C base-pairs. These findings indicate that the sequence specificity in recombination is achieved by Watson-Crick pairing in the context of base-pair dynamics inherent to the extended DNA structure bound by RecA during strand exchange.

  11. Identification of neurofibromatosis 1 (NF1) homologous loci by direct sequencing, fluorescence in situ hybridization, and PCR amplification of somatic cell hybrids

    SciTech Connect

    Purandare, S.M.; Neil, S.M.; Brothman, A. |

    1995-12-10

    Using fluorescence in situ hybridization (FISH), we have identified seven NF1-related loci, two separate loci on chromosome 2, at bands 2q21 and 2q33-q34, and one locus each on five other chromosomes at bands 14q11.2, 15q11.2, 18p11.2, 21q11.2-q21, and 22q11.2. Application of PCR using NF1 primer pairs and genomic DNA from somatic cell hybrids confirmed the above loci, identified additional loci on chromosomes 12 and 15, and showed that the various loci do not share homology beyond NF1 exon 27b. Sequenced PCR products representing segments corresponding to NF1 exons from these loci demonstrated greater than 95% sequence identity with the NF1 locus. We used sequence differences between bona fide NF1 and NF1-homologous loci to strategically design primer sets to specifically amplify 30 of 36 exons within the 5{prime} end of the NF1 gene. These developments have facilitated mutation analysis at the NF1 locus using genomic DNA as template. 41 refs., 3 figs., 3 tabs.

  12. Phylogenetic position of phylum Nemertini, inferred from 18S rRNA sequences: molecular data as a test of morphological character homology.

    PubMed

    Turbeville, J M; Field, K G; Raff, R A

    1992-03-01

    Partial 18S rRNA sequence of the nemertine Cerebratulus lacteus was obtained and compared with those of coelomate metazoans and acoelomate platyhelminths to test whether nemertines share a most recent common ancestor with the platyhelminths, as traditionally has been implied, or whether nemertines lie within a protostome coelomate clade, as suggested by more recent morphological analyses. Maximum-parsimony analysis supports the inclusion of the nemertine within a protostome-coelomate clade that falls within a more inclusive coelomate clade. Bootstrap analysis indicates strong support for a monophyletic Coelomata composed of a deuterostome and protostome-coelomate clade. Support for a monophyletic protostome Coelomata is weak. Inference by distance analysis is consistent with that of maximum parsimony. Analysis of down-weighted paired sites by maximum parsimony reveals variation in topology only within the protostome-coelomate clade. The relationships among the protostome coelomates cannot be reliably inferred from the partial sequences, suggesting that coelomate protostomes diversified rapidly. Results with evolutionary parsimony are consistent with the inclusion of the nemertine in a coelomate clade. The molecular inference corroborates recent morphological character analyses that reveal no synapomorphies of nemertines and flatworms but instead suggest that the circulatory system and rhynchocoel of nemertines are homologous to coelomic cavities of protostome coelomates, thus supporting the corresponding hypothesis that nemertines belong within a protostome-coelomate clade. The sequence data provide an independent test of morphological character homology.

  13. The amino acid sequence of a carbohydrate-containing immunoglobulin-light-chain-type amyloid-fibril protein.

    PubMed Central

    Tveteraas, T; Sletten, K; Westermark, P

    1985-01-01

    The amino acid sequence of an amyloid-fibril protein Es492 of immunoglobulin-lambda-light-chain origin (AL) was elucidated. The amyloid fibrils were obtained from the spleen of a patient who died from systemic amyloidosis. The amino acid sequence was elucidated from structural studies of peptides derived from digestion of the protein with trypsin, thermolysin, chymotrypsin and Staphylococcus aureus V8 proteinase and from cleavage of the protein with CNBr and BNPS-skatole. A heterogeneity in the length of the polypeptide was seen in the C-terminal region. The protein was by sequence homology to other lambda-chains shown to be of the V lambda II subgroup. Although an extensive homology was seen, some amino acid residues in positions 26, 31, 32, 40, 44, 93, 97, 98 and 99 have not previously been reported in these positions of V lambda II proteins. The significance of these residues in the fibril formation is unclear. The protein was found to contain carbohydrate, with glycosylation sites in two of the hypervariable regions. PMID:3936482

  14. Molecular Cloning and Sequence Analysis of the Sta58 Major Antigen Gene of Rickettsia tsutsugamushi: Sequence homology and Antigenic Comparison of Sta58 to the 60-Kilodalton Family of Stress Proteins

    DTIC Science & Technology

    1990-05-01

    on the cell envelopes of Rickettsia 29. Messing, J. 1983. New M13 vectors for cloning. Methods prowazekii, Rickettsia rickettsii , and Rickettsia ...gene of Rickettsia tsu sugamushi:Sequence homology and antigenic comparison to the 60-kilodalton family of stresproteins. 12. PERSONAL AUTHOR(S...IuwRnuiy dy "jmber FIELD GROUP S ROUP Rickettsia tsutsugamushi, antigens, molecular cloning,. FIED_ GROU__ SUB-GROUP scrub typhus, heat-shock proteins

  15. Sequence homologies between eukaryotic 5.8S rRNA and the 5' end of prokaryotic 23S rRNa: evidences for a common evolutionary origin.

    PubMed Central

    Jacq, B

    1981-01-01

    The question of the evolutionary origin of eukaryotic 5.8S rRNA was re-examined after the recent publication of the E. coli 23S rRNA sequence (26,40). A region of the 23S RNA located at its 5' end was found to be approximately 50% homologous to four different eukaryotic 5.8S rRNAs. A computer comparison analysis indicates that no other region of the E. coli ribosomal transcription unit (greater than 5 000 nucleotides in length) shares a comparable homology with 5.8S rRNA. Homology between the 5' end of e. coli 23S and four different eukaryotic 5.8S rRNAs falls within the same range as that between E. coli 5S RNA from the same four eukaryotic species. All these data strongly suggest that the 5' end of prokaryotic 23S rRNA and eukaryotic 5.8S RNA have a common evolutionary origin. Secondary structure models are proposed for the 5' region of E. coli 23S RNA. Images PMID:7024907

  16. Valproic acid increases conservative homologous recombination frequency and reactive oxygen species formation: a potential mechanism for valproic acid-induced neural tube defects.

    PubMed

    Defoort, Ericka N; Kim, Perry M; Winn, Louise M

    2006-04-01

    Valproic acid, a commonly used antiepileptic agent, is associated with a 1 to 2% incidence of neural tube defects when taken during pregnancy; however, the molecular mechanism by which this occurs has not been elucidated. Previous research suggests that valproic acid exposure leads to an increase in reactive oxygen species (ROS). DNA damage due to ROS can result in DNA double-strand breaks, which can be repaired through homologous recombination (HR), a process that is not error-free and can result in detrimental genetic changes. Because the developing embryo requires tight regulation of gene expression to develop properly, we propose that the loss or dysfunction of genes involved in embryonic development through aberrant HR may ultimately cause neural tube defects. To determine whether valproic acid induces HR, Chinese hamster ovary 3-6 cells, containing a neomycin direct repeat recombination substrate, were exposed to valproic acid for 4 or 24 h. A significant increase in HR after exposure to valproic acid (5 and 10 mM) for 24 h was observed, which seems to occur through a conservative HR mechanism. We also demonstrated that exposure to valproic acid (5 and 10 mM) significantly increased intracellular ROS levels, which were attenuated by preincubation with polyethylene glycol-conjugated (PEG)-catalase. A significant change in the ratio of 8-hydroxy-2'-deoxyguanosine/2'-de-oxyguanosine, a measure of DNA oxidation, was not observed after valproic acid exposure; however, preincubation with PEG-catalase significantly blocked the increase in HR. These data demonstrate that valproic acid increases HR frequency and provides a possible mechanism for valproic acid-induced neural tube defects.

  17. Amino acid sequence and comparative antigenicity of chicken metallothionein.

    PubMed Central

    McCormick, C C; Fullmer, C S; Garvey, J S

    1988-01-01

    The complete amino acid sequence of metallothionein (MT) from chicken liver is reported. The primary structure was determined by automated sequence analysis of peptides produced by limited acid hydrolysis and by trypsin digestion. The comparative antigenicity of chicken MT was determined by radioimmunoassay using rabbit anti-rat MT polyclonal antibody. Chicken MT consists of 63 amino acids as compared to 61 found in MTs from mammals. One insertion (and two substitutions) occurs in the amino-terminal region, a region considered invariant among mammalian MTs. Eighteen of the 20 cysteines in chicken MT were aligned with cysteines from other mammalian sequences. Two cysteines near the carboxyl terminus are shifted by one residue due to the insertion of proline in that region. Overall, the chicken protein showed approximately equal to 68% sequence identity in a comparison with various mammalian MTs. The affinity of the polyclonal antibody for chicken MT was decreased by 2 orders of magnitude in comparison to that of a mammalian MT (rat MT isoforms). This reduced affinity is attributed to major substitutions in chicken MT in the regions of the principal determinants of mammalian MTs. Theoretical analysis of the primary structure predicted the secondary structure to consist of reverse turns and random coils with no stable beta or helix conformations. There is no evidence that chicken MT differs functionally from mammalian MTs. PMID:2448773

  18. Sequences Of Amino Acids For Human Serum Albumin

    NASA Technical Reports Server (NTRS)

    Carter, Daniel C.

    1992-01-01

    Sequences of amino acids defined for use in making polypeptides one-third to one-sixth as large as parent human serum albumin molecule. Smaller, chemically stable peptides have diverse applications including service as artificial human serum and as active components of biosensors and chromatographic matrices. In applications involving production of artificial sera from new sequences, little or no concern about viral contaminants. Smaller genetically engineered polypeptides more easily expressed and produced in large quantities, making commercial isolation and production more feasible and profitable.

  19. Gastropod arginine kinases from Cellana grata and Aplysia kurodai. Isolation and cDNA-derived amino acid sequences.

    PubMed

    Suzuki, T; Inoue, N; Higashi, T; Mizobuchi, R; Sugimura, N; Yokouchi, K; Furukohri, T

    2000-12-01

    Arginine kinase (AK) was isolated from the radular muscle of the gastropod molluscs Cellana grata (subclass Prosobranchia) and Aplysia kurodai (subclass Opisthobranchia), respectively, by ammonium sulfate fractionation, Sephadex G-75 gel filtration and DEAE-ion exchange chromatography. The denatured relative molecular mass values were estimated to be 40 kDa by sodium dodecyl sulfate-polyacrylamide gel electrophoresis. The isolated enzyme from Aplysia gave a Km value of 0.6 mM for arginine and a Vmax value of 13 micromole Pi min(-1) mg protein(-1) for the forward reaction. These values are comparable to other molluscan AKs. The cDNAs encoding Cellana and Aplysia AKs were amplified by polymerase chain reaction, and the nucleotide sequences of 1,608 and 1,239 bp, respectively, were determined. The open reading frame for Cellana AK is 1044 nucleotides in length and encodes a protein with 347 amino acid residues, and that for A. kurodai is 1077 nucleotides and 354 residues. The cDNA-derived amino acid sequences were validated by chemical sequencing of internal lysyl endopeptidase peptides. The amino acid sequences of Cellana and Aplysia AKs showed the highest percent identity (66-73%) with those of the abalone Nordotis and turbanshell Battilus belonging to the same class Gastropoda. These AK sequences still have a strong homology (63-71%) with that of the chiton Liolophura (class Polyplacophora), which is believed to be one of the most primitive molluscs. On the other hand, these AK sequences are less homologous (55-57%) with that of the clam Pseudocardium (class Bivalvia), suggesting that the biological position of the class Polyplacophora should be reconsidered.

  20. Nanopores and nucleic acids: prospects for ultrarapid sequencing

    NASA Technical Reports Server (NTRS)

    Deamer, D. W.; Akeson, M.

    2000-01-01

    DNA and RNA molecules can be detected as they are driven through a nanopore by an applied electric field at rates ranging from several hundred microseconds to a few milliseconds per molecule. The nanopore can rapidly discriminate between pyrimidine and purine segments along a single-stranded nucleic acid molecule. Nanopore detection and characterization of single molecules represents a new method for directly reading information encoded in linear polymers. If single-nucleotide resolution can be achieved, it is possible that nucleic acid sequences can be determined at rates exceeding a thousand bases per second.

  1. Partitioning of Homologous Nicotinic Acid Ester Prodrugs (Nicotinates) into Dipalmitoylphosphatidylcholine (DPPC) Membrane Bilayers

    PubMed Central

    Ojogun, Vivian; Vyas, Sandhya M.; Lehmler, Hans-Joachim; Knutson, Barbara L

    2010-01-01

    The partitioning behavior of a series of perhydrocarbon nicotinic acid esters (nicotinates) between aqueous solution and dipalmitoylphosphatidylcholine (DPPC) membrane bilayers is investigated as a function of increasing alkyl chain length. The hydrocarbon nicotinates represent putative prodrugs, derivatives of the polar drug nicotinic acid, whose functionalization provides the hydrophobic character necessary for pulmonary delivery in a hydrophobic, fluorocarbon solvent, such as perfluorooctyl bromide. Independent techniques of differential scanning calorimetry and 1,6-diphenyl-1,3,5 hexatriene (DPH) fluorescence anisotropy measurements are used to analyze the thermotropic phase behavior and lipid bilayer fluidity as a function of nicotinate concentration. At increasing concentrations of nicotinates over the DPPC mole fraction range examined (XDPPC = 0.6 – 1.0), all the nicotinates (ethyl (C2H5); butyl (C4H9); hexyl (C6H13); and octyl (C8H17)) partition into the lipid bilayer at sufficient levels to eliminate the pretransition, and decrease and broaden the gel to fluid phase transition temperature. The concentration at which these effects occur is chain length-dependent; the shortest chain nicotinate, C2H5, elicits the least dramatic response. Similarly, the DPH anisotropy results demonstrate an alteration of the bilayer organization in the liposomes as a consequence of the chain length-dependent partitioning of the nicotinates into DPPC bilayers. The membrane partition coefficients (logarithm values), determined from the depressed bilayer phase transition temperatures, increase from 2.18 for C2H5 to 5.25 for C8H17. The DPPC membrane/water partitioning of the perhydrocarbon nicotinate series correlates with trends in the octanol/water partitioning of these solutes, suggesting that their incorporation into the bilayer is driven by increasing hydrophobicity. PMID:20227859

  2. Strategies for Development of Functionally Equivalent Promoters with Minimum Sequence Homology for Transgene Expression in Plants: cis-Elements in a Novel DNA Context versus Domain Swapping1

    PubMed Central

    Bhullar, Simran; Chakravarthy, Suma; Advani, Sonia; Datta, Sudipta; Pental, Deepak; Burma, Pradeep Kumar

    2003-01-01

    The cauliflower mosaic virus 35S (35S) promoter has been extensively used for the constitutive expression of transgenes in dicotyledonous plants. The repetitive use of the same promoter is known to induce transgene inactivation due to promoter homology. As a way to circumvent this problem, we tested two different strategies for the development of synthetic promoters that are functionally equivalent but have a minimum sequence homology. Such promoters can be generated by (a) introducing known cis-elements in a novel or synthetic stretch of DNA or (b) “domain swapping,” wherein domains of one promoter can be replaced with functionally equivalent domains from other heterologous promoters. We evaluated the two strategies for promoter modifications using domain A (consisting of minimal promoter and subdomain A1) of the 35S promoter as a model. A set of modified 35S promoters were developed whose strength was compared with the 35S promoter per se using β-glucuronidase as the reporter gene. Analysis of the expression of the reporter gene in transient assay system showed that domain swapping led to a significant fall in promoter activity. In contrast, promoters developed by placing cis-elements in a novel DNA context showed levels of expression comparable with that of the 35S. Two promoter constructs Mod2A1T and Mod3A1T were then designed by placing the core sequences of minimal promoter and subdomain A1 in divergent DNA sequences. Transgenics developed in tobacco (Nicotiana tabacum) with the two constructs and with 35S as control were used to assess the promoter activity in different tissues of primary transformants. Mod2A1T and Mod3A1T were found to be active in all of the tissues tested, at levels comparable with that of 35S. Further, the expression of the Mod2A1T promoter in the seedlings of the T1 generation was also similar to that of the 35S promoter. The present strategy opens up the possibility of creating a set of synthetic promoters with minimum sequence

  3. Functional homology between the sequence-specific DNA-binding proteins nuclear factor I from HeLa cells and the TGGCA protein from chicken liver.

    PubMed Central

    Leegwater, P A; van der Vliet, P C; Rupp, R A; Nowock, J; Sippel, A E

    1986-01-01

    Nuclear factor I from HeLa cells, a protein with enhancing function in adenovirus DNA replication, and the chicken TGGCA protein are specific DNA-binding proteins that were first detected by independent methods and that appeared to have similar DNA sequence specificity. To test whether they are homologous proteins from different species we have compared (i) their DNA binding properties and (ii) their function in reconstituted adenovirus DNA replication systems. Using deletion and substitution mutants derived from the DNA binding site on the adenovirus 2 inverted terminal repeat, it was found that the two proteins protect the same 24-nucleotide region of both strands against DNase I digestion and that they have identical minimal recognition sequences of 15 bp containing dyad symmetry. Like nuclear factor I, the TGGCA protein enhances the initiation reaction of adenovirus 2 DNA replication in vitro in a DNA recognition site-dependent manner. Images Fig. 1. Fig. 3. Fig. 4. Fig. 5. Fig. 6. PMID:3709517

  4. Method for the detection of specific nucleic acid sequences by polymerase nucleotide incorporation

    DOEpatents

    Castro, Alonso

    2004-06-01

    A method for rapid and efficient detection of a target DNA or RNA sequence is provided. A primer having a 3'-hydroxyl group at one end and having a sequence of nucleotides sufficiently homologous with an identifying sequence of nucleotides in the target DNA is selected. The primer is hybridized to the identifying sequence of nucleotides on the DNA or RNA sequence and a reporter molecule is synthesized on the target sequence by progressively binding complementary nucleotides to the primer, where the complementary nucleotides include nucleotides labeled with a fluorophore. Fluorescence emitted by fluorophores on single reporter molecules is detected to identify the target DNA or RNA sequence.

  5. Molecular cloning, encoding sequence, and expression of vaccinia virus nucleic acid-dependent nucleoside triphosphatase gene.

    PubMed Central

    Rodriguez, J F; Kahn, J S; Esteban, M

    1986-01-01

    A rabbit poxvirus genomic library contained within the expression vector lambda gt11 was screened with polyclonal antiserum prepared against vaccinia virus nucleic acid-dependent nucleoside triphosphatase (NTPase)-I enzyme. Five positive phage clones containing from 0.72- to 2.5-kilobase-pair (kbp) inserts expressed a beta-galactosidase fusion protein that was reactive by immunoblotting with the NTPase-I antibody. Hybridization analysis allowed the location of this gene within the vaccinia HindIIID restriction fragment. From the known nucleotide sequence of the 16-kbp vaccinia HindIIID fragment, we identified a region that contains a 1896-base open reading frame coding for a 631-amino acid protein. Analysis of the complete sequence revealed a highly basic protein, with hydrophilic COOH and NH2 termini, various hydrophobic domains, and no significant homology to other known proteins. Translational studies demonstrate that NTPase-I belongs to a late class of viral genes. This protein is highly conserved among Orthopoxviruses. Images PMID:3025846

  6. The amino acid sequences and activities of synergistic hemolysins from Staphylococcus cohnii.

    PubMed

    Mak, Pawel; Maszewska, Agnieszka; Rozalska, Malgorzata

    2008-10-01

    Staphylococcus cohnii ssp. cohnii and S. cohnii ssp. urealyticus are a coagulase-negative staphylococci considered for a long time as unable to cause infections. This situation changed recently and pathogenic strains of these bacteria were isolated from hospital environments, patients and medical staff. Most of the isolated strains were resistant to many antibiotics. The present work describes isolation and characterization of several synergistic peptide hemolysins produced by these bacteria and acting as virulence factors responsible for hemolytic and cytotoxic activities. Amino acid sequences of respective hemolysins from S. cohnii ssp. cohnii (named as H1C, H2C and H3C) and S. cohnii ssp. urealyticus (H1U, H2U and H3U) were identical. Peptides H1 and H3 possessed significant amino acid homology to three synergistic hemolysins secreted by Staphylococcus lugdunensis and to putative antibacterial peptide produced by Staphylococcus saprophyticus ssp. saprophyticus. On the other hand, hemolysin H2 had a unique sequence. All isolated peptides lysed red cells from different mammalian species and exerted a cytotoxic effect on human fibroblasts.

  7. Complete amino acid sequence of a Lolium perenne (perennial rye grass) pollen allergen, Lol p II.

    PubMed

    Ansari, A A; Shenbagamurthi, P; Marsh, D G

    1989-07-05

    The complete amino acid sequence of a Lolium perenne (rye grass) pollen allergen, Lol p II was determined by automated Edman degradation of the protein and selected fragments. Cleavage of the protein by enzymatic and chemical techniques established an unambiguous sequence for the protein. Lol p II contains 97 amino acid residues, with a calculated molecular weight of 10,882. The protein lacks cysteine and glutamine and shows no evidence of glycosylation. Theoretical predictions by Fraga's (Fraga, S. (1982) Can. J. Chem. 60, 2606-2610) and Hopp and Woods' (Hopp, T. P., and Woods, K. R. (1981) Proc. Natl. Acad. Sci. U.S.A. 78, 3824-3828) methods indicate the presence of four hydrophilic regions, which may contribute to sequential or parts of conformational B-cell epitopes. Analysis of amphipathic regions by Berzofsky's method indicates the presence of a highly amphipathic region, which may contain, or contribute to, an Ia/T-cell epitope. This latter segment of Lol p II was found to be highly homologous with an antibody-binding segment of the major rye allergen Lol p I and may explain why immune responsiveness to both the allergens is associated with HLA-DR3.

  8. Sequence homology between 4qter and 10qter loci facilitates the instability of subtelomeric KpnI repeat units implicated in facioscapulohumeral muscular dystrophy.

    PubMed Central

    Cacurri, S; Piazzo, N; Deidda, G; Vigneti, E; Galluzzi, G; Colantoni, L; Merico, B; Ricci, E; Felicetti, L

    1998-01-01

    Physical mapping and in situ hybridization experiments have shown that a duplicated locus with a structural organization similar to that of the 4q35 locus implicated in facioscapulohumeral muscular dystrophy is present in the subtelomeric portion of 10q. We performed sequence analysis of the p13E-11 probe and of the adjacent KpnI tandem-repeat unit derived from a 10qter cosmid clone and compared our results with those published, by other laboratories, for the 4q35 region. We found that the sequence homology range is 98%-100% and confirmed that the only difference that can be exploited for differentiation of the 10qter from the 4q35 alleles is the presence of an additional BlnI site within the 10qter KpnI repeat unit. In addition, we observed that the high degree of sequence homology does facilitate interchromosomal exchanges resulting in displacement of the whole set of BlnI-resistant or BlnI-sensitive KpnI repeats from one chromosome to the other. However, partial translocations escape detection if the latter simply relies on the hybridization pattern from double digestion with EcoRI/BlnI and with p13E-11 as a probe. We discovered that the restriction enzyme Tru9I cuts at both ends of the array of KpnI repeats of different chromosomal origins and allows the use of cloned KpnI sequences as a probe by eliminating other spurious fragments. This approach coupled with BlnI digestion permitted us to investigate the structural organization of BlnI-resistant and BlnI-sensitive units within translocated chromosomes of 4q35 and 10q26 origin. A priori, the possibility that partial translocations could play a role in the molecular mechanism of the disease cannot be excluded. PMID:9634507

  9. Amino acid sequence of neurotoxin III of the scorpion Androctonus austrialis Hector.

    PubMed

    Kopeyan, C; Martinez, G; Rochat, H

    1979-03-01

    The amino acid sequence of neurotoxin III, purified from the venom of the North African scorpion Androctonus australis Hector, has been determined by Edman degradation using a liquid-phase sequencer. Carboxypeptidase A hydrolyses confirmed not only the sequence of the five last residues but also the presence of a free alpha-carboxylic group at the C-terminus. Edman degradation was conducted on one hand with the Quadrol [N,N,N',N'-tetrakis(2-hydroxypropyl)ethylene diamine] program and S-alkylated protein before or after coupling with sulfophenylisothiocynate (the first 34 residues were thus identified), on the other hand on tryptic and chymotryptic peptides with a dimethylbenzylamine program (residues 1--23 and 31--34 were confirmed, the positions of residues 35-64 were established). Neurotoxin III was found to belong to the same group of scorpion toxins active on mammals as neurotoxin I purified from the same venom (50 homologous positions exist in the two proteins).

  10. Identification of two homologous mitochondrial DNA sequences, which bind strongly and specifically to a mitochondrial protein of Paracentrotus lividus.

    PubMed Central

    Roberti, M; Mustich, A; Gadaleta, M N; Cantatore, P

    1991-01-01

    Using a combination of band shift and DNasel protection experiments, two Paracentrotus lividus mitochondrial sequences, able to bind tightly and selectively to a mitochondrial protein from sea urchin embryos, have been found. The two sequences, which compete with each other for binding to the protein, are located in two genome regions which are thought to contain regulatory signals for mitochondrial replication and transcription. A computer analysis suggests that the sequence TTTTRTANNTCYYATCAYA, common to the two binding regions, is the minimal recognition signal for the binding to the protein. We discuss the hypothesis that the protein binding capacity of these two sequences is involved in the control of sea urchin mtDNA replication during developmental stages. Images PMID:1956785

  11. Defining sequence space and reaction products within the cyanuric acid hydrolase (AtzD)/barbiturase protein family.

    PubMed

    Seffernick, Jennifer L; Erickson, Jasmine S; Cameron, Stephan M; Cho, Seunghee; Dodge, Anthony G; Richman, Jack E; Sadowsky, Michael J; Wackett, Lawrence P

    2012-09-01

    Cyanuric acid hydrolases (AtzD) and barbiturases are homologous, found almost exclusively in bacteria, and comprise a rare protein family with no discernible linkage to other protein families or an X-ray structural class. There has been confusion in the literature and in genome projects regarding the reaction products, the assignment of individual sequences as either cyanuric acid hydrolases or barbiturases, and spurious connection of this family to another protein family. The present study has addressed those issues. First, the published enzyme reaction products of cyanuric acid hydrolase are incorrectly identified as biuret and carbon dioxide. The current study employed (13)C nuclear magnetic resonance (NMR) spectroscopy and mass spectrometry to show that cyanuric acid hydrolase releases carboxybiuret, which spontaneously decarboxylates to biuret. This is significant because it revealed that homologous cyanuric acid hydrolases and barbiturases catalyze completely analogous reactions. Second, enzymes that had been annotated incorrectly in genome projects have been reassigned here by bioinformatics, gene cloning, and protein characterization studies. Third, the AtzD/barbiturase family has previously been suggested to consist of members of the amidohydrolase superfamily, a large class of metallohydrolases. Bioinformatics and the lack of bound metals both argue against a connection to the amidohydrolase superfamily. Lastly, steady-state kinetic measurements and observations of protein stability suggested that the AtzD/barbiturase family might be an undistinguished protein family that has undergone some resurgence with the recent introduction of industrial s-triazine compounds such as atrazine and melamine into the environment.

  12. Molecular cloning and sequencing of a cDNA encoding the thioesterase domain of the rat fatty acid synthetase.

    PubMed

    Naggert, J; Witkowski, A; Mikkelsen, J; Smith, S

    1988-01-25

    A cloned cDNA containing the entire coding sequence for the long-chain S-acyl fatty acid synthetase thioester hydrolase (thioesterase I) component as well as the 3'-noncoding region of the fatty acid synthetase has been isolated using an expression vector and domain-specific antibodies. The coding region was assigned to the thioesterase I domain by identification of sequences coding for characterized peptide fragments, amino-terminal analysis of the isolated thioesterase I domain and the presence of the serine esterase active-site sequence motif. The thioesterase I domain is 306 amino acids long with a calculated molecular mass of 33,476 daltons; its DNA is flanked at the 5'-end by a region coding for the acyl carrier protein domain and at the 3'-end by a 1,537-base pairs-long noncoding sequence with a poly(A) tail. The thioesterase I domain exhibits a low, albeit discernible, homology with the discrete medium-chain S-acyl fatty acid synthetase thioester hydrolases (thioesterase II) from rat mammary gland and duck uropygial gland, suggesting a distant but common evolutionary ancestry for these proteins.

  13. Quantum-Sequencing: Biophysics of quantum tunneling through nucleic acids

    NASA Astrophysics Data System (ADS)

    Casamada Ribot, Josep; Chatterjee, Anushree; Nagpal, Prashant

    2014-03-01

    Tunneling microscopy and spectroscopy has extensively been used in physical surface sciences to study quantum tunneling to measure electronic local density of states of nanomaterials and to characterize adsorbed species. Quantum-Sequencing (Q-Seq) is a new method based on tunneling microscopy for electronic sequencing of single molecule of nucleic acids. A major goal of third-generation sequencing technologies is to develop a fast, reliable, enzyme-free single-molecule sequencing method. Here, we present the unique ``electronic fingerprints'' for all nucleotides on DNA and RNA using Q-Seq along their intrinsic biophysical parameters. We have analyzed tunneling spectra for the nucleotides at different pH conditions and analyzed the HOMO, LUMO and energy gap for all of them. In addition we show a number of biophysical parameters to further characterize all nucleobases (electron and hole transition voltage and energy barriers). These results highlight the robustness of Q-Seq as a technique for next-generation sequencing.

  14. Homology modeling and identification of amino acids involved in the catalytic process of Mycobacterium tuberculosis serine acetyltransferase.

    PubMed

    Qiu, Juanjuan; Zang, Shizhu; Ma, Yufang; Owusu, Lawrence; Zhou, Lei; Jiang, Tao; Xin, Yi

    2017-03-01

    Serine acetyltransferase (CysE) belongs to the hexapeptide acetyltransferase family and is involved in the biosynthesis of L‑cysteine in microorganisms. Mycobacterium tuberculosis CysE is regarded as a potential target for anti‑tuberculosis (TB) drugs; however, the structure and active sites of M. tuberculosis CysE remain unknown. The present study aimed to predict the secondary structure and to construct a 3D model for M. tuberculosis CysE using bioinformatics analysis. To determine the essential amino acids that are associated with CysE enzymatic activity, amino acid sequences from several microorganisms were compared, and a consensus sequence was identified. Subsequently, site‑directed mutagenesis was used to generate mutant M. tuberculosis CysE proteins. Enzyme assays demonstrated that D67A, H82A and H117A mutants abolished ~75% activity of M. tuberculosis CysE. Prediction of the protein structure and identification of the active amino acids for M. tuberculosis CysE is essential for designing inhibitors, which may aid the discovery of effective anti‑TB drugs.

  15. Cloning and sequence analysis of the StsI restriction-modification gene: presence of homology to FokI restriction-modification enzymes.

    PubMed Central

    Kita, K; Suisha, M; Kotani, H; Yanase, H; Kato, N

    1992-01-01

    StsI endonuclease (R.StsI), a type IIs restriction endonuclease found in Streptococcus sanguis 54, recognizes the same sequence as FokI but cleaves at different positions. A DNA fragment that carried the genes for R.StsI and StsI methylase (M.StsI) was cloned from the chromosomal DNA of S.sanguis 54, and its nucleotide sequence was analyzed. The endonuclease gene was 1,806 bp long, corresponding to a protein of 602 amino acid residues (M(r) = 68,388), and the methylase gene was 1,959 bp long, corresponding to a protein of 653 amino acid residues (M(r) = 76,064). The assignment of the endonuclease gene was confirmed by analysis of the N-terminal amino acid sequence. Genes for the two proteins were in a tail-to-tail orientation, separated by a 131-nucleotide intercistronic region. The predicted amino acid sequences between the StsI system and the FokI system showed a 49% identity between the methylases and a 30% identity between the endonucleases. The sequence comparison of M.StsI with various methylases showed that the N-terminal half of M.StsI matches M.NIaIII, and the C-terminal half matches adenine methylases that recognize GATC and GATATC. PMID:1387204

  16. Salicylic Acid Based Small Molecule Inhibitor for the Oncogenic Src Homology-2 Domain Containing Protein Tyrosine Phosphatase-2 (SHP2)

    SciTech Connect

    Zhang, Xian; He, Yantao; Liu, Sijiu; Yu, Zhihong; Jiang, Zhong-Xing; Yang, Zhenyun; Dong, Yuanshu; Nabinger, Sarah C.; Wu, Li; Gunawan, Andrea M.; Wang, Lina; Chan, Rebecca J.; Zhang, Zhong-Yin

    2010-08-13

    The Src homology-2 domain containing protein tyrosine phosphatase-2 (SHP2) plays a pivotal role in growth factor and cytokine signaling. Gain-of-function SHP2 mutations are associated with Noonan syndrome, various kinds of leukemias, and solid tumors. Thus, there is considerable interest in SHP2 as a potential target for anticancer and antileukemia therapy. We report a salicylic acid based combinatorial library approach aimed at binding both active site and unique nearby subpockets for enhanced affinity and selectivity. Screening of the library led to the identification of a SHP2 inhibitor II-B08 (compound 9) with highly efficacious cellular activity. Compound 9 blocks growth factor stimulated ERK1/2 activation and hematopoietic progenitor proliferation, providing supporting evidence that chemical inhibition of SHP2 may be therapeutically useful for anticancer and antileukemia treatment. X-ray crystallographic analysis of the structure of SHP2 in complex with 9 reveals molecular determinants that can be exploited for the acquisition of more potent and selective SHP2 inhibitors.

  17. Improved Homology Model of the Human all-trans Retinoic Acid Metabolizing Enzyme CYP26A1.

    PubMed

    Awadalla, Mohamed K A; Alshammari, Thamir M; Eriksson, Leif A; Saenz-Méndez, Patricia

    2016-03-15

    A new CYP26A1 homology model was built based on the crystal structure of cyanobacterial CYP120A1. The model quality was examined for stereochemical accuracy, folding reliability, and absolute quality using a variety of different bioinformatics tools. Furthermore, the docking capabilities of the model were assessed by docking of the natural substrate all-trans-retinoic acid (atRA), and a group of known azole- and tetralone-based CYP26A1 inhibitors. The preferred binding pose of atRA suggests the (4S)-OH-atRA metabolite production, in agreement with recently available experimental data. The distances between the ligands and the heme group iron of the enzyme are in agreement with corresponding distances obtained for substrates and azole inhibitors for other cytochrome systems. The calculated theoretical binding energies agree with recently reported experimental data and show that the model is capable of discriminating between natural substrate, strong inhibitors (R116010 and R115866), and weak inhibitors (liarozole, fluconazole, tetralone derivatives).

  18. Purification, characterization, gene cloning and nucleotide sequencing of D: -stereospecific amino acid amidase from soil bacterium: Delftia acidovorans.

    PubMed

    Hongpattarakere, Tipparat; Komeda, Hidenobu; Asano, Yasuhisa

    2005-12-01

    The D-amino acid amidase-producing bacterium was isolated from soil samples using an enrichment culture technique in medium broth containing D-phenylalanine amide as a sole source of nitrogen. The strain exhibiting the strongest activity was identified as Delftia acidovorans strain 16. This strain produced intracellular D-amino acid amidase constitutively. The enzyme was purified about 380-fold to homogeneity and its molecular mass was estimated to be about 50 kDa, on sodium dodecyl sulfate polyacrylamide gel electrophoresis. The enzyme was active preferentially toward D-amino acid amides rather than their L-counterparts. It exhibited strong amino acid amidase activity toward aromatic amino acid amides including D-phenylalanine amide, D-tryptophan amide and D-tyrosine amide, yet it was not specifically active toward low-molecular-weight D-amino acid amides such as D-alanine amide, L-alanine amide and L-serine amide. Moreover, it was not specifically active toward oligopeptides. The enzyme showed maximum activity at 40 degrees C and pH 8.5 and appeared to be very stable, with 92.5% remaining activity after the reaction was performed at 45 degrees C for 30 min. However, it was mostly inactivated in the presence of phenylmethanesulfonyl fluoride or Cd2+, Ag+, Zn2+, Hg2+ and As3+ . The NH2 terminal and internal amino acid sequences of the enzyme were determined; and the gene was cloned and sequenced. The enzyme gene damA encodes a 466-amino-acid protein (molecular mass 49,860.46 Da); and the deduced amino acid sequence exhibits homology to the D-amino acid amidase from Variovorax paradoxus (67.9% identity), the amidotransferase A subunit from Burkholderia fungorum (50% identity) and other enantioselective amidases.

  19. Molecular association of normal alkanoic acids with their thallium(I) salts: a new homologous series of fatty acid metal soaps.

    PubMed

    Fernández-García, M; García, M V; Redondo, M I; Cheda, J A; Fernández-García, M; Westrum, E F; Fernández-Martín, F

    1997-02-01

    A new homologous series of thallium(I) hydrogen dialkanoates, fatty acid thallium soaps, from the dipropane up to the ditetradecane is reported for the first time. This association with 1:1 stoichiometry is the only one exhibited by the thallium derivatives. They have been prepared by solidification of molten mixtures with equimolar proportions of acid and corresponding neutral salt, through crystallization from an anhydrous ethanolic solution of the mixture has also been successful in getting pure compounds with largest chain lengths. Vibrational spectroscopies clearly characterize these crystalline compounds as very strong hydrogen bonding systems. Assignations of active modes in proton and carbon nuclear magnetic resonance spectrometry (NMR) (in ethanol) and infrared (IR) and Raman spectra (in solid state) are reported. According to X-ray diffraction (XRD) they have monomolecular lamellar structures with the acyl chains arranged up and down to the cation/H-bond network in a methyl-to-methyl fashion, and vertically oriented to the basal plane. The acyl chains present all-trans conformation and alternating configuration (perpendicular orthorhombic subcell), like the beta'-phases of other kinds of lipids. Lamellar thickness is reported for the six room-temperature crystalline members. The molecular compounds present polymorphism, one crystal/crystal transition at temperatures close to the peritectical melting. Phase transition thermodynamics are also given and discussed with respect to their acid and salt parents. Their incongruent melting involves nearly 90% of the total enthalpic increments of both constituents' melting processes, making these compounds potential thermal energy storage materials.

  20. Nucleic acid sequence detection using multiplexed oligonucleotide PCR

    DOEpatents

    Nolan, John P.; White, P. Scott

    2006-12-26

    Methods for rapidly detecting single or multiple sequence alleles in a sample nucleic acid are described. Provided are all of the oligonucleotide pairs capable of annealing specifically to a target allele and discriminating among possible sequences thereof, and ligating to each other to form an oligonucleotide complex when a particular sequence feature is present (or, alternatively, absent) in the sample nucleic acid. The design of each oligonucleotide pair permits the subsequent high-level PCR amplification of a specific amplicon when the oligonucleotide complex is formed, but not when the oligonucleotide complex is not formed. The presence or absence of the specific amplicon is used to detect the allele. Detection of the specific amplicon may be achieved using a variety of methods well known in the art, including without limitation, oligonucleotide capture onto DNA chips or microarrays, oligonucleotide capture onto beads or microspheres, electrophoresis, and mass spectrometry. Various labels and address-capture tags may be employed in the amplicon detection step of multiplexed assays, as further described herein.

  1. The amino acid sequence of chymopapain from Carica papaya.

    PubMed Central

    Watson, D C; Yaguchi, M; Lynn, K R

    1990-01-01

    Chymopapain is a polypeptide of 218 amino acid residues. It has considerable structural similarity with papain and papaya proteinase omega, including conservation of the catalytic site and of the disulphide bonding. Chymopapain is like papaya proteinase omega in carrying four extra residues between papain positions 168 and 169, but differs from both papaya proteinases in the composition of its S2 subsite, as well as in having a second thiol group, Cys-117. Some evidence for the amino acid sequence of chymopapain has been deposited as Supplementary Publication SUP 50153 (12 pages) at the British Library Document Supply Centre, Boston Spa., Wetherby, West Yorkshire LS23 7BQ, U.K., from whom copies may be obtained on the terms indicated in Biochem. J. (1990) 265, 5. The information comprises Supplement Tables 1-4, which contain, in order, amino acid compositions of peptides from tryptic, peptic, CNBr and mild acid cleavages, Supplement Fig. 1, showing re-fractionation of selected peaks from Fig. 2 of the main paper. Supplement Fig. 2, showing cation-exchange chromatography of the earliest-eluted peak of Fig. 3 of the main paper, Supplement Fig. 3, showing reverse-phase h.p.l.c. of the later-eluted peak from Fig. 3 of the main paper, and Supplement Fig. 4, showing the separation of peptides after mild acid hydrolysis of CNBr-cleavage fragment CB3. PMID:2106878

  2. Triose phosphate isomerase from the coelacanth. An approach to the rapid determination of an amino acid sequence with small amounts of material.

    PubMed

    Kolb, E; Harris, J I; Bridgen, J

    1974-02-01

    The preparation and purification of cyanogen bromide fragments from [(14)C]carboxymethylated coelacanth triose phosphate isomerase is presented. The automated sequencing of these fragments, the lysine-blocked tryptic peptides derived from them, and also of the intact protein, is described. Combination with results from manual sequence analysis has given the 247-residue amino acid sequence of coelacanth triose phosphate isomerase in 4 months, by using 100mg of enzyme. (Two small adjacent peptides were placed by homology with the rabbit enzyme.) Comparison of this sequence with that of the rabbit muscle enzyme shows that 207 (84%) of the residues are identical. This slow rate of evolutionary change (corresponding to two amino acid substitutions per 100 residues per 100 million years) is similar to that found for glyceraldehyde 3-phosphate dehydrogenase. The reliability of sequence information obtained by automated methods is discussed.

  3. The amino-acid sequence of the 2S sulphur-rich proteins from seeds of Brazil nut (Bertholletia excelsa H.B.K.).

    PubMed

    Ampe, C; Van Damme, J; de Castro, L A; Sampaio, M J; Van Montagu, M; Vandekerckhove, J

    1986-09-15

    Storage proteins of the albumin solubility fraction from seeds of Bertholletia excelsa H.B.K. were separated by reversed-phase high-performance liquid chromatography and their primary structures were determined by gas-phase sequencing on intact polypeptides and on the overlapping tryptic and thermolysin peptides. The 2S storage proteins consist of two subunits linked by disulphide bridges. The large subunit (8.5 kDa) is expressed in at least six different isoforms while the small subunit (3.6 kDa) consists of only one form. These proteins are extremely rich in glutamine, glutamic acid, arginine and the sulphur-containing amino acids cysteine and methionine. One of the variants even contains a sequence of six methionine residues in a row. Comparison with known sequences of 2S proteins of other dicotyledonous plants shows limited but distinct sequence homology. In particular, the positions of the cysteine residues relative to each other appear to be completely conserved, suggesting that tertiary structure constraints imposed by disulphide bridges dominate sequence conservation. It has been proposed that the two subunits of a related protein (the Brassica napus storage protein) is cleaved from a precursor polypeptide [Crouch, M. L., Tenbarge, K. M., Simon, A. E. & Ferl, R. (1983) J. Mol. Appl. Genet. 2,273-283]. The amino acid sequence homology of the Brazil nut protein with the former suggests that a similar protein processing event could occur.

  4. Determining structure and function of steroid dehydrogenase enzymes by sequence analysis, homology modeling, and rational mutational analysis.

    PubMed

    Duax, William L; Thomas, James; Pletnev, Vladimir; Addlagatta, Anthony; Huether, Robert; Habegger, Lukas; Weeks, Charles M

    2005-12-01

    The short-chain oxidoreductase (SCOR) family of enzymes includes over 6,000 members identified in sequenced genomes. Of these enzymes, approximately 300 have been characterized functionally, and the three-dimensional crystal structures of approximately 40 have been reported. Since some SCOR enzymes are steroid dehydrogenases involved in hypertension, diabetes, breast cancer, and polycystic kidney disease, it is important to characterize the other members of the family for which the biological functions are currently unknown and to determine their three-dimensional structure and mechanism of action. Although the SCOR family appears to have only a single fully conserved residue, it was possible, using bioinformatics methods, to determine characteristic fingerprints composed of 30-40 residues that are conserved at the 70% or greater level in SCOR subgroups. These fingerprints permit reliable prediction of several important structure-function features including cofactor preference, catalytic residues, and substrate specificity. Human type 1 3beta-hydroxysteroid dehydrogenase isomerase (3beta-HSDI) has 30% sequence identity with a human UDP galactose 4-epimerase (UDPGE), a SCOR family enzyme for which an X-ray structure has been reported. Both UDPGE and 3-HSDI appear to trace their origins back to bacterial 3alpha,20beta-HSD. Combining three-dimensional structural information and sequence data on the 3alpha,20beta-HSD, UDPGE, and 3beta-HSDI subfamilies with mutational analysis, we were able to identify the residues critical to the dehydrogenase function of 3-HSDI. We also identified the residues most probably responsible for the isomerase activity of 3beta-HSDI. We test our predictions by specific mutations based on sequence analysis and our structure-based model.

  5. A case of orthologous sequences of hemocyanin subunits for an evolutionary study of horseshoe crabs: amino acid sequence comparison of immunologically identical subunits of Carcinoscorpius rotundicauda and Tachypleus tridentatus.

    PubMed

    Sugita, H; Shishikura, F

    1995-10-01

    About 83% of the amino acid sequence of hemocyanin subunit HR6 from the Southeast Asian horseshoe crab, Carcinoscorpius rotundicauda, has been determined. There is a difference of about 43% between HR6 and complete sequences of chelicerate hemocyanin subunits from the American horseshoe crab, Limulus polyphemus, and a tarantula, Eurypelma californicum. However, the immunologically identical subunits HR6 and HT6 from Tachypleus tridentatus (Japanese horseshoe crab) show 2.7% sequence difference. Based on the amino acid sequences of HR6 and HT6, the divergence between C. rotundicauda and T. tridentatus occurred about 9.6 million years ago. In the case of horseshoe crab hemocyanin subunits, it seems that the orthologous homologues in many homologous subunits between species are immunologically detectable.

  6. A new method for sex determination based on detection of SRY, STS and amelogenin gene regions with simultaneous amplification of their homologous sequences by a multiplex PCR.

    PubMed

    Morikawa, Toshio; Yamamoto, Yuji; Miyaishi, Satoru

    2011-04-01

    We have developed a new method for sex determination based on simultaneous detection of the SRY (sex-determining region Y), STS (steroid sulfatase) and amelogenin (AMELX and AMELY) gene regions and their homologous sequences. The sex of 246 blood samples was correctly determined by this method. An AMELY-deleted male sample, which would have been erroneously considered female based solely on analysis of the amelogenin locus, was successfully identified as male by the present method. The detection limit of this method was 63 pg of genomic DNA, and the male DNA component could be detected from mixed samples having a male:female ratio as low as 1:10. This method was useful for degraded DNA and possessed the human specificity. Practical application to 35 autopsy cases is described.

  7. Ultrasensitive nucleic acid sequence detection by single-molecule electrophoresis

    SciTech Connect

    Castro, A; Shera, E.B.

    1996-09-01

    This is the final report of a one-year laboratory-directed research and development project at Los Alamos National Laboratory. There has been considerable interest in the development of very sensitive clinical diagnostic techniques over the last few years. Many pathogenic agents are often present in extremely small concentrations in clinical samples, especially at the initial stages of infection, making their detection very difficult. This project sought to develop a new technique for the detection and accurate quantification of specific bacterial and viral nucleic acid sequences in clinical samples. The scheme involved the use of novel hybridization probes for the detection of nucleic acids combined with our recently developed technique of single-molecule electrophoresis. This project is directly relevant to the DOE`s Defense Programs strategic directions in the area of biological warfare counter-proliferation.

  8. Cloning and Sequence Analysis of cDNAs Encoding Two Acidic PLA(2) from venom of Ophiophagus hannah(King Cobra), Guangxi Species.

    PubMed

    Wang, Qiu-Yan; Shu, Yu-Yan; Zhuang, Mao-Xing; Lin, Zheng-Jiong

    2001-01-01

    Total RNA was extracted from venom glands of Ophiophagus hannah, Guangxi species. The cDNAs encoding PLA(2) were amplified by RT-PCR and cloned into the PUCm-T vector. The positive clones encoding two acidic PLA(2) (APLA(2)-1 and APLA(2)-2) were selected and bidirectionally sequenced. Their complete amino acid sequences were deduced and found to be identical to the known amino acid sequences. Their isoelectric points calculated by computer agreed with the values determined with their protein. Homology analysis indicated that the mature peptide of APLA(2)-1 had high homology with PLA(2) from venoms of Ophiophagus hannah, Fujian and Taiwan species, but APLA(2)-2 had lower homology. The most striking difference between APLA(2)-2 and other PLA(2) from Ophiophagus hannah venoms is the missing of a extra "pancreatic loop" at residues 62--66 in APLA(2)-2, and it may be related to their species evolution and biological activity.

  9. Alcohol homologation

    DOEpatents

    Wegman, R.W.; Moloy, K.G.

    1988-02-23

    A process is described for the homologation of an alkanol by reaction with synthesis gas in contact with a system containing rhodium atom, ruthenium atom, iodine atom and a bis(diorganophosphino) alkane to selectivity produce the next higher homologue.

  10. Alcohol homologation

    DOEpatents

    Wegman, Richard W.; Moloy, Kenneth G.

    1988-01-01

    A process for the homologation of an alkanol by reaction with synthesis gas in contact with a system containing rhodium atom, ruthenium atom, iodine atom and a bis(diorganophosphino) alkane to selectivity produce the next higher homologue.

  11. Cloning, sequence, and developmental expression of a type 5, tartrate-resistant, acid phosphatase of rat bone.

    PubMed

    Ek-Rylander, B; Bill, P; Norgård, M; Nilsson, S; Andersson, G

    1991-12-25

    Tartrate-resistant acid phosphatase (TRAP) is a characteristic constituent of osteoclasts and some mononuclear preosteoclasts and, therefore, used as a histochemical and biochemical marker for osteoclasts and bone resorption. We now report the isolation of a 1397-base pair (bp) full-length TRAP/tartrate-resistant acid ATPase (TrATPase) cDNA clone from a neonatal rat calvaria lambda gt11 cDNA library. The cDNA clone consists of a 92-bp untranslated 5'-flank, an open reading frame of 981 bp and a 324-bp untranslated 3'-poly(A)-containing region. The deduced protein sequence of 327 amino acids contains a putative cleavable signal sequence of 21 amino acids. The mature polypeptide of 306 amino acids has a calculated Mr of 34,350 Da and a pI of 9.18, and it contains two potential N-glycosylation sites and the lysosomal targeting sequence DKRFQ. At the protein level, the sequence displays 89-94% homology to TRAP enzymes from human placenta, beef spleen, and uteroferrin and identity to the N terminus of purified rat bone TRAP/TrATPase. An N-terminal amino acid segment is strikingly homologous to the corresponding region in lysosomal and prostatic acid phosphatases. The cDNA recognized a 1.5-kilobase mRNA in long bones and calvaria, and in vitro translation using, as template, mRNA transcribed from the full-length insert yielded an immunoprecipitated product of 34 kDa. In neonatal rats, TRAP/TrATPase mRNA was highly expressed in skeletal tissues, with much lower (less than 10%) levels detected in spleen, thymus, liver, skin, brain, kidney, brain, lung, and heart. In situ hybridization demonstrated specific labeling of osteoclasts at endostal surfaces and bone trabeculae of long bones. Thus, despite the apparent similarity of this osteoclastic TRAP/TrATPase with type 5, tartrate-resistant and purple, acid phosphatases expressed in other mammalian tissues, this gene appears to be preferentially expressed at skeletal sites.

  12. Homology analysis of pathogenic Yersinia species Yersinia enterocolitica, Yersinia pseudotuberculosis, and Yersinia pestis based on multilocus sequence typing.

    PubMed

    Duan, Ran; Liang, Junrong; Shi, Guoxiang; Cui, Zhigang; Hai, Rong; Wang, Peng; Xiao, Yuchun; Li, Kewei; Qiu, Haiyan; Gu, Wenpeng; Du, Xiaoli; Jing, Huaiqi; Wang, Xin

    2014-01-01

    We developed a multilocus sequence typing (MLST) scheme and used it to study the population structure and evolutionary relationships of three pathogenic Yersinia species. MLST of these three Yersinia species showed a complex of two clusters, one composed of Yersinia pseudotuberculosis and Yersinia pestis and the other composed of Yersinia enterocolitica. Within the first cluster, the predominant Y. pestis sequence type 90 (ST90) was linked to Y. pseudotuberculosis ST43 by one locus difference, and 81.25% of the ST43 strains were from serotype O:1b, supporting the hypothesis that Y. pestis descended from the O:1b serotype of Y. pseudotuberculosis. We also found that the worldwide-prevalent serotypes O:1a, O:1b, and O:3 were predominated by specific STs. The second cluster consisted of pathogenic and nonpathogenic Y. enterocolitica strains, two of which may not have identical STs. The pathogenic Y. enterocolitica strains formed a relatively conserved group; most strains clustered within ST186 and ST187. Serotypes O:3, O:8, and O:9 were separated into three distinct blocks. Nonpathogenic Y. enterocolitica STs were more heterogeneous, reflecting genetic diversity through evolution. By providing a better and effective MLST procedure for use with the Yersinia community, valuable information and insights into the genetic evolutionary differences of these pathogens were obtained.

  13. Nucleic acid (cDNA) and amino acid sequences of alpha-type gliadins from wheat (Triticum aestivum).

    PubMed Central

    Kasarda, D D; Okita, T W; Bernardin, J E; Baecker, P A; Nimmo, C C; Lew, E J; Dietler, M D; Greene, F C

    1984-01-01

    The complete amino acid sequence for an alpha-type gliadin protein of wheat (Triticum aestivum Linnaeus) endosperm has been derived from a cloned cDNA sequence. An additional cDNA clone that corresponds to about 75% of a similar alpha-type gliadin has been sequenced and shows some important differences. About 97% of the composite sequence of A-gliadin (an alpha-type gliadin fraction) has also been obtained by direct amino acid sequencing. This sequence shows a high degree of similarity with amino acid sequences derived from both cDNA clones and is virtually identical to one of them. On the basis of sequence information, after loss of the signal sequence, the mature alpha-type gliadins may be divided into five different domains, two of which may have evolved from an ancestral gliadin gene, whereas the remaining three contain repeating sequences that may have developed independently. Images PMID:6589619

  14. Molecular cloning of cDNA for the zeta isoform of the 14-3-3 protein: homologous sequences in the 3'-untranslated region of frog and human zeta isoforms.

    PubMed

    Miura, I; Nakajima, T; Ohtani, H; Kashiwagi, A; Nakamura, M

    1997-10-01

    14-3-3 proteins constitute a family of well-conserved eukaryotic proteins that possess diverse biochemical activities such as regulation of gene transcription, cell proliferation and activation of protein kinase C. At least 7 subtypes (alpha to theta) of 14-3-3 protein are known, but the zeta subtype of this protein has been cloned only in mammals. We cloned the zeta subtype of 14-3-3 protein (14-3-3 zeta) from the frog, Rana rugosa. The sequence encoded 245 amino acids that share 92% identity with rat and bovine 14-3-3 zeta s, and 92% with human phospholipase A2 (PLA2; 14-3-3 zeta). Northern blot analysis revealed a single band of about 1.8 kb in tadpoles at stage 25. The 14-3-3 zeta mRNA level was high in the brain, lung, spleen and kidney, and low in the heart and testis, as opposed to the mRNA level, which was only faintly detected in the liver, pancreas, ovary and muscle. Furthermore, high similarity in the 3'-untranslated region (3'-UTR) was observed between frog and human 14-3-3 zeta cDNA. The results suggest that 14-3-3 zeta is highly conserved throughout eukaryotic evolution, and that the homologous sequence in the 3'-UTR of 14-3-3 zeta cDNA may be conserved in frogs and humans.

  15. Homology-Based Modeling of Universal Stress Protein from Listeria innocua Up-Regulated under Acid Stress Conditions

    PubMed Central

    Tremonte, Patrizio; Succi, Mariantonietta; Coppola, Raffaele; Sorrentino, Elena; Tipaldi, Luca; Picariello, Gianluca; Pannella, Gianfranco; Fraternali, Franca

    2016-01-01

    An Universal Stress Protein (USP) expressed under acid stress condition by Listeria innocua ATCC 33090 was investigated. The USP was up-regulated not only in the stationary phase but also during the exponential growth phase. The three dimensional (3D) structure of USP was predicted using a combined proteomic and bioinformatics approach. Phylogenetic analysis showed that the USP from Listeria detected in our study was distant from the USPs of other bacteria (such as Pseudomonas spp., Escherichia coli, Salmonella spp.) and clustered in a separate and heterogeneous class including several USPs from Listeria spp. and Lactobacillus spp. An important information on the studied USP was obtained from the 3D-structure established through the homology modeling procedure. In detail, the Model_USP-691 suggested that the investigated USP had a homo-tetrameric quaternary structure. Each monomer presented an architecture analogous to the Rossmann-like α/β-fold with five parallel β-strands, and four α-helices. The analysis of monomer-monomer interfaces and quality of the structure alignments confirmed the model reliability. In fact, the structurally and sequentially conserved hydrophobic residues of the β-strand 5 (in particular the residues V146 and V148) were involved in the inter-chains contact. Moreover, the highly conserved residues I139 and H141 in the region α4 were involved in the dimer association and functioned as hot spots into monomer–monomer interface assembly. The hypothetical assembly of dimers was also supported by the large interface area and by the negative value of solvation free energy gain upon interface interaction. Finally, the structurally conserved ATP-binding motif G-2X-G-9X-G(S/T-N) suggested for a putative role of ATP in stabilizing the tetrameric assembly of the USP. Therefore, the results obtained from a multiple approach, consisting in the application of kinetic, proteomic, phylogenetic and modeling analyses, suggest that Listeria USP could

  16. Structural gene and complete amino acid sequence of Vibrio alginolyticus collagenase.

    PubMed Central

    Takeuchi, H; Shibano, Y; Morihara, K; Fukushima, J; Inami, S; Keil, B; Gilles, A M; Kawamoto, S; Okuda, K

    1992-01-01

    The DNA encoding the collagenase of Vibrio alginolyticus was cloned, and its complete nucleotide sequence was determined. When the cloned gene was ligated to pUC18, the Escherichia coli expression vector, bacteria carrying the gene exhibited both collagenase antigen and collagenase activity. The open reading frame from the ATG initiation codon was 2442 bp in length for the collagenase structural gene. The amino acid sequence, deduced from the nucleotide sequence, revealed that the mature collagenase consists of 739 amino acids with an Mr of 81875. The amino acid sequences of 20 polypeptide fragments were completely identical with the deduced amino acid sequences of the collagenase gene. The amino acid composition predicted from the DNA sequence was similar to the chemically determined composition of purified collagenase reported previously. The analyses of both the DNA and amino acid sequences of the collagenase gene were rigorously performed, but we could not detect any significant sequence similarity to other collagenases. Images Fig. 2. PMID:1311172

  17. Evolution of phosphagen kinase V. cDNA-derived amino acid sequences of two molluscan arginine kinases from the chiton Liolophura japonica and the turbanshell Battilus cornutus.

    PubMed

    Suzuki, T; Ban, T; Furukohri, T

    1997-06-20

    The cDNAs of arginine kinases from the chiton Liolophura japonica (Polyplacophora) and the turbanshell Battilus cornutus (Gastropoda) were amplified by polymerase chain reaction (PCR), and the complete nucleotide sequences of 1669 and 1624 bp, respectively, were determined. The open reading frame for Liolophura arginine kinase is 1050 nucleotides in length and encodes a protein with 349 amino acid residues, and that for Battilus is 1077 nucleotides and 358 residues. The validity of the cDNA-derived amino acid sequence was supported by chemical sequencing of internal tryptic peptides. The molecular masses were calculated to be 39,057 and 39,795 Da, respectively. The amino acid sequence of Liolophura arginine kinase showed 65-68% identity with those of Battilus and Nordotis (abalone) arginine kinases, and the homology between Battilus and Nordotis was 79%. Molluscan arginine kinases also show lower, but significant homology (38-43%) with rabbit creatine kinase. The sequences of arginine kinases could be used as a molecular clock to elucidate the phylogeny of Mollusca, one of the most diverse animal phyla.

  18. Cloning, sequence analysis, and expression in Escherichia coli of the gene encoding an alpha-amino acid ester hydrolase from Acetobacter turbidans.

    PubMed

    Polderman-Tijmes, Jolanda J; Jekel, Peter A; de Vries, Erik J; van Merode, Annet E J; Floris, René; van der Laan, Jan-Metske; Sonke, Theo; Janssen, Dick B

    2002-01-01

    The alpha-amino acid ester hydrolase from Acetobacter turbidans ATCC 9325 is capable of hydrolyzing and synthesizing beta-lactam antibiotics, such as cephalexin and ampicillin. N-terminal amino acid sequencing of the purified alpha-amino acid ester hydrolase allowed cloning and genetic characterization of the corresponding gene from an A. turbidans genomic library. The gene, designated aehA, encodes a polypeptide with a molecular weight of 72,000. Comparison of the determined N-terminal sequence and the deduced amino acid sequence indicated the presence of an N-terminal leader sequence of 40 amino acids. The aehA gene was subcloned in the pET9 expression plasmid and expressed in Escherichia coli. The recombinant protein was purified and found to be dimeric with subunits of 70 kDa. A sequence similarity search revealed 26% identity with a glutaryl 7-ACA acylase precursor from Bacillus laterosporus, but no homology was found with other known penicillin or cephalosporin acylases. There was some similarity to serine proteases, including the conservation of the active site motif, GXSYXG. Together with database searches, this suggested that the alpha-amino acid ester hydrolase is a beta-lactam antibiotic acylase that belongs to a class of hydrolases that is different from the Ntn hydrolase superfamily to which the well-characterized penicillin acylase from E. coli belongs. The alpha-amino acid ester hydrolase of A. turbidans represents a subclass of this new class of beta-lactam antibiotic acylases.

  19. Molecular cloning, nucleotide sequence, and abscisic acid induction of a suberization-associated highly anionic peroxidase.

    PubMed

    Roberts, E; Kolattukudy, P E

    1989-06-01

    A highly anionic peroxidase induced in suberizing cells was suggested to be the key enzyme involved in polymerization of phenolic monomers to generate the aromatic matrix of suberin. The enzyme encoded by a potato cDNA was found to be highly homologous to the anionic peroxidase induced in suberizing tomato fruit. A tomato genomic library was screened using the potato anionic peroxidase cDNA and one genomic clone was isolated that contained two tandemly oriented anionic peroxidase genes. These genes were sequenced and were 96% and 87% identical to the mRNA for potato anionic peroxidase. Both genes consist of three exons with the relative positions of their two introns being conserved between the two genes. Primer extension analysis showed that only one of the genes is expressed in the periderm of 3 day wound-healed tomato fruits. Southern blot analyses suggested that there are two copies each of the two highly homologous genes per haploid genome in both potato and tomato. Abscisic acid (ABA) induced the accumulation of the anionic peroxidase transcripts in potato and tomato callus tissues. Northern blots showed that peroxidase mRNA was detectable at 2 days and was maximal at 8 days after transfer of potato callus to solid agar media containing 10(-4) M ABA. The transcripts induced by ABA in both potato and tomato callus were identical in size to those induced in wound-healing potato tuber and tomato fruit. The anionic peroxidase peptide was detected in extracts of potato callus grown on the ABA-containing media by western blot analysis. The results support the suggestion that stimulation of suberization by ABA involves the induction of the highly anionic peroxidase.

  20. Specific transcription of an adenoviral gene that possesses no TATA sequence homology in extracts of HeLa cells.

    PubMed

    Leong, K; Flint, S J

    1984-09-25

    Transcription of the adenovirus type 2 (Ad2) IVa2 gene, which contains no TATA-like sequence in the region immediately upstream of the IVa2 cap sites (Baker, C. C., and Ziff, E. B. (1981) J. Mol. Biol. 149, 189-221), has been examined in extracts of HeLa cells (Manley, J. L., Fire, A., Cano, A., Sharp, P. A., and Gefter, M.L. (1980) Proc. Natl. Acad. Sci. U.S.A. 77, 3855-3859). Run-off transcripts of the predicted length of those initiated at the IVa2 cap sites were synthesized from different Ad2 DNA templates, each of which also contained the major late transcriptional control region. Mapping of the 5' ends of the RNA made from one template by a nuclease protection assay established the fidelity of initiation of IVa2 transcription in vitro. The efficiency of IVa2 expression in whole HeLa extracts was influenced quite dramatically by monovalent and divalent metal ion concentrations and the concentration of extract protein present in the reaction mixture. Under certain conditions, IVa2 run-off transcripts were made almost as efficiently as those from the Ad2 major late transcriptional control region. However, conditions promoting optimal IVa2 transcription in vitro did not favor recognition of the major late transcriptional control region, and vice versa: the synthesis of IVa2 and major late run-off transcripts responded differently to all parameters tested.

  1. Amino acid sequence of myoglobin from the chiton Liolophura japonica and a phylogenetic tree for molluscan globins.

    PubMed

    Suzuki, T; Furukohri, T; Okamoto, S

    1993-02-01

    Myoglobin was isolated from the radular muscle of the chiton Liolophura japonica, a primitive archigastropodic mollusc. Liolophura contains three monomeric myoglobins (I, II, and III), and the complete amino acid sequence of myoglobin I has been determined. It is composed of 145 amino acid residues, and the molecular mass was calculated to be 16,070 D. The E7 distal histidine, which is replaced by valine or glutamine in several molluscan globins, is conserved in Liolophura myoglobin. The autoxidation rate at physiological conditions indicated that Liolophura oxymyoglobin is fairly stable when compared with other molluscan myoglobins. The amino acid sequence of Liolophura myoglobin shows low homology (11-21%) with molluscan dimeric myoglobins and hemoglobins, but shows higher homology (26-29%) with monomeric myoglobins from the gastropodic molluscs Aplysia, Dolabella, and Bursatella. A phylogenetic tree was constructed from 19 molluscan globin sequences. The tree separated them into two distinct clusters, a cluster for muscle myoglobins and a cluster for erythrocyte or gill hemoglobins. The myoglobin cluster is divided further into two subclusters, corresponding to monomeric and dimeric myoglobins, respectively. Liolophura myoglobin was placed on the branch of monomeric myoglobin lineage, showing that it diverged earlier from other monomeric myoglobins. The hemoglobin cluster is also divided into two subclusters. One cluster contains homodimeric, heterodimeric, tetrameric, and didomain chains of erythrocyte hemoglobins of the blood clams Anadara, Scapharca, and Barbatia. Of special interest is the other subcluster. It consists of three hemoglobin chains derived from the bacterial symbiontharboring clams Calyptogena and Lucina, in which hemoglobins are supposed to play an important role in maintaining the symbiosis with sulfide bacteria.

  2. Amino acid sequence of a neurotoxic phospholipase A2 enzyme from common death adder (Acanthophis antracticus) venom.

    PubMed

    van der Weyden, L; Hains, P; Broady, K; Shaw, D; Milburn, P

    2001-02-01

    The amino acid sequence of the first neurotoxic phospholipase A2, acanthoxin A1, purified from the venom of the Common death adder (Acanthophis antarcticus) was determined. Acanthoxin A1 shows high homology with other Australian elapid PLA2 neurotoxins, in particular Acanthin-I and -II, also from Death adder, Pseudexin A from the Red-bellied black snake (Pseudechis porphyriacus), and Pa-12a and Pa-9c from the King brown snake (Pseudechis australis). Acanthoxin A1 is a single-chain 118 amino acid residue PLA2, including 14 half cystine residues and the essential residues forming the ubiquitous calcium binding pocket and catalytic site. Critical analysis of the residues hypothesized to be important for neurotoxicity is presented.

  3. Sequence analysis of a gene cluster involved in metabolism of 2,4,5-trichlorophenoxyacetic acid by Burkholderia cepacia AC1100.

    PubMed Central

    Daubaras, D L; Hershberger, C D; Kitano, K; Chakrabarty, A M

    1995-01-01

    Burkholderia cepacia AC1100 utilizes 2,4,5-trichlorophenoxyacetic acid (2,4,5-T) as a sole source of carbon and energy. PT88 is a chromosomal deletion mutant of B. cepacia AC1100 and is unable to grow on 2,4,5-T. The nucleotide sequence of a 5.5-kb chromosomal fragment from B. cepacia AC1100 which complemented PT88 for growth on 2,4,5-T was determined. The sequence revealed the presence of six open reading frames, designated ORF1 to ORF6. Five polypeptides were produced when this DNA region was under control of the T7 promoter in Escherichia coli; however, no polypeptide was produced from the fourth open reading frame, ORF4. Homology searches of protein sequence databases were performed to determine if the proteins involved in 2,4,5-T metabolism were similar to other biodegradative enzymes. In addition, complementation studies were used to determine which genes were essential for the metabolism of 2,4,5-T. The first gene of the cluster, ORF1, encoded a 37-kDa polypeptide which was essential for complementation of PT88 and showed significant homology to putative trans-chlorodienelactone isomerases. The next gene, ORF2, was necessary for complementation and encoded a 47-kDa protein which showed homology to glutathione reductases. ORF3 was not essential for complementation; however, both the 23-kDa protein encoded by ORF3 and the predicted amino acid sequence of ORF4 showed homology to glutathione S-transferases. ORF5, which encoded an 11-kDa polypeptide, was essential for growth on 2,4,5-T, but the amino acid sequence did not show homology to those of any known proteins. The last gene of the cluster, ORF6, was necessary for complementation of PT88, and the 32-kDa protein encoded by this gene showed homology to catechol and chlorocatechol-1,2-dioxygenases. PMID:7538273

  4. Complete amino acid sequence of an acidic, cardiotoxic phospholipase A2 from the venom of Ophiophagus hannah (King Cobra): a novel cobra venom enzyme with "pancreatic loop".

    PubMed

    Huang, M Z; Gopalakrishnakone, P; Chung, M C; Kini, R M

    1997-02-15

    A phospholipase A2 (OHV A-PLA2) from the venom of Ophiophagus hannah (King cobra) is an acidic protein exhibiting cardiotoxicity, myotoxicity, and antiplatelet activity. The complete amino acid sequence of OHV A-PLA2 has been determined using a combination of Edman degradation and mass spectrometric techniques. OHV A-PLA2 is composed of a single chain of 124 amino acid residues with 14 cysteines and a calculated molecular weight of 13719 Da. It contains the loop of residues (62-66) found in pancreatic PLA2s and hence belongs to class IB enzymes. This pancreatic loop is between two proline residues (Pro 59 and Pro 68) and contains several hydrophilic amino acids (Ser and Asp). This region has high degree of conformational flexibility and is on the surface of the molecule, and hence it may be a potential protein-protein interaction site. A relatively low sequence homology is found between OHV A-PLA2 and other known cardiotoxic PLA2s, and hence a contiguous segment could not be identified as a site responsible for the cardiotoxic activity.

  5. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... 37 Patents, Trademarks, and Copyrights 1 2010-07-01 2010-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide...

  6. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... 37 Patents, Trademarks, and Copyrights 1 2012-07-01 2012-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide...

  7. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... 37 Patents, Trademarks, and Copyrights 1 2014-07-01 2014-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide...

  8. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... 37 Patents, Trademarks, and Copyrights 1 2011-07-01 2011-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide...

  9. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... 37 Patents, Trademarks, and Copyrights 1 2013-07-01 2013-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide...

  10. Purification to homogeneity and partial amino acid sequence of a fragment which includes the methyl acceptor site of the human DNA repair protein for O6-methylguanine.

    PubMed

    Major, G N; Gardner, E J; Carne, A F; Lawley, P D

    1990-03-25

    DNA repair by O6-methylguanine-DNA methyltransferase (O6-MT) is accomplished by removal by the enzyme of the methyl group from premutagenic O6-methylguanine-DNA, thereby restoring native guanine in DNA. The methyl group is transferred to an acceptor site cysteine thiol group in the enzyme, which causes the irreversible inactivation of O6-MT. We detected a variety of different forms of the methylated, inactivated enzyme in crude extracts of human spleen of molecular weights higher and lower than the usually observed 21-24kDa for the human O6-MT. Several apparent fragments of the methylated form of the protein were purified to homogeneity following reaction of partially-purified extract enzyme with O6-[3H-CH3]methylguanine-DNA substrate. One of these fragments yielded amino acid sequence information spanning fifteen residues, which was identified as probably belonging to human methyltransferase by virtue of both its significant sequence homology to three procaryote forms of O6-MT encoded by the ada, ogt (both from E. coli) and dat (B. subtilis) genes, and sequence position of the radiolabelled methyl group which matched the position of the conserved procaryote methyl acceptor site cysteine residue. Statistical prediction of secondary structure indicated good homologies between the human fragment and corresponding regions of the constitutive form of O6-MT in procaryotes (ogt and dat gene products), but not with the inducible ada protein, indicating the possibility that we had obtained partial amino acid sequence for a non-inducible form of the human enzyme. The identity of the fragment sequence as belonging to human methyltransferase was more recently confirmed by comparison with cDNA-derived amino acid sequence from the cloned human O6-MT gene from HeLa cells (1). The two sequences compared well, with only three out of fifteen amino acids being different (and two of them by only one nucleotide in each codon).

  11. Peptide Mapping of Aminoacyl-tRNA Synthetases: Evidence for Internal Sequence Homology in Escherichia coli Leucyl-tRNA Synthetase

    PubMed Central

    Waterson, Robert M.; Konigsberg, William H.

    1974-01-01

    Most aminoacyl-tRNA synthetases contain polypeptide chains of about either 50,000 or 100,000 daltons. Peptide mapping of tryptic, chymotryptic, or Staphylococcus aureus acid protease digests of seryl-tRNA synthetase (100,000, dimer) and leucyl-tRNA synthetase (100,000, monomer) from E. coli was done after selective modification of lysine residues with [14C]succinic anhydride or of methionine residues with [14C]iodoacetate. By use of thin-layer electrophoresis and chromatography on silicagel or cellulose plates followed by radioautography it was possible, depending upon the specific activity of the reagent used, to detect radioactive peptides obtained from as little as l μg of protein. Seryl-tRNA synthetase gave the correct number of tryptic peptides expected for a dimer of identical subunits. Leucyl-tRNA synthetase, on the other hand, gave roughly half the number of radioactive tryptic, chymotryptic, and acid protease peptides expected from the lysine, arginine, and methionine content of the 100,000 monomer. We have interpreted these results as indicating that extensive internal homology exists among lysine- and methionine-containing peptides within the leucyl-tRNA synthetase. The simplest conclusion that can be drawn from these observations is that the NH2- and COOH-terminal halves of leucyl-tRNA synthetase and perhaps other synthetases of 100,000 molecular weight may have evolved through a process of gene duplication and fusion, followed by limited diversification by way of amino-acid substitutions accumulating during evolution. Images PMID:4592690

  12. Human liver apolipoprotein B-100 cDNA: complete nucleic acid and derived amino acid sequence.

    PubMed Central

    Law, S W; Grant, S M; Higuchi, K; Hospattankar, A; Lackner, K; Lee, N; Brewer, H B

    1986-01-01

    Human apolipoprotein B-100 (apoB-100), the ligand on low density lipoproteins that interacts with the low density lipoprotein receptor and initiates receptor-mediated endocytosis and low density lipoprotein catabolism, has been cloned, and the complete nucleic acid and derived amino acid sequences have been determined. ApoB-100 cDNAs were isolated from normal human liver cDNA libraries utilizing immunoscreening as well as filter hybridization with radiolabeled apoB-100 oligodeoxynucleotides. The apoB-100 mRNA is 14.1 kilobases long encoding a mature apoB-100 protein of 4536 amino acids with a calculated amino acid molecular weight of 512,723. ApoB-100 contains 20 potential glycosylation sites, and 12 of a total of 25 cysteine residues are located in the amino-terminal region of the apolipoprotein providing a potential globular structure of the amino terminus of the protein. ApoB-100 contains relatively few regions of amphipathic helices, but compared to other human apolipoproteins it is enriched in beta-structure. The delineation of the entire human apoB-100 sequence will now permit a detailed analysis of the conformation of the protein, the low density lipoprotein receptor binding domain(s), and the structural relationship between apoB-100 and apoB-48 and will provide the basis for the study of genetic defects in apoB-100 in patients with dyslipoproteinemias. PMID:3464946

  13. Computer selection of oligonucleotide probes from amino acid sequences for use in gene library screening.

    PubMed

    Yang, J H; Ye, J H; Wallace, D C

    1984-01-11

    We present a computer program, FINPROBE, which utilizes known amino acid sequence data to deduce minimum redundancy oligonucleotide probes for use in screening cDNA or genomic libraries or in primer extension. The user enters the amino acid sequence of interest, the desired probe length, the number of probes sought, and the constraints on oligonucleotide synthesis. The computer generates a table of possible probes listed in increasing order of redundancy and provides the location of each probe in the protein and mRNA coding sequence. Activation of a next function provides the amino acid and mRNA sequences of each probe of interest as well as the complementary sequence and the minimum dissociation temperature of the probe. A final routine prints out the amino acid sequence of the protein in parallel with the mRNA sequence listing all possible codons for each amino acid.

  14. Somatic homologous recombination in planta: the recombination frequency is dependent on the allelic state of recombining sequences and may be influenced by genomic position effects.

    PubMed

    Swoboda, P; Hohn, B; Gal, S

    1993-02-01

    We have previously described a non-selective method for scoring somatic recombination in the genome of whole plants. The recombination substrate consists of a defective partial dimer of Cauliflower Mosaic Virus (CaMV) sequences, which can code for production of viable virus only upon homologous recombination; this leads to disease symptoms on leaves. Brassica napus plants (rapeseed) harbouring the recombination substrate as a transgene were used to examine the time in plant development at which recombination takes place. The analysis of three transgene loci revealed recombination frequencies specific for each locus. Recombination frequencies were increased if more than one transgene locus was present per genome, either in allelic (homozygosity of the transgene locus) or in non-allelic positions. In both cases, the overall recombination frequency was found to be elevated to approximately the sum of the frequencies for the individual transgene loci or slightly higher, suggesting that the respective transgene loci behave largely independently of each other. For all plants tested (single locus, two or multiple loci) maximal recombination frequencies were of the order of 10(-6) events per cell division.

  15. Compound Mutations Cause Increased Cardiac Events in Children with Long QT Syndrome: Can the Sequence Homology-Based Tools be Applied for Prediction of Phenotypic Severity?

    PubMed

    Izumi, Gaku; Hayama, Emiko; Yamazawa, Hirokuni; Inai, Kei; Shimada, Mitsuyo; Furutani, Michiko; Nishizawa, Tsutomu; Furutani, Yoshiyuki; Matsuoka, Rumiko; Nakanishi, Toshio

    2016-06-01

    Long QT syndrome (LQTS) can cause syncope, ventricular fibrillation, and death. Recently, several disease-causing mutations in ion channel genes have been identified, and compound mutations have also been detected. It is unclear whether children who are carriers of compound mutations exhibit a more severe phenotype than those with single mutations. Although predicting phenotypic severity is clinically important, the availability of prediction tools for LQTS is unknown. To determine whether the severity of the LQTS phenotype can be predicted by the presence of compound mutations in children is needed. We detected 97 single mutations (Group S) and 13 compound mutations (Group C) between 1998 and 2012, age at diagnosis ranging 0-19 years old (median age is 9.0) and 18.0 years of follow-up period. The phenotypes and Kaplan-Meier event-free rates of the two groups were compared for cardiac events. This study investigated phenotypic severity in relation to the location of mutations in the protein sequence, which was analyzed using two sequence homology-based tools. In results, compound mutations in children were associated with a high incidence of syncope within the first decade (Group S: 32 % vs. Group C: 61 %), requiring an ICD in the second decade (Group S: 3 % vs. Group C: 56 %). Mortality in these patients was high within 5 years of birth (23 %). Phenotypic prediction tools correctly predicted the phenotypic severity in both Groups S and C, especially by using their coupling method. The coupling prediction method is useful in the initial evaluation of phenotypes both with single and compound mutations of LQTS patients. However, it should be noted that the compound mutation makes more severe phenotype.

  16. Solid phase sequencing of biopolymers

    DOEpatents

    Cantor, Charles; Koster, Hubert

    2010-09-28

    This invention relates to methods for detecting and sequencing target nucleic acid sequences, to mass modified nucleic acid probes and arrays of probes useful in these methods, and to kits and systems which contain these probes. Useful methods involve hybridizing the nucleic acids or nucleic acids which represent complementary or homologous sequences of the target to an array of nucleic acid probes. These probes comprise a single-stranded portion, an optional double-stranded portion and a variable sequence within the single-stranded portion. The molecular weights of the hybridized nucleic acids of the set can be determined by mass spectroscopy, and the sequence of the target determined from the molecular weights of the fragments. Nucleic acids whose sequences can be determined include DNA or RNA in biological samples such as patient biopsies and environmental samples. Probes may be fixed to a solid support such as a hybridization chip to facilitate automated molecular weight analysis and identification of the target sequence.

  17. A putative carbohydrate-binding domain of the lactose-binding Cytisus sessilifolius anti-H(O) lectin has a similar amino acid sequence to that of the L-fucose-binding Ulex europaeus anti-H(O) lectin.

    PubMed

    Konami, Y; Yamamoto, K; Osawa, T; Irimura, T

    1995-04-01

    The complete amino acid sequence of a lactose-binding Cytisus sessilifolius anti-H(O) lectin II (CSA-II) was determined using a protein sequencer. After digestion of CSA-II with endoproteinase Lys-C or Asp-N, the resulting peptides were purified by reversed-phase high performance liquid chromatography (HPLC) and then subjected to sequence analysis. Comparison of the complete amino acid sequence of CSA-II with the sequences of other leguminous seed lectins revealed regions of extensive homology. The amino acid sequence of a putative carbohydrate-binding domain of CSA-II was found to be similar to those of several anti-H(O) leguminous lectins, especially to that of the L-fucose-binding Ulex europaeus lectin I (UEA-I).

  18. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data...

  19. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data...

  20. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data...

  1. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data...

  2. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data...

  3. Cloning, sequence analysis and expression of the F1F0-ATPase beta-subunit from wine lactic acid bacteria.

    PubMed

    Sievers, Martin; Uermösi, Christina; Fehlmann, Marc; Krieger, Sibylle

    2003-09-01

    The nucleotide sequences of the genes encoding the F1F0-ATPase beta-subunit from Oenococcus oeni, Leuconostoc mesenteroides subsp. mesenteroides, Pediococcus damnosus, Pediococcus parvulus, Lactobacillus brevis and Lactobacillus hilgardii were determined. Their deduced amino acid sequences showed homology values of 79-98%. Data from the alignment and ATPase tree indicated that O. oeni and L. mesenteroides subsp. mesenteroides formed a group well-separated from P. damnosus and P. parvulus and from the group comprises L. brevis and L. hilgardii. The N-terminus of the F1F0-ATPase beta-subunit of O. oeni contains a stretch of additional 38 amino acid residues. The catalytic site of the ATPase beta-subunit of the investigated strains is characterized by the two conserved motifs GGAGVGKT and GERTRE. The amplified atpD coding sequences were inserted into the pCRT7/CT-TOPO vector using TA-cloning strategy and transformed in Escherichia coli. SDS-PAGE and Western blot analyses confirmed that O. oeni has an ATPase beta-subunit protein which is larger in size than the corresponding molecules from the investigated strains.

  4. The amino acid sequence of protein SCMK-B2C from the high-sulphur fraction of wool keratin

    PubMed Central

    Elleman, T. C.

    1972-01-01

    1. The amino acid sequence of a protein from the reduced and carboxymethylated high-sulphur fraction of wool has been determined. 2. The sequence of this S-carboxymethylkerateine (SCMK-B2C) of 151 amino acid residues displays much internal homology and an unusual residue distribution. Thus a ten-residue sequence occurs four times near the N-terminus and five times near the C-terminus with few changes. These regions contain much of the molecule's half-cystine, whereas between them there is a region of 19 residues that are mainly small and devoid of cystine and proline. 3. Certain models of the wool fibre based on its mechanical and physical properties propose a matrix of small compact globular units linked together to form beaded chains. The unusual distribution of the component residues of protein SCMK-B2C suggests structures in the wool-fibre matrix compatible with certain features of the proposed models. PMID:4678578

  5. Human retroviruses and AIDS 1996. A compilation and analysis of nucleic acid and amino acid sequences

    SciTech Connect

    Myers, G.; Foley, B.; Korber, B.; Mellors, J.W.; Jeang, K.T.; Wain-Hobson, S.

    1997-04-01

    This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (1) Nuclear Acid Alignments and Sequences; (2) Amino Acid Alignments; (3) Analysis; (4) Related Sequences; and (5) Database Communications. Information within all the parts is updated throughout the year on the Web site, http://hiv-web.lanl.gov. While this publication could take the form of a review or sequence monograph, it is not so conceived. Instead, the literature from which the database is derived has simply been summarized and some elementary computational analyses have been performed upon the data. Interpretation and commentary have been avoided insofar as possible so that the reader can form his or her own judgments concerning the complex information. In addition to the general descriptions of the parts of the compendium, the user should read the individual introductions for each part.

  6. Nucleotide sequence and spatial expression pattern of a drought- and abscisic Acid-induced gene of tomato.

    PubMed

    Plant, A L; Cohen, A; Moses, M S; Bray, E A

    1991-11-01

    The nucleotide sequence of le16, a tomato (Lycopersicon esculentum Mill.) gene induced by drought stress and regulated by abscisic acid specifically in aerial vegetative tissue, is presented. The single open reading frame contained within the gene has the capacity to encode a polypeptide of 12.7 kilodaltons and is interrupted by a small intron. The predicted polypeptide is rich in leucine, glycine, and alanine and has an isoelectric point of 8.7. The amino terminus is hydrophobic and characteristic of signal sequences that target polypeptides for export from the cytoplasm. There is homology (47.2% identity) between the amino terminus of the LE 16 polypeptide and the corresponding amino terminal domain of the maize phospholipid transfer protein. le16 was expressed in drought-stressed leaf, petiole, and stem tissue and to a much lower extent in the pericarp of mature green tomato fruit and developing seeds. No expression was detected in the pericarp of red fruit or in drought-stressed roots. Expression of le16 was also induced in leaf tissue by a variety of other abiotic stresses including polyethylene glycol-mediated water deficit, salinity, cold stress, and heat stress. None of these stresses or direct applications of abscisic acid induced the expression of le16 in the roots of the same plants. The unique expression characteristics of this gene indicates that novel regulatory mechanisms, in addition to endogenous abscisic acid, are involved in controlling gene expression.

  7. Substitution of a single amino acid residue in the aromatic/arginine selectivity filter alters the transport profiles of tonoplast aquaporin homologs.

    PubMed

    Azad, Abul Kalam; Yoshikawa, Naoki; Ishikawa, Takahiro; Sawa, Yoshihiro; Shibata, Hitoshi

    2012-01-01

    Aquaporins are integral membrane proteins that facilitate the transport of water and some small solutes across cellular membranes. X-ray crystallography of aquaporins indicates that four amino acids constitute an aromatic/arginine (ar/R) pore constriction known as the selectivity filter. On the basis of these four amino acids, tonoplast aquaporins called tonoplast intrinsic proteins (TIPs) are divided into three groups in Arabidopsis. Herein, we describe the characterization of two group I TIP1s (TgTIP1;1 and TgTIP1;2) from tulip (Tulipa gesneriana). TgTIP1;1 and TgTIP1;2 have a novel isoleucine in loop E (LE2 position) of the ar/R filter; the residue at LE2 is a valine in all group I TIPs from model plants. The homologs showed mercury-sensitive water channel activity in a fast kinetics swelling assay upon heterologous expression in Pichia pastoris. Heterologous expression of both homologs promoted the growth of P. pastoris on ammonium or urea as sole sources of nitrogen and decreased growth and survival in the presence of H(2)O(2). TgTIP1;1- and TgTIP1;2-mediated H(2)O(2) conductance was demonstrated further by a fluorescence assay. Substitutions in the ar/R selectivity filter of TgTIP1;1 showed that mutants that mimicked the ar/R constriction of group I TIPs could conduct the same substrates that were transported by wild-type TgTIP1;1. In contrast, mutants that mimicked group II TIPs showed no evidence of urea or H(2)O(2) conductance. These results suggest that the amino acid residue at LE2 position is critical for the transport selectivity of the TIP homologs and group I TIPs might have a broader spectrum of substrate selectivity than group II TIPs.

  8. Transcriptome Sequencing in Response to Salicylic Acid in Salvia miltiorrhiza

    PubMed Central

    Zhang, Xiaoru; Dong, Juane; Liu, Hailong; Wang, Jiao; Qi, Yuexin; Liang, Zongsuo

    2016-01-01

    Salvia miltiorrhiza is a traditional Chinese herbal medicine, whose quality and yield are often affected by diseases and environmental stresses during its growing season. Salicylic acid (SA) plays a significant role in plants responding to biotic and abiotic stresses, but the involved regulatory factors and their signaling mechanisms are largely unknown. In order to identify the genes involved in SA signaling, the RNA sequencing (RNA-seq) strategy was employed to evaluate the transcriptional profiles in S. miltiorrhiza cell cultures. A total of 50,778 unigenes were assembled, in which 5,316 unigenes were differentially expressed among 0-, 2-, and 8-h SA induction. The up-regulated genes were mainly involved in stimulus response and multi-organism process. A core set of candidate novel genes coding SA signaling component proteins was identified. Many transcription factors (e.g., WRKY, bHLH and GRAS) and genes involved in hormone signal transduction were differentially expressed in response to SA induction. Detailed analysis revealed that genes associated with defense signaling, such as antioxidant system genes, cytochrome P450s and ATP-binding cassette transporters, were significantly overexpressed, which can be used as genetic tools to investigate disease resistance. Our transcriptome analysis will help understand SA signaling and its mechanism of defense systems in S. miltiorrhiza. PMID:26808150

  9. Human retroviruses and aids, 1992. A compilation and analysis of nucleic acid and amino acid sequences

    SciTech Connect

    Myers, G.; Korber, B.; Berzofsky, J.A.; Pavlakis, G.N.; Smith, R.F.

    1992-10-01

    This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (1) HIV and SIV Nucleotide Sequences; (H) Amino Acid Sequences; (III) Analyses; (IV) Related Sequences; and (V) Database Communications. information within all the parts is updated at least twice in each year, which accounts for the modes of binding and pagination in the compendium. While this publication could take the form of a review or sequence monograph, it is not so conceived. Instead, the literature from which the database is derived has simply been summarized and some elementary computational analyses have been performed upon the data. Interpretation and commentary have been avoided insofar as possible so that the reader can form his or her own judgments concerning the complex information. In addition to the general descriptions below of the parts of the compendium, the user should read the individual introductions for each part.

  10. KM+, a mannose-binding lectin from Artocarpus integrifolia: amino acid sequence, predicted tertiary structure, carbohydrate recognition, and analysis of the beta-prism fold.

    PubMed Central

    Rosa, J. C.; De Oliveira, P. S.; Garratt, R.; Beltramini, L.; Resing, K.; Roque-Barreira, M. C.; Greene, L. J.

    1999-01-01

    The complete amino acid sequence of the lectin KM+ from Artocarpus integrifolia (jackfruit), which contains 149 residues/mol, is reported and compared to those of other members of the Moraceae family, particularly that of jacalin, also from jackfruit, with which it shares 52% sequence identity. KM+ presents an acetyl-blocked N-terminus and is not posttranslationally modified by proteolytic cleavage as is the case for jacalin. Rather, it possesses a short, glycine-rich linker that unites the regions homologous to the alpha- and beta-chains of jacalin. The results of homology modeling implicate the linker sequence in sterically impeding rotation of the side chain of Asp141 within the binding site pocket. As a consequence, the aspartic acid is locked into a conformation adequate only for the recognition of equatorial hydroxyl groups on the C4 epimeric center (alpha-D-mannose, alpha-D-glucose, and their derivatives). In contrast, the internal cleavage of the jacalin chain permits free rotation of the homologous aspartic acid, rendering it capable of accepting hydrogen bonds from both possible hydroxyl configurations on C4. We suggest that, together with direct recognition of epimeric hydroxyls and the steric exclusion of disfavored ligands, conformational restriction of the lectin should be considered to be a new mechanism by which selectivity may be built into carbohydrate binding sites. Jacalin and KM+ adopt the beta-prism fold already observed in two unrelated protein families. Despite presenting little or no sequence similarity, an analysis of the beta-prism reveals a canonical feature repeatedly present in all such structures, which is based on six largely hydrophobic residues within a beta-hairpin containing two classic-type beta-bulges. We suggest the term beta-prism motif to describe this feature. PMID:10210179

  11. Purification, properties, and partial amino acid sequences of thermostable xylanases from Streptomyces thermoviolaceus OPC-520

    SciTech Connect

    Tsujibo, Hiroshi; Miyamoto, Katsushiro; Kuda, Takashi; Minami, Kazushi; Sakamoto, Takashi; Inamori, Yoshihiko ); Hasegawa, Toru )

    1992-01-01

    Two types of xylanases (1,4-{beta}-D-xylan xylanohydrolase, EC 3.2.1.8) were isolated from the culture filtrate of a thermophilic actinomycete, Streptomyces thermoviolaceus OPC-520. The enzymes (STX-I and STX-II) were purified by chromatography with DEAE-Toyopearl 650 M, CM-Toyopearl 650 M, Sephadex G-75, Phenyl-Toyopearl 650 M, and Mono Q HR. The purified enzymes showed single bands on sodium dodecyl sulfate-polyacrylamide gel electrophoresis. The molecular weights of STX-I and STX-II were 54,000 and 33,000, respectively. The pIs were 4.2 (STX-I) and 8.0 (STX-II). The optimum pH levels for the activity of STX-I and STX-II were pH 7.0. The optimum temperature for the activity of STX-I was 70C, and that for the activity of STX-II was 60C. The enzymes were completely inhibited by N-bromosuccinimide. The enzymes degraded xylan, producing xylose and xylobiose as the predominant products, indicating that they were endoxylanases. STX-I showed high sequence homology with the exoglucanase from Cellulomonas fimi (47% homology), and STX-II showed high sequence homology with the xylanase from Bacillus pumilus (46% homology).

  12. Nucleotide sequence of the capsid protein gene of two serotypes of San Miguel sea lion virus: identification of conserved and non-conserved amino acid sequences among calicivirus capsid proteins.

    PubMed

    Neill, J D

    1992-07-01

    The San Miguel sea lion viruses, members of the calicivirus family, are closely related to the vesicular disease of swine viruses which can cause severe disease in swine. In order to begin the molecular characterization of these viruses, the nucleotide sequence of the capsid protein gene of two San Miguel sea lion viruses (SMSV), serotypes 1 and 4, was determined. The coding sequences for the capsid precursor protein were located within the 3' terminal 2620 bases of the genomic RNAs of both viruses. The encoded capsid precursor proteins were 79,500 and 77,634 Da for SMSV 1 and SMSV 4, respectively. The SMSV 1 protein was 47.7% and SMSV 4 was 48.6% homologous to the feline calicivirus (FCV) capsid precursor protein while the two SMSV capsid precursors were 73% homologous to each other. Six distinct regions within the capsid precursors (denoted as regions A-F) were identified based on amino acid sequence alignment analysis of the two SMSV serotypes with FCV and the rabbit hemorrhagic disease virus (RHDV) capsid protein. Three regions showed similarity among all four viruses (regions B, D and F) and one region showed a very high degree of homology between the SMSV serotypes but only limited similarity with FCV (region A). RHDV contained only a truncated region A. A fifth region, consisting of approximately 100 residues, was not conserved among any of the viruses (region E) and, in SMSV, may contain the serotype-specific determinants. Another small region (region C) contained between 15 and 27 amino acids and showed little sequence conservation. Region B showed the highest degree of conservation among the four viruses and contained the residues which had homology to the picornavirus VP3 structural protein. An open reading frame, found in the 3' terminal 514 bases of the SMSV genomes, encoded small proteins (12,575 and 12,522 Da, respectively for SMSV 1 and SMSV 4) of which 32% of the conserved amino acids were basic residues, implying a possible nucleic acid

  13. Lactobacillus kefiri shows inter-strain variations in the amino acid sequence of the S-layer proteins.

    PubMed

    Malamud, Mariano; Carasi, Paula; Bronsoms, Sílvia; Trejo, Sebastián A; Serradell, María de Los Angeles

    2017-04-01

    The S-layer is a proteinaceous envelope constituted by subunits that self-assemble to form a two-dimensional lattice that covers the surface of different species of Bacteria and Archaea, and it could be involved in cell recognition of microbes among other several distinct functions. In this work, both proteomic and genomic approaches were used to gain knowledge about the sequences of the S-layer protein (SLPs) encoding genes expressed by six aggregative and sixteen non-aggregative strains of potentially probiotic Lactobacillus kefiri. Peptide mass fingerprint (PMF) analysis confirmed the identity of SLPs extracted from L. kefiri, and based on the homology with phylogenetically related species, primers located outside and inside the SLP-genes were employed to amplify genomic DNA. The O-glycosylation site SASSAS was found in all L. kefiri SLPs. Ten strains were selected for sequencing of the complete genes. The total length of the mature proteins varies from 492 to 576 amino acids, and all SLPs have a calculated pI between 9.37 and 9.60. The N-terminal region is relatively conserved and shows a high percentage of positively charged amino acids. Major differences among strains are found in the C-terminal region. Different groups could be distinguished regarding the mature SLPs and the similarities observed in the PMF spectra. Interestingly, SLPs of the aggregative strains are 100% homologous, although these strains were isolated from different kefir grains. This knowledge provides relevant data for better understanding of the mechanisms involved in SLPs functionality and could contribute to the development of products of biotechnological interest from potentially probiotic bacteria.

  14. Completion of the amino acid sequence of the alpha 1 chain from type I calf skin collagen. Amino acid sequence of alpha 1(I)B8.

    PubMed Central

    Glanville, R W; Breitkreutz, D; Meitinger, M; Fietzek, P P

    1983-01-01

    The complete amino acid sequence of the 279-residue CNBr peptide CB8 from the alpha 1 chain of type I calf skin collagen is presented. It was determined by sequencing overlapping fragments of CB8 produced by Staphylococcus aureus V8 proteinase, trypsin, Endoproteinase Arg-C and hydroxylamine. Tryptic cleavages were also made specific for lysine by blocking arginine residues with cyclohexane-1,2-dione. This completes the amino acid sequence analysis of the 1054-residues-long alpha (I) chain of calf skin collagen. PMID:6354180

  15. A bacterial protein has homology with human chorionic gonadotropin (hCG).

    PubMed

    Grover, S; Woodward, S R; Odell, W D

    1993-06-30

    Studies from our laboratory have demonstrated the presence of a 48.5 kD cell wall protein in the bacterium, Xanthomonas maltophilia, which immunologically resembles the beta subunit of human chorionic gonadotropin. Primers were designed from the amino acid sequences of enzymatically cleaved peptide fragments of this protein. These primers were used to obtain PCR amplified products, which were subsequently cloned in a PCR11TA cloning vector, and a 492 base pair nucleotide sequence was obtained with a 164 amino acid open reading frame. When this nucleotide sequence was aligned with exon 2 of genes 5 and 6 of the beta hCG gene, a 53% homology was observed. The translated protein sequence had a 35% homology with hCG and a 25% homology with human luteinizing hormone.

  16. Redesigning Aldolase Stereoselectivity by Homologous Grafting

    PubMed Central

    Henßen, Birgit; Metz, Alexander; Gohlke, Holger; Pietruszka, Jörg

    2016-01-01

    The 2-deoxy-d-ribose-5-phosphate aldolase (DERA) offers access to highly desirable building blocks for organic synthesis by catalyzing a stereoselective C-C bond formation between acetaldehyde and certain electrophilic aldehydes. DERA´s potential is particularly highlighted by the ability to catalyze sequential, highly enantioselective aldol reactions. However, its synthetic use is limited by the absence of an enantiocomplementary enzyme. Here, we introduce the concept of homologous grafting to identify stereoselectivity-determining amino acid positions in DERA. We identified such positions by structural analysis of the homologous aldolases 2-keto-3-deoxy-6-phosphogluconate aldolase (KDPG) and the enantiocomplementary enzyme 2-keto-3-deoxy-6-phosphogalactonate aldolase (KDPGal). Mutation of these positions led to a slightly inversed enantiopreference of both aldolases to the same extent. By transferring these sequence motifs onto DERA we achieved the intended change in enantioselectivity. PMID:27327271

  17. Amino acid substitutions in homologs of the STAY-GREEN protein are responsible for the green-flesh and chlorophyll retainer mutations of tomato and pepper.

    PubMed

    Barry, Cornelius S; McQuinn, Ryan P; Chung, Mi-Young; Besuden, Anna; Giovannoni, James J

    2008-05-01

    Color changes often accompany the onset of ripening, leading to brightly colored fruits that serve as attractants to seed-dispersing organisms. In many fruits, including tomato (Solanum lycopersicum) and pepper (Capsicum annuum), there is a sharp decrease in chlorophyll content and a concomitant increase in the synthesis of carotenoids as a result of the conversion of chloroplasts into chromoplasts. The green-flesh (gf) and chlorophyll retainer (cl) mutations of tomato and pepper, respectively, are inhibited in their ability to degrade chlorophyll during ripening, leading to the production of ripe fruits characterized by both chlorophyll and carotenoid accumulation and are thus brown in color. Using a positional cloning approach, we have identified a point mutation at the gf locus that causes an amino acid substitution in an invariant residue of a tomato homolog of the STAY-GREEN (SGR) protein of rice (Oryza sativa). Similarly, the cl mutation also carries an amino acid substitution at an invariant residue in a pepper homolog of SGR. Both GF and CL expression are highly induced at the onset of fruit ripening, coincident with the ripening-associated decline in chlorophyll. Phylogenetic analysis indicates that there are two distinct groups of SGR proteins in plants. The SGR subfamily is required for chlorophyll degradation and operates through an unknown mechanism. A second subfamily, which we have termed SGR-like, has an as-yet undefined function.

  18. Arabidopsis glutamate receptor homolog3.5 modulates cytosolic Ca2+ level to counteract effect of abscisic acid in seed germination.

    PubMed

    Kong, Dongdong; Ju, Chuanli; Parihar, Aisha; Kim, So; Cho, Daeshik; Kwak, June M

    2015-04-01

    Seed germination is a critical step in a plant's life cycle that allows successful propagation and is therefore strictly controlled by endogenous and environmental signals. However, the molecular mechanisms underlying germination control remain elusive. Here, we report that the Arabidopsis (Arabidopsis thaliana) glutamate receptor homolog3.5 (AtGLR3.5) is predominantly expressed in germinating seeds and increases cytosolic Ca2+ concentration that counteracts the effect of abscisic acid (ABA) to promote germination. Repression of AtGLR3.5 impairs cytosolic Ca2+ concentration elevation, significantly delays germination, and enhances ABA sensitivity in seeds, whereas overexpression of AtGLR3.5 results in earlier germination and reduced seed sensitivity to ABA. Furthermore, we show that Ca2+ suppresses the expression of ABSCISIC ACID INSENSITIVE4 (ABI4), a key transcription factor involved in ABA response in seeds, and that ABI4 plays a fundamental role in modulation of Ca2+-dependent germination. Taken together, our results provide molecular genetic evidence that AtGLR3.5-mediated Ca2+ influx stimulates seed germination by antagonizing the inhibitory effects of ABA through suppression of ABI4. These findings establish, to our knowledge, a new and pivotal role of the plant glutamate receptor homolog and Ca2+ signaling in germination control and uncover the orchestrated modulation of the AtGLR3.5-mediated Ca2+ signal and ABA signaling via ABI4 to fine-tune the crucial developmental process, germination, in Arabidopsis.

  19. Adsorption of the Lighter Homologs of Element 104 and Element 105 on DGA Resin from Various Mineral Acids

    SciTech Connect

    Bennett, M E; Sudowe, R

    2008-11-17

    The goal of studying transactinide elements is to further understand the fundamental principles that govern the periodic table. The current periodic table arrangement allows for the prediction of the chemical behavior of elements. The correct position of a transactinide element can be assessed by investigating its chemical behavior and comparing it to that of the homologs and pseudo-homologs of a transactinide element. Homologs of a transactinide element are the elements in the same group of the periodic table as the transactinide. A pseudo-homolog of a transactinide element is an element with a similar main oxidation state and similar ionic radius to the transactinide element. For example, the homologs of rutherfordium, Rf, are titanium, zirconium and hafnium (Ti, Zr and Hf); the pseudo homologs of Rf are thorium, Th, and plutonium, Pu. Understanding the chemical behavior of a transactinide element compared to its homologs and pseudo-homologs also allows for the assessment of the role of relativistic effects. Relativistic effects occur when the velocity of the s orbital electrons closest to the nucleus approaches the speed of light. These electrons approach the speed of light because they have no orbital momentum. This causes two effects, first there is in a decrease in Bohr radius of the inner electronic orbitals because of this there is an increase in particle mass. A contraction of outer s and p orbitals is also seen. The contraction of these orbitals results in an energy destabilization of the outer most shell, in the case of transactinides this would be the 5f and 6d orbitals. The outer most d shell and all f shells can also experience a radial expansion due to these orbitals being screened from the effective nuclear charge. Another relativistic effect is the 'spin-orbit splitting' for p, d and f orbitals into j = 1 {+-} 1/2 states. Where j is the total angular momentum vector and 1 is angular quantum number. All of these effects have the same order of

  20. Analys. DNA: a computer program for nucleic acid sequence data processing.

    PubMed

    Amthauer, R; Araya, A

    1984-09-01

    A computer program written in BASIC language is described. The program allows processing and analysis of DNA data and has been designed to be used by persons with little or no computer experience. The operator using different options can search for direct homologies with varying degrees of matching, generate complementary strands, find restriction sites, invert the polarity of the sequence and edit a print-out.

  1. Detection and isolation of nucleic acid sequences using a bifunctional hybridization probe

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    2000-01-01

    A method for detecting and isolating a target sequence in a sample of nucleic acids is provided using a bifunctional hybridization probe capable of hybridizing to the target sequence that includes a detectable marker and a first complexing agent capable of forming a binding pair with a second complexing agent. A kit is also provided for detecting a target sequence in a sample of nucleic acids using a bifunctional hybridization probe according to this method.

  2. Hydroxylation of aspartic acid in domains homologous to the epidermal growth factor precursor is catalyzed by a 2-oxoglutarate-dependent dioxygenase.

    PubMed Central

    Stenflo, J; Holme, E; Lindstedt, S; Chandramouli, N; Huang, L H; Tam, J P; Merrifield, R B

    1989-01-01

    3-Hydroxyaspartic acid and 3-hydroxyasparagine are two rare amino acids that are present in domains homologous to the epidermal growth factor precursor in vitamin K-dependent plasma proteins as well as in proteins that do not require vitamin K for normal biosynthesis. They are formed by posttranslational hydroxylation of aspartic acid and asparagine, respectively. The first epidermal growth factor-like domain in factor IX (residues 45-87) was synthesized with aspartic acid in position 64, replacing 3-hydroxyaspartic acid. It was used as substrate in a hydroxylase assay with rat liver microsomes as the source of enzyme and reaction conditions that satisfy the requirements of 2-oxoglutarate-dependent dioxygenases. The synthetic peptide stimulated the 2-oxoglutarate decarboxylation in contrast to synthetic, modified epidermal growth factor (Met-21 and His-22 deleted and Glu-24 replaced by Asp) and synthetic peptides corresponding to residues 60-71 in human factor IX. This indicates that the hydroxylase is a 2-oxoglutarate-dependent dioxygenase with a selective substrate requirement. Images PMID:2492106

  3. Computational methods for remote homolog identification.

    PubMed

    Wan, Xiu-Feng; Xu, Dong

    2005-12-01

    As more and more protein sequences are available, homolog identification becomes increasingly important for functional, structural, and evolutional studies of proteins. Many homologous proteins were separated a very long time ago in their evolutionary history and thus their sequences share low sequence identity. These remote homologs have become a research focus in bioinformatics over the past decade, and some significant advances have been achieved. In this paper, we provide a comprehensive review on computational techniques used in remote homolog identification based on different methods, including sequence-sequence comparison, and sequence-structure comparison, and structure-structure comparison. Other miscellaneous approaches are also summarized. Pointers to the online resources of these methods and their related databases are provided. Comparisons among different methods in terms of their technical approaches, their strengths, and limitations are followed. Studies on proteins in SARS-CoV are shown as an example for remote homolog identification application.

  4. Complete genome sequence of the probiotic lactic acid bacterium Lactobacillus acidophilus NCFM

    PubMed Central

    Altermann, Eric; Russell, W. Michael; Azcarate-Peril, M. Andrea; Barrangou, Rodolphe; Buck, B. Logan; McAuliffe, Olivia; Souther, Nicole; Dobson, Alleson; Duong, Tri; Callanan, Michael; Lick, Sonja; Hamrick, Alice; Cano, Raul; Klaenhammer, Todd R.

    2005-01-01

    Lactobacillus acidophilus NCFM is a probiotic bacterium that has been produced commercially since 1972. The complete genome is 1,993,564 nt and devoid of plasmids. The average GC content is 34.71% with 1,864 predicted ORFs, of which 72.5% were functionally classified. Nine phage-related integrases were predicted, but no complete prophages were found. However, three unique regions designated as potential autonomous units (PAUs) were identified. These units resemble a unique structure and bear characteristics of both plasmids and phages. Analysis of the three PAUs revealed the presence of two R/M systems and a prophage maintenance system killer protein. A spacers interspersed direct repeat locus containing 32 nearly perfect 29-bp repeats was discovered and may provide a unique molecular signature for this organism. In silico analyses predicted 17 transposase genes and a chromosomal locus for lactacin B, a class II bacteriocin. Several mucus- and fibronectin-binding proteins, implicated in adhesion to human intestinal cells, were also identified. Gene clusters for transport of a diverse group of carbohydrates, including fructooligosaccharides and raffinose, were present and often accompanied by transcriptional regulators of the lacI family. For protein degradation and peptide utilization, the organism encoded 20 putative peptidases, homologs for PrtP and PrtM, and two complete oligopeptide transport systems. Nine two-component regulatory systems were predicted, some associated with determinants implicated in bacteriocin production and acid tolerance. Collectively, these features within the genome sequence of L. acidophilus are likely to contribute to the organisms' gastric survival and promote interactions with the intestinal mucosa and microbiota. PMID:15671160

  5. Purification, properties and amino acid sequence of a low-Mr abundant seed protein from pea (Pisum sativum L.).

    PubMed

    Gatehouse, J A; Gilroy, J; Hoque, M S; Croy, R R

    1985-01-01

    The seeds of pea (Pisum sativum L.) contain several proteins in the albumin solubility fraction that are significant components of total cotyledonary protein (5-10%) and are accumulated in developing seeds concurrently with storage-protein synthesis. One of these proteins, of low Mr and designated 'Psa LA', has been purified, characterized and sequenced. Psa LA has an Mr of 11000 and contains polypeptides of Mr 6000, suggesting that the protein molecules are dimeric. The amino acid sequence contains 54 residues, with a high content (10/54) of asparagine/aspartate. It has no inhibitory action towards trypsin or chymotrypsin, and is distinct from the inhibitors of those enzymes found in pea seeds, nor does it inhibit hog pancreatic alpha-amylase. The protein contains no methionine, but significant amounts of cysteine (four residues per polypeptide), suggesting a possible role as a sulphur storage protein. However, its sequence is not homologous with low-Mr (2S) storage proteins from castor bean (Ricinus communis) or rape (Brassica napus). Psa LA therefore represents a new type of low-Mr seed protein.

  6. Characterization and expression pattern of the novel MIA homolog TANGO.

    PubMed

    Bosserhoff, A K; Moser, M; Buettner, R

    2004-07-01

    A novel human gene, TANGO, encoding a MIA ('melanoma inhibitory activity') homologous protein was identified by a gene bank search. TANGO, together with the homologous genes MIA, OTOR (FPD, MIAL) and MIA2 define a novel gene family sharing important structural features, significant homology at both the nucleotide and protein level, and similar genomic organization. The four members share 34-45% amino acid identity and 47-59% cDNA sequence identity. TANGO encodes a mature protein of 103 amino acids in addition to a hydrophobic secretory signal sequence. Sequence homology confirms the highly conserved SH3 structure present also in MIA, OTOR and MIA2. Thus, it appears that there are a number of extracellular proteins with SH3-fold like structures. Interestingly, in situ hybridization, RT-PCR and Northern Blots revealed very broad TANGO expression patterns in contrast to the highly restricted expression patterns previously determined for the other members of the MIA gene family. The only cells lacking TANGO expression are cells belonging to the hematopoetic system. High levels of TANGO expression were observed both during embryogenesis and in adult tissues.

  7. Amino acid sequences of two novel long-chain neurotoxins from the venom of the sea snake Laticauda colubrina.

    PubMed

    Kim, H S; Tamiya, N

    1982-11-01

    From the venom of a population of the sea snake Laticauda colubrina from the Solomon Islands, a neurotoxic component, Laticauda colubrina a (toxin Lc a), was isolated in 16.6% (A280) yield. Similarly, from the venom of a population of L. colubrina from the Philippines, a neurotoxic component, Laticauda colubrina b (toxin Lc b), was obtained in 10.0% (A280) yield. The LD50 values of these toxins were 0.12 microgram/g body wt. on intramuscular injection in mice. Toxins Lc a and Lc b were each composed of molecules containing 69 amino acid residues with eight half-cystine residues. The complete amino acid sequences of these two toxins were elucidated. Toxins Lc a and Lc b are different from each other at five positions of their sequences, namely at positions 31 (Phe/Ser), 32 (Leu/Ile), 33 (Lys/Arg), 50 (Pro/Arg) and 53 (Asp/His) (residues in parentheses give the residues in toxins Lc a and Lc b respectively). Toxins Lc a and Lc b have a novel structure in that they have only four disulphide bridges, although the whole amino acid sequences are homologous to those of other known long-chain neurotoxins. It is remarkable that toxins Lc a and Lc b are not coexistent at the detection error of 6% of the other toxin. Populations of Laticauda colubrina from the Solomon Islands and from the Philippines have either toxin Lc a or toxin Lc b and not both of them.

  8. Solid phase sequencing of biopolymers

    SciTech Connect

    Cantor, Charles R.; Hubert, Koster

    2014-06-24

    This invention relates to methods for detecting and sequencing target nucleic acid sequences, to mass modified nucleic acid probes and arrays of probes useful in these methods, and to kits and systems which contain these probes. Useful methods involve hybridizing the nucleic acids or nucleic acids which represent complementary or homologous sequences of the target to an array of nucleic acid probes. These probes comprise a single-stranded portion, an optional double-stranded portion and a variable sequence within the single-stranded portion. The molecular weights of the hybridized nucleic acids of the set can be determined by mass spectroscopy, and the sequence of the target determined from the molecular weights of the fragments. Probes may be affixed to a solid support such as a hybridization chip to facilitate automated molecular weight analysis and identification of the target sequence.

  9. Homology recognition funnel

    NASA Astrophysics Data System (ADS)

    Lee, Dominic; Kornyshev, Alexei A.

    2009-10-01

    The recognition of homologous sequences of DNA before strand exchange is considered to be the most puzzling stage of homologous recombination. A mechanism for two homologous dsDNAs to recognize each other from a distance in electrolytic solution without unzipping had been proposed in an earlier paper [A. A. Kornyshev and S. Leikin, Phys. Rev. Lett. 86, 366 (2001)]. In that work, the difference in the electrostatic interaction energy between homologous duplexes and between nonhomologous duplexes, termed the recognition energy, has been calculated. That calculation was later extended in a series of papers to account for torsional elasticity of the molecules. A recent paper [A. A. Kornyshev and A. Wynveen, Proc. Natl. Acad. Sci. U.S.A. 106, 4683 (2009)] investigated the form of the potential well that homologous DNA molecules may feel when sliding along each other. A simple formula for the shape of the well was obtained. However, this latter study was performed under the approximation that the sliding molecules are torsionally rigid. Following on from this work, in the present article we investigate the effect of torsional flexibility of the molecules on the shape of the well. A variational approach to this problem results in a transcendental equation that is easily solved numerically. Its solutions show that at large interaxial separations the recognition well becomes wider and shallower, whereas at closer distances further unexpected features arise related to an abrupt change in the mean azimuthal alignment of the molecules. The energy surface as a function of interaxial separation and the axial shift defines what we call the recognition funnel. We show that it depends dramatically on the patterns of adsorption of counterions on DNA.

  10. Trichomonas vaginalis acidic phospholipase A2: isolation and partial amino acid sequence.

    PubMed

    Escobedo-Guajardo, Brenda L; González-Salazar, Francisco; Palacios-Corona, Rebeca; Torres de la Cruz, Víctor M; Morales-Vallarta, Mario; Mata-Cárdenas, Benito D; Garza-González, Jesús N; Rivera-Silva, Gerardo; Vargas-Villarreal, Javier

    2013-12-01

    Sexually transmitted diseases are a major cause of acute disease worldwide, and trichomoniasis is the most common and curable disease, generating more than 170 million cases annually worldwide. Trichomonas vaginalis is the causal agent of trichomoniasis and has the ability to destroy in vitro cell monolayers of the vaginal mucosa, where the phospholipases A2 (PLA2) have been reported as potential virulence factors. These enzymes have been partially characterized from the subcellular fraction S30 of pathogenic T. vaginalis strains. The main objective of this study was to purify a phospholipase A2 from T. vaginalis, make a partial characterization, obtain a partial amino acid sequence, and determine its enzymatic participation as hemolytic factor causing lysis of erythrocytes. Trichomonas S30, RF30 and UFF30 sub-fractions from GT-15 strain have the capacity to hydrolyze [2-(14)C-PA]-PC at pH 6.0. Proteins from the UFF30 sub-fraction were separated by affinity chromatography into two eluted fractions with detectable PLA A2 activity. The EDTA-eluted fraction was analyzed by HPLC using on-line HPLC-tandem mass spectrometry and two protein peaks were observed at 8.2 and 13 kDa. Peptide sequences were identified from the proteins present in the eluted EDTA UFF30 fraction; bioinformatic analysis using Protein Link Global Server charged with T. vaginalis protein database suggests that eluted peptides correspond a putative ubiquitin protein in the 8.2 kDa fraction and a phospholipase preserved in the 13 kDa fraction. The EDTA-eluted fraction hydrolyzed [2-(14)C-PA]-PC lyses erythrocytes from Sprague-Dawley in a time and dose-dependent manner. The acidic hemolytic activity decreased by 84% with the addition of 100 μM of Rosenthal's inhibitor.

  11. Plasmid pKM101 encodes two nonhomologous antirestriction proteins (ArdA and ArdB) whose expression is controlled by homologous regulatory sequences.

    PubMed Central

    Belogurov, A A; Delver, E P; Rodzevich, O V

    1993-01-01

    The IncN plasmid pKM101 (a derivative of R46) encodes the antirestriction protein ArdB (alleviation of restriction of DNA) in addition to another antirestriction protein, ArdA, described previously. The relevant gene, ardB, was located in the leading region of pKM101, about 7 kb from oriT. The nucleotide sequence of ardB was determined, and an appropriate polypeptide was identified in maxicells of Escherichia coli. Like ArdA, ArdB efficiently inhibits restriction by members of the three known families of type I systems of E. coli and only slightly affects the type II enzyme, EcoRI. However, in contrast to ArdA, ArdB is ineffective against the modification activity of the type I (EcoK) system. Comparison of deduced amino acid sequences of ArdA and ArdB revealed only one small region of similarity (nine residues), suggesting that this region may be somehow involved in the interaction with the type I restriction systems. We also found that the expression of both ardA and ardB genes is controlled jointly by two pKM101-encoded proteins, ArdK and ArdR, with molecular weights of about 15,000 and 20,000, respectively. The finding that the sequences immediately upstream of ardA and ardB share about 94% identity over 218 bp suggests that their expression may be controlled by ArdK and ArdR at the transcriptional level. Deletion studies and promoter probe analysis of these sequences revealed the regions responsible for the action of ArdK and ArdR as regulatory proteins. We propose that both types of antirestriction proteins may play a pivotal role in overcoming the host restriction barrier by self-transmissible broad-host-range plasmids. It seems likely that the ardKR-dependent regulatory system serves in this case as a genetic switch that controls the expression of plasmid-encoded antirestriction functions during mating. Images PMID:8393008

  12. A vacuolar β-glucosidase homolog that possesses glucose-conjugated abscisic acid hydrolyzing activity plays an important role in osmotic stress responses in Arabidopsis.

    PubMed

    Xu, Zheng-Yi; Lee, Kwang Hee; Dong, Ting; Jeong, Jae Cheol; Jin, Jing Bo; Kanno, Yuri; Kim, Dae Heon; Kim, Soo Youn; Seo, Mitsunori; Bressan, Ray A; Yun, Dae-Jin; Hwang, Inhwan

    2012-05-01

    The phytohormone abscisic acid (ABA) plays a critical role in various physiological processes, including adaptation to abiotic stresses. In Arabidopsis thaliana, ABA levels are increased both through de novo biosynthesis and via β-glucosidase homolog1 (BG1)-mediated hydrolysis of Glc-conjugated ABA (ABA-GE). However, it is not known how many different β-glucosidase proteins produce ABA from ABA-GE and how the multiple ABA production pathways are coordinated to increase ABA levels. Here, we report that a previously undiscovered β-glucosidase homolog, BG2, produced ABA by hydrolyzing ABA-GE and plays a role in osmotic stress response. BG2 localized to the vacuole as a high molecular weight complex and accumulated to high levels under dehydration stress. BG2 hydrolyzed ABA-GE to ABA in vitro. In addition, BG2 increased ABA levels in protoplasts upon application of exogenous ABA-GE. Overexpression of BG2 rescued the bg1 mutant phenotype, as observed for the overexpression of NCED3 in bg1 mutants. Multiple Arabidopsis bg2 alleles with a T-DNA insertion in BG2 were more sensitive to dehydration and NaCl stress, whereas BG2 overexpression resulted in enhanced resistance to dehydration and NaCl stress. Based on these observations, we propose that, in addition to the de novo biosynthesis, ABA is produced in multiple organelles by organelle-specific β-glucosidases in response to abiotic stresses.

  13. Identification of random nucleic acid sequence aberrations using dual capture probes which hybridize to different chromosome regions

    DOEpatents

    Lucas, J.N.; Straume, T.; Bogen, K.T.

    1998-03-24

    A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration. 14 figs.

  14. Identification of random nucleic acid sequence aberrations using dual capture probes which hybridize to different chromosome regions

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    1998-01-01

    A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration.

  15. Isolation of novel human cDNA (hGMF-gamma) homologous to Glia Maturation Factor-beta gene.

    PubMed

    Asai, K; Fujita, K; Yamamoto, M; Hotta, T; Morikawa, M; Kokubo, M; Moriyama, A; Kato, T

    1998-03-13

    A novel full-length human cDNA homologous to Glia Maturation Factor-beta (GMF-beta) gene was isolated. Sequence analysis of the entire cDNA revealed an open reading frame of 426 nucleotides with a deduced protein sequence of 142 amino acid residues. The deduced amino acid sequences of its putative product is highly homologous to human GMF-beta (82% identity) and named for GMF-gamma. Northern blot analysis indicated that a message of 0.9 kb long, but not 4.1 kb of GMF-beta, is predominantly expressed in human lung, heart, and placenta.

  16. The amino acid sequence of protein CM-3 from Dendroaspis polylepis polylepis (black mamba) venom.

    PubMed

    Joubert, F J

    1985-01-01

    Protein CM-3 from Dendroaspis polylepis polylepis venom was purified by gel filtration and ion exchange chromatography. It comprises 65 amino acids including eight half-cystines. The complete amino acid sequence of protein CM-3 has been elucidated. The sequence (residues 1-50) resembles that of the N-terminal sequence of the subunits of a synergistic type protein and residues 51-65 that of the C-terminal sequence of an angusticeps type protein. Mixtures of protein CM-3 and angusticeps type proteins showed no apparent synergistic effect, in that their toxicity in combination was no greater than the sum of their individual toxicities.

  17. The amino acid sequences of the Fd fragments of two human γ heavy chains

    PubMed Central

    Press, E. M.; Hogg, N. M.

    1970-01-01

    The amino acid sequences of the Fd fragments of two human pathological immunoglobulins of the immunoglobulin G1 class are reported. Comparison of the two sequences shows that the heavy-chain variable regions are similar in length to those of the light chains. The existence of heavy chain variable region subgroups is also deduced, from a comparison of these two sequences with those of another γ 1 chain, Eu, a μ chain, Ou, and the partial sequence of a fourth γ 1 chain, Ste. Carbohydrate has been found to be linked to an aspartic acid residue in the variable region of one of the γ 1 chains, Cor. PMID:5449120

  18. Asymmetric synthesis of α-amino acids via homologation of Ni(II) complexes of glycine Schiff bases. Part 2: aldol, Mannich addition reactions, deracemization and (S) to (R) interconversion of α-amino acids.

    PubMed

    Sorochinsky, Alexander E; Aceña, José Luis; Moriwaki, Hiroki; Sato, Tatsunori; Soloshonok, Vadim

    2013-11-01

    This review provides a comprehensive treatment of literature data dealing with asymmetric synthesis of α-amino-β-hydroxy and α,β-diamino acids via homologation of chiral Ni(II) complexes of glycine Schiff bases using aldol and Mannich-type reactions. These reactions proceed with synthetically useful chemical yields and thermodynamically controlled stereoselectivity and allow direct introduction of two stereogenic centers in a single operation with predictable stereochemical outcome. Furthermore, new application of Ni(II) complexes of α-amino acids Schiff bases for deracemization of racemic α-amino acids and (S) to (R) interconversion providing additional synthetic opportunities for preparation of enantiomerically pure α-amino acids, is also reviewed. Origin of observed diastereo-/enantioselectivity in the aldol, Mannich-type and deracemization reactions, generality and limitations of these methodologies are critically discussed.

  19. Helicobacter pylori acidic stress response factor HP1286 is a YceI homolog with new binding specificity.

    PubMed

    Sisinni, Lorenza; Cendron, Laura; Favaro, Gabriella; Zanotti, Giuseppe

    2010-04-01

    HP1286 from Helicobacter pylori is among the proteins that play a relevant role in bacterial colonization and persistence in the stomach. Indeed, it was demonstrated to be overexpressed under acidic stress conditions, together with other essential virulence factors. Here we describe its crystal structure, determined at 2.1 A resolution. The molecular model, a dimer characterized by two-fold symmetry, shows that HP1286 structurally belongs to the YceI-like protein family, which in turn is characterized by the lipocalin fold. The latter characterizes proteins possessing an internal cavity with the function of binding and/or transport of amphiphilic molecules. Surprisingly, a molecule of erucamide was found bound in the internal cavity of each monomer of recombinant HP1286, cloned and expressed in an Escherichia coli heterologous system. The shape and length of the cavity indicate that, at variance with other members of the family, HP-YceI has a binding specificity for amphiphilic compounds with a linear chain of about 22 carbon atoms. These features, along with the fact that the protein is secreted by the bacterium and is involved in adaptation to an acidic environment, suggest that its function could be that of sequestering specific fatty acids or amides from the environment, either to supply the bacterium with the fatty acids necessary for its metabolism, or to protect and detoxify it from the detergent-like antimicrobial activity of fatty acids that are eventually present in the external milieu.

  20. The mouse and human excitatory amino acid transporter gene (EAAT1) maps to mouse chromosome 15 and a region of syntenic homology on human chromosome 5

    SciTech Connect

    Kirschner, M.A.; Arriza, J.L.; Amara, S.G.

    1994-08-01

    The gene for human excitatory amino acid transporter (EAAT1) was localized to the distal region of human chromosome 5p13 by in situ hybridization of metaphase chromosome spreads. Interspecific backcross analysis identified the mouse Eaat1 locus in a region of 5p13 homology on mouse chromosome 15. Markers that are linked with EAAT1 on both human and mouse chromosomes include the receptors for leukemia inhibitory factor, interleukin-7, and prolactin. The Eaat1 locus appears not be linked to the epilepsy mutant stg locus, which is also on chromosome 15. The EAAT1 locus is located in a region of 5p deletions that have been associated with mental retardation and microcephaly. 22 refs., 2 figs.

  1. The amino acid sequence of goat beta-lactoglobulin.

    PubMed

    Préaux, G; Braunitzer, G; Schrank, B; Stangl, A

    1979-11-01

    The isolation of beta-lactoglobulin from milk of the goat is described. The purified protein was checked for purity and has been characterized by its gross composition and end groups. The native or the modified protein was then degraded by tryptic and cyanogen bromide cleavage. The cleavage products were isolated and sequenced in the sequenator using a Quadrol and propyne program. These data provide the complete sequence of beta-lactoglobulin of the goat. The results are discussed and compared particularly with bovine beta-lactoglobulin components AB. Some biological aspects are described.

  2. Layered materials with coexisting acidic and basic sites for catalytic one-pot reaction sequences.

    PubMed

    Motokura, Ken; Tada, Mizuki; Iwasawa, Yasuhiro

    2009-06-17

    Acidic montmorillonite-immobilized primary amines (H-mont-NH(2)) were found to be excellent acid-base bifunctional catalysts for one-pot reaction sequences, which are the first materials with coexisting acid and base sites active for acid-base tamdem reactions. For example, tandem deacetalization-Knoevenagel condensation proceeded successfully with the H-mont-NH(2), affording the corresponding condensation product in a quantitative yield. The acidity of the H-mont-NH(2) was strongly influenced by the preparation solvent, and the base-catalyzed reactions were enhanced by interlayer acid sites.

  3. Antigenic and protein sequence homology between VP13/14, a herpes simplex virus type 1 tegument protein, and gp10, a glycoprotein of equine herpesvirus 1 and 4.

    PubMed Central

    Whittaker, G R; Riggio, M P; Halliburton, I W; Killington, R A; Allen, G P; Meredith, D M

    1991-01-01

    Monospecific polyclonal antisera raised against VP13/14, a major tegument protein of herpes simplex virus type 1 cross-reacted with structural equine herpesvirus 1 and 4 proteins of Mr 120,000 and 123,000, respectively; these proteins are identical in molecular weight to the corresponding glycoprotein 10 (gp10) of each virus. Using a combination of immune precipitation and Western immunoblotting techniques, we confirmed that anti-VP13/14 and a monoclonal antibody to gp10 reacted with the same protein. Sequence analysis of a lambda gt11 insert of equine herpesvirus 1 gp10 identified an open reading frame in equine herpesvirus 4 with which it showed strong homology; this open reading frame also shared homology with gene UL47 of herpes simplex virus type 1 and gene 11 of varicella-zoster virus. This showed that, in addition to immunological cross-reactivity, VP13/14 and gp10 have protein sequence homology; it also allowed identification of VP13/14 as the gene product of UL47. Images PMID:1850013

  4. Synthesis of gamma,delta-unsaturated glycolic acids via sequenced brook and Ireland--claisen rearrangements.

    PubMed

    Schmitt, Daniel C; Johnson, Jeffrey S

    2010-03-05

    Organozinc, -magnesium, and -lithium nucleophiles initiate a Brook/Ireland-Claisen rearrangement sequence of allylic silyl glyoxylates resulting in the formation of gamma,delta-unsaturated alpha-silyloxy acids.

  5. Computer Simulation of the Determination of Amino Acid Sequences in Polypeptides

    ERIC Educational Resources Information Center

    Daubert, Stephen D.; Sontum, Stephen F.

    1977-01-01

    Describes a computer program that generates a random string of amino acids and guides the student in determining the correct sequence of a given protein by using experimental analytic data for that protein. (MLH)

  6. Genome sequence of the acid-tolerant strain Rhizobium sp. LPU83.

    PubMed

    Wibberg, Daniel; Tejerizo, Gonzalo Torres; Del Papa, María Florencia; Martini, Carla; Pühler, Alfred; Lagares, Antonio; Schlüter, Andreas; Pistorio, Mariano

    2014-04-20

    Rhizobia are important members of the soil microbiome since they enter into nitrogen-fixing symbiosis with different legume host plants. Rhizobium sp. LPU83 is an acid-tolerant Rhizobium strain featuring a broad-host-range. However, it is ineffective in nitrogen fixation. Here, the improved draft genome sequence of this strain is reported. Genome sequence information provides the basis for analysis of its acid tolerance, symbiotic properties and taxonomic classification.

  7. Cytochromes c-552 from two strains of the hydrogenotrophic bacterium Alcaligenes eutrophus are sequence homologs of the cytochromes c8 from the denitrifying pseudomonads.

    PubMed

    Klarskov, K; Bartsch, R G; Meyer, T E; Cusanovich, M A; Van Beeumen, J J

    1997-12-05

    Soluble cytochromes c-552 were purified from two strains of the hydrogenothrophic species Alcaligenes eutrophus and their amino acid sequences determined. The two cytochromes were found to have 5 differences out of a total of 89 residues. The proteins are clearly related to the cytochromes c8 (formerly called Pseudomonas cytochromes c-551), but require a single residue insertion after the methionine sixth heme ligand relative to the Pseudomonas aeruginosa protein. The consensus residues Trp56 and Trp77, characteristic for the c8 family, are also present in the Alcaligenes proteins. Overall, the Alcaligenes cytochromes are only 43% identical to the Pseudomonas proteins which average 68% identity to one another. They are also only 45% identical to cytochrome c8 from Hydrogenobacter thermophilus, another hydrogenothrophic species, which indicates that the hydrogen utilizing bacteria are not more closely related to one another than they are to other species. The finding of cytochrome c8 in Alcaligenes eutrophus completes the recent characterization of a cytochrome cd1-nitrite reductase from this bacterial species and suggests the existence of the same denitrification pathway as in Pseudomonas where these two proteins are reaction partners.

  8. The amino acid sequence of monal pheasant lysozyme and its activity.

    PubMed

    Araki, T; Matsumoto, T; Torikata, T

    1998-10-01

    The amino acid sequence of monal pheasant lysozyme and its activity were analyzed. Carboxymethylated lysozyme was digested with trypsin and the resulting peptides were sequenced. The established amino acid sequence had one amino acid substitution at position 102 (Arg to Gly) comparing with Indian peafowl lysozyme and four amino acid substitutions at positions 3 (Phe to Tyr), 15 (His to Leu), 41 (Gln to His), and 121 (Gln to His) with chicken lysozyme. Analysis of the time-courses of reaction using N-acetylglucosamine pentamer as a substrate showed a difference of binding free energy change (-0.4 kcal/mol) at subsites A between monal pheasant and Indian peafowl lysozyme. This was assumed to be caused by the amino acid substitution at subsite A with loss of a positive charge at position 102 (Arg102 to Gly).

  9. Sequence of a cDNA clone encoding the polysialic acid-rich and cytoplasmic domains of the neural cell adhesion molecule N-CAM.

    PubMed Central

    Hemperly, J J; Murray, B A; Edelman, G M; Cunningham, B A

    1986-01-01

    Purified fractions of the neural cell-adhesion molecule N-CAM from embryonic chicken brain contain two similar polypeptides (Mr, 160,000 and 130,000), each containing an amino-terminal external binding region, a carbohydrate-rich central region, and a carboxyl-terminal region that is associated with the cell. Previous studies indicate that the two polypeptides arise by alternative splicing of mRNAs transcribed from a single gene. We report here the 3556-nucleotide sequence of a cDNA clone (pEC208) that encodes 964 amino acids from the carbohydrate and cell-associated domains of the larger N-CAM polypeptide followed by 664 nucleotides of 3' untranslated sequence. The predicted protein sequence contains attachment sites for polysialic acid-containing oligosaccharides, four tandem homologous regions of polypeptide resembling those seen in the immunoglobulin superfamily, and a single hydrophobic sequence that appears to be the membrane-spanning segment. The cytoplasmic domain carboxyl terminal to this segment includes a block of approximately equal to 250 amino acids present in the larger but not in the smaller N-CAM polypeptide. We designate these the ld (large domain) polypeptide and the sd (small domain) polypeptide. The intracellular domains of the ld and sd polypeptides are likely to be critical for cell-surface modulation of N-CAM by interacting in a differential fashion with other intrinsic proteins or with the cytoskeleton. PMID:3458261

  10. Single-chain structure of human ceruloplasmin: the complete amino acid sequence of the whole molecule.

    PubMed Central

    Takahashi, N; Ortel, T L; Putnam, F W

    1984-01-01

    We have determined the amino acid sequence of the amino-terminal 67,000-dalton (67-kDa) fragment of human ceruloplasmin and have established overlapping sequences between the 67-kDa and 50-kDa fragments and between the 50-kDa and 19-kDa fragments. The 67-kDa fragment contains 480 amino acid residues and three glucosamine oligosaccharides. These results together with our previous sequence data for the 50-kDa and 19-kDa fragments complete the amino acid sequence of human ceruloplasmin. The polypeptide chain has a total of 1,046 amino acid residues (Mr 120,085) and has attachment sites for four glucosamine oligosaccharides; together these account for the total molecular mass of human ceruloplasmin (132 kDa). The sequence analysis of the peptides overlapping the fragments showed that one additional amino acid, arginine, is present between the 67-kDa and 50-kDa fragments, and another, lysine, is between the 50-kDa and 19-kDa fragments. Only two apparent sites of amino acid interchange have been identified in the polypeptide chain. Both involve a single-point interchange of glycine and lysine that would result in a difference in charge. The results of the complete sequence analysis verified that human ceruloplasmin is composed of a single polypeptide chain and that the subunit-like fragments are produced by proteolytic cleavage during purification (and possibly also in vivo). PMID:6582496

  11. Biosynthesis, glycosylation, and partial N-terminal amino acid sequence of the T-cell-activating protein TAP.

    PubMed Central

    Reiser, H; Coligan, J; Benacerraf, B; Rock, K L

    1987-01-01

    We have characterized the TAP molecule, an Ly-6 linked T-cell-activating glycoprotein. The three TAP bands that are precipitated from metabolically labeled cells display a common migration pattern in isoelectric focusing/NaDodSO4/PAGE gels and have common N-terminal sequences. This sequence is rich in cysteine and is homologous to that previously reported for the Ly-6.1E antigen. We, therefore, compared TAP and Ly-6.1E biochemically and found them to be structurally distinct. Given the role of TAP in T-cell activation, we further studied whether the molecule was phosphorylated. We have not found evidence for phosphorylation of the TAP protein. The carbohydrates present on the TAP molecule are resistant to peptide N-glycosidase F in vitro and tunicamycin in vivo. The upper band of the TAP triplet is susceptible to treatment with trifluoromethanesulfonic acid and thus seems to be of the O-linked rather than of the N-linked variety. The biosynthetic processing of TAP was studied in pulse-chase experiments. The middle band of the TAP triplet appears to be the earliest detectable species. Its conversion to the O-linked high molecular weight species can be blocked by monensin. Images PMID:3033645

  12. Biosynthesis, glycosylation, and partial N-terminal amino acid sequence of the T-cell-activating protein TAP

    SciTech Connect

    Reiser, H.; Coligan, J.; Benacerraf, B.; Rock, K.L.

    1987-05-01

    The authors have characterized the TAP molecule, an Ly-6 linked T-cell-activating glycoprotein. The three TAP bands that are precipitated from metabolically labeled cells display a common migration pattern in isoelectric focusing/NaDodSO/sub 4//PAGE gels and have common N-terminal sequences. This sequence is rich in cysteine and is homologous to that previously reported for the Ly-6.1E antigen. They therefore, compared TAP and Ly-6.1E biochemically and found them to be structurally distinct. Given the role of TAP in T-cell activation, they further studied whether the molecule was phosphorylated. We have not found evidence for phosphorylation of the TAP protein. The carbohydrates present on the TAP molecule are resistant to peptide N-glycosidase F in vitro and tunicamycin in vivo. The upper band of the TAP triplet is susceptible to treatment with trifluoromethanesulfonic acid and thus seems to be of the O-linked rather than of the N-linked variety. The biosynthetic processing of TAP was studied in pulse-chase experiments. The middle band of the TAP triplet appears to be the earliest detectable species. Its conversion to the O-linked high molecular weight species can be blocked by monensin.

  13. Identification, sequencing, and expression of Mycobacterium leprae superoxide dismutase, a major antigen.

    PubMed Central

    Thangaraj, H S; Lamb, F I; Davis, E O; Jenner, P J; Jeyakumar, L H; Colston, M J

    1990-01-01

    The gene encoding a major 28-kilodalton antigen of Mycobacterium leprae has now been sequenced and identified as the enzyme superoxide dismutase (SOD) on the basis of the high degree of homology with known SOD sequences. The deduced amino acid sequence shows 67% homology with a human manganese-utilizing SOD and 55% homology with the Escherichia coli manganese-utilizing enzyme. The gene is not expressed from its own promoter in E. coli but is expressed from its own promoter in Mycobacterium smegmatis. The amino acid sequences of epitopes recognized by monoclonal antibodies against the 28-kilodalton antigen have been determined. Images PMID:1692812

  14. Multiple Genome Sequences of Important Beer-Spoiling Lactic Acid Bacteria

    PubMed Central

    Geissler, Andreas J.; Vogel, Rudi F.

    2016-01-01

    Seven strains of important beer-spoiling lactic acid bacteria were sequenced using single-molecule real-time sequencing. Complete genomes were obtained for strains of Lactobacillus paracollinoides, Lactobacillus lindneri, and Pediococcus claussenii. The analysis of these genomes emphasizes the role of plasmids as the genomic foundation of beer-spoiling ability. PMID:27795248

  15. Correlation between carbohydrate-binding specificity and amino acid sequence of carbohydrate-binding regions of Cytisus-type anti-H(O) lectins.

    PubMed

    Konami, Y; Yamamoto, K; Osawa, T; Irimura, T

    1992-06-15

    A carbohydrate-binding peptide of the di-N-acetylchitobiose-binding Cytisus sessilifolius anti-H(O) lectin I (CSA-I) was isolated from the endoproteinase Asp-N digest of CSA-I by affinity chromatography on a column of N-acetyl-D-glucosamine oligomer-Sepharose (GlcNAc oligomer-Sepharose). The amino acid sequence of the carbohydrate-binding peptide of CSA-I was determined to be DTYFGKTYNPW using a gas-phase protein sequencer. This sequence corresponds to the sequence from Asp-129 to Trp-139 based on the primary structure of CSA-I, and shows a high degree of homology to those of the putative carbohydrate-binding peptide of the Laburnum alpinum lectin I (LAA-I) (DTYFGKAYNPW) and of the Ulex europaeus lectin II (UEA-II) (DSYFGKTYNPW). The binding of these three anti-H(O) lectins is known to be inhibited by di-N-acetylchitobiose but not by L-fucose. These results strongly suggest that there is a good correlation between the carbohydrate-binding specificity and the amino acid sequence of the carbohydrate-binding regions of di-N-acetylchitobiose-binding lectins.

  16. Amino acid sequences of alpha-helical segments from S-carboxymethylkerateine-A. Tryptic and chymotryptic peptides from a type-II segment.

    PubMed Central

    Hogg, D M; Dowling, L M; Crewther, W G

    1978-01-01

    1. Amino acid-sequence studies were done on a peptide of mol.wt. approx. 12500 that was isolated from the highly helical fragments obtained by partial chymotryptic digestion of the low-sulphur proteins (S-carboxymethylkerateine-A) from wool. 2. The peptides obtained by tryptic and chymotryptic digestion of this large peptide were separated by ion-exchange chromatography on DEAE-cellulose at pH8.5 with an (NH4)(2)CO(3) concentration gradient and, where necessary, purified further by paper electrophoresis. 3. Determination of the sequences of many of these peptides showed that a high proportion of the cationic residues occurs in pairs. 4. Although two of the four S-carboxymethylcysteine residues are located in what appears to be a non-helical region near the N-terminus the other two S-carboxymethylcysteine residues occur in or near sequences suggesting a helical conformation. 5. Some peptides were obtained, in low yields, that appeared to be homologues of more major ones. These suggest either homologies in the helical portions of the low-sulphur proteins or the presence of closely related amino acid sequences in helical regions of completely different origins. 6. A partial sequence of the complete peptide is proposed. PMID:581263

  17. Ra5G, a homologue of Ra5 in giant ragweed pollen: isolation, HLA-DR-associated activity and amino acid sequence.

    PubMed

    Goodfriend, L; Choudhury, A M; Klapper, D G; Coulter, K M; Dorval, G; Del Carpio, J; Osterland, C K

    1985-08-01

    Recent studies [Marsh et al. (1982) J. exp. Med. 155, 1439-1451; Coulter (1983) M.Sc. thesis, McGill University, Montreal, Canada; Coulter et al. (1983) in Genetic and Environmental Factors in Clinical Allergy (Edited by Marsh D.G., Blumenthal M.N. and Santilli J., Jr), University of Minnesota Press, Minneapolis, MN] have shown a highly significant association between HLA-Dw2/DR2 and host sensitivity to the 5000-D, 4-disulfide bonded protein Ra5S of short ragweed pollen. To extend these findings, we isolated Ra5G, an Ra5S-like protein, from giant ragweed pollen by gel and ion-exchange chromatography. The protein was homogeneous by polyacrylamide gel electrophoresis (pH 4.3), reverse-phase high-performance liquid chromatography, and antigenic assays. Its mol. wt and amino acid composition (including 8 half-cystine residues) were closely similar to Ra5S, but the two proteins had little or no antigenic or allergenic cross-reactivity. In a study of 200 ragweed-sensitive individuals, host sensitivity simultaneously to Ra5G and Ra5S was significantly associated with the DR2 allele. The amino acid sequence of Ra5G was determined and showed close homology with Ra5S. The potential function of a highly homologous decapeptidyl sequence stretch is discussed in relation to Ir gene control of immune response to the 2 proteins.

  18. PASTA: Ultra-Large Multiple Sequence Alignment for Nucleotide and Amino-Acid Sequences.

    PubMed

    Mirarab, Siavash; Nguyen, Nam; Guo, Sheng; Wang, Li-San; Kim, Junhyong; Warnow, Tandy

    2015-05-01

    We introduce PASTA, a new multiple sequence alignment algorithm. PASTA uses a new technique to produce an alignment given a guide tree that enables it to be both highly scalable and very accurate. We present a study on biological and simulated data with up to 200,000 sequences, showing that PASTA produces highly accurate alignments, improving on the accuracy and scalability of the leading alignment methods (including SATé). We also show that trees estimated on PASTA alignments are highly accurate--slightly better than SATé trees, but with substantial improvements relative to other methods. Finally, PASTA is faster than SATé, highly parallelizable, and requires relatively little memory.

  19. SETG: Nucleic Acid Extraction and Sequencing for In Situ Life Detection on Mars

    NASA Astrophysics Data System (ADS)

    Mojarro, A.; Hachey, J.; Tani, J.; Smith, A.; Bhattaru, S. A.; Pontefract, A.; Doebler, R.; Brown, M.; Ruvkun, G.; Zuber, M. T.; Carr, C. E.

    2016-10-01

    We are developing an integrated nucleic acid extraction and sequencing instrument: the Search for Extra-Terrestrial Genomes (SETG) for in situ life detection on Mars. Our goals are to identify related or unrelated nucleic acid-based life on Mars.

  20. Draft Genome Sequence of Cyanobacterium sp. Strain IPPAS B-1200 with a Unique Fatty Acid Composition

    PubMed Central

    Starikov, Alexander Y.; Usserbaeva, Aizhan A.; Sinetova, Maria A.; Sarsekeyeva, Fariza K.; Zayadan, Bolatkhan K.; Ustinova, Vera V.; Kupriyanova, Elena V.; Los, Dmitry A.

    2016-01-01

    Here, we report the draft genome of Cyanobacterium sp. IPPAS strain B-1200, isolated from Lake Balkhash, Kazakhstan, and characterized by the unique fatty acid composition of its membrane lipids, which are enriched with myristic and myristoleic acids. The approximate genome size is 3.4 Mb, and the predicted number of coding sequences is 3,119. PMID:27856596

  1. Screening of Israeli Holstein-Friesian cattle for restriction fragment length polymorphisms using homologous and heterologous deoxyribonucleic acid probes.

    PubMed

    Hallerman, E M; Nave, A; Soller, M; Beckmann, J S

    1988-12-01

    Genomic DNA of Israeli Holstein-Friesian dairy cattle were screened with a battery of 17 cloned or subcloned DNA probes in an attempt to document restriction fragment length polymorphisms at a number of genetic loci. Restriction fragment length polymorphisms were observed at the chymosin, oxytocin-neurophysin I, lutropin beta, keratin III, keratin VI, keratin VII, prolactin, and dihydrofolate reductase loci. Use of certain genomic DNA fragments as probes produced hybridization patterns indicative of satellite DNA at the respective loci. Means for distinguishing hybridizations to coding sequences for unique genes from those to satellite DNA were developed. Results of this study are discussed in terms of strategy for the systematic development of large numbers of bovine genomic polymorphisms.

  2. Two ras genes in Dictyostelium minutum show high sequence homology, but different developmental regulation from Dictyostelium discoideum rasD and rasG genes.

    PubMed

    van Es, S; Kooistra, R A; Schaap, P

    1997-03-10

    The social amoeba Dictyostelium discoideum expresses five ras genes at different stages of development. One of them, DdrasD is expressed during postaggregative development and transcription is induced by extracellular cAMP. A homologue of DdrasD, the DdrasG gene, is expressed exclusively during vegetative growth. We cloned two ras homologues Dmras1 and Dmras2 from the primitive species D. minutum, which show high homology to DdrasD and DdrasG and less homology to the other Ddras genes. In contrast to the DdrasD and DdrasG genes, both the Dmras1 and Dmras2 genes are expressed during the entire course of development. The expression levels are low during growth, increase at the onset of starvation and do not decrease until fruiting bodies have formed. Expression of neither Dmras1 or Dmras2 is regulated by cAMP. So even though the high degree of homology between the ras genes of different species suggests conservation of function, this function is apparently not associated with a specific developmental stage.

  3. Parvalbumins from coelacanth muscle. III. Amino acid sequence of the major component.

    PubMed

    Jauregui-Adell, J; Pechere, J F

    1978-09-26

    The primary structure of the major parvalbumin (pI = 4.52) from coelacanth muscle (Latimeria chalumnae) has been determined. Sequence analysis of the tryptic peptides, in some cases obtained with beta-trypsin, accounts for the total amino acid content of the protein. Chymotryptic peptides provide appropriate sequence overlaps, to complete the localization of the tryptic peptides. Examination of the amino acid sequence of this protein shows the typical structure of a beta-parvalbumin. Its position in the dendrogram of related calcium-binding proteins corresponds to that usually accepted for crossopterygians.

  4. Extended amino acid sequences around the active-site lysine residue of class-I fructose 1,6-bisphosphate aldolases from rabbit muscle, sturgeon muscle, trout muscle and ox liver.

    PubMed Central

    Benfield, P A; Forcina, B G; Gibbons, I; Perham, R N

    1979-01-01

    1. Amino acid sequences covering the region between residues 173 and 248 [adopting the numbering system proposed by Lai, Nakai & Chang (1974) Science 183, 1204-1206] were derived for trout (Salmo trutta) muscle aldolase and for ox liver aldolase. A comparable sequence was derived for residues 180-248 of sturgeon (Acipenser transmontanus) muscle aldolase. The close homology with the rabbit muscle enzyme was used to align the peptides of the other aldolases from which the sequences were derived. The results also allowed a partial sequence for the N-terminal 39 residues for the ox liver enzyme to be deduced. 2. In the light of the strong homology evinced for these enzymes, a re-investigation of the amino acid sequence of rabbit muscle aldolase between residues 181 and 185 was undertaken. This indicated the presence of a hitherto unsuspected -Ile-Val-sequence between residues 181 and 182 and the need to invert the sequence -Glu-Val- to -Val-Glx- at positions 184 and 185. 3. Comparison of the available amino acid sequences of these enzymes suggested an early evolutionary divergence of the genes for muscle and liver aldolases. It was also consistent with other evidence that the central region of the primary structure of these enzymes (which includes the active-site lysine-227) forms part of a conserved folding domain in the protein subunit. 4. Detailed evidence for the amino acid sequences proposed has been deposited as Suy Lending Division, Boston Spa, Wetherby, West Yorkshire LS23 7BQ, U.K., from whom copies can be obtained on the terms indicated in Biochem. J. (1978) 169, 5. PMID:534504

  5. The Shigella Virulence Factor IcsA Relieves N-WASP Autoinhibition by Displacing the Verprolin Homology/Cofilin/Acidic (VCA) Domain*

    PubMed Central

    Mauricio, Rui P. M.; Jeffries, Cy M.; Svergun, Dmitri I.; Deane, Janet E.

    2017-01-01

    Shigella flexneri is a bacterial pathogen that invades cells of the gastrointestinal tract, causing severe dysentery. Shigella mediates intracellular motility and spreading via actin comet tail formation. This process is dependent on the surface-exposed, membrane-embedded virulence factor IcsA, which recruits the host actin regulator N-WASP. Although it is clear that Shigella requires N-WASP for this process, the molecular details of this interaction and the mechanism of N-WASP activation remain poorly understood. Here, we show that co-expression of full-length IcsA and the Shigella membrane protease IcsP yields highly pure IcsA passenger domain (residues 53–758). We show that IcsA is monomeric and describe the solution structure of the passenger domain obtained by small-angle X-ray scattering (SAXS) analysis. The SAXS-derived models suggest that IcsA has an elongated shape but, unlike most other autotransporter proteins, possesses a central kink revealing a distinctly curved structure. Pull-down experiments show direct binding of the IcsA passenger domain to both the WASP homology 1 (WH1) domain and the GTPase binding domain (GBD) of N-WASP and no binding to the verprolin homology/cofilin/acidic (VCA) region. Using fluorescence polarization experiments, we demonstrate that IcsA binding to the GBD region displaces the VCA peptide and that this effect is synergistically enhanced upon IcsA binding to the WH1 region. Additionally, domain mapping of the IcsA interaction interface reveals that different regions of IcsA bind to the WH1 and GBD domains of N-WASP. Taken together, our data support a model where IcsA and N-WASP form a tight complex releasing the N-WASP VCA domain to recruit the host cell machinery for actin tail formation. PMID:27881679

  6. Homology and causes.

    PubMed

    Van Valen, L M

    1982-09-01

    Homology is resemblance caused by a continuity of information. In biology it is a unified developmental phenomenon. Homologies among and within individuals intergrade in several ways, so historical homology cannot be separated sharply from repetitive homology. Nevertheless, the consequences of historical and repetitive homologies can be mutually contradictory. A detailed discussion of the rise and fall of the "premolar-analogy" theory of homologies of mammalian molar-tooth cusps exemplifies such a contradiction. All other hypotheses of historical homology which are based on repetitive homology, such as the foliar theory of the flower considered phyletically, are suspect.

  7. Purification, characterization and partial amino acid sequence of glycogen synthase from Saccharomyces cerevisiae.

    PubMed Central

    Carabaza, A; Arino, J; Fox, J W; Villar-Palasi, C; Guinovart, J J

    1990-01-01

    Glycogen synthase from Saccharomyces cerevisiae was purified to homogeneity. The enzyme showed a subunit molecular mass of 80 kDa. The holoenzyme appears to be a tetramer. Antibodies developed against purified yeast glycogen synthase inactivated the enzyme in yeast extracts and allowed the detection of the protein in Western blots. Amino acid analysis showed that the enzyme is very rich in glutamate and/or glutamine residues. The N-terminal sequence (11 amino acid residues) was determined. In addition, selected tryptic-digest peptides were purified by reverse-phase h.p.l.c. and submitted to gas-phase sequencing. Up to eight sequences (79 amino acid residues) could be aligned with the human muscle enzyme sequence. Levels of identity range between 37 and 100%, indicating that, although human and yeast glycogen synthases probably share some conserved regions, significant differences in their primary structure should be expected. Images Fig. 1. Fig. 2. Fig. 3. PMID:2114092

  8. Amino acid sequence of anionic peroxidase from the windmill palm tree Trachycarpus fortunei.

    PubMed

    Baker, Margaret R; Zhao, Hongwei; Sakharov, Ivan Yu; Li, Qing X

    2014-12-10

    Palm peroxidases are extremely stable and have uncommon substrate specificity. This study was designed to fill in the knowledge gap about the structures of a peroxidase from the windmill palm tree Trachycarpus fortunei. The complete amino acid sequence and partial glycosylation were determined by MALDI-top-down sequencing of native windmill palm tree peroxidase (WPTP), MALDI-TOF/TOF MS/MS of WPTP tryptic peptides, and cDNA sequencing. The propeptide of WPTP contained N- and C-terminal signal sequences which contained 21 and 17 amino acid residues, respectively. Mature WPTP was 306 amino acids in length, and its carbohydrate content ranged from 21% to 29%. Comparison to closely related royal palm tree peroxidase revealed structural features that may explain differences in their substrate specificity. The results can be used to guide engineering of WPTP and its novel applications.

  9. Despite sequence homologies to gluten, salivary proline-rich proteins do not elicit immune responses central to the pathogenesis of celiac disease.

    PubMed

    Tian, Na; Leffler, Daniel A; Kelly, Ciaran P; Hansen, Joshua; Marietta, Eric V; Murray, Joseph A; Schuppan, Detlef; Helmerhorst, Eva J

    2015-12-01

    Celiac disease (CD) is an inflammatory disorder triggered by ingested gluten, causing immune-mediated damage to the small-intestinal mucosa. Gluten proteins are strikingly similar in amino acid composition and sequence to proline-rich proteins (PRPs) in human saliva. On the basis of this feature and their shared destination in the gastrointestinal tract, we hypothesized that salivary PRPs may modulate gluten-mediated immune responses in CD. Parotid salivary secretions were collected from CD patients, refractory CD patients, non-CD patients with functional gastrointestinal complaints, and healthy controls. Structural similarities of PRPs with gluten were probed with anti-gliadin antibodies. Immune responses to PRPs were investigated toward CD patient-derived peripheral blood mononuclear cells and in a humanized transgenic HLA-DQ2/DQ8 mouse model for CD. Anti-gliadin antibodies weakly cross-reacted with the abundant salivary amylase but not with PRPs. Likewise, the R5 antibody, recognizing potential antigenic gluten epitopes, showed negligible reactivity to salivary proteins from all groups. Inflammatory responses in peripheral blood mononuclear cells were provoked by gliadins whereas responses to PRPs were similar to control levels, and PRPs did not compete with gliadins in immune stimulation. In vivo, PRP peptides were well tolerated and nonimmunogenic in the transgenic HLA-DQ2/DQ8 mouse model. Collectively, although structurally similar to dietary gluten, salivary PRPs were nonimmunogenic in CD patients and in a transgenic HLA-DQ2/DQ8 mouse model for CD. It is possible that salivary PRPs play a role in tolerance induction to gluten early in life. Deciphering the structural basis for the lack of immunogenicity of salivary PRPs may further our understanding of the toxicity of gluten.

  10. Despite sequence homologies to gluten, salivary proline-rich proteins do not elicit immune responses central to the pathogenesis of celiac disease

    PubMed Central

    Tian, Na; Leffler, Daniel A.; Kelly, Ciaran P.; Hansen, Joshua; Marietta, Eric V.; Murray, Joseph A.; Schuppan, Detlef

    2015-01-01

    Celiac disease (CD) is an inflammatory disorder triggered by ingested gluten, causing immune-mediated damage to the small-intestinal mucosa. Gluten proteins are strikingly similar in amino acid composition and sequence to proline-rich proteins (PRPs) in human saliva. On the basis of this feature and their shared destination in the gastrointestinal tract, we hypothesized that salivary PRPs may modulate gluten-mediated immune responses in CD. Parotid salivary secretions were collected from CD patients, refractory CD patients, non-CD patients with functional gastrointestinal complaints, and healthy controls. Structural similarities of PRPs with gluten were probed with anti-gliadin antibodies. Immune responses to PRPs were investigated toward CD patient-derived peripheral blood mononuclear cells and in a humanized transgenic HLA-DQ2/DQ8 mouse model for CD. Anti-gliadin antibodies weakly cross-reacted with the abundant salivary amylase but not with PRPs. Likewise, the R5 antibody, recognizing potential antigenic gluten epitopes, showed negligible reactivity to salivary proteins from all groups. Inflammatory responses in peripheral blood mononuclear cells were provoked by gliadins whereas responses to PRPs were similar to control levels, and PRPs did not compete with gliadins in immune stimulation. In vivo, PRP peptides were well tolerated and nonimmunogenic in the transgenic HLA-DQ2/DQ8 mouse model. Collectively, although structurally similar to dietary gluten, salivary PRPs were nonimmunogenic in CD patients and in a transgenic HLA-DQ2/DQ8 mouse model for CD. It is possible that salivary PRPs play a role in tolerance induction to gluten early in life. Deciphering the structural basis for the lack of immunogenicity of salivary PRPs may further our understanding of the toxicity of gluten. PMID:26505973

  11. Nucleotide and deduced amino acid sequences of a new subtilisin from an alkaliphilic Bacillus isolate.

    PubMed

    Saeki, Katsuhisa; Magallones, Marietta V; Takimura, Yasushi; Hatada, Yuji; Kobayashi, Tohru; Kawai, Shuji; Ito, Susumu

    2003-10-01

    The gene for a new subtilisin from the alkaliphilic Bacillus sp. KSM-LD1 was cloned and sequenced. The open reading frame of the gene encoded a 97 amino-acid prepro-peptide plus a 307 amino-acid mature enzyme that contained a possible catalytic triad of residues, Asp32, His66, and Ser224. The deduced amino acid sequence of the mature enzyme (LD1) showed approximately 65% identity to those of subtilisins SprC and SprD from alkaliphilic Bacillus sp. LG12. The amino acid sequence identities of LD1 to those of previously reported true subtilisins and high-alkaline proteases were below 60%. LD1 was characteristically stable during incubation with surfactants and chemical oxidants. Interestingly, an oxidizable Met residue is located next to the catalytic Ser224 of the enzyme as in the cases of the oxidation-susceptible subtilisins reported to date.

  12. Molecular cloning of insect pro-phenol oxidase: a copper-containing protein homologous to arthropod hemocyanin.

    PubMed Central

    Kawabata, T; Yasuhara, Y; Ochiai, M; Matsuura, S; Ashida, M

    1995-01-01

    Pro-phenol oxidase [pro-PO; zymogen of phenol oxidase (monophenol, L-dopa:oxygen oxidoreductase, EC 1.14.18.1)] is present in the hemolymph plasma of the silkworm Bombyx mori. Pro-PO is a heterodimeric protein synthesized by hemocytes. A specific serine proteinase activates both subunits through a limited proteolysis. The amino acid sequences of both subunits were deduced from their respective cDNAs; amino acid sequence homology between the subunits was 51%. The deduced amino acid sequences revealed domains highly homologous to the copper-binding site sequences (copper-binding sites A and B) of arthropod hemocyanins. The overall sequence homology between silkworm pro-PO and arthropod hemocyanins ranged from 29 to 39%. Phenol oxidases from prokaryotes, fungi, and vertebrates have sequences homologous to only the copper-binding site B of arthropod hemocyanins. Thus, silkworm pro-PO DNA described here appears distinctive and more closely related to arthropod hemocyanins. The pro-PO-activating serine proteinase was shown to hydrolyze peptide bonds at the carboxyl side of arginine in the sequence-Asn-49-Arg-50-Phe-51-Gly-52- of both subunits. Amino groups of N termini of both subunits were indicated to be N-acetylated. The cDNAs of both pro-PO subunits lacked signal peptide sequences. This result supports our contention that mature pro-PO accumulates in the cytoplasm of hemocytes and is released by cell rupture, as for arthropod hemocyanins. PMID:7644494

  13. An analysis of amino acid sequences surrounding archaeal glycoprotein sequons.

    PubMed

    Abu-Qarn, Mehtap; Eichler, Jerry

    2007-05-01

    Despite having provided the first example of a prokaryal glycoprotein, little is known of the rules governing the N-glycosylation process in Archaea. As in Eukarya and Bacteria, archaeal N-glycosylation takes place at the Asn residues of Asn-X-Ser/Thr sequons. Since not all sequons are utilized, it is clear that other factors, including the context in which a sequon exists, affect glycosylation efficiency. As yet, the contribution to N-glycosylation made by sequon-bordering residues and other related factors in Archaea remains unaddressed. In the following, the surroundings of Asn residues confirmed by experiment as modified were analyzed in an attempt to define sequence rules and requirements for archaeal N-glycosylation.

  14. Hybridization probe for femtomolar quantification of selected nucleic acid sequences on a disposable electrode.

    PubMed

    Jenkins, Daniel M; Chami, Bilal; Kreuzer, Matthias; Presting, Gernot; Alvarez, Anne M; Liaw, Bor Yann

    2006-04-01

    Mixed monolayers of electroactive hybridization probes on gold surfaces of a disposable electrode were investigated as a technology for simple, sensitive, selective, and rapid gene identification. Hybridization to the ferrocene-labeled hairpin probes reproducibly diminished cyclic redox currents, presumably due to a displacement of the label from the electrode. Observed peak current densities were roughly 1000x greater than those observed in previous studies, such that results could easily be interpreted without the use of algorithms to correct for background polarization currents. Probes were sensitive to hybridization with a number of oligonucleotide sequences with varying homology, but target oligonucleotides could be distinguished from competing nontarget sequences based on unique "melting" profiles from the probe. Detection limits were demonstrated down to nearly 100 fM, which may be low enough to identify certain genetic conditions or infections without amplification. This technology has rich potential for use in field devices for gene identification as well as in gene microarrays.

  15. Amino acid sequence and chemical modification of a novel alpha-neurotoxin (Oh-5) from king cobra (Ophiophagus hannah) venom.

    PubMed

    Lin, S R; Leu, L F; Chang, L S; Chang, C C

    1997-04-01

    A novel alpha-neurotoxin, Oh-5, was isolated from king cobra (Ophiophagus hannah) venom and purified by successive SP-Sephadex C-25 column chromatography and reversed-phase HPLC. The complete sequence of Oh-5 was determined by Edman degradation of peptide fragments generated by endopeptidases, i.e., trypsin, Saccharomyces aureus V8 protease and lysyl endopeptidase. This novel toxin comprises 72 amino acid residues with 10 cysteines. The sequence shows 89% sequence homology with Oh-4, and 60% with Toxins a and b from the same venom. The tyrosine, tryptophan, lysine and arginine residues in Oh-5 were modified with tetranitromethane (TNM), 2-nitrophenylsulfenyl (NPS) chloride, trinitrobenzene sulfonate (TNBS), and p-hydroxyphenylglyoxal (HPG), respectively. Modification of Tyr-4 or Trp-27 did not affect the lethal toxicity at all, while the Tyr-4 and 23 nitrated derivative retained about 50% of the lethality of native toxin. Selective trinitrophenylation of Lys-51 or 69 resulted in a decrease in lethality by 29%, and 50% lethality was retained after modification of Lys-2, 51, and 69. A drastic decrease in lethality to 26% was observed when both Arg-35 and 37 were modified. The neurotoxicity was further decreased when Arg-9 was additionally modified. These results suggest that the aromatic residues, Tyr-4 and Trp-27, are not crucial for the neurotoxicity, whereas the cationic residues are involved in multipoint contact between the toxin molecule and the nicotinic acetylcholine receptor (nAChR). The residues Tyr-23 and Arg-35 and 37 in the central loop of Oh-5 seem to contribute greatly to the neurotoxicity.

  16. Nucleic and Amino Acid Sequences Support Structure-Based Viral Classification

    PubMed Central

    Sinclair, Robert M.; Ravantti, Janne J.

    2017-01-01

    ABSTRACT Viral capsids ensure viral genome integrity by protecting the enclosed nucleic acids. Interactions between the genome and capsid and between individual capsid proteins (i.e., capsid architecture) are intimate and are expected to be characterized by strong evolutionary conservation. For this reason, a capsid structure-based viral classification has been proposed as a way to bring order to the viral universe. The seeming lack of sufficient sequence similarity to reproduce this classification has made it difficult to reject structural convergence as the basis for the classification. We reinvestigate whether the structure-based classification for viral coat proteins making icosahedral virus capsids is in fact supported by previously undetected sequence similarity. Since codon choices can influence nascent protein folding cotranslationally, we searched for both amino acid and nucleotide sequence similarity. To demonstrate the sensitivity of the approach, we identify a candidate gene for the pandoravirus capsid protein. We show that the structure-based classification is strongly supported by amino acid and also nucleotide sequence similarities, suggesting that the similarities are due to common descent. The correspondence between structure-based and sequence-based analyses of the same proteins shown here allow them to be used in future analyses of the relationship between linear sequence information and macromolecular function, as well as between linear sequence and protein folds. IMPORTANCE Viral capsids protect nucleic acid genomes, which in turn encode capsid proteins. This tight coupling of protein shell and nucleic acids, together with strong functional constraints on capsid protein folding and architecture, leads to the hypothesis that capsid protein-coding nucleotide sequences may retain signatures of ancient viral evolution. We have been able to show that this is indeed the case, using the major capsid proteins of viruses forming icosahedral capsids

  17. Classification of mouse VK groups based on the partial amino acid sequence to the first invariant tryptophan: impact of 14 new sequences from IgG myeloma proteins.

    PubMed

    Potter, M; Newell, J B; Rudikoff, S; Haber, E

    1982-12-01

    Fourteen new VK sequences derived from BALB/c IgG myeloma proteins were determined to the first invariant tryptophan (Trp 35). These partial sequences were compared with 65 other published VK sequences using a computer program. The 79 sequences were organized according to the length of the sequence from the amino terminus to the first invariant tryptophan (Trp 35), into seven groups (33, 34, 35, 36, 39, 40 and 41aa). A distance matrix of all 79 sequences was then computed, i.e. the number of amino acid substitutions necessary to convert one sequence to another was determined. From these data a dendrogram was constructed. Most of the VK sequences fell into clusters or closely related groups. The definition of a sequence group is arbitrary but facilitates the classification of VK proteins. We used 12 substitutions as the basis for defining a sequence group based on the known number of substitutions that are found in the VK21 proteins. By this criterion there were 18 groups in the Trp 35 dendrogram. Twelve of the 14 new sequences fell into one of these sequence groups; two formed new sequence groups. Collective amino acid sequencing is still encountering new VK structures indicating more sequences will be required to attain an accurate estimate of the total number of VK groups. Updated dendrograms can be quickly generated to include newly generated sequences.

  18. Homology, Analogy, and Ethology.

    ERIC Educational Resources Information Center

    Beer, Colin G.

    1984-01-01

    Because the main criterion of structural homology (the principle of connections) does not exist for behavioral homology, the utility of the ethological concept of homology has been questioned. The confidence with which behavioral homologies can be claimed varies inversely with taxonomic distance. Thus, conjectures about long-range phylogenetic…

  19. Biosynthesis of D-alanyl-lipoteichoic acid: cloning, nucleotide sequence, and expression of the Lactobacillus casei gene for the D-alanine-activating enzyme.

    PubMed Central

    Heaton, M P; Neuhaus, F C

    1992-01-01

    The D-alanine-activating enzyme (Dae; EC 6.3.2.4) encoded by the dae gene from Lactobacillus casei ATCC 7469 is a cytosolic protein essential for the formation of the D-alanyl esters of membrane-bound lipoteichoic acid. The gene has been cloned, sequenced, and expressed in Escherichia coli, an organism which does not possess Dae activity. The open reading frame is 1,518 nucleotides and codes for a protein of 55.867 kDa, a value in agreement with the 56 kDa obtained by electrophoresis. A putative promoter and ribosome-binding site immediately precede the dae gene. A second open reading frame contiguous with the dae gene has also been partially sequenced. The organization of these genetic elements suggests that more than one enzyme necessary for the biosynthesis of D-alanyl-lipoteichoic acid may be present in this operon. Analysis of the amino acid sequence deduced from the dae gene identified three regions with significant homology to proteins in the following groups of ATP-utilizing enzymes: (i) the acid-thiol ligases, (ii) the activating enzymes for the biosynthesis of enterobactin, and (iii) the synthetases for tyrocidine, gramicidin S, and penicillin. From these comparisons, a common motif (GXXGXPK) has been identified that is conserved in the 19 protein domains analyzed. This motif may represent the phosphate-binding loop of an ATP-binding site for this class of enzymes. A DNA fragment (1,568 nucleotides) containing the dae gene and its putative ribosome-binding site has been subcloned and expressed in E. coli. Approximately 0.5% of the total cell protein is active Dae, whereas 21% is in the form of inclusion bodies. The isolation of this minimal fragment without a native promoter sequence provides the basis for designing a genetic system for modulating the D-alanine ester content of lipoteichoic acid. PMID:1385594

  20. Detection and isolation of nucleic acid sequences using competitive hybridization probes

    DOEpatents

    Lucas, J.N.; Straume, T.; Bogen, K.T.

    1997-04-01

    A method for detecting a target nucleic acid sequence in a sample is provided using hybridization probes which competitively hybridize to a target nucleic acid. According to the method, a target nucleic acid sequence is hybridized to first and second hybridization probes which are complementary to overlapping portions of the target nucleic acid sequence, the first hybridization probe including a first complexing agent capable of forming a binding pair with a second complexing agent and the second hybridization probe including a detectable marker. The first complexing agent attached to the first hybridization probe is contacted with a second complexing agent, the second complexing agent being attached to a solid support such that when the first and second complexing agents are attached, target nucleic acid sequences hybridized to the first hybridization probe become immobilized on to the solid support. The immobilized target nucleic acids are then separated and detected by detecting the detectable marker attached to the second hybridization probe. A kit for performing the method is also provided. 7 figs.

  1. Detection and isolation of nucleic acid sequences using competitive hybridization probes

    DOEpatents

    Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

    1997-01-01

    A method for detecting a target nucleic acid sequence in a sample is provided using hybridization probes which competitively hybridize to a target nucleic acid. According to the method, a target nucleic acid sequence is hybridized to first and second hybridization probes which are complementary to overlapping portions of the target nucleic acid sequence, the first hybridization probe including a first complexing agent capable of forming a binding pair with a second complexing agent and the second hybridization probe including a detectable marker. The first complexing agent attached to the first hybridization probe is contacted with a second complexing agent, the second complexing agent being attached to a solid support such that when the first and second complexing agents are attached, target nucleic acid sequences hybridized to the first hybridization probe become immobilized on to the solid support. The immobilized target nucleic acids are then separated and detected by detecting the detectable marker attached to the second hybridization probe. A kit for performing the method is also provided.

  2. Amino acid sequence around the active-site serine residue in the acyltransferase domain of goat mammary fatty acid synthetase.

    PubMed Central

    Mikkelsen, J; Højrup, P; Rasmussen, M M; Roepstorff, P; Knudsen, J

    1985-01-01

    Goat mammary fatty acid synthetase was labelled in the acyltransferase domain by formation of O-ester intermediates by incubation with [1-14C]acetyl-CoA and [2-14C]malonyl-CoA. Tryptic-digest and CNBr-cleavage peptides were isolated and purified by high-performance reverse-phase and ion-exchange liquid chromatography. The sequences of the malonyl- and acetyl-labelled peptides were shown to be identical. The results confirm the hypothesis that both acetyl and malonyl groups are transferred to the mammalian fatty acid synthetase complex by the same transferase. The sequence is compared with those of other fatty acid synthetase transferases. PMID:3922356

  3. Ligation with nucleic acid sequence-based amplification.

    PubMed

    Ong, Carmichael; Tai, Warren; Sarma, Aartik; Opal, Steven M; Artenstein, Andrew W; Tripathi, Anubhav

    2012-01-01

    This work presents a novel method for detecting nucleic acid targets using a ligation step along with an isothermal, exponential amplification step. We use an engineered ssDNA with two variable regions on the ends, allowing us to design the probe for optimal reaction kinetics and primer binding. This two-part probe is ligated by T4 DNA Ligase only when both parts bind adjacently to the target. The assay demonstrates that the expected 72-nt RNA product appears only when the synthetic target, T4 ligase, and both probe fragments are present during the ligation step. An extraneous 38-nt RNA product also appears due to linear amplification of unligated probe (P3), but its presence does not cause a false-positive result. In addition, 40 mmol/L KCl in the final amplification mix was found to be optimal. It was also found that increasing P5 in excess of P3 helped with ligation and reduced the extraneous 38-nt RNA product. The assay was also tested with a single nucleotide polymorphism target, changing one base at the ligation site. The assay was able to yield a negative signal despite only a single-base change. Finally, using P3 and P5 with longer binding sites results in increased overall sensitivity of the reaction, showing that increasing ligation efficiency can improve the assay overall. We believe that this method can be used effectively for a number of diagnostic assays.

  4. Thin-film technology for direct visual detection of nucleic acid sequences: applications in clinical research.

    PubMed

    Jenison, Robert D; Bucala, Richard; Maul, Diana; Ward, David C

    2006-01-01

    Certain optical conditions permit the unaided eye to detect thickness changes on surfaces on the order of 20 A, which are of similar dimensions to monomolecular interactions between proteins or hybridization of complementary nucleic acid sequences. Such detection exploits specific interference of reflected white light, wherein thickness changes are perceived as surface color changes. This technology, termed thin-film detection, allows for the visualization of subattomole amounts of nucleic acid targets, even in complex clinical samples. Thin-film technology has been applied to a broad range of clinically relevant indications, including the detection of pathogenic bacterial and viral nucleic acid sequences and the discrimination of sequence variations in human genes causally related to susceptibility or severity of disease.

  5. Conservation of Shannon's redundancy for proteins. [information theory applied to amino acid sequences

    NASA Technical Reports Server (NTRS)

    Gatlin, L. L.

    1974-01-01

    Concepts of information theory are applied to examine various proteins in terms of their redundancy in natural originators such as animals and plants. The Monte Carlo method is used to derive information parameters for random protein sequences. Real protein sequence parameters are compared with the standard parameters of protein sequences having a specific length. The tendency of a chain to contain some amino acids more frequently than others and the tendency of a chain to contain certain amino acid pairs more frequently than other pairs are used as randomness measures of individual protein sequences. Non-periodic proteins are generally found to have random Shannon redundancies except in cases of constraints due to short chain length and genetic codes. Redundant characteristics of highly periodic proteins are discussed. A degree of periodicity parameter is derived.

  6. RNA internal standard synthesis by nucleic acid sequence-based amplification for competitive quantitative amplification reactions.

    PubMed

    Lo, Wan-Yu; Baeumner, Antje J

    2007-02-15

    Nucleic acid sequence-based amplification (NASBA) reactions have been demonstrated to successfully synthesize new sequences based on deletion and insertion reactions. Two RNA internal standards were synthesized for use in competitive amplification reactions in which quantitative analysis can be achieved by coamplifying the internal standard with the wild type sample. The sequences were created in two consecutive NASBA reactions using the E. coli clpB mRNA sequence as model analyte. The primer sequences of the wild type sequence were maintained, and a 20-nt-long segment inside the amplicon region was exchanged for a new segment of similar GC content and melting temperature. The new RNA sequence was thus amplifiable using the wild type primers and detectable via a new inserted sequence. In the first reaction, the forwarding primer and an additional 20-nt-long sequence was deleted and replaced by a new 20-nt-long sequence. In the second reaction, a forwarding primer containing as 5' overhang sequence the wild type primer sequence was used. The presence of pure internal standard was verified using electrochemiluminescence and RNA lateral-flow biosensor analysis. Additional sequence deletion in order to shorten the internal standard amplicons and thus generate higher detection signals was found not to be required. Finally, a competitive NASBA reaction between one internal standard and the wild type sequence was carried out proving its functionality. This new rapid construction method via NASBA provides advantages over the traditional techniques since it requires no traditional cloning procedures, no thermocyclers, and can be completed in less than 4 h.

  7. Uses of phage display in agriculture: sequence analysis and comparative modeling of late embryogenesis abundant client proteins suggest protein-nucleic acid binding functionality.

    PubMed

    Kushwaha, Rekha; Downie, A Bruce; Payne, Christina M

    2013-01-01

    A group of intrinsically disordered, hydrophilic proteins-Late Embryogenesis Abundant (LEA) proteins-has been linked to survival in plants and animals in periods of stress, putatively through safeguarding enzymatic function and prevention of aggregation in times of dehydration/heat. Yet despite decades of effort, the molecular-level mechanisms defining this protective function remain unknown. A recent effort to understand LEA functionality began with the unique application of phage display, wherein phage display and biopanning over recombinant Seed Maturation Protein homologs from Arabidopsis thaliana and Glycine max were used to retrieve client proteins at two different temperatures, with one intended to represent heat stress. From this previous study, we identified 21 client proteins for which clones were recovered, sometimes repeatedly. Here, we use sequence analysis and homology modeling of the client proteins to ascertain common sequence and structural properties that may contribute to binding affinity with the protective LEA protein. Our methods uncover what appears to be a predilection for protein-nucleic acid interactions among LEA client proteins, which is suggestive of subcellular residence. The results from this initial computational study will guide future efforts to uncover the protein protective mechanisms during heat stress, potentially leading to phage-display-directed evolution of synthetic LEA molecules.

  8. A rat gene with sequence homology to the Drosophila gene hairy is rapidly induced by growth factors known to influence neuronal differentiation.

    PubMed Central

    Feder, J N; Jan, L Y; Jan, Y N

    1993-01-01

    Several genes encoding transcription factors with a helix-loop-helix (HLH) motif are involved in the early process of neural development in Drosophila spp. We report the isolation from the rat a homolog of one of these genes, called hairy. The rat-hairy-like (RHL) gene is expressed early during embryogenesis. In contrast to the restricted expression of hairy mRNA in Drosophila spp., however, the mRNA encoded by RHL is detectable in all tissues examined. Stimulation of PC12 pheochromocytoma cells by nerve growth factor, basis fibroblast growth factor, or epidermal growth factor or of Rat-1 fibroblasts by epidermal growth factor causes a rapid and transient induction of the RHL gene. Thus, RHL acts as an immediate-early gene that can potentially transduce growth factor signals during the development of the mammalian embryo. Images PMID:8417318

  9. I-BasI and I-HmuI: two phage intron-encoded endonucleases with homologous DNA recognition sequences but distinct DNA specificities.

    PubMed

    Landthaler, Markus; Shen, Betty W; Stoddard, Barry L; Shub, David A

    2006-05-12

    I-HmuI and I-BasI are two highly similar nicking DNA endonucleases, which are each encoded by a group I intron inserted into homologous sites within the DNA polymerase genes of Bacillus phages SPO1 and Bastille, respectively. Here, we present a comparison of the DNA specificities and cleavage activities of these enconucleases with homologous target sites. I-BasI has properties that are typical of homing endonucleases, nicking the intron-minus polymerase genes in either host genome, three nucleotides downstream of the intron insertion site. In contrast, I-HmuI nicks both the intron-plus and intron-minus site in its own host genome, but does not act on the target from Bastille phage. Although the enzymes have distinct DNA substrate specificities, both bind to an identical 25bp region of their respective intron-minus DNA polymerase genes surrounding the intron insertion site. The endonucleases appear to interact with the DNA substrates in the downstream exon 2 in a similar manner. However, whereas I-HmuI is known to make its only base-specific contacts within this exon region, structural modeling analyses predict that I-BasI might make specific base contacts both upstream and downstream of the site of intron insertion. The predicted requirement for base-specific contacts in exon 1 for cleavage by I-BasI was confirmed experimentally. This explains the difference in substrate specificities between the two enzymes, including the observation that the former enzyme is relatively insensitive to the presence of an intron upstream of exon 2. These differences are likely a consequence of divergent evolutionary constraints.

  10. The amino acid sequence of Canada goose (Branta canadensis) and mute swan (Cygnus olor) hemoglobins. Two different species with identical beta-chains.

    PubMed

    Oberthür, W; Godovac-Zimmermann, J; Braunitzer, G; Wiesner, H

    1982-08-01

    The amino acid sequences of the alpha- and beta-chains from the major hemoglobin component (HbA) of Canada goose (Branta canadensis) and mute swan (Cygnus olor) are given. The alpha-chains are of the alpha A-type, since alpha D-type was expressed but only found in low concentrations. By homologous comparison, greylag goose hemoglobin (Anser anser) and Canada goose hemoglobin alpha-chains differ by two exchanges, and beta-chains by three exchanges. A valine substitution for threonine was found at position alpha 34 (B15). This exchange is a result of a two point mutation. Thus, there are three nucleotide mutations in alpha-chains, as in beta-chains. Substitutions in positions alpha 34 (B15) and beta 125 (H3) have modified intersubunit contacts (alpha 1 beta 1-contacts). A comparison of mute swan hemoglobin with greylag goose hemoglobin shows four exchanges in alpha-chains and three in beta-chains. Canada goose and mute swan have identical beta-chains, while alpha-chains differ in two amino acids. One of these exchanges is implicated in one of the alpha 1 beta 1-contact points (alpha 34) where isoleucine substitution for valine was found. Comparison of hemoglobins from different species in the same tribe (Anserini) shows a high homology between Canada goose and mute swan hemoglobins.

  11. Amino acid sequences of two nonspecific lipid-transfer proteins from germinated castor bean.

    PubMed

    Takishima, K; Watanabe, S; Yamada, M; Suga, T; Mamiya, G

    1988-11-01

    The amino acid sequence of two nonspecific lipid-transfer proteins (nsLTP) B and C from germinated castor bean seeds have been determined. Both the proteins consist of 92 residues, as for nsLTP previously reported, and their calculated Mr values are 9847 and 9593 for nsLTP-B and nsLTP-C, respectively. The sequences of nsLTP-B and nsLTP-C, compared to the known sequence of nsLTP-A from the same source, are 68% and 35% similar, respectively. No variation was found at the positions of the cysteine residues, indicating that they might be involved in disulfide bridges.

  12. A classification of glycosyl hydrolases based on amino acid sequence similarities.

    PubMed Central

    Henrissat, B

    1991-01-01

    The amino acid sequences of 301 glycosyl hydrolases and related enzymes have been compared. A total of 291 sequences corresponding to 39 EC entries could be classified into 35 families. Only ten sequences (less than 5% of the sample) could not be assigned to any family. With the sequences available for this analysis, 18 families were found to be monospecific (containing only one EC number) and 17 were found to be polyspecific (containing at least two EC numbers). Implications on the folding characteristics and mechanism of action of these enzymes and on the evolution of carbohydrate metabolism are discussed. With the steady increase in sequence and structural data, it is suggested that the enzyme classification system should perhaps be revised. PMID:1747104

  13. In silico comparative analysis of DNA and amino acid sequences for prion protein gene.

    PubMed

    Kim, Y; Lee, J; Lee, C

    2008-01-01

    Genetic variability might contribute to species specificity of prion diseases in various organisms. In this study, structures of the prion protein gene (PRNP) and its amino acids were compared among species of which sequence data were available. Comparisons of PRNP DNA sequences among 12 species including human, chimpanzee, monkey, bovine, ovine, dog, mouse, rat, wallaby, opossum, chicken and zebrafish allowed us to identify candidate regulatory regions in intron 1 and 3'-untranslated region (UTR) in addition to the coding region. Highly conserved putative binding sites for transcription factors, such as heat shock factor 2 (HSF2) and myocite enhancer factor 2 (MEF2), were discovered in the intron 1. In 3'-UTR, the functional sequence (ATTAAA) for nucleus-specific polyadenylation was found in all the analysed species. The functional sequence (TTTTTAT) for maturation-specific polyadenylation was identically observed only in ovine, and one or two nucleotide mismatches in the other species. A comparison of the amino acid sequences in 53 species revealed a large sequence identity. Especially the octapeptide repeat region was observed in all the species but frog and zebrafish. Functional changes and susceptibility to prion diseases with various isoforms of prion protein could be caused by numeric variability and conformational changes discovered in the repeat sequences.

  14. 37 CFR 1.824 - Form and format for nucleotide and/or amino acid sequence submissions in computer readable form.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... nucleotide and/or amino acid sequence submissions in computer readable form. 1.824 Section 1.824 Patents... And/or Amino Acid Sequences § 1.824 Form and format for nucleotide and/or amino acid sequence... readable form may be created by any means, such as word processors, nucleotide/amino acid sequence...

  15. 37 CFR 1.824 - Form and format for nucleotide and/or amino acid sequence submissions in computer readable form.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... nucleotide and/or amino acid sequence submissions in computer readable form. 1.824 Section 1.824 Patents... And/or Amino Acid Sequences § 1.824 Form and format for nucleotide and/or amino acid sequence... readable form may be created by any means, such as word processors, nucleotide/amino acid sequence...

  16. 37 CFR 1.824 - Form and format for nucleotide and/or amino acid sequence submissions in computer readable form.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... nucleotide and/or amino acid sequence submissions in computer readable form. 1.824 Section 1.824 Patents... And/or Amino Acid Sequences § 1.824 Form and format for nucleotide and/or amino acid sequence... readable form may be created by any means, such as word processors, nucleotide/amino acid sequence...

  17. Homoplasy in genome-wide analysis of rare amino acid replacements: the molecular-evolutionary basis for Vavilov's law of homologous series

    PubMed Central

    Rogozin, Igor B; Thomson, Karen; Csürös, Miklós; Carmel, Liran; Koonin, Eugene V

    2008-01-01

    Background Rare genomic changes (RGCs) that are thought to comprise derived shared characters of individual clades are becoming an increasingly important class of markers in genome-wide phylogenetic studies. Recently, we proposed a new type of RGCs designated RGC_CAMs (after Conserved Amino acids-Multiple substitutions) that were inferred using genome-wide identification of amino acid replacements that were: i) located in unambiguously aligned regions of orthologous genes, ii) shared by two or more taxa in positions that contain a different, conserved amino acid in a much broader range of taxa, and iii) require two or three nucleotide substitutions. When applied to animal phylogeny, the RGC_CAM approach supported the coelomate clade that unites deuterostomes with arthropods as opposed to the ecdysozoan (molting animals) clade. However, a non-negligible level of homoplasy was detected. Results We provide a direct estimate of the level of homoplasy caused by parallel changes and reversals among the RGC_CAMs using 462 alignments of orthologous genes from 19 eukaryotic species. It is shown that the impact of parallel changes and reversals on the results of phylogenetic inference using RGC_CAMs cannot explain the observed support for the Coelomata clade. In contrast, the evidence in support of the Ecdysozoa clade, in large part, can be attributed to parallel changes. It is demonstrated that parallel changes are significantly more common in internal branches of different subtrees that are separated from the respective common ancestor by relatively short times than in terminal branches separated by longer time intervals. A similar but much weaker trend was detected for reversals. The observed evolutionary trend of parallel changes is explained in terms of the covarion model of molecular evolution. As the overlap between the covarion sets in orthologous genes from different lineages decreases with time after divergence, the likelihood of parallel changes decreases as well

  18. Complete amino acid sequence of branched-chain amino acid aminotransferase (transaminase B) of Salmonella typhimurium, identification of the coenzyme-binding site and sequence comparison analysis

    SciTech Connect

    Feild, M.J.

    1988-01-01

    The complete amino acid sequence of the subunit of branched-chain amino acid aminotransferase of Salmonella typhimurium was determined by automated Edman degradation of peptide fragments generated by chemical and enzymatic digestion of S-carboxymethylated and S-pyridylethylated transaminase B. Peptide fragments of transaminase B were generated by treatment of the enzyme with trypsin, Staphylococcus aureus V8 protease, endoproteinase Lys-C, and cyanogen bromide. Protocols were developed for separation of the peptide fragments by reverse-phase high performance liquid chromatography (HPLC), ion-exchange HPLC, and SDS-urea gel electrophoresis. The enzyme subunit contains 308 amino acid residues and has a molecular weight of 33,920 daltons. The coenzyme-binding site was determined by treatment of the enzyme, containing bound pyridoxal 5-phosphate, with tritiated sodium borohydride prior to trypsin digestion. Monitoring radioactivity incorporation and peptide map comparisons with an apoenzyme tryptic digest, allowed identification of the pyridoxylated-peptide which was isolated by reverse-phase HPLC and sequenced. The coenzyme-binding site is a lysyl residue at position 159. Some peptides were further characterized by fast atom bombardment mass spectrometry.

  19. The semaphorontic view of homology

    PubMed Central

    Assis, Leandro C.S.; Rieppel, Olivier

    2015-01-01

    ABSTRACT The relation of homology is generally characterized as an identity relation, or alternatively as a correspondence relation, both of which are transitive. We use the example of the ontogenetic development and evolutionary origin of the gnathostome jaw to discuss identity and transitivity of the homology relation under the transformationist and emergentist paradigms respectively. Token identity and consequent transitivity of homology relations are shown to be requirements that are too strong to allow the origin of genuine evolutionary novelties. We consequently introduce the concept of compositional identity that is grounded in relations prevailing between parts (organs and organ systems) of a whole (organism). We recognize an ontogenetic identity of parts within a whole throughout the sequence of successive developmental stages of those parts: this is an intra‐organismal character identity maintained throughout developmental trajectory. Correspondingly, we recognize a phylogenetic identity of homologous parts within two or more organisms of different species: this is an inter‐species character identity maintained throughout evolutionary trajectory. These different dimensions of character identity—ontogenetic (through development) and phylogenetic (via shared evolutionary history)—break the transitivity of homology relations. Under the transformationist paradigm, the relation of homology reigns over the entire character (‐state) transformation series, and thus encompasses the plesiomorphic as well as the apomorphic condition of form. In contrast, genuine evolutionary novelties originate not through transformation of ancestral characters (‐states), but instead through deviating developmental trajectories that result in alternate characters. Under the emergentist paradigm, homology is thus synonymous with synapomorphy. J. Exp. Zool. (Mol. Dev. Evol.) 324B: 578–587, 2015. © 2015 The Authors. Journal of Experimental Zoology Part B: Molecular and

  20. The characterization of Mycoplasma synoviae EF-Tu protein and proteins involved in hemadherence and their N-terminal amino acid sequences.

    PubMed

    Bencina, D; Narat, M; Dovc, P; Drobnic-Valic, M; Habe, F; Kleven, S H

    1999-04-01

    An abundant cytoplasmic 43-kDa protein from Mycoplasma synoviae, a major pathogen from poultry, was identified as elongation factor Tu. The N-terminal amino acid sequence (AKLDFDRSKEHVNVGTIGHV) has 90% identity with the sequence of the Mycoplasma hominis elongation factor Tu protein. Monoclonal antibodies reacting with the M. synoviae elongation factor Tu protein also reacted with 43-kDa proteins from the avian Mycoplasma species Mycoplasma gallinarum, Mycoplasma gallinaceum, Mycoplasma pullorum, Mycoplasma cloacale, Mycoplasma iners and Mycoplasma meleagridis, but not with the proteins from Mycoplasma gallisepticum, Mycoplasma imitans or Mycoplasma iowae. In addition, two groups of phase variable integral membrane proteins, pMSA and pMSB, associated with hemadherence and pathogenicity of M. synoviae strains AAY-4 and ULB925 were identified. The cleavage of a larger hemagglutinating protein encoded by a gene homologous to the vlhA gene of M. synoviae generates pMSB1 and pMSA1 proteins defined by mAb 125 and by hemagglutination inhibiting mAb 3E10, respectively. The N-terminal amino acid sequences of pMSA proteins (SENKLI ... and SENETQ ...) probably indicate the cleavage site of the M. synoviae strain ULB 925 hemagglutinin.

  1. A Possible Mechanism of Zika Virus Associated Microcephaly: Imperative Role of Retinoic Acid Response Element (RARE) Consensus Sequence Repeats in the Viral Genome

    PubMed Central

    Kumar, Ashutosh; Singh, Himanshu N.; Pareek, Vikas; Raza, Khursheed; Dantham, Subrahamanyam; Kumar, Pavan; Mochan, Sankat; Faiq, Muneeb A.

    2016-01-01

    Owing to the reports of microcephaly as a consistent outcome in the fetuses of pregnant women infected with ZIKV in Brazil, Zika virus (ZIKV)—microcephaly etiomechanistic relationship has recently been implicated. Researchers, however, are still struggling to establish an embryological basis for this interesting causal handcuff. The present study reveals robust evidence in favor of a plausible ZIKV-microcephaly cause-effect liaison. The rationale is based on: (1) sequence homology between ZIKV genome and the response element of an early neural tube developmental marker “retinoic acid” in human DNA and (2) comprehensive similarities between the details of brain defects in ZIKV-microcephaly and retinoic acid embryopathy. Retinoic acid is considered as the earliest factor for regulating anteroposterior axis of neural tube and positioning of structures in developing brain through retinoic acid response elements (RARE) consensus sequence (5′–AGGTCA–3′) in promoter regions of retinoic acid-dependent genes. We screened genomic sequences of already reported virulent ZIKV strains (including those linked to microcephaly) and other viruses available in National Institute of Health genetic sequence database (GenBank) for the RARE consensus repeats and obtained results strongly bolstering our hypothesis that ZIKV strains associated with microcephaly may act through precipitation of dysregulation in retinoic acid-dependent genes by introducing extra stretches of RARE consensus sequence repeats in the genome of developing brain cells. Additional support to our hypothesis comes from our findings that screening of other viruses for RARE consensus sequence repeats is positive only for those known to display neurotropism and cause fetal brain defects (for which maternal-fetal transmission during developing stage may be required). The numbers of RARE sequence repeats appeared to match with the virulence of screened positive viruses. Although, bioinformatic evidence and

  2. The amino acid sequence of cytochromes c-551 from three species of Pseudomonas

    PubMed Central

    Ambler, R. P.; Wynn, Margaret

    1973-01-01

    The amino acid sequences of the cytochromes c-551 from three species of Pseudomonas have been determined. Each resembles the protein from Pseudomonas strain P6009 (now known to be Pseudomonas aeruginosa, not Pseudomonas fluorescens) in containing 82 amino acids in a single peptide chain, with a haem group covalently attached to cysteine residues 12 and 15. In all four sequences 43 residues are identical. Although by bacteriological criteria the organisms are closely related, the differences between pairs of sequences range from 22% to 39%. These values should be compared with the differences in the sequence of mitochondrial cytochrome c between mammals and amphibians (about 18%) or between mammals and insects (about 33%). Detailed evidence for the amino acid sequences of the proteins has been deposited as Supplementary Publication SUP 50015 at the National Lending Library for Science and Technology, Boston Spa, Yorks. LS23 7BQ, U.K., from whom copies can be obtained on the terms indicated in Biochem. J. (1973), 131, 5. PMID:4352718

  3. Draft Genome Sequence of Sorghum Grain Mold Fungus Epicoccum sorghinum, a Producer of Tenuazonic Acid

    PubMed Central

    Oliveira, Rodrigo C.; Davenport, Karen W.; Hovde, Blake; Silva, Danielle; Chain, Patrick S. G.; Correa, Benedito

    2017-01-01

    ABSTRACT The facultative plant pathogen Epicoccum sorghinum is associated with grain mold of sorghum and produces the mycotoxin tenuazonic acid. This fungus can have serious economic impact on sorghum production. Here, we report the draft genome sequence of E. sorghinum (USPMTOX48). PMID:28126937

  4. Draft Genome Sequence of Bacillus coagulans NL01, a Wonderful l-Lactic Acid Producer

    PubMed Central

    Zheng, Zhaojuan; Jiang, Ting; Lin, Xi; Zhou, Jie

    2015-01-01

    Here, we report the draft genome sequence of Bacillus coagulans NL01, which could produce high optically pure l-lactic acid using xylose as a sole carbon source. The draft genome is 3,505,081 bp, with 144 contigs. About 3,903 protein-coding genes and 92 rRNAs are predicted from this assembly. PMID:26089419

  5. Sequences homologous to the human x- and y-borne zinc finger protein genes (ZFX/Y) are autosomal in monotreme mannals

    SciTech Connect

    Watson, J.M.; Frost, C.; Graves, M.J.A. ); Spencer, J.A. )

    1993-02-01

    The human zinc finger protein genes (ZFX/Y) were identified as a result of a systematic search for the testis-determining factor gene on the human Y chromosome. Although they play no direct role in sex determination, they are of particular interest because they are highly conserved among mammals, birds, and amphibians and because, in eutherian mammals at least, they have active alleles on both the X and the Y chromosomes outside the pseudoautosomal region. We used in situ hybridization to localize the homologues of the zinc finger protein gene to chromosome 1 of the Australian echidna and to an equivalent position on chromosomes 1 and 2 of the playtpus. The localization to platypus chromosome 1 was confirmed by Southern analysis of a Chinese hamster [times] platypus cell hybrid retaining most of platypus chromosome 1. This localization is consistent with the cytological homology of chromosome 1 between the two species. The zinc finger protein gene homologues were localized to regions of platypus chromosomes 1 and 2 that included a number of other genes situated near ZFX on the short arm of the human X chromosome. These results support the hypothesis that many of the genes located on the short arm of the human X were originally autosomal and have been translocated to the X chromosome since the eutherian-metatherian divergence. 34 refs., 3 figs., 2 tabs.

  6. Sequence homology requirements for transcriptional silencing of 35S transgenes and post-transcriptional silencing of nitrite reductase (trans)genes by the tobacco 271 locus.

    PubMed

    Thierry, D; Vaucheret, H

    1996-12-01

    The transgene locus of the tobacco plant 271 (271 locus) is located on a telomere and consists of multiple copies of a plasmid carrying an NptII marker gene driven by the cauliflower mosaic virus (CaMV) 19S promoter and the leaf-specific nitrite reductase Nii1 cDNA cloned in the antisense orientation under the control of the CaMV 35S promoter. Previous analysis of gene expression in leaves has shown that this locus triggers both post-transcriptional silencing of the host leaf-specific Nii genes and transcriptional silencing of transgenes driven by the 19S or 35S promoter irrespective of their coding sequence and of their location in the genome. In this paper we show that silencing of transgenes carrying Nii1 sequences occurs irrespective of the promoter driving their expression and of their location within the genome. This phenomenon occurs in roots as well as in leaves although root Nii genes share only 84% identity with leaf-specific Nii1 sequences carried by the 271 locus. Conversely, transgenes carrying the bean Nii gene (which shares 76% identity with the tobacco Nii1 gene) escape silencing by the 271 locus. We also show that transgenes driven by the figwort mosaic virus 34S promoter (which shares 63% identity with the 35S promoter) also escape silencing by the 271 locus. Taken together, these results indicate that a high degree of sequence similarity is required between the sequences of the silencing locus and of the target (trans)genes for both transcriptional and post-transcriptional silencing.

  7. Amino acid sequences of heterotrophic and photosynthetic ferredoxins from the tomato plant (Lycopersicon esculentum Mill.).

    PubMed

    Kamide, K; Sakai, H; Aoki, K; Sanada, Y; Wada, K; Green, L S; Yee, B C; Buchanan, B B

    1995-11-01

    Several forms (isoproteins) of ferredoxin in roots, leaves, and green and red pericarps in tomato plants (Lycopersicon esculentum Mill.) were earlier identified on the basis of N-terminal amino acid sequence and chromatographic behavior (Green et al. 1991). In the present study, a large scale preparation made possible determination of the full length amino acid sequence of the two ferredoxins from leaves. The ferredoxins characteristic of fruit and root were sequenced from the amino terminus to the 30th residue or beyond. The leaf ferredoxins were confirmed to be expressed in pericarp of both green and red fruit. The ferredoxins characteristic of fruit and root appeared to be restricted to those tissue. The results extend earlier findings in demonstrating that ferredoxin occurs in the major organs of the tomato plant where it appears to function irrespective of photosynthetic competence.

  8. Amino acid sequence of myoglobin from white-tailed deer (Odocoileus virginianus).

    PubMed

    Joseph, Poulson; Suman, Surendranath P; Li, Shuting; Fontaine, Michele; Steinke, Laurey

    2012-10-01

    Our objective was to determine the primary structure of white-tailed deer myoglobin (Mb). White-tailed deer Mb was isolated from cardiac muscles employing ammonium sulfate precipitation and gel-filtration chromatography. The amino acid sequence was determined by Edman degradation. Sequence analyses of intact Mb as well as tryptic- and cyanogen bromide-peptides yielded the complete primary structure of white-tailed deer Mb, which shared 100% similarity with red deer Mb. White-tailed deer Mb consists of 153 amino acid residues and shares more than 96% sequence similarity with myoglobins from meat-producing ruminants, such as cattle, buffalo, sheep, and goat. Similar to sheep and goat myoglobins, white-tailed deer Mb contains 12 histidine residues. Proximal (position 93) and distal (position 64) histidine residues responsible for maintaining the stability of heme are conserved in white-tailed deer Mb.

  9. Nucleotide sequence and the encoded amino acids of human apolipoprotein A-I mRNA.

    PubMed Central

    Law, S W; Brewer, H B

    1984-01-01

    The cDNA clones encoding the precursor form of human liver apolipoprotein A-I (apoA-I), preproapoA-I, have been isolated from a cDNA library. A 17-base synthetic oligonucleotide based on residues 108-113 of apoA-I and a 26-base primer-extended, dideoxynucleotide-terminated cDNA were used as hybridization probes to select for recombinant plasmids bearing the apoA-I sequence. The complete nucleic acid sequence of human liver preproapoA-I has been determined by analysis of the cloned cDNA. The sequence is composed of 801 nucleotides encoding 267 amino acid residues. PreproapoA-I contains an 18-amino-acid prepeptide and a 6-amino-acid propeptide connected to the amino terminus of the 243-amino acid mature apoA-I. Southern blotting analysis of chromosomal DNA obtained from peripheral blood indicated the apoA-I gene is contained in a 2.1-kilobase-pair Pst I fragment and there is no gross difference in structural organization between the normal apoA-I gene and the Tangier disease apoA-I gene. Images PMID:6198645

  10. Mathematical Characterization of Protein Sequences Using Patterns as Chemical Group Combinations of Amino Acids.

    PubMed

    Das, Jayanta Kumar; Das, Provas; Ray, Korak Kumar; Choudhury, Pabitra Pal; Jana, Siddhartha Sankar

    2016-01-01

    Comparison of amino acid sequence similarity is the fundamental concept behind the protein phylogenetic tree formation. By virtue of this method, we can explain the evolutionary relationships, but further explanations are not possible unless sequences are studied through the chemical nature of individual amino acids. Here we develop a new methodology to characterize the protein sequences on the basis of the chemical nature of the amino acids. We design various algorithms for studying the variation of chemical group transitions and various chemical group combinations as patterns in the protein sequences. The amino acid sequence of conventional myosin II head domain of 14 family members are taken to illustrate this new approach. We find two blocks of maximum length 6 aa as 'FPKATD' and 'Y/FTNEKL' without repeating the same chemical nature and one block of maximum length 20 aa with the repetition of chemical nature which are common among all 14 members. We also check commonality with another motor protein sub-family kinesin, KIF1A. Based on our analysis we find a common block of length 8 aa both in myosin II and KIF1A. This motif is located in the neck linker region which could be responsible for the generation of mechanical force, enabling us to find the unique blocks which remain chemically conserved across the family. We also validate our methodology with different protein families such as MYOI, Myosin light chain kinase (MLCK) and Rho-associated protein kinase (ROCK), Na+/K+-ATPase and Ca2+-ATPase. Altogether, our studies provide a new methodology for investigating the conserved amino acids' pattern in different proteins.

  11. Single-tube nested competitive PCR with homologous competitor for quantitation of DNA target sequences: theoretical description of heteroduplex formation, evaluation of sensitivity, precision and linear range of the method.

    PubMed

    Serth, J; Panitz, F; Herrmann, H; Alves, J

    1998-10-01

    Competitive PCR is a frequently used technique for quantitation of DNA and mRNA. However, the application of the most favourable homologous mutated competitors is impeded by the formation of heteroduplex molecules which complicates the data evaluation and may lead to quantitation errors. Moreover, in most cases a single quantitation of an unknown sample requires multiple competitive reactions for identification of the equivalence point. In the present study, a highly efficient and reliable method as well as the underlying theoretical model is described. The mathematical solutions of this model provide the basis for single-tube quantitation using a homologous mutated competitor. For quantitation of Human Papilloma Virus 16-DNA, it is shown that single tube quantitations using simple PAGE separation and video evaluation for signal analysis permit linear detection within more than two orders of magnitude. In addition, repeated single-tube competitive PCRs exhibited good precision (average standard deviation 5%), even if carried out as nested high cycle PCR for quantitation of low abundant sequences (intraassay sensitivity <2 x 10(2) copies). This evaluation method can be applied to any DNA separation and detection method which is capable of resolving the heteroduplex fraction from both homoduplex molecules.

  12. Software scripts for quality checking of high-throughput nucleic acid sequencers.

    PubMed

    Lazo, G R; Tong, J; Miller, R; Hsia, C; Rausch, C; Kang, Y; Anderson, O D

    2001-06-01

    We have developed a graphical interface to allow the researcher to view and assess the quality of sequencing results using a series of program scripts developed to process data generated by automated sequencers. The scripts are written in Perl programming language and are executable under the cgibin directory of a Web server environment. The scripts direct nucleic acid sequencing trace file data output from automated sequencers to be analyzed by the phred molecular biology program and are displayed as graphical hypertext mark-up language (HTML) pages. The scripts are mainly designed to handle 96-well microtiter dish samples, but the scripts are also able to read data from 384-well microtiter dishes 96 samples at a time. The scripts may be customized for different laboratory environments and computer configurations. Web links to the sources and discussion page are provided.

  13. Amino acid sequence of band-3 protein from rainbow trout erythrocytes derived from cDNA.

    PubMed Central

    Hübner, S; Michel, F; Rudloff, V; Appelhans, H

    1992-01-01

    In this report we present the first complete band-3 cDNA sequence of a poikilothermic lower vertebrate. The primary structure of the anion-exchange protein band 3 (AE1) from rainbow trout erythrocytes was determined by nucleotide sequencing of cDNA clones. The overlapping clones have a total length of 3827 bp with a 5'-terminal untranslated region of 150 bp, a 2754 bp open reading frame and a 3'-untranslated region of 924 bp. Band-3 protein from trout erythrocytes consists of 918 amino acid residues with a calculated molecular mass of 101 827 Da. Comparison of its amino acid sequence revealed a 60-65% identity within the transmembrane spanning sequence of band-3 proteins published so far. An additional insertion of 24 amino acid residues within the membrane-associated domain of trout band-3 protein was identified, which until now was thought to be a general feature only of mammalian band-3-related proteins. PMID:1637296

  14. Preparation of Nucleic Acid Libraries for Personalized Sequencing Systems Using an Integrated Microfluidic Hub Technology (Seventh Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting 2012)

    ScienceCinema

    Patel, Kamlesh D [Ken; SNL,

    2016-07-12

    Kamlesh (Ken) Patel from Sandia National Laboratories (Livermore, California) presents "Preparation of Nucleic Acid Libraries for Personalized Sequencing Systems Using an Integrated Microfluidic Hub Technology " at the 7th Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting held in June, 2012 in Santa Fe, NM.

  15. Role of the two-component leader sequence and mature amino acid sequences in extracellular export of endoglucanase EGL from Pseudomonas solanacearum.

    PubMed Central

    Huang, J Z; Schell, M A

    1992-01-01

    The egl gene of Pseudomonas solanacearum encodes a 43-kDa extracellular endoglucanase (mEGL) involved in wilt disease caused by this phytopathogen. Egl is initially translated with a 45-residue, two-part leader sequence. The first 19 residues are apparently removed by signal peptidase II during export of Egl across the inner membrane (IM); the remaining residues of the leader sequence (modified with palmitate) are removed during export across the outer membrane (OM). Localization of Egl-PhoA fusion proteins showed that the first 26 residues of the Egl leader sequence are required and sufficient to direct lipid modification, processing, and export of Egl or PhoA across the IM but not the OM. Fusions of the complete 45-residue leader sequence or of the leader and increasing portions of mEgl sequences to PhoA did not cause its export across the OM. In-frame deletion of portions of mEGL-coding sequences blocked export of the truncated polypeptides across the OM without affecting export across the IM. These results indicate that the first part of the leader sequence functions independently to direct export of Egl across the IM while the second part and sequences and structures in mEGL are involved in export across the OM. Computer analysis of the mEgl amino acid sequence obtained from its nucleotide sequence identified a region of mEGL similar in amino acid sequence to regions in other prokaryotic endoglucanases. Images PMID:1735723

  16. Mathematical Characterization of Protein Sequences Using Patterns as Chemical Group Combinations of Amino Acids

    PubMed Central

    Choudhury, Pabitra Pal; Jana, Siddhartha Sankar

    2016-01-01

    Comparison of amino acid sequence similarity is the fundamental concept behind the protein phylogenetic tree formation. By virtue of this method, we can explain the evolutionary relationships, but further explanations are not possible unless sequences are studied through the chemical nature of individual amino acids. Here we develop a new methodology to characterize the protein sequences on the basis of the chemical nature of the amino acids. We design various algorithms for studying the variation of chemical group transitions and various chemical group combinations as patterns in the protein sequences. The amino acid sequence of conventional myosin II head domain of 14 family members are taken to illustrate this new approach. We find two blocks of maximum length 6 aa as ‘FPKATD’ and ‘Y/FTNEKL’ without repeating the same chemical nature and one block of maximum length 20 aa with the repetition of chemical nature which are common among all 14 members. We also check commonality with another motor protein sub-family kinesin, KIF1A. Based on our analysis we find a common block of length 8 aa both in myosin II and KIF1A. This motif is located in the neck linker region which could be responsible for the generation of mechanical force, enabling us to find the unique blocks which remain chemically conserved across the family. We also validate our methodology with different protein families such as MYOI, Myosin light chain kinase (MLCK) and Rho-associated protein kinase (ROCK), Na+/K+-ATPase and Ca2+-ATPase. Altogether, our studies provide a new methodology for investigating the conserved amino acids’ pattern in different proteins. PMID:27930687

  17. Multiple site-selective insertions of non-canonical amino acids into sequence-repetitive polypeptides

    PubMed Central

    Wu, I-Lin; Patterson, Melissa A.; Carpenter Desai, Holly E.; Mehl, Ryan A.; Giorgi, Gianluca

    2013-01-01

    A simple and efficient method is described for introduction of non-canonical amino acids at multiple, structurally defined sites within recombinant polypeptide sequences. E. coli MRA30, a bacterial host strain with attenuated activity for release factor 1 (RF1), is assessed for its ability to support the incorporation of a diverse range of non-canonical amino acids in response to multiple encoded amber (TAG) codons within genetic templates derived from superfolder GFP and an elastin-mimetic protein polymer. Suppression efficiency and isolated protein yield were observed to depend on the identity of the orthogonal aminoacyl-tRNA synthetase/tRNACUA pair and the non-canonical amino acid substrate. This approach afforded elastin-mimetic protein polymers containing non-canonical amino acid derivatives at up to twenty-two positions within the repeat sequence with high levels of substitution. The identity and position of the variant residues was confirmed by mass spectrometric analysis of the full-length polypeptides and proteolytic cleavage fragments resulting from thermolysin digestion. The accumulated data suggest that this multi-site suppression approach permits the preparation of protein-based materials in which novel chemical functionality can be introduced at precisely defined positions within the polypeptide sequence. PMID:23625817

  18. Deduced amino acid sequence of human pulmonary surfactant proteolipid: SPL(pVal)

    SciTech Connect

    Whitsett, J.A.; Glasser, S.W.; Korfhagen, T.R.; Weaver, T.E.; Clark, J.; Pilot-Matias, T.; Meuth, J.; Fox, J.L.

    1987-05-01

    Hydrophobic, proteolipid-like protein of Mr 6500 was isolated from ether/ethanol extracts of human, canine and bovine pulmonary surfactant. Amino acid composition of the protein demonstrated a remarkable abundance of hydrophobic residues, particularly valine and leucine. The N-terminal amino acid sequence of the human protein was determined: N-Leu-Ile-Pro-Cys-Cys-Pro-Val-Asn-Leu-Lys-Arg-Leu-Leu-Ile-Val4... An oligonucleotide probe was used to screen an adult human lung cDNA library and resulted in detection of cDNA clones with predicted amino acid sequence with close identity to the N-terminal amino acid sequence of the human peptide. SPL(pVal) was found within the reading frame of a larger peptide. SPL(pVal) results from proteolytic processing of a larger preprotein. Northern blot analysis detected in a single 1.0 kilobase SPL(pVal) RNA which was less abundant in fetal than in adult lung. Mixtures of purified canine and bovine SPL(pVal) and synthetic phospholipids display properties of rapid adsorption and surface tension lowering activity characteristic of surfactant. Human SPL(pVal) is a pulmonary surfactant proteolipid which may therefore be useful in combination with phospholipids and/or other surfactant proteins for the treatment of surfactant deficiency such as hyaline membrane disease in newborn infants.

  19. SUBGROUPS OF AMINO ACID SEQUENCES IN THE VARIABLE REGIONS OF IMMUNOGLOBULIN HEAVY CHAINS*

    PubMed Central

    Cunningham, Bruce A.; Pflumm, Mollie N.; User, Urs Rutisha; Edelman, Gerald M.

    1969-01-01

    The amino acid sequence of the first 133 residues of the heavy (γ) chain from a human γG immunoglobulin (He) has been determined. This γ-chain is identical in Gm type to that of protein Eu, the complete sequence of which has been reported. Comparison of the two sequences substantiates the previous suggestion that there are subgroups of variable regions of heavy chains. The variable region of Eu has been assigned to subgroup I and that of He to subgroup II; on the other hand, the constant regions of the two proteins appear to be identical. Comparison of the sequence of the heavy chain of He with the heavy chain sequences determined in other laboratories suggests that the variable region of subgroup II is at least 118 residues long. The nature and distribution of amino acid variations in this heavy chain subgroup resemble those observed in light chain subgroups. These studies provide evidence that the translocation hypothesis applies to heavy as well as to light chains, viz., genes for variable regions (V) are somatically translocated to genes for constant regions (C) to form complete VC structural genes. Images PMID:5264153

  20. Complete nucleic acid sequence of Penaeus stylirostris densovirus (PstDNV) from India.

    PubMed

    Rai, Praveen; Safeena, Muhammed P; Karunasagar, Iddya; Karunasagar, Indrani

    2011-06-01

    Infectious hypodermal and hematopoietic necrosis virus (IHHNV) of shrimp, recently been classified as Penaeus stylirostris densovirus (PstDNV). The complete nucleic acid sequence of PstDNV from India was obtained by cloning and sequencing of different DNA fragment of the virus. The genome organisation of PstDNV revealed that there were three major coding domains: a left ORF (NS1) of 2001 bp, a mid ORF (NS2) of 1092 bp and a right ORF (VP) of 990 bp. The complete genome and amino acid sequences of three proteins viz., NS1, NS2 and VP were compared with the genomes of the virus reported from Hawaii, China and Mexico and with partial sequence available from isolates from different regions. The phylogenetic analysis of shrimp, insect and vertebrate parvovirus sequences showed that the Indian PstDNV isolate is phylogenetically more closely related to one of the three isolates from Taiwan (AY355307), and two isolates (AY362547 and AY102034) from Thailand.

  1. Elucidation of the sequence of canine (pro)-calcitonin. A molecular biological and protein chemical approach.

    PubMed

    Mol, J A; Kwant, M M; Arnold, I C; Hazewinkel, H A

    1991-09-03

    From the canine thyroid gland a calcitonin (CT) immunoreactive peptide was purified by successive aqueous acid acetone extraction, gel filtration and HPLC. Gas-phase sequencing of the purified peptide showed that the first 25 amino acids had 65% sequence homology with the amino-terminus of the human CT prohormone. A canine cDNA library was then made from the thyroid gland. A plasmid was isolated containing a sequence that is homologous to part of exon 3, and the complete sequence of exon 4 of the human mRNA encoding preproCT. From this cDNA the amino acid sequence of canine CT is predicted. In comparison with well-known CT sequences of other species, the strongest homology exists with bovine, porcine and ovine CT.

  2. DNA Cloning of Plasmodium falciparum Circumsporozoite Gene: Amino Acid Sequence of Repetitive Epitope

    NASA Astrophysics Data System (ADS)

    Enea, Vincenzo; Ellis, Joan; Zavala, Fidel; Arnot, David E.; Asavanich, Achara; Masuda, Aoi; Quakyi, Isabella; Nussenzweig, Ruth S.

    1984-08-01

    A clone of complementary DNA encoding the circumsporozoite (CS) protein of the human malaria parasite Plasmodium falciparum has been isolated by screening an Escherichia coli complementary DNA library with a monoclonal antibody to the CS protein. The DNA sequence of the complementary DNA insert encodes a four-amino acid sequence: proline-asparagine-alanine-asparagine, tandemly repeated 23 times. The CS β -lactamase fusion protein specifically binds monoclonal antibodies to the CS protein and inhibits the binding of these antibodies to native Plasmodium falciparum CS protein. These findings provide a basis for the development of a vaccine against Plasmodium falciparum malaria.

  3. Seq2Logo: a method for construction and visualization of amino acid binding motifs and sequence profiles including sequence weighting, pseudo counts and two-sided representation of amino acid enrichment and depletion

    PubMed Central

    Thomsen, Martin Christen Frølund; Nielsen, Morten

    2012-01-01

    Seq2Logo is a web-based sequence logo generator. Sequence logos are a graphical representation of the information content stored in a multiple sequence alignment (MSA) and provide a compact and highly intuitive representation of the position-specific amino acid composition of binding motifs, active sites, etc. in biological sequences. Accurate generation of sequence logos is often compromised by sequence redundancy and low number of observations. Moreover, most methods available for sequence logo generation focus on displaying the position-specific enrichment of amino acids, discarding the equally valuable information related to amino acid depletion. Seq2logo aims at resolving these issues allowing the user to include sequence weighting to correct for data redundancy, pseudo counts to correct for low number of observations and different logotype representations each capturing different aspects related to amino acid enrichment and depletion. Besides allowing input in the format of peptides and MSA, Seq2Logo accepts input as Blast sequence profiles, providing easy access for non-expert end-users to characterize and identify functionally conserved/variable amino acids in any given protein of interest. The output from the server is a sequence logo and a PSSM. Seq2Logo is available at http://www.cbs.dtu.dk/biotools/Seq2Logo (14 May 2012, date last accessed). PMID:22638583

  4. Seq2Logo: a method for construction and visualization of amino acid binding motifs and sequence profiles including sequence weighting, pseudo counts and two-sided representation of amino acid enrichment and depletion.

    PubMed

    Thomsen, Martin Christen Frølund; Nielsen, Morten

    2012-07-01

    Seq2Logo is a web-based sequence logo generator. Sequence logos are a graphical representation of the information content stored in a multiple sequence alignment (MSA) and provide a compact and highly intuitive representation of the position-specific amino acid composition of binding motifs, active sites, etc. in biological sequences. Accurate generation of sequence logos is often compromised by sequence redundancy and low number of observations. Moreover, most methods available for sequence logo generation focus on displaying the position-specific enrichment of amino acids, discarding the equally valuable information related to amino acid depletion. Seq2logo aims at resolving these issues allowing the user to include sequence weighting to correct for data redundancy, pseudo counts to correct for low number of observations and different logotype representations each capturing different aspects related to amino acid enrichment and depletion. Besides allowing input in the format of peptides and MSA, Seq2Logo accepts input as Blast sequence profiles, providing easy access for non-expert end-users to characterize and identify functionally conserved/variable amino acids in any given protein of interest. The output from the server is a sequence logo and a PSSM. Seq2Logo is available at http://www.cbs.dtu.dk/biotools/Seq2Logo (14 May 2012, date last accessed).

  5. Method for high-volume sequencing of nucleic acids: random and directed priming with libraries of oligonucleotides

    DOEpatents

    Studier, F.W.

    1995-04-18

    Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient. 2 figs.

  6. Method for high-volume sequencing of nucleic acids: random and directed priming with libraries of oligonucleotides

    DOEpatents

    Studier, F. William

    1995-04-18

    Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient.

  7. Sequence-specific thermodynamic properties of nucleic acids influence both transcriptional pausing and backtracking in yeast

    PubMed Central

    2017-01-01

    RNA Polymerase II pauses and backtracks during transcription, with many consequences for gene expression and cellular physiology. Here, we show that the energy required to melt double-stranded nucleic acids in the transcription bubble predicts pausing in Saccharomyces cerevisiae far more accurately than nucleosome roadblocks do. In addition, the same energy difference also determines when the RNA polymerase backtracks instead of continuing to move forward. This data-driven model corroborates—in a genome wide and quantitative manner—previous evidence that sequence-dependent thermodynamic features of nucleic acids influence both transcriptional pausing and backtracking. PMID:28301878

  8. Respiratory syncytial virus fusion glycoprotein: nucleotide sequence of mRNA, identification of cleavage activation site and amino acid sequence of N-terminus of F1 subunit.

    PubMed Central

    Elango, N; Satake, M; Coligan, J E; Norrby, E; Camargo, E; Venkatesan, S

    1985-01-01

    The amino acid sequence of respiratory syncytial virus fusion protein (Fo) was deduced from the sequence of a partial cDNA clone of mRNA and from the 5' mRNA sequence obtained by primer extension and dideoxysequencing. The encoded protein of 574 amino acids is extremely hydrophobic and has a molecular weight of 63371 daltons. The site of proteolytic cleavage within this protein was accurately mapped by determining a partial amino acid sequence of the N-terminus of the larger subunit (F1) purified by radioimmunoprecipitation using monoclonal antibodies. Alignment of the N-terminus of the F1 subunit within the deduced amino acid sequence of Fo permitted us to identify a sequence of lys-lys-arg-lys-arg-arg at the C-terminus of the smaller N-terminal F2 subunit that appears to represent the cleavage/activation domain. Five potential sites of glycosylation, four within the F2 subunit, were also identified. Three extremely hydrophobic domains are present in the protein; a) the N-terminal signal sequence, b) the N-terminus of the F1 subunit that is analogous to the N-terminus of the paramyxovirus F1 subunit and the HA2 subunit of influenza virus hemagglutinin, and c) the putative membrane anchorage domain near the C-terminus of F1. Images PMID:2987829

  9. Analysis of protein function and its prediction from amino acid sequence.

    PubMed

    Clark, Wyatt T; Radivojac, Predrag

    2011-07-01

    Understanding protein function is one of the keys to understanding life at the molecular level. It is also important in the context of human disease because many conditions arise as a consequence of alterations of protein function. The recent availability of relatively inexpensive sequencing technology has resulted in thousands of complete or partially sequenced genomes with millions of functionally uncharacterized proteins. Such a large volume of data, combined with the lack of high-throughput experimental assays to functionally annotate proteins, attributes to the growing importance of automated function prediction. Here, we study proteins annotated by Gene Ontology (GO) terms and estimate the accuracy of functional transfer from protein sequence only. We find that the transfer of GO terms by pairwise sequence alignments is only moderately accurate, showing a surprisingly small influence of sequence identity (SID) in a broad range (30-100%). We developed and evaluated a new predictor of protein function, functional annotator (FANN), from amino acid sequence. The predictor exploits a multioutput neural network framework which is well suited to simultaneously modeling dependencies between functional terms. Experiments provide evidence that FANN-GO (predictor of GO terms; available from http://www.informatics.indiana.edu/predrag) outperforms standard methods such as transfer by global or local SID as well as GOtcha, a method that incorporates the structure of GO.

  10. The Complete Genome Sequence of the Lactic Acid Bacterium Lactococcus lactis ssp. lactis IL1403

    PubMed Central

    Bolotin, Alexander; Wincker, Patrick; Mauger, Stéphane; Jaillon, Olivier; Malarme, Karine; Weissenbach, Jean; Ehrlich, S. Dusko; Sorokin, Alexei

    2001-01-01

    Lactococcus lactis is a nonpathogenic AT-rich gram-positive bacterium closely related to the genus Streptococcus and is the most commonly used cheese starter. It is also the best-characterized lactic acid bacterium. We sequenced the genome of the laboratory strain IL1403, using a novel two-step strategy that comprises diagnostic sequencing of the entire genome and a shotgun polishing step. The genome contains 2,365,589 base pairs and encodes 2310 proteins, including 293 protein-coding genes belonging to six prophages and 43 insertion sequence (IS) elements. Nonrandom distribution of IS elements indicates that the chromosome of the sequenced strain may be a product of recent recombination between two closely related genomes. A complete set of late competence genes is present, indicating the ability of L. lactis to undergo DNA transformation. Genomic sequence revealed new possibilities for fermentation pathways and for aerobic respiration. It also indicated a horizontal transfer of genetic information from Lactococcus to gram-negative enteric bacteria of Salmonella-Escherichia group. [The sequence data described in this paper has been submitted to the GenBank data library under accession no. AE005176.] PMID:11337471

  11. Stereochemical Sequence Ion Selectivity: Proline versus Pipecolic-acid-containing Protonated Peptides

    NASA Astrophysics Data System (ADS)

    Abutokaikah, Maha T.; Guan, Shanshan; Bythell, Benjamin J.

    2017-01-01

    Substitution of proline by pipecolic acid, the six-membered ring congener of proline, results in vastly different tandem mass spectra. The well-known proline effect is eliminated and amide bond cleavage C-terminal to pipecolic acid dominates instead. Why do these two ostensibly similar residues produce dramatically differing spectra? Recent evidence indicates that the proton affinities of these residues are similar, so are unlikely to explain the result [Raulfs et al., J. Am. Soc. Mass Spectrom. 25, 1705-1715 (2014)]. An additional hypothesis based on increased flexibility was also advocated. Here, we provide a computational investigation of the "pipecolic acid effect," to test this and other hypotheses to determine if theory can shed additional light on this fascinating result. Our calculations provide evidence for both the increased flexibility of pipecolic-acid-containing peptides, and structural changes in the transition structures necessary to produce the sequence ions. The most striking computational finding is inversion of the stereochemistry of the transition structures leading to "proline effect"-type amide bond fragmentation between the proline/pipecolic acid-congeners: R (proline) to S (pipecolic acid). Additionally, our calculations predict substantial stabilization of the amide bond cleavage barriers for the pipecolic acid congeners by reduction in deleterious steric interactions and provide evidence for the importance of experimental energy regime in rationalizing the spectra.

  12. Self-sequencing of amino acids and origins of polyfunctional protocells

    NASA Technical Reports Server (NTRS)

    Fox, S. W.

    1984-01-01

    The role of proteins in the origin of living things is discussed. It has been experimentally established that amino acids can sequence themselves under simulated geological conditions with highly nonrandom products which accordingly contain diverse information. Multiple copies of each type of macromolecule are formed, resulting in greater power for any protoenzymic molecule than would accrue from a single copy of each type. Thermal proteins are readily incorporated into laboratory protocells. The experimental evidence for original polyfunctional protocells is discussed.

  13. Bovine-like coronaviruses isolated from four species of captive wild ruminants are homologous to bovine coronaviruses, based on complete genomic sequences.

    PubMed

    Alekseev, Konstantin P; Vlasova, Anastasia N; Jung, Kwonil; Hasoksuz, Mustafa; Zhang, Xinsheng; Halpin, Rebecca; Wang, Shiliang; Ghedin, Elodie; Spiro, David; Saif, Linda J

    2008-12-01

    We sequenced and analyzed the full-length genomes of four coronaviruses (CoVs), each from a distinct wild-ruminant species in Ohio: sambar deer (Cervus unicolor), a waterbuck (Kobus ellipsiprymnus), a sable antelope (Hippotragus niger), and a white-tailed deer (Odocoileus virginianus). The fecal samples from the sambar deer, the waterbuck, and the white-tailed deer were collected during winter dysentery outbreaks and sporadic diarrhea cases in 1993 and 1994 (H. Tsunemitsu, Z. R. el-Kanawati, D. R. Smith, H. H. Reed, and L. J. Saif, J. Clin. Microbiol. 33:3264-3269, 1995). A fecal sample from a sable antelope was collected in 2003 from an Ohio wild-animal habitat during the same outbreak when a bovine-like CoV from a giraffe (GiCoV) was isolated (M. Hasoksuz, K. Alekseev, A. Vlasova, X. Zhang, D. Spiro, R. Halpin, S. Wang, E. Ghedin, and L. J. Saif, J. Virol. 81:4981-4990, 2007). For two of the CoVs (sambar deer and waterbuck), complete genomes from both the cell culture-adapted and gnotobiotic-calf-passaged strains were also sequenced and analyzed. Phylogenetically, wild-ruminant CoVs belong to group 2a CoVs, with the closest relatedness to recent bovine CoV (BCoV) strains. High nucleotide identities (99.4 to 99.6%) among the wild-ruminant strains and recent BCoV strains (BCoV-LUN and BCoV-ENT, isolated in 1998) further confirm the close relatedness. Comparative genetic analysis of CoVs of captive wild ruminants with BCoV strains suggests that no specific genomic markers are present that allow discrimination between the bovine strains and bovine-like CoVs from captive wild ruminants; furthermore, no specific genetic markers were identified that defined cell cultured or calf-passaged strains or the host origin of strains. The results of this study confirm prior reports of biologic and antigenic similarities between bovine and wild-ruminant CoVs and suggest that cattle may be reservoirs for CoVs that infect captive wild ruminants or vice versa and that these CoVs may

  14. Amino acid sequence similarity between rabies virus glycoprotein and snake venom curaremimetic neurotoxins.

    PubMed

    Lentz, T L; Wilson, P T; Hawrot, E; Speicher, D W

    1984-11-16

    Evidence was presented earlier that a host-cell receptor for the highly neurotropic rabies virus might be the acetylcholine receptor. The amino acid sequence of the glycoprotein of rabies virus was compared by computer analysis with that of snake venom curaremimetic neurotoxins, potent ligands of the acetylcholine receptor. A statistically significant sequence relation was found between a segment of the rabies glycoprotein and the entire sequence of long neurotoxins. The greatest identity occurs with residues considered most important in neurotoxicity, including those interacting with the acetylcholine binding site of the acetylcholine receptor. Because of the similarity between the glycoprotein and the receptor-binding region of the neurotoxins, this region of the viral glycoprotein may function as a recognition site for the acetylcholine receptor. Direct binding of the rabies virus glycoprotein to the acetylcholine receptor could contribute to the neurotropism of this virus.

  15. Partial amino acid sequence of human pancreatic stone protein, a novel pancreatic secretory protein.

    PubMed Central

    Montalto, G; Bonicel, J; Multigner, L; Rovery, M; Sarles, H; De Caro, A

    1986-01-01

    Pancreatic stone protein (PSP) is the major organic component of human pancreatic stones. With the use of monoclonal antibody immunoadsorbents, five immunoreactive forms (PSP-S) with close Mr values (14,000-19,000) were isolated from normal pancreatic juice. By CM-Trisacryl M chromatography the lowest-Mr form (PSP-S1) was separated from the others and some of its molecular characteristics were investigated. The Mr of the PSP-S1 polypeptide chain calculated from the amino acid composition was about 16,100. The N-terminal sequences (40 residues) of PSP and PSP-S1 are identical, which suggests that the peptide backbone is the same for both of these polypeptides. The PSP-S1 sequence was determined up to residue 65 and was found to be different from all other known protein sequences. Images Fig. 1. PMID:3541906

  16. Low molecular weight (C1-C10) monocarboxylic acids, dissolved organic carbon and major inorganic ions in alpine snow pit sequence from a high mountain site, central Japan

    NASA Astrophysics Data System (ADS)

    Kawamura, Kimitaka; Matsumoto, Kohei; Tachibana, Eri; Aoki, Kazuma

    2012-12-01

    Snowpack samples were collected from a snow pit sequence (6 m in depth) at the Murodo-Daira site near the summit of Mt. Tateyama, central Japan, an outflow region of Asian dusts. The snow samples were analyzed for a homologous series of low molecular weight normal (C1-C10) and branched (iC4-iC6) monocarboxylic acids as well as aromatic (benzoic) and hydroxy (glycolic and lactic) acids, together with major inorganic ions and dissolved organic carbon (DOC). The molecular distributions of organic acids were characterized by a predominance of acetic (range 7.8-76.4 ng g-1-snow, av. 34.8 ng g-1) or formic acid (2.6-48.1 ng g-1, 27.7 ng g-1), followed by propionic acid (0.6-5.2 ng g-1, 2.8 ng g-1). Concentrations of normal organic acids generally decreased with an increase in carbon chain length, although nonanoic acid (C9) showed a maximum in the range of C5-C10. Higher concentrations were found in the snowpack samples containing dust layer. Benzoic acid (0.18-4.1 ng g-1, 1.4 ng g-1) showed positive correlation with nitrate (r = 0.70), sulfate (0.67), Na+ (0.78), Ca2+ (0.86) and Mg+ (0.75), suggesting that this aromatic acid is involved with anthropogenic sources and Asian dusts. Higher concentrations of Ca2+ and SO42- were found in the dusty snow samples. We found a weak positive correlation (r = 0.43) between formic acid and Ca2+, suggesting that gaseous formic acid may react with Asian dusts in the atmosphere during long-range transport. However, acetic acid did not show any positive correlations with major inorganic ions. Hydroxyacids (0.03-5.7 ng g-1, 1.5 ng g-1) were more abundant in the granular and dusty snow. Total monocarboxylic acids (16-130 ng g-1, 74 ng g-1) were found to account for 1-6% of DOC (270-1500 ng g-1, 630 ng g-1) in the snow samples.

  17. Characterization of the microbial acid mine drainage microbial community using culturing and direct sequencing techniques.

    PubMed

    Auld, Ryan R; Myre, Maxine; Mykytczuk, Nadia C S; Leduc, Leo G; Merritt, Thomas J S

    2013-05-01

    We characterized the bacterial community from an AMD tailings pond using both classical culturing and modern direct sequencing techniques and compared the two methods. Acid mine drainage (AMD) is produced by the environmental and microbial oxidation of minerals dissolved from mining waste. Surprisingly, we know little about the microbial communities associated with AMD, despite the fundamental ecological roles of these organisms and large-scale economic impact of these waste sites. AMD microbial communities have classically been characterized by laboratory culturing-based techniques and more recently by direct sequencing of marker gene sequences, primarily the 16S rRNA gene. In our comparison of the techniques, we find that their results are complementary, overall indicating very similar community structure with similar dominant species, but with each method identifying some species that were missed by the other. We were able to culture the majority of species that our direct sequencing results indicated were present, primarily species within the Acidithiobacillus and Acidiphilium genera, although estimates of relative species abundance were only obtained from direct sequencing. Interestingly, our culture-based methods recovered four species that had been overlooked from our sequencing results because of the rarity of the marker gene sequences, likely members of the rare biosphere. Further, direct sequencing indicated that a single genus, completely missed in our culture-based study, Legionella, was a dominant member of the microbial community. Our results suggest that while either method does a reasonable job of identifying the dominant members of the AMD microbial community, together the methods combine to give a more complete picture of the true diversity of this environment.

  18. Structural investigations of the p53/p73 homologs from the tunicate species Ciona intestinalis reveal the sequence requirements for the formation of a tetramerization domain.

    PubMed

    Heering, Jan; Jonker, Hendrik R A; Löhr, Frank; Schwalbe, Harald; Dötsch, Volker

    2016-02-01

    Most members of the p53 family of transcription factors form tetramers. Responsible for determining the oligomeric state is a short oligomerization domain consisting of one β-strand and one α-helix. With the exception of human p53 all other family members investigated so far contain a second α-helix as part of their tetramerization domain. Here we have used nuclear magnetic resonance spectroscopy to characterize the oligomerization domains of the two p53-like proteins from the tunicate Ciona intestinalis, representing the closest living relative of vertebrates. Structure determination reveals for one of the two proteins a new type of packing of this second α-helix on the core domain that was not predicted based on the sequence, while the other protein does not form a second helix despite the presence of crucial residues that are conserved in all other family members that form a second helix. By mutational analysis, we identify a proline as well as large hydrophobic residues in the hinge region between both helices as the crucial determinant for the formation of a second helix.

  19. The amino acid sequence of the aspartate aminotransferase from baker's yeast (Saccharomyces cerevisiae).

    PubMed Central

    Cronin, V B; Maras, B; Barra, D; Doonan, S

    1991-01-01

    1. The single (cytosolic) aspartate aminotransferase was purified in high yield from baker's yeast (Saccharomyces cerevisiae). 2. Amino-acid-sequence analysis was carried out by digestion of the protein with trypsin and with CNBr; some of the peptides produced were further subdigested with Staphylococcus aureus V8 proteinase or with pepsin. Peptides were sequenced by the dansyl-Edman method and/or by automated gas-phase methods. The amino acid sequence obtained was complete except for a probable gap of two residues as indicated by comparison with the structures of counterpart proteins in other species. 3. The N-terminus of the enzyme is blocked. Fast-atom-bombardment m.s. was used to identify the blocking group as an acetyl one. 4. Alignment of the sequence of the enzyme with those of vertebrate cytosolic and mitochondrial aspartate aminotransferases and with the enzyme from Escherichia coli showed that about 25% of residues are conserved between these distantly related forms. 5. Experimental details and confirmatory data for the results presented here are given in a Supplementary Publication (SUP 50164, 25 pages) that has been deposited at the British Library Document Supply Centre, Boston Spa. Wetherby, West Yorkshire LS23 7 BQ, U.K., from whom copies can be obtained on the terms indicated in Biochem. J. (1991) 273, 5. PMID:1859361

  20. Analysis of amino acid sequence variations and immunoglobulin E-binding epitopes of German cockroach tropomyosin.

    PubMed

    Jeong, Kyoung Yong; Lee, Jongweon; Lee, In-Yong; Ree, Han-Il; Hong, Chein-Soo; Yong, Tai-Soon

    2004-09-01

    The allergenicities of tropomyosins from different organisms have been reported to vary. The cDNA encoding German cockroach tropomyosin (Bla g 7) was isolated, expressed, and characterized previously. In the present study, the amino acid sequence variations in German cockroach tropomyosin were analyzed in order to investigate its influence on allergenicity. We also undertook the identification of immunodominant peptides containing immunoglobulin E (IgE) epitopes which may facilitate the development of diagnostic and immunotherapeutic strategies based on the recombinant proteins. Two-dimensional gel electrophoresis and immunoblot analysis with mouse anti-recombinant German cockroach tropomyosin serum was performed to investigate the isoforms at the protein level. Reverse transcriptase PCR (RT-PCR) was applied to examine the sequence diversity. Eleven different variants of the deduced amino acid sequences were identified by RT-PCR. German cockroach tropomyosin has only minor sequence variations that did not seem to affect its allergenicity significantly. These results support the molecular basis underlying the cross-reactivities of arthropod tropomyosins. Recombinant fragments were also generated by PCR, and IgE-binding epitopes were assessed by enzyme-linked immunosorbent assay. Sera from seven patients revealed heterogeneous IgE-binding responses. This study demonstrates multiple IgE-binding epitope regions in a single molecule, suggesting that full-length tropomyosin should be used for the development of diagnostic and therapeutic reagents.

  1. Nitrogenase and Homologs

    PubMed Central

    2014-01-01

    Nitrogenase catalyzes biological nitrogen fixation, a key step in the global nitrogen cycle. Three homologous nitrogenases have been identified to date, along with several structural and/or functional homologs of this enzyme that are involved in nitrogenase assembly, bacteriochlorophyll biosynthesis and methanogenic process, respectively. In this article, we provide an overview of the structures and functions of nitrogenase and its homologs, which highlights the similarity and disparity of this uniquely versatile group of enzymes. PMID:25491285

  2. Complete amino acid sequence of a histidine-rich proteolytic fragment of human ceruloplasmin.

    PubMed

    Kingston, I B; Kingston, B L; Putnam, F W

    1979-04-01

    The complete amino acid sequence has been determined for a fragment of human ceruloplasmin [ferroxidase; iron(II):oxygen oxidoreductase, EC 1.16.3.1]. The fragment (designated Cp F5) contains 159 amino acid residues and has a molecular weight of 18,650; it lacks carbohydrate, is rich in histidine, and contains one free cysteine that may be part of a copper-binding site. This fragment is present in most commercial preparations of ceruloplasmin, probably owing to proteolytic degradation, but can also be obtained by limited cleavage of single-chain ceruloplasmin with plasmin. Cp F5 probably is an intact domain attached to the COOH-terminal end of single-chain ceruloplasmin via a labile interdomain peptide bond. A model of the secondary structure predicted by empirical methods suggests that almost one-third of the amino acid residues are distributed in alpha helices, about a third in beta-sheet structure, and the remainder in beta turns and unidentified structures. Computer analysis of the amino acid sequence has not demonstrated a statistically significant relationship between this ceruloplasmin fragment and any other protein, but there is some evidence for an internal duplication.

  3. Dualities in Persistent (Co)Homology

    SciTech Connect

    de Silva, Vin; Morozov, Dmitriy; Vejdemo-Johansson, Mikael

    2011-09-16

    We consider sequences of absolute and relative homology and cohomology groups that arise naturally for a filtered cell complex. We establishalgebraic relationships between their persistence modules, and show that they contain equivalent information. We explain how one can use the existingalgorithm for persistent homology to process any of the four modules, and relate it to a recently introduced persistent cohomology algorithm. Wepresent experimental evidence for the practical efficiency of the latter algorithm.

  4. Processing and amino acid sequence analysis of the mouse mammary tumor virus env gene product.

    PubMed Central

    Arthur, L O; Copeland, T D; Oroszlan, S; Schochetman, G

    1982-01-01

    The envelope proteins of mouse mammary tumor virus (MMTV) are synthesized from a subgenomic 24S mRNA as a 75,000-dalton glycosylated precursor polyprotein which is eventually processed to the mature glycoproteins gp52 and gp36. In vivo synthesis of this env precursor in the presence of the core glycosylation inhibitor tunicamycin yielded a precursor of approximately 61,000 daltons (P61env). However, a 67,000-dalton protein (P67env) was obtained from cell-free translation with the MMTV 24S mRNA as the template. To determine whether the portion of the protein cleaved from P67env to give P61env was removed from the NH2-terminal end of P67env and as such would represent a leader sequence, the NH2-terminal amino acid sequence of the terminal peptide gp52 was determined. Glutamic acid, and not methionine, was found to be the amino-terminal residue of gp52, indicating that the cleaved portion was derived from the NH2-terminal end of P67env. The NH2-terminal amino acid sequences of gp52's from endogenous and exogenous C3H MMTVs were determined though 46 residues and found to be identical. However, amino acid composition and type-specific gp52 radioimmunoassays from MMTVs grown in heterologous cells indicated primary structure differences between gp52's of the two viruses. The nucleic acid sequence of cloned MMTV DNA fragments (J. Majors and H. E. Varmus, personal communication) in conjunction with the NH2-terminal sequence of gp52 allowed localization of the env gene in the MMTV genome. Nucleotides coding for the NH2 terminus of gp52 begin approximately 0.8 kilobase to the 3' side of the single EcoRI cleavage site. Localization of the env gene at that point agrees with the proposed gene order -gag-pol-env- and also allows sufficient coding potential for the glycoprotein precursor without extending into the long terminal repeat. Images PMID:6281457

  5. Complete Genome Sequence of a thermotolerant sporogenic lactic acid bacterium, Bacillus coagulans strain 36D1

    PubMed Central

    Rhee, Mun Su; Moritz, Brélan E.; Xie, Gary; Glavina del Rio, T.; Dalin, E.; Tice, H.; Bruce, D.; Goodwin, L.; Chertkov, O.; Brettin, T.; Han, C.; Detter, C.; Pitluck, S.; Land, Miriam L.; Patel, Milind; Ou, Mark; Harbrucker, Roberta; Ingram, Lonnie O.; Shanmugam, K. T.

    2011-01-01

    Bacillus coagulans is a ubiquitous soil bacterium that grows at 50-55 °C and pH 5.0 and ferments various sugars that constitute plant biomass to L (+)-lactic acid. The ability of this sporogenic lactic acid bacterium to grow at 50-55 °C and pH 5.0 makes this organism an attractive microbial biocatalyst for production of optically pure lactic acid at industrial scale not only from glucose derived from cellulose but also from xylose, a major constituent of hemicellulose. This bacterium is also considered as a potential probiotic. Complete genome sequence of a representative strain, B. coagulans strain 36D1, is presented and discussed. PMID:22675583

  6. BeadCons: detection of nucleic acid sequences by flow cytometry.

    PubMed

    Horejsh, Douglas; Martini, Federico; Capobianchi, Maria Rosaria

    2005-11-01

    Molecular beacons are single-stranded nucleic acid structures with a terminal fluorophore and a distal, terminal quencher. These molecules are typically used in real-time PCR assays, but have also been conjugated with solid matrices. This unit describes protocols related to molecular beacon-conjugated beads (BeadCons), whose specific hybridization with complementary target sequences can be resolved by cytometry. Assay sensitivity is achieved through the concentration of fluorescence signal on discrete particles. By using molecular beacons with different fluorophores and microspheres of different sizes, it is possible to construct a fluid array system with each bead corresponding to a specific target nucleic acid. Methods are presented for the design, construction, and use of BeadCons for the specific, multiplexed detection of unlabeled nucleic acids in solution. The use of bead-based detection methods will likely lead to the design of new multiplex molecular diagnostic tools.

  7. Measuring nanometer distances in nucleic acids using a sequence-independent nitroxide probe

    PubMed Central

    Qin, Peter Z; Haworth, Ian S; Cai, Qi; Kusnetzow, Ana K; Grant, Gian Paola G; Price, Eric A; Sowa, Glenna Z; Popova, Anna; Herreros, Bruno; He, Honghang

    2008-01-01

    This protocol describes the procedures for measuring nanometer distances in nucleic acids using a nitroxide probe that can be attached to any nucleotide within a given sequence. Two nitroxides are attached to phosphorothioates that are chemically substituted at specific sites of DNA or RNA. Inter-nitroxide distances are measured using a four-pulse double electron–electron resonance technique, and the measured distances are correlated to the parent structures using a Web-accessible computer program. Four to five days are needed for sample labeling, purification and distance measurement. The procedures described herein provide a method for probing global structures and studying conformational changes of nucleic acids and protein/nucleic acid complexes. PMID:17947978

  8. Complete Genome Sequence of a thermotolerant sporogenic lactic acid bacterium, Bacillus coagulans strain 36D1.

    PubMed

    Rhee, Mun Su; Moritz, Brélan E; Xie, Gary; Glavina Del Rio, T; Dalin, E; Tice, H; Bruce, D; Goodwin, L; Chertkov, O; Brettin, T; Han, C; Detter, C; Pitluck, S; Land, Miriam L; Patel, Milind; Ou, Mark; Harbrucker, Roberta; Ingram, Lonnie O; Shanmugam, K T

    2011-12-31

    Bacillus coagulans is a ubiquitous soil bacterium that grows at 50-55 °C and pH 5.0 and ferments various sugars that constitute plant biomass to L (+)-lactic acid. The ability of this sporogenic lactic acid bacterium to grow at 50-55 °C and pH 5.0 makes this organism an attractive microbial biocatalyst for production of optically pure lactic acid at industrial scale not only from glucose derived from cellulose but also from xylose, a major constituent of hemicellulose. This bacterium is also considered as a potential probiotic. Complete genome sequence of a representative strain, B. coagulans strain 36D1, is presented and discussed.

  9. The amino acid sequence of Lady Amherst's pheasant (Chrysolophus amherstiae) and golden pheasant (Chrysolophus pictus) egg-white lysozymes.

    PubMed

    Araki, T; Kuramoto, M; Torikata, T

    1990-09-01

    The amino acids of Lady Amherst's pheasant and golden pheasant egg-white lysozymes have been sequenced. The carboxymethylated lysozymes were digested with trypsin followed by sequencing of the tryptic peptides. Lady Amherst's pheasant lysozyme proved to consist of 129 amino acid residues, and a relative molecular mass of 14,423 Da was calculated. This lysozyme had 6 amino acids substitutions when compared with hen egg-white lysozyme: Phe3 to Tyr, His15 to Leu, Gln41 to His, Asn77 to His, Gln 121 to Asn, and a newly found substitution of Ile124 to Thr. The amino acid sequence of golden pheasant lysozyme was identical to that of Lady Amherst's phesant lysozyme. The phylogenetic tree constructured by the comparison of amino acid sequences of phasianoid birds lysozymes revealed a minimum genetic distance between these pheasants and the turkey-peafowl group.

  10. Random-walk model of homologous recombination

    NASA Astrophysics Data System (ADS)

    Fujitani, Youhei; Kobayashi, Ichizo

    1995-12-01

    Interaction between two homologous (i.e., identical or nearly identical) DNA sequences leads to their homologous recombination in the cell. We present the following stochastic model to explain the dependence of the frequency of homologous recombination on the length of the homologous region. The branch point connecting the two DNAs in a reaction intermediate follows the random-walk process along the homology (N base-pairs). If the branch point reaches either of the homology ends, it bounds back to the homologous region at a probability of γ (reflection coefficient) and is destroyed at a probability of 1-γ. When γ is small, the frequency of homologous recombination is found to be proportional to N3 for smaller N and a linear function of N for larger N. The exponent of the nonlinear dependence for smaller N decreases from three as γ increases. When γ=1, only the linear dependence is left. These theoretical results can explain many experimental data in various systems. (c) 1995 The American Physical Society

  11. A 25-Amino Acid Sequence of the Arabidopsis TGD2 Protein Is Sufficient for Specific Binding of Phosphatidic Acid*

    PubMed Central

    Lu, Binbin; Benning, Christoph

    2009-01-01

    Genetic analysis suggests that the TGD2 protein of Arabidopsis is required for the biosynthesis of endoplasmic reticulum derived thylakoid lipids. TGD2 is proposed to be the substrate-binding protein of a presumed lipid transporter consisting of the TGD1 (permease) and TGD3 (ATPase) proteins. The TGD1, -2, and -3 proteins are localized in the inner chloroplast envelope membrane. TGD2 appears to be anchored with an N-terminal membrane-spanning domain into the inner envelope membrane, whereas the C-terminal domain faces the intermembrane space. It was previously shown that the C-terminal domain of TGD2 binds phosphatidic acid (PtdOH). To investigate the PtdOH binding site of TGD2 in detail, the C-terminal domain of the TGD2 sequence lacking the transit peptide and transmembrane sequences was fused to the C terminus of the Discosoma sp. red fluorescent protein (DR). This greatly improved the solubility of the resulting DR-TGD2C fusion protein following production in Escherichia coli. The DR-TGD2C protein bound PtdOH with high specificity, as demonstrated by membrane lipid-protein overlay and liposome association assays. Internal deletion and truncation mutagenesis identified a previously undescribed minimal 25-amino acid fragment in the C-terminal domain of TGD2 that is sufficient for PtdOH binding. Binding characteristics of this 25-mer were distinctly different from those of TGD2C, suggesting that additional sequences of TGD2 providing the proper context for this 25-mer are needed for wild type-like PtdOH binding. PMID:19416982

  12. Nucleotide sequence of the luxC gene encoding fatty acid reductase of the lux operon from Photobacterium leiognathi.

    PubMed

    Lin, J W; Chao, Y F; Weng, S F

    1993-02-26

    The nucleotide sequence of the luxC gene (EMBL Accession No. 65156) encoding fatty acid reductase (FAR) of the lux operon from Photobacterium leiognathi PL741 was determined and the encoded amino acid sequence deduced. The fatty acid reductase is a component of the fatty acid reductase complex. The complex is responsible for converting fatty acid to aldehyde which serves as the substrate in the luciferase-catalyzed bioluminescent reaction. The protein comprises 478 amino acid residues and has a calculated M(r) of 53,858. Alignment and comparison of the fatty acid reductase of P. leiognathi with that of Vibrio harveyi B392 and Vibrio fischeri ATCC 7744 shows that there is 70% and 59% amino acid residues identity, respectively.

  13. Identification of Protein-Protein Interactions via a Novel Matrix-Based Sequence Representation Model with Amino Acid Contact Information.

    PubMed

    Ding, Yijie; Tang, Jijun; Guo, Fei

    2016-09-24

    Identification of protein-protein interactions (PPIs) is a difficult and important problem in biology. Since experimental methods for predicting PPIs are both expensive and time-consuming, many computational methods have been developed to predict PPIs and interaction networks, which can be used to complement experimental approaches. However, these methods have limitations to overcome. They need a large number of homology proteins or literature to be applied in their method. In this paper, we propose a novel matrix-based protein sequence representation approach to predict PPIs, using an ensemble learning method for classification. We construct the matrix of Amino Acid Contact (AAC), based on the statistical analysis of residue-pairing frequencies in a database of 6323 protein-protein complexes. We first represent the protein sequence as a Substitution Matrix Representation (SMR) matrix. Then, the feature vector is extracted by applying algorithms of Histogram of Oriented Gradient (HOG) and Singular Value Decomposition (SVD) on the SMR matrix. Finally, we feed the feature vector into a Random Forest (RF) for judging interaction pairs and non-interaction pairs. Our method is applied to several PPI datasets to evaluate its performance. On the S . c e r e v i s i a e dataset, our method achieves 94 . 83 % accuracy and 92 . 40 % sensitivity. Compared with existing methods, and the accuracy of our method is increased by 0 . 11 percentage points. On the H . p y l o r i dataset, our method achieves 89 . 06 % accuracy and 88 . 15 % sensitivity, the accuracy of our method is increased by 0 . 76 % . On the H u m a n PPI dataset, our method achieves 97 . 60 % accuracy and 96 . 37 % sensitivity, and the accuracy of our method is increased by 1 . 30 % . In addition, we test our method on a very important PPI network, and it achieves 92 . 71 % accuracy. In the Wnt-related network, the accuracy of our method is increased by 16 . 67 % . The source code and all datasets are available

  14. Identification of Protein–Protein Interactions via a Novel Matrix-Based Sequence Representation Model with Amino Acid Contact Information

    PubMed Central

    Ding, Yijie; Tang, Jijun; Guo, Fei

    2016-01-01

    Identification of protein–protein interactions (PPIs) is a difficult and important problem in biology. Since experimental methods for predicting PPIs are both expensive and time-consuming, many computational methods have been developed to predict PPIs and interaction networks, which can be used to complement experimental approaches. However, these methods have limitations to overcome. They need a large number of homology proteins or literature to be applied in their method. In this paper, we propose a novel matrix-based protein sequence representation approach to predict PPIs, using an ensemble learning method for classification. We construct the matrix of Amino Acid Contact (AAC), based on the statistical analysis of residue-pairing frequencies in a database of 6323 protein–protein complexes. We first represent the protein sequence as a Substitution Matrix Representation (SMR) matrix. Then, the feature vector is extracted by applying algorithms of Histogram of Oriented Gradient (HOG) and Singular Value Decomposition (SVD) on the SMR matrix. Finally, we feed the feature vector into a Random Forest (RF) for judging interaction pairs and non-interaction pairs. Our method is applied to several PPI datasets to evaluate its performance. On the S.cerevisiae dataset, our method achieves 94.83% accuracy and 92.40% sensitivity. Compared with existing methods, and the accuracy of our method is increased by 0.11 percentage points. On the H.pylori dataset, our method achieves 89.06% accuracy and 88.15% sensitivity, the accuracy of our method is increased by 0.76%. On the Human PPI dataset, our method achieves 97.60% accuracy and 96.37% sensitivity, and the accuracy of our method is increased by 1.30%. In addition, we test our method on a very important PPI network, and it achieves 92.71% accuracy. In the Wnt-related network, the accuracy of our method is increased by 16.67%. The source code and all datasets are available at https://figshare.com/s/580c11dce13e63cb9a53. PMID

  15. Prediction of flexible/rigid regions from protein sequences using k-spaced amino acid pairs

    PubMed Central

    Chen, Ke; Kurgan, Lukasz A; Ruan, Jishou

    2007-01-01

    Background Traditionally, it is believed that the native structure of a protein corresponds to a global minimum of its free energy. However, with the growing number of known tertiary (3D) protein structures, researchers have discovered that some proteins can alter their structures in response to a change in their surroundings or with the help of other proteins or ligands. Such structural shifts play a crucial role with respect to the protein function. To this end, we propose a machine learning method for the prediction of the flexible/rigid regions of proteins (referred to as FlexRP); the method is based on a novel sequence representation and feature selection. Knowledge of the flexible/rigid regions may provide insights into the protein folding process and the 3D structure prediction. Results The flexible/rigid regions were defined based on a dataset, which includes protein sequences that have multiple experimental structures, and which was previously used to study the structural conservation of proteins. Sequences drawn from this dataset were represented based on feature sets that were proposed in prior research, such as PSI-BLAST profiles, composition vector and binary sequence encoding, and a newly proposed representation based on frequencies of k-spaced amino acid pairs. These representations were processed by feature selection to reduce the dimensionality. Several machine learning methods for the prediction of flexible/rigid regions and two recently proposed methods for the prediction of conformational changes and unstructured regions were compared with the proposed method. The FlexRP method, which applies Logistic Regression and collocation-based representation with 95 features, obtained 79.5% accuracy. The two runner-up methods, which apply the same sequence representation and Support Vector Machines (SVM) and Naïve Bayes classifiers, obtained 79.2% and 78.4% accuracy, respectively. The remaining considered methods are characterized by accuracies below 70

  16. Nucleic and amino acid sequences relating to a novel transketolase, and methods for the expression thereof

    DOEpatents

    Croteau, Rodney Bruce; Wildung, Mark Raymond; Lange, Bernd Markus; McCaskill, David G.

    2001-01-01

    cDNAs encoding 1-deoxyxylulose-5-phosphate synthase from peppermint (Mentha piperita) have been isolated and sequenced, and the corresponding amino acid sequences have been determined. Accordingly, isolated DNA sequences (SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7) are provided which code for the expression of 1-deoxyxylulose-5-phosphate synthase from plants. In another aspect the present invention provides for isolated, recombinant DXPS proteins, such as the proteins having the sequences set forth in SEQ ID NO:4, SEQ ID NO:6 and SEQ ID NO:8. In other aspects, replicable recombinant cloning vehicles are provided which code for plant 1-deoxyxylulose-5-phosphate synthases, or for a base sequence sufficiently complementary to at least a portion of 1-deoxyxylulose-5-phosphate synthase DNA or RNA to enable hybridization therewith. In yet other aspects, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding a plant 1-deoxyxylulose-5-phosphate synthase. Thus, systems and methods are provided for the recombinant expression of the aforementioned recombinant 1-deoxyxylulose-5-phosphate synthase that may be used to facilitate its production, isolation and purification in significant amounts. Recombinant 1-deoxyxylulose-5-phosphate synthase may be used to obtain expression or enhanced expression of 1-deoxyxylulose-5-phosphate synthase in plants in order to enhance the production of 1-deoxyxylulose-5-phosphate, or its derivatives such as isopentenyl diphosphate (BP), or may be otherwise employed for the regulation or expression of 1-deoxyxylulose-5-phosphate synthase, or the production of its products.

  17. Gene sequence and predicted amino acid sequence of the motA protein, a membrane-associated protein required for flagellar rotation in Escherichia coli.

    PubMed Central

    Dean, G E; Macnab, R M; Stader, J; Matsumura, P; Burks, C

    1984-01-01

    The motA and motB gene products of Escherichia coli are integral membrane proteins necessary for flagellar rotation. We determined the DNA sequence of the region containing the motA gene and its promoter. Within this sequence, there is an open reading frame of 885 nucleotides, which with high probability (98% confidence level) meets criteria for a coding sequence. The 295-residue amino acid translation product had a molecular weight of 31,974, in good agreement with the value determined experimentally by gel electrophoresis. The amino acid sequence, which was quite hydrophobic, was subjected to a theoretical analysis designed to predict membrane-spanning alpha-helical segments of integral membrane proteins; four such hydrophobic helices were predicted by this treatment. Additional amphipathic helices may also be present. A remarkable feature of the sequence is the existence of two segments of high uncompensated charge density, one positive and the other negative. Possible organization of the protein in the membrane is discussed. Asymmetry in the amino acid composition of translated DNA sequences was used to distinguish between two possible initiation codons. The use of this method as a criterion for authentication of coding regions is described briefly in an Appendix. PMID:6090403

  18. Genome Sequence Analysis of the Naphthenic Acid Degrading and Metal Resistant Bacterium Cupriavidus gilardii CR3

    PubMed Central

    Xiao, Jingfa; Hao, Lirui; Crowley, David E.; Zhang, Zhewen; Yu, Jun; Huang, Ning; Huo, Mingxin; Wu, Jiayan

    2015-01-01

    Cupriavidus sp. are generally heavy metal tolerant bacteria with the ability to degrade a variety of aromatic hydrocarbon compounds, although the degradation pathways and substrate versatilities remain largely unknown. Here we studied the bacterium Cupriavidus gilardii strain CR3, which was isolated from a natural asphalt deposit, and which was shown to utilize naphthenic acids as a sole carbon source. Genome sequencing of C. gilardii CR3 was carried out to elucidate possible mechanisms for the naphthenic acid biodegradation. The genome of C. gilardii CR3 was composed of two circular chromosomes chr1 and chr2 of respectively 3,539,530 bp and 2,039,213 bp in size. The genome for strain CR3 encoded 4,502 putative protein-coding genes, 59 tRNA genes, and many other non-coding genes. Many genes were associated with xenobiotic biodegradation and metal resistance functions. Pathway prediction for degradation of cyclohexanecarboxylic acid, a representative naphthenic acid, suggested that naphthenic acid undergoes initial ring-cleavage, after which the ring fission products can be degraded via several plausible degradation pathways including a mechanism similar to that used for fatty acid oxidation. The final metabolic products of these pathways are unstable or volatile compounds that were not toxic to CR3. Strain CR3 was also shown to have tolerance to at least 10 heavy metals, which was mainly achieved by self-detoxification through ion efflux, metal-complexation and metal-reduction, and a powerful DNA self-repair mechanism. Our genomic analysis suggests that CR3 is well adapted to survive the harsh environment in natural asphalts containing naphthenic acids and high concentrations of heavy metals. PMID:26301592

  19. Genome Sequence Analysis of the Naphthenic Acid Degrading and Metal Resistant Bacterium Cupriavidus gilardii CR3.

    PubMed

    Wang, Xiaoyu; Chen, Meili; Xiao, Jingfa; Hao, Lirui; Crowley, David E; Zhang, Zhewen; Yu, Jun; Huang, Ning; Huo, Mingxin; Wu, Jiayan

    2015-01-01

    Cupriavidus sp. are generally heavy metal tolerant bacteria with the ability to degrade a variety of aromatic hydrocarbon compounds, although the degradation pathways and substrate versatilities remain largely unknown. Here we studied the bacterium Cupriavidus gilardii strain CR3, which was isolated from a natural asphalt deposit, and which was shown to utilize naphthenic acids as a sole carbon source. Genome sequencing of C. gilardii CR3 was carried out to elucidate possible mechanisms for the naphthenic acid biodegradation. The genome of C. gilardii CR3 was composed of two circular chromosomes chr1 and chr2 of respectively 3,539,530 bp and 2,039,213 bp in size. The genome for strain CR3 encoded 4,502 putative protein-coding genes, 59 tRNA genes, and many other non-coding genes. Many genes were associated with xenobiotic biodegradation and metal resistance functions. Pathway prediction for degradation of cyclohexanecarboxylic acid, a representative naphthenic acid, suggested that naphthenic acid undergoes initial ring-cleavage, after which the ring fission products can be degraded via several plausible degradation pathways including a mechanism similar to that used for fatty acid oxidation. The final metabolic products of these pathways are unstable or volatile compounds that were not toxic to CR3. Strain CR3 was also shown to have tolerance to at least 10 heavy metals, which was mainly achieved by self-detoxification through ion efflux, metal-complexation and metal-reduction, and a powerful DNA self-repair mechanism. Our genomic analysis suggests that CR3 is well adapted to survive the harsh environment in natural asphalts containing naphthenic acids and high concentrations of heavy metals.

  20. Repeat sequence chromosome specific nucleic acid probes and methods of preparing and using

    DOEpatents

    Weier, Heinz-Ulrich G.; Gray, Joe W.

    1995-01-01

    A primer directed DNA amplification method to isolate efficiently chromosome-specific repeated DNA wherein degenerate oligonucleotide primers are used is disclosed. The probes produced are a heterogeneous mixture that can be used with blocking DNA as a chromosome-specific staining reagent, and/or the elements of the mixture can be screened for high specificity, size and/or high degree of repetition among other parameters. The degenerate primers are sets of primers that vary in sequence but are substantially complementary to highly repeated nucleic acid sequences, preferably clustered within the template DNA, for example, pericentromeric alpha satellite repeat sequences. The template DNA is preferably chromosome-specific. Exemplary primers ard probes are disclosed. The probes of this invention can be used to determine the number of chromosomes of a specific type in metaphase spreads, in germ line and/or somatic cell interphase nuclei, micronuclei and/or in tissue sections. Also provided is a method to select arbitrarily repeat sequence probes that can be screened for chromosome-specificity.

  1. Repeat sequence chromosome specific nucleic acid probes and methods of preparing and using

    DOEpatents

    Weier, H.U.G.; Gray, J.W.

    1995-06-27

    A primer directed DNA amplification method to isolate efficiently chromosome-specific repeated DNA wherein degenerate oligonucleotide primers are used is disclosed. The probes produced are a heterogeneous mixture that can be used with blocking DNA as a chromosome-specific staining reagent, and/or the elements of the mixture can be screened for high specificity, size and/or high degree of repetition among other parameters. The degenerate primers are sets of primers that vary in sequence but are substantially complementary to highly repeated nucleic acid sequences, preferably clustered within the template DNA, for example, pericentromeric alpha satellite repeat sequences. The template DNA is preferably chromosome-specific. Exemplary primers and probes are disclosed. The probes of this invention can be used to determine the number of chromosomes of a specific type in metaphase spreads, in germ line and/or somatic cell interphase nuclei, micronuclei and/or in tissue sections. Also provided is a method to select arbitrarily repeat sequence probes that can be screened for chromosome-specificity. 18 figs.

  2. Sequence-defined bioactive macrocycles via an acid-catalysed cascade reaction

    NASA Astrophysics Data System (ADS)

    Porel, Mintu; Thornlow, Dana N.; Phan, Ngoc N.; Alabi, Christopher A.

    2016-06-01

    Synthetic macrocycles derived from sequence-defined oligomers are a unique structural class whose ring size, sequence and structure can be tuned via precise organization of the primary sequence. Similar to peptides and other peptidomimetics, these well-defined synthetic macromolecules become pharmacologically relevant when bioactive side chains are incorporated into their primary sequence. In this article, we report the synthesis of oligothioetheramide (oligoTEA) macrocycles via a one-pot acid-catalysed cascade reaction. The versatility of the cyclization chemistry and modularity of the assembly process was demonstrated via the synthesis of >20 diverse oligoTEA macrocycles. Structural characterization via NMR spectroscopy revealed the presence of conformational isomers, which enabled the determination of local chain dynamics within the macromolecular structure. Finally, we demonstrate the biological activity of oligoTEA macrocycles designed to mimic facially amphiphilic antimicrobial peptides. The preliminary results indicate that macrocyclic oligoTEAs with just two-to-three cationic charge centres can elicit potent antibacterial activity against Gram-positive and Gram-negative bacteria.

  3. Complete amino acid sequence of ananain and a comparison with stem bromelain and other plant cysteine proteases.

    PubMed Central

    Lee, K L; Albee, K L; Bernasconi, R J; Edmunds, T

    1997-01-01

    The amino acid sequences of ananain (EC3.4.22.31) and stem bromelain (3.4.22.32), two cysteine proteases from pineapple stem, are similar yet ananain and stem bromelain possess distinct specificities towards synthetic peptide substrates and different reactivities towards the cysteine protease inhibitors E-64 and chicken egg white cystatin. We present here the complete amino acid sequence of ananain and compare it with the reported sequences of pineapple stem bromelain, papain and chymopapain from papaya and actinidin from kiwifruit. Ananain is comprised of 216 residues with a theoretical mass of 23464 Da. This primary structure includes a sequence insert between residues 170 and 174 not present in stem bromelain or papain and a hydrophobic series of amino acids adjacent to His-157. It is possible that these sequence differences contribute to the different substrate and inhibitor specificities exhibited by ananain and stem bromelain. PMID:9355753

  4. Microbial community dynamics in bioaugmented sequencing batch reactors for bromoamine acid removal.

    PubMed

    Qu, Yuanyuan; Zhou, Jiti; Wang, Jing; Fu, Xiang; Xing, Linlin

    2005-05-01

    Sphingomonas xenophaga QYY with the ability to degrade bromoamine acid (BAA) was previously isolated from sludge samples. The enhancement of BAA removal by strain QYY in sequencing batch reactors (SBRs) was investigated in this study. The results showed that augmented SBRs exhibited stronger abilities to degrade BAA than the non-augmented control one. In order to estimate the relationship between community dynamics and function of augmented SBRs, a combined method based on fingerprints (ribosomal intergenic spacer analysis, RISA) and 16S rRNA gene sequencing was used. The results indicated that the microbial community dynamics were substantially changed, and the introduced strain QYY was persistent in the augmented systems. This study suggests that it is feasible and potentially useful to enhance BAA removal using BAA-degrading bacteria, such as S. xenophaga QYY.

  5. [Measurement of the amino acid sequence for the fusion protein FP3 with LC-MS/MS].

    PubMed

    Li, Xiang; Gao, Xiang-Dong; Tao, Lei; Pei, De-Ning; Guo, Ying; Rao, Chun-Ming; Wang, Jun-Zhi

    2012-02-01

    The amino acid sequence of the fusion protein FP3 was measured by two types of LC-MS/MS and its primary structure was confirmed. After reduction and alkylation, the protein was digested with trypsin and glycosyl groups in glycopeptide were removed by PNGase F. The mixed peptides were separated by LC, then Q-TOF and Ion trap tandem mass spectrometry were used to measure b, y fragment ions of each peptide to analyze the amino acid sequence of fusion protein FP3. Seventy-six percent of full amino acid sequence of the fusion protein FP3 was measured by LC-ESI-Q-TOF with the remaining 24% completed by LC-ESI-Trap. As LC-MS and tandem mass spectrometry are rapid, sensitive, accurate to measure the protein amino acid sequence, they are important approach to structure analysis and identification of recombinant protein.

  6. Introduction of Ca(2+)-binding amino-acid sequence into the T4 lysozyme.

    PubMed

    Leontiev, V V; Uversky, V N; Permyakov, E A; Murzin, A G

    1993-03-05

    The 51-62 loop of T4 phage lysozyme was altered by site-directed mutagenesis to obtain maximal homology with the typical EF-hand motif. A Ca(2+)-binding site was designed and created by replacing both Gly-51 and Asn-53 with aspartic acid. The mutant T4 lysozyme (G51D/N53D) was expressed in Escherichia coli. The activity of the G51D/N53D-mutant was about 60% of that of the wild-type protein. This mutant can bind Ca2+ ions specifically, while the effective dissociation constant was essentially greater than that of the EF-hand proteins. Stability of the G51D/N53D-mutant apo-form to urea- or temperature-induced denaturation was the same as that of the wild-type protein. In the presence of Ca2+ ions in solution the stability of the mutant T4 phage lysozyme was less than that of the wild-type protein. It is suggested that the binding of Ca2+ by the mutant is accompanied by the considerable conformational changes in the 'corrected' loop, which can lead to the Ca(2+)-induced destabilization of the protein.

  7. NullSeq: A Tool for Generating Random Coding Sequences with Desired Amino Acid and GC Contents

    PubMed Central

    Liu, Sophia S.; Hockenberry, Adam J.; Lancichinetti, Andrea; Jewett, Michael C.

    2016-01-01

    The existence of over- and under-represented sequence motifs in genomes provides evidence of selective evolutionary pressures on biological mechanisms such as transcription, translation, ligand-substrate binding, and host immunity. In order to accurately identify motifs and other genome-scale patterns of interest, it is essential to be able to generate accurate null models that are appropriate for the sequences under study. While many tools have been developed to create random nucleotide sequences, protein coding sequences are subject to a unique set of constraints that complicates the process of generating appropriate null models. There are currently no tools available that allow users to create random coding sequences with specified amino acid composition and GC content for the purpose of hypothesis testing. Using the principle of maximum entropy, we developed a method that generates unbiased random sequences with pre-specified amino acid and GC content, which we have developed into a python package. Our method is the simplest way to obtain maximally unbiased random sequences that are subject to GC usage and primary amino acid sequence constraints. Furthermore, this approach can easily be expanded to create unbiased random sequences that incorporate more complicated constraints such as individual nucleotide usage or even di-nucleotide frequencies. The ability to generate correctly specified null models will allow researchers to accurately identify sequence motifs which will lead to a better understanding of biological processes as well as more effective engineering of biological systems. PMID:27835644

  8. Deduced amino acid sequence, functional expression, and unique enzymatic properties of the form I and form II ribulose bisphosphate carboxylase/oxygenase from the chemoautotrophic bacterium Thiobacillus denitrificans.

    PubMed

    Hernandez, J M; Baker, S H; Lorbach, S C; Shively, J M; Tabita, F R

    1996-01-01

    The cbbL cbbS and cbbM genes of Thiobacillus denitrificans, encoding form I and form II ribulose 1,5-bisphosphate carboxylase/oxygenase (RubisCO), respectively, were found to complement a RubisCO-negative mutant of Rhodobacter sphaeroides to autotrophic growth. Endogenous T. denitrificans promoters were shown to function in R. sphaeroides, resulting in high levels of cbbL cbbS and cbbM expression in the R. sphaeroides host. This expression system provided high levels of both T. denitrificans enzymes, each of which was highly purified. The deduced amino acid sequence of the form I enzyme indicated that the large subunit was closely homologous to previously sequenced form I RubisCO enzymes from sulfur-oxidizing bacteria. The form I T. denitrificans enzyme possessed a very low substrate specificity factor and did not exhibit fallover, and yet this enzyme showed a poor ability to recover from incubation with ribulose 1,5-bisphosphate. The deduced amino acid sequence of the form II T. denitrificans enzyme resembled those of other form II RubisCO enzymes. The substrate specificity factor was characteristically low, and the lack of fallover and the inhibition by ribulose 1,5-bisphosphate were similar to those of form II RubisCO obtained from nonsulfur purple bacteria. Both form I and form II RubisCO from T. denitrificans possessed high KCO2 values, suggesting that this organism might suffer in environments containing low levels of dissolved CO2. These studies present the initial description of the kinetic properties of form I and form II RubisCO from a chemoautotrophic bacterium that synthesizes both types of enzyme.

  9. Deduced amino acid sequence, functional expression, and unique enzymatic properties of the form I and form II ribulose bisphosphate carboxylase/oxygenase from the chemoautotrophic bacterium Thiobacillus denitrificans.

    PubMed Central

    Hernandez, J M; Baker, S H; Lorbach, S C; Shively, J M; Tabita, F R

    1996-01-01

    The cbbL cbbS and cbbM genes of Thiobacillus denitrificans, encoding form I and form II ribulose 1,5-bisphosphate carboxylase/oxygenase (RubisCO), respectively, were found to complement a RubisCO-negative mutant of Rhodobacter sphaeroides to autotrophic growth. Endogenous T. denitrificans promoters were shown to function in R. sphaeroides, resulting in high levels of cbbL cbbS and cbbM expression in the R. sphaeroides host. This expression system provided high levels of both T. denitrificans enzymes, each of which was highly purified. The deduced amino acid sequence of the form I enzyme indicated that the large subunit was closely homologous to previously sequenced form I RubisCO enzymes from sulfur-oxidizing bacteria. The form I T. denitrificans enzyme possessed a very low substrate specificity factor and did not exhibit fallover, and yet this enzyme showed a poor ability to recover from incubation with ribulose 1,5-bisphosphate. The deduced amino acid sequence of the form II T. denitrificans enzyme resembled those of other form II RubisCO enzymes. The substrate specificity factor was characteristically low, and the lack of fallover and the inhibition by ribulose 1,5-bisphosphate were similar to those of form II RubisCO obtained from nonsulfur purple bacteria. Both form I and form II RubisCO from T. denitrificans possessed high KCO2 values, suggesting that this organism might suffer in environments containing low levels of dissolved CO2. These studies present the initial description of the kinetic properties of form I and form II RubisCO from a chemoautotrophic bacterium that synthesizes both types of enzyme. PMID:8550452

  10. Morphological tranformation of calcite crystal growth by prismatic "acidic" polypeptide sequences.

    SciTech Connect

    Kim, I; Giocondi, J L; Orme, C A; Collino, J; Evans, J S

    2007-02-13

    Many of the interesting mechanical and materials properties of the mollusk shell are thought to stem from the prismatic calcite crystal assemblies within this composite structure. It is now evident that proteins play a major role in the formation of these assemblies. Recently, a superfamily of 7 conserved prismatic layer-specific mollusk shell proteins, Asprich, were sequenced, and the 42 AA C-terminal sequence region of this protein superfamily was found to introduce surface voids or porosities on calcite crystals in vitro. Using AFM imaging techniques, we further investigate the effect that this 42 AA domain (Fragment-2) and its constituent subdomains, DEAD-17 and Acidic-2, have on the morphology and growth kinetics of calcite dislocation hillocks. We find that Fragment-2 adsorbs on terrace surfaces and pins acute steps, accelerates then decelerates the growth of obtuse steps, forms clusters and voids on terrace surfaces, and transforms calcite hillock morphology from a rhombohedral form to a rounded one. These results mirror yet are distinct from some of the earlier findings obtained for nacreous polypeptides. The subdomains Acidic-2 and DEAD-17 were found to accelerate then decelerate obtuse steps and induce oval rather than rounded hillock morphologies. Unlike DEAD-17, Acidic-2 does form clusters on terrace surfaces and exhibits stronger obtuse velocity inhibition effects than either DEAD-17 or Fragment-2. Interestingly, a 1:1 mixture of both subdomains induces an irregular polygonal morphology to hillocks, and exhibits the highest degree of acute step pinning and obtuse step velocity inhibition. This suggests that there is some interplay between subdomains within an intra (Fragment-2) or intermolecular (1:1 mixture) context, and sequence interplay phenomena may be employed by biomineralization proteins to exert net effects on crystal growth and morphology.

  11. Biochemical characterization of the murine S100A9 (MRP14) protein suggests that it is functionally equivalent to its human counterpart despite its low degree of sequence homology.

    PubMed

    Nacken, W; Sopalla, C; Pröpper, C; Sorg, C; Kerkhoff, C

    2000-01-01

    Due to the low degree of sequence similarity it has been speculated that murine and human S100A9 (MRP14), an inflammatory marker protein belonging to the S100 protein family, may have different cellular functions in mouse and man. The present study was undertaken to investigate the murine S100A9 protein (mS100A9) biochemically. We demonstrate that in murine peripheral CD11b+ cells up to 20% of the protein of the cytosolic fraction consists of mS100A9 and that several minor mS100A9 isoforms are present. Cell fractionation experiments with CD11b+ murine leukocytes showed that mS100A9 is found in the cytosol as well as in the insoluble fraction. Transient expression of a green fluorescence protein-mS100A9 fusion in mammalian cells revealed that mS100A9 is localized in neither the nucleus nor the vesicles. Recombinantly expressed murine S100A9 interacts in vitro with murine and human S100A8 in an in vitro glutathione S-transferase pull-down assay. Homodimerization was not observed. For further biochemical analysis the myeloid 32D cell line is presented as a suitable model, to study murine myeloid expressed S100 proteins. Both murine S100A9 and its dimerization partner mS100A8 are expressed at the onset of granulocyte-colony stimulating factor induced myeloid differentiation. Substantial amounts of this complex are constitutively secreted by granulocytic 32D cells into the medium. In summary, these data suggest, that the human and murine S100A9 may share a higher degree of functional homology than of sequence similarity.

  12. Sequence selective recognition of double-stranded RNA using triple helix-forming peptide nucleic acids.

    PubMed

    Zengeya, Thomas; Gupta, Pankaj; Rozners, Eriks

    2014-01-01

    Noncoding RNAs are attractive targets for molecular recognition because of the central role they play in gene expression. Since most noncoding RNAs are in a double-helical conformation, recognition of such structures is a formidable problem. Herein, we describe a method for sequence-selective recognition of biologically relevant double-helical RNA (illustrated on ribosomal A-site RNA) using peptide nucleic acids (PNA) that form a triple helix in the major grove of RNA under physiologically relevant conditions. Protocols for PNA preparation and binding studies using isothermal titration calorimetry are described in detail.

  13. Fast computational methods for predicting protein structure from primary amino acid sequence

    DOEpatents

    Agarwal, Pratul Kumar

    2011-07-19

    The present invention provides a method utilizing primary amino acid sequence of a protein, energy minimization, molecular dynamics and protein vibrational modes to predict three-dimensional structure of a protein. The present invention also determines possible intermediates in the protein folding pathway. The present invention has important applications to the design of novel drugs as well as protein engineering. The present invention predicts the three-dimensional structure of a protein independent of size of the protein, overcoming a significant limitation in the prior art.

  14. Fluorescence energy transfer as a probe for nucleic acid structures and sequences.

    PubMed Central

    Mergny, J L; Boutorine, A S; Garestier, T; Belloc, F; Rougée, M; Bulychev, N V; Koshkin, A A; Bourson, J; Lebedev, A V; Valeur, B

    1994-01-01

    The primary or secondary structure of single-stranded nucleic acids has been investigated with fluorescent oligonucleotides, i.e., oligonucleotides covalently linked to a fluorescent dye. Five different chromophores were used: 2-methoxy-6-chloro-9-amino-acridine, coumarin 500, fluorescein, rhodamine and ethidium. The chemical synthesis of derivatized oligonucleotides is described. Hybridization of two fluorescent oligonucleotides to adjacent nucleic acid sequences led to fluorescence excitation energy transfer between the donor and the acceptor dyes. This phenomenon was used to probe primary and secondary structures of DNA fragments and the orientation of oligodeoxynucleotides synthesized with the alpha-anomers of nucleoside units. Fluorescence energy transfer can be used to reveal the formation of hairpin structures and the translocation of genes between two chromosomes. PMID:8152922

  15. Complete genome sequence of Lactococcus lactis IO-1, a lactic acid bacterium that utilizes xylose and produces high levels of L-lactic acid.

    PubMed

    Kato, Hiroaki; Shiwa, Yuh; Oshima, Kenshiro; Machii, Miki; Araya-Kojima, Tomoko; Zendo, Takeshi; Shimizu-Kadota, Mariko; Hattori, Masahira; Sonomoto, Kenji; Yoshikawa, Hirofumi

    2012-04-01

    We report the complete genome sequence of Lactococcus lactis IO-1 (= JCM7638). It is a nondairy lactic acid bacterium, produces nisin Z, ferments xylose, and produces predominantly L-lactic acid at high xylose concentrations. From ortholog analysis with other five L. lactis strains, IO-1 was identified as L. lactis subsp. lactis.

  16. Complete genome sequence of Bacillus amyloliquefaciens LL3, which exhibits glutamic acid-independent production of poly-γ-glutamic acid.

    PubMed

    Geng, Weitao; Cao, Mingfeng; Song, Cunjiang; Xie, Hui; Liu, Li; Yang, Chao; Feng, Jun; Zhang, Wei; Jin, Yinghong; Du, Yang; Wang, Shufang

    2011-07-01

    Bacillus amyloliquefaciens is one of most prevalent Gram-positive aerobic spore-forming bacteria with the ability to synthesize polysaccharides and polypeptides. Here, we report the complete genome sequence of B. amyloliquefaciens LL3, which was isolated from fermented food and presents the glutamic acid-independent production of poly-γ-glutamic acid.

  17. Formation Sequences of Iron Minerals in the Acidic Alteration Products and Variation of Hydrothermal Fluid Conditions

    NASA Astrophysics Data System (ADS)

    Isobe, H.; Yoshizawa, M.

    2008-12-01

    Iron minerals have important role in environmental issues not only on the Earth but also other terrestrial planets. Iron mineral species related to alteration products of primary minerals with surface or subsurface fluids are characterized by temperature, acidity and redox conditions of the fluids. We can see various iron- bearing alteration products in alteration products around fumaroles in geothermal/volcanic areas. In this study, zonal structures of iron minerals in alteration products of the geothermal area are observed to elucidate temporal and spatial variation of hydrothermal fluids. Alteration of the pyroxene-amphibole andesite of Garan-dake volcano, Oita, Japan occurs by the acidic hydrothermal fluid to form cristobalite leaching out elements other than Si. Hand specimens with unaltered or weakly altered core and cristobalite crust show various sequences of layers. XRD analysis revealed that the alteration degree is represented by abundance of cristobalite. Intermediately altered layers are characterized by occurrence including alunite, pyrite, kaolinite, goethite and hematite. A specimen with reddish brown core surrounded by cristobalite-rich white crust has brown colored layers at the boundary of core and the crust. Reddish core is characterized by occurrence of crystalline hematite by XRD. Another hand specimen has light gray core, which represents reduced conditions, and white cristobalite crust with light brown and reddish brown layers of ferric iron minerals between the core and the crust. On the other hand, hornblende crystals, typical ferrous iron-bearing mineral of the host rock, are well preserved in some samples with strongly decolorized cristobalite-rich groundmass. Hydrothermal alteration experiments of iron-rich basaltic material shows iron mineral species depend on acidity and temperature of the fluid. Oxidation states of the iron-bearing mineral species are strongly influenced by the acidity and redox conditions. Variations of alteration

  18. Design, synthesis, and characterization of a protein sequencing reagent yielding amino acid derivatives with enhanced detectability by mass spectrometry.

    PubMed Central

    Aebersold, R.; Bures, E. J.; Namchuk, M.; Goghari, M. H.; Shushan, B.; Covey, T. C.

    1992-01-01

    We report the design, chemical synthesis, and structural and functional characterization of a novel reagent for protein sequence analysis by the Edman degradation, yielding amino acid derivatives rapidly detectable at high sensitivity by ion-evaporation mass spectrometry. We demonstrate that the reagent 3-[4'(ethylene-N,N,N-trimethylamino)phenyl]-2-isothiocyanate is chemically stable and shows coupling and cyclization/cleavage yields comparable to phenylisothiocyanate, the standard reagent in chemical sequence analysis, under conditions typically encountered in manual or automated sequence analysis. Amino acid derivatives generated with this reagent were detectable by ion-evaporation mass spectrometry at the subfemtomole sensitivity level at a pace of one sample per minute. Furthermore, derivatives were identified by their mass, thus permitting the rapid and highly sensitive determination of the molecular nature of modified amino acids. Derivatives of amino acids with acidic, basic, polar, or hydrophobic side chains were reproducibly detectable at comparable sensitivities. The polar nature of the reagent required covalent immobilization of polypeptides prior to automated sequence analysis. This reagent, used in automated sequence analysis, has the potential for overcoming the limitations in sensitivity, speed, and the ability to characterize modified amino acid residues inherent in the chemical sequencing methods that are currently used. PMID:1304351

  19. Homology model building of the HMG-1 box structural domain.

    PubMed Central

    Baxevanis, A D; Bryant, S H; Landsman, D

    1995-01-01

    Nucleoproteins belonging to the HMG-1/2 family possess homologous domains approximately 75 amino acids in length. These domains, termed HMG-1 boxes, are highly structured, compact, and mediate the interaction between HMG-1 box-containing proteins and DNA in a variety of biological contexts. Homology model building experiments on HMG-1 box sequences 'threaded' through the 1H-NMR structure of an HMG-1 box from rat indicate that the domain does not have rigid sequence requirements for its formation. Energy calculations indicate that the structure of all HMG-1 box domains is stabilized primarily through hydrophobic interactions. We have found structural relationships in the absence of statistically significant sequence similarity, identifying several candidate proteins which could possibly assume the same three-dimensional conformation as the rat HMG-1 box motif. The threading technique provides a method by which significant structural similarities in a diverse protein family can be efficiently detected, and the 'structural alignment' derived by this method provides a rational basis through which phylogenetic relationships and the precise sites of interaction between HMG-1 box proteins and DNA can be deduced. Images PMID:7731789

  20. Complete Genome Sequence of Enterobacter cloacae UW5, a Rhizobacterium Capable of High Levels of Indole-3-Acetic Acid Production.

    PubMed

    Coulson, Thomas J D; Patten, Cheryl L

    2015-08-06

    We report the complete genome sequence of Enterobacter cloacae UW5, an indole-3-acetic acid-producing rhizobacterium originally isolated from the rhizosphere of grass. The 4.9-Mbp genome has a G+C content of 54% and contains 4,496 protein-coding sequences.

  1. Complete Genome Sequence of Enterobacter cloacae UW5, a Rhizobacterium Capable of High Levels of Indole-3-Acetic Acid Production

    PubMed Central

    Coulson, Thomas J. D.

    2015-01-01

    We report the complete genome sequence of Enterobacter cloacae UW5, an indole-3-acetic acid-producing rhizobacterium originally isolated from the rhizosphere of grass. The 4.9-Mbp genome has a G+C content of 54% and contains 4,496 protein-coding sequences. PMID:26251488

  2. Genome Sequence of the Lactic Acid Bacterium Lactococcus lactis subsp. lactis TOMSC161, Isolated from a Nonscalded Curd Pressed Cheese

    PubMed Central

    Velly, H.; Abraham, A.-L.; Loux, V.; Delacroix-Buchet, A.; Fonseca, F.; Bouix, M.

    2014-01-01

    Lactococcus lactis is a lactic