Izumikawa, Tomomi; Kitagawa, Hiroshi
2015-05-01
Thrombomodulin (TM) is a cell-surface glycoprotein and a critical mediator of endothelial anticoagulant function. TM exists as both a chondroitin sulfate (CS) proteoglycan (PG) form and a non-PG form lacking a CS chain (α-TM); therefore, TM can be described as a part-time PG. Previously, we reported that α-TM bears an immature, truncated linkage tetrasaccharide structure (GlcAβ1-3Galβ1-3Galβ1-4Xyl). However, the biosynthetic mechanism to generate part-time PGs remains unclear. In this study, we used several mutants to demonstrate that the amino acid sequence surrounding the CS attachment site influences the efficiency of chondroitin polymerization. In particular, the presence of acidic residues surrounding the CS attachment site was indispensable for the elongation of CS. In addition, mutants defective in CS elongation did not exhibit anti-coagulant activity, as in the case with α-TM. Together, these data support a model for CS chain assembly in which specific core protein determinants are recognized by a key biosynthetic enzyme involved in chondroitin polymerization. Copyright © 2015 Elsevier Inc. All rights reserved.
Christiaens, H; Leer, R J; Pouwels, P H; Verstraete, W
1992-12-01
The conjugated bile acid hydrolase gene from the silage isolate Lactobacillus plantarum 80 was cloned and expressed in Escherichia coli MC1061. For the screening of this hydrolase gene within the gene bank, a direct plate assay developed by Dashkevicz and Feighner (M. P. Dashkevicz and S. D. Feighner, Appl. Environ. Microbiol. 53:331-336, 1989) was adapted to the growth requirements of E. coli. Because of hydrolysis and medium acidification, hydrolase-active colonies were surrounded with big halos of precipitated, free bile acids. This phenomenon was also obtained when the gene was cloned into a multicopy shuttle vector and subsequently reintroduced into the parental Lactobacillus strain. The cbh gene and surrounding regions were characterized by nucleotide sequence analysis. The deduced amino acid sequence was shown to have 52% similarity with a penicillin V amidase from Bacillus sphaericus. Preliminary characterization of the gene product showed that it is a cholylglycine hydrolase (EC 3.5.1.24) with only slight activity against taurine conjugates. The optimum pH was between 4.7 and 5.5. Optimum temperature ranged from 30 to 45 degrees C. Southern blot analysis indicated that the cloned gene has similarity with genomic DNA of bile acid hydrolase-active Lactobacillus spp. of intestinal origin.
In silico analysis of subtilisin from Glaciozyma antarctica PI12
NASA Astrophysics Data System (ADS)
Mustafha, Siti Mardhiah; Murad, Abdul Munir Abdul; Mahadi, Nor Muhammad; Kamaruddin, Shazilah; Bakar, Farah Diba Abu
2015-09-01
Subtilisin constitute as a major player in industrial enzymes that has a wide range of application especially in the detergent industry. In this study, a cDNA encoding for subtilisin (GaSUBT) was extracted from the psychrophilic yeast, Glaciozyma antarctica PI12, PCR amplified and sequenced. Various bioinformatics tools were used to characterize the GaSUBT. GaSUBT contains 1587 bp nucleotides encoding for 529 amino acids. The predicted molecular weight of the deduced protein is 55.34 kDa with an isoelectric point of 6.25. GaSUBT was predicted to possess a signal peptide and pro-peptide consisting of a peptidase inhibitor I9 sequence. From the sequence alignment analysis of deduced amino acids with other subtilisins in the NCBI database showed that the sequences surrounding the catalytic triad that forms the catalytic domain are well conserved.
De Rocquigny, H; Ficheux, D; Gabus, C; Allain, B; Fournie-Zaluski, M C; Darlix, J L; Roques, B P
1993-02-25
The 56 amino acid nucleocapsid protein (NCp10) of Moloney Murine Leukemia Virus, contains a CysX2CysX4HisX4Cys zinc finger flanked by basic residues. In vitro NCp10 promotes genomic RNA dimerization, a process most probably linked to genomic RNA packaging, and replication primer tRNA(Pro) annealing to the initiation site of reverse transcription. To characterize the amino-acid sequences involved in the various functions of NCp10, we have synthesized by solid phase method the native protein and a series of derived peptides shortened at the N- or C-terminus with or without the zinc finger domain. In the latter case, the two parts of the protein were linked by a Glycine - Glycine spacer. The in vitro studies of these peptides show that nucleic acid annealing activities of NCp10 do not require a zinc finger but are critically dependent on the presence of specific sequences located on each side of the CCHC domain and containing proline and basic residues. Thus, deletion of 11R or 49PRPQT, of the fully active 29 residue peptide 11RQGGERRRSQLDRDGGKKPRGPRGPRPQT53 leads to a complete loss of NCp10 activity. Therefore it is proposed that in NCp10, the zinc finger directs the spatial recognition of the target RNAs by the basic domains surrounding the zinc finger.
De Rocquigny, H; Ficheux, D; Gabus, C; Allain, B; Fournie-Zaluski, M C; Darlix, J L; Roques, B P
1993-01-01
The 56 amino acid nucleocapsid protein (NCp10) of Moloney Murine Leukemia Virus, contains a CysX2CysX4HisX4Cys zinc finger flanked by basic residues. In vitro NCp10 promotes genomic RNA dimerization, a process most probably linked to genomic RNA packaging, and replication primer tRNA(Pro) annealing to the initiation site of reverse transcription. To characterize the amino-acid sequences involved in the various functions of NCp10, we have synthesized by solid phase method the native protein and a series of derived peptides shortened at the N- or C-terminus with or without the zinc finger domain. In the latter case, the two parts of the protein were linked by a Glycine - Glycine spacer. The in vitro studies of these peptides show that nucleic acid annealing activities of NCp10 do not require a zinc finger but are critically dependent on the presence of specific sequences located on each side of the CCHC domain and containing proline and basic residues. Thus, deletion of 11R or 49PRPQT, of the fully active 29 residue peptide 11RQGGERRRSQLDRDGGKKPRGPRGPRPQT53 leads to a complete loss of NCp10 activity. Therefore it is proposed that in NCp10, the zinc finger directs the spatial recognition of the target RNAs by the basic domains surrounding the zinc finger. Images PMID:8451185
Common Amino Acid Subsequences in a Universal Proteome—Relevance for Food Science
Minkiewicz, Piotr; Darewicz, Małgorzata; Iwaniak, Anna; Sokołowska, Jolanta; Starowicz, Piotr; Bucholska, Justyna; Hrynkiewicz, Monika
2015-01-01
A common subsequence is a fragment of the amino acid chain that occurs in more than one protein. Common subsequences may be an object of interest for food scientists as biologically active peptides, epitopes, and/or protein markers that are used in comparative proteomics. An individual bioactive fragment, in particular the shortest fragment containing two or three amino acid residues, may occur in many protein sequences. An individual linear epitope may also be present in multiple sequences of precursor proteins. Although recent recommendations for prediction of allergenicity and cross-reactivity include not only sequence identity, but also similarities in secondary and tertiary structures surrounding the common fragment, local sequence identity may be used to screen protein sequence databases for potential allergens in silico. The main weakness of the screening process is that it overlooks allergens and cross-reactivity cases without identical fragments corresponding to linear epitopes. A single peptide may also serve as a marker of a group of allergens that belong to the same family and, possibly, reveal cross-reactivity. This review article discusses the benefits for food scientists that follow from the common subsequences concept. PMID:26340620
NASA Astrophysics Data System (ADS)
Kakinoki, Sachiro; Nakayama, Midori; Moritan, Toshiyuki; Yamaoka, Tetsuji
2014-07-01
We developed a microfibrous poly(L-lactic acid) (PLLA) nerve conduit with a three-layered structure to simultaneously enhance nerve regeneration and prevent adhesion of surrounding tissue. The inner layer was composed of PLLA microfiber containing 25% elastin-laminin mimetic protein (AG73-(VPGIG)30) that promotes neurite outgrowth. The thickest middle layer was constructed of pure PLLA microfibers that impart the large mechanical stremgth to the conduit. A 10% poly(ethylene glycol) was added to the outer layer to prevent the adhesion with the surrounding tissue. The AG73-(VPGIG)30 composisting of an elastin-like repetitive sequence (VPGIG)30 and a laminin-derived sequence (RKRLQVQLSIRT: AG73) was biosynthesized using Escherichia coli. The PLLA microfibrous conduits were fabricated using an electrospinning procedure. AG73-(VPGIG)30 was successfully mixed in the PLLA microfibers, and the PLLA/AG73-(VPGIG)30 microfibers were stable under physiological conditions. The PLLA/AG73-(VPGIG)30 microfibers enhanced adhesion and neurite outgrowth of PC12 cells. The electrospun microfibrous conduit with a three-layered structure was implanted for bridging a 2.0-cm gap in the tibial nerve of a rabbit. Two months after implantation, no adhesion of surrounding tissue was observed, and the action potential was slightly improved in the nerve conduit with the PLLA/AG73-(VPGIG)30 inner layer.
Kolpakova, E; Frengen, E; Stokke, T; Olsnes, S
2000-01-01
Acidic fibroblast growth factor (aFGF) intracellular binding protein (FIBP) is a protein found mainly in the nucleus that might be involved in the intracellular function of aFGF. Here we present a comparative analysis of the deduced amino acid sequences of human, murine and Drosophila FIBP analogues and demonstrate that FIBP is an evolutionarily conserved protein. The human gene spans more than 5 kb, comprising ten exons and nine introns, and maps to chromosome 11q13.1. Two slightly different splice variants found in different tissues were isolated and characterized. Sequence analysis of the region surrounding the translation start revealed a CpG island, a classical feature of widely expressed genes. Functional studies of the promoter region with a luciferase reporter system suggested a strong transcriptional activity residing within 600 bp of the 5' flanking region. PMID:11104667
Active site of tripeptidyl peptidase II from human erythrocytes is of the subtilisin type.
Tomkinson, B; Wernstedt, C; Hellman, U; Zetterqvist, O
1987-01-01
The present report presents evidence that the amino acid sequence around the serine of the active site of human tripeptidyl peptidase II is of the subtilisin type. The enzyme from human erythrocytes was covalently labeled at its active site with [3H]diisopropyl fluorophosphate, and the protein was subsequently reduced, alkylated, and digested with trypsin. The labeled tryptic peptides were purified by gel filtration and repeated reversed-phase HPLC, and their amino-terminal sequences were determined. Residue 9 contained the radioactive label and was, therefore, considered to be the active serine residue. The primary structure of the part of the active site (residues 1-10) containing this residue was concluded to be Xaa-Thr-Gln-Leu-Met-Asx-Gly-Thr-Ser-Met. This amino acid sequence is homologous to the sequence surrounding the active serine of the microbial peptidases subtilisin and thermitase. These data demonstrate that human tripeptidyl peptidase II represents a potentially distinct class of human peptidases and raise the question of an evolutionary relationship between the active site of a mammalian peptidase and that of the subtilisin family of serine peptidases. PMID:3313395
Mocz, G.
1995-01-01
Fuzzy cluster analysis has been applied to the 20 amino acids by using 65 physicochemical properties as a basis for classification. The clustering products, the fuzzy sets (i.e., classical sets with associated membership functions), have provided a new measure of amino acid similarities for use in protein folding studies. This work demonstrates that fuzzy sets of simple molecular attributes, when assigned to amino acid residues in a protein's sequence, can predict the secondary structure of the sequence with reasonable accuracy. An approach is presented for discriminating standard folding states, using near-optimum information splitting in half-overlapping segments of the sequence of assigned membership functions. The method is applied to a nonredundant set of 252 proteins and yields approximately 73% matching for correctly predicted and correctly rejected residues with approximately 60% overall success rate for the correctly recognized ones in three folding states: alpha-helix, beta-strand, and coil. The most useful attributes for discriminating these states appear to be related to size, polarity, and thermodynamic factors. Van der Waals volume, apparent average thickness of surrounding molecular free volume, and a measure of dimensionless surface electron density can explain approximately 95% of prediction results. hydrogen bonding and hydrophobicity induces do not yet enable clear clustering and prediction. PMID:7549882
AFAL: a web service for profiling amino acids surrounding ligands in proteins
NASA Astrophysics Data System (ADS)
Arenas-Salinas, Mauricio; Ortega-Salazar, Samuel; Gonzales-Nilo, Fernando; Pohl, Ehmke; Holmes, David S.; Quatrini, Raquel
2014-11-01
With advancements in crystallographic technology and the increasing wealth of information populating structural databases, there is an increasing need for prediction tools based on spatial information that will support the characterization of proteins and protein-ligand interactions. Herein, a new web service is presented termed amino acid frequency around ligand (AFAL) for determining amino acids type and frequencies surrounding ligands within proteins deposited in the Protein Data Bank and for assessing the atoms and atom-ligand distances involved in each interaction (availability: http://structuralbio.utalca.cl/AFAL/index.html). AFAL allows the user to define a wide variety of filtering criteria (protein family, source organism, resolution, sequence redundancy and distance) in order to uncover trends and evolutionary differences in amino acid preferences that define interactions with particular ligands. Results obtained from AFAL provide valuable statistical information about amino acids that may be responsible for establishing particular ligand-protein interactions. The analysis will enable investigators to compare ligand-binding sites of different proteins and to uncover general as well as specific interaction patterns from existing data. Such patterns can be used subsequently to predict ligand binding in proteins that currently have no structural information and to refine the interpretation of existing protein models. The application of AFAL is illustrated by the analysis of proteins interacting with adenosine-5'-triphosphate.
AFAL: a web service for profiling amino acids surrounding ligands in proteins.
Arenas-Salinas, Mauricio; Ortega-Salazar, Samuel; Gonzales-Nilo, Fernando; Pohl, Ehmke; Holmes, David S; Quatrini, Raquel
2014-11-01
With advancements in crystallographic technology and the increasing wealth of information populating structural databases, there is an increasing need for prediction tools based on spatial information that will support the characterization of proteins and protein-ligand interactions. Herein, a new web service is presented termed amino acid frequency around ligand (AFAL) for determining amino acids type and frequencies surrounding ligands within proteins deposited in the Protein Data Bank and for assessing the atoms and atom-ligand distances involved in each interaction (availability: http://structuralbio.utalca.cl/AFAL/index.html ). AFAL allows the user to define a wide variety of filtering criteria (protein family, source organism, resolution, sequence redundancy and distance) in order to uncover trends and evolutionary differences in amino acid preferences that define interactions with particular ligands. Results obtained from AFAL provide valuable statistical information about amino acids that may be responsible for establishing particular ligand-protein interactions. The analysis will enable investigators to compare ligand-binding sites of different proteins and to uncover general as well as specific interaction patterns from existing data. Such patterns can be used subsequently to predict ligand binding in proteins that currently have no structural information and to refine the interpretation of existing protein models. The application of AFAL is illustrated by the analysis of proteins interacting with adenosine-5'-triphosphate.
Active site of tripeptidyl peptidase II from human erythrocytes is of the subtilisin type
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tomkinson, B.; Wernstedt, C.; Hellman, U.
1987-11-01
The present report presents evidence that the amino acid sequence around the serine of the active site of human tripeptidyl peptidase II is of the subtilisin type. The enzyme from human erythrocytes was covalently labeled at its active site with (/sup 3/H)diisopropyl fluorophosphate, and the protein was subsequently reduced, alkylated, and digested with trypsin. The labeled tryptic peptides were purified by gel filtration and repeated reversed-phase HPLC, and their amino-terminal sequences were determined. Residue 9 contained the radioactive label and was, therefore, considered to be the active serine residue. The primary structure of the part of the active site (residuesmore » 1-10) containing this residue was concluded to be Xaa-Thr-Gln-Leu-Met-Asx-Gly-Thr-Ser-Met. This amino acid sequence is homologous to the sequence surrounding the active serine of the microbial peptidases subtilisin and thermitase. These data demonstrate that human tripeptidyl peptidase II represents a potentially distinct class of human peptidases and raise the question of an evolutionary relationship between the active site of a mammalian peptidase and that of the subtilisin family of serine peptidases.« less
Musinova, Yana R; Kananykhina, Eugenia Y; Potashnikova, Daria M; Lisitsyna, Olga M; Sheval, Eugene V
2015-01-01
The majority of known nucleolar proteins are freely exchanged between the nucleolus and the surrounding nucleoplasm. One way proteins are retained in the nucleoli is by the presence of specific amino acid sequences, namely nucleolar localization signals (NoLSs). The mechanism by which NoLSs retain proteins inside the nucleoli is still unclear. Here, we present data showing that the charge-dependent (electrostatic) interactions of NoLSs with nucleolar components lead to nucleolar accumulation as follows: (i) known NoLSs are enriched in positively charged amino acids, but the NoLS structure is highly heterogeneous, and it is not possible to identify a consensus sequence for this type of signal; (ii) in two analyzed proteins (NF-κB-inducing kinase and HIV-1 Tat), the NoLS corresponds to a region that is enriched for positively charged amino acid residues; substituting charged amino acids with non-charged ones reduced the nucleolar accumulation in proportion to the charge reduction, and nucleolar accumulation efficiency was strongly correlated with the predicted charge of the tested sequences; and (iii) sequences containing only lysine or arginine residues (which were referred to as imitative NoLSs, or iNoLSs) are accumulated in the nucleoli in a charge-dependent manner. The results of experiments with iNoLSs suggested that charge-dependent accumulation inside the nucleoli was dependent on interactions with nucleolar RNAs. The results of this work are consistent with the hypothesis that nucleolar protein accumulation by NoLSs can be determined by the electrostatic interaction of positively charged regions with nucleolar RNAs rather than by any sequence-specific mechanism. Copyright © 2014 Elsevier B.V. All rights reserved.
Millot, Benjamin; Montoliu, Lluís; Fontaine, Marie-Louise; Mata, Teresa; Devinoy, Eve
2003-01-01
The upstream regulatory regions of the mouse and rabbit whey acidic protein (WAP) genes have been used extensively to target the efficient expression of foreign genes into the mammary gland of transgenic animals. Therefore both regions have been studied to elucidate fully the mechanisms controlling WAP gene expression. Three DNase I-hypersensitive sites (HSS0, HSS1 and HSS2) have been described upstream of the rabbit WAP gene in the lactating mammary gland and correspond to important regulatory regions. These sites are surrounded by variable chromatin structures during mammary-gland development. In the present study, we describe the upstream sequence of the mouse WAP gene. Analysis of genomic sequences shows that the mouse WAP gene is situated between two widely expressed genes (Cpr2 and Ramp3). We show that the hypersensitive sites found upstream of the rabbit WAP gene are also detected in the mouse WAP gene. Further, they encompass functional signal transducer and activator of transcription 5-binding sites, as has been observed in the rabbit. A new hypersensitive site (HSS3), not specific to the mammary gland, was mapped 8 kb upstream of the rabbit WAP gene. Unlike the three HSSs described above, HSS3 is also detected in the liver, but similar to HSS1, it does not depend on lactogenic hormone treatments during cell culture. The region surrounding HSS3 encompasses a potential matrix attachment region, which is also conserved upstream of the mouse WAP gene and contains a functional transcription factor Ets-1 (E26 transformation-specific-1)-binding site. Finally, we demonstrate for the first time that variations in the chromatin structure are dependent on prolactin alone. PMID:12580766
Brunak, S; Engelbrecht, J
1996-06-01
A direct comparison of experimentally determined protein structures and their corresponding protein coding mRNA sequences has been performed. We examine whether real world data support the hypothesis that clusters of rare codons correlate with the location of structural units in the resulting protein. The degeneracy of the genetic code allows for a biased selection of codons which may control the translational rate of the ribosome, and may thus in vivo have a catalyzing effect on the folding of the polypeptide chain. A complete search for GenBank nucleotide sequences coding for structural entries in the Brookhaven Protein Data Bank produced 719 protein chains with matching mRNA sequence, amino acid sequence, and secondary structure assignment. By neural network analysis, we found strong signals in mRNA sequence regions surrounding helices and sheets. These signals do not originate from the clustering of rare codons, but from the similarity of codons coding for very abundant amino acid residues at the N- and C-termini of helices and sheets. No correlation between the positioning of rare codons and the location of structural units was found. The mRNA signals were also compared with conserved nucleotide features of 16S-like ribosomal RNA sequences and related to mechanisms for maintaining the correct reading frame by the ribosome.
Evolution of the arginase fold and functional diversity
Dowling, Daniel P.; Costanzo, Luigi Di; Gennadios, Heather A.; Christianson, David W.
2009-01-01
The large number of protein structures deposited in the Protein Data Bank allows for the identification of novel structural superfamilies based on conservation of fold in addition to conservation of amino acid sequence. Since sequence diverges more rapidly than fold in protein evolution, proteins with little or no significant sequence identity are occasionally observed to adopt similar folds, thereby reflecting unanticipated evolutionary relationships. Here, we review the unique α/β fold first observed in the manganese metalloenzyme rat liver arginase, consisting of a parallel 8 stranded β-sheet surrounded by several helices, and its evolutionary relationship with the zinc-requiring and/or iron-requiring histone deacetylases and acetylpolyamine amidohydrolases. Structural comparisons reveal key features of the core α/β fold that contribute to the divergent metal ion specificity and stoichiometry required for the chemical and biological functions of these enzymes. PMID:18360740
Acid Stress Response Mechanisms of Group B Streptococci
Shabayek, Sarah; Spellerberg, Barbara
2017-01-01
Group B streptococcus (GBS) is a leading cause of neonatal mortality and morbidity in the United States and Europe. It is part of the vaginal microbiota in up to 30% of pregnant women and can be passed on to the newborn through perinatal transmission. GBS has the ability to survive in multiple different host niches. The pathophysiology of this bacterium reveals an outstanding ability to withstand varying pH fluctuations of the surrounding environments inside the human host. GBS host pathogen interations include colonization of the acidic vaginal mucosa, invasion of the neutral human blood or amniotic fluid, breaching of the blood brain barrier as well as survival within the acidic phagolysosomal compartment of macrophages. However, investigations on GBS responses to acid stress are limited. Technologies, such as whole genome sequencing, genome-wide transcription and proteome mapping facilitate large scale identification of genes and proteins. Mechanisms enabling GBS to cope with acid stress have mainly been studied through these techniques and are summarized in the current review PMID:28936424
Subcutaneous botryomycosis due to Bibersteinia trehalosi in a Texas Longhorn steer.
Spagnoli, S; Reilly, T J; Calcutt, M J; Fales, W H; Kim, D Y
2012-09-01
A 3-year-old Texas Longhorn steer had a long history of progressive swelling of the soft tissues of the jaw and neck. At necropsy, multifocal to coalescing dermal and subcutaneous pyogranulomas were surrounded by fibrous tissue. Microscopically, the pyogranulomas contained aggregates of gram-negative coccobacilli surrounded by Splendore-Hoeppli material and were separated by bands of fibrovascular tissue (botryomycosis). Phylogenetic analysis of multilocus sequence-typing data revealed that the bacteria recovered in pure culture from swabs of submandibular tissue were most closely related to Bibersteinia [Pasteurella] trehalosi. The bacterial colonies were immunohistochemically reactive with a rabbit polyclonal anti-Pasteurella class C acid phosphatase antibody. Botryomycosis is a pyogranulomatous inflammation caused by a variety of nonbranching, nonfilamentous bacteria that elicit the formation of Splendore-Hoeppli material. This case of botryomycosis is unique for its association with Bibersteinia trehalosi.
Discovery of Escherichia coli CRISPR sequences in an undergraduate laboratory.
Militello, Kevin T; Lazatin, Justine C
2017-05-01
Clustered regularly interspaced short palindromic repeats (CRISPRs) represent a novel type of adaptive immune system found in eubacteria and archaebacteria. CRISPRs have recently generated a lot of attention due to their unique ability to catalog foreign nucleic acids, their ability to destroy foreign nucleic acids in a mechanism that shares some similarity to RNA interference, and the ability to utilize reconstituted CRISPR systems for genome editing in numerous organisms. In order to introduce CRISPR biology into an undergraduate upper-level laboratory, a five-week set of exercises was designed to allow students to examine the CRISPR status of uncharacterized Escherichia coli strains and to allow the discovery of new repeats and spacers. Students started the project by isolating genomic DNA from E. coli and amplifying the iap CRISPR locus using the polymerase chain reaction (PCR). The PCR products were analyzed by Sanger DNA sequencing, and the sequences were examined for the presence of CRISPR repeat sequences. The regions between the repeats, the spacers, were extracted and analyzed with BLASTN searches. Overall, CRISPR loci were sequenced from several previously uncharacterized E. coli strains and one E. coli K-12 strain. Sanger DNA sequencing resulted in the discovery of 36 spacer sequences and their corresponding surrounding repeat sequences. Five of the spacers were homologous to foreign (non-E. coli) DNA. Assessment of the laboratory indicates that improvements were made in the ability of students to answer questions relating to the structure and function of CRISPRs. Future directions of the laboratory are presented and discussed. © 2016 by The International Union of Biochemistry and Molecular Biology, 45(3):262-269, 2017. © 2016 The International Union of Biochemistry and Molecular Biology.
Barrera, Daniel; Valdecantos, Pablo A; García, E Vanesa; Miceli, Dora C
2012-02-01
The glycoprotein envelope surrounding the Bufo arenarum egg exists in different functional forms. Conversion between types involves proteolysis of specific envelope glycoproteins. When the egg is released from the ovary, the envelope cannot be penetrated by sperm. Conversion to a penetrable state occurs during passage through the pars recta portion of the oviduct, where oviductin, a serine protease with trypsin-like substrate specificity, hydrolyzes two kinds of envelope glycoproteins: gp84 and gp55. The nucleotide sequence of a 3203 bp B. arenarum oviductin cDNA was obtained. Deduced amino acid sequence showed a complete open reading frame encoding 980 amino acids. B. arenarum oviductin is a multi-domain protein with a protease domain at the N-terminal region followed by two CUB domains and toward the C-terminal region another protease domain, which lacked an active histidine site, and one CUB domain. Expression of ovochymase 2, the mammalian orthologous of amphibian oviductin, was assayed in mouse female reproductive tract. Ovochymase 2 mRNA was unnoticeable in the mouse oviduct but expression was remarkable in the uterus. Phylogenetic relationship between oviductin and ovochymase 2 opens the possibility to understand the role of this enzyme in mammalian reproduction.
Scarpeci, Sonia L; Sanchez, Mercedes L; Cabada, Marcelo O
2008-04-01
The egg envelope is an extracellular matrix that surrounds oocytes. In frogs and mammals, a prominent feature of envelope modification following fertilization is the N-terminal proteolysis of the envelope glycoproteins, ZPA [ZP (zona pellucida) A]. It was proposed that ZPA N-terminal proteolysis leads to a conformational change in egg envelope glycoproteins, resulting in the prevention of polyspermy. Bufo arenarum VE (vitelline envelope) is made up of at least four glycoproteins: gp120 (glycoprotein 120), gp75, gp41 and gp38. The aim of the present study was to identify and characterize the baZPA (B. arenarum ZPA homologue). Also, our aim was to evaluate its integrity and functional significance during fertilization. VE components were labelled with FITC in order to study their sperm-binding capacity. The assay showed that gp75, gp41 and gp38 possess sperm-binding activity. We obtained a full-length cDNA of 2062 bp containing one ORF (open reading frame) with a sequence for 687 amino acids. The predicted amino acid sequence had close similarity to that of mammalian ZPA. This result indicates that gp75 is the baZPA. Antibodies raised against an N-terminal sequence recognized baZPA and inhibited sperm-baZPA extracted from VE binding. This protein does not induce the acrosome reaction in homologue sperm. Northern-blot studies indicated that the transcript is exclusively expressed in the ovary. In situ hybridization studies confirmed this and pointed to previtellogenic oocytes and follicle cells surrounding the oocyte as the source of the transcript. baZPA was cleaved during fertilization and the N-terminal peptide fragment remained disulfide bonded to the glycoprotein moiety following proteolysis. From the sequence analysis, it was possible to consider that gp75 is the baZPA. It is expressed by previtellogenic oocytes and follicle cells. Also, it can be considered as a sperm receptor that undergoes N-terminal proteolysis during fertilization. The N-terminal peptide could be necessary for sperm binding.
Gibreel, Amera; Sköld, Ola
1999-01-01
The characterization of the genetic basis of sulfonamide resistance in Campylobacter jejuni was attempted. The resistance determinant from a sulfonamide-resistant strain of C. jejuni was cloned and was found to show 42% identity with the folP gene (which codes for dihydropteroate synthase, the target of sulfonamides) of the related bacterium Helicobacter pylori. The sequences of the areas surrounding the folP gene in C. jejuni showed similarity to those of the areas surrounding the corresponding gene in H. pylori. The folP gene of C. jejuni, which mediates the resistance, was observed to show particular features when it was compared to other known folP genes. One of these features is the presence of two pairs of direct repeats (15 and 27 bp) within the coding sequence of the gene. Comparison of the C. jejuni folP genes that mediate susceptibility and resistance revealed the occurrence of mutations that changed four amino acid residues. Resistance of C. jejuni to sulfonamides could be associated with one or several of these four mutational substitutions, which all occurred in the five different resistant isolates studied. The codon for one of these changed amino acids was found to be located in the second direct repeat within the coding sequence of the gene. The change made the repeat perfect. The transformation of both the resistance and the susceptibility variants of the gene into an Escherichia coli folP knockout mutant was found to complement the dihydropteroate synthase deficiency, confirming that the characterized sulfonamide resistance determinant codes for the C. jejuni dihydropteroate synthase enzyme. Kinetic measurements established different affinities of sulfonamide for the dihydropteroate synthase enzyme isolated from the resistant and susceptible strains. In conclusion, sulfonamide resistance in C. jejuni was shown to be associated with mutational changes in the chromosomally located gene for dihydropteroate synthase, the target of sulfonamides. PMID:10471557
Human milk is a source of lactic acid bacteria for the infant gut.
Martín, Rocío; Langa, Susana; Reviriego, Carlota; Jimínez, Esther; Marín, María L; Xaus, Jordi; Fernández, Leonides; Rodríguez, Juan M
2003-12-01
To investigate whether human breast milk contains potentially probiotic lactic acid bacteria, and therefore, whether it can be considered a synbiotic food. Study design Lactic acid bacteria were isolated from milk, mammary areola, and breast skin of eight healthy mothers and oral swabs and feces of their respective breast-fed infants. Some isolates (178 from each mother and newborn pair) were randomly selected and submitted to randomly amplified polymorphic DNA (RAPD) polymerase chain reaction analysis, and those that displayed identical RAPD patterns were identified by 16S rDNA sequencing. Within each mother and newborn pair, some rod-shaped lactic acid bacteria isolated from mammary areola, breast milk, and infant oral swabs and feces displayed identical RAPD profiles. All of them, independently from the mother and child pair, were identified as Lactobacillus gasseri. Similarly, among coccoid lactic acid bacteria from these different sources, some shared an identical RAPD pattern and were identified as Enterococcus faecium. In contrast, none of the lactic acid bacteria isolated from breast skin shared RAPD profiles with lactic acid bacteria of the other sources. Breast-feeding can be a significant source of lactic acid bacteria to the infant gut. Lactic acid bacteria present in milk may have an endogenous origin and may not be the result of contamination from the surrounding breast skin.
Baum, B; Richter, B; Reifinger, M; Klang, A; Finnberg, C; Loncaric, I; Spergser, J; Eisenberg, T; Künzel, F; Preis, S; Pantchev, N; Rütgen, B; Guija de Arespacochaga, A; Hewicker-Trautwein, M
2015-01-01
One ferret (Mustela putorius furo) from Finland and two ferrets from Austria, aged 1-4.5 years and of both genders, were presented with pyogranulomatous subcutaneous inflammation affecting the inguinal, preputial and femoral regions, respectively. Histologically, microorganisms were detected within the lesions. The organisms had a capsule that stained positively by the periodic acid-Schiff reaction. Pseudomonas spp. were cultured from the lesions in two cases. In the third case, electron microscopy revealed a prokaryotic organism surrounded by an electron lucent matrix. 16S rRNA gene sequencing showed highest sequence homology to Pseudomonas luteola in all three cases. In combination with recent reports of pleuropneumonia in ferrets due to P. luteola infection, these cases might indicate a predisposition of ferrets for infection by these bacteria. Copyright © 2015 Elsevier Ltd. All rights reserved.
Lucas, J.N.; Straume, T.; Bogen, K.T.
1998-03-24
A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration. 14 figs.
Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.
1998-01-01
A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration.
Method for identifying and quantifying nucleic acid sequence aberrations
Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.
1998-01-01
A method for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe.
Method for identifying and quantifying nucleic acid sequence aberrations
Lucas, J.N.; Straume, T.; Bogen, K.T.
1998-07-21
A method is disclosed for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe. 11 figs.
Nanoscale Bio-engineering Solutions for Space Exploration: The Nanopore Sequencer
NASA Technical Reports Server (NTRS)
Stolc, Viktor; Cozmuta, Ioana
2004-01-01
Characterization of biological systems at the molecular level and extraction of essential information for nano-engineering design to guide the nano-fabrication of solid-state sensors and molecular identification devices is a computational challenge. The alpha hemolysin protein ion channel is used as a model system for structural analysis of nucleic acids like DNA. Applied voltage draws a DNA strand and surrounding ionic solution through the biological nanopore. The subunits in the DNA strand block ion flow by differing amounts. Atomistic scale simulations are employed using NASA supercomputers to study DNA translocation, with the aim to enhance single DNA subunit identification. Compared to protein channels, solid-state nanopores offer a better temporal control of the translocation of DNA and the possibility to easily tune its chemistry to increase the signal resolution. Potential applications for NASA missions, besides real-time genome sequencing include astronaut health, life detection and decoding of various genomes.
Nanoscale Bioengineering Solutions for Space Exploration the Nanopore Sequencer
NASA Technical Reports Server (NTRS)
Ioana, Cozmuta; Viktor, Stoic
2005-01-01
Characterization of biological systems at the molecular level and extraction of essential information for nano-engineering design to guide the nano-fabrication of solid-state sensors and molecular identification devices is a computational challenge. The alpha hemolysin protein ion channel is used as a model system for structural analysis of nucleic acids like DNA. Applied voltage draws a DNA strand and surrounding ionic solution through the biological nanopore. The subunits in the DNA strand block ion flow by differing amounts. Atomistic scale simulations are employed using NASA supercomputers to study DNA translocation. with the aim to enhance single DNA subunit identification. Compared to protein channels, solid-state nanopores offer a better temporal control of the translocation of DNA and the possibility to easily tune its chemistry to increase the signal resolution. Potential applications for NASA missions, besides real-time genome sequencing include astronaut health, life detection and decoding of various genomes. http://phenomrph.arc.nasa.gov/index.php
Viruses in diarrhoeic dogs include novel kobuviruses and sapoviruses.
Li, Linlin; Pesavento, Patricia A; Shan, Tongling; Leutenegger, Christian M; Wang, Chunlin; Delwart, Eric
2011-11-01
The close interactions of dogs with humans and surrounding wildlife provide frequent opportunities for cross-species virus transmissions. In order to initiate an unbiased characterization of the eukaryotic viruses in the gut of dogs, this study used deep sequencing of partially purified viral capsid-protected nucleic acids from the faeces of 18 diarrhoeic dogs. Known canine parvoviruses, coronaviruses and rotaviruses were identified, and the genomes of the first reported canine kobuvirus and sapovirus were characterized. Canine kobuvirus, the first sequenced canine picornavirus and the closest genetic relative of the diarrhoea-causing human Aichi virus, was detected at high frequency in the faeces of both healthy and diarrhoeic dogs. Canine sapovirus constituted a novel genogroup within the genus Sapovirus, a group of viruses also associated with human and animal diarrhoea. These results highlight the high frequency of new virus detection possible even in extensively studied animal species using metagenomics approaches, and provide viral genomes for further disease-association studies.
Yokoo, Nozomi; Togashi, Takanari; Umetsu, Mitsuo; Tsumoto, Kouhei; Hattori, Takamitsu; Nakanishi, Takeshi; Ohara, Satoshi; Takami, Seiichi; Naka, Takashi; Abe, Hiroya; Kumagai, Izumi; Adschiri, Tadafumi
2010-01-14
Using an artificial peptide library, we have identified a peptide with affinity for ZnO materials that could be used to selectively accumulate ZnO particles on polypropylene-gold plates. In this study, we fused recombinant green fluorescent protein (GFP) with this ZnO-binding peptide (ZnOBP) and then selectively immobilized the fused protein on ZnO particles. We determined an appropriate condition for selective immobilization of recombinant GFP, and the ZnO-binding function of ZnOBP-fused GFP was examined by elongating the ZnOBP tag from a single amino acid to the intact sequence. The fusion of ZnOBP with GFP enabled specific adsorption of GFP on ZnO substrates in an appropriate solution, and thermodynamic studies showed a predominantly enthalpy-dependent electrostatic interaction between ZnOBP and the ZnO surface. The ZnOBP's binding affinity for the ZnO surface increased first in terms of material selectivity and then in terms of high affinity as the GFP-fused peptide was elongated from a single amino acid to intact ZnOBP. We concluded that the enthalpy-dependent interaction between ZnOBP and ZnO was influenced by the presence of not only charged amino acids but also their surrounding residues in the ZnOBP sequence.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhang, Yanfeng; Zheng, Yi; Qin, Ling
Beta-hydroxyacid dehydrogenase (β-HAD) genes have been identified in all sequenced genomes of eukaryotes and prokaryotes. Their gene products catalyze the NAD+- or NADP+-dependent oxidation of various β-hydroxy acid substrates into their corresponding semialdehyde. In many fungal and bacterial genomes, multiple β-HAD genes are observed leading to the hypothesis that these gene products may have unique, uncharacterized metabolic roles specific to their species. The genomes of Geobacter sulfurreducens and Geobacter metallireducens each contain two potential β-HAD genes. The protein sequences of one pair of these genes, Gs-βHAD (Q74DE4) and Gm-βHAD (Q39R98), have 65% sequence identity and 77% sequence similarity with eachmore » other. Both proteins reduce succinic semialdehyde, a metabolite of the GABA shunt. To further explore the structural and functional characteristics of these two β-HADs with a potentially unique substrate specificity, crystal structures for Gs-βHAD and Gm-βHAD in complex with NADP+ were determined to a resolution of 1.89 Å and 2.07 Å, respectively. The structure of both proteins are similar, composed of 14 α-helices and nine β-strands organized into two domains. Domain One (1-165) adopts a typical Rossmann fold composed of two α/β units: a six-strand parallel β-sheet surrounded by six α-helices (α1 – α6) followed by a mixed three-strand β-sheet surrounded by two α-helices (α7 and α8). Domain Two (166-287) is composed of a bundle of seven α-helices (α9 – α14). Four functional regions conserved in all β-HADs are spatially located near each other at the interdomain cleft in both Gs-βHAD and Gm-βHAD with a buried molecule of NADP+. The structural features of Gs-βHAD and Gm-βHAD are described in relation to the four conserved consensus sequences characteristic of β-HADs and the potential biochemical importance of these enzymes as an alternative pathway for the degradation of succinic semialdehyde.« less
Chen, C L; Lampe, D J; Robertson, H M; Nardi, J B
1997-01-01
A cell surface protein (3B11) is differentially expressed in the embryonic labial segment of Manduca as two circular monolayers of epithelial cells invaginate and segregate from surrounding epithelial cells. The cells that invaginate and preferentially express 3B11 represent the presumptive prothoracic glands. These cells continue to express protein 3B11 as they rearrange to form first a three-dimensional aggregate and later anastomosing filaments of cells. In the differentiated prothoracic gland, expression of 3B11 is restricted to sites of cell-cell contact. Cloning and sequencing of the cDNA for protein 3B11 revealed that this protein is the Manduca counterpart of Drosophila neuroglian and mouse L1. These surface proteins are known to function as adhesion/recognition molecules during development. Manduca neuroglian shares 58 and 31% identity respectively with the Drosophila and mouse proteins and has a cytoplasmic domain of over 100 amino acids.
Kavianpour, Hamidreza; Vasighi, Mahdi
2017-02-01
Nowadays, having knowledge about cellular attributes of proteins has an important role in pharmacy, medical science and molecular biology. These attributes are closely correlated with the function and three-dimensional structure of proteins. Knowledge of protein structural class is used by various methods for better understanding the protein functionality and folding patterns. Computational methods and intelligence systems can have an important role in performing structural classification of proteins. Most of protein sequences are saved in databanks as characters and strings and a numerical representation is essential for applying machine learning methods. In this work, a binary representation of protein sequences is introduced based on reduced amino acids alphabets according to surrounding hydrophobicity index. Many important features which are hidden in these long binary sequences can be clearly displayed through their cellular automata images. The extracted features from these images are used to build a classification model by support vector machine. Comparing to previous studies on the several benchmark datasets, the promising classification rates obtained by tenfold cross-validation imply that the current approach can help in revealing some inherent features deeply hidden in protein sequences and improve the quality of predicting protein structural class.
Method for isolating chromosomal DNA in preparation for hybridization in suspension
Lucas, Joe N.
2000-01-01
A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration. Chromosomal DNA in a sample containing cell debris is prepared for hybridization in suspension by treating the mixture with RNase. The treated DNA can also be fixed prior to hybridization.
Gillot, Guillaume; Jany, Jean-Luc; Dominguez-Santos, Rebeca; Poirier, Elisabeth; Debaets, Stella; Hidalgo, Pedro I; Ullán, Ricardo V; Coton, Emmanuel; Coton, Monika
2017-04-01
Mycophenolic acid (MPA) is a secondary metabolite produced by various Penicillium species including Penicillium roqueforti. The MPA biosynthetic pathway was recently described in Penicillium brevicompactum. In this study, an in silico analysis of the P. roqueforti FM164 genome sequence localized a 23.5-kb putative MPA gene cluster. The cluster contains seven genes putatively coding seven proteins (MpaA, MpaB, MpaC, MpaDE, MpaF, MpaG, MpaH) and is highly similar (i.e. gene synteny, sequence homology) to the P. brevicompactum cluster. To confirm the involvement of this gene cluster in MPA biosynthesis, gene silencing using RNA interference targeting mpaC, encoding a putative polyketide synthase, was performed in a high MPA-producing P. roqueforti strain (F43-1). In the obtained transformants, decreased MPA production (measured by LC-Q-TOF/MS) was correlated to reduced mpaC gene expression by Q-RT-PCR. In parallel, mycotoxin quantification on multiple P. roqueforti strains suggested strain-dependent MPA-production. Thus, the entire MPA cluster was sequenced for P. roqueforti strains with contrasted MPA production and a 174bp deletion in mpaC was observed in low MPA-producers. PCRs directed towards the deleted region among 55 strains showed an excellent correlation with MPA quantification. Our results indicated the clear involvement of mpaC gene as well as surrounding cluster in P. roqueforti MPA biosynthesis. Copyright © 2016 Elsevier Ltd. All rights reserved.
[Microeukaryotic biodiversity in the waste ore samples surrounding an acid mine drainage lake].
Li, Si-Yuan; Hao, Chun-Bo; Wang, Li-Hua; Lü, Zheng; Zhang, Li-Na; Liu, Ying; Feng, Chuan-Ping
2013-10-01
The abandoned mineral samples were collected in an acid mine drainage area in Anhui Province. Molecular ecological methods were used to construct 18S rDNA clone libraries after analyzing the main physicochemical parameters, and then the microeukaryotic diversity and community structure in the acid mine drainage area were studied. The results showed that the region was strongly acidic (pH <3), and the concentrations of Fe, SO2-(4), P, NO-(3) -N showed the same trend, all higher in the bare waste ore samples PD and 1 M than in the vegetation covered samples LW and XC. Four eukaryotic phyla were detected in the abandoned mineral samples: Ascomycota, Basidiomycota, Glomeromycota and Arthropoda. Glomeromycota can form an absolute symbiotic relationship with the plant, and it was a key factor for early plant to adapt the terrestrial environment. The biodiversity of the vegetation covered samples LW and XC, which contained Glomeromycota, was much higher than that of the bare abandoned rock samples PD and 1 M. Moreover, many sequences in the libraries were closely related to some isolated strains, which are tolerant to low pH and heavy metals, such as Penicillium purpurogenum, Chaetothyriales sp. and Staninwardia suttonii.
Nucleic acid arrays and methods of synthesis
Sabanayagam, Chandran R.; Sano, Takeshi; Misasi, John; Hatch, Anson; Cantor, Charles
2001-01-01
The present invention generally relates to high density nucleic acid arrays and methods of synthesizing nucleic acid sequences on a solid surface. Specifically, the present invention contemplates the use of stabilized nucleic acid primer sequences immobilized on solid surfaces, and circular nucleic acid sequence templates combined with the use of isothermal rolling circle amplification to thereby increase nucleic acid sequence concentrations in a sample or on an array of nucleic acid sequences.
Targeted Delivery of Drugs to Brain Tumors (LBNL Summer Lecture Series)
Forte, Trudy [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Life Sciences Division; ChildrenâÂÂs Hospital Oakland Research Inst. (CHORI), Oakland, CA (United States)
2017-12-15
Summer Lecture Series 2007: Trudy Forte of Berkeley Lab's Life Sciences Division will discuss her work developing nano-sized low-density lipoprotein (LDL) particles that can be used as a safe and effective means of delivering anticancer drugs to brain tumors, particularly glioblastoma multiforme. This is the most common malignant brain tumor in adults and one of the deadliest forms of cancer. Her research team found that the synthetic LDL particles can target and kill such tumors cells in vitro. The nanoparticles are composed of a lipid core surrounded by a peptide. The peptide contains an amino acid sequence that recognizes the LDL receptor, and the lipid core has the ability to accumulate anti-cancer drugs.
Rapid NMR Assignments of Proteins by Using Optimized Combinatorial Selective Unlabeling.
Dubey, Abhinav; Kadumuri, Rajashekar Varma; Jaipuria, Garima; Vadrevu, Ramakrishna; Atreya, Hanudatta S
2016-02-15
A new approach for rapid resonance assignments in proteins based on amino acid selective unlabeling is presented. The method involves choosing a set of multiple amino acid types for selective unlabeling and identifying specific tripeptides surrounding the labeled residues from specific 2D NMR spectra in a combinatorial manner. The methodology directly yields sequence specific assignments, without requiring a contiguously stretch of amino acid residues to be linked, and is applicable to deuterated proteins. We show that a 2D [(15) N,(1) H] HSQC spectrum with two 2D spectra can result in ∼50 % assignments. The methodology was applied to two proteins: an intrinsically disordered protein (12 kDa) and the 29 kDa (268 residue) α-subunit of Escherichia coli tryptophan synthase, which presents a challenging case with spectral overlaps and missing peaks. The method can augment existing approaches and will be useful for applications such as identifying active-site residues involved in ligand binding, phosphorylation, or protein-protein interactions, even prior to complete resonance assignments. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Zheng, Lu-Lu; Niu, Shen; Hao, Pei; Feng, KaiYan; Cai, Yu-Dong; Li, Yixue
2011-01-01
Pyrrolidone carboxylic acid (PCA) is formed during a common post-translational modification (PTM) of extracellular and multi-pass membrane proteins. In this study, we developed a new predictor to predict the modification sites of PCA based on maximum relevance minimum redundancy (mRMR) and incremental feature selection (IFS). We incorporated 727 features that belonged to 7 kinds of protein properties to predict the modification sites, including sequence conservation, residual disorder, amino acid factor, secondary structure and solvent accessibility, gain/loss of amino acid during evolution, propensity of amino acid to be conserved at protein-protein interface and protein surface, and deviation of side chain carbon atom number. Among these 727 features, 244 features were selected by mRMR and IFS as the optimized features for the prediction, with which the prediction model achieved a maximum of MCC of 0.7812. Feature analysis showed that all feature types contributed to the modification process. Further site-specific feature analysis showed that the features derived from PCA's surrounding sites contributed more to the determination of PCA sites than other sites. The detailed feature analysis in this paper might provide important clues for understanding the mechanism of the PCA formation and guide relevant experimental validations. PMID:22174779
5-Aminolevulinic Acid Accumulation in a Cerebral Infarction Mimicking High-Grade Glioma.
Behling, Felix; Hennersdorf, Florian; Bornemann, Antje; Tatagiba, Marcos; Skardelly, Marco
2016-08-01
5-Aminolevulinic acid (5-ALA) has become an integral part in the neurosurgical treatment of malignant glioma. Over time, several other tumor entities have been identified to metabolize 5-ALA and show a similar fluorescence pattern during surgical resection. This case report is the first description of 5-ALA accumulation in postischemic cerebral tissue. This evidence questions the assumption that 5-ALA accumulation in glioma is exclusively attributed to tumor infiltration. Instead, 5-ALA accumulation can also occur beyond the tumor borders and may be partially ascribed to inflammatory changes in the surrounding brain tissue. A 64-year old woman presented with episodes of apraxia and a ring-enhancing lesion in postcontrast T1-weighted magnetic resonance sequences suggestive of high grade glioma. Strong fluorescence was observed during 5-ALA-guided resection. However, although the frozen section was inconclusive, the final histopathologic examination revealed a stage II cerebral infarction. 5-ALA accumulation in postischemic cerebral tissue should be considered for intended supramarginal resections near eloquent brain regions. Therefore, sufficient preoperative imaging should regularly include magnetic resonance imaging spectroscopy and perfusion sequences to ascertain the proper diagnosis. Moreover, further research is warranted to determine the role of 5-ALA accumulation in postischemic and inflammatory brain tissue. Copyright © 2016 Elsevier Inc. All rights reserved.
Johnson, Paul A; Tencate, James A; Le Bas, Pierre-Yves; Guyer, Robert; Vu, Cung Khac; Skelt, Christopher
2013-10-08
In some aspects of the disclosure, a method and an apparatus is disclosed for investigating material surrounding the borehole. The method includes generating within a borehole an intermittent low frequency vibration that propagates as a tube wave longitudinally to the borehole and induces a nonlinear response in one or more features in the material that are substantially perpendicular to a longitudinal axis of the borehole; generating within the borehole a sequence of high frequency pulses directed such that they travel longitudinally to the borehole within the surrounding material; and receiving, at one or more receivers positionable in the borehole, a signal that includes components from the low frequency vibration and the sequence of high frequency pulses during intermittent generation of the low frequency vibration, to investigate the material surrounding the borehole.
López-Bueno, Alberto; Rubio, Mari-Paz; Bryant, Nathan; McKenna, Robert; Agbandje-McKenna, Mavis; Almendral, José M.
2006-01-01
The role of receptor recognition in the emergence of virulent viruses was investigated in the infection of severe combined immunodeficient (SCID) mice by the apathogenic prototype strain of the parvovirus minute virus of mice (MVMp). Genetic analysis of isolated MVMp viral clones (n = 48) emerging in mice, including lethal variants, showed only one of three single changes (V325M, I362S, or K368R) in the common sequence of the two capsid proteins. As was found for the parental isolates, the constructed recombinant viruses harboring the I362S or the K368R single substitutions in the capsid sequence, or mutations at both sites, showed a large-plaque phenotype and lower avidity than the wild type for cells in the cytotoxic interaction with two permissive fibroblast cell lines in vitro and caused a lethal disease in SCID mice when inoculated by the natural oronasal route. Significantly, the productive adsorption of MVMp variants carrying any of the three mutations selected through parallel evolution in mice showed higher sensitivity to the treatment of cells by neuraminidase than that of the wild type, indicating a lower affinity of the viral particle for the sialic acid component of the receptor. Consistent with this, the X-ray crystal structure of the MVMp capsids soaked with sialic acid (N-acetyl neuraminic acid) showed the sugar allocated in the depression at the twofold axis of symmetry (termed the dimple), immediately adjacent to residues I362 and K368, which are located on the wall of the dimple, and approximately 22 Å away from V325 in a threefold-related monomer. This is the first reported crystal structure identifying an infectious receptor attachment site on a parvovirus capsid. We conclude that the affinity of the interactions of sialic-acid-containing receptors with residues at or surrounding the dimple can evolutionarily regulate parvovirus pathogenicity and adaptation to new hosts. PMID:16415031
Sheng, Zizhang; Schramm, Chaim A; Kong, Rui; Mullikin, James C; Mascola, John R; Kwong, Peter D; Shapiro, Lawrence
2017-01-01
Somatic hypermutation (SHM) plays a critical role in the maturation of antibodies, optimizing recognition initiated by recombination of V(D)J genes. Previous studies have shown that the propensity to mutate is modulated by the context of surrounding nucleotides and that SHM machinery generates biased substitutions. To investigate the intrinsic mutation frequency and substitution bias of SHMs at the amino acid level, we analyzed functional human antibody repertoires and developed mGSSP (method for gene-specific substitution profile), a method to construct amino acid substitution profiles from next-generation sequencing-determined B cell transcripts. We demonstrated that these gene-specific substitution profiles (GSSPs) are unique to each V gene and highly consistent between donors. We also showed that the GSSPs constructed from functional antibody repertoires are highly similar to those constructed from antibody sequences amplified from non-productively rearranged passenger alleles, which do not undergo functional selection. This suggests the types and frequencies, or mutational space, of a majority of amino acid changes sampled by the SHM machinery to be well captured by GSSPs. We further observed the rates of mutational exchange between some amino acids to be both asymmetric and context dependent and to correlate weakly with their biochemical properties. GSSPs provide an improved, position-dependent alternative to standard substitution matrices, and can be utilized to developing software for accurately modeling the SHM process. GSSPs can also be used for predicting the amino acid mutational space available for antigen-driven selection and for understanding factors modulating the maturation pathways of antibody lineages in a gene-specific context. The mGSSP method can be used to build, compare, and plot GSSPs; we report the GSSPs constructed for 69 common human V genes (DOI: 10.6084/m9.figshare.3511083) and provide high-resolution logo plots for each (DOI: 10.6084/m9.figshare.3511085).
37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.
Code of Federal Regulations, 2011 CFR
2011-07-01
... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data shall...
A Library of the Nanoscale Self-Assembly of Amino Acids on Metal Surfaces
NASA Astrophysics Data System (ADS)
Iski, Erin; Yitamben, Esmeralda; Guisinger, Nathan
2012-02-01
The investigation of the hierarchical self-assembly of amino acids on surfaces represents a unique test-bed for the origin of enantio-favoritism in biology and the transmission of chirality from single molecules to complete surface layers. These chiral systems, in particular the assembly of isoleucine and alanine on Cu(111), represent a direct link to the understanding of certain biological processes, specifically the preference for some amino acids to form alpha helices vs. beta-pleated sheets in the secondary structure of proteins. Low temperature, ultra-high vacuum, scanning tunneling microscopy (LT UHV-STM) is used to study the hierarchical self-assembly of different amino acids on a Cu(111) single crystal in an effort to build a library of their two-dimensional structure with molecular-scale resolution for enhanced protein and peptide studies. Both enantiopure and racemic structures are studied in order to elucidate how chirality can affect the self-assembly of the amino acids. In some cases, density functional theory (DFT) models can be used to confirm the experimental structure. The advent of such a library with fully resolved, two-dimensional structures at different molecular coverages would address some of the complex questions surrounding the preferential formation of alpha helices vs. beta-pleated sheets in proteins and lead to a better understanding of the key role played by these amino acids in protein sequencing.
Solid phase sequencing of double-stranded nucleic acids
Fu, Dong-Jing; Cantor, Charles R.; Koster, Hubert; Smith, Cassandra L.
2002-01-01
This invention relates to methods for detecting and sequencing of target double-stranded nucleic acid sequences, to nucleic acid probes and arrays of probes useful in these methods, and to kits and systems which contain these probes. Useful methods involve hybridizing the nucleic acids or nucleic acids which represent complementary or homologous sequences of the target to an array of nucleic acid probes. These probe comprise a single-stranded portion, an optional double-stranded portion and a variable sequence within the single-stranded portion. The molecular weights of the hybridized nucleic acids of the set can be determined by mass spectroscopy, and the sequence of the target determined from the molecular weights of the fragments. Nucleic acids whose sequences can be determined include nucleic acids in biological samples such as patient biopsies and environmental samples. Probes may be fixed to a solid support such as a hybridization chip to facilitate automated determination of molecular weights and identification of the target sequence.
Lee, Dong-Hun; Torchetti, Mia Kim; Killian, Mary Lea; Swayne, David E
2017-07-01
In mid-January 2016, an outbreak of H7N8 high-pathogenicity avian influenza virus (HPAIV) in commercial turkeys occurred in Indiana. Surveillance within the 10km control zone identified H7N8 low-pathogenicity avian influenza virus (LPAIV) in nine surrounding turkey flocks but no other HPAIV-affected premises. We sequenced four of the H7N8 HPAIV isolated from the single farm and nine LPAIV identified during control zone surveillance. Evaluation included phylogenetic network analysis indicating close relatedness across the HPAIV and LPAIV, and that the progenitor H7N8 LPAIV spread among the affected turkey farms in Indiana, followed by spontaneous mutation to HPAIV on a single premise through acquisition of three basic amino acids at the hemagglutinin cleavage site. Deep sequencing of the available viruses failed to identify subpopulations in either the HPAIV or LPAIV suggesting mutation to HPAIV likely occurred on a single farm and the HPAIV did not spread to epidemiologically linked LPAIV-affected farms. Published by Elsevier Inc.
Solid phase sequencing of biopolymers
Cantor, Charles; Koster, Hubert
2010-09-28
This invention relates to methods for detecting and sequencing target nucleic acid sequences, to mass modified nucleic acid probes and arrays of probes useful in these methods, and to kits and systems which contain these probes. Useful methods involve hybridizing the nucleic acids or nucleic acids which represent complementary or homologous sequences of the target to an array of nucleic acid probes. These probes comprise a single-stranded portion, an optional double-stranded portion and a variable sequence within the single-stranded portion. The molecular weights of the hybridized nucleic acids of the set can be determined by mass spectroscopy, and the sequence of the target determined from the molecular weights of the fragments. Nucleic acids whose sequences can be determined include DNA or RNA in biological samples such as patient biopsies and environmental samples. Probes may be fixed to a solid support such as a hybridization chip to facilitate automated molecular weight analysis and identification of the target sequence.
Flanking signal and mature peptide residues influence signal peptide cleavage
Choo, Khar Heng; Ranganathan, Shoba
2008-01-01
Background Signal peptides (SPs) mediate the targeting of secretory precursor proteins to the correct subcellular compartments in prokaryotes and eukaryotes. Identifying these transient peptides is crucial to the medical, food and beverage and biotechnology industries yet our understanding of these peptides remains limited. This paper examines the most common type of signal peptides cleavable by the endoprotease signal peptidase I (SPase I), and the residues flanking the cleavage sites of three groups of signal peptide sequences, namely (i) eukaryotes (Euk) (ii) Gram-positive (Gram+) bacteria, and (iii) Gram-negative (Gram-) bacteria. Results In this study, 2352 secretory peptide sequences from a variety of organisms with amino-terminal SPs are extracted from the manually curated SPdb database for analysis based on physicochemical properties such as pI, aliphatic index, GRAVY score, hydrophobicity, net charge and position-specific residue preferences. Our findings show that the three groups share several similarities in general, but they display distinctive features upon examination in terms of their amino acid compositions and frequencies, and various physico-chemical properties. Thus, analysis or prediction of their sequences should be separated and treated as distinct groups. Conclusion We conclude that the peptide segment recognized by SPase I extends to the start of the mature protein to a limited extent, upon our survey of the amino acid residues surrounding the cleavage processing site. These flanking residues possibly influence the cleavage processing and contribute to non-canonical cleavage sites. Our findings are applicable in defining more accurate prediction tools for recognition and identification of cleavage site of SPs. PMID:19091014
Conservation of Fold and Topology of Functional Elements in Thiamin Pyrophosphate Enzymes
NASA Technical Reports Server (NTRS)
Dominiak, P.; Ciszak, E. M.
2005-01-01
Thiamin pyrophosphate (TPP)-dependent enzymes are a highly divergent family of proteins binding both TPP and metal ions. They perform decarboxylation-hydroxyaldehydes. Prior -ketoacids and of a common - (O=)C-C(OH)- fragment of to knowledge of three-dimensional structures of these enzmes, the GDGY25-30NN sequence was used to identify these enzymes. Subsequently, a number of structural studies on those enzymes revealed multi-subunit organization and the features of the two duplicate cofactor binding sites. Analyzing the structures of 44 structurally known enzymes, we found that the common structure of these enzymes is reduced to 180-220 amino acid long fragments of two PP and two PYR domains that form the [PP:PYR]2 binding center of two cofactor molecules. The structures of PP and PYR are arranged in a similar fold-sheet with triplets of helices on both sides.Dconsisting of a six-stranded Residues surrounding the cofactors are not strictly conserved, but they provide the same interatomic contacts required for the catalytic functions that these enzymes perform while maintaining interactive structural integrity. These structural and functional amino acids are topological counterparts located in the same positions of the conserved fold of sets of PP and PYR domains. Additional parallels include short fragments of sequences that link these amino acids to the fold and function. This report on the structural commonalities amongst TPP dependent enzymes is thought to contribute new approaches to annotation that may assist in advancing the functional proteomics of TPP dependent enzymes, and trace their complexity within evolutionary context.
Detection of nucleic acid sequences by invader-directed cleavage
Brow, Mary Ann D.; Hall, Jeff Steven Grotelueschen; Lyamichev, Victor; Olive, David Michael; Prudent, James Robert
1999-01-01
The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The 5' nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based by charge.
37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.
Code of Federal Regulations, 2011 CFR
2011-07-01
... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...
37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.
Code of Federal Regulations, 2013 CFR
2013-07-01
... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...
37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.
Code of Federal Regulations, 2012 CFR
2012-07-01
... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...
37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.
Code of Federal Regulations, 2010 CFR
2010-07-01
... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...
37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.
Code of Federal Regulations, 2014 CFR
2014-07-01
... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...
Pyrin gene and mutants thereof, which cause familial Mediterranean fever
Kastner, Daniel L [Bethesda, MD; Aksentijevichh, Ivona [Bethesda, MD; Centola, Michael [Tacoma Park, MD; Deng, Zuoming [Gaithersburg, MD; Sood, Ramen [Rockville, MD; Collins, Francis S [Rockville, MD; Blake, Trevor [Laytonsville, MD; Liu, P Paul [Ellicott City, MD; Fischel-Ghodsian, Nathan [Los Angeles, CA; Gumucio, Deborah L [Ann Arbor, MI; Richards, Robert I [North Adelaide, AU; Ricke, Darrell O [San Diego, CA; Doggett, Norman A [Santa Cruz, NM; Pras, Mordechai [Tel-Hashomer, IL
2003-09-30
The invention provides the nucleic acid sequence encoding the protein associated with familial Mediterranean fever (FMF). The cDNA sequence is designated as MEFV. The invention is also directed towards fragments of the DNA sequence, as well as the corresponding sequence for the RNA transcript and fragments thereof. Another aspect of the invention provides the amino acid sequence for a protein (pyrin) associated with FMF. The invention is directed towards both the full length amino acid sequence, fusion proteins containing the amino acid sequence and fragments thereof. The invention is also directed towards mutants of the nucleic acid and amino acid sequences associated with FMF. In particular, the invention discloses three missense mutations, clustered in within about 40 to 50 amino acids, in the highly conserved rfp (B30.2) domain at the C-terminal of the protein. These mutants include M6801, M694V, K695R, and V726A. Additionally, the invention includes methods for diagnosing a patient at risk for having FMF and kits therefor.
Feng, You; Hadjikyriacou, Andrea; Clarke, Steven G
2014-11-21
Protein arginine methyltransferase 7 (PRMT7) methylates arginine residues on various protein substrates and is involved in DNA transcription, RNA splicing, DNA repair, cell differentiation, and metastasis. The substrate sequences it recognizes in vivo and the enzymatic mechanism behind it, however, remain to be explored. Here we characterize methylation catalyzed by a bacterially expressed GST-tagged human PRMT7 fusion protein with a broad range of peptide and protein substrates. After confirming its type III activity generating only ω-N(G)-monomethylarginine and its distinct substrate specificity for RXR motifs surrounded by basic residues, we performed site-directed mutagenesis studies on this enzyme, revealing that two acidic residues within the double E loop, Asp-147 and Glu-149, modulate the substrate preference. Furthermore, altering a single acidic residue, Glu-478, on the C-terminal domain to glutamine nearly abolished the activity of the enzyme. Additionally, we demonstrate that PRMT7 has unusual temperature dependence and salt tolerance. These results provide a biochemical foundation to understanding the broad biological functions of PRMT7 in health and disease. © 2014 by The American Society for Biochemistry and Molecular Biology, Inc.
Federal Register 2010, 2011, 2012, 2013, 2014
2012-10-29
... DEPARTMENT OF COMMERCE Patent and Trademark Office Requirements for Patent Applications Containing Nucleotide Sequence and/or Amino Acid Sequence Disclosures ACTION: Proposed collection; comment request... Patent applications that contain nucleotide and/or amino acid sequence disclosures must include a copy of...
OPAL: prediction of MoRF regions in intrinsically disordered protein sequences.
Sharma, Ronesh; Raicar, Gaurav; Tsunoda, Tatsuhiko; Patil, Ashwini; Sharma, Alok
2018-06-01
Intrinsically disordered proteins lack stable 3-dimensional structure and play a crucial role in performing various biological functions. Key to their biological function are the molecular recognition features (MoRFs) located within long disordered regions. Computationally identifying these MoRFs from disordered protein sequences is a challenging task. In this study, we present a new MoRF predictor, OPAL, to identify MoRFs in disordered protein sequences. OPAL utilizes two independent sources of information computed using different component predictors. The scores are processed and combined using common averaging method. The first score is computed using a component MoRF predictor which utilizes composition and sequence similarity of MoRF and non-MoRF regions to detect MoRFs. The second score is calculated using half-sphere exposure (HSE), solvent accessible surface area (ASA) and backbone angle information of the disordered protein sequence, using information from the amino acid properties of flanks surrounding the MoRFs to distinguish MoRF and non-MoRF residues. OPAL is evaluated using test sets that were previously used to evaluate MoRF predictors, MoRFpred, MoRFchibi and MoRFchibi-web. The results demonstrate that OPAL outperforms all the available MoRF predictors and is the most accurate predictor available for MoRF prediction. It is available at http://www.alok-ai-lab.com/tools/opal/. ashwini@hgc.jp or alok.sharma@griffith.edu.au. Supplementary data are available at Bioinformatics online.
Park, Dongbin; Goh, Chul Jun; Kim, Hyein; Hahn, Yoonsoo
2018-04-01
The genome sequences of two novel monopartite RNA viruses were identified in a common eelgrass ( Zostera marina ) transcriptome dataset. Sequence comparison and phylogenetic analyses revealed that these two novel viruses belong to the genus Amalgavirus in the family Amalgaviridae . They were named Zostera marina amalgavirus 1 (ZmAV1) and Zostera marina amalgavirus 2 (ZmAV2). Genomes of both ZmAV1 and ZmAV2 contain two overlapping open reading frames (ORFs). ORF1 encodes a putative replication factory matrix-like protein, while ORF2 encodes a RNA-dependent RNA polymerase (RdRp) domain. The fusion protein (ORF1+2) of ORF1 and ORF2, which mediates RNA replication, was produced using the +1 programmed ribosomal frameshifting (PRF) mechanism. The +1 PRF motif sequence, UUU_CGN, which is highly conserved among known amalgaviruses, was also found in ZmAV1 and ZmAV2. Multiple sequence alignment of the ORF1+2 fusion proteins from 24 amalgaviruses revealed that +1 PRF occurred only at three different positions within the 13-amino acid-long segment, which was surrounded by highly conserved regions on both sides. This suggested that the +1 PRF may be constrained by the structure of fusion proteins. Genome sequences of ZmAV1 and ZmAV2, which are the first viruses to be identified in common eelgrass, will serve as useful resources for studying evolution and diversity of amalgaviruses.
Park, Dongbin; Goh, Chul Jun; Kim, Hyein; Hahn, Yoonsoo
2018-01-01
The genome sequences of two novel monopartite RNA viruses were identified in a common eelgrass (Zostera marina) transcriptome dataset. Sequence comparison and phylogenetic analyses revealed that these two novel viruses belong to the genus Amalgavirus in the family Amalgaviridae. They were named Zostera marina amalgavirus 1 (ZmAV1) and Zostera marina amalgavirus 2 (ZmAV2). Genomes of both ZmAV1 and ZmAV2 contain two overlapping open reading frames (ORFs). ORF1 encodes a putative replication factory matrix-like protein, while ORF2 encodes a RNA-dependent RNA polymerase (RdRp) domain. The fusion protein (ORF1+2) of ORF1 and ORF2, which mediates RNA replication, was produced using the +1 programmed ribosomal frameshifting (PRF) mechanism. The +1 PRF motif sequence, UUU_CGN, which is highly conserved among known amalgaviruses, was also found in ZmAV1 and ZmAV2. Multiple sequence alignment of the ORF1+2 fusion proteins from 24 amalgaviruses revealed that +1 PRF occurred only at three different positions within the 13-amino acid-long segment, which was surrounded by highly conserved regions on both sides. This suggested that the +1 PRF may be constrained by the structure of fusion proteins. Genome sequences of ZmAV1 and ZmAV2, which are the first viruses to be identified in common eelgrass, will serve as useful resources for studying evolution and diversity of amalgaviruses. PMID:29628822
Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor L.; Brow, Mary Ann D.; Dahlberg, James E.
2007-12-11
The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Invasive cleavage of nucleic acids
Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor I.; Brow, Mary Ann D.; Dahlberg, James E.
1999-01-01
The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Invasive cleavage of nucleic acids
Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor I.; Brow, Mary Ann D.; Dahlberg, James E.
2002-01-01
The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor I.; Brow; Mary Ann D.; Dahlberg, James E.
2010-11-09
The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor I.; Brow, Mary Ann D.; Dahlberg, James E.
2000-01-01
The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor I.; Brow, Mary Ann; Dahlberg, James E.
2005-04-05
The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Parks, Jason C; Patton, Alyssa L; McCallie, Blair R; Griffin, Darren K; Schoolcraft, William B; Katz-Jaffe, Mandy G
2016-05-01
Corona cells surround the oocyte and maintain a close relationship through transzonal processes and gap junctions, and may be used to assess oocyte competence. In this study, the corona cell transcriptome of individual cumulus oocyte complexes (COCs) was investigated. Isolated corona cells were collected from COCs that developed into euploid blastocysts and were transferred in a subsequent frozen embryo transfer. Ten corona cell samples underwent RNA-sequencing to generate unique gene expression profiles. Live birth was compared with negative implantation after the transfer of a euploid blastocyst using bioinformatics and statistical analysis. Individual corona cell samples produced a mean of 21.2 million sequence reads, and 307 differentially expressed transcrpits (P < 0.05; fold change ≥ 2). Enriched pathway analysis showed Wnt signalling, mitogen-activated protein kinases signalling, focal adhesion and tricarboxylic acid cycle to be affected by implantation outcome. The Wnt/beta-catenin signalling pathway, including genes APC, AXIN and GSK3B, were independently validated by real-time quantitative reverse transcription. Individual, corona cell transcriptome was successfully generated using RNA-sequencing. Key genes and signalling pathways were identified in association with implantation outcome after the transfer of a euploid blastocyst in a frozen embryo transfer. These data could provide novel biomarkers for the non-invasive assessment of embryo viability. Copyright © 2016 Reproductive Healthcare Ltd. Published by Elsevier Ltd. All rights reserved.
Bhatia, Sonu; Batra, Navneet; Pathak, Ashish; Joshi, Amit; Souza, Leila; Almeida, Paulo; Chauhan, Ashvini
2015-09-01
The soil-mousse surrounding a geothermal spring was analyzed for bacterial and archaeal diversity using 16S rRNA gene amplicon metagenomic sequencing which revealed the presence of 18 bacterial phyla distributed across 109 families and 219 genera. Firmicutes, Actinobacteria, and the Deinococcus-Thermus group were the predominant bacterial assemblages with Crenarchaeota and Thaumarchaeota as the main archaeal assemblages in this largely understudied geothermal habitat. Several metagenome sequences remained taxonomically unassigned suggesting the presence of a repertoire of hitherto undescribed microbes in this geothermal soil-mousse econiche.
Method for nucleic acid hybridization using single-stranded DNA binding protein
Tabor, Stanley; Richardson, Charles C.
1996-01-01
Method of nucleic acid hybridization for detecting the presence of a specific nucleic acid sequence in a population of different nucleic acid sequences using a nucleic acid probe. The nucleic acid probe hybridizes with the specific nucleic acid sequence but not with other nucleic acid sequences in the population. The method includes contacting a sample (potentially including the nucleic acid sequence) with the nucleic acid probe under hybridizing conditions in the presence of a single-stranded DNA binding protein provided in an amount which stimulates renaturation of a dilute solution (i.e., one in which the t.sub.1/2 of renaturation is longer than 3 weeks) of single-stranded DNA greater than 500 fold (i.e., to a t.sub.1/2 less than 60 min, preferably less than 5 min, and most preferably about 1 min.) in the absence of nucleotide triphosphates.
Sequence quality analysis tool for HIV type 1 protease and reverse transcriptase.
Delong, Allison K; Wu, Mingham; Bennett, Diane; Parkin, Neil; Wu, Zhijin; Hogan, Joseph W; Kantor, Rami
2012-08-01
Access to antiretroviral therapy is increasing globally and drug resistance evolution is anticipated. Currently, protease (PR) and reverse transcriptase (RT) sequence generation is increasing, including the use of in-house sequencing assays, and quality assessment prior to sequence analysis is essential. We created a computational HIV PR/RT Sequence Quality Analysis Tool (SQUAT) that runs in the R statistical environment. Sequence quality thresholds are calculated from a large dataset (46,802 PR and 44,432 RT sequences) from the published literature ( http://hivdb.Stanford.edu ). Nucleic acid sequences are read into SQUAT, identified, aligned, and translated. Nucleic acid sequences are flagged if with >five 1-2-base insertions; >one 3-base insertion; >one deletion; >six PR or >18 RT ambiguous bases; >three consecutive PR or >four RT nucleic acid mutations; >zero stop codons; >three PR or >six RT ambiguous amino acids; >three consecutive PR or >four RT amino acid mutations; >zero unique amino acids; or <0.5% or >15% genetic distance from another submitted sequence. Thresholds are user modifiable. SQUAT output includes a summary report with detailed comments for troubleshooting of flagged sequences, histograms of pairwise genetic distances, neighbor joining phylogenetic trees, and aligned nucleic and amino acid sequences. SQUAT is a stand-alone, free, web-independent tool to ensure use of high-quality HIV PR/RT sequences in interpretation and reporting of drug resistance, while increasing awareness and expertise and facilitating troubleshooting of potentially problematic sequences.
Saito, T; Ochiai, H
1999-10-01
cDNA fragments putatively encoding amino acid sequences characteristic of the fatty acid desaturase were obtained using expressed sequence tag (EST) information of the Dictyostelium cDNA project. Using this sequence, we have determined the cDNA sequence and genomic sequence of a desaturase. The cloned cDNA is 1489 nucleotides long and the deduced amino acid sequence comprised 464 amino acid residues containing an N-terminal cytochrome b5 domain. The whole sequence was 38.6% identical to the initially identified Delta5-desaturase of Mortierella alpina. We have confirmed its function as Delta5-desaturase by over expression mutation in D. discoideum and also the gain of function mutation in the yeast Saccharomyces cerevisiae. Analysis of the lipids from transformed D. discoideum and yeast demonstrated the accumulation of Delta5-desaturated products. This is the first report concering fatty acid desaturase in cellular slime molds.
Composition for nucleic acid sequencing
Korlach, Jonas [Ithaca, NY; Webb, Watt W [Ithaca, NY; Levene, Michael [Ithaca, NY; Turner, Stephen [Ithaca, NY; Craighead, Harold G [Ithaca, NY; Foquet, Mathieu [Ithaca, NY
2008-08-26
The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
Method for sequencing nucleic acid molecules
Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu
2006-06-06
The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
Method for sequencing nucleic acid molecules
Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu
2006-05-30
The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
Dipeptide Sequence Determination: Analyzing Phenylthiohydantoin Amino Acids by HPLC
NASA Astrophysics Data System (ADS)
Barton, Janice S.; Tang, Chung-Fei; Reed, Steven S.
2000-02-01
Amino acid composition and sequence determination, important techniques for characterizing peptides and proteins, are essential for predicting conformation and studying sequence alignment. This experiment presents improved, fundamental methods of sequence analysis for an upper-division biochemistry laboratory. Working in pairs, students use the Edman reagent to prepare phenylthiohydantoin derivatives of amino acids for determination of the sequence of an unknown dipeptide. With a single HPLC technique, students identify both the N-terminal amino acid and the composition of the dipeptide. This method yields good precision of retention times and allows use of a broad range of amino acids as components of the dipeptide. Students learn fundamental principles and techniques of sequence analysis and HPLC.
Effect of Base Sequence "Defects" on the Electrostatic Potential of Dissolved DNA
NASA Astrophysics Data System (ADS)
Adams, Scott V.; Wagner, Katrina; Kephart, Thomas S.; Edwards, Glenn
1997-11-01
An analytical model of the electrostatic potential surrounding dissolved DNA has been developed. The model consists of an all-atom, mathematically helical structure for DNA, in which the atoms are arranged in infinite lines of discrete point charges on concentric cylindrical surfaces. The surrounding solvent and counterions are treated with the Debye-Huckel approximation (Wagner et al., Biophysical Journal 73, 21-30, 1997). Variation in the electrostatic potential due to structural differences between A, B, and Z conformations and homopolymer base sequence is apparent. The most recent modification to the model exploits the principle of superposition to calculate the potential of DNA with a base sequence containing `defects.' That is, the base sequence is no longer uniform along the polymer. Differences between the potential of homopolymer DNA and the potential of DNA containing base `defects' are immediately obvious. These results may aid in understanding the role of electrostatics in base-sequence specificity exhibited by DNA-binding proteins.
Assignment of the human GABA transporter gene (GABATHG) locus to chromosome 3p24-p25
DOE Office of Scientific and Technical Information (OSTI.GOV)
Huang, Fang; Fei, Jian; Guo, Li-He
1995-09-01
An essential regulatory process of synaptic transmission is the inactivation of released neurotransmitters by the transmitter-specific uptake mechanism, {gamma}-Aminobutyric acid (GABA) is an inhibitory transmitter in the vertebrate central nervous system; its activity is terminated by a high-affinity Na{sup +} and Cl{sup -} -dependent specific GABA transporter (GAT), which carries the neurotransmitter to the presynaptic neuron and/or glial elements surrounding the synaptic cleft. Deficiency of the transporter may cause epilepsy and some other nervous diseases. The human GAT gene (GABATHG), approximately 25 kb in length, has been cloned and sequenced by our colleagues (7). Here the results of the inmore » situ hybridization mapping with the gene are presented. 10 refs., 1 fig.« less
Tada, Tatsuya; Shrestha, Shovita; Shimada, Kayo; Ohara, Hiroshi; Sherchand, Jeevan B; Pokhrel, Bharat M; Kirikae, Teruo
2017-03-01
A novel PER-type extended-spectrum β-lactamase, PER-8, was identified in an Acinetobacter baumannii clinical isolate obtained in Nepal. The amino acid sequence of PER-8 has a substitution at position 39 (Gly to Glu) compared with that of PER-7. The k cat / K m ratio of PER-8 for aztreonam was lower than that of PER-7, while the k cat / K m ratio of PER-8 for imipenem was higher than that of PER-7. The genomic environment surrounding bla PER-8 was intI1 bla PSE-1 qacEDI sulI IS CR1-bla PER-8 gts sulI orfX on a 100-kb plasmid. Copyright © 2017 American Society for Microbiology.
EGVII endoglucanase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2014-02-25
The present invention provides a novel endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.
EGVII endoglucanase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2006-05-16
The present invention provides a novel endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.
EGVI endoglucanase and nucleic acids encoding the same
Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA
2008-04-01
The present invention provides a novel endoglucanase nucleic acid sequence, designated egl6, and the corresponding EGVI amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVI, recombinant EGVI proteins and methods for producing the same.
EGVI endoglucanase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2010-10-12
The present invention provides a novel endoglucanase nucleic acid sequence, designated egl6, and the corresponding EGVI amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVI, recombinant EGVI proteins and methods for producing the same.
EGVIII endoglucanase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2006-05-23
The present invention provides a novel endoglucanase nucleic acid sequence, designated egl8, and the corresponding EGVIII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVIII, recombinant EGVIII proteins and methods for producing the same.
EGVI endoglucanase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2010-10-05
The present invention provides a novel endoglucanase nucleic acid sequence, designated egl6, and the corresponding EGVI amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVI, recombinant EGVI proteins and methods for producing the same.
EGVI endoglucanase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2006-06-06
The present invention provides a novel endoglucanase nucleic acid sequence, designated egl6, and the corresponding EGVI amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVI, recombinant EGVI proteins and methods for producing the same.
EGVII endoglucanase and nucleic acids encoding the same
Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA
2009-05-05
The present invention provides an endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.
EGVII endoglucanase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2013-07-16
The present invention provides a novel endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.
EGVII endoglucanase and nucleic acids encoding the same
Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA
2012-02-14
The present invention provides a novel endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.
EGVII endoglucanase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2015-04-14
The present invention provides a novel endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.
Kit for detecting nucleic acid sequences using competitive hybridization probes
Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.
2001-01-01
A kit is provided for detecting a target nucleic acid sequence in a sample, the kit comprising: a first hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the first hybridization probe including a first complexing agent for forming a binding pair with a second complexing agent; and a second hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the first hybridization probe does not selectively hybridize, the second hybridization probe including a detectable marker; a third hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the third hybridization probe including the same detectable marker as the second hybridization probe; and a fourth hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the third hybridization probe does not selectively hybridize, the fourth hybridization probe including the first complexing agent for forming a binding pair with the second complexing agent; wherein the first and second hybridization probes are capable of simultaneously hybridizing to the target sequence and the third and fourth hybridization probes are capable of simultaneously hybridizing to the target sequence, the detectable marker is not present on the first or fourth hybridization probes and the first, second, third, and fourth hybridization probes each include a competitive nucleic acid sequence which is sufficiently complementary to a third portion of the target sequence that the competitive sequences of the first, second, third, and fourth hybridization probes compete with each other to hybridize to the third portion of the target sequence.
Identification of chondrocyte-binding peptides by phage display.
Cheung, Crystal S F; Lui, Julian C; Baron, Jeffrey
2013-07-01
As an initial step toward targeting cartilage tissue for potential therapeutic applications, we sought cartilage-binding peptides using phage display, a powerful technology for selection of peptides that bind to molecules of interest. A library of phage displaying random 12-amino acid peptides was iteratively incubated with cultured chondrocytes to select phage that bind cartilage. The resulting phage clones demonstrated increased affinity to chondrocytes by ELISA, when compared to a wild-type, insertless phage. Furthermore, the selected phage showed little preferential binding to other cell types, including primary skin fibroblast, myocyte and hepatocyte cultures, suggesting a tissue-specific interaction. Immunohistochemical staining revealed that the selected phage bound chondrocytes themselves and the surrounding extracellular matrix. FITC-tagged peptides were synthesized based on the sequence of cartilage-binding phage clones. These peptides, but not a random peptide, bound cultured chondrocytes, and extracelluar matrix. In conclusion, using phage display, we identified peptide sequences that specifically target chondrocytes. We anticipate that such peptides may be coupled to therapeutic molecules to provide targeted treatment for cartilage disorders. Copyright © 2013 Orthopaedic Research Society.
Chip-based sequencing nucleic acids
Beer, Neil Reginald
2014-08-26
A system for fast DNA sequencing by amplification of genetic material within microreactors, denaturing, demulsifying, and then sequencing the material, while retaining it in a PCR/sequencing zone by a magnetic field. One embodiment includes sequencing nucleic acids on a microchip that includes a microchannel flow channel in the microchip. The nucleic acids are isolated and hybridized to magnetic nanoparticles or to magnetic polystyrene-coated beads. Microreactor droplets are formed in the microchannel flow channel. The microreactor droplets containing the nucleic acids and the magnetic nanoparticles are retained in a magnetic trap in the microchannel flow channel and sequenced.
Yefremova, Yelena; Al-Majdoub, Mahmoud; Opuni, Kwabena F M; Koy, Cornelia; Cui, Weidong; Yan, Yuetian; Gross, Michael L; Glocker, Michael O
2015-03-01
Mass spectrometric de-novo sequencing was applied to review the amino acid sequence of a commercially available recombinant protein G´ with great scientific and economic importance. Substantial deviations to the published amino acid sequence (Uniprot Q54181) were found by the presence of 46 additional amino acids at the N-terminus, including a so-called "His-tag" as well as an N-terminal partial α-N-gluconoylation and α-N-phosphogluconoylation, respectively. The unexpected amino acid sequence of the commercial protein G' comprised 241 amino acids and resulted in a molecular mass of 25,998.9 ± 0.2 Da for the unmodified protein. Due to the higher mass that is caused by its extended amino acid sequence compared with the original protein G' (185 amino acids), we named this protein "protein G'e." By means of mass spectrometric peptide mapping, the suggested amino acid sequence, as well as the N-terminal partial α-N-gluconoylations, was confirmed with 100% sequence coverage. After the protein G'e sequence was determined, we were able to determine the expression vector pET-28b from Novagen with the Xho I restriction enzyme cleavage site as the best option that was used for cloning and expressing the recombinant protein G'e in E. coli. A dissociation constant (K(d)) value of 9.4 nM for protein G'e was determined thermophoretically, showing that the N-terminal flanking sequence extension did not cause significant changes in the binding affinity to immunoglobulins.
Thomsen, Martin Christen Frølund; Nielsen, Morten
2012-01-01
Seq2Logo is a web-based sequence logo generator. Sequence logos are a graphical representation of the information content stored in a multiple sequence alignment (MSA) and provide a compact and highly intuitive representation of the position-specific amino acid composition of binding motifs, active sites, etc. in biological sequences. Accurate generation of sequence logos is often compromised by sequence redundancy and low number of observations. Moreover, most methods available for sequence logo generation focus on displaying the position-specific enrichment of amino acids, discarding the equally valuable information related to amino acid depletion. Seq2logo aims at resolving these issues allowing the user to include sequence weighting to correct for data redundancy, pseudo counts to correct for low number of observations and different logotype representations each capturing different aspects related to amino acid enrichment and depletion. Besides allowing input in the format of peptides and MSA, Seq2Logo accepts input as Blast sequence profiles, providing easy access for non-expert end-users to characterize and identify functionally conserved/variable amino acids in any given protein of interest. The output from the server is a sequence logo and a PSSM. Seq2Logo is available at http://www.cbs.dtu.dk/biotools/Seq2Logo (14 May 2012, date last accessed). PMID:22638583
Tarr, Sarah J; Cryar, Adam; Thalassinos, Konstantinos; Haldar, Kasturi; Osborne, Andrew R
2013-01-01
The malaria parasite exports proteins across its plasma membrane and a surrounding parasitophorous vacuole membrane, into its host erythrocyte. Most exported proteins contain a Host Targeting motif (HT motif) that targets them for export. In the parasite secretory pathway, the HT motif is cleaved by the protease plasmepsin V, but the role of the newly generated N-terminal sequence in protein export is unclear. Using a model protein that is cleaved by an exogenous viral protease, we show that the new N-terminal sequence, normally generated by plasmepsin V cleavage, is sufficient to target a protein for export, and that cleavage by plasmepsin V is not coupled directly to the transfer of a protein to the next component in the export pathway. Mutation of the fourth and fifth positions of the HT motif, as well as amino acids further downstream, block or affect the efficiency of protein export indicating that this region is necessary for efficient export. We also show that the fifth position of the HT motif is important for plasmepsin V cleavage. Our results indicate that plasmepsin V cleavage is required to generate a new N-terminal sequence that is necessary and sufficient to mediate protein export by the malaria parasite. PMID:23279267
Tanaka, Junko; Doi, Nobuhide; Takashima, Hideaki; Yanagawa, Hiroshi
2010-01-01
Screening of functional proteins from a random-sequence library has been used to evolve novel proteins in the field of evolutionary protein engineering. However, random-sequence proteins consisting of the 20 natural amino acids tend to aggregate, and the occurrence rate of functional proteins in a random-sequence library is low. From the viewpoint of the origin of life, it has been proposed that primordial proteins consisted of a limited set of amino acids that could have been abundantly formed early during chemical evolution. We have previously found that members of a random-sequence protein library constructed with five primitive amino acids show high solubility (Doi et al., Protein Eng Des Sel 2005;18:279–284). Although such a library is expected to be appropriate for finding functional proteins, the functionality may be limited, because they have no positively charged amino acid. Here, we constructed three libraries of 120-amino acid, random-sequence proteins using alphabets of 5, 12, and 20 amino acids by preselection using mRNA display (to eliminate sequences containing stop codons and frameshifts) and characterized and compared the structural properties of random-sequence proteins arbitrarily chosen from these libraries. We found that random-sequence proteins constructed with the 12-member alphabet (including five primitive amino acids and positively charged amino acids) have higher solubility than those constructed with the 20-member alphabet, though other biophysical properties are very similar in the two libraries. Thus, a library of moderate complexity constructed from 12 amino acids may be a more appropriate resource for functional screening than one constructed from 20 amino acids. PMID:20162614
DOE Office of Scientific and Technical Information (OSTI.GOV)
Reiser, Steven E.; Somerville, Chris R.
The present invention relates to bacterial enzymes, in particular to an acyl-CoA reductase and a gene encoding an acyl-CoA reductase, the amino acid and nucleic acid sequences corresponding to the reductase polypeptide and gene, respectively, and to methods of obtaining such enzymes, amino acid sequences and nucleic acid sequences. The invention also relates to the use of such sequences to provide transgenic host cells capable of producing fatty alcohols and fatty aldehydes.
BGL7 beta-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Ward, Michael
2013-01-29
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl7, and the corresponding BGL7 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL7, recombinant BGL7 proteins and methods for producing the same.
BGL6 .beta.-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Ward, Michael
2012-10-02
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl6, and the corresponding BGL6 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL6, recombinant BGL6 proteins and methods for producing the same.
BGL5 .beta.-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2006-02-28
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl5, and the corresponding BGL5 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL5, recombinant BGL5 proteins and methods for producing the same.
BGL5 .beta.-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA
2008-03-18
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl5, and the corresponding BGL5 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL5, recombinant BGL5 proteins and methods for producing the same.
BGL6 beta-glucosidase and nucleic acids encoding the same
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dunn-Coleman, Nigel; Ward, Michael
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl6, and the corresponding BGL6 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL6, recombinant BGL6 proteins and methods for producing the same.
BGL6 beta-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Ward, Michael
2014-03-04
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl6, and the corresponding BGL6 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL6, recombinant BGL6 proteins and methods for producing the same.
BGL7 beta-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Ward, Michael
2015-04-14
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl7, and the corresponding BGL7 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL7, recombinant BGL7 proteins and methods for producing the same.
BGL7 beta-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Ward, Michael
2014-03-25
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl7, and the corresponding BGL7 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL7, recombinant BGL7 proteins and methods for producing the same.
BGL6 beta-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Ward, Michael
2015-08-11
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl6, and the corresponding BGL6 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL6, recombinant BGL6 proteins and methods for producing the same.
BGL3 beta-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2007-09-25
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl3, and the corresponding BGL3 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL3, recombinant BGL3 proteins and methods for producing the same.
BGL3 beta-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA
2008-04-01
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl3, and the corresponding BGL3 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL3, recombinant BGL3 proteins and methods for producing the same.
BGL4 beta-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA
2011-12-06
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl4, and the corresponding BGL4 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL4, recombinant BGL4 proteins and methods for producing the same.
BGL4 .beta.-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2006-05-16
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl4, and the corresponding BGL4 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL4, recombinant BGL4 proteins and methods for producing the same.
BGL3 beta-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA
2011-06-14
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl3, and the corresponding BGL3 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL3, recombinant BGL3 proteins and methods for producing the same.
BGL6 beta-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel [Los Gatos, CA; Ward, Michael [San Francisco, CA
2009-09-01
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl6, and the corresponding BGL6 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL6, recombinant BGL6 proteins and methods for producing the same.
BGL3 beta-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2012-10-30
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl3, and the corresponding BGL3 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL3, recombinant BGL3 proteins and methods for producing the same.
BGL4 beta-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA
2008-01-22
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl4, and the corresponding BGL4 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL4, recombinant BGL4 proteins and methods for producing the same.
Chromatic induction in space and time.
Coia, Andrew J; Shevell, Steven K
2018-04-01
The color appearance of a light depends on variation in the complete visual field over both space and time. In the spatial domain, a chromatic stimulus within a patterned chromatic surround can appear a different hue than the same stimulus within a uniform surround. In the temporal domain, a stimulus presented as an element of a continuously changing chromaticity can appear a different color compared to the identical stimulus, presented simultaneously but viewed alone. This is the flash-lag effect for color, which has an analog in the domain of motion: a pulsed object seen alone can appear to lag behind an identical pulsed object that is an element of a motion sequence. Studies of the flash-lag effect for motion have considered whether it is mediated by a neural representation for the moving physical stimulus or, alternatively, for the perceived motion. The current study addresses this question for the flash-lag effect for color by testing whether the color flash lag depends on a representation of only the changing chromatic stimulus or, alternatively, its color percept, which can be altered by chromatic induction. baseline measurements for spatial chromatic induction determined the chromaticity of a flashed ring within a uniform surround that matched a flashed ring within a patterned surround. Baseline measurements for the color flash-lag effect determined the chromaticity of a pulsed ring presented alone (within a uniform surround) that matched a pulsed ring presented in a sequence of changing chromaticity over time (also within a uniform surround). Finally, the main experiments combined chromatic induction from a patterned surround and the flash-lag effect, in three conditions: (1) both the changing and pulsed rings were within a patterned chromatic surround; (2) the changing ring was within a patterned surround and the pulsed ring within a uniform surround; and (3) the changing ring was within a uniform surround and the pulsed ring within a patterned surround. the flash-lag measurements for a changing chromaticity were affected by perceptual changes induced by the surrounding chromatic pattern. Thus, the color shifts induced by a chromatic surround are incorporated in the neural representation mediating the flash-lag effect for color.
Asahi, Hiroko; Osman, Ahmed; Cook, Rosemary M.; LoVerde, Philip T.; Stadecker, Miguel J.
2000-01-01
In schistosomiasis mansoni, hepatic granulomatous inflammation surrounding parasite eggs is mediated by CD4+ T helper (Th) cells sensitized to schistosomal egg antigens (SEA). We previously showed that a prominent lymphoproliferative response of CD4+ Th cells from schistosome-infected C57BL/6 (BL/6) mice was directed against a 62-kDa component of SEA. A partial amino acid sequence of the 62-kDa component was found to be identical with one present in the enzyme phosphoenolpyruvate carboxykinase (PEPCK). Based on this sequence, a cDNA clone containing the entire coding region of PEPCK was identified, and the full recombinant Schistosoma mansoni PEPCK (rSm-PEPCK) of 626 amino acids was purified from a prokaryotic expression system. rSm-PEPCK strongly stimulated a specific T-cell hybridoma, 4E6, as well as CD4+ Th cells from SEA-immunized BL/6 mice and from infected BL/6, CBA, and BALB/c mice. In the infected mice, rSm-PEPCK elicited significant gamma interferon production as well as, to a lesser extent, production of interleukin-2 and -5. In BL/6 and BALB/c mice, the CD4+ Th cell response to rSm-PEPCK was greater than that directed against the egg antigen Sm-p40; conversely, CBA mice responded better to Sm-p40 than to Sm-PEPCK. A 12-amino-acid region (residues 398 to 409: DKSKDPKAHPNS) was demonstrated to contain a T-cell epitope; synthetic peptides containing this epitope significantly stimulated specific hybridoma 4E6 and polyclonal CD4+ Th cells. The identification and characterization of immunogenic egg components will contribute to the understanding and possible control of T-cell-mediated schistosomal disease. PMID:10816489
Tulipano, Giovanni; Bulgari, Omar; Chessa, Stefania; Nardone, Alessandro; Cocchi, Daniela; Caroli, Anna
2010-02-25
Casein phosphopeptides (CPPs) obtained by enzymatic hydrolysis in vitro of caseins, have been shown to enhance calcium solubility and to increase the calcification of embryonic rat bones in their diaphyseal area. Little is known about the direct effects of CPPs on cultured osteoblastic cells. Calcium in the microenvironment surrounding bone cells is not only important for the mineralization of the extracellular matrix, but it is believed to provide preosteblasts with a signal that modulates their proliferation and differentiation. The aim of the present study was to investigate the direct effects of four selected casein phosphopeptides on osteoblastic cell (MC3T3-E1 cells) viability and differentiation. The selected peptides have been obtained by chemical synthesis and differed in the number of phosphorylated sites and in the amino acid spacing out two phosphorylated sites, in order to further characterize the relationship between structure and function. The results obtained in this work demonstrated that CPPs may directly affect osteoblast-like cell growth, calcium uptake and ultimately calcium deposition in the extracellular matrix. The effects exerted by distinct CPPs on osteogenesis in vitro can be either stimulatory or inhibitory. Differential short amino acid sequences in their molecules, like the -SpEE- and the -SpTSpEE-motifs, are likely the molecular determinants for their biological activities on osteoblastic cells. Moreover, two genetic variants of CPPs showing one amino acid change in their sequence may profoundly differ in their biological activities. Finally, our data may also suggest important clues about the role of intrinsic phosphorylated peptides derived from endogenous phosphorylated proteins in bone metabolism, apart from extrinsic CPPs. Copyright 2009 Elsevier B.V. All rights reserved.
Bhatia, Sonu; Batra, Navneet; Pathak, Ashish; Joshi, Amit; Souza, Leila; Almeida, Paulo; Chauhan, Ashvini
2015-01-01
The soil-mousse surrounding a geothermal spring was analyzed for bacterial and archaeal diversity using 16S rRNA gene amplicon metagenomic sequencing which revealed the presence of 18 bacterial phyla distributed across 109 families and 219 genera. Firmicutes, Actinobacteria, and the Deinococcus-Thermus group were the predominant bacterial assemblages with Crenarchaeota and Thaumarchaeota as the main archaeal assemblages in this largely understudied geothermal habitat. Several metagenome sequences remained taxonomically unassigned suggesting the presence of a repertoire of hitherto undescribed microbes in this geothermal soil-mousse econiche. PMID:26484255
Methods and compositions for efficient nucleic acid sequencing
Drmanac, Radoje
2006-07-04
Disclosed are novel methods and compositions for rapid and highly efficient nucleic acid sequencing based upon hybridization with two sets of small oligonucleotide probes of known sequences. Extremely large nucleic acid molecules, including chromosomes and non-amplified RNA, may be sequenced without prior cloning or subcloning steps. The methods of the invention also solve various current problems associated with sequencing technology such as, for example, high noise to signal ratios and difficult discrimination, attaching many nucleic acid fragments to a surface, preparing many, longer or more complex probes and labelling more species.
Methods and compositions for efficient nucleic acid sequencing
Drmanac, Radoje
2002-01-01
Disclosed are novel methods and compositions for rapid and highly efficient nucleic acid sequencing based upon hybridization with two sets of small oligonucleotide probes of known sequences. Extremely large nucleic acid molecules, including chromosomes and non-amplified RNA, may be sequenced without prior cloning or subcloning steps. The methods of the invention also solve various current problems associated with sequencing technology such as, for example, high noise to signal ratios and difficult discrimination, attaching many nucleic acid fragments to a surface, preparing many, longer or more complex probes and labelling more species.
Kloos, Karin; Schloter, Michael; Meyer, Ortwin
2006-11-01
Acid resins are residues produced in a recycling process for used oils that was in use in the forties and fifties of the last century. The resin-like material is highly contaminated with mineral oil hydrocarbons, extremely acidic and co-contaminated with substituted and aromatic hydrocarbons, and heavy metals. To determine the potential for microbial biodegradation the acid resin deposit and its surroundings were screened for microbial activity by soil respiration measurements. No microbial activity was found in the core deposit. However, biodegradation of hydrocarbons was possible in zones with a lower degree of contamination surrounding the deposit. An extreme acidophilic microbial community was detected close to the core deposit. With a simple ecotoxicological approach it could be shown that the pure acid resin that formed the major part of the core deposit, was toxic to the indigenous microflora due to its extremely low pH of 0-1.
Hybridization and sequencing of nucleic acids using base pair mismatches
Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua
2001-01-01
Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.
Human jagged polypeptide, encoding nucleic acids and methods of use
Li, Linheng; Hood, Leroy
2000-01-01
The present invention provides an isolated polypeptide exhibiting substantially the same amino acid sequence as JAGGED, or an active fragment thereof, provided that the polypeptide does not have the amino acid sequence of SEQ ID NO:5 or SEQ ID NO:6. The invention further provides an isolated nucleic acid molecule containing a nucleotide sequence encoding substantially the same amino acid sequence as JAGGED, or an active fragment thereof, provided that the nucleotide sequence does not encode the amino acid sequence of SEQ ID NO:5 or SEQ ID NO:6. Also provided herein is a method of inhibiting differentiation of hematopoietic progenitor cells by contacting the progenitor cells with an isolated JAGGED polypeptide, or active fragment thereof. The invention additionally provides a method of diagnosing Alagille Syndrome in an individual. The method consists of detecting an Alagille Syndrome disease-associated mutation linked to a JAGGED locus.
Polypeptide having or assisting in carbohydrate material degrading activity and uses thereof
Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Los, Alrik Pieter
2016-02-16
The invention relates to a polypeptide which comprises the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 76% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 76% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having beta-glucosidase activity and uses thereof
DOE Office of Scientific and Technical Information (OSTI.GOV)
Schoonneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; De Jong, Rene Marcel
The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well asmore » the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.« less
Polypeptide having swollenin activity and uses thereof
Schoonneveld-Bergmans, Margot Elizabeth Francoise; Heijne, Wilbert Herman Marie; Vlasie, Monica D; Damveld, Robbertus Antonius
2015-11-04
The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having beta-glucosidase activity and uses thereof
Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; De Jong, Rene Marcel; Damveld, Robbertus Antonius
2015-09-01
The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 70% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 70% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having cellobiohydrolase activity and uses thereof
Sagt, Cornelis Maria Jacobus; Schooneveld-Bergmans, Margot Elisabeth Francoise; Roubos, Johannes Andries; Los, Alrik Pieter
2015-09-15
The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 93% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 93% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having acetyl xylan esterase activity and uses thereof
Schoonneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Los, Alrik Pieter
2015-10-20
The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 82% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 82% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having carbohydrate degrading activity and uses thereof
Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Vlasie, Monica Diana; Damveld, Robbertus Antonius
2015-08-18
The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Draft genome sequences of 9 LA-MRSA ST5 isolates obtained from humans after short term swine contact
USDA-ARS?s Scientific Manuscript database
Livestock associated methicillin resistant Staphylococcus aureus (LA-MRSA) sequence type 5 have raised concerns surrounding the potential for these isolates to colonize or cause disease in humans with swine contact. Here, we report draft genome sequences for 9 LA-MRSA ST5 isolates obtained from huma...
Sprincl, L; Terescenko, T L; Kálal, J; Lipatova, T E; Kopecek, J; Pchakadze, G A
1976-01-01
The biocompatibility of three types of hydrophilic poly-(glycol methacrylate) gels--homogeneous, microporous and macroporous--was investigated in an experimental subcutaneous implantation. The occurrence of mucopolysaccharide complexes formed by both hyaluronic acid and chondroitine sulphates was examined in the fibrous tissue which surrounds the implant and penetrates into it in the case of a macroporous polymer. In an early stage of investigation hyaluronic acid prevails, but with proceeding collagenization the chondroitine sulphate part becomes predominant.
Lead identification in soil surrounding a used lead acid battery smelter area in Banten, Indonesia
NASA Astrophysics Data System (ADS)
Adventini, N.; Santoso, M.; Lestiani, D. D.; Syahfitri, W. Y. N.; Rixson, L.
2017-06-01
A used lead acid battery smelter generates particulates containing lead that can contaminate the surrounding environment area. Lead is a heavy metal which is harmful to health if it enters the human body through soil, air, or water. An identification of lead in soil samples surrounding formal and informal used lead acid battery smelters area in Banten, Indonesia using EDXRF has been carried out. The EDXRF accuracy and precision evaluated from marine sediment IAEA 457 gave a good agreement to the certified value. A number of 16 soil samples from formal and informal areas and 2 soil samples from control area were taken from surface and subsurface soils. The highest lead concentrations from both lead smelter were approximately 9 folds and 11 folds higher than the reference and control samples. The assessment of lead contamination in soils described in Cf index was in category: moderately and strongly polluted by lead for formal and informal lead smelter. Daily lead intake of children in this study from all sites had exceeded the recommended dietary allowance. The HI values for adults and children living near both lead smelter areas were greater than the value of safety threshold 1. This study finding confirmed that there is a potential health risk for inhabitants surrounding the used lead acid battery smelter areas in Banten, Indonesia.
37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.
Code of Federal Regulations, 2010 CFR
2010-07-01
... 37 Patents, Trademarks, and Copyrights 1 2010-07-01 2010-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide and...
Code of Federal Regulations, 2011 CFR
2011-07-01
... from abandonment 1.135 Amino Acid Sequences. (See Nucleotide and/or Amino Acid Sequences) Appeal to... Appeals and Interference 41.47 Of rejection of an application 1.104(a) Nucleotide and/or Amino Acid...) Symbols for nucleotide and/or amino acid sequence data 1.822 T Tables in patent applications 1.58 Terminal...
37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.
Code of Federal Regulations, 2011 CFR
2011-07-01
... 37 Patents, Trademarks, and Copyrights 1 2011-07-01 2011-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide and...
Yang, Wei; Liu, Fuguo; Xu, Chenqi; Sun, Cuixia; Yuan, Fang; Gao, Yanxiang
2015-05-27
The aggregation of lactoferrin and (-)-epigallocatechin gallate (EGCG) was inhibited by polyphenols, oligosaccharides, and collagen peptide in this study. Polyphenols, oligosaccharides, or collagen peptide can effectively prevent the formation of lactoferrin-EGCG aggregates, respectively. The addition sequence of lactoferrin, polyphenols (oligosaccharides or collagen peptide) and EGCG can affect the turbidity and particle size of the ternary complexes in the buffer solution; however, it hardly affected the ζ-potential and fluorescence characteristics. With either positive or negative charge, polyphenols and collagen peptide disrupted the formation of lactoferrin-EGCG aggregate mainly through the mechanism of its competition with EGCG molecules which surrounded the lactoferrin molecule surface with weaker binding affinities, forming polyphenols or a collagen peptide-lactoferrin-EGCG ternary complex; for neutral oligosaccharides, the ternary complex was generated mainly through steric effects, accompanied by a change in the lactoferrin secondary structure induced by gallic acid, chlorogenic acid, and xylo-oligosaccharide. Polyphenols, oligosaccharides, or collagen peptide restraining the formation of lactoferrin-EGCG aggregate could be applied in the design of clear products in the food, pharmaceutical, and cosmetic industries.
Ishii, Tadashi; Matsuoka, Keita; Ono, Hiroshi; Ohnishi-Kameyama, Mayumi; Yaoi, Katsuro; Nakano, Yoshimi; Ohtani, Misato; Demura, Taku; Iwai, Hiroaki; Satoh, Shinobu
2017-11-15
The major polysaccharides present in the primary and secondary walls surrounding plant cells have been well characterized. However, our knowledge of the early stages of secondary wall formation is limited. To address this, cell walls were isolated from differentiating xylem vessel elements of tobacco bright yellow-2 (BY-2) cells induced by VASCULAR-RELATED NAC-DOMAIN7 (VND7). The walls of induced VND7-VP16-GR BY-2 cells consisted of cellulose, pectic polysaccharides, hemicelluloses, and lignin, and contained more xylan and cellulose compared with non-transformed BY-2 and uninduced VND7-VP16-GR BY-2 cells. A reducing end sequence of xylan containing rhamnose and galaturonic acid- residues is present in the walls of induced, uninduced, and non-transformed BY-2 cells. Glucuronic acid residues in xylan from walls of induced cells are O-methylated, while those of xylan in non-transformed BY-2 and uninduced cells are not. Our results show that xylan changes in chemical structure and amounts during the early stages of xylem differentiation. Copyright © 2017 Elsevier Ltd. All rights reserved.
Substrate Specificity of Human Protein Arginine Methyltransferase 7 (PRMT7)
Feng, You; Hadjikyriacou, Andrea; Clarke, Steven G.
2014-01-01
Protein arginine methyltransferase 7 (PRMT7) methylates arginine residues on various protein substrates and is involved in DNA transcription, RNA splicing, DNA repair, cell differentiation, and metastasis. The substrate sequences it recognizes in vivo and the enzymatic mechanism behind it, however, remain to be explored. Here we characterize methylation catalyzed by a bacterially expressed GST-tagged human PRMT7 fusion protein with a broad range of peptide and protein substrates. After confirming its type III activity generating only ω-NG-monomethylarginine and its distinct substrate specificity for RXR motifs surrounded by basic residues, we performed site-directed mutagenesis studies on this enzyme, revealing that two acidic residues within the double E loop, Asp-147 and Glu-149, modulate the substrate preference. Furthermore, altering a single acidic residue, Glu-478, on the C-terminal domain to glutamine nearly abolished the activity of the enzyme. Additionally, we demonstrate that PRMT7 has unusual temperature dependence and salt tolerance. These results provide a biochemical foundation to understanding the broad biological functions of PRMT7 in health and disease. PMID:25294873
Gene encoding a novel extracellular metalloprotease in Bacillus subtilis.
Sloma, A; Rudolph, C F; Rufo, G A; Sullivan, B J; Theriault, K A; Ally, D; Pero, J
1990-01-01
The gene for a novel extracellular metalloprotease was cloned, and its nucleotide sequence was determined. The gene (mpr) encodes a primary product of 313 amino acids that has little similarity to other known Bacillus proteases. The amino acid sequence of the mature protease was preceded by a signal sequence of approximately 34 amino acids and a pro sequence of 58 amino acids. Four cysteine residues were found in the deduced amino acid sequence of the mature protein, indicating the possible presence of disulfide bonds. The mpr gene mapped in the cysA-aroI region of the chromosome and was not required for growth or sporulation. Images FIG. 2 FIG. 7 PMID:2105291
Ruiz-López, Noemi; Sayanova, Olga; Napier, Johnathan A; Haslam, Richard P
2012-04-01
Omega-3 (ω-3) very long chain polyunsaturated fatty acids (VLC-PUFAs) such as eicosapentaenoic acid (EPA; 20:5 Δ5,8,11,14,17) and docosahexaenoic acid (DHA; 22:6 Δ4,7,10,13,16,19) have been shown to have significant roles in human health. Currently the primary dietary source of these fatty acids are marine fish; however, the increasing demand for fish and fish oil (in particular the expansion of the aquaculture industry) is placing enormous pressure on diminishing marine stocks. Such overfishing and concerns related to pollution in the marine environment have directed research towards the development of a viable alternative sustainable source of VLC-PUFAs. As a result, the last decade has seen many genes encoding the primary VLC-PUFA biosynthetic activities identified and characterized. This has allowed the reconstitution of the VLC-PUFA biosynthetic pathway in oilseed crops, producing transgenic plants engineered to accumulate ω-3 VLC-PUFAs at levels approaching those found in native marine organisms. Moreover, as a result of these engineering activities, knowledge of the fundamental processes surrounding acyl exchange and lipid remodelling has progressed. The application of new technologies, for example lipidomics and next-generation sequencing, is providing a better understanding of seed oil biosynthesis and opportunities for increasing the production of unusual fatty acids. Certainly, it is now possible to modify the composition of plant oils successfully, and, in this review, the most recent developments in this field and the challenges of producing VLC-PUFAs in the seed oil of higher plants will be described.
Thermophilic cellobiohydrolase
Sapra, Rajat; Park, Joshua I.; Datta, Supratim; Simmons, Blake A.
2017-04-18
The present invention provides for a composition comprising a polypeptide comprising a first amino acid sequence having at least 70% identity with the amino acid sequence of Csac GH5 wherein said first amino acid sequence has a thermostable or thermophilic cellobiohydrolase (CBH) or exoglucanase activity.
Evolutionary phylodynamics of foot-and-mouth disease virus serotypes O and A circulating in Vietnam.
Le, Van Phan; Vu, Thi Thu Hang; Duong, Hong-Quan; Than, Van Thai; Song, Daesub
2016-11-29
Foot-and-mouth disease virus (FMDV) is one of the highest risk factors that affects the animal industry of the country. The virus causes production loss and high ratio mortality in young cloven-hoofed animals in Vietnam. The VP1 coding gene of 80 FMDV samples (66 samples of the serotype O and 14 samples of the serotype A) collected from endemic outbreaks during 2006-2014 were analyzed to investigate their phylogeny and genetic relationship with other available FMDVs globally. Phylogenetic analysis indicated that the serotype O strains were clustered into two distinct viral topotypes (the SEA and ME-SA), while the serotype A strains were all clustered into the genotype IX. Among the study strains, the amino acid sequence identities were shared at a level of 90.1-100, 92.9-100, and 92.8-100% for the topotypes SEA, ME-SA, and genotype IX, respectively. Substitutions leading to changes in the amino acid sequence, which are critical for the VP1 antigenic sites were also identified. Our results showed that the studied strains are most closely related to the recent FMDV isolates from Southeast Asian countries (Myanmar, Thailand, Cambodia, Malaysia, and Laos), but are distinct from the earlier FMDV isolates within the genotypes. This study provides important evidence of recent movement of FMDVs serotype O and A into Vietnam within the last decade and their genetic accumulation to be closely related to strains causing FMD in surrounding countries.
Computer-aided visualization and analysis system for sequence evaluation
Chee, M.S.
1998-08-18
A computer system for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments are improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area and sample sequences in another area on a display device. 27 figs.
Computer-aided visualization and analysis system for sequence evaluation
Chee, Mark S.; Wang, Chunwei; Jevons, Luis C.; Bernhart, Derek H.; Lipshutz, Robert J.
2004-05-11
A computer system for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments are improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area and sample sequences in another area on a display device.
Computer-aided visualization and analysis system for sequence evaluation
Chee, Mark S.
1998-08-18
A computer system for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments are improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area and sample sequences in another area on a display device.
Computer-aided visualization and analysis system for sequence evaluation
Chee, Mark S.
2003-08-19
A computer system for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments may be improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area and sample sequences in another area on a display device.
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yiao, Jian
2014-03-18
The present invention provides a novel endoglucanase nucleic acid sequence, designated egl6 (SEQ ID NO:1 encodes the full length endoglucanase; SEQ ID NO:4 encodes the mature form), and the corresponding endoglucanase VI amino acid sequence ("EGVI"; SEQ ID NO:3 is the signal sequence; SEQ ID NO:2 is the mature sequence). The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVI, recombinant EGVI proteins and methods for producing the same.
Labeled nucleotide phosphate (NP) probes
Korlach, Jonas [Ithaca, NY; Webb, Watt W [Ithaca, NY; Levene, Michael [Ithaca, NY; Turner, Stephen [Ithaca, NY; Craighead, Harold G [Ithaca, NY; Foquet, Mathieu [Ithaca, NY
2009-02-03
The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
Johnson, Paul A; Tencate, James A; Le Bas, Pierre-Yves; Guyer, Robert; Vu, Cung Khac; Skelt, Christopher
2013-11-05
In some aspects of the disclosure, a method and an apparatus is disclosed for investigating material surrounding the borehole. The method includes generating a first low frequency acoustic wave within the borehole, wherein the first low frequency acoustic wave induces a linear and a nonlinear response in one or more features in the material that are substantially perpendicular to a radius of the borehole; directing a first sequence of high frequency pulses in a direction perpendicularly with respect to the longitudinal axis of the borehole into the material contemporaneously with the first acoustic wave; and receiving one or more second high frequency pulses at one or more receivers positionable in the borehole produced by an interaction between the first sequence of high frequency pulses and the one or more features undergoing linear and nonlinear elastic distortion due to the first low frequency acoustic wave to investigate the material surrounding the borehole.
Yasuno, Rie; Wada, Hajime
1998-01-01
Lipoic acid is a coenzyme that is essential for the activity of enzyme complexes such as those of pyruvate dehydrogenase and glycine decarboxylase. We report here the isolation and characterization of LIP1 cDNA for lipoic acid synthase of Arabidopsis. The Arabidopsis LIP1 cDNA was isolated using an expressed sequence tag homologous to the lipoic acid synthase of Escherichia coli. This cDNA was shown to code for Arabidopsis lipoic acid synthase by its ability to complement a lipA mutant of E. coli defective in lipoic acid synthase. DNA-sequence analysis of the LIP1 cDNA revealed an open reading frame predicting a protein of 374 amino acids. Comparisons of the deduced amino acid sequence with those of E. coli and yeast lipoic acid synthase homologs showed a high degree of sequence similarity and the presence of a leader sequence presumably required for import into the mitochondria. Southern-hybridization analysis suggested that LIP1 is a single-copy gene in Arabidopsis. Western analysis with an antibody against lipoic acid synthase demonstrated that this enzyme is located in the mitochondrial compartment in Arabidopsis cells as a 43-kD polypeptide. PMID:9808738
Trichoderma .beta.-glucosidase
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2006-01-03
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl3, and the corresponding BGL3 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL3, recombinant BGL3 proteins and methods for producing the same.
Computer-aided visualization and analysis system for sequence evaluation
Chee, Mark S.
1999-10-26
A computer system (1) for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments may be improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area (814) and sample sequences in another area (816) on a display device (3).
Computer-aided visualization and analysis system for sequence evaluation
Chee, Mark S.
2001-06-05
A computer system (1) for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments may be improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area (814) and sample sequences in another area (816) on a display device (3).
Carbohydrate degrading polypeptide and uses thereof
Sagt, Cornelis Maria Jacobus; Schooneveld-Bergmans, Margot Elisabeth Francoise; Roubos, Johannes Andries; Los, Alrik Pieter
2015-10-20
The invention relates to a polypeptide having carbohydrate material degrading activity which comprises the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1 or SEQ ID NO: 4, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional protein and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Singh, Aditya; Bhatia, Prateek
2016-12-01
Sanger sequencing platforms, such as applied biosystems instruments, generate chromatogram files. Generally, for 1 region of a sequence, we use both forward and reverse primers to sequence that area, in that way, we have 2 sequences that need to be aligned and a consensus generated before mutation detection studies. This work is cumbersome and takes time, especially if the gene is large with many exons. Hence, we devised a rapid automated command system to filter, build, and align consensus sequences and also optionally extract exonic regions, translate them in all frames, and perform an amino acid alignment starting from raw sequence data within a very short time. In full capabilities of Automated Mutation Analysis Pipeline (ASAP), it is able to read "*.ab1" chromatogram files through command line interface, convert it to the FASTQ format, trim the low-quality regions, reverse-complement the reverse sequence, create a consensus sequence, extract the exonic regions using a reference exonic sequence, translate the sequence in all frames, and align the nucleic acid and amino acid sequences to reference nucleic acid and amino acid sequences, respectively. All files are created and can be used for further analysis. ASAP is available as Python 3.x executable at https://github.com/aditya-88/ASAP. The version described in this paper is 0.28.
Nucleic acid analysis using terminal-phosphate-labeled nucleotides
Korlach, Jonas [Ithaca, NY; Webb, Watt W [Ithaca, NY; Levene, Michael [Ithaca, NY; Turner, Stephen [Ithaca, NY; Craighead, Harold G [Ithaca, NY; Foquet, Mathieu [Ithaca, NY
2008-04-22
The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
Yang, Hyun-Jong; Shin, Joo-Ho; Diaz-Camacho, Sylvia Paz; Nawa, Yukifumi; Kang, Insug; Kong, Yoon
2012-01-01
Background Fatty acid (FA) binding proteins (FABPs) of helminths are implicated in acquisition and utilization of host-derived hydrophobic substances, as well as in signaling and cellular interactions. We previously demonstrated that secretory hydrophobic ligand binding proteins (HLBPs) of Taenia solium metacestode (TsM), a causative agent of neurocysticercosis (NC), shuttle FAs in the surrounding host tissues and inwardly transport the FAs across the parasite syncytial membrane. However, the protein molecules responsible for the intracellular trafficking and assimilation of FAs have remained elusive. Methodology/Principal Findings We isolated two novel TsMFABP genes (TsMFABP1 and TsMFABP2), which encoded 133- and 136-amino acid polypeptides with predicted molecular masses of 14.3 and 14.8 kDa, respectively. They shared 45% sequence identity with each other and 15–95% with other related-members. Homology modeling demonstrated a characteristic β-barrel composed of 10 anti-parallel β-strands and two α-helices. TsMFABP2 harbored two additional loops between β-strands two and three, and β-strands six and seven, respectively. TsMFABP1 was secreted into cyst fluid and surrounding environments, whereas TsMFABP2 was intracellularly confined. Partially purified native proteins migrated to 15 kDa with different isoelectric points of 9.2 (TsMFABP1) and 8.4 (TsMFABP2). Both native and recombinant proteins bound to 11-([5-dimethylaminonaphthalene-1-sulfonyl]amino)undecannoic acid, dansyl-DL-α-amino-caprylic acid, cis-parinaric acid and retinol, which were competitively inhibited by oleic acid. TsMFABP1 exhibited high affinity toward FA analogs. TsMFABPs showed weak binding activity to retinol, but TsMFABP2 showed relatively high affinity. Isolation of two distinct genes from an individual genome strongly suggested their paralogous nature. Abundant expression of TsMFABP1 and TsMFABP2 in the canal region of worm matched well with the histological distributions of lipids and retinol. Conclusions/Significance The divergent biochemical properties, physiological roles and cellular distributions of the TsMFABPs might be one of the critical mechanisms compensating for inadequate de novo FA synthesis. These proteins might exert harmonized or independent roles on lipid assimilation and intracellular signaling. The specialized distribution of retinol in the canal region further implies that cells in this region might differentiate into diverse cell types during metamorphosis into an adult worm. Identification of bioactive systems pertinent to parasitic homeostasis may provide a valuable target for function-related drug design. PMID:23150743
Studier, F. William
1995-04-18
Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient.
Studier, F.W.
1995-04-18
Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient. 2 figs.
Anisimov, Andrey P; Panfertsev, Evgeniy A; Svetoch, Tat'yana E; Dentovskaya, Svetlana V
2007-01-01
Sequencing of lcrV genes and comparison of the deduced amino acid sequences from ten Y. pestis strains belonging mostly to the group of atypical rhamnose-positive isolates (non-pestis subspecies or pestoides group) showed that the LcrV proteins analyzed could be classified into five sequence types. This classification was based on major amino acid polymorphisms among LcrV proteins in the four "hot points" of the protein sequences. Some additional minor polymorphisms were found throughout these sequence types. The "hot points" corresponded to amino acids 18 (Lys --> Asn), 72 (Lys --> Arg), 273 (Cys --> Ser), and 324-326 (Ser-Gly-Lys --> Arg) in the LcrV sequence of the reference Y. pestis strain CO92. One possible explanation for polymorphism in amino acid sequences of LcrV among different strains is that strain-specific variation resulted from adaptation of the plague pathogen to different rodent and lagomorph hosts.
A robust and cost-effective approach to sequence and analyze complete genomes of small RNA viruses
USDA-ARS?s Scientific Manuscript database
Background: Next-generation sequencing (NGS) allows ultra-deep sequencing of nucleic acids. The use of sequence-independent amplification of viral nucleic acids without utilization of target-specific primers provides advantages over traditional sequencing methods and allows detection of unsuspected ...
.beta.-glucosidase 5 (BGL5) compositions
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2010-06-01
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl5, and the corresponding BGL5 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL5, recombinant BGL5 proteins and methods for producing the same.
Antell, Gregory C.; Zhong, Wen; Kercher, Katherine; Passic, Shendra; Williams, Jean; Liu, Yucheng; James, Tony; Jacobson, Jeffrey M.; Szep, Zsofia
2017-01-01
Vpr is an HIV-1 accessory protein that plays numerous roles during viral replication, and some of which are cell type dependent. To test the hypothesis that HIV-1 tropism extends beyond the envelope into the vpr gene, studies were performed to identify the associations between coreceptor usage and Vpr variation in HIV-1-infected patients. Colinear HIV-1 Env-V3 and Vpr amino acid sequences were obtained from the LANL HIV-1 sequence database and from well-suppressed patients in the Drexel/Temple Medicine CNS AIDS Research and Eradication Study (CARES) Cohort. Genotypic classification of Env-V3 sequences as X4 (CXCR4-utilizing) or R5 (CCR5-utilizing) was used to group colinear Vpr sequences. To reveal the sequences associated with a specific coreceptor usage genotype, Vpr amino acid sequences were assessed for amino acid diversity and Jensen-Shannon divergence between the two groups. Five amino acid alphabets were used to comprehensively examine the impact of amino acid substitutions involving side chains with similar physiochemical properties. Positions 36, 37, 41, 89, and 96 of Vpr were characterized by statistically significant divergence across multiple alphabets when X4 and R5 sequence groups were compared. In addition, consensus amino acid switches were found at positions 37 and 41 in comparisons of the R5 and X4 sequence populations. These results suggest an evolutionary link between Vpr and gp120 in HIV-1-infected patients. PMID:28620613
Methods of diagnosing alagille syndrome
Li, Linheng; Hood, Leroy; Krantz, Ian D.; Spinner, Nancy B.
2004-03-09
The present invention provides an isolated polypeptide exhibiting substantially the same amino acid sequence as JAGGED, or an active fragment thereof, provided that the polypeptide does not have the amino acid sequence of SEQ ID NO:5 or SEQ ID NO:6. The invention further provides an isolated nucleic acid molecule containing a nucleotide sequence encoding substantially the same amino acid sequence as JAGGED, or an active fragment thereof, provided that the nucleotide sequence does not encode the amino acid sequence of SEQ ID NO:5 or SEQ ID NO:6. Also provided herein is a method of inhibiting differentiation of hematopoietic progenitor cells by contacting the progenitor cells with an isolated JAGGED polypeptide, or active fragment thereof. The invention additionally provides a method of diagnosing Alagille Syndrome in an individual. The method consists of detecting an Alagille Syndrome disease-associated mutation linked to a JAGGED locus.
Complete amino acid sequence of bovine colostrum low-Mr cysteine proteinase inhibitor.
Hirado, M; Tsunasawa, S; Sakiyama, F; Niinobe, M; Fujii, S
1985-07-01
The complete amino acid sequence of bovine colostrum cysteine proteinase inhibitor was determined by sequencing native inhibitor and peptides obtained by cyanogen bromide degradation, Achromobacter lysylendopeptidase digestion and partial acid hydrolysis of reduced and S-carboxymethylated protein. Achromobacter peptidase digestion was successfully used to isolate two disulfide-containing peptides. The inhibitor consists of 112 amino acids with an Mr of 12787. Two disulfide bonds were established between Cys 66 and Cys 77 and between Cys 90 and Cys 110. A high degree of homology in the sequence was found between the colostrum inhibitor and human gamma-trace, human salivary acidic protein and chicken egg-white cystatin.
NASA Astrophysics Data System (ADS)
Mass, T.; Drake, J.; Haramaty, L.; Zelzion, U.; Bhattacharya, D.; Rosenthal, Y.; Falkowski, P. G.
2012-12-01
Atmospheric CO 2 levels are rising rapidly, resulting in a decrease in both oceanic pH, and the carbonate saturation state (Ω). It has been hypothesized that calcifying marine organisms, including reef-building corals, will be affected by the decline of the carbonate saturation state. However, it is still unclear how corals will respond to these changes, as their skeletal formation is biologically mediated and occurs in isolated space rather than directly from seawater. In corals new skeletal material is precipitated in the subcalicoblastic space between the skeleton and the calicoblastic epithelium which, does not exceed a few nanometers and contains the ''calcifying fluid''. The goal of our project is to understand how these fluids respond to changes in the surrounding seawater and in turn affects the biologically mediated calcification mechanisms at the molecular, cellular and tissue levels. While it is generally thought that an organic matrix, which contain a suite of proteins, lipids and poly-saccharides, take part in calcification process, the specific mechanism by which the mineral is precipitated is unknown. The organic matrix composed of two fractions: the soluble organic matrix (SOM) and the insoluble organic matrix (IOM). It is suggested that the IOM plays a role as structural proteins forming a framework for crystal growth whereas the SOM plays a role in nucleation and crystal growth. To address this question we have investigated both the structural framework proteins (Drake et al abstract submitted to the AGU fall meeting) the role of proteins in nucleation and crystal growth (this work). Here, we established cell cultures and sequenced the 458-megabase genome of the stony coral, Stylophora pistillata, using next-generation sequencing technology. This genome contains 21,678 predicted protein-coding genes. Many of the known protein components of invertebrate skeletal matrices are acidic and/or contain repeated sequences. We searched for genes encoding proteins with the following characteristics: (1) high content (>35%) of acidic amino acids (aspartate and glutamate), (2) at least 150 residues, and (3) a signal peptide. This approach revealed eight coral acidic proteins (CAPs). We confirmed the sequence of four candidates (CAP1-3 and 8). A search for similarity in the UniProt database and published coral's genomes and ESTs reveals high similarity of CAPs 2, 3, and 8 to both invertebrate and vertebrate acidic rich proteins. CAP1, however, has no significant matches in any of the databases. We expressed and purified these four proteins to examine their role in coral bio-mineralization. We show that the pure CAPs bind Ca+2, and furthermore, individual CAPs can precipitate calcium carbonate in vitro in artificial seawater. These results strongly suggest that aragonite precipitation in the calcifying region of corals is promoted by a highly conserved set of acidic proteins. Based purely on thermodynamic grounds, the predicted change in surface ocean pH should not affect the binding ability of these acidic proteins. To the extent these proteins are responsible for calcification in this and other corals, we suggest that corals will continue to calcify at pH changes predicted to occur in this century.
Detection and isolation of nucleic acid sequences using competitive hybridization probes
Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.
1997-01-01
A method for detecting a target nucleic acid sequence in a sample is provided using hybridization probes which competitively hybridize to a target nucleic acid. According to the method, a target nucleic acid sequence is hybridized to first and second hybridization probes which are complementary to overlapping portions of the target nucleic acid sequence, the first hybridization probe including a first complexing agent capable of forming a binding pair with a second complexing agent and the second hybridization probe including a detectable marker. The first complexing agent attached to the first hybridization probe is contacted with a second complexing agent, the second complexing agent being attached to a solid support such that when the first and second complexing agents are attached, target nucleic acid sequences hybridized to the first hybridization probe become immobilized on to the solid support. The immobilized target nucleic acids are then separated and detected by detecting the detectable marker attached to the second hybridization probe. A kit for performing the method is also provided.
Detection and isolation of nucleic acid sequences using competitive hybridization probes
Lucas, J.N.; Straume, T.; Bogen, K.T.
1997-04-01
A method for detecting a target nucleic acid sequence in a sample is provided using hybridization probes which competitively hybridize to a target nucleic acid. According to the method, a target nucleic acid sequence is hybridized to first and second hybridization probes which are complementary to overlapping portions of the target nucleic acid sequence, the first hybridization probe including a first complexing agent capable of forming a binding pair with a second complexing agent and the second hybridization probe including a detectable marker. The first complexing agent attached to the first hybridization probe is contacted with a second complexing agent, the second complexing agent being attached to a solid support such that when the first and second complexing agents are attached, target nucleic acid sequences hybridized to the first hybridization probe become immobilized on to the solid support. The immobilized target nucleic acids are then separated and detected by detecting the detectable marker attached to the second hybridization probe. A kit for performing the method is also provided. 7 figs.
Mutation analysis in a German family identified a new cataract-causing allele in the CRYBB2 gene
Pauli, Silke; Söker, Torben; Klopp, Norman; Illig, Thomas; Engel, Wolfgang
2007-01-01
Purpose The study demonstrates the functional candidate gene analysis in a cataract family of German descent. Methods We screened a German family, clinically documented to have congenital cataracts, for mutation in the candidate genes CRYG (A to D) and CRYBB2 through polymerase chain reaction analyses and sequencing. Results Congenital cataract was first observed in a daughter of healthy parents. Her two children (a boy and a girl) also suffer from congenital cataracts and have been operated within the first weeks of birth. Morphologically, the cataract is characterized as nuclear with an additional ring-shaped cortical opacity. Molecular analysis revealed no causative mutation in any of the CRYG genes. However, sequencing of the exons of the CRYBB2 gene identified a sequence variation in exon 5 (383 A>T) with a substitution of Asp to Val at position 128. All three affected family members revealed this change but it was not observed in any of the unaffected persons of the family. The putative mutation creates a restriction site for the enzyme TaiI. This mutation was checked for in controls of randomly selected DNA samples from ophthalmologically normal individuals from the population-based KORA S4 study (n=96) and no mutation was observed. Moreover, the Asp at position 128 is within a stretch of 12 amino acids, which are highly conserved throughout the animal kingdom. For the mutant protein, the isoelectric point is raised from pH 6.50 to 6.75. Additionally, the random coil structure of the protein between the amino acids 126-139 is interrupted by a short extended strand structure. In addition, this region becomes hydrophobic (from neutral to +1) and the electrostatic potential in the region surrounding the exchanged amino acid alters from a mainly negative potential to an enlarged positive potential. Conclusions The D128V mutation segregates only in affected family members and is not seen in representative controls. It represents the first mutation outside exon 6 of the human CRYBB2 gene. PMID:17653036
Detection of nucleic acids by multiple sequential invasive cleavages
Hall, Jeff G.; Lyamichev, Victor I.; Mast, Andrea L.; Brow, Mary Ann D.
1999-01-01
The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based on charge. The present invention also provides methods for the detection of non-target cleavage products via the formation of a complete and activated protein binding region. The invention further provides sensitive and specific methods for the detection of human cytomegalovirus nucleic acid in a sample.
Hall, Jeff G.; Lyamichev, Victor I.; Mast, Andrea L.; Brow, Mary Ann; Kwiatkowski, Robert W.; Vavra, Stephanie H.
2005-03-29
The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based on charge. The present invention also provides methods for the detection of non-target cleavage products via the formation of a complete and activated protein binding region. The invention further provides sensitive and specific methods for the detection of nucleic acid from various viruses in a sample.
Detection of nucleic acids by multiple sequential invasive cleavages 02
Hall, Jeff G.; Lyamichev, Victor I.; Mast, Andrea L.; Brow, Mary Ann D.
2002-01-01
The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based on charge. The present invention also provides methods for the detection of non-target cleavage products via the formation of a complete and activated protein binding region. The invention further provides sensitive and specific methods for the detection of human cytomegalovirus nucleic acid in a sample.
Detection of nucleic acids by multiple sequential invasive cleavages
Hall, Jeff G; Lyamichev, Victor I; Mast, Andrea L; Brow, Mary Ann D
2012-10-16
The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based on charge. The present invention also provides methods for the detection of non-target cleavage products via the formation of a complete and activated protein binding region. The invention further provides sensitive and specific methods for the detection of human cytomegalovirus nucleic acid in a sample.
Ridley, R G; Patel, H V; Gerber, G E; Morton, R C; Freeman, K B
1986-01-01
A cDNA clone spanning the entire amino acid sequence of the nuclear-encoded uncoupling protein of rat brown adipose tissue mitochondria has been isolated and sequenced. With the exception of the N-terminal methionine the deduced N-terminus of the newly synthesized uncoupling protein is identical to the N-terminal 30 amino acids of the native uncoupling protein as determined by protein sequencing. This proves that the protein contains no N-terminal mitochondrial targeting prepiece and that a targeting region must reside within the amino acid sequence of the mature protein. Images PMID:3012461
Hehle, Verena K.; Paul, Matthew J.; Roberts, Victoria A.; van Dolleweerd, Craig J.; Ma, Julian K.-C.
2016-01-01
This study examined the degradation pattern of a murine IgG1κ monoclonal antibody expressed in and extracted from transformed Nicotiana tabacum. Gel electrophoresis of leaf extracts revealed a consistent pattern of recombinant immunoglobulin bands, including intact and full-length antibody, as well as smaller antibody fragments. N-terminal sequencing revealed these smaller fragments to be proteolytic cleavage products and identified a limited number of protease-sensitive sites in the antibody light and heavy chain sequences. No strictly conserved target sequence was evident, although the peptide bonds that were susceptible to proteolysis were predominantly and consistently located within or near to the interdomain or solvent-exposed regions in the antibody structure. Amino acids surrounding identified cleavage sites were mutated in an attempt to increase resistance. Different Guy’s 13 antibody heavy and light chain mutant combinations were expressed transiently in N. tabacum and demonstrated intensity shifts in the fragmentation pattern, resulting in alterations to the full-length antibody-to-fragment ratio. The work strengthens the understanding of proteolytic cleavage of antibodies expressed in plants and presents a novel approach to stabilize full-length antibody by site-directed mutagenesis.—Hehle, V. K., Paul, M. J., Roberts, V. A., van Dolleweerd, C. J., Ma, J. K.-C. Site-targeted mutagenesis for stabilization of recombinant monoclonal antibody expressed in tobacco (Nicotiana tabacum) plants. PMID:26712217
Zhao, Yanting; Duan, Cuilan; Zhang, Xuxiang; Chen, Huangen; Ren, Hongqiang; Yin, Ying; Ye, Lin
2018-04-23
The gut microbiota of aquatic animals plays a crucial role in host health through nutrient acquisition and outcompetition of pathogens. In this study, based on the high-throughput sequencing of 16S rRNA gene amplicons, we examined the bacterial communities in the gut of freshwater shrimp ( Macrobrachium nipponense ) and in their living environments (sediment and pond water) and analyzed the effects of abiotic and biotic factors on the shrimp gut bacterial communities. High bacterial heterogeneity was observed in the freshwater shrimp gut samples, and the result indicated that both the surrounding bacterial community and water quality factors (particularly dissolved oxygen and temperature) could affect the shrimp gut bacterial community. Despite the observed heterogeneity, 57 genera, constituting 38~99% of the total genera in each of the 40 shrimp gut samples, were identified as the main bacterial population in the gut of M. nipponense . In addition, a high diversity and abundance of lactic acid bacteria (26 genera), which could play significant roles in the digestion process in shrimp, were observed in the shrimp gut samples. Overall, this study provides insights into the gut bacterial communities of freshwater shrimp and basic information for shrimp farming regarding the application of probiotics and disease prevention.
Method of increasing conversion of a fatty acid to its corresponding dicarboxylic acid
Craft, David L.; Wilson, C. Ron; Eirich, Dudley; Zhang, Yeyan
2004-09-14
A nucleic acid sequence including a CYP promoter operably linked to nucleic acid encoding a heterologous protein is provided to increase transcription of the nucleic acid. Expression vectors and host cells containing the nucleic acid sequence are also provided. The methods and compositions described herein are especially useful in the production of polycarboxylic acids by yeast cells.
Konami, Y; Yamamoto, K; Osawa, T; Irimura, T
1995-04-01
The complete amino acid sequence of a lactose-binding Cytisus sessilifolius anti-H(O) lectin II (CSA-II) was determined using a protein sequencer. After digestion of CSA-II with endoproteinase Lys-C or Asp-N, the resulting peptides were purified by reversed-phase high performance liquid chromatography (HPLC) and then subjected to sequence analysis. Comparison of the complete amino acid sequence of CSA-II with the sequences of other leguminous seed lectins revealed regions of extensive homology. The amino acid sequence of a putative carbohydrate-binding domain of CSA-II was found to be similar to those of several anti-H(O) leguminous lectins, especially to that of the L-fucose-binding Ulex europaeus lectin I (UEA-I).
WEB-server for search of a periodicity in amino acid and nucleotide sequences
NASA Astrophysics Data System (ADS)
E Frenkel, F.; Skryabin, K. G.; Korotkov, E. V.
2017-12-01
A new web server (http://victoria.biengi.ac.ru/splinter/login.php) was designed and developed to search for periodicity in nucleotide and amino acid sequences. The web server operation is based upon a new mathematical method of searching for multiple alignments, which is founded on the position weight matrices optimization, as well as on implementation of the two-dimensional dynamic programming. This approach allows the construction of multiple alignments of the indistinctly similar amino acid and nucleotide sequences that accumulated more than 1.5 substitutions per a single amino acid or a nucleotide without performing the sequences paired comparisons. The article examines the principles of the web server operation and two examples of studying amino acid and nucleotide sequences, as well as information that could be obtained using the web server.
Using Behavior Sequence Analysis to Map Serial Killers' Life Histories.
Keatley, David A; Golightly, Hayley; Shephard, Rebecca; Yaksic, Enzo; Reid, Sasha
2018-03-01
The aim of the current research was to provide a novel method for mapping the developmental sequences of serial killers' life histories. An in-depth biographical account of serial killers' lives, from birth through to conviction, was gained and analyzed using Behavior Sequence Analysis. The analyses highlight similarities in behavioral events across the serial killers' lives, indicating not only which risk factors occur, but the temporal order of these factors. Results focused on early childhood environment, indicating the role of parental abuse; behaviors and events surrounding criminal histories of serial killers, showing that many had previous convictions and were known to police for other crimes; behaviors surrounding their murders, highlighting differences in victim choice and modus operandi; and, finally, trial pleas and convictions. The present research, therefore, provides a novel approach to synthesizing large volumes of data on criminals and presenting results in accessible, understandable outcomes.
Sequence of the chloroplast 16S rRNA gene and its surrounding regions of Chlamydomonas reinhardii.
Dron, M; Rahire, M; Rochaix, J D
1982-01-01
The sequence of a 2 kb DNA fragment containing the chloroplast 16S ribosomal RNA gene from Chlamydomonas reinhardii and its flanking regions has been determined. The algal 16S rRNA sequence (1475 nucleotides) and secondary structure are highly related to those found in bacteria and in the chloroplasts of higher plants. In contrast, the flanking regions are very different. In C. reinhardii the 16S rRNA gene is surrounded by AT rich segments of about 180 bases, which are followed by a long stretch of complementary bases separated from each other by 1833 nucleotides. It is likely that these structures play an important role in the folding and processing of the precursor of 16S rRNA. The primary and secondary structures of the binding sites of two ribosomal proteins in the 16SrRNAs of E. coli and C. reinhardii are considerably related. Images PMID:6296784
A duplicate gene rooting of seed plants and the phylogenetic position of flowering plants
Mathews, Sarah; Clements, Mark D.; Beilstein, Mark A.
2010-01-01
Flowering plants represent the most significant branch in the tree of land plants, with respect to the number of extant species, their impact on the shaping of modern ecosystems and their economic importance. However, unlike so many persistent phylogenetic problems that have yielded to insights from DNA sequence data, the mystery surrounding the origin of angiosperms has deepened with the advent and advance of molecular systematics. Strong statistical support for competing hypotheses and recent novel trees from molecular data suggest that the accuracy of current molecular trees requires further testing. Analyses of phytochrome amino acids using a duplicate gene-rooting approach yield trees that unite cycads and angiosperms in a clade that is sister to a clade in which Gingko and Cupressophyta are successive sister taxa to gnetophytes plus Pinaceae. Application of a cycads + angiosperms backbone constraint in analyses of a morphological dataset yields better resolved trees than do analyses in which extant gymnosperms are forced to be monophyletic. The results have implications both for our assessment of uncertainty in trees from sequence data and for our use of molecular constraints as a way to integrate insights from morphological and molecular evidence. PMID:20047866
Identifying protein kinase target preferences using mass spectrometry
Douglass, Jacqueline; Gunaratne, Ruwan; Bradford, Davis; Saeed, Fahad; Hoffert, Jason D.; Steinbach, Peter J.; Pisitkun, Trairak
2012-01-01
A general question in molecular physiology is how to identify candidate protein kinases corresponding to a known or hypothetical phosphorylation site in a protein of interest. It is generally recognized that the amino acid sequence surrounding the phosphorylation site provides information that is relevant to identification of the cognate protein kinase. Here, we present a mass spectrometry-based method for profiling the target specificity of a given protein kinase as well as a computational tool for the calculation and visualization of the target preferences. The mass spectrometry-based method identifies sites phosphorylated in response to in vitro incubation of protein mixtures with active recombinant protein kinases followed by standard phosphoproteomic methodologies. The computational tool, called “PhosphoLogo,” uses an information-theoretic algorithm to calculate position-specific amino acid preferences and anti-preferences from the mass-spectrometry data (http://helixweb.nih.gov/PhosphoLogo/). The method was tested using protein kinase A (catalytic subunit α), revealing the well-known preference for basic amino acids in positions −2 and −3 relative to the phosphorylated amino acid. It also provides evidence for a preference for amino acids with a branched aliphatic side chain in position +1, a finding compatible with known crystal structures of protein kinase A. The method was also employed to profile target preferences and anti-preferences for 15 additional protein kinases with potential roles in regulation of epithelial transport: CK2, p38, AKT1, SGK1, PKCδ, CaMK2δ, DAPK1, MAPKAPK2, PKD3, PIM1, OSR1, STK39/SPAK, GSK3β, Wnk1, and Wnk4. PMID:22723110
DeWitt, D L; Smith, W L
1988-01-01
Prostaglandin G/H synthase (8,11,14-icosatrienoate, hydrogen-donor:oxygen oxidoreductase, EC 1.14.99.1) catalyzes the first step in the formation of prostaglandins and thromboxanes, the conversion of arachidonic acid to prostaglandin endoperoxides G and H. This enzyme is the site of action of nonsteroidal anti-inflammatory drugs. We have isolated a 2.7-kilobase complementary DNA (cDNA) encompassing the entire coding region of prostaglandin G/H synthase from sheep vesicular glands. This cDNA, cloned from a lambda gt 10 library prepared from poly(A)+ RNA of vesicular glands, hybridizes with a single 2.75-kilobase mRNA species. The cDNA clone was selected using oligonucleotide probes modeled from amino acid sequences of tryptic peptides prepared from the purified enzyme. The full-length cDNA encodes a protein of 600 amino acids, including a signal sequence of 24 amino acids. Identification of the cDNA as coding for prostaglandin G/H synthase is based on comparison of amino acid sequences of seven peptides comprising 103 amino acids with the amino acid sequence deduced from the nucleotide sequence of the cDNA. The molecular weight of the unglycosylated enzyme lacking the signal peptide is 65,621. The synthase is a glycoprotein, and there are three potential sites for N-glycosylation, two of them in the amino-terminal half of the molecule. The serine reported to be acetylated by aspirin is at position 530, near the carboxyl terminus. There is no significant similarity between the sequence of the synthase and that of any other protein in amino acid or nucleotide sequence libraries, and a heme binding site(s) is not apparent from the amino acid sequence. The availability of a full-length cDNA clone coding for prostaglandin G/H synthase should facilitate studies of the regulation of expression of this enzyme and the structural features important for catalysis and for interaction with anti-inflammatory drugs. Images PMID:3125548
Lazzarato, F; Franceschinis, G; Botta, M; Cordero, F; Calogero, R A
2004-11-01
RRE allows the extraction of non-coding regions surrounding a coding sequence [i.e. gene upstream region, 5'-untranslated region (5'-UTR), introns, 3'-UTR, downstream region] from annotated genomic datasets available at NCBI. RRE parser and web-based interface are accessible at http://www.bioinformatica.unito.it/bioinformatics/rre/rre.html
PubDNA Finder: a web database linking full-text articles to sequences of nucleic acids.
García-Remesal, Miguel; Cuevas, Alejandro; Pérez-Rey, David; Martín, Luis; Anguita, Alberto; de la Iglesia, Diana; de la Calle, Guillermo; Crespo, José; Maojo, Víctor
2010-11-01
PubDNA Finder is an online repository that we have created to link PubMed Central manuscripts to the sequences of nucleic acids appearing in them. It extends the search capabilities provided by PubMed Central by enabling researchers to perform advanced searches involving sequences of nucleic acids. This includes, among other features (i) searching for papers mentioning one or more specific sequences of nucleic acids and (ii) retrieving the genetic sequences appearing in different articles. These additional query capabilities are provided by a searchable index that we created by using the full text of the 176 672 papers available at PubMed Central at the time of writing and the sequences of nucleic acids appearing in them. To automatically extract the genetic sequences occurring in each paper, we used an original method we have developed. The database is updated monthly by automatically connecting to the PubMed Central FTP site to retrieve and index new manuscripts. Users can query the database via the web interface provided. PubDNA Finder can be freely accessed at http://servet.dia.fi.upm.es:8080/pubdnafinder
NASA Astrophysics Data System (ADS)
Wu, Yue-Hong; Liao, Li; Wang, Chun-Sheng; Ma, Wei-Lin; Meng, Fan-Xu; Wu, Min; Xu, Xue-Wei
2013-09-01
Deep-sea polymetallic nodules, rich in metals such as Fe, Mn, and Ni, are potential resources for future exploitation. Early culturing and microscopy studies suggest that polymetallic nodules are at least partially biogenic. To understand the microbial communities in this environment, we compared microbial community composition and diversity inside nodules and in the surrounding sediments. Three sampling sites in the Pacific Ocean containing polymetallic nodules were used for culture-independent investigations of microbial diversity. A total of 1013 near full-length bacterial 16S rRNA gene sequences and 640 archaeal 16S rRNA gene sequences with ~650 bp from nodules and the surrounding sediments were analyzed. Bacteria showed higher diversity than archaea. Interestingly, sediments contained more diverse bacterial communities than nodules, while the opposite was detected for archaea. Bacterial communities tend to be mostly unique to sediments or nodules, with only 13.3% of sequences shared. The most abundant bacterial groups detected only in nodules were Pseudoalteromonas and Alteromonas, which were predicted to play a role in building matrix outside cells to induce or control mineralization. However, archaeal communities were mostly shared between sediments and nodules, including the most abundant OTU containing 290 sequences from marine group I Thaumarchaeota. PcoA analysis indicated that microhabitat (i.e., nodule or sediment) seemed to be a major factor influencing microbial community composition, rather than sampling locations or distances between locations.
Modulation of V1 Spike Response by Temporal Interval of Spatiotemporal Stimulus Sequence
Kim, Taekjun; Kim, HyungGoo R.; Kim, Kayeon; Lee, Choongkil
2012-01-01
The spike activity of single neurons of the primary visual cortex (V1) becomes more selective and reliable in response to wide-field natural scenes compared to smaller stimuli confined to the classical receptive field (RF). However, it is largely unknown what aspects of natural scenes increase the selectivity of V1 neurons. One hypothesis is that modulation by surround interaction is highly sensitive to small changes in spatiotemporal aspects of RF surround. Such a fine-tuned modulation would enable single neurons to hold information about spatiotemporal sequences of oriented stimuli, which extends the role of V1 neurons as a simple spatiotemporal filter confined to the RF. In the current study, we examined the hypothesis in the V1 of awake behaving monkeys, by testing whether the spike response of single V1 neurons is modulated by temporal interval of spatiotemporal stimulus sequence encompassing inside and outside the RF. We used two identical Gabor stimuli that were sequentially presented with a variable stimulus onset asynchrony (SOA): the preceding one (S1) outside the RF and the following one (S2) in the RF. This stimulus configuration enabled us to examine the spatiotemporal selectivity of response modulation from a focal surround region. Although S1 alone did not evoke spike responses, visual response to S2 was modulated for SOA in the range of tens of milliseconds. These results suggest that V1 neurons participate in processing spatiotemporal sequences of oriented stimuli extending outside the RF. PMID:23091631
21 CFR 137.270 - Self-rising white corn meal.
Code of Federal Regulations, 2012 CFR
2012-04-01
... 45 cc. acid used in the decomposition). Observe the temperature of the air surrounding the apparatus... is an intimate mixture of white corn meal, sodium bicarbonate, and one or both of the acid-reacting... dioxide is evolved. The acid-reacting substance is added in sufficient quantity to neutralize the sodium...
21 CFR 137.270 - Self-rising white corn meal.
Code of Federal Regulations, 2014 CFR
2014-04-01
... 45 cc. acid used in the decomposition). Observe the temperature of the air surrounding the apparatus... is an intimate mixture of white corn meal, sodium bicarbonate, and one or both of the acid-reacting... dioxide is evolved. The acid-reacting substance is added in sufficient quantity to neutralize the sodium...
21 CFR 137.270 - Self-rising white corn meal.
Code of Federal Regulations, 2013 CFR
2013-04-01
... 45 cc. acid used in the decomposition). Observe the temperature of the air surrounding the apparatus... is an intimate mixture of white corn meal, sodium bicarbonate, and one or both of the acid-reacting... dioxide is evolved. The acid-reacting substance is added in sufficient quantity to neutralize the sodium...
DOE Office of Scientific and Technical Information (OSTI.GOV)
Peters, J.; Peters, M.; Lottspeich, F.
1987-11-01
The complete nucleotide sequence of the gene encoding the surface (hexagonally packed intermediate (HPI))-layer polypeptide of Deinococcus radiodurans Sark was determined and found to encode a polypeptide of 1036 amino acids. Amino acid sequence analysis of about 30% of the residues revealed that the mature polypeptide consists of at least 978 amino acids. The N terminus was blocked to Edman degradation. The results of proteolytic modification of the HPI layer in situ and M/sub r/ estimations of the HPI polypeptide expressed in Escherichia coli indicated that there is a leader sequence. The N-terminal region contained a very high percentage (29%)more » of threonine and serine, including a cluster of nine consecutive serine or threonine residues, whereas a stretch near the C terminus was extremely rich in aromatic amino acids (29%). The protein contained at least two disulfide bridges, as well as tightly bound reducing sugars and fatty acids.« less
Artificial mismatch hybridization
Guo, Zhen; Smith, Lloyd M.
1998-01-01
An improved nucleic acid hybridization process is provided which employs a modified oligonucleotide and improves the ability to discriminate a control nucleic acid target from a variant nucleic acid target containing a sequence variation. The modified probe contains at least one artificial mismatch relative to the control nucleic acid target in addition to any mismatch(es) arising from the sequence variation. The invention has direct and advantageous application to numerous existing hybridization methods, including, applications that employ, for example, the Polymerase Chain Reaction, allele-specific nucleic acid sequencing methods, and diagnostic hybridization methods.
Detection and isolation of nucleic acid sequences using a bifunctional hybridization probe
Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.
2000-01-01
A method for detecting and isolating a target sequence in a sample of nucleic acids is provided using a bifunctional hybridization probe capable of hybridizing to the target sequence that includes a detectable marker and a first complexing agent capable of forming a binding pair with a second complexing agent. A kit is also provided for detecting a target sequence in a sample of nucleic acids using a bifunctional hybridization probe according to this method.
Kurosu, Y; Murayama, K; Shindo, N; Shisa, Y; Ishioka, N
1996-11-01
This is an initial report to propose a protein sequence analysis system with DL differentiation using capillary electrophoresis (CE). This system consists of a protein sequencer and a CE system. After fractionation of phenyl-thiohydantoin (PTH)-amino acids using a protein sequencer, optical resolution for each PTH-amino acid is performed by CE using some chiral selectors such as digitonin, beta-escin and others. As a model peptide, [D-Ala2]-methionine enkephalin (L-Tyr-D-Ala-Gly-L-Phe-L-Met), was used and the sequence with DL differentiation was determined, with the exception of the fourth amino acid, L-Phe, using our proposed system.
Influence of metakaolin on chemical resistance of concrete
NASA Astrophysics Data System (ADS)
Mlinárik, L.; Kopecskó, K.
2013-12-01
Nowadays the most suitable and widely used construction material is concrete. We could develop concrete for every request in connection with the properties of fresh concrete and the quality of hardened concrete, too. The demand is rising in application of special concretes, like high performance and ultra high performance concretes (HPC, UHPC). These are usable in extreme natural circumstances or in very corrosive surroundings (for example: sewage farm, sewer, cooling tower, biogas factories). The pH value of the commercial sewage is between 7-8, but this value is often around 4 or less. The concrete pipes, which transport the sewage, are under corrosion, because above the liquid level sulphuric acid occurs due to microbes. Acidic surroundings could start the corrosion of concrete. When the pH value reduces, the influence of the acids will increase. The most significant influence has the sulphuric acid. The pH value of sulphuric acid is about 1, or less. Earlier in the cooling towers of coal thermal power stations used special coating on the concrete wall. Recently application of high performance concrete without polymeric coating is more general. Cementitious supplementary materials are widely used to protect the concrete from these corrosive surroundings. Usually used cementitious supplementary materials are ground granulated blastfurnace slag (GGBS), flying ash (FA) or silica fume (SF). In the last years there has been a growing interest in the application of metakaolin. Metakaolin is made by heat treatment, calcinations of a natural clay mineral, kaolinite. In our present research the chemical resistance of mortars in different corrosive surroundings (pH=1 sulphuric acid; pH=3 acetic acid) and the chloride ion migration were studied on series of mortar samples using rapid chloride migration test. Cement paste and mortar samples were made with 17% metakaolin replacement or without metakaolin. The following cements were used: CEM II/A-S 42.5 N, CEM I 42.5 N-S. We concluded that the replacement of cement by metakaolin results in significant increases in compressive and tensile strengths and it prevents the infiltration of harmful substances.
Structural Comparison of Different Antibodies Interacting with Parvovirus Capsids
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hafenstein, Susan; Bowman, Valorie D.; Sun, Tao
2009-05-13
The structures of canine parvovirus (CPV) and feline parvovirus (FPV) complexed with antibody fragments from eight different neutralizing monoclonal antibodies were determined by cryo-electron microscopy (cryoEM) reconstruction to resolutions varying from 8.5 to 18 {angstrom}. The crystal structure of one of the Fab molecules and the sequence of the variable domain for each of the Fab molecules have been determined. The structures of Fab fragments not determined crystallographically were predicted by homology modeling according to the amino acid sequence. Fitting of the Fab and virus structures into the cryoEM densities identified the footprints of each antibody on the viral surface.more » As anticipated from earlier analyses, the Fab binding sites are directed to two epitopes, A and B. The A site is on an exposed part of the surface near an icosahedral threefold axis, whereas the B site is about equidistant from the surrounding five-, three-, and twofold axes. One antibody directed to the A site binds CPV but not FPV. Two of the antibodies directed to the B site neutralize the virus as Fab fragments. The differences in antibody properties have been linked to the amino acids within the antibody footprints, the position of the binding site relative to the icosahedral symmetry elements, and the orientation of the Fab structure relative to the surface of the virus. Most of the exposed surface area was antigenic, although each of the antibodies had a common area of overlap that coincided with the positions of the previously mapped escape mutations.« less
Hemalatha, G. R.; Rao, D. Satyanarayana; Guruprasad, L.
2007-01-01
We have identified four repeats and ten domains that are novel in proteins encoded by the Bacillus anthracis str. Ames proteome using automated in silico methods. A “repeat” corresponds to a region comprising less than 55-amino-acid residues that occur more than once in the protein sequence and sometimes present in tandem. A “domain” corresponds to a conserved region with greater than 55-amino-acid residues and may be present as single or multiple copies in the protein sequence. These correspond to (1) 57-amino-acid-residue PxV domain, (2) 122-amino-acid-residue FxF domain, (3) 111-amino-acid-residue YEFF domain, (4) 109-amino-acid-residue IMxxH domain, (5) 103-amino-acid-residue VxxT domain, (6) 84-amino-acid-residue ExW domain, (7) 104-amino-acid-residue NTGFIG domain, (8) 36-amino-acid-residue NxGK repeat, (9) 95-amino-acid-residue VYV domain, (10) 75-amino-acid-residue KEWE domain, (11) 59-amino-acid-residue AFL domain, (12) 53-amino-acid-residue RIDVK repeat, (13) (a) 41-amino-acid-residue AGQF repeat and (b) 42-amino-acid-residue GSAL repeat. A repeat or domain type is characterized by specific conserved sequence motifs. We discuss the presence of these repeats and domains in proteins from other genomes and their probable secondary structure. PMID:17538688
Speed, Accuracy, and Serial Order in Sequence Production
ERIC Educational Resources Information Center
Pfordresher, Peter Q.; Palmer, Caroline; Jungers, Melissa K.
2007-01-01
The production of complex sequences like music or speech requires the rapid and temporally precise production of events (e.g., notes and chords), often at fast rates. Memory retrieval in these circumstances may rely on the simultaneous activation of both the current event and the surrounding context (Lashley, 1951). We describe an extension to a…
Code of Federal Regulations, 2010 CFR
2010-07-01
... 37 Patents, Trademarks, and Copyrights 1 2010-07-01 2010-07-01 false Form and format for... And/or Amino Acid Sequences § 1.824 Form and format for nucleotide and/or amino acid sequence... Code for Information Interchange (ASCII) text. No other formats shall be allowed. (3) The computer...
Assessing the Chemical Accuracy of Protein Structures via Peptide Acidity
Anderson, Janet S.; Hernández, Griselda; LeMaster, David M.
2012-01-01
Although the protein native state is a Boltzmann conformational ensemble, practical applications often require a representative model from the most populated region of that distribution. The acidity of the backbone amides, as reflected in hydrogen exchange rates, is exquisitely sensitive to the surrounding charge and dielectric volume distribution. For each of four proteins, three independently determined X-ray structures of differing crystallographic resolution were used to predict exchange for the static solvent-exposed amide hydrogens. The average correlation coefficients range from 0.74 for ubiquitin to 0.93 for Pyrococcus furiosus rubredoxin, reflecting the larger range of experimental exchange rates exhibited by the latter protein. The exchange prediction errors modestly correlate with the crystallographic resolution. MODELLER 9v6-derived homology models at ~60% sequence identity (36% identity for chymotrypsin inhibitor CI2) yielded correlation coefficients that are ~0.1 smaller than for the cognate X-ray structures. The most recently deposited NOE-based ubiquitin structure and the original NMR structure of CI2 fail to provide statistically significant predictions of hydrogen exchange. However, the more recent RECOORD refinement study of CI2 yielded predictions comparable to the X-ray and homology model-based analyses. PMID:23182463
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gantz, I.; Yamada, Tadataka; Tashiro, Takao
1994-01-15
[alpha]-Melanocyte stimulating hormone ([alpha]-MSH), a hormone originally named for its ability to regulate pigmentation of melanocytes, is a 13-amino-acid post-translational product of the pro-opiomelanocortin (POMC) gene. [alpha]-MSH and the other products of POMC processing, which share the core heptapeptide amino acid sequence Met-Glu (Gly)-His-Phe-Arg-Trp-Gly (Asp), the adrenocorticotropic hormone (ACTH), [beta]-MSH, and [gamma]-MSH, are collectively referred to as melanocortins. While best known for their effects on the melanocyte (pigmentation) and adrenal cortical cells (steroidogenesis), melanocortins have been postulated to function in diverse activities, including enhancement of learning and memory, control of the cardiovascular system, analgesia, thermoregulation, immunomodulation, parturition, and neurotrophism. Tomore » identify the chromosomal band encoding the human melanocortin-1 receptor gene, 1 [mu]g of an EMBL clone coding region of the human MC1R and approximately 15 kb of surrounding DNA was labeled with biotin and hybridized to human metaphase chromosomes as previously described. The results indicate that the human MC1R gene is localized to 16q24.3. 15 refs., 1 fig.« less
Renn, S C; Tomkinson, B; Taghert, P H
1998-07-24
We describe the characterization, cloning, and genetic analysis of tripeptidyl peptidase II (TPP II) from Drosophila melanogaster. Mammalian TPP II removes N-terminal tripeptides, has wide distribution, and has been identified as the cholecystokinin-degrading peptidase in rat brain. Size exclusion and ion exchange chromatography produced a 70-fold purification of dTPP II activity from Drosophila tissue extracts. The substrate specificity and the inhibitor sensitivity of dTPP II is comparable to that of the human enzyme. In particular, dTPP II is sensitive to butabindide, a specific inhibitor of the rat cholecystokinin-inactivating activity. We isolated a 4309-base pair dTPP II cDNA which predicts a 1354-amino acid protein. The deduced human and Drosophila TPP II proteins display 38% overall identity. The catalytic triad, its spacing, and the sequences that surround it are highly conserved; the C-terminal end of dTPP II contains a 100-amino acid insert not found in the mammalian proteins. Recombinant dTPP II displays the predicted activity following expression in HEK cells. TPP II maps to cytological position 49F4-7; animals deficient for this interval show reduced TPP II activity.
A delicate case of unidirectional proton transfer from water to an aromatic heterocyclic anion.
Biswas, Sohag; Mallik, Bhabani S
2016-11-21
We present the characteristic proton transfer process from water to the pyrazole anion, infrared signatures of hydroxyl groups and the free energy profile of the process in aqueous solution combining first principles simulations, wavelet analysis and metadynamics. Our results show that the presence of minimum three water molecules in the gas phase cluster with a particular arrangement is sufficient to facilitate the proton transfer process from water to the anion. The overall reaction is very rapid in aqueous solution, and the free energy barrier for this process is found to be 4.2 kcal mol -1 . One of the earlier reported fundamental reasons for the transfer of proton from water to the anion is the change in the acidity of OH groups surrounding the anion. We have correlated the stretching frequencies of the surrounding OH groups with this acidity. We find that the development of less energetic vibrational states, and the OH mode having lowest average stretching frequency contains the most acidic proton. A large frequency shift of the OH mode belonging to one of the surrounding water molecules is observed during the transfer of proton from water to the anion; this shift is due to the change in acidity of the adjacent hydroxyl groups in the vicinity of the anion.
Application of 2D graphic representation of protein sequence based on Huffman tree method.
Qi, Zhao-Hui; Feng, Jun; Qi, Xiao-Qin; Li, Ling
2012-05-01
Based on Huffman tree method, we propose a new 2D graphic representation of protein sequence. This representation can completely avoid loss of information in the transfer of data from a protein sequence to its graphic representation. The method consists of two parts. One is about the 0-1 codes of 20 amino acids by Huffman tree with amino acid frequency. The amino acid frequency is defined as the statistical number of an amino acid in the analyzed protein sequences. The other is about the 2D graphic representation of protein sequence based on the 0-1 codes. Then the applications of the method on ten ND5 genes and seven Escherichia coli strains are presented in detail. The results show that the proposed model may provide us with some new sights to understand the evolution patterns determined from protein sequences and complete genomes. Copyright © 2012 Elsevier Ltd. All rights reserved.
Opsin cDNA sequences of a UV and green rhodopsin of the satyrine butterfly Bicyclus anynana.
Vanhoutte, K J A; Eggen, B J L; Janssen, J J M; Stavenga, D G
2002-11-01
The cDNAs of an ultraviolet (UV) and long-wavelength (LW) (green) absorbing rhodopsin of the bush brown Bicyclus anynana were partially identified. The UV sequence, encoding 377 amino acids, is 76-79% identical to the UV sequences of the papilionids Papilio glaucus and Papilio xuthus and the moth Manduca sexta. A dendrogram derived from aligning the amino acid sequences reveals an equidistant position of Bicyclus between Papilio and Manduca. The sequence of the green opsin cDNA fragment, which encodes 242 amino acids, represents six of the seven transmembrane regions. At the amino acid level, this fragment is more than 80% identical to the corresponding LW opsin sequences of Dryas, Heliconius, Papilio (rhodopsin 2) and Manduca. Whereas three LW absorbing rhodopsins were identified in the papilionid butterflies, only one green opsin was found in B. anynana.
Lee, K L; Albee, K L; Bernasconi, R J; Edmunds, T
1997-01-01
The amino acid sequences of ananain (EC3.4.22.31) and stem bromelain (3.4.22.32), two cysteine proteases from pineapple stem, are similar yet ananain and stem bromelain possess distinct specificities towards synthetic peptide substrates and different reactivities towards the cysteine protease inhibitors E-64 and chicken egg white cystatin. We present here the complete amino acid sequence of ananain and compare it with the reported sequences of pineapple stem bromelain, papain and chymopapain from papaya and actinidin from kiwifruit. Ananain is comprised of 216 residues with a theoretical mass of 23464 Da. This primary structure includes a sequence insert between residues 170 and 174 not present in stem bromelain or papain and a hydrophobic series of amino acids adjacent to His-157. It is possible that these sequence differences contribute to the different substrate and inhibitor specificities exhibited by ananain and stem bromelain. PMID:9355753
DOE Office of Scientific and Technical Information (OSTI.GOV)
Crooks, Gavin E.
WebLogo is a web based application designed to make the generation of sequence logos as easy and painless as possible. Sequesnce logos are a graphical representation of an amino acid or nucleic acid multiple sequence alignment developed by Tom Schneider and Mike Stephens. Each logo consists of stacks of symbols, one stack for each position in the sequence. The overall height of the stack indicates the sequence conservation at that position, while the height of symbols within the stack indicates the relative frequency of each amino or nucleic acid at that position. In general, a sequence logo provides a richermore » and more precise description of, for example, a binding site, than would a consensus sequence.« less
1987-01-01
identified in the difference spectra, implying that: there are five to seven tryptophans within 17 A of the spin-label hapten. Amino acid sequences...of the heavy, and light chains were obtained by a combination of amino acid and DNA sequencing. A molecular model’ was constructed from the sequence...Clore & acids yields detailed information about the amino acid com- Gronenborn, 1982, 1983). This technique should also identify position of the combining
Bowen, D; Littlechild, J A; Fothergill, J E; Watson, H C; Hall, L
1988-01-01
Using oligonucleotide probes derived from amino acid sequencing information, the structural gene for phosphoglycerate kinase from the extreme thermophile, Thermus thermophilus, was cloned in Escherichia coli and its complete nucleotide sequence determined. The gene consists of an open reading frame corresponding to a protein of 390 amino acid residues (calculated Mr 41,791) with an extreme bias for G or C (93.1%) in the codon third base position. Comparison of the deduced amino acid sequence with that of the corresponding mesophilic yeast enzyme indicated a number of significant differences. These are discussed in terms of the unusual codon bias and their possible role in enhanced protein thermal stability. Images Fig. 1. PMID:3052437
Attentional awakening: gradual modulation of temporal attention in rapid serial visual presentation.
Ariga, Atsunori; Yokosawa, Kazuhiko
2008-03-01
Orienting attention to a point in time facilitates processing of an item within rapidly changing surroundings. We used a one-target RSVP task to look for differences in accuracy in reporting a target related to when the target temporally appeared in the sequence. The results show that observers correctly report a target early in the sequence less frequently than later in the sequence. Previous RSVP studies predicted equivalently accurate performances for one target wherever it appeared in the sequence. We named this new phenomenon attentional awakening, which reflects a gradual modulation of temporal attention in a rapid sequence.
Sequence of a cDNA encoding pancreatic preprosomatostatin-22.
Magazin, M; Minth, C D; Funckes, C L; Deschenes, R; Tavianini, M A; Dixon, J E
1982-01-01
We report the nucleotide sequence of a precursor to somatostatin that upon proteolytic processing may give rise to a hormone of 22 amino acids. The nucleotide sequence of a cDNA from the channel catfish (Ictalurus punctatus) encodes a precursor to somatostatin that is 105 amino acids (Mr, 11,500). The cDNA coding for somatostatin-22 consists of 36 nucleotides in the 5' untranslated region, 315 nucleotides that code for the precursor to somatostatin-22, 269 nucleotides at the 3' untranslated region, and a variable length of poly(A). The putative preprohormone contains a sequence of hydrophobic amino acids at the amino terminus that has the properties of a "signal" peptide. A connecting sequence of approximately 57 amino acids is followed by a single Arg-Arg sequence, which immediately precedes the hormone. Somatostatin-22 is homologous to somatostatin-14 in 7 of the 14 amino acids, including the Phe-Trp-Lys sequence. Hybridization selection of mRNA, followed by its translation in a wheat germ cell-free system, resulted in the synthesis of a single polypeptide having a molecular weight of approximately 10,000 as estimated on Na-DodSO4/polyacrylamide gels. Images PMID:6127673
Montoya-Ruiz, Carolina; Cajimat, Maria N B; Milazzo, Mary Louise; Diaz, Francisco J; Rodas, Juan David; Valbuena, Gustavo; Fulhorst, Charles F
2015-07-01
The results of a previous study suggested that Cherrie's cane rat (Zygodontomys cherriei) is the principal host of Necoclí virus (family Bunyaviridae, genus Hantavirus) in Colombia. Bayesian analyses of complete nucleocapsid protein gene sequences and complete glycoprotein precursor gene sequences in this study confirmed that Necoclí virus is phylogenetically closely related to Maporal virus, which is principally associated with the delicate pygmy rice rat (Oligoryzomys delicatus) in western Venezuela. In pairwise comparisons, nonidentities between the complete amino acid sequence of the nucleocapsid protein of Necoclí virus and the complete amino acid sequences of the nucleocapsid proteins of other hantaviruses were ≥8.7%. Likewise, nonidentities between the complete amino acid sequence of the glycoprotein precursor of Necoclí virus and the complete amino acid sequences of the glycoprotein precursors of other hantaviruses were ≥11.7%. Collectively, the unique association of Necoclí virus with Z. cherriei in Colombia, results of the Bayesian analyses of complete nucleocapsid protein gene sequences and complete glycoprotein precursor gene sequences, and results of the pairwise comparisons of amino acid sequences strongly support the notion that Necoclí virus represents a novel species in the genus Hantavirus. Further work is needed to determine whether Calabazo virus (a hantavirus associated with Z. brevicauda cherriei in Panama) and Necoclí virus are conspecific.
Shayan, P; Jafari, S; Fattahi, R; Ebrahimzade, E; Amininia, N; Changizi, E
2016-05-01
Ovine theileriosis is an important hemoprotozoal disease of sheep and goats in tropical and subtropical regions which caused high economic loses in the livestock industry. Theileria annulata surface protein (TaSp) was used previously as a tool for serological analysis in livestock. Since the amino acid sequences of TaSp is, at least, in part very conserved in T. annulata, Theileria lestoquardi and Theileria china I and II, it is very important to determine the amino acid sequence of this protein in Theileria ovis as well, to avoid false interpretation of serological data based on this protein in small animal. In the present study, the nucleotide sequence and amino acid sequence of T. ovis surface protein (ToSp) were determined. The comparison of the nucleotide sequence of ToSp showed 96, 96, 99, and 86 % homology to the corresponding nucleotide sequence of TaSp genes by T. annulata, T. China I, T. China II and T. lestoquardi, previously registered in GenBank under accession nos. AJ316260.1, AY274329.1, DQ120058.1, and EF092924.1 respectively. The amino acid sequence analysis showed 95, 81, 98 and 70 % homology to the corresponding amino acid sequence of T. annulata, T chinaI, T china II and T. lestoquardi, registered in GenBank under accession nos. CAC87478.1, AAP36993.1, AAZ30365.1 and AAP36999.11, respectively. Interestingly, in contrast to the C terminus, a significant difference in amino acid sequence in the N teminus of the ToSp protein could be determined compared to the other known corresponding TaSp sequences, which make this region attractive for designing of a suitable tool for serological diagnosis.
Brain cDNA clone for human cholinesterase
DOE Office of Scientific and Technical Information (OSTI.GOV)
McTiernan, C.; Adkins, S.; Chatonnet, A.
1987-10-01
A cDNA library from human basal ganglia was screened with oligonucleotide probes corresponding to portions of the amino acid sequence of human serum cholinesterase. Five overlapping clones, representing 2.4 kilobases, were isolated. The sequenced cDNA contained 207 base pairs of coding sequence 5' to the amino terminus of the mature protein in which there were four ATG translation start sites in the same reading frame as the protein. Only the ATG coding for Met-(-28) lay within a favorable consensus sequence for functional initiators. There were 1722 base pairs of coding sequence corresponding to the protein found circulating in human serum.more » The amino acid sequence deduced from the cDNA exactly matched the 574 amino acid sequence of human serum cholinesterase, as previously determined by Edman degradation. Therefore, our clones represented cholinesterase rather than acetylcholinesterase. It was concluded that the amino acid sequences of cholinesterase from two different tissues, human brain and human serum, were identical. Hybridization of genomic DNA blots suggested that a single gene, or very few genes coded for cholinesterase.« less
Hansen, Mie Johanne; Pennanen, Elin Anna Erica; Bojesen, Anders Miki; Christensen, Henrik; Bertelsen, Mads Frost
2016-02-01
A total of 13 Pasteurellaceae isolates from healthy freshwater turtles were characterized by genotypic and phenotypic tests. Phylogenetic analysis of partial 16S rRNA and rpoB gene sequences showed that the isolates investigated formed a monophyletic group. The closest related species based on 16S rRNA gene sequencing was Chelonobacter oris CCUG 55632T with 94.4 % similarity and the closest related species based on rpoB gene sequence comparison was [Pasteurella] testudinis CCUG 19802T with 91.5 % similarity. All the investigated isolates exhibited phenotypic characteristics of the family Pasteurellaceae. However, they could be separated from existing genera of the Pasteurellaceae by the following test results: indole, ornithine decarboxylase and Voges-Proskauer positive; and methyl red, urease and PNPG (α-glucosidase) negative. No X- or V-factor requirement was observed. A zone of β-haemolysis surrounded the colonies after 24 h of incubation on bovine blood agar at 37 °C. Acid was produced from l-arabinose, dulcitol, d-mannitol, sucrose and trehalose. Representative strain ELNT2xT had a fatty acid profile that was characteristic for members of the Pasteurellaceae. ELNT2xT expressed only one respiratory quinone, ubiquinone-8 (100 %). The DNA G+C content of strain ELNT2xT was 42.8 mol%. On the basis of both phylogenetic and phenotypic evidence, it is proposed that the strains should be classified as representatives of a novel species of a new genus, Testudinibacter aquarius gen. nov., sp. nov. The type strain of Testudinibacter aquarius is ELNT2xT ( = CCUG 65146T = DSM 28140T), which was isolated from the oral cavity of a captive eastern long-necked turtle (Chelodina longicollis) in Denmark in 2012.
Ali, Akhtar; Raddatz, Natalia; Aman, Rashid; Kim, Songmi; Park, Hyeong Cheol; Jan, Masood; Baek, Dongwon; Khan, Irfan Ullah; Oh, Dong-Ha; Lee, Sang Yeol; Bressan, Ray A; Lee, Keun Woo; Maggio, Albino; Pardo, Jose M; Bohnert, Hans J; Yun, Dae-Jin
2016-07-01
A crucial prerequisite for plant growth and survival is the maintenance of potassium uptake, especially when high sodium surrounds the root zone. The Arabidopsis HIGH-AFFINITY K(+) TRANSPORTER1 (HKT1), and its homologs in other salt-sensitive dicots, contributes to salinity tolerance by removing Na(+) from the transpiration stream. However, TsHKT1;2, one of three HKT1 copies in Thellungiella salsuginea, a halophytic Arabidopsis relative, acts as a K(+) transporter in the presence of Na(+) in yeast (Saccharomyces cerevisiae). Amino-acid sequence comparisons indicated differences between TsHKT1;2 and most other published HKT1 sequences with respect to an Asp residue (D207) in the second pore-loop domain. Two additional T salsuginea and most other HKT1 sequences contain Asn (n) in this position. Wild-type TsHKT1;2 and altered AtHKT1 (AtHKT1(N-D)) complemented K(+)-uptake deficiency of yeast cells. Mutant hkt1-1 plants complemented with both AtHKT1(N) (-) (D) and TsHKT1;2 showed higher tolerance to salt stress than lines complemented by the wild-type AtHKT1 Electrophysiological analysis in Xenopus laevis oocytes confirmed the functional properties of these transporters and the differential selectivity for Na(+) and K(+) based on the n/d variance in the pore region. This change also dictated inward-rectification for Na(+) transport. Thus, the introduction of Asp, replacing Asn, in HKT1-type transporters established altered cation selectivity and uptake dynamics. We describe one way, based on a single change in a crucial protein that enabled some crucifer species to acquire improved salt tolerance, which over evolutionary time may have resulted in further changes that ultimately facilitated colonization of saline habitats. © 2016 American Society of Plant Biologists. All Rights Reserved.
Cloning and expression of cDNA coding for bouganin.
den Hartog, Marcel T; Lubelli, Chiara; Boon, Louis; Heerkens, Sijmie; Ortiz Buijsse, Antonio P; de Boer, Mark; Stirpe, Fiorenzo
2002-03-01
Bouganin is a ribosome-inactivating protein that recently was isolated from Bougainvillea spectabilis Willd. In this work, the cloning and expression of the cDNA encoding for bouganin is described. From the cDNA, the amino-acid sequence was deduced, which correlated with the primary sequence data obtained by amino-acid sequencing on the native protein. Bouganin is synthesized as a pro-peptide consisting of 305 amino acids, the first 26 of which act as a leader signal while the 29 C-terminal amino acids are cleaved during processing of the molecule. The mature protein consists of 250 amino acids. Using the cDNA sequence encoding the mature protein of 250 amino acids, a recombinant protein was expressed, purified and characterized. The recombinant molecule had similar activity in a cell-free protein synthesis assay and had comparable toxicity on living cells as compared to the isolated native bouganin.
Method for altering antibody light chain interactions
Stevens, Fred J.; Stevens, Priscilla Wilkins; Raffen, Rosemarie; Schiffer, Marianne
2002-01-01
A method for recombinant antibody subunit dimerization including modifying at least one codon of a nucleic acid sequence to replace an amino acid occurring naturally in the antibody with a charged amino acid at a position in the interface segment of the light polypeptide variable region, the charged amino acid having a first polarity; and modifying at least one codon of the nucleic acid sequence to replace an amino acid occurring naturally in the antibody with a charged amino acid at a position in an interface segment of the heavy polypeptide variable region corresponding to a position in the light polypeptide variable region, the charged amino acid having a second polarity opposite the first polarity. Nucleic acid sequences which code for novel light chain proteins, the latter of which are used in conjunction with the inventive method, are also provided.
37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.
Code of Federal Regulations, 2013 CFR
2013-07-01
... in WIPO Standard ST.25 (1998), Appendix 2, Tables 1 and 3. This incorporation by reference was... ST.25 (1998), Appendix 2, Tables 1 and 3, shall be listed in a given sequence as “n” or “Xaa... acids. (1) The amino acids in a protein or peptide sequence shall be listed using the three-letter...
37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.
Code of Federal Regulations, 2010 CFR
2010-07-01
... in WIPO Standard ST.25 (1998), Appendix 2, Tables 1 and 3. This incorporation by reference was... ST.25 (1998), Appendix 2, Tables 1 and 3, shall be listed in a given sequence as “n” or “Xaa... acids. (1) The amino acids in a protein or peptide sequence shall be listed using the three-letter...
37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.
Code of Federal Regulations, 2012 CFR
2012-07-01
... in WIPO Standard ST.25 (1998), Appendix 2, Tables 1 and 3. This incorporation by reference was... ST.25 (1998), Appendix 2, Tables 1 and 3, shall be listed in a given sequence as “n” or “Xaa... acids. (1) The amino acids in a protein or peptide sequence shall be listed using the three-letter...
Use of CYP52A2A promoter to increase gene expression in yeast
Craft, David L.; Wilson, C. Ron; Eirich, Dudley; Zhang, Yeyan
2004-01-06
A nucleic acid sequence including a CYP promoter operably linked to nucleic acid encoding a heterologous protein is provided to increase transcription of the nucleic acid. Expression vectors and host cells containing the nucleic acid sequence are also provided. The methods and compositions described herein are especially useful in the production of polycarboxylic acids by yeast cells.
Method of Identifying a Base in a Nucleic Acid
Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua
1999-01-01
Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.
Identifying a base in a nucleic acid
Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua
2005-02-08
Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.
Study of Tranexamic Acid during Air Medical Prehospital Transport (STAAMP) Trial
2014-10-01
AD______________ AWARD NUMBER: W81XWH-13-2-0080 TITLE: Study of Tranexamic acid ... Tranexamic acid during Air Medical Prehospital transport (STAAMP) trial 5b. GRANT NUMBER W81XWH-13-2-0080 5c. PROGRAM ELEMENT NUMBER 6. AUTHOR(S...and explained the purpose of this study to Pittsburgh local and surrounding area. 15. SUBJECT TERMS Prehospital ; Tranexamic acid 16
NASA Astrophysics Data System (ADS)
Hwang, Geelsu; Liu, Yuan; Kim, Dongyeop; Sun, Victor; Aviles-Reyes, Alejandro; Kajfasz, Jessica K.; Lemos, Jose A.; Koo, Hyun
2016-09-01
Biofilms are comprised of bacterial-clusters (microcolonies) enmeshed in an extracellular matrix. Streptococcus mutans can produce exopolysaccharides (EPS)-matrix and assemble microcolonies with acidic microenvironments that can cause tooth-decay despite the surrounding neutral-pH found in oral cavity. How the matrix influences the pH and bacterial activity locally remains unclear. Here, we simultaneously analyzed in situ pH and gene expression within intact biofilms and measured the impact of damage to the surrounding EPS-matrix. The spatiotemporal changes of these properties were characterized at a single-microcolony level following incubation in neutral-pH buffer. The middle and bottom-regions as well as inner-section within the microcolony 3D structure were resistant to neutralization (vs. upper and peripheral-region), forming an acidic core. Concomitantly, we used a green fluorescent protein (GFP) reporter to monitor expression of the pH-responsive atpB (PatpB::gfp) by S. mutans within microcolonies. The atpB expression was induced in the acidic core, but sharply decreased at peripheral/upper microcolony regions, congruent with local pH microenvironment. Enzymatic digestion of the surrounding matrix resulted in nearly complete neutralization of microcolony interior and down-regulation of atpB. Altogether, our data reveal that biofilm matrix facilitates formation of an acidic core within microcolonies which in turn activates S. mutans acid-stress response, mediating both the local environment and bacterial activity in situ.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Deutscher, J.; Pevec, B.; Beyreuther, K.
1986-10-21
The amino acid sequence of histidine-containing protein (HPr) from Streptococcus faecalis has been determined by direct Edman degradation of intact HPr and by amino acid sequence analysis of tryptic peptides, V8 proteolyptic peptides, thermolytic peptides, and cyanogen bromide cleavage products. HPr from S. faecalis was found to contain 89 amino acid residues, corresponding to a molecular weight of 9438. The amino acid sequence of HPr from S. faecalis shows extended homology to the primary structure of HPr proteins from other bacteria. Besides the phosphoenolpyruvate-dependent phosphorylation of a histidyl residue in HPr, catalyzed by enzyme I of the bacterial phosphotransferase system,more » HPr was also found to be phosphorylated at a seryl residue in an ATP-dependent protein kinase catalyzed reaction. The site of ATP-dependent phosphorylation in HPr of S faecalis has now been determined. (/sup 32/P)P-Ser-HPr was digested with three different proteases, and in each case, a single labeled peptide was isolated. Following digestion with subtilisin, they obtained a peptide with the sequence -(P)Ser-Ile-Met-. Using chymotrypsin, they isolated a peptide with the sequence -Ser-Val-Asn-Leu-Lys-(P)Ser-Ile-Met-Gly-Val-Met-. The longest labeled peptide was obtained with V8 staphylococcal protease. According to amino acid analysis, this peptide contained 36 out of the 89 amino acid residues of HPr. The following sequence of 12 amino acid residues of the V8 peptide was determined: -Tyr-Lys-Gly-Lys-Ser-Val-Asn-Leu-Lys-(P)Ser-Ile-Met-. Thus, the site of ATP-dependent phosphorylation was determined to be Ser-46 within the primary structure of HPr.« less
Methods and compositions for regulating gene expression in plant cells
NASA Technical Reports Server (NTRS)
Dai, Shunhong (Inventor); Beachy, Roger N. (Inventor); Luis, Maria Isabel Ordiz (Inventor)
2010-01-01
Novel chimeric plant promoter sequences are provided, together with plant gene expression cassettes comprising such sequences. In certain preferred embodiments, the chimeric plant promoters comprise the BoxII cis element and/or derivatives thereof. In addition, novel transcription factors are provided, together with nucleic acid sequences encoding such transcription factors and plant gene expression cassettes comprising such nucleic acid sequences. In certain preferred embodiments, the novel transcription factors comprise the acidic domain, or fragments thereof, of the RF2a transcription factor. Methods for using the chimeric plant promoter sequences and novel transcription factors in regulating the expression of at least one gene of interest are provided, together with transgenic plants comprising such chimeric plant promoter sequences and novel transcription factors.
The complete amino acid sequence of human skeletal-muscle fructose-bisphosphate aldolase.
Freemont, P S; Dunbar, B; Fothergill-Gilmore, L A
1988-01-01
The complete amino acid sequence of human skeletal-muscle fructose-bisphosphate aldolase, comprising 363 residues, was determined. The sequence was deduced by automated sequencing of CNBr-cleavage, o-iodosobenzoic acid-cleavage, trypsin-digest and staphylococcal-proteinase-digest fragments. Comparison of the sequence with other class I aldolase sequences shows that the mammalian muscle isoenzyme is one of the most highly conserved enzymes known, with only about 2% of the residues changing per 100 million years. Non-mammalian aldolases appear to be evolving at the same rate as other glycolytic enzymes, with about 4% of the residues changing per 100 million years. Secondary-structure predictions are analysed in an accompanying paper [Sawyer, Fothergill-Gilmore & Freemont (1988) Biochem. J. 249, 789-793]. PMID:3355497
Cloning and sequencing of the allophycocyanin genes from Spirulina maxima (Cyanophyta)
NASA Astrophysics Data System (ADS)
Qin, Song; Hiroyuki, Kojima; Yoshikazu, Kawata; Shin-Ichi, Yano; Zeng, Cheng-Kui
1998-03-01
The genes coding for the α-and β-subunit of allophycocyanin ( apcA and apcB) from the cyanophyte Spirulina maxima were cloned and sequenced. The results revealed 44.4% of nucleotide sequence similarity and 30.4% of similarity of deduced amino acid sequence between them. The amino acid sequence identities between S. maxima and S. platensis are 99.4% for α subunit and 100% for β subunit.
NASA Astrophysics Data System (ADS)
Garcia, J.; Hidalgo, S. S.; Solis, S. E.; Vazquez, D.; Nuñez, J.; Rodriguez, A. O.
2012-10-01
The susceptibility artifacts can degrade of magnetic resonance image quality. Electrodes are an important source of artifacts when performing brain imaging. A dedicated phantom was built using a depth electrode to study the susceptibility effects under different pulse sequences. T2-weighted images were acquired with both gradient-and spin-echo sequences. The spin-echo sequences can significantly attenuate the susceptibility artifacts allowing a straightforward visualization of the regions surrounding the electrode.
Use of linalool synthase in genetic engineering of scent production
Pichersky, E.
1998-12-15
A purified S-linalool synthase polypeptide from Clarkia breweri is disclosed as is the recombinant polypeptide and nucleic acid sequences encoding the polypeptide. Also disclosed are antibodies immunoreactive with the purified peptide and with recombinant versions of the polypeptide. Methods of using the nucleic acid sequences, as well as methods of enhancing the smell and the flavor of plants expressing the nucleic acid sequences are also disclosed. 5 figs.
Use of linalool synthase in genetic engineering of scent production
Pichersky, Eran
1998-01-01
A purified S-linalool synthase polypeptide from Clarkia breweri is disclosed as is the recombinant polypeptide and nucleic acid sequences encoding the polypeptide. Also disclosed are antibodies immunoreactive with the purified peptide and with recombinant versions of the polypeptide. Methods of using the nucleic acid sequences, as well as methods of enhancing the smell and the flavor of plants expressing the nucleic acid sequences are also disclosed.
Probe kit for identifying a base in a nucleic acid
Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua
2001-01-01
Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.
Crotoxin: Structural Studies, Mechanism of Action and Cloning of its Gene
1988-03-01
thirteen amino acids being acidic . Sequencing of the three peptides present in the acidic subunit, two of which are blocked by pyroglutamate ...the sequence determination of both the basic and acidic subunits of crotoxin- The acidic * subunit peptides were d!Tfficult, .sfi~n~e two of-ftflý...fluorescence spectroscopy. Results indicate a large conformational change occurs upon) ccmplex formation between the acidic and basic subunits of all four
37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.
Code of Federal Regulations, 2014 CFR
2014-07-01
... base or modified or unusual amino acid may be presented in a given sequence as the corresponding unmodified base or amino acid if the modified base or modified or unusual amino acid is one of those listed... the Feature section. Otherwise, each occurrence of a base or amino acid not appearing in WIPO Standard...
Mouse Vk gene classification by nucleic acid sequence similarity.
Strohal, R; Helmberg, A; Kroemer, G; Kofler, R
1989-01-01
Analyses of immunoglobulin (Ig) variable (V) region gene usage in the immune response, estimates of V gene germline complexity, and other nucleic acid hybridization-based studies depend on the extent to which such genes are related (i.e., sequence similarity) and their organization in gene families. While mouse Igh heavy chain V region (VH) gene families are relatively well-established, a corresponding systematic classification of Igk light chain V region (Vk) genes has not been reported. The present analysis, in the course of which we reviewed the known extent of the Vk germline gene repertoire and Vk gene usage in a variety of responses to foreign and self antigens, provides a classification of mouse Vk genes in gene families composed of members with greater than 80% overall nucleic acid sequence similarity. This classification differed in several aspects from that of VH genes: only some Vk gene families were as clearly separated (by greater than 25% sequence dissimilarity) as typical VH gene families; most Vk gene families were closely related and, in several instances, members from different families were very similar (greater than 80%) over large sequence portions; frequently, classification by nucleic acid sequence similarity diverged from existing classifications based on amino-terminal protein sequence similarity. Our data have implications for Vk gene analyses by nucleic acid hybridization and describe potentially important differences in sequence organization between VH and Vk genes.
MR safety and compatibility of a noninvasively expandable total-joint endoprosthesis.
Ogg, Robert J; McDaniel, C Brian; Wallace, Donald; Pitot, Pierre; Neel, Michael D; Kaste, Sue C
2005-09-01
A noninvasively expandable total-joint endoprosthesis is now available for pediatric patients; the prosthesis can be lengthened by external application of a magnetic field. We investigated the risks of unintentional heating or lengthening of the prosthesis during MR imaging and evaluated the effect of the device on the diagnostic efficacy of MR imaging of surrounding tissues. We performed MR imaging at 1.5 T by using standard pulse sequences and pulse sequences with high-gradient and high-radiofrequency duty cycle. MR imaging caused no measurable change in prosthesis length, and the temperature of the prosthesis increased by less than 1 degrees C during repeated 14-min exposures. Despite significant signal loss and image distortion around the prosthetic joint, clinically useful images were obtained as close as 12 cm from the ends of the prosthetic stems, measured toward the body of the device. Thus, the prosthesis can be safely exposed to MR imaging pulse sequences at 1.5 T, and the visualization of some tissue surrounding the device is clinically useful.
Kimura, Tomohiro; Nakano, Toshiki; Yamaguchi, Toshiyasu; Sato, Minoru; Ogawa, Tomohisa; Muramoto, Koji; Yokoyama, Takehiko; Kan-No, Nobuhiro; Nagahisa, Eizou; Janssen, Frank; Grieshaber, Manfred K
2004-01-01
The complete complementary DNA sequences of genes presumably coding for opine dehydrogenases from Arabella iricolor (sandworm), Haliotis discus hannai (abalone), and Patinopecten yessoensis (scallop) were determined, and partial cDNA sequences were derived for Meretrix lusoria (Japanese hard clam) and Spisula sachalinensis (Sakhalin surf clam). The primers ODH-9F and ODH-11R proved useful for amplifying the sequences for opine dehydrogenases from the 4 mollusk species investigated in this study. The sequence of the sandworm was obtained using primers constructed from the amino acid sequence of tauropine dehydrogenase, the main opine dehydrogenase in A. iricolor. The complete cDNA sequence of A. iricolor, H. discus hannai, and P. yessoensis encode 397, 400, and 405 amino acids, respectively. All sequences were aligned and compared with published databank sequences of Loligo opalescens, Loligo vulgaris (squid), Sepia officinalis (cuttlefish), and Pecten maximus (scallop). As expected, a high level of homology was observed for the cDNA from closely related species, such as for cephalopods or scallops, whereas cDNA from the other species showed lower-level homologies. A similar trend was observed when the deduced amino acid sequences were compared. Furthermore, alignment of these sequences revealed some structural motifs that are possibly related to the binding sites of the substrates. The phylogenetic trees derived from the nucleotide and amino acid sequences were consistent with the classification of species resulting from classical taxonomic analyses.
Methods for making nucleotide probes for sequencing and synthesis
Church, George M; Zhang, Kun; Chou, Joseph
2014-07-08
Compositions and methods for making a plurality of probes for analyzing a plurality of nucleic acid samples are provided. Compositions and methods for analyzing a plurality of nucleic acid samples to obtain sequence information in each nucleic acid sample are also provided.
Automatic classification of protein structures using physicochemical parameters.
Mohan, Abhilash; Rao, M Divya; Sunderrajan, Shruthi; Pennathur, Gautam
2014-09-01
Protein classification is the first step to functional annotation; SCOP and Pfam databases are currently the most relevant protein classification schemes. However, the disproportion in the number of three dimensional (3D) protein structures generated versus their classification into relevant superfamilies/families emphasizes the need for automated classification schemes. Predicting function of novel proteins based on sequence information alone has proven to be a major challenge. The present study focuses on the use of physicochemical parameters in conjunction with machine learning algorithms (Naive Bayes, Decision Trees, Random Forest and Support Vector Machines) to classify proteins into their respective SCOP superfamily/Pfam family, using sequence derived information. Spectrophores™, a 1D descriptor of the 3D molecular field surrounding a structure was used as a benchmark to compare the performance of the physicochemical parameters. The machine learning algorithms were modified to select features based on information gain for each SCOP superfamily/Pfam family. The effect of combining physicochemical parameters and spectrophores on classification accuracy (CA) was studied. Machine learning algorithms trained with the physicochemical parameters consistently classified SCOP superfamilies and Pfam families with a classification accuracy above 90%, while spectrophores performed with a CA of around 85%. Feature selection improved classification accuracy for both physicochemical parameters and spectrophores based machine learning algorithms. Combining both attributes resulted in a marginal loss of performance. Physicochemical parameters were able to classify proteins from both schemes with classification accuracy ranging from 90-96%. These results suggest the usefulness of this method in classifying proteins from amino acid sequences.
Thrombin-like enzymes from snake venom: Structural characterization and mechanism of action.
Ullah, Anwar; Masood, Rehana; Ali, Ijaz; Ullah, Kifayat; Ali, Hamid; Akbar, Haji; Betzel, Christian
2018-07-15
Snake venom thrombin-like enzymes (SVTLEs) constitute the major portion (10-24%) of snake venom and these are the second most abundant enzymes present in the crude venom. During envenomation, these enzymes had shown prominently the various pathological effects, such as disturbance in hemostatic system, fibrinogenolysis, fibrinolysis, platelet aggregation, thrombosis, neurologic disorders, activation of coagulation factors, coagulant, procoagulant etc. These enzymes also been used as a therapeutic agent for the treatment of various diseases such as congestive heart failure, ischemic stroke, thrombotic disorders etc. Although the crystal structures of five SVTLEs are available in the Protein Data Bank (PDB), there is no single article present in the literature that has described all of them. The current work describes the structural aspects, structure-based mechanism of action, processing and inhibition of these enzymes. The sequence analysis indicates that these enzymes show a high sequence identity (57-85%) with each other and low sequence identity with trypsin (36-43%), human alpha-thrombin (29-36%) and other snake venom serine proteinases (57-85%). Three-dimensional structural analysis indicates that the loops surrounding the active site are variable both in amino acids composition and length that may convey variable substrate specificity to these enzymes. The surface charge distributions also vary in these enzymes. Docking analysis with suramin shows that this inhibitor preferably binds to the C-terminal region of these enzymes and causes the destabilization of their three-dimensional structure. Copyright © 2018 Elsevier B.V. All rights reserved.
Soil amino acid composition across a boreal forest successional sequence
Nancy R. Werdin-Pfisterer; Knut Kielland; Richard D. Boone
2009-01-01
Soil amino acids are important sources of organic nitrogen for plant nutrition, yet few studies have examined which amino acids are most prevalent in the soil. In this study, we examined the composition, concentration, and seasonal patterns of soil amino acids across a primary successional sequence encompassing a natural gradient of plant productivity and soil...
37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.
Code of Federal Regulations, 2014 CFR
2014-07-01
...” means those amino acids other than “Xaa” and those nucleotide bases other than “n”defined in accordance... 37 Patents, Trademarks, and Copyrights 1 2014-07-01 2014-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences...
37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.
Code of Federal Regulations, 2013 CFR
2013-07-01
...” means those amino acids other than “Xaa” and those nucleotide bases other than “n”defined in accordance... 37 Patents, Trademarks, and Copyrights 1 2013-07-01 2013-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences...
37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.
Code of Federal Regulations, 2012 CFR
2012-07-01
...” means those amino acids other than “Xaa” and those nucleotide bases other than “n”defined in accordance... 37 Patents, Trademarks, and Copyrights 1 2012-07-01 2012-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences...
Amino-terminal sequence of glycoprotein D of herpes simplex virus types 1 and 2
DOE Office of Scientific and Technical Information (OSTI.GOV)
Eisenberg, R.J.; Long, D.; Hogue-Angeletti, R.
1984-01-01
Glycoprotein D (gD) of herpes simplex virus is a structural component of the virion envelope which stimulates production of high titers of herpes simplex virus type-common neutralizing antibody. The authors caried out automated N-terminal amino acid sequencing studies on radiolabeled preparations of gD-1 (gD of herpes simplex virus type 1) and gD-2 (gD of herpes simplex virus type 2). Although some differences were noted, particularly in the methionine and alanine profiles for gD-1 and gD-2, the amino acid sequence of a number of the first 30 residues of the amino terminus of gD-1 and gD-2 appears to be quite similar.more » For both proteins, the first residue is a lysine. When we compared out sequence data for gD-1 with those predicted by nucleic acid sequencing, the two sequences could be aligned (with one exception) starting at residue 26 (lysine) of the predicted sequence. Thus, the first 25 amino acids of the predicted sequence are absent from the polypeptides isolated from infected cells.« less
Lampel, J S; Aphale, J S; Lampel, K A; Strohl, W R
1992-01-01
The gene encoding a novel milk protein-hydrolyzing proteinase was cloned on a 6.56-kb SstI fragment from Streptomyces sp. strain C5 genomic DNA into Streptomyces lividans 1326 by using the plasmid vector pIJ702. The gene encoding the small neutral proteinase (snpA) was located within a 2.6-kb BamHI-SstI restriction fragment that was partially sequenced. The molecular mass of the deduced amino acid sequence of the mature protein was determined to be 15,740, which corresponds very closely with the relative molecular mass of the purified protein (15,500) determined by sodium dodecyl sulfate-polyacrylamide gel electrophoresis. The N-terminal amino acid sequence of the purified neutral proteinase was determined, and the DNA encoding this sequence was found to be located within the sequenced DNA. The deduced amino acid sequence contains a conserved zinc binding site, although secondary ligand binding and active sites typical of thermolysinlike metalloproteinases are absent. The combination of its small size, deduced amino acid sequence, and substrate and inhibition profile indicate that snpA encodes a novel neutral proteinase. Images PMID:1569011
Makiguchi, Wataru; Tanabe, Junki; Yamada, Hidekazu; Iida, Hiroki; Taura, Daisuke; Ousaka, Naoki; Yashima, Eiji
2015-01-01
Self-recognition and self-discrimination within complex mixtures are of fundamental importance in biological systems, which entirely rely on the preprogrammed monomer sequences and homochirality of biological macromolecules. Here we report artificial chirality- and sequence-selective successive self-sorting of chiral dimeric strands bearing carboxylic acid or amidine groups joined by chiral amide linkers with different sequences through homo- and complementary-duplex formations. A mixture of carboxylic acid dimers linked by racemic-1,2-cyclohexane bis-amides with different amide sequences (NHCO or CONH) self-associate to form homoduplexes in a completely sequence-selective way, the structures of which are different from each other depending on the linker amide sequences. The further addition of an enantiopure amide-linked amidine dimer to a mixture of the racemic carboxylic acid dimers resulted in the formation of a single optically pure complementary duplex with a 100% diastereoselectivity and complete sequence specificity stabilized by the amidinium–carboxylate salt bridges, leading to the perfect chirality- and sequence-selective duplex formation. PMID:26051291
Meiler, Arno; Klinger, Claudia; Kaufmann, Michael
2012-09-08
The COG database is the most popular collection of orthologous proteins from many different completely sequenced microbial genomes. Per definition, a cluster of orthologous groups (COG) within this database exclusively contains proteins that most likely achieve the same cellular function. Recently, the COG database was extended by assigning to every protein both the corresponding amino acid and its encoding nucleotide sequence resulting in the NUCOCOG database. This extended version of the COG database is a valuable resource connecting sequence features with the functionality of the respective proteins. Here we present ANCAC, a web tool and MySQL database for the analysis of amino acid, nucleotide, and codon frequencies in COGs on the basis of freely definable phylogenetic patterns. We demonstrate the usefulness of ANCAC by analyzing amino acid frequencies, codon usage, and GC-content in a species- or function-specific context. With respect to amino acids we, at least in part, confirm the cognate bias hypothesis by using ANCAC's NUCOCOG dataset as the largest one available for that purpose thus far. Using the NUCOCOG datasets, ANCAC connects taxonomic, amino acid, and nucleotide sequence information with the functional classification via COGs and provides a GUI for flexible mining for sequence-bias. Thereby, to our knowledge, it is the only tool for the analysis of sequence composition in the light of physiological roles and phylogenetic context without requirement of substantial programming-skills.
2012-01-01
Background The COG database is the most popular collection of orthologous proteins from many different completely sequenced microbial genomes. Per definition, a cluster of orthologous groups (COG) within this database exclusively contains proteins that most likely achieve the same cellular function. Recently, the COG database was extended by assigning to every protein both the corresponding amino acid and its encoding nucleotide sequence resulting in the NUCOCOG database. This extended version of the COG database is a valuable resource connecting sequence features with the functionality of the respective proteins. Results Here we present ANCAC, a web tool and MySQL database for the analysis of amino acid, nucleotide, and codon frequencies in COGs on the basis of freely definable phylogenetic patterns. We demonstrate the usefulness of ANCAC by analyzing amino acid frequencies, codon usage, and GC-content in a species- or function-specific context. With respect to amino acids we, at least in part, confirm the cognate bias hypothesis by using ANCAC’s NUCOCOG dataset as the largest one available for that purpose thus far. Conclusions Using the NUCOCOG datasets, ANCAC connects taxonomic, amino acid, and nucleotide sequence information with the functional classification via COGs and provides a GUI for flexible mining for sequence-bias. Thereby, to our knowledge, it is the only tool for the analysis of sequence composition in the light of physiological roles and phylogenetic context without requirement of substantial programming-skills. PMID:22958836
The primary structure of the thymidine kinase gene of fish lymphocystis disease virus.
Schnitzler, P; Handermann, M; Szépe, O; Darai, G
1991-06-01
The DNA nucleotide sequence of the thymidine kinase (TK) gene of fish lymphocystis disease virus (FLDV) which has been localized between the coordinates 0.678 to 0.688 of the viral genome was determined. The analysis of the DNA nucleotide sequence located between the recognition sites of HindIII (0.669 map unit; nucleotide position 1) and AccI (nucleotide position 2032) revealed the presence of an open reading frame of 954 bp on the lower strand of this region between nucleotide positions 1868 (ATG) and 915 (TAA). It encodes for a protein of 318 amino acid residues. The evolutionary relationships of the TK gene of FLDV to the other known TK genes was investigated using the method of progressive sequence alignment. These analyses revealed a high degree of diversity between the protein sequence of FLDV TK gene and the amino acid composition of other TKs tested. However, significant conservations were detected at several regions of amino acid residues of the FLDV TK protein when compared to the amino acid sequence of TKs of African swine fever virus, fowlpox virus, shope fibroma virus, and vaccinia virus and to the amino acid sequences of the cellular cytoplasmic TK of chicken, mouse, and man.
Zou, Jiaqi; Li, Na
2013-09-01
Proper design of nucleic acid sequences is crucial for many applications. We have previously established a thermodynamics-based quantitative model to help design aptamer-based nucleic acid probes by predicting equilibrium concentrations of all interacting species. To facilitate customization of this thermodynamic model for different applications, here we present a generic and easy-to-use platform to implement the algorithm of the model with Microsoft(®) Excel formulas and VBA (Visual Basic for Applications) macros. Two Excel spreadsheets have been developed: one for the applications involving only nucleic acid species, the other for the applications involving both nucleic acid and non-nucleic acid species. The spreadsheets take the nucleic acid sequences and the initial concentrations of all species as input, guide the user to retrieve the necessary thermodynamic constants, and finally calculate equilibrium concentrations for all species in various bound and unbound conformations. The validity of both spreadsheets has been verified by comparing the modeling results with the experimental results on nucleic acid sequences reported in the literature. This Excel-based platform described here will allow biomedical researchers to rationalize the sequence design of nucleic acid probes using the thermodynamics-based modeling even without relevant theoretical and computational skills. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Suzuki, Shun'ichi; Takenaka, Yasuhiro; Onishi, Norimasa; Yokozeki, Kenzo
2005-08-01
A DNA fragment from Microbacterium liquefaciens AJ 3912, containing the genes responsible for the conversion of 5-substituted-hydantoins to alpha-amino acids, was cloned in Escherichia coli and sequenced. Seven open reading frames (hyuP, hyuA, hyuH, hyuC, ORF1, ORF2, and ORF3) were identified on the 7.5 kb fragment. The deduced amino acid sequence encoded by the hyuA gene included the N-terminal amino acid sequence of the hydantoin racemase from M. liquefaciens AJ 3912. The hyuA, hyuH, and hyuC genes were heterologously expressed in E. coli; their presence corresponded with the detection of hydantoin racemase, hydantoinase, and N-carbamoyl alpha-amino acid amido hydrolase enzymatic activities respectively. The deduced amino acid sequences of hyuP were similar to those of the allantoin (5-ureido-hydantoin) permease from Saccharomyces cerevisiae, suggesting that hyuP protein might function as a hydantoin transporter.
NASA Astrophysics Data System (ADS)
Son, Gyeongho; Jung, Youngho; Yu, Kyoungsik
2017-04-01
We report a directional-coupler-based refractive index sensor and its cost-effective fabrication method using hydrofluoric acid droplet wet-etching and surface-tension-driven liquid flows. The proposed fiber sensor consists of a pair of twisted tapered optical fibers with low excess losses. The fiber cores in the etched microfiber region are exposed to the surrounding medium for efficient interaction with the guided light. We observe that the etching-based low-loss fiber-optic sensors can measure the water droplet volume by detecting the refractive index changes of the surrounding medium around the etched fiber core region.
Large-Scale Concatenation cDNA Sequencing
Yu, Wei; Andersson, Björn; Worley, Kim C.; Muzny, Donna M.; Ding, Yan; Liu, Wen; Ricafrente, Jennifer Y.; Wentland, Meredith A.; Lennon, Greg; Gibbs, Richard A.
1997-01-01
A total of 100 kb of DNA derived from 69 individual human brain cDNA clones of 0.7–2.0 kb were sequenced by concatenated cDNA sequencing (CCS), whereby multiple individual DNA fragments are sequenced simultaneously in a single shotgun library. The method yielded accurate sequences and a similar efficiency compared with other shotgun libraries constructed from single DNA fragments (>20 kb). Computer analyses were carried out on 65 cDNA clone sequences and their corresponding end sequences to examine both nucleic acid and amino acid sequence similarities in the databases. Thirty-seven clones revealed no DNA database matches, 12 clones generated exact matches (≥98% identity), and 16 clones generated nonexact matches (57%–97% identity) to either known human or other species genes. Of those 28 matched clones, 8 had corresponding end sequences that failed to identify similarities. In a protein similarity search, 27 clone sequences displayed significant matches, whereas only 20 of the end sequences had matches to known protein sequences. Our data indicate that full-length cDNA insert sequences provide significantly more nucleic acid and protein sequence similarity matches than expressed sequence tags (ESTs) for database searching. [All 65 cDNA clone sequences described in this paper have been submitted to the GenBank data library under accession nos. U79240–U79304.] PMID:9110174
DOE Office of Scientific and Technical Information (OSTI.GOV)
Myers, G.; Korber, B.; Wain-Hobson, S.
1993-12-31
This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (I) HIV and SIV Nucleotide Sequences; (II) Amino Acid Sequences; (III) Analyses; (IV) Related Sequences; and (V) Database Communications. Information within all the parts is updated at least twice in each year, which accounts for the modes of binding and pagination in the compendium.
Complete cDNA sequence and amino acid analysis of a bovine ribonuclease K6 gene.
Pietrowski, D; Förster, M
2000-01-01
The complete cDNA sequence of a ribonuclease k6 gene of Bos Taurus has been determined. It codes for a protein with 154 amino acids and contains the invariant cysteine, histidine and lysine residues as well as the characteristic motifs specific to ribonuclease active sites. The deduced protein sequence is 27 residues longer than other known ribonucleases k6 and shows amino acids exchanges which could reflect a strain specificity or polymorphism within the bovine genome. Based on sequence similarity we have termed the identified gene bovine ribonuclease k6 b (brk6b).
Cloning of an avilamycin biosynthetic gene cluster from Streptomyces viridochromogenes Tü57.
Gaisser, S; Trefzer, A; Stockert, S; Kirschning, A; Bechthold, A
1997-01-01
A 65-kb region of DNA from Streptomyces viridochromogenes Tü57, containing genes encoding proteins involved in the biosynthesis of avilamycins, was isolated. The DNA sequence of a 6.4-kb fragment from this region revealed four open reading frames (ORF1 to ORF4), three of which are fully contained within the sequenced fragment. The deduced amino acid sequence of AviM, encoded by ORF2, shows 37% identity to a 6-methylsalicylic acid synthase from Penicillium patulum. Cultures of S. lividans TK24 and S. coelicolor CH999 containing plasmids with ORF2 on a 5.5-kb PstI fragment were able to produce orsellinic acid, an unreduced version of 6-methylsalicylic acid. The amino acid sequence encoded by ORF3 (AviD) is 62% identical to that of StrD, a dTDP-glucose synthase from S. griseus. The deduced amino acid sequence of AviE, encoded by ORF4, shows 55% identity to a dTDP-glucose dehydratase (StrE) from S. griseus. Gene insertional inactivation experiments of aviE abolished avilamycin production, indicating the involvement of aviE in the biosynthesis of avilamycins. PMID:9335272
DOE Office of Scientific and Technical Information (OSTI.GOV)
Leong, JoAnn Ching
The nucleotide sequence of the IHNV glycoprotein gene has been determined from a cDNA clone containing the entire coding region. The glycoprotein cDNA clone contained a leader sequence of 48 bases, a coding region of 1524 nucleotides, and 39 bases at the 3 foot end. The entire cDNA clone contains 1609 nucleodites and encodes a protein of 508 amino acids. The deduced amino acid sequence gave a translated molecular weight of 56,795 daltons. A hydropathicity profile of the deduced amino acid sequence indicated that there were two major hydrophobic domains: one,at the N-terminus,delineating a signal peptide of 18 amino acidsmore » and the other, at the C-terminus,delineating the region of the transmembrane. Five possible sites of N-linked glyscoylation were identified. Although no nucleic acid homology existed between the IHNV glycoprotein gene and the glycoprotein genes of rabies and VSV, there was significant homology at the amino acid level between all three rhabdovirus glycoproteins.« less
Kim, Juhan; Kyung, Dohyun; Yun, Hyungdon; Cho, Byung-Kwan; Seo, Joo-Hyun; Cha, Minho; Kim, Byung-Gee
2007-01-01
A novel β-transaminase gene was cloned from Mesorhizobium sp. strain LUK. By using N-terminal sequence and an internal protein sequence, a digoxigenin-labeled probe was made for nonradioactive hybridization, and a 2.5-kb gene fragment was obtained by colony hybridization of a cosmid library. Through Southern blotting and sequence analysis of the selected cosmid clone, the structural gene of the enzyme (1,335 bp) was identified, which encodes a protein of 47,244 Da with a theoretical pI of 6.2. The deduced amino acid sequence of the β-transaminase showed the highest sequence similarity with glutamate-1-semialdehyde aminomutase of transaminase subgroup II. The β-transaminase showed higher activities toward d-β-aminocarboxylic acids such as 3-aminobutyric acid, 3-amino-5-methylhexanoic acid, and 3-amino-3-phenylpropionic acid. The β-transaminase has an unusually broad specificity for amino acceptors such as pyruvate and α-ketoglutarate/oxaloacetate. The enantioselectivity of the enzyme suggested that the recognition mode of β-aminocarboxylic acids in the active site is reversed relative to that of α-amino acids. After comparison of its primary structure with transaminase subgroup II enzymes, it was proposed that R43 interacts with the carboxylate group of the β-aminocarboxylic acids and the carboxylate group on the side chain of dicarboxylic α-keto acids such as α-ketoglutarate and oxaloacetate. R404 is another conserved residue, which interacts with the α-carboxylate group of the α-amino acids and α-keto acids. The β-transaminase was used for the asymmetric synthesis of enantiomerically pure β-aminocarboxylic acids. (3S)-Amino-3-phenylpropionic acid was produced from the ketocarboxylic acid ester substrate by coupled reaction with a lipase using 3-aminobutyric acid as amino donor. PMID:17259358
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rudwaleit, M.; Bowness, P.; Wordsworth, P.
1996-12-31
The HLA-B27 subtype HLA-B{sup *}2704 is virtually absent in Caucasians but common in Orientals, where it is associated with ankylosing spondylitis. The amino acid sequence of HLA-B{sup *}2704 has been established by peptide mapping and was shown to differ by two amino acids from HLA-B{sup *}2705, HLA-B{sup *}2704 is characterized by a serine for aspartic acid substitution at position 77 and glutamic acid for valine at position 152. To date, however, no nucleotide sequence confirming these changes at the DNA level has been published. 13 refs., 2 figs.
Federal Register 2010, 2011, 2012, 2013, 2014
2012-05-15
... (EPO) as the lead, to propose a revised standard for the filing of nucleotide and/or amino acid.... ST.25 uses a controlled vocabulary of feature keys to describe nucleic acid and amino acid sequences... patent data purposes. The XML standard also includes four qualifiers for amino acids. These feature keys...
Molecular cloning of the pheromone biosynthesis-activating neuropeptide in Helicoverpa zea.
Davis, M T; Vakharia, V N; Henry, J; Kempe, T G; Raina, A K
1992-01-01
Pheromone biosynthesis-activating neuropeptide (PBAN) regulates sex pheromone biosynthesis in female Helicoverpa (Heliothis) zea. Two oligonucleotide probes representing two overlapping amino acid regions of PBAN were used to screen 2.5 x 10(5) recombinant plaques, and a positive recombinant clone was isolated. Sequence analysis of the isolated clone showed that the PBAN gene is interrupted after the codon encoding amino acid 14 by a 0.63-kilobase (kb) intron. Preceding the PBAN amino acid sequence is a 10-amino acid sequence containing a pentapeptide Phe-Thr-Pro-Arg-Leu, which is followed by a Gly-Arg-Arg processing site. Immediately after the PBAN amino acid sequence is a Gly-Arg processing site and a short stretch of 10 amino acids. This 10-amino acid sequence contains a repeat of the PBAN C-terminal pentapeptide Phe-Ser-Pro-Arg-Leu and is terminated by another Gly-Arg processing site. It is suggested that the PBAN gene in H. zea might carry, besides PBAN, a 7- and an 8-residue amidated peptide, which share with PBAN the core C-terminal pentapeptide Phe-(Ser or Thr)-Pro-Arg-Leu-NH2. The C-terminal pentapeptide sequence of PBAN represents the minimum sequence required for pheromonotropic activity in H. zea and also bears a high degree of homology to the pyrokinin family of insect peptides with myotropic activity. It is possible that the putative heptapeptide and octapeptide might be new members of the pyrokinin family, with pheromonotropic and/or myotropic activities. Thus, the PBAN gene products, besides affecting sexual behavior, might have broad influence on many biological processes in H. zea. Images PMID:1729680
Host Cell Virus Entry Mediated by Australian Bat Lyssavirus Envelope G glycoprotein
2013-10-24
39 Figure 7. Comparison of the amino acid sequences of Saccolaimus and Pteropus ABLV G mature protein... sequence analysis revealed that the PCR products were identical. Sequence comparisons of the ABLV N and other lyssavirus N proteins showed that ABLV...Saccolaimus flaviventris) (129). Nucleoprotein sequence comparisons revealed that the Saccolaimus N protein shared 96% amino acid homology with the Pteropus
DNA sequence similarity recognition by hybridization to short oligomers
Milosavljevic, Aleksandar
1999-01-01
Methods are disclosed for the comparison of nucleic acid sequences. Data is generated by hybridizing sets of oligomers with target nucleic acids. The data thus generated is manipulated simultaneously with respect to both (i) matching between oligomers and (ii) matching between oligomers and putative reference sequences available in databases. Using data compression methods to manipulate this mutual information, sequences for the target can be constructed.
Patel, Kamlesh D.
2018-01-22
Kamlesh (Ken) Patel from Sandia National Laboratories (Livermore, California) presents "Preparation of Nucleic Acid Libraries for Personalized Sequencing Systems Using an Integrated Microfluidic Hub Technology " at the 7th Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting held in June, 2012 in Santa Fe, NM.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Patel, Kamlesh D.
2012-06-01
Kamlesh (Ken) Patel from Sandia National Laboratories (Livermore, California) presents "Preparation of Nucleic Acid Libraries for Personalized Sequencing Systems Using an Integrated Microfluidic Hub Technology " at the 7th Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting held in June, 2012 in Santa Fe, NM.
Genetic association of polymorphism rs1333049 with gout.
Wang, Binbin; Meng, Dongmei; Wang, Jing; Liu, Shiguo; Zhou, Sirui; Miao, Zhimin; Han, Lin; Chu, Nan; Zhang, Kun; Ma, Xu; Li, Changgui
2011-09-01
We suspect that genes or loci that contribute to coronary artery disease (CAD) may also play a role in the pathogenesis of gout, since hyperuricaemia leads to gout, and serum uric acid (SUA) levels are potential risk factors for CAD. The single nucleotide polymorphism (SNP) rs1333049 (C/G) on chromosome 9p21 has been implicated in previous studies to be associated with CAD. The aim of this study was to evaluate the relationship between this SNP and gout pathogenesis. Nine hundred Chinese Han were recruited for this study (461 gout patients and 439 gout-free individuals). The rs1333049 SNP and surrounding sequences were PCR sequenced. There was a clear link between the rs1333049 genotypic and allelic frequencies between gout cases and controls (χ(2) = 6.81, df = 2, P = 0.033 by genotype; χ(2) = 6.63, df = 1, P = 0.01 by allele). There was a significantly increased risk of gout in carriers of the CC genotype (odds ratio = 1.43, 95% CI 1.07, 1.91). To the best of our knowledge, our findings are the first to establish an association of rs1333049 with gout in a Chinese Han population. Meanwhile, this SNP is homologous to miR-519 and miR-520.
Nucleosome regulatory dynamics in response to TGFβ
Enroth, Stefan; Andersson, Robin; Bysani, Madhusudhan; Wallerman, Ola; Termén, Stefan; Tuch, Brian B.; De La Vega, Francisco M.; Heldin, Carl-Henrik; Moustakas, Aristidis; Komorowski, Jan; Wadelius, Claes
2014-01-01
Nucleosomes play important roles in a cell beyond their basal functionality in chromatin compaction. Their placement affects all steps in transcriptional regulation, from transcription factor (TF) binding to messenger ribonucleic acid (mRNA) synthesis. Careful profiling of their locations and dynamics in response to stimuli is important to further our understanding of transcriptional regulation by the state of chromatin. We measured nucleosome occupancy in human hepatic cells before and after treatment with transforming growth factor beta 1 (TGFβ1), using massively parallel sequencing. With a newly developed method, SuMMIt, for precise positioning of nucleosomes we inferred dynamics of the nucleosomal landscape. Distinct nucleosome positioning has previously been described at transcription start site and flanking TF binding sites. We found that the average pattern is present at very few sites and, in case of TF binding, the double peak surrounding the sites is just an artifact of averaging over many loci. We systematically searched for depleted nucleosomes in stimulated cells compared to unstimulated cells and identified 24 318 loci. Depending on genomic annotation, 44–78% of them were over-represented in binding motifs for TFs. Changes in binding affinity were verified for HNF4α by qPCR. Strikingly many of these loci were associated with expression changes, as measured by RNA sequencing. PMID:24771338
Crotoxin: Structural Studies, Mechanism of Action and Cloning of Its Gene
1987-03-01
other venoms and examine their toxin neutral- izing ability. The amino acid sequences of both crotoxin subunits were determined Is a prelude to cloning...be examined for their potential as anti-idiotype vaccines The complete amino acid sequence of the basic subunit and two of the three dic subunit chains...of crotoxin from the venom of C.d. terrificus has been de rmined. Sequence comparison data suggest that the non-toxic, acidic subunit was derived
Comparative analysis of the XopD T3S effector family in plant pathogenic bacteria
Kim, Jung-Gun; Taylor, Kyle W.; Mudgett, Mary Beth
2011-01-01
SUMMARY XopD is a type III effector protein that is required for Xanthomonas campestris pathovar vesicatoria (Xcv) growth in tomato. It is a modular protein consisting of an N-terminal DNA-binding domain, two EAR transcriptional repressor motifs, and a C-terminal SUMO protease. In tomato, XopD functions as a transcriptional repressor, resulting in the suppression of defense responses at late stages of infection. A survey of available genome sequences for phytopathogenic bacteria revealed that XopD homologs are limited to species within three Genera of Proteobacteria – Xanthomonas, Acidovorax, and Pseudomonas. While the EAR motif(s) and SUMO protease domain are conserved in all the XopD-like proteins, variation exists in the length and sequence identity of the N-terminal domains. Comparative analysis of the DNA sequences surrounding xopD and xopD-like genes led to revised annotation of the xopD gene. Edman degradation sequence analysis and functional complementation studies confirmed that the xopD gene from Xcv encodes a 760 amino acid protein with a longer N-terminal domain than previously predicted. None of the XopD-like proteins studied complemented Xcv ΔxopD mutant phenotypes in tomato leaves suggesting that the N-terminus of XopD defines functional specificity. Xcv ΔxopD strains expressing chimeric fusion proteins containing the N-terminus of XopD fused to the EAR motif(s) and SUMO protease domain of the XopD-like protein from Xanthomonas campestris pathovar campestris strain B100 were fully virulent in tomato demonstrating that the N-terminus of XopD controls specificity in tomato. PMID:21726373
Genetic Locus for Streptolysin S Production by Group A Streptococcus
Nizet, Victor; Beall, Bernard; Bast, Darrin J.; Datta, Vivekananda; Kilburn, Laurie; Low, Donald E.; De Azavedo, Joyce C. S.
2000-01-01
Group A streptococcus (GAS) is an important human pathogen that causes pharyngitis and invasive infections, including necrotizing fasciitis. Streptolysin S (SLS) is the cytolytic factor that creates the zone of beta-hemolysis surrounding GAS colonies grown on blood agar. We recently reported the discovery of a potential genetic determinant involved in SLS production, sagA, encoding a small peptide of 53 amino acids (S. D. Betschel, S. M. Borgia, N. L. Barg, D. E. Low, and J. C. De Azavedo, Infect. Immun. 66:1671–1679, 1998). Using transposon mutagenesis, chromosomal walking steps, and data from the GAS genome sequencing project (www.genome.ou.edu/strep.html), we have now identified a contiguous nine-gene locus (sagA to sagI) involved in SLS production. The sag locus is conserved among GAS strains regardless of M protein type. Targeted plasmid integrational mutagenesis of each gene in the sag operon resulted in an SLS-negative phenotype. Targeted integrations (i) upstream of the sagA promoter and (ii) downstream of a terminator sequence after sagI did not affect SLS production, establishing the functional boundaries of the operon. A rho-independent terminator sequence between sagA and sagB appears to regulate the amount of sagA transcript produced versus transcript for the entire operon. Reintroduction of the nine-gene sag locus on a plasmid vector restored SLS activity to the nonhemolytic sagA knockout mutant. Finally, heterologous expression of the intact sag operon conferred the SLS beta-hemolytic phenotype to the nonhemolytic Lactococcus lactis. We conclude that gene products of the GAS sag operon are both necessary and sufficient for SLS production. Sequence homologies of sag operon gene products suggest that SLS is related to the bacteriocin family of microbial toxins. PMID:10858242
Erixon, Per; Oxelman, Bengt
2008-01-01
Background Synonymous DNA substitution rates in the plant chloroplast genome are generally relatively slow and lineage dependent. Non-synonymous rates are usually even slower due to purifying selection acting on the genes. Positive selection is expected to speed up non-synonymous substitution rates, whereas synonymous rates are expected to be unaffected. Until recently, positive selection has seldom been observed in chloroplast genes, and large-scale structural rearrangements leading to gene duplications are hitherto supposed to be rare. Methodology/Principle Findings We found high substitution rates in the exons of the plastid clpP1 gene in Oenothera (the Evening Primrose family) and three separate lineages in the tribe Sileneae (Caryophyllaceae, the Carnation family). Introns have been lost in some of the lineages, but where present, the intron sequences have substitution rates similar to those found in other introns of their genomes. The elevated substitution rates of clpP1 are associated with statistically significant whole-gene positive selection in three branches of the phylogeny. In two of the lineages we found multiple copies of the gene. Neighboring genes present in the duplicated fragments do not show signs of elevated substitution rates or positive selection. Although non-synonymous substitutions account for most of the increase in substitution rates, synonymous rates are also markedly elevated in some lineages. Whereas plant clpP1 genes experiencing negative (purifying) selection are characterized by having very conserved lengths, genes under positive selection often have large insertions of more or less repetitive amino acid sequence motifs. Conclusions/Significance We found positive selection of the clpP1 gene in various plant lineages to correlated with repeated duplication of the clpP1 gene and surrounding regions, repetitive amino acid sequences, and increase in synonymous substitution rates. The present study sheds light on the controversial issue of whether negative or positive selection is to be expected after gene duplications by providing evidence for the latter alternative. The observed increase in synonymous substitution rates in some of the lineages indicates that the detection of positive selection may be obscured under such circumstances. Future studies are required to explore the functional significance of the large inserted repeated amino acid motifs, as well as the possibility that synonymous substitution rates may be affected by positive selection. PMID:18167545
Cloning, expression pattern and promoter functional analysis of cyp19a1a gene in miiuy croaker.
Huang, Wei; Yang, Pan; Lv, Zhenming; Wu, Changwen; Gui, Jianfang; Lou, Bao
2017-09-05
Gonadal-specific aromatase encoded by cyp19a1a is the important enzyme controlling estrogen biosynthesis in teleosts. In the present study, the cDNA sequence of cyp19a1a was cloned and characterized from miiuy croaker Miichthys miiuy. The cDNA encoded a protein of 519 amino acids with five structural regions. Higher identities of amino acid sequences and conserved structural regions were found between Mmcyp19a1a and other cyp19a1a genes. In addition, Mmcyp19a1a was clustered together with other seawater fishes. Immunohistochemical analysis revealed that Mmcyp19a1a was localized exclusively in the cytoplasmic of thecal and granulosa cells surrounding the oocytes. Both the protein and mRNA levels of Mmcyp19a1a were increased significantly at the stage III follicles (mid-vitellogenic) and then decreased along with vitellogenesis. Interestingly, strong immunoreactive signals were also detected in the supporting cells of connective tissues during ovarian development. A 1777bp promoter fragment of Mmcyp19a1a was also isolated, and functional analysis using an EGFP reporter fusion in zebrafish larvae presented positive signals in the above of yolk sac, where is the region of pronephros and germ plasm occur. The Mmcyp19a1a:EGFP expression pattern was generally consistent with the endogenous cyp19a1a genesis. These results indicate that the Mmcyp19a1a gene plays an important role during vitellogenesis and oocyte maturation. The constructor of Mmcyp19a1a:EGFP may provide a useful tool for genetic analysis of gonad development in teleost. Copyright © 2017 Elsevier B.V. All rights reserved.
NullSeq: A Tool for Generating Random Coding Sequences with Desired Amino Acid and GC Contents.
Liu, Sophia S; Hockenberry, Adam J; Lancichinetti, Andrea; Jewett, Michael C; Amaral, Luís A N
2016-11-01
The existence of over- and under-represented sequence motifs in genomes provides evidence of selective evolutionary pressures on biological mechanisms such as transcription, translation, ligand-substrate binding, and host immunity. In order to accurately identify motifs and other genome-scale patterns of interest, it is essential to be able to generate accurate null models that are appropriate for the sequences under study. While many tools have been developed to create random nucleotide sequences, protein coding sequences are subject to a unique set of constraints that complicates the process of generating appropriate null models. There are currently no tools available that allow users to create random coding sequences with specified amino acid composition and GC content for the purpose of hypothesis testing. Using the principle of maximum entropy, we developed a method that generates unbiased random sequences with pre-specified amino acid and GC content, which we have developed into a python package. Our method is the simplest way to obtain maximally unbiased random sequences that are subject to GC usage and primary amino acid sequence constraints. Furthermore, this approach can easily be expanded to create unbiased random sequences that incorporate more complicated constraints such as individual nucleotide usage or even di-nucleotide frequencies. The ability to generate correctly specified null models will allow researchers to accurately identify sequence motifs which will lead to a better understanding of biological processes as well as more effective engineering of biological systems.
A Novel Cylindrical Representation for Characterizing Intrinsic Properties of Protein Sequences.
Yu, Jia-Feng; Dou, Xiang-Hua; Wang, Hong-Bo; Sun, Xiao; Zhao, Hui-Ying; Wang, Ji-Hua
2015-06-22
The composition and sequence order of amino acid residues are the two most important characteristics to describe a protein sequence. Graphical representations facilitate visualization of biological sequences and produce biologically useful numerical descriptors. In this paper, we propose a novel cylindrical representation by placing the 20 amino acid residue types in a circle and sequence positions along the z axis. This representation allows visualization of the composition and sequence order of amino acids at the same time. Ten numerical descriptors and one weighted numerical descriptor have been developed to quantitatively describe intrinsic properties of protein sequences on the basis of the cylindrical model. Their applications to similarity/dissimilarity analysis of nine ND5 proteins indicated that these numerical descriptors are more effective than several classical numerical matrices. Thus, the cylindrical representation obtained here provides a new useful tool for visualizing and charactering protein sequences. An online server is available at http://biophy.dzu.edu.cn:8080/CNumD/input.jsp .
Cloning and sequence analysis of the invertase gene INV 1 from the yeast Pichia anomala.
Pérez, J A; Rodríguez, J; Rodríguez, L; Ruiz, T
1996-02-01
A genomic library from the yeast Pichia anomala has been constructed and employed to clone the gene encoding the sucrose-hydrolysing enzyme invertase by complementation of a sucrose non-fermenting mutant of Saccharomyces cerevisiae. The cloned gene, INV1, was sequenced and found to encode a polypeptide of 550 amino acids which contained a 22 amino-acid signal sequence and ten potential glycosylation sites. The amino-acid sequence shows significant identity with other yeast invertases and also with Kluyveromyces marxianus inulinase, a yeast beta-fructofuranosidase which has a different substrate specificity. The nucleotide sequences of the 5' and 3' non-coding regions were found to contain several consensus motifs probably involved in the initiation and termination of gene transcription.
Deyashiki, Y; Ogasawara, A; Nakayama, T; Nakanishi, M; Miyabe, Y; Sato, K; Hara, A
1994-01-01
Human liver contains two dihydrodiol dehydrogenases, DD2 and DD4, associated with 3 alpha-hydroxysteroid dehydrogenase activity. We have raised polyclonal antibodies that cross-reacted with the two enzymes and isolated two 1.2 kb cDNA clones (C9 and C11) for the two enzymes from a human liver cDNA library using the antibodies. The clones of C9 and C11 contained coding sequences corresponding to 306 and 321 amino acid residues respectively, but lacked 5'-coding regions around the initiation codon. Sequence analyses of several peptides obtained by enzymic and chemical cleavages of the two purified enzymes verified that the C9 and C11 clones encoded DD2 and DD4 respectively, and further indicated that the sequence of DD2 had at least additional 16 residues upward from the N-terminal sequence deduced from the cDNA. There was 82% amino acid sequence identity between the two enzymes, indicating that the enzymes are genetic isoenzymes. A computer-based comparison of the cDNAs of the isoenzymes with the DNA sequence database revealed that the nucleotide and amino acid sequences of DD2 and DD4 are virtually identical with those of human bile-acid binder and human chlordecone reductase cDNAs respectively. Images Figure 1 PMID:8172617
NASA Astrophysics Data System (ADS)
McMillen, Chelsea L.; Wright, Patience M.; Cassady, Carolyn J.
2016-05-01
Matrix-assisted laser desorption/ionization (MALDI) in-source decay was studied in the negative ion mode on deprotonated peptides to determine its usefulness for obtaining extensive sequence information for acidic peptides. Eight biological acidic peptides, ranging in size from 11 to 33 residues, were studied by negative ion mode ISD (nISD). The matrices 2,5-dihydroxybenzoic acid, 2-aminobenzoic acid, 2-aminobenzamide, 1,5-diaminonaphthalene, 5-amino-1-naphthol, 3-aminoquinoline, and 9-aminoacridine were used with each peptide. Optimal fragmentation was produced with 1,5-diaminonphthalene (DAN), and extensive sequence informative fragmentation was observed for every peptide except hirudin(54-65). Cleavage at the N-Cα bond of the peptide backbone, producing c' and z' ions, was dominant for all peptides. Cleavage of the N-Cα bond N-terminal to proline residues was not observed. The formation of c and z ions is also found in electron transfer dissociation (ETD), electron capture dissociation (ECD), and positive ion mode ISD, which are considered to be radical-driven techniques. Oxidized insulin chain A, which has four highly acidic oxidized cysteine residues, had less extensive fragmentation. This peptide also exhibited the only charged localized fragmentation, with more pronounced product ion formation adjacent to the highly acidic residues. In addition, spectra were obtained by positive ion mode ISD for each protonated peptide; more sequence informative fragmentation was observed via nISD for all peptides. Three of the peptides studied had no product ion formation in ISD, but extensive sequence informative fragmentation was found in their nISD spectra. The results of this study indicate that nISD can be used to readily obtain sequence information for acidic peptides.
McMillen, Chelsea L; Wright, Patience M; Cassady, Carolyn J
2016-05-01
Matrix-assisted laser desorption/ionization (MALDI) in-source decay was studied in the negative ion mode on deprotonated peptides to determine its usefulness for obtaining extensive sequence information for acidic peptides. Eight biological acidic peptides, ranging in size from 11 to 33 residues, were studied by negative ion mode ISD (nISD). The matrices 2,5-dihydroxybenzoic acid, 2-aminobenzoic acid, 2-aminobenzamide, 1,5-diaminonaphthalene, 5-amino-1-naphthol, 3-aminoquinoline, and 9-aminoacridine were used with each peptide. Optimal fragmentation was produced with 1,5-diaminonphthalene (DAN), and extensive sequence informative fragmentation was observed for every peptide except hirudin(54-65). Cleavage at the N-Cα bond of the peptide backbone, producing c' and z' ions, was dominant for all peptides. Cleavage of the N-Cα bond N-terminal to proline residues was not observed. The formation of c and z ions is also found in electron transfer dissociation (ETD), electron capture dissociation (ECD), and positive ion mode ISD, which are considered to be radical-driven techniques. Oxidized insulin chain A, which has four highly acidic oxidized cysteine residues, had less extensive fragmentation. This peptide also exhibited the only charged localized fragmentation, with more pronounced product ion formation adjacent to the highly acidic residues. In addition, spectra were obtained by positive ion mode ISD for each protonated peptide; more sequence informative fragmentation was observed via nISD for all peptides. Three of the peptides studied had no product ion formation in ISD, but extensive sequence informative fragmentation was found in their nISD spectra. The results of this study indicate that nISD can be used to readily obtain sequence information for acidic peptides.
López-García, Juan Manuel; Blain, Hugues-Alexandre; Bennàsar, Maria; Euba, Itxaso; Bañuls, Sandra; Bischoff, James; López-Ortega, Esther; Saladié, Palmira; Uzquiano, Paloma; Vallverdú, Josep
2012-01-01
The Cova dels Xaragalls is a small open karst system, located in the municipality of Vimbodí-Poblet (Tarragona, Catalonia, NE Spain). It is an important Holocene archaeological site that was inspected in the 1970s but from which little has been published. New excavations starting in 2008 have exposed a deep Late Pleistocene stratigraphical sequence. In this paper, we present for the first time palaeoenvironmental and palaeoclimatic reconstructions of this Late Pleistocene succession on the basis of both the small-vertebrate assemblages and the charcoals. Results from the small-vertebrate associations along the sequence indicate that the landscape had open-woodland habitats in the vicinity of the Cova del Xaragalls, with wet points in the surrounding area. Woodland habitats were dominant throughout the sequence, as evidenced by the abundance of the species Apodemus sylvaticus, but were better developed during warm periods (layers C5 and C8), whereas during cold periods (layers C4 and C3) the environment was slightly more humid in response to higher mean annual precipitation and the opening of the landscape. The charcoal analysis indicates that the woodland surrounding the cave was composed mainly of Pinus (more than 90% was identified as Pinus), but that during the cold period (C3–C4) it incorporated some Quercus ilex/coccifera and Angiosperm indet., probably linked with greater precipitation. Comparisons are made with other long palaeoenvironmental sequences from the northeastern Iberian Peninsula and with global marine isotopic curves, providing a scenario for the palaeoclimatic and palaeoenvironmental changes that occurred during the Late Pleistocene in the woodland areas surrounding the Cova dels Xaragalls.
Goh, Falicia; Allen, Michelle A; Leuko, Stefan; Kawaguchi, Tomohiro; Decho, Alan W; Burns, Brendan P; Neilan, Brett A
2009-04-01
The stromatolites at Shark Bay, Western Australia, are analogues of some of the oldest evidence of life on Earth. The aim of this study was to identify and spatially characterize the specific microbial communities associated with Shark Bay intertidal columnar stromatolites. Conventional culturing methods and construction of 16S rDNA clone libraries from community genomic DNA with both universal and specific PCR primers were employed. The estimated coverage, richness and diversity of stromatolite microbial populations were compared with earlier studies on these ecosystems. The estimated coverage for all clone libraries indicated that population coverage was comprehensive. Phylogenetic analyses of stromatolite and surrounding seawater sequences were performed in ARB with the Greengenes database of full-length non-chimaeric 16S rRNA genes. The communities identified exhibited extensive diversity. The most abundant sequences from the stromatolites were alpha- and gamma-proteobacteria (58%), whereas the cyanobacterial community was characterized by sequences related to the genera Euhalothece, Gloeocapsa, Gloeothece, Chroococcidiopsis, Dermocarpella, Acaryochloris, Geitlerinema and Schizothrix. All clones from the archaeal-specific clone libraries were related to the halophilic archaea; however, no archaeal sequence was identified from the surrounding seawater. Fluorescence in situ hybridization also revealed stromatolite surfaces to be dominated by unicellular cyanobacteria, in contrast to the sub-surface archaea and sulphate-reducing bacteria. This study is the first to compare the microbial composition of morphologically similar stromatolites over time and examine the spatial distribution of specific microorganismic groups in these intertidal structures and the surrounding seawater at Shark Bay. The results provide a platform for identifying the key microbial physiology groups and their potential roles in modern stromatolite morphogenesis and ecology.
Hazes, Bart
2014-02-28
Protein-coding DNA sequences and their corresponding amino acid sequences are routinely used to study relationships between sequence, structure, function, and evolution. The rapidly growing size of sequence databases increases the power of such comparative analyses but it makes it more challenging to prepare high quality sequence data sets with control over redundancy, quality, completeness, formatting, and labeling. Software tools for some individual steps in this process exist but manual intervention remains a common and time consuming necessity. CDSbank is a database that stores both the protein-coding DNA sequence (CDS) and amino acid sequence for each protein annotated in Genbank. CDSbank also stores Genbank feature annotation, a flag to indicate incomplete 5' and 3' ends, full taxonomic data, and a heuristic to rank the scientific interest of each species. This rich information allows fully automated data set preparation with a level of sophistication that aims to meet or exceed manual processing. Defaults ensure ease of use for typical scenarios while allowing great flexibility when needed. Access is via a free web server at http://hazeslab.med.ualberta.ca/CDSbank/. CDSbank presents a user-friendly web server to download, filter, format, and name large sequence data sets. Common usage scenarios can be accessed via pre-programmed default choices, while optional sections give full control over the processing pipeline. Particular strengths are: extract protein-coding DNA sequences just as easily as amino acid sequences, full access to taxonomy for labeling and filtering, awareness of incomplete sequences, and the ability to take one protein sequence and extract all synonymous CDS or identical protein sequences in other species. Finally, CDSbank can also create labeled property files to, for instance, annotate or re-label phylogenetic trees.
Structure of the horseradish peroxidase isozyme C genes.
Fujiyama, K; Takemura, H; Shibayama, S; Kobayashi, K; Choi, J K; Shinmyo, A; Takano, M; Yamada, Y; Okada, H
1988-05-02
We have isolated, cloned and characterized three cDNAs and two genomic DNAs corresponding to the mRNAs and genes for the horseradish (Armoracia rusticana) peroxidase isoenzyme C (HPR C). The amino acid sequence of HRP C1, deduced from the nucleotide sequence of one of the cDNA clone, pSK1, contained the same primary sequence as that of the purified enzyme established by Welinder [FEBS Lett. 72, 19-23 (1976)] with additional sequences at the N and C terminal. All three inserts in the cDNA clones, pSK1, pSK2 and pSK3, coded the same size of peptide (308 amino acid residues) if these are processed in the same way, and the amino acid sequence were homologous to each other by 91-94%. Functional amino acids, including His40, His170, Tyr185 and Arg183 and S-S-bond-forming Cys, were conserved in the three isozymes, but a few N-glycosylation sites were not the same. Two HRP C isoenzyme genomic genes, prxC1 and prxC2, were tandem on the chromosomal DNA and each gene consisted of four exons and three introns. The positions in the exons interrupted by introns were the same in two genes. We observed a putative promoter sequence 5' upstream and a poly(A) signal 3' downstream in both genes. The gene product of prxC1 might be processed with a signal sequence of 30 amino acid residues at the N terminus and a peptide consisting of 15 amino acid residues at the C terminus.
Useful halophilic, thermostable and ionic liquids tolerant cellulases
Zhang, Tao; Datta, Supratim; Simmons, Blake A.; Rubin, Edward M.
2016-06-28
The present invention provides for an isolated or recombinant polypeptide comprising an amino acid sequence having at least 70% identity with the amino acid sequence of a Halorhabdus utahensis cellulase, such as Hu-CBH1, wherein said amino acid sequence has a halophilic thermostable and/or thermophilic cellobiohydrolase (CBH) activity. In some embodiments, the polypeptide has a CBH activity that is resistant to up to about 20% of ionic liquids. The present invention also provides for compositions comprising and methods using the isolated or recombinant polypeptide.
Khodakov, Dmitriy; Wang, Chunyan; Zhang, David Yu
2016-10-01
Nucleic acid sequence variations have been implicated in many diseases, and reliable detection and quantitation of DNA/RNA biomarkers can inform effective therapeutic action, enabling precision medicine. Nucleic acid analysis technologies being translated into the clinic can broadly be classified into hybridization, PCR, and sequencing, as well as their combinations. Here we review the molecular mechanisms of popular commercial assays, and their progress in translation into in vitro diagnostics. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.
Zhang, Jing-Nan; Song, Ping; Hu, Jia-Rui; Mo, Sai-Jun; Peng, Mao-Yu; Zhou, Wei; Zou, Ji-Xing; Hu, Yin-Chang
2005-01-01
In this study,the full-length cDNAs of GH (Growth Hormone) gene was isolated from six important economic fishes, Siniperca kneri, Epinephelus coioides, Monopterus albus, Silurus asotus, Misgurnus anguillicaudatus and Carassius auratus gibelio Bloch. It is the first time to clone these GH sequences except E. coioides GH. The lengths of the above cDNAs are as follows: 953 bp, 1 023 bp, 825 bp, 1 082 bp, 1 154 bp and 1 180 bp. Each sequence includes an ORF of about 600 bp which encodes a protein of about 200 amino acid: S. kneri, E. coioides and M. albus GHs of 204 amino acid, S. asotus GH of 200 amino acid, M. anguillicaudatus and C. auratus gibelio GHs of 210 amino acid. Then detailed sequence analysis of the six GHs with many other fish sequences was performed. The six sequences all showed high homology to other sequences, especially to sequences within the same order, and many conserved residues were identified, most localized in five domains. The phylogenetic trees (MP and NJ) of many fish GH ORF sequences (including the new six) with Amia calva as outgroup were generally resolved and largely congruent with the morphology-based tree though some incongruities were observed, suggesting GH ORF should be paid more attention to in teleostean phylogeny.
Characterization and mapping of cDNA encoding aspartate aminotransferase in rice, Oryza sativa L.
Song, J; Yamamoto, K; Shomura, A; Yano, M; Minobe, Y; Sasaki, T
1996-10-31
Fifteen cDNA clones, putatively identified as encoding aspartate aminotransferase (AST, EC 2.6.1.1.), were isolated and partially sequenced. Together with six previously isolated clones putatively identified to encode ASTs (Sasaki, et al. 1994, Plant Journal 6, 615-624), their sequences were characterized and classified into 4 cDNA species. Two of the isolated clones, C60213 and C2079, were full-length cDNAs, and their complete nucleotide sequences were determined. C60213 was 1612 bp long and its deduced amino acid sequence showed 88% homology with that of Panicum miliaceum L. mitochondrial AST. The C60213-encoded protein had an N-terminal amino acid sequence that was characteristic of a mitochondrial transit peptide. On the other hand, C2079 was 1546 bp long and had 91% amino acid sequence homology with P. miliaceum L. cytosolic AST but lacked in the transit peptide sequence. The homologies of nucleotide sequences and deduced amino acid sequences of C2079 and C60213 were 54% and 52%, respectively. C2079 and C60213 were mapped on chromosomes 1 and 6, respectively, by restriction fragment length polymorphism linkage analysis. Northern blot analysis using C2079 as a probe revealed much higher transcript levels in callus and root than in green and etiolated shoots, suggesting tissue-specific variations of AST gene expression.
Human somatostatin I: sequence of the cDNA.
Shen, L P; Pictet, R L; Rutter, W J
1982-01-01
RNA has been isolated from a human pancreatic somatostatinoma and used to prepare a cDNA library. After prescreening, clones containing somatostatin I sequences were identified by hybridization with an anglerfish somatostatin I-cloned cDNA probe. From the nucleotide sequence of two of these clones, we have deduced an essentially full-length mRNA sequence, including the preprosomatostatin coding region, 105 nucleotides from the 5' untranslated region and the complete 150-nucleotide 3' untranslated region. The coding region predicts a 116-amino acid precursor protein (Mr, 12.727) that contains somatostatin-14 and -28 at its COOH terminus. The predicted amino acid sequence of human somatostatin-28 is identical to that of somatostatin-28 isolated from the porcine and ovine species. A comparison of the amino acid sequences of human and anglerfish preprosomatostatin I indicated that the COOH-terminal region encoding somatostatin-14 and the adjacent 6 amino acids are highly conserved, whereas the remainder of the molecule, including the signal peptide region, is more divergent. However, many of the amino acid differences found in the pro region of the human and anglerfish proteins are conservative changes. This suggests that the propeptides have a similar secondary structure, which in turn may imply a biological function for this region of the molecule. Images PMID:6126875
DOE Office of Scientific and Technical Information (OSTI.GOV)
Feild, M.J.; Armstrong, F.B.
1987-05-01
E. coli JA199 pDU11 harbors a multicopy plasmid containing the ilv GEDAY gene cluster of S. typhimurium. TmB, gene product of ilv E, was purified, crystallized, and subjected to Edman degradation using a gas phase sequencer. The intact protein yielded an amino terminal 31 residue sequence. Both carboxymethylated apoenzyme and (/sup 3/H)-NaBH-reduced holoenzyme were then subjected to digestion by trypsin. The digests were fractionated using reversed phase HPLC, and the peptides isolated were sequenced. The borohydride-treated holoenzyme was used to isolate the cofactor-binding peptide. The peptide is 27 residues long and a comparison with known sequences of other aminotransferases revealedmore » limited homology. Peptides accounting for 211 of 288 predicted residues have been sequenced, including 9 residues of the carboxyl terminus. Comparison of peptides with the inferred amino acid sequence of the E. coli K-12 enzyme has helped determine the sequence of the amino terminal 59 residues; only two differences between the sequences are noted in this region.« less
The primary structure of the Saccharomyces cerevisiae gene for 3-phosphoglycerate kinase.
Hitzeman, R A; Hagie, F E; Hayflick, J S; Chen, C Y; Seeburg, P H; Derynck, R
1982-01-01
The DNA sequence of the gene for the yeast glycolytic enzyme, 3-phosphoglycerate kinase (PGK), has been obtained by sequencing part of a 3.1 kbp HindIII fragment obtained from the yeast genome. The structural gene sequence corresponds to a reading frame of 1251 bp coding for 416 amino acids with no intervening DNA sequences. The amino acid sequence is approximately 65 percent homologous with human and horse PGK protein sequences and is in general agreement with the published protein sequence for yeast PGK. As for other highly expressed structural genes in yeast, the coding sequence is highly codon biased with 95 percent of the amino acids coded for by a select 25 codons (out of 61 possible). Besides structural DNA sequence, 291 bp of 5'-flanking sequence and 286 bp of 3'-flanking sequence were determined. Transcription starts 36 nucleotides upstream from the translational start and stops 86-93 nucleotides downstream from the translational stop. These results suggest a non-polyadenylated mRNA length of 1373 to 1380 nucleotides, which is consistent with the observed length of 1500 nucleotides for polyadenylated PGK mRNA. A sequence TATATATAAA is found at 145 nucleotides upstream from the translational start. This sequence resembles the TATAAA box that is possibly associated with RNA polymerase II binding. Images PMID:6296791
Nucleic and Amino Acid Sequences Support Structure-Based Viral Classification.
Sinclair, Robert M; Ravantti, Janne J; Bamford, Dennis H
2017-04-15
Viral capsids ensure viral genome integrity by protecting the enclosed nucleic acids. Interactions between the genome and capsid and between individual capsid proteins (i.e., capsid architecture) are intimate and are expected to be characterized by strong evolutionary conservation. For this reason, a capsid structure-based viral classification has been proposed as a way to bring order to the viral universe. The seeming lack of sufficient sequence similarity to reproduce this classification has made it difficult to reject structural convergence as the basis for the classification. We reinvestigate whether the structure-based classification for viral coat proteins making icosahedral virus capsids is in fact supported by previously undetected sequence similarity. Since codon choices can influence nascent protein folding cotranslationally, we searched for both amino acid and nucleotide sequence similarity. To demonstrate the sensitivity of the approach, we identify a candidate gene for the pandoravirus capsid protein. We show that the structure-based classification is strongly supported by amino acid and also nucleotide sequence similarities, suggesting that the similarities are due to common descent. The correspondence between structure-based and sequence-based analyses of the same proteins shown here allow them to be used in future analyses of the relationship between linear sequence information and macromolecular function, as well as between linear sequence and protein folds. IMPORTANCE Viral capsids protect nucleic acid genomes, which in turn encode capsid proteins. This tight coupling of protein shell and nucleic acids, together with strong functional constraints on capsid protein folding and architecture, leads to the hypothesis that capsid protein-coding nucleotide sequences may retain signatures of ancient viral evolution. We have been able to show that this is indeed the case, using the major capsid proteins of viruses forming icosahedral capsids. Importantly, we detected similarity at the nucleotide level between capsid protein-coding regions from viruses infecting cells belonging to all three domains of life, reproducing a previously established structure-based classification of icosahedral viral capsids. Copyright © 2017 Sinclair et al.
Nucleic and Amino Acid Sequences Support Structure-Based Viral Classification
Sinclair, Robert M.; Ravantti, Janne J.
2017-01-01
ABSTRACT Viral capsids ensure viral genome integrity by protecting the enclosed nucleic acids. Interactions between the genome and capsid and between individual capsid proteins (i.e., capsid architecture) are intimate and are expected to be characterized by strong evolutionary conservation. For this reason, a capsid structure-based viral classification has been proposed as a way to bring order to the viral universe. The seeming lack of sufficient sequence similarity to reproduce this classification has made it difficult to reject structural convergence as the basis for the classification. We reinvestigate whether the structure-based classification for viral coat proteins making icosahedral virus capsids is in fact supported by previously undetected sequence similarity. Since codon choices can influence nascent protein folding cotranslationally, we searched for both amino acid and nucleotide sequence similarity. To demonstrate the sensitivity of the approach, we identify a candidate gene for the pandoravirus capsid protein. We show that the structure-based classification is strongly supported by amino acid and also nucleotide sequence similarities, suggesting that the similarities are due to common descent. The correspondence between structure-based and sequence-based analyses of the same proteins shown here allow them to be used in future analyses of the relationship between linear sequence information and macromolecular function, as well as between linear sequence and protein folds. IMPORTANCE Viral capsids protect nucleic acid genomes, which in turn encode capsid proteins. This tight coupling of protein shell and nucleic acids, together with strong functional constraints on capsid protein folding and architecture, leads to the hypothesis that capsid protein-coding nucleotide sequences may retain signatures of ancient viral evolution. We have been able to show that this is indeed the case, using the major capsid proteins of viruses forming icosahedral capsids. Importantly, we detected similarity at the nucleotide level between capsid protein-coding regions from viruses infecting cells belonging to all three domains of life, reproducing a previously established structure-based classification of icosahedral viral capsids. PMID:28122979
Saccharomyces cerevisiae SSB1 protein and its relationship to nucleolar RNA-binding proteins.
Jong, A Y; Clark, M W; Gilbert, M; Oehm, A; Campbell, J L
1987-08-01
To better define the function of Saccharomyces cerevisiae SSB1, an abundant single-stranded nucleic acid-binding protein, we determined the nucleotide sequence of the SSB1 gene and compared it with those of other proteins of known function. The amino acid sequence contains 293 amino acid residues and has an Mr of 32,853. There are several stretches of sequence characteristic of other eucaryotic single-stranded nucleic acid-binding proteins. At the amino terminus, residues 39 to 54 are highly homologous to a peptide in calf thymus UP1 and UP2 and a human heterogeneous nuclear ribonucleoprotein. Residues 125 to 162 constitute a fivefold tandem repeat of the sequence RGGFRG, the composition of which suggests a nucleic acid-binding site. Near the C terminus, residues 233 to 245 are homologous to several RNA-binding proteins. Of 18 C-terminal residues, 10 are acidic, a characteristic of the procaryotic single-stranded DNA-binding proteins and eucaryotic DNA- and RNA-binding proteins. In addition, examination of the subcellular distribution of SSB1 by immunofluorescence microscopy indicated that SSB1 is a nuclear protein, predominantly located in the nucleolus. Sequence homologies and the nucleolar localization make it likely that SSB1 functions in RNA metabolism in vivo, although an additional role in DNA metabolism cannot be excluded.
Benyo, B; Biro, J C; Benyo, Z
2004-01-01
The theory of "codon-amino acid coevolution" was first proposed by Woese in 1967. It suggests that there is a stereochemical matching - that is, affinity - between amino acids and certain of the base triplet sequences that code for those amino acids. We have constructed a common periodic table of codons and amino acids, where the nucleic acid table showed perfect axial symmetry for codons and the corresponding amino acid table also displayed periodicity regarding the biochemical properties (charge and hydrophobicity) of the 20 amino acids and the position of the stop signals. The table indicates that the middle (2/sup nd/) amino acid in the codon has a prominent role in determining some of the structural features of the amino acids. The possibility that physical contact between codons and amino acids might exist was tested on restriction enzymes. Many recognition site-like sequences were found in the coding sequences of these enzymes and as many as 73 examples of codon-amino acid co-location were observed in the 7 known 3D structures (December 2003) of endonuclease-nucleic acid complexes. These results indicate that the smallest possible units of specific nucleic acid-protein interaction are indeed the stereochemically compatible codons and amino acids.
Thermal and acid tolerant beta-xylosidases, genes encoding, related organisms, and methods
Thompson, David N [Idaho Falls, ID; Thompson, Vicki S [Idaho Falls, ID; Schaller, Kastli D [Ammon, ID; Apel, William A [Jackson, WY; Lacey, Jeffrey A [Idaho Falls, ID; Reed, David W [Idaho Falls, ID
2011-04-12
Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius and variations thereof are provided. Further provided are methods of at least partially degrading xylotriose and/or xylobiose using isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius and variations thereof.
Wolffe, E J; Gause, W C; Pelfrey, C M; Holland, S M; Steinberg, A D; August, J T
1990-01-05
We describe the isolation and sequencing of a cDNA encoding mouse Pgp-1. An oligonucleotide probe corresponding to the NH2-terminal sequence of the purified protein was synthesized by the polymerase chain reaction and used to screen a mouse macrophage lambda gt11 library. A cDNA clone with an insert of 1.2 kilobases was selected and sequenced. In Northern blot analysis, only cells expressing Pgp-1 contained mRNA species that hybridized with this Pgp-1 cDNA. The nucleotide sequence of the cDNA has a single open reading frame that yields a protein-coding sequence of 1076 base pairs followed by a 132-base pair 3'-untranslated sequence that includes a putative polyadenylation signal but no poly(A) tail. The translated sequence comprises a 13-amino acid signal peptide followed by a polypeptide core of 345 residues corresponding to an Mr of 37,800. Portions of the deduced amino acid sequence were identical to those obtained by amino acid sequence analysis from the purified glycoprotein, confirming that the cDNA encodes Pgp-1. The predicted structure of Pgp-1 includes an NH2-terminal extracellular domain (residues 14-265), a transmembrane domain (residues 266-286), and a cytoplasmic tail (residues 287-358). Portions of the mouse Pgp-1 sequence are highly similar to that of the human CD44 cell surface glycoprotein implicated in cell adhesion. The protein also shows sequence similarity to the proteoglycan tandem repeat sequences found in cartilage link protein and cartilage proteoglycan core protein which are thought to be involved in binding to hyaluronic acid.
Shamoo, Yousif; Sun, Siyang
2014-06-10
Chimeric proteins comprising a sequence nonspecific single-stranded nucleic-acid-binding domain joined to a catalytic nucleic-acid-modifying domain are provided. Methods comprising contacting a nucleic acid molecule with a chimeric protein, as well as systems comprising a nucleic acid molecule, a chimeric protein, and an aqueous solution are also provided. The joining of sequence nonspecific single-stranded nucleic-acid-binding domain and a catalytic nucleic-acid-modifying domain in chimeric proteins, among other things, may prevent the separation of the two domains due to their weak association and thereby enhances processivity while maintaining fidelity.
Zimmermann, Karel; Gibrat, Jean-François
2010-01-04
Sequence comparisons make use of a one-letter representation for amino acids, the necessary quantitative information being supplied by the substitution matrices. This paper deals with the problem of finding a representation that provides a comprehensive description of amino acid intrinsic properties consistent with the substitution matrices. We present a Euclidian vector representation of the amino acids, obtained by the singular value decomposition of the substitution matrices. The substitution matrix entries correspond to the dot product of amino acid vectors. We apply this vector encoding to the study of the relative importance of various amino acid physicochemical properties upon the substitution matrices. We also characterize and compare the PAM and BLOSUM series substitution matrices. This vector encoding introduces a Euclidian metric in the amino acid space, consistent with substitution matrices. Such a numerical description of the amino acid is useful when intrinsic properties of amino acids are necessary, for instance, building sequence profiles or finding consensus sequences, using machine learning algorithms such as Support Vector Machine and Neural Networks algorithms.
MECHANISMS OF HEAVY METAL REMOVAL FROM ACID MINE DRAINAGE USING CHITIN
Acid Mine Drainage (AMD) emanating from inactive or active mine sites contains elevated levels of toxic heavy metals, which can have an adverse impact to the surrounding environment. The major pathway involved in generation of AMD is weathering of pyritic mineral ores, where in s...
Molecular cloning and sequence analysis of stearoyl-CoA desaturase in milkfish, Chanos chanos.
Hsieh, S L; Liao, W L; Kuo, C M
2001-12-01
Stearoyl-CoA desaturase (EC 1.14.99.5) is a key enzyme in the biosynthesis of polyunsaturated fatty acids and the maintenance of the homeoviscous fluidity of biological membranes. The stearoyl-CoA desaturase cDNA in milkfish (Chanos chanos) was cloned by RT-PCR and RACE, and it was compared with the stearoyl-CoA desaturase in cold-tolerant teleosts, common carp and grass carp. Nucleotide sequence analysis revealed that the cDNA clone has a 972-bp open reading frame encoding 323 amino acid residues. Alignments of the deduced amino acid sequence showed that the milkfish stearoyl-CoA desaturase shares 79% and 75% identity with common carp and grass carp, and 63%-64% with other vertebrates such as sheep, hamsters, rats, mice, and humans. Like common carp and grass carp, the deduced amino acid sequence in milkfish well conserves three histidine cluster motifs (one HXXXXH and two HXXHH) that are essential for catalysis of stearoyl-CoA desaturase activity. However, RT-PCR analysis showed that stearoyl-CoA desaturase expression in milkfish is detected in the tissues of liver, muscle, kidney, brain, and gill, and more expression sites were found in milkfish than in common carp and grass carp. Phylogenic relationships among the deduced stearoyl-CoA desaturase amino acid sequence in milkfish and those in other vertebrates showed that the milkfish stearoyl-CoA desaturase amino acid sequence is phylogenetically closer to those of common carp and grass carp than to other higher vertebrates.
Amino Acid Racemization and the Preservation of Ancient DNA
NASA Technical Reports Server (NTRS)
Poinar, Hendrik N.; Hoss, Matthias
1996-01-01
The extent of racemization of aspartic acid, alanine, and leucine provides criteria for assessing whether ancient tissue samples contain endogenous DNA. In samples in which the D/L ratio of aspartic acid exceeds 0.08, ancient DNA sequences could not be retrieved. Paleontological finds from which DNA sequences purportedly millions of years old have been reported show extensive racemization, and the amino acids present are mainly contaminates. An exception is the amino acids in some insects preserved in amber.
Characterization of tannase protein sequences of bacteria and fungi: an in silico study.
Banerjee, Amrita; Jana, Arijit; Pati, Bikash R; Mondal, Keshab C; Das Mohapatra, Pradeep K
2012-04-01
The tannase protein sequences of 149 bacteria and 36 fungi were retrieved from NCBI database. Among them only 77 bacterial and 31 fungal tannase sequences were taken which have different amino acid compositions. These sequences were analysed for different physical and chemical properties, superfamily search, multiple sequence alignment, phylogenetic tree construction and motif finding to find out the functional motif and the evolutionary relationship among them. The superfamily search for these tannase exposed the occurrence of proline iminopeptidase-like, biotin biosynthesis protein BioH, O-acetyltransferase, carboxylesterase/thioesterase 1, carbon-carbon bond hydrolase, haloperoxidase, prolyl oligopeptidase, C-terminal domain and mycobacterial antigens families and alpha/beta hydrolase superfamily. Some bacterial and fungal sequence showed similarity with different families individually. The multiple sequence alignment of these tannase protein sequences showed conserved regions at different stretches with maximum homology from amino acid residues 389-469 and 482-523 which could be used for designing degenerate primers or probes specific for tannase producing bacterial and fungal species. Phylogenetic tree showed two different clusters; one has only bacteria and another have both fungi and bacteria showing some relationship between these different genera. Although in second cluster near about all fungal species were found together in a corner which indicates the sequence level similarity among fungal genera. The distributions of fourteen motifs analysis revealed Motif 1 with a signature amino acid sequence of 29 amino acids, i.e. GCSTGGREALKQAQRWPHDYDGIIANNPA, was uniformly observed in 83.3 % of studied tannase sequences representing its participation with the structure and enzymatic function.
Nucleic Acid Encoding A Lectin-Derived Progenitor Cell Preservation Factor
Colucci, M. Gabriella; Chrispeels, Maarten J.; Moore, Jeffrey G.
2001-10-30
The invention relates to an isolated nucleic acid molecule that encodes a protein that is effective to preserve progenitor cells, such as hematopoietic progenitor cells. The nucleic acid comprises a sequence defined by SEQ ID NO:1, a homolog thereof, or a fragment thereof. The encoded protein has an amino acid sequence that comprises a sequence defined by SEQ ID NO:2, a homolog thereof, or a fragment thereof that contains an amino acid sequence TNNVLQVT. Methods of using the encoded protein for preserving progenitor cells in vitro, ex vivo, and in vivo are also described. The invention, therefore, include methods such as myeloablation therapies for cancer treatment wherein myeloid reconstitution is facilitated by means of the specified protein. Other therapeutic utilities are also enabled through the invention, for example, expanding progenitor cell populations ex vivo to increase chances of engraftation, improving conditions for transporting and storing progenitor cells, and facilitating gene therapy to treat and cure a broad range of life-threatening hematologic diseases.
Alkaptonuria and Pompe disease in one patient: metabolic and molecular analysis.
Zouheir Habbal, Mohammad; Bou Assi, Tarek; Mansour, Hicham
2013-04-29
Pompe disease is characterised by deficiency of acid α-glucosidase that results in abnormal glycogen deposition in the muscles. Alkaptonuria is caused by a defect in the enzyme homogentisate 1,2-dioxygenase with subsequent accumulation of homogentisic acid. We report the case of a 6-year-old boy diagnosed with Pompe disease and alkaptonuria. Urine organic acids and α-glucosidase were measured. Homogentisate 1,2-dioxygenase (HGO) and acid alpha-glucosidase (GAA) genes were sequenced by Sanger DNA sequencing. The level of α-glucosidase in white blood cells was markedly decreased (4 nm/mg) while the level of homogentisic acid was markedly increased (15 027 mmol/mol creatine). GAA sequencing detected two heterozygous GAA mutations (C.670C>T and C.1064T>C) while HGO sequencing revealed three polymorphisms in exons 4, 5 and 6, respectively. To the best of our knowledge, this is the first reported instance of Pompe disease and alkaptonuria occurring in the same individual.
Alkaptonuria and pompe disease in one patient: metabolic and molecular analysis
Habbal, Mohammad Zouheir; Bou Assi, Tarek; Mansour, Hicham
2013-01-01
Pompe disease is characterised by deficiency of acid α-glucosidase that results in abnormal glycogen deposition in the muscles. Alkaptonuria is caused by a defect in the enzyme homogentisate 1,2-dioxygenase with subsequent accumulation of homogentisic acid. We report the case of a 6-year-old boy diagnosed with Pompe disease and alkaptonuria. Urine organic acids and α-glucosidase were measured. Homogentisate 1,2-dioxygenase (HGO) and acid alpha-glucosidase (GAA) genes were sequenced by Sanger DNA sequencing. The level of α-glucosidase in white blood cells was markedly decreased (4 nm/mg) while the level of homogentisic acid was markedly increased (15 027 mmol/mol creatine). GAA sequencing detected two heterozygous GAA mutations (C.670C>T and C.1064T>C) while HGO sequencing revealed three polymorphisms in exons 4, 5 and 6, respectively. To the best of our knowledge, this is the first reported instance of Pompe disease and alkaptonuria occurring in the same individual. PMID:23632174
Amino acid sequence of a trypsin inhibitor from a Spirometra (Spirometra erinaceieuropaei).
Sanda, A; Uchida, A; Itagaki, T; Kobayashi, H; Inokuchi, N; Koyama, T; Iwama, M; Ohgi, K; Irie, M
2001-12-01
A trypsin inhibitor that is highly homologous with bovine pancreatic trypsin inhibitor (BPTI) was co-purified along with RNase from Spirometra (Spirometra erinaceieuropaei). The amino acid sequence of this inhibitor (SETI) and the nucleotide sequence of the cDNA encoding this protein were determined by protein chemistry and gene technology. SETI contains 68 amino acid residues and has a molecular mass of 7,798 Da. SETI has 31 amino acid residues that are identical with BPTI's sequence, including 6 half-cystine and 5 aromatic amino acid residues. The active site Lys residue in BPTI is replaced by an Arg residue in SETI. SETI is an effective inhibitor of trypsin and moderately inhibits a-chymotrypsin, but less inhibits elastase or subtilisin. SETI was expressed by E. coli containing a PelB vector carrying the SETI encoding cDNA; an expression yield of 0.68 mg/l was obtained. The phylogenetic relationship of SETI and the other BPTI-like trypsin inhibitors was analyzed using most likelihood inference methods.
Arrays of nucleic acid probes on biological chips
Chee, Mark; Cronin, Maureen T.; Fodor, Stephen P. A.; Huang, Xiaohua X.; Hubbell, Earl A.; Lipshutz, Robert J.; Lobban, Peter E.; Morris, MacDonald S.; Sheldon, Edward L.
1998-11-17
DNA chips containing arrays of oligonucleotide probes can be used to determine whether a target nucleic acid has a nucleotide sequence identical to or different from a specific reference sequence. The array of probes comprises probes exactly complementary to the reference sequence, as well as probes that differ by one or more bases from the exactly complementary probes.
A proteomic analysis of leaf sheaths from rice.
Shen, Shihua; Matsubae, Masami; Takao, Toshifumi; Tanaka, Naoki; Komatsu, Setsuko
2002-10-01
The proteins extracted from the leaf sheaths of rice seedlings were separated by 2-D PAGE, and analyzed by Edman sequencing and mass spectrometry, followed by database searching. Image analysis revealed 352 protein spots on 2-D PAGE after staining with Coomassie Brilliant Blue. The amino acid sequences of 44 of 84 proteins were determined; for 31 of these proteins, a clear function could be assigned, whereas for 12 proteins, no function could be assigned. Forty proteins did not yield amino acid sequence information, because they were N-terminally blocked, or the obtained sequences were too short and/or did not give unambiguous results. Fifty-nine proteins were analyzed by mass spectrometry; all of these proteins were identified by matching to the protein database. The amino acid sequences of 19 of 27 proteins analyzed by mass spectrometry were similar to the results of Edman sequencing. These results suggest that 2-D PAGE combined with Edman sequencing and mass spectrometry analysis can be effectively used to identify plant proteins.
Nucleic acid constructs containing orthogonal site selective recombinases (OSSRs)
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gilmore, Joshua M.; Anderson, J. Christopher; Dueber, John E.
The present invention provides for a recombinant nucleic acid comprising a nucleotide sequence comprising a plurality of constructs, wherein each construct independently comprises a nucleotide sequence of interest flanked by a pair of recombinase recognition sequences. Each pair of recombinase recognition sequences is recognized by a distinct recombinase. Optionally, each construct can, independently, further comprise one or more genes encoding a recombinase capable of recognizing the pair of recombinase recognition sequences of the construct. The recombinase can be an orthogonal (non-cross reacting), site-selective recombinase (OSSR).
Predicting residue-wise contact orders in proteins by support vector regression.
Song, Jiangning; Burrage, Kevin
2006-10-03
The residue-wise contact order (RWCO) describes the sequence separations between the residues of interest and its contacting residues in a protein sequence. It is a new kind of one-dimensional protein structure that represents the extent of long-range contacts and is considered as a generalization of contact order. Together with secondary structure, accessible surface area, the B factor, and contact number, RWCO provides comprehensive and indispensable important information to reconstructing the protein three-dimensional structure from a set of one-dimensional structural properties. Accurately predicting RWCO values could have many important applications in protein three-dimensional structure prediction and protein folding rate prediction, and give deep insights into protein sequence-structure relationships. We developed a novel approach to predict residue-wise contact order values in proteins based on support vector regression (SVR), starting from primary amino acid sequences. We explored seven different sequence encoding schemes to examine their effects on the prediction performance, including local sequence in the form of PSI-BLAST profiles, local sequence plus amino acid composition, local sequence plus molecular weight, local sequence plus secondary structure predicted by PSIPRED, local sequence plus molecular weight and amino acid composition, local sequence plus molecular weight and predicted secondary structure, and local sequence plus molecular weight, amino acid composition and predicted secondary structure. When using local sequences with multiple sequence alignments in the form of PSI-BLAST profiles, we could predict the RWCO distribution with a Pearson correlation coefficient (CC) between the predicted and observed RWCO values of 0.55, and root mean square error (RMSE) of 0.82, based on a well-defined dataset with 680 protein sequences. Moreover, by incorporating global features such as molecular weight and amino acid composition we could further improve the prediction performance with the CC to 0.57 and an RMSE of 0.79. In addition, combining the predicted secondary structure by PSIPRED was found to significantly improve the prediction performance and could yield the best prediction accuracy with a CC of 0.60 and RMSE of 0.78, which provided at least comparable performance compared with the other existing methods. The SVR method shows a prediction performance competitive with or at least comparable to the previously developed linear regression-based methods for predicting RWCO values. In contrast to support vector classification (SVC), SVR is very good at estimating the raw value profiles of the samples. The successful application of the SVR approach in this study reinforces the fact that support vector regression is a powerful tool in extracting the protein sequence-structure relationship and in estimating the protein structural profiles from amino acid sequences.
Unusual polyphosphate inclusions observed in a marine Beggiatoa strain.
Brock, Jörg; Rhiel, Erhard; Beutler, Martin; Salman, Verena; Schulz-Vogt, Heide N
2012-02-01
Sulfide-oxidizing bacteria of the genus Beggiatoa are known to accumulate phosphate intracellularly as polyphosphate but little is known about the structure and properties of these inclusions. Application of different staining techniques revealed the presence of unusually large polyphosphate inclusions in the marine Beggiatoa strain 35Flor. The inclusions showed a co-occurrence of polyphosphate, calcium and magnesium when analyzed by scanning electron microscopy and energy dispersive X-ray analysis. Similar to polyphosphate-enriched acidocalcisomes of prokaryotes and eukaryotes, the polyphosphate inclusions in Beggiatoa strain 35Flor are enclosed by a lipid layer and store cations. However, they are not notably acidic. 16S rRNA gene sequence-based phylogenetic reconstruction showed an affiliation of Beggiatoa strain 35Flor to a monophyletic branch, comprising other narrow vacuolated and non-vacuolated Beggiatoa species. The polyphosphate inclusions represent a new type of membrane surrounded storage compartment within the genus Beggiatoa, distinct from the mostly nitrate-storing vacuoles known from other marine sulfide-oxidizing bacteria of the family Beggiatoaceae.
Mechanical design of translocating motor proteins.
Hwang, Wonmuk; Lang, Matthew J
2009-01-01
Translocating motors generate force and move along a biofilament track to achieve diverse functions including gene transcription, translation, intracellular cargo transport, protein degradation, and muscle contraction. Advances in single molecule manipulation experiments, structural biology, and computational analysis are making it possible to consider common mechanical design principles of these diverse families of motors. Here, we propose a mechanical parts list that include track, energy conversion machinery, and moving parts. Energy is supplied not just by burning of a fuel molecule, but there are other sources or sinks of free energy, by binding and release of a fuel or products, or similarly between the motor and the track. Dynamic conformational changes of the motor domain can be regarded as controlling the flow of free energy to and from the surrounding heat reservoir. Multiple motor domains are organized in distinct ways to achieve motility under imposed physical constraints. Transcending amino acid sequence and structure, physically and functionally similar mechanical parts may have evolved as nature's design strategy for these molecular engines.
Mechanical Design of Translocating Motor Proteins
Lang, Matthew J.
2013-01-01
Translocating motors generate force and move along a biofilament track to achieve diverse functions including gene transcription, translation, intracellular cargo transport, protein degradation, and muscle contraction. Advances in single molecule manipulation experiments, structural biology, and computational analysis are making it possible to consider common mechanical design principles of these diverse families of motors. Here, we propose a mechanical parts list that include track, energy conversion machinery, and moving parts. Energy is supplied not just by burning of a fuel molecule, but there are other sources or sinks of free energy, by binding and release of a fuel or products, or similarly between the motor and the track. Dynamic conformational changes of the motor domain can be regarded as controlling the flow of free energy to and from the surrounding heat reservoir. Multiple motor domains are organized in distinct ways to achieve motility under imposed physical constraints. Transcending amino acid sequence and structure, physically and functionally similar mechanical parts may have evolved as nature’s design strategy for these molecular engines. PMID:19452133
Plant Seeds as Model Vectors for the Transfer of Life Through Space
NASA Astrophysics Data System (ADS)
Tepfer, David; Leach, Sydney
2006-12-01
We consider plant seeds as terrestrial models for a vectored life form that could protect biological information in space. Seeds consist of maternal tissue surrounding and protecting an embryo. Some seeds resist deleterious conditions found in space: ultra low vacuum, extreme temperatures and radiation, including intense UV light. In a receptive environment, seeds could liberate a viable embryo, viable higher cells or a viable free-living organism (an endosymbiont or endophyte). Even if viability is lost, seeds still contain functional macro and small molecules (DNA, RNA, proteins, amino acids, lipids, etc.) that could provide the chemical basis for starting or modifying life. The possible release of endophytes or endosymbionts from a seed-like space traveler suggests that multiple domains of life, defined in DNA sequence phylogenies, could be disseminated simultaneously from Earth. We consider the possibility of exospermia, the outward transfer of life, as well as introspermia, the inward transfer of life-both as a contemporary and ancient events.
The complete nucleotide sequence of RNA 3 of a peach isolate of Prunus necrotic ringspot virus.
Hammond, R W; Crosslin, J M
1995-04-01
The complete nucleotide sequence of RNA 3 of the PE-5 peach isolate of Prunus necrotic ringspot ilarvirus (PNRSV) was obtained from cloned cDNA. The RNA sequence is 1941 nucleotides and contains two open reading frames (ORFs). ORF 1 consisted of 284 amino acids with a calculated molecular weight of 31,729 Da and ORF 2 contained 224 amino acids with a calculated molecular weight of 25,018 Da. ORF 2 corresponds to the coat protein gene. Expression of ORF 2 engineered into a pTrcHis vector in Escherichia coli results in a fusion polypeptide of approximately 28 kDa which cross-reacts with PNRSV polyclonal antiserum. Analysis of the coat protein amino acid sequence reveals a putative "zinc-finger" domain at the amino-terminal portion of the protein. Two tetranucleotide AUGC motifs occur in the 3'-UTR of the RNA and may function in coat protein binding and genome activation. ORF 1 homologies to other ilarviruses and alfalfa mosaic virus are confined to limited regions of conserved amino acids. The translated amino acid sequence of the coat protein gene shows 92% similarity to one isolate of apple mosaic virus, a closely related member of the ilarvirus group of plant viruses, but only 66% similarity to the amino acid sequence of the coat protein gene of a second isolate. These relationships are also reflected at the nucleotide sequence level. These results in one instance confirm the close similarities observed at the biophysical and serological levels between these two viruses, but on the other hand call into question the nomenclature used to describe these viruses.
Replica amplification of nucleic acid arrays
Church, George M.; Mitra, Robi D.
2010-08-31
Disclosed are improved methods of making and using immobilized arrays of nucleic acids, particularly methods for producing replicas of such arrays. Included are methods for producing high density arrays of nucleic acids and replicas of such arrays, as well as methods for preserving the resolution of arrays through rounds of replication. Also included are methods which take advantage of the availability of replicas of arrays for increased sensitivity in detection of sequences on arrays. Improved methods of sequencing nucleic acids immobilized on arrays utilizing single copies of arrays and methods taking further advantage of the availability of replicas of arrays are disclosed. The improvements lead to higher fidelity and longer read lengths of sequences immobilized on arrays. Methods are also disclosed which improve the efficiency of multiplex PCR using arrays of immobilized nucleic acids.
Camicia, Federico; Paredes, Rodolfo; Chalar, Cora; Galanti, Norbel; Kamenetzky, Laura; Gutierrez, Ariana; Rosenzvit, Mara C
2008-03-31
We have sequenced and partially characterized an Echinococcus granulosus cDNA, termed egat1, from a protoscolex signal sequence trap (SST) cDNA library. The isolated 1627 bp long cDNA contains an ORF of 489 amino acids and shows an amino acid identity of 30% with neutral and excitatory amino acid transporters members of the Dicarboxylate/Amino Acid Na+ and/or H+ Cation Symporter family (DAACS) (TC 2.A.23). Additional bioinformatics analysis of EgAT1, confirmed the results obtained by similarity searches and showed the presence of 9 to 10 transmembrane domains, consensus sequences for N-glycosylation between the third and fourth transmembrane domain, a highly similar hydropathy profile with ASCT1 (a known member of DAACS family), high score with SDF (Sodium Dicarboxilate Family) and similar motifs with EDTRANSPORT, a fingerprint of excitatory amino acid transporters. The localization of the putative amino acid transporter was analyzed by in situ hybridization and immunofluorescence in protoscoleces and associated germinal layer. The in situ hybridization labelling indicates the distribution of egat1 mRNA throughout the tegument. EgAT1 protein, which showed in Western blots a molecular mass of approximately 60 kD, is localized in the subtegumental region of the metacestode, particularly around suckers and rostellum of protoscoleces and layers from brood capsules. The sequence and expression analyses of EgAT1 pave the way for functional analysis of amino acids transporters of E. granulosus and its evaluation as new drug targets against cystic echinococcosis.
Xu, Li; Ding, Zhi-Shan; Zhou, Yun-Kai; Tao, Xue-Fen
2009-06-01
To obtain the full-length cDNA sequence of Secoisolariciresinol Dehydrogenase gene from Dysosma versipellis by RACE PCR,then investigate the character of Secoisolariciresinol Dehydrogenase gene. The full-length cDNA sequence of Secoisolariciresinol Dehydrogenase gene was obtained by 3'-RACE and 5'-RACE from Dysosma versipellis. We first reported the full cDNA sequences of Secoisolariciresinol Dehydrogenase in Dysosma versipellis. The acquired gene was 991bp in full length, including 5' untranslated region of 42bp, 3' untranslated region of 112bp with Poly (A). The open reading frame (ORF) encoding 278 amino acid with molecular weight 29253.3 Daltons and isolectric point 6.328. The gene accession nucleotide sequence number in GeneBank was EU573789. Semi-quantitative RT-PCR analysis revealed that the Secoisolariciresinol Dehydrogenase gene was highly expressed in stem. Alignment of the amino acid sequence of Secoisolariciresinol Dehydrogenase indicated there may be some significant amino acid sequence difference among different species. Obtain the full-length cDNA sequence of Secoisolariciresinol Dehydrogenase gene from Dysosma versipellis.
Anderson, Carl W.; Connelly, Margery A.
2004-10-12
The present invention provides a method for detecting DNA-activated protein kinase (DNA-PK) activity in a biological sample. The method includes contacting a biological sample with a detectably-labeled phosphate donor and a synthetic peptide substrate defined by the following features to provide specific recognition and phosphorylation by DNA-PK: (1) a phosphate-accepting amino acid pair which may include serine-glutamine (Ser-Gln) (SQ), threonine-glutamine (Thr-Gln) (TQ), glutamine-serine (Gln-Ser) (QS), or glutamine-threonine (Gln-Thr) (QT); (2) enhancer amino acids which may include glutamic acid or glutamine immediately adjacent at the amino- or carboxyl- side of the amino acid pair and forming an amino acid pair-enhancer unit; (3) a first spacer sequence at the amino terminus of the amino acid pair-enhancer unit; (4) a second spacer sequence at the carboxyl terminus of the amino acid pair-enhancer unit, which spacer sequences may include any combination of amino acids that does not provide a phosphorylation site consensus sequence motif; and, (5) a tag moiety, which may be an amino acid sequence or another chemical entity that permits separating the synthetic peptide from the phosphate donor. A compostion and a kit for the detection of DNA-PK activity are also provided. Methods for detecting DNA, protein phosphatases and substances that alter the activity of DNA-PK are also provided. The present invention also provides a method of monitoring protein kinase and DNA-PK activity in living cells. -A composition and a kit for monitoring protein kinase activity in vitro and a composition and a kit for monitoring DNA-PK activities in living cells are also provided. A method for identifying agents that alter protein kinase activity in vitro and a method for identifying agents that alter DNA-PK activity in living cells are also provided.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhang, Tao; Datta, Supratim; Simmons, Blake A.
The present invention provides for an isolated or recombinant polypeptide comprising an amino acid sequence having at least 70% identity with the amino acid sequence of a Halorhabdus utahensis cellulase, such as Hu-CBH1, wherein said amino acid sequence has a halophilic thermostable and/or thermophilic cellobiohydrolase (CBH) activity. In some embodiments, the polypeptide has a CBH activity that is resistant to up to about 20% of ionic liquids. The present invention also provides for compositions comprising and methods using the isolated or recombinant polypeptide.
Cloning of the poly(ADP-ribose) Gene from Rat Liver.
1986-09-24
Levinson, Ph.D. (Cetus Corp., Berkeley). 5. Amino acid analysis done in UCSF Bioanal. Lab. TABLE OF CONTENTS Page METHOD I...TABLE I ............. ............................... ... 12 Proteolytic degradation, isolation of peptide and amino acid sequences...technique developed for enzyme quantitation in biological materials. The amino- acid sequence of the enzyme has so far been determined because the amino
Thompson, David N; Thompson, Vicki S; Schaller, Kastli D; Apel, William A; Reed, David W; Lacey, Jeffrey A
2013-04-30
Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius and variations thereof are provided. Further provided are methods of at least partially degrading xylotriose, xylobiose, and/or arabinofuranose-substituted xylan using isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius and variations thereof.
[Comparative genomics and evolutionary analysis of CRISPR loci in acetic acid bacteria].
Xia, Kai; Liang, Xin-le; Li, Yu-dong
2015-12-01
The clustered regularly interspaced short palindromic repeat (CRISPR) is a widespread adaptive immunity system that exists in most archaea and many bacteria against foreign DNA, such as phages, viruses and plasmids. In general, CRISPR system consists of direct repeat, leader, spacer and CRISPR-associated sequences. Acetic acid bacteria (AAB) play an important role in industrial fermentation of vinegar and bioelectrochemistry. To investigate the polymorphism and evolution pattern of CRISPR loci in acetic acid bacteria, bioinformatic analyses were performed on 48 species from three main genera (Acetobacter, Gluconacetobacter and Gluconobacter) with whole genome sequences available from the NCBI database. The results showed that the CRISPR system existed in 32 species of the 48 strains studied. Most of the CRISPR-Cas system in AAB belonged to type I CRISPR-Cas system (subtype E and C), but type II CRISPR-Cas system which contain cas9 gene was only found in the genus Acetobacter and Gluconacetobacter. The repeat sequences of some CRISPR were highly conserved among species from different genera, and the leader sequences of some CRISPR possessed conservative motif, which was associated with regulated promoters. Moreover, phylogenetic analysis of cas1 demonstrated that they were suitable for classification of species. The conservation of cas1 genes was associated with that of repeat sequences among different strains, suggesting they were subjected to similar functional constraints. Moreover, the number of spacer was positively correlated with the number of prophages and insertion sequences, indicating the acetic acid bacteria were continually invaded by new foreign DNA. The comparative analysis of CRISR loci in acetic acid bacteria provided the basis for investigating the molecular mechanism of different acetic acid tolerance and genome stability in acetic acid bacteria.
Li, Yushuang; Yang, Jiasheng; Zhang, Yi
2016-01-01
In this paper, we have proposed a novel alignment-free method for comparing the similarity of protein sequences. We first encode a protein sequence into a 440 dimensional feature vector consisting of a 400 dimensional Pseudo-Markov transition probability vector among the 20 amino acids, a 20 dimensional content ratio vector, and a 20 dimensional position ratio vector of the amino acids in the sequence. By evaluating the Euclidean distances among the representing vectors, we compare the similarity of protein sequences. We then apply this method into the ND5 dataset consisting of the ND5 protein sequences of 9 species, and the F10 and G11 datasets representing two of the xylanases containing glycoside hydrolase families, i.e., families 10 and 11. As a result, our method achieves a correlation coefficient of 0.962 with the canonical protein sequence aligner ClustalW in the ND5 dataset, much higher than those of other 5 popular alignment-free methods. In addition, we successfully separate the xylanases sequences in the F10 family and the G11 family and illustrate that the F10 family is more heat stable than the G11 family, consistent with a few previous studies. Moreover, we prove mathematically an identity equation involving the Pseudo-Markov transition probability vector and the amino acids content ratio vector. PMID:27918587
A Generative Angular Model of Protein Structure Evolution
Golden, Michael; García-Portugués, Eduardo; Sørensen, Michael; Mardia, Kanti V.; Hamelryck, Thomas; Hein, Jotun
2017-01-01
Abstract Recently described stochastic models of protein evolution have demonstrated that the inclusion of structural information in addition to amino acid sequences leads to a more reliable estimation of evolutionary parameters. We present a generative, evolutionary model of protein structure and sequence that is valid on a local length scale. The model concerns the local dependencies between sequence and structure evolution in a pair of homologous proteins. The evolutionary trajectory between the two structures in the protein pair is treated as a random walk in dihedral angle space, which is modeled using a novel angular diffusion process on the two-dimensional torus. Coupling sequence and structure evolution in our model allows for modeling both “smooth” conformational changes and “catastrophic” conformational jumps, conditioned on the amino acid changes. The model has interpretable parameters and is comparatively more realistic than previous stochastic models, providing new insights into the relationship between sequence and structure evolution. For example, using the trained model we were able to identify an apparent sequence–structure evolutionary motif present in a large number of homologous protein pairs. The generative nature of our model enables us to evaluate its validity and its ability to simulate aspects of protein evolution conditioned on an amino acid sequence, a related amino acid sequence, a related structure or any combination thereof. PMID:28453724
Regulation of Nutrient Transport in Quiescent, Lactating, and Neoplastic Mammary Epithelia
1998-10-01
collected and solubilized with 1.25% dodecyl maltoside in the presence of 6- aminocaproic acid . After a 30-minute 13000 rpm centrifugation at 4°C, the... acids . Hydropathy plots based on amino acid sequences predicted from cDNA sequence suggest that all share a common topology, which includes... acid intracellular loop midway through the transporter. There is a striking degree of homology among these isoforms, which are 50- 65% identical in
1988-01-01
The primary amino acid sequence of contactin, a neuronal cell surface glycoprotein of 130 kD that is isolated in association with components of the cytoskeleton (Ranscht, B., D. J. Moss, and C. Thomas. 1984. J. Cell Biol. 99:1803-1813), was deduced from the nucleotide sequence of cDNA clones and is reported here. The cDNA sequence contains an open reading frame for a 1,071-amino acid transmembrane protein with 962 extracellular and 89 cytoplasmic amino acids. In its extracellular portion, the polypeptide features six type 1 and two type 2 repeats. The six amino-terminal type 1 repeats (I-VI) each consist of 81-99 amino acids and contain two cysteine residues that are in the right context to form globular domains as described for molecules with immunoglobulin structure. Within the proposed globular region, contactin shares 31% identical amino acids with the neural cell adhesion molecule NCAM. The two type 2 repeats (I-II) are each composed of 100 amino acids and lack cysteine residues. They are 20-31% identical to fibronectin type III repeats. Both the structural similarity of contactin to molecules of the immunoglobulin supergene family, in particular the amino acid sequence resemblance to NCAM, and its relationship to fibronectin indicate that contactin could be involved in some aspect of cellular adhesion. This suggestion is further strengthened by its localization in neuropil containing axon fascicles and synapses. PMID:3049624
A Novel Locomotion-based Validation Assay for Candidate Drugs Using Drosophila DYT1 Disease Model
2013-11-01
the genome using the same parental fly line, minimizing the effect of surrounding sequences and genetic variations on the ...locomotion and GTPC cyclrohydolase protein levels; (3) supplementation of dopamine can partially rescue the locomotion defects of Drosophila larvae...8217- GCGAACAACCAAAAAATCATTGAGATAATAAACTCCTCCATTAG-3’) to make dtorsin cDNA that lacks GAC (D307) (Fig. 1) respectively. After confirming mutated sequences , the insert was again
Sequencing intractable DNA to close microbial genomes.
Hurt, Richard A; Brown, Steven D; Podar, Mircea; Palumbo, Anthony V; Elias, Dwayne A
2012-01-01
Advancement in high throughput DNA sequencing technologies has supported a rapid proliferation of microbial genome sequencing projects, providing the genetic blueprint for in-depth studies. Oftentimes, difficult to sequence regions in microbial genomes are ruled "intractable" resulting in a growing number of genomes with sequence gaps deposited in databases. A procedure was developed to sequence such problematic regions in the "non-contiguous finished" Desulfovibrio desulfuricans ND132 genome (6 intractable gaps) and the Desulfovibrio africanus genome (1 intractable gap). The polynucleotides surrounding each gap formed GC rich secondary structures making the regions refractory to amplification and sequencing. Strand-displacing DNA polymerases used in concert with a novel ramped PCR extension cycle supported amplification and closure of all gap regions in both genomes. The developed procedures support accurate gene annotation, and provide a step-wise method that reduces the effort required for genome finishing.
Mena-Ulecia, Karel; Gonzalez-Norambuena, Fabian; Vergara-Jaque, Ariela; Poblete, Horacio; Tiznado, William; Caballero, Julio
2018-06-15
Protein kinases (PKs) discriminate between closely related sequences that contain serine, threonine, and/or tyrosine residues. Such specificity is defined by the amino acid sequence surrounding the phosphorylatable residue, so that it is possible to identify an optimal recognition motif (ORM) for each PK. The ORM for the protein kinase A (PKA), a well-known member of the PK family, is the sequence RRX(S/T)X, where arginines at the -3 and -2 positions play a key role with respect to the primed phosphorylation site. In this work, differential affinities of PKA for the peptide substrate Kemptide (LRRASLG) and mutants that substitute the arginine residues by the unnatural peptide homoarginine were evaluated through molecular dynamics (MD) and free energy perturbation (FEP) calculations. The FEP study for the homoarginine mutants required previous elaboration of a CHARMM "arginine to homoarginine" (R2B) hybrid topology file which is available in this manuscript as Supporting Information. Mutants substituting the arginine residues by alanine, lysine, and histidine were also considered in the comparison by using the same protocol. FEP calculations allowed estimating the free energy changes from the free PKA to PKA-substrate complex (ΔΔG E→ES ) when Kemptide structure was mutated. Both ΔΔG S→ES values for homoarginine mutants were predicted with a difference below 1 kcal/mol. In addition, FEP correctly predicted that all the studied mutations decrease the catalytic efficiency of Kemptide for PKA. © 2018 Wiley Periodicals, Inc. © 2018 Wiley Periodicals, Inc.
Vanhoutteghem, Amandine; Maciejewski-Duval, Anna; Bouche, Cyril; Delhomme, Brigitte; Hervé, Françoise; Daubigney, Fabrice; Soubigou, Guillaume; Araki, Masatake; Araki, Kimi; Yamamura, Ken-ichi; Djian, Philippe
2009-01-01
Basonuclin 2 is a recently discovered zinc finger protein of unknown function. Its paralog, basonuclin 1, is associated with the ability of keratinocytes to multiply. The basonuclin zinc fingers are closely related to those of the Drosophila proteins disco and discorelated, but the relation between disco proteins and basonuclins has remained elusive because the function of the disco proteins in larval head development seems to have no relation to that of basonuclin 1 and because the amino acid sequence of disco, apart from the zinc fingers, also has no similarity to that of the basonuclins. We have generated mice lacking basonuclin 2. These mice die within 24 h of birth with a cleft palate and abnormalities of craniofacial bones and tongue. In the embryonic head, expression of the basonuclin 2 gene is restricted to mesenchymal cells in the palate, at the periphery of the tongue, and in the mesenchymal sheaths that surround the brain and the osteocartilagineous structures. In late embryos, the rate of multiplication of these mesenchymal cells is greatly diminished. Therefore, basonuclin 2 is essential for the multiplication of craniofacial mesenchymal cells during embryogenesis. Non-Drosophila insect databases available since 2008 reveal that the basonuclins and the disco proteins share much more extensive sequence and gene structure similarity than noted when only Drosophila sequences were examined. We conclude that basonuclin 2 is both structurally and functionally the vertebrate ortholog of the disco proteins. We also note the possibility that some human craniofacial abnormalities are due to a lack of basonuclin 2. PMID:19706529
Protein location prediction using atomic composition and global features of the amino acid sequence
DOE Office of Scientific and Technical Information (OSTI.GOV)
Cherian, Betsy Sheena, E-mail: betsy.skb@gmail.com; Nair, Achuthsankar S.
2010-01-22
Subcellular location of protein is constructive information in determining its function, screening for drug candidates, vaccine design, annotation of gene products and in selecting relevant proteins for further studies. Computational prediction of subcellular localization deals with predicting the location of a protein from its amino acid sequence. For a computational localization prediction method to be more accurate, it should exploit all possible relevant biological features that contribute to the subcellular localization. In this work, we extracted the biological features from the full length protein sequence to incorporate more biological information. A new biological feature, distribution of atomic composition is effectivelymore » used with, multiple physiochemical properties, amino acid composition, three part amino acid composition, and sequence similarity for predicting the subcellular location of the protein. Support Vector Machines are designed for four modules and prediction is made by a weighted voting system. Our system makes prediction with an accuracy of 100, 82.47, 88.81 for self-consistency test, jackknife test and independent data test respectively. Our results provide evidence that the prediction based on the biological features derived from the full length amino acid sequence gives better accuracy than those derived from N-terminal alone. Considering the features as a distribution within the entire sequence will bring out underlying property distribution to a greater detail to enhance the prediction accuracy.« less
Reiz, Bela; Li, Liang
2010-09-01
Controlled hydrolysis of proteins to generate peptide ladders combined with mass spectrometric analysis of the resultant peptides can be used for protein sequencing. In this paper, two methods of improving the microwave-assisted protein hydrolysis process are described to enable rapid sequencing of proteins containing disulfide bonds and increase sequence coverage, respectively. It was demonstrated that proteins containing disulfide bonds could be sequenced by MS analysis by first performing hydrolysis for less than 2 min, followed by 1 h of reduction to release the peptides originally linked by disulfide bonds. It was shown that a strong base could be used as a catalyst for microwave-assisted protein hydrolysis, producing complementary sequence information to that generated by microwave-assisted acid hydrolysis. However, using either acid or base hydrolysis, amide bond breakages in small regions of the polypeptide chains of the model proteins (e.g., cytochrome c and lysozyme) were not detected. Dynamic light scattering measurement of the proteins solubilized in an acid or base indicated that protein-protein interaction or aggregation was not the cause of the failure to hydrolyze certain amide bonds. It was speculated that there were some unknown local structures that might play a role in preventing an acid or base from reacting with the peptide bonds therein. 2010 American Society for Mass Spectrometry. Published by Elsevier Inc. All rights reserved.
NASA Technical Reports Server (NTRS)
Funderburgh, J. L.; Funderburgh, M. L.; Brown, S. J.; Vergnes, J. P.; Hassell, J. R.; Mann, M. M.; Conrad, G. W.; Spooner, B. S. (Principal Investigator)
1993-01-01
Amino acid sequence from tryptic peptides of three different bovine corneal keratan sulfate proteoglycan (KSPG) core proteins (designated 37A, 37B, and 25) showed similarities to the sequence of a chicken KSPG core protein lumican. Bovine lumican cDNA was isolated from a bovine corneal expression library by screening with chicken lumican cDNA. The bovine cDNA codes for a 342-amino acid protein, M(r) 38,712, containing amino acid sequences identified in the 37B KSPG core protein. The bovine lumican is 68% identical to chicken lumican, with an 83% identity excluding the N-terminal 40 amino acids. Location of 6 cysteine and 4 consensus N-glycosylation sites in the bovine sequence were identical to those in chicken lumican. Bovine lumican had about 50% identity to bovine fibromodulin and 20% identity to bovine decorin and biglycan. About two-thirds of the lumican protein consists of a series of 10 amino acid leucine-rich repeats that occur in regions of calculated high beta-hydrophobic moment, suggesting that the leucine-rich repeats contribute to beta-sheet formation in these proteins. Sequences obtained from 37A and 25 core proteins were absent in bovine lumican, thus predicting a unique primary structure and separate mRNA for each of the three bovine KSPG core proteins.
Isolated nucleic acids encoding antipathogenic polypeptides and uses thereof
Altier, Daniel J.; Crane, Virginia C.; Ellanskaya, Irina; Ellanskaya, Natalia; Gilliam, Jacob T.; Hunter-Cevera, Jennie; Presnail, James K.; Schepers, Eric J.; Simmons, Carl R.; Torok, Tamas; Yalpani, Nasser
2010-04-20
Compositions and methods for protecting a plant from a pathogen, particularly a fungal pathogen, are provided. Compositions include amino acid sequences, and variants and fragments thereof, for antipathogenic polypeptides that were isolated from fungal fermentation broths. Nucleic acids that encode the antipathogenic polypeptides are also provided. A method for inducing pathogen resistance in a plant using the nucleotide sequences disclosed herein is further provided. The method comprises introducing into a plant an expression cassette comprising a promoter operably linked to a nucleotide sequence that encodes an antipathogenic polypeptide of the invention. Compositions comprising an antipathogenic polypeptide or a transformed microorganism comprising a nucleic acid of the invention in combination with a carrier and methods of using these compositions to protect a plant from a pathogen are further provided. Transformed plants, plant cells, seeds, and microorganisms comprising a nucleotide sequence that encodes an antipathogenic polypeptide of the invention are also disclosed.
Laboratory procedures to generate viral metagenomes.
Thurber, Rebecca V; Haynes, Matthew; Breitbart, Mya; Wegley, Linda; Rohwer, Forest
2009-01-01
This collection of laboratory protocols describes the steps to collect viruses from various samples with the specific aim of generating viral metagenome sequence libraries (viromes). Viral metagenomics, the study of uncultured viral nucleic acid sequences from different biomes, relies on several concentration, purification, extraction, sequencing and heuristic bioinformatic methods. No single technique can provide an all-inclusive approach, and therefore the protocols presented here will be discussed in terms of hypothetical projects. However, care must be taken to individualize each step depending on the source and type of viral-particles. This protocol is a description of the processes we have successfully used to: (i) concentrate viral particles from various types of samples, (ii) eliminate contaminating cells and free nucleic acids and (iii) extract, amplify and purify viral nucleic acids. Overall, a sample can be processed to isolate viral nucleic acids suitable for high-throughput sequencing in approximately 1 week.
Nakajima, T; Walkup, R D; Tochigi, A; Shearer, T R; Azuma, M
2007-11-01
Lacritin is a mitogen of human salivary gland cells as well as a stimulator of human corneal epithelial cells. It is expected to be an important factor in maintaining the surrounding ocular surface. The monkey would be a relevant animal model in which to study the role of lacritin in ophthalmic physiology and pathology. However, to our knowledge, no cDNA cloning or functional analysis of monkey lacritin has been performed. Thus, the purposes of this study were: (1) to clone the monkey ortholog of lacritin; (2) to characterize lacritin in tears from several species; and (3) to determine the tissues where lacritin is produced and secreted. cDNA for lacritin from rhesus macaque contained 547 bp, with 411 bp in an open reading frame (ORF) encoding a protein of 137 amino acids. Monkey lacritin showed 89% amino acid homology with human lacritin; one amino acid was deleted in all three monkey strains. The predicted MW of mature lacritin was 12.2 kDa, and the isoelectric point was 4.99. Lacritin showed anomalous migration at approximately 21.0 kDa on SDS-PAGE, as confirmed by immunoblotting and amino acid sequencing. Similar to native lacritin in monkey tears, a 21 kDa band was also detected in human tears. In contrast, no lacritin was observed at a similar position on SDS-PAGE in rat, rabbit and dog tears. In the monkey, lacritin mRNA was expressed highly in the lacrimal gland, moderately in the conjunctiva and the meibomian gland, and weakly in corneal epithelium. In primates, lacritin was produced in the lacrimal gland and secreted into tear fluid. These results suggest that lacritin might be important for the maintenance of the ocular surface in higher animals, such as monkeys and humans.
Kimura, M; Kimura, J; Hatakeyama, T
1988-11-21
The complete amino acid sequences of ribosomal proteins S11 from the Gram-positive eubacterium Bacillus stearothermophilus and of S19 from the archaebacterium Halobacterium marismortui have been determined. A search for homologous sequences of these proteins revealed that they belong to the ribosomal protein S11 family. Homologous proteins have previously been sequenced from Escherichia coli as well as from chloroplast, yeast and mammalian ribosomes. A pairwise comparison of the amino acid sequences showed that Bacillus protein S11 shares 68% identical residues with S11 from Escherichia coli and a slightly lower homology (52%) with the homologous chloroplast protein. The halophilic protein S19 is more related to the eukaryotic (45-49%) than to the eubacterial counterparts (35%).
Goonesekere, Nalin Cw
2009-01-01
The large numbers of protein sequences generated by whole genome sequencing projects require rapid and accurate methods of annotation. The detection of homology through computational sequence analysis is a powerful tool in determining the complex evolutionary and functional relationships that exist between proteins. Homology search algorithms employ amino acid substitution matrices to detect similarity between proteins sequences. The substitution matrices in common use today are constructed using sequences aligned without reference to protein structure. Here we present amino acid substitution matrices constructed from the alignment of a large number of protein domain structures from the structural classification of proteins (SCOP) database. We show that when incorporated into the homology search algorithms BLAST and PSI-blast, the structure-based substitution matrices enhance the efficacy of detecting remote homologs.
Sequencing proteins with transverse ionic transport in nanochannels.
Boynton, Paul; Di Ventra, Massimiliano
2016-05-03
De novo protein sequencing is essential for understanding cellular processes that govern the function of living organisms and all sequence modifications that occur after a protein has been constructed from its corresponding DNA code. By obtaining the order of the amino acids that compose a given protein one can then determine both its secondary and tertiary structures through structure prediction, which is used to create models for protein aggregation diseases such as Alzheimer's Disease. Here, we propose a new technique for de novo protein sequencing that involves translocating a polypeptide through a synthetic nanochannel and measuring the ionic current of each amino acid through an intersecting perpendicular nanochannel. We find that the distribution of ionic currents for each of the 20 proteinogenic amino acids encoded by eukaryotic genes is statistically distinct, showing this technique's potential for de novo protein sequencing.
NASA Technical Reports Server (NTRS)
Gatlin, L. L.
1974-01-01
Concepts of information theory are applied to examine various proteins in terms of their redundancy in natural originators such as animals and plants. The Monte Carlo method is used to derive information parameters for random protein sequences. Real protein sequence parameters are compared with the standard parameters of protein sequences having a specific length. The tendency of a chain to contain some amino acids more frequently than others and the tendency of a chain to contain certain amino acid pairs more frequently than other pairs are used as randomness measures of individual protein sequences. Non-periodic proteins are generally found to have random Shannon redundancies except in cases of constraints due to short chain length and genetic codes. Redundant characteristics of highly periodic proteins are discussed. A degree of periodicity parameter is derived.
The amino acid sequence of Staphylococcus aureus penicillinase.
Ambler, R P
1975-01-01
The amino acid sequence of the penicillinase (penicillin amido-beta-lactamhydrolase, EC 3.5.2.6) from Staphylococcus aureus strain PC1 was determined. The protein consists of a single polypeptide chain of 257 residues, and the sequence was determined by characterization of tryptic, chymotryptic, peptic and CNBr peptides, with some additional evidence from thermolysin and S. aureus proteinase peptides. A mistake in the preliminary report of the sequence is corrected; residues 113-116 are now thought to be -Lys-Lys-Val-Lys- rather than -Lys-Val-Lys-Lys-. Detailed evidence for the amino acid sequence has been deposited as Supplementary Publication SUP 50056 (91 pages) at the British Library (Lending Division), Boston Spa, Wetherby, West Yorkshire LS23 7BQ, U.K., from whom copies may be obtained on the terms given in Biochem. J. (1975) 145, 5. PMID:1218078
Martínez-Castilla, León P.; Rodríguez-Sotres, Rogelio
2010-01-01
Background Despite the remarkable progress of bioinformatics, how the primary structure of a protein leads to a three-dimensional fold, and in turn determines its function remains an elusive question. Alignments of sequences with known function can be used to identify proteins with the same or similar function with high success. However, identification of function-related and structure-related amino acid positions is only possible after a detailed study of every protein. Folding pattern diversity seems to be much narrower than sequence diversity, and the amino acid sequences of natural proteins have evolved under a selective pressure comprising structural and functional requirements acting in parallel. Principal Findings The approach described in this work begins by generating a large number of amino acid sequences using ROSETTA [Dantas G et al. (2003) J Mol Biol 332:449–460], a program with notable robustness in the assignment of amino acids to a known three-dimensional structure. The resulting sequence-sets showed no conservation of amino acids at active sites, or protein-protein interfaces. Hidden Markov models built from the resulting sequence sets were used to search sequence databases. Surprisingly, the models retrieved from the database sequences belonged to proteins with the same or a very similar function. Given an appropriate cutoff, the rate of false positives was zero. According to our results, this protocol, here referred to as Rd.HMM, detects fine structural details on the folding patterns, that seem to be tightly linked to the fitness of a structural framework for a specific biological function. Conclusion Because the sequence of the native protein used to create the Rd.HMM model was always amongst the top hits, the procedure is a reliable tool to score, very accurately, the quality and appropriateness of computer-modeled 3D-structures, without the need for spectroscopy data. However, Rd.HMM is very sensitive to the conformational features of the models' backbone. PMID:20830209
Saccharomyces cerevisiae SSB1 protein and its relationship to nucleolar RNA-binding proteins.
Jong, A Y; Clark, M W; Gilbert, M; Oehm, A; Campbell, J L
1987-01-01
To better define the function of Saccharomyces cerevisiae SSB1, an abundant single-stranded nucleic acid-binding protein, we determined the nucleotide sequence of the SSB1 gene and compared it with those of other proteins of known function. The amino acid sequence contains 293 amino acid residues and has an Mr of 32,853. There are several stretches of sequence characteristic of other eucaryotic single-stranded nucleic acid-binding proteins. At the amino terminus, residues 39 to 54 are highly homologous to a peptide in calf thymus UP1 and UP2 and a human heterogeneous nuclear ribonucleoprotein. Residues 125 to 162 constitute a fivefold tandem repeat of the sequence RGGFRG, the composition of which suggests a nucleic acid-binding site. Near the C terminus, residues 233 to 245 are homologous to several RNA-binding proteins. Of 18 C-terminal residues, 10 are acidic, a characteristic of the procaryotic single-stranded DNA-binding proteins and eucaryotic DNA- and RNA-binding proteins. In addition, examination of the subcellular distribution of SSB1 by immunofluorescence microscopy indicated that SSB1 is a nuclear protein, predominantly located in the nucleolus. Sequence homologies and the nucleolar localization make it likely that SSB1 functions in RNA metabolism in vivo, although an additional role in DNA metabolism cannot be excluded. Images PMID:2823109
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chang, Soo-Ik; Hammes, G.G.
1989-11-01
Homology analyses of the protein sequences of chicken liver and rat mammary gland fatty acid synthases were carried out. The amino acid sequences of the chicken and rat enzymes are 67% identical. If conservative substitutions are allowed, 78% of the amino acids are matched. A region of low homologies exists between the functional domains, in particular around amino acid residues 1059-1264 of the chicken enzyme. Homologies between the active sites of chicken and rat and of chicken and yeast enzymes have been analyzed by an alignment method. A high degree of homology exists between the active sites of the chickenmore » and rat enzymes. However, the chicken and yeast enzymes show a lower degree of homology. The DADPH-binding dinucleotide folds of the {beta}-ketoacyl reductase and the enoyl reductase sites were identified by comparison with a known consensus sequence for the DADP- and FAD-binding dinucleotide folds. The active sites of all of the enzymes are primarily in hydrophobic regions of the protein. This study suggests that the genes for the functional domains of fatty acid synthase were originally separated, and these genes were connected to each other by using different connecting nucleotide sequences in different species. An alternative explanation for the differences in rat and chicken is a common ancestry and mutations in the joining regions during evolution.« less
A single alteration 20 nt 5′ to an editing target inhibits chloroplast RNA editing in vivo
Reed, Martha L.; Peeters, Nemo M.; Hanson, Maureen R.
2001-01-01
Transcripts of typical dicot plant plastid genes undergo C→U RNA editing at approximately 30 locations, but there is no consensus sequence surrounding the C targets of editing. The cis-acting elements required for editing of the C located at tobacco rpoB editing site II were investigated by introducing translatable chimeric minigenes containing sequence –20 to +6 surrounding the C target of editing. When the –20 to +6 sequence specified by the homologous region present in the black pine chloroplast genome was incorporated, virtually no editing of the transcripts occurred in transgenic tobacco plastids. Nucleotides that differ between the black pine and tobacco sequence were tested for their role in C→U editing by designing chimeric genes containing one or more of these divergent nucleotides. Surprisingly, the divergent nucleotide that had the strongest negative effect on editing of the minigene transcript was located –20 nt 5′ to the C target of editing. Expression of transgene transcripts carrying the 27 nt sequence did not affect the editing extent of the endogenous rpoB transcripts, even though the chimeric transcripts were much more abundant than those of the endogenous gene. In plants carrying a 93 nt rpoB editing site sequence, transgene transcripts accumulated to a level three times greater than transgene transcripts in the plants carrying the 27 nt rpoB editing sites and resulted in editing of the endogenous transcripts from 100 to 50%. Both a lower affinity of the 27 nt site for a trans-acting factor and lower abundance of the transcript could explain why expression of minigene transcripts containing the 27 nt sequence did not affect endogenous editing. PMID:11266552
The TGA codons are present in the open reading frame of selenoprotein P cDNA
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hill, K.E.; Lloyd, R.S.; Read, R.
1991-03-11
The TGA codon in DNA has been shown to direct incorporation of selenocysteine into protein. Several proteins from bacteria and animals contain selenocysteine in their primary structures. Each of the cDNA clones of these selenoproteins contains one TGA codon in the open reading frame which corresponds to the selenocysteine in the protein. A cDNA clone for selenoprotein P (SeP), obtained from a {gamma}ZAP rat liver library, was sequenced by the dideoxy termination method. The correct reading frame was determined by comparison of the deduced amino acid sequence with the amino acid sequence of several peptides from SeP. Using SeP labelledmore » with {sup 75}Se in vivo, the selenocysteine content of the peptides was verified by the collection of carboxymethylated {sup 77}Se-selenocysteine as it eluted from the amino acid analyzer and determination of the radioactivity contained in the collected samples. Ten TGA codons are present in the open reading frame of the cDNA. Peptide fragmentation studies and the deduced sequence indicate that selenium-rich regions are located close to the carboxy terminus. Nine of the 10 selenocysteines are located in the terminal 26% of the sequence with four in the terminal 15 amino acids. The deduced sequence codes for a protein of 385 amino acids. Cleavage of the signal peptide gives the mature protein with 366 amino acids and a calculated mol wt of 41,052 Da. Searches of PIR and SWISSPROT protein databases revealed no similarity with glutathione peroxidase or other selenoproteins.« less
A reduced amino acid alphabet for understanding and designing protein adaptation to mutation.
Etchebest, C; Benros, C; Bornot, A; Camproux, A-C; de Brevern, A G
2007-11-01
Protein sequence world is considerably larger than structure world. In consequence, numerous non-related sequences may adopt similar 3D folds and different kinds of amino acids may thus be found in similar 3D structures. By grouping together the 20 amino acids into a smaller number of representative residues with similar features, sequence world simplification may be achieved. This clustering hence defines a reduced amino acid alphabet (reduced AAA). Numerous works have shown that protein 3D structures are composed of a limited number of building blocks, defining a structural alphabet. We previously identified such an alphabet composed of 16 representative structural motifs (5-residues length) called Protein Blocks (PBs). This alphabet permits to translate the structure (3D) in sequence of PBs (1D). Based on these two concepts, reduced AAA and PBs, we analyzed the distributions of the different kinds of amino acids and their equivalences in the structural context. Different reduced sets were considered. Recurrent amino acid associations were found in all the local structures while other were specific of some local structures (PBs) (e.g Cysteine, Histidine, Threonine and Serine for the alpha-helix Ncap). Some similar associations are found in other reduced AAAs, e.g Ile with Val, or hydrophobic aromatic residues Trp with Phe and Tyr. We put into evidence interesting alternative associations. This highlights the dependence on the information considered (sequence or structure). This approach, equivalent to a substitution matrix, could be useful for designing protein sequence with different features (for instance adaptation to environment) while preserving mainly the 3D fold.
The cDNA sequence of a neutral horseradish peroxidase.
Bartonek-Roxå, E; Eriksson, H; Mattiasson, B
1991-02-16
A cDNA clone encoding a horseradish (Armoracia rusticana) peroxidase has been isolated and characterized. The cDNA contains 1378 nucleotides excluding the poly(A) tail and the deduced protein contains 327 amino acids which includes a 28 amino acid leader sequence. The predicted amino acid sequence is nine amino acids shorter than the major isoenzyme belonging to the horseradish peroxidase C group (HRP-C) and the sequence shows 53.7% identity with this isoenzyme. The described clone encodes nine cysteines of which eight correspond well with the cysteines found in HRP-C. Five potential N-glycosylation sites with the general sequence Asn-X-Thr/Ser are present in the deduced sequence. Compared to the earlier described HRP-C this is three glycosylation sites less. The shorter sequence and fewer N-glycosylation sites give the native isoenzyme a molecular weight of several thousands less than the horseradish peroxidase C isoenzymes. Comparison with the net charge value of HRP-C indicates that the described cDNA clone encodes a peroxidase which has either the same or a slightly less basic pI value, depending on whether the encoded protein is N-terminally blocked or not. This excludes the possibility that HRP-n could belong to either the HRP-A, -D or -E groups. The low sequence identity (53.7%) with HRP-C indicates that the described clone does not belong to the HRP-C isoenzyme group and comparison of the total amino acid composition with the HRP-B group does not place the described clone within this isoenzyme group. Our conclusion is that the described cDNA clone encodes a neutral horseradish peroxidase which belongs to a new, not earlier described, horseradish peroxidase group.
Fresco, Jacques R.; Johnson, Marion D.
2002-01-01
Disclosed are methods for detecting in situ the presence of a target sequence in a substantially double-stranded nucleic acid segment, which comprises: a) contacting in situ under conditions suitable for hybridization a substantially double-stranded nucleic acid segment with a detectable third strand, said third strand being capable of hybridizing to at least a portion of the target sequence to form a triple-stranded structure, if said target sequence is present; and b) detecting whether hybridization between the third strand and the target sequence has occured.
NASA Astrophysics Data System (ADS)
Humpula, James F.; Ostrom, Peggy H.; Gandhi, Hasand; Strahler, John R.; Walker, Angela K.; Stafford, Thomas W.; Smith, James J.; Voorhies, Michael R.; George Corner, R.; Andrews, Phillip C.
2007-12-01
Ancient DNA sequences offer an extraordinary opportunity to unravel the evolutionary history of ancient organisms. Protein sequences offer another reservoir of genetic information that has recently become tractable through the application of mass spectrometric techniques. The extent to which ancient protein sequences resolve phylogenetic relationships, however, has not been explored. We determined the osteocalcin amino acid sequence from the bone of an extinct Camelid (21 ka, Camelops hesternus) excavated from Isleta Cave, New Mexico and three bones of extant camelids: bactrian camel ( Camelus bactrianus); dromedary camel ( Camelus dromedarius) and guanaco ( Llama guanacoe) for a diagenetic and phylogenetic assessment. There was no difference in sequence among the four taxa. Structural attributes observed in both modern and ancient osteocalcin include a post-translation modification, Hyp 9, deamidation of Gln 35 and Gln 39, and oxidation of Met 36. Carbamylation of the N-terminus in ancient osteocalcin may result in blockage and explain previous difficulties in sequencing ancient proteins via Edman degradation. A phylogenetic analysis using osteocalcin sequences of 25 vertebrate taxa was conducted to explore osteocalcin protein evolution and the utility of osteocalcin sequences for delineating phylogenetic relationships. The maximum likelihood tree closely reflected generally recognized taxonomic relationships. For example, maximum likelihood analysis recovered rodents, birds and, within hominins, the Homo-Pan-Gorilla trichotomy. Within Artiodactyla, character state analysis showed that a substitution of Pro 4 for His 4 defines the Capra-Ovis clade within Artiodactyla. Homoplasy in our analysis indicated that osteocalcin evolution is not a perfect indicator of species evolution. Limited sequence availability prevented assigning functional significance to sequence changes. Our preliminary analysis of osteocalcin evolution represents an initial step towards a complete character analysis aimed at determining the evolutionary history of this functionally significant protein. We emphasize that ancient protein sequencing and phylogenetic analyses using amino acid sequences must pay close attention to post-translational modifications, amino acid substitutions due to diagenetic alteration and the impacts of isobaric amino acids on mass shifts and sequence alignments.
Kato, Shiro
2017-01-01
ABSTRACT This announcement reports the complete genome sequence of strain LK-145 of Lactobacillus sakei isolated from a Japanese sake cellar as a potent strain for the production of large amounts of d-amino acids. Three putative genes encoding an amino acid racemase were identified. PMID:28818888
Hegeman, Carla E.; Grabau, Elizabeth A.
2001-01-01
Phytic acid (myo-inositol hexakisphosphate) is the major storage form of phosphorus in plant seeds. During germination, stored reserves are used as a source of nutrients by the plant seedling. Phytic acid is degraded by the activity of phytases to yield inositol and free phosphate. Due to the lack of phytases in the non-ruminant digestive tract, monogastric animals cannot utilize dietary phytic acid and it is excreted into manure. High phytic acid content in manure results in elevated phosphorus levels in soil and water and accompanying environmental concerns. The use of phytases to degrade seed phytic acid has potential for reducing the negative environmental impact of livestock production. A phytase was purified to electrophoretic homogeneity from cotyledons of germinated soybeans (Glycine max L. Merr.). Peptide sequence data generated from the purified enzyme facilitated the cloning of the phytase sequence (GmPhy) employing a polymerase chain reaction strategy. The introduction of GmPhy into soybean tissue culture resulted in increased phytase activity in transformed cells, which confirmed the identity of the phytase gene. It is surprising that the soybean phytase was unrelated to previously characterized microbial or maize (Zea mays) phytases, which were classified as histidine acid phosphatases. The soybean phytase sequence exhibited a high degree of similarity to purple acid phosphatases, a class of metallophosphoesterases. PMID:11500558
Positive selection sites in tertiary structure of Leguminosae chalcone isomerase 1.
Wang, R K; Zhan, S F; Zhao, T J; Zhou, X L; Wang, C E
2015-03-20
Isoflavonoids and the related synthesis enzyme, chalcone isomerase 1 (CHI1), are unique in the Leguminosae, with diverse biological functions. Among the Leguminosae, the soybean is an important oil, protein crop, and model plant. In this study, we aimed to detect the generation pattern of Leguminosae CHI1. Genome-wide sequence analysis of CHI in 3 Leguminosae and 3 other closely related model plants was performed; the expression levels of soybean chalcone isomerases were also analyzed. By comparing positively selected sites and their protein structures, we retrieved the evolution patterns for Leguminosae CHI1. A total of 28 CHI and 7 FAP3 (CHI4) genes were identified and separated into 4 clades: CHI1, CHI2, CHI3, and FAP3. Soybean genes belonging to the same chalcone isomerase subfamily had similar expression patterns. CHI1, the unique chalcone isomerase subfamily in Leguminosae, showed signs of significant positive selection as well as special expression characteristics, indicating an accelerated evolution throughout its divergence. Eight sites were identified as undergoing positive selection with high confidence. When mapped onto the tertiary structure of CHI1, these 8 sites were observed surrounding the enzyme substrate only; some of them connected to the catalytic core of CHI. Thus, we inferred that the generation of Leguminosae CHI1 is dependent on the positively selected amino acids surrounding its catalytic substrate. In other words, the evolution of CHI1 was driven by specific selection or processing conditions within the substrate.
Fatima, Tahira; Snyder, Crystal L; Schroeder, William R; Cram, Dustin; Datla, Raju; Wishart, David; Weselake, Randall J; Krishna, Priti
2012-01-01
Sea buckthorn (Hippophae rhamnoides L.) is a hardy, fruit-producing plant known historically for its medicinal and nutraceutical properties. The most recognized product of sea buckthorn is its fruit oil, composed of seed oil that is rich in essential fatty acids, linoleic (18:2 ω-6) and α-linolenic (18:3 ω-3) acids, and pulp oil that contains high levels of monounsaturated palmitoleic acid (16:1 ω-7). Sea buckthorn is fast gaining popularity as a source of functional food and nutraceuticals, but currently has few genomic resources; therefore, we explored the fatty acid composition of Canadian-grown cultivars (ssp. mongolica) and the sea buckthorn seed transcriptome using the 454 GS FLX sequencing technology. GC-MS profiling of fatty acids in seeds and pulp of berries indicated that the seed oil contained linoleic and α-linolenic acids at 33-36% and 30-36%, respectively, while the pulp oil contained palmitoleic acid at 32-42%. 454 sequencing of sea buckthorn cDNA collections from mature seeds yielded 500,392 sequence reads, which identified 89,141 putative unigenes represented by 37,482 contigs and 51,659 singletons. Functional annotation by Gene Ontology and computational prediction of metabolic pathways indicated that primary metabolism (protein>nucleic acid>carbohydrate>lipid) and fatty acid and lipid biosynthesis pathways were highly represented categories. Sea buckthorn sequences related to fatty acid biosynthesis genes in Arabidopsis were identified, and a subset of these was examined for transcript expression at four developing stages of the berry. This study provides the first comprehensive genomic resources represented by expressed sequences for sea buckthorn, and demonstrates that the seed oil of Canadian-grown sea buckthorn cultivars contains high levels of linoleic acid and α-linolenic acid in a close to 1:1 ratio, which is beneficial for human health. These data provide the foundation for further studies on sea buckthorn oil, the enzymes involved in its biosynthesis, and the genes involved in the general hardiness of sea buckthorn against environmental conditions.
Fatima, Tahira; Snyder, Crystal L.; Schroeder, William R.; Cram, Dustin; Datla, Raju; Wishart, David; Weselake, Randall J.; Krishna, Priti
2012-01-01
Background Sea buckthorn (Hippophae rhamnoides L.) is a hardy, fruit-producing plant known historically for its medicinal and nutraceutical properties. The most recognized product of sea buckthorn is its fruit oil, composed of seed oil that is rich in essential fatty acids, linoleic (18∶2ω-6) and α-linolenic (18∶3ω-3) acids, and pulp oil that contains high levels of monounsaturated palmitoleic acid (16∶1ω-7). Sea buckthorn is fast gaining popularity as a source of functional food and nutraceuticals, but currently has few genomic resources; therefore, we explored the fatty acid composition of Canadian-grown cultivars (ssp. mongolica) and the sea buckthorn seed transcriptome using the 454 GS FLX sequencing technology. Results GC-MS profiling of fatty acids in seeds and pulp of berries indicated that the seed oil contained linoleic and α-linolenic acids at 33–36% and 30–36%, respectively, while the pulp oil contained palmitoleic acid at 32–42%. 454 sequencing of sea buckthorn cDNA collections from mature seeds yielded 500,392 sequence reads, which identified 89,141 putative unigenes represented by 37,482 contigs and 51,659 singletons. Functional annotation by Gene Ontology and computational prediction of metabolic pathways indicated that primary metabolism (protein>nucleic acid>carbohydrate>lipid) and fatty acid and lipid biosynthesis pathways were highly represented categories. Sea buckthorn sequences related to fatty acid biosynthesis genes in Arabidopsis were identified, and a subset of these was examined for transcript expression at four developing stages of the berry. Conclusion This study provides the first comprehensive genomic resources represented by expressed sequences for sea buckthorn, and demonstrates that the seed oil of Canadian-grown sea buckthorn cultivars contains high levels of linoleic acid and α-linolenic acid in a close to 1∶1 ratio, which is beneficial for human health. These data provide the foundation for further studies on sea buckthorn oil, the enzymes involved in its biosynthesis, and the genes involved in the general hardiness of sea buckthorn against environmental conditions. PMID:22558083
Corfield, M. C.; Fletcher, J. C.
1969-01-01
1. A chymotryptic digest of the protein fraction U.S.3. from oxidized wool was separated into 51 peptide fractions by chromatography on a column of cation-exchange resin. 2. The less acidic fractions were separated into their component peptides by a combination of cation-exchange-resin chromatography, paper chromatography and paper electrophoresis. 3. The amino acid sequences of 34 of these peptides were elucidated, and those of 14 others partially determined. 4. Overlaps between the tryptic and chymotryptic peptides from fraction U.S.3 have enabled ten extended amino acid sequences to be deduced, the longest containing 20 amino acid residues. 5. The relevance of the results to the structures of the helical and non-helical regions of wool is discussed. PMID:5395876
[Study on the genetic difference of SEO type Hantaviruses].
Zhang, X; Zhou, S; Wang, H; Hu, J; Guan, Z; Liu, H
2000-10-01
To understand the genetic type of Hantaviruses and the difference between them caused by rodents in Beijing and to furhter explore the source of the infectious factors. Hantavirus RNA, isolated from lungs of rodents captured in Beijing and positive with Hantavirus antigens with frozen sectioning and Immunofluorescent assay, were reverse-transcribed and amplified with PCR with Hantavirus-specific primers. Five of the PCR amplifications were discovered and sequenced with 300 bp sequence data of M segments (from 2003 - 2302nt according cDNA of seoul 8039 strain). Nucleotide sequence homology showed that they were sequences of SEO-type Hantavirus. Compared with SEO type Hantavirus, the nucleotide sequence homology of these samples was more than 94% while the homology of amonia acid sequence was more than 98%. When compared with HNT type Hantavirus, the homology of nucleotide sequence became less than 72% with the homology of amonia acid sequence less than 81%. Similar to other Hantavirus of SEO type, their nucleotide sequences and deduced amino acid sequences were highly preserved. Phylogenetic tree analysis showed that the five viruses could be divided into at least 4 branches. It was quite likely that there were at least two sub-type SEO viruses with 4 branches that were circulating in Beijing.
2012-01-01
Background Aquatic plants differ in their development from terrestrial plants in their morphology and physiology, but little is known about the molecular basis of the major phases of their life cycle. Interestingly, in place of seeds of terrestrial plants their dormant phase is represented by turions, which circumvents sexual reproduction. However, like seeds turions provide energy storage for starting the next growing season. Results To begin a characterization of the transition from the growth to the dormant phase we used abscisic acid (ABA), a plant hormone, to induce controlled turion formation in Spirodela polyrhiza and investigated their differentiation from fronds, representing their growth phase, into turions with respect to morphological, ultra-structural characteristics, and starch content. Turions were rich in anthocyanin pigmentation and had a density that submerged them to the bottom of liquid medium. Transmission electron microscopy (TEM) of turions showed in comparison to fronds shrunken vacuoles, smaller intercellular space, and abundant starch granules surrounded by thylakoid membranes. Turions accumulated more than 60% starch in dry mass after two weeks of ABA treatment. To further understand the mechanism of the developmental switch from fronds to turions, we cloned and sequenced the genes of three large-subunit ADP-glucose pyrophosphorylases (APLs). All three putative protein and exon sequences were conserved, but the corresponding genomic sequences were extremely variable mainly due to the invasion of miniature inverted-repeat transposable elements (MITEs) into introns. A molecular three-dimensional model of the SpAPLs was consistent with their regulatory mechanism in the interaction with the substrate (ATP) and allosteric activator (3-PGA) to permit conformational changes of its structure. Gene expression analysis revealed that each gene was associated with distinct temporal expression during turion formation. APL2 and APL3 were highly expressed in earlier stages of turion development, while APL1 expression was reduced throughout turion development. Conclusions These results suggest that the differential expression of APLs could be used to enhance energy flow from photosynthesis to storage of carbon in aquatic plants, making duckweeds a useful alternative biofuel feedstock. PMID:22235974
NASA Astrophysics Data System (ADS)
Sethaphong, Latsavongsakda
This work examines smart material properties of rational self-assembly and molecular recognition found in nano-biosystems. Exploiting the sequence and structural information encoded within nucleic acids and proteins will permit programmed synthesis of nanomaterials and help create molecular machines that may carry out new roles involving chemical catalysis and bioenergy. Responsive to different ionic environments thru self-reorgnization, nucleic acids (NA) are nature's signature smart material; organisms such as viruses and bacteria use features of NAs to react to their environment and orchestrate their lifecycle. Furthermore, nucleic acid systems (both RNA and DNA) are currently exploited as scaffolds; recent applications have been showcased to build bioelectronics and biotemplated nanostructures via directed assembly of multidimensional nanoelectronic devices 1. Since the most stable and rudimentary structure of nucleic acids is the helical duplex, these were modeled in order to examine the influence of the microenvironment, sequence, and cation-dependent perturbations of their canonical forms. Due to their negatively charged phosphate backbone, NA's rely on counterions to overcome the inherent repulsive forces that arise from the assembly of two complementary strands. As a realistic model system, we chose the HIV-TAR helix (PDB ID: 397D) to study specific sequence motifs on cation sequestration. At physiologically relevant concentrations of sodium and potassium ions, we observed sequence based effects where purine stretches were adept in retaining high residency cations. The transitional space between adenine and guanosine nucleotides (ApG step) in a sequence proved the most favorable. This work was the first to directly show these subtle interactions of sequence based cationic sequestration and may be useful for controlling metallization of nucleic acids in conductive nanowires. Extending the study further, we explored the degree to which the structure of NA duplexes alone interacted with cations distinct from a specific sequence. Under physiologically relevant conditions, a duplex of RNA polyguanine-polycitidine was highly responsive and able to sequester cations to the middle of the purine stretches. The least responsive structure was a DNA polyadenine-polythymine duplex. A random sequence DNA duplex contorted into an RNA-like helix resulted in cationic dynamics similar to RNA systems. These studies showed that cation diffusive binding events in nucleic acid duplex structures are sequence specific and heavily influenced by structural aspects helical forms to account for much of the differences observed. Although structural information in nucleic acids is encoded within their sequence, linking amino acid sequence to protein structure is murkier; the structural information within proteins is encoded by the folding process itself: a complex phenomenon driven toward the equilibrium state of the active conformation. Upwards of two thirds of a protein's sequence can be substituted with similar amino acids without significantly perturbing its function; conserved residues of about 10% seem to be vital; since evolutionary selection pressure in proteins operates 3-dimenionally, a linear sequence is partially informative. We explored this problem by folding de-novo the cytosolic portion of the membrane protein, cellulose synthase, CESA1 from upland cotton, Gossypium hirsutum (Ghcesa1). The cytoplasmic region was generated by homology modeling and refined with molecular dynamics. These mutations impair local structural flexibility which likely results in cellulose that is produced at a lower rate and is less crystalline. Additional modeling of fragments of cellulose synthases from the model plant, Arabidopsis thaliana, offered novel insights into the function of conserved cytosolic domains within plant cellulose synthases. Transport mechanisms related to the transmembrane region revealed significant differences between plants and a bacterial complex. These studies generated possible mutations that may allow for the creation of new synthases and identified other avenues of research in order to develop technologies that may alter the crystallinity and other useful properties of cellulose. 1. Karplus, K., SAM-T08, HMM-based protein structure prediction. Nucleic Acids Research, 2009. 37: p. W492-W497.
Neuropeptidomics of the Mosquito Aedes Aegypti
2010-01-01
translational processing ( pyroglutamate formation) was detected for AST-C and CAPA-PVK-2. For the first time in insects, we succeeded in the direct...hormones, trace DNA sequences generated by TIGR and the Broad Institute were first searched by TBLASTN24 using amino acid sequences of candidate peptides...previously described.1 TBLASTN searches, using the amino acid sequences of putative Ae. aegypti neuropeptide and peptide hormone orthologs identified in
Cloning and sequence analysis of Hemonchus contortus HC58cDNA.
Muleke, Charles I; Ruofeng, Yan; Lixin, Xu; Xinwen, Bo; Xiangrui, Li
2007-06-01
The complete coding sequence of Hemonchus contortus HC58cDNA was generated by rapid amplification of cDNA ends and polymerase chain reaction using primers based on the 5' and 3' ends of the parasite mRNA, accession no. AF305964. The HC58cDNA gene was 851 bp long, with open reading frame of 717 bp, precursors to 239 amino acids coding for approximately 27 kDa protein. Analysis of amino acid sequence revealed conserved residues of cysteine, histidine, asparagine, occluding loop pattern, hemoglobinase motif and glutamine of the oxyanion hole characteristic of cathepsin B like proteases (CBL). Comparison of the predicted amino acid sequences showed the protein shared 33.5-58.7% identity to cathepsin B homologues in the papain clan CA family (family C1). Phylogenetic analysis revealed close evolutionary proximity of the protein sequence to counterpart sequences in the CBL, suggesting that HC58cDNA was a member of the papain family.
Nucleic Acid Detection Methods
Smith, Cassandra L.; Yaar, Ron; Szafranski, Przemyslaw; Cantor, Charles R.
1998-05-19
The invention relates to methods for rapidly determining the sequence and/or length a target sequence. The target sequence may be a series of known or unknown repeat sequences which are hybridized to an array of probes. The hybridized array is digested with a single-strand nuclease and free 3'-hydroxyl groups extended with a nucleic acid polymerase. Nuclease cleaved heteroduplexes can be easily distinguish from nuclease uncleaved heteroduplexes by differential labeling. Probes and target can be differentially labeled with detectable labels. Matched target can be detected by cleaving resulting loops from the hybridized target and creating free 3-hydroxyl groups. These groups are recognized and extended by polymerases added into the reaction system which also adds or releases one label into solution. Analysis of the resulting products using either solid phase or solution. These methods can be used to detect characteristic nucleic acid sequences, to determine target sequence and to screen for genetic defects and disorders. Assays can be conducted on solid surfaces allowing for multiple reactions to be conducted in parallel and, if desired, automated.
Development and Translation of a Tissue-Engineered Disc in a Preclinical Rodent Model
2014-12-01
annulus fibrosus tissue into full 3D Disc-like Angle Ply Structures (DAPS), inclusive of a hyaluronic acid hydrogel seeded with adult stem cells, that...AF constructs surrounding an engineered nucleus pulposus (NP) composed of a hyaluronic acid (HA) hydrogel. Measure the disc structural mechanics in...exposure to TGF-ß3 improves the functional properties of MSC-seeded photocrosslinked hyaluronic acid hydrogels by authors Minwook Kim, Isaac E
Somerville, Chris; van de Loo, Frank
2000-01-01
The present invention relates to the identification of nucleic acid sequences and constructs, and methods related thereto, and the use of these sequences and constructs to produce genetically modified plants for the purpose of altering the composition of plant oils, waxes and related compounds.
Hydroxyapatite-binding peptides for bone growth and inhibition
Bertozzi, Carolyn R [Berkeley, CA; Song, Jie [Shrewsbury, MA; Lee, Seung-Wuk [Walnut Creek, CA
2011-09-20
Hydroxyapatite (HA)-binding peptides are selected using combinatorial phage library display. Pseudo-repetitive consensus amino acid sequences possessing periodic hydroxyl side chains in every two or three amino acid sequences are obtained. These sequences resemble the (Gly-Pro-Hyp).sub.x repeat of human type I collagen, a major component of extracellular matrices of natural bone. A consistent presence of basic amino acid residues is also observed. The peptides are synthesized by the solid-phase synthetic method and then used for template-driven HA-mineralization. Microscopy reveal that the peptides template the growth of polycrystalline HA crystals .about.40 nm in size.
Medzihradszky, K F; Gibson, B W; Kaur, S; Yu, Z H; Medzihradszky, D; Burlingame, A L; Bass, N M
1992-02-01
The primary structure of a fatty-acid-binding protein (FABP) isolated from the liver of the nurse shark (Ginglymostoma cirratum) was determined by high-performance tandem mass spectrometry (employing multichannel array detection) and Edman degradation. Shark liver FABP consists of 132 amino acids with an acetylated N-terminal valine. The chemical molecular mass of the intact protein determined by electrospray ionization mass spectrometry (Mr = 15124 +/- 2.5) was in good agreement with that calculated from the amino acid sequence (Mr = 15121.3). The amino acid sequence of shark liver FABP displays significantly greater similarity to the FABP expressed in mammalian heart, peripheral nerve myelin and adipose tissue (61-53% sequence similarity) than to the FABP expressed in mammalian liver (22% similarity). Phylogenetic trees derived from the comparison of the shark liver FABP amino acid sequence with the members of the mammalian fatty-acid/retinoid-binding protein gene family indicate the initial divergence of an ancestral gene into two major subfamilies: one comprising the genes for mammalian liver FABP and gastrotropin, the other comprising the genes for mammalian cellular retinol-binding proteins I and II, cellular retinoic-acid-binding protein myelin P2 protein, adipocyte FABP, heart FABP and shark liver FABP, the latter having diverged from the ancestral gene that ultimately gave rise to the present day mammalian heart-FABP, adipocyte FABP and myelin P2 protein sequences. The sequence for intestinal FABP from the rat could be assigned to either subfamily, depending on the approach used for phylogenetic tree construction, but clearly diverged at a relatively early evolutionary time point. Indeed, sequences proximately ancestral or closely related to mammalian intestinal FABP, liver FABP, gastrotropin and the retinoid-binding group of proteins appear to have arisen prior to the divergence of shark liver FABP and should therefore also be present in elasmobranchs. The presence in shark liver of an FABP which differs substantially in primary structure from mammalian liver FABP, while being closely related to the FABP expressed in mammalian heart muscle, peripheral nerve myelin and adipocytes, opens a further dimension regarding the question of the existence of structure-dependent and tissue-specific specialization of FABP function in lipid metabolism.
The bglA Gene of Aspergillus kawachii Encodes Both Extracellular and Cell Wall-Bound β-Glucosidases
Iwashita, Kazuhiro; Nagahara, Tatsuya; Kimura, Hitoshi; Takano, Makoto; Shimoi, Hitoshi; Ito, Kiyoshi
1999-01-01
We cloned the genomic DNA and cDNA of bglA, which encodes β-glucosidase in Aspergillus kawachii, based on a partial amino acid sequence of purified cell wall-bound β-glucosidase CB-1. The nucleotide sequence of the cloned bglA gene revealed a 2,933-bp open reading frame with six introns that encodes an 860-amino-acid protein. Based on the deduced amino acid sequence, we concluded that the bglA gene encodes cell wall-bound β-glucosidase CB-1. The amino acid sequence exhibited high levels of homology with the amino acid sequences of fungal β-glucosidases classified in subfamily B. We expressed the bglA cDNA in Saccharomyces cerevisiae and detected the recombinant β-glucosidase in the periplasm fraction of the recombinant yeast. A. kawachii can produce two extracellular β-glucosidases (EX-1 and EX-2) in addition to the cell wall-bound β-glucosidase. A. kawachii in which the bglA gene was disrupted produced none of the three β-glucosidases, as determined by enzyme assays and a Western blot analysis. Thus, we concluded that the bglA gene encodes both extracellular and cell wall-bound β-glucosidases in A. kawachii. PMID:10584016
Demers, Jill E.; Gugino, Beth K.
2014-01-01
The diversity and genetic differentiation of populations of Fusarium oxysporum associated with tomato fields, both endophytes obtained from tomato plants and isolates obtained from soil surrounding the sampled plants, were investigated. A total of 609 isolates of F. oxysporum were obtained, 295 isolates from a total of 32 asymptomatic tomato plants in two fields and 314 isolates from eight soil cores sampled from the area surrounding the plants. Included in this total were 112 isolates from the stems of all 32 plants, a niche that has not been previously included in F. oxysporum population genetics studies. Isolates were characterized using the DNA sequence of the translation elongation factor 1α gene. A diverse population of 26 sequence types was found, although two sequence types represented nearly two-thirds of the isolates studied. The sequence types were placed in different phylogenetic clades within F. oxysporum, and endophytic isolates were not monophyletic. Multiple sequence types were found in all plants, with an average of 4.2 per plant. The population compositions differed between the two fields but not between soil samples within each field. A certain degree of differentiation was observed between populations associated with different tomato cultivars, suggesting that the host genotype may affect the composition of plant-associated F. oxysporum populations. No clear patterns of genetic differentiation were observed between endophyte populations and soil populations, suggesting a lack of specialization of endophytic isolates. PMID:25304514
Altier, Daniel J.; Dahlbacka, Glen; Ellanskaya, legal representative, Natalia; Herrmann, Rafael; Hunter-Cevera, Jennie; McCutchen, Billy F.; Presnail, James K.; Rice, Janet A.; Schepers, Eric; Simmons, Carl R.; Torok, Tamas; Yalpani, Nasser; Ellanskaya, deceased, Irina
2007-12-11
Compositions and methods for protecting a plant from a pathogen, particularly a fungal pathogen, are provided. Compositions include novel amino acid sequences, and variants and fragments thereof, for antipathogenic polypeptides that were isolated from microbial fermentation broths. Nucleic acid molecules comprising nucleotide sequences that encode the antipathogenic polypeptides of the invention are also provided. A method for inducing pathogen resistance in a plant using the nucleotide sequences disclosed herein is further provided. The method comprises introducing into a plant an expression cassette comprising a promoter operably linked to a nucleotide sequence that encodes an antipathogenic polypeptide of the invention. Compositions comprising an antipathogenic polypeptide or a transformed microorganism comprising a nucleic acid of the invention in combination with a carrier and methods of using these compositions to protect a plant from a pathogen are further provided. Transformed plants, plant cells, seeds, and microorganisms comprising a nucleotide sequence that encodes an antipathogenic polypeptide of the invention, or variant or fragment thereof, are also disclosed.
Altier, Daniel J.; Dahlbacka, Glen; Elleskaya, Irina; Ellanskaya, legal representative; Natalia; Herrmann, Rafael; Hunter-Cevera, Jennie; McCutchen, Billy F.; Presnail, James K.; Rice, Janet A.; Schepers, Eric; Simmons, Carl R.; Torok, Tamas; Yalpani, Nasser
2010-08-10
Compositions and methods for protecting a plant from a pathogen, particularly a fungal pathogen, are provided. Compositions include novel amino acid sequences, and variants and fragments thereof, for antipathogenic polypeptides that were isolated from microbial fermentation broths. Nucleic acid molecules comprising nucleotide sequences that encode the antipathogenic polypeptides of the invention are also provided. A method for inducing pathogen resistance in a plant using the nucleotide sequences disclosed herein is further provided. The method comprises introducing into a plant an expression cassette comprising a promoter operably linked to a nucleotide sequence that encodes an antipathogenic polypeptide of the invention. Compositions comprising an antipathogenic polypeptide or a transformed microorganism comprising a nucleic acid of the invention in combination with a carrier and methods of using these compositions to protect a plant from a pathogen are further provided. Transformed plants, plant cells, seeds, and microorganisms comprising a nucleotide sequence that encodes an antipathogenic polypeptide of the invention, or variant or fragment thereof, are also disclosed.
Altier, Daniel J [Waukee, IA; Dahlbacka, Glen [Oakland, CA; Elleskaya, Irina [Kyiv, UA; Ellanskaya, legal representative, Natalia; Herrmann, Rafael [Wilmington, DE; Hunter-Cevera, Jennie [Elliott City, MD; McCutchen, Billy F [College Station, IA; Presnail, James K [Avondale, PA; Rice, Janet A [Wilmington, DE; Schepers, Eric [Port Deposit, MD; Simmons, Carl R [Des Moines, IA; Torok, Tamas [Richmond, CA; Yalpani, Nasser [Johnston, IA
2011-04-12
Compositions and methods for protecting a plant from a pathogen, particularly a fungal pathogen, are provided. Compositions include novel amino acid sequences, and variants and fragments thereof, for antipathogenic polypeptides that were isolated from microbial fermentation broths. Nucleic acid molecules comprising nucleotide sequences that encode the antipathogenic polypeptides of the invention are also provided. A method for inducing pathogen resistance in a plant using the nucleotide sequences disclosed herein is further provided. The method comprises introducing into a plant an expression cassette comprising a promoter operably linked to a nucleotide sequence that encodes an antipathogenic polypeptide of the invention. Compositions comprising an antipathogenic polypeptide or a transformed microorganism comprising a nucleic acid of the invention in combination with a carrier and methods of using these compositions to protect a plant from a pathogen are further provided. Transformed plants, plant cells, seeds, and microorganisms comprising a nucleotide sequence that encodes an antipathogenic polypeptide of the invention, or variant or fragment thereof, are also disclosed.
Altier, Daniel J [Granger, IA; Dahlbacka, Glen [Oakland, CA; Ellanskaya, Irina [Kyiv, UA; Ellanskaya, legal representative, Natalia; Herrmann, Rafael [Wilmington, DE; Hunter-Cevera, Jennie [Elliott City, MD; McCutchen, Billy F [College Station, TX; Presnail, James K [Avondale, PA; Rice, Janet A [Wilmington, DE; Schepers, Eric [Port Deposit, MD; Simmons, Carl R [Des Moines, IA; Torok, Tamas [Richmond, CA; Yalpani, Nasser [Johnston, IA
2012-04-03
Compositions and methods for protecting a plant from a pathogen, particularly a fungal pathogen, are provided. Compositions include novel amino acid sequences, and variants and fragments thereof, for antipathogenic polypeptides that were isolated from microbial fermentation broths. Nucleic acid molecules comprising nucleotide sequences that encode the antipathogenic polypeptides of the invention are also provided. A method for inducing pathogen resistance in a plant using the nucleotide sequences disclosed herein is further provided. The method comprises introducing into a plant an expression cassette comprising a promoter operably linked to a nucleotide sequence that encodes an antipathogenic polypeptide of the invention. Compositions comprising an antipathogenic polypeptide or a transformed microorganism comprising a nucleic acid of the invention in combination with a carrier and methods of using these compositions to protect a plant from a pathogen are further provided. Transformed plants, plant cells, seeds, and microorganisms comprising a nucleotide sequence that encodes an antipathogenic polypeptide of the invention, or variant or fragment thereof, are also disclosed.
Comparative analysis of the prion protein gene sequences in African lion.
Wu, Chang-De; Pang, Wan-Yong; Zhao, De-Ming
2006-10-01
The prion protein gene of African lion (Panthera Leo) was first cloned and polymorphisms screened. The results suggest that the prion protein gene of eight African lions is highly homogenous. The amino acid sequences of the prion protein (PrP) of all samples tested were identical. Four single nucleotide polymorphisms (C42T, C81A, C420T, T600C) in the prion protein gene (Prnp) of African lion were found, but no amino acid substitutions. Sequence analysis showed that the higher homology is observed to felis catus AF003087 (96.7%) and to sheep number M31313.1 (96.2%) Genbank accessed. With respect to all the mammalian prion protein sequences compared, the African lion prion protein sequence has three amino acid substitutions. The homology might in turn affect the potential intermolecular interactions critical for cross species transmission of prion disease.
Schaeffer, E; Sninsky, J J
1984-01-01
Proteins that are related evolutionarily may have diverged at the level of primary amino acid sequence while maintaining similar secondary structures. Computer analysis has been used to compare the open reading frames of the hepatitis B virus to those of the woodchuck hepatitis virus at the level of amino acid sequence, and to predict the relative hydrophilic character and the secondary structure of putative polypeptides. Similarity is seen at the levels of relative hydrophilicity and secondary structure, in the absence of sequence homology. These data reinforce the proposal that these open reading frames encode viral proteins. Computer analysis of this type can be more generally used to establish structural similarities between proteins that do not share obvious sequence homology as well as to assess whether an open reading frame is fortuitous or codes for a protein. PMID:6585835
Rodríguez-Lázaro, David; D'Agostino, Martin; Pla, Maria; Cook, Nigel
2004-01-01
An important analytical control in molecular amplification-based methods is an internal amplification control (IAC), which should be included in each reaction mixture. An IAC is a nontarget nucleic acid sequence which is coamplified simultaneously with the target sequence. With negative results for the target nucleic acid, the absence of an IAC signal indicates that amplification has failed. A general strategy for the construction of an IAC for inclusion in molecular beacon-based real-time nucleic acid sequence-based amplification (NASBA) assays is presented. Construction proceeds in two phases. In the first phase, a double-stranded DNA molecule that contains nontarget sequences flanked by target sequences complementary to the NASBA primers is produced. At the 5′ end of this DNA molecule is a T7 RNA polymerase binding sequence. In the second phase of construction, RNA transcripts are produced from the DNA by T7 RNA polymerase. This RNA is the IAC; it is amplified by the target NASBA primers and is detected by a molecular beacon probe complementary to the internal nontarget sequences. As a practical example, an IAC for use in an assay for the detection of Mycobacterium avium subsp. paratuberculosis is described, its incorporation and optimization within the assay are detailed, and its application to spiked and natural clinical samples is shown to illustrate the correct interpretation of the diagnostic results. PMID:15583319
Bahramnejad, Bahman
2014-01-01
P. atlantica subsp. Kurdica, with the local name of Baneh, is a wild medicinal plant which grows in Kurdistan, Iran. The identification of resistance gene analogs holds great promise for the development of resistant cultivars. A PCR approach with degenerate primers designed according to conserved NBS-LRR (nucleotide binding site-leucine rich repeat) regions of known disease-resistance (R) genes was used to amplify and clone homologous sequences from P. atlantica subsp. Kurdica. A DNA fragment of the expected 500-bp size was amplified. The nucleotide sequence of this amplicon was obtained through sequencing and the predicted amino acid sequence compared to the amino acid sequences of known R-genes revealed significant sequence similarity. Alignment of the deduced amino acid sequence of P. atlantica subsp. Kurdica resistance gene analog (RGA) showed strong identity, ranging from 68% to 77%, to the non-toll interleukin receptor (non-TIR) R-gene subfamily from other plants. A P-loop motif (GMMGGEGKTT), a conserved and hydrophobic motif GLPLAL, a kinase-2a motif (LLVLDDV), when replaced by IAVFDDI in PAKRGA1 and a kinase-3a (FGPGSRIII) were presented in all RGA. A phylogenetic tree, based on the deduced amino-acid sequences of PAKRGA1 and RGAs from different species indicated that they were separated in two clusters, PAKRGA1 being on cluster II. The isolated NBS analogs can be eventually used as guidelines to isolate numerous R-genes in Pistachio. PMID:27843981
Aymerich, T; Holo, H; Håvarstein, L S; Hugas, M; Garriga, M; Nes, I F
1996-01-01
A new bacteriocin has been isolated from an Enterococcus faecium strain. The bacteriocin, termed enterocin A, was purified to homogeneity as judged by sodium dodecyl sulfate-polyacrylamide gel electrophoresis, N-terminal amino acid sequencing, and mass spectrometry analysis. By combining the data obtained from amino acid and DNA sequencing, the primary structure of enterocin A was determined. It consists of 47 amino acid residues, and the molecular weight was calculated to be 4,829, assuming that the four cysteine residues form intramolecular disulfide bridges. This molecular weight was confirmed by mass spectrometry analysis. The amino acid sequence of enterocin A shared significant homology with a group of bacteriocins (now termed pediocin-like bacteriocins) isolated from a variety of lactic acid-producing bacteria, which include members of the genera Lactobacillus, Pediococcus, Leuconostoc, and Carnobacterium. Sequencing of the structural gene of enterocin A, which is located on the bacterial chromosome, revealed an N-terminal leader sequence of 18 amino acid residues, which was removed during the maturation process. The enterocin A leader belongs to the double-glycine leaders which are found among most other small nonlantibiotic bacteriocins, some lantibiotics, and colicin V. Downstream of the enterocin A gene was located a second open reading frame, encoding a putative protein of 103 amino acid residues. This gene may encode the immunity factor of enterocin A, and it shares 40% identity with a similar open reading frame in the operon of leucocin AUL 187, another pediocin-like bacteriocin. PMID:8633865
Taravat, Elham; Zebarjadi, Alireza; Kahrizi, Danial; Yari, Kheirollah
2015-05-01
Among the essential amino acids, phenylalanine, tryptophan, and tyrosine are aromatic amino acids which are synthesized by the shikimate pathway in plants and bacteria. Herbicide glyphosate can inhibit the biosynthesis of these amino acids. So, identification of the gene tolerant to glyphosate is very important. It has been shown that the common reed or Phragmites australis Cav. (Poaceae) is relatively tolerant to glyphosate. The aim of the current research is identification, cloning, sequencing, and registering of partial aro A gene of the common reed P. australis. The partial aro A gene of common reed (P. australis) was cloned in Escherichia coli and the amino acid sequence was identified/determined for the first time. This is the first report for isolation, cloning, and sequencing of a part of aro A gene from the common reed. A 670 bp fragment including two introns (86 bp and 289 bp) was obtained. The open reading frame (ORF) region in part of gene was encoded for 98 amino acids. Alignment showed high similarity among this region with Zea mays (L.) (Poaceae) (94.6%), Eleusine indica L. Gaertn (Poaceae) (94.2%), and Zoysia japonica Steud. (Poaceae) (94.2%). The alignment of amino acid sequence of the investigated part of the gene showed a homology with aro A from several other plants. This conserved region forms the enzyme active site. The alignment results of nucleotide and amino acid residues with related sequences showed that there are some differences among them. The relative glyphosate tolerance in the common reed may be related to these differences.
Comparative analysis of ribosomal protein L5 sequences from bacteria of the genus Thermus.
Jahn, O; Hartmann, R K; Boeckh, T; Erdmann, V A
1991-06-01
The genes for the ribosomal 5S rRNA binding protein L5 have been cloned from three extremely thermophilic eubacteria, Thermus flavus, Thermus thermophilus HB8 and Thermus aquaticus (Jahn et al, submitted). Genes for protein L5 from the three Thermus strains display 95% G/C in third positions of codons. Amino acid sequences deduced from the DNA sequence were shown to be identical for T flavus and T thermophilus, although the corresponding DNA sequences differed by two T to C transitions in the T thermophilus gene. Protein L5 sequences from T flavus and T thermophilus are 95% homologous to L5 from T aquaticus and 56.5% homologous to the corresponding E coli sequence. The lowest degrees of homology were found between the T flavus/T thermophilus L5 proteins and those of yeast L16 (27.5%), Halobacterium marismortui (34.0%) and Methanococcus vannielii (36.6%). From sequence comparison it becomes clear that thermostability of Thermus L5 proteins is achieved by an increase in hydrophobic interactions and/or by restriction of steric flexibility due to the introduction of amino acids with branched aliphatic side chains such as leucine. Alignment of the nine protein sequences equivalent to Thermus L5 proteins led to identification of a conserved internal segment, rich in acidic amino acids, which shows homology to subsequences of E coli L18 and L25. The occurrence of conserved sequence elements in 5S rRNA binding proteins and ribosomal proteins in general is discussed in terms of evolution and function.
Yarimizu, Tohru; Nakamura, Mikiko; Hoshida, Hisashi; Akada, Rinji
2015-02-14
Targeting of cellular proteins to the extracellular environment is directed by a secretory signal sequence located at the N-terminus of a secretory protein. These signal sequences usually contain an N-terminal basic amino acid followed by a stretch containing hydrophobic residues, although no consensus signal sequence has been identified. In this study, simple modeling of signal sequences was attempted using Gaussia princeps secretory luciferase (GLuc) in the yeast Kluyveromyces marxianus, which allowed comprehensive recombinant gene construction to substitute synthetic signal sequences. Mutational analysis of the GLuc signal sequence revealed that the GLuc hydrophobic peptide length was lower limit for effective secretion and that the N-terminal basic residue was indispensable. Deletion of the 16th Glu caused enhanced levels of secreted protein, suggesting that this hydrophilic residue defined the boundary of a hydrophobic peptide stretch. Consequently, we redesigned this domain as a repeat of a single hydrophobic amino acid between the N-terminal Lys and C-terminal Glu. Stretches consisting of Phe, Leu, Ile, or Met were effective for secretion but the number of residues affected secretory activity. A stretch containing sixteen consecutive methionine residues (M16) showed the highest activity; the M16 sequence was therefore utilized for the secretory production of human leukemia inhibitory factor protein in yeast, resulting in enhanced secreted protein yield. We present a new concept for the provision of secretory signal sequence ability in the yeast K. marxianus, determined by the number of residues of a single hydrophobic residue located between N-terminal basic and C-terminal acidic amino acid boundaries.
Phenolic acid esterases, coding sequences and methods
Blum, David L.; Kataeva, Irina; Li, Xin-Liang; Ljungdahl, Lars G.
2002-01-01
Described herein are four phenolic acid esterases, three of which correspond to domains of previously unknown function within bacterial xylanases, from XynY and XynZ of Clostridium thermocellum and from a xylanase of Ruminococcus. The fourth specifically exemplified xylanase is a protein encoded within the genome of Orpinomyces PC-2. The amino acids of these polypeptides and nucleotide sequences encoding them are provided. Recombinant host cells, expression vectors and methods for the recombinant production of phenolic acid esterases are also provided.
Oroszlan, Stephen; Henderson, Louis E.; Stephenson, John R.; Copeland, Terry D.; Long, Cedric W.; Ihle, James N.; Gilden, Raymond V.
1978-01-01
The amino- and carboxyl-terminal amino acid sequences of proteins (p10, p12, p15, and p30) coded by the gag gene of Rauscher and AKR murine leukemia viruses were determined. Among these proteins, p15 from both viruses appears to have a blocked amino end. Proline was found to be the common NH2 terminus of both p30s and both p12s, and alanine of both p10s. The amino-terminal sequences of p30s are identical, as are those of p10s, while the p12 sequences are clearly distinctive but also show substantial homology. The carboxyl-terminal amino acids of both viral p30s and p12s are leucine and phenylalanine, respectively. Rauscher leukemia virus p15 has tyrosine as the carboxyl terminus while AKR virus p15 has phenylalanine in this position. The compositional and sequence data provide definite chemical criteria for the identification of analogous gag gene products and for the comparison of viral proteins isolated in different laboratories. On the basis of amino acid sequences and the previously proposed H-p15-p12-p30-p10-COOH peptide sequence in the precursor polyprotein, a model for cleavage sites involved in the post-translational processing of the precursor coded for by the gag gene is proposed. PMID:206897
Using Maximum Entropy to Find Patterns in Genomes
NASA Astrophysics Data System (ADS)
Liu, Sophia; Hockenberry, Adam; Lancichinetti, Andrea; Jewett, Michael; Amaral, Luis
The existence of over- and under-represented sequence motifs in genomes provides evidence of selective evolutionary pressures on biological mechanisms such as transcription, translation, ligand-substrate binding, and host immunity. To accurately identify motifs and other genome-scale patterns of interest, it is essential to be able to generate accurate null models that are appropriate for the sequences under study. There are currently no tools available that allow users to create random coding sequences with specified amino acid composition and GC content. Using the principle of maximum entropy, we developed a method that generates unbiased random sequences with pre-specified amino acid and GC content. Our method is the simplest way to obtain maximally unbiased random sequences that are subject to GC usage and primary amino acid sequence constraints. This approach can also be easily be expanded to create unbiased random sequences that incorporate more complicated constraints such as individual nucleotide usage or even di-nucleotide frequencies. The ability to generate correctly specified null models will allow researchers to accurately identify sequence motifs which will lead to a better understanding of biological processes. National Institute of General Medical Science, Northwestern University Presidential Fellowship, National Science Foundation, David and Lucile Packard Foundation, Camille Dreyfus Teacher Scholar Award.
Matsuoka, Masanari; Sugita, Masatake; Kikuchi, Takeshi
2014-09-18
Proteins that share a high sequence homology while exhibiting drastically different 3D structures are investigated in this study. Recently, artificial proteins related to the sequences of the GA and IgG binding GB domains of human serum albumin have been designed. These artificial proteins, referred to as GA and GB, share 98% amino acid sequence identity but exhibit different 3D structures, namely, a 3α bundle versus a 4β + α structure. Discriminating between their 3D structures based on their amino acid sequences is a very difficult problem. In the present work, in addition to using bioinformatics techniques, an analysis based on inter-residue average distance statistics is used to address this problem. It was hard to distinguish which structure a given sequence would take only with the results of ordinary analyses like BLAST and conservation analyses. However, in addition to these analyses, with the analysis based on the inter-residue average distance statistics and our sequence tendency analysis, we could infer which part would play an important role in its structural formation. The results suggest possible determinants of the different 3D structures for sequences with high sequence identity. The possibility of discriminating between the 3D structures based on the given sequences is also discussed.
Yang, Xiaoxia; Wang, Jia; Sun, Jun; Liu, Rong
2015-01-01
Protein-nucleic acid interactions are central to various fundamental biological processes. Automated methods capable of reliably identifying DNA- and RNA-binding residues in protein sequence are assuming ever-increasing importance. The majority of current algorithms rely on feature-based prediction, but their accuracy remains to be further improved. Here we propose a sequence-based hybrid algorithm SNBRFinder (Sequence-based Nucleic acid-Binding Residue Finder) by merging a feature predictor SNBRFinderF and a template predictor SNBRFinderT. SNBRFinderF was established using the support vector machine whose inputs include sequence profile and other complementary sequence descriptors, while SNBRFinderT was implemented with the sequence alignment algorithm based on profile hidden Markov models to capture the weakly homologous template of query sequence. Experimental results show that SNBRFinderF was clearly superior to the commonly used sequence profile-based predictor and SNBRFinderT can achieve comparable performance to the structure-based template methods. Leveraging the complementary relationship between these two predictors, SNBRFinder reasonably improved the performance of both DNA- and RNA-binding residue predictions. More importantly, the sequence-based hybrid prediction reached competitive performance relative to our previous structure-based counterpart. Our extensive and stringent comparisons show that SNBRFinder has obvious advantages over the existing sequence-based prediction algorithms. The value of our algorithm is highlighted by establishing an easy-to-use web server that is freely accessible at http://ibi.hzau.edu.cn/SNBRFinder.
Crimean-Congo Hemorrhagic Fever
2004-01-01
aminocaproic acid were also indicated. Much emphasis was also placed on preventing reinfection, including the necessity of remov- ing blood crusts from...The se- quence is approximately 60% identical both at the nucleotide and amino acid levels to the L segment of Dugbe virus, the only other Nairovirus...However, more recent data based on nucleic acid sequence analysis have revealed extensive genetic diversity. The first published CCHFV sequence
DOE Office of Scientific and Technical Information (OSTI.GOV)
Akabayov, B.; Lee, S; Akabayov, S
2009-01-01
Synthesis of oligoribonucleotide primers for lagging-strand DNA synthesis in the DNA replication system of bacteriophage T7 is catalyzed by the primase domain of the gene 4 helicase-primase. The primase consists of a zinc-binding domain (ZBD) and an RNA polymerase (RPD) domain. The ZBD is responsible for recognition of a specific sequence in the ssDNA template whereas catalytic activity resides in the RPD. The ZBD contains a zinc ion coordinated with four cysteine residues. We have examined the ligation state of the zinc ion by X-ray absorption spectroscopy and biochemical analysis of genetically altered primases. The ZBD of primase engaged inmore » catalysis exhibits considerable asymmetry in coordination to zinc, as evidenced by a gradual increase in electron density of the zinc together with elongation of the zinc-sulfur bonds. Both wild-type primase and primase reconstituted from purified ZBD and RPD have a similar electronic change in the level of the zinc ion as well as the configuration of the ZBD. Single amino acid replacements in the ZBD (H33A and C36S) result in the loss of both zinc binding and its structural integrity. Thus the zinc in the ZBD may act as a charge modulation indicator for the surrounding sulfur atoms necessary for recognition of specific DNA sequences.« less
J chain in the nurse shark: implications for function in a lower vertebrate.
Hohman, Valerie S; Stewart, Sue E; Rumfelt, Lynn L; Greenberg, Andrew S; Avila, David W; Flajnik, Martin F; Steiner, Lisa A
2003-06-15
J chain is a small polypeptide covalently attached to polymeric IgA and IgM. In humans and mice, it plays a role in binding Ig to the polymeric Ig receptor for transport into secretions. The putative orthologue of mammalian J chain has been identified in the nurse shark by sequence analysis of cDNA and the polypeptide isolated from IgM. Conservation with J chains from other species is relatively poor, especially in the carboxyl-terminal portion, and, unlike other J chains, the shark protein is not acidic. The only highly conserved segment in all known J chains is a block of residues surrounding an N-linked glycosylation site. Of the eight half-cystine residues that are conserved in mammalian J chains, three are lacking in the nurse shark, including two in the carboxyl-terminal segment that have been reported to be required for binding of human J chain-containing IgA to secretory component. Taken together with these data, the relative abundance of J chain transcripts in the spleen and their absence in the spiral valve (intestine) suggest that J chain in nurse sharks may not have a role in Ig secretion. Analysis of J chain sequences in diverse species is in agreement with accepted phylogenetic relationships, with the exception of the earthworm, suggesting that the reported presence of J chain in invertebrates should be reassessed.
Koda, Toshiaki; Kijimoto-Ochiai, Shigeko; Uemura, Satoshi; Inokuchi, Jin-ichi
2009-10-02
Neu2 mRNA from the mouse thymus, as we have reported [K. Kotani, A. Kuroiwa, T. Saito, Y. Matsuda, T. Koda, S. Kijimoto-Ochiai, Cloning, chromosomal mapping, and characteristic 5'-UTR sequence of murine cytosolic sialidase, Biochem. Biophys. Res. Commun. 286 (2001) 250-258], has a novel sequence at the 5' terminus that shows the ability to encode 6 extra amino acids in the N-terminus than that of the muscle. In this paper, we analyzed the cDNA and EST database and found the five types of alternative splicing of Neu2 mRNA: A, B, C, D and N. We studied the expression of these types in the immune tissues and found that the thymus expressed only type B. We constructed 2 types of plasmid that encode long (B) or short (C) form of Neu2 protein, and transfected them into COS7 cells to study them under the same conditions. We found that 30-40% of the both forms of Neu2 activity was located in the crude membrane-fraction, and hydrolyzed ganglioside effectively, while both soluble fraction showed particular behavior with substrate specificity. Microscopic study by active staining with X-NANA showed that they located not only in the cytoplasm but also in areas surrounding the nucleus and in the peripheral ruffled spot.
Schlesinger, D H; Hay, D I
1977-03-10
The complete amino acid sequence of human salivary statherin, a peptide which strongly inhibits precipitation from supersaturated calcium phosphate solutions, and therefore stabilizes supersaturated saliva, has been determined. The NH2-terminal half of this Mr=5380 (43 amino acids) polypeptide was determined by automated Edman degradations (liquid phase) on native statherin. The peptide was digested separately with trypsin, chymotrypsin, and Staphylococcus aureus protease, and the resulting peptides were purified by gel filtration. Manual Edman degradations on purified peptide fragments yielded peptides that completed the amino acid sequence through the penultimate COOH-terminal residue. These analyses, together with carboxypeptidase digestion of native statherin and of peptide fragments of statherin, established the complete sequence of the molecule. The 2 serine residues (positions 2 and 3) in statherin were identified as phosphoserine. The amino acid sequence of human salivary statherin is striking in a number of ways. The NH2-terminal one-third is highly polar and includes three polar dipeptides: H2PO3-Ser-Ser-H2PO3-Arg-Arg-, and Glu-Glu-. The COOH-terminal two-thirds of the molecule is hydrophobic, containing several repeating dipeptides: four of -Gn-Pro-, three of -Tyr-Gln-, two of -Gly-Tyr-, two of-Gln-Tyr-, and two of the tetrapeptide sequence -Pro-Tyr-Gln-Pro-. Unusual cleavage sites in the statherin sequence obtained with chymotrypsin and S. aureus protease were also noted.
Hongpattarakere, Tipparat; Komeda, Hidenobu; Asano, Yasuhisa
2005-12-01
The D-amino acid amidase-producing bacterium was isolated from soil samples using an enrichment culture technique in medium broth containing D-phenylalanine amide as a sole source of nitrogen. The strain exhibiting the strongest activity was identified as Delftia acidovorans strain 16. This strain produced intracellular D-amino acid amidase constitutively. The enzyme was purified about 380-fold to homogeneity and its molecular mass was estimated to be about 50 kDa, on sodium dodecyl sulfate polyacrylamide gel electrophoresis. The enzyme was active preferentially toward D-amino acid amides rather than their L-counterparts. It exhibited strong amino acid amidase activity toward aromatic amino acid amides including D-phenylalanine amide, D-tryptophan amide and D-tyrosine amide, yet it was not specifically active toward low-molecular-weight D-amino acid amides such as D-alanine amide, L-alanine amide and L-serine amide. Moreover, it was not specifically active toward oligopeptides. The enzyme showed maximum activity at 40 degrees C and pH 8.5 and appeared to be very stable, with 92.5% remaining activity after the reaction was performed at 45 degrees C for 30 min. However, it was mostly inactivated in the presence of phenylmethanesulfonyl fluoride or Cd2+, Ag+, Zn2+, Hg2+ and As3+ . The NH2 terminal and internal amino acid sequences of the enzyme were determined; and the gene was cloned and sequenced. The enzyme gene damA encodes a 466-amino-acid protein (molecular mass 49,860.46 Da); and the deduced amino acid sequence exhibits homology to the D-amino acid amidase from Variovorax paradoxus (67.9% identity), the amidotransferase A subunit from Burkholderia fungorum (50% identity) and other enantioselective amidases.
The Mystery and Misery of Acid Reflux in Children
ERIC Educational Resources Information Center
Davenport, Mike; Davenport, Tracy
2006-01-01
When a child is sick, parents want answers. They want to know what is wrong, what they can do, and how to get their child healthy--pronto. Regrettably, there are some puzzling illnesses affecting children that are surrounded by mystery. One of them is gastroesophageal reflux (GER), otherwise known as acid reflux--or "reflux" for short. Reflux…
2014-01-01
Background Ambiscript is a graphically-designed nucleic acid notation that uses symbol symmetries to support sequence complementation, highlight biologically-relevant palindromes, and facilitate the analysis of consensus sequences. Although the original Ambiscript notation was designed to easily represent consensus sequences for multiple sequence alignments, the notation’s black-on-white ambiguity characters are unable to reflect the statistical distribution of nucleotides found at each position. We now propose a color-augmented ambigraphic notation to encode the frequency of positional polymorphisms in these consensus sequences. Results We have implemented this color-coding approach by creating an Adobe Flash® application ( http://www.ambiscript.org) that shades and colors modified Ambiscript characters according to the prevalence of the encoded nucleotide at each position in the alignment. The resulting graphic helps viewers perceive biologically-relevant patterns in multiple sequence alignments by uniquely combining color, shading, and character symmetries to highlight palindromes and inverted repeats in conserved DNA motifs. Conclusion Juxtaposing an intuitive color scheme over the deliberate character symmetries of an ambigraphic nucleic acid notation yields a highly-functional nucleic acid notation that maximizes information content and successfully embodies key principles of graphic excellence put forth by the statistician and graphic design theorist, Edward Tufte. PMID:24447494
Du, Q S; Ma, Y; Xie, N Z; Huang, R B
2014-01-01
In the design of peptide inhibitors the huge possible variety of the peptide sequences is of high concern. In collaboration with the fast accumulation of the peptide experimental data and database, a statistical method is suggested for peptide inhibitor design. In the two-level peptide prediction network (2L-QSAR) one level is the physicochemical properties of amino acids and the other level is the peptide sequence position. The activity contributions of amino acids are the functions of physicochemical properties and the sequence positions. In the prediction equation two weight coefficient sets {ak} and {bl} are assigned to the physicochemical properties and to the sequence positions, respectively. After the two coefficient sets are optimized based on the experimental data of known peptide inhibitors using the iterative double least square (IDLS) procedure, the coefficients are used to evaluate the bioactivities of new designed peptide inhibitors. The two-level prediction network can be applied to the peptide inhibitor design that may aim for different target proteins, or different positions of a protein. A notable advantage of the two-level statistical algorithm is that there is no need for host protein structural information. It may also provide useful insight into the amino acid properties and the roles of sequence positions.
Dolinšek, Jan; Dorninger, Christiane; Lagkouvardos, Ilias; Wagner, Michael
2013-01-01
Many studies of molecular microbial ecology rely on the characterization of microbial communities by PCR amplification, cloning, sequencing, and phylogenetic analysis of genes encoding rRNAs or functional marker enzymes. However, if the established clone libraries are dominated by one or a few sequence types, the cloned diversity is difficult to analyze by random clone sequencing. Here we present a novel approach to deplete unwanted sequence types from complex nucleic acid mixtures prior to cloning and downstream analyses. It employs catalytically active oligonucleotides containing locked nucleic acids (LNAzymes) for the specific cleavage of selected RNA targets. When combined with in vitro transcription and reverse transcriptase PCR, this LNAzyme-based technique can be used with DNA or RNA extracts from microbial communities. The simultaneous application of more than one specific LNAzyme allows the concurrent depletion of different sequence types from the same nucleic acid preparation. This new method was evaluated with defined mixtures of cloned 16S rRNA genes and then used to identify accompanying bacteria in an enrichment culture dominated by the nitrite oxidizer “Candidatus Nitrospira defluvii.” In silico analysis revealed that the majority of publicly deposited rRNA-targeted oligonucleotide probes may be used as specific LNAzymes with no or only minor sequence modifications. This efficient and cost-effective approach will greatly facilitate tasks such as the identification of microbial symbionts in nucleic acid preparations dominated by plastid or mitochondrial rRNA genes from eukaryotic hosts, the detection of contaminants in microbial cultures, and the analysis of rare organisms in microbial communities of highly uneven composition. PMID:23263968
GCPred: a web tool for guanylyl cyclase functional centre prediction from amino acid sequence.
Xu, Nuo; Fu, Dongfang; Li, Shiang; Wang, Yuxuan; Wong, Aloysius
2018-06-15
GCPred is a webserver for the prediction of guanylyl cyclase (GC) functional centres from amino acid sequence. GCs are enzymes that generate the signalling molecule cyclic guanosine 3', 5'-monophosphate from guanosine-5'-triphosphate. A novel class of GC centres (GCCs) has been identified in complex plant proteins. Using currently available experimental data, GCPred is created to automate and facilitate the identification of similar GCCs. The server features GCC values that consider in its calculation, the physicochemical properties of amino acids constituting the GCC and the conserved amino acids within the centre. From user input amino acid sequence, the server returns a table of GCC values and graphs depicting deviations from mean values. The utility of this server is demonstrated using plant proteins and the human interleukin-1 receptor-associated kinase family of proteins as example. The GCPred server is available at http://gcpred.com. Supplementary data are available at Bioinformatics online.
The complete amino acid sequence of human erythrocyte diphosphoglycerate mutase.
Haggarty, N W; Dunbar, B; Fothergill, L A
1983-01-01
The complete amino acid sequence of human erythrocyte diphosphoglycerate mutase, comprising 239 residues, was determined. The sequence was deduced from the four cyanogen bromide fragments, and from the peptides derived from these fragments after digestion with a number of proteolytic enzymes. Comparison of this sequence with that of the yeast glycolytic enzyme, phosphoglycerate mutase, shows that these enzymes are 47% identical. Most, but not all, of the residues implicated as being important for the activity of the glycolytic mutase are conserved in the erythrocyte diphosphoglycerate mutase. PMID:6313356
Complete complementary DNA-derived amino acid sequence of canine cardiac phospholamban.
Fujii, J; Ueno, A; Kitano, K; Tanaka, S; Kadoma, M; Tada, M
1987-01-01
Complementary DNA (cDNA) clones specific for phospholamban of sarcoplasmic reticulum membranes have been isolated from a canine cardiac cDNA library. The amino acid sequence deduced from the cDNA sequence indicates that phospholamban consists of 52 amino acid residues and lacks an amino-terminal signal sequence. The protein has an inferred mol wt 6,080 that is in agreement with its apparent monomeric mol wt 6,000, estimated previously by sodium dodecyl sulfate-polyacrylamide gel electrophoresis. Phospholamban contains two distinct domains, a hydrophilic region at the amino terminus (domain I) and a hydrophobic region at the carboxy terminus (domain II). We propose that domain I is localized at the cytoplasmic surface and offers phosphorylatable sites whereas domain II is anchored into the sarcoplasmic reticulum membrane. PMID:3793929
Rottloff, Sandy; Stieber, Regina; Maischak, Heiko; Turini, Florian G; Heubl, Günther; Mithöfer, Axel
2011-08-01
Carnivory in plants is an adaptation strategy to nutrient-poor environments and soils. Carnivorous plants obtain some additional mineral nutrients by trapping and digesting prey; the genus Nepenthes is helped by its specialized pitcher traps. To make the nutrients available, the caught prey needs to be digested, a process that requires the concerted activity of several hydrolytic enzymes. To identify and investigate the various enzymes involved in this process, fluid from Nepenthes traps has been analysed in detail. In this study, a novel type of Nepenthes endochitinase was identified in the digestion fluid of closed pitchers. The encoding endochitinase genes have been cloned from eight different Nepenthes species. Among these, the deduced amino acid sequence similarity was at least 94.9%. The corresponding cDNA from N. rafflesiana was heterologously expressed, and the purified protein, NrChit1, was biochemically characterized. The enzyme, classified as a class III acid endochitinase belonging to family 18 of the glycoside hydrolases, is secreted into the pitcher fluid very probably due to the presence of an N-terminal signal peptide. Transcriptome analyses using real-time PCR indicated that the presence of prey in the pitcher up-regulates the endochitinase gene not only in the glands, which are responsible for enzyme secretion, but at an even higher level, in the glands' surrounding tissue. These results suggest that in the pitchers' tissues, the endochitinase as well as other proteins from the pitcher fluid might fulfil a different, primary function as pathogenesis-related proteins. © 2011 The Author(s).
Auvynet, Constance; Seddiki, Nabila; Dunia, Irene; Nicolas, Pierre; Amiche, Mohamed; Lacombe, Claire
2006-01-01
The dermal glands of the South American hylid frog Phyllomedusa bicolor synthesize and expel huge amounts of cationic, alpha-helical, 24- to 33-residue antimicrobial peptides, the dermaseptins B. These glands also produce a wide array of peptides that are similar to mammalian hormones and neuropeptides, including a heptapeptide opioid containing a D-amino acid, deltorphin I (Tyr-DAla-Phe-Asp-Val-Val-Gly NH2). Its biological activity is due to the racemization of L-Ala2 to D-Ala. The dermaseptins B and deltorphins are all derived from a single family of precursor polypeptides that have an N-terminal preprosequence that is remarkably well conserved, although the progenitor sequences giving rise to mature opioid or antimicrobial peptides are markedly different. Monoclonal and polyclonal antibodies were used to examine the cellular and ultrastructural distributions of deltorphin I and dermaseptin B in the serous glands by immunofluoresence confocal microscopy and immunogold-electron microscopy. Preprodeltorphin I and preprodermaseptins B are sorted into the regulated pathway of secretion, where they are processed to give the mature products. Deltorphin I, [l-Ala2]-deltorphin I and dermaseptin B are all stored together in secretion granules which accumulate in the cytoplasm of all serous glands. We conclude that the L- to D-amino acid isomerization of the deltorphin I occurs in the secretory granules as a post-translational event. Thus the specificity of isomerization depends on the presence of structural and/or conformational determinants in the peptide N-terminus surrounding the isomerization site.
NASA Astrophysics Data System (ADS)
Lu, Cheng-Tsung; Chen, Shu-An; Bretaña, Neil Arvin; Cheng, Tzu-Hsiu; Lee, Tzong-Yi
2011-10-01
In proteins, glutamate (Glu) residues are transformed into γ-carboxyglutamate (Gla) residues in a process called carboxylation. The process of protein carboxylation catalyzed by γ-glutamyl carboxylase is deemed to be important due to its involvement in biological processes such as blood clotting cascade and bone growth. There is an increasing interest within the scientific community to identify protein carboxylation sites. However, experimental identification of carboxylation sites via mass spectrometry-based methods is observed to be expensive, time-consuming, and labor-intensive. Thus, we were motivated to design a computational method for identifying protein carboxylation sites. This work aims to investigate the protein carboxylation by considering the composition of amino acids that surround modification sites. With the implication of a modified residue prefers to be accessible on the surface of a protein, the solvent-accessible surface area (ASA) around carboxylation sites is also investigated. Radial basis function network is then employed to build a predictive model using various features for identifying carboxylation sites. Based on a five-fold cross-validation evaluation, a predictive model trained using the combined features of amino acid sequence (AA20D), amino acid composition, and ASA, yields the highest accuracy at 0.874. Furthermore, an independent test done involving data not included in the cross-validation process indicates that in silico identification is a feasible means of preliminary analysis. Additionally, the predictive method presented in this work is implemented as Carboxylator (http://csb.cse.yzu.edu.tw/Carboxylator/), a web-based tool for identifying carboxylated proteins with modification sites in order to help users in investigating γ-glutamyl carboxylation.
Chen, Xiaochi; Ansai, Toshihiro; Awano, Shuji; Iida, Toshiya; Barik, Sailen; Takehara, Tadamichi
1999-01-01
A novel acid phosphatase containing phosphotyrosyl phosphatase (PTPase) activity, designated PiACP, from Prevotella intermedia ATCC 25611, an anaerobe implicated in progressive periodontal disease, has been purified and characterized. PiACP, a monomer with an apparent molecular mass of 30 kDa, did not require divalent metal cations for activity and was sensitive to orthovanadate but highly resistant to okadaic acid. The enzyme exhibited substantial activity against tyrosine phosphate-containing peptides derived from the epidermal growth factor receptor. On the basis of N-terminal and internal amino acid sequences of purified PiACP, the gene coding for PiACP was isolated and sequenced. The PiACP gene consisted of 792 bp and coded for a basic protein with an Mr of 29,164. The deduced amino acid sequence exhibited striking similarity (25 to 64%) to those of members of class A bacterial acid phosphatases, including PhoC of Morganella morganii, and involved a conserved phosphatase sequence motif that is shared among several lipid phosphatases and the mammalian glucose-6-phosphatases. The highly conservative motif HCXAGXXR in the active domain of PTPase was not found in PiACP. Mutagenesis of recombinant PiACP showed that His-170 and His-209 were essential for activity. Thus, the class A bacterial acid phosphatases including PiACP may function as atypical PTPases, the biological functions of which remain to be determined. PMID:10559178
Rapid Threat Organism Recognition Pipeline
DOE Office of Scientific and Technical Information (OSTI.GOV)
Williams, Kelly P.; Solberg, Owen D.; Schoeniger, Joseph S.
2013-05-07
The RAPTOR computational pipeline identifies microbial nucleic acid sequences present in sequence data from clinical samples. It takes as input raw short-read genomic sequence data (in particular, the type generated by the Illumina sequencing platforms) and outputs taxonomic evaluation of detected microbes in various human-readable formats. This software was designed to assist in the diagnosis or characterization of infectious disease, by detecting pathogen sequences in nucleic acid sequence data from clinical samples. It has also been applied in the detection of algal pathogens, when algal biofuel ponds became unproductive. RAPTOR first trims and filters genomic sequence reads based on qualitymore » and related considerations, then performs a quick alignment to the human (or other host) genome to filter out host sequences, then performs a deeper search against microbial genomes. Alignment to a protein sequence database is optional. Alignment results are summarized and placed in a taxonomic framework using the Lowest Common Ancestor algorithm.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lee, Brady D.; Thompson, David N.; Apel, William A.
Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius are provided. Further provided are methods of modulating transcription or transcription or transcriptional control using isolated and/or purified polypeptides and nucleic acid sequences from Alicyclobacillus acidocaldarius.
Lee, Brady Deneys; Thompson, David N; Apel, William A.; Thompson, Vicki Slavchev; Reed, David W; Lacey, Jeffrey A
2014-05-06
Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius are provided. Further provided are methods of modulating transcription or transcription or transcriptional control using isolated and/or purified polypeptides and nucleic acid sequences from Alicyclobacillus acidocaldarius.
Lee, Brady D.; Thompson, David N.; Apel, William A.; Thompson, Vicki S.; Reed, David W.; Lacey, Jeffrey A.
2015-11-17
Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius are provided. Further provided are methods of modulating transcription or transcription or transcriptional control using isolated and/or purified polypeptides and nucleic acid sequences from Alicyclobacillus acidocaldarius.
Lee, Brady D; Thompson, David N; Apel, William A; Thompson, Vicki S; Reed, David W; Lacey, Jeffrey A
2016-11-22
Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius are provided. Further provided are methods of modulating transcription or transcription or transcriptional control using isolated and/or purified polypeptides and nucleic acid sequences from Alicyclobacillus acidocaldarius.
Aggregation of peptides in the tube model with correlated sidechain orientations
NASA Astrophysics Data System (ADS)
Hung, Nguyen Ba; Hoang, Trinh Xuan
2015-06-01
The ability of proteins and peptides to aggregate and form toxic amyloid fibrils is associated with a range of diseases including BSE (or mad cow), Alzheimer's and Parkinson's Diseases. In this study, we investigate the the role of amino acid sequence in the aggregation propensity by using a modified tube model with a new procedure for hydrophobic interaction. In this model, the amino acid sidechains are not considered explicitly, but their orientations are taken into account in the formation of hydrophobic contact. Extensive Monte Carlo simulations for systems of short peptides are carried out with the use of parallel tempering technique. Our results show that the propensity to form and the structures of the aggregates strongly depend on the amino acid sequence and the number of peptides. Some sequences may not aggregate at all at a presumable physiological temperature while other can easily form fibril-like, β-sheet struture. Our study provides an insight into the principles of how the formation of amyloid can be governed by amino acid sequence.
Nucleic acids encoding antifungal polypeptides and uses thereof
Altier, Daniel J.; Ellanskaya, I. A.; Gilliam, Jacob T.; Hunter-Cevera, Jennie; Presnail, James K; Schepers, Eric; Simmons, Carl R.; Torok, Tamas; Yalpani, Nasser
2010-11-02
Compositions and methods for protecting a plant from a pathogen, particularly a fungal pathogen, are provided. Compositions include an amino acid sequence, and variants and fragments thereof, for an antipathogenic polypeptide that was isolated from a fungal fermentation broth. Nucleic acid molecules that encode the antipathogenic polypeptides of the invention, and antipathogenic domains thereof, are also provided. A method for inducing pathogen resistance in a plant using the nucleotide sequences disclosed herein is further provided. The method comprises introducing into a plant an expression cassette comprising a promoter operably linked to a nucleotide sequence that encodes an antipathogenic polypeptide of the invention. Compositions comprising an antipathogenic polypeptide or a transformed microorganism comprising a nucleic acid of the invention in combination with a carrier and methods of using these compositions to protect a plant from a pathogen are further provided. Transformed plants, plant cells, seeds, and microorganisms comprising a nucleotide sequence that encodes an antipathogenic polypeptide of the invention are also disclosed.
Cloning and High-Level Expression of α-Galactosidase cDNA from Penicillium purpurogenum
Shibuya, Hajime; Nagasaki, Hiroaki; Kaneko, Satoshi; Yoshida, Shigeki; Park, Gwi Gun; Kusakabe, Isao; Kobayashi, Hideyuki
1998-01-01
The cDNA coding for Penicillium purpurogenum α-galactosidase (αGal) was cloned and sequenced. The deduced amino acid sequence of the α-Gal cDNA showed that the mature enzyme consisted of 419 amino acid residues with a molecular mass of 46,334 Da. The derived amino acid sequence of the enzyme showed similarity to eukaryotic αGals from plants, animals, yeasts, and filamentous fungi. The highest similarity observed (57% identity) was to Trichoderma reesei AGLI. The cDNA was expressed in Saccharomyces cerevisiae under the control of the yeast GAL10 promoter. Almost all of the enzyme produced was secreted into the culture medium, and the expression level reached was approximately 0.2 g/liter. The recombinant enzyme purified to homogeneity was highly glycosylated, showed slightly higher specific activity, and exhibited properties almost identical to those of the native enzyme from P. purpurogenum in terms of the N-terminal amino acid sequence, thermoactivity, pH profile, and mode of action on galacto-oligosaccharides. PMID:9797312
Regulation of Glucose Transport in Quiescent, Lactating, and Neoplastic Mammary Epithelia
1998-10-01
17000g pellet iodixanol density gradient was collected and solubilized with 1.25% dodecyl maltoside in the presence of 6- aminocaproic acid . After a...regulatory properties, tissue distributions, and kinetics. However, they are all integral membrane proteins containing approximately 500 amino acids ...Hydropathy plots based on amino acid sequences predicted from cDNA sequence suggest that all share a common topology, which includes cytoplasmic N- and C
Yu, Bo; Su, Fei; Wang, Limin; Zhao, Bo; Qin, Jiayang; Ma, Cuiqing; Xu, Ping; Ma, Yanhe
2011-01-01
Lactobacillus rhamnosus is a type of probiotic bacteria with industrial potential for l-lactic acid production. We announce the draft genome sequence of L. rhamnosus CASL (2,855,156 bp with a G+C content of 46.6%), which is an efficient producer of l-lactic acid from cheap, nonfood substrate cassava with a high production titer. PMID:22123765
Su, Fei; Yu, Bo; Sun, Jibin; Ou, Hong-Yu; Zhao, Bo; Wang, Limin; Qin, Jiayang; Tang, Hongzhi; Tao, Fei; Jarek, Michael; Scharfe, Maren; Ma, Cuiqing; Ma, Yanhe; Xu, Ping
2011-09-01
Bacillus coagulans 2-6 is an efficient producer of lactic acid. The genome of B. coagulans 2-6 has the smallest genome among the members of the genus Bacillus known to date. The frameshift mutation at the start of the d-lactate dehydrogenase sequence might be responsible for the production of high-optical-purity l-lactic acid.
Epitaxial Nucleation on Rationally Designed Peptide Functionalized Interface
2011-07-19
of 17 amino acid peptides. In this report, we focus on the findings from several variants of these sequences, including the role of charge...separation and histidine-gold coordination. We find that these 17 amino acid peptide sequences behave robustly, where periodicity appears to dominate the...26,27 Secondary structure propensity refers to the intrinsic inclination of individual amino acids to a given secondary structure, where side-group
Crotoxin: Structural Studies, Mechanism of Action and Cloning of Its gene
1989-12-01
B-chain. Sequencing of the three peptides present in the acidic subunit, two of which are blocked by pyroglutamate , represents a significant...We have completed the sequence determination of both the basic and acidic subunits of crotoxin. The acidic subunit peptides were difficult, since two...of the three peptides were blocked at the amino-terminus by pyroglutamate . Earlier structural studies on crotoxin and related crotalid dimeric
Molecular Cloning of Adenosinediphosphoribosyl Transferase.
1987-09-08
nature of the blocking group is unknown, except its identity with pyroglutamic acid was ruled out by its insensitivity to pyroglutaminase (not shown...AdenosinediphosphoribOSyl Transferase (ADPRT) is: 1) the complete amino acid sequence of this large protein is best determined -from the DNA !equence of the gene, 2...enzyme (I), determination of its peptide structure (II) and application of synthetic DNA probes (III) derived from amino acid sequences, resulting in the
Kimura, J; Kimura, M
1987-09-05
The amino acid sequences of two ribosomal proteins, S14 and S16, from the archaebacterium Halobacterium marismortui have been determined. Sequence data were obtained by the manual and solid-phase sequencing of peptides derived from enzymatic digestions with trypsin, chymotrypsin, pepsin, and Staphylococcus aureus protease as well as by chemical cleavage with cyanogen bromide. Proteins S14 and S16 contain 109 and 126 amino acid residues and have Mr values of 11,964 and 13,515, respectively. Comparison of the sequences with those of ribosomal proteins from other organisms demonstrates that S14 has a significant homology with the rat liver ribosomal protein S11 (36% identity) as well as with the Escherichia coli ribosomal protein S17 (37%), and that S16 is related to the yeast ribosomal protein YS22 (40%) and proteins S8 from E. coli (28%) and Bacillus stearothermophilus (30%). A comparison of the amino acid residues in the homologous regions of halophilic and nonhalophilic ribosomal proteins reveals that halophilic proteins have more glutamic acids, asparatic acids, prolines, and alanines, and less lysines, arginines, and isoleucines than their nonhalophilic counterparts. These amino acid substitutions probably contribute to the structural stability of halophilic ribosomal proteins.
NASA Astrophysics Data System (ADS)
Mandre, Shreyas; Akella, Sathish; Singh, Dhiraj; Singh, Ravi; Bandi, Mahesh
2016-11-01
A camphoric-acid boat (c-boat for short), a cylindrical gel tablet infused with camphoric acid, moves spontaneously when placed on an air-water interface. This system is a classic example of propulsion driven by Marangoni forces. Despite rich history on particles propelled by Marangoni forces, including contributions by figures such as Benjamin Franklin, Allesandro Volta, and Giovanni Venturi, the underlying fluid dynamics remains poorly understood. A key missing piece is the nature of the surfactant; in our case, the question is whether the camphoric acid is dissolved in the bulk or adsorbed on to the interface. We gain insight into this piece by holding the c-boat stationary and measuring the surrounding axisymmetric flow velocity to a precision needed to distinguish between the two possibilities. For soluble surfactants, it is known that the velocity field decays as r - 2 / 3, where r is the distance from the center of the c-boat. Whereas, for surfactant adsorbed on to the air-water interface, we derive that the surrounding velocity fields decays as r - 3 / 5. Based on our measurements we deduce that, even though soluble in water, the Marangoni flow results from a layer of camphoric acid adsorbed to the air-water interface.
Random Amplification and Pyrosequencing for Identification of Novel Viral Genome Sequences
Hang, Jun; Forshey, Brett M.; Kochel, Tadeusz J.; Li, Tao; Solórzano, Víctor Fiestas; Halsey, Eric S.; Kuschner, Robert A.
2012-01-01
ssRNA viruses have high levels of genomic divergence, which can lead to difficulty in genomic characterization of new viruses using traditional PCR amplification and sequencing methods. In this study, random reverse transcription, anchored random PCR amplification, and high-throughput pyrosequencing were used to identify orthobunyavirus sequences from total RNA extracted from viral cultures of acute febrile illness specimens. Draft genome sequence for the orthobunyavirus L segment was assembled and sequentially extended using de novo assembly contigs from pyrosequencing reads and orthobunyavirus sequences in GenBank as guidance. Accuracy and continuous coverage were achieved by mapping all reads to the L segment draft sequence. Subsequently, RT-PCR and Sanger sequencing were used to complete the genome sequence. The complete L segment was found to be 6936 bases in length, encoding a 2248-aa putative RNA polymerase. The identified L segment was distinct from previously published South American orthobunyaviruses, sharing 63% and 54% identity at the nucleotide and amino acid level, respectively, with the complete Oropouche virus L segment and 73% and 81% identity at the nucleotide and amino acid level, respectively, with a partial Caraparu virus L segment. The result demonstrated the effectiveness of a sequence-independent amplification and next-generation sequencing approach for obtaining complete viral genomes from total nucleic acid extracts and its use in pathogen discovery. PMID:22468136
Semi-Immersive Virtual Turbine Engine Simulation System
NASA Astrophysics Data System (ADS)
Abidi, Mustufa H.; Al-Ahmari, Abdulrahman M.; Ahmad, Ali; Darmoul, Saber; Ameen, Wadea
2018-05-01
The design and verification of assembly operations is essential for planning product production operations. Recently, virtual prototyping has witnessed tremendous progress, and has reached a stage where current environments enable rich and multi-modal interaction between designers and models through stereoscopic visuals, surround sound, and haptic feedback. The benefits of building and using Virtual Reality (VR) models in assembly process verification are discussed in this paper. In this paper, we present the virtual assembly (VA) of an aircraft turbine engine. The assembly parts and sequences are explained using a virtual reality design system. The system enables stereoscopic visuals, surround sounds, and ample and intuitive interaction with developed models. A special software architecture is suggested to describe the assembly parts and assembly sequence in VR. A collision detection mechanism is employed that provides visual feedback to check the interference between components. The system is tested for virtual prototype and assembly sequencing of a turbine engine. We show that the developed system is comprehensive in terms of VR feedback mechanisms, which include visual, auditory, tactile, as well as force feedback. The system is shown to be effective and efficient for validating the design of assembly, part design, and operations planning.
Sata, F; Sapone, A; Elizondo, G; Stocker, P; Miller, V P; Zheng, W; Raunio, H; Crespi, C L; Gonzalez, F J
2000-01-01
To determine the existence of mutant and variant CgammaP3A4 alleles in three racial groups and to assess functions of the variant alleles by complementary deoxyribonucleic acid (cDNA) expression. A bacterial artificial chromosome that contains the complete CgammaP3A4 gene was isolated and the exons and surrounding introns were directly sequenced to develop primers to polymerase chain reaction (PCR) amplify and sequence the gene from lymphocyte DNA. DNA samples from Chinese, black, and white subjects were screened. Mutating the affected amino acid in the wild-type cDNA and expressing the variant enzyme with use of the baculovirus system was used to functionally evaluate the variant allele having a missense mutation. To investigate the existence of mutant and variant CgammaP3A4 alleles in humans, all 13 exons and the 5'-flanking region of the human CgammaP3A4 gene in three racial groups were sequenced and four alleles were identified. An A-->G point mutation in the 5'-flanking region of the human CgammaP3A4 gene, designated CgammaP3A4*1B, was found in the three different racial groups. The frequency of this allele in a white population was 4.2%, whereas it was 66.7% in black subjects. The CgammaP3A4*1B allele was not found in Chinese subjects. A second variant allele, designated CgammaP3A4*2, having a Ser222Pro change, was found at a frequency of 2.7% in the white population and was absent in the black subjects and Chinese subjects analyzed. Baculovirus-directed cDNA expression revealed that the CYP3A4*2 P450 had a lower intrinsic clearance for the CYP3A4 substrate nifedipine compared with the wild-type enzyme but was not significantly different from the wild-type enzyme for testosterone 6beta-hydroxylation. Another rare allele, designated CgammaP3A4*3, was found in a single Chinese subject who had a Met445Thr change in the conserved heme-binding region of the P450. These are the first examples of potential function polymorphisms resulting from missense mutations in the CgammaP3A4 gene. The CgammaP3A4*2 allele was found to encode a P450 with substrate-dependent altered kinetics compared with the wild-type P450.
High speed nucleic acid sequencing
Korlach, Jonas [Ithaca, NY; Webb, Watt W [Ithaca, NY; Levene, Michael [Ithaca, NY; Turner, Stephen [Ithaca, NY; Craighead, Harold G [Ithaca, NY; Foquet, Mathieu [Ithaca, NY
2011-05-17
The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid. Each type of labeled nucleotide comprises an acceptor fluorophore attached to a phosphate portion of the nucleotide such that the fluorophore is removed upon incorporation into a growing strand. Fluorescent signal is emitted via fluorescent resonance energy transfer between the donor fluorophore and the acceptor fluorophore as each nucleotide is incorporated into the growing strand. The sequence is deduced by identifying which base is being incorporated into the growing strand.
Jung, Woongsic; Kim, Eun Jae; Han, Se Jong; Choi, Han-Gu; Kim, Sanghee
2016-10-01
Stearoyl-CoA desaturase is a key regulator in fatty acid metabolism that catalyzes the desaturation of stearic acid to oleic acid and controls the intracellular levels of monounsaturated fatty acids (MUFAs). Two stearoyl-CoA desaturases (SCD, Δ9 desaturases) genes were identified in an Antarctic copepod, Tigriopus kingsejongensis, that was collected in a tidal pool near the King Sejong Station, King George Island, Antarctica. Full-length complementary DNA (cDNA) sequences of two T. kingsejongensis SCDs (TkSCDs) were obtained from next-generation sequencing and isolated by reverse transcription PCR. DNA sequence lengths of the open reading frames of TkSCD-1 and TkSCD-2 were determined to be 1110 and 681 bp, respectively. The molecular weights deduced from the corresponding genes were estimated to be 43.1 kDa (TkSCD-1) and 26.1 kDa (TkSCD-2). The amino acid sequences were compared with those of fatty acid desaturases and sterol desaturases from various organisms and used to analyze the relationships among TkSCDs. As assessed by heterologous expression of recombinant proteins in Escherichia coli, the enzymatic functions of both stearoyl-CoA desaturases revealed that the amount of C16:1 and C18:1 fatty acids increased by greater than 3-fold after induction with isopropyl β-D-thiogalactopyranoside. In particular, C18:1 fatty acid production increased greater than 10-fold in E. coli expressing TkSCD-1 and TkSCD-2. The results of this study suggest that both SCD genes from an Antarctic marine copepod encode a functional desaturase that is capable of increasing the amounts of palmitoleic acid and oleic acid in a prokaryotic expression system.
Conversion of amino-acid sequence in proteins to classical music: search for auditory patterns
2007-01-01
We have converted genome-encoded protein sequences into musical notes to reveal auditory patterns without compromising musicality. We derived a reduced range of 13 base notes by pairing similar amino acids and distinguishing them using variations of three-note chords and codon distribution to dictate rhythm. The conversion will help make genomic coding sequences more approachable for the general public, young children, and vision-impaired scientists. PMID:17477882
Nam, Bo-Hye; Seo, Jung-Kil; Lee, Min Jeong; Kim, Young-Ok; Kim, Dong-Gyun; An, Cheul Min; Park, Nam Gyu
2015-07-01
An antimicrobial peptide, ∼5 kDa in size, was isolated and purified in its active form from the mantle of the Pacific oyster Crassostrea gigas by C18 reversed-phase high-performance liquid chromatography. Matrix-assisted laser desorption ionisation time-of-flight analysis revealed 4656.4 Da of the purified and unreduced peptide. A comparison of the N-terminal amino acid sequence of oyster antimicrobial peptide with deduced amino acid sequences in our local expressed sequence tag (EST) database of C. gigas (unpublished data) revealed that the oyster antimicrobial peptide sequence entirely matched the deduced amino acid sequence of an EST clone (HM-8_A04), which was highly homologous with the β-thymosin of other species. The cDNA possessed a 126-bp open reading frame that encoded a protein of 41 amino acids. To confirm the antimicrobial activity of C. gigas β-thymosin, we overexpressed a recombinant β-thymosin (rcgTβ) using a pET22 expression plasmid in an Escherichia coli system. The antimicrobial activity of rcgTβ was evaluated and demonstrated using a bacterial growth inhibition test in both liquid and solid cultures. Copyright © 2015 Elsevier Ltd. All rights reserved.
NASA Technical Reports Server (NTRS)
Lee, W. Y.; Brune, D. C.; LoBrutto, R.; Blankenship, R. E.
1995-01-01
Rubredoxin is a small nonheme iron protein that serves as an electron carrier in bacterial systems. Rubredoxin has now been isolated and characterized from the strictly anaerobic phototroph, Heliobacillus mobilis. THe molecular mass (5671.3 Da from the amino acid sequence) was confirmed and partial formylation of the N-terminal methionyl residue was established by matrix-assisted laser desorption mass spectroscopy. The complete 52-amino-acid sequence was determined by a combination of N-terminal sequencing by Edman degradation and C-terminal sequencing by a novel method using carboxypeptidase treatment in conjunction with amino acid analysis and laser desorption time of flight mass spectrometry. The molar absorption coefficient of Hc. mobilis rubredoxin at 490 nm is 6.9 mM-1 cm-1 and the midpoint redox potential at pH 8.0 is -46 mV. The EPR spectrum of the oxidized form shows resonances at g = 9.66 and 4.30 due to a high-spin ferric iron. The amino acid sequence is homologous to those of rubredoxins from other species, in particular, the gram-positive bacteria, and the phototrophic green sulfur bacteria, and the evolutionary implications of this are discussed.
Tan, Lianjiang; Liu, Yazhi; Li, Xiaowei; Wu, Xin-Yan; Gong, Bing; Shen, Yu-Mei; Shao, Zhifeng
2016-02-11
An acid-cleavable linker based on a dimethylketal moiety was synthesized and used to connect a nucleotide with a fluorophore to produce a 3'-OH unblocked nucleotide analogue as an excellent reversible terminator for DNA sequencing by synthesis.
Type II restriction modification system methylation subunit of Alicyclobacillus acidocaldarius
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lee, Brady D.; Newby, Deborah T.; Lacey, Jeffrey A.
2018-02-13
Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius are provided. Further provided are methods for modulating or altering recombination inside or outside of a cell using isolated and/or purified polypeptides and/or nucleic acid sequences from Alicyclobacillus acidocaldarius.
Tang, Khanh G; Kent, Greggory T; Erden, Ihsan; Wu, Weiming
2017-10-04
cis -β-Bromostyrene derivatives were synthesized stereospecifically from cinnamic acids through β-lactone intermediates. The synthetic sequence did not require the purification of the β-lactone intermediates although they were found to be stable and readily purified in most cases.
Type II restriction-modification system methylation subunit of Alicyclobacillus acidocaldarius
Lee, Brady D; Newby, Deborah T; Lacey, Jeffrey A; Thompson, David N; Thompson, Vicki S; Apel, William A; Roberto, Francisco F; Reed, David W
2013-10-29
Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius are provided. Further provided are methods for modulating or altering recombination inside or outside of a cell using isolated and/or purified polypeptides and/or nucleic acid sequences from Alicyclobacillus acidocaldarius.
Type II restriction-modification system methylation subunit of Alicyclobacillus acidocaldarius
Lee, Brady D; Newby, Deborah T; Lacey, Jeffrey A; Thompson, David N; Thompson, Vicki S; Apel, William A; Roberto, Francisco F; Reed, David W
2015-05-12
Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius are provided. Further provided are methods for modulating or altering recombination inside or outside of a cell using isolated and/or purified polypeptides and/or nucleic acid sequences from Alicyclobacillus acidocaldarius.
Type II restriction modification system methylation subunit of Alicyclobacillus acidocaldarius
Lee, Brady D.; Newby, Deborah T.; Lacey, Jeffrey A.; Thompson, David N.; Thompson, Vicki S.; Apel, William A.; Roberto, Francisco F.; Reed, David W.
2017-02-14
Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius are provided. Further provided are methods for modulating or altering recombination inside or outside of a cell using isolated and/or purified polypeptides and/or nucleic acid sequences from Alicyclobacillus acidocaldarius.
Thompson, David N.; Apel, William A.; Thompson, Vicki S.; Reed, David W.; Lacey, Jeffrey A.
2013-01-15
Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius are provided. Further provided are methods for transporting sugars across cell membranes using isolated and/or purified polypeptides and nucleic acid sequences from Alicyclobacillus acidocaldarius.
Thompson, Vicki S.; Apel, William A.; Reed, David William; Lee, Brady D.; Thompson, David N.; Roberto, Francisco F.; Lacey, Jeffrey A.
2015-12-29
Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius are provided. Further provided are methods for modulating or altering metabolism in a cell using isolated and/or purified polypeptides and nucleic acid sequences from Alicyclobacillus acidocaldarius.
Thompson, Vicki S; Apel, William A; Reed, David W; Lee, Brady D; Thompson, David N; Roberto, Francisco F; Lacey, Jeffrey A
2014-05-20
Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius are provided. Further provided are methods for modulating or altering metabolism in a cell using isolated and/or purified polypeptides and nucleic acid sequences from Alicyclobacillus acidocaldarius.
Thompson, David N; Apel, William A; Thompson, Vicki S; Reed, David W; Lacey, Jeffrey A
2017-06-14
Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius are provided. Further provided are methods for glycosylating and/or post-translationally modifying proteins using isolated and/or purified polypeptides and nucleic acid sequences from Alicyclobacillus acidocaldarius.
Thompson, David N [Idaho Falls, ID; Apel, William A [Jackson, WY; Thompson, Vicki S [Idaho Falls, ID; Reed, David W [Idaho Falls, ID; Lacey, Jeffrey A [Idaho Falls, ID
2011-12-06
Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius are provided. Further provided are methods for transporting sugars across cell membranes using isolated and/or purified polypeptides and nucleic acid sequences from Alicyclobacillus acidocaldarius.
Thompson, David N [Idaho Falls, ID; Apel, William A [Jackson, WY; Thompson, Vicki S [Idaho Falls, ID; Reed, David W [Idaho Falls, ID; Lacey, Jeffrey A [Idaho Falls, ID
2011-06-14
Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius are provided. Further provided are methods for transporting sugars across cell membranes using isolated and/or purified polypeptides and nucleic acid sequences from Alicyclobacillus acidocaldarius.
Thompson, David N.; Apel, William A.; Thompson, Vicki S.; Reed, David W.; Lacey, Jeffrey A.
2013-01-29
Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius are provided. Further provided are methods for transporting sugars across cell membranes using isolated and/or purified polypeptides and nucleic acid sequences from Alicyclobacillus acidocaldarius.
Thompson, David N.; Apel, William A.; Thompson, Vicki S.; Reed, David W.; Lacey, Jeffrey A.
2016-01-12
Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius are provided. Further provided are methods for glycosylating and/or post-translationally modifying proteins using isolated and/or purified polypeptides and nucleic acid sequences from Alicyclobacillus acidocaldarius.
Thompson, David N; Apel, William A; Thompson, Vicki S; Reed, David W; Lacey, Jeffrey A
2013-11-05
Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius are provided. Further provided are methods for transporting sugars across cell membranes using isolated and/or purified polypeptides and nucleic acid sequences from Alicyclobacillus acidocaldarius.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Thompson, Vicki S.; Apel, William A.; Lacey, Jeffrey A.
Isolated and/or purified polypeptides and nucleic acid sequences encoding polypeptides from Alicyclobacillus acidocaldarius are provided. Further provided are methods for modulating or altering metabolism in a cell using isolated and/or purified polypeptides and nucleic acid sequences from Alicyclobacillus acidocaldarius.
2013-01-01
Background Hypodontus macropi is a common intestinal nematode of a range of kangaroos and wallabies (macropodid marsupials). Based on previous multilocus enzyme electrophoresis (MEE) and nuclear ribosomal DNA sequence data sets, H. macropi has been proposed to be complex of species. To test this proposal using independent molecular data, we sequenced the whole mitochondrial (mt) genomes of individuals of H. macropi from three different species of hosts (Macropus robustus robustus, Thylogale billardierii and Macropus [Wallabia] bicolor) as well as that of Macropicola ocydromi (a related nematode), and undertook a comparative analysis of the amino acid sequence datasets derived from these genomes. Results The mt genomes sequenced by next-generation (454) technology from H. macropi from the three host species varied from 13,634 bp to 13,699 bp in size. Pairwise comparisons of the amino acid sequences predicted from these three mt genomes revealed differences of 5.8% to 18%. Phylogenetic analysis of the amino acid sequence data sets using Bayesian Inference (BI) showed that H. macropi from the three different host species formed distinct, well-supported clades. In addition, sliding window analysis of the mt genomes defined variable regions for future population genetic studies of H. macropi in different macropodid hosts and geographical regions around Australia. Conclusions The present analyses of inferred mt protein sequence datasets clearly supported the hypothesis that H. macropi from M. robustus robustus, M. bicolor and T. billardierii represent distinct species. PMID:24261823
Stolterfoht, Holly; Schwendenwein, Daniel; Sensen, Christoph W; Rudroff, Florian; Winkler, Margit
2017-09-10
Increasing demand for chemicals from renewable resources calls for the development of new biotechnological methods for the reduction of oxidized bio-based compounds. Enzymatic carboxylate reduction is highly selective, both in terms of chemo- and product selectivity, but not many carboxylate reductase enzymes (CARs) have been identified on the sequence level to date. Thus far, their phylogeny is unexplored and very little is known about their structure-function-relationship. CARs minimally contain an adenylation domain, a phosphopantetheinylation domain and a reductase domain. We have recently identified new enzymes of fungal origin, using similarity searches against genomic sequences from organisms in which aldehydes were detected upon incubation with carboxylic acids. Analysis of sequences with known CAR functionality and CAR enzymes recently identified in our laboratory suggests that the three-domain architecture mentioned above is modular. The construction of a distance tree with a subsequent 1000-replicate bootstrap analysis showed that the CAR sequences included in our study fall into four distinct subgroups (one of bacterial origin and three of fungal origin, respectively), each with a bootstrap value of 100%. The multiple sequence alignment of all experimentally confirmed CAR protein sequences revealed fingerprint sequences of residues which are likely to be involved in substrate and co-substrate binding and one of the three catalytic substeps, respectively. The fingerprint sequences broaden our understanding of the amino acids that might be essential for the reduction of organic acids to the corresponding aldehydes in CAR proteins. Copyright © 2017 Elsevier B.V. All rights reserved.
Preferential amino acid sequences in alumina-catalyzed peptide bond formation.
Bujdák, J; Rode, B M
2002-05-21
The catalytic effect of activated alumina on amino acid condensation was investigated. The readiness of amino acids to form peptide sequences was estimated on the basis of the yield of dipeptides and was found to decrease in the order glycine (Gly), alanine (Ala), leucine (Leu), valine (Val), proline (Pro). For example, approximately 15% Gly was converted to the dipeptide (Gly(2)), 5% to cyclic anhydride (cyc(Gly(2))) and small amounts of tri- (Gly(3)) and tetrapeptide (Gly(4)) were formed after 28 days. On the other hand, only trace amounts of Pro(2) were formed from proline under the same conditions. Preferential formation of certain sequences was observed in the mixed reaction systems containing two amino acids. For example, almost ten times more Gly-Val than Val-Gly was formed in the Gly+Val reaction system. The preferred sequences can be explained on the basis of an inductive effect that side groups have on the nucleophilicity and electrophilicity, respectively, of the amino and carboxyl groups. A comparison with published data of amino acid reactions in other reaction systems revealed that the main trends of preferential sequence formation were the same as those described for the salt-induced peptide formation (SIPF) reaction. The results of this work and other previously published papers show that alumina and related mineral surfaces might have played a crucial role in the prebiotic formation of the first peptides on the primitive earth.
Ross, Cody T.; Roodgar, Morteza; Smith, David Glenn
2015-01-01
We use the Reciprocal Smallest Distance (RSD) algorithm to identify amino acid sequence orthologs in the Chinese and Indian rhesus macaque draft sequences and estimate the evolutionary distance between such orthologs. We then use GOanna to map gene function annotations and human gene identifiers to the rhesus macaque amino acid sequences. We conclude methodologically by cross-tabulating a list of amino acid orthologs with large divergence scores with a list of genes known to be involved in SIV or HIV pathogenesis. We find that many of the amino acid sequences with large evolutionary divergence scores, as calculated by the RSD algorithm, have been shown to be related to HIV pathogenesis in previous laboratory studies. Four of the strongest candidate genes for SIVmac resistance in Chinese rhesus macaques identified in this study are CDK9, CXCL12, TRIM21, and TRIM32. Additionally, ANKRD30A, CTSZ, GORASP2, GTF2H1, IL13RA1, MUC16, NMDAR1, Notch1, NT5M, PDCD5, RAD50, and TM9SF2 were identified as possible candidates, among others. We failed to find many laboratory experiments contrasting the effects of Indian and Chinese orthologs at these sites on SIVmac pathogenesis, but future comparative studies might hold fertile ground for research into the biological mechanisms underlying innate resistance to SIVmac in Chinese rhesus macaques. PMID:25884674
CLAVATA3-like genes are differentially expressed in grape vine (Vitis vinifera) tissues.
Tominaga-Wada, Rumi; Nukumizu, Yuka; Wada, Takuji; Sawa, Shinichiro; Tetsumura, Takuya
2013-10-15
The CLAVATA3 (CLV3)/endosperm surrounding region [(ESR) CLE] peptides function as intercellular signaling molecules that regulate various physiological and developmental processes in diverse plant species. We identified five CLV3-like genes from grape vine (Vitis vinifera var. Pinot Noir): VvCLE 6, VvCLE 25-1, VvCLE 25-2, VvCLE 43 and VvCLE TDIF. These CLV3-like genes encode short proteins containing 43-128 amino acids. Except VvCLE TDIF, grape vine CLV3-like proteins possess a consensus amino acid sequence known as the CLE domain. Phylogenic analysis suggests that the VvCLE 6, VvCLE25-1, VvCLE25-2 and VvCLE43 genes have evolved from a single common ancestor to the Arabidopsis CLV3 gene. Expression analyses showed that the five grape CLV3-like genes are expressed in leaves, stems, roots and axillary buds with significant differences in their levels of expression. For example, while all of them were strongly expressed in axillary buds, VvCLE6 and VvCLE43 expression prevailed in roots, and VvCLE25-1, VvCLE25-2 and VvCLE TDIF expression in stems. The differential expression of the five grape CLV3-like peptides suggests that they play different roles in different organs and developmental stages. Copyright © 2013 Elsevier GmbH. All rights reserved.
Maurya, Anay Kumar; Parashar, Deepak; Satyanarayana, T
2017-01-01
Thermophilc mold Sporotrichum thermophile secretes an acidstable and thermostable phytase, which finds application as a food and feed additive because of its adequate thermostability, acid stability, protease insensitivity and broad substrate spectrum. Low extracellular phytase production by the mold is a major bottleneck for its application on a commercial scale. We have successfully overcome this problem by constitutive secretary expression of codon optimized rStPhy under glyceraldehyde phosphate dehydrogenase (GAP) promoter in Pichia pastoris. A ∼41-fold improvement in rStPhy production has been achieved. Circular Dichroism (CD) spectra revealed that rStPhy is composed of 26.65% α-helices, 5.26% β-sheets and 68.09% random coils at pH 5.0 and 60°C, the optima for the enzyme activity. The melting temperature (T m ) of the enzyme is ∼73°C. The 3D structure of rStPhy displayed characteristic signature sequences (RHGXRXP and HD) of HAP phytase. The catalytically important amino acids (Arg74, His75, Arg78, His368 and Asp369) were identified by docking and site directed mutagenesis. Fluorescence quenching by N-bromosuccinimide (NBS) and CsCl exposed tryptophan residues surrounded by negative charges, which play a key role in maintaining structural integrity of rStPhy. Copyright © 2016 Elsevier B.V. All rights reserved.
Cotten, Matthew; Oude Munnink, Bas; Canuti, Marta; Deijs, Martin; Watson, Simon J; Kellam, Paul; van der Hoek, Lia
2014-01-01
We have developed a full genome virus detection process that combines sensitive nucleic acid preparation optimised for virus identification in fecal material with Illumina MiSeq sequencing and a novel post-sequencing virus identification algorithm. Enriched viral nucleic acid was converted to double-stranded DNA and subjected to Illumina MiSeq sequencing. The resulting short reads were processed with a novel iterative Python algorithm SLIM for the identification of sequences with homology to known viruses. De novo assembly was then used to generate full viral genomes. The sensitivity of this process was demonstrated with a set of fecal samples from HIV-1 infected patients. A quantitative assessment of the mammalian, plant, and bacterial virus content of this compartment was generated and the deep sequencing data were sufficient to assembly 12 complete viral genomes from 6 virus families. The method detected high levels of enteropathic viruses that are normally controlled in healthy adults, but may be involved in the pathogenesis of HIV-1 infection and will provide a powerful tool for virus detection and for analyzing changes in the fecal virome associated with HIV-1 progression and pathogenesis.
Cotten, Matthew; Oude Munnink, Bas; Canuti, Marta; Deijs, Martin; Watson, Simon J.; Kellam, Paul; van der Hoek, Lia
2014-01-01
We have developed a full genome virus detection process that combines sensitive nucleic acid preparation optimised for virus identification in fecal material with Illumina MiSeq sequencing and a novel post-sequencing virus identification algorithm. Enriched viral nucleic acid was converted to double-stranded DNA and subjected to Illumina MiSeq sequencing. The resulting short reads were processed with a novel iterative Python algorithm SLIM for the identification of sequences with homology to known viruses. De novo assembly was then used to generate full viral genomes. The sensitivity of this process was demonstrated with a set of fecal samples from HIV-1 infected patients. A quantitative assessment of the mammalian, plant, and bacterial virus content of this compartment was generated and the deep sequencing data were sufficient to assembly 12 complete viral genomes from 6 virus families. The method detected high levels of enteropathic viruses that are normally controlled in healthy adults, but may be involved in the pathogenesis of HIV-1 infection and will provide a powerful tool for virus detection and for analyzing changes in the fecal virome associated with HIV-1 progression and pathogenesis. PMID:24695106
Wang, Yongkang; Song, Xiaodan; Li, Xiaorong; Yang, Sang-tian; Zou, Xiang
2017-01-04
To explore the genome sequence of Aureobasidium pullulans CCTCC M2012223, analyze the key genes related to the biosynthesis of important metabolites, and provide genetic background for metabolic engineering. Complete genome of A. pullulans CCTCC M2012223 was sequenced by Illumina HiSeq high throughput sequencing platform. Then, fragment assembly, gene prediction, functional annotation, and GO/COG cluster were analyzed in comparison with those of other five A. pullulans varieties. The complete genome sequence of A. pullulans CCTCC M2012223 was 30756831 bp with an average GC content of 47.49%, and 9452 genes were successfully predicted. Genome-wide analysis showed that A. pullulans CCTCC M2012223 had the biggest genome assembly size. Protein sequences involved in the pullulan and polymalic acid pathway were highly conservative in all of six A. pullulans varieties. Although both A. pullulans CCTCC M2012223 and A. pullulans var. melanogenum have a close affinity, some point mutation and inserts were occurred in protein sequences involved in melanin biosynthesis. Genome information of A. pullulans CCTCC M2012223 was annotated and genes involved in melanin, pullulan and polymalic acid pathway were compared, which would provide a theoretical basis for genetic modification of metabolic pathway in A. pullulans.
Sloma, A; Rufo, G A; Theriault, K A; Dwyer, M; Wilson, S W; Pero, J
1991-11-01
We have purified a minor extracellular serine protease from a strain of Bacillus subtilis bearing null mutations in five extracellular protease genes: apr, npr, epr, bpr, and mpr (A. Sloma, C. Rudolph, G. Rufo, Jr., B. Sullivan, K. Theriault, D. Ally, and J. Pero, J. Bacteriol. 172:1024-1029, 1990). During purification, this novel protease (Vpr) was found bound in a complex in the void volume after gel filtration chromatography. The amino-terminal sequence of the purified protein was determined, and an oligonucleotide probe was constructed on the basis of the amino acid sequence. This probe was used to clone the structural gene (vpr) for this protease. The gene encodes a primary product of 806 amino acids. The amino acid sequence of the mature protein was preceded by a signal sequence of approximately 28 amino acids and a prosequence of approximately 132 amino acids. The mature protein has a predicted molecular weight of 68,197; however, the isolated protein has an apparent molecular weight of 28,500, suggesting that Vpr undergoes C-terminal processing or proteolysis. The vpr gene maps in the ctrA-sacA-epr region of the chromosome and is not required for growth or sporulation.
Sorimachi, Kenji; Okayasu, Teiji; Ohhira, Shuji
2015-04-01
Normalized nucleotide and amino acid contents of complete genome sequences can be visualized as radar charts. The shapes of these charts depict the characteristics of an organism's genome. The normalized values calculated from the genome sequence theoretically exclude experimental errors. Further, because normalization is independent of both target size and kind, this procedure is applicable not only to single genes but also to whole genomes, which consist of a huge number of different genes. In this review, we discuss the applications of the normalization of the nucleotide and predicted amino acid contents of complete genomes to the investigation of genome structure and to evolutionary research from primitive organisms to Homo sapiens. Some of the results could never have been obtained from the analysis of individual nucleotide or amino acid sequences but were revealed only after the normalization of nucleotide and amino acid contents was applied to genome research. The discovery that genome structure was homogeneous was obtained only after normalization methods were applied to the nucleotide or predicted amino acid contents of genome sequences. Normalization procedures are also applicable to evolutionary research. Thus, normalization of the contents of whole genomes is a useful procedure that can help to characterize organisms.
Motomura, Kenta; Nakamura, Morikazu; Otaki, Joji M.
2013-01-01
Protein structure and function information is coded in amino acid sequences. However, the relationship between primary sequences and three-dimensional structures and functions remains enigmatic. Our approach to this fundamental biochemistry problem is based on the frequencies of short constituent sequences (SCSs) or words. A protein amino acid sequence is considered analogous to an English sentence, where SCSs are equivalent to words. Availability scores, which are defined as real SCS frequencies in the non-redundant amino acid database relative to their probabilistically expected frequencies, demonstrate the biological usage bias of SCSs. As a result, this frequency-based linguistic approach is expected to have diverse applications, such as secondary structure specifications by structure-specific SCSs and immunological adjuvants with rare or non-existent SCSs. Linguistic similarities (e.g., wide ranges of scale-free distributions) and dissimilarities (e.g., behaviors of low-rank samples) between proteins and the natural English language have been revealed in the rank-frequency relationships of SCSs or words. We have developed a web server, the SCS Package, which contains five applications for analyzing protein sequences based on the linguistic concept. These tools have the potential to assist researchers in deciphering structurally and functionally important protein sites, species-specific sequences, and functional relationships between SCSs. The SCS Package also provides researchers with a tool to construct amino acid sequences de novo based on the idiomatic usage of SCSs. PMID:24688703
Motomura, Kenta; Nakamura, Morikazu; Otaki, Joji M
2013-01-01
Protein structure and function information is coded in amino acid sequences. However, the relationship between primary sequences and three-dimensional structures and functions remains enigmatic. Our approach to this fundamental biochemistry problem is based on the frequencies of short constituent sequences (SCSs) or words. A protein amino acid sequence is considered analogous to an English sentence, where SCSs are equivalent to words. Availability scores, which are defined as real SCS frequencies in the non-redundant amino acid database relative to their probabilistically expected frequencies, demonstrate the biological usage bias of SCSs. As a result, this frequency-based linguistic approach is expected to have diverse applications, such as secondary structure specifications by structure-specific SCSs and immunological adjuvants with rare or non-existent SCSs. Linguistic similarities (e.g., wide ranges of scale-free distributions) and dissimilarities (e.g., behaviors of low-rank samples) between proteins and the natural English language have been revealed in the rank-frequency relationships of SCSs or words. We have developed a web server, the SCS Package, which contains five applications for analyzing protein sequences based on the linguistic concept. These tools have the potential to assist researchers in deciphering structurally and functionally important protein sites, species-specific sequences, and functional relationships between SCSs. The SCS Package also provides researchers with a tool to construct amino acid sequences de novo based on the idiomatic usage of SCSs.
Domier, L L; Latorre, I J; Steinlage, T A; McCoppin, N; Hartman, G L
2003-10-01
The variability of North American and Asian strains and isolates of Soybean mosaic virus was investigated. First, polymerase chain reaction (PCR) products representing the coat protein (CP)-coding regions of 38 SMVs were analyzed for restriction fragment length polymorphisms (RFLP). Second, the nucleotide and predicted amino acid sequence variability of the P1-coding region of 18 SMVs and the helper component/protease (HC/Pro) and CP-coding regions of 25 SMVs were assessed. The CP nucleotide and predicted amino acid sequences were the most similar and predicted phylogenetic relationships similar to those obtained from RFLP analysis. Neither RFLP nor sequence analyses of the CP-coding regions grouped the SMVs by geographical origin. The P1 and HC/Pro sequences were more variable and separated the North American and Asian SMV isolates into two groups similar to previously reported differences in pathogenic diversity of the two sets of SMV isolates. The P1 region was the most informative of the three regions analyzed. To assess the biological relevance of the sequence differences in the HC/Pro and CP coding regions, the transmissibility of 14 SMV isolates by Aphis glycines was tested. All field isolates of SMV were transmitted efficiently by A. glycines, but the laboratory isolates analyzed were transmitted poorly. The amino acid sequences from most, but not all, of the poorly transmitted isolates contained mutations in the aphid transmission-associated DAG and/or KLSC amino acid sequence motifs of CP and HC/Pro, respectively.
Nucleic acid detection methods
Smith, C.L.; Yaar, R.; Szafranski, P.; Cantor, C.R.
1998-05-19
The invention relates to methods for rapidly determining the sequence and/or length a target sequence. The target sequence may be a series of known or unknown repeat sequences which are hybridized to an array of probes. The hybridized array is digested with a single-strand nuclease and free 3{prime}-hydroxyl groups extended with a nucleic acid polymerase. Nuclease cleaved heteroduplexes can be easily distinguish from nuclease uncleaved heteroduplexes by differential labeling. Probes and target can be differentially labeled with detectable labels. Matched target can be detected by cleaving resulting loops from the hybridized target and creating free 3-hydroxyl groups. These groups are recognized and extended by polymerases added into the reaction system which also adds or releases one label into solution. Analysis of the resulting products using either solid phase or solution. These methods can be used to detect characteristic nucleic acid sequences, to determine target sequence and to screen for genetic defects and disorders. Assays can be conducted on solid surfaces allowing for multiple reactions to be conducted in parallel and, if desired, automated. 18 figs.
Cloning and purification of alpha-neurotoxins from king cobra (Ophiophagus hannah).
He, Ying-Ying; Lee, Wei-Hui; Zhang, Yun
2004-09-01
Thirteen complete and three partial cDNA sequences were cloned from the constructed king cobra (Ophiophagus hannah) venom gland cDNA library. Phylogenetic analysis of nucleotide sequences of king cobra with those from other snake venoms revealed that obtained cDNAs are highly homologous to snake venom alpha-neurotoxins. Alignment of deduced mature peptide sequences of the obtained clones with those of other reported alpha-neurotoxins from the king cobra venom indicates that our obtained 16 clones belong to long-chain neurotoxins (seven), short-chain neurotoxins (seven), weak toxin (one) and variant (one), respectively. Up to now, two out of 16 newly cloned king cobra alpha-neurotoxins have identical amino acid sequences with CM-11 and Oh-6A/6B, which have been characterized from the same venom. Furthermore, five long-chain alpha-neurotoxins and two short-chain alpha-neurotoxins were purified from crude venom and their N-terminal amino acid sequences were determined. The cDNAs encoding the putative precursors of the purified native peptide were also determined based on the N-terminal amino acid sequencing. The purified alpha-neurotoxins showed different lethal activities on mice.
Canine hippocampal formation composited into three-dimensional structure using MPRAGE.
Jung, Mi-Ae; Nahm, Sang-Soep; Lee, Min-Su; Lee, In-Hye; Lee, Ah-Ra; Jang, Dong-Pyo; Kim, Young-Bo; Cho, Zang-Hee; Eom, Ki-Dong
2010-07-01
This study was performed to anatomically illustrate the living canine hippocampal formation in three-dimensions (3D), and to evaluate its relationship to surrounding brain structures. Three normal beagle dogs were scanned on a MR scanner with inversion recovery segmented 3D gradient echo sequence (known as MP-RAGE: Magnetization Prepared Rapid Gradient Echo). The MRI data was manually segmented and reconstructed into a 3D model using the 3D slicer software tool. From the 3D model, the spatial relationships between hippocampal formation and surrounding structures were evaluated. With the increased spatial resolution and contrast of the MPRAGE, the canine hippocampal formation was easily depicted. The reconstructed 3D image allows easy understanding of the hippocampal contour and demonstrates the structural relationship of the hippocampal formation to surrounding structures in vivo.
Nishizawa, M; Nishizawa, K
2000-10-01
The tendency for repetitiveness of nucleotides in DNA sequences has been reported for a variety of organisms. We show that the tendency for repetitive use of amino acids is widespread and is observed even for segments conserved between human and Drosophila melanogaster at the level of >50% amino acid identity. This indicates that repetitiveness influences not only the weakly constrained segments but also those sequence segments conserved among phyla. Not only glutamine (Q) but also many of the 20 amino acids show a comparable level of repetitiveness. Repetitiveness in bases at codon position 3 is stronger for human than for D.melanogaster, whereas local repetitiveness in intron sequences is similar between the two organisms. While genes for immune system-specific proteins, but not ancient human genes (i.e. human homologs of Escherichia coli genes), have repetitiveness at codon bases 1 and 2, repetitiveness at codon base 3 for these groups is similar, suggesting that the human genome has at least two mechanisms generating local repetitiveness. Neither amino acid nor nucleotide repetitiveness is observed beyond the exon boundary, denying the possibility that such repetitiveness could mainly stem from natural selection on mRNA or protein sequences. Analyses of mammalian sequence alignments show that while the 'between gene' GC content heterogeneity, which is linked to 'isochores', is a principal factor associated with the bias in substitution patterns in human, 'within gene' heterogeneity in nucleotide composition is also associated with such bias on a more local scale. The relationship amongst the various types of repetitiveness is discussed.
Nishizawa, Manami; Nishizawa, Kazuhisa
2000-01-01
The tendency for repetitiveness of nucleotides in DNA sequences has been reported for a variety of organisms. We show that the tendency for repetitive use of amino acids is widespread and is observed even for segments conserved between human and Drosophila melanogaster at the level of >50% amino acid identity. This indicates that repetitiveness influences not only the weakly constrained segments but also those sequence segments conserved among phyla. Not only glutamine (Q) but also many of the 20 amino acids show a comparable level of repetitiveness. Repetitiveness in bases at codon position 3 is stronger for human than for D.melanogaster, whereas local repetitiveness in intron sequences is similar between the two organisms. While genes for immune system-specific proteins, but not ancient human genes (i.e. human homologs of Escherichia coli genes), have repetitiveness at codon bases 1 and 2, repetitiveness at codon base 3 for these groups is similar, suggesting that the human genome has at least two mechanisms generating local repetitiveness. Neither amino acid nor nucleotide repetitiveness is observed beyond the exon boundary, denying the possibility that such repetitiveness could mainly stem from natural selection on mRNA or protein sequences. Analyses of mammalian sequence alignments show that while the ‘between gene’ GC content heterogeneity, which is linked to ‘isochores’, is a principal factor associated with the bias in substitution patterns in human, ‘within gene’ heterogeneity in nucleotide composition is also associated with such bias on a more local scale. The relationship amongst the various types of repetitiveness is discussed. PMID:11000273
Complete Amino Acid Sequence of a Copper/Zinc-Superoxide Dismutase from Ginger Rhizome.
Nishiyama, Yuki; Fukamizo, Tamo; Yoneda, Kazunari; Araki, Tomohiro
2017-04-01
Superoxide dismutase (SOD) is an antioxidant enzyme protecting cells from oxidative stress. Ginger (Zingiber officinale) is known for its antioxidant properties, however, there are no data on SODs from ginger rhizomes. In this study, we purified SOD from the rhizome of Z. officinale (Zo-SOD) and determined its complete amino acid sequence using N terminal sequencing, amino acid analysis, and de novo sequencing by tandem mass spectrometry. Zo-SOD consists of 151 amino acids with two signature Cu/Zn-SOD motifs and has high similarity to other plant Cu/Zn-SODs. Multiple sequence alignment showed that Cu/Zn-binding residues and cysteines forming a disulfide bond, which are highly conserved in Cu/Zn-SODs, are also present in Zo-SOD. Phylogenetic analysis revealed that plant Cu/Zn-SODs clustered into distinct chloroplastic, cytoplasmic, and intermediate groups. Among them, only chloroplastic enzymes carried amino acid substitutions in the region functionally important for enzymatic activity, suggesting that chloroplastic SODs may have a function distinct from those of SODs localized in other subcellular compartments. The nucleotide sequence of the Zo-SOD coding region was obtained by reverse-translation, and the gene was synthesized, cloned, and expressed. The recombinant Zo-SOD demonstrated pH stability in the range of 5-10, which is similar to other reported Cu/Zn-SODs, and thermal stability in the range of 10-60 °C, which is higher than that for most plant Cu/Zn-SODs but lower compared to the enzyme from a Z. officinale relative Curcuma aromatica.
Lathe, R
1985-05-05
Synthetic probes deduced from amino acid sequence data are widely used to detect cognate coding sequences in libraries of cloned DNA segments. The redundancy of the genetic code dictates that a choice must be made between (1) a mixture of probes reflecting all codon combinations, and (2) a single longer "optimal" probe. The second strategy is examined in detail. The frequency of sequences matching a given probe by chance alone can be determined and also the frequency of sequences closely resembling the probe and contributing to the hybridization background. Gene banks cannot be treated as random associations of the four nucleotides, and probe sequences deduced from amino acid sequence data occur more often than predicted by chance alone. Probe lengths must be increased to confer the necessary specificity. Examination of hybrids formed between unique homologous probes and their cognate targets reveals that short stretches of perfect homology occurring by chance make a significant contribution to the hybridization background. Statistical methods for improving homology are examined, taking human coding sequences as an example, and considerations of codon utilization and dinucleotide frequencies yield an overall homology of greater than 82%. Recommendations for probe design and hybridization are presented, and the choice between using multiple probes reflecting all codon possibilities and a unique optimal probe is discussed.
Chang, Elizabeth; Pourmal, Sergei; Zhou, Chun; Kumar, Rupesh; Teplova, Marianna; Pavletich, Nikola P; Marians, Kenneth J; Erdjument-Bromage, Hediye
2016-07-01
In recent history, alternative approaches to Edman sequencing have been investigated, and to this end, the Association of Biomolecular Resource Facilities (ABRF) Protein Sequencing Research Group (PSRG) initiated studies in 2014 and 2015, looking into bottom-up and top-down N-terminal (Nt) dimethyl derivatization of standard quantities of intact proteins with the aim to determine Nt sequence information. We have expanded this initiative and used low picomole amounts of myoglobin to determine the efficiency of Nt-dimethylation. Application of this approach on protein domains, generated by limited proteolysis of overexpressed proteins, confirms that it is a universal labeling technique and is very sensitive when compared with Edman sequencing. Finally, we compared Edman sequencing and Nt-dimethylation of the same polypeptide fragments; results confirm that there is agreement in the identity of the Nt amino acid sequence between these 2 methods.
Use of plant fatty acyl hydroxylases to produce hydroxylated fatty acids and derivatives in plants
Somerville, Chris; van de Loo, Frank
1998-01-01
The present invention relates to the identification of nucleic acid sequences and constructs, and methods related thereto, and the use of these sequences and constructs to produce genetically modified plants for the purpose of altering the composition of plant oils, waxes and related compounds.
Use of plant fatty acyl hydroxylases to produce hydroxylated fatty acids and derivatives in plants
Somerville, Chris; van de Loo, Frank
2002-01-01
The present invention relates to the identification of nucleic acid sequences and constructs, and methods related thereto, and the use of these sequences and constructs to produce genetically modified plants for the purpose of altering the composition of plant oils, waxes and related compounds.
Use of plant fatty acyl hydroxylases to produce hydroxylated fatty acids and derivatives in plants
Somerville, Chris; van de Loo, Frank
1997-01-01
The present invention relates to the identification of nucleic acid sequences and constructs, and methods related thereto, and the use of these sequences and constructs to produce genetically modified plants for the purpose of altering the composition of plant oils, waxes and related compounds.
Close, Dan; Ojumu, John O.; Zhang, Gui X.
2016-11-03
Cryptococcus terricola JCM 24523 has recently been identified as an oleaginous yeast capable of converting starch into fatty acids. Here, this draft genome sequence provides a platform for elucidating its fatty acid production potential and supporting comparisons with other oleaginous species.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Close, Dan; Ojumu, John O.; Zhang, Gui X.
Cryptococcus terricola JCM 24523 has recently been identified as an oleaginous yeast capable of converting starch into fatty acids. Here, this draft genome sequence provides a platform for elucidating its fatty acid production potential and supporting comparisons with other oleaginous species.
Ultra high-throughput nucleic acid sequencing as a tool for virus discovery in the turkey gut.
USDA-ARS?s Scientific Manuscript database
Recently, the use of the next generation of nucleic acid sequencing technology (i.e., 454 pyrosequencing, as developed by Roche/454 Life Sciences) has allowed an in-depth look at the uncultivated microorganisms present in complex environmental samples, including samples with agricultural importance....
USDA-ARS?s Scientific Manuscript database
Vinylglycines are non-proteinogenic amino acids that inhibit amino acid metabolism and ethylene production. In this report, we describe the draft genome sequences of seven isolates of Pseudomonas that produce 4-formylaminooxyvinylglycine, a compound known to inhibit the germination of grasses and t...
Leonard, Michael T; Valladares, Ricardo B; Ardissone, Alexandria; Gonzalez, Claudio F; Lorca, Graciela L; Triplett, Eric W
2014-05-08
We report here the complete genome sequences of Lactobacillus johnsonii strain N6.2, a homofermentative lactic acid intestinal bacterium, and Lactobacillus reuteri strain TD1, a heterofermentative lactic acid intestinal bacterium, both isolated from a type 1 diabetes-resistant rat model.
Bao, Weichen; Mi, Zhihui; Xu, Haiyan; Zheng, Yi; Kwok, Lai Yu; Zhang, Heping; Zhang, Wenyi
2016-01-01
The present study applied the PacBio single molecule, real-time sequencing technology (SMRT) in evaluating the quality of silage production. Specifically, we produced four types of Medicago sativa silages by using four different lactic acid bacteria-based additives (AD-I, AD-II, AD-III and AD-IV). We monitored the changes in pH, organic acids (including butyric acid, the ratio of acetic acid/lactic acid, γ-aminobutyric acid, 4-hyroxy benzoic acid and phenyl lactic acid), mycotoxins, and bacterial microbiota during silage fermentation. Our results showed that the use of the additives was beneficial to the silage fermentation by enhancing a general pH and mycotoxin reduction, while increasing the organic acids content. By SMRT analysis of the microbial composition in eight silage samples, we found that the bacterial species number and relative abundances shifted apparently after fermentation. Such changes were specific to the LAB species in the additives. Particularly, Bacillus megaterium was the initial dominant species in the raw materials; and after the fermentation process, Pediococcus acidilactici and Lactobacillus plantarum became the most prevalent species, both of which were intrinsically present in the LAB additives. Our data have demonstrated that the SMRT sequencing platform is applicable in assessing the quality of silage. PMID:27340760
Nucleic Acid Extraction from Synthetic Mars Analog Soils for in situ Life Detection.
Mojarro, Angel; Ruvkun, Gary; Zuber, Maria T; Carr, Christopher E
2017-08-01
Biological informational polymers such as nucleic acids have the potential to provide unambiguous evidence of life beyond Earth. To this end, we are developing an automated in situ life-detection instrument that integrates nucleic acid extraction and nanopore sequencing: the Search for Extra-Terrestrial Genomes (SETG) instrument. Our goal is to isolate and determine the sequence of nucleic acids from extant or preserved life on Mars, if, for example, there is common ancestry to life on Mars and Earth. As is true of metagenomic analysis of terrestrial environmental samples, the SETG instrument must isolate nucleic acids from crude samples and then determine the DNA sequence of the unknown nucleic acids. Our initial DNA extraction experiments resulted in low to undetectable amounts of DNA due to soil chemistry-dependent soil-DNA interactions, namely adsorption to mineral surfaces, binding to divalent/trivalent cations, destruction by iron redox cycling, and acidic conditions. Subsequently, we developed soil-specific extraction protocols that increase DNA yields through a combination of desalting, utilization of competitive binders, and promotion of anaerobic conditions. Our results suggest that a combination of desalting and utilizing competitive binders may establish a "universal" nucleic acid extraction protocol suitable for analyzing samples from diverse soils on Mars. Key Words: Life-detection instruments-Nucleic acids-Mars-Panspermia. Astrobiology 17, 747-760.
Bao, Weichen; Mi, Zhihui; Xu, Haiyan; Zheng, Yi; Kwok, Lai Yu; Zhang, Heping; Zhang, Wenyi
2016-06-24
The present study applied the PacBio single molecule, real-time sequencing technology (SMRT) in evaluating the quality of silage production. Specifically, we produced four types of Medicago sativa silages by using four different lactic acid bacteria-based additives (AD-I, AD-II, AD-III and AD-IV). We monitored the changes in pH, organic acids (including butyric acid, the ratio of acetic acid/lactic acid, γ-aminobutyric acid, 4-hyroxy benzoic acid and phenyl lactic acid), mycotoxins, and bacterial microbiota during silage fermentation. Our results showed that the use of the additives was beneficial to the silage fermentation by enhancing a general pH and mycotoxin reduction, while increasing the organic acids content. By SMRT analysis of the microbial composition in eight silage samples, we found that the bacterial species number and relative abundances shifted apparently after fermentation. Such changes were specific to the LAB species in the additives. Particularly, Bacillus megaterium was the initial dominant species in the raw materials; and after the fermentation process, Pediococcus acidilactici and Lactobacillus plantarum became the most prevalent species, both of which were intrinsically present in the LAB additives. Our data have demonstrated that the SMRT sequencing platform is applicable in assessing the quality of silage.
A statistical view of FMRFamide neuropeptide diversity.
Espinoza, E; Carrigan, M; Thomas, S G; Shaw, G; Edison, A S
2000-01-01
FMRFamide-like peptide (FLP) amino acid sequences have been collected and statistically analyzed. FLP amino acid composition as a function of position in the peptide is graphically presented for several major phyla. Results of total amino acid composition and frequencies of pairs of FLP amino acids have been computed and compared with corresponding values from the entire GenBank protein sequence database. The data for pairwise distributions of amino acids should help in future structure-function studies of FLPs. To aid in future peptide discovery, a computer program and search protocol was developed to identify FLPs from the GenBank protein database without the use of keywords.
The amino acid motif L/IIxxFE defines a novel actin-binding sequence in PDZ-RhoGEF
Banerjee, Jayashree; Fischer, Christopher C.; Wedegaertner, Philip B.
2009-01-01
PDZ-RhoGEF is a member of the regulator of G protein signaling (RGS) domain-containing RhoGEFs (RGS-RhoGEFs) that link activated heterotrimeric G protein α subunits of the G12 family to activation of the small GTPase RhoA. Unique among the RGS-RhoGEFs, PDZ-RhoGEF contains a short sequence that localizes the protein to the actin cytoskeleton. In this report, we demonstrate that the actin-binding domain, located between amino acids 561–585, directly binds to F-actin in vitro. Extensive mutagenesis identifies isoleucine 568, isoleucine 569, phenylalanine 572, and glutamic acid 573 as necessary for binding to actin and for co-localization with the actin cytoskeleton in cells. These results define a novel actin-binding sequence in PDZ-RhoGEF with a critical amino acid motif of IIxxFE. Moreover, sequence analysis identifies a similar actin-binding motif in the N-terminus of the RhoGEF frabin, and, as with PDZ-RhoGEF, mutagenesis and actin interaction experiments demonstrate a motif of LIxxFE, consisting of the key amino acids leucine 23, isoleucine 24, phenylalanine 27, and glutamic acid 28. Taken together, results with PDZ-RhoGEF and frabin identify a novel actin binding sequence. Lastly, inducible dimerization of the actin-binding region of PDZ-RhoGEF revealed a dimerization-dependent actin bundling activity in vitro. PDZ-RhoGEF exists in cells as a dimer, raising the possibility that PDZ-RhoGEF could influence actin structure independent of its ability to activate RhoA. PMID:19618964
Koch, P J; Goldschmidt, M D; Walsh, M J; Zimbelmann, R; Schmelz, M; Franke, W W
1991-05-01
Desmosomes are cell-type-specific intercellular junctions found in epithelium, myocardium and certain other tissues. They consist of assemblies of molecules involved in the adhesion of specific cell types and in the anchorage of cell-type-specific cytoskeletal elements, the intermediate-size filaments, to the plasma membrane. To explore the individual desmosomal components and their functions we have isolated DNA clones encoding the desmosomal glycoprotein, desmocollin, using antibodies and a cDNA expression library from bovine muzzle epithelium. The cDNA-deduced amino-acid sequence of desmocollin (presently we cannot decide to which of the two desmocollins, DC I or DC II, this clone relates) defines a polypeptide with a calculated molecular weight of 85,000, with a single candidate sequence of 24 amino acids sufficiently long for a transmembrane arrangement, and an extracellular aminoterminal portion of 561 amino acid residues, compared to a cytoplasmic part of only 176 amino acids. Amino acid sequence comparisons have revealed that desmocollin is highly homologous to members of the cadherin family of cell adhesion molecules, including the previously sequenced desmoglein, another desmosome-specific cadherin. Using riboprobes derived from cDNAs for Northern-blot analyses, we have identified an mRNA of approximately 6 kb in stratified epithelia such as muzzle epithelium and tongue mucosa but not in two epithelial cell culture lines containing desmosomes and desmoplakins. The difference may indicate drastic differences in mRNA concentration or the existence of cell-type-specific desmocollin subforms. The molecular topology of desmocollin(s) is discussed in relation to possible functions of the individual molecular domains.
A DNA sequence element that advances replication origin activation time in Saccharomyces cerevisiae.
Pohl, Thomas J; Kolor, Katherine; Fangman, Walton L; Brewer, Bonita J; Raghuraman, M K
2013-11-06
Eukaryotic origins of DNA replication undergo activation at various times in S-phase, allowing the genome to be duplicated in a temporally staggered fashion. In the budding yeast Saccharomyces cerevisiae, the activation times of individual origins are not intrinsic to those origins but are instead governed by surrounding sequences. Currently, there are two examples of DNA sequences that are known to advance origin activation time, centromeres and forkhead transcription factor binding sites. By combining deletion and linker scanning mutational analysis with two-dimensional gel electrophoresis to measure fork direction in the context of a two-origin plasmid, we have identified and characterized a 19- to 23-bp and a larger 584-bp DNA sequence that are capable of advancing origin activation time.
Tong, C G; Reichler, S; Blumenthal, S; Balk, J; Hsieh, H L; Roux, S J
1997-01-01
A cDNA encoding a nucleolar protein was selected from a pea (Pisum sativum) plumule library, cloned, and sequenced. The translated sequence of the cDNA has significant percent identity to Xenopus laevis nucleolin (31%), the alfalfa (Medicago sativa) nucleolin homolog (66%), and the yeast (Saccharomyces cerevisiae) nucleolin homolog (NSR1) (28%). It also has sequence patterns in its primary structure that are characteristic of all nucleolins, including an N-terminal acidic motif, RNA recognition motifs, and a C-terminal Gly- and Arg-rich domain. By immunoblot analysis, the polyclonal antibodies used to select the cDNA bind selectively to a 90-kD protein in purified pea nuclei and nucleoli and to an 88-kD protein in extracts of Escherichia coli expressing the cDNA. In immunolocalization assays of pea plumule cells, the antibodies stained primarily a region surrounding the fibrillar center of nucleoli, where animal nucleolins are typically found. Southern analysis indicated that the pea nucleolin-like protein is encoded by a single gene, and northern analysis showed that the labeled cDNA binds to a single band of RNA, approximately the same size and the cDNA. After irradiation of etiolated pea seedlings by red light, the mRNA level in plumules decreased during the 1st hour and then increased to a peak of six times the 0-h level at 12 h. Far-red light reversed this effect of red light, and the mRNA accumulation from red/far-red light irradiation was equal to that found in the dark control. This indicates that phytochrome may regulate the expression of this gene. PMID:9193096
Folic Acid Food Fortification—Its History, Effect, Concerns, and Future Directions
Crider, Krista S.; Bailey, Lynn B.; Berry, Robert J.
2011-01-01
Periconceptional intake of folic acid is known to reduce a woman’s risk of having an infant affected by a neural tube birth defect (NTD). National programs to mandate fortification of food with folic acid have reduced the prevalence of NTDs worldwide. Uncertainty surrounding possible unintended consequences has led to concerns about higher folic acid intake and food fortification programs. This uncertainty emphasizes the need to continually monitor fortification programs for accurate measures of their effect and the ability to address concerns as they arise. This review highlights the history, effect, concerns, and future directions of folic acid food fortification programs. PMID:22254102
Complete genome sequence of duck Tembusu virus, isolated from Muscovy ducks in southern China.
Zhu, Wanjun; Chen, Jidang; Wei, Chunya; Wang, Heng; Huang, Zhen; Zhang, Minze; Tang, Fengfeng; Xie, Jiexiong; Liang, Huanbin; Zhang, Guihong; Su, Shuo
2012-12-01
We report here the complete genomic sequence of the duck Tembusu virus (DTMUV) WJ-1 strain, isolated from Muscovy ducks. This is the first complete genome sequence of DTMUV reported in southern China. Compared with the other strains (TA, GH-2, YY5, and ZJ-407) that were previously found in eastern China, WJ-1 bears a few differences in the nucleotide and amino acid sequences. We found that there are 47 mutations of amino acids encoded by the whole open reading frame (ORF) among these five strains. The whole-genome sequence of DTMUV will help in understanding the epidemiology and molecular characteristics of duck Tembusu virus in southern China.
Liu, Bin; Ertesvåg, Helga; Aasen, Inga Marie; Vadstein, Olav; Brautaset, Trygve; Heggeset, Tonje Marita Bjerkan
2016-06-01
Thraustochytrids are unicellular, marine protists, and there is a growing industrial interest in these organisms, particularly because some species, including strains belonging to the genus Aurantiochytrium, accumulate high levels of docosahexaenoic acid (DHA). Here, we report the draft genome sequence of Aurantiochytrium sp. T66 (ATCC PRA-276), with a size of 43 Mbp, and 11,683 predicted protein-coding sequences. The data has been deposited at DDBJ/EMBL/Genbank under the accession LNGJ00000000. The genome sequence will contribute new insight into DHA biosynthesis and regulation, providing a basis for metabolic engineering of thraustochytrids.
Reference System of DNA and Protein Sequences on CD-ROM
NASA Astrophysics Data System (ADS)
Nasu, Hisanori; Ito, Toshiaki
DNASIS-DBREF31 is a database for DNA and Protein sequences in the form of optical Compact Disk (CD) ROM, developed and commercialized by Hitachi Software Engineering Co., Ltd. Both nucleic acid base sequences and protein amino acid sequences can be retrieved from a single CD-ROM. Existing database is offered in the form of on-line service, floppy disks, or magnetic tape, all of which have some problems or other, such as usability or storage capacity. DNASIS-DBREF31 newly adopt a CD-ROM as a database device to realize a mass storage and personal use of the database.
Compositions and methods for the expression of selenoproteins in eukaryotic cells
Gladyshev, Vadim [Lincoln, NE; Novoselov, Sergey [Puschino, RU
2012-09-25
Recombinant nucleic acid constructs for the efficient expression of eukaryotic selenoproteins and related methods for production of recombinant selenoproteins are provided. The nucleic acid constructs comprise novel selenocysteine insertion sequence (SECIS) elements. Certain novel SECIS elements of the invention contain non-canonical quartet sequences. Other novel SECIS elements provided by the invention are chimeric SECIS elements comprising a canonical SECIS element that contains a non-canonical quartet sequence and chimeric SECIS elements comprising a non-canonical SECIS element that contains a canonical quartet sequence. The novel SECIS elements of the invention facilitate the insertion of selenocysteine residues into recombinant polypeptides.
2013-01-01
predicted amino acid sequences of the three encoded BmAChEs were no more closely related to one another than AChEs from different organisms and their...solely on nucleotide and amino acid sequence similarity; however, the cholinesterase gene family contains a number of related enzymes and structural...acetylcholinesterase of P. papatasi was cloned, sequenced , and expressed in the baculo- virus system to generate a recombinant enzyme for biochemical
Betsuyaku, Shigeyuki; Katou, Shinpei; Takebayashi, Yumiko; Sakakibara, Hitoshi; Nomura, Nobuhiko; Fukuda, Hiroo
2018-01-01
The innate immune response is, in the first place, elicited at the site of infection. Thus, the host response can be different among the infected cells and the cells surrounding them. Effector-triggered immunity (ETI), a form of innate immunity in plants, is triggered by specific recognition between pathogen effectors and their corresponding plant cytosolic immune receptors, resulting in rapid localized cell death known as hypersensitive response (HR). HR cell death is usually limited to a few cells at the infection site, and is surrounded by a few layers of cells massively expressing defense genes such as Pathogenesis-Related Gene 1 (PR1). This virtually concentric pattern of the cellular responses in ETI is proposed to be regulated by a concentration gradient of salicylic acid (SA), a phytohormone accumulated around the infection site. Recent studies demonstrated that jasmonic acid (JA), another phytohormone known to be mutually antagonistic to SA in many cases, is also accumulated in and required for ETI, suggesting that ETI is a unique case. However, the molecular basis for this uniqueness remained largely to be solved. Here, we found that, using intravital time-lapse imaging, the JA signaling pathway is activated in the cells surrounding the central SA-active cells around the infection sites in Arabidopsis thaliana. This distinct spatial organization explains how these two phythormone pathways in a mutually antagonistic relationship can be activated simultaneously during ETI. Our results re-emphasize that the spatial consideration is a key strategy to gain mechanistic insights into the apparently complex signaling cross-talk in immunity. © The Author 2017. Published by Oxford University Press on behalf of Japanese Society of Plant Physiologists.
Roy, Avijit; Stone, Andrew L; Shao, Jonathan; Otero-Colina, Gabriel; Wei, Gang; Choudhary, Nandlal; Achor, Diann; Levy, Laurene; Nakhla, Mark K; Hartung, John S; Schneider, William L; Brlansky, Ronald H
2015-04-01
Citrus leprosis is one of the most destructive diseases of Citrus spp. and is associated with two unrelated virus groups that produce particles primarily in either the cytoplasm or nucleus of infected plant cells. Symptoms of leprosis, including chlorotic spots surrounded by yellow haloes on leaves and necrotic spots on twigs and fruit, were observed on leprosis-affected mandarin and navel sweet orange trees in the state of Querétaro, Mexico. Serological and molecular assays showed that the cytoplasmic types of Citrus leprosis virus (CiLV-C) often associated with leprosis symptomatic tissues were absent. However, using transmission electron microscopy, bullet-shaped rhabdovirus-like virions were observed in the nuclei and cytoplasm of the citrus leprosis-infected leaf tissues. An analysis of small RNA populations from symptomatic tissue was carried out to determine the genome sequence of the rhabdovirus-like particles observed in the citrus leprosis samples. The complete genome sequence showed that the nuclear type of CiLV (CiLV-N) present in the samples consisted of two negative-sense RNAs: 6,268-nucleotide (nt)-long RNA1 and 5,847-nt-long RNA2, excluding the poly(A) tails. CiLV-N had a genome organization identical to that of Orchid fleck virus (OFV), with the exception of shorter 5' untranslated regions in RNA1 (53 versus 205 nt) and RNA2 (34 versus 182 nt). Phylogenetic trees constructed with the amino acid sequences of the nucleocapsid (N) and glycoproteins (G) and the RNA polymerase (L protein) showed that CiLV-N clusters with OFV. Furthermore, phylogenetic analyses of N protein established CiLV-N as a member of the proposed genus Dichorhavirus. Reverse-transcription polymerase chain reaction primers for the detection of CiLV-N were designed based on the sequence of the N gene and the assay was optimized and tested to detect the presence of CiLV-N in both diseased and symptom-free plants.
Lerner, D R; Raikhel, N V
1992-06-05
Chitin-binding proteins are present in a wide range of plant species, including both monocots and dicots, even though these plants contain no chitin. To investigate the relationship between in vitro antifungal and insecticidal activities of chitin-binding proteins and their unknown endogenous functions, the stinging nettle lectin (Urtica dioica agglutinin, UDA) cDNA was cloned using a synthetic gene as the probe. The nettle lectin cDNA clone contained an open reading frame encoding 374 amino acids. Analysis of the deduced amino acid sequence revealed a 21-amino acid putative signal sequence and the 86 amino acids encoding the two chitin-binding domains of nettle lectin. These domains were fused to a 19-amino acid "spacer" domain and a 244-amino acid carboxyl extension with partial identity to a chitinase catalytic domain. The authenticity of the cDNA clone was confirmed by deduced amino acid sequence identity with sequence data obtained from tryptic digests, RNA gel blot, and polymerase chain reaction analyses. RNA gel blot analysis also showed the nettle lectin message was present primarily in rhizomes and inflorescence (with immature seeds) but not in leaves or stems. Chitinase enzymatic activity was found when the chitinase-like domain alone or the chitinase-like domain with the chitin-binding domains were expressed in Escherichia coli. This is the first example of a chitin-binding protein with both a duplication of the 43-amino acid chitin-binding domain and a fusion of the chitin-binding domains to a structurally unrelated domain, the chitinase domain.