Du, Q S; Ma, Y; Xie, N Z; Huang, R B
2014-01-01
In the design of peptide inhibitors the huge possible variety of the peptide sequences is of high concern. In collaboration with the fast accumulation of the peptide experimental data and database, a statistical method is suggested for peptide inhibitor design. In the two-level peptide prediction network (2L-QSAR) one level is the physicochemical properties of amino acids and the other level is the peptide sequence position. The activity contributions of amino acids are the functions of physicochemical properties and the sequence positions. In the prediction equation two weight coefficient sets {ak} and {bl} are assigned to the physicochemical properties and to the sequence positions, respectively. After the two coefficient sets are optimized based on the experimental data of known peptide inhibitors using the iterative double least square (IDLS) procedure, the coefficients are used to evaluate the bioactivities of new designed peptide inhibitors. The two-level prediction network can be applied to the peptide inhibitor design that may aim for different target proteins, or different positions of a protein. A notable advantage of the two-level statistical algorithm is that there is no need for host protein structural information. It may also provide useful insight into the amino acid properties and the roles of sequence positions.
Schaeffer, E; Sninsky, J J
1984-01-01
Proteins that are related evolutionarily may have diverged at the level of primary amino acid sequence while maintaining similar secondary structures. Computer analysis has been used to compare the open reading frames of the hepatitis B virus to those of the woodchuck hepatitis virus at the level of amino acid sequence, and to predict the relative hydrophilic character and the secondary structure of putative polypeptides. Similarity is seen at the levels of relative hydrophilicity and secondary structure, in the absence of sequence homology. These data reinforce the proposal that these open reading frames encode viral proteins. Computer analysis of this type can be more generally used to establish structural similarities between proteins that do not share obvious sequence homology as well as to assess whether an open reading frame is fortuitous or codes for a protein. PMID:6585835
Alkaptonuria and Pompe disease in one patient: metabolic and molecular analysis.
Zouheir Habbal, Mohammad; Bou Assi, Tarek; Mansour, Hicham
2013-04-29
Pompe disease is characterised by deficiency of acid α-glucosidase that results in abnormal glycogen deposition in the muscles. Alkaptonuria is caused by a defect in the enzyme homogentisate 1,2-dioxygenase with subsequent accumulation of homogentisic acid. We report the case of a 6-year-old boy diagnosed with Pompe disease and alkaptonuria. Urine organic acids and α-glucosidase were measured. Homogentisate 1,2-dioxygenase (HGO) and acid alpha-glucosidase (GAA) genes were sequenced by Sanger DNA sequencing. The level of α-glucosidase in white blood cells was markedly decreased (4 nm/mg) while the level of homogentisic acid was markedly increased (15 027 mmol/mol creatine). GAA sequencing detected two heterozygous GAA mutations (C.670C>T and C.1064T>C) while HGO sequencing revealed three polymorphisms in exons 4, 5 and 6, respectively. To the best of our knowledge, this is the first reported instance of Pompe disease and alkaptonuria occurring in the same individual.
Alkaptonuria and pompe disease in one patient: metabolic and molecular analysis
Habbal, Mohammad Zouheir; Bou Assi, Tarek; Mansour, Hicham
2013-01-01
Pompe disease is characterised by deficiency of acid α-glucosidase that results in abnormal glycogen deposition in the muscles. Alkaptonuria is caused by a defect in the enzyme homogentisate 1,2-dioxygenase with subsequent accumulation of homogentisic acid. We report the case of a 6-year-old boy diagnosed with Pompe disease and alkaptonuria. Urine organic acids and α-glucosidase were measured. Homogentisate 1,2-dioxygenase (HGO) and acid alpha-glucosidase (GAA) genes were sequenced by Sanger DNA sequencing. The level of α-glucosidase in white blood cells was markedly decreased (4 nm/mg) while the level of homogentisic acid was markedly increased (15 027 mmol/mol creatine). GAA sequencing detected two heterozygous GAA mutations (C.670C>T and C.1064T>C) while HGO sequencing revealed three polymorphisms in exons 4, 5 and 6, respectively. To the best of our knowledge, this is the first reported instance of Pompe disease and alkaptonuria occurring in the same individual. PMID:23632174
Kimura, Tomohiro; Nakano, Toshiki; Yamaguchi, Toshiyasu; Sato, Minoru; Ogawa, Tomohisa; Muramoto, Koji; Yokoyama, Takehiko; Kan-No, Nobuhiro; Nagahisa, Eizou; Janssen, Frank; Grieshaber, Manfred K
2004-01-01
The complete complementary DNA sequences of genes presumably coding for opine dehydrogenases from Arabella iricolor (sandworm), Haliotis discus hannai (abalone), and Patinopecten yessoensis (scallop) were determined, and partial cDNA sequences were derived for Meretrix lusoria (Japanese hard clam) and Spisula sachalinensis (Sakhalin surf clam). The primers ODH-9F and ODH-11R proved useful for amplifying the sequences for opine dehydrogenases from the 4 mollusk species investigated in this study. The sequence of the sandworm was obtained using primers constructed from the amino acid sequence of tauropine dehydrogenase, the main opine dehydrogenase in A. iricolor. The complete cDNA sequence of A. iricolor, H. discus hannai, and P. yessoensis encode 397, 400, and 405 amino acids, respectively. All sequences were aligned and compared with published databank sequences of Loligo opalescens, Loligo vulgaris (squid), Sepia officinalis (cuttlefish), and Pecten maximus (scallop). As expected, a high level of homology was observed for the cDNA from closely related species, such as for cephalopods or scallops, whereas cDNA from the other species showed lower-level homologies. A similar trend was observed when the deduced amino acid sequences were compared. Furthermore, alignment of these sequences revealed some structural motifs that are possibly related to the binding sites of the substrates. The phylogenetic trees derived from the nucleotide and amino acid sequences were consistent with the classification of species resulting from classical taxonomic analyses.
Random Amplification and Pyrosequencing for Identification of Novel Viral Genome Sequences
Hang, Jun; Forshey, Brett M.; Kochel, Tadeusz J.; Li, Tao; Solórzano, Víctor Fiestas; Halsey, Eric S.; Kuschner, Robert A.
2012-01-01
ssRNA viruses have high levels of genomic divergence, which can lead to difficulty in genomic characterization of new viruses using traditional PCR amplification and sequencing methods. In this study, random reverse transcription, anchored random PCR amplification, and high-throughput pyrosequencing were used to identify orthobunyavirus sequences from total RNA extracted from viral cultures of acute febrile illness specimens. Draft genome sequence for the orthobunyavirus L segment was assembled and sequentially extended using de novo assembly contigs from pyrosequencing reads and orthobunyavirus sequences in GenBank as guidance. Accuracy and continuous coverage were achieved by mapping all reads to the L segment draft sequence. Subsequently, RT-PCR and Sanger sequencing were used to complete the genome sequence. The complete L segment was found to be 6936 bases in length, encoding a 2248-aa putative RNA polymerase. The identified L segment was distinct from previously published South American orthobunyaviruses, sharing 63% and 54% identity at the nucleotide and amino acid level, respectively, with the complete Oropouche virus L segment and 73% and 81% identity at the nucleotide and amino acid level, respectively, with a partial Caraparu virus L segment. The result demonstrated the effectiveness of a sequence-independent amplification and next-generation sequencing approach for obtaining complete viral genomes from total nucleic acid extracts and its use in pathogen discovery. PMID:22468136
Materials and methods for the alteration of enzyme and acetyl CoA levels in plants
Nikolau, Basil J.; Wurtele, Eve S.; Oliver, David J.; Behal, Robert; Schnable, Patrick S.; Ke, Jinshan; Johnson, Jerry L.; Allred, Carolyn C.; Fatland, Beth; Lutziger, Isabelle; Wen, Tsui-Jung
2005-09-13
The present invention provides nucleic acid and amino acid sequences of acetyl CoA synthetase (ACS), plastidic pyruvate dehydrogenase (pPDH), ATP citrate lyase (ACL), Arabidopsis pyruvate decarboxylase (PDC), and Arabidopsis aldehyde dehydrogenase (ALDH), specifically ALDH-2 and ALDH-4. The present invention also provides a recombinant vector comprising a nucleic acid sequence encoding one of the aforementioned enzymes, an antisense sequence thereto or a ribozyme therefor, a cell transformed with such a vector, antibodies to the enzymes, a plant cell, a plant tissue, a plant organ or a plant in which the level of an enzyme has been altered, and a method of producing such a plant cell, plant tissue, plant organ or plant. Desirably, alteration of the level of enzyme results in an alteration of the level of acetyl CoA in the plant cell, plant tissue, plant organ or plant. In addition, the present invention provides a recombinant vector comprising an antisense sequence of a nucleic acid sequence encoding pyruvate decarboxylase (PDC), the E1.alpha. subunit of pPDH, the E1.beta. subunit of pPDH, the E2 subunit of pPDH, mitochondrial pyruvate dehydrogenase (mtPDH) or aldehyde dehydrogenase (ALDH) or a ribozyme that can cleave an RNA molecule encoding PDC, E1.alpha. pPDH, E1.beta. pPDH, E2 pPDH, mtPDH or ALDH.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nikolau, Basil J; Wurtele, Eve S; Oliver, David J
The present invention provides nucleic acid and amino acid sequences of acetyl CoA synthetase (ACS), plastidic pyruvate dehydrogenase (pPDH), ATP citrate lyase (ACL), Arabidopsis pyruvate decarboxylase (PDC), and Arabidopsis aldehyde dehydrogenase (ALDH), specifically ALDH-2 and ALDH-4. The present invention also provides a recombinant vector comprising a nucleic acid sequence encoding one of the aforementioned enzymes, an antisense sequence thereto or a ribozyme therefor, a cell transformed with such a vector, antibodies to the enzymes, a plant cell, a plant tissue, a plant organ or a plant in which the level of an enzyme has been altered, and a method ofmore » producing such a plant cell, plant tissue, plant organ or plant. Desirably, alteration of the level of enzyme results in an alteration of the level of acetyl CoA in the plant cell, plant tissue, plant organ or plant. In addition, the present invention provides a recombinant vector comprising an antisense sequence of a nucleic acid sequence encoding pyruvate decarboxylase (PDC), the E1.alpha. subunit of pPDH, the E1.beta. subunit of pPDH, the E2 subunit of pPDH, mitochondrial pyruvate dehydrogenase (mtPDH) or aldehyde dehydrogenase (ALDH) or a ribozyme that can cleave an RNA molecule encoding PDC, E1.alpha. pPDH, E1.beta. pPDH, E2 pPDH, mtPDH or ALDH.« less
Cloning and High-Level Expression of α-Galactosidase cDNA from Penicillium purpurogenum
Shibuya, Hajime; Nagasaki, Hiroaki; Kaneko, Satoshi; Yoshida, Shigeki; Park, Gwi Gun; Kusakabe, Isao; Kobayashi, Hideyuki
1998-01-01
The cDNA coding for Penicillium purpurogenum α-galactosidase (αGal) was cloned and sequenced. The deduced amino acid sequence of the α-Gal cDNA showed that the mature enzyme consisted of 419 amino acid residues with a molecular mass of 46,334 Da. The derived amino acid sequence of the enzyme showed similarity to eukaryotic αGals from plants, animals, yeasts, and filamentous fungi. The highest similarity observed (57% identity) was to Trichoderma reesei AGLI. The cDNA was expressed in Saccharomyces cerevisiae under the control of the yeast GAL10 promoter. Almost all of the enzyme produced was secreted into the culture medium, and the expression level reached was approximately 0.2 g/liter. The recombinant enzyme purified to homogeneity was highly glycosylated, showed slightly higher specific activity, and exhibited properties almost identical to those of the native enzyme from P. purpurogenum in terms of the N-terminal amino acid sequence, thermoactivity, pH profile, and mode of action on galacto-oligosaccharides. PMID:9797312
Opsin cDNA sequences of a UV and green rhodopsin of the satyrine butterfly Bicyclus anynana.
Vanhoutte, K J A; Eggen, B J L; Janssen, J J M; Stavenga, D G
2002-11-01
The cDNAs of an ultraviolet (UV) and long-wavelength (LW) (green) absorbing rhodopsin of the bush brown Bicyclus anynana were partially identified. The UV sequence, encoding 377 amino acids, is 76-79% identical to the UV sequences of the papilionids Papilio glaucus and Papilio xuthus and the moth Manduca sexta. A dendrogram derived from aligning the amino acid sequences reveals an equidistant position of Bicyclus between Papilio and Manduca. The sequence of the green opsin cDNA fragment, which encodes 242 amino acids, represents six of the seven transmembrane regions. At the amino acid level, this fragment is more than 80% identical to the corresponding LW opsin sequences of Dryas, Heliconius, Papilio (rhodopsin 2) and Manduca. Whereas three LW absorbing rhodopsins were identified in the papilionid butterflies, only one green opsin was found in B. anynana.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rudwaleit, M.; Bowness, P.; Wordsworth, P.
1996-12-31
The HLA-B27 subtype HLA-B{sup *}2704 is virtually absent in Caucasians but common in Orientals, where it is associated with ankylosing spondylitis. The amino acid sequence of HLA-B{sup *}2704 has been established by peptide mapping and was shown to differ by two amino acids from HLA-B{sup *}2705, HLA-B{sup *}2704 is characterized by a serine for aspartic acid substitution at position 77 and glutamic acid for valine at position 152. To date, however, no nucleotide sequence confirming these changes at the DNA level has been published. 13 refs., 2 figs.
Crimean-Congo Hemorrhagic Fever
2004-01-01
aminocaproic acid were also indicated. Much emphasis was also placed on preventing reinfection, including the necessity of remov- ing blood crusts from...The se- quence is approximately 60% identical both at the nucleotide and amino acid levels to the L segment of Dugbe virus, the only other Nairovirus...However, more recent data based on nucleic acid sequence analysis have revealed extensive genetic diversity. The first published CCHFV sequence
SENCA: A Multilayered Codon Model to Study the Origins and Dynamics of Codon Usage
Pouyet, Fanny; Bailly-Bechet, Marc; Mouchiroud, Dominique; Guéguen, Laurent
2016-01-01
Gene sequences are the target of evolution operating at different levels, including the nucleotide, codon, and amino acid levels. Disentangling the impact of those different levels on gene sequences requires developing a probabilistic model with three layers. Here we present SENCA (site evolution of nucleotides, codons, and amino acids), a codon substitution model that separately describes 1) nucleotide processes which apply on all sites of a sequence such as the mutational bias, 2) preferences between synonymous codons, and 3) preferences among amino acids. We argue that most synonymous substitutions are not neutral and that SENCA provides more accurate estimates of selection compared with more classical codon sequence models. We study the forces that drive the genomic content evolution, intraspecifically in the core genome of 21 prokaryotes and interspecifically for five Enterobacteria. We retrieve the existence of a universal mutational bias toward AT, and that taking into account selection on synonymous codon usage has consequences on the measurement of selection on nonsynonymous substitutions. We also confirm that codon usage bias is mostly driven by selection on preferred codons. We propose new summary statistics to measure the relative importance of the different evolutionary processes acting on sequences. PMID:27401173
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nikolau, Basil J.; Wurtele, Eve S.; Oliver, David J.
The present invention provides nucleic acid and amino acid sequences of acetyl CoA synthetase (ACS), plastidic pyruvate dehydrogenase (pPDH), ATP citrate lyase (ACL), Arabidopsis pyruvate decarboxylase (PDC), and Arabidopsis aldehyde dehydrogenase (ALDH), specifically ALDH-2 and ALDH-4. The present invention also provides a recombinant vector comprising a nucleic acid sequence encoding one of the aforementioned enzymes, an antisense sequence thereto or a ribozyme therefor, a cell transformed with such a vector, antibodies to the enzymes, a plant cell, a plant tissue, a plant organ or a plant in which the level of an enzyme has been altered, and a method ofmore » producing such a plant cell, plant tissue, plant organ or plant. Desirably, alteration of the level of enzyme results in an alteration of the level of acetyl CoA in the plant cell, plant tissue, plant organ or plant. In addition, the present invention provides a recombinant vector comprising an antisense sequence of a nucleic acid sequence encoding pyruvate decarboxylase (PDC), the E1.sub..alpha. subunit of pPDH, the E1.sub..beta. subunit of pPDH, the E2 subunit of pPDH, mitochondrial pyurvate dehydrogenase (mtPDH) or aldehyde dehydrogenase (ALDH) or a ribozyme that can cleave an RNA molecule encoding PDC, E1.sub..alpha. pPDH, E1.sub..beta. pPDH, E2 pPDH, mtPDH or ALDH.« less
Detection and characterization of hepatitis A virus circulating in Egypt.
Hamza, Hazem; Abd-Elshafy, Dina Nadeem; Fayed, Sayed A; Bahgat, Mahmoud Mohamed; El-Esnawy, Nagwa Abass; Abdel-Mobdy, Emam
2017-07-01
Hepatitis A virus (HAV) still poses a considerable problem worldwide. In the current study, hepatitis A virus was recovered from wastewater samples collected from three wastewater treatment plants over one year. Using RT-PCR, HAV was detected in 43 out of 68 samples (63.2%) representing both inlet and outlet. Eleven positive samples were subjected to sequencing targeting the VP1-2A junction region. Phylogenetic analysis revealed that all samples belonged to subgenotype IB with few substitutions at the amino acid level. The complete sequence of one isolate (HAV/Egy/BI-11/2015) showed that the similarity at the amino acid level was not reflected at the nucleotide level. However, the deduced amino acid sequence derived from the complete nucleotide sequence showed distinct substitutions in the 2B, 2C, and 3A regions. Recombination analysis revealed a recombination event between X75215 (subgenotype IA) and AF268396 (subgenotype IB) involving a portion of the 2B nonstructural protein coding region (nucleotides 3757-3868) assuming the herein characterized sequence an actual recombinant. Despite the role of recombination in picornaviruses evolution, its involvement in HAV evolution has rarely been reported, and this may be due to the limited available complete HAV sequences. To our knowledge, this represents the first characterized complete sequence of an Egyptian isolate and the described recombination event provides an important update on the circulating HAV strains in Egypt.
The complete nucleotide sequence of RNA 3 of a peach isolate of Prunus necrotic ringspot virus.
Hammond, R W; Crosslin, J M
1995-04-01
The complete nucleotide sequence of RNA 3 of the PE-5 peach isolate of Prunus necrotic ringspot ilarvirus (PNRSV) was obtained from cloned cDNA. The RNA sequence is 1941 nucleotides and contains two open reading frames (ORFs). ORF 1 consisted of 284 amino acids with a calculated molecular weight of 31,729 Da and ORF 2 contained 224 amino acids with a calculated molecular weight of 25,018 Da. ORF 2 corresponds to the coat protein gene. Expression of ORF 2 engineered into a pTrcHis vector in Escherichia coli results in a fusion polypeptide of approximately 28 kDa which cross-reacts with PNRSV polyclonal antiserum. Analysis of the coat protein amino acid sequence reveals a putative "zinc-finger" domain at the amino-terminal portion of the protein. Two tetranucleotide AUGC motifs occur in the 3'-UTR of the RNA and may function in coat protein binding and genome activation. ORF 1 homologies to other ilarviruses and alfalfa mosaic virus are confined to limited regions of conserved amino acids. The translated amino acid sequence of the coat protein gene shows 92% similarity to one isolate of apple mosaic virus, a closely related member of the ilarvirus group of plant viruses, but only 66% similarity to the amino acid sequence of the coat protein gene of a second isolate. These relationships are also reflected at the nucleotide sequence level. These results in one instance confirm the close similarities observed at the biophysical and serological levels between these two viruses, but on the other hand call into question the nomenclature used to describe these viruses.
Sequence Alignment to Predict Across Species Susceptibility ...
Conservation of a molecular target across species can be used as a line-of-evidence to predict the likelihood of chemical susceptibility. The web-based Sequence Alignment to Predict Across Species Susceptibility (SeqAPASS) tool was developed to simplify, streamline, and quantitatively assess protein sequence/structural similarity across taxonomic groups as a means to predict relative intrinsic susceptibility. The intent of the tool is to allow for evaluation of any potential protein target, so it is amenable to variable degrees of protein characterization, depending on available information about the chemical/protein interaction and the molecular target itself. To allow for flexibility in the analysis, a layered strategy was adopted for the tool. The first level of the SeqAPASS analysis compares primary amino acid sequences to a query sequence, calculating a metric for sequence similarity (including detection of candidate orthologs), the second level evaluates sequence similarity within selected domains (e.g., ligand-binding domain, DNA binding domain), and the third level of analysis compares individual amino acid residue positions identified as being of importance for protein conformation and/or ligand binding upon chemical perturbation. Each level of the SeqAPASS analysis provides increasing evidence to apply toward rapid, screening-level assessments of probable cross species susceptibility. Such analyses can support prioritization of chemicals for further ev
ADS genes for reducing saturated fatty acid levels in seed oils
Heilmann, Ingo H; Shanklin, John
2014-03-18
The present invention relates to enzymes involved in lipid metabolism. In particular, the present invention provides coding sequences for Arabidopsis Desaturases (ADS), the encoded ADS polypeptides, and methods for using the sequences and encoded polypeptides, where such methods include decreasing and increasing saturated fatty acid content in plant seed oils.
ADS genes for reducing saturated fatty acid levels in seed oils
Heilmann, Ingo H.; Shanklin, John
2010-02-02
The present invention relates to enzymes involved in lipid metabolism. In particular, the present invention provides coding sequences for Arabidopsis Desaturases (ADS), the encoded ADS polypeptides, and methods for using the sequences and encoded polypeptides, where such methods include decreasing and increasing saturated fatty acid content in plant seed oils.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Leong, JoAnn Ching
The nucleotide sequence of the IHNV glycoprotein gene has been determined from a cDNA clone containing the entire coding region. The glycoprotein cDNA clone contained a leader sequence of 48 bases, a coding region of 1524 nucleotides, and 39 bases at the 3 foot end. The entire cDNA clone contains 1609 nucleodites and encodes a protein of 508 amino acids. The deduced amino acid sequence gave a translated molecular weight of 56,795 daltons. A hydropathicity profile of the deduced amino acid sequence indicated that there were two major hydrophobic domains: one,at the N-terminus,delineating a signal peptide of 18 amino acidsmore » and the other, at the C-terminus,delineating the region of the transmembrane. Five possible sites of N-linked glyscoylation were identified. Although no nucleic acid homology existed between the IHNV glycoprotein gene and the glycoprotein genes of rabies and VSV, there was significant homology at the amino acid level between all three rhabdovirus glycoproteins.« less
Zhang, Luan; Xiong, Zhi-ting; Xu, Zhong-rui; Liu, Chen; Cai, Shen-wen
2014-06-01
The roots of metallophytes serve as the key interface between plants and heavy metal-contaminated underground environments. It is known that the roots of metallicolous plants show a higher activity of acid invertase enzymes than those of non-metallicolous plants when under copper stress. To test whether the higher activity of acid invertases is the result of increased expression of acid invertase genes or variations in the amino acid sequences between the two population types, we isolated full cDNAs for acid invertases from two populations of Kummerowia stipulacea (from metalliferous and non-metalliferous soils), determined their nucleotide sequences, expressed them in Pichia pastoris, and conducted real-time PCR to determine differences in transcript levels during Cu stress. Heterologous expression of acid invertase cDNAs in P. pastoris indicated that variations in the amino acid sequences of acid invertases between the two populations played no significant role in determining enzyme characteristics. Seedlings of K. stipulacea were exposed to 0.3µM Cu(2+) (control) and 10µM Cu(2+) for 7 days under hydroponics׳ conditions. The transcript levels of acid invertases in metallicolous plants were significantly higher than in non-metallicolous plants when under copper stress. The results suggest that the expression of acid invertase genes in metallicolous plants of K. stipulacea differed from those in non-metallicolous plants under such conditions. In addition, the sugars may play an important role in regulating the transcript level of acid invertase genes and acid invertase genes may also be involved in root/shoot biomass allocation. Copyright © 2014 Elsevier Inc. All rights reserved.
Xu, Ting; Xie, Jiasong; Yang, Shoubao; Ye, Shigen; Luo, Ming; Wu, Xinzhong
2016-08-01
Cyclophilins (CyPs) are a family of proteins that bind the immunosuppressive agent cyclosporin A (CsA) with high-affinity and belong to one of the three superfamilies of peptidyl-prolyl cis-trans isomerases (PPIase). In this report, three cyclophilin genes (Ca-CyPs), including Ca-CyPA, Ca-CyPB and Ca-PPIL3, were identified from oyster, Crassostrea ariakensis Gould in which Ca-CyPA encodes a protein with 165 amino acid sequences, Ca-CyPB encodes a protein with 217 amino acid sequences and Ca-PPIL3 encodes a protein with 162 amino acid sequences. All of the three Ca-CyPs genes contain a typical CyP-PPIase domain with its signature sequences and Ca-CyPB contains an N-signal peptide sequences. Tissue distribution study revealed that Ca-CyPs were ubiquitously expressed in all examined tissues and the highest levels were observed in hemocytes. RLO incubation upregulated the mRNA expression levels of Ca-CyPs, indicating that three Ca-CyPs might be involved in oyster immune response against RLO infection. Copyright © 2016 Elsevier Ltd. All rights reserved.
Demorest, Zachary L; Coffman, Andrew; Baltes, Nicholas J; Stoddard, Thomas J; Clasen, Benjamin M; Luo, Song; Retterath, Adam; Yabandith, Ann; Gamo, Maria Elena; Bissen, Jeff; Mathis, Luc; Voytas, Daniel F; Zhang, Feng
2016-10-13
The ability to modulate levels of individual fatty acids within soybean oil has potential to increase shelf-life and frying stability and to improve nutritional characteristics. Commodity soybean oil contains high levels of polyunsaturated linoleic and linolenic acid, which contribute to oxidative instability - a problem that has been addressed through partial hydrogenation. However, partial hydrogenation increases levels of trans-fatty acids, which have been associated with cardiovascular disease. Previously, we generated soybean lines with knockout mutations within fatty acid desaturase 2-1A (FAD2-1A) and FAD2-1B genes, resulting in oil with increased levels of monounsaturated oleic acid (18:1) and decreased levels of linoleic (18:2) and linolenic acid (18:3). Here, we stack mutations within FAD2-1A and FAD2-1B with mutations in fatty acid desaturase 3A (FAD3A) to further decrease levels of linolenic acid. Mutations were introduced into FAD3A by directly delivering TALENs into fad2-1a fad2-1b soybean plants. Oil from fad2-1a fad2-1b fad3a plants had significantly lower levels of linolenic acid (2.5 %), as compared to fad2-1a fad2-1b plants (4.7 %). Furthermore, oil had significantly lower levels of linoleic acid (2.7 % compared to 5.1 %) and significantly higher levels of oleic acid (82.2 % compared to 77.5 %). Transgene-free fad2-1a fad2-1b fad3a soybean lines were identified. The methods presented here provide an efficient means for using sequence-specific nucleases to stack quality traits in soybean. The resulting product comprised oleic acid levels above 80 % and linoleic and linolenic acid levels below 3 %.
Nishizawa, M; Nishizawa, K
2000-10-01
The tendency for repetitiveness of nucleotides in DNA sequences has been reported for a variety of organisms. We show that the tendency for repetitive use of amino acids is widespread and is observed even for segments conserved between human and Drosophila melanogaster at the level of >50% amino acid identity. This indicates that repetitiveness influences not only the weakly constrained segments but also those sequence segments conserved among phyla. Not only glutamine (Q) but also many of the 20 amino acids show a comparable level of repetitiveness. Repetitiveness in bases at codon position 3 is stronger for human than for D.melanogaster, whereas local repetitiveness in intron sequences is similar between the two organisms. While genes for immune system-specific proteins, but not ancient human genes (i.e. human homologs of Escherichia coli genes), have repetitiveness at codon bases 1 and 2, repetitiveness at codon base 3 for these groups is similar, suggesting that the human genome has at least two mechanisms generating local repetitiveness. Neither amino acid nor nucleotide repetitiveness is observed beyond the exon boundary, denying the possibility that such repetitiveness could mainly stem from natural selection on mRNA or protein sequences. Analyses of mammalian sequence alignments show that while the 'between gene' GC content heterogeneity, which is linked to 'isochores', is a principal factor associated with the bias in substitution patterns in human, 'within gene' heterogeneity in nucleotide composition is also associated with such bias on a more local scale. The relationship amongst the various types of repetitiveness is discussed.
Nishizawa, Manami; Nishizawa, Kazuhisa
2000-01-01
The tendency for repetitiveness of nucleotides in DNA sequences has been reported for a variety of organisms. We show that the tendency for repetitive use of amino acids is widespread and is observed even for segments conserved between human and Drosophila melanogaster at the level of >50% amino acid identity. This indicates that repetitiveness influences not only the weakly constrained segments but also those sequence segments conserved among phyla. Not only glutamine (Q) but also many of the 20 amino acids show a comparable level of repetitiveness. Repetitiveness in bases at codon position 3 is stronger for human than for D.melanogaster, whereas local repetitiveness in intron sequences is similar between the two organisms. While genes for immune system-specific proteins, but not ancient human genes (i.e. human homologs of Escherichia coli genes), have repetitiveness at codon bases 1 and 2, repetitiveness at codon base 3 for these groups is similar, suggesting that the human genome has at least two mechanisms generating local repetitiveness. Neither amino acid nor nucleotide repetitiveness is observed beyond the exon boundary, denying the possibility that such repetitiveness could mainly stem from natural selection on mRNA or protein sequences. Analyses of mammalian sequence alignments show that while the ‘between gene’ GC content heterogeneity, which is linked to ‘isochores’, is a principal factor associated with the bias in substitution patterns in human, ‘within gene’ heterogeneity in nucleotide composition is also associated with such bias on a more local scale. The relationship amongst the various types of repetitiveness is discussed. PMID:11000273
Fatima, Tahira; Snyder, Crystal L; Schroeder, William R; Cram, Dustin; Datla, Raju; Wishart, David; Weselake, Randall J; Krishna, Priti
2012-01-01
Sea buckthorn (Hippophae rhamnoides L.) is a hardy, fruit-producing plant known historically for its medicinal and nutraceutical properties. The most recognized product of sea buckthorn is its fruit oil, composed of seed oil that is rich in essential fatty acids, linoleic (18:2 ω-6) and α-linolenic (18:3 ω-3) acids, and pulp oil that contains high levels of monounsaturated palmitoleic acid (16:1 ω-7). Sea buckthorn is fast gaining popularity as a source of functional food and nutraceuticals, but currently has few genomic resources; therefore, we explored the fatty acid composition of Canadian-grown cultivars (ssp. mongolica) and the sea buckthorn seed transcriptome using the 454 GS FLX sequencing technology. GC-MS profiling of fatty acids in seeds and pulp of berries indicated that the seed oil contained linoleic and α-linolenic acids at 33-36% and 30-36%, respectively, while the pulp oil contained palmitoleic acid at 32-42%. 454 sequencing of sea buckthorn cDNA collections from mature seeds yielded 500,392 sequence reads, which identified 89,141 putative unigenes represented by 37,482 contigs and 51,659 singletons. Functional annotation by Gene Ontology and computational prediction of metabolic pathways indicated that primary metabolism (protein>nucleic acid>carbohydrate>lipid) and fatty acid and lipid biosynthesis pathways were highly represented categories. Sea buckthorn sequences related to fatty acid biosynthesis genes in Arabidopsis were identified, and a subset of these was examined for transcript expression at four developing stages of the berry. This study provides the first comprehensive genomic resources represented by expressed sequences for sea buckthorn, and demonstrates that the seed oil of Canadian-grown sea buckthorn cultivars contains high levels of linoleic acid and α-linolenic acid in a close to 1:1 ratio, which is beneficial for human health. These data provide the foundation for further studies on sea buckthorn oil, the enzymes involved in its biosynthesis, and the genes involved in the general hardiness of sea buckthorn against environmental conditions.
Fatima, Tahira; Snyder, Crystal L.; Schroeder, William R.; Cram, Dustin; Datla, Raju; Wishart, David; Weselake, Randall J.; Krishna, Priti
2012-01-01
Background Sea buckthorn (Hippophae rhamnoides L.) is a hardy, fruit-producing plant known historically for its medicinal and nutraceutical properties. The most recognized product of sea buckthorn is its fruit oil, composed of seed oil that is rich in essential fatty acids, linoleic (18∶2ω-6) and α-linolenic (18∶3ω-3) acids, and pulp oil that contains high levels of monounsaturated palmitoleic acid (16∶1ω-7). Sea buckthorn is fast gaining popularity as a source of functional food and nutraceuticals, but currently has few genomic resources; therefore, we explored the fatty acid composition of Canadian-grown cultivars (ssp. mongolica) and the sea buckthorn seed transcriptome using the 454 GS FLX sequencing technology. Results GC-MS profiling of fatty acids in seeds and pulp of berries indicated that the seed oil contained linoleic and α-linolenic acids at 33–36% and 30–36%, respectively, while the pulp oil contained palmitoleic acid at 32–42%. 454 sequencing of sea buckthorn cDNA collections from mature seeds yielded 500,392 sequence reads, which identified 89,141 putative unigenes represented by 37,482 contigs and 51,659 singletons. Functional annotation by Gene Ontology and computational prediction of metabolic pathways indicated that primary metabolism (protein>nucleic acid>carbohydrate>lipid) and fatty acid and lipid biosynthesis pathways were highly represented categories. Sea buckthorn sequences related to fatty acid biosynthesis genes in Arabidopsis were identified, and a subset of these was examined for transcript expression at four developing stages of the berry. Conclusion This study provides the first comprehensive genomic resources represented by expressed sequences for sea buckthorn, and demonstrates that the seed oil of Canadian-grown sea buckthorn cultivars contains high levels of linoleic acid and α-linolenic acid in a close to 1∶1 ratio, which is beneficial for human health. These data provide the foundation for further studies on sea buckthorn oil, the enzymes involved in its biosynthesis, and the genes involved in the general hardiness of sea buckthorn against environmental conditions. PMID:22558083
Liu, Bin; Ertesvåg, Helga; Aasen, Inga Marie; Vadstein, Olav; Brautaset, Trygve; Heggeset, Tonje Marita Bjerkan
2016-06-01
Thraustochytrids are unicellular, marine protists, and there is a growing industrial interest in these organisms, particularly because some species, including strains belonging to the genus Aurantiochytrium, accumulate high levels of docosahexaenoic acid (DHA). Here, we report the draft genome sequence of Aurantiochytrium sp. T66 (ATCC PRA-276), with a size of 43 Mbp, and 11,683 predicted protein-coding sequences. The data has been deposited at DDBJ/EMBL/Genbank under the accession LNGJ00000000. The genome sequence will contribute new insight into DHA biosynthesis and regulation, providing a basis for metabolic engineering of thraustochytrids.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wilkins, T.A.
1993-06-01
This study investigates the molecular events of vacuole ontogeny in rapidly elongated cotton plant cells. Within the DNA coding region, the cotton and carrot cDNA clones exhibit 82.2% nucleotide sequence homology; at the amino acid level cotton and carrot catalytic subunits exhibited 95.7% identity and 2.1% amino acid similarity. When aligned with the analogous sequences from yeast, the cotton protein shared only 60.5% amino acid identity and 12.7% similarity. 10 refs., 1 tab.
Adderson, Elisabeth E.; Boudreaux, Jan W.; Cummings, Jessica R.; Pounds, Stanley; Wilson, Deborah A.; Procop, Gary W.; Hayden, Randall T.
2008-01-01
We compared the relative levels of effectiveness of three commercial identification kits and three nucleic acid amplification tests for the identification of coryneform bacteria by testing 50 diverse isolates, including 12 well-characterized control strains and 38 organisms obtained from pediatric oncology patients at our institution. Between 33.3 and 75.0% of control strains were correctly identified to the species level by phenotypic systems or nucleic acid amplification assays. The most sensitive tests were the API Coryne system and amplification and sequencing of the 16S rRNA gene using primers optimized for coryneform bacteria, which correctly identified 9 of 12 control isolates to the species level, and all strains with a high-confidence call were correctly identified. Organisms not correctly identified were species not included in the test kit databases or not producing a pattern of reactions included in kit databases or which could not be differentiated among several genospecies based on reaction patterns. Nucleic acid amplification assays had limited abilities to identify some bacteria to the species level, and comparison of sequence homologies was complicated by the inclusion of allele sequences obtained from uncultivated and uncharacterized strains in databases. The utility of rpoB genotyping was limited by the small number of representative gene sequences that are currently available for comparison. The correlation between identifications produced by different classification systems was poor, particularly for clinical isolates. PMID:18160450
A Therapeutic Uricase with Reduced Immunogenicity Risk and Improved Development Properties.
Nyborg, Andrew C; Ward, Chris; Zacco, Anna; Chacko, Benoy; Grinberg, Luba; Geoghegan, James C; Bean, Ryan; Wendeler, Michaela; Bartnik, Frank; O'Connor, Ellen; Gruia, Flaviu; Iyer, Vidyashankara; Feng, Hui; Roy, Varnika; Berge, Mark; Miner, Jeffrey N; Wilson, David M; Zhou, Dongmei; Nicholson, Simone; Wilker, Clynn; Wu, Chi Y; Wilson, Susan; Jermutus, Lutz; Wu, Herren; Owen, David A; Osbourn, Jane; Coats, Steven; Baca, Manuel
2016-01-01
Humans and higher primates are unique in that they lack uricase, the enzyme capable of oxidizing uric acid. As a consequence of this enzyme deficiency, humans have high serum uric acid levels. In some people, uric acid levels rise above the solubility limit resulting in crystallization in joints, acute inflammation in response to those crystals causes severe pain; a condition known as gout. Treatment for severe gout includes injection of non-human uricase to reduce serum uric acid levels. Krystexxa® is a hyper-PEGylated pig-baboon chimeric uricase indicated for chronic refractory gout that induces an immunogenic response in 91% of treated patients, including infusion reactions (26%) and anaphylaxis (6.5%). These properties limit its use and effectiveness. An innovative approach has been used to develop a therapeutic uricase with improved properties such as: soluble expression, neutral pH solubility, high E. coli expression level, thermal stability, and excellent activity. More than 200 diverse uricase sequences were aligned to guide protein engineering and reduce putative sequence liabilities. A single uricase lead candidate was identified, which showed low potential for immunogenicity in >200 human donor samples selected to represent diverse HLA haplotypes. Cysteines were engineered into the lead sequence for site specific PEGylation and studies demonstrated >95% PEGylation efficiency. PEGylated uricase retains enzymatic activity in vitro at neutral pH, in human serum and in vivo (rats and canines) and has an extended half-life. In canines, an 85% reduction in serum uric acid levels was observed with a single subcutaneous injection. This PEGylated, non-immunogenic uricase has the potential to provide meaningful benefits to patients with gout.
High levels of MHC class II allelic diversity in lake trout from Lake Superior
Dorschner, M.O.; Duris, T.; Bronte, C.R.; Burnham-Curtis, M. K.; Phillips, R.B.
2000-01-01
Sequence variation in a 216 bp portion of the major histocompatibility complex (MHC) II B1 domain was examined in 74 individual lake trout (Salvelinus namaycush) from different locations in Lake Superior. Forty-three alleles were obtained which encoded 71-72 amino acids of the mature protein. These sequences were compared with previous data obtained from five Pacific salmon species and Atlantic salmon using the same primers. Although all of the lake trout alleles clustered together in the neighbor-joining analysis of amino acid sequences, one amino acid allelic lineage was shared with Atlantic salmon (Salmo salar), a species in another genus which probably diverged from Salvelinus more than 10-20 million years ago. As shown previously in other salmonids, the level of nonsynonymous nucleotide substitution (d(N)) exceeded the level of synonymous substitution (d(S)). The level of nucleotide diversity at the MHC class II B1 locus was considerably higher in lake trout than in the Pacific salmon (genus Oncorhynchus). These results are consistent with the hypothesis that lake trout colonized Lake Superior from more than one refuge following the Wisconsin glaciation. Recent population bottlenecks may have reduced nucleotide diversity in Pacific salmon populations.
Characterization and mapping of cDNA encoding aspartate aminotransferase in rice, Oryza sativa L.
Song, J; Yamamoto, K; Shomura, A; Yano, M; Minobe, Y; Sasaki, T
1996-10-31
Fifteen cDNA clones, putatively identified as encoding aspartate aminotransferase (AST, EC 2.6.1.1.), were isolated and partially sequenced. Together with six previously isolated clones putatively identified to encode ASTs (Sasaki, et al. 1994, Plant Journal 6, 615-624), their sequences were characterized and classified into 4 cDNA species. Two of the isolated clones, C60213 and C2079, were full-length cDNAs, and their complete nucleotide sequences were determined. C60213 was 1612 bp long and its deduced amino acid sequence showed 88% homology with that of Panicum miliaceum L. mitochondrial AST. The C60213-encoded protein had an N-terminal amino acid sequence that was characteristic of a mitochondrial transit peptide. On the other hand, C2079 was 1546 bp long and had 91% amino acid sequence homology with P. miliaceum L. cytosolic AST but lacked in the transit peptide sequence. The homologies of nucleotide sequences and deduced amino acid sequences of C2079 and C60213 were 54% and 52%, respectively. C2079 and C60213 were mapped on chromosomes 1 and 6, respectively, by restriction fragment length polymorphism linkage analysis. Northern blot analysis using C2079 as a probe revealed much higher transcript levels in callus and root than in green and etiolated shoots, suggesting tissue-specific variations of AST gene expression.
Comparison of the Heme Iron Utilization Systems of Pathogenic Vibrios
O’Malley, S. M.; Mouton, S. L.; Occhino, D. A.; Deanda, M. T.; Rashidi, J. R.; Fuson, K. L.; Rashidi, C. E.; Mora, M. Y.; Payne, S. M.; Henderson, D. P.
1999-01-01
Vibrio alginolyticus, Vibrio fluvialis, and Vibrio parahaemolyticus utilized heme and hemoglobin as iron sources and contained chromosomal DNA similar to several Vibrio cholerae heme iron utilization genes. A V. parahaemolyticus gene that performed the function of V. cholerae hutA was isolated. A portion of the tonB1 locus of V. parahaemolyticus was sequenced and found to encode proteins similar in amino acid sequence to V. cholerae HutW, TonB1, and ExbB1. A recombinant plasmid containing the V. cholerae tonB1 and exbB1D1 genes complemented a V. alginolyticus heme utilization mutant. These data suggest that the heme iron utilization systems of the pathogenic vibrios tested, particularly V. parahaemolyticus and V. alginolyticus, are similar at the DNA level, the functional level, and, in the case of V. parahaemolyticus, the amino acid sequence or protein level to that of V. cholerae. PMID:10348876
USDA-ARS?s Scientific Manuscript database
Background: Perilla (Perilla frutescens (L.) var frutescens) produces high levels of a-linolenic acid (ALA), an omega-3 fatty acid important to health and development. To uncover key genes involved in fatty acid (FA) and triacylglycerol (TAG) synthesis in perilla, we conducted deep sequencing of cD...
A Therapeutic Uricase with Reduced Immunogenicity Risk and Improved Development Properties
Nyborg, Andrew C.; Ward, Chris; Zacco, Anna; Grinberg, Luba; Geoghegan, James C.; Bean, Ryan; Wendeler, Michaela; Bartnik, Frank; O’Connor, Ellen; Gruia, Flaviu; Iyer, Vidyashankara; Feng, Hui; Roy, Varnika; Berge, Mark; Miner, Jeffrey N.; Wilson, David M.; Zhou, Dongmei; Nicholson, Simone; Wilker, Clynn; Wu, Chi Y.; Wilson, Susan; Jermutus, Lutz; Wu, Herren; Owen, David A.; Osbourn, Jane; Coats, Steven; Baca, Manuel
2016-01-01
Humans and higher primates are unique in that they lack uricase, the enzyme capable of oxidizing uric acid. As a consequence of this enzyme deficiency, humans have high serum uric acid levels. In some people, uric acid levels rise above the solubility limit resulting in crystallization in joints, acute inflammation in response to those crystals causes severe pain; a condition known as gout. Treatment for severe gout includes injection of non-human uricase to reduce serum uric acid levels. Krystexxa® is a hyper-PEGylated pig-baboon chimeric uricase indicated for chronic refractory gout that induces an immunogenic response in 91% of treated patients, including infusion reactions (26%) and anaphylaxis (6.5%). These properties limit its use and effectiveness. An innovative approach has been used to develop a therapeutic uricase with improved properties such as: soluble expression, neutral pH solubility, high E. coli expression level, thermal stability, and excellent activity. More than 200 diverse uricase sequences were aligned to guide protein engineering and reduce putative sequence liabilities. A single uricase lead candidate was identified, which showed low potential for immunogenicity in >200 human donor samples selected to represent diverse HLA haplotypes. Cysteines were engineered into the lead sequence for site specific PEGylation and studies demonstrated >95% PEGylation efficiency. PEGylated uricase retains enzymatic activity in vitro at neutral pH, in human serum and in vivo (rats and canines) and has an extended half-life. In canines, an 85% reduction in serum uric acid levels was observed with a single subcutaneous injection. This PEGylated, non-immunogenic uricase has the potential to provide meaningful benefits to patients with gout. PMID:28002433
Sakoda, H; Imanaka, T
1992-02-01
Using Bacillus subtilis as a host and pTB524 as a vector plasmid, we cloned the thermostable alcohol dehydrogenase (ADH-T) gene (adhT) from Bacillus stearothermophilus NCA1503 and determined its nucleotide sequence. The deduced amino acid sequence (337 amino acids) was compared with the sequences of ADHs from four different origins. The amino acid residues responsible for the catalytic activity of horse liver ADH had been clarified on the basis of three-dimensional structure. Since those catalytic amino acid residues were fairly conserved in ADH-T and other ADHs, ADH-T was inferred to have basically the same proton release system as horse liver ADH. The putative proton release system of ADH-T was elucidated by introducing point mutations at the catalytic amino acid residues, Cys-38 (cysteine at position 38), Thr-40, and His-43, with site-directed mutagenesis. The mutant enzyme Thr-40-Ser (Thr-40 was replaced by serine) showed a little lower level of activity than wild-type ADH-T did. The result indicates that the OH group of serine instead of threonine can also be used for the catalytic activity. To change the pKa value of the putative system, His-43 was replaced by the more basic amino acid arginine. As a result, the optimum pH of the mutant enzyme His-43-Arg was shifted from 7.8 (wild-type enzyme) to 9.0. His-43-Arg exhibited a higher level of activity than wild-type enzyme at the optimum pH.
Sakoda, H; Imanaka, T
1992-01-01
Using Bacillus subtilis as a host and pTB524 as a vector plasmid, we cloned the thermostable alcohol dehydrogenase (ADH-T) gene (adhT) from Bacillus stearothermophilus NCA1503 and determined its nucleotide sequence. The deduced amino acid sequence (337 amino acids) was compared with the sequences of ADHs from four different origins. The amino acid residues responsible for the catalytic activity of horse liver ADH had been clarified on the basis of three-dimensional structure. Since those catalytic amino acid residues were fairly conserved in ADH-T and other ADHs, ADH-T was inferred to have basically the same proton release system as horse liver ADH. The putative proton release system of ADH-T was elucidated by introducing point mutations at the catalytic amino acid residues, Cys-38 (cysteine at position 38), Thr-40, and His-43, with site-directed mutagenesis. The mutant enzyme Thr-40-Ser (Thr-40 was replaced by serine) showed a little lower level of activity than wild-type ADH-T did. The result indicates that the OH group of serine instead of threonine can also be used for the catalytic activity. To change the pKa value of the putative system, His-43 was replaced by the more basic amino acid arginine. As a result, the optimum pH of the mutant enzyme His-43-Arg was shifted from 7.8 (wild-type enzyme) to 9.0. His-43-Arg exhibited a higher level of activity than wild-type enzyme at the optimum pH. Images PMID:1735726
Santagati, Vito Davide; Sestili, Francesco; Lafiandra, Domenico; D'Ovidio, Renato; Rogniaux, Helene; Masci, Stefania
2016-07-01
Wheat high molecular weight glutenin subunit variation is important because of its great influence on glutenin polymer structure, that is related to dough technological properties. Among the different subunits, the pair Bx20 and By20 is known to have a negative effect on quality, but the reasons are not clear: Bx20 has two cysteines, which theoretically make this subunit a chain extender of the glutenin polymer, just like the other Bx subunits, showing four cysteines, two of which should be involved in intra-molecular disulfide bonds. By20 has never been characterized so far at molecular level. Here we report the nucleotide sequences of Bx20 and By20 genes isolated from the durum wheat cultivar 'Lira 45' and the validation of the corresponding deduced amino acid sequences by using MALDI-TOF and LC-MS/MS. Four nucleotide differences were identified in the Bx20 gene with respect to the deduced sequence present in NCBI, causing two amino acid substitutions. For the By20 subunit, nucleotide and amino acid sequences revealed a great similarity to By15, both at gene and protein levels, showing five nucleotide changes generating two amino acid differences. No evidence of post-translational modifications has been found. Hypotheses are formulated in regard to relationships with technological quality. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
Song, Zhewei; Du, Hai; Zhang, Yan; Xu, Yan
2017-01-01
Fermentation microbiota is specific microorganisms that generate different types of metabolites in many productions. In traditional solid-state fermentation, the structural composition and functional capacity of the core microbiota determine the quality and quantity of products. As a typical example of food fermentation, Chinese Maotai-flavor liquor production involves a complex of various microorganisms and a wide variety of metabolites. However, the microbial succession and functional shift of the core microbiota in this traditional food fermentation remain unclear. Here, high-throughput amplicons (16S rRNA gene amplicon sequencing and internal transcribed space amplicon sequencing) and metatranscriptomics sequencing technologies were combined to reveal the structure and function of the core microbiota in Chinese soy sauce aroma type liquor production. In addition, ultra-performance liquid chromatography and headspace-solid phase microextraction-gas chromatography-mass spectrometry were employed to provide qualitative and quantitative analysis of the major flavor metabolites. A total of 10 fungal and 11 bacterial genera were identified as the core microbiota. In addition, metatranscriptomic analysis revealed pyruvate metabolism in yeasts (genera Pichia, Schizosaccharomyces, Saccharomyces , and Zygosaccharomyces ) and lactic acid bacteria (genus Lactobacillus ) classified into two stages in the production of flavor components. Stage I involved high-level alcohol (ethanol) production, with the genus Schizosaccharomyces serving as the core functional microorganism. Stage II involved high-level acid (lactic acid and acetic acid) production, with the genus Lactobacillus serving as the core functional microorganism. The functional shift from the genus Schizosaccharomyces to the genus Lactobacillus drives flavor component conversion from alcohol (ethanol) to acid (lactic acid and acetic acid) in Chinese Maotai-flavor liquor production. Our findings provide insight into the effects of the core functional microbiota in soy sauce aroma type liquor production and the characteristics of the fermentation microbiota under different environmental conditions.
Takaesu, Azusa; Watanabe, Kiyotaka; Takai, Shinji; Sasaki, Yukako; Orino, Koichi
2008-01-01
Background Iron-storage protein, ferritin plays a central role in iron metabolism. Ferritin has dual function to store iron and segregate iron for protection of iron-catalyzed reactive oxygen species. Tissue ferritin is composed of two kinds of subunits (H: heavy chain or heart-type subunit; L: light chain or liver-type subunit). Ferritin gene expression is controlled at translational level in iron-dependent manner or at transcriptional level in iron-independent manner. However, sequencing analysis of marine mammalian ferritin subunits has not yet been performed fully. The purpose of this study is to reveal cDNA-derived amino acid sequences of cetacean ferritin H and L subunits, and demonstrate the possibility of expression of these subunits, especially H subunit, by iron. Methods Sequence analyses of cetacean ferritin H and L subunits were performed by direct sequencing of polymerase chain reaction (PCR) fragments from cDNAs generated via reverse transcription-PCR of leukocyte total RNA prepared from blood samples of six different dolphin species (Pseudorca crassidens, Lagenorhynchus obliquidens, Grampus griseus, Globicephala macrorhynchus, Tursiops truncatus, and Delphinapterus leucas). The putative iron-responsive element sequence in the 5'-untranslated region of the six different dolphin species was revealed by direct sequencing of PCR fragments obtained using leukocyte genomic DNA. Results Dolphin H and L subunits consist of 182 and 174 amino acids, respectively, and amino acid sequence identities of ferritin subunits among these dolphins are highly conserved (H: 99–100%, (99→98) ; L: 98–100%). The conserved 28 bp IRE sequence was located -144 bp upstream from the initiation codon in the six different dolphin species. Conclusion These results indicate that six different dolphin species have conserved ferritin sequences, and suggest that these genes are iron-dependently expressed. PMID:18954429
Sugihara, K; Hanagata, N; Dubinsky, Z; Baba, S; Karube, I
2000-11-01
Young plants of the common Okinawa mangrove species Bruguiera gymnorrhiza were transferred from freshwater to a medium with seawater salt level (500 mM NaCl). Two-dimensional gel electrophoresis revealed in the leaf extract of the plant a 33 kDa protein with pI 5.2, whose quantity increased as a result of NaCl treatment. The N-terminal amino acids sequence of this protein had a significant homology with mature region of oxygen evolving enhancer protein 1 (OEE1) precursor. The cloning of OEE1 precursor cDNA fragment was carried out by means of reverse transcription-PCR (RT-PCR) using degenerated primers. Both 3'- and 5'-regions were isolated by rapid amplification of cDNA ends (RACE) method. The deduced amino acid sequence consisted of 322 amino acids and was 87% identical to that of Nicotiana tabacum. In B. gymnorrhiza, the predicted amino acid sequence of the mature protein starts at the residue number 85 of the open reading frame. The first 84-amino acid residues correspond to a typical transit sequence for the signal directing OEE1 to its appropriate compartment of chloroplast. The expression of OEE1 was analyzed together with other OEE subunits and D1 protein of photosystem II. The transcript levels of all the three OEEs were enhanced by NaCl treatment, but the significant increase of D1 protein was not observed.
Hazes, Bart
2014-02-28
Protein-coding DNA sequences and their corresponding amino acid sequences are routinely used to study relationships between sequence, structure, function, and evolution. The rapidly growing size of sequence databases increases the power of such comparative analyses but it makes it more challenging to prepare high quality sequence data sets with control over redundancy, quality, completeness, formatting, and labeling. Software tools for some individual steps in this process exist but manual intervention remains a common and time consuming necessity. CDSbank is a database that stores both the protein-coding DNA sequence (CDS) and amino acid sequence for each protein annotated in Genbank. CDSbank also stores Genbank feature annotation, a flag to indicate incomplete 5' and 3' ends, full taxonomic data, and a heuristic to rank the scientific interest of each species. This rich information allows fully automated data set preparation with a level of sophistication that aims to meet or exceed manual processing. Defaults ensure ease of use for typical scenarios while allowing great flexibility when needed. Access is via a free web server at http://hazeslab.med.ualberta.ca/CDSbank/. CDSbank presents a user-friendly web server to download, filter, format, and name large sequence data sets. Common usage scenarios can be accessed via pre-programmed default choices, while optional sections give full control over the processing pipeline. Particular strengths are: extract protein-coding DNA sequences just as easily as amino acid sequences, full access to taxonomy for labeling and filtering, awareness of incomplete sequences, and the ability to take one protein sequence and extract all synonymous CDS or identical protein sequences in other species. Finally, CDSbank can also create labeled property files to, for instance, annotate or re-label phylogenetic trees.
Bai, Yang; Dougherty, Laura; Li, Mingjun; Fazio, Gennaro; Cheng, Lailiang; Xu, Kenong
2012-08-01
Acidity levels greatly affect the taste and flavor of fruit, and consequently its market value. In mature apple fruit, malic acid is the predominant organic acid. Several studies have confirmed that the major quantitative trait locus Ma largely controls the variation of fruit acidity levels. The Ma locus has recently been defined in a region of 150 kb that contains 44 predicted genes on chromosome 16 in the Golden Delicious genome. In this study, we identified two aluminum-activated malate transporter-like genes, designated Ma1 and Ma2, as strong candidates of Ma by narrowing down the Ma locus to 65-82 kb containing 12-19 predicted genes depending on the haplotypes. The Ma haplotypes were determined by sequencing two bacterial artificial chromosome clones from G.41 (an apple rootstock of genotype Mama) that cover the two distinct haplotypes at the Ma locus. Gene expression profiling in 18 apple germplasm accessions suggested that Ma1 is the major determinant at the Ma locus controlling fruit acidity as Ma1 is expressed at a much higher level than Ma2 and the Ma1 expression is significantly correlated with fruit titratable acidity (R (2) = 0.4543, P = 0.0021). In the coding sequences of low acidity alleles of Ma1 and Ma2, sequence variations at the amino acid level between Golden Delicious and G.41 were not detected. But the alleles for high acidity vary considerably between the two genotypes. The low acidity allele of Ma1, Ma1-1455A, is mainly characterized by a mutation at base 1455 in the open reading frame. The mutation leads to a premature stop codon that truncates the carboxyl terminus of Ma1-1455A by 84 amino acids compared with Ma1-1455G. A survey of 29 apple germplasm accessions using marker CAPS(1455) that targets the SNP(1455) in Ma1 showed that the CAPS(1455A) allele was associated completely with high pH and highly with low titratable acidity, suggesting that the natural mutation-led truncation is most likely responsible for the abolished function of Ma for low pH or high acidity in apple.
Lucas, J.N.; Straume, T.; Bogen, K.T.
1998-03-24
A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration. 14 figs.
Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.
1998-01-01
A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration.
Pudupakam, Raghavendra Sumanth; Raghunath, Shobana; Pudupakam, Meghanath; Daggupati, Sreenivasulu
2017-03-01
Sequence analysis and phylogenetic studies based on non-structural protein-3 (NS3) gene are important in understanding the evolution and epidemiology of bluetongue virus (BTV). This study was aimed at characterizing the NS3 gene sequence of Indian BTV serotype-2 (BTV2) to elucidate its genetic relationship to global BTV isolates. The NS3 gene of BTV2 was amplified from infected BHK-21 cell cultures, cloned and subjected to sequence analysis. The generated NS3 gene sequence was compared with the corresponding sequences of different BTV serotypes across the world, and a phylogenetic relationship was established. The NS3 gene of BTV2 showed moderate levels of variability in comparison to different BTV serotypes, with nucleotide sequence identities ranging from 81% to 98%. The region showed high sequence homology of 93-99% at amino acid level with various BTV serotypes. The PPXY/PTAP late domain motifs, glycosylation sites, hydrophobic domains, and the amino acid residues critical for virus-host interactions were conserved in NS3 protein. Phylogenetic analysis revealed that BTV isolates segregate into four topotypes and that the Indian BTV2 in subclade IA is closely related to Asian and Australian origin strains. Analysis of the NS3 gene indicated that Indian BTV2 isolate is closely related to strains from Asia and Australia, suggesting a common origin of infection. Although the pattern of evolution of BTV2 isolate is different from other global isolates, the deduced amino acid sequence of NS3 protein demonstrated high molecular stability.
Pudupakam, Raghavendra Sumanth; Raghunath, Shobana; Pudupakam, Meghanath; Daggupati, Sreenivasulu
2017-01-01
Aim: Sequence analysis and phylogenetic studies based on non-structural protein-3 (NS3) gene are important in understanding the evolution and epidemiology of bluetongue virus (BTV). This study was aimed at characterizing the NS3 gene sequence of Indian BTV serotype-2 (BTV2) to elucidate its genetic relationship to global BTV isolates. Materials and Methods: The NS3 gene of BTV2 was amplified from infected BHK-21 cell cultures, cloned and subjected to sequence analysis. The generated NS3 gene sequence was compared with the corresponding sequences of different BTV serotypes across the world, and a phylogenetic relationship was established. Results: The NS3 gene of BTV2 showed moderate levels of variability in comparison to different BTV serotypes, with nucleotide sequence identities ranging from 81% to 98%. The region showed high sequence homology of 93-99% at amino acid level with various BTV serotypes. The PPXY/PTAP late domain motifs, glycosylation sites, hydrophobic domains, and the amino acid residues critical for virus-host interactions were conserved in NS3 protein. Phylogenetic analysis revealed that BTV isolates segregate into four topotypes and that the Indian BTV2 in subclade IA is closely related to Asian and Australian origin strains. Conclusion: Analysis of the NS3 gene indicated that Indian BTV2 isolate is closely related to strains from Asia and Australia, suggesting a common origin of infection. Although the pattern of evolution of BTV2 isolate is different from other global isolates, the deduced amino acid sequence of NS3 protein demonstrated high molecular stability. PMID:28435199
Trends of amino acid usage in the proteins from the unicellular parasite Giardia lamblia.
Garat, B; Musto, H
2000-12-29
Correspondence analysis of amino acid frequencies was applied to 75 complete coding sequences from the unicellular parasite Giardia lamblia, and it was found that three major factors influence the variability of amino acidic composition of proteins. The first trend strongly correlated with (a) the cysteine content and (b) the mean weight of the amino acids used in each protein. The second trend correlated with the global levels of hydropathy and aromaticity of each protein. Both axes might be related with the defense of the parasite to oxygen free radicals. Finally, the third trend correlated with the expressivity of each gene, indicating that in G. lamblia highly expressed sequences display a tendency to preferentially use a subset of the total amino acids.
The bglA Gene of Aspergillus kawachii Encodes Both Extracellular and Cell Wall-Bound β-Glucosidases
Iwashita, Kazuhiro; Nagahara, Tatsuya; Kimura, Hitoshi; Takano, Makoto; Shimoi, Hitoshi; Ito, Kiyoshi
1999-01-01
We cloned the genomic DNA and cDNA of bglA, which encodes β-glucosidase in Aspergillus kawachii, based on a partial amino acid sequence of purified cell wall-bound β-glucosidase CB-1. The nucleotide sequence of the cloned bglA gene revealed a 2,933-bp open reading frame with six introns that encodes an 860-amino-acid protein. Based on the deduced amino acid sequence, we concluded that the bglA gene encodes cell wall-bound β-glucosidase CB-1. The amino acid sequence exhibited high levels of homology with the amino acid sequences of fungal β-glucosidases classified in subfamily B. We expressed the bglA cDNA in Saccharomyces cerevisiae and detected the recombinant β-glucosidase in the periplasm fraction of the recombinant yeast. A. kawachii can produce two extracellular β-glucosidases (EX-1 and EX-2) in addition to the cell wall-bound β-glucosidase. A. kawachii in which the bglA gene was disrupted produced none of the three β-glucosidases, as determined by enzyme assays and a Western blot analysis. Thus, we concluded that the bglA gene encodes both extracellular and cell wall-bound β-glucosidases in A. kawachii. PMID:10584016
Method for identifying and quantifying nucleic acid sequence aberrations
Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.
1998-01-01
A method for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe.
Method for identifying and quantifying nucleic acid sequence aberrations
Lucas, J.N.; Straume, T.; Bogen, K.T.
1998-07-21
A method is disclosed for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe. 11 figs.
Characterization of tannase protein sequences of bacteria and fungi: an in silico study.
Banerjee, Amrita; Jana, Arijit; Pati, Bikash R; Mondal, Keshab C; Das Mohapatra, Pradeep K
2012-04-01
The tannase protein sequences of 149 bacteria and 36 fungi were retrieved from NCBI database. Among them only 77 bacterial and 31 fungal tannase sequences were taken which have different amino acid compositions. These sequences were analysed for different physical and chemical properties, superfamily search, multiple sequence alignment, phylogenetic tree construction and motif finding to find out the functional motif and the evolutionary relationship among them. The superfamily search for these tannase exposed the occurrence of proline iminopeptidase-like, biotin biosynthesis protein BioH, O-acetyltransferase, carboxylesterase/thioesterase 1, carbon-carbon bond hydrolase, haloperoxidase, prolyl oligopeptidase, C-terminal domain and mycobacterial antigens families and alpha/beta hydrolase superfamily. Some bacterial and fungal sequence showed similarity with different families individually. The multiple sequence alignment of these tannase protein sequences showed conserved regions at different stretches with maximum homology from amino acid residues 389-469 and 482-523 which could be used for designing degenerate primers or probes specific for tannase producing bacterial and fungal species. Phylogenetic tree showed two different clusters; one has only bacteria and another have both fungi and bacteria showing some relationship between these different genera. Although in second cluster near about all fungal species were found together in a corner which indicates the sequence level similarity among fungal genera. The distributions of fourteen motifs analysis revealed Motif 1 with a signature amino acid sequence of 29 amino acids, i.e. GCSTGGREALKQAQRWPHDYDGIIANNPA, was uniformly observed in 83.3 % of studied tannase sequences representing its participation with the structure and enzymatic function.
Hegeman, Carla E.; Grabau, Elizabeth A.
2001-01-01
Phytic acid (myo-inositol hexakisphosphate) is the major storage form of phosphorus in plant seeds. During germination, stored reserves are used as a source of nutrients by the plant seedling. Phytic acid is degraded by the activity of phytases to yield inositol and free phosphate. Due to the lack of phytases in the non-ruminant digestive tract, monogastric animals cannot utilize dietary phytic acid and it is excreted into manure. High phytic acid content in manure results in elevated phosphorus levels in soil and water and accompanying environmental concerns. The use of phytases to degrade seed phytic acid has potential for reducing the negative environmental impact of livestock production. A phytase was purified to electrophoretic homogeneity from cotyledons of germinated soybeans (Glycine max L. Merr.). Peptide sequence data generated from the purified enzyme facilitated the cloning of the phytase sequence (GmPhy) employing a polymerase chain reaction strategy. The introduction of GmPhy into soybean tissue culture resulted in increased phytase activity in transformed cells, which confirmed the identity of the phytase gene. It is surprising that the soybean phytase was unrelated to previously characterized microbial or maize (Zea mays) phytases, which were classified as histidine acid phosphatases. The soybean phytase sequence exhibited a high degree of similarity to purple acid phosphatases, a class of metallophosphoesterases. PMID:11500558
An oleate 12-hydroxylase from Ricinus communis L. is a fatty acyl desaturase homolog
DOE Office of Scientific and Technical Information (OSTI.GOV)
Van De Loo, F.J.; Broun, P.; Turner, S.
1995-07-18
Recent spectroscopic evidence implicating a binuclear iron site at the reaction center of fatty acyl desaturases suggested to us that certain fatty acyl hydroxylases may share significant amino acid sequence similarity with desaturases. To test this theory, we prepared a cDNA library from developing endosperm of the castor-oil plant (Ricinus communis L.) and obtained partial nucleotide sequences for 468 anonymous clones that were not expressed at high levels in leaves, a tissue deficient in 12-hydroxyoleic acid. This resulted in the identification of several cDNA clones encoding a polypeptide of 387 amino acids with a predicted molecular weight of 44,407 andmore » with {approx}67% sequence homology to microsomal oleate desaturase from Arabidopsis. Expression of a full-length clone under control of the cauliflower mosaic virus 35S promoter in transgenic tobacco resulted in the accumulation of low levels of 12-hydroxyoleic acid in seeds, indicating that the clone encodes the castor oleate hydroxylase. These results suggest that fatty acyl desaturases and hydroxylases share similar reaction mechanisms and provide an example of enzyme evolution. 26 refs., 6 figs., 1 tab.« less
Somerville, Chris; Broun, Pierre; van de Loo, Frank
2001-01-01
This invention relates to plant fatty acyl hydroxylases. Methods to use conserved amino acid or nucleotide sequences to obtain plant fatty acyl hydroxylases are described. Also described is the use of cDNA clones encoding a plant hydroxylase to produce a family of hydroxylated fatty acids in transgenic plants. In addition, the use of genes encoding fatty acid hydroxylases or desaturases to alter the level of lipid fatty acid unsaturation in transgenic plants is described.
Cotten, Matthew; Oude Munnink, Bas; Canuti, Marta; Deijs, Martin; Watson, Simon J; Kellam, Paul; van der Hoek, Lia
2014-01-01
We have developed a full genome virus detection process that combines sensitive nucleic acid preparation optimised for virus identification in fecal material with Illumina MiSeq sequencing and a novel post-sequencing virus identification algorithm. Enriched viral nucleic acid was converted to double-stranded DNA and subjected to Illumina MiSeq sequencing. The resulting short reads were processed with a novel iterative Python algorithm SLIM for the identification of sequences with homology to known viruses. De novo assembly was then used to generate full viral genomes. The sensitivity of this process was demonstrated with a set of fecal samples from HIV-1 infected patients. A quantitative assessment of the mammalian, plant, and bacterial virus content of this compartment was generated and the deep sequencing data were sufficient to assembly 12 complete viral genomes from 6 virus families. The method detected high levels of enteropathic viruses that are normally controlled in healthy adults, but may be involved in the pathogenesis of HIV-1 infection and will provide a powerful tool for virus detection and for analyzing changes in the fecal virome associated with HIV-1 progression and pathogenesis.
Cotten, Matthew; Oude Munnink, Bas; Canuti, Marta; Deijs, Martin; Watson, Simon J.; Kellam, Paul; van der Hoek, Lia
2014-01-01
We have developed a full genome virus detection process that combines sensitive nucleic acid preparation optimised for virus identification in fecal material with Illumina MiSeq sequencing and a novel post-sequencing virus identification algorithm. Enriched viral nucleic acid was converted to double-stranded DNA and subjected to Illumina MiSeq sequencing. The resulting short reads were processed with a novel iterative Python algorithm SLIM for the identification of sequences with homology to known viruses. De novo assembly was then used to generate full viral genomes. The sensitivity of this process was demonstrated with a set of fecal samples from HIV-1 infected patients. A quantitative assessment of the mammalian, plant, and bacterial virus content of this compartment was generated and the deep sequencing data were sufficient to assembly 12 complete viral genomes from 6 virus families. The method detected high levels of enteropathic viruses that are normally controlled in healthy adults, but may be involved in the pathogenesis of HIV-1 infection and will provide a powerful tool for virus detection and for analyzing changes in the fecal virome associated with HIV-1 progression and pathogenesis. PMID:24695106
Tan, Yen Hock; Huang, He; Kihara, Daisuke
2006-08-15
Aligning distantly related protein sequences is a long-standing problem in bioinformatics, and a key for successful protein structure prediction. Its importance is increasing recently in the context of structural genomics projects because more and more experimentally solved structures are available as templates for protein structure modeling. Toward this end, recent structure prediction methods employ profile-profile alignments, and various ways of aligning two profiles have been developed. More fundamentally, a better amino acid similarity matrix can improve a profile itself; thereby resulting in more accurate profile-profile alignments. Here we have developed novel amino acid similarity matrices from knowledge-based amino acid contact potentials. Contact potentials are used because the contact propensity to the other amino acids would be one of the most conserved features of each position of a protein structure. The derived amino acid similarity matrices are tested on benchmark alignments at three different levels, namely, the family, the superfamily, and the fold level. Compared to BLOSUM45 and the other existing matrices, the contact potential-based matrices perform comparably in the family level alignments, but clearly outperform in the fold level alignments. The contact potential-based matrices perform even better when suboptimal alignments are considered. Comparing the matrices themselves with each other revealed that the contact potential-based matrices are very different from BLOSUM45 and the other matrices, indicating that they are located in a different basin in the amino acid similarity matrix space.
Nucleic and Amino Acid Sequences Support Structure-Based Viral Classification.
Sinclair, Robert M; Ravantti, Janne J; Bamford, Dennis H
2017-04-15
Viral capsids ensure viral genome integrity by protecting the enclosed nucleic acids. Interactions between the genome and capsid and between individual capsid proteins (i.e., capsid architecture) are intimate and are expected to be characterized by strong evolutionary conservation. For this reason, a capsid structure-based viral classification has been proposed as a way to bring order to the viral universe. The seeming lack of sufficient sequence similarity to reproduce this classification has made it difficult to reject structural convergence as the basis for the classification. We reinvestigate whether the structure-based classification for viral coat proteins making icosahedral virus capsids is in fact supported by previously undetected sequence similarity. Since codon choices can influence nascent protein folding cotranslationally, we searched for both amino acid and nucleotide sequence similarity. To demonstrate the sensitivity of the approach, we identify a candidate gene for the pandoravirus capsid protein. We show that the structure-based classification is strongly supported by amino acid and also nucleotide sequence similarities, suggesting that the similarities are due to common descent. The correspondence between structure-based and sequence-based analyses of the same proteins shown here allow them to be used in future analyses of the relationship between linear sequence information and macromolecular function, as well as between linear sequence and protein folds. IMPORTANCE Viral capsids protect nucleic acid genomes, which in turn encode capsid proteins. This tight coupling of protein shell and nucleic acids, together with strong functional constraints on capsid protein folding and architecture, leads to the hypothesis that capsid protein-coding nucleotide sequences may retain signatures of ancient viral evolution. We have been able to show that this is indeed the case, using the major capsid proteins of viruses forming icosahedral capsids. Importantly, we detected similarity at the nucleotide level between capsid protein-coding regions from viruses infecting cells belonging to all three domains of life, reproducing a previously established structure-based classification of icosahedral viral capsids. Copyright © 2017 Sinclair et al.
Nucleic and Amino Acid Sequences Support Structure-Based Viral Classification
Sinclair, Robert M.; Ravantti, Janne J.
2017-01-01
ABSTRACT Viral capsids ensure viral genome integrity by protecting the enclosed nucleic acids. Interactions between the genome and capsid and between individual capsid proteins (i.e., capsid architecture) are intimate and are expected to be characterized by strong evolutionary conservation. For this reason, a capsid structure-based viral classification has been proposed as a way to bring order to the viral universe. The seeming lack of sufficient sequence similarity to reproduce this classification has made it difficult to reject structural convergence as the basis for the classification. We reinvestigate whether the structure-based classification for viral coat proteins making icosahedral virus capsids is in fact supported by previously undetected sequence similarity. Since codon choices can influence nascent protein folding cotranslationally, we searched for both amino acid and nucleotide sequence similarity. To demonstrate the sensitivity of the approach, we identify a candidate gene for the pandoravirus capsid protein. We show that the structure-based classification is strongly supported by amino acid and also nucleotide sequence similarities, suggesting that the similarities are due to common descent. The correspondence between structure-based and sequence-based analyses of the same proteins shown here allow them to be used in future analyses of the relationship between linear sequence information and macromolecular function, as well as between linear sequence and protein folds. IMPORTANCE Viral capsids protect nucleic acid genomes, which in turn encode capsid proteins. This tight coupling of protein shell and nucleic acids, together with strong functional constraints on capsid protein folding and architecture, leads to the hypothesis that capsid protein-coding nucleotide sequences may retain signatures of ancient viral evolution. We have been able to show that this is indeed the case, using the major capsid proteins of viruses forming icosahedral capsids. Importantly, we detected similarity at the nucleotide level between capsid protein-coding regions from viruses infecting cells belonging to all three domains of life, reproducing a previously established structure-based classification of icosahedral viral capsids. PMID:28122979
M Naresh Kumar, C V; Anthony Johnson, A M; R Sai Gopal, D V
2007-12-01
Chikungunya virus has caused numerous large outbreaks in India. Suspected blood samples from the epidemic were collected and characterized for the identification of the responsible causative from Rayalaseema region of Andhra Pradesh. RT-PCR was used for screening of suspected blood samples. Primers were designed to amplify partial E1 gene and the amplified fragment was cloned and sequenced. The sequence was analyzed and compared with other geographical isolates to find the phylogenetic relationship. The sequence was submitted to the Gen bank DNA database (accession DQ888620). Comparative nucleotide homology analysis of the AP Ra-CTR isolate with the other isolates revealed 94.7+/-3.6 per cent of homology of CHIKAPRa-CTR with other isolates of Chikungunya virus at nucleotide level and 96.8+/-3.2 per cent of homology at amino acid level. The current epidemic was caused by the Central African genotype of CHIKV, grouped in Central Africa cluster in phylogenetic trees generated based on nucleotide and amino acid sequences.
Method for isolating chromosomal DNA in preparation for hybridization in suspension
Lucas, Joe N.
2000-01-01
A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration. Chromosomal DNA in a sample containing cell debris is prepared for hybridization in suspension by treating the mixture with RNase. The treated DNA can also be fixed prior to hybridization.
Phylogenetic analysis of Hungarian goose parvovirus isolates and vaccine strains.
Tatár-Kis, Tímea; Mató, Tamás; Markos, Béla; Palya, Vilmos
2004-08-01
Polymerase chain reaction and sequencing were used to analyse goose parvovirus field isolates and vaccine strains. Two fragments of the genome were amplified. Fragment "A" represents a region of VP3 gene, while fragment "B" represents a region upstream of the VP3 gene, encompassing part of the VP1 gene. In the region of fragment "A" the deduced amino acid sequence of the strains was identical, therefore differentiation among strains could be done only at the nucleotide level, which resulted in the formation of three groups: Hungarian, West-European and Asian strains. In the region of fragment "B", separation of groups could be done by both nucleotide and deduced amino acid sequence level. The nucleotide sequences resulted in the same groups as for fragment "A" but with a different clustering pattern among the Hungarian strains. Within the "Hungarian" group most of the recent field isolates fell into one cluster, very closely related or identical to each other, indicating a very slow evolutionary change. The attenuated strains and field isolates from 1979/80 formed a separate cluster. When vaccine strains and field isolates were compared, two specific amino acid differences were found that can be considered as possible markers for vaccinal strains. Sequence analysis of fragment "B" seems to be a suitable method for differentiation of attenuated vaccine strains from virulent strains. Copyright 2004 Houghton Trust Ltd
Chen, Yuhuang; Duan, Ran; Li, Xu; Li, Kewei; Liang, Junrong; Liu, Chang; Qiu, Haiyan; Xiao, Yuchun; Jing, Huaiqi; Wang, Xin
2015-12-01
The outer membrane protein A (OmpA) is one of the intra-species conserved proteins with immunogenicity widely found in the family of Enterobacteriaceae. Here we first confirmed OmpA is conserved in the three pathogenic Yersinia: Yersinia pestis, Yersinia pseudotuberculosis and pathogenic Yersinia enterocolitica, with high homology at the nucleotide level and at the amino acid sequence level. The identity of ompA sequences for 262 Y. pestis strains, 134 Y. pseudotuberculosis strains and 219 pathogenic Y. enterocolitica strains are 100%, 98.8% and 97.7% similar. The main pattern of OmpA of pathogenic Yersinia are 86.2% and 88.8% identical at the nucleotide and amino acid sequence levels, respectively. Immunological analysis showed the immunogenicity of each OmpA and cross-immunogenicity of OmpA for pathogenic Yersinia where OmpA may be a vaccine candidate for Y. pestis and other pathogenic Yersinia. Copyright © 2015 Elsevier Ltd. All rights reserved.
[Cloning and bioinformatics analysis of abscisic acid 8'-hydroxylase from Pseudostellariae Radix].
Li, Jun; Long, Deng-Kai; Zhou, Tao; Ding, Ling; Zheng, Wei; Jiang, Wei-Ke
2016-07-01
Abscisic acid 8'-hydroxylase was one of key enzymes genes in the metabolism of abscisic acid (ABA). Seven menbers of abscisic acid 8'-hydroxylase were identified from Pseudostellaria heterophylla transcriptome sequencing results by using sequence homology. The expression profiles of these genes were analyzed by transcriptome data. The coding sequence of ABA8ox1 was cloned and analyzed by informational technology. The full-length cDNA of ABA8ox1 was 1 401 bp,with 480 encoded amino acids. The predicated isoelectric point (pI) and relative molecular mass (MW) were 8.55 and 53 kDa,respectively. Transmembrane structure analysis showed that there were 21 amino acids in-side and 445 amino acids out-side. High level of transcripts can detect in bark of root and fibrous root. Multi-alignment and phylogenetic analysis both show that ABA8ox1 had a high similarity with the CYP707As from other plants,especially with AtCYP707A1 and AtCYP707A3 in Arabidopsis thaliana. These results lay a foundation for molecular mechanism of tuberous root expanding and response to adversity stress. Copyright© by the Chinese Pharmaceutical Association.
Yarimizu, Tohru; Nakamura, Mikiko; Hoshida, Hisashi; Akada, Rinji
2015-02-14
Targeting of cellular proteins to the extracellular environment is directed by a secretory signal sequence located at the N-terminus of a secretory protein. These signal sequences usually contain an N-terminal basic amino acid followed by a stretch containing hydrophobic residues, although no consensus signal sequence has been identified. In this study, simple modeling of signal sequences was attempted using Gaussia princeps secretory luciferase (GLuc) in the yeast Kluyveromyces marxianus, which allowed comprehensive recombinant gene construction to substitute synthetic signal sequences. Mutational analysis of the GLuc signal sequence revealed that the GLuc hydrophobic peptide length was lower limit for effective secretion and that the N-terminal basic residue was indispensable. Deletion of the 16th Glu caused enhanced levels of secreted protein, suggesting that this hydrophilic residue defined the boundary of a hydrophobic peptide stretch. Consequently, we redesigned this domain as a repeat of a single hydrophobic amino acid between the N-terminal Lys and C-terminal Glu. Stretches consisting of Phe, Leu, Ile, or Met were effective for secretion but the number of residues affected secretory activity. A stretch containing sixteen consecutive methionine residues (M16) showed the highest activity; the M16 sequence was therefore utilized for the secretory production of human leukemia inhibitory factor protein in yeast, resulting in enhanced secreted protein yield. We present a new concept for the provision of secretory signal sequence ability in the yeast K. marxianus, determined by the number of residues of a single hydrophobic residue located between N-terminal basic and C-terminal acidic amino acid boundaries.
Xie, P; Wan, X P; Bu, Z; Zou, X T
2016-11-01
Ghrelin and cholecystokinin (CCK) are multifunctional peptides. In the current study, complete sequences of ghrelin (800 bp) and CCK (739 bp) were firstly cloned in Columba livia by using rapid amplification of cDNA ends (RACE) method. The open reading frames of ghrelin (351bp) and CCK (393bp) encoded 116 amino acids and 130 amino acids, respectively. Sequence comparison indicated that pigeon ghrelin and CCK shared high identity with those reported in other avian species. Quantitative real-time PCR analysis found that ghrelin and CCK mRNAs expressed in three intestinal segments of pigeon during development. Both ghrelin and CCK showed generally higher expressions at days posthatch than embryonic periods regardless of intestinal segments. In duodenum and ileum, the expressions of ghrelin and CCK mRNA reached the peak values at 8 d posthatch. Jejunum CCK mRNA level increased linearly after hatching, and reached the highest point at posthatch 28 d. Based on documented effects of long chain fatty acids (LCFAs) on pigeon ghrelin and CCK expression were also investigated in vitro. Higher concentrations (50 μM or 250 μM) of linoleic acid, α-linolenic acid or arachidonic acid can significantly increase ghrelin mRNA level in pigeon jejunum. However, for oleic acid, the induction of ghrelin gene expressions needed a lower concentration (5 μM). 5 μM of linoleic acid, α-linolenic acid or arachidonic acid and 250 μM palmitic acid repressed CCK expression significantly. A higher concentration (250 μM) of oleic acid or α-linolenic acid can up-regulate CCK mRNA level significantly. Our results indicated that ghrelin and CCK may act key functions in pigeon intestine development and their expressions could be regulated by LCFAs. © 2016 Poultry Science Association Inc.
Molecular identification of catalases from Nicotiana plumbaginifolia (L.).
Willekens, H; Villarroel, R; Van Montagu, M; Inzé, D; Van Camp, W
1994-09-19
We have isolated three different catalase cDNAs from Nicotiana plumbaginifolia (cat1, cat2, and cat3) and a partial sequence of a fourth catalase gene (cat4) that shows no discernible expression based on Northern analysis. The catalase sequences were used to determine the similarity with other plant catalases and to study the transcriptional response to paraquat, 3-aminotriazole, and salicylic acid. 3-Aminotriazole induces mRNA levels of cat1, cat2 and cat3, indicating that a reduction in catalase activity positively affects catalase mRNA abundance. Salicylic acid that binds catalase in vitro, had no effect on catalase transcript levels at physiological concentrations. Paraquat resulted in the induction of cat1.
Molecular identification of Mango, Mangifera indica L.var. totupura
Jagarlamudi, Sankar; G, Rosaiah; Kurapati, Ravi Kumar; Pinnamaneni, Rajasekhar
2011-01-01
Mango (>Mangifera indica) belonging to Anacardiaceae family is a fruit that grows in tropical regions. It is considered as the King of fruits. The present work was taken up to identify a tool in identifying the mango species at the molecular level. The chloroplast trnL-F region was amplified from extracted total genomic DNA using the polymerase chain reaction (PCR) and sequenced. Sequence of the dominant DGGE band revealed that Mangifera indica in tested leaves was Mangifera indica (100% similarity to the ITS sequences of Mangifera indica). This sequence was deposited in NCBI with the accession no. GQ927757. Abbreviations AFLP - Amplified fragment length polymorphism , cpDNA - Chloroplast DNA, DDGE - Denaturing gradient gel electrophoresis, DNA - Deoxyribo nucleic acid, EDTA - Ethylenediamine tetraacetic acid, HCl - Hydrochloric acid, ISSR - Inter simple sequence repeats, ITS - Internal transcribed spacer, MATAB - Methyl Ammonium Bromide, Na2SO3 - Sodium sulphite, NaCl - Sodium chloride, NCBI - National Centre for Biotechnology Information, PCR - Polymerase chain reaction, PEG - Polyethylene glycol, RAPD - Randomly amplified polymorphic DNA, trnL-F - Transfer RNA genes start codon- termination codon. PMID:21423885
Isolation and characterization of the chicken trypsinogen gene family.
Wang, K; Gan, L; Lee, I; Hood, L
1995-01-01
Based on genomic Southern hybridizations and cDNA sequence analyses, the chicken trypsinogen gene family can be divided into two multi-member subfamilies, a six-member trypsinogen I subfamily which encodes the cationic trypsin isoenzymes and a three-member trypsinogen II subfamily which encodes the anionic trypsin isoenzymes. The chicken cDNA and genomic clones containing these two subfamilies were isolated and characterized by DNA sequence analysis. The results indicated that the chicken trypsinogen genes encoded a signal peptide of 15 to 16 amino acid residues, an activation peptide of 9 to 10 residues and a trypsin of 223 amino acid residues. The chicken trypsinogens contain all the common catalytic and structural features for trypsins, including the catalytic triad His, Asp and Ser and the six disulphide bonds. The trypsinogen I and II subfamilies share approximately 70% sequence identity at the nucleotide and amino acid level. The sequence comparison among chicken trypsinogen subfamily members and trypsin sequences from other species suggested that the chicken trypsinogen genes may have evolved in coincidental or concerted fashion. Images Figure 6 Figure 7 PMID:7733885
Cloning and baculovirus expression of a desiccation stress gene from the beetle, Tenebrio molitor.
Graham, L A; Bendena, W G; Walker, V K
1996-02-01
The cDNA sequence encoding a novel desiccation stress protein (dsp28) found in the hemolymph of the common yellow mealworm beetle, Tenebrio molitor, has been determined. The sequence encodes a 225 amino acid protein containing a 20 amino acid signal peptide. Dsp28 shows no significant similarity to any known nucleic acid or protein sequence. Levels of dsp28 mRNA were found to increase approx 5-fold following desiccation. Dsp28 cDNA has been cloned into a baculovirus expression vector and the expressed protein was compared to native dsp28. Both dsp28 expressed by recombinant baculovirus and native dsp28 are glycosylated and N-terminally processed. Although dsp28 is induced by cold in addition to desiccation stress, it does not contribute to the freezing point depression (thermal hysteresis) observed in Tenebrio hemolymph.
Production of hydroxylated fatty acids in genetically modified plants
Somerville, Chris [Portola Valley, CA; Broun, Pierre [Burlingame, CA; van de Loo, Frank [Weston, AU; Boddupalli, Sekhar S [Manchester, MI
2011-08-23
This invention relates to plant fatty acyl hydroxylases. Methods to use conserved amino acid or nucleotide sequences to obtain plant fatty acyl hydroxylases are described. Also described is the use of cDNA clones encoding a plant hydroxylase to produce a family of hydroxylated fatty acids in transgenic plants. In addition, the use of genes encoding fatty acid hydroxylases or desaturases to alter the level of lipid fatty acid unsaturation in transgenic plants is described.
Production of hydroxylated fatty acids in genetically modified plants
Somerville, Chris; Broun, Pierre; van de Loo, Frank; Boddupalli, Sekhar S.
2005-08-30
This invention relates to plant fatty acyl hydroxylases. Methods to use conserved amino acid or nucleotide sequences to obtain plant fatty acyl hydroxylases are described. Also described is the use of cDNA clones encoding a plant hydroxylase to produce a family of hydroxylated fatty acids in transgenic plants. In addition, the use of genes encoding fatty acid hydroxylases or desaturases to alter the level of lipid fatty acid unsaturation in transgenic plants is described.
Nucleic acid sequence detection using multiplexed oligonucleotide PCR
Nolan, John P [Santa Fe, NM; White, P Scott [Los Alamos, NM
2006-12-26
Methods for rapidly detecting single or multiple sequence alleles in a sample nucleic acid are described. Provided are all of the oligonucleotide pairs capable of annealing specifically to a target allele and discriminating among possible sequences thereof, and ligating to each other to form an oligonucleotide complex when a particular sequence feature is present (or, alternatively, absent) in the sample nucleic acid. The design of each oligonucleotide pair permits the subsequent high-level PCR amplification of a specific amplicon when the oligonucleotide complex is formed, but not when the oligonucleotide complex is not formed. The presence or absence of the specific amplicon is used to detect the allele. Detection of the specific amplicon may be achieved using a variety of methods well known in the art, including without limitation, oligonucleotide capture onto DNA chips or microarrays, oligonucleotide capture onto beads or microspheres, electrophoresis, and mass spectrometry. Various labels and address-capture tags may be employed in the amplicon detection step of multiplexed assays, as further described herein.
Molecular Characterization of an Avian Astrovirus
Koci, Matthew D.; Seal, Bruce S.; Schultz-Cherry, Stacey
2000-01-01
Astroviruses are known to cause enteric disease in several animal species, including turkeys. However, only human astroviruses have been well characterized at the nucleotide level. Herein we report the nucleotide sequence, genomic organization, and predicted amino acid sequence of a turkey astrovirus isolated from poults with an emerging enteric disease. PMID:10846102
Tao, Junjie; Feng, Chao; Ai, Bin; Kang, Ming
2016-01-01
Background and Aims Limestone karst areas possess high floral diversity and endemism. The genus Primulina, which contributes to the unique calcicole flora, has high species richness and exhibit specific soil-based habitat associations that are mainly distributed on calcareous karst soils. The adaptive molecular evolutionary mechanism of the genus to karst calcium-rich environments is still not well understood. The Ca2+-permeable channel TPC1 was used in this study to test whether its gene is involved in the local adaptation of Primulina to karst high-calcium soil environments. Methods Specific amplification and sequencing primers were designed and used to amplify the full-length coding sequences of TPC1 from cDNA of 76 Primulina species. The sequence alignment without recombination and the corresponding reconstructed phylogeny tree were used in molecular evolutionary analyses at the nucleic acid level and amino acid level, respectively. Finally, the identified sites under positive selection were labelled on the predicted secondary structure of TPC1. Key Results Seventy-six full-length coding sequences of Primulina TPC1 were obtained. The length of the sequences varied between 2220 and 2286 bp and the insertion/deletion was located at the 5′ end of the sequences. No signal of substitution saturation was detected in the sequences, while significant recombination breakpoints were detected. The molecular evolutionary analyses showed that TPC1 was dominated by purifying selection and the selective pressures were not significantly different among species lineages. However, significant signals of positive selection were detected at both TPC1 codon level and amino acid level, and five sites under positive selective pressure were identified by at least three different methods. Conclusions The Ca2+-permeable channel TPC1 may be involved in the local adaptation of Primulina to karst Ca2+-rich environments. Different species lineages suffered similar selective pressure associated with calcium in karst environments, and episodic diversifying selection at a few sites may play a major role in the molecular evolution of Primulina TPC1. PMID:27582362
USDA-ARS?s Scientific Manuscript database
Seeds of Momordica charantia (bitter melon) produce high levels of eleostearic acid, an unusual conjugated fatty acid with industrial value. Deep sequencing of non-normalized and normalized cDNAs from developing bitter melon seeds was conducted to uncover key genes required for biotechnological tran...
Prasad, B. C. Narasimha; Kumar, Vinod; Gururaj, H. B.; Parimalan, R.; Giridhar, P.; Ravishankar, G. A.
2006-01-01
Capsaicin is a unique alkaloid of the plant kingdom restricted to the genus Capsicum. Capsaicin is the pungency factor, a bioactive molecule of food and of medicinal importance. Capsaicin is useful as a counterirritant, antiarthritic, analgesic, antioxidant, and anticancer agent. Capsaicin biosynthesis involves condensation of vanillylamine and 8-methyl nonenoic acid, brought about by capsaicin synthase (CS). We found that CS activity correlated with genotype-specific capsaicin levels. We purified and characterized CS (≈35 kDa). Immunolocalization studies confirmed that CS is specifically localized to the placental tissues of Capsicum fruits. Western blot analysis revealed concomitant enhancement of CS levels and capsaicin accumulation during fruit development. We determined the N-terminal amino acid sequence of purified CS, cloned the CS gene (csy1) and sequenced full-length cDNA (981 bp). The deduced amino acid sequence of CS from full-length cDNA was 38 kDa. Functionality of csy1 through heterologous expression in recombinant Escherichia coli was also demonstrated. Here we report the gene responsible for capsaicin biosynthesis, which is unique to Capsicum spp. With this information on the CS gene, speculation on the gene for pungency is unequivocally resolved. Our findings have implications in the regulation of capsaicin levels in Capsicum genotypes. PMID:16938870
Song, Zhewei; Du, Hai; Zhang, Yan; Xu, Yan
2017-01-01
Fermentation microbiota is specific microorganisms that generate different types of metabolites in many productions. In traditional solid-state fermentation, the structural composition and functional capacity of the core microbiota determine the quality and quantity of products. As a typical example of food fermentation, Chinese Maotai-flavor liquor production involves a complex of various microorganisms and a wide variety of metabolites. However, the microbial succession and functional shift of the core microbiota in this traditional food fermentation remain unclear. Here, high-throughput amplicons (16S rRNA gene amplicon sequencing and internal transcribed space amplicon sequencing) and metatranscriptomics sequencing technologies were combined to reveal the structure and function of the core microbiota in Chinese soy sauce aroma type liquor production. In addition, ultra-performance liquid chromatography and headspace-solid phase microextraction-gas chromatography-mass spectrometry were employed to provide qualitative and quantitative analysis of the major flavor metabolites. A total of 10 fungal and 11 bacterial genera were identified as the core microbiota. In addition, metatranscriptomic analysis revealed pyruvate metabolism in yeasts (genera Pichia, Schizosaccharomyces, Saccharomyces, and Zygosaccharomyces) and lactic acid bacteria (genus Lactobacillus) classified into two stages in the production of flavor components. Stage I involved high-level alcohol (ethanol) production, with the genus Schizosaccharomyces serving as the core functional microorganism. Stage II involved high-level acid (lactic acid and acetic acid) production, with the genus Lactobacillus serving as the core functional microorganism. The functional shift from the genus Schizosaccharomyces to the genus Lactobacillus drives flavor component conversion from alcohol (ethanol) to acid (lactic acid and acetic acid) in Chinese Maotai-flavor liquor production. Our findings provide insight into the effects of the core functional microbiota in soy sauce aroma type liquor production and the characteristics of the fermentation microbiota under different environmental conditions. PMID:28769888
Voelker, T A; Staswick, P; Chrispeels, M J
1986-12-01
Phytohemagglutinin (PHA), the seed lectin of the common bean, Phaseolus vulgaris, is encoded by two highly homologous, tandemly linked genes, dlec1 and dlec2, which are coordinately expressed at high levels in developing cotyledons. Their respective transcripts translate into closely related polypeptides, PHA-E and PHA-L, constituents of the tetrameric lectin which accumulates at high levels in developing seeds. In the bean cultivar Pinto UI111, PHA-E is not detectable, and PHA-L accumulates at very reduced levels. To investigate the cause of the Pinto phenotype, we cloned and sequenced the two PHA genes of Pinto, called Pdlec1 and Pdlec2, and determined the abundance of their respective mRNAs in developing cotyledons. Both genes are more than 90% homologous to the normal PHA genes found in other cultivars. Pdlec1 carries a 1-bp frameshift mutation close to the 5' end of its coding sequence. Only very truncated polypeptides could be made from its mRNA. The gene Pdlec2 encodes a polypeptide, which resembles PHA-L and its predicted amino acid sequence agrees with the available Pinto PHA amino acid sequence data. Analysis of the mRNA of developing cotyledons revealed that the Pdlec1 message is reduced 600-fold, and Pdlec2 mRNA is reduced 20-fold with respect to mRNA levels in normal cultivars. A comparison of the sequences which are upstream from the coding sequence shows that Pdlec2 has a 100-bp deletion compared to the other genes (dlec1, dlec2 and Pdlec1). This deletion which contains a large tandem repeat may be responsible for the low level of expression of Pdlec2. The very low expression of Pdlec1 is as yet unexplained.
Stolterfoht, Holly; Schwendenwein, Daniel; Sensen, Christoph W; Rudroff, Florian; Winkler, Margit
2017-09-10
Increasing demand for chemicals from renewable resources calls for the development of new biotechnological methods for the reduction of oxidized bio-based compounds. Enzymatic carboxylate reduction is highly selective, both in terms of chemo- and product selectivity, but not many carboxylate reductase enzymes (CARs) have been identified on the sequence level to date. Thus far, their phylogeny is unexplored and very little is known about their structure-function-relationship. CARs minimally contain an adenylation domain, a phosphopantetheinylation domain and a reductase domain. We have recently identified new enzymes of fungal origin, using similarity searches against genomic sequences from organisms in which aldehydes were detected upon incubation with carboxylic acids. Analysis of sequences with known CAR functionality and CAR enzymes recently identified in our laboratory suggests that the three-domain architecture mentioned above is modular. The construction of a distance tree with a subsequent 1000-replicate bootstrap analysis showed that the CAR sequences included in our study fall into four distinct subgroups (one of bacterial origin and three of fungal origin, respectively), each with a bootstrap value of 100%. The multiple sequence alignment of all experimentally confirmed CAR protein sequences revealed fingerprint sequences of residues which are likely to be involved in substrate and co-substrate binding and one of the three catalytic substeps, respectively. The fingerprint sequences broaden our understanding of the amino acids that might be essential for the reduction of organic acids to the corresponding aldehydes in CAR proteins. Copyright © 2017 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Akileswaran, L.; Brock, B.J.; Cereghino, J.L.
1999-02-01
A cDNA clone encoding a quinone reductase (QR) from the white rot basidiomycete Phanerochaete chrysosporium was isolated and sequenced. The cDNA consisted of 1,007 nucleotides and a poly(A) tail and encoded a deduced protein containing 271 amino acids. The experimentally determined eight-amino-acid N-germinal sequence of the purified QR protein from P. chrysosporium matched amino acids 72 to 79 of the predicted translation product of the cDNA. The M{sub r} of the predicted translation product, beginning with Pro-72, was essentially identical to the experimentally determined M{sub r} of one monomer of the QR dimer, and this finding suggested that QR ismore » synthesized as a proenzyme. The results of in vitro transcription-translation experiments suggested that QR is synthesized as a proenzyme with a 71-amino-acid leader sequence. This leader sequence contains two potential KEX2 cleavage sites and numerous potential cleavage sites for dipeptidyl aminopeptidase. The QR activity in cultures of P. chrysosporium increased following the addition of 2-dimethoxybenzoquinone, vanillic acid, or several other aromatic compounds. An immunoblot analysis indicated that induction resulted in an increase in the amount of QR protein, and a Northern blot analysis indicated that this regulation occurs at the level of the qr mRNA.« less
Zheng, Ling; Shockey, Jay; Bian, Fei; Chen, Gao; Shan, Lei; Li, Xinguo; Wan, Shubo; Peng, Zhenying
2017-01-01
Diacylglycerol acyltransferase (DGAT) catalyzes the final step in triacylglycerol (TAG) biosynthesis via the acyl-CoA-dependent acylation of diacylglycerol. This reaction is a major control point in the Kennedy pathway for biosynthesis of TAG, which is the most important form of stored metabolic energy in most oil-producing plants. In this study, Arachis hypogaea type 2 DGAT (AhDGAT2) genes were cloned from the peanut cultivar ‘Luhua 14.’ Sequence analysis of 11 different peanut cultivars revealed a gene family of 8 peanut DGAT2 genes (designated AhDGAT2a-h). Sequence alignments revealed 21 nucleotide differences between the eight ORFs, but only six differences result in changes to the predicted amino acid (AA) sequences. A representative full-length cDNA clone (AhDGAT2a) was characterized in detail. The biochemical effects of altering the AhDGAT2a sequence to include single variable AA residues were tested by mutagenesis and functional complementation assays in transgenic yeast systems. All six mutant variants retained enzyme activity and produced lipid droplets in vivo. The N6D and A26P mutants also displayed increased enzyme activity and/or total cellular fatty acid (FA) content. N6D mutant mainly increased the content of palmitoleic acid, and A26P mutant mainly increased the content of palmitic acid. The A26P mutant grew well both in the presence of oleic and C18:2, but the other mutants grew better in the presence of C18:2. AhDGAT2 is expressed in all peanut organs analyzed, with high transcript levels in leaves and flowers. These levels are comparable to that found in immature seeds, where DGAT2 expression is most abundant in other plants. Over-expression of AhDGAT2a in tobacco substantially increased the FA content of transformed tobacco seeds. Expression of AhDGAT2a also altered transcription levels of endogenous tobacco lipid metabolic genes in transgenic tobacco, apparently creating a larger carbon ‘sink’ that supports increased FA levels. PMID:29085382
Rajendran, Senthilnathan; Jothi, Arunachalam
2018-05-16
The Three-dimensional structure of a protein depends on the interaction between their amino acid residues. These interactions are in turn influenced by various biophysical properties of the amino acids. There are several examples of proteins that share the same fold but are very dissimilar at the sequence level. For proteins to share a common fold some crucial interactions should be maintained despite insignificant sequence similarity. Since the interactions are because of the biophysical properties of the amino acids, we should be able to detect descriptive patterns for folds at such a property level. In this line, the main focus of our research is to analyze such proteins and to characterize them in terms of their biophysical properties. Protein structures with sequence similarity lesser than 40% were selected for ten different subfolds from three different mainfolds (according to CATH classification) and were used for this analysis. We used the normalized values of the 49 physio-chemical, energetic and conformational properties of amino acids. We characterize the folds based on the average biophysical property values. We also observed a fold specific correlational behavior of biophysical properties despite a very low sequence similarity in our data. We further trained three different binary classification models (Naive Bayes-NB, Support Vector Machines-SVM and Bayesian Generalized Linear Model-BGLM) which could discriminate mainfold based on the biophysical properties. We also show that among the three generated models, the BGLM classifier model was able to discriminate protein sequences coming under all beta category with 81.43% accuracy and all alpha, alpha-beta proteins with 83.37% accuracy. Copyright © 2018 Elsevier Ltd. All rights reserved.
Clark, A M; Jacobsen, K R; Bostwick, D E; Dannenhoffer, J M; Skaggs, M I; Thompson, G A
1997-07-01
Sieve elements in the phloem of most angiosperms contain proteinaceous filaments and aggregates called P-protein. In the genus Cucurbita, these filaments are composed of two major proteins: PP1, the phloem filament protein, and PP2, the phloem lactin. The gene encoding the phloem filament protein in pumpkin (Cucurbita maxima Duch.) has been isolated and characterized. Nucleotide sequence analysis of the reconstructed gene gPP1 revealed a continuous 2430 bp protein coding sequence, with no introns, encoding an 809 amino acid polypeptide. The deduced polypeptide had characteristics of PP1 and contained a 15 amino acid sequence determined by N-terminal peptide sequence analysis of PP1. The sequence of PP1 was highly repetitive with four 200 amino acid sequence domains containing structural motifs in common with cysteine proteinase inhibitors. Expression of the PP1 gene was detected in roots, hypocotyls, cotyledons, stems, and leaves of pumpkin plants. PP1 and its mRNA accumulated in pumpkin hypocotyls during the period of rapid hypocotyl elongation after which mRNA levels declined, while protein levels remained elevated. PP1 was immunolocalized in slime plugs and P-protein bodies in sieve elements of the phloem. Occasionally, PP1 was detected in companion cells. PP1 mRNA was localized by in situ hybridization in companion cells at early stages of vascular differentiation. The developmental accumulation and localization of PP1 and its mRNA paralleled the phloem lactin, further suggesting an interaction between these phloem-specific proteins.
Nucleic acid arrays and methods of synthesis
Sabanayagam, Chandran R.; Sano, Takeshi; Misasi, John; Hatch, Anson; Cantor, Charles
2001-01-01
The present invention generally relates to high density nucleic acid arrays and methods of synthesizing nucleic acid sequences on a solid surface. Specifically, the present invention contemplates the use of stabilized nucleic acid primer sequences immobilized on solid surfaces, and circular nucleic acid sequence templates combined with the use of isothermal rolling circle amplification to thereby increase nucleic acid sequence concentrations in a sample or on an array of nucleic acid sequences.
Recoding method that removes inhibitory sequences and improves HIV gene expression
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rabadan, Raul; Krasnitz, Michael; Robins, Harlan
The invention relates to inhibitory nucleotide signal sequences or "INS" sequences in the genomes of lentiviruses. In particular the invention relates to the AGG motif present in all viral genomes. The AGG motif may have an inhibitory effect on a virus, for example by reducing the levels of, or maintaining low steady-state levels of, viral RNAs in host cells, and inducing and/or maintaining in viral latency. In one aspect, the invention provides vaccines that contain, or are produced from, viral nucleic acids in which the AGG sequences have been mutated. In another aspect, the invention provides methods and compositions formore » affecting the function of the AGG motif, and methods for identifying other INS sequences in viral genomes.« less
Yu, Fulai; Zhang, Guoqing; Zhong, Xiangfu; Han, Na; Song, Yunfeng; Zhao, Ling; Cui, Min; Rayner, Simon; Fu, Zhen F
2014-07-01
Rabies is a global problem, but its impact and prevalence vary across different regions. In some areas, such as parts of Africa and Asia, the virus is prevalent in the domestic dog population, leading to epidemic waves and large numbers of human fatalities. In other regions, such as the Americas, the virus predominates in wildlife and bat populations, with sporadic spillover into domestic animals. In this work, we attempted to investigate whether these distinct environments led to selective pressures that result in measurable changes within the genome at the amino acid level. To this end, we collected and sequenced the full genome of two isolates from divergent environments. The first isolate (DRV-AH08) was from China, where the virus is present in the dog population and the country is experiencing a serious epidemic. The second isolate (DRV-Mexico) was taken from Mexico, where the virus is present in both wildlife and domestic dog populations, but at low levels as a consequence of an effective vaccination program. We then combined and compared these with other full genome sequences to identify distinct amino acid changes that might be associated with environment. Phylogenetic analysis identified strain DRV-AH08 as belonging to the China-I lineage, which has emerged to become the dominant lineage in the current epidemic. The Mexico strain was placed in the D11 Mexico lineage, associated with the West USA-Mexico border clade. Amino acid sequence analysis identified only 17 amino acid differences in the N, G and L proteins. These differences may be associated with virus replication and virulence-for example, the short incubation period observed in the current epidemic in China.
Comino, Cinzia; Lanteri, Sergio; Portis, Ezio; Acquadro, Alberto; Romani, Annalisa; Hehn, Alain; Larbat, Romain; Bourgaud, Frédéric
2007-01-01
Background Cynara cardunculus L. is an edible plant of pharmaceutical interest, in particular with respect to the polyphenolic content of its leaves. It includes three taxa: globe artichoke, cultivated cardoon, and wild cardoon. The dominating phenolics are the di-caffeoylquinic acids (such as cynarin), which are largely restricted to Cynara species, along with their precursor, chlorogenic acid (CGA). The scope of this study is to better understand CGA synthesis in this plant. Results A gene sequence encoding a hydroxycinnamoyltransferase (HCT) involved in the synthesis of CGA, was identified. Isolation of the gene sequence was achieved by using a PCR strategy with degenerated primers targeted to conserved regions of orthologous HCT sequences available. We have isolated a 717 bp cDNA which shares 84% aminoacid identity and 92% similarity with a tobacco gene responsible for the biosynthesis of CGA from p-coumaroyl-CoA and quinic acid. In silico studies revealed the globe artichoke HCT sequence clustering with one of the main acyltransferase groups (i.e. anthranilate N-hydroxycinnamoyl/benzoyltransferase). Heterologous expression of the full length HCT (GenBank accession DQ104740) cDNA in E. coli demonstrated that the recombinant enzyme efficiently synthesizes both chlorogenic acid and p-coumaroyl quinate from quinic acid and caffeoyl-CoA or p-coumaroyl-CoA, respectively, confirming its identity as a hydroxycinnamoyl-CoA: quinate HCT. Variable levels of HCT expression were shown among wild and cultivated forms of C. cardunculus subspecies. The level of expression was correlated with CGA content. Conclusion The data support the predicted involvement of the Cynara cardunculus HCT in the biosynthesis of CGA before and/or after the hydroxylation step of hydroxycinnamoyl esters. PMID:17374149
Mosaic protein and nucleic acid vaccines against hepatitis C virus
Yusim, Karina; Korber, Bette T. M.; Kuiken, Carla L.; Fischer, William M.
2013-06-11
The invention relates to immunogenic compositions useful as HCV vaccines. Provided are HCV mosaic polypeptide and nucleic acid compositions which provide higher levels of T-cell epitope coverage while minimizing the occurrence of unnatural and rare epitopes compared to natural HCV polypeptides and consensus HCV sequences.
Jung, Woongsic; Kim, Eun Jae; Han, Se Jong; Choi, Han-Gu; Kim, Sanghee
2016-10-01
Stearoyl-CoA desaturase is a key regulator in fatty acid metabolism that catalyzes the desaturation of stearic acid to oleic acid and controls the intracellular levels of monounsaturated fatty acids (MUFAs). Two stearoyl-CoA desaturases (SCD, Δ9 desaturases) genes were identified in an Antarctic copepod, Tigriopus kingsejongensis, that was collected in a tidal pool near the King Sejong Station, King George Island, Antarctica. Full-length complementary DNA (cDNA) sequences of two T. kingsejongensis SCDs (TkSCDs) were obtained from next-generation sequencing and isolated by reverse transcription PCR. DNA sequence lengths of the open reading frames of TkSCD-1 and TkSCD-2 were determined to be 1110 and 681 bp, respectively. The molecular weights deduced from the corresponding genes were estimated to be 43.1 kDa (TkSCD-1) and 26.1 kDa (TkSCD-2). The amino acid sequences were compared with those of fatty acid desaturases and sterol desaturases from various organisms and used to analyze the relationships among TkSCDs. As assessed by heterologous expression of recombinant proteins in Escherichia coli, the enzymatic functions of both stearoyl-CoA desaturases revealed that the amount of C16:1 and C18:1 fatty acids increased by greater than 3-fold after induction with isopropyl β-D-thiogalactopyranoside. In particular, C18:1 fatty acid production increased greater than 10-fold in E. coli expressing TkSCD-1 and TkSCD-2. The results of this study suggest that both SCD genes from an Antarctic marine copepod encode a functional desaturase that is capable of increasing the amounts of palmitoleic acid and oleic acid in a prokaryotic expression system.
Goncearenco, Alexander; Ma, Bin-Guang; Berezovsky, Igor N
2014-03-01
DNA, RNA and proteins are major biological macromolecules that coevolve and adapt to environments as components of one highly interconnected system. We explore here sequence/structure determinants of mechanisms of adaptation of these molecules, links between them, and results of their mutual evolution. We complemented statistical analysis of genomic and proteomic sequences with folding simulations of RNA molecules, unraveling causal relations between compositional and sequence biases reflecting molecular adaptation on DNA, RNA and protein levels. We found many compositional peculiarities related to environmental adaptation and the life style. Specifically, thermal adaptation of protein-coding sequences in Archaea is characterized by a stronger codon bias than in Bacteria. Guanine and cytosine load in the third codon position is important for supporting the aerobic life style, and it is highly pronounced in Bacteria. The third codon position also provides a tradeoff between arginine and lysine, which are favorable for thermal adaptation and aerobicity, respectively. Dinucleotide composition provides stability of nucleic acids via strong base-stacking in ApG dinucleotides. In relation to coevolution of nucleic acids and proteins, thermostability-related demands on the amino acid composition affect the nucleotide content in the second codon position in Archaea.
Goncearenco, Alexander; Ma, Bin-Guang; Berezovsky, Igor N.
2014-01-01
DNA, RNA and proteins are major biological macromolecules that coevolve and adapt to environments as components of one highly interconnected system. We explore here sequence/structure determinants of mechanisms of adaptation of these molecules, links between them, and results of their mutual evolution. We complemented statistical analysis of genomic and proteomic sequences with folding simulations of RNA molecules, unraveling causal relations between compositional and sequence biases reflecting molecular adaptation on DNA, RNA and protein levels. We found many compositional peculiarities related to environmental adaptation and the life style. Specifically, thermal adaptation of protein-coding sequences in Archaea is characterized by a stronger codon bias than in Bacteria. Guanine and cytosine load in the third codon position is important for supporting the aerobic life style, and it is highly pronounced in Bacteria. The third codon position also provides a tradeoff between arginine and lysine, which are favorable for thermal adaptation and aerobicity, respectively. Dinucleotide composition provides stability of nucleic acids via strong base-stacking in ApG dinucleotides. In relation to coevolution of nucleic acids and proteins, thermostability-related demands on the amino acid composition affect the nucleotide content in the second codon position in Archaea. PMID:24371267
Vakili Azghandi, Masoume; Nasiri, Mohammadreza; Shamsa, Ali; Jalali, Mohsen; Shariati, Mohammad Mahdi
2016-04-01
The SRY gene (SRY) provides instructions for making a transcription factor called the sex-determining region Y protein. The sex-determining region Y protein causes a fetus to develop as a male. In this study, SRY of 15 spices included of human, chimpanzee, dog, pig, rat, cattle, buffalo, goat, sheep, horse, zebra, frog, urial, dolphin and killer whale were used for determine of bioinformatic differences. Nucleotide sequences of SRY were retrieved from the NCBI databank. Bioinformatic analysis of SRY is done by CLC Main Workbench version 5.5 and ClustalW (http:/www.ebi.ac.uk/clustalw/) and MEGA6 softwares. The multiple sequence alignment results indicated that SRY protein sequences from Orcinus orca (killer whale) and Tursiopsaduncus (dolphin) have least genetic distance of 0.33 in these 15 species and are 99.67% identical at the amino acid level. Homosapiens and Pantroglodytes (chimpanzee) have the next lowest genetic distance of 1.35 and are 98.65% identical at the amino acid level. These findings indicate that the SRY proteins are conserved in the 15 species, and their evolutionary relationships are similar.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kita, Keiko; Fukura, Takanobu; Nakase, Kohichi
The authors cloned and sequenced the gene encoding and NADPH-dependent aldehyde reductase (ARII) in Sporobolomyces salmonicolor AKU4429, which reduces ethyl 4-chloro-3-oxobutanoate (4-COBE) to ethyl (S)-4-chloro-3-hydroxybutanoate. The ARII gene is 1,032 bp long, is interrupted by four introns, and encodes a 37,315-Da polypeptide. The deduced amino acid sequence exhibited significant levels of similarity to the amino acid sequences of members of the mammalian 3{Beta}-hydroxysteroid dehydrogenase-plant dihydroglavonol 4-reductase superfamily but not to the amino acid sequences of members of the aldo-keto reductase superfamily or to the amino acid sequence of an aldehyde reductase previously isolated from the same organism. The ARII proteinmore » was overproduced in Escherichia coli about 2,000-fold compared to the production in the original y east cells. The enzyme expressed in E. coli was purified to homogeneity and had the same catalytic properties as ARII purified from S. Salmonicolor. To examine the contribution of the dinucleotide-binding motif G{sub 19}-X-X-G{sub 22}-X-X-A{sub 25}, which is located in the N-terminal region, during ARII catalysis, they replaced three amino acid residues in the motif and purified the resulting mutant enzymes. Substrate inhibition of the G{sub 19}{r{underscore}arrow}A and G{sub 22}{r{underscore}arrow}A mutant enzymes by 4-COBE die not occur. The A{sub 25}{r{underscore}arrow}G mutant enzyme could reduce 4-COBE when NADPH was replaced by an equimolar concentration of NADH.« less
NASA Astrophysics Data System (ADS)
Wang, Bin; Shao, Yanchun; Chen, Tao; Chen, Wanping; Chen, Fusheng
2015-12-01
Acetobacter pasteurianus (Ap) CICC 20001 and CGMCC 1.41 are two acetic acid bacteria strains that, because of their strong abilities to produce and tolerate high concentrations of acetic acid, have been widely used to brew vinegar in China. To globally understand the fermentation characteristics, acid-tolerant mechanisms and genetic stabilities, their genomes were sequenced. Genomic comparisons with 9 other sequenced Ap strains revealed that their chromosomes were evolutionarily conserved, whereas the plasmids were unique compared with other Ap strains. Analysis of the acid-tolerant metabolic pathway at the genomic level indicated that the metabolism of some amino acids and the known mechanisms of acetic acid tolerance, might collaboratively contribute to acetic acid resistance in Ap strains. The balance of instability factors and stability factors in the genomes of Ap CICC 20001 and CGMCC 1.41 strains might be the basis for their genetic stability, consistent with their stable industrial performances. These observations provide important insights into the acid resistance mechanism and the genetic stability of Ap strains and lay a foundation for future genetic manipulation and engineering of these two strains.
Wang, Bin; Shao, Yanchun; Chen, Tao; Chen, Wanping; Chen, Fusheng
2015-12-22
Acetobacter pasteurianus (Ap) CICC 20001 and CGMCC 1.41 are two acetic acid bacteria strains that, because of their strong abilities to produce and tolerate high concentrations of acetic acid, have been widely used to brew vinegar in China. To globally understand the fermentation characteristics, acid-tolerant mechanisms and genetic stabilities, their genomes were sequenced. Genomic comparisons with 9 other sequenced Ap strains revealed that their chromosomes were evolutionarily conserved, whereas the plasmids were unique compared with other Ap strains. Analysis of the acid-tolerant metabolic pathway at the genomic level indicated that the metabolism of some amino acids and the known mechanisms of acetic acid tolerance, might collaboratively contribute to acetic acid resistance in Ap strains. The balance of instability factors and stability factors in the genomes of Ap CICC 20001 and CGMCC 1.41 strains might be the basis for their genetic stability, consistent with their stable industrial performances. These observations provide important insights into the acid resistance mechanism and the genetic stability of Ap strains and lay a foundation for future genetic manipulation and engineering of these two strains.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Andersen, Mikael R.; Salazar, Margarita; Schaap, Peter
2011-06-01
The filamentous fungus Aspergillus niger exhibits great diversity in its phenotype. It is found globally, both as marine and terrestrial strains, produces both organic acids and hydrolytic enzymes in high amounts, and some isolates exhibit pathogenicity. Although the genome of an industrial enzyme-producing A. niger strain (CBS 513.88) has already been sequenced, the versatility and diversity of this species compels additional exploration. We therefore undertook whole genome sequencing of the acidogenic A. niger wild type strain (ATCC 1015), and produced a genome sequence of very high quality. Only 15 gaps are present in the sequence and half the telomeric regionsmore » have been elucidated. Moreover, sequence information from ATCC 1015 was utilized to improve the genome sequence of CBS 513.88. Chromosome-level comparisons uncovered several genome rearrangements, deletions, a clear case of strain-specific horizontal gene transfer, and identification of 0.8 megabase of novel sequence. Single nucleotide polymorphisms per kilobase (SNPs/kb) between the two strains were found to be exceptionally high (average: 7.8, maximum: 160 SNPs/kb). High variation within the species was confirmed with exo-metabolite profiling and phylogenetics. Detailed lists of alleles were generated, and genotypic differences were observed to accumulate in metabolic pathways essential to acid production and protein synthesis. A transcriptome analysis revealed up-regulation of the electron transport chain, specifically the alternative oxidative pathway in ATCC 1015, while CBS 513.88 showed significant up regulation of genes associated with biosynthesis of amino acids that are abundant in glucoamylase A, tRNA-synthases and protein transporters.« less
37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.
Code of Federal Regulations, 2011 CFR
2011-07-01
... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data shall...
Saravanan, Konda Mani; Dunker, A Keith; Krishnaswamy, Sankaran
2017-12-27
More than 60 prediction methods for intrinsically disordered proteins (IDPs) have been developed over the years, many of which are accessible on the World Wide Web. Nearly, all of these predictors give balanced accuracies in the ~65%-~80% range. Since predictors are not perfect, further studies are required to uncover the role of amino acid residues in native IDP as compared to predicted IDP regions. In the present work, we make use of sequences of 100% predicted IDP regions, false positive disorder predictions, and experimentally determined IDP regions to distinguish the characteristics of native versus predicted IDP regions. A higher occurrence of asparagine is observed in sequences of native IDP regions but not in sequences of false positive predictions of IDP regions. The occurrences of certain combinations of amino acids at the pentapeptide level provide a distinguishing feature in the IDPs with respect to globular proteins. The distinguishing features presented in this paper provide insights into the sequence fingerprints of amino acid residues in experimentally determined as compared to predicted IDP regions. These observations and additional work along these lines should enable the development of improvements in the accuracy of disorder prediction algorithm.
USDA-ARS?s Scientific Manuscript database
Sesame germplasm harbors genetic diversity which can be useful for sesame improvement in breeding programs. Seven accessions with different levels of oleic acid were selected from the entire USDA sesame germplasm collection (1232 accessions) and planted for morphological observation and re-examinati...
A dehydrin cognate protein from pea (Pisum sativum L.) with an atypical pattern of expression.
Robertson, M; Chandler, P M
1994-11-01
Dehydrins are a family of proteins characterised by conserved amino acid motifs, and induced in plants by dehydration or treatment with ABA. An antiserum was raised against a synthetic oligopeptide based on the most highly conserved dehydrin amino acid motif, the lysine-rich (core sequence KIKEK-LPG). This antiserum detected a novel M(r) 40,000 polypeptide and enabled isolation of a corresponding cDNA clone, pPsB61 (B61). The deduced amino acid sequence contained two lysine-rich blocks, however the remainder of the sequenced differed markedly from other pea dehydrins. Surprisingly, the sequence contained a stretch of serine residues, a characteristic common to dehydrins from many plant species but which is missing in pea dehydrin. The expression patterns of B61 mRNA and polypeptide were distinctively different from those of the pea dehydrins during seed development, germination and in young seedlings exposed to dehydration stress or treated with ABA. In particular, dehydration stress led to slightly reduced levels of B61 RNA, and ABA application to young seedlings had no marked effect on its abundance. The M(r) 40,000 polypeptide is thus related to pea dehydrin by the presence of the most highly conserved amino acid sequence motifs, but lacks the characteristic expression pattern of dehydrin. By analogy with heat shock cognate proteins we refer to this protein as a dehydrin cognate.
Cho, Young Sun; Choi, Buyl Nim; Ha, En-Mi; Kim, Ki Hong; Kim, Sung Koo; Kim, Dong Soo; Nam, Yoon Kwon
2005-01-01
Novel metallothionein (MT) complementary DNA and genomic sequences were isolated from a cartilaginous shark species, Scyliorhinus torazame. The full-length open reading frame (ORF) of shark MT cDNA encoded 68 amino acids with a high cysteine content (29%). The genomic ORF sequence (932 bp) of shark MT isolated by polymerase chain reaction (PCR) comprised 3 exons with 2 interventing introns. Shark MT sequence shared many conserved features with other vertebrate MTs: overall amino acid identities of shark MT ranged from 47% to 57% with fish MTs, and 41% to 62% with mammalian MTs. However, in addition to these conserved characteristics, shark MT sequence exhibited some unique characteristics. It contained 4 extra amino acids (Lys-Ala-Gly-Arg) at the end of the beta-domain, which have not been reported in any other vertebrate MTs. The last amino acid residue at the C-terminus was Ser, which also has not been reported in fish and mammalian MTs. The MT messenger RNA levels in shark liver and kidney, assessed by semiquantitative reverse transcriptase PCR and RNA blot hybridization, were significantly affected by experimental exposures to heavy metals (cadmium, copper, and zinc). Generally, the transcriptional activation of shark MT gene was dependent on the dose (0-10 mg/kg body weight for injection and 0-20 microM for immersion) and duration (1-10 days); zinc was a more potent inducer than copper and cadmium.
Zhang, L J; Dong, W X; Guo, S M; Wang, Y X; Wang, A D; Lu, X J
2015-11-19
This study aims to explore the roles of somatic embryogenesis receptor-like kinase (SERK) in Malus hupehensis (Pingyi Tiancha). The full-length sequences of SERK1 in triploid Pingyi Tiancha (3n) and a tetraploid hybrid strain 33# (4n) were cloned, sequenced, and designated as MhSERK1 and MhdSERK1, respectively. Multiple alignments of amino acid sequences were conducted to identify similarity between MhSERK1 and MhdSERK1 and SERK sequences in other species, and a neighbor-joining phylogenetic tree was constructed to elucidate their phylogenetic relations. Expression levels of MhSERK1 and MhdSERK1 in different tissues and developmental stages were investigated using quantitative real-time PCR. The coding sequence lengths of MhSERK1 and MhdSERK1 were 1899 bp (encoding 632 amino acids) and 1881 bp (encoding 626 amino acids), respectively. Sequence analysis demonstrated that MhSERK1 and MhdSERK1 display high similarity to SERKs in other species, with a conserved intron/exon structure that is unique to members of the SERK family. Additionally, the phylogenetic tree showed that MhSERK1 and MhdSERK1 clustered with orange CitSERK (93%). Furthermore, MhSERK1 and MhdSERK1 were mainly expressed in the reproductive organs, in particular the ovary. Their expression levels were highest in young flowers and they differed among different tissues and organs. Our results suggest that MhSERK1 and MhdSERK1 are related to plant reproduction, and that MhSERK1 is related to apomixis in triploid Pingyi Tiancha.
A retrotransposable element from the mosquito Anopheles gambiae .
Besansky, N J
1990-01-01
A family of middle repetitive elements from the African malaria vector Anopheles gambiae is described. Approximately 100 copies of the element, designated T1Ag, are dispersed in the genome. Full-length elements are 4.6 kilobase pairs in length, but truncation of the 5' end is common. Nucleotide sequences of one full-length, two 5'-truncated, and two 5' ends of T1Ag elements were determined and aligned to define a consensus sequence. Sequence analysis revealed two long, overlapping open reading frames followed by a polyadenylation signal, AATAAA, and a tail consisting of tandem repetitions of the motif TGAAA. No direct or inverted long terminal repeats (LTRs) were detected. The first open reading frame, 442 amino acids in length, includes a domain resembling that of nucleic acid-binding proteins. The second open reading frame, 975 amino acids long, resembles the reverse transcriptases of a category of retrotransposable elements without LTRs, variously termed class II retrotransposons, class III elements or non-LTR retrotransposons. Similarity at the sequence and structural levels places T1Ag in this category. Images PMID:1689457
Solid phase sequencing of double-stranded nucleic acids
Fu, Dong-Jing; Cantor, Charles R.; Koster, Hubert; Smith, Cassandra L.
2002-01-01
This invention relates to methods for detecting and sequencing of target double-stranded nucleic acid sequences, to nucleic acid probes and arrays of probes useful in these methods, and to kits and systems which contain these probes. Useful methods involve hybridizing the nucleic acids or nucleic acids which represent complementary or homologous sequences of the target to an array of nucleic acid probes. These probe comprise a single-stranded portion, an optional double-stranded portion and a variable sequence within the single-stranded portion. The molecular weights of the hybridized nucleic acids of the set can be determined by mass spectroscopy, and the sequence of the target determined from the molecular weights of the fragments. Nucleic acids whose sequences can be determined include nucleic acids in biological samples such as patient biopsies and environmental samples. Probes may be fixed to a solid support such as a hybridization chip to facilitate automated determination of molecular weights and identification of the target sequence.
Genome sequences of five Lactobacillus sp. isolates from traditional Turkish sourdough
USDA-ARS?s Scientific Manuscript database
A high level of variation in microflora can be observed in lactic acid bacteria (LAB) profiles of sourdoughs. Here, we present draft genome sequences of Lactobacillus reuteri E81, L. reuteri LR5A, L. rhamnosus LR2, L. plantarum PFC-311 and a novel Lactobacillus sp. PFC-70 isolated from traditional T...
Song, B; Hou, Y L; Ding, X; Wang, T; Wang, F; Zhong, J C; Xu, T; Zhong, J; Hou, W R; Shuai, S R
2014-02-20
Fatty acid binding proteins (FABPs) are a family of small, highly conserved cytoplasmic proteins that bind long-chain fatty acids and other hydrophobic ligands. In this study, cDNA and genomic sequences of FABP4 and FABP5 were cloned successfully from the giant panda (Ailuropoda melanoleuca) using reverse transcription polymerase chain reaction (RT-PCR) technology and touchdown-PCR. The cDNAs of FABP4 and FABP5 cloned from the giant panda were 400 and 413 bp in length, containing an open reading frame of 399 and 408 bp, encoding 132 and 135 amino acids, respectively. The genomic sequences of FABP4 and FABP5 were 3976 and 3962 bp, respectively, which each contained four exons and three introns. Sequence alignment indicated a high degree of homology with reported FABP sequences of other mammals at both the amino acid and DNA levels. Topology prediction revealed seven protein kinase C phosphorylation sites, two casein kinase II phosphorylation sites, two N-myristoylation sites, and one cytosolic fatty acid-binding protein signature in the FABP4 protein, and three N-glycosylation sites, three protein kinase C phosphorylation sites, one casein kinase II phosphorylation site, one N-myristoylation site, one amidation site, and one cytosolic fatty acid-binding protein signature in the FABP5 protein. The FABP4 and FABP5 genes were overexpressed in Escherichia coli BL21 and they produced the expected 16.8- and 17.0-kDa polypeptides. The results obtained in this study provide information for further in-depth research of this system, which has great value of both theoretical and practical significance.
Satoh, Dan; Hiraoka, Yasutaka; Colman, Brian; Matsuda, Yusuke
2001-01-01
A single intracellular carbonic anhydrase (CA) was detected in air-grown and, at reduced levels, in high CO2-grown cells of the marine diatom Phaeodactylum tricornutum (UTEX 642). No external CA activity was detected irrespective of growth CO2 conditions. Ethoxyzolamide (0.4 mm), a CA-specific inhibitor, severely inhibited high-affinity photosynthesis at low concentrations of dissolved inorganic carbon, whereas 2 mm acetazolamide had little effect on the affinity for dissolved inorganic carbon, suggesting that internal CA is crucial for the operation of a carbon concentrating mechanism in P. tricornutum. Internal CA was purified 36.7-fold of that of cell homogenates by ammonium sulfate precipitation, and two-step column chromatography on diethylaminoethyl-sephacel and p-aminomethylbenzene sulfone amide agarose. The purified CA was shown, by SDS-PAGE, to comprise an electrophoretically single polypeptide of 28 kD under both reduced and nonreduced conditions. The entire sequence of the cDNA of this CA was obtained by the rapid amplification of cDNA ends method and indicated that the cDNA encodes 282 amino acids. Comparison of this putative precursor sequence with the N-terminal amino acid sequence of the purified CA indicated that it included a possible signal sequence of up to 46 amino acids at the N terminus. The mature CA was found to consist of 236 amino acids and the sequence was homologous to β-type CAs. Even though the zinc-ligand amino acid residues were shown to be completely conserved, the amino acid residues that may constitute a CO2-binding site appeared to be unique among the β-CAs so far reported. PMID:11500545
Benne, R; De Vries, B F; Van den Burg, J; Klaver, B
1983-01-01
The nucleotide sequence of a 2.5-kb segment of the maxi-circle of Trypanosoma brucei mtDNA has been determined. The segment contains the gene for apocytochrome b, which displays about 25% homology at the amino acid level to the apocytochrome b gene from fungal and mammalian mtDNAs. Northern blot and S1 nuclease analyses have yielded accurate map positions of an RNA species in an area that coincides with the reading frame. The segment also contains two pairs of overlapping unassigned reading frames, which lack homology with any known mitochondrial gene or URF. The DNA sequence in these areas is AG-rich (70%), resulting in URFs with an unusually high level of glycine and charged amino acids (60%). They may not encode proteins, in spite of their size and the fact that abundant transcripts are mapped in these areas. Images PMID:6314266
Brendolise, Cyril; Yauk, Yar-Khing; Eberhard, Ellen D; Wang, Mindy; Chagne, David; Andre, Christelle; Greenwood, David R; Beuning, Lesley L
2011-07-01
The pentacyclic triterpenes, in particular ursolic acid and oleanolic acid and their derivatives, exist abundantly in the plant kingdom, where they are well known for their anti-inflammatory, antitumour and antimicrobial properties. α-Amyrin and β-amyrin are the precursors of ursolic and oleanolic acids, respectively, formed by concerted cyclization of squalene epoxide by a complex synthase reaction. We identified three full-length expressed sequence tag sequences in cDNA libraries constructed from apple (Malus × domestica 'Royal Gala') that were likely to encode triterpene synthases. Two of these expressed sequence tag sequences were essentially identical (> 99% amino acid similarity; MdOSC1 and MdOSC3). MdOSC1 and MdOSC2 were expressed by transient expression in Nicotiana benthamiana leaves and by expression in the yeast Pichia methanolica. The resulting products were analysed by GC and GC-MS. MdOSC1 was shown to be a mixed amyrin synthase (a 5 : 1 ratio of α-amyrin to β-amyrin). MdOSC1 is the only triterpene synthase so far identified in which the level of α-amyrin produced is > 80% of the total product and is, therefore, primarily an α-amyrin synthase. No product was evident for MdOSC2 when expressed either transiently or in yeast, suggesting that this putative triterpene synthase is either encoded by a pseudogene or does not express well in these systems. Transcript expression analysis in Royal Gala indicated that the genes are mostly expressed in apple peel, and that the MdOSC2 expression level was much lower than that of MdOSC1 and MdOSC3 in all the tissues tested. Amyrin content analysis was undertaken by LC-MS, and demonstrated that levels and ratios differ between tissues, but that the true consequence of synthase activity is reflected in the ursolic/oleanolic acid content and in further triterpenoids derived from them. Phylogenetic analysis placed the three triterpene synthase sequences with other triterpene synthases that encoded either α-amyrin and/or β-amyrin synthase. MdOSC1 and MdOSC3 clustered with the multifunctional triterpene synthases, whereas MdOSC2 was most similar to the β-amyrin synthases. © 2011 The New Zealand Institute for Plant and Food Research Limited. Journal compilation © 2011 FEBS.
Heparin Characterization: Challenges and Solutions
NASA Astrophysics Data System (ADS)
Jones, Christopher J.; Beni, Szabolcs; Limtiaco, John F. K.; Langeslay, Derek J.; Larive, Cynthia K.
2011-07-01
Although heparin is an important and widely prescribed pharmaceutical anticoagulant, its high degree of sequence microheterogeneity and size polydispersity make molecular-level characterization challenging. Unlike nucleic acids and proteins that are biosynthesized through template-driven assembly processes, heparin and the related glycosaminoglycan heparan sulfate are actively remodeled during biosynthesis through a series of enzymatic reactions that lead to variable levels of O- and N-sulfonation and uronic acid epimers. As summarized in this review, heparin sequence information is determined through a bottom-up approach that relies on depolymerization reactions, size- and charge-based separations, and sensitive mass spectrometric and nuclear magnetic resonance experiments to determine the structural identity of component oligosaccharides. The structure-elucidation process, along with its challenges and opportunities for future analytical improvements, is reviewed and illustrated for a heparin-derived hexasaccharide.
Sequence repeats and protein structure
NASA Astrophysics Data System (ADS)
Hoang, Trinh X.; Trovato, Antonio; Seno, Flavio; Banavar, Jayanth R.; Maritan, Amos
2012-11-01
Repeats are frequently found in known protein sequences. The level of sequence conservation in tandem repeats correlates with their propensities to be intrinsically disordered. We employ a coarse-grained model of a protein with a two-letter amino acid alphabet, hydrophobic (H) and polar (P), to examine the sequence-structure relationship in the realm of repeated sequences. A fraction of repeated sequences comprises a distinct class of bad folders, whose folding temperatures are much lower than those of random sequences. Imperfection in sequence repetition improves the folding properties of the bad folders while deteriorating those of the good folders. Our results may explain why nature has utilized repeated sequences for their versatility and especially to design functional proteins that are intrinsically unstructured at physiological temperatures.
Solid phase sequencing of biopolymers
Cantor, Charles; Koster, Hubert
2010-09-28
This invention relates to methods for detecting and sequencing target nucleic acid sequences, to mass modified nucleic acid probes and arrays of probes useful in these methods, and to kits and systems which contain these probes. Useful methods involve hybridizing the nucleic acids or nucleic acids which represent complementary or homologous sequences of the target to an array of nucleic acid probes. These probes comprise a single-stranded portion, an optional double-stranded portion and a variable sequence within the single-stranded portion. The molecular weights of the hybridized nucleic acids of the set can be determined by mass spectroscopy, and the sequence of the target determined from the molecular weights of the fragments. Nucleic acids whose sequences can be determined include DNA or RNA in biological samples such as patient biopsies and environmental samples. Probes may be fixed to a solid support such as a hybridization chip to facilitate automated molecular weight analysis and identification of the target sequence.
Andersen, Mikael R.; Salazar, Margarita P.; Schaap, Peter J.; van de Vondervoort, Peter J.I.; Culley, David; Thykaer, Jette; Frisvad, Jens C.; Nielsen, Kristian F.; Albang, Richard; Albermann, Kaj; Berka, Randy M.; Braus, Gerhard H.; Braus-Stromeyer, Susanna A.; Corrochano, Luis M.; Dai, Ziyu; van Dijck, Piet W.M.; Hofmann, Gerald; Lasure, Linda L.; Magnuson, Jon K.; Menke, Hildegard; Meijer, Martin; Meijer, Susan L.; Nielsen, Jakob B.; Nielsen, Michael L.; van Ooyen, Albert J.J.; Pel, Herman J.; Poulsen, Lars; Samson, Rob A.; Stam, Hein; Tsang, Adrian; van den Brink, Johannes M.; Atkins, Alex; Aerts, Andrea; Shapiro, Harris; Pangilinan, Jasmyn; Salamov, Asaf; Lou, Yigong; Lindquist, Erika; Lucas, Susan; Grimwood, Jane; Grigoriev, Igor V.; Kubicek, Christian P.; Martinez, Diego; van Peij, Noël N.M.E.; Roubos, Johannes A.; Nielsen, Jens; Baker, Scott E.
2011-01-01
The filamentous fungus Aspergillus niger exhibits great diversity in its phenotype. It is found globally, both as marine and terrestrial strains, produces both organic acids and hydrolytic enzymes in high amounts, and some isolates exhibit pathogenicity. Although the genome of an industrial enzyme-producing A. niger strain (CBS 513.88) has already been sequenced, the versatility and diversity of this species compel additional exploration. We therefore undertook whole-genome sequencing of the acidogenic A. niger wild-type strain (ATCC 1015) and produced a genome sequence of very high quality. Only 15 gaps are present in the sequence, and half the telomeric regions have been elucidated. Moreover, sequence information from ATCC 1015 was used to improve the genome sequence of CBS 513.88. Chromosome-level comparisons uncovered several genome rearrangements, deletions, a clear case of strain-specific horizontal gene transfer, and identification of 0.8 Mb of novel sequence. Single nucleotide polymorphisms per kilobase (SNPs/kb) between the two strains were found to be exceptionally high (average: 7.8, maximum: 160 SNPs/kb). High variation within the species was confirmed with exo-metabolite profiling and phylogenetics. Detailed lists of alleles were generated, and genotypic differences were observed to accumulate in metabolic pathways essential to acid production and protein synthesis. A transcriptome analysis supported up-regulation of genes associated with biosynthesis of amino acids that are abundant in glucoamylase A, tRNA-synthases, and protein transporters in the protein producing CBS 513.88 strain. Our results and data sets from this integrative systems biology analysis resulted in a snapshot of fungal evolution and will support further optimization of cell factories based on filamentous fungi. PMID:21543515
Cell cycle, differentiation and tissue-independent expression of ribosomal protein L37.
Su, S; Bird, R C
1995-09-15
A unique human cDNA (hG1.16) that encodes a mRNA of 450 nucleotides was isolated from a subtractive library derived from HeLa cells. The relative expression level of hG1.16 during different cell-cycle phases was determined by Northern-blot analysis of cells synchronized by double-thymidine block and serum deprivation/refeeding. hG1.16 was constitutively expressed during all phases of the cell cycle, including the quiescent phase when even most constitutively expressed genes experience some suppression of expression. The expression level of hG1.16 did not change during terminal differentiation of myoblasts to myotubes, during which cells become permanently post-mitotic. Examination of other tissues revealed that the relative expression level of hG1.16 was constitutive in all embryonic mouse tissues examined, including brain, eye, heart, kidney, liver, lung and skeletal muscle. This was unusual in that expression was not down-modulated during differentiation and did not vary appreciably between tissue types. Analysis by inter-species Northern-blot analysis revealed that hG1.16 was highly conserved among all vertebrates studied (from fish to humans but not in insects). DNA sequence analysis of hG1.16 revealed a high level of similarity to rat ribosomal protein L37, identifying hG1.16 as a new member of this multigene family. The deduced amino acid sequence of hG1.16 was identical to rat ribosomal protein L37 that contained 97 amino acids, many of which are highly positively charged (15 arginine and 14 lysine residues with a predicted M(r) of 11,065). hG1.16 protein has a single C2-C2 zinc-finger-like motif which is also present in rat ribosomal protein L37. Using primers designed from the sequence of hG1.16, unique bovine and rat cDNAs were also isolated by 5'-rapid-amplification of cDNA ends. DNA sequences of bovine and rat G1.16, clones were 92.8% and 92.2% similar to human G1.16 while the deduced amino acid sequences derived from bovine and rat cDNAs each differed by a single amino acid from the sequence of hG1.16 and the published rat L37 sequence. Southern-blot analysis revealed that hG1.16 exists in multiple copies in human, rat and mouse genomes. These G1.16 clones encode unique human, rat and bovine members of the ribosomal protein L37 gene family, which are constitutively expressed even during transitions from quiescence to active cell proliferation or terminal differentiation, in all tissues and all vertebrates investigated.
Brewer, Michael S; Swafford, Lynn; Spruill, Chad L; Bond, Jason E
2013-01-01
Arthropods are the most diverse group of eukaryotic organisms, but their phylogenetic relationships are poorly understood. Herein, we describe three mitochondrial genomes representing orders of millipedes for which complete genomes had not been characterized. Newly sequenced genomes are combined with existing data to characterize the protein coding regions of myriapods and to attempt to reconstruct the evolutionary relationships within the Myriapoda and Arthropoda. The newly sequenced genomes are similar to previously characterized millipede sequences in terms of synteny and length. Unique translocations occurred within the newly sequenced taxa, including one half of the Appalachioria falcifera genome, which is inverted with respect to other millipede genomes. Across myriapods, amino acid conservation levels are highly dependent on the gene region. Additionally, individual loci varied in the level of amino acid conservation. Overall, most gene regions showed low levels of conservation at many sites. Attempts to reconstruct the evolutionary relationships suffered from questionable relationships and low support values. Analyses of phylogenetic informativeness show the lack of signal deep in the trees (i.e., genes evolve too quickly). As a result, the myriapod tree resembles previously published results but lacks convincing support, and, within the arthropod tree, well established groups were recovered as polyphyletic. The novel genome sequences described herein provide useful genomic information concerning millipede groups that had not been investigated. Taken together with existing sequences, the variety of compositions and evolution of myriapod mitochondrial genomes are shown to be more complex than previously thought. Unfortunately, the use of mitochondrial protein-coding regions in deep arthropod phylogenetics appears problematic, a result consistent with previously published studies. Lack of phylogenetic signal renders the resulting tree topologies as suspect. As such, these data are likely inappropriate for investigating such ancient relationships.
Voelker, Toni A.; Staswick, Paul; Chrispeels, Maarten J.
1986-01-01
Phytohemagglutinin (PHA), the seed lectin of the common bean, Phaseolus vulgaris, is encoded by two highly homologous, tandemly linked genes, dlec1 and dlec2, which are coordinately expressed at high levels in developing cotyledons. Their respective transcripts translate into closely related polypeptides, PHA-E and PHA-L, constituents of the tetrameric lectin which accumulates at high levels in developing seeds. In the bean cultivar Pinto UI111, PHA-E is not detectable, and PHA-L accumulates at very reduced levels. To investigate the cause of the Pinto phenotype, we cloned and sequenced the two PHA genes of Pinto, called Pdlec1 and Pdlec2, and determined the abundance of their respective mRNAs in developing cotyledons. Both genes are more than 90% homologous to the normal PHA genes found in other cultivars. Pdlec1 carries a 1-bp frameshift mutation close to the 5' end of its coding sequence. Only very truncated polypeptides could be made from its mRNA. The gene Pdlec2 encodes a polypeptide, which resembles PHA-L and its predicted amino acid sequence agrees with the available Pinto PHA amino acid sequence data. Analysis of the mRNA of developing cotyledons revealed that the Pdlec1 message is reduced 600-fold, and Pdlec2 mRNA is reduced 20-fold with respect to mRNA levels in normal cultivars. A comparison of the sequences which are upstream from the coding sequence shows that Pdlec2 has a 100-bp deletion compared to the other genes (dlec1, dlec2 and Pdlec1). This deletion which contains a large tandem repeat may be responsible for the low level of expression of Pdlec2. The very low expression of Pdlec1 is as yet unexplained. ImagesFig. 5. PMID:16453730
Astell, C R; Gardiner, E M; Tattersall, P
1986-02-01
The sequence of molecular clones of the genome of MVM(i), a lymphotropic variant of minute virus of mice, was determined and compared with that of MVM(p), the fibrotropic prototype strain. At the nucleotide level there are 163 base changes: 129 transitions and 34 transversions. Most nucleotide changes are silent, with only 27 amino acids changes predicted, of which 22 are conservative. Notable differences between the MVM(i) and MVM(p) genomes which may account for the cell specificities of these viruses occur within the 3' nontranslated regions. The differences discussed include the absence of a 65-base-pair direct in MVM(i), the presence of only two polyadenylation sites in MVM(i) compared with four in MVM(p), and sequences that bear a resemblance to enhancer sequences. Also included in this paper is an important correction to the MVM(p) sequence (C.R. Astell, M. Thomson, M. Merchlinsky, and D. C. Ward, Nucleic Acids Res. 11:999-1018, 1983).
Gao, Yang; He, Jie; He, Zhuliu; Li, Zhiwei; Zhao, Bo; Mu, Yi; Lee, Jeong-Yeol; Chu, Zhangjie
2017-03-01
A 60-day feeding trial was conducted to determine the effect of dietary fulvic acid supplements on intestinal digestive activity (enzymatic analysis), antioxidant activity, immune enzyme activity and microflora composition of juvenile loach (initial weight of 6.2 ± 0.1 g) reared in experimental aquaria. Five test diets containing 0, 0.5, 1.0, 1.5, and 2% fulvic acid were randomly assigned to three aquaria, respectively. Elevated growth performance including final weight, weight gain (WG), specific growth rate (SGR) and feed conversion ratio (FCR) was observed in loaches that were fed fulvic acid. Maximal weight gain rates and specific growth rates occurred at the 1.5% additive level. The optimal dietary fulvic requirement for maximal growth of juvenile loach is 16.4 g per kg of the diet based on the quadratic regression analysis of specific growth rate against dietary fulvic acid levels. Furthermore, intestinal protease activity, antioxidant activity, lysozyme activity (LZM), complement 3 (C3) content, immunoglobulin M (IgM) content, acid phosphatase activity (ACP) and alkaline phosphatase activity (AKP) were significantly elevated with concomitant increasing levels of dietary fulvic acid. Following a deep sequencing analysis, a total of 42,058 valid reads and 609 OTUs (operational taxonomic units) obtained from the control group and the group displaying the most optimal growth rate were analyzed. Fulvic acid supplementation resulted in an abundance of Firmicute and Actinobacteria sequences, with a concomitant reduction in the abundance of Proteobacteria. Results indicated that fulvic acid supplementation resulted in a reduction in the relative abundance of Serratia, Acinetobacter, Aeromonas and Edwardsiella, and a relative increase in the abundance of Lactobacillus in the intestine. In conclusion, these results suggest that fulvic acid improves growth performance and intestinal health condition of loach, indicates that fulvic acid could be used as an immunoenhancer in loach culture. Copyright © 2017. Published by Elsevier Ltd.
Shimajiri, Yasuka; Oonishi, Takayuki; Ozaki, Kae; Kainou, Kumiko; Akama, Kazuhito
2013-06-01
Gamma-aminobutyric acid (GABA) is a non-protein amino acid commonly present in all organisms. Because cellular levels of GABA in plants are mainly regulated by synthesis (glutamate decarboxylase, GAD) and catabolism (GABA-transaminase, GABA-T), we attempted seed-specific manipulation of the GABA shunt to achieve stable GABA accumulation in rice. A truncated GAD2 sequence, one of five GAD genes, controlled by the glutelin (GluB-1) or rice embryo globulin promoters (REG) and GABA-T-based trigger sequences in RNA interference (RNAi) cassettes controlled by one of these promoters as well, was introduced into rice (cv. Koshihikari) to establish stable transgenic lines under herbicide selection using pyriminobac. T₁ and T₂ generations of rice lines displayed high GABA concentrations (2-100 mg/100 g grain). In analyses of two selected lines from the T₃ generation, there was a strong correlation between GABA level and the expression of truncated GAD2, whereas the inhibitory effect of GABA-T expression was relatively weak. In these two lines both with two T-DNA copies, their starch, amylose, and protein levels were slightly lower than non-transformed cv. Koshihikari. Free amino acid analysis of mature kernels of these lines demonstrated elevated levels of GABA (75-350 mg/100 g polished rice) and also high levels of several amino acids, such as Ala, Ser, and Val. Because these lines of seeds could sustain their GABA content after harvest (up to 6 months), the strategy in this study could lead to the accumulation GABA and for these to be sustained in the edible parts. © 2013 Society for Experimental Biology, Association of Applied Biologists and John Wiley & Sons Ltd.
Molecular Simulations of Sequence-Specific Association of Transmembrane Proteins in Lipid Bilayers
NASA Astrophysics Data System (ADS)
Doxastakis, Manolis; Prakash, Anupam; Janosi, Lorant
2011-03-01
Association of membrane proteins is central in material and information flow across the cellular membranes. Amino-acid sequence and the membrane environment are two critical factors controlling association, however, quantitative knowledge on such contributions is limited. In this work, we study the dimerization of helices in lipid bilayers using extensive parallel Monte Carlo simulations with recently developed algorithms. The dimerization of Glycophorin A is examined employing a coarse-grain model that retains a level of amino-acid specificity, in three different phospholipid bilayers. Association is driven by a balance of protein-protein and lipid-induced interactions with the latter playing a major role at short separations. Following a different approach, the effect of amino-acid sequence is studied using the four transmembrane domains of the epidermal growth factor receptor family in identical lipid environments. Detailed characterization of dimer formation and estimates of the free energy of association reveal that these helices present significant affinity to self-associate with certain dimers forming non-specific interfaces.
Kim, Sunhwa; Matsuo, Ichiro; Ajisaka, Katsumi; Nakajima, Harushi; Kitamoto, Katsuhiko
2002-10-01
We isolated a beta-N-acetylglucosaminidase encoding gene and its cDNA from the filamentous fungus Aspergillus nidulans, and designated it nagA. The nagA gene contained no intron and encoded a polypeptide of 603 amino acids with a putative 19-amino acid signal sequence. The deduced amino acid sequence was very similar to the sequence of Candida albicans Hex1 and Trichoderma harzianum Nag1. Yeast cells containing the nagA cDNA under the control of the GAL1 promoter expressed beta-N-acetylglucosaminidase activity. The chromosomal nagA gene of A. nidulans was disrupted by replacement with the argB marker gene. The disruptant strains expressed low levels of beta-N-acetylglucosaminidase activity and showed poor growth on a medium containing chitobiose as a carbon source. Aspergillus oryzae strain carrying the nagA gene under the control of the improved glaA promoter produced large amounts of beta-N-acetylglucosaminidase in a wheat bran solid culture.
Zn-metalloprotease sequences in extremophiles
NASA Astrophysics Data System (ADS)
Holden, T.; Dehipawala, S.; Golebiewska, U.; Cheung, E.; Tremberger, G., Jr.; Williams, E.; Schneider, P.; Gadura, N.; Lieberman, D.; Cheung, T.
2010-09-01
The Zn-metalloprotease family contains conserved amino acid structures such that the nucleotide fluctuation at the DNA level would exhibit correlated randomness as described by fractal dimension. A nucleotide sequence fractal dimension can be calculated from a numerical series consisting of the atomic numbers of each nucleotide. The structure's vibration modes can also be studied using a Gaussian Network Model. The vibration measure and fractal dimension values form a two-dimensional plot with a standard vector metric that can be used for comparison of structures. The preference for amino acid usage in extremophiles may suppress nucleotide fluctuations that could be analyzed in terms of fractal dimension and Shannon entropy. A protein level cold adaptation study of the thermolysin Zn-metalloprotease family using molecular dynamics simulation was reported recently and our results show that the associated nucleotide fluctuation suppression is consistent with a regression pattern generated from the sequences's fractal dimension and entropy values (R-square { 0.98, N =5). It was observed that cold adaptation selected for high entropy and low fractal dimension values. Extension to the Archaemetzincin M54 family in extremophiles reveals a similar regression pattern (R-square = 0.98, N = 6). It was observed that the metalloprotease sequences of extremely halophilic organisms possess high fractal dimension and low entropy values as compared with non-halophiles. The zinc atom is usually bonded to the histidine residue, which shows limited levels of vibration in the Gaussian Network Model. The variability of the fractal dimension and entropy for a given protein structure suggests that extremophiles would have evolved after mesophiles, consistent with the bias usage of non-prebiotic amino acids by extremophiles. It may be argued that extremophiles have the capacity to offer extinction protection during drastic changes in astrobiological environments.
Detection of nucleic acid sequences by invader-directed cleavage
Brow, Mary Ann D.; Hall, Jeff Steven Grotelueschen; Lyamichev, Victor; Olive, David Michael; Prudent, James Robert
1999-01-01
The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The 5' nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based by charge.
Grizzly bear corticosteroid binding globulin: Cloning and serum protein expression.
Chow, Brian A; Hamilton, Jason; Alsop, Derek; Cattet, Marc R L; Stenhouse, Gordon; Vijayan, Mathilakath M
2010-06-01
Serum corticosteroid levels are routinely measured as markers of stress in wild animals. However, corticosteroid levels rise rapidly in response to the acute stress of capture and restraint for sampling, limiting its use as an indicator of chronic stress. We hypothesized that serum corticosteroid binding globulin (CBG), the primary transport protein for corticosteroids in circulation, may be a better marker of the stress status prior to capture in grizzly bears (Ursus arctos). To test this, a full-length CBG cDNA was cloned and sequenced from grizzly bear testis and polyclonal antibodies were generated for detection of this protein in bear sera. The deduced nucleotide and protein sequences were 1218 bp and 405 amino acids, respectively. Multiple sequence alignments showed that grizzly bear CBG (gbCBG) was 90% and 83% identical to the dog CBG nucleotide and amino acid sequences, respectively. The affinity purified rabbit gbCBG antiserum detected grizzly bear but not human CBG. There were no sex differences in serum total cortisol concentration, while CBG expression was significantly higher in adult females compared to males. Serum cortisol levels were significantly higher in bears captured by leg-hold snare compared to those captured by remote drug delivery from helicopter. However, serum CBG expression between these two groups did not differ significantly. Overall, serum CBG levels may be a better marker of chronic stress, especially because this protein is not modulated by the stress of capture and restraint in grizzly bears. Copyright 2010 Elsevier Inc. All rights reserved.
Wang, Bin; Shao, Yanchun; Chen, Tao; Chen, Wanping; Chen, Fusheng
2015-01-01
Acetobacter pasteurianus (Ap) CICC 20001 and CGMCC 1.41 are two acetic acid bacteria strains that, because of their strong abilities to produce and tolerate high concentrations of acetic acid, have been widely used to brew vinegar in China. To globally understand the fermentation characteristics, acid-tolerant mechanisms and genetic stabilities, their genomes were sequenced. Genomic comparisons with 9 other sequenced Ap strains revealed that their chromosomes were evolutionarily conserved, whereas the plasmids were unique compared with other Ap strains. Analysis of the acid-tolerant metabolic pathway at the genomic level indicated that the metabolism of some amino acids and the known mechanisms of acetic acid tolerance, might collaboratively contribute to acetic acid resistance in Ap strains. The balance of instability factors and stability factors in the genomes of Ap CICC 20001 and CGMCC 1.41 strains might be the basis for their genetic stability, consistent with their stable industrial performances. These observations provide important insights into the acid resistance mechanism and the genetic stability of Ap strains and lay a foundation for future genetic manipulation and engineering of these two strains. PMID:26691589
Johnson, Alexander A. T.
2017-01-01
Iron (Fe) uptake in graminaceous plant species occurs via the release and uptake of Fe-chelating compounds known as mugineic acid family phytosiderophores (MAs). In the MAs biosynthetic pathway, nicotianamine aminotransferase (NAAT) and deoxymugineic acid synthase (DMAS) enzymes catalyse the formation of 2’-deoxymugineic acid (DMA) from nicotianamine (NA). Here we describe the identification and characterisation of six TaNAAT and three TaDMAS1 genes in bread wheat (Triticum aestivum L.). The coding sequences of all six TaNAAT homeologs consist of seven exons with ≥88.0% nucleotide sequence identity and most sequence variation present in the first exon. The coding sequences of the three TaDMAS1 homeologs consist of three exons with ≥97.8% nucleotide sequence identity. Phylogenetic analysis revealed that the TaNAAT and TaDMAS1 proteins are most closely related to the HvNAAT and HvDMAS1 proteins of barley and that there are two distinct groups of TaNAAT proteins—TaNAAT1 and TaNAAT2 –that correspond to the HvNAATA and HvNAATB proteins, respectively. Quantitative reverse transcription-PCR analysis revealed that the TaNAAT2 genes are expressed at highest levels in anther tissues whilst the TaNAAT1 and TaDMAS1 genes are expressed at highest levels in root tissues of bread wheat. Furthermore, the TaNAAT1, TaNAAT2 and TaDMAS1 genes were differentially regulated by plant Fe status and their expression was significantly upregulated in root tissues from day five onwards during a seven-day Fe deficiency treatment. The identification and characterization of the TaNAAT1, TaNAAT2 and TaDMAS1 genes provides a valuable genetic resource for improving bread wheat growth on Fe deficient soils and enhancing grain Fe nutrition. PMID:28475636
37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.
Code of Federal Regulations, 2011 CFR
2011-07-01
... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...
37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.
Code of Federal Regulations, 2013 CFR
2013-07-01
... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...
37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.
Code of Federal Regulations, 2012 CFR
2012-07-01
... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...
37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.
Code of Federal Regulations, 2010 CFR
2010-07-01
... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...
37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.
Code of Federal Regulations, 2014 CFR
2014-07-01
... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...
DOE Office of Scientific and Technical Information (OSTI.GOV)
Denef, Vincent; Shah, Manesh B; Verberkmoes, Nathan C
The recent surge in microbial genomic sequencing, combined with the development of high-throughput liquid chromatography-mass-spectrometry-based (LC/LC-MS/MS) proteomics, has raised the question of the extent to which genomic information of one strain or environmental sample can be used to profile proteomes of related strains or samples. Even with decreasing sequencing costs, it remains impractical to obtain genomic sequence for every strain or sample analyzed. Here, we evaluate how shotgun proteomics is affected by amino acid divergence between the sample and the genomic database using a probability-based model and a random mutation simulation model constrained by experimental data. To assess the effectsmore » of nonrandom distribution of mutations, we also evaluated identification levels using in silico peptide data from sequenced isolates with average amino acid identities (AAI) varying between 76 and 98%. We compared the predictions to experimental protein identification levels for a sample that was evaluated using a database that included genomic information for the dominant organism and for a closely related variant (95% AAI). The range of models set the boundaries at which half of the proteins in a proteomic experiment can be identified to be 77-92% AAI between orthologs in the sample and database. Consistent with this prediction, experimental data indicated loss of half the identifiable proteins at 90% AAI. Additional analysis indicated a 6.4% reduction of the initial protein coverage per 1% amino acid divergence and total identification loss at 86% AAI. Consequently, shotgun proteomics is capable of cross-strain identifications but avoids most crossspecies false positives.« less
Isolation and Characterization of the PKAr Gene From a Plant Pathogen, Curvularia lunata.
Liu, T; Ma, B C; Hou, J M; Zuo, Y H
2014-09-01
By using EST database from a full-length cDNA library of Curvularia lunata, we have isolated a 2.9 kb cDNA, termed PKAr. An ORF of 1,383 bp encoding a polypeptide of 460 amino acids with molecular weight 50.1 kDa, (GeneBank Acc. No. KF675744) was cloned. The deduced amino acid sequence of the PKAr shows 90 and 88 % identity with cAMP-dependent protein kinase A regulatory subunit from Alternaria alternate and Pyrenophora tritici-repentis Pt-1C-BFP, respectively. Database analysis revealed that the deduced amino acid sequence of PKAr shares considerable similarity with that of PKA regulatory subunits in other organisms, particularly in the conserved regions. No introns were identified within the 1,383 bp of ORF compared with PKAr genomic DNA sequence. Southern blot indicated that PKAr existed as a single copy per genome. The mRNA expression level of PKAr in different development stages were demonstrated using real-time quantitative PCR. The results showed that the level of PKAr expression was highest in vegetative growth mycelium, which indicated it might play an important role in the vegetative growth of C. lunata. These results provided a fundamental supporting research on the function of PKAr in plant pathogen, C. lunata.
Henderson, R A; Krissansen, G W; Yong, R Y; Leung, E; Watson, J D; Dholakia, J N
1994-12-02
Protein synthesis in mammalian cells is regulated at the level of the guanine nucleotide exchange factor, eIF-2B, which catalyzes the exchange of eukaryotic initiation factor 2-bound GDP for GTP. We have isolated and sequenced cDNA clones encoding the delta-subunit of murine eIF-2B. The cDNA sequence encodes a polypeptide of 544 amino acids with molecular mass of 60 kDa. Antibodies against a synthetic polypeptide of 30 amino acids deduced from the cDNA sequence specifically react with the delta-subunit of mammalian eIF-2B. The cDNA-derived amino acid sequence shows significant homology with the yeast translational regulator Gcd2, supporting the hypothesis that Gcd2 may be the yeast homolog of the delta-subunit of mammalian eIF-2B. Primer extension studies and anchor polymerase chain reaction analysis were performed to determine the 5'-end of the transcript for the delta-subunit of eIF-2B. Results of these experiments demonstrate two different mRNAs for the delta-subunit of eIF-2B in murine cells. The isolation and characterization of two different full-length cDNAs also predicts the presence of two alternate forms of the delta-subunit of eIF-2B in murine cells. These differ at their amino-terminal end but have identical nucleotide sequences coding for amino acids 31-544.
Sturm, A; Chrispeels, M J
1990-11-01
We isolated a full-length cDNA for apoplastic (extracellular or cell wall-bound) beta-fructosidase (invertase), determined its nucleotide sequence, and used it as a probe to measure changes in mRNA as a result of wounding of carrot storage roots and infection of carrot plants with the bacterial pathogen Erwinia carotovora. The derived amino acid sequence of extracellular beta-fructosidase shows that it is a basic protein (pl 9.9) with a signal sequence for entry into the endoplasmic reticulum and a propeptide at the N terminus that is not present in the mature protein. Amino acid sequence comparison with yeast and bacterial invertases shows that the overall homology is only about 28%, but that there are short conserved motifs, one of which is at the active site. Maturing carrot storage roots contain barely detectable levels of mRNA for extracellular beta-fructosidase and these levels rise slowly but dramatically after wounding with maximal expression after 12 hours. Infection of roots and leaves of carrot plants with E. carotovora results in a very fast increase in the mRNA levels with maximal expression after 1 hour. These results indicate that apoplastic beta-fructosidase is probably a new and hitherto unrecognized pathogenesis-related protein [Van Loon, L.C. (1985). Plant Mol. Biol. 4, 111-116]. Suspension-cultured carrot cells contain high levels of mRNA for extracellular beta-fructosidase and these levels remain the same whether the cells are grown on sucrose, glucose, or fructose.
Sturm, A; Chrispeels, M J
1990-01-01
We isolated a full-length cDNA for apoplastic (extracellular or cell wall-bound) beta-fructosidase (invertase), determined its nucleotide sequence, and used it as a probe to measure changes in mRNA as a result of wounding of carrot storage roots and infection of carrot plants with the bacterial pathogen Erwinia carotovora. The derived amino acid sequence of extracellular beta-fructosidase shows that it is a basic protein (pl 9.9) with a signal sequence for entry into the endoplasmic reticulum and a propeptide at the N terminus that is not present in the mature protein. Amino acid sequence comparison with yeast and bacterial invertases shows that the overall homology is only about 28%, but that there are short conserved motifs, one of which is at the active site. Maturing carrot storage roots contain barely detectable levels of mRNA for extracellular beta-fructosidase and these levels rise slowly but dramatically after wounding with maximal expression after 12 hours. Infection of roots and leaves of carrot plants with E. carotovora results in a very fast increase in the mRNA levels with maximal expression after 1 hour. These results indicate that apoplastic beta-fructosidase is probably a new and hitherto unrecognized pathogenesis-related protein [Van Loon, L.C. (1985). Plant Mol. Biol. 4, 111-116]. Suspension-cultured carrot cells contain high levels of mRNA for extracellular beta-fructosidase and these levels remain the same whether the cells are grown on sucrose, glucose, or fructose. PMID:2152110
Pyrin gene and mutants thereof, which cause familial Mediterranean fever
Kastner, Daniel L [Bethesda, MD; Aksentijevichh, Ivona [Bethesda, MD; Centola, Michael [Tacoma Park, MD; Deng, Zuoming [Gaithersburg, MD; Sood, Ramen [Rockville, MD; Collins, Francis S [Rockville, MD; Blake, Trevor [Laytonsville, MD; Liu, P Paul [Ellicott City, MD; Fischel-Ghodsian, Nathan [Los Angeles, CA; Gumucio, Deborah L [Ann Arbor, MI; Richards, Robert I [North Adelaide, AU; Ricke, Darrell O [San Diego, CA; Doggett, Norman A [Santa Cruz, NM; Pras, Mordechai [Tel-Hashomer, IL
2003-09-30
The invention provides the nucleic acid sequence encoding the protein associated with familial Mediterranean fever (FMF). The cDNA sequence is designated as MEFV. The invention is also directed towards fragments of the DNA sequence, as well as the corresponding sequence for the RNA transcript and fragments thereof. Another aspect of the invention provides the amino acid sequence for a protein (pyrin) associated with FMF. The invention is directed towards both the full length amino acid sequence, fusion proteins containing the amino acid sequence and fragments thereof. The invention is also directed towards mutants of the nucleic acid and amino acid sequences associated with FMF. In particular, the invention discloses three missense mutations, clustered in within about 40 to 50 amino acids, in the highly conserved rfp (B30.2) domain at the C-terminal of the protein. These mutants include M6801, M694V, K695R, and V726A. Additionally, the invention includes methods for diagnosing a patient at risk for having FMF and kits therefor.
Molecular cloning and expression analysis of annexin A2 gene in sika deer antler tip.
Xia, Yanling; Qu, Haomiao; Lu, Binshan; Zhang, Qiang; Li, Heping
2018-04-01
Molecular cloning and bioinformatics analysis of annexin A2 ( ANXA2 ) gene in sika deer antler tip were conducted. The role of ANXA2 gene in the growth and development of the antler were analyzed initially. The reverse transcriptase polymerase chain reaction (RT-PCR) was used to clone the cDNA sequence of the ANXA2 gene from antler tip of sika deer ( Cervus Nippon hortulorum ) and the bioinformatics methods were applied to analyze the amino acid sequence of Anxa2 protein. The mRNA expression levels of the ANXA2 gene in different growth stages were examined by real time reverse transcriptase polymerase chain reaction (real time RT-PCR). The nucleotide sequence analysis revealed an open reading frame of 1,020 bp encoding 339 amino acids long protein of calculated molecular weight 38.6 kDa and isoelectric point 6.09. Homologous sequence alignment and phylogenetic analysis indicated that the Anxa2 mature protein of sika deer had the closest genetic distance with Cervus elaphus and Bos mutus . Real time RT-PCR results showed that the gene had differential expression levels in different growth stages, and the expression level of the ANXA2 gene was the highest at metaphase (rapid growing period). ANXA2 gene may promote the cell proliferation, and the finding suggested Anxa2 as an important candidate for regulating the growth and development of deer antler.
MIPS: a calmodulin-binding protein of Gracilaria lemaneiformis under heat shock.
Zhang, Xuan; Zhou, Huiyue; Zang, Xiaonan; Gong, Le; Sun, Hengyi; Zhang, Xuecheng
2014-08-01
To study the Ca(2+)/Calmodulin (CaM) signal transduction pathway of Gracilaria lemaneiformis under heat stress, myo-inositol-1-phosphate synthase (MIPS), a calmodulin-binding protein, was isolated using the yeast two-hybrid system. cDNA and DNA sequences of mips were cloned from G. lemaneiformis by using 5'RACE and genome walking procedures. The MIPS DNA sequence was 2,067 nucleotides long, containing an open reading frame (ORF) of 1,623 nucleotides with no intron. The mips ORF was predicted to encode 540 amino acids, which included the conserved MIPS domain and was 61-67 % similar to that of other species. After analyzing the amino acid sequence of MIPS, the CaM-Binding Domain (CaMBD) was inferred to be at a site spanning from amino acid 212 to amino acid 236. The yeast two-hybrid results proved that MIPS can interact with CaM and that MIPS is a type of calmodulin-binding protein. Next, the expression of CaM and MIPS in wild-type G. lemaneiformis and a heat-tolerant G. lemaneiformis cultivar, "981," were analyzed using real-time PCR under a heat shock of 32 °C. The expression level displayed a cyclical upward trend. Compared with wild type, the CaM expression levels of cultivar 981 were higher, which might directly relate to its resistance to high temperatures. This paper indicates that MIPS and CaM may play important roles in the high-temperature resistance of G. lemaneiformis.
Bonen, Linda; Boer, Poppo H.; Gray, Michael W.
1984-01-01
We have determined the sequence of the wheat mitochondrial gene for cytochrome oxidase subunit II (COII) and find that its derived protein sequence differs from that of maize at only three amino acid positions. Unexpectedly, all three replacements are non-conservative ones. The wheat COII gene has a highly-conserved intron at the same position as in maize, but the wheat intron is 1.5 times longer because of an insert relative to its maize counterpart. Hybridization analysis of mitochondrial DNA from rye, pea, broad bean and cucumber indicates strong sequence conservation of COII coding sequences among all these higher plants. However, only rye and maize mitochondrial DNA show homology with wheat COII intron sequences and rye alone with intron-insert sequences. We find that a sequence identical to the region of the 5' exon corresponding to the transmembrane domain of the COII protein is present at a second genomic location in wheat mitochondria. These variations in COII gene structure and size, as well as the presence of repeated COII sequences, illustrate at the DNA sequence level, factors which contribute to higher plant mitochondrial DNA diversity and complexity. ImagesFig. 3.Fig. 4.Fig. 5. PMID:16453565
Federal Register 2010, 2011, 2012, 2013, 2014
2012-10-29
... DEPARTMENT OF COMMERCE Patent and Trademark Office Requirements for Patent Applications Containing Nucleotide Sequence and/or Amino Acid Sequence Disclosures ACTION: Proposed collection; comment request... Patent applications that contain nucleotide and/or amino acid sequence disclosures must include a copy of...
Automated sequence analysis and editing software for HIV drug resistance testing.
Struck, Daniel; Wallis, Carole L; Denisov, Gennady; Lambert, Christine; Servais, Jean-Yves; Viana, Raquel V; Letsoalo, Esrom; Bronze, Michelle; Aitken, Sue C; Schuurman, Rob; Stevens, Wendy; Schmit, Jean Claude; Rinke de Wit, Tobias; Perez Bercoff, Danielle
2012-05-01
Access to antiretroviral treatment in resource-limited-settings is inevitably paralleled by the emergence of HIV drug resistance. Monitoring treatment efficacy and HIV drugs resistance testing are therefore of increasing importance in resource-limited settings. Yet low-cost technologies and procedures suited to the particular context and constraints of such settings are still lacking. The ART-A (Affordable Resistance Testing for Africa) consortium brought together public and private partners to address this issue. To develop an automated sequence analysis and editing software to support high throughput automated sequencing. The ART-A Software was designed to automatically process and edit ABI chromatograms or FASTA files from HIV-1 isolates. The ART-A Software performs the basecalling, assigns quality values, aligns query sequences against a set reference, infers a consensus sequence, identifies the HIV type and subtype, translates the nucleotide sequence to amino acids and reports insertions/deletions, premature stop codons, ambiguities and mixed calls. The results can be automatically exported to Excel to identify mutations. Automated analysis was compared to manual analysis using a panel of 1624 PR-RT sequences generated in 3 different laboratories. Discrepancies between manual and automated sequence analysis were 0.69% at the nucleotide level and 0.57% at the amino acid level (668,047 AA analyzed), and discordances at major resistance mutations were recorded in 62 cases (4.83% of differences, 0.04% of all AA) for PR and 171 (6.18% of differences, 0.03% of all AA) cases for RT. The ART-A Software is a time-sparing tool for pre-analyzing HIV and viral quasispecies sequences in high throughput laboratories and highlighting positions requiring attention. Copyright © 2012 Elsevier B.V. All rights reserved.
Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor L.; Brow, Mary Ann D.; Dahlberg, James E.
2007-12-11
The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Invasive cleavage of nucleic acids
Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor I.; Brow, Mary Ann D.; Dahlberg, James E.
1999-01-01
The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Invasive cleavage of nucleic acids
Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor I.; Brow, Mary Ann D.; Dahlberg, James E.
2002-01-01
The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor I.; Brow; Mary Ann D.; Dahlberg, James E.
2010-11-09
The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor I.; Brow, Mary Ann D.; Dahlberg, James E.
2000-01-01
The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor I.; Brow, Mary Ann; Dahlberg, James E.
2005-04-05
The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Song, Jiangning; Tan, Hao; Wang, Mingjun; Webb, Geoffrey I.; Akutsu, Tatsuya
2012-01-01
Protein backbone torsion angles (Phi) and (Psi) involve two rotation angles rotating around the Cα-N bond (Phi) and the Cα-C bond (Psi). Due to the planarity of the linked rigid peptide bonds, these two angles can essentially determine the backbone geometry of proteins. Accordingly, the accurate prediction of protein backbone torsion angle from sequence information can assist the prediction of protein structures. In this study, we develop a new approach called TANGLE (Torsion ANGLE predictor) to predict the protein backbone torsion angles from amino acid sequences. TANGLE uses a two-level support vector regression approach to perform real-value torsion angle prediction using a variety of features derived from amino acid sequences, including the evolutionary profiles in the form of position-specific scoring matrices, predicted secondary structure, solvent accessibility and natively disordered region as well as other global sequence features. When evaluated based on a large benchmark dataset of 1,526 non-homologous proteins, the mean absolute errors (MAEs) of the Phi and Psi angle prediction are 27.8° and 44.6°, respectively, which are 1% and 3% respectively lower than that using one of the state-of-the-art prediction tools ANGLOR. Moreover, the prediction of TANGLE is significantly better than a random predictor that was built on the amino acid-specific basis, with the p-value<1.46e-147 and 7.97e-150, respectively by the Wilcoxon signed rank test. As a complementary approach to the current torsion angle prediction algorithms, TANGLE should prove useful in predicting protein structural properties and assisting protein fold recognition by applying the predicted torsion angles as useful restraints. TANGLE is freely accessible at http://sunflower.kuicr.kyoto-u.ac.jp/~sjn/TANGLE/. PMID:22319565
Rule, G S; Pratt, E A; Chin, C C; Wold, F; Ho, C
1985-01-01
Recombinant DNA plasmids containing the gene for the membrane-bound D-lactate dehydrogenase (D-LDH) of Escherichia coli linked to the promoter PL from lambda were constructed. After induction, the levels of D-LDH were elevated 300-fold over that of the wild type and amounted to 35% of the total cellular protein. The nucleotide sequence of the D-LDH gene was determined and shown to agree with the amino acid composition and the amino-terminal sequence of the purified enzyme. Removal of the amino-terminal formyl-Met from D-LDH was not inhibited in cells which contained these high levels of D-LDH. Images PMID:3882663
Method for nucleic acid hybridization using single-stranded DNA binding protein
Tabor, Stanley; Richardson, Charles C.
1996-01-01
Method of nucleic acid hybridization for detecting the presence of a specific nucleic acid sequence in a population of different nucleic acid sequences using a nucleic acid probe. The nucleic acid probe hybridizes with the specific nucleic acid sequence but not with other nucleic acid sequences in the population. The method includes contacting a sample (potentially including the nucleic acid sequence) with the nucleic acid probe under hybridizing conditions in the presence of a single-stranded DNA binding protein provided in an amount which stimulates renaturation of a dilute solution (i.e., one in which the t.sub.1/2 of renaturation is longer than 3 weeks) of single-stranded DNA greater than 500 fold (i.e., to a t.sub.1/2 less than 60 min, preferably less than 5 min, and most preferably about 1 min.) in the absence of nucleotide triphosphates.
Sequence quality analysis tool for HIV type 1 protease and reverse transcriptase.
Delong, Allison K; Wu, Mingham; Bennett, Diane; Parkin, Neil; Wu, Zhijin; Hogan, Joseph W; Kantor, Rami
2012-08-01
Access to antiretroviral therapy is increasing globally and drug resistance evolution is anticipated. Currently, protease (PR) and reverse transcriptase (RT) sequence generation is increasing, including the use of in-house sequencing assays, and quality assessment prior to sequence analysis is essential. We created a computational HIV PR/RT Sequence Quality Analysis Tool (SQUAT) that runs in the R statistical environment. Sequence quality thresholds are calculated from a large dataset (46,802 PR and 44,432 RT sequences) from the published literature ( http://hivdb.Stanford.edu ). Nucleic acid sequences are read into SQUAT, identified, aligned, and translated. Nucleic acid sequences are flagged if with >five 1-2-base insertions; >one 3-base insertion; >one deletion; >six PR or >18 RT ambiguous bases; >three consecutive PR or >four RT nucleic acid mutations; >zero stop codons; >three PR or >six RT ambiguous amino acids; >three consecutive PR or >four RT amino acid mutations; >zero unique amino acids; or <0.5% or >15% genetic distance from another submitted sequence. Thresholds are user modifiable. SQUAT output includes a summary report with detailed comments for troubleshooting of flagged sequences, histograms of pairwise genetic distances, neighbor joining phylogenetic trees, and aligned nucleic and amino acid sequences. SQUAT is a stand-alone, free, web-independent tool to ensure use of high-quality HIV PR/RT sequences in interpretation and reporting of drug resistance, while increasing awareness and expertise and facilitating troubleshooting of potentially problematic sequences.
Detection of Emerging Vaccine-Related Polioviruses by Deep Sequencing.
Sahoo, Malaya K; Holubar, Marisa; Huang, ChunHong; Mohamed-Hadley, Alisha; Liu, Yuanyuan; Waggoner, Jesse J; Troy, Stephanie B; Garcia-Garcia, Lourdes; Ferreyra-Reyes, Leticia; Maldonado, Yvonne; Pinsky, Benjamin A
2017-07-01
Oral poliovirus vaccine can mutate to regain neurovirulence. To date, evaluation of these mutations has been performed primarily on culture-enriched isolates by using conventional Sanger sequencing. We therefore developed a culture-independent, deep-sequencing method targeting the 5' untranslated region (UTR) and P1 genomic region to characterize vaccine-related poliovirus variants. Error analysis of the deep-sequencing method demonstrated reliable detection of poliovirus mutations at levels of <1%, depending on read depth. Sequencing of viral nucleic acids from the stool of vaccinated, asymptomatic children and their close contacts collected during a prospective cohort study in Veracruz, Mexico, revealed no vaccine-derived polioviruses. This was expected given that the longest duration between sequenced sample collection and the end of the most recent national immunization week was 66 days. However, we identified many low-level variants (<5%) distributed across the 5' UTR and P1 genomic region in all three Sabin serotypes, as well as vaccine-related viruses with multiple canonical mutations associated with phenotypic reversion present at high levels (>90%). These results suggest that monitoring emerging vaccine-related poliovirus variants by deep sequencing may aid in the poliovirus endgame and efforts to ensure global polio eradication. Copyright © 2017 Sahoo et al.
Tharia, Hazel A; Shrive, Annette K; Mills, John D; Arme, Chris; Williams, Gwyn T; Greenhough, Trevor J
2002-02-22
The serum amyloid P component (SAP)-like pentraxin Limulus polyphemus SAP is a recently discovered, distinct pentraxin species, of known structure, which does not bind phosphocholine and whose N-terminal sequence has been shown to differ markedly from the highly conserved N terminus of all other known horseshoe crab pentraxins. The complete cDNA sequence of Limulus SAP, and the derived amino acid sequence, the first invertebrate SAP-like pentraxin sequence, have been determined. Two sequences were identified that differed only in the length of the 3' untranslated region. Limulus SAP is synthesised as a precursor protein of 234 amino acid residues, the first 17 residues encoding a signal peptide that is absent from the mature protein. Phylogenetic analysis clusters Limulus SAP pentraxin with the horseshoe crab C-reactive proteins (CRPs) rather than the mammalian SAPs, which are clustered with mammalian CRPs. The deduced amino acid sequence shares 22% identity with both human SAP and CRP, which are 51% identical, and 31-35% with horseshoe crab CRPs. These analyses indicate that gene duplication of CRP (or SAP), followed by sequence divergence and the evolution of CRP and/or SAP function, occurred independently along the chordate and arthropod evolutionary lines rather than in a common ancestor. They further indicate that the CRP/SAP gene duplication event in Limulus occurred before both the emergence of the Limulus CRP variants and the mammalian CRP/SAP gene duplication. Limulus SAP, which does not exhibit the CRP characteristic of calcium-dependent binding to phosphocholine, is established as a pentraxin species distinct from all other known horseshoe crab pentraxins that exist in many variant forms sharing a high level of sequence homology. Copyright 2002 Elsevier Science Ltd.
Mashima, Izumi; Liao, Yu-Chieh; Miyakawa, Hiroshi; Theodorea, Citra F; Thawboon, Boonyanit; Thaweboon, Sroisiri; Scannapieco, Frank A; Nakazawa, Futoshi
2018-04-01
A strain of a novel anaerobic, Gram-stain-negative coccus was isolated from the tongue biofilm of a Thai child. This strain was shown, at the phenotypic level and based on 16S rRNA gene sequencing, to be a member of the genus Veillonella. Comparative analysis of the 16S rRNA, dnaK and rpoB gene sequences indicated that phylogenetically the strain comprised a distinct novel branch within the genus Veillonella. The novel strain showed 99.8, 95.1 and 95.9 % similarity to partial 16S rRNA, dnaK and rpoB gene sequences, respectively, to the type strains of the two most closely related species, Veillonelladispar ATCC 17748 T and Veillonellatobetsuensis ATCC BAA-2400 T . The novel strain could be discriminated from previously reported species of the genus Veillonella based on partial dnaK and rpoB gene sequencing and average nucleotide identity values. The major acid end-product produced by this strain was acetic acid under anaerobic conditions in trypticase-yeast extract-haemin with 1 % (w/v) glucose or fructose medium. Lactate was fermented to acetic acid and propionic acid. Based on these observations, this strain represents a novel species, for which the name Veillonella infantium sp. nov. is proposed. The type strain is T11011-4 T (=JCM 31738 T =TSD-88 T ).
Marron, Alan O; Akam, Michael; Walker, Giselle
2013-01-01
Cultures of heterotrophic protists often require co-culturing with bacteria to act as a source of nutrition. Such cultures will contain varying levels of intrinsic bacterial contamination that can interfere with molecular research and cause problems with the collection of sufficient material for sequencing. Measuring the levels of bacterial contamination for the purposes of molecular biology research is non-trivial, and can be complicated by the presence of a diverse bacterial flora, or by differences in the relative nucleic acid yield per bacterial or eukaryotic cell. Here we describe a duplex PCR-based assay that can be used to measure the levels of contamination from marine bacteria in a culture of loricate choanoflagellates. By comparison to a standard culture of known target sequence content, the assay can be used to quantify the relative proportions of bacterial and choanoflagellate material in DNA or RNA samples extracted from a culture. We apply the assay to compare methods of purifying choanoflagellate cultures prior to DNA extraction, to determine their effectiveness in reducing bacterial contamination. Together with measurements of the total nucleic acid concentration, the assay can then be used as the basis for determining the absolute amounts of choanoflagellate DNA or RNA present in a sample. The assay protocol we describe here is a simple and relatively inexpensive method of measuring contamination levels in nucleic acid samples. This provides a new way to establish quantification and purification protocols for molecular biology and genomics in novel heterotrophic protist species. Guidelines are provided to develop a similar protocol for use with any protistan culture. This assay method is recommended where qPCR equipment is unavailable, where qPCR is not viable because of the nature of the bacterial contamination or starting material, or where prior sequence information is insufficient to develop qPCR protocols.
van den Berg, M; Verbaarschot, P; Hontelez, S; Vet, L E M; Dicke, M; Smid, H M
2010-06-01
The cAMP/PKA signalling pathway and transcription factor cAMP response element-binding protein (CREB) play key roles in long-term memory (LTM) formation. We used two closely related parasitic wasp species, Cotesia glomerata and Cotesia rubecula, which were previously shown to be different in LTM formation, and sequenced at least nine different CREB transcripts in both wasp species. The splicing patterns, functional domains and amino acid sequences were similar to those found in the CREB genes of other organisms. The predicted amino acid sequences of the CREB isoforms were identical in both wasp species. Using real-time quantitative PCR we found that two low abundant CREB transcripts are differentially expressed in the two wasps, whereas the expression levels of high abundant transcripts are similar.
On the inhibition of muscle membrane chloride conductance by aromatic carboxylic acids
Palade, PT; Barchi, RL
1977-01-01
25 aromatic carboxylic acids which are analogs of benzoic acid were tested in the rat diaphragm preparation for effects on chloride conductance (G(Cl)). Of the 25, 19 were shown to reduce membrane G(Cl) with little effect on other membrane parameters, although their apparent K(i) varied widely. This inhibition was reversible if exposure times were not prolonged. The most effective analog studied was anthracene-9-COOH (9-AC; K(i) = 1.1 x 10(-5) M). Active analogs produced concentration-dependent inhibition of a type consistent with interaction at a single site or group of sites having similar binding affinities, although a correlation could also be shown between lipophilicity and K(i). Structure-activity analysis indicated that hydrophobic ring substitution usually increased inhibitory activity while para polar substitutions reduced effectiveness. These compounds do not appear to inhibit G(Cl) by altering membrane surface charge and the inhibition produced is not voltage dependent. Qualitative characteristics of the I-V relationship for Cl(-) current are not altered. Conductance to all anions is not uniformly altered by these acids as would be expected from steric occlusion of a common channel. Concentrations of 9-AC reducing G(Cl) by more than 90 percent resulted in slight augmentation of G(I). The complete conductance sequence obtained at high levels of 9-AC was the reverse of that obtained under control conditions. Permeability sequences underwent progressive changes with increasing 9-AC concentration and ultimately inverted at high levels of the analog. Aromatic carboxylic acids appear to inhibit G(Cl) by binding to a specific intramembrane site and altering the selectivity sequence of the membrane anion channel. PMID:894246
Saito, T; Ochiai, H
1999-10-01
cDNA fragments putatively encoding amino acid sequences characteristic of the fatty acid desaturase were obtained using expressed sequence tag (EST) information of the Dictyostelium cDNA project. Using this sequence, we have determined the cDNA sequence and genomic sequence of a desaturase. The cloned cDNA is 1489 nucleotides long and the deduced amino acid sequence comprised 464 amino acid residues containing an N-terminal cytochrome b5 domain. The whole sequence was 38.6% identical to the initially identified Delta5-desaturase of Mortierella alpina. We have confirmed its function as Delta5-desaturase by over expression mutation in D. discoideum and also the gain of function mutation in the yeast Saccharomyces cerevisiae. Analysis of the lipids from transformed D. discoideum and yeast demonstrated the accumulation of Delta5-desaturated products. This is the first report concering fatty acid desaturase in cellular slime molds.
Methods for determining the genetic affinity of microorganisms and viruses
NASA Technical Reports Server (NTRS)
Fox, George E. (Inventor); Willson, III, Richard C. (Inventor); Zhang, Zhengdong (Inventor)
2012-01-01
Selecting which sub-sequences in a database of nucleic acid such as 16S rRNA are highly characteristic of particular groupings of bacteria, microorganisms, fungi, etc. on a substantially phylogenetic tree. Also applicable to viruses comprising viral genomic RNA or DNA. A catalogue of highly characteristic sequences identified by this method is assembled to establish the genetic identity of an unknown organism. The characteristic sequences are used to design nucleic acid hybridization probes that include the characteristic sequence or its complement, or are derived from one or more characteristic sequences. A plurality of these characteristic sequences is used in hybridization to determine the phylogenetic tree position of the organism(s) in a sample. Those target organisms represented in the original sequence database and sufficient characteristic sequences can identify to the species or subspecies level. Oligonucleotide arrays of many probes are especially preferred. A hybridization signal can comprise fluorescence, chemiluminescence, or isotopic labeling, etc.; or sequences in a sample can be detected by direct means, e.g. mass spectrometry. The method's characteristic sequences can also be used to design specific PCR primers. The method uniquely identifies the phylogenetic affinity of an unknown organism without requiring prior knowledge of what is present in the sample. Even if the organism has not been previously encountered, the method still provides useful information about which phylogenetic tree bifurcation nodes encompass the organism.
Protein Design Using Unnatural Amino Acids
NASA Astrophysics Data System (ADS)
Bilgiçer, Basar; Kumar, Krishna
2003-11-01
With the increasing availability of whole organism genome sequences, understanding protein structure and function is of capital importance. Recent developments in the methodology of incorporation of unnatural amino acids into proteins allow the exploration of proteins at a very detailed level. Furthermore, de novo design of novel protein structures and function is feasible with unprecedented sophistication. Using examples from the literature, this article describes the available methods for unnatural amino acid incorporation and highlights some recent applications including the design of hyperstable protein folds.
Composition for nucleic acid sequencing
Korlach, Jonas [Ithaca, NY; Webb, Watt W [Ithaca, NY; Levene, Michael [Ithaca, NY; Turner, Stephen [Ithaca, NY; Craighead, Harold G [Ithaca, NY; Foquet, Mathieu [Ithaca, NY
2008-08-26
The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
Method for sequencing nucleic acid molecules
Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu
2006-06-06
The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
Method for sequencing nucleic acid molecules
Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu
2006-05-30
The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
Dipeptide Sequence Determination: Analyzing Phenylthiohydantoin Amino Acids by HPLC
NASA Astrophysics Data System (ADS)
Barton, Janice S.; Tang, Chung-Fei; Reed, Steven S.
2000-02-01
Amino acid composition and sequence determination, important techniques for characterizing peptides and proteins, are essential for predicting conformation and studying sequence alignment. This experiment presents improved, fundamental methods of sequence analysis for an upper-division biochemistry laboratory. Working in pairs, students use the Edman reagent to prepare phenylthiohydantoin derivatives of amino acids for determination of the sequence of an unknown dipeptide. With a single HPLC technique, students identify both the N-terminal amino acid and the composition of the dipeptide. This method yields good precision of retention times and allows use of a broad range of amino acids as components of the dipeptide. Students learn fundamental principles and techniques of sequence analysis and HPLC.
Maruri-López, Israel; Rodríguez-Kessler, Margarita; Rodríguez-Hernández, Aída Araceli; Becerra-Flora, Alicia; Olivares-Grajales, Juan Elías; Jiménez-Bremont, Juan Francisco
2014-05-01
Polyamines are low molecular weight aliphatic compounds involved in various biochemical, cellular and physiological processes in all organisms. In plants, genes involved in polyamine biosynthesis and catabolism are regulated at transcriptional, translational, and posttranslational level. In this research, we focused on the characterization of a PEST sequence (rich in proline, glutamic acid, serine, and threonine) of the maize spermine synthase 1 (ZmSPMS1). To this aim, 123 bp encoding 40 amino acids of the C-terminal region of the ZmSPMS1 enzyme containing the PEST sequence were fused to the GUS reporter gene. This fusion was evaluated in Arabidopsis thaliana transgenic lines and onion monolayers transient expression system. The ZmSPMS1 PEST sequence leads to specific degradation of the GUS reporter protein. It is suggested that the 26S proteasome may be involved in GUS::PEST fusion degradation in both onion and Arabidopsis. The PEST sequences appear to be present in plant spermine synthases, mainly in monocots. Copyright © 2014 Elsevier Masson SAS. All rights reserved.
Fabrication of a New Lineage of Artificial Luciferases from Natural Luciferase Pools.
Kim, Sung Bae; Nishihara, Ryo; Citterio, Daniel; Suzuki, Koji
2017-09-11
The fabrication of artificial luciferases (ALucs) with unique optical properties has a fundamental impact on bioassays and molecular imaging. In this study, we developed a new lineage of ALucs with unique substrate preferences by extracting consensus amino acids from the alignment of 25 copepod luciferase sequences available in natural luciferase pools. The primary sequence was first created with a sequence logo generator resulting in a total of 11 sibling sequences. Phylogenetic analysis shows that the newly fabricated ALucs form an independent branch, genetically isolated from the natural luciferases, and from a prior series of ALucs produced by our laboratory using a smaller basis set. The new lineage of ALucs were strongly luminescent in living mammalian cells with specific substrate selectivity to native coelenterazine. A single-residue-level comparison of the C-terminal sequences of new ALucs reveals that some amino acids in the C-terminal ends are greatly influential on the optical intensities but limited in the color variance. The success of this approach guides on how to engineer and functionalize marine luciferases for bioluminescence imaging and assays.
EGVII endoglucanase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2014-02-25
The present invention provides a novel endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.
EGVII endoglucanase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2006-05-16
The present invention provides a novel endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.
EGVI endoglucanase and nucleic acids encoding the same
Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA
2008-04-01
The present invention provides a novel endoglucanase nucleic acid sequence, designated egl6, and the corresponding EGVI amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVI, recombinant EGVI proteins and methods for producing the same.
EGVI endoglucanase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2010-10-12
The present invention provides a novel endoglucanase nucleic acid sequence, designated egl6, and the corresponding EGVI amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVI, recombinant EGVI proteins and methods for producing the same.
EGVIII endoglucanase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2006-05-23
The present invention provides a novel endoglucanase nucleic acid sequence, designated egl8, and the corresponding EGVIII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVIII, recombinant EGVIII proteins and methods for producing the same.
EGVI endoglucanase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2010-10-05
The present invention provides a novel endoglucanase nucleic acid sequence, designated egl6, and the corresponding EGVI amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVI, recombinant EGVI proteins and methods for producing the same.
EGVI endoglucanase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2006-06-06
The present invention provides a novel endoglucanase nucleic acid sequence, designated egl6, and the corresponding EGVI amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVI, recombinant EGVI proteins and methods for producing the same.
EGVII endoglucanase and nucleic acids encoding the same
Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA
2009-05-05
The present invention provides an endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.
EGVII endoglucanase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2013-07-16
The present invention provides a novel endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.
EGVII endoglucanase and nucleic acids encoding the same
Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA
2012-02-14
The present invention provides a novel endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.
EGVII endoglucanase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2015-04-14
The present invention provides a novel endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.
Neill, John D; Dubovi, Edward J; Ridpath, Julia F
2015-09-30
Bovine viral diarrhea viruses (BVDV) are most commonly associated with infections of cattle. However, BVDV are often isolated from closely related ruminants with a number of BVDV-1b viruses being isolated from alpacas that were both acutely and persistently infected. The complete nucleotide sequence of the open reading frame of eleven alpaca-adapted BVDV isolates and the region encoding the envelope glycoproteins of an additional three isolates were determined. With the exception of one, all alpaca isolates were >99.2% similar at the nucleotide level. The Hercules isolate was more divergent, with 95.7% sequence identity to the other viruses. Sequence similarity of the 14 viruses indicated they were isolates of a single BVDV strain that had adapted to and were circulating through alpaca herds. Hercules was a more distantly related strain that has been isolated only once in Canada and represented a separate adaptation event that possessed the same adaptive changes. Comparison of amino acid sequences of alpaca and bovine-derived BVDV strains revealed three regions with amino acid sequences unique to all alpaca isolates. The first contained two small in-frame deletions near the N-terminus of the E2 glycoprotein. The second was found near the C-terminus of the E2 protein where four altered amino acids were located within a 30 amino acid domain that participates in E2 homodimerization. The third region contained three variable amino acids in the C-terminus of the E(rns) within the amphipathic helix membrane anchor. These changes were found in the polar side of the amphipathic helix and resulted in an increased charge within the polar face. Titration of bovine and alpaca viruses in both bovine and alpaca cells indicated that with increased charge in the amphipathic helix, the ability to infect alpaca cells also increased. Published by Elsevier B.V.
Kit for detecting nucleic acid sequences using competitive hybridization probes
Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.
2001-01-01
A kit is provided for detecting a target nucleic acid sequence in a sample, the kit comprising: a first hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the first hybridization probe including a first complexing agent for forming a binding pair with a second complexing agent; and a second hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the first hybridization probe does not selectively hybridize, the second hybridization probe including a detectable marker; a third hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the third hybridization probe including the same detectable marker as the second hybridization probe; and a fourth hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the third hybridization probe does not selectively hybridize, the fourth hybridization probe including the first complexing agent for forming a binding pair with the second complexing agent; wherein the first and second hybridization probes are capable of simultaneously hybridizing to the target sequence and the third and fourth hybridization probes are capable of simultaneously hybridizing to the target sequence, the detectable marker is not present on the first or fourth hybridization probes and the first, second, third, and fourth hybridization probes each include a competitive nucleic acid sequence which is sufficiently complementary to a third portion of the target sequence that the competitive sequences of the first, second, third, and fourth hybridization probes compete with each other to hybridize to the third portion of the target sequence.
Peng, Qisheng; Wang, Zijian; Wu, Donglin; Li, Xiaoou; Liu, Xiaofeng; Sun, Wanchun; Liu, Ning
2016-08-01
Amino acid substitutions in the neuraminidase of the influenza virus are the main cause of the emergence of resistance to zanamivir or oseltamivir during seasonal influenza treatment; they are the result of non-synonymous mutations in the viral genome that can be successfully detected by polymer chain reaction (PCR)-based approaches. There is always an urgent need to detect variation in amino acid sequences directly at the protein level. Mass spectrometry coupled with de novo sequencing has been explored as an alternative and straightforward strategy for detecting amino acid substitutions, as well - this approach is the primary focus of the present study. Influenza virus (A/Puerto Rico/8/1934 H1N1) propagated in embryonated chicken eggs was purified by ultracentrifugation, followed by PNGase F treatment. The deglycosylated virion was lysed and separated by sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE). The gel band corresponding to neuraminidase was picked up and subjected to liquid chromatography tandem mass spectrometry (LC-MS/MS) analysis. LC-MS/MS analyses, coupled with manual de novo sequencing, allowed the determination of three amino acid substitutions: R346K, S349 N, and S370I/L, in the neuraminidase from the influenza virus (A/Puerto Rico/8/1934 H1N1), which were located in three mutated peptides of the neuraminidase: YGNGVWIGK, TKNHSSR, and PNGWTETDI/LK, respectively. We found that the amino acid substitutions in the proteins of RNA viruses (including influenza A virus) resulting from non-synonymous gene mutations can indeed be directly analyzed via mass spectrometry, and that manual interpretation of the MS/MS data may be beneficial. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
Huang, Shengbing; Song, Wei; Lin, Qishui
2005-08-01
A membrane-bound protein was purified from rat liver mitochondria. After being digested with V8 protease, two peptides containing identical 14 amino acid residue sequences were obtained. Using the 14 amino acid peptide derived DNA sequence as gene specific primer, the cDNA of correspondent gene 5'-terminal and 3'-terminal were obtained by RACE technique. The full-length cDNA that encoded a protein of 616 amino acids was thus cloned, which included the above mentioned peptide sequence. The full length cDNA was highly homologous to that of human ETF-QO, indicating that it may be the cDNA of rat ETF-QO. ETF-QO is an iron sulfur protein located in mitochondria inner membrane containing two kinds of redox center: FAD and [4Fe-4S] center. After comparing the sequence from the cDNA of the 616 amino acids protein with that of the mature protein of rat liver mitochondria, it was found that the N terminal 32 amino acid residues did not exist in the mature protein, indicating that the cDNA was that of ETF-QOp. When the cDNA was expressed in Saccharomyces cerevisiae with inducible vectors, the protein product was enriched in mitochondrial fraction and exhibited electron transfer activity (NBT reductase activity) of ETF-QO. Results demonstrated that the 32 amino acid peptide was a mitochondrial targeting peptide, and both FAD and iron-sulfur cluster were inserted properly into the expressed ETF-QO. ETF-QO had a high level expression in rat heart, liver and kidney. The fusion protein of GFP-ETF-QO co-localized with mitochondria in COS-7 cells.
Álvarez-Cervantes, Jorge; Díaz-Godínez, Gerardo; Mercado-Flores, Yuridia; Gupta, Vijai Kumar; Anducho-Reyes, Miguel Angel
2016-01-01
In this paper, the amino acid sequence of the β-xylanase SRXL1 of Sporisorium reilianum, which is a pathogenic fungus of maize was used as a model protein to find its phylogenetic relationship with other xylanases of Ascomycetes and Basidiomycetes and the information obtained allowed to establish a hypothesis of monophyly and of biological role. 84 amino acid sequences of β-xylanase obtained from the GenBank database was used. Groupings analysis of higher-level in the Pfam database allowed to determine that the proteins under study were classified into the GH10 and GH11 families, based on the regions of highly conserved amino acids, 233–318 and 180–193 respectively, where glutamate residues are responsible for the catalysis. PMID:27040368
Chip-based sequencing nucleic acids
Beer, Neil Reginald
2014-08-26
A system for fast DNA sequencing by amplification of genetic material within microreactors, denaturing, demulsifying, and then sequencing the material, while retaining it in a PCR/sequencing zone by a magnetic field. One embodiment includes sequencing nucleic acids on a microchip that includes a microchannel flow channel in the microchip. The nucleic acids are isolated and hybridized to magnetic nanoparticles or to magnetic polystyrene-coated beads. Microreactor droplets are formed in the microchannel flow channel. The microreactor droplets containing the nucleic acids and the magnetic nanoparticles are retained in a magnetic trap in the microchannel flow channel and sequenced.
Yefremova, Yelena; Al-Majdoub, Mahmoud; Opuni, Kwabena F M; Koy, Cornelia; Cui, Weidong; Yan, Yuetian; Gross, Michael L; Glocker, Michael O
2015-03-01
Mass spectrometric de-novo sequencing was applied to review the amino acid sequence of a commercially available recombinant protein G´ with great scientific and economic importance. Substantial deviations to the published amino acid sequence (Uniprot Q54181) were found by the presence of 46 additional amino acids at the N-terminus, including a so-called "His-tag" as well as an N-terminal partial α-N-gluconoylation and α-N-phosphogluconoylation, respectively. The unexpected amino acid sequence of the commercial protein G' comprised 241 amino acids and resulted in a molecular mass of 25,998.9 ± 0.2 Da for the unmodified protein. Due to the higher mass that is caused by its extended amino acid sequence compared with the original protein G' (185 amino acids), we named this protein "protein G'e." By means of mass spectrometric peptide mapping, the suggested amino acid sequence, as well as the N-terminal partial α-N-gluconoylations, was confirmed with 100% sequence coverage. After the protein G'e sequence was determined, we were able to determine the expression vector pET-28b from Novagen with the Xho I restriction enzyme cleavage site as the best option that was used for cloning and expressing the recombinant protein G'e in E. coli. A dissociation constant (K(d)) value of 9.4 nM for protein G'e was determined thermophoretically, showing that the N-terminal flanking sequence extension did not cause significant changes in the binding affinity to immunoglobulins.
Thomsen, Martin Christen Frølund; Nielsen, Morten
2012-01-01
Seq2Logo is a web-based sequence logo generator. Sequence logos are a graphical representation of the information content stored in a multiple sequence alignment (MSA) and provide a compact and highly intuitive representation of the position-specific amino acid composition of binding motifs, active sites, etc. in biological sequences. Accurate generation of sequence logos is often compromised by sequence redundancy and low number of observations. Moreover, most methods available for sequence logo generation focus on displaying the position-specific enrichment of amino acids, discarding the equally valuable information related to amino acid depletion. Seq2logo aims at resolving these issues allowing the user to include sequence weighting to correct for data redundancy, pseudo counts to correct for low number of observations and different logotype representations each capturing different aspects related to amino acid enrichment and depletion. Besides allowing input in the format of peptides and MSA, Seq2Logo accepts input as Blast sequence profiles, providing easy access for non-expert end-users to characterize and identify functionally conserved/variable amino acids in any given protein of interest. The output from the server is a sequence logo and a PSSM. Seq2Logo is available at http://www.cbs.dtu.dk/biotools/Seq2Logo (14 May 2012, date last accessed). PMID:22638583
Tanaka, Junko; Doi, Nobuhide; Takashima, Hideaki; Yanagawa, Hiroshi
2010-01-01
Screening of functional proteins from a random-sequence library has been used to evolve novel proteins in the field of evolutionary protein engineering. However, random-sequence proteins consisting of the 20 natural amino acids tend to aggregate, and the occurrence rate of functional proteins in a random-sequence library is low. From the viewpoint of the origin of life, it has been proposed that primordial proteins consisted of a limited set of amino acids that could have been abundantly formed early during chemical evolution. We have previously found that members of a random-sequence protein library constructed with five primitive amino acids show high solubility (Doi et al., Protein Eng Des Sel 2005;18:279–284). Although such a library is expected to be appropriate for finding functional proteins, the functionality may be limited, because they have no positively charged amino acid. Here, we constructed three libraries of 120-amino acid, random-sequence proteins using alphabets of 5, 12, and 20 amino acids by preselection using mRNA display (to eliminate sequences containing stop codons and frameshifts) and characterized and compared the structural properties of random-sequence proteins arbitrarily chosen from these libraries. We found that random-sequence proteins constructed with the 12-member alphabet (including five primitive amino acids and positively charged amino acids) have higher solubility than those constructed with the 20-member alphabet, though other biophysical properties are very similar in the two libraries. Thus, a library of moderate complexity constructed from 12 amino acids may be a more appropriate resource for functional screening than one constructed from 20 amino acids. PMID:20162614
DOE Office of Scientific and Technical Information (OSTI.GOV)
Reiser, Steven E.; Somerville, Chris R.
The present invention relates to bacterial enzymes, in particular to an acyl-CoA reductase and a gene encoding an acyl-CoA reductase, the amino acid and nucleic acid sequences corresponding to the reductase polypeptide and gene, respectively, and to methods of obtaining such enzymes, amino acid sequences and nucleic acid sequences. The invention also relates to the use of such sequences to provide transgenic host cells capable of producing fatty alcohols and fatty aldehydes.
Insights into the diversity of eukaryotes in acid mine drainage biofilm communities.
Baker, Brett J; Tyson, Gene W; Goosherst, Lindsey; Banfield, Jillian F
2009-04-01
Microscopic eukaryotes are known to have important ecosystem functions, but their diversity in most environments remains vastly unexplored. Here we analyzed an 18S rRNA gene library from a subsurface iron- and sulfur-oxidizing microbial community growing in highly acidic (pH < 0.9) runoff within the Richmond Mine at Iron Mountain (northern California). Phylogenetic analysis revealed that the majority (68%) of the sequences belonged to fungi. Protists falling into the deeply branching lineage named the acidophilic protist clade (APC) and the class Heterolobosea were also present. The APC group represents kingdom-level novelty, with <76% sequence similarity to 18S rRNA gene sequences of organisms from other environments. Fluorescently labeled oligonucleotide rRNA probes were designed to target each of these groups in biofilm samples, enabling abundance and morphological characterization. Results revealed that the populations vary significantly with the habitat and no group is ubiquitous. Surprisingly, many of the eukaryotic lineages (with the exception of the APC) are closely related to neutrophiles, suggesting that they recently adapted to this extreme environment. Molecular analyses presented here confirm that the number of eukaryotic species associated with the acid mine drainage (AMD) communities is low. This finding is consistent with previous results showing a limited diversity of archaea, bacteria, and viruses in AMD environments and suggests that the environmental pressures and interplay between the members of these communities limit species diversity at all trophic levels.
BGL7 beta-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Ward, Michael
2013-01-29
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl7, and the corresponding BGL7 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL7, recombinant BGL7 proteins and methods for producing the same.
BGL6 .beta.-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Ward, Michael
2012-10-02
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl6, and the corresponding BGL6 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL6, recombinant BGL6 proteins and methods for producing the same.
BGL5 .beta.-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2006-02-28
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl5, and the corresponding BGL5 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL5, recombinant BGL5 proteins and methods for producing the same.
BGL5 .beta.-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA
2008-03-18
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl5, and the corresponding BGL5 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL5, recombinant BGL5 proteins and methods for producing the same.
BGL6 beta-glucosidase and nucleic acids encoding the same
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dunn-Coleman, Nigel; Ward, Michael
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl6, and the corresponding BGL6 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL6, recombinant BGL6 proteins and methods for producing the same.
BGL6 beta-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Ward, Michael
2014-03-04
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl6, and the corresponding BGL6 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL6, recombinant BGL6 proteins and methods for producing the same.
BGL7 beta-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Ward, Michael
2015-04-14
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl7, and the corresponding BGL7 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL7, recombinant BGL7 proteins and methods for producing the same.
BGL7 beta-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Ward, Michael
2014-03-25
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl7, and the corresponding BGL7 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL7, recombinant BGL7 proteins and methods for producing the same.
BGL6 beta-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Ward, Michael
2015-08-11
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl6, and the corresponding BGL6 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL6, recombinant BGL6 proteins and methods for producing the same.
BGL3 beta-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2007-09-25
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl3, and the corresponding BGL3 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL3, recombinant BGL3 proteins and methods for producing the same.
BGL3 beta-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA
2008-04-01
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl3, and the corresponding BGL3 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL3, recombinant BGL3 proteins and methods for producing the same.
BGL4 beta-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA
2011-12-06
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl4, and the corresponding BGL4 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL4, recombinant BGL4 proteins and methods for producing the same.
BGL4 .beta.-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2006-05-16
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl4, and the corresponding BGL4 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL4, recombinant BGL4 proteins and methods for producing the same.
BGL3 beta-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA
2011-06-14
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl3, and the corresponding BGL3 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL3, recombinant BGL3 proteins and methods for producing the same.
BGL6 beta-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel [Los Gatos, CA; Ward, Michael [San Francisco, CA
2009-09-01
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl6, and the corresponding BGL6 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL6, recombinant BGL6 proteins and methods for producing the same.
BGL3 beta-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2012-10-30
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl3, and the corresponding BGL3 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL3, recombinant BGL3 proteins and methods for producing the same.
BGL4 beta-glucosidase and nucleic acids encoding the same
Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA
2008-01-22
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl4, and the corresponding BGL4 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL4, recombinant BGL4 proteins and methods for producing the same.
ESTs from Seeds to Assist the Selective Breeding of Jatropha curcas L. for Oil and Active Compounds
Gomes, Kleber A; Almeida, Tiago C; Gesteira, Abelmon S; Lôbo, Ivon P; Guimarães, Ana Carolina R; de Miranda, Antonio B; Van Sluys, Marie-Anne; da Cruz, Rosenira S; Cascardo, Júlio CM; Carels, Nicolas
2010-01-01
We report here on the characterization of a cDNA library from seeds of Jatropha curcas L. at three stages of fruit maturation before yellowing. We sequenced a total of 2200 clones and obtained a set of 931 non-redundant sequences (unigenes) after trimming and quality control, ie, 140 contigs and 791 singlets with PHRED quality ≥10. We found low levels of sequence redundancy and extensive metabolic coverage by homology comparison to GO. After comparison of 5841 non-redundant ESTs from a total of 13193 reads from GenBank with KEGG, we identified tags with nucleotide variations among J. curcas accessions for genes of fatty acid, terpene, alkaloid, quinone and hormone pathways of biosynthesis. More specifically, the expression level of four genes (palmitoyl-acyl carrier protein thioesterase, 3-ketoacyl-CoA thiolase B, lysophosphatidic acid acyltransferase and geranyl pyrophosphate synthase) measured by real-time PCR proved to be significantly different between leaves and fruits. Since the nucleotide polymorphism of these tags is associated to higher level of gene expression in fruits compared to leaves, we propose this approach to speed up the search for quantitative traits in selective breeding of J. curcas. We also discuss its potential utility for the selective breeding of economically important traits in J. curcas. PMID:26217103
Ae Kim, Sun; Hong Park, Si; In Lee, Sang; Owens, Casey M.; Ricke, Steven C.
2017-01-01
The purpose of this study was to 1) identify microbial compositional changes on chicken carcasses during processing, 2) determine the antimicrobial efficacy of peracetic acid (PAA) and Amplon (blend of sulfuric acid and sodium sulfate) at a poultry processing pilot plant scale, and 3) compare microbial communities between chicken carcass rinsates and recovered bacteria from media. Birds were collected from each processing step and rinsates were applied to estimate aerobic plate count (APC) and Campylobacter as well as Salmonella prevalence. Microbiome sequencing was utilized to identify microbial population changes over processing and antimicrobial treatments. Only the PAA treatment exhibited significant reduction of APC at the post chilling step while both Amplon and PAA yielded detectable Campylobacter reductions at all steps. Based on microbiome sequencing, Firmicutes were the predominant bacterial group at the phyla level with over 50% frequency in all steps while the relative abundance of Proteobacteria decreased as processing progressed. Overall microbiota between rinsate and APC plate microbial populations revealed generally similar patterns at the phyla level but they were different at the genus level. Both antimicrobials appeared to be effective on reducing problematic bacteria and microbiome can be utilized to identify optimal indicator microorganisms for enhancing product quality. PMID:28230180
Lei, Yong-Liang; Wang, Xiao-Guang; Tao, Xiao-Yan; Li, Hao; Meng, Sheng-Li; Chen, Xiu-Ying; Liu, Fu-Ming; Ye, Bi-Feng; Tang, Qing
2010-01-01
Based on sequencing the full-length genomes of four Chinese Ferret-Badger and dog, we analyze the properties of rabies viruses genetic variation in molecular level, get the information about rabies viruses prevalence and variation in Zhejiang, and enrich the genome database of rabies viruses street strains isolated from China. Rabies viruses in suckling mice were isolated, overlapped fragments were amplified by RT-PCR and full-length genomes were assembled to analyze the nucleotide and deduced protein similarities and phylogenetic analyses from Chinese Ferret-Badger, dog, sika deer, vole, used vaccine strain were determined. The four full-length genomes were sequenced completely and had the same genetic structure with the length of 11, 923 nts or 11, 925 nts including 58 nts-Leader, 1353 nts-NP, 894 nts-PP, 609 nts-MP, 1575 nts-GP, 6386 nts-LP, and 2, 5, 5 nts- intergenic regions(IGRs), 423 nts-Pseudogene-like sequence (psi), 70 nts-Trailer. The four full-length genomes were in accordance with the properties of Rhabdoviridae Lyssa virus by BLAST and multi-sequence alignment. The nucleotide and amino acid sequences among Chinese strains had the highest similarity, especially among animals of the same species. Of the four full-length genomes, the similarity in amino acid level was dramatically higher than that in nucleotide level, so the nucleotide mutations happened in these four genomes were most synonymous mutations. Compared with the reference rabies viruses, the lengths of the five protein coding regions had no change, no recombination, only with a few point mutations. It was evident that the five proteins appeared to be stable. The variation sites and types of the four genomes were similar to the reference vaccine or street strains. And the four strains were genotype 1 according to the multi-sequence and phylogenetic analyses, which possessed the distinct district characteristics of China. Therefore, these four rabies viruses are likely to be street viruses already existing in the natural world.
Thraustochytrids as production organisms for docosahexaenoic acid (DHA), squalene, and carotenoids.
Aasen, Inga Marie; Ertesvåg, Helga; Heggeset, Tonje Marita Bjerkan; Liu, Bin; Brautaset, Trygve; Vadstein, Olav; Ellingsen, Trond E
2016-05-01
Thraustochytrids have been applied for industrial production of the omega-3 fatty acid docosahexaenoic (DHA) since the 1990s. During more than 20 years of research on this group of marine, heterotrophic microorganisms, considerable increases in DHA productivities have been obtained by process and medium optimization. Strains of thraustochytrids also produce high levels of squalene and carotenoids, two other commercially interesting compounds with a rapidly growing market potential, but where yet few studies on process optimization have been reported. Thraustochytrids use two pathways for fatty acid synthesis. The saturated fatty acids are produced by the standard fatty acid synthesis, while DHA is synthesized by a polyketide synthase. However, fundamental knowledge about the relationship between the two pathways is still lacking. In the present review, we extract main findings from the high number of reports on process optimization for DHA production and interpret these in the light of the current knowledge of DHA synthesis in thraustochytrids and lipid accumulation in oleaginous microorganisms in general. We also summarize published reports on squalene and carotenoid production and review the current status on strain improvement, which has been hampered by the yet very few published genome sequences and the lack of tools for gene transfer to the organisms. As more sequences now are becoming available, targets for strain improvement can be identified and open for a system-level metabolic engineering for improved productivities.
Methods and compositions for efficient nucleic acid sequencing
Drmanac, Radoje
2006-07-04
Disclosed are novel methods and compositions for rapid and highly efficient nucleic acid sequencing based upon hybridization with two sets of small oligonucleotide probes of known sequences. Extremely large nucleic acid molecules, including chromosomes and non-amplified RNA, may be sequenced without prior cloning or subcloning steps. The methods of the invention also solve various current problems associated with sequencing technology such as, for example, high noise to signal ratios and difficult discrimination, attaching many nucleic acid fragments to a surface, preparing many, longer or more complex probes and labelling more species.
Methods and compositions for efficient nucleic acid sequencing
Drmanac, Radoje
2002-01-01
Disclosed are novel methods and compositions for rapid and highly efficient nucleic acid sequencing based upon hybridization with two sets of small oligonucleotide probes of known sequences. Extremely large nucleic acid molecules, including chromosomes and non-amplified RNA, may be sequenced without prior cloning or subcloning steps. The methods of the invention also solve various current problems associated with sequencing technology such as, for example, high noise to signal ratios and difficult discrimination, attaching many nucleic acid fragments to a surface, preparing many, longer or more complex probes and labelling more species.
Kaplan, Oktay I; Berber, Burak; Hekim, Nezih; Doluca, Osman
2016-11-02
Many studies show that short non-coding sequences are widely conserved among regulatory elements. More and more conserved sequences are being discovered since the development of next generation sequencing technology. A common approach to identify conserved sequences with regulatory roles relies on topological changes such as hairpin formation at the DNA or RNA level. G-quadruplexes, non-canonical nucleic acid topologies with little established biological roles, are increasingly considered for conserved regulatory element discovery. Since the tertiary structure of G-quadruplexes is strongly dependent on the loop sequence which is disregarded by the generally accepted algorithm, we hypothesized that G-quadruplexes with similar topology and, indirectly, similar interaction patterns, can be determined using phylogenetic clustering based on differences in the loop sequences. Phylogenetic analysis of 52 G-quadruplex forming sequences in the Escherichia coli genome revealed two conserved G-quadruplex motifs with a potential regulatory role. Further analysis revealed that both motifs tend to form hairpins and G quadruplexes, as supported by circular dichroism studies. The phylogenetic analysis as described in this work can greatly improve the discovery of functional G-quadruplex structures and may explain unknown regulatory patterns. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Ito, Toshihiko; Konno, Mahito; Shimura, Yoichiro; Watanabe, Seiei; Takahashi, Hitoshi; Hashizume, Katsumi
2016-06-08
The formation of guaiacol, a potent phenolic off-odor compound in the Japanese sake brewing process, was investigated. Eight rice koji samples were analyzed, and one contained guaiacol and 4-vinylguaiacol (4-VG) at extraordinarily high levels: 374 and 2433 μg/kg dry mass koji, respectively. All samples contained ferulic and vanillic acids at concentrations of mg/kg dry mass koji. Guaiacol forming microorganisms were isolated from four rice koji samples. They were identified as Bacillus subtilis, B. amyloliquefaciens/subtilis, and Staphylococcus gallinarum using 16S rRNA gene sequence. These spoilage bacteria convert vanillic acid to guaiacol and ferulic acid to 4-VG. However, they convert very little ferulic acid or 4-VG to guaiacol. Nine strains of koji fungi tested produced vanillic acid at the mg/kg dry mass koji level after cultivation. These results indicated that spoilage bacteria form guaiacol from vanillic acid, which is a product of koji cultivation in the sake brewing process.
Hughes, M. S.; Hoey, E. M.; Coyle, P. V.
1993-01-01
Ten coxsackievirus B4 (CVB4) strains isolated from clinical and environmental sources in Northern Ireland in 1985-7, were compared at the nucleotide sequence level. Dideoxynucleotide sequencing of a polymerase chain reaction (PCR) amplified fragment, spanning the VP1/P2A genomic region, classified the isolates into two distinct groups or genotypes as defined by Rico-Hesse and colleagues for poliovirus type 1. Isolates within each group shared approximately 99% sequence identity at the nucleotide level whereas < or = 86% sequence identity was shared between groups. One isolate derived from a clinical specimen in 1987 was grouped with six CVB4 isolates recovered from the aquatic environment in 1986-7. The second group comprised CVB4 isolates from clinical specimens in 1985-6. Both groups were different at the nucleotide level from the prototype strain isolated in 1950. It was concluded that the method could be used to sub-type CVB4 isolates and would be of value in epidemiological studies of CVB4. Predicted amino acid sequences revealed non-conservation of the tyrosine residue at the VP1/P2A cleavage site but were of little value in distinguishing CVB4 variants. PMID:8386098
Iterative reactions of transient boronic acids enable sequential C-C bond formation
NASA Astrophysics Data System (ADS)
Battilocchio, Claudio; Feist, Florian; Hafner, Andreas; Simon, Meike; Tran, Duc N.; Allwood, Daniel M.; Blakemore, David C.; Ley, Steven V.
2016-04-01
The ability to form multiple carbon-carbon bonds in a controlled sequence and thus rapidly build molecular complexity in an iterative fashion is an important goal in modern chemical synthesis. In recent times, transition-metal-catalysed coupling reactions have dominated in the development of C-C bond forming processes. A desire to reduce the reliance on precious metals and a need to obtain products with very low levels of metal impurities has brought a renewed focus on metal-free coupling processes. Here, we report the in situ preparation of reactive allylic and benzylic boronic acids, obtained by reacting flow-generated diazo compounds with boronic acids, and their application in controlled iterative C-C bond forming reactions is described. Thus far we have shown the formation of up to three C-C bonds in a sequence including the final trapping of a reactive boronic acid species with an aldehyde to generate a range of new chemical structures.
Hybridization and sequencing of nucleic acids using base pair mismatches
Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua
2001-01-01
Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.
A re-evaluation of the final step of vanillin biosynthesis in the orchid Vanilla planifolia.
Yang, Hailian; Barros-Rios, Jaime; Kourteva, Galina; Rao, Xiaolan; Chen, Fang; Shen, Hui; Liu, Chenggang; Podstolski, Andrzej; Belanger, Faith; Havkin-Frenkel, Daphna; Dixon, Richard A
2017-07-01
A recent publication describes an enzyme from the vanilla orchid Vanilla planifolia with the ability to convert ferulic acid directly to vanillin. The authors propose that this represents the final step in the biosynthesis of vanillin, which is then converted to its storage form, glucovanillin, by glycosylation. The existence of such a "vanillin synthase" could enable biotechnological production of vanillin from ferulic acid using a "natural" vanilla enzyme. The proposed vanillin synthase exhibits high identity to cysteine proteases, and is identical at the protein sequence level to a protein identified in 2003 as being associated with the conversion of 4-coumaric acid to 4-hydroxybenzaldehyde. We here demonstrate that the recombinant cysteine protease-like protein, whether expressed in an in vitro transcription-translation system, E. coli, yeast, or plants, is unable to convert ferulic acid to vanillin. Rather, the protein is a component of an enzyme complex that preferentially converts 4-coumaric acid to 4-hydroxybenzaldehyde, as demonstrated by the purification of this complex and peptide sequencing. Furthermore, RNA sequencing provides evidence that this protein is expressed in many tissues of V. planifolia irrespective of whether or not they produce vanillin. On the basis of our results, V. planifolia does not appear to contain a cysteine protease-like "vanillin synthase" that can, by itself, directly convert ferulic acid to vanillin. The pathway to vanillin in V. planifolia is yet to be conclusively determined. Copyright © 2017 Elsevier Ltd. All rights reserved.
Human jagged polypeptide, encoding nucleic acids and methods of use
Li, Linheng; Hood, Leroy
2000-01-01
The present invention provides an isolated polypeptide exhibiting substantially the same amino acid sequence as JAGGED, or an active fragment thereof, provided that the polypeptide does not have the amino acid sequence of SEQ ID NO:5 or SEQ ID NO:6. The invention further provides an isolated nucleic acid molecule containing a nucleotide sequence encoding substantially the same amino acid sequence as JAGGED, or an active fragment thereof, provided that the nucleotide sequence does not encode the amino acid sequence of SEQ ID NO:5 or SEQ ID NO:6. Also provided herein is a method of inhibiting differentiation of hematopoietic progenitor cells by contacting the progenitor cells with an isolated JAGGED polypeptide, or active fragment thereof. The invention additionally provides a method of diagnosing Alagille Syndrome in an individual. The method consists of detecting an Alagille Syndrome disease-associated mutation linked to a JAGGED locus.
Polypeptide having or assisting in carbohydrate material degrading activity and uses thereof
Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Los, Alrik Pieter
2016-02-16
The invention relates to a polypeptide which comprises the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 76% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 76% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having beta-glucosidase activity and uses thereof
DOE Office of Scientific and Technical Information (OSTI.GOV)
Schoonneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; De Jong, Rene Marcel
The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well asmore » the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.« less
Polypeptide having swollenin activity and uses thereof
Schoonneveld-Bergmans, Margot Elizabeth Francoise; Heijne, Wilbert Herman Marie; Vlasie, Monica D; Damveld, Robbertus Antonius
2015-11-04
The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having beta-glucosidase activity and uses thereof
Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; De Jong, Rene Marcel; Damveld, Robbertus Antonius
2015-09-01
The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 70% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 70% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having cellobiohydrolase activity and uses thereof
Sagt, Cornelis Maria Jacobus; Schooneveld-Bergmans, Margot Elisabeth Francoise; Roubos, Johannes Andries; Los, Alrik Pieter
2015-09-15
The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 93% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 93% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having acetyl xylan esterase activity and uses thereof
Schoonneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Los, Alrik Pieter
2015-10-20
The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 82% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 82% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having carbohydrate degrading activity and uses thereof
Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Vlasie, Monica Diana; Damveld, Robbertus Antonius
2015-08-18
The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Hashimoto, Y; Tanimoto, K; Ozawa, Y; Murata, T; Ike, Y
2000-04-15
The vancomycin-resistant enterococci GV1, GV2 and GV3, which were isolated from droppings from broiler farms in Japan have been characterized as VanA-type VRE, which express high-level vancomycin resistance (256 or 512 microg ml(-1), MIC) and low-level teicoplanin resistance (1 or 2 microg ml(-1), MIC). The vancomycin resistances were encoded on plasmids. The vancomycin resistance conjugative plasmid pMG2 was isolated from the GV2 strain. The VanA determinant of pMG2 showed the same genetic organization as that of the VanA genes encoded on the representative transposon Tn1546, which comprises vanRSHAXYZ. The nucleotide sequences of all the genes, except the gene related to the vanS gene on Tn1546, were completely identical to the genes encoded on Tn1546. Three amino acid substitutions in the N-terminal region of the deduced VanS were detected in the nucleotide sequence of vanS encoded on pMG2. There were also three amino acid substitutions in the vanS gene of the GV1 and GV3 strains in the same positions as in the vanS gene of pMG2. Vancomycin induced the increased teicoplanin resistance in these strains.
Wu, Qinglong; Shah, Nagendra P
2017-11-22
γ-Aminobutyric acid (GABA) and GABA-rich foods have shown anti-hypertensive and anti-depressant activities as the major functions in humans and animals. Hence, high GABA-producing lactic acid bacteria (LAB) could be used as functional starters for manufacturing novel fermented dairy foods. Glutamic acid decarboxylases (GADs) from LAB are highly conserved at the species level based on the phylogenetic tree of GADs from LAB. Moreover, two functionally distinct GADs and one intact gad operon were observed in all the completely sequenced Lactobacillus brevis strains suggesting its common capability to synthesize GABA. Difficulties and strategies for the manufacture of GABA-rich fermented dairy foods have been discussed and proposed, respectively. In addition, a genetic survey on the sequenced LAB strains demonstrated the absence of cell envelope proteinases in the majority of LAB including Lb. brevis, which diminishes their cell viabilities in milk environments due to their non-proteolytic nature. Thus, several strategies have been proposed to overcome the non-proteolytic nature of Lb. brevis in order to produce GABA-rich dairy foods.
37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.
Code of Federal Regulations, 2010 CFR
2010-07-01
... 37 Patents, Trademarks, and Copyrights 1 2010-07-01 2010-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide and...
Code of Federal Regulations, 2011 CFR
2011-07-01
... from abandonment 1.135 Amino Acid Sequences. (See Nucleotide and/or Amino Acid Sequences) Appeal to... Appeals and Interference 41.47 Of rejection of an application 1.104(a) Nucleotide and/or Amino Acid...) Symbols for nucleotide and/or amino acid sequence data 1.822 T Tables in patent applications 1.58 Terminal...
37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.
Code of Federal Regulations, 2011 CFR
2011-07-01
... 37 Patents, Trademarks, and Copyrights 1 2011-07-01 2011-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide and...
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sharrock, R.A.; Quail, P.H.
1989-01-01
Phytochrome is a plant regulatory photoreceptor that mediates red light effects on a wide variety of physiological and molecular responses. DNA blot analysis indicates that the Arabidopsis thaliana genome contains four to five phytochrome-related gene sequences. The authors have isolated and sequenced cDNA clones corresponding to three of these genes and have deduced the amino acid sequence of the full-length polypeptide encoded in each case. One of these proteins (phyA) shows 65-80% amino acid sequence identity with the major, etiolated-tissue phytochrome apoproteins described previously in other plant species. The other two polypeptides (phyB and phyC) are unique in that theymore » have low sequence identity with each other, with phyA, and with all previously described phytochromes. The phyA, phyB, and phyC proteins are of similar molecular mass, have related hydropathic profiles, and contain a conserved chromophore attachment region. However, the sequence comparison data indicate that the three phy genes diverged early in plant evolution, well before the divergence of the two major groups of angiosperms, the monocots and dicots. The steady-state level of the phyA transcript is high in dark-grown A. thaliana seedlings and is down-regulated by light. In contrast, the phyB and phyC transcripts are present at lower levels and are not strongly light-regulated. These findings indicate that the red/far red light-responsive phytochrome photoreceptor system in A. thaliana, and perhaps in all higher plants, consists of a family of chromoproteins that are heterogeneous in structure and regulation.« less
Gene encoding a novel extracellular metalloprotease in Bacillus subtilis.
Sloma, A; Rudolph, C F; Rufo, G A; Sullivan, B J; Theriault, K A; Ally, D; Pero, J
1990-01-01
The gene for a novel extracellular metalloprotease was cloned, and its nucleotide sequence was determined. The gene (mpr) encodes a primary product of 313 amino acids that has little similarity to other known Bacillus proteases. The amino acid sequence of the mature protease was preceded by a signal sequence of approximately 34 amino acids and a pro sequence of 58 amino acids. Four cysteine residues were found in the deduced amino acid sequence of the mature protein, indicating the possible presence of disulfide bonds. The mpr gene mapped in the cysA-aroI region of the chromosome and was not required for growth or sporulation. Images FIG. 2 FIG. 7 PMID:2105291
Kobayashi, M; Takatori, T; Nakajima, M; Saka, K; Iwase, H; Nagao, M; Niijima, H; Matsuda, Y
1999-01-01
We examined the postmortem changes in the levels of ATP, glycogen and lactic acid in two masticatory muscles and three leg muscles of rats. The proportion of fibre types of the muscles was determined with NIH image software. The ATP levels in the white muscles did not decrease up to 1 h after death, and the ATP levels 1 and 2 h after death in the white muscles were higher than those in the red muscles with a single exception. The glycogen level at death and 1 h after death and the lactic acid level 1 h after death in masticatory muscles were lower than in the leg muscles. It is possible that the differences in the proportion of muscle fibre types and in glycogen level in muscles influences the postmortem change in ATP and lactic acid, which would accelerate or retard rigor mortis of the muscles.
Pittet, Vanessa; Phister, Trevor G.; Ziola, Barry
2013-01-01
Growth of specific lactic acid bacteria in beer leads to spoiled product and economic loss for the brewing industry. Microbial growth is typically inhibited by the combined stresses found in beer (e.g., ethanol, hops, low pH, minimal nutrients); however, certain bacteria have adapted to grow in this harsh environment. Considering little is known about the mechanisms used by bacteria to grow in and spoil beer, transcriptome sequencing was performed on a variant of the beer-spoilage organism Pediococcus claussenii ATCC BAA-344T (Pc344-358). Illumina sequencing was used to compare the transcript levels in Pc344-358 growing mid-exponentially in beer to those in nutrient-rich MRS broth. Various operons demonstrated high gene expression in beer, several of which are involved in nutrient acquisition and overcoming the inhibitory effects of hop compounds. As well, genes functioning in cell membrane modification and biosynthesis demonstrated significantly higher transcript levels in Pc344-358 growing in beer. Three plasmids had the majority of their genes showing increased transcript levels in beer, whereas the two cryptic plasmids showed slightly decreased gene expression. Follow-up analysis of plasmid copy number in both growth environments revealed similar trends, where more copies of the three non-cryptic plasmids were found in Pc344-358 growing in beer. Transcriptome sequencing also enabled the addition of several genes to the P . claussenii ATCC BAA-344T genome annotation, some of which are putatively transcribed as non-coding RNAs. The sequencing results not only provide the first transcriptome description of a beer-spoilage organism while growing in beer, but they also highlight several targets for future exploration, including genes that may have a role in the general stress response of lactic acid bacteria. PMID:24040005
Liu, X J; Jin, C; Wu, L M; Dong, S J; Zeng, S M; Li, J L
2016-07-29
Matrix proteins that either weakly acidic or unusually highly acidic have important roles in shell biomineralization. In this study, we have identified and characterized hic22, a weakly acidic matrix protein, from the nacreous layer of Hyriopsis cumingii. Total protein was extracted from the nacre using 5 M EDTA and hic22 was purified using a DEAE-sepharose column. The N-terminal amino acid sequence of hic22 was determined and the complete cDNA encoding hic22 was cloned and sequenced by rapid amplification of cDNA ends-polymerase chain reaction. Finally, the localization and distribution of hic22 was determined by in situ hybridization. Our results revealed that hic22 encodes a 22-kDa protein composed of 185 amino acids. Tissue expression analysis and in situ hybridization indicated that hic22 is expressed in the dorsal epithelial cells of the mantle pallial; moreover, significant expression levels of hic22 were observed after the early formation of the pearl sac (days 19-77), implying that hic22 may play an important role in biomineralization of the nacreous layer.
(Mechanisms of inhibition of viral replication in plants)
DOE Office of Scientific and Technical Information (OSTI.GOV)
Not Available
1991-01-01
During the last year we have made a number of important observations in the fields of virology and plant molecular biology. By directly sequencing Tomato Mosaic Virus (ToMV) movement genes, previously undetected sequence alterations common to specific viral strains were found. The difficulty in regenerating transgenic tomato plants containing the Tm-2 gene was overcome. Tobacco plants transformed with Cucumber Mosaic Virus (CMV) are being characterized. Analysis of transgenic tobacco plants expressing CMV coat protein have shown no correlation between coat protein expression and level of resistance. Specific amino acid changes have been found to correlate with CMV resistance breaking andmore » degree of pathogenicity. Satellite RNAs are shown to be too unstable for use as a biological control agent. The aphid transmission domain CMV has been localized to one (or more) of three amino acids; constructs have been made to determine the exact amino acids involved. 15 refs.« less
Thermophilic cellobiohydrolase
Sapra, Rajat; Park, Joshua I.; Datta, Supratim; Simmons, Blake A.
2017-04-18
The present invention provides for a composition comprising a polypeptide comprising a first amino acid sequence having at least 70% identity with the amino acid sequence of Csac GH5 wherein said first amino acid sequence has a thermostable or thermophilic cellobiohydrolase (CBH) or exoglucanase activity.
Takakuwa, Naoya; Saito, Katsuichi
2010-01-01
Eighty-one yeast isolates from raw milk were surveyed for the production of fatty acid methyl esters (FAME). Only one species, identified as Cryptococcus curvatus, produced FAME at a detectable level. Cr. curvatus TYC-19 produced more FAME from beet molasses and cheese whey medium than other strains of the same species. In both media, the major FAME produced were linoleic and oleic acid methyl esters. Sequence analysis of the internal transcribed spacer region of ribosomal DNA indicated that TYC-19 diverged from the same species.
NASA Astrophysics Data System (ADS)
Ren, Hai; Li, Jian; Li, Jitao; Liu, Ping; Liang, Zhongxiu; Wu, Jianhua
2015-05-01
Superoxide dismutase (SOD) is one of the most important antioxidant defense enzymes, and is considered as the first line against oxidative stress. In this study, we cloned a mitochondrial manganese (Mn) SOD ( mMnSOD) cDNA from the ridgetail white prawn Exopalaemon carinicauda by using rapid amplification of cDNA ends (RACE) methods. The full-length cDNA for mMnSOD was 1 014-bp long, containing a 5'-untranslated region (UTR) of 37-bp, a 3'-UTR of 321-bp with a poly (A) tail, and included a 657-bp open reading frame encoding a protein of 218 amino acids with a 16-amino-acid signal peptide. The protein had a calculated molecular weight of 23.87 kDa and a theoretical isoelectric point of 6.75. The mMnSOD sequence included two putative N-glycosylation sites (NHT and NLS), the MnSOD signature sequence 180DVWEHAYY187, and four putative Mn binding sites (H48, H96, D180, and H184). Sequence comparison showed that the mMnSOD deduced amino acid sequence of E. carinicauda shared 97%, 95%, 89%, 84%, 82%, 72%, and 69% identity with that of Macrobrachium rosenbergii, Macrobrachium nipponense, Fenneropeneaus chinensis, Callinectes sapidus, Perisesarma bidens, Danio rerio, and Homo sapiens, resectively. Quantitative real-time RT-PCR analysis showed that mMnSOD transcripts were present in all E. carinicauda tissues examined, with the highest levels in the hepatopancreas. During an ammonia stress treatment, the transcript levels of mMnSOD and cMnSOD were up-regulated at 12 h in hemocytes and at 24 h in the hepatopancreas. As the duration of the ammonia stress treatment extended to 72 h, the transcript levels of mMnSOD and cMnSOD significantly decreased both in hemocytes and hepatopancreas. These findings indicate that the SOD system is induced to respond to acute ammonia stress, and may be involved in environmental stress responses in E. carinicauda.
Takeuchi, Y; Yoshikawa, M; Takeba, G; Tanaka, K; Shibata, D; Horino, O
1990-06-01
Soybean (Glycine max) beta-1,3-endoglucanase (EC 3.2. 1.39) is involved in one of the earliest plant-pathogen interactions that may lead to active disease resistance by releasing elicitor-active carbohydrates from the cell walls of fungal pathogens. Ethylene induced beta-1,3-endoglucanase activity to 2- to 3-fold higher levels in cotyledons of soybean seedlings. A specific polyclonal antiserum raised against purified soybean beta-1,3-endoglucanase was used to immunoprecipitate in vitro translation products, demonstrating that ethylene induction increased translatable beta-1,3-endoglucanase mRNA. Several cDNA clones for the endoglucanase gene were obtained by antibody screening of a lambda-gt11 expression library prepared from soybean cotyledons. Hybrid-select translation experiments indicated that the cloned cDNA encoded a 36-kilodalton precursor protein product that was specifically immunoprecipitated with beta-1,3-endoglucanase antiserum. Escherichia coli cells expressing the cloned cDNA also synthesized an immunologically positive protein. Nucleotide sequence of three independent clones revealed a single uninterrupted open reading frame of 1041 nucleotides, corresponding to a polypeptide of 347 residue long. The primary amino acid sequence of beta-1,3-endoglucanase as deduced from the nucleotide sequence was confirmed by direct amino acid sequencing of trypsin digests of the glucanase. The soybean beta-1,3-endoglucanase exhibited 53% amino acid homology to a beta-1,3-glucanase cloned from cultured tobacco cells and 48% homology to a beta-(1,3-1,4)-glucanase from barley. Utilizing the largest cloned cDNA (pEG488) as a hybridization probe, it was found that the increase in translatable beta-1,3-endoglucanase mRNA seen upon ethylene treatment of soybean seedlings was due to 50- to 100-fold increase in steady state mRNA levels, indicating that ethylene regulates gene expression of this enzyme important in disease resistance at the level of gene transcription.
Computer-aided visualization and analysis system for sequence evaluation
Chee, M.S.
1998-08-18
A computer system for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments are improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area and sample sequences in another area on a display device. 27 figs.
Computer-aided visualization and analysis system for sequence evaluation
Chee, Mark S.; Wang, Chunwei; Jevons, Luis C.; Bernhart, Derek H.; Lipshutz, Robert J.
2004-05-11
A computer system for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments are improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area and sample sequences in another area on a display device.
Computer-aided visualization and analysis system for sequence evaluation
Chee, Mark S.
1998-08-18
A computer system for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments are improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area and sample sequences in another area on a display device.
Computer-aided visualization and analysis system for sequence evaluation
Chee, Mark S.
2003-08-19
A computer system for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments may be improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area and sample sequences in another area on a display device.
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yiao, Jian
2014-03-18
The present invention provides a novel endoglucanase nucleic acid sequence, designated egl6 (SEQ ID NO:1 encodes the full length endoglucanase; SEQ ID NO:4 encodes the mature form), and the corresponding endoglucanase VI amino acid sequence ("EGVI"; SEQ ID NO:3 is the signal sequence; SEQ ID NO:2 is the mature sequence). The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVI, recombinant EGVI proteins and methods for producing the same.
Sequence heuristics to encode phase behaviour in intrinsically disordered protein polymers
Quiroz, Felipe García; Chilkoti, Ashutosh
2015-01-01
Proteins and synthetic polymers that undergo aqueous phase transitions mediate self-assembly in nature and in man-made material systems. Yet little is known about how the phase behaviour of a protein is encoded in its amino acid sequence. Here, by synthesizing intrinsically disordered, repeat proteins to test motifs that we hypothesized would encode phase behaviour, we show that the proteins can be designed to exhibit tunable lower or upper critical solution temperature (LCST and UCST, respectively) transitions in physiological solutions. We also show that mutation of key residues at the repeat level abolishes phase behaviour or encodes an orthogonal transition. Furthermore, we provide heuristics to identify, at the proteome level, proteins that might exhibit phase behaviour and to design novel protein polymers consisting of biologically active peptide repeats that exhibit LCST or UCST transitions. These findings set the foundation for the prediction and encoding of phase behaviour at the sequence level. PMID:26390327
Novel beta-lactamase genes from two environmental isolates of Vibrio harveyi.
Teo, J W; Suwanto, A; Poh, C L
2000-05-01
Two ampicillin-resistant (Amp(r)) isolates of Vibrio harveyi, W3B and HB3, were obtained from the coastal waters of the Indonesian island of Java. Strain W3B was isolated from marine water near a shrimp farm in North Java while HB3 was from pristine seawater in South Java. In this study, novel beta-lactamase genes from W3B (bla(VHW-1)) and HB3 (bla(VHH-1)) were cloned and their nucleotide sequences were determined. An open reading frame (ORF) of 870 bp encoding a deduced protein of 290 amino acids (VHW-1) was revealed for the bla gene of strain W3B while an ORF of 849 bp encoding a 283-amino-acid protein (VHH-1) was deduced for bla(VHH-1). At the DNA level, genes for VHW-1 and VHH-1 have a 97% homology, while at the protein level they have a 91% homology of amino acid sequences. Neither gene sequence showed homology to any other beta-lactamases in the databases. The deduced proteins were found to be class A beta-lactamases bearing low levels of homology (<50%) to other beta-lactamases of the same class. The highest level of identity was obtained with beta-lactamases from Pseudomonas aeruginosa, i.e., PSE-1, PSE-4, and CARB-3, and Vibrio cholerae CARB-6. Our study showed that both strains W3B and HB3 possess an endogenous plasmid of approximately 60 kb in size. However, Southern hybridization analysis employing bla(VHW-1) as a gene probe demonstrated that the bla gene was not located in the plasmid. A total of nine ampicillin-resistant V. harveyi strains, including W3B and HB3, were examined by pulsed-field gel electrophoresis of NotI-digested genomic DNA. Despite a high level of intrastrain genetic diversity, the bla(VHW-1) probe hybridized only to an 80- or 160-kb NotI genomic fragment in different isolates.
Labeled nucleotide phosphate (NP) probes
Korlach, Jonas [Ithaca, NY; Webb, Watt W [Ithaca, NY; Levene, Michael [Ithaca, NY; Turner, Stephen [Ithaca, NY; Craighead, Harold G [Ithaca, NY; Foquet, Mathieu [Ithaca, NY
2009-02-03
The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
An atypical topoisomerase II sequence from the slime mold Physarum polycephalum.
Hugodot, Yannick; Dutertre, Murielle; Duguet, Michel
2004-01-21
We have determined the complete nucleotide sequence of the cDNA encoding DNA topoisomerase II from Physarum polycephalum. Using degenerate primers, based on the conserved amino acid sequences of other eukaryotic enzymes, a 250-bp fragment was polymerase chain reaction (PCR) amplified. This fragment was used as a probe to screen a Physarum cDNA library. A partial cDNA clone was isolated that was truncated at the 3' end. Rapid amplification of cDNA ends (RACE)-PCR was employed to isolate the remaining portion of the gene. The complete sequence of 4613 bp contains an open reading frame of 4494 bp that codes for 1498 amino acid residues with a theoretical molecular weight of 167 kDa. The predicted amino acid sequence shares similarity with those of other eukaryotes and shows the highest degree of identity with the enzyme of Dictyostelium discoideum. However, the enzyme of P. polycephalum contains an atypical amino-terminal domain very rich in serine and proline, whose function is unknown. Remarkably, both a mitochondrial targeting sequence and a nuclear localization signal were predicted respectively in the amino and carboxy-terminus of the protein, as in the case of human topoisomerase III alpha. At the Physarum genomic level, the topoisomerase II gene encompasses a region of about 16 kbp suggesting a large proportion of intronic sequences, an unusual situation for a gene of a lower eukaryote, often free of introns. Finally, expression of topoisomerase II mRNA does not appear significantly dependent on the plasmodium cycle stage, possibly due to the lack of G1 phase or (and) to a mitochondrial localization of the enzyme.
Amino Acid Properties Conserved in Molecular Evolution
Rudnicki, Witold R.; Mroczek, Teresa; Cudek, Paweł
2014-01-01
That amino acid properties are responsible for the way protein molecules evolve is natural and is also reasonably well supported both by the structure of the genetic code and, to a large extent, by the experimental measures of the amino acid similarity. Nevertheless, there remains a significant gap between observed similarity matrices and their reconstructions from amino acid properties. Therefore, we introduce a simple theoretical model of amino acid similarity matrices, which allows splitting the matrix into two parts – one that depends only on mutabilities of amino acids and another that depends on pairwise similarities between them. Then the new synthetic amino acid properties are derived from the pairwise similarities and used to reconstruct similarity matrices covering a wide range of information entropies. Our model allows us to explain up to 94% of the variability in the BLOSUM family of the amino acids similarity matrices in terms of amino acid properties. The new properties derived from amino acid similarity matrices correlate highly with properties known to be important for molecular evolution such as hydrophobicity, size, shape and charge of amino acids. This result closes the gap in our understanding of the influence of amino acids on evolution at the molecular level. The methods were applied to the single family of similarity matrices used often in general sequence homology searches, but it is general and can be used also for more specific matrices. The new synthetic properties can be used in analyzes of protein sequences in various biological applications. PMID:24967708
Yasuno, Rie; Wada, Hajime
1998-01-01
Lipoic acid is a coenzyme that is essential for the activity of enzyme complexes such as those of pyruvate dehydrogenase and glycine decarboxylase. We report here the isolation and characterization of LIP1 cDNA for lipoic acid synthase of Arabidopsis. The Arabidopsis LIP1 cDNA was isolated using an expressed sequence tag homologous to the lipoic acid synthase of Escherichia coli. This cDNA was shown to code for Arabidopsis lipoic acid synthase by its ability to complement a lipA mutant of E. coli defective in lipoic acid synthase. DNA-sequence analysis of the LIP1 cDNA revealed an open reading frame predicting a protein of 374 amino acids. Comparisons of the deduced amino acid sequence with those of E. coli and yeast lipoic acid synthase homologs showed a high degree of sequence similarity and the presence of a leader sequence presumably required for import into the mitochondria. Southern-hybridization analysis suggested that LIP1 is a single-copy gene in Arabidopsis. Western analysis with an antibody against lipoic acid synthase demonstrated that this enzyme is located in the mitochondrial compartment in Arabidopsis cells as a 43-kD polypeptide. PMID:9808738
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dussossoy, D.; Carayon, P.; Feraut, D.
1996-05-01
Based on the amino acid sequence deduced from the cloned human peripheral benzodiazepine receptor (PBR) gene, monoclonal antibody (Mab 8D7) was produced against the C-terminal fragment of the receptor. Immunoblot experiments, performed against purified PBR, indicated that the antipeptide antibody recognized, under denaturing conditions, the corresponding amino acid sequence of the PBR. When mitochondrial membranes form PBR transfected yeast or from THP1 and U937 cells were used on immunoblot analysis, a high level of immunoreactivity was observed at 18 kDa, the PBR molecular mass deduced from cDNA, establishing the specificity of the antibody for the receptor. Moreover, binding experiments realizedmore » with intact mitochondria demonstrated that the immunogenic sequence was accessible to the antibody indicating that the C-terminal fragment of the PBR faces the cytosol. Using this Mab we developed a technique which allowed precise quantification of PBR density per cell. Furthermore, cellular localization studies by flow cytometric analysis and confocal microscopy on cell lines displaying different levels of PBR showed that Mab 8D7 was entirely colocalized with an antimitochondria Mab. 34 refs., 7 figs.« less
Microbial ecological succession during municipal solid waste decomposition.
Staley, Bryan F; de Los Reyes, Francis L; Wang, Ling; Barlaz, Morton A
2018-04-28
The decomposition of landfilled refuse proceeds through distinct phases, each defined by varying environmental factors such as volatile fatty acid concentration, pH, and substrate quality. The succession of microbial communities in response to these changing conditions was monitored in a laboratory-scale simulated landfill to minimize measurement difficulties experienced at field scale. 16S rRNA gene sequences retrieved at separate stages of decomposition showed significant succession in both Bacteria and methanogenic Archaea. A majority of Bacteria sequences in landfilled refuse belong to members of the phylum Firmicutes, while Proteobacteria levels fluctuated and Bacteroidetes levels increased as decomposition proceeded. Roughly 44% of archaeal sequences retrieved under conditions of low pH and high acetate were strictly hydrogenotrophic (Methanomicrobiales, Methanobacteriales). Methanosarcina was present at all stages of decomposition. Correspondence analysis showed bacterial population shifts were attributed to carboxylic acid concentration and solids hydrolysis, while archaeal populations were affected to a higher degree by pH. T-RFLP analysis showed specific taxonomic groups responded differently and exhibited unique responses during decomposition, suggesting that species composition and abundance within Bacteria and Archaea are highly dynamic. This study shows landfill microbial demographics are highly variable across both spatial and temporal transects.
Iimura, Yosuke; Tatsumi, Kenji
2002-07-01
We isolated and analysed two genomic DNAs that encode the heat-shock protein Hsp30 from Coriolus versicolor. The amino acid sequences substitute only three amino acid substitutions. The promoter regions contain the consensus heat-shock element, a xenobiotic-response element, a stress-response element, and a metal-response element. The levels of mRNAs for Hsp30 increased markedly after exposure of C. versicolor to pentachlorophenol and levels were higher than those after heat shock.
Li, Guang-Qi; Zang, Xiao-Nan; Zhang, Xue-Cheng; Lu, Ning; Ding, Yan; Gong, Le; Chen, Wen-Chao
2014-03-15
To study the response of Gracilaria lemaneiformis to heat stress, two key enzymes - ubiquitin-activating enzyme (E1) and ubiquitin-conjugating enzyme (E2) - of the Ubiquitin/26S proteasome pathway (UPP) were studied in three strains of G. lemaneiformis-wild type, heat-tolerant cultivar 981 and heat-tolerant cultivar 07-2. The full length DNA sequence of E1 contained only one exon. The open reading frame (ORF) sequence was 981 nucleotides encoding 326 amino acids, which contained conserved ATP binding sites (LYDRQIRLWGLE, ELAKNVLLAGV, LKEMN, VVCAI) and the ubiquitin-activating domains (VVCAI…LMTEAC, VFLDLGDEYSYQ, AIVGGMWGRE). The gene sequence of E2 contained four exons and three introns. The sum of the four exons gave an open reading frame sequence of 444 nucleotides encoding 147 amino acids, which contained a conserved ubiquitin-activating domain (GSICLDIL), ubiquitin-conjugating domains (RIYHPNIN, KVLLSICSLL, DDPLV) and ubiquitin-ligase (E3) recognition sites (KRI, YPF, WSP). Real-time-PCR analysis of transcription levels of E1 and E2 under heat shock conditions (28°C and 32°C) showed that in wild type, transcriptions of E1 and E2 were up-regulated at 28°C, while at 32°C, transcriptions of the two enzymes were below the normal level. In cultivar 981 and cultivar 07-2 of G. lemaneiformis, the transcription levels of the two enzymes were up-regulated at 32°C, and transcription level of cultivar 07-2 was even higher than that of cultivar 981. These results suggest that the UPP plays an important role in high temperature resistance of G. lemaneiformis and the bioactivity of UPP is directly related to the heat-resistant ability of G. lemaneiformis. Copyright © 2013 Elsevier B.V. All rights reserved.
Molecular characterization of KGH, the first human isolate of rabies virus in Korea.
Park, Jun-Sun; Kim, Chi-Kyeong; Kim, Su Yeon; Ju, Young Ran
2013-04-01
The complete genome sequence of the KGH strain of the first human rabies virus, which was isolated from a skin biopsy of a patient with rabies, whose symptoms developed due to bites from a raccoon dog in 2001. The size of the KGH strain genome was determined to be 11,928 nucleotides (nt) with a leader sequence of 58 nt, nucleoprotein gene of 1,353 nt, phosphoprotein gene of 894 nt, matrix protein gene of 609 nt, glycoprotein gene of 1,575 nt, RNA-dependent RNA polymerase gene of 6,384 nt, and trailer region of 69 nt. Sequence similarity was compared with 39 fully sequenced rabies virus genomes currently available, and the result showed 70.6-91.6 % at the nucleotide level, and 82.8-97.9 % at the amino acid level. The deduced amino acids in the viral protein were compared with those of other rabies viruses, and various functional regions were investigated. As a result, we found that the KGH strain only had a unique amino acid substitution that was identified to be associated either with host immune response and pathogenicity in the N protein, or with a related region regulating STAT1 in the P protein, and related to pathogenicity in G protein. Based on phylogenetic analyses using the complete genome of 39 rabies viruses, the KGH strain was determined to be closely related with the NNV-RAB-H strain and transplant rabies virus serotype 1, which are Indian isolates, and was confirmed to belong to the Arctic-like 2 clade. The KGH strain was most closely related to the SKRRD0204HC and SKRRD0205HC strain when compared with Korean animal isolates, which was separated around the same time and place, and belonged to the Gangwon III subgroup.
Steinberg, Lisa M; Regan, John M
2008-11-01
Methanogens play a critical role in the decomposition of organics under anaerobic conditions. The methanogenic consortia in saturated wetland soils are often subjected to large temperature fluctuations and acidic conditions, imposing a selective pressure for psychro- and acidotolerant community members; however, methanogenic communities in engineered digesters are frequently maintained within a narrow range of mesophilic and circumneutral conditions to retain system stability. To investigate the hypothesis that these two disparate environments have distinct methanogenic communities, the methanogens in an oligotrophic acidic fen and a mesophilic anaerobic digester treating municipal wastewater sludge were characterized by creating clone libraries for the 16S rRNA and methyl coenzyme M reductase alpha subunit (mcrA) genes. A quantitative framework was developed to assess the differences between these two communities by calculating the average sequence similarity for 16S rRNA genes and mcrA within a genus and family using sequences of isolated and characterized methanogens within the approved methanogen taxonomy. The average sequence similarities for 16S rRNA genes within a genus and family were 96.0 and 93.5%, respectively, and the average sequence similarities for mcrA within a genus and family were 88.9 and 79%, respectively. The clone libraries of the bog and digester environments showed no overlap at the species level and almost no overlap at the family level. Both libraries were dominated by clones related to uncultured methanogen groups within the Methanomicrobiales, although members of the Methanosarcinales and Methanobacteriales were also found in both libraries. Diversity indices for the 16S rRNA gene library of the bog and both mcrA libraries were similar, but these indices indicated much lower diversity in the 16S digester library than in the other three libraries.
Böer, Erik; Bode, Rüdiger; Mock, Hans-Peter; Piontek, Michael; Kunze, Gotthard
2009-06-01
The tannase-encoding Arxula adeninivorans gene ATAN1 was isolated from genomic DNA by PCR, using as primers oligonucleotide sequences derived from peptides obtained after tryptic digestion of the purified tannase protein. The gene harbours an ORF of 1764 bp, encoding a 587-amino acid protein, preceded by an N-terminal secretion sequence comprising 28 residues. The deduced amino acid sequence was similar to those of tannases from Aspergillus oryzae (50% identity), A. niger (48%) and putative tannases from A. fumigatus (52%) and A. nidulans (50%). The sequence contains the consensus pentapeptide motif (-Gly-X-Ser-X-Gly-) which forms part of the catalytic centre of serine hydrolases. Expression of ATAN1 is regulated by the carbon source. Supplementation with tannic acid or gallic acid leads to induction of ATAN1, and accumulation of the native tannase enzyme in the medium. The enzymes recovered from both wild-type and recombinant strains were essentially indistinguishable. A molecular mass of approximately 320 kDa was determined, indicating that the native, glycosylated tannase consists of four identical subunits. The enzyme has a temperature optimum at 35-40 degrees C and a pH optimum at approximately 6.0. The enzyme is able to remove gallic acid from both condensed and hydrolysable tannins. The wild-type strain LS3 secreted amounts of tannase equivalent to 100 U/l under inducing conditions, while the transformant strain, which overexpresses the ATAN1 gene from the strong, constitutively active A. adeninivorans TEF1 promoter, produced levels of up to 400 U/l when grown in glucose medium in shake flasks. Copyright (c) 2009 John Wiley & Sons, Ltd.
Trichoderma .beta.-glucosidase
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2006-01-03
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl3, and the corresponding BGL3 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL3, recombinant BGL3 proteins and methods for producing the same.
Computer-aided visualization and analysis system for sequence evaluation
Chee, Mark S.
1999-10-26
A computer system (1) for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments may be improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area (814) and sample sequences in another area (816) on a display device (3).
Computer-aided visualization and analysis system for sequence evaluation
Chee, Mark S.
2001-06-05
A computer system (1) for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments may be improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area (814) and sample sequences in another area (816) on a display device (3).
Carbohydrate degrading polypeptide and uses thereof
Sagt, Cornelis Maria Jacobus; Schooneveld-Bergmans, Margot Elisabeth Francoise; Roubos, Johannes Andries; Los, Alrik Pieter
2015-10-20
The invention relates to a polypeptide having carbohydrate material degrading activity which comprises the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1 or SEQ ID NO: 4, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional protein and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Westin, Maria A K; Alexson, Stefan E H; Hunt, Mary C
2004-05-21
Peroxisomes are organelles that function in the beta-oxidation of long- and very long-chain acyl-CoAs, bile acid-CoA intermediates, prostaglandins, leukotrienes, thromboxanes, dicarboxylic fatty acids, pristanic acid, and xenobiotic carboxylic acids. The very long- and long-chain acyl-CoAs are mainly chain-shortened and then transported to mitochondria for further metabolism. We have now identified and characterized two peroxisomal acyl-CoA thioesterases, named PTE-Ia and PTE-Ic, that hydrolyze acyl-CoAs to the free fatty acid and coenzyme A. PTE-Ia and PTE-Ic show 82% sequence identity at the amino acid level, and a putative peroxisomal type 1 targeting signal of -AKL was identified at the carboxyl-terminal end of both proteins. Localization experiments using green fluorescent fusion protein showed PTE-Ia and PTE-Ic to be localized in peroxisomes. Despite their high level of sequence identity, we show that PTE-Ia is mainly active on long-chain acyl-CoAs, whereas PTE-Ic is mainly active on medium-chain acyl-CoAs. Lack of regulation of enzyme activity by free CoASH suggests that PTE-Ia and PTE-Ic regulate intraperoxisomal levels of acyl-CoA, and they may have a function in termination of beta-oxidation of fatty acids of different chain lengths. Tissue expression studies revealed that PTE-Ia is highly expressed in kidney, whereas PTE-Ic is most highly expressed in spleen, brain, testis, and proximal and distal intestine. Both PTE-Ia and PTE-Ic were highly up-regulated in mouse liver by treatment with the peroxisome proliferator WY-14,643 and by fasting in a peroxisome proliferator-activated receptor alpha-dependent manner. These data show that PTE-Ia and PTE-Ic have different functions based on different substrate specificities and tissue expression.
Yafremava, Liudmila S; Di Giulio, Massimo; Caetano-Anollés, Gustavo
2013-01-01
Amino acid substitution patterns between the nonbarophilic Pyrococcus furiosus and its barophilic relative P. abyssi confirm that hydrostatic pressure asymmetry indices reflect the extent to which amino acids are preferred by barophilic archaeal organisms. Substitution patterns in entire protein sequences, shared protein domains defined at fold superfamily level, domains in homologous sequence pairs, and domains of very ancient and very recent origin now provide further clues about the environment that led to the genetic code and diversified life. The pyrococcal proteomes are very similar and share a very early ancestor. Relative amino acid abundance analyses showed that biases in the use of amino acids are due to their shared fold superfamilies. Within these repertoires, only two of the five amino acids that are preferentially barophilic, aspartic acid and arginine, displayed this preference significantly and consistently across structure and in domains appearing in the ancestor. The more primordial asparagine, lysine and threonine displayed a consistent preference for nonbarophily across structure and in the ancestor. Since barophilic preferences are already evident in ancient domains that are at least ~3 billion year old, we conclude that barophily is a very ancient trait that unfolded concurrently with genetic idiosyncrasies in convergence towards a universal code.
Assier, E; Bouzinba-Segard, H; Stolzenberg, M C; Stephens, R; Bardos, J; Freemont, P; Charron, D; Trowsdale, J; Rich, T
1999-04-16
A novel human gene RED, and the murine homologue, MuRED, were cloned. These genes were named after the extensive stretch of alternating arginine (R) and glutamic acid (E) or aspartic acid (D) residues that they contain. We term this the 'RED' repeat. The genes of both species were expressed in a wide range of tissues and we have mapped the human gene to chromosome 5q22-24. MuRED and RED shared 98% sequence identity at the amino acid level. The open reading frame of both genes encodes a 557 amino acid protein. RED fused to a fluorescent tag was expressed in nuclei of transfected cells and localised to nuclear dots. Co-localisation studies showed that these nuclear dots did not contain either PML or Coilin, which are commonly found in the POD or coiled body nuclear compartments. Deletion of the amino terminal 265 amino acids resulted in a failure to sort efficiently to the nucleus, though nuclear dots were formed. Deletion of a further 50 amino acids from the amino terminus generates a protein that can sort to the nucleus but is unable to generate nuclear dots. Neither construct localised to the nucleolus. The characteristics of RED and its nuclear localisation implicate it as a regulatory protein, possibly involved in transcription.
Matsuno, Michiyo; Nagatsu, Akito; Ogihara, Yukio; Ellis, Brian E; Mizukami, Hajime
2002-03-13
Rosmarinic acid is the dominant hydroxycinnamic acid ester accumulated in Boraginaceae and Lamiaceae plants. A cytochrome P450 cDNA was isolated by differential display from cultured cells of Lithospermum erythrorhizon, and the gene product was designated CYP98A6 based on the deduced amino acid sequence. After expression in yeast, the P450 was shown to catalyze the 3-hydroxylation of 4-coumaroyl-4'-hydroxyphenyllactic acid, one of the final two steps leading to rosmarinic acid. The expression level of CYP98A6 is dramatically increased by addition of yeast extract or methyl jasmonate to L. erythrorhizon cells, and its expression pattern reflected the elicitor-induced change in rosmarinic acid production, indicating that CYP98A6 plays an important role in regulation of rosmarinic acid biosynthesis.
Pseudomonas kribbensis sp. nov., isolated from garden soils in Daejeon, Korea.
Chang, Dong-Ho; Rhee, Moon-Soo; Kim, Ji-Sun; Lee, Yookyung; Park, Mi Young; Kim, Haseong; Lee, Seung-Goo; Kim, Byoung-Chan
2016-11-01
Two bacterial strains, 46-1 and 46-2 T , were isolated from garden soil. These strains were observed to be aerobic, Gram-stain negative, rod-shaped, non-spore-forming, motile and catalase and oxidase positive. Phylogenetic analysis based on 16S rRNA gene sequences showed that the two strains shared 100 % sequence similarity with each other and belong to the genus Pseudomonas in the class Gammaproteobacteria. The concatenated 16S rRNA, gyrB, rpoB and rpoD gene sequences further confirmed that the isolates belong to the Pseudomonas koreensis subgroup (SG), with P. koreensis Ps 9-14 T , Pseudomonas moraviensis 1B4 T and Pseudomonas granadensis F-278,770 T as their close relatives (>96 % pairwise similarity). DNA-DNA hybridization with the closely related type strain P. koreensis SG revealed a low level of relatedness (<50 %). A cladogram constructed using whole-cell matrix-assisted laser desorption/ionization time-of-flight (WC-MALDI-TOF) MS analysis showed the isolates formed a completely separate monophyletic group. The isolates were negative for utilization of glycogen, D-psicose, α-keto butyric acid, α-keto valeric acid, succinamic acid and D, L-α-glycerol phosphate. In contrast, all these reactions were positive in P. koreensis JCM 14769 T and P. moraviensis DSM 16007 T . The fatty acid C 17:0 cyclo was detected as one of the major cellular fatty acids (>15 %) in the isolates but it was a minor component (<4 %) in both reference type strains. In contrast, the fatty acid, C 12:0 was not observed in the isolates but was present in both reference strains. Based on differences such as phylogenetic position, low-level DNA-DNA hybridization, WC-MALDI-TOF MS analysis, fluorescence pigmentation, fatty acid profiles, and substrate utilization, we propose that the isolates 46-1 and 46-2 T represent a novel species of the genus Pseudomonas, for which the name Pseudomonas kribbensis sp. nov. is proposed; the type strain is 46-2 T (=KCTC 32541 T = DSM 100278 T ).
Cloning and characterization of an abalone (Haliotis discus hannai) actin gene
NASA Astrophysics Data System (ADS)
Ma, Hongming; Xu, Wei; Mai, Kangsen; Liufu, Zhiguo; Chen, Hong
2004-10-01
An actin encoding gene was cloned by using RT-PCR, 3‧ RACE and 5‧ RACE from abalone Haliotis discus hannai. The full length of the gene is 1532 base pairs, which contains a long 3‧ untranslated region of 307 base pairs and 79 base pairs of 5‧ untranslated sequence. The open reading frame encodes 376 amino acid residues. Sequence comparison with those of human and other mollusks showed high conservation among species at amino acid level. The identities was 96%, 97% and 96% respectively compared with Aplysia californica, Biomphalaria glabrata and Homo sapience β-actin. It is also indicated that this actin is more similar to the human cytoplasmic actin (β-actin) than to human muscle actin.
Singh, Aditya; Bhatia, Prateek
2016-12-01
Sanger sequencing platforms, such as applied biosystems instruments, generate chromatogram files. Generally, for 1 region of a sequence, we use both forward and reverse primers to sequence that area, in that way, we have 2 sequences that need to be aligned and a consensus generated before mutation detection studies. This work is cumbersome and takes time, especially if the gene is large with many exons. Hence, we devised a rapid automated command system to filter, build, and align consensus sequences and also optionally extract exonic regions, translate them in all frames, and perform an amino acid alignment starting from raw sequence data within a very short time. In full capabilities of Automated Mutation Analysis Pipeline (ASAP), it is able to read "*.ab1" chromatogram files through command line interface, convert it to the FASTQ format, trim the low-quality regions, reverse-complement the reverse sequence, create a consensus sequence, extract the exonic regions using a reference exonic sequence, translate the sequence in all frames, and align the nucleic acid and amino acid sequences to reference nucleic acid and amino acid sequences, respectively. All files are created and can be used for further analysis. ASAP is available as Python 3.x executable at https://github.com/aditya-88/ASAP. The version described in this paper is 0.28.
Nucleic acid analysis using terminal-phosphate-labeled nucleotides
Korlach, Jonas [Ithaca, NY; Webb, Watt W [Ithaca, NY; Levene, Michael [Ithaca, NY; Turner, Stephen [Ithaca, NY; Craighead, Harold G [Ithaca, NY; Foquet, Mathieu [Ithaca, NY
2008-04-22
The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
Investigation of mRNA quadruplex formation in Escherichia coli.
Wieland, Markus; Hartig, Jörg S
2009-01-01
The protocol presented here allows for the investigation of the formation of unusual nucleic acid structures in the 5'-untranslated region (UTR) of bacteria by correlating gene expression levels to the in vitro stability of the respective structure. In particular, we describe the introduction of G-quadruplex forming sequences close to the ribosome-binding site (RBS) on the mRNA of a reporter gene and the subsequent read-out of the expression levels. Insertion of a stable secondary structure results in the cloaking of RBS and eventually reduced gene expression levels. The structures and stability of the introduced sequences are further characterized by circular dichroism (CD) spectroscopy and thermal melting experiments. The extent of inhibition is then correlated to the stability of the respective quadruplex structure, allowing judgement of whether factors other than thermodynamic stability affect the formation of a given quadruplex sequence in vivo. Measuring gene expression levels takes 2 d including cloning; CD experiments take 5 hours per experiment.
Multiple DNA and protein sequence alignment on a workstation and a supercomputer.
Tajima, K
1988-11-01
This paper describes a multiple alignment method using a workstation and supercomputer. The method is based on the alignment of a set of aligned sequences with the new sequence, and uses a recursive procedure of such alignment. The alignment is executed in a reasonable computation time on diverse levels from a workstation to a supercomputer, from the viewpoint of alignment results and computational speed by parallel processing. The application of the algorithm is illustrated by several examples of multiple alignment of 12 amino acid and DNA sequences of HIV (human immunodeficiency virus) env genes. Colour graphic programs on a workstation and parallel processing on a supercomputer are discussed.
SeqAPASS: Sequence alignment to predict across-species ...
Efforts to shift the toxicity testing paradigm from whole organism studies to those focused on the initiation of toxicity and relevant pathways have led to increased utilization of in vitro and in silico methods. Hence the emergence of high through-put screening (HTS) programs, such as U.S. EPA ToxCast, and application of the adverse outcome pathway (AOP) framework for identifying and defining biological key events triggered upon perturbation of molecular initiating events and leading to adverse outcomes occuring at a level of organization relevant for risk assessment [1]. With these recent initiatives to harness the power of “the pathway” in describing and evaluating toxicity comes the need to extrapolate data beyond the model species. Sequence alignment to predict across-species susceptibilty (SeqAPASS) is a web-based tool that allows the user to begin to understand how broadly HTS data or AOP constructs may plausibly be extrapolated across species, while describing the relative intrinsic susceptibiltiy of different taxa to chemicals with known modes of action (e.g., pharmaceuticals and pesticides). The tool rapidly and strategically assesses available molecular target information to describe protein sequence similarity at the primary amino acid sequence, conserved domain, and individual amino acid residue levels. This in silico approach to species extrapolation was designed to automate and streamline the relatively complex and time-consuming process of co
Studier, F. William
1995-04-18
Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient.
Studier, F.W.
1995-04-18
Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient. 2 figs.
Anisimov, Andrey P; Panfertsev, Evgeniy A; Svetoch, Tat'yana E; Dentovskaya, Svetlana V
2007-01-01
Sequencing of lcrV genes and comparison of the deduced amino acid sequences from ten Y. pestis strains belonging mostly to the group of atypical rhamnose-positive isolates (non-pestis subspecies or pestoides group) showed that the LcrV proteins analyzed could be classified into five sequence types. This classification was based on major amino acid polymorphisms among LcrV proteins in the four "hot points" of the protein sequences. Some additional minor polymorphisms were found throughout these sequence types. The "hot points" corresponded to amino acids 18 (Lys --> Asn), 72 (Lys --> Arg), 273 (Cys --> Ser), and 324-326 (Ser-Gly-Lys --> Arg) in the LcrV sequence of the reference Y. pestis strain CO92. One possible explanation for polymorphism in amino acid sequences of LcrV among different strains is that strain-specific variation resulted from adaptation of the plague pathogen to different rodent and lagomorph hosts.
A robust and cost-effective approach to sequence and analyze complete genomes of small RNA viruses
USDA-ARS?s Scientific Manuscript database
Background: Next-generation sequencing (NGS) allows ultra-deep sequencing of nucleic acids. The use of sequence-independent amplification of viral nucleic acids without utilization of target-specific primers provides advantages over traditional sequencing methods and allows detection of unsuspected ...
Valliere-Douglass, John F; Kodama, Paul; Mujacic, Mirna; Brady, Lowell J; Wang, Wes; Wallace, Alison; Yan, Boxu; Reddy, Pranhitha; Treuheit, Michael J; Balland, Alain
2009-11-20
We report that N-linked oligosaccharide structures can be present on an asparagine residue not adhering to the consensus site motif NX(S/T), where X is not proline, described in the literature. We have observed oligosaccharides on a non-consensus asparaginyl residue in the C(H)1 constant domain of IgG1 and IgG2 antibodies. The initial findings were obtained from characterization of charge variant populations evident in a recombinant human antibody of the IgG2 subclass. HPLC-MS results indicated that cation-exchange chromatography acidic variant populations were enriched in antibody with a second glycosylation site, in addition to the well documented canonical glycosylation site located in the C(H)2 domain. Subsequent tryptic and chymotryptic peptide map data indicated that the second glycosylation site was associated with the amino acid sequence TVSWN(162)SGAL in the C(H)1 domain of the antibody. This highly atypical modification is present at levels of 0.5-2.0% on most of the recombinant antibodies that have been tested and has also been observed in IgG1 antibodies derived from human donors. Site-directed mutagenesis of the C(H)1 domain sequence in a recombinant-human IgG1 antibody resulted in an increase in non-consensus glycosylation to 3.15%, a greater than 4-fold increase over the level observed in the wild type, by changing the -1 and +1 amino acids relative to the asparagine residue at position 162. We believe that further understanding of the phenomenon of non-consensus glycosylation can be used to gain fundamental insights into the fidelity of the cellular glycosylation machinery.
Yu, Meng; Jia, Hongmei; Zhou, Chao; Yang, Yong; Zhao, Yang; Yang, Maohua; Zou, Zhongmei
2017-05-10
As a prevalent, life-threatening and highly recurrent psychiatric illness, depression is characterized by a wide range of pathological changes; however, its etiology remains incompletely understood. Accumulating evidence supports that gut microbiota affects not only gastrointestinal physiology but also central nervous system (CNS) function and behavior through the microbiota-gut-brain axis. To assess the impact of gut microbiota on fecal metabolic phenotype in depressive conditions, an integrated approach of 16S rRNA gene sequencing combined with ultra high-performance liquid chromatography-mass spectrometry (UHPLC-MS) based metabolomics was performed in chronic variable stress (CVS)-induced depression rat model. Interestingly, depression led to significant gut microbiota changes, at the phylum and genus levels in rats treated with CVS compared to controls. The relative abundances of the bacterial genera Marvinbryantia, Corynebacterium, Psychrobacter, Christensenella, Lactobacillus, Peptostreptococcaceae incertae sedis, Anaerovorax, Clostridiales incertae sedis and Coprococcus were significantly decreased, whereas Candidatus Arthromitus and Oscillibacter were markedly increased in model rats compared with normal controls. Meanwhile, distinct changes in fecal metabolic phenotype of depressive rats were also found, including lower levels of amino acids, and fatty acids, and higher amounts of bile acids, hypoxanthine and stercobilins. Moreover, there were substantial associations of perturbed gut microbiota genera with the altered fecal metabolites, especially compounds involved in the metabolism of tryptophan and bile acids. These results showed that the gut microbiota was altered in association with fecal metabolism in depressive conditions. These findings suggest that the 16S rRNA gene sequencing and LC-MS based metabolomics approach can be further applied to assess pathogenesis of depression. Copyright © 2017 Elsevier B.V. All rights reserved.
.beta.-glucosidase 5 (BGL5) compositions
Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian
2010-06-01
The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl5, and the corresponding BGL5 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL5, recombinant BGL5 proteins and methods for producing the same.
Antell, Gregory C.; Zhong, Wen; Kercher, Katherine; Passic, Shendra; Williams, Jean; Liu, Yucheng; James, Tony; Jacobson, Jeffrey M.; Szep, Zsofia
2017-01-01
Vpr is an HIV-1 accessory protein that plays numerous roles during viral replication, and some of which are cell type dependent. To test the hypothesis that HIV-1 tropism extends beyond the envelope into the vpr gene, studies were performed to identify the associations between coreceptor usage and Vpr variation in HIV-1-infected patients. Colinear HIV-1 Env-V3 and Vpr amino acid sequences were obtained from the LANL HIV-1 sequence database and from well-suppressed patients in the Drexel/Temple Medicine CNS AIDS Research and Eradication Study (CARES) Cohort. Genotypic classification of Env-V3 sequences as X4 (CXCR4-utilizing) or R5 (CCR5-utilizing) was used to group colinear Vpr sequences. To reveal the sequences associated with a specific coreceptor usage genotype, Vpr amino acid sequences were assessed for amino acid diversity and Jensen-Shannon divergence between the two groups. Five amino acid alphabets were used to comprehensively examine the impact of amino acid substitutions involving side chains with similar physiochemical properties. Positions 36, 37, 41, 89, and 96 of Vpr were characterized by statistically significant divergence across multiple alphabets when X4 and R5 sequence groups were compared. In addition, consensus amino acid switches were found at positions 37 and 41 in comparisons of the R5 and X4 sequence populations. These results suggest an evolutionary link between Vpr and gp120 in HIV-1-infected patients. PMID:28620613
Wu, Hao-Yang; Wang, Yan-Hui; Xie, Qiang; Ke, Yun-Ling; Bu, Wen-Jun
2016-06-17
With the great development of sequencing technologies and systematic methods, our understanding of evolutionary relationships at deeper levels within the tree of life has greatly improved over the last decade. However, the current taxonomic methodology is insufficient to describe the growing levels of diversity in both a standardised and general way due to the limitations of using only morphological traits to describe clades. Herein, we propose the idea of a molecular classification based on hierarchical and discrete amino acid characters. Clades are classified based on the results of phylogenetic analyses and described using amino acids with group specificity in phylograms. Practices based on the recently published phylogenomic datasets of insects together with 15 de novo sequenced transcriptomes in this study demonstrate that such a methodology can accommodate various higher ranks of taxonomy. Such an approach has the advantage of describing organisms in a standard and discrete way within a phylogenetic framework, thereby facilitating the recognition of clades from the view of the whole lineage, as indicated by PhyloCode. By combining identification keys and phylogenies, the molecular classification based on hierarchical and discrete characters may greatly boost the progress of integrative taxonomy.
Wu, Hao-Yang; Wang, Yan-Hui; Xie, Qiang; Ke, Yun-Ling; Bu, Wen-Jun
2016-01-01
With the great development of sequencing technologies and systematic methods, our understanding of evolutionary relationships at deeper levels within the tree of life has greatly improved over the last decade. However, the current taxonomic methodology is insufficient to describe the growing levels of diversity in both a standardised and general way due to the limitations of using only morphological traits to describe clades. Herein, we propose the idea of a molecular classification based on hierarchical and discrete amino acid characters. Clades are classified based on the results of phylogenetic analyses and described using amino acids with group specificity in phylograms. Practices based on the recently published phylogenomic datasets of insects together with 15 de novo sequenced transcriptomes in this study demonstrate that such a methodology can accommodate various higher ranks of taxonomy. Such an approach has the advantage of describing organisms in a standard and discrete way within a phylogenetic framework, thereby facilitating the recognition of clades from the view of the whole lineage, as indicated by PhyloCode. By combining identification keys and phylogenies, the molecular classification based on hierarchical and discrete characters may greatly boost the progress of integrative taxonomy. PMID:27312960
Methods of diagnosing alagille syndrome
Li, Linheng; Hood, Leroy; Krantz, Ian D.; Spinner, Nancy B.
2004-03-09
The present invention provides an isolated polypeptide exhibiting substantially the same amino acid sequence as JAGGED, or an active fragment thereof, provided that the polypeptide does not have the amino acid sequence of SEQ ID NO:5 or SEQ ID NO:6. The invention further provides an isolated nucleic acid molecule containing a nucleotide sequence encoding substantially the same amino acid sequence as JAGGED, or an active fragment thereof, provided that the nucleotide sequence does not encode the amino acid sequence of SEQ ID NO:5 or SEQ ID NO:6. Also provided herein is a method of inhibiting differentiation of hematopoietic progenitor cells by contacting the progenitor cells with an isolated JAGGED polypeptide, or active fragment thereof. The invention additionally provides a method of diagnosing Alagille Syndrome in an individual. The method consists of detecting an Alagille Syndrome disease-associated mutation linked to a JAGGED locus.
Principles of protein folding--a perspective from simple exact models.
Dill, K. A.; Bromberg, S.; Yue, K.; Fiebig, K. M.; Yee, D. P.; Thomas, P. D.; Chan, H. S.
1995-01-01
General principles of protein structure, stability, and folding kinetics have recently been explored in computer simulations of simple exact lattice models. These models represent protein chains at a rudimentary level, but they involve few parameters, approximations, or implicit biases, and they allow complete explorations of conformational and sequence spaces. Such simulations have resulted in testable predictions that are sometimes unanticipated: The folding code is mainly binary and delocalized throughout the amino acid sequence. The secondary and tertiary structures of a protein are specified mainly by the sequence of polar and nonpolar monomers. More specific interactions may refine the structure, rather than dominate the folding code. Simple exact models can account for the properties that characterize protein folding: two-state cooperativity, secondary and tertiary structures, and multistage folding kinetics--fast hydrophobic collapse followed by slower annealing. These studies suggest the possibility of creating "foldable" chain molecules other than proteins. The encoding of a unique compact chain conformation may not require amino acids; it may require only the ability to synthesize specific monomer sequences in which at least one monomer type is solvent-averse. PMID:7613459
A new ALF from Litopenaeus vannamei and its SNPs related to WSSV resistance
NASA Astrophysics Data System (ADS)
Liu, Jingwen; Yu, Yang; Li, Fuhua; Zhang, Xiaojun; Xiang, Jianhai
2014-11-01
Anti-lipopolysaccharide factors (ALFs) are basic components of the crustacean immune system that defend against a range of pathogens. The cDNA sequence of a new ALF, designated nLvALF2, with an open reading frame encoding 132 amino acids was cloned. Its deduced amino acid sequence contained the conserved functional domain of ALFs, the LPS binding domain (LBD). Its genomic sequence consisted of three exons and four introns. nLvALF2 was mainly expressed in the Oka organ and gills of shrimps. The transcriptional level of nLvALF2 increased significantly after white spot syndrome virus (WSSV) infection, suggesting its important roles in protecting shrimps from WSSV. Single nucleotide polymorphisms (SNPs) were found in the genomic sequence of nLvALF2, of which 38 were analyzed for associations with the susceptibility/resistance of shrimps to WSSV. The loci g.2422 A>G, g.2466 T>C, and g.2529 G>A were significantly associated with the resistance to WSSV ( P<0.05). These SNP loci could be developed as markers for selection of WSSV-resistant varieties of Litopenaeus vannamei.
L'vov, D K; Al'khovskiĭ, S V; Shchelkanov, M Iu; Shchetinin, A M; Deriabin, P G; Aristova, V A; Gitel'man, A K; Samokhvalov, E I; Botikov, A G
2014-01-01
The Tyulek virus (TLKV) was isolated from the ticks Argas vulgaris Filippova, 1961 (Argasidae), collected from the burrow biotopes in multispecies birds colony in the Aksu river floodplain near Tyulek village (northern part of Chu Valley, Kyrgyzstan). Recently, the TLKV was assigned to the Quaranfil group (including the Quaranfil virus (QRFV), Johnston Atoll virus (JAV), Lake Chad virus) that is a novel genus of the Quaranjavirus in the Orthomyxoviridae family. In his work, the complete genome (ID GenBank KJ438647-8) sequence of the TLKV was determined using next-generation sequencing (Illumina platform). Comparison of deduced amino acid sequences shows closed relationship of the TLKV with QRFV and JAV (86% and 84% identity for PB1 and about 70% for PB2 and PA, respectively). The identity level of the TLKV and QRFV in outer glycoprotein GP is 72% and 80% for nucleotide and amino acid sequences, respectively. The phylogenetic analysis showed that the TLKV belongs to the genus of the Quaranjavirus in the family Orthomyxoviridae.
Structure-Specific Ribonucleases for MS-Based Elucidation of Higher-Order RNA Structure
NASA Astrophysics Data System (ADS)
Scalabrin, Matteo; Siu, Yik; Asare-Okai, Papa Nii; Fabris, Daniele
2014-07-01
Supported by high-throughput sequencing technologies, structure-specific nucleases are experiencing a renaissance as biochemical probes for genome-wide mapping of nucleic acid structure. This report explores the benefits and pitfalls of the application of Mung bean (Mb) and V1 nuclease, which attack specifically single- and double-stranded regions of nucleic acids, as possible structural probes to be employed in combination with MS detection. Both enzymes were found capable of operating in ammonium-based solutions that are preferred for high-resolution analysis by direct infusion electrospray ionization (ESI). Sequence analysis by tandem mass spectrometry (MS/MS) was performed to confirm mapping assignments and to resolve possible ambiguities arising from the concomitant formation of isobaric products with identical base composition and different sequences. The observed products grouped together into ladder-type series that facilitated their assignment to unique regions of the substrate, but revealed also a certain level of uncertainty in identifying the boundaries between paired and unpaired regions. Various experimental factors that are known to stabilize nucleic acid structure, such as higher ionic strength, presence of Mg(II), etc., increased the accuracy of cleavage information, but did not completely eliminate deviations from expected results. These observations suggest extreme caution in interpreting the results afforded by these types of reagents. Regardless of the analytical platform of choice, the results highlighted the need to repeat probing experiments under the most diverse possible conditions to recognize potential artifacts and to increase the level of confidence in the observed structural information.
Ma, G X; Zhou, R Q; Hu, L; Luo, Y L; Luo, Y F; Zhu, H H
2018-03-01
Toxocara canis is an important but neglected zoonotic parasite, and is the causative agent of human toxocariasis. Chondroitin proteoglycans are biological macromolecules, widely distributed in extracellular matrices, with a great diversity of functions in mammals. However, there is limited information regarding chondroitin proteoglycans in nematode parasites. In the present study, a female-enriched chondroitin proteoglycan 2 gene of T. canis (Tc-cpg-2) was cloned and characterized. Quantitative real-time polymerase chain reaction (qRT-PCR) was employed to measure the transcription levels of Tc-cpg-2 among tissues of male and female adult worms. A 485-amino-acid (aa) polypeptide was predicted from a continuous 1458-nuleotide open reading frame and designated as TcCPG2, which contains a 21-aa signal peptide. Conserved domain searching indicated three chitin-binding peritrophin-A (CBM_14) domains in the amino acid sequence of TcCPG2. Multiple alignment with the inferred amino acid sequences of Caenorhabditis elegans and Ascaris suum showed that CBM_14 domains were well conserved among these species. Phylogenetic analysis suggested that TcCPG2 was closely related to the sequence of chondroitin proteoglycan 2 of A. suum. Interestingly, a high level of Tc-cpg-2 was detected in female germline tissues, particularly in the oviduct, suggesting potential roles of this gene in reproduction (e.g. oogenesis and embryogenesis) of adult T. canis. The functional roles of Tc-cpg-2 in reproduction and development in this parasite and related parasitic nematodes warrant further functional studies.
Complete amino acid sequence of bovine colostrum low-Mr cysteine proteinase inhibitor.
Hirado, M; Tsunasawa, S; Sakiyama, F; Niinobe, M; Fujii, S
1985-07-01
The complete amino acid sequence of bovine colostrum cysteine proteinase inhibitor was determined by sequencing native inhibitor and peptides obtained by cyanogen bromide degradation, Achromobacter lysylendopeptidase digestion and partial acid hydrolysis of reduced and S-carboxymethylated protein. Achromobacter peptidase digestion was successfully used to isolate two disulfide-containing peptides. The inhibitor consists of 112 amino acids with an Mr of 12787. Two disulfide bonds were established between Cys 66 and Cys 77 and between Cys 90 and Cys 110. A high degree of homology in the sequence was found between the colostrum inhibitor and human gamma-trace, human salivary acidic protein and chicken egg-white cystatin.
Itoh, Nobuya; Takagi, Shinya; Miki, Asami; Kurokawa, Junji
2016-01-01
Epitheaflagallin 3-O-gallate (ETFGg) is a minor polyphenol found in black tea extract, which has good physiological functions. It is synthesized from epigallocatechin gallate (EGCg) with gallic acid via laccase oxidation. Various basidiomycetes and fungi were screened to find a suitable laccase for the production of ETFGg. A basidiomycete, Hericium coralloides NBRC 7716, produced an appropriate extracellular laccase. The purified laccase produced twice the level of ETFGg compared with commercially available laccase from Trametes sp. The enzyme, termed Lcc2, is a monomeric protein with an apparent molecular mass of 67.2 kDa. The N-terminal amino acid sequence of Lcc2 is quite different from laccase isolated from the fruiting bodies of Hericium. Lcc2 showed similar substrate specificity to known laccases and could oxidize various phenolic substrates, including pyrogallol, gallic acid, and 2,6-dimethoxyphenol. The full-length lcc2 gene was obtained by PCR using degenerate primers, which were designed based on the N-terminal amino acid sequence of Lcc2 and conserved copper-binding sites of laccases, and 5'-, and 3'-RACE PCR with mRNA. The Lcc2 gene showed homology with Lentinula edodes laccase (sharing 77% amino acid identity with Lcc6). We successfully produced extracellular Lcc2 using a heterologous expression system with Saccharomyces cerevisiae. Moreover, it was confirmed that the recombinant laccase generates similar levels of ETFGg as the native enzyme. Copyright © 2015 Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Qi, Fei; Guo, Huarong; Wang, Jian
2008-02-01
Reversible protein phosphorylation, catalyzed by protein kinases and phosphatases, is an important and versatile mechanism by which eukaryotic cells regulate almost all the signaling processes. Protein phosphatase 1 (PP1) is the first and well-characterized member of the protein serine/threonine phosphatase family. In the present study, a full-length cDNA encoding the beta isoform of the catalytic subunit of protein phosphatase 1(PP1cb), was for the first time isolated and sequenced from the skin tissue of flatfish turbot Scophthalmus maximus, designated SmPP1cb, by the rapid amplification of cDNA ends (RACE) technique. The cDNA sequence of SmPP1cb we obtained contains a 984 bp open reading frame (ORF), flanked by a complete 39 bp 5' untranslated region and 462 bp 3' untranslated region. The ORF encodes a putative 327 amino acid protein, and the N-terminal section of this protein is highly acidic, Met-Ala-Glu-Gly-Glu-Leu-Asp-Val-Asp, a common feature for PP1 catalytic subunit but absent in protein phosphatase 2B (PP2B). And its calculated molecular mass is 37 193 Da and pI 5.8. Sequence analysis indicated that, SmPP1cb is extremely conserved in both amino acid and nucleotide acid levels compared with the PP1cb of other vertebrates and invertebrates, and its Kozak motif contained in the 5'UTR around ATG start codon is GXXAXXGXX ATGG, which is different from mammalian in two positions A-6 and G-3, indicating the possibility of different initiation of translation in turbot, and also the 3'UTR of SmPP1cb is highly diverse in the sequence similarity and length compared with other animals, especially zebrafish. The cloning and sequencing of SmPP1cb gene lays a good foundation for the future work on the biological functions of PP1 in the flatfish turbot.
Detection and isolation of nucleic acid sequences using competitive hybridization probes
Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.
1997-01-01
A method for detecting a target nucleic acid sequence in a sample is provided using hybridization probes which competitively hybridize to a target nucleic acid. According to the method, a target nucleic acid sequence is hybridized to first and second hybridization probes which are complementary to overlapping portions of the target nucleic acid sequence, the first hybridization probe including a first complexing agent capable of forming a binding pair with a second complexing agent and the second hybridization probe including a detectable marker. The first complexing agent attached to the first hybridization probe is contacted with a second complexing agent, the second complexing agent being attached to a solid support such that when the first and second complexing agents are attached, target nucleic acid sequences hybridized to the first hybridization probe become immobilized on to the solid support. The immobilized target nucleic acids are then separated and detected by detecting the detectable marker attached to the second hybridization probe. A kit for performing the method is also provided.
Detection and isolation of nucleic acid sequences using competitive hybridization probes
Lucas, J.N.; Straume, T.; Bogen, K.T.
1997-04-01
A method for detecting a target nucleic acid sequence in a sample is provided using hybridization probes which competitively hybridize to a target nucleic acid. According to the method, a target nucleic acid sequence is hybridized to first and second hybridization probes which are complementary to overlapping portions of the target nucleic acid sequence, the first hybridization probe including a first complexing agent capable of forming a binding pair with a second complexing agent and the second hybridization probe including a detectable marker. The first complexing agent attached to the first hybridization probe is contacted with a second complexing agent, the second complexing agent being attached to a solid support such that when the first and second complexing agents are attached, target nucleic acid sequences hybridized to the first hybridization probe become immobilized on to the solid support. The immobilized target nucleic acids are then separated and detected by detecting the detectable marker attached to the second hybridization probe. A kit for performing the method is also provided. 7 figs.
NASA Technical Reports Server (NTRS)
Wu, Liu-Lai; Song, Il; Karuppiah, Nadarajah; Kaufman, Peter B.
1993-01-01
An asymmetric (top vs. bottom halves of pulvini) induction of invertase mRNA by gravistimulation was analyzed in oat shoot pulvini. Total RNA and poly(A)(+) RNA, isolated from oat pulvini, and two oli-gonucleotide primers, corresponding to two conserved amino acid sequences (NDPNG and WECPD) found in invertase from other species, were used for the polymerase chain reaction (PCR). A partial length cDNA (550 bp) was obtained and characterized. A 62% nucleotide sequence homology and 58% deduced amino acid sequence homology, as compared to beta-fructosidase of carrot cell wall, was found. Northern blot analysis showed that there was an obviously transient induction of invertase mRNA by gravistimulation in the oat pulvinus system. The mRNA was rapidly induced to a maximum level at 1 hour after gravistimulation treatment and gradually decreased afterwards. The mRNA level in the bottom half of the oat pulvinus was significantly higher than that in the top half of the pulvinus tissue. The kinetic induction of invertase mRNA was consistent with the transient accumulation of invertase activity during the graviresponse of the pulvinus. This indicates that the expression of the invertase gene(s) could be regulated by gravistimulation at the transcriptional level. Southern blot analysis showed that there were two to three genomic DNA fragments which hybridized with the partial-length invertase cDNA.
Yadav, Kamlesh Kumar; Rajasekharan, Ram
2016-11-01
PHM8 is a very important enzyme in nonpolar lipid metabolism because of its role in triacylglycerol (TAG) biosynthesis under phosphate stress conditions. It is positively regulated by the PHO4 transcription factor under low phosphate conditions; however, its regulation has not been explored under normal physiological conditions. General control nonderepressible (GCN4), a basic leucine-zipper transcription factor activates the transcription of amino acids, purine biosynthesis genes and many stress response genes under various stress conditions. In this study, we demonstrate that the level of TAG is regulated by the transcription factor GCN4. GCN4 directly binds to its consensus recognition sequence (TGACTC) in the PHM8 promoter and controls its expression. The analysis of cells expressing the P PHM8 -lacZ reporter gene showed that mutations (TGACTC-GGGCCC) in the GCN4-binding sequence caused a significant increase in β-galactosidase activity. Mutation in the GCN4 binding sequence causes an increase in PHM8 expression, lysophosphatidic acid phosphatase activity and TAG level. PHM8, in conjunction with DGA1, a mono- and diacylglycerol transferase, controls the level of TAG. These results revealed that GCN4 negatively regulates PHM8 and that deletion of GCN4 causes de-repression of PHM8, which is responsible for the increased TAG content in gcn4∆ cells.
Applications of Single-Cell Sequencing for Multiomics.
Xu, Yungang; Zhou, Xiaobo
2018-01-01
Single-cell sequencing interrogates the sequence or chromatin information from individual cells with advanced next-generation sequencing technologies. It provides a higher resolution of cellular differences and a better understanding of the underlying genetic and epigenetic mechanisms of an individual cell in the context of its survival and adaptation to microenvironment. However, it is more challenging to perform single-cell sequencing and downstream data analysis, owing to the minimal amount of starting materials, sample loss, and contamination. In addition, due to the picogram level of the amount of nucleic acids used, heavy amplification is often needed during sample preparation of single-cell sequencing, resulting in the uneven coverage, noise, and inaccurate quantification of sequencing data. All these unique properties raise challenges in and thus high demands for computational methods that specifically fit single-cell sequencing data. We here comprehensively survey the current strategies and challenges for multiple single-cell sequencing, including single-cell transcriptome, genome, and epigenome, beginning with a brief introduction to multiple sequencing techniques for single cells.
Li, Yantao; Fu, Tuo; Liu, Tao; Guo, Huaizu; Guo, Qingcheng; Xu, Jin; Zhang, Dapeng; Qian, Weizhu; Dai, Jianxin; Li, Bohua; Guo, Yajun; Hou, Sheng; Wang, Hao
2016-07-01
Nivolumab is a therapeutic fully human IgG4 antibody to programmed death 1 (PD-1). In this study, a nivolumab biosimilar, which was produced in our laboratory, was analyzed and characterized. Sequence variants that contain undesired amino acid sequences may cause concern during biosimilar bioprocess development. We found that low levels of sequence variants were detected in the heavy chain of the nivolumab biosimilar by ultra performance liquid chromatography (UPLC) and tandem mass spectrometry. It was further identified with UPLC-MS/MS by IdeS or trypsin digestion. The sequence variant was confirmed through addition of synthetic mutant peptide. Subsequently, the mixing base signal of normal and mutant sequence was detected through DNA sequencing. The relative levels of mutant A424V in the Fc region of the heavy chain have been detected and demonstrated to be 12.25% and 13.54%, via base peak intensity (BPI) and UV chromatography of the tryptic peptide mapping, respectively. A424V variant was also quantified by real-time PCR (RT-PCR) at the DNA and RNA level, which was 19.2% and 16.8%, respectively. The relative content of the mutant was consistent at the DNA, RNA and protein level, indicating that the A424V mutation may have little influence at transcriptional or translational levels. These results demonstrate that orthogonal state-of-the-art techniques such as LC- UV- MS and RT-PCR should be implemented to characterize recombinant proteins and cell lines for development of biosimilars. Our study suggests that it is important to establish an integrated and effective analytical method to monitor and characterize sequence variants during antibody drug development, especially for antibody biosimilar products.
Detection of nucleic acids by multiple sequential invasive cleavages
Hall, Jeff G.; Lyamichev, Victor I.; Mast, Andrea L.; Brow, Mary Ann D.
1999-01-01
The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based on charge. The present invention also provides methods for the detection of non-target cleavage products via the formation of a complete and activated protein binding region. The invention further provides sensitive and specific methods for the detection of human cytomegalovirus nucleic acid in a sample.
Hall, Jeff G.; Lyamichev, Victor I.; Mast, Andrea L.; Brow, Mary Ann; Kwiatkowski, Robert W.; Vavra, Stephanie H.
2005-03-29
The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based on charge. The present invention also provides methods for the detection of non-target cleavage products via the formation of a complete and activated protein binding region. The invention further provides sensitive and specific methods for the detection of nucleic acid from various viruses in a sample.
Detection of nucleic acids by multiple sequential invasive cleavages 02
Hall, Jeff G.; Lyamichev, Victor I.; Mast, Andrea L.; Brow, Mary Ann D.
2002-01-01
The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based on charge. The present invention also provides methods for the detection of non-target cleavage products via the formation of a complete and activated protein binding region. The invention further provides sensitive and specific methods for the detection of human cytomegalovirus nucleic acid in a sample.
Detection of nucleic acids by multiple sequential invasive cleavages
Hall, Jeff G; Lyamichev, Victor I; Mast, Andrea L; Brow, Mary Ann D
2012-10-16
The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based on charge. The present invention also provides methods for the detection of non-target cleavage products via the formation of a complete and activated protein binding region. The invention further provides sensitive and specific methods for the detection of human cytomegalovirus nucleic acid in a sample.
[Analysis of structural characteristics of alpha-tubulins in plants with enhanced cold tolerance].
Nyporko, A Iu; Demchuk, O N; Blium, Ia B
2003-01-01
The uniqueness of the point substitutions in the sequences of two alpha-tubulin isotypes from psychrophilic alga Chloromonas that can determine the increased cold tolerance of this alga was analyzed. The comparison of all known amino acid sequences of plant alpha-tubulins enabled to ascertain that only M268-->V replacement is unique and may have a significant influence on spatial structure of plant alpha-tubulins. Modeling of molecular surfaces of alpha-tubulins from Chloromonas, Chalmydomonas reinhardtii and goose grass Eleusine indica showed that insertion of the amino acid replacement M268-->V into the sequence of goose grace tubulin led to the likening of this protein surface to the surface of native alpha-tubulin from Chloromonas. Alteration of local hydrophobic properties of alpha-tubulin molecular surface in interdimeric contact zone as a result of the mentioned replacement was shown that may play important role in increasing the level of cold resistance of microtubules. The crucial role of amino acid residue in 268 position for forming the interdimeric contact surface of alpha-tubulin molecule was revealed. The assumption is made about the importance of replacements at this position for plant tolerance to abiotic factors of different nature (cold, herbicides).
Zenno, S; Saigo, K; Kanoh, H; Inouye, S
1994-01-01
The gene encoding the major NAD(P)H-flavin oxidoreductase (flavin reductase) of the luminous bacterium Vibrio fischeri ATCC 7744 was isolated by using synthetic oligonucleotide probes corresponding to the N-terminal amino acid sequence of the enzyme. Nucleotide sequence analysis suggested that the major flavin reductase of V. fischeri consisted of 218 amino acids and had a calculated molecular weight of 24,562. Cloned flavin reductase expressed in Escherichia coli was purified virtually to homogeneity, and its basic biochemical properties were examined. As in the major flavin reductase in crude extracts of V. fischeri, cloned flavin reductase showed broad substrate specificity and served well as a catalyst to supply reduced flavin mononucleotide (FMNH2) to the bioluminescence reaction. The major flavin reductase of V. fischeri not only showed significant similarity in amino acid sequence to oxygen-insensitive NAD(P)H nitroreductases of Salmonella typhimurium, Enterobacter cloacae, and E. coli but also was associated with a low level of nitroreductase activity. The major flavin reductase of V. fischeri and the nitroreductases of members of the family Enterobacteriaceae would thus appear closely related in evolution and form a novel protein family. Images PMID:8206830
Bhattacharyya, Pallab K; Phillips, Micheal D; Stone, Lael A; Lowe, Mark J
2011-04-01
Gamma-aminobutyric acid (GABA) is a major inhibitory neurotransmitter in the brain. Understanding the GABA concentration, in vivo, is important to understand normal brain function. Using MEGA point-resolved spectroscopy sequence with interleaved water scans to detect subject motion, GABA level of sensorimotor cortex was measured using a voxel identified from a functional magnetic resonance imaging scan. The GABA level in a 20×20×20-mm(3) voxel consisting of 37%±7% gray matter, 52%±12% white matter and 11%±8% cerebrospinal fluid in the sensorimotor region was measured to be 1.43±0.48 mM. In addition, using linear regression analysis, GABA concentrations within gray and white matter were calculated to be 2.87±0.61 and 0.33±0.11 mM, respectively. Copyright © 2011 Elsevier Inc. All rights reserved.
[Hepatitis C virus: sequence homology of a European isolate and divergence from the prototype].
Seelig, R; Seelig, H P; Renz, M
1991-08-01
The polymerase chain reaction (PCR) detected specific hepatitis C viral (HCV) RNA sequences in liver biopsies from two patients with chronic hepatitis, in the tissue of a liver implantate, in plasma from four chronic non-A, non-B hepatitis (NANBH) patients and, for the first time, in an infectious anti-D-immunoglobulin preparation. A comparison of the viral sequences coding for a region for the nonstructural NS3 protein from the liver tissues revealed only a very small degree of sequence divergence on the cDNA as well as on the amino acid level (between 0 and 5%). The sequence similarities of the RNA isolated from plasma of the four chronic NANBH patients and the anti-D-immunoglobulin preparation were partly somewhat lower but altogether also high (between 90 and 100%). In contrast, all eight cDNA and amino acid sequences exhibited a significantly higher degree of divergence in comparison with the HCV prototype sequence (between 29 and 32%) than among themselves (between 0 and 10%). This unexpected high sequence similarity of the eight European isolates and their low homology to the Northamerican prototype sequence is indicative for the existence of different types of HCV. This will be important not only for epidemiological studies but also for the development of effective diagnostic procedures and vaccines. Concerning the pathogenesis of NANBH, a double infection or a helper mechanism has to be considered: in addition to the C virus, sequences of an other virus particle were found in the infectious IgG preparation as well as in the liver biopsies.
Characterization of sams genes of Amoeba proteus and the endosymbiotic X-bacteria.
Jeon, Taeck J; Jeon, Kwang W
2003-01-01
As a result of harboring obligatory bacterial endosymbionts, the xD strain of Amoeba proteus no longer produces its own S-adenosylmethionine synthetase (SAMS). When symbiont-free D amoebae are infected with symbionts (X-bacteria), the amount of amoeba SAMS decreases to a negligible level within four weeks, but about 47% of the SAMS activity, which apparently comes from another source, is still detected. Complete nucleotide sequences of sams genes of D and xD amoebae are presented and show that there are no differences between the two. Long-established xD amoebae contain an intact sams gene and thus the loss of xD amoeba's SAMS is not due to the loss of the gene itself. The open reading frame of the amoeba's sams gene has 1,281 nucleotides, encoding SAMS of 426 amino acids with a mass of 48 kDa and pI of 6.5. The amino acid sequence of amoeba SAMS is longer than the SAMS of other organisms by having an extra internal stretch of 28 amino acids. The 5'-flanking region of amoeba sams contains consensus-binding sites for several transcription factors that are related to the regulation of sams genes in E. coli and yeast. The complete nucleotide sequence of the symbiont's sams gene is also presented. The open reading frame of X-bacteria sams is 1,146 nucleotides long, encoding SAMS of 381 amino acids with a mass of 41 kDa and pI of 6.0. The X-bacteria SAMS has 45% sequence identity with that of A. proteus.
Lashbrook, C C; Gonzalez-Bosch, C; Bennett, A B
1994-01-01
Two structurally divergent endo-beta-1,4-glucanase (EGase) cDNAs were cloned from tomato. Although both cDNAs (Cel1 and Cel2) encode potentially glycosylated, basic proteins of 51 to 53 kD and possess multiple amino acid domains conserved in both plant and microbial EGases, Cel1 and Cel2 exhibit only 50% amino acid identity at the overall sequence level. Amino acid sequence comparisons to other plant EGases indicate that tomato Cel1 is most similar to bean abscission zone EGase (68%), whereas Cel2 exhibits greatest sequence identity to avocado fruit EGase (57%). Sequence comparisons suggest the presence of at least two structurally divergent EGase families in plants. Unlike ripening avocado fruit and bean abscission zones in which a single EGase mRNA predominates, EGase expression in tomato reflects the overlapping accumulation of both Cel1 and Cel2 transcripts in ripening fruit and in plant organs undergoing cell separation. Cel1 mRNA contributes significantly to total EGase mRNA accumulation within plant organs undergoing cell separation (abscission zones and mature anthers), whereas Cel2 mRNA is most abundant in ripening fruit. The overlapping expression of divergent EGase genes within a single species may suggest that multiple activities are required for the cooperative disassembly of cell wall components during fruit ripening, floral abscission, and anther dehiscence. PMID:7994180
Sousa, Juliana C; Berto, Raquel F; Gois, Elicélia A; Fontenele-Cardi, Nauíla C; Honório, José E R; Konno, Katsuhiro; Richardson, Michael; Rocha, Marcos F G; Camargo, Antônio A C M; Pimenta, Daniel C; Cardi, Bruno A; Carvalho, Krishnamurti M
2009-07-01
Antimicrobial peptides are components of innate immunity that is the first-line defense against invading pathogens for a wide range of organisms. Here, we describe the isolation, biological characterization and amino acid sequencing of a novel neutral Glycine/Leucine-rich antimicrobial peptide from skin secretion of Leptodactylus pentadactylus named leptoglycin. The amino acid sequence of the peptide purified by RP-HPLC (C(18) column) was deduced by mass spectrometric de novo sequencing and confirmed by Edman degradation: GLLGGLLGPLLGGGGGGGGGLL. Leptoglycin was able to inhibit the growth of Gram-negative bacteria Pseudomonas aeruginosa, Escherichia coli and Citrobacter freundii with minimal inhibitory concentrations (MICs) of 8 microM, 50 microM, and 75 microM respectively, but it did not show antimicrobial activity against Gram-positive bacteria (Staphylococcus aureus, Micrococcus luteus and Enterococcus faecalis), yeasts (Candida albicans and Candida tropicalis) and dermatophytes fungi (Microsporum canis and Trichophyton rubrum). No hemolytic activity was observed at the 2-200 microM range concentration. The amino acid sequence of leptoglycin with high level of glycine (59.1%) and leucine (36.4%) containing an unusual central proline suggests the existence of a new class of Gly/Leu-rich antimicrobial peptides. Taken together, these results suggest that this natural antimicrobial peptide could be a tool to develop new antibiotics.
Tei, Meina; Uchida, Kazuyuki; Chambers, James K; Watanabe, Ken-Ichi; Tamamoto, Takashi; Ohno, Koichi; Nakayama, Hiroyuki
2018-02-02
Amyloid A (AA) amyloidosis, a fatal systemic amyloid disease, occurs secondary to chronic inflammatory conditions in humans. Although persistently elevated serum amyloid A (SAA) levels are required for its pathogenesis, not all individuals with chronic inflammation necessarily develop AA amyloidosis. Furthermore, many diseases in cats are associated with the elevated production of SAA, whereas only a small number actually develop AA amyloidosis. We hypothesized that a genetic mutation in the SAA gene may strongly contribute to the pathogenesis of feline AA amyloidosis. In the present study, genomic DNA from four Japanese domestic cats (JDCs) with AA amyloidosis and from five without amyloidosis was analyzed using polymerase chain reaction (PCR) amplification and direct sequencing. We identified the novel variation combination of 45R-51A in the deduced amino acid sequences of four JDCs with amyloidosis and five without. However, there was no relationship between amino acid variations and the distribution of AA amyloid deposits, indicating that differences in SAA sequences do not contribute to the pathogenesis of AA amyloidosis. Immunohistochemical analysis using antisera against the three different parts of the feline SAA protein-i.e., the N-terminal, central, and C-terminal regions-revealed that feline AA contained the C-terminus, unlike human AA. These results indicate that the cleavage and degradation of the C-terminus are not essential for amyloid fibril formation in JDCs.
Biological production of organic compounds
Yu, Jianping; Paddock, Troy; Carrieri, Damian; Maness, Pin-Ching; Seibert, Michael
2016-04-12
Strains of cyanobacteria that produce high levels of alpha ketoglutarate (AKG) and pyruvate are disclosed herein. Methods of culturing these cyanobacteria to produce AKG or pyruvate and recover AKG or pyruvate from the culture are also described herein. Nucleic acid sequences encoding polypeptides that function as ethylene-forming enzymes and their use in the production of ethylene are further disclosed herein. These nucleic acids may be expressed in hosts such as cyanobacteria, which in turn may be cultured to produce ethylene.
Ridley, R G; Patel, H V; Gerber, G E; Morton, R C; Freeman, K B
1986-01-01
A cDNA clone spanning the entire amino acid sequence of the nuclear-encoded uncoupling protein of rat brown adipose tissue mitochondria has been isolated and sequenced. With the exception of the N-terminal methionine the deduced N-terminus of the newly synthesized uncoupling protein is identical to the N-terminal 30 amino acids of the native uncoupling protein as determined by protein sequencing. This proves that the protein contains no N-terminal mitochondrial targeting prepiece and that a targeting region must reside within the amino acid sequence of the mature protein. Images PMID:3012461
Vanillin formation from ferulic acid in Vanilla planifolia is catalysed by a single enzyme.
Gallage, Nethaji J; Hansen, Esben H; Kannangara, Rubini; Olsen, Carl Erik; Motawia, Mohammed Saddik; Jørgensen, Kirsten; Holme, Inger; Hebelstrup, Kim; Grisoni, Michel; Møller, Birger Lindberg
2014-06-19
Vanillin is a popular and valuable flavour compound. It is the key constituent of the natural vanilla flavour obtained from cured vanilla pods. Here we show that a single hydratase/lyase type enzyme designated vanillin synthase (VpVAN) catalyses direct conversion of ferulic acid and its glucoside into vanillin and its glucoside, respectively. The enzyme shows high sequence similarity to cysteine proteinases and is specific to the substitution pattern at the aromatic ring and does not metabolize caffeic acid and p-coumaric acid as demonstrated by coupled transcription/translation assays. VpVAN localizes to the inner part of the vanilla pod and high transcript levels are found in single cells located a few cell layers from the inner epidermis. Transient expression of VpVAN in tobacco and stable expression in barley in combination with the action of endogenous alcohol dehydrogenases and UDP-glucosyltransferases result in vanillyl alcohol glucoside formation from endogenous ferulic acid. A gene encoding an enzyme showing 71% sequence identity to VpVAN was identified in another vanillin-producing plant species Glechoma hederacea and was also shown to be a vanillin synthase as demonstrated by transient expression in tobacco.
Vanillin formation from ferulic acid in Vanilla planifolia is catalysed by a single enzyme
Gallage, Nethaji J.; Hansen, Esben H.; Kannangara, Rubini; Olsen, Carl Erik; Motawia, Mohammed Saddik; Jørgensen, Kirsten; Holme, Inger; Hebelstrup, Kim; Grisoni, Michel; Møller, Birger Lindberg
2014-01-01
Vanillin is a popular and valuable flavour compound. It is the key constituent of the natural vanilla flavour obtained from cured vanilla pods. Here we show that a single hydratase/lyase type enzyme designated vanillin synthase (VpVAN) catalyses direct conversion of ferulic acid and its glucoside into vanillin and its glucoside, respectively. The enzyme shows high sequence similarity to cysteine proteinases and is specific to the substitution pattern at the aromatic ring and does not metabolize caffeic acid and p-coumaric acid as demonstrated by coupled transcription/translation assays. VpVAN localizes to the inner part of the vanilla pod and high transcript levels are found in single cells located a few cell layers from the inner epidermis. Transient expression of VpVAN in tobacco and stable expression in barley in combination with the action of endogenous alcohol dehydrogenases and UDP-glucosyltransferases result in vanillyl alcohol glucoside formation from endogenous ferulic acid. A gene encoding an enzyme showing 71% sequence identity to VpVAN was identified in another vanillin-producing plant species Glechoma hederacea and was also shown to be a vanillin synthase as demonstrated by transient expression in tobacco. PMID:24941968
Method of increasing conversion of a fatty acid to its corresponding dicarboxylic acid
Craft, David L.; Wilson, C. Ron; Eirich, Dudley; Zhang, Yeyan
2004-09-14
A nucleic acid sequence including a CYP promoter operably linked to nucleic acid encoding a heterologous protein is provided to increase transcription of the nucleic acid. Expression vectors and host cells containing the nucleic acid sequence are also provided. The methods and compositions described herein are especially useful in the production of polycarboxylic acids by yeast cells.
Sadkowska-Todys, M
2000-01-01
The aims of these studies were: genetic characteristic of street rabies virus strains isolated from different animal species in Poland and determination of phylogenetic relationships to reference laboratory strains of the street rabies viruses belonging to genotype 1 and 5. The variability of rabies isolates and their phylogenetic relationship were studied by comparing the nucleotide sequence of the virus genome fragment. The Polish strains of genotype 1 belong to four phylogenetic groups (NE, CE, NEE, EE) corresponding to four variants: fox-racoon dog (F-RD); European fox 1 (F1); European fox 2 (F2) and European fox 3 (F3). On the Polish territories there are no rabies strains representing the variant dog-wolf and typical for arctic fox variant. The similarity of nucleotide and amino acid sequences of street rabies strains belonging to genotype 1 and laboratory strain CVS is very high. It is about 91% similarity at nucleotide level and 95% at amino acid level. Rabies strain CVS is similar to genotype 5 bat strains (EBL 1) only in about 69% and 74% at nucleotide and amino acid level, respectively. The genetic divergence of rabies strains circulating in Poland raised the need of permanent epidemiological and virological surveillance. The genotype and variant of isolated strains should be determined (using PCR and RLFP methods).
Fowler, Elizabeth V; Peters, Jennifer M; Gatton, Michelle L; Chen, Nanhua; Cheng, Qin
2002-03-01
In Plasmodium falciparum a highly polymorphic multi-copy gene family, var, encodes the variant surface antigen P. falciparum erythrocyte membrane protein 1 (PfEMP1), which has an important role in cytoadherence and immune evasion. Using previously described universal PCR primers for the first Duffy binding-like domain (DBLalpha) of var we analysed the DBLalpha repertoires of Dd2 (originally from Thailand) and eight isolates from the Solomon Islands (n=4), Philippines (n=2), Papua New Guinea (n=1) and Africa (n=1). We found 15-32 unique DBLalpha sequence types among these isolates and estimated detectable DBLalpha repertoire sizes ranging from 33-38 to 52-57 copies per genome. Our data suggest that var gene repertoires generally consist of 40-50 copies per genome. Eighteen DBLalpha sequences appeared in more than one Asia-Pacific isolate with the number of sequences shared between any two isolates ranging from 0 to 6 (mean=2.0 +/-1.6). At the amino acid level DBLalpha sequence similarity within isolates ranged from 45.2 +/- 7.1 to 50.2 +/- 6.9%, and was not significantly different from the DBLalpha amino acid sequence similarity among isolates (P>0.1). Comparisons with published sequences also revealed little overlap among DBLalpha sequences from different regions. High DBLalpha sequence diversity and minimal overlap among these isolates suggest that the global var gene repertoire is immense, and may potentially be selected for by the host's protective immune response to the var gene products, PfEMP1.
Oliveira-Neto, Osmundo B; Batista, João A N; Rigden, Daniel J; Fragoso, Rodrigo R; Silva, Rodrigo O; Gomes, Eliane A; Franco, Octávio L; Dias, Simoni C; Cordeiro, Célia M T; Monnerat, Rose G; Grossi-De-Sá, Maria F
2004-09-01
Fourteen different cDNA fragments encoding serine proteinases were isolated by reverse transcription-PCR from cotton boll weevil (Anthonomus grandis) larvae. A large diversity between the sequences was observed, with a mean pairwise identity of 22% in the amino acid sequence. The cDNAs encompassed 11 trypsin-like sequences classifiable into three families and three chymotrypsin-like sequences belonging to a single family. Using a combination of 5' and 3' RACE, the full-length sequence was obtained for five of the cDNAs, named Agser2, Agser5, Agser6, Agser10 and Agser21. The encoded proteins included amino acid sequence motifs of serine proteinase active sites, conserved cysteine residues, and both zymogen activation and signal peptides. Southern blotting analysis suggested that one or two copies of these serine proteinase genes exist in the A. grandis genome. Northern blotting analysis of Agser2 and Agser5 showed that for both genes, expression is induced upon feeding and is concentrated in the gut of larvae and adult insects. Reverse northern analysis of the 14 cDNA fragments showed that only two trypsin-like and two chymotrypsin-like were expressed at detectable levels. Under the effect of the serine proteinase inhibitors soybean Kunitz trypsin inhibitor and black-eyed pea trypsin/chymotrypsin inhibitor, expression of one of the trypsin-like sequences was upregulated while expression of the two chymotrypsin-like sequences was downregulated. Copyright 2004 Elsevier Ltd.
Konami, Y; Yamamoto, K; Osawa, T; Irimura, T
1995-04-01
The complete amino acid sequence of a lactose-binding Cytisus sessilifolius anti-H(O) lectin II (CSA-II) was determined using a protein sequencer. After digestion of CSA-II with endoproteinase Lys-C or Asp-N, the resulting peptides were purified by reversed-phase high performance liquid chromatography (HPLC) and then subjected to sequence analysis. Comparison of the complete amino acid sequence of CSA-II with the sequences of other leguminous seed lectins revealed regions of extensive homology. The amino acid sequence of a putative carbohydrate-binding domain of CSA-II was found to be similar to those of several anti-H(O) leguminous lectins, especially to that of the L-fucose-binding Ulex europaeus lectin I (UEA-I).
Margaglione, M; Santacroce, R; Colaizzo, D; Seripa, D; Vecchione, G; Lupone, M R; De Lucia, D; Fortina, P; Grandone, E; Perricone, C; Di Minno, G
2000-10-01
Congenital afibrinogenemia is a rare autosomal recessive disorder characterized by a hemorrhagic diathesis of variable severity. Although more than 100 families with this disorder have been described, genetic defects have been characterized in few cases. An investigation of a young propositus, offspring of a consanguineous marriage, with undetectable levels of functional and quantitative fibrinogen, was conducted. Sequence analysis of the fibrinogen genes showed a homozygous G-to-A mutation at the fifth nucleotide (nt 2395) of the third intervening sequence (IVS) of the gamma-chain gene. Her first-degree relatives, who had approximately half the normal fibrinogen values and showed concordance between functional and immunologic levels, were heterozygtes. The G-to-A change predicts the disappearance of a donor splice site. After transfection with a construct, containing either the wild-type or the mutated sequence, cells with the mutant construct showed an aberrant messenger RNA (mRNA), consistent with skipping of exon 3, but not the expected mRNA. Sequencing of the abnormal mRNA showed the complete absence of exon 3. Skipping of exon 3 predicts the deletion of amino acid sequence from residue 16 to residue 75 and shifting of reading frame at amino acid 76 with a premature stop codon within exon 4 at position 77. Thus, the truncated gamma-chain gene product would not interact with other chains to form the mature fibrinogen molecule. The current findings show that mutations within highly conserved IVS regions of fibrinogen genes could affect the efficiency of normal splicing, giving rise to congenital afibrinogenemia.
Xiang, Kuan-Hui; Michailidis, Eleftherios; Ding, Hai; Peng, Ya-Qin; Su, Ming-Ze; Li, Yao; Liu, Xue-En; Dao Thi, Viet Loan; Wu, Xian-Fang; Schneider, William M; Rice, Charles M; Zhuang, Hui; Li, Tong
2017-02-01
As important virological markers, serum hepatitis B surface antigen (HBsAg) and hepatitis B virus (HBV) DNA levels show large fluctuations among chronic hepatitis B patients. The aim of this study was to reveal the potential impact and mechanisms of amino acid substitutions in small hepatitis B surface proteins (SHBs) on serum HBsAg and HBV DNA levels. Serum samples from 230 untreated chronic hepatitis B patients with genotype C HBV were analyzed in terms of HBV DNA levels, serological markers of HBV infection and SHBs sequences. In vitro functional analysis of the identified SHBs mutants was performed. Among 230 SHBs sequences, there were 39 (16.96%) sequences with no mutation detected (wild-type) and 191 (83.04%) with single or multiple mutations. SHBs consist of 226 amino acids, of which 104 (46.02%) had mutations in our study. Some mutations (e.g., sE2G, sL21S, sR24K, sT47A/K, sC69stop (sC69∗), sL95W, sL98V, and sG145R) negatively correlated with serum HBsAg levels. HBsAg and HBV DNA levels from this group of patients had a positive correlation (r=0.61, p<0.001). In vitro analysis showed that these mutations reduced extracellular HBsAg and HBV DNA levels by restricting virion secretion and antibody binding capacity. Virion secretion could be rescued for sE2G, sC69∗, and sG145R by co-expression of wild-type HBsAg. The serum HBsAg levels were lower in untreated CHB patients with novel SHBs mutations outside the major antigenic region than those without mutations. Underlying mechanisms include impairment of virion secretion and lower binding affinity to antibodies used for HBsAg measurements. The hepatitis B surface antigen (HBsAg) is a major viral protein of the hepatitis B virus (HBV) secreted into patient blood serum and its quantification value serves as an important marker for the evaluation of chronic HBV infection and antiviral response. We found a few new amino acid substitutions in HBsAg associated with lower serum HBsAg and HBV DNA levels. These different substitutions might impair virion secretion, change the ability of HBsAg to bind to antibodies, or impact HBV replication. These could all result in decreased detectable levels of serum HBsAg. The factors affecting circulating HBsAg level and HBsAg detection are varied and caution is needed when interpreting clinical significance of serum HBsAg levels. Clinical trial number: NCT01088009. Copyright © 2016 European Association for the Study of the Liver. Published by Elsevier B.V. All rights reserved.
WEB-server for search of a periodicity in amino acid and nucleotide sequences
NASA Astrophysics Data System (ADS)
E Frenkel, F.; Skryabin, K. G.; Korotkov, E. V.
2017-12-01
A new web server (http://victoria.biengi.ac.ru/splinter/login.php) was designed and developed to search for periodicity in nucleotide and amino acid sequences. The web server operation is based upon a new mathematical method of searching for multiple alignments, which is founded on the position weight matrices optimization, as well as on implementation of the two-dimensional dynamic programming. This approach allows the construction of multiple alignments of the indistinctly similar amino acid and nucleotide sequences that accumulated more than 1.5 substitutions per a single amino acid or a nucleotide without performing the sequences paired comparisons. The article examines the principles of the web server operation and two examples of studying amino acid and nucleotide sequences, as well as information that could be obtained using the web server.
DeWitt, D L; Smith, W L
1988-01-01
Prostaglandin G/H synthase (8,11,14-icosatrienoate, hydrogen-donor:oxygen oxidoreductase, EC 1.14.99.1) catalyzes the first step in the formation of prostaglandins and thromboxanes, the conversion of arachidonic acid to prostaglandin endoperoxides G and H. This enzyme is the site of action of nonsteroidal anti-inflammatory drugs. We have isolated a 2.7-kilobase complementary DNA (cDNA) encompassing the entire coding region of prostaglandin G/H synthase from sheep vesicular glands. This cDNA, cloned from a lambda gt 10 library prepared from poly(A)+ RNA of vesicular glands, hybridizes with a single 2.75-kilobase mRNA species. The cDNA clone was selected using oligonucleotide probes modeled from amino acid sequences of tryptic peptides prepared from the purified enzyme. The full-length cDNA encodes a protein of 600 amino acids, including a signal sequence of 24 amino acids. Identification of the cDNA as coding for prostaglandin G/H synthase is based on comparison of amino acid sequences of seven peptides comprising 103 amino acids with the amino acid sequence deduced from the nucleotide sequence of the cDNA. The molecular weight of the unglycosylated enzyme lacking the signal peptide is 65,621. The synthase is a glycoprotein, and there are three potential sites for N-glycosylation, two of them in the amino-terminal half of the molecule. The serine reported to be acetylated by aspirin is at position 530, near the carboxyl terminus. There is no significant similarity between the sequence of the synthase and that of any other protein in amino acid or nucleotide sequence libraries, and a heme binding site(s) is not apparent from the amino acid sequence. The availability of a full-length cDNA clone coding for prostaglandin G/H synthase should facilitate studies of the regulation of expression of this enzyme and the structural features important for catalysis and for interaction with anti-inflammatory drugs. Images PMID:3125548
Nguyen, Thuy Thi Thu; Nguyen, Hai Trong; Wang, Pei-Chyi; Chen, Shih-Chu
2017-08-01
Tumor necrosis factor-alpha (TNF-α) and interleukin-8 (IL-8/CXCL8) play pivotal roles in mediating inflammatory responses to invading pathogens. In this study, we identified and analyzed expressions of cobia TNF-α and IL-8 during Streptococcus dysgalactiae infection. The cloned cDNA transcript of cobia TNF-α comprised of 1281 base pairs (bp), with a 774 bp open reading frame (ORF) encoding 257 amino acids. The deduced amino acid sequence of cobia TNF-α showed a close relationship (84% similarity) with TNF-α of yellowtail amberjack. The cloned IL-8 cDNA sequence was 828 bp long, including a 300-bp ORF encoding 99 amino acids. The deduced amino acid sequence of cobia IL-8 shared 90% identity with IL-8 of striped trumpeter. Cobia challenged with a virulent S. dysgalactiae strain displayed an early significant up-regulation of TNF-α and IL-8 in head kidney, liver, and spleen. Notably, IL-8 expression level increased dramatically in the liver at the severe stage of infection (72 h). In conclusion, a better understanding of TNF-α and IL-8 allows more detailed investigation of immune responses in cobia and furthers study on controlling the infectious disease caused by S. dysgalactiae. Copyright © 2017 Elsevier Ltd. All rights reserved.
The advent of biosimilars: challenges and risks.
Müller, Rüdiger; Renner, Christoph; Gabay, Cem; Cassata, Giuseppe; Lohri, Andreas; Hasler, Paul
2014-01-01
Biosimilars represent a new class of medicinal products that will have significant impact on clinical use. They are identical on an amino acid sequence level to existing reference biopharmaceutical products (originals). However, they may exhibit differences on a protein level. This paper provides a brief overview of biosimilar development and describes the risk and challenges that should be considered during the admission of biosimilars into the clinic.
Martínez-Quintana, José A; Peregrino-Uriarte, Alma B; Gollas-Galván, Teresa; Gómez-Jiménez, Silvia; Yepiz-Plascencia, Gloria
2014-12-01
During hypoxia the shrimp Litopenaeus vannamei accelerates anaerobic glycolysis to obtain energy; therefore, a correct supply of glucose to the cells is needed. Facilitated glucose transport across the cells is mediated by a group of membrane embedded integral proteins called GLUT; being GLUT1 the most ubiquitous form. In this work, we report the first cDNA nucleotide and deduced amino acid sequences of a glucose transporter 1 from L. vannamei. A 1619 bp sequence was obtained by RT-PCR and RACE approaches. The 5´ UTR is 161 bp and the poly A tail is exactly after the stop codon in the mRNA. The ORF is 1485 bp and codes for 485 amino acids. The deduced protein sequence has high identity to GLUT1 proteins from several species and contains all the main features of glucose transporter proteins, including twelve transmembrane domains, the conserved motives and amino acids involved in transport activity, ligands binding and membrane anchor. Therefore, we decided to name this sequence, glucose transporter 1 of L. vannamei (LvGLUT1). A partial gene sequence of 8.87 Kbp was also obtained; it contains the complete coding sequence divided in 10 exons. LvGlut1 expression was detected in hemocytes, hepatopancreas, intestine gills, muscle and pleopods. The higher relative expression was found in gills and the lower in hemocytes. This indicates that LvGlut1 is ubiquitously expressed but its levels are tissue-specific and upon short-term hypoxia, the GLUT1 transcripts increase 3.7-fold in hepatopancreas and gills. To our knowledge, this is the first evidence of expression of GLUT1 in crustaceans.
Analysis of the mitochondrial genome of cheetahs (Acinonyx jubatus) with neurodegenerative disease.
Burger, Pamela A; Steinborn, Ralf; Walzer, Christian; Petit, Thierry; Mueller, Mathias; Schwarzenberger, Franz
2004-08-18
The complete mitochondrial genome of Acinonyx jubatus was sequenced and mitochondrial DNA (mtDNA) regions were screened for polymorphisms as candidates for the cause of a neurodegenerative demyelinating disease affecting captive cheetahs. The mtDNA reference sequences were established on the basis of the complete sequences of two diseased and two nondiseased animals as well as partial sequences of 26 further individuals. The A. jubatus mitochondrial genome is 17,047-bp long and shows a high sequence similarity (91%) to the domestic cat. Based on single nucleotide polymorphisms (SNPs) in the control region (CR) and pedigree information, the 18 myelopathic and 12 non-myelopathic cheetahs included in this study were classified into haplotypes I, II and III. In view of the phenotypic comparability of the neurodegenerative disease observed in cheetahs and human mtDNA-associated diseases, specific coding regions including the tRNAs leucine UUR, lysine, serine UCN, and partial complex I and V sequences were screened. We identified a heteroplasmic and a homoplasmic SNP at codon 507 in the subunit 5 (MTND5) of complex I. The heteroplasmic haplotype I-specific valine to methionine substitution represents a nonconservative amino acid change and was found in 11 myelopathic and eight non-myelopathic cheetahs with levels ranging from 29% to 79%. The homoplasmic conservative amino acid substitution valine to alanine was identified in two myelopathic animals of haplotype II. In addition, a synonymous SNP in the codon 76 of the MTND4L gene was found in the single haplotype III animal. The amino acid exchanges in the MTND5 gene were not associated with the occurrence of neurodegenerative disease in captive cheetahs.
Krishnan, Neeraja M; Seligmann, Hervé; Rao, Basuthkar J
2008-01-28
Synonymous sites are freer to vary because of redundancy in genetic code. Messenger RNA secondary structure restricts this freedom, as revealed by previous findings in mitochondrial genes that mutations at third codon position nucleotides in helices are more selected against than those in loops. This motivated us to explore the constraints imposed by mRNA secondary structure on evolutionary variability at all codon positions in general, in chloroplast systems. We found that the evolutionary variability and intrinsic secondary structure stability of these sequences share an inverse relationship. Simulations of most likely single nucleotide evolution in Psilotum nudum and Nephroselmis olivacea mRNAs, indicate that helix-forming propensities of mutated mRNAs are greater than those of the natural mRNAs for short sequences and vice-versa for long sequences. Moreover, helix-forming propensity estimated by the percentage of total mRNA in helices increases gradually with mRNA length, saturating beyond 1000 nucleotides. Protection levels of functionally important sites vary across plants and proteins: r-strategists minimize mutation costs in large genes; K-strategists do the opposite. Mrna length presumably predisposes shorter mRNAs to evolve under different constraints than longer mRNAs. The positive correlation between secondary structure protection and functional importance of sites suggests that some sites might be conserved due to packing-protection constraints at the nucleic acid level in addition to protein level constraints. Consequently, nucleic acid secondary structure a priori biases mutations. The converse (exposure of conserved sites) apparently occurs in a smaller number of cases, indicating a different evolutionary adaptive strategy in these plants. The differences between the protection levels of functionally important sites for r- and K-strategists reflect their respective molecular adaptive strategies. These converge with increasing domestication levels of K-strategists, perhaps because domestication increases reproductive output.
PubDNA Finder: a web database linking full-text articles to sequences of nucleic acids.
García-Remesal, Miguel; Cuevas, Alejandro; Pérez-Rey, David; Martín, Luis; Anguita, Alberto; de la Iglesia, Diana; de la Calle, Guillermo; Crespo, José; Maojo, Víctor
2010-11-01
PubDNA Finder is an online repository that we have created to link PubMed Central manuscripts to the sequences of nucleic acids appearing in them. It extends the search capabilities provided by PubMed Central by enabling researchers to perform advanced searches involving sequences of nucleic acids. This includes, among other features (i) searching for papers mentioning one or more specific sequences of nucleic acids and (ii) retrieving the genetic sequences appearing in different articles. These additional query capabilities are provided by a searchable index that we created by using the full text of the 176 672 papers available at PubMed Central at the time of writing and the sequences of nucleic acids appearing in them. To automatically extract the genetic sequences occurring in each paper, we used an original method we have developed. The database is updated monthly by automatically connecting to the PubMed Central FTP site to retrieve and index new manuscripts. Users can query the database via the web interface provided. PubDNA Finder can be freely accessed at http://servet.dia.fi.upm.es:8080/pubdnafinder
Decoding DNA, RNA and peptides with quantum tunnelling
NASA Astrophysics Data System (ADS)
di Ventra, Massimiliano; Taniguchi, Masateru
2016-02-01
Drugs and treatments could be precisely tailored to an individual patient by extracting their cellular- and molecular-level information. For this approach to be feasible on a global scale, however, information on complete genomes (DNA), transcriptomes (RNA) and proteomes (all proteins) needs to be obtained quickly and at low cost. Quantum mechanical phenomena could potentially be of value here, because the biological information needs to be decoded at an atomic level and quantum tunnelling has recently been shown to be able to differentiate single nucleobases and amino acids in short sequences. Here, we review the different approaches to using quantum tunnelling for sequencing, highlighting the theoretical background to the method and the experimental capabilities demonstrated to date. We also explore the potential advantages of the approach and the technical challenges that must be addressed to deliver practical quantum sequencing devices.
Lei, Yong-Liang; Wang, Xiao-Guang; Liu, Fu-Ming; Chen, Xiu-Ying; Ye, Bi-Feng; Mei, Jian-Hua; Lan, Jin-Quan; Tang, Qing
2009-08-01
Based on sequencing the full-length genomes of two Chinese Ferret-Badger, we analyzed the properties of rabies viruses genetic variation in molecular level to get information on prevalence and variation of rabies viruses in Zhejiang, and to enrich the genome database of rabies viruses street strains isolated from Chinese wildlife. Overlapped fragments were amplified by RT-PCR and full-length genomes were assembled to analyze the nucleotide and deduced protein similarities and phylogenetic analyses of the N genes from Chinese Ferret-Badger, sika deer, vole, dog. Vaccine strains were then determined. The two full-length genomes were completely sequenced to find out that they had the same genetic structure with 11 923 nts including 58 nts-Leader, 1353 nts-NP, 894 nts-PP, 609 nts-MP, 1575 nts-GP, 6386 nts-LP, and 2, 5, 5 nts- intergenic regions (IGRs), 423 nts-Pseudogene-like sequence (Psi), 70 nts-Trailer. The two full-length genomes were in accordance with the properties of Rhabdoviridae Lyssa virus by blast and multi-sequence alignment. The nucleotide and amino acid sequences among Chinese strains had the highest similarity, especially among animals of the same species. Of the two full-length genomes, the similarity in amino acid level was dramatically higher than that in nucleotide level, so that the nucleotide mutations happened in these two genomes were most probably as synonymous mutations. Compared to the referenced rabies viruses, the lengths of the five protein coding regions did not show any changes or recombination, but only with a few-point mutations. It was evident that the five proteins appeared to be stable. The variation sites and types of the two ferret badgers genomes were similar to the referenced vaccine or street strains. The two strains were genotype 1 according to the multi-sequence and phylogenetic analyses, which possessing the distinct geographyphic characteristics of China. All the evidence suggested a cue that these two ferret badgers rabies viruses were likely to be street virus that already circulating in wildlife.
Schoone, G J; Oskam, L; Kroon, N C; Schallig, H D; Omar, S A
2000-11-01
A quantitative nucleic acid sequence-based amplification (QT-NASBA) assay for the detection of Plasmodium parasites has been developed. Primers and probes were selected on the basis of the sequence of the small-subunit rRNA gene. Quantification was achieved by coamplification of the RNA in the sample with one modified in vitro RNA as a competitor in a single-tube NASBA reaction. Parasite densities ranging from 10 to 10(8) Plasmodium falciparum parasites per ml could be demonstrated and quantified in whole blood. This is approximately 1,000 times more sensitive than conventional microscopy analysis of thick blood smears. Comparison of the parasite densities obtained by microscopy and QT-NASBA with 120 blood samples from Kenyan patients with clinical malaria revealed that for 112 of 120 (93%) of the samples results were within a 1-log difference. QT-NASBA may be especially useful for the detection of low parasite levels in patients with early-stage malaria and for the monitoring of the efficacy of drug treatment.
The domestication of the probiotic bacterium Lactobacillus acidophilus
Bull, Matthew J.; Jolley, Keith A.; Bray, James E.; Aerts, Maarten; Vandamme, Peter; Maiden, Martin C. J.; Marchesi, Julian R.; Mahenthiralingam, Eshwar
2014-01-01
Lactobacillus acidophilus is a Gram-positive lactic acid bacterium that has had widespread historical use in the dairy industry and more recently as a probiotic. Although L. acidophilus has been designated as safe for human consumption, increasing commercial regulation and clinical demands for probiotic validation has resulted in a need to understand its genetic diversity. By drawing on large, well-characterised collections of lactic acid bacteria, we examined L. acidophilus isolates spanning 92 years and including multiple strains in current commercial use. Analysis of the whole genome sequence data set (34 isolate genomes) demonstrated L. acidophilus was a low diversity, monophyletic species with commercial isolates essentially identical at the sequence level. Our results indicate that commercial use has domesticated L. acidophilus with genetically stable, invariant strains being consumed globally by the human population. PMID:25425319
The domestication of the probiotic bacterium Lactobacillus acidophilus.
Bull, Matthew J; Jolley, Keith A; Bray, James E; Aerts, Maarten; Vandamme, Peter; Maiden, Martin C J; Marchesi, Julian R; Mahenthiralingam, Eshwar
2014-11-26
Lactobacillus acidophilus is a Gram-positive lactic acid bacterium that has had widespread historical use in the dairy industry and more recently as a probiotic. Although L. acidophilus has been designated as safe for human consumption, increasing commercial regulation and clinical demands for probiotic validation has resulted in a need to understand its genetic diversity. By drawing on large, well-characterised collections of lactic acid bacteria, we examined L. acidophilus isolates spanning 92 years and including multiple strains in current commercial use. Analysis of the whole genome sequence data set (34 isolate genomes) demonstrated L. acidophilus was a low diversity, monophyletic species with commercial isolates essentially identical at the sequence level. Our results indicate that commercial use has domesticated L. acidophilus with genetically stable, invariant strains being consumed globally by the human population.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Peters, J.; Peters, M.; Lottspeich, F.
1987-11-01
The complete nucleotide sequence of the gene encoding the surface (hexagonally packed intermediate (HPI))-layer polypeptide of Deinococcus radiodurans Sark was determined and found to encode a polypeptide of 1036 amino acids. Amino acid sequence analysis of about 30% of the residues revealed that the mature polypeptide consists of at least 978 amino acids. The N terminus was blocked to Edman degradation. The results of proteolytic modification of the HPI layer in situ and M/sub r/ estimations of the HPI polypeptide expressed in Escherichia coli indicated that there is a leader sequence. The N-terminal region contained a very high percentage (29%)more » of threonine and serine, including a cluster of nine consecutive serine or threonine residues, whereas a stretch near the C terminus was extremely rich in aromatic amino acids (29%). The protein contained at least two disulfide bridges, as well as tightly bound reducing sugars and fatty acids.« less
Artificial mismatch hybridization
Guo, Zhen; Smith, Lloyd M.
1998-01-01
An improved nucleic acid hybridization process is provided which employs a modified oligonucleotide and improves the ability to discriminate a control nucleic acid target from a variant nucleic acid target containing a sequence variation. The modified probe contains at least one artificial mismatch relative to the control nucleic acid target in addition to any mismatch(es) arising from the sequence variation. The invention has direct and advantageous application to numerous existing hybridization methods, including, applications that employ, for example, the Polymerase Chain Reaction, allele-specific nucleic acid sequencing methods, and diagnostic hybridization methods.
Detection and isolation of nucleic acid sequences using a bifunctional hybridization probe
Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.
2000-01-01
A method for detecting and isolating a target sequence in a sample of nucleic acids is provided using a bifunctional hybridization probe capable of hybridizing to the target sequence that includes a detectable marker and a first complexing agent capable of forming a binding pair with a second complexing agent. A kit is also provided for detecting a target sequence in a sample of nucleic acids using a bifunctional hybridization probe according to this method.
2011-01-01
Background A deficiency in phaseolin and phytohemagglutinin is associated with a near doubling of sulfur amino acid content in genetically related lines of common bean (Phaseolus vulgaris), particularly cysteine, elevated by 70%, and methionine, elevated by 10%. This mostly takes place at the expense of an abundant non-protein amino acid, S-methyl-cysteine. The deficiency in phaseolin and phytohemagglutinin is mainly compensated by increased levels of the 11S globulin legumin and residual lectins. Legumin, albumin-2, defensin and albumin-1 were previously identified as contributing to the increased sulfur amino acid content in the mutant line, on the basis of similarity to proteins from other legumes. Results Profiling of free amino acid in developing seeds of the BAT93 reference genotype revealed a biphasic accumulation of gamma-glutamyl-S-methyl-cysteine, the main soluble form of S-methyl-cysteine, with a lag phase occurring during storage protein accumulation. A collection of 30,147 expressed sequence tags (ESTs) was generated from four developmental stages, corresponding to distinct phases of gamma-glutamyl-S-methyl-cysteine accumulation, and covering the transitions to reserve accumulation and dessication. Analysis of gene ontology categories indicated the occurrence of multiple sulfur metabolic pathways, including all enzymatic activities responsible for sulfate assimilation, de novo cysteine and methionine biosynthesis. Integration of genomic and proteomic data enabled the identification and isolation of cDNAs coding for legumin, albumin-2, defensin D1 and albumin-1A and -B induced in the absence of phaseolin and phytohemagglutinin. Their deduced amino acid sequences have a higher content of cysteine than methionine, providing an explanation for the preferential increase of cysteine in the mutant line. Conclusion The EST collection provides a foundation to further investigate sulfur metabolism and the differential accumulation of sulfur amino acids in seed of common bean. Identification of sulfur-rich proteins whose levels are elevated in seed lacking phaseolin and phytohemagglutinin and sulfur metabolic genes may assist the improvement of protein quality. PMID:21615926
Kirsch, Christoph; Takamiya-Wik, Monica; Reinold, Susanne; Hahlbrock, Klaus; Somssich, Imre E.
1997-01-01
Parsley (Petroselinum crispum) plants and suspension-cultured cells have been used extensively for studies of non-host-resistance mechanisms in plant/pathogen interactions. We now show that treatment of cultured parsley cells with a defined peptide elicitor of fungal origin causes rapid and large changes in the levels of various unsaturated fatty acids. While linoleic acid decreased and linolenic acid increased steadily for several hours, comparatively sharp increases in oleic acid followed a biphasic time course. In contrast, the overall level of stearic acid remained unaffected. Using a PCR-based approach, a parsley cDNA was isolated sharing high sequence similarity with ω-3 fatty acid desaturases. Subsequent isolation and characterization of a full-length cDNA enabled its functional identification as a plastid-localized ω-3 fatty acid desaturase by complementation of the Arabidopsis thaliana fad7/8 double mutant which is low in trienoic fatty acids. ω-3 Fatty acid desaturase mRNA accumulated rapidly and transiently in elicitor-treated cultured parsley cells, protoplasts, and leaves, as well as highly localized around fungal infection sites in parsley leaf buds. These results indicate that unsaturated fatty acid metabolism is yet another component of the highly complex, transcriptionally regulated pathogen defense response in plants. PMID:9050908
An artificial molecular machine that builds an asymmetric catalyst
NASA Astrophysics Data System (ADS)
De Bo, Guillaume; Gall, Malcolm A. Y.; Kuschel, Sonja; De Winter, Julien; Gerbaux, Pascal; Leigh, David A.
2018-05-01
Biomolecular machines perform types of complex molecular-level tasks that artificial molecular machines can aspire to. The ribosome, for example, translates information from the polymer track it traverses (messenger RNA) to the new polymer it constructs (a polypeptide)1. The sequence and number of codons read determines the sequence and number of building blocks incorporated into the biomachine-synthesized polymer. However, neither control of sequence2,3 nor the transfer of length information from one polymer to another (which to date has only been accomplished in man-made systems through template synthesis)4 is easily achieved in the synthesis of artificial macromolecules. Rotaxane-based molecular machines5-7 have been developed that successively add amino acids8-10 (including β-amino acids10) to a growing peptide chain by the action of a macrocycle moving along a mono-dispersed oligomeric track derivatized with amino-acid phenol esters. The threaded macrocycle picks up groups that block its path and links them through successive native chemical ligation reactions11 to form a peptide sequence corresponding to the order of the building blocks on the track. Here, we show that as an alternative to translating sequence information, a rotaxane molecular machine can transfer the narrow polydispersity of a leucine-ester-derivatized polystyrene chain synthesized by atom transfer radical polymerization12 to a molecular-machine-made homo-leucine oligomer. The resulting narrow-molecular-weight oligomer folds to an α-helical secondary structure13 that acts as an asymmetric catalyst for the Juliá-Colonna epoxidation14,15 of chalcones.
Kurosu, Y; Murayama, K; Shindo, N; Shisa, Y; Ishioka, N
1996-11-01
This is an initial report to propose a protein sequence analysis system with DL differentiation using capillary electrophoresis (CE). This system consists of a protein sequencer and a CE system. After fractionation of phenyl-thiohydantoin (PTH)-amino acids using a protein sequencer, optical resolution for each PTH-amino acid is performed by CE using some chiral selectors such as digitonin, beta-escin and others. As a model peptide, [D-Ala2]-methionine enkephalin (L-Tyr-D-Ala-Gly-L-Phe-L-Met), was used and the sequence with DL differentiation was determined, with the exception of the fourth amino acid, L-Phe, using our proposed system.
Li, Yang; Ren, Yi
2017-01-01
Pseudomonas sp. QTF5 was isolated from the continuous permafrost near the bitumen layers in the Qiangtang basin of Qinghai-Tibetan Plateau in China (5,111 m above sea level). It is psychrotolerant and highly and widely tolerant to heavy metals and has the ability to metabolize benzoic acid and salicylic acid. To gain insight into the genetic basis for its adaptation, we performed whole genome sequencing and analyzed the resistant genes and metabolic pathways. Based on 120 published and annotated genomes representing 31 species in the genus Pseudomonas, in silico genomic DNA-DNA hybridization (<54%) and average nucleotide identity calculation (<94%) revealed that QTF5 is closest to Pseudomonas lini and should be classified into a novel species. This study provides the genetic basis to identify the genes linked to its specific mechanisms for adaptation to extreme environment and application of this microorganism in environmental conservation. PMID:29270429
Verma, Pankaj; Pandey, Prashant Kumar; Gupta, Arvind Kumar; Seong, Chi Nam; Park, Seong Chan; Choe, Han Na; Baik, Keun Sik; Patole, Milind Shivaji; Shouche, Yogesh Shreepad
2012-10-01
We have carried out a polyphasic taxonomic characterization of Bacillus beijingensis DSM 19037(T) and Bacillus ginsengi DSM 19038(T), which are closely related phylogenetically to Bhargavaea cecembensis LMG 24411(T). All three strains are Gram-stain-positive, non-motile, moderately halotolerant and non-spore-forming. 16S rRNA gene sequence analyses showed that the strains constituted a coherent cluster, with sequence similarities between 99.7 and 98.7 %. The percentage similarity on the basis of amino acid sequences deduced from partial gyrB gene nucleotide sequences of these three type strains was 96.1-92.7 %. Phylogenetic trees based on the 16S rRNA gene and GyrB amino acid sequences, obtained by using three different algorithms, were consistent and showed that these three species constituted a deeply rooted cluster separated from the clades represented by the genera Bacillus, Planococcus, Planomicrobium, Sporosarcina, Lysinibacillus, Viridibacillus, Kurthia and Geobacillus, supporting their placement in the genus Bhargavaea. All three type strains have menaquinone MK-8 as the major respiratory quinone and showed similar fatty acid profiles. The main polar lipids present in the three type strains were diphosphatidylglycerol and phosphatidylglycerol, and the three strains showed peptidoglycan type A4α with L-lysine as the diagnostic diamino acid. The DNA G+C contents of Bacillus beijingensis DSM 19037(T), Bacillus ginsengi DSM 19038(T) and Bhargavaea cecembensis LMG 24411(T) were 53.1, 50.2 and 53.7 mol%, respectively. The level of DNA-DNA hybridization among the three strains was 57-39 %, indicating that they are members of different species of the genus Bhargavaea. The phenotypic data are consistent with the placement of these three species in a single genus and support their differentiation at the species level. On the basis of these data, we have emended the description of the genus Bhargavaea and propose the reclassification of Bacillus beijingensis and Bacillus ginsengi to the genus Bhargavaea, as Bhargavaea beijingensis comb. nov. (type strain ge10(T) = DSM 19037(T) = CGMCC 1.6762(T)) and Bhargavaea ginsengi comb. nov. (type strain ge14(T) = DSM 19038(T) = CGMCC 1.6763(T)).
Averbeck, N B; Borghouts, C; Hamann, A; Specke, V; Osiewacz, H D
2001-01-01
The lifespan of the ascomycete Podospora anserina was previously demonstrated to be significantly increased in a copper-uptake mutant, suggesting that copper is a potential stressor involved in degenerative processes. In order to determine whether changes in copper stress occur in the cells during normal aging of cultures, we cloned and characterized a gene coding for a component of the molecular machinery involved in the control of copper homeostasis. This gene, PaMt1, is a single-copy gene that encodes a metallothionein of 26 amino acids. The coding sequence of PaMt1 is interrupted by a single intron. The deduced amino acid sequence shows a high degree of sequence identity to metallothioneins of the filamentous ascomycete Neurospora crassa and the basidiomycete Agaricus bisporus, and to the N-terminal portion of mammalian metallothioneins. Levels of PaMt1 transcript increase in response to elevated amounts of copper in the growth medium and during aging of wild-type cultures. In contrast, in the long-lived mutant grisea, transcript levels first increase but then decrease again. The ability of wild-type cultures to respond to exogenous copper stress via the induction of PaMt1 transcription is not affected as they grow older.
Hsieh, S L; Liu, R W; Wu, C H; Cheng, W T; Kuo, Ching-Ming
2003-12-01
A cDNA sequence of stearoyl-CoA desaturase (SCD) was determined from zebrafish (Danio rerio) and compared to the corresponding genes in several teleosts. Zebrafish SCD cDNA has a size of 1,061 bp, encodes a polypeptide of 325 amino acids, and shares 88, 85, 84, and 83% similarities with tilapia (Oreochromis mossambicus), grass carp (Ctenopharyngodon idella), common carp (Cyprinus carpio), and milkfish (Chanos chanos), respectively. This 1,061 bp sequence specifies a protein that, in common with other fatty acid desaturases, contains three histidine boxes, believed to be involved in catalysis. These observations suggested that SCD genes are highly conserved. In addition, an oligonucleotide probe complementary to zebrafish SCD mRNA was hybridized to mRNA of approximately 396 bases with Northern blot analysis. The Northern blot and RT-PCR analyses showed that the SCD mRNA was expressed predominantly in the liver, intestine, gill, and muscle, while a lower level was found in the brain. Furthermore, we utilized whole-mount in situ hybridization and real-time quantitative RT-PCR to identify expression of the zebrafish SCD gene at five different stages of development. This revealed that very high levels of transcripts were found in zebrafish at all stages during embryogenesis and early development. Copyright 2003 Wiley-Liss, Inc.
Characterization of Two Cysteine Transfer RNA Genes from Xenopus Laevis
1984-07-12
containing amino acids glycine, alanine and serine, are produced by the posterior silk gland of Bombyx mori and therefore high level of tRNAgly, tRNA^Ia...1979) Studies on tRNA adaptation, tRNA turnover, precursor tRNA and tRNA gene distribution in Bombyx mori by using two-dimensional polyacrylamlde gel...Nucleic Acids Research, 1^, 8537-8546. 26. Garber, R.L. and Gage, L.P. (1979) Transcription of a cloned Bombyx mori tRNA^2 gene: Nucleotide sequence of
Campion, S R; Ameen, A S; Lai, L; King, J M; Munzenmaier, T N
2001-08-15
This report describes the application of a simple computational tool, AAPAIR.TAB, for the systematic analysis of the cysteine-rich EGF, Sushi, and Laminin motif/sequence families at the two-amino acid level. Automated dipeptide frequency/bias analysis detects preferences in the distribution of amino acids in established protein families, by determining which "ordered dipeptides" occur most frequently in comprehensive motif-specific sequence data sets. Graphic display of the dipeptide frequency/bias data revealed family-specific preferences for certain dipeptides, but more importantly detected a shared preference for employment of the ordered dipeptides Gly-Tyr (GY) and Gly-Phe (GF) in all three protein families. The dipeptide Asn-Gly (NG) also exhibited high-frequency and bias in the EGF and Sushi motif families, whereas Asn-Thr (NT) was distinguished in the Laminin family. Evaluation of the distribution of dipeptides identified by frequency/bias analysis subsequently revealed the highly restricted localization of the G(F/Y) and N(G/T) sequence elements at two separate sites of extreme conservation in the consensus sequence of all three sequence families. The similar employment of the high-frequency/bias dipeptides in three distinct protein sequence families was further correlated with the concurrence of these shared molecular determinants at similar positions within the distinctive scaffolds of three structurally divergent, but similarly employed, motif modules.
Brunak, S; Engelbrecht, J
1996-06-01
A direct comparison of experimentally determined protein structures and their corresponding protein coding mRNA sequences has been performed. We examine whether real world data support the hypothesis that clusters of rare codons correlate with the location of structural units in the resulting protein. The degeneracy of the genetic code allows for a biased selection of codons which may control the translational rate of the ribosome, and may thus in vivo have a catalyzing effect on the folding of the polypeptide chain. A complete search for GenBank nucleotide sequences coding for structural entries in the Brookhaven Protein Data Bank produced 719 protein chains with matching mRNA sequence, amino acid sequence, and secondary structure assignment. By neural network analysis, we found strong signals in mRNA sequence regions surrounding helices and sheets. These signals do not originate from the clustering of rare codons, but from the similarity of codons coding for very abundant amino acid residues at the N- and C-termini of helices and sheets. No correlation between the positioning of rare codons and the location of structural units was found. The mRNA signals were also compared with conserved nucleotide features of 16S-like ribosomal RNA sequences and related to mechanisms for maintaining the correct reading frame by the ribosome.
Heim, Crystal B; Gillman, Jason D
2017-01-05
Soybean oil is highly unsaturated but oxidatively unstable, rendering it nonideal for food applications. Until recently, the majority of soybean oil underwent partial chemical hydrogenation, which produces trans fats as an unavoidable consequence. Dietary intake of trans fats and most saturated fats are conclusively linked to negative impacts on cholesterol levels and cardiovascular health. Two major soybean oil breeding targets are: (1) to reduce or eliminate the need for chemical hydrogenation, and (2) to replace the functional properties of partially hydrogenated soybean oil. One potential solution is the elevation of seed stearic acid, a saturated fat which has no negative impacts on cardiovascular health, from 3 to 4% in typical cultivars to > 20% of the seed oil. We performed QTL analysis of a population developed by crossing two mutant lines, one with a missense mutation affecting a stearoyl-acyl-carrier protein desaturase gene resulting in ∼11% seed stearic acid crossed to another mutant, A6, which has 24-28% seed stearic acid. Genotyping-by-sequencing (GBS)-based QTL mapping identified 21 minor and major effect QTL for six seed oil related traits and plant height. The inheritance of a large genomic deletion affecting chromosome 14 is the basis for largest effect QTL, resulting in ∼18% seed stearic acid. This deletion contains SACPD-C and another gene(s); loss of both genes boosts seed stearic acid levels to ≥ 18%. Unfortunately, this genomic deletion has been shown in previous studies to be inextricably correlated with reduced seed yield. Our results will help inform and guide ongoing breeding efforts to improve soybean oil oxidative stability. Copyright © 2017 Heim and Gillman.
Gordonia caeni sp. nov., isolated from sludge of a sewage disposal plant.
Srinivasan, Sathiyaraj; Park, Giho; Yang, Hyejin; Hwang, Supyong; Bae, Yoonjung; Jung, Yong-An; Kim, Myung Kyum; Lee, Myungjin
2012-11-01
A Gram-stain-positive, strictly aerobic, short-rod-shaped, non-motile strain (designated MJ32(T)) was isolated from a sludge sample of the Daejeon sewage disposal plant in South Korea. A polyphasic approach was applied to study the taxonomic position of strain MJ32(T). Strain MJ32(T) showed highest 16S rRNA gene sequence similarity to Gordonia hirsuta DSM 44140(T) (98.1%) and Gordonia hydrophobica DSM 44015(T) (97.0%); levels of sequence similarity to the type strains of other recognized Gordonia species were less than 97.0%. Phylogenetic analysis based on 16S rRNA gene sequences showed that strain MJ32(T) belonged to the clade formed by members of the genus Gordonia in the family Gordoniaceae. The G+C content of the genomic DNA of strain MJ32(T) was 69.2 mol%. Chemotaxonomically, strain MJ32(T) showed features typical of the genus Gordonia. The predominant respiratory quinone was MK-9(H(2)), the mycolic acids present had C(56)-C(60) carbon atoms, and the major fatty acids were C(16:0) (34.6%), tuberculostearic acid (21.8%), C(16:1)ω7c (19.5%) and C(18:1)ω9c (12.7%). The peptidoglycan type was based on meso-2,6-diaminopimelic acid as the diagnostic diamino acid with glycolated sugars. On the basis of phylogenetic inference, fatty acid profile and other phenotypic properties, strain MJ32(T) is considered to represent a novel species of the genus Gordonia, for which the name Gordonia caeni sp. nov. is proposed. The type strain is MJ32(T) (=KCTC 19771(T)=JCM 16923(T)).
Sounds of silence: synonymous nucleotides as a key to biological regulation and complexity
Shabalina, Svetlana A.; Spiridonov, Nikolay A.; Kashina, Anna
2013-01-01
Messenger RNA is a key component of an intricate regulatory network of its own. It accommodates numerous nucleotide signals that overlap protein coding sequences and are responsible for multiple levels of regulation and generation of biological complexity. A wealth of structural and regulatory information, which mRNA carries in addition to the encoded amino acid sequence, raises the question of how these signals and overlapping codes are delineated along non-synonymous and synonymous positions in protein coding regions, especially in eukaryotes. Silent or synonymous codon positions, which do not determine amino acid sequences of the encoded proteins, define mRNA secondary structure and stability and affect the rate of translation, folding and post-translational modifications of nascent polypeptides. The RNA level selection is acting on synonymous sites in both prokaryotes and eukaryotes and is more common than previously thought. Selection pressure on the coding gene regions follows three-nucleotide periodic pattern of nucleotide base-pairing in mRNA, which is imposed by the genetic code. Synonymous positions of the coding regions have a higher level of hybridization potential relative to non-synonymous positions, and are multifunctional in their regulatory and structural roles. Recent experimental evidence and analysis of mRNA structure and interspecies conservation suggest that there is an evolutionary tradeoff between selective pressure acting at the RNA and protein levels. Here we provide a comprehensive overview of the studies that define the role of silent positions in regulating RNA structure and processing that exert downstream effects on proteins and their functions. PMID:23293005
Hemalatha, G. R.; Rao, D. Satyanarayana; Guruprasad, L.
2007-01-01
We have identified four repeats and ten domains that are novel in proteins encoded by the Bacillus anthracis str. Ames proteome using automated in silico methods. A “repeat” corresponds to a region comprising less than 55-amino-acid residues that occur more than once in the protein sequence and sometimes present in tandem. A “domain” corresponds to a conserved region with greater than 55-amino-acid residues and may be present as single or multiple copies in the protein sequence. These correspond to (1) 57-amino-acid-residue PxV domain, (2) 122-amino-acid-residue FxF domain, (3) 111-amino-acid-residue YEFF domain, (4) 109-amino-acid-residue IMxxH domain, (5) 103-amino-acid-residue VxxT domain, (6) 84-amino-acid-residue ExW domain, (7) 104-amino-acid-residue NTGFIG domain, (8) 36-amino-acid-residue NxGK repeat, (9) 95-amino-acid-residue VYV domain, (10) 75-amino-acid-residue KEWE domain, (11) 59-amino-acid-residue AFL domain, (12) 53-amino-acid-residue RIDVK repeat, (13) (a) 41-amino-acid-residue AGQF repeat and (b) 42-amino-acid-residue GSAL repeat. A repeat or domain type is characterized by specific conserved sequence motifs. We discuss the presence of these repeats and domains in proteins from other genomes and their probable secondary structure. PMID:17538688
Code of Federal Regulations, 2010 CFR
2010-07-01
... 37 Patents, Trademarks, and Copyrights 1 2010-07-01 2010-07-01 false Form and format for... And/or Amino Acid Sequences § 1.824 Form and format for nucleotide and/or amino acid sequence... Code for Information Interchange (ASCII) text. No other formats shall be allowed. (3) The computer...
Carrot Juice Fermentations as Man-Made Microbial Ecosystems Dominated by Lactic Acid Bacteria.
Wuyts, Sander; Van Beeck, Wannes; Oerlemans, Eline F M; Wittouck, Stijn; Claes, Ingmar J J; De Boeck, Ilke; Weckx, Stefan; Lievens, Bart; De Vuyst, Luc; Lebeer, Sarah
2018-06-15
Spontaneous vegetable fermentations, with their rich flavors and postulated health benefits, are regaining popularity. However, their microbiology is still poorly understood, therefore raising concerns about food safety. In addition, such spontaneous fermentations form interesting cases of man-made microbial ecosystems. Here, samples from 38 carrot juice fermentations were collected through a citizen science initiative, in addition to three laboratory fermentations. Culturing showed that Enterobacteriaceae were outcompeted by lactic acid bacteria (LAB) between 3 and 13 days of fermentation. Metabolite-target analysis showed that lactic acid and mannitol were highly produced, as well as the biogenic amine cadaverine. High-throughput 16S rRNA gene sequencing revealed that mainly species of Leuconostoc and Lactobacillus (as identified by 8 and 20 amplicon sequence variants [ASVs], respectively) mediated the fermentations in subsequent order. The analyses at the DNA level still detected a high number of Enterobacteriaceae , but their relative abundance was low when RNA-based sequencing was performed to detect presumptive metabolically active bacterial cells. In addition, this method greatly reduced host read contamination. Phylogenetic placement indicated a high LAB diversity, with ASVs from nine different phylogenetic groups of the Lactobacillus genus complex. However, fermentation experiments with isolates showed that only strains belonging to the most prevalent phylogenetic groups preserved the fermentation dynamics. The carrot juice fermentation thus forms a robust man-made microbial ecosystem suitable for studies on LAB diversity and niche specificity. IMPORTANCE The usage of fermented food products by professional chefs is steadily growing worldwide. Meanwhile, this interest has also increased at the household level. However, many of these artisanal food products remain understudied. Here, an extensive microbial analysis was performed of spontaneous fermented carrot juices which are used as nonalcoholic alternatives for wine in a Belgian Michelin star restaurant. Samples were collected through an active citizen science approach with 38 participants, in addition to three laboratory fermentations. Identification of the main microbial players revealed that mainly species of Leuconostoc and Lactobacillus mediated the fermentations in subsequent order. In addition, a high diversity of lactic acid bacteria was found; however, fermentation experiments with isolates showed that only strains belonging to the most prevalent lactic acid bacteria preserved the fermentation dynamics. Finally, this study showed that the usage of RNA-based 16S rRNA amplicon sequencing greatly reduces host read contamination. Copyright © 2018 American Society for Microbiology.
Cryptic splice site in the complementary DNA of glucocerebrosidase causes inefficient expression.
Bukovac, Scott W; Bagshaw, Richard D; Rigat, Brigitte A; Callahan, John W; Clarke, Joe T R; Mahuran, Don J
2008-10-15
The low levels of human lysosomal glucocerebrosidase activity expressed in transiently transfected Chinese hamster ovary (CHO) cells were investigated. Reverse transcription PCR (RT-PCR) demonstrated that a significant portion of the transcribed RNA was misspliced owing to the presence of a cryptic splice site in the complementary DNA (cDNA). Missplicing results in the deletion of 179 bp of coding sequence and a premature stop codon. A repaired cDNA was constructed abolishing the splice site without changing the amino acid sequence. The level of glucocerebrosidase expression was increased sixfold. These data demonstrate that for maximum expression of any cDNA construct, the transcription products should be examined.
Characterization of a novel organic solute transporter homologue from Clonorchis sinensis
Dai, Fuhong; Lee, Ji-Yun; Pak, Jhang Ho; Sohn, Woon-Mok
2018-01-01
Clonorchis sinensis is a liver fluke that can dwell in the bile ducts of mammals. Bile acid transporters function to maintain the homeostasis of bile acids in C. sinensis, as they induce physiological changes or have harmful effects on C. sinensis survival. The organic solute transporter (OST) transports mainly bile acid and belongs to the SLC51 subfamily of solute carrier transporters. OST plays a critical role in the recirculation of bile acids in higher animals. In this study, we cloned full-length cDNA of the 480-amino acid OST from C. sinensis (CsOST). Genomic analysis revealed 11 exons and nine introns. The CsOST protein had a ‘Solute_trans_a’ domain with 67% homology to Schistosoma japonicum OST. For further analysis, the CsOST protein sequence was split into the ordered domain (CsOST-N) at the N-terminus and disordered domain (CsOST-C) at the C-terminus. The tertiary structure of each domain was built using a threading-based method and determined by manual comparison. In a phylogenetic tree, the CsOST-N domain belonged to the OSTα and CsOST-C to the OSTβ clade. These two domains were more highly conserved with the OST α- and β-subunits at the structure level than at sequence level. These findings suggested that CsOST comprised the OST α- and β-subunits. CsOST was localized in the oral and ventral suckers and in the mesenchymal tissues abundant around the intestine, vitelline glands, uterus, and testes. This study provides fundamental data for the further understanding of homologues in other flukes. PMID:29702646
González-Mellado, Damián; von Wettstein-Knowles, Penny; Garcés, Rafael; Martínez-Force, Enrique
2010-05-01
The beta-ketoacyl-acyl carrier protein synthase III (KAS III; EC 2.3.1.180) is a condensing enzyme catalyzing the initial step of fatty acid biosynthesis using acetyl-CoA as primer. To determine the mechanisms involved in the biosynthesis of fatty acids in sunflower (Helianthus annuus L.) developing seeds, a cDNA coding for HaKAS III (EF514400) was isolated, cloned and sequenced. Its protein sequence is as much as 72% identical to other KAS III-like ones such as those from Perilla frutescens, Jatropha curcas, Ricinus communis or Cuphea hookeriana. Phylogenetic study of the HaKAS III homologous proteins infers its origin from cyanobacterial ancestors. A genomic DNA gel blot analysis revealed that HaKAS III is a single copy gene. Expression levels of this gene, examined by Q-PCR, revealed higher levels in developing seeds storing oil than in leaves, stems, roots or seedling cotyledons. Heterologous expression of HaKAS III in Escherichia coli altered their fatty acid content and composition implying an interaction of HaKAS III with the bacterial FAS complex. Testing purified HaKAS III recombinant protein by adding to a reconstituted E. coli FAS system lacking condensation activity revealed a novel substrate specificity. In contrast to all hitherto characterized plant KAS IIIs, the activities of which are limited to the first cycles of intraplastidial fatty acid biosynthesis yielding C6 chains, HaKAS III participates in at least four cycles resulting in C10 chains.
Application of 2D graphic representation of protein sequence based on Huffman tree method.
Qi, Zhao-Hui; Feng, Jun; Qi, Xiao-Qin; Li, Ling
2012-05-01
Based on Huffman tree method, we propose a new 2D graphic representation of protein sequence. This representation can completely avoid loss of information in the transfer of data from a protein sequence to its graphic representation. The method consists of two parts. One is about the 0-1 codes of 20 amino acids by Huffman tree with amino acid frequency. The amino acid frequency is defined as the statistical number of an amino acid in the analyzed protein sequences. The other is about the 2D graphic representation of protein sequence based on the 0-1 codes. Then the applications of the method on ten ND5 genes and seven Escherichia coli strains are presented in detail. The results show that the proposed model may provide us with some new sights to understand the evolution patterns determined from protein sequences and complete genomes. Copyright © 2012 Elsevier Ltd. All rights reserved.
Lee, K L; Albee, K L; Bernasconi, R J; Edmunds, T
1997-01-01
The amino acid sequences of ananain (EC3.4.22.31) and stem bromelain (3.4.22.32), two cysteine proteases from pineapple stem, are similar yet ananain and stem bromelain possess distinct specificities towards synthetic peptide substrates and different reactivities towards the cysteine protease inhibitors E-64 and chicken egg white cystatin. We present here the complete amino acid sequence of ananain and compare it with the reported sequences of pineapple stem bromelain, papain and chymopapain from papaya and actinidin from kiwifruit. Ananain is comprised of 216 residues with a theoretical mass of 23464 Da. This primary structure includes a sequence insert between residues 170 and 174 not present in stem bromelain or papain and a hydrophobic series of amino acids adjacent to His-157. It is possible that these sequence differences contribute to the different substrate and inhibitor specificities exhibited by ananain and stem bromelain. PMID:9355753
Chen, Y; Feng, H; Chen, D; Abuduwaili, K; Li, X; Zhang, H
2018-01-01
The protective effects of folic acid on DNA damage and DNA methylation induced by N-methyl- N'-nitro- N-nitrosoguanidine (MNNG) in Kazakh esophageal epithelial cells were investigated using a 3 × 3 factorial design trial. The cells were cultured in vitro and exposed to media containing different concentrations of folic acid and MNNG, after which growth indices were detected. DNA damage levels were measured using comet assays, and genome-wide DNA methylation levels (MLs) were measured using high-performance liquid chromatography. The DNA methylation of methylenetetrahydrofolate reductase (MTHFR) and folate receptor- α (FR α) genes was detected by bisulfite sequencing polymerase chain reaction (PCR). The results showed significant increases in tail DNA concentration, tail length, and Olive tail moment ( p < 0.01); a significant reduction of genome-wide DNA MLs ( p < 0.01); and an increase in the methylation frequencies of MTHFR and FR α genes. In particular, significant differences were observed in the promoter regions of both genes ( p < 0.01). Our study indicated that a reduction in folic acid concentration promotes DNA damage and DNA methylation in Kazakh esophageal epithelial cells upon MNNG exposure. Thus, sufficient folic acid levels could play a protective role against the damage induced by this compound.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gu, Huiya; Jinkerson, Robert E.; Davies, Fiona K.
The isolation or engineering of algal cells synthesizing high levels of medium-chain fatty acids (MCFAs) is attractive to mitigate the high clouding point of longer chain fatty acids in algal based biodiesel. To develop a more informed understanding of MCFA synthesis in photosynthetic microorganisms, we isolated several algae from Great Salt Lake and screened this collection for MCFA accumulation to identify strains naturally accumulating high levels of MCFA. A diatom, Chaetoceros sp. GSL56, accumulated particularly high levels of C14 (up to 40%), with the majority of C14 fatty acids allocated in triacylglycerols. Using whole cell transcriptome sequencing and de novomore » assembly, putative genes encoding fatty acid synthesis enzymes were identified. Enzymes from this Chaetoceros sp. were expressed in the cyanobacterium Synechococcus sp. PCC 7002 to validate gene function and to determine whether eukaryotic enzymes putatively lacking bacterial evolutionary control mechanisms could be used to improve MCFA production in this promising production strain. Replacement of the Synechococcus 7002 native FabH with a Chaetoceros ketoacyl-ACP synthase Ill increased MCFA synthesis up to fivefold. In conclusion, the level of increase is dependent on promoter strength and culturing conditions.« less
Gu, Huiya; Jinkerson, Robert E.; Davies, Fiona K.; ...
2016-05-26
The isolation or engineering of algal cells synthesizing high levels of medium-chain fatty acids (MCFAs) is attractive to mitigate the high clouding point of longer chain fatty acids in algal based biodiesel. To develop a more informed understanding of MCFA synthesis in photosynthetic microorganisms, we isolated several algae from Great Salt Lake and screened this collection for MCFA accumulation to identify strains naturally accumulating high levels of MCFA. A diatom, Chaetoceros sp. GSL56, accumulated particularly high levels of C14 (up to 40%), with the majority of C14 fatty acids allocated in triacylglycerols. Using whole cell transcriptome sequencing and de novomore » assembly, putative genes encoding fatty acid synthesis enzymes were identified. Enzymes from this Chaetoceros sp. were expressed in the cyanobacterium Synechococcus sp. PCC 7002 to validate gene function and to determine whether eukaryotic enzymes putatively lacking bacterial evolutionary control mechanisms could be used to improve MCFA production in this promising production strain. Replacement of the Synechococcus 7002 native FabH with a Chaetoceros ketoacyl-ACP synthase Ill increased MCFA synthesis up to fivefold. In conclusion, the level of increase is dependent on promoter strength and culturing conditions.« less
From Ramachandran Maps to Tertiary Structures of Proteins.
DasGupta, Debarati; Kaushik, Rahul; Jayaram, B
2015-08-27
Sequence to structure of proteins is an unsolved problem. A possible coarse grained resolution to this entails specification of all the torsional (Φ, Ψ) angles along the backbone of the polypeptide chain. The Ramachandran map quite elegantly depicts the allowed conformational (Φ, Ψ) space of proteins which is still very large for the purposes of accurate structure generation. We have divided the allowed (Φ, Ψ) space in Ramachandran maps into 27 distinct conformations sufficient to regenerate a structure to within 5 Å from the native, at least for small proteins, thus reducing the structure prediction problem to a specification of an alphanumeric string, i.e., the amino acid sequence together with one of the 27 conformations preferred by each amino acid residue. This still theoretically results in 27(n) conformations for a protein comprising "n" amino acids. We then investigated the spatial correlations at the two-residue (dipeptide) and three-residue (tripeptide) levels in what may be described as higher order Ramachandran maps, with the premise that the allowed conformational space starts to shrink as we introduce neighborhood effects. We found, for instance, for a tripeptide which potentially can exist in any of the 27(3) "allowed" conformations, three-fourths of these conformations are redundant to the 95% confidence level, suggesting sequence context dependent preferred conformations. We then created a look-up table of preferred conformations at the tripeptide level and correlated them with energetically favorable conformations. We found in particular that Boltzmann probabilities calculated from van der Waals energies for each conformation of tripeptides correlate well with the observed populations in the structural database (the average correlation coefficient is ∼0.8). An alpha-numeric string and hence the tertiary structure can be generated for any sequence from the look-up table within minutes on a single processor and to a higher level of accuracy if secondary structure can be specified. We tested the methodology on 100 small proteins, and in 90% of the cases, a structure within 5 Å is recovered. We thus believe that the method presented here provides the missing link between Ramachandran maps and tertiary structures of proteins. A Web server to convert a tertiary structure to an alphanumeric string and to predict the tertiary structure from the sequence of a protein using the above methodology is created and made freely accessible at http://www.scfbio-iitd.res.in/software/proteomics/rm2ts.jsp.
Biessen, Erik A L; Sliedregt-Bol, Karen; 'T Hoen, Peter A Chr; Prince, Perry; Van der Bilt, Erica; Valentijn, A Rob P M; Meeuwenoord, Nico J; Princen, Hans; Bijsterbosch, Martin K; Van der Marel, Gijs A; Van Boom, Jacques H; Van Berkel, Theo J C
2002-01-01
In this study, we present the design and synthesis of an antisense peptide nucleic acid (asPNA) prodrug, which displays an improved biodistribution profile and an equally improved capacity to reduce the levels of target mRNA. The prodrug, K(GalNAc)(2)-asPNA, comprised of a 14-mer sequence complementary to the human microsomal triglyceride transfer protein (huMTP) gene, conjugated to a high-affinity tag for the hepatic asialoglycoprotein receptor (K(GalNAc)(2)). The prodrug was avidly bound and rapidly internalized by HepG2s. After iv injection into mice, K(GalNAc)(2)-asPNA accumulated in the parenchymal liver cells to a much greater extent than nonconjugated PNA (46% +/- 1% vs 3.1% +/- 0.5% of the injected dose, respectively). The prodrug was able to reduce MTP mRNA levels in HepG2 cells by 35-40% (P < 0.02) at 100 nM in an asialoglycoprotein receptor- and sequence-dependent fashion. In conclusion, hepatocyte-targeted PNA prodrugs combine a greatly improved tropism with an enhanced local intracellular availability and activity, making them attractive therapeutics to lower the expression level of hepatic target genes such as MTP.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Crooks, Gavin E.
WebLogo is a web based application designed to make the generation of sequence logos as easy and painless as possible. Sequesnce logos are a graphical representation of an amino acid or nucleic acid multiple sequence alignment developed by Tom Schneider and Mike Stephens. Each logo consists of stacks of symbols, one stack for each position in the sequence. The overall height of the stack indicates the sequence conservation at that position, while the height of symbols within the stack indicates the relative frequency of each amino or nucleic acid at that position. In general, a sequence logo provides a richermore » and more precise description of, for example, a binding site, than would a consensus sequence.« less
1987-01-01
identified in the difference spectra, implying that: there are five to seven tryptophans within 17 A of the spin-label hapten. Amino acid sequences...of the heavy, and light chains were obtained by a combination of amino acid and DNA sequencing. A molecular model’ was constructed from the sequence...Clore & acids yields detailed information about the amino acid com- Gronenborn, 1982, 1983). This technique should also identify position of the combining
Koyama, Hiroki; Mizusawa, Nanami; Hoashi, Masataka; Tan, Engkong; Yasumoto, Ko; Jimbo, Mitsuru; Ikeda, Daisuke; Yokoyama, Takehiko; Asakawa, Shuichi; Piyapattanakorn, Sanit; Watabe, Shugo
2018-06-05
Shrimps inhabiting coastal waters can survive in a wide range of salinity. However, the molecular mechanisms involved in their acclimation to different environmental salinities have remained largely unknown. In the present study, we acclimated kuruma shrimp ( Marsupenaeus japonicus ) at 1.7%, 3.4% and 4.0% salinities. After acclimating for 6, 12, 24 and 72 h, we determined free amino acid concentrations in their abdominal muscle, and performed RNA sequencing analysis on this muscle. The concentrations of free amino acids were clearly altered depending on salinity after 24 h of acclimation. Glutamine and alanine concentrations were markedly increased following the increase of salinity. In association with such changes, many genes related to amino acid metabolism changed their expression levels. In particular, the increase of the expression level of the gene encoding glutamate-ammonia ligase, which functions in glutamine metabolism, appeared to be associated with the increased glutamine concentration at high salinity. Furthermore, the increased alanine concentration at high salinity was likely associated with the decrease in the expression levels of the the gene encoding alanine-glyoxylate transaminase. Thus, there is a possibility that changes in the concentration of free amino acids for osmoregulation in kuruma shrimp are regulated by changes in the expression levels of genes related to amino acid metabolism. © 2018. Published by The Company of Biologists Ltd.
Detection of the High-Level Aminoglycoside Resistance Gene aph(2")-Ib in Enterococcus faecium
Kao, Susan J.; You, Il; Clewell, Don B.; Donabedian, Susan M.; Zervos, Marcus J.; Petrin, Joanne; Shaw, Karen J.; Chow, Joseph W.
2000-01-01
A new high-level gentamicin resistance gene, designated aph(2")-Ib, was cloned from Enterococcus faecium SF11770. The deduced amino acid sequence of the 897-bp open reading frame of aph(2")-Ib shares homology with the aminoglycoside-modifying enzymes AAC(6′)-APH(2"), APH(2")-Ic, and APH(2")-Id. The observed phosphotransferase activity is designated APH(2")-Ib. PMID:10991878
Bowen, D; Littlechild, J A; Fothergill, J E; Watson, H C; Hall, L
1988-01-01
Using oligonucleotide probes derived from amino acid sequencing information, the structural gene for phosphoglycerate kinase from the extreme thermophile, Thermus thermophilus, was cloned in Escherichia coli and its complete nucleotide sequence determined. The gene consists of an open reading frame corresponding to a protein of 390 amino acid residues (calculated Mr 41,791) with an extreme bias for G or C (93.1%) in the codon third base position. Comparison of the deduced amino acid sequence with that of the corresponding mesophilic yeast enzyme indicated a number of significant differences. These are discussed in terms of the unusual codon bias and their possible role in enhanced protein thermal stability. Images Fig. 1. PMID:3052437
Sequence of a cDNA encoding pancreatic preprosomatostatin-22.
Magazin, M; Minth, C D; Funckes, C L; Deschenes, R; Tavianini, M A; Dixon, J E
1982-01-01
We report the nucleotide sequence of a precursor to somatostatin that upon proteolytic processing may give rise to a hormone of 22 amino acids. The nucleotide sequence of a cDNA from the channel catfish (Ictalurus punctatus) encodes a precursor to somatostatin that is 105 amino acids (Mr, 11,500). The cDNA coding for somatostatin-22 consists of 36 nucleotides in the 5' untranslated region, 315 nucleotides that code for the precursor to somatostatin-22, 269 nucleotides at the 3' untranslated region, and a variable length of poly(A). The putative preprohormone contains a sequence of hydrophobic amino acids at the amino terminus that has the properties of a "signal" peptide. A connecting sequence of approximately 57 amino acids is followed by a single Arg-Arg sequence, which immediately precedes the hormone. Somatostatin-22 is homologous to somatostatin-14 in 7 of the 14 amino acids, including the Phe-Trp-Lys sequence. Hybridization selection of mRNA, followed by its translation in a wheat germ cell-free system, resulted in the synthesis of a single polypeptide having a molecular weight of approximately 10,000 as estimated on Na-DodSO4/polyacrylamide gels. Images PMID:6127673
NASA Astrophysics Data System (ADS)
Ertel, John R.; Hedges, John I.
1984-10-01
Vanillyl, syringyl and cinnamyl phenols occur as CuO oxidation products of humic, fulvic and base-insoluble residual fractions from soils, peat and nearshore marine sediments. However, none of these lignin-derived phenols were released by CuO oxidation of deepsea sediment or its base-extractable organic fractions. Lignin analysis indicated that peat and coastal marine sediments contained significantly higher levels of recognizable vascular plant carbon (20-50%) than soils and offshore marine sediments (0-10%). Although accounting for less than 20% of the total sedimentary (bulk) lignin, lignin components of humic acid fractions compositionally and quantitatively resembled the corresponding bulk samples and baseinsoluble residues. Recognizable lignin, presumably present as intact phenylpropanoid units, accounted for up to 5% of the carbon in peat and coastal humic acids but less than 1% in soil humic acids. Fulvic acid fractions uniformly yielded less lignin-derived phenols in mixtures that were depleted in syringyl and cinnamyl phenols relative to the corresponding humic acid fractions. Within the vanillyl and syringyl families the relative distribution of acidic and aldehydic phenols is a sensitive measure of the degree of oxidative alteration of the lignin component The high acid/aldehyde ratios and the low phenol yields of soils and their humic fractions compared to peat and coastal sediments indicate extensive degradation of the lignin source material. Likewise, the progressively higher acid/aldehyde ratios and lower phenol yields along the sequence: plant tissues (plant debris)-humic acids-fulvic acids suggest that this pattern represents the diagenetic sequence for the aerobic degradation of lignin biopolymers.
Miller, Thomas F.
2017-01-01
We present a coarse-grained simulation model that is capable of simulating the minute-timescale dynamics of protein translocation and membrane integration via the Sec translocon, while retaining sufficient chemical and structural detail to capture many of the sequence-specific interactions that drive these processes. The model includes accurate geometric representations of the ribosome and Sec translocon, obtained directly from experimental structures, and interactions parameterized from nearly 200 μs of residue-based coarse-grained molecular dynamics simulations. A protocol for mapping amino-acid sequences to coarse-grained beads enables the direct simulation of trajectories for the co-translational insertion of arbitrary polypeptide sequences into the Sec translocon. The model reproduces experimentally observed features of membrane protein integration, including the efficiency with which polypeptide domains integrate into the membrane, the variation in integration efficiency upon single amino-acid mutations, and the orientation of transmembrane domains. The central advantage of the model is that it connects sequence-level protein features to biological observables and timescales, enabling direct simulation for the mechanistic analysis of co-translational integration and for the engineering of membrane proteins with enhanced membrane integration efficiency. PMID:28328943
Nucleotide sequence of a resistance breaking mutant of southern bean mosaic virus.
Lee, L; Anderson, E J
1998-01-01
SBMV-S is a resistance-breaking mutant of an Arkansas isolate of the bean strain of southern bean mosaic virus (SBMV-BARK) that is able to move systemically in Phaseolus vulgaris cvs. Pinto and Great Northern, whereas the wild-type SBMV-BARK causes local necrotic lesions and is restricted to the inoculated leaves of these hosts. Sequence analysis of the 4136 nucleotide genomes of SBMV-BARK and SBMV-S revealed seven nucleotide differences, but only four deduced amino acid changes. A single amino acid change occurred in the C-terminal region of the putative RNA-dependent RNA polymerase and three differences were identified in the N-terminal portion of the virus coat protein. SBMV-BARK and SBMV-S were compared with other sobemoviruses and were found to contain a high level of nucleotide sequence identity (91.3%) to SBMV-B. Unlike SBMV-B however, SBMV-BARK and SBMV-S contained four putative overlapping open reading frames, making them more similar in genome organization to the cowpea strain, SBMV-C. The possibility exists that mutations or even errors, that resulted in mis-identification of open reading frames, occurred in previously published information on nucleotide sequence and genomic organization for SBMV-B.
Montoya-Ruiz, Carolina; Cajimat, Maria N B; Milazzo, Mary Louise; Diaz, Francisco J; Rodas, Juan David; Valbuena, Gustavo; Fulhorst, Charles F
2015-07-01
The results of a previous study suggested that Cherrie's cane rat (Zygodontomys cherriei) is the principal host of Necoclí virus (family Bunyaviridae, genus Hantavirus) in Colombia. Bayesian analyses of complete nucleocapsid protein gene sequences and complete glycoprotein precursor gene sequences in this study confirmed that Necoclí virus is phylogenetically closely related to Maporal virus, which is principally associated with the delicate pygmy rice rat (Oligoryzomys delicatus) in western Venezuela. In pairwise comparisons, nonidentities between the complete amino acid sequence of the nucleocapsid protein of Necoclí virus and the complete amino acid sequences of the nucleocapsid proteins of other hantaviruses were ≥8.7%. Likewise, nonidentities between the complete amino acid sequence of the glycoprotein precursor of Necoclí virus and the complete amino acid sequences of the glycoprotein precursors of other hantaviruses were ≥11.7%. Collectively, the unique association of Necoclí virus with Z. cherriei in Colombia, results of the Bayesian analyses of complete nucleocapsid protein gene sequences and complete glycoprotein precursor gene sequences, and results of the pairwise comparisons of amino acid sequences strongly support the notion that Necoclí virus represents a novel species in the genus Hantavirus. Further work is needed to determine whether Calabazo virus (a hantavirus associated with Z. brevicauda cherriei in Panama) and Necoclí virus are conspecific.
Evolutionary Pattern of the FAE1 Gene in Brassicaceae and Its Correlation with the Erucic Acid Trait
Li, Mimi; Peng, Bin; Guo, Haisong; Yan, Qinqin; Hang, Yueyu
2013-01-01
The fatty acid elongase 1 (FAE1) gene catalyzes the initial condensation step in the elongation pathway of VLCFA (very long chain fatty acid) biosynthesis and is thus a key gene in erucic acid biosynthesis. Based on a worldwide collection of 62 accessions representing 14 tribes, 31 genera, 51 species, 4 subspecies and 7 varieties, we conducted a phylogenetic reconstruction and correlation analysis between genetic variations in the FAE1 gene and the erucic acid trait, attempting to gain insight into the evolutionary patterns and the correlations between genetic variations in FAE1 and trait variations. The five clear, deeply diverged clades detected in the phylogenetic reconstruction are largely congruent with a previous multiple gene-derived phylogeny. The Ka/Ks ratio (<1) and overall low level of nucleotide diversity in the FAE1 gene suggest that purifying selection is the major evolutionary force acting on this gene. Sequence variations in FAE1 show a strong correlation with the content of erucic acid in seeds, suggesting a causal link between the two. Furthermore, we detected 16 mutations that were fixed between the low and high phenotypes of the FAE1 gene, which constitute candidate active sites in this gene for altering the content of erucic acid in seeds. Our findings begin to shed light on the evolutionary pattern of this important gene and represent the first step in elucidating how the sequence variations impact the production of erucic acid in plants. PMID:24358289
Shayan, P; Jafari, S; Fattahi, R; Ebrahimzade, E; Amininia, N; Changizi, E
2016-05-01
Ovine theileriosis is an important hemoprotozoal disease of sheep and goats in tropical and subtropical regions which caused high economic loses in the livestock industry. Theileria annulata surface protein (TaSp) was used previously as a tool for serological analysis in livestock. Since the amino acid sequences of TaSp is, at least, in part very conserved in T. annulata, Theileria lestoquardi and Theileria china I and II, it is very important to determine the amino acid sequence of this protein in Theileria ovis as well, to avoid false interpretation of serological data based on this protein in small animal. In the present study, the nucleotide sequence and amino acid sequence of T. ovis surface protein (ToSp) were determined. The comparison of the nucleotide sequence of ToSp showed 96, 96, 99, and 86 % homology to the corresponding nucleotide sequence of TaSp genes by T. annulata, T. China I, T. China II and T. lestoquardi, previously registered in GenBank under accession nos. AJ316260.1, AY274329.1, DQ120058.1, and EF092924.1 respectively. The amino acid sequence analysis showed 95, 81, 98 and 70 % homology to the corresponding amino acid sequence of T. annulata, T chinaI, T china II and T. lestoquardi, registered in GenBank under accession nos. CAC87478.1, AAP36993.1, AAZ30365.1 and AAP36999.11, respectively. Interestingly, in contrast to the C terminus, a significant difference in amino acid sequence in the N teminus of the ToSp protein could be determined compared to the other known corresponding TaSp sequences, which make this region attractive for designing of a suitable tool for serological diagnosis.
Baron, S F; Franklund, C V; Hylemon, P B
1991-01-01
Southern blot analysis indicated that the gene encoding the constitutive, NADP-linked bile acid 7 alpha-hydroxysteroid dehydrogenase of Eubacterium sp. strain VPI 12708 was located on a 6.5-kb EcoRI fragment of the chromosomal DNA. This fragment was cloned into bacteriophage lambda gt11, and a 2.9-kb piece of this insert was subcloned into pUC19, yielding the recombinant plasmid pBH51. DNA sequence analysis of the 7 alpha-hydroxysteroid dehydrogenase gene in pBH51 revealed a 798-bp open reading frame, coding for a protein with a calculated molecular weight of 28,500. A putative promoter sequence and ribosome binding site were identified. The 7 alpha-hydroxysteroid dehydrogenase mRNA transcript in Eubacterium sp. strain VPI 12708 was about 0.94 kb in length, suggesting that it is monocistronic. An Escherichia coli DH5 alpha transformant harboring pBH51 had approximately 30-fold greater levels of 7 alpha-hydroxysteroid dehydrogenase mRNA, immunoreactive protein, and specific activity than Eubacterium sp. strain VPI 12708. The 7 alpha-hydroxysteroid dehydrogenase purified from the pBH51 transformant was similar in subunit molecular weight, specific activity, and kinetic properties to that from Eubacterium sp. strain VPI 12708, and it reached with antiserum raised against the authentic enzyme on Western immunoblots. Alignment of the amino acid sequence of the 7 alpha-hydroxysteroid dehydrogenase with those of 10 other pyridine nucleotide-linked alcohol/polyol dehydrogenases revealed six conserved amino acid residues in the N-terminal regions thought to function in coenzyme binding. Images PMID:1856160
Silva, Roberta N; Oliveira, Lilian C G; Parise, Carolina B; Oliveira, Juliana R; Severino, Beatrice; Corvino, Angela; di Vaio, Paola; Temussi, Piero A; Caliendo, Giuseppe; Santagada, Vincenzo; Juliano, Luiz; Juliano, Maria A
2017-05-01
Human kallikrein 6 (KLK6) is highly expressed in the central nervous system and with elevated level in demyelinating disease. KLK6 has a very restricted specificity for arginine (R) and hydrolyses myelin basic protein, protein activator receptors and human ionotropic glutamate receptor subunits. Here we report a previously unreported activity of KLK6 on peptides containing clusters of basic amino acids, as in synthetic fluorogenic peptidyl-Arg-7-amino-4-carbamoylmethylcoumarin (peptidyl-ACC) peptides and FRET peptides in the format of Abz-peptidyl-Q-EDDnp (where Abz=ortho-aminobenzoic acid and Q-EDDnp=glutaminyl-N-(2,4-dinitrophenyl) ethylenediamine), in which pairs or sequences of basic amino acids (R or K) were introduced. Surprisingly, KLK6 hydrolyzed the fluorogenic peptides Bz-A-R ↓ R-ACC and Z-R ↓ R-MCA between the two R groups, resulting in non-fluorescent products. FRET peptides containing furin processing sequences of human MMP-14, nerve growth factor (NGF), Neurotrophin-3 (NT-3) and Neurotrophin-4 (NT-4) were cleaved by KLK6 at the same position expected by furin. Finally, KLK6 cleaved FRET peptides derived from human proenkephalin after the KR, the more frequent basic residues flanking enkephalins in human proenkephalin sequence. This result suggests the ability of KLK6 to release enkephalin from proenkephalin precursors and resembles furin a canonical processing proteolytic enzyme. Molecular models of peptides were built into the KLK6 structure and the marked preference of the cut between the two R of the examined peptides was related to the extended conformation of the substrates. Copyright © 2017 Elsevier B.V. All rights reserved.
NASA Astrophysics Data System (ADS)
Yu, Shuiyan; Liu, Shicheng; Li, Chunyang; Zhou, Zhigang
2011-01-01
Myrmecia incisa is a green coccoid freshwater microalgae, which is rich in arachidonic acid (ArA, C20: 4ω-6, δ5, 8, 11, 14), a long chain polyunsaturated fatty acid (PUFA), especially under nitrogen starvation stress. A cDNA library of M. incisa was constructed with λ phage vectors and a 545 nt expressed sequence tag (EST) was screened from this library as a putative elongase gene due to its 56% and 49% identity to Marchantia polymorpha L. and Ostreococcus tauri Courties et Chrétiennot-Dinet, respectively. Based upon this EST sequence, an elongase gene designated MiFAE was isolated from M. incisa via 5'/3' rapid amplification of cDNA ends (RACE). The cDNA sequence was 1 331 bp long and included a 33 bp 5'-untranslated region (UTR) and a 431 bp 3'-UTR with a typical poly-A tail. The 867 bp ORF encoded a predicted protein of 288 amino acids. This protein was characterized by a conserved histidine-rich box and a MYxYY motif that was present in other members of the elongase family. The genomic DNA sequence of MiFAE was found to be interrupted by three introns with splicing sites of Introns I (81 bp), II (81 bp), and III (67 bp) that conformed to the GT-AG rule. Quantitative real-time PCR showed that the transcription level of MiFAE in this microalga under nitrogen starvation was higher than that under normal condition. Prior to the ArA content accumulation, the transcription of MiFAE was enhanced, suggesting that it was possibly responsible for the ArA accumulation in this microalga cultured under nitrogen starvation conditions.
Brain cDNA clone for human cholinesterase
DOE Office of Scientific and Technical Information (OSTI.GOV)
McTiernan, C.; Adkins, S.; Chatonnet, A.
1987-10-01
A cDNA library from human basal ganglia was screened with oligonucleotide probes corresponding to portions of the amino acid sequence of human serum cholinesterase. Five overlapping clones, representing 2.4 kilobases, were isolated. The sequenced cDNA contained 207 base pairs of coding sequence 5' to the amino terminus of the mature protein in which there were four ATG translation start sites in the same reading frame as the protein. Only the ATG coding for Met-(-28) lay within a favorable consensus sequence for functional initiators. There were 1722 base pairs of coding sequence corresponding to the protein found circulating in human serum.more » The amino acid sequence deduced from the cDNA exactly matched the 574 amino acid sequence of human serum cholinesterase, as previously determined by Edman degradation. Therefore, our clones represented cholinesterase rather than acetylcholinesterase. It was concluded that the amino acid sequences of cholinesterase from two different tissues, human brain and human serum, were identical. Hybridization of genomic DNA blots suggested that a single gene, or very few genes coded for cholinesterase.« less
Rubin, D A; Dores, R M
1995-06-01
In order to obtain a more resolute phylogeny of teleosts based on growth hormone (GH) sequences, phylogenetic analyses were performed in which deletions (gaps), which appear to be order specific, were upheld to maintain GH's structural information. Sequences were analyzed at 194 amino acid positions. In addition, the two closest genealogically related groups to the teleosts, Amia calva and Acipenser guldenstadti, were used as outgroups. Modified sequence alignments were also analyzed to determine clade stability. Analyses indicated, in the most parsimonious cladogram, that molecular and morphological relationships for the orders of fishes are congruent. With GH molecular sequence data it was possible to resolve all clades at the familial level. Analyses of the primary sequence data indicate that: (a) the halecomorphean and chondrostean GH sequences are the appropriate outgroups for generating the most parsimonious cladogram for teleosts; (b) proper alignment of teleost GH sequence by the inclusion of gaps is necessary for resolution of the Percomorpha; and (c) removal of sequence information by deleting improperly aligned sequence decreases the phylogenetic signal obtained.
Cloning and expression of cDNA coding for bouganin.
den Hartog, Marcel T; Lubelli, Chiara; Boon, Louis; Heerkens, Sijmie; Ortiz Buijsse, Antonio P; de Boer, Mark; Stirpe, Fiorenzo
2002-03-01
Bouganin is a ribosome-inactivating protein that recently was isolated from Bougainvillea spectabilis Willd. In this work, the cloning and expression of the cDNA encoding for bouganin is described. From the cDNA, the amino-acid sequence was deduced, which correlated with the primary sequence data obtained by amino-acid sequencing on the native protein. Bouganin is synthesized as a pro-peptide consisting of 305 amino acids, the first 26 of which act as a leader signal while the 29 C-terminal amino acids are cleaved during processing of the molecule. The mature protein consists of 250 amino acids. Using the cDNA sequence encoding the mature protein of 250 amino acids, a recombinant protein was expressed, purified and characterized. The recombinant molecule had similar activity in a cell-free protein synthesis assay and had comparable toxicity on living cells as compared to the isolated native bouganin.
Method for altering antibody light chain interactions
Stevens, Fred J.; Stevens, Priscilla Wilkins; Raffen, Rosemarie; Schiffer, Marianne
2002-01-01
A method for recombinant antibody subunit dimerization including modifying at least one codon of a nucleic acid sequence to replace an amino acid occurring naturally in the antibody with a charged amino acid at a position in the interface segment of the light polypeptide variable region, the charged amino acid having a first polarity; and modifying at least one codon of the nucleic acid sequence to replace an amino acid occurring naturally in the antibody with a charged amino acid at a position in an interface segment of the heavy polypeptide variable region corresponding to a position in the light polypeptide variable region, the charged amino acid having a second polarity opposite the first polarity. Nucleic acid sequences which code for novel light chain proteins, the latter of which are used in conjunction with the inventive method, are also provided.
Multiple copies of a bile acid-inducible gene in Eubacterium sp. strain VPI 12708.
Gopal-Srivastava, R; Mallonee, D H; White, W B; Hylemon, P B
1990-01-01
Eubacterium sp. strain VPI 12708 is an anaerobic intestinal bacterium which possesses inducible bile acid 7-dehydroxylation activity. Several new polypeptides are produced in this strain following induction with cholic acid. Genes coding for two copies of a bile acid-inducible 27,000-dalton polypeptide (baiA1 and baiA2) have been previously cloned and sequenced. We now report on a gene coding for a third copy of this 27,000-dalton polypeptide (baiA3). The baiA3 gene has been cloned in lambda DASH on an 11.2-kilobase DNA fragment from a partial Sau3A digest of the Eubacterium DNA. DNA sequence analysis of the baiA3 gene revealed 100% homology with the baiA1 gene within the coding region of the 27,000-dalton polypeptides. The baiA2 gene shares 81% sequence identity with the other two genes at the nucleotide level. The flanking nucleotide sequences associated with the baiA1 and baiA3 genes are identical for 930 bases in the 5' direction from the initiation codon and for at least 325 bases in the 3' direction from the stop codon, including the putative promoter regions for the genes. An additional open reading frame (occupying from 621 to 648 bases, depending on the correct start codon) was found in the identical 5' regions associated with the baiA1 and baiA3 clones. The 5' sequence 930 bases upstream from the baiA1 and baiA3 genes was totally divergent. The baiA2 gene, which is part of a large bile acid-inducible operon, showed no homology with the other two genes either in the 5' or 3' direction from the polypeptide coding region, except for a 15-base-pair presumed ribosome-binding site in the 5' region. These studies strongly suggest that a gene duplication (baiA1 and baiA3) has occurred and is stably maintained in this bacterium. Images PMID:2376563
Trucco, Verónica; de Breuil, Soledad; Bejerman, Nicolás; Lenardon, Sergio; Giolitti, Fabián
2014-06-01
The complete nucleotide sequence of an Alfalfa mosaic virus (AMV) isolate infecting alfalfa (Medicago sativa L.) in Argentina, AMV-Arg, was determined. The virus genome has the typical organization described for AMV, and comprises 3,643, 2,593, and 2,038 nucleotides for RNA1, 2 and 3, respectively. The whole genome sequence and each encoding region were compared with those of other four isolates that have been completely sequenced from China, Italy, Spain and USA. The nucleotide identity percentages ranged from 95.9 to 99.1 % for the three RNAs and from 93.7 to 99 % for the protein 1 (P1), protein 2 (P2), movement protein and coat protein (CP) encoding regions, whereas the amino acid identity percentages of these proteins ranged from 93.4 to 99.5 %, the lowest value corresponding to P2. CP sequences of AMV-Arg were compared with those of other 25 available isolates, and the phylogenetic analysis based on the CP gene was carried out. The highest percentage of nucleotide sequence identity of the CP gene was 98.3 % with a Chinese isolate and 98.6 % at the amino acid level with four isolates, two from Italy, one from Brazil and the remaining one from China. The phylogenetic analysis showed that AMV-Arg is closely related to subgroup I of AMV isolates. To our knowledge, this is the first report of a complete nucleotide sequence of AMV from South America and the first worldwide report of complete nucleotide sequence of AMV isolated from alfalfa as natural host.
Ruff, Kiersten M; Roberts, Stefan; Chilkoti, Ashutosh; Pappu, Rohit V
2018-06-24
Proteins and synthetic polymers can undergo phase transitions in response to changes to intensive solution parameters such as temperature, proton chemical potentials (pH), and hydrostatic pressure. For proteins and protein-based polymers, the information required for stimulus responsive phase transitions is encoded in their amino acid sequence. Here, we review some of the key physical principles that govern the phase transitions of archetypal intrinsically disordered protein polymers (IDPPs). These are disordered proteins with highly repetitive amino acid sequences. Advances in recombinant technologies have enabled the design and synthesis of protein sequences of a variety of sequence complexities and lengths. We summarize insights that have been gleaned from the design and characterization of IDPPs that undergo thermo-responsive phase transitions and build on these insights to present a general framework for IDPPs with pH and pressure responsive phase behavior. In doing so, we connect the stimulus responsive phase behavior of IDPPs with repetitive sequences to the coil-to-globule transitions that these sequences undergo at the single chain level in response to changes in stimuli. The proposed framework and ongoing studies of stimulus responsive phase behavior of designed IDPPs have direct implications in bioengineering, where designing sequences with bespoke material properties broadens the spectrum of applications, and in biology and medicine for understanding the sequence-specific driving forces for the formation of protein-based membraneless organelles as well as biological matrices that act as scaffolds for cells and mediators of cell-to-cell communication. Copyright © 2018. Published by Elsevier Ltd.
37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.
Code of Federal Regulations, 2013 CFR
2013-07-01
... in WIPO Standard ST.25 (1998), Appendix 2, Tables 1 and 3. This incorporation by reference was... ST.25 (1998), Appendix 2, Tables 1 and 3, shall be listed in a given sequence as “n” or “Xaa... acids. (1) The amino acids in a protein or peptide sequence shall be listed using the three-letter...
37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.
Code of Federal Regulations, 2010 CFR
2010-07-01
... in WIPO Standard ST.25 (1998), Appendix 2, Tables 1 and 3. This incorporation by reference was... ST.25 (1998), Appendix 2, Tables 1 and 3, shall be listed in a given sequence as “n” or “Xaa... acids. (1) The amino acids in a protein or peptide sequence shall be listed using the three-letter...
37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.
Code of Federal Regulations, 2012 CFR
2012-07-01
... in WIPO Standard ST.25 (1998), Appendix 2, Tables 1 and 3. This incorporation by reference was... ST.25 (1998), Appendix 2, Tables 1 and 3, shall be listed in a given sequence as “n” or “Xaa... acids. (1) The amino acids in a protein or peptide sequence shall be listed using the three-letter...
Diniz, Erik Trovão; Jorge, Alexander A L; Arnhold, Ivo J P; Rosenbloom, Arlan L; Bandeira, Francisco
2008-11-01
To date, about sixty different mutations within GH receptor (GHR) gene have been described in patients with GH insensitivity syndrome (GHI). In this report, we described a novel nonsense mutation of GHR. The patient was evaluated at the age of 6 yr, for short stature associated to clinical phenotype of GHI. GH, IGF-1, and GHBP levels were determined. The PCR products from exons 2-10 were sequenced. The patient had high GH (26 microg/L), low IGF-1 (22.5 ng/ml) and undetectable GHBP levels. The sequencing of GHR exon 5 disclosed adenine duplication at nucleotide 338 of GHR coding sequence (c.338dupA) in homozygous state. We described a novel mutation that causes a truncated GHR and a loss of receptor function due to the lack of amino acids comprising the transmembrane and intracellular regions of GHR protein, leading to GHI.
Use of CYP52A2A promoter to increase gene expression in yeast
Craft, David L.; Wilson, C. Ron; Eirich, Dudley; Zhang, Yeyan
2004-01-06
A nucleic acid sequence including a CYP promoter operably linked to nucleic acid encoding a heterologous protein is provided to increase transcription of the nucleic acid. Expression vectors and host cells containing the nucleic acid sequence are also provided. The methods and compositions described herein are especially useful in the production of polycarboxylic acids by yeast cells.
Method of Identifying a Base in a Nucleic Acid
Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua
1999-01-01
Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.
Identifying a base in a nucleic acid
Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua
2005-02-08
Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.
Genetic characterization of a novel astrovirus in Pekin ducks.
Liao, Qinfeng; Liu, Ning; Wang, Xiaoyan; Wang, Fumin; Zhang, Dabing
2015-06-01
Three divergent groups of duck astroviruses (DAstVs), namely DAstV-1, DAstV-2 (formerly duck hepatitis virus type 3) and DAstV-3 (isolate CPH), and other avastroviruses are known to infect domestic ducks. To provide more data regarding the molecular epidemiology of astroviruses in domestic ducks, we examined the prevalence of astroviruses in 136 domestic duck samples collected from four different provinces of China. Nineteen goose samples were also included. Using an astrovirus-specific reverse transcription-PCR assay, two groups of astroviruses were detected from our samples. A group of astroviruses detected from Pekin ducks, Shaoxing ducks and Landes geese were highly similar to the newly discovered DAstV-3. More interestingly, a novel group of avastroviruses, which we named DAstV-4, was detected in Pekin ducks. Following full-length sequencing and sequence analysis, the variation between DAstV-4 and other avastroviruses in terms of lengths of genome and internal component was highlighted. Sequence identity and phylogenetic analyses based on the amino acid sequences of the three open reading frames (ORFs) clearly demonstrated that DAstV-4 was highly divergent from all other avastroviruses. Further analyses showed that DAstV-4 shared low levels of genome identities (50-58%) and high levels of mean amino acid genetic distances in the ORF2 sequences (0.520-0.801) with other avastroviruses, suggesting DAstV-4 may represent an additional avastrovirus species although the taxonomic relationship of DAstV-4 to DAstV-3 remains to be resolved. The present works contribute to the understanding of epidemiology, ecology and taxonomy of astroviruses in ducks. Copyright © 2015 Elsevier B.V. All rights reserved.
Zhao, Guozhong; Yao, Yunping; Wang, Chunling; Tian, Fengwei; Liu, Xiaoming; Hou, Lihua; Yang, Zhen; Zhao, Jianxin; Zhang, Hao
2015-01-01
Amino acids comprise the majority of the flavor compounds in soy sauce. A portion of these amino acids are formed from the biosynthesis and metabolism of the fungus Aspergillus oryzae; however, the metabolic pathways leading to the formation of these amino acids in A. oryzae remain largely unknown. We sequenced the transcriptomes of A. oryzae 100-8 and A. oryzae 3.042 under similar soy sauce fermentation conditions. 2D gel electrophoresis was also used to find some differences in protein expression. We found that many amino acid hydrolases (endopeptidases, aminopeptidases, and X-pro-dipeptidyl aminopeptidase) were expressed at much higher levels (mostly greater than double) in A. oryzae 100-8 than in A. oryzae 3.042. Our results indicated that glutamate dehydrogenase may activate the metabolism of amino acids. We also found that the expression levels of some genes changed simultaneously in the metabolic pathways of tyrosine and leucine and that these conserved genes may modulate the function of the metabolic pathway. Such variation in the metabolic pathways of amino acids is important as it can significantly alter the flavor of fermented soy sauce. PMID:25945335
Zhao, Guozhong; Yao, Yunping; Wang, Chunling; Tian, Fengwei; Liu, Xiaoming; Hou, Lihua; Yang, Zhen; Zhao, Jianxin; Zhang, Hao; Cao, Xiaohong
2015-01-01
Amino acids comprise the majority of the flavor compounds in soy sauce. A portion of these amino acids are formed from the biosynthesis and metabolism of the fungus Aspergillus oryzae; however, the metabolic pathways leading to the formation of these amino acids in A. oryzae remain largely unknown. We sequenced the transcriptomes of A. oryzae 100-8 and A. oryzae 3.042 under similar soy sauce fermentation conditions. 2D gel electrophoresis was also used to find some differences in protein expression. We found that many amino acid hydrolases (endopeptidases, aminopeptidases, and X-pro-dipeptidyl aminopeptidase) were expressed at much higher levels (mostly greater than double) in A. oryzae 100-8 than in A. oryzae 3.042. Our results indicated that glutamate dehydrogenase may activate the metabolism of amino acids. We also found that the expression levels of some genes changed simultaneously in the metabolic pathways of tyrosine and leucine and that these conserved genes may modulate the function of the metabolic pathway. Such variation in the metabolic pathways of amino acids is important as it can significantly alter the flavor of fermented soy sauce.
Kourennaia, Olga V; Tsujikawa, Laura; Dehaseth, Pieter L
2005-10-01
Upon the exposure of Escherichia coli to high temperature (heat shock), cellular levels of the transcription factor sigma32 rise greatly, resulting in the increased formation of the sigma32 holoenzyme, which is capable of transcription initiation at heat shock promoters. Higher levels of heat shock proteins render the cell better able to cope with the effects of higher temperatures. To conduct structure-function studies on sigma32 in vivo, we have carried out site-directed mutagenesis and employed a previously developed system involving sigma32 expression from one plasmid and a beta-galactosidase reporter gene driven by the sigma32-dependent groE promoter on another in order to monitor the effects of single amino acid substitutions on sigma32 activity. It was found that the recognition of the -35 region involves similar amino acid residues in regions 4.2 of E. coli sigma32 and sigma70. Three conserved amino acids in region 2.3 of sigma32 were found to be only marginally important in determining activity in vivo. Differences between sigma32 and sigma70 in the effects of mutation in region 2.4 on the activities of the two sigma factors are consistent with the pronounced differences between both the amino acid sequences in this region and the recognized promoter DNA sequences.
Liang, Xiao; Gao, Jian; Li, Dapeng; Cao, Xiaojuan
2016-12-02
Peroxisome proliferator activated receptor alpha1 and alpha2 (PPARα1 and PPARα2) were investigated in loach (Misgurnus anguillicaudatus) by RACE (rapid amplification of cDNA ends) and qPCR (real-time quantitative PCR) for the first time. The cDNA sequences of PPARα1 and PPARα2 were 2042bp and 2407bp, respectively encoding 467 and 465 amino acids. Sequence alignments of deduced amino acids showed significant homology between the two subtypes of PPARα, indicating 70% identity. The two genes revealed sensible changes in transcriptions during early life stages of the loach, and the highest transcriptions of the two genes both appeared at some day after hatching. PPARα1 predominantly expressed in liver, while PPARα2 markedly expressed in heart. The expression regulation of PPARα1 and PPARα2 in response to dietary fatty acids was determined in livers of loaches fed with diets containing fish oil (FO group) and soybean oil (SO group) for 75 days. The expression level of PPARα1 in FO group was significantly higher than those in SO group (P < 0.01), while the expression level of PPARα2 in FO group was also significantly higher than those in SO group (P < 0.05). There was no significant difference in the expression level between PPARα1 and PPARα2 in SO group, whereas significant difference in FO group. These indicated that lipid resources could regulate the expressions of these two genes in the loach. Our results will provide opportunities to better understand the functional characterization of PPARα1 and PPARα2 in further studies. Copyright © 2016 Elsevier Inc. All rights reserved.
Porcine MYF6 gene: sequence, homology analysis, and variation in the promoter region.
Wyszyńska-Koko, J; Kurył, J
2004-01-01
MYF6 gene codes for the bHLH transcription factor belonging to MyoD family. Its expression accompanies the processes of differentiation and maturation of myotubes during embriogenesis and continues on a relatively high level after birth, affecting the muscle phenotype. The porcine MYF6 gene was amplified and sequenced and compared with MYF6 gene sequences of other species. The amino acid sequence was deduced and an interspecies homology analysis was performed. Myf-6 protein shows a high conservation among species of 99 and 97% identity when comparing pig with cow and human, respectively, and of 93% when comparing pig with mouse and rat. The single nucleotide polymorphism (SNP) was revealed within the promoter region, which appeared to be T --> C transition recognized by a MspI restriction enzyme.
T box riboswitches in Actinobacteria: Translational regulation via novel tRNA interactions
Sherwood, Anna V.; Grundy, Frank J.; Henkin, Tina M.
2015-01-01
The T box riboswitch regulates many amino acid-related genes in Gram-positive bacteria. T box riboswitch-mediated gene regulation was shown previously to occur at the level of transcription attenuation via structural rearrangements in the 5′ untranslated (leader) region of the mRNA in response to binding of a specific uncharged tRNA. In this study, a novel group of isoleucyl-tRNA synthetase gene (ileS) T box leader sequences found in organisms of the phylum Actinobacteria was investigated. The Stem I domains of these RNAs lack several highly conserved elements that are essential for interaction with the tRNA ligand in other T box RNAs. Many of these RNAs were predicted to regulate gene expression at the level of translation initiation through tRNA-dependent stabilization of a helix that sequesters a sequence complementary to the Shine–Dalgarno (SD) sequence, thus freeing the SD sequence for ribosome binding and translation initiation. We demonstrated specific binding to the cognate tRNAIle and tRNAIle-dependent structural rearrangements consistent with regulation at the level of translation initiation, providing the first biochemical demonstration, to our knowledge, of translational regulation in a T box riboswitch. PMID:25583497
Transcriptional regulation of fatty acid biosynthesis in mycobacteria
Mondino, S.; Gago, G.; Gramajo, H.
2013-01-01
SUMMARY The main purpose of our study is to understand how mycobacteria exert control over the biosynthesis of their membrane lipids and find out the key components of the regulatory network that control fatty acid biosynthesis at the transcriptional level. In this paper we describe the identification and purification of FasR, a transcriptional regulator from Mycobacterium sp. that controls the expression of the fatty acid synthase (fas) and the 4-phosphopantetheinyl transferase (acpS) encoding genes, whose products are involved in the fatty acid and mycolic acid biosynthesis pathways. In vitro studies demonstrated that fas and acpS genes are part of the same transcriptional unit and that FasR specifically binds to three conserved operator sequences present in the fas-acpS promoter region (Pfas). The construction and further characterization of a fasR conditional mutant confirmed that FasR is a transcriptional activator of the fas-acpS operon and that this protein is essential for mycobacteria viability. Furthermore, the combined used of Pfas-lacZ fusions in different fasR backgrounds and electrophoretic mobility shift assays experiments, strongly suggested that long-chain acyl-CoAs are the effector molecules that modulate the affinity of FasR for its DNA binding sequences and therefore the expression of the essential fas-acpS operon. PMID:23721164
DOE Office of Scientific and Technical Information (OSTI.GOV)
Deutscher, J.; Pevec, B.; Beyreuther, K.
1986-10-21
The amino acid sequence of histidine-containing protein (HPr) from Streptococcus faecalis has been determined by direct Edman degradation of intact HPr and by amino acid sequence analysis of tryptic peptides, V8 proteolyptic peptides, thermolytic peptides, and cyanogen bromide cleavage products. HPr from S. faecalis was found to contain 89 amino acid residues, corresponding to a molecular weight of 9438. The amino acid sequence of HPr from S. faecalis shows extended homology to the primary structure of HPr proteins from other bacteria. Besides the phosphoenolpyruvate-dependent phosphorylation of a histidyl residue in HPr, catalyzed by enzyme I of the bacterial phosphotransferase system,more » HPr was also found to be phosphorylated at a seryl residue in an ATP-dependent protein kinase catalyzed reaction. The site of ATP-dependent phosphorylation in HPr of S faecalis has now been determined. (/sup 32/P)P-Ser-HPr was digested with three different proteases, and in each case, a single labeled peptide was isolated. Following digestion with subtilisin, they obtained a peptide with the sequence -(P)Ser-Ile-Met-. Using chymotrypsin, they isolated a peptide with the sequence -Ser-Val-Asn-Leu-Lys-(P)Ser-Ile-Met-Gly-Val-Met-. The longest labeled peptide was obtained with V8 staphylococcal protease. According to amino acid analysis, this peptide contained 36 out of the 89 amino acid residues of HPr. The following sequence of 12 amino acid residues of the V8 peptide was determined: -Tyr-Lys-Gly-Lys-Ser-Val-Asn-Leu-Lys-(P)Ser-Ile-Met-. Thus, the site of ATP-dependent phosphorylation was determined to be Ser-46 within the primary structure of HPr.« less
Selection in Europeans on Fatty Acid Desaturases Associated with Dietary Changes
Buckley, Matthew T.; Racimo, Fernando; Allentoft, Morten E.; Jensen, Majken K.; Jonsson, Anna; Huang, Hongyan; Hormozdiari, Farhad; Sikora, Martin; Marnetto, Davide; Eskin, Eleazar; Jørgensen, Marit E.; Grarup, Niels; Pedersen, Oluf; Hansen, Torben; Kraft, Peter; Willerslev, Eske
2017-01-01
Abstract FADS genes encode fatty acid desaturases that are important for the conversion of short chain polyunsaturated fatty acids (PUFAs) to long chain fatty acids. Prior studies indicate that the FADS genes have been subjected to strong positive selection in Africa, South Asia, Greenland, and Europe. By comparing FADS sequencing data from present-day and Bronze Age (5–3k years ago) Europeans, we identify possible targets of selection in the European population, which suggest that selection has targeted different alleles in the FADS genes in Europe than it has in South Asia or Greenland. The alleles showing the strongest changes in allele frequency since the Bronze Age show associations with expression changes and multiple lipid-related phenotypes. Furthermore, the selected alleles are associated with a decrease in linoleic acid and an increase in arachidonic and eicosapentaenoic acids among Europeans; this is an opposite effect of that observed for selected alleles in Inuit from Greenland. We show that multiple SNPs in the region affect expression levels and PUFA synthesis. Additionally, we find evidence for a gene–environment interaction influencing low-density lipoprotein (LDL) levels between alleles affecting PUFA synthesis and PUFA dietary intake: carriers of the derived allele display lower LDL cholesterol levels with a higher intake of PUFAs. We hypothesize that the selective patterns observed in Europeans were driven by a change in dietary composition of fatty acids following the transition to agriculture, resulting in a lower intake of arachidonic acid and eicosapentaenoic acid, but a higher intake of linoleic acid and α-linolenic acid. PMID:28333262
Methods and compositions for regulating gene expression in plant cells
NASA Technical Reports Server (NTRS)
Dai, Shunhong (Inventor); Beachy, Roger N. (Inventor); Luis, Maria Isabel Ordiz (Inventor)
2010-01-01
Novel chimeric plant promoter sequences are provided, together with plant gene expression cassettes comprising such sequences. In certain preferred embodiments, the chimeric plant promoters comprise the BoxII cis element and/or derivatives thereof. In addition, novel transcription factors are provided, together with nucleic acid sequences encoding such transcription factors and plant gene expression cassettes comprising such nucleic acid sequences. In certain preferred embodiments, the novel transcription factors comprise the acidic domain, or fragments thereof, of the RF2a transcription factor. Methods for using the chimeric plant promoter sequences and novel transcription factors in regulating the expression of at least one gene of interest are provided, together with transgenic plants comprising such chimeric plant promoter sequences and novel transcription factors.
The complete amino acid sequence of human skeletal-muscle fructose-bisphosphate aldolase.
Freemont, P S; Dunbar, B; Fothergill-Gilmore, L A
1988-01-01
The complete amino acid sequence of human skeletal-muscle fructose-bisphosphate aldolase, comprising 363 residues, was determined. The sequence was deduced by automated sequencing of CNBr-cleavage, o-iodosobenzoic acid-cleavage, trypsin-digest and staphylococcal-proteinase-digest fragments. Comparison of the sequence with other class I aldolase sequences shows that the mammalian muscle isoenzyme is one of the most highly conserved enzymes known, with only about 2% of the residues changing per 100 million years. Non-mammalian aldolases appear to be evolving at the same rate as other glycolytic enzymes, with about 4% of the residues changing per 100 million years. Secondary-structure predictions are analysed in an accompanying paper [Sawyer, Fothergill-Gilmore & Freemont (1988) Biochem. J. 249, 789-793]. PMID:3355497
DOE Office of Scientific and Technical Information (OSTI.GOV)
Claffey, K.P.; Herrera, V.L.; Brecher, P.
1987-12-01
A fatty acid binding protein (FABP) as been identified and characterized in rat heart, but the function and regulation of this protein are unclear. In this study the cDNA for rat heart FABP was cloned from a lambda gt11 library. Sequencing of the cDNA showed an open reading frame coding for a protein with 133 amino acids and a calculated size of 14,776 daltons. Several differences were found between the sequence determined from the cDNA and that reported previously by protein sequencing techniques. Northern blot analysis using rat heart FABP cDNA as a probe established the presence of an abundantmore » mRNA in rat heart about 0.85 kilobases in length. This mRNA was detected, but was not abundant, in fetal heart tissue. Tissue distribution studies showed a similar mRNA species in red, but not white, skeletal muscle. In general, the mRNA tissue distribution was similar to that of the protein detected by Western immunoblot analysis, suggesting that heart FABP expression may be regulated at the transcriptional level. S1 nuclease mapping studies confirmed that the mRNA hybridized to rat heart FABP cDNA was identical in heart and red skeletal muscle throughout the entire open reading frame. The structural differences between heart FABP and other members of this multigene family may be related to the functional requirements of oxidative muscle for fatty acids as a fuel source.« less
Putative Porin of Bradyrhizobium sp. (Lupinus) Bacteroids Induced by Glyphosate▿
de María, Nuria; Guevara, Ángeles; Serra, M. Teresa; García-Luque, Isabel; González-Sama, Alfonso; de Lacoba, Mario García; de Felipe, M. Rosario; Fernández-Pascual, Mercedes
2007-01-01
Application of glyphosate (N-[phosphonomethyl] glycine) to Bradyrhizobium sp. (Lupinus)-nodulated lupin plants caused modifications in the protein pattern of bacteroids. The most significant change was the presence of a 44-kDa polypeptide in bacteroids from plants treated with the higher doses of glyphosate employed (5 and 10 mM). The polypeptide has been characterized by the amino acid sequencing of its N terminus and the isolation and nucleic acid sequencing of its encoding gene. It is putatively encoded by a single gene, and the protein has been identified as a putative porin. Protein modeling revealed the existence of several domains sharing similarity to different porins, such as a transmembrane beta-barrel. The protein has been designated BLpp, for Bradyrhizobium sp. (Lupinus) putative porin, and would be the first porin described in Bradyrhizobium sp. (Lupinus). In addition, a putative conserved domain of porins has been identified which consists of 87 amino acids, located in the BLpp sequence 30 amino acids downstream of the N-terminal region. In bacteroids, mRNA of the BLpp gene shows a basal constitutive expression that increases under glyphosate treatment, and the expression of the gene is seemingly regulated at the transcriptional level. By contrast, in free-living bacteria glyphosate treatment leads to an inhibition of BLpp mRNA accumulation, indicating a different effect of glyphosate on BLpp gene expression in bacteroids and free-living bacteria. The possible role of BLpp in a metabolite interchange between Bradyrhizobium and lupin is discussed. PMID:17557843
Pseudoclavibacter caeni sp. nov., isolated from sludge of a sewage disposal plant.
Srinivasan, Sathiyaraj; Kim, Hyun Sook; Kim, Myung Kyum; Lee, Myungjin
2012-04-01
A Gram-positive, strictly aerobic, rod-shaped, non-motile bacterial strain, designated MJ28T, was isolated from a sludge sample from the Daejeon sewage disposal plant in South Korea. A polyphasic approach was applied to study the taxonomic position of strain MJ28T. Strain MJ28T showed highest 16S rRNA gene sequence similarity to Pseudoclavibacter soli KP02T (95.2 %). Levels of 16S rRNA gene sequence similarity to the type strains of other Pseudoclavibacter species were less than 94.0 %. Phylogenetic analysis based on 16S rRNA gene sequences showed that strain MJ28T belonged to the clade formed by members of the genus Pseudoclavibacter in the family Microbacteriaceae. The G+C content of the genomic DNA of strain MJ28T was 65.8 mol%. The chemotaxonomic characteristics of strain MJ28T showed features typical of the genus Pseudoclavibacter, with MK-9 as the predominant respiratory quinone, 2,4-diaminobutryic acid as the diamino acid in the peptidoglycan, and anteiso-C17:0 (44.6 %), anteiso-C15:0 (35.7 %) and C16:0 (9.5 %) as the major fatty acids. On the basis of phylogenetic inference, fatty acid profile and other phenotypic properties, strain MJ28T is considered to represent a novel species of the genus Pseudoclavibacter, for which the name Pseudoclavibacter caeni sp. nov. is proposed. The type strain is MJ28T (=KCTC 19773T=JCM 16921T).
Mouw, M; Pintel, D J
1998-11-10
GST-NS1 purified from Escherichia coli and insect cells binds double-strand DNA in an (ACCA)2-3-dependent fashion under similar ionic conditions, independent of the presence of anti-NS1 antisera or exogenously supplied ATP and interacts with single-strand DNA and RNA in a sequence-independent manner. An amino-terminal domain (amino acids 1-275) of NS1 [GST-NS1(1-275)], representing 41% of the full-length NS1 molecule, includes a domain that binds double-strand DNA in a sequence-specific manner at levels comparable to full-length GST-NS1, as well as single-strand DNA and RNA in a sequence-independent manner. The deletion of 15 additional amino-terminal amino acids yielded a molecule [GST-NS1(1-275)] that maintained (ACCA)2-3-specific double-strand DNA binding; however, this molecule was more sensitive to increasing ionic conditions than full-length GST-NS1 and GST-NS1(1-275) and could not be demonstrated to bind single-strand nucleic acids. A quantitative filter binding assay showed that E. coli- and baculovirus-expressed GST-NS1 and E. coli GST-NS1(1-275) specifically bound double-strand DNA with similar equilibrium kinetics [as measured by their apparent equilibrium DNA binding constants (KD)], whereas GST-NS1(16-275) bound 4- to 8-fold less well. Copyright 1998 Academic Press.
Cloning and sequencing of the allophycocyanin genes from Spirulina maxima (Cyanophyta)
NASA Astrophysics Data System (ADS)
Qin, Song; Hiroyuki, Kojima; Yoshikazu, Kawata; Shin-Ichi, Yano; Zeng, Cheng-Kui
1998-03-01
The genes coding for the α-and β-subunit of allophycocyanin ( apcA and apcB) from the cyanophyte Spirulina maxima were cloned and sequenced. The results revealed 44.4% of nucleotide sequence similarity and 30.4% of similarity of deduced amino acid sequence between them. The amino acid sequence identities between S. maxima and S. platensis are 99.4% for α subunit and 100% for β subunit.
Wang, Zhiwei; Qiao, Yan; Zhang, Jingjing; Shi, Wenhui; Zhang, Jinwen
2017-07-01
Rapeseed (Brassica napus) is an important cash crop considered as the third largest oil crop worldwide. Rapeseed oil contains various saturation or unsaturation fatty acids, these fatty acids, whose could incorporation with TAG form into lipids stored in seeds play various roles in the metabolic activity. The different fatty acids in B. napus seeds determine oil quality, define if the oil is edible or must be used as industrial material. miRNAs are kind of non-coding sRNAs that could regulate gene expressions through post-transcriptional modification to their target transcripts playing important roles in plant metabolic activities. We employed high-throughput sequencing to identify the miRNAs and their target transcripts involved in fatty acids and lipids metabolism in different development of B. napus seeds. As a result, we identified 826 miRNA sequences, including 523 conserved and 303 newly miRNAs. From the degradome sequencing, we found 589 mRNA could be targeted by 236 miRNAs, it includes 49 novel miRNAs and 187 conserved miRNAs. The miRNA-target couple suggests that bna-5p-163957_18, bna-5p-396192_7, miR9563a-p3, miR9563b-p5, miR838-p3, miR156e-p3, miR159c and miR1134 could target PDP, LACS9, MFPA, ADSL1, ACO32, C0401, GDL73, PlCD6, OLEO3 and WSD1. These target transcripts are involving in acetyl-CoA generate and carbon chain desaturase, regulating the levels of very long chain fatty acids, β-oxidation and lipids transport and metabolism process. At the same, we employed the q-PCR to valid the expression of miRNAs and their target transcripts that involve in fatty acid and lipid metabolism, the result suggested that the miRNA and their transcript expression are negative correlation, which in accord with the expression of miRNA and its target transcript. The study findings suggest that the identified miRNA may play important role in the fatty acids and lipids metabolism in seeds of B. napus. Copyright © 2017 The Author(s). Published by Elsevier B.V. All rights reserved.
Complete genome sequence of lymphocystis disease virus isolated from China.
Zhang, Qi-Ya; Xiao, Feng; Xie, Jian; Li, Zheng-Qiu; Gui, Jian-Fang
2004-07-01
Lymphocystis diseases in fish throughout the world have been extensively described. Here we report the complete genome sequence of lymphocystis disease virus isolated in China (LCDV-C), an LCDV isolated from cultured flounder (Paralichthys olivaceus) with lymphocystis disease in China. The LCDV-C genome is 186,250 bp, with a base composition of 27.25% G+C. Computer-assisted analysis revealed 240 potential open reading frames (ORFs) and 176 nonoverlapping putative viral genes, which encode polypeptides ranging from 40 to 1,193 amino acids. The percent coding density is 67%, and the average length of each ORF is 702 bp. A search of the GenBank database using the 176 individual putative genes revealed 103 homologues to the corresponding ORFs of LCDV-1 and 73 potential genes that were not found in LCDV-1 and other iridoviruses. Among the 73 genes, there are 8 genes that contain conserved domains of cellular genes and 65 novel genes that do not show any significant homology with the sequences in public databases. Although a certain extent of similarity between putative gene products of LCDV-C and corresponding proteins of LCDV-1 was revealed, no colinearity was detected when their ORF arrangements and coding strategies were compared to each other, suggesting that a high degree of genetic rearrangements between them has occurred. And a large number of tandem and overlapping repeated sequences were observed in the LCDV-C genome. The deduced amino acid sequence of the major capsid protein (MCP) presents the highest identity to those of LCDV-1 and other iridoviruses among the LCDV-C gene products. Furthermore, a phylogenetic tree was constructed based on the multiple alignments of nine MCP amino acid sequences. Interestingly, LCDV-C and LCDV-1 were clustered together, but their amino acid identity is much less than that in other clusters. The unexpected levels of divergence between their genomes in size, gene organization, and gene product identity suggest that LCDV-C and LCDV-1 shouldn't belong to a same species and that LCDV-C should be considered a species different from LCDV-1.
Complete Genome Sequence of Lymphocystis Disease Virus Isolated from China
Zhang, Qi-Ya; Xiao, Feng; Xie, Jian; Li, Zheng-Qiu; Gui, Jian-Fang
2004-01-01
Lymphocystis diseases in fish throughout the world have been extensively described. Here we report the complete genome sequence of lymphocystis disease virus isolated in China (LCDV-C), an LCDV isolated from cultured flounder (Paralichthys olivaceus) with lymphocystis disease in China. The LCDV-C genome is 186,250 bp, with a base composition of 27.25% G+C. Computer-assisted analysis revealed 240 potential open reading frames (ORFs) and 176 nonoverlapping putative viral genes, which encode polypeptides ranging from 40 to 1,193 amino acids. The percent coding density is 67%, and the average length of each ORF is 702 bp. A search of the GenBank database using the 176 individual putative genes revealed 103 homologues to the corresponding ORFs of LCDV-1 and 73 potential genes that were not found in LCDV-1 and other iridoviruses. Among the 73 genes, there are 8 genes that contain conserved domains of cellular genes and 65 novel genes that do not show any significant homology with the sequences in public databases. Although a certain extent of similarity between putative gene products of LCDV-C and corresponding proteins of LCDV-1 was revealed, no colinearity was detected when their ORF arrangements and coding strategies were compared to each other, suggesting that a high degree of genetic rearrangements between them has occurred. And a large number of tandem and overlapping repeated sequences were observed in the LCDV-C genome. The deduced amino acid sequence of the major capsid protein (MCP) presents the highest identity to those of LCDV-1 and other iridoviruses among the LCDV-C gene products. Furthermore, a phylogenetic tree was constructed based on the multiple alignments of nine MCP amino acid sequences. Interestingly, LCDV-C and LCDV-1 were clustered together, but their amino acid identity is much less than that in other clusters. The unexpected levels of divergence between their genomes in size, gene organization, and gene product identity suggest that LCDV-C and LCDV-1 shouldn't belong to a same species and that LCDV-C should be considered a species different from LCDV-1. PMID:15194775
Use of linalool synthase in genetic engineering of scent production
Pichersky, E.
1998-12-15
A purified S-linalool synthase polypeptide from Clarkia breweri is disclosed as is the recombinant polypeptide and nucleic acid sequences encoding the polypeptide. Also disclosed are antibodies immunoreactive with the purified peptide and with recombinant versions of the polypeptide. Methods of using the nucleic acid sequences, as well as methods of enhancing the smell and the flavor of plants expressing the nucleic acid sequences are also disclosed. 5 figs.
Use of linalool synthase in genetic engineering of scent production
Pichersky, Eran
1998-01-01
A purified S-linalool synthase polypeptide from Clarkia breweri is disclosed as is the recombinant polypeptide and nucleic acid sequences encoding the polypeptide. Also disclosed are antibodies immunoreactive with the purified peptide and with recombinant versions of the polypeptide. Methods of using the nucleic acid sequences, as well as methods of enhancing the smell and the flavor of plants expressing the nucleic acid sequences are also disclosed.
Urade, Y; Oberdick, J; Molinar-Rode, R; Morgan, J I
1991-01-01
The cerebellum contains a hexadecapeptide, termed cerebellin, that is conserved in sequence from human to chicken. Three independent, overlapping cDNA clones have been isolated from a human cerebellum cDNA library that encode the cerebellin sequence. The longest clone codes for a protein of 193 amino acids that we term precerebellin. This protein has a significant similarity (31.3% identity, 52.2% similarity) to the globular (non-collagen-like) region of the B chain of human complement component C1q. The region of relatedness extends over approximately 145 amino acids located in the carboxyl terminus of both proteins. Unlike C1q B chain, no collagen-like motifs are present in the amino-terminal regions of precerebellin. The amino terminus of precerebellin contains three possible N-linked glycosylation sites. Although hydrophobic amino acids are clustered at the amino terminus, they do not conform to the classical signal-peptide motif, and no other obvious membrane-spanning domains are predicted from the cDNA sequence. The cDNA predicts that the cerebellin peptide is flanked by Val-Arg and Glu-Pro residues. Therefore, cerebellin is not liberated from precerebellin by the classical dibasic amino acid proteolytic-cleavage mechanism seen in many neuropeptide precursors. In Northern (RNA) blots, precerebellin transcripts, with four distinct sizes (1.8, 2.3, 2.7, and 3.0 kilobases), are abundant in cerebellum. These transcripts are present at either very low or undetectable levels in other brain areas and extraneural structures. A similar pattern of cerebellin precursor transcripts are seen in rat, mouse, and human cerebellum. Furthermore, a partial genomic fragment from mouse shows the same bands in Northern blots as the human cDNA clone. During rat development, precerebellin transcripts mirror the level of cerebellin peptide. Low levels of precerebellin mRNA are seen at birth. Levels increase modestly from postpartum day 1 to 8, then increase more dramatically between day 5 and 15, and eventually reach peak values between day 21 and 56. Because cerebellin-like immunoreactivity is associated with Purkinje cell postsynaptic structures, these data raise interesting possibilities concerning the function of the cerebellin precursor in synaptic physiology. Images PMID:1704129
Sequence Diversity Diagram for comparative analysis of multiple sequence alignments.
Sakai, Ryo; Aerts, Jan
2014-01-01
The sequence logo is a graphical representation of a set of aligned sequences, commonly used to depict conservation of amino acid or nucleotide sequences. Although it effectively communicates the amount of information present at every position, this visual representation falls short when the domain task is to compare between two or more sets of aligned sequences. We present a new visual presentation called a Sequence Diversity Diagram and validate our design choices with a case study. Our software was developed using the open-source program called Processing. It loads multiple sequence alignment FASTA files and a configuration file, which can be modified as needed to change the visualization. The redesigned figure improves on the visual comparison of two or more sets, and it additionally encodes information on sequential position conservation. In our case study of the adenylate kinase lid domain, the Sequence Diversity Diagram reveals unexpected patterns and new insights, for example the identification of subgroups within the protein subfamily. Our future work will integrate this visual encoding into interactive visualization tools to support higher level data exploration tasks.
Perczel, András; Jákli, Imre; McAllister, Michael A; Csizmadia, Imre G
2003-06-06
Folding properties of small globular proteins are determined by their amino acid sequence (primary structure). This holds both for local (secondary structure) and for global conformational features of linear polypeptides and proteins composed from natural amino acid derivatives. It thus provides the rational basis of structure prediction algorithms. The shortest secondary structure element, the beta-turn, most typically adopts either a type I or a type II form, depending on the amino acid composition. Herein we investigate the sequence-dependent folding stability of both major types of beta-turns using simple dipeptide models (-Xxx-Yyy-). Gas-phase ab initio properties of 16 carefully selected and suitably protected dipeptide models (for example Val-Ser, Ala-Gly, Ser-Ser) were studied. For each backbone fold most probable side-chain conformers were considered. Fully optimized 321G RHF molecular structures were employed in medium level [B3LYP/6-311++G(d,p)//RHF/3-21G] energy calculations to estimate relative populations of the different backbone conformers. Our results show that the preference for beta-turn forms as calculated by quantum mechanics and observed in Xray determined proteins correlates significantly.
Payne, G; Ahl, P; Moyer, M; Harper, A; Beck, J; Meins, F; Ryals, J
1990-01-01
Complementary DNA clones encoding two isoforms of the acidic endochitinase (chitinase, EC 3.2.1.14) from tobacco were isolated. Comparison of amino acid sequences deduced from the cDNA clones and the sequence of peptides derived from purified proteins show that these clones encode the pathogenesis-related proteins PR-P and PR-Q. The cDNA inserts were not homologous to either the bacterial form of chitinase or the form from cucumber but shared significant homology to the basic form of chitinase from tobacco and bean. The acidic isoforms of tobacco chitinase did not contain the amino-terminal, cysteine-rich "hevein" domain found in the basic isoforms, indicating that this domain, which binds chitin, is not essential for chitinolytic activity. The accumulation of mRNA for the pathogenesis-related proteins PR-1, PR-R, PR-P, and PR-Q in Xanthi.nc tobacco leaves following infection with tobacco mosaic virus was measured by primer extension. The results indicate that the induction of these proteins during the local necrotic lesion response to the virus is coordinated at the mRNA level. Images PMID:2296608
Probe kit for identifying a base in a nucleic acid
Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua
2001-01-01
Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.
Crotoxin: Structural Studies, Mechanism of Action and Cloning of its Gene
1988-03-01
thirteen amino acids being acidic . Sequencing of the three peptides present in the acidic subunit, two of which are blocked by pyroglutamate ...the sequence determination of both the basic and acidic subunits of crotoxin- The acidic * subunit peptides were d!Tfficult, .sfi~n~e two of-ftflý...fluorescence spectroscopy. Results indicate a large conformational change occurs upon) ccmplex formation between the acidic and basic subunits of all four
Kaiya, Hiroyuki; Sakata, Ichiro; Kojima, Masayasu; Hosoda, Hiroshi; Sakai, Takafumi; Kangawa, Kenji
2004-08-01
We purified ghrelin peptide and determined the cDNA sequence encoding the precursor protein from the stomach of the red-eared slider turtle, Trachemys scripta elegans. The Trachemys ghrelin is comprised of 25-amino acids and has the sequence GSSFLSPEYQNTQQRKDPKKHTKLN. The third serine residue was modified by n-octanoic (C8:0), decanoic (C10:0) or unsaturated decanoic acid (C10:1). The carboxyl-terminal end of the peptide was not amidated, as seen in the ghrelins of other land vertebrates. Quantitative real-time PCR analysis revealed high levels of gene expression in the stomach and moderate levels in the large intestine and pancreas. Histochemical studies of turtle stomach revealed that ghrelin-immunopositive (ghrelin-ip) cells, which were small and round, were observed in the mucosal layer of the stomach but not in the myenteric plexus, and ghrelin-mRNA-expressing (ghrelin-ex) cells detected by in situ hybridization were scattered in a similar distribution as ghrelin-ip cells. These results indicate that ghrelin is present in reptiles.
te Biesebeke, Rob; Levasseur, Anthony; Boussier, Amandine; Record, Eric; van den Hondel, Cees A M J J; Punt, Peter J
2010-01-01
The fhbA genes encoding putative flavohemoglobins (FHb) from Aspergillus niger and Aspergillus oryzae were isolated. Comparison of the deduced amino acid sequence of the A. niger fhbA gene and other putative filamentous fungal FHb-encoding genes to that of Ralstonia eutropha shows an overall conserved gene structure and completely conserved catalytic amino acids. Several yeasts and filamentous fungi, including both Aspergillus species have been found to contain a small FHb gene family mostly consisting of two family members. Based on these sequences the evolutionary history of the fungal FHb family was reconstructed. The isolated fhbA genes from A. oryzae and A. niger belong to a phylogenetic group, which exclusively contains Aspergillus genes. Different experimental approaches show that fhbA transcript levels appear during active hyphal growth. Moreover, in a pclA-disrupted strain with a hyperbranching growth phenotype, the transcript levels of the fhbA gene were 2–5 times higher compared to the wild-type. These results suggest that FHb from filamentous fungi have a function that is correlated to the hyphal growth phenotype.
Zhang, Chong; Wang, Jin-gang; Yang, Ting
2006-06-01
To study the effects of Bushen Yin' ao Tablet (BSYNT) on physiology and cerebral gene expression in senescence-accelerated mice (SAM). The change of cerebral tissues mRNA expression in SAM was analyzed and compared by messenger ribonucleic acids reverse transcription differential display polymerase chain reaction (mRNA DDRT-PCR) between the medicated group and the control group. BSYNT could increase the level of hemoglobin (Hb) and amount of erythrocyte (RBC) of blood deficiency mice, improve the spatial learning and memory function and the escape response by conditional stimulus. In this study, 14 differential display bands had been discerned, and three of them had been sequenced. The sequence of the three fragments was similar to fatty acid binding protein 7, ubiquinol-cytochrome C reductase complex (7. 2 kD) and 60S ribosomal protein L21 respectively. And the homogeneity was 97% , 100% , and 99% , respectively. BSYNT has effect on the physiological changing of mice, and its effect on cerebral tissues mRNA expression maybe play an important role in anti-aging on the molecular level.
De novo peptide sequencing by deep learning
Tran, Ngoc Hieu; Zhang, Xianglilan; Xin, Lei; Shan, Baozhen; Li, Ming
2017-01-01
De novo peptide sequencing from tandem MS data is the key technology in proteomics for the characterization of proteins, especially for new sequences, such as mAbs. In this study, we propose a deep neural network model, DeepNovo, for de novo peptide sequencing. DeepNovo architecture combines recent advances in convolutional neural networks and recurrent neural networks to learn features of tandem mass spectra, fragment ions, and sequence patterns of peptides. The networks are further integrated with local dynamic programming to solve the complex optimization task of de novo sequencing. We evaluated the method on a wide variety of species and found that DeepNovo considerably outperformed state of the art methods, achieving 7.7–22.9% higher accuracy at the amino acid level and 38.1–64.0% higher accuracy at the peptide level. We further used DeepNovo to automatically reconstruct the complete sequences of antibody light and heavy chains of mouse, achieving 97.5–100% coverage and 97.2–99.5% accuracy, without assisting databases. Moreover, DeepNovo is retrainable to adapt to any sources of data and provides a complete end-to-end training and prediction solution to the de novo sequencing problem. Not only does our study extend the deep learning revolution to a new field, but it also shows an innovative approach in solving optimization problems by using deep learning and dynamic programming. PMID:28720701
37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.
Code of Federal Regulations, 2014 CFR
2014-07-01
... base or modified or unusual amino acid may be presented in a given sequence as the corresponding unmodified base or amino acid if the modified base or modified or unusual amino acid is one of those listed... the Feature section. Otherwise, each occurrence of a base or amino acid not appearing in WIPO Standard...
Akins, R A; Grant, D M; Stohl, L L; Bottorff, D A; Nargang, F E; Lambowitz, A M
1988-11-05
The Mauriceville and Varkud mitochondrial plasmids of Neurospora are closely related, closed circular DNAs (3.6 and 3.7 kb, respectively; 1 kb = 10(3) bases or base-pairs), whose characteristics suggest relationships to mitochondrial DNA introns and retrotransposons. Here, we characterized the structure of the Varkud plasmid, determined its complete nucleotide sequence and mapped its major transcripts. The Mauriceville and Varkud plasmids have more than 97% positional identity. Both plasmids contain a 710 amino acid open reading frame that encodes a reverse transcriptase-like protein. The amino acid sequence of this open reading frame is strongly conserved between the two plasmids (701/710 amino acids) as expected for a functionally important protein. Both plasmids have a 0.4 kb region that contains five PstI palindromes and a direct repeat of approximately 160 base-pairs. Comparison of sequences in this region suggests that the Varkud plasmid has diverged less from a common ancestor than has the Mauriceville plasmid. Two major transcripts of the Varkud plasmid were detected by Northern hybridization experiments: a full-length linear RNA of 3.7 kb and an additional prominent transcript of 4.9 kb, 1.2 kb longer than monomer plasmid. Remarkably, we find that the 4.9 kb transcript is a hybrid RNA consisting of the full-length 3.7 kb Varkud plasmid transcript plus a 5' leader of 1.2 kb that is derived from the 5' end of the mitochondrial small rRNA. This and other findings suggest that the Varkud plasmid, like certain RNA viruses, has a mechanism for joining heterologous RNAs to the 5' end of its major transcript, and that, under some circumstances, nucleotide sequences in mitochondria may be recombined at the RNA level.
Mouse Vk gene classification by nucleic acid sequence similarity.
Strohal, R; Helmberg, A; Kroemer, G; Kofler, R
1989-01-01
Analyses of immunoglobulin (Ig) variable (V) region gene usage in the immune response, estimates of V gene germline complexity, and other nucleic acid hybridization-based studies depend on the extent to which such genes are related (i.e., sequence similarity) and their organization in gene families. While mouse Igh heavy chain V region (VH) gene families are relatively well-established, a corresponding systematic classification of Igk light chain V region (Vk) genes has not been reported. The present analysis, in the course of which we reviewed the known extent of the Vk germline gene repertoire and Vk gene usage in a variety of responses to foreign and self antigens, provides a classification of mouse Vk genes in gene families composed of members with greater than 80% overall nucleic acid sequence similarity. This classification differed in several aspects from that of VH genes: only some Vk gene families were as clearly separated (by greater than 25% sequence dissimilarity) as typical VH gene families; most Vk gene families were closely related and, in several instances, members from different families were very similar (greater than 80%) over large sequence portions; frequently, classification by nucleic acid sequence similarity diverged from existing classifications based on amino-terminal protein sequence similarity. Our data have implications for Vk gene analyses by nucleic acid hybridization and describe potentially important differences in sequence organization between VH and Vk genes.
NASA Astrophysics Data System (ADS)
Ferreira, M.; Creveling, J.; Hilburn, I.; Karlsson, E.; Pepe-Ranney, C.; Spear, J.; Dawson, S.; Geobio2008, I.
2008-12-01
Silicified structures that exhibit a putative biologic component in their formation permeate the rock record as stromatolites. We have studied a silicified microbial structure from a hot spring in Yellowstone National Park using phenotypic, phylogenetic, and metagenomic analyses to determine microbial carbon metabolic pathways and the phylogenetic affiliations of microbes present in this unique structure. In this multi-faceted approach, dominant physiologies, specifically with regards to anaerobic and aerobic metabolisms, were inferred from 16S rRNA gene sequences and 454 sequencing data from bulk DNA samples of the structure. Carbon utilization as indicated by ECO Biolog plates showed abundant heterotrophy and heterotrophic diversity throughout the microbial structure. Microbes within the structure are able to utilize all tested sources of carbohydrates, lipids/fatty acids, and protein/amino acids as carbon sources. ECO plate testing of the hot spring water yielded considerable less carbohydrate consumption (only 4 out of 13 tested carbohydrates) and similar lipids/fatty acids and protein/amino acids consumption (2 out of 3 and 5 out of 5 tested sources respectively). Full length 16S rRNA gene sequences and metagenomic 454 pyrosequencing of community DNA showed limited diversity among primary producers. From the 16S data, the majority of the autotrophs are inferred to utilize the Calvin cycle for CO2 fixation, followed by 3-hydroxypropionate/4- hydroxybutyrate CO2 fixation. However, an analysis of the metagenomic data compared to the KEGG database does not show genes directly involved with Calvin cycle carbon fixation. Further BLAST searches of our data failed to find significant matches within our 6514 metagenomic sequences to known RuBisCo sequences taken from the NCBI database. This is likely due to a far under-sampled dataset of metagenomic sequences, and the low number (958) that had matches to the KEGG pathways database. Anaerobic versus aerobic physiology also can be estimated from the 16S clone libraries. Phylogenetic analysis of recovered 16S sequences suggests that 15% of the 16S sequences can be attributed to anaerobic microbes while 42% likely come from aerobes. The remaining 43% of 16S rRNA gene sequences belong to metabolically unassigned phyla both known and novel. This preliminary study demonstrates that the small spatially stratified silicified microbial structure present on the margins of a hot spring contains a rich and complex microbial community with different trophic levels and enzymatic pathways.
Relative Amino Acid Composition Signatures of Organisms and Environments
Moura, Alexandra; Savageau, Michael A.; Alves, Rui
2013-01-01
Background Identifying organism-environment interactions at the molecular level is crucial to understanding how organisms adapt to and change the chemical and molecular landscape of their habitats. In this work we investigated whether relative amino acid compositions could be used as a molecular signature of an environment and whether such a signature could also be observed at the level of the cellular amino acid composition of the microorganisms that inhabit that environment. Methodologies/Principal Findings To address these questions we collected and analyzed environmental amino acid determinations from the literature, and estimated from complete genomic sequences the global relative amino acid abundances of organisms that are cognate to the different types of environment. Environmental relative amino acid abundances clustered into broad groups (ocean waters, host-associated environments, grass land environments, sandy soils and sediments, and forest soils), indicating the presence of amino acid signatures specific for each environment. These signatures correlate to those found in organisms. Nevertheless, relative amino acid abundance of organisms was more influenced by GC content than habitat or phylogeny. Conclusions Our results suggest that relative amino acid composition can be used as a signature of an environment. In addition, we observed that the relative amino acid composition of organisms is not highly determined by environment, reinforcing previous studies that find GC content to be the major factor correlating to amino acid composition in living organisms. PMID:24204807
Relative amino acid composition signatures of organisms and environments.
Moura, Alexandra; Savageau, Michael A; Alves, Rui
2013-01-01
Identifying organism-environment interactions at the molecular level is crucial to understanding how organisms adapt to and change the chemical and molecular landscape of their habitats. In this work we investigated whether relative amino acid compositions could be used as a molecular signature of an environment and whether such a signature could also be observed at the level of the cellular amino acid composition of the microorganisms that inhabit that environment. To address these questions we collected and analyzed environmental amino acid determinations from the literature, and estimated from complete genomic sequences the global relative amino acid abundances of organisms that are cognate to the different types of environment. Environmental relative amino acid abundances clustered into broad groups (ocean waters, host-associated environments, grass land environments, sandy soils and sediments, and forest soils), indicating the presence of amino acid signatures specific for each environment. These signatures correlate to those found in organisms. Nevertheless, relative amino acid abundance of organisms was more influenced by GC content than habitat or phylogeny. Our results suggest that relative amino acid composition can be used as a signature of an environment. In addition, we observed that the relative amino acid composition of organisms is not highly determined by environment, reinforcing previous studies that find GC content to be the major factor correlating to amino acid composition in living organisms.
Methods for making nucleotide probes for sequencing and synthesis
Church, George M; Zhang, Kun; Chou, Joseph
2014-07-08
Compositions and methods for making a plurality of probes for analyzing a plurality of nucleic acid samples are provided. Compositions and methods for analyzing a plurality of nucleic acid samples to obtain sequence information in each nucleic acid sample are also provided.
Soil amino acid composition across a boreal forest successional sequence
Nancy R. Werdin-Pfisterer; Knut Kielland; Richard D. Boone
2009-01-01
Soil amino acids are important sources of organic nitrogen for plant nutrition, yet few studies have examined which amino acids are most prevalent in the soil. In this study, we examined the composition, concentration, and seasonal patterns of soil amino acids across a primary successional sequence encompassing a natural gradient of plant productivity and soil...
37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.
Code of Federal Regulations, 2014 CFR
2014-07-01
...” means those amino acids other than “Xaa” and those nucleotide bases other than “n”defined in accordance... 37 Patents, Trademarks, and Copyrights 1 2014-07-01 2014-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences...
37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.
Code of Federal Regulations, 2013 CFR
2013-07-01
...” means those amino acids other than “Xaa” and those nucleotide bases other than “n”defined in accordance... 37 Patents, Trademarks, and Copyrights 1 2013-07-01 2013-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences...
37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.
Code of Federal Regulations, 2012 CFR
2012-07-01
...” means those amino acids other than “Xaa” and those nucleotide bases other than “n”defined in accordance... 37 Patents, Trademarks, and Copyrights 1 2012-07-01 2012-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences...
Amino-terminal sequence of glycoprotein D of herpes simplex virus types 1 and 2
DOE Office of Scientific and Technical Information (OSTI.GOV)
Eisenberg, R.J.; Long, D.; Hogue-Angeletti, R.
1984-01-01
Glycoprotein D (gD) of herpes simplex virus is a structural component of the virion envelope which stimulates production of high titers of herpes simplex virus type-common neutralizing antibody. The authors caried out automated N-terminal amino acid sequencing studies on radiolabeled preparations of gD-1 (gD of herpes simplex virus type 1) and gD-2 (gD of herpes simplex virus type 2). Although some differences were noted, particularly in the methionine and alanine profiles for gD-1 and gD-2, the amino acid sequence of a number of the first 30 residues of the amino terminus of gD-1 and gD-2 appears to be quite similar.more » For both proteins, the first residue is a lysine. When we compared out sequence data for gD-1 with those predicted by nucleic acid sequencing, the two sequences could be aligned (with one exception) starting at residue 26 (lysine) of the predicted sequence. Thus, the first 25 amino acids of the predicted sequence are absent from the polypeptides isolated from infected cells.« less
Lampel, J S; Aphale, J S; Lampel, K A; Strohl, W R
1992-01-01
The gene encoding a novel milk protein-hydrolyzing proteinase was cloned on a 6.56-kb SstI fragment from Streptomyces sp. strain C5 genomic DNA into Streptomyces lividans 1326 by using the plasmid vector pIJ702. The gene encoding the small neutral proteinase (snpA) was located within a 2.6-kb BamHI-SstI restriction fragment that was partially sequenced. The molecular mass of the deduced amino acid sequence of the mature protein was determined to be 15,740, which corresponds very closely with the relative molecular mass of the purified protein (15,500) determined by sodium dodecyl sulfate-polyacrylamide gel electrophoresis. The N-terminal amino acid sequence of the purified neutral proteinase was determined, and the DNA encoding this sequence was found to be located within the sequenced DNA. The deduced amino acid sequence contains a conserved zinc binding site, although secondary ligand binding and active sites typical of thermolysinlike metalloproteinases are absent. The combination of its small size, deduced amino acid sequence, and substrate and inhibition profile indicate that snpA encodes a novel neutral proteinase. Images PMID:1569011
Irukayama-Tomobe, Yoko; Tanaka, Hirokazu; Yokomizo, Takehiko; Hashidate-Yoshida, Tomomi; Yanagisawa, Masashi; Sakurai, Takeshi
2009-03-10
GPR109B (HM74) is a putative G protein-coupled receptor (GPCR) whose cognate ligands have yet to be characterized. GPR109B shows a high degree of sequence similarity to GPR109A, another GPCR that was identified as a high-affinity nicotinic acid (niacin) receptor. However, the affinity of nicotinic acid to GPR109B is very low. In this study, we found that certain aromatic D-amino acids, including D-phenylalanine, D-tryptophan, and the metabolite of the latter, D-kynurenine, decreased the activity of adenylate cyclase in cells transfected with GPR109B cDNA through activation of pertussis toxin (PTX)-sensitive G proteins. These D-amino acids also elicited a transient rise of intracellular Ca(2+) level in cells expressing GPR109B in a PTX-sensitive manner. In contrast, these D-amino acids did not show any effects on cells expressing GPR109A. We found that the GPR109B mRNA is abundantly expressed in human neutrophils. D-phenylalanine and D-tryptophan induced a transient increase of intracellular Ca(2+) level and a reduction of cAMP levels in human neutrophils. Furthermore, knockdown of GPR109B by RNA interference inhibited the D-amino acids-induced decrease of cellular cAMP levels in human neutrophils. These D-amino acids induced chemotactic activity of freshly prepared human neutrophils. We also found that D-phenylalanine and D-tryptophan induced chemotactic responses in Jurkat cells transfected with the GPR109B cDNA but not in mock-transfected Jurkat cells. These results suggest that these aromatic D-amino acids elicit a chemotactic response in human neutrophils via activation of GPR109B.
Makiguchi, Wataru; Tanabe, Junki; Yamada, Hidekazu; Iida, Hiroki; Taura, Daisuke; Ousaka, Naoki; Yashima, Eiji
2015-01-01
Self-recognition and self-discrimination within complex mixtures are of fundamental importance in biological systems, which entirely rely on the preprogrammed monomer sequences and homochirality of biological macromolecules. Here we report artificial chirality- and sequence-selective successive self-sorting of chiral dimeric strands bearing carboxylic acid or amidine groups joined by chiral amide linkers with different sequences through homo- and complementary-duplex formations. A mixture of carboxylic acid dimers linked by racemic-1,2-cyclohexane bis-amides with different amide sequences (NHCO or CONH) self-associate to form homoduplexes in a completely sequence-selective way, the structures of which are different from each other depending on the linker amide sequences. The further addition of an enantiopure amide-linked amidine dimer to a mixture of the racemic carboxylic acid dimers resulted in the formation of a single optically pure complementary duplex with a 100% diastereoselectivity and complete sequence specificity stabilized by the amidinium–carboxylate salt bridges, leading to the perfect chirality- and sequence-selective duplex formation. PMID:26051291
Amexis, Georgios; Rubin, Steven; Chatterjee, Nando; Carbone, Kathryn; Chumakov, Kostantin
2003-06-01
A single clinical isolate of mumps virus designated 88-1961 was obtained from a patient hospitalized with a clinical history of upper respiratory tract infection, parotitis, severe headache, fever and lymphadenopathy. We have sequenced the full-length genome of 88-1961 and compared it against all available full-length sequences of mumps virus. Based upon its nucleotide sequence of the SH gene 88-1961 was identified as a genotype H mumps strain. The overall extent of nucleotide and amino acid differences between each individual gene and protein of 88-1961 and the full-length mumps samples showed that the missense to silent ratios were unevenly distributed. Upon evaluation of the consensus sequence of 88-1961, four positions were found to be clearly heterogeneous at the nucleotide level (NP 315C/T, NP 318C/T, F 271A/C, and HN 855C/T). Sequence analysis revealed that the amino acid sequences for the NP, M, and the L protein were the most conserved, whereas the SH protein exhibited the highest variability among the compared mumps genotypes A, B, and G. No identifying molecular patterns in the non-coding (intergenic) or coding regions of 88-1961 were found when we compared it against relatively virulent (Urabe AM9 B, Glouc1/UK96, 87-1004 and 87-1005) and non-virulent mumps strains (Jeryl Lynn and all Urabe Am9 A substrains). Copyright 2003 Wiley-Liss, Inc.
Genomic structure of the human D-site binding protein (DBP) gene
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shutler, G.; Glassco, T.; Kang, Xiaolin
1996-06-15
The human gene for the D-Site Binding Protein (DBP) has been sequenced and characterized. This gene is a member of the b/ZIP family of transcription factors and is one of three genes forming the PAR sub-family. DBP has been implicated in the diurnal regulation of a variety of liver-specific genes. Examination of the genomic structure of DBP reveals that the gene is divided into four exons and is contained within a relatively compact region of approximately 6 kb. These exons appear to correspond to functional divisions the DBP protein. Exon 1 contains a long 5{prime} UTR, and conservation between themore » rat and the human genes of the presence of small open reading frames within this region suggests that is may play a role in translational control. Exon 2 contains a limited region of similarity to the other PAR domain genes, which may be part of a potential activation domain. Exon 3 contains the PAR domain and differs by only 1 of 71 amino acids between rat and human. Exon 4, containing both the basic and the leucine zipper domains, is likewise highly conserved. The overall degree of homology between the rat and the human cDNA sequences is 82% for the nucleic acid sequence and 92% for the protein sequence. comparison of the rat and human proximal promoters reveals extensive sequence conservation, with two previously characterized DNA binding sites being conserved at the functional and sequence levels. 31 refs., 4 figs.« less
Goettel, Wolfgang; Xia, Eric; Upchurch, Robert; Wang, Ming-Li; Chen, Pengyin; An, Yong-Qiang Charles
2014-04-23
Variation in seed oil composition and content among soybean varieties is largely attributed to differences in transcript sequences and/or transcript accumulation of oil production related genes in seeds. Discovery and analysis of sequence and expression variations in these genes will accelerate soybean oil quality improvement. In an effort to identify these variations, we sequenced the transcriptomes of soybean seeds from nine lines varying in oil composition and/or total oil content. Our results showed that 69,338 distinct transcripts from 32,885 annotated genes were expressed in seeds. A total of 8,037 transcript expression polymorphisms and 50,485 transcript sequence polymorphisms (48,792 SNPs and 1,693 small Indels) were identified among the lines. Effects of the transcript polymorphisms on their encoded protein sequences and functions were predicted. The studies also provided independent evidence that the lack of FAD2-1A gene activity and a non-synonymous SNP in the coding sequence of FAB2C caused elevated oleic acid and stearic acid levels in soybean lines M23 and FAM94-41, respectively. As a proof-of-concept, we developed an integrated RNA-seq and bioinformatics approach to identify and functionally annotate transcript polymorphisms, and demonstrated its high effectiveness for discovery of genetic and transcript variations that result in altered oil quality traits. The collection of transcript polymorphisms coupled with their predicted functional effects will be a valuable asset for further discovery of genes, gene variants, and functional markers to improve soybean oil quality.
Meiler, Arno; Klinger, Claudia; Kaufmann, Michael
2012-09-08
The COG database is the most popular collection of orthologous proteins from many different completely sequenced microbial genomes. Per definition, a cluster of orthologous groups (COG) within this database exclusively contains proteins that most likely achieve the same cellular function. Recently, the COG database was extended by assigning to every protein both the corresponding amino acid and its encoding nucleotide sequence resulting in the NUCOCOG database. This extended version of the COG database is a valuable resource connecting sequence features with the functionality of the respective proteins. Here we present ANCAC, a web tool and MySQL database for the analysis of amino acid, nucleotide, and codon frequencies in COGs on the basis of freely definable phylogenetic patterns. We demonstrate the usefulness of ANCAC by analyzing amino acid frequencies, codon usage, and GC-content in a species- or function-specific context. With respect to amino acids we, at least in part, confirm the cognate bias hypothesis by using ANCAC's NUCOCOG dataset as the largest one available for that purpose thus far. Using the NUCOCOG datasets, ANCAC connects taxonomic, amino acid, and nucleotide sequence information with the functional classification via COGs and provides a GUI for flexible mining for sequence-bias. Thereby, to our knowledge, it is the only tool for the analysis of sequence composition in the light of physiological roles and phylogenetic context without requirement of substantial programming-skills.
2012-01-01
Background The COG database is the most popular collection of orthologous proteins from many different completely sequenced microbial genomes. Per definition, a cluster of orthologous groups (COG) within this database exclusively contains proteins that most likely achieve the same cellular function. Recently, the COG database was extended by assigning to every protein both the corresponding amino acid and its encoding nucleotide sequence resulting in the NUCOCOG database. This extended version of the COG database is a valuable resource connecting sequence features with the functionality of the respective proteins. Results Here we present ANCAC, a web tool and MySQL database for the analysis of amino acid, nucleotide, and codon frequencies in COGs on the basis of freely definable phylogenetic patterns. We demonstrate the usefulness of ANCAC by analyzing amino acid frequencies, codon usage, and GC-content in a species- or function-specific context. With respect to amino acids we, at least in part, confirm the cognate bias hypothesis by using ANCAC’s NUCOCOG dataset as the largest one available for that purpose thus far. Conclusions Using the NUCOCOG datasets, ANCAC connects taxonomic, amino acid, and nucleotide sequence information with the functional classification via COGs and provides a GUI for flexible mining for sequence-bias. Thereby, to our knowledge, it is the only tool for the analysis of sequence composition in the light of physiological roles and phylogenetic context without requirement of substantial programming-skills. PMID:22958836
Hashimoto, Mikako; Orikasa, Yoshitake; Hayashi, Hidenori; Watanabe, Kentaro; Yoshida, Kiyohito; Okuyama, Hidetoshi
2015-07-01
Colwellia psychrerythraea strain 34H is an obligately psychrophilic bacterium that has been used as a model cold-adapted microorganism because of its psychrophilic growth profile, significant production of cold-active enzymes, and cryoprotectant extracellular polysaccharide substances. However, its fatty acid components, particularly trans unsaturated fatty acids and long-chain polyunsaturated fatty acids (LC-PUFAs), have not been fully investigated. In this study, we biochemically identified Δ9-trans hexadecenoic acid [16:1(9t)] and LC-PUFAs such as docosahexaenoic acid. These results are comparable with the fact that the strain 34H genome sequence includes pfa and cti genes that are responsible for the biosynthesis of LC-PUFAs and trans unsaturated fatty acids, respectively. Strain 34H cells grown under static conditions at 5 °C had higher levels of 16:1(9t) than those grown under shaken conditions, and this change was accompanied by an antiparallel decrease in the levels of Δ9-cis hexadecenoic acid [16:1(9c)], suggesting that the cis-to-trans isomerization reaction of 16:1(9c) is activated under static (microanaerobic) culture conditions, that is, the enzyme could be activated by the decreased dissolved oxygen concentration of cultures. On the other hand, the levels of LC-PUFAs were too low (less than 3% of the total), even for cells grown at 5 °C, to evaluate their cold-adaptive function in this bacterium. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
The primary structure of the thymidine kinase gene of fish lymphocystis disease virus.
Schnitzler, P; Handermann, M; Szépe, O; Darai, G
1991-06-01
The DNA nucleotide sequence of the thymidine kinase (TK) gene of fish lymphocystis disease virus (FLDV) which has been localized between the coordinates 0.678 to 0.688 of the viral genome was determined. The analysis of the DNA nucleotide sequence located between the recognition sites of HindIII (0.669 map unit; nucleotide position 1) and AccI (nucleotide position 2032) revealed the presence of an open reading frame of 954 bp on the lower strand of this region between nucleotide positions 1868 (ATG) and 915 (TAA). It encodes for a protein of 318 amino acid residues. The evolutionary relationships of the TK gene of FLDV to the other known TK genes was investigated using the method of progressive sequence alignment. These analyses revealed a high degree of diversity between the protein sequence of FLDV TK gene and the amino acid composition of other TKs tested. However, significant conservations were detected at several regions of amino acid residues of the FLDV TK protein when compared to the amino acid sequence of TKs of African swine fever virus, fowlpox virus, shope fibroma virus, and vaccinia virus and to the amino acid sequences of the cellular cytoplasmic TK of chicken, mouse, and man.
Zou, Jiaqi; Li, Na
2013-09-01
Proper design of nucleic acid sequences is crucial for many applications. We have previously established a thermodynamics-based quantitative model to help design aptamer-based nucleic acid probes by predicting equilibrium concentrations of all interacting species. To facilitate customization of this thermodynamic model for different applications, here we present a generic and easy-to-use platform to implement the algorithm of the model with Microsoft(®) Excel formulas and VBA (Visual Basic for Applications) macros. Two Excel spreadsheets have been developed: one for the applications involving only nucleic acid species, the other for the applications involving both nucleic acid and non-nucleic acid species. The spreadsheets take the nucleic acid sequences and the initial concentrations of all species as input, guide the user to retrieve the necessary thermodynamic constants, and finally calculate equilibrium concentrations for all species in various bound and unbound conformations. The validity of both spreadsheets has been verified by comparing the modeling results with the experimental results on nucleic acid sequences reported in the literature. This Excel-based platform described here will allow biomedical researchers to rationalize the sequence design of nucleic acid probes using the thermodynamics-based modeling even without relevant theoretical and computational skills. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Antarctic ice core samples: culturable bacterial diversity.
Shivaji, Sisinthy; Begum, Zareena; Shiva Nageswara Rao, Singireesu Soma; Vishnu Vardhan Reddy, Puram V; Manasa, Poorna; Sailaja, Buddi; Prathiba, Mambatta S; Thamban, Meloth; Krishnan, Kottekkatu P; Singh, Shiv M; Srinivas, Tanuku N R
2013-01-01
Culturable bacterial abundance at 11 different depths of a 50.26 m ice core from the Tallaksenvarden Nunatak, Antarctica, varied from 0.02 to 5.8 × 10(3) CFU ml(-1) of the melt water. A total of 138 bacterial strains were recovered from the 11 different depths of the ice core. Based on 16S rRNA gene sequence analyses, the 138 isolates could be categorized into 25 phylotypes belonging to phyla Actinobacteria, Bacteroidetes, Firmicutes and Proteobacteria. All isolates had 16S rRNA sequences similar to previously determined sequences (97.2-100%). No correlation was observed in the distribution of the isolates at the various depths either at the phylum, genus or species level. The 25 phylotypes varied in growth temperature range, tolerance to NaCl, growth pH range and ability to produce eight different extracellular enzymes at either 4 or 18 °C. Iso-, anteiso-, unsaturated and saturated fatty acids together constituted a significant proportion of the total fatty acid composition. Copyright © 2012 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.
Ma, Ben-Yuan; Wei, Lian; Sun, Sheng-Zhen; Wang, Duo-Wei; Wei, Deng-Bang
2014-04-25
Plateau zokor (Myospalax baileyi) is a subterranean mammal. Plateau zokor has high learning and memory ability, and can determine the location of blocking obstacles in their tunnels. Forkhead box p2 (FOXP2) is a transcription factor implicated in the neural control of orofacial coordination and sensory-motor integration, particularly with respect to learning, memory and vocalization. To explore the association of foxP2 with the high learning and memory ability of plateau zokor, the cDNA of foxP2 of plateau zokor was sequenced; by using plateau pika as control, the expression levels of foxP2 mRNA and FOXP2 protein in brain of plateau zokor were determined by real-time PCR and Western blot, respectively; and the location of FOXP2 protein in the brain of plateau zokor was determined by immunohistochemistry. The result showed that the cDNA sequence of plateau zokor foxP2 was similar to that of other mammals and the amino acid sequences showed a relatively high degree of conservation, with the exception of two particular amino acid substitutions [a Gln (Q)-to-His (H) change at position 231 and a Ser (S)-to-Ile (I) change at position 235]. Higher expression levels of foxP2 mRNA (3-fold higher) and FOXP2 protein (>2-fold higher) were detected in plateau zokor brain relative to plateau pika brain. In plateau zokor brain, FOXP2 protein was highly expressed in the cerebral cortex, thalamus and the striatum (a basal ganglia brain region). The results suggest that the high learning and memory ability of plateau zokor is related to the high expression levels of foxP2 in the brain.
Suzuki, Shun'ichi; Takenaka, Yasuhiro; Onishi, Norimasa; Yokozeki, Kenzo
2005-08-01
A DNA fragment from Microbacterium liquefaciens AJ 3912, containing the genes responsible for the conversion of 5-substituted-hydantoins to alpha-amino acids, was cloned in Escherichia coli and sequenced. Seven open reading frames (hyuP, hyuA, hyuH, hyuC, ORF1, ORF2, and ORF3) were identified on the 7.5 kb fragment. The deduced amino acid sequence encoded by the hyuA gene included the N-terminal amino acid sequence of the hydantoin racemase from M. liquefaciens AJ 3912. The hyuA, hyuH, and hyuC genes were heterologously expressed in E. coli; their presence corresponded with the detection of hydantoin racemase, hydantoinase, and N-carbamoyl alpha-amino acid amido hydrolase enzymatic activities respectively. The deduced amino acid sequences of hyuP were similar to those of the allantoin (5-ureido-hydantoin) permease from Saccharomyces cerevisiae, suggesting that hyuP protein might function as a hydantoin transporter.
Agarwala, Prachi; Pandey, Satyaprakash; Mapa, Koyeli; Maiti, Souvik
2013-03-05
Transforming growth factor β2 (TGFβ2) is a versatile cytokine with a prominent role in cell migration, invasion, cellular development, and immunomodulation. TGFβ2 promotes the malignancy of tumors by inducing epithelial-mesenchymal transition, angiogenesis, and immunosuppression. As it is well-documented that nucleic acid secondary structure can regulate gene expression, we assessed whether any secondary motif regulates its expression at the post-transcriptional level. Bioinformatics analysis predicts an existence of a 23-nucleotide putative G-quadruplex sequence (PG4) in the 5' untranslated region (UTR) of TGFβ2 mRNA. The ability of this stretch of sequence to form a highly stable, intramolecular parallel quadruplex was demonstrated using ultraviolet and circular dichroism spectroscopy. Footprinting studies further validated its existence in the presence of a neighboring nucleotide sequence. Following structural characterization, we evaluated the biological relevance of this secondary motif using a dual luciferase assay. Although PG4 inhibits the expression of the reporter gene, its presence in the context of the entire 5' UTR sequence interestingly enhances gene expression. Mutation or removal of the G-quadruplex sequence from the 5' UTR of the gene diminished the level of expression of this gene at the translational level. Thus, here we highlight an activating role of the G-quadruplex in modulating gene expression of TGFβ2 at the translational level and its potential to be used as a target for the development of therapeutics against cancer.
Detecting Coevolution in and among Protein Domains
Yeang, Chen-Hsiang; Haussler, David
2007-01-01
Correlated changes of nucleic or amino acids have provided strong information about the structures and interactions of molecules. Despite the rich literature in coevolutionary sequence analysis, previous methods often have to trade off between generality, simplicity, phylogenetic information, and specific knowledge about interactions. Furthermore, despite the evidence of coevolution in selected protein families, a comprehensive screening of coevolution among all protein domains is still lacking. We propose an augmented continuous-time Markov process model for sequence coevolution. The model can handle different types of interactions, incorporate phylogenetic information and sequence substitution, has only one extra free parameter, and requires no knowledge about interaction rules. We employ this model to large-scale screenings on the entire protein domain database (Pfam). Strikingly, with 0.1 trillion tests executed, the majority of the inferred coevolving protein domains are functionally related, and the coevolving amino acid residues are spatially coupled. Moreover, many of the coevolving positions are located at functionally important sites of proteins/protein complexes, such as the subunit linkers of superoxide dismutase, the tRNA binding sites of ribosomes, the DNA binding region of RNA polymerase, and the active and ligand binding sites of various enzymes. The results suggest sequence coevolution manifests structural and functional constraints of proteins. The intricate relations between sequence coevolution and various selective constraints are worth pursuing at a deeper level. PMID:17983264
Schmidt, DJ; Pickett, BE; Camacho, D; Comach, G; Xhaja, K; Lennon, NJ; Rizzolo, K; de Bosch, N; Becerra, A; Nogueira, ML; Mondini, A; da Silva, EV; Vasconcelos, PF; Muñoz-Jordán, JL; Santiago, GA; Ocazionez, R; Gehrke, L; Lefkowitz, EJ; Birren, BW; Henn, MR; Bosch, I
2013-01-01
Dengue virus currently causes 50-100 million infections annually. Comprehensive knowledge about the evolution of Dengue in response to selection pressure is currently unavailable, but would greatly enhance vaccine design efforts. In the current study, we sequenced 187 new dengue virus serotype 3(DENV-3) genotype III whole genomes isolated from Asia and the Americas. We analyzed them together with previously-sequenced isolates to gain a more detailed understanding of the evolutionary adaptations existing in this prevalent American serotype. In order to analyze the phylogenetic dynamics of DENV-3 during outbreak periods; we incorporated datasets of 48 and 11 sequences spanning two major outbreaks in Venezuela during 2001 and 2007-2008 respectively. Our phylogenetic analysis of newly sequenced viruses shows that subsets of genomes cluster primarily by geographic location, and secondarily by time of virus isolation. DENV-3 genotype III sequences from Asia are significantly divergent from those from the Americas due to their geographical separation and subsequent speciation. We measured amino acid variation for the E protein by calculating the Shannon entropy at each position between Asian and American genomes. We found a cluster of 7 amino acid substitutions having high variability within E protein domain III, which has previously been implicated in serotype-specific neutralization escape mutants. No novel mutations were found in the E protein of sequences isolated during either Venezuelan outbreak. Shannon entropy analysis of the NS5 polymerase mature protein revealed that a G374E mutation, in a region that contributes to interferon resistance in other flaviviruses by interfering with JAK-STAT signaling was present in both the Asian and American sequences from the 2007-2008 Venezuelan outbreak, but was absent in the sequences from the 2001 Venezuelan outbreak. In addition to E, several NS5 amino acid changes were unique to the 2007-2008 epidemic in Venezuela and may give additional insight into the adaptive response of DENV-3 at the population level. PMID:21964598
Large-Scale Concatenation cDNA Sequencing
Yu, Wei; Andersson, Björn; Worley, Kim C.; Muzny, Donna M.; Ding, Yan; Liu, Wen; Ricafrente, Jennifer Y.; Wentland, Meredith A.; Lennon, Greg; Gibbs, Richard A.
1997-01-01
A total of 100 kb of DNA derived from 69 individual human brain cDNA clones of 0.7–2.0 kb were sequenced by concatenated cDNA sequencing (CCS), whereby multiple individual DNA fragments are sequenced simultaneously in a single shotgun library. The method yielded accurate sequences and a similar efficiency compared with other shotgun libraries constructed from single DNA fragments (>20 kb). Computer analyses were carried out on 65 cDNA clone sequences and their corresponding end sequences to examine both nucleic acid and amino acid sequence similarities in the databases. Thirty-seven clones revealed no DNA database matches, 12 clones generated exact matches (≥98% identity), and 16 clones generated nonexact matches (57%–97% identity) to either known human or other species genes. Of those 28 matched clones, 8 had corresponding end sequences that failed to identify similarities. In a protein similarity search, 27 clone sequences displayed significant matches, whereas only 20 of the end sequences had matches to known protein sequences. Our data indicate that full-length cDNA insert sequences provide significantly more nucleic acid and protein sequence similarity matches than expressed sequence tags (ESTs) for database searching. [All 65 cDNA clone sequences described in this paper have been submitted to the GenBank data library under accession nos. U79240–U79304.] PMID:9110174
Tkavc, Rok; Matrosova, Vera Y; Grichenko, Olga E; Gostinčar, Cene; Volpe, Robert P; Klimenkova, Polina; Gaidamakova, Elena K; Zhou, Carol E; Stewart, Benjamin J; Lyman, Mathew G; Malfatti, Stephanie A; Rubinfeld, Bonnee; Courtot, Melanie; Singh, Jatinder; Dalgard, Clifton L; Hamilton, Theron; Frey, Kenneth G; Gunde-Cimerman, Nina; Dugan, Lawrence; Daly, Michael J
2017-01-01
Highly concentrated radionuclide waste produced during the Cold War era is stored at US Department of Energy (DOE) production sites. This radioactive waste was often highly acidic and mixed with heavy metals, and has been leaking into the environment since the 1950s. Because of the danger and expense of cleanup of such radioactive sites by physicochemical processes, in situ bioremediation methods are being developed for cleanup of contaminated ground and groundwater. To date, the most developed microbial treatment proposed for high-level radioactive sites employs the radiation-resistant bacterium Deinococcus radiodurans . However, the use of Deinococcus spp. and other bacteria is limited by their sensitivity to low pH. We report the characterization of 27 diverse environmental yeasts for their resistance to ionizing radiation (chronic and acute), heavy metals, pH minima, temperature maxima and optima, and their ability to form biofilms. Remarkably, many yeasts are extremely resistant to ionizing radiation and heavy metals. They also excrete carboxylic acids and are exceptionally tolerant to low pH. A special focus is placed on Rhodotorula taiwanensis MD1149, which was the most resistant to acid and gamma radiation. MD1149 is capable of growing under 66 Gy/h at pH 2.3 and in the presence of high concentrations of mercury and chromium compounds, and forming biofilms under high-level chronic radiation and low pH. We present the whole genome sequence and annotation of R. taiwanensis strain MD1149, with a comparison to other Rhodotorula species. This survey elevates yeasts to the frontier of biology's most radiation-resistant representatives, presenting a strong rationale for a role of fungi in bioremediation of acidic radioactive waste sites.
Tkavc, Rok; Matrosova, Vera Y.; Grichenko, Olga E.; Gostinčar, Cene; Volpe, Robert P.; Klimenkova, Polina; Gaidamakova, Elena K.; Zhou, Carol E.; Stewart, Benjamin J.; Lyman, Mathew G.; Malfatti, Stephanie A.; Rubinfeld, Bonnee; Courtot, Melanie; Singh, Jatinder; Dalgard, Clifton L.; Hamilton, Theron; Frey, Kenneth G.; Gunde-Cimerman, Nina; Dugan, Lawrence; Daly, Michael J.
2018-01-01
Highly concentrated radionuclide waste produced during the Cold War era is stored at US Department of Energy (DOE) production sites. This radioactive waste was often highly acidic and mixed with heavy metals, and has been leaking into the environment since the 1950s. Because of the danger and expense of cleanup of such radioactive sites by physicochemical processes, in situ bioremediation methods are being developed for cleanup of contaminated ground and groundwater. To date, the most developed microbial treatment proposed for high-level radioactive sites employs the radiation-resistant bacterium Deinococcus radiodurans. However, the use of Deinococcus spp. and other bacteria is limited by their sensitivity to low pH. We report the characterization of 27 diverse environmental yeasts for their resistance to ionizing radiation (chronic and acute), heavy metals, pH minima, temperature maxima and optima, and their ability to form biofilms. Remarkably, many yeasts are extremely resistant to ionizing radiation and heavy metals. They also excrete carboxylic acids and are exceptionally tolerant to low pH. A special focus is placed on Rhodotorula taiwanensis MD1149, which was the most resistant to acid and gamma radiation. MD1149 is capable of growing under 66 Gy/h at pH 2.3 and in the presence of high concentrations of mercury and chromium compounds, and forming biofilms under high-level chronic radiation and low pH. We present the whole genome sequence and annotation of R. taiwanensis strain MD1149, with a comparison to other Rhodotorula species. This survey elevates yeasts to the frontier of biology's most radiation-resistant representatives, presenting a strong rationale for a role of fungi in bioremediation of acidic radioactive waste sites. PMID:29375494
Lorsirigool, Athip; Saeng-Chuto, Kepalee; Madapong, Adthakorn; Temeeyasen, Gun; Tripipat, Thitima; Kaewprommal, Pavita; Tantituvanont, Angkana; Piriyapongsa, Jittima; Nilubol, Dachrit
2017-04-01
Porcine deltacoronavirus (PDCoV) was identified in intestinal samples collected from piglets with diarrhea in Thailand in 2015. Two Thai PDCoV isolates, P23_15_TT_1115 and P24_15_NT1_1215, were isolated and identified. The full-length genome sequences of the P23_15_TT_1115 and P24_15_NT1_1215 isolates were 25,404 and 25,407 nucleotides in length, respectively, which were relatively shorter than that of US and China PDCoV. The phylogenetic analysis based on the full-length genome demonstrated that Thai PDCoV isolates form a new cluster separated from US and China PDCoV but relatively were more closely related to China PDCoV than US isolates. The genetic analyses demonstrated that Thai PDCoVs have 97.0-97.8 and 92.2-94.0% similarities with China PDCoV at nucleotide and amino acid levels, respectively, but share 97.1-97.3 and 92.5-93.0 similarity with US PDCoV at the nucleotide and amino acid levels, respectively. Thai PDCoV possesses two discontinuous deletions of five amino acids in ORF1a/b region. One additional deletion of one amino acid was identified in P23_15_TT_1115. The variation analyses demonstrated that six regions (nt 1317-1436, 2997-3096, 19,737-19,836, 20,277-20,376, 21,177-21,276, and 22,371-22,416) in ORF1a/b and spike genes exhibit high sequence variation between Thai and other PDCoV. The analyses of amino acid changes suggested that they could potentially be from different lineages.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Myers, G.; Korber, B.; Wain-Hobson, S.
1993-12-31
This compendium and the accompanying floppy diskettes are the result of an effort to compile and rapidly publish all relevant molecular data concerning the human immunodeficiency viruses (HIV) and related retroviruses. The scope of the compendium and database is best summarized by the five parts that it comprises: (I) HIV and SIV Nucleotide Sequences; (II) Amino Acid Sequences; (III) Analyses; (IV) Related Sequences; and (V) Database Communications. Information within all the parts is updated at least twice in each year, which accounts for the modes of binding and pagination in the compendium.
Complete cDNA sequence and amino acid analysis of a bovine ribonuclease K6 gene.
Pietrowski, D; Förster, M
2000-01-01
The complete cDNA sequence of a ribonuclease k6 gene of Bos Taurus has been determined. It codes for a protein with 154 amino acids and contains the invariant cysteine, histidine and lysine residues as well as the characteristic motifs specific to ribonuclease active sites. The deduced protein sequence is 27 residues longer than other known ribonucleases k6 and shows amino acids exchanges which could reflect a strain specificity or polymorphism within the bovine genome. Based on sequence similarity we have termed the identified gene bovine ribonuclease k6 b (brk6b).
Cloning of an avilamycin biosynthetic gene cluster from Streptomyces viridochromogenes Tü57.
Gaisser, S; Trefzer, A; Stockert, S; Kirschning, A; Bechthold, A
1997-01-01
A 65-kb region of DNA from Streptomyces viridochromogenes Tü57, containing genes encoding proteins involved in the biosynthesis of avilamycins, was isolated. The DNA sequence of a 6.4-kb fragment from this region revealed four open reading frames (ORF1 to ORF4), three of which are fully contained within the sequenced fragment. The deduced amino acid sequence of AviM, encoded by ORF2, shows 37% identity to a 6-methylsalicylic acid synthase from Penicillium patulum. Cultures of S. lividans TK24 and S. coelicolor CH999 containing plasmids with ORF2 on a 5.5-kb PstI fragment were able to produce orsellinic acid, an unreduced version of 6-methylsalicylic acid. The amino acid sequence encoded by ORF3 (AviD) is 62% identical to that of StrD, a dTDP-glucose synthase from S. griseus. The deduced amino acid sequence of AviE, encoded by ORF4, shows 55% identity to a dTDP-glucose dehydratase (StrE) from S. griseus. Gene insertional inactivation experiments of aviE abolished avilamycin production, indicating the involvement of aviE in the biosynthesis of avilamycins. PMID:9335272
Follin, Elna; Karlsson, Maria; Lundegaard, Claus; Nielsen, Morten; Wallin, Stefan; Paulsson, Kajsa; Westerdahl, Helena
2013-04-01
The major histocompatibility complex (MHC) genes are the most polymorphic genes found in the vertebrate genome, and they encode proteins that play an essential role in the adaptive immune response. Many songbirds (passerines) have been shown to have a large number of transcribed MHC class I genes compared to most mammals. To elucidate the reason for this large number of genes, we compared 14 MHC class I alleles (α1-α3 domains), from great reed warbler, house sparrow and tree sparrow, via phylogenetic analysis, homology modelling and in silico peptide-binding predictions to investigate their functional and genetic relationships. We found more pronounced clustering of the MHC class I allomorphs (allele specific proteins) in regards to their function (peptide-binding specificities) compared to their genetic relationships (amino acid sequences), indicating that the high number of alleles is of functional significance. The MHC class I allomorphs from house sparrow and tree sparrow, species that diverged 10 million years ago (MYA), had overlapping peptide-binding specificities, and these similarities across species were also confirmed in phylogenetic analyses based on amino acid sequences. Notably, there were also overlapping peptide-binding specificities in the allomorphs from house sparrow and great reed warbler, although these species diverged 30 MYA. This overlap was not found in a tree based on amino acid sequences. Our interpretation is that convergent evolution on the level of the protein function, possibly driven by selection from shared pathogens, has resulted in allomorphs with similar peptide-binding repertoires, although trans-species evolution in combination with gene conversion cannot be ruled out.
Vinueza-Burgos, Christian; Wautier, Magali; Martiny, Delphine; Cisneros, Marco; Van Damme, Inge; De Zutter, Lieven
2017-01-01
Abstract Thermotolerant Campylobacter spp. are a major cause of foodborne gastrointestinal infections worldwide. The linkage of human campylobacteriosis and poultry has been widely described. In this study we aimed to investigate the prevalence, antimicrobial resistance and genetic diversity of C. coli and C. jejuni in broilers from Ecuador. Caecal content from 379 randomly selected broiler batches originating from 115 farms were collected from 6 slaughterhouses located in the province of Pichincha during 1 year. Microbiological isolation was performed by direct plating on mCCDA agar. Identification of Campylobacter species was done by PCR. Minimum inhibitory concentration (MIC) values for gentamicin, ciprofloxacin, nalidixic acid, tetracycline, streptomycin, and erythromycin were obtained. Genetic variation was assessed by RFLP-flaA typing and Multilocus Sequence Typing (MLST) of selected isolates. Prevalence at batch level was 64.1%. Of the positive batches 68.7% were positive for C. coli, 18.9% for C. jejuni, and 12.4% for C. coli and C. jejuni. Resistance rates above 67% were shown for tetracycline, ciprofloxacin, and nalidixic acid. The resistance pattern tetracycline, ciprofloxin, and nalidixic acid was the dominant one in both Campylobacter species. RFLP-flaA typing analysis showed that C. coli and C. jejuni strains belonged to 38 and 26 profiles respectively. On the other hand MLST typing revealed that C. coli except one strain belonged to CC-828, while C. jejuni except 2 strains belonged to 12 assigned clonal complexes (CCs). Furthermore 4 new sequence types (STs) for both species were described, whereby 2 new STs for C. coli were based on new allele sequences. Further research is necessary to estimate the impact of the slaughter of Campylobacter positive broiler batches on the contamination level of carcasses in slaughterhouses and at retail in Ecuador. PMID:28339716
Elkins, Christopher A.; Savage, Dwayne C.
1998-01-01
Cytosolic extracts of Lactobacillus johnsonii 100-100 (previously reported as Lactobacillus sp. strain 100-100) contain four heterotrimeric isozymes composed of two peptides, α and β, with conjugated bile salt hydrolase (BSH) activity. We now report cloning, from the genome of strain 100-100, a 2,977-bp DNA segment that expresses BSH activity in Escherichia coli. The sequencing of this segment showed that it contained one complete and two partial open reading frames (ORFs). The 3′ partial ORF (927 nucleotides) was predicted by BLAST and confirmed with 5′ and 3′ deletions to be a BSH gene. Thermal asymmetric interlaced PCR was used to extend and complete the 948-nucleotide sequence of the BSH gene 3′ of the cloned segment. The predicted amino acid sequence of the 5′ partial ORF (651 nucleotides) was about 80% similar to the C-terminal half of the largest, complete ORF (1,353 nucleotides), and these two putative proteins were similar to several amine, multidrug resistance, and sugar transport proteins of the major facilitator superfamily. E. coli DH5α cells transformed with a construct containing these ORFs, in concert with an extracellular factor produced by strain 100-100, demonstrated levels of uptake of [14C]taurocholic acid that were increased as much as threefold over control levels. [14C]Cholic acid was taken up in similar amounts by strain DH5α pSportI (control) and DH5α p2000 (transport clones). These findings support a hypothesis that the ORFs are conjugated bile salt transport genes which may be arranged in an operon with BSH genes. PMID:9721268
Molecular Characterization of a Novel N-Acetyltransferase from Chryseobacterium sp.
Yoshida, Kenji; Tanaka, Kosei; Yoshida, Ken-ichi
2014-01-01
N-Acetyltransferase from Chryseobacterium sp. strain 5-3B is an acetyl coenzyme A (acetyl-CoA)-dependent enzyme that catalyzes the enantioselective transfer of an acetyl group from acetyl-CoA to the amino group of l-2-phenylglycine to produce (2S)-2-acetylamino-2-phenylacetic acid. We purified the enzyme from strain 5-3B and deduced the N-terminal amino acid sequence. The gene, designated natA, was cloned with two other hypothetical protein genes; the three genes probably form a 2.5-kb operon. The deduced amino acid sequence of NatA showed high levels of identity to sequences of putative N-acetyltransferases of Chryseobacterium spp. but not to other known arylamine and arylalkylamine N-acetyltransferases. Phylogenetic analysis indicated that NatA forms a distinct lineage from known N-acetyltransferases. We heterologously expressed recombinant NatA (rNatA) in Escherichia coli and purified it. rNatA showed high activity for l-2-phenylglycine and its chloro- and hydroxyl-derivatives. The Km and Vmax values for l-2-phenylglycine were 0.145 ± 0.026 mM and 43.6 ± 2.39 μmol · min−1 · mg protein−1, respectively. The enzyme showed low activity for 5-aminosalicylic acid and 5-hydroxytryptamine, which are reported as good substrates of a known arylamine N-acetyltransferase and an arylalkylamine N-acetyltransferase. rNatA had a comparatively broad acyl donor specificity, transferring acyl groups to l-2-phenylglycine and producing the corresponding 2-acetylamino-2-phenylacetic acids (relative activity with acetyl donors acetyl-CoA, propanoyl-CoA, butanoyl-CoA, pentanoyl-CoA, and hexanoyl-CoA, 100:108:122:10:<1). PMID:24375143
García-Cobos, Silvia; Campos, José; Lázaro, Edurne; Román, Federico; Cercenado, Emilia; García-Rey, César; Pérez-Vázquez, María; Oteo, Jesús; de Abajo, Francisco
2007-01-01
The sequence of the ftsI gene encoding the transpeptidase domain of penicillin-binding protein 3 (PBP 3) was determined for 354 nonconsecutive Haemophilus influenzae isolates from Spain; 17.8% of them were ampicillin susceptible, 56% were β-lactamase nonproducing ampicillin resistant (BLNAR), 15.8% were β-lactamase producers and ampicillin resistant, and 10.4% displayed both resistance mechanisms. The ftsI gene sequences had 28 different mutation patterns and amino acid substitutions at 23 positions. Some 93.2% of the BLNAR strains had amino acid substitutions at the Lys-Thr-Gly (KTG) motif, the two most common being Asn526 to Lys (83.9%) and Arg517 to His (9.3%). Amino acid substitutions at positions 377, 385, and 389, which conferred cefotaxime and cefixime MICs 10 to 60 times higher than those of susceptible strains, were found for the first time in Europe. In 72 isolates for which the repressor acrR gene of the AcrAB efflux pump was sequenced, numerous amino acid substitutions were found. Eight isolates with ampicillin MICs of 0.25 to 2 μg/ml showed changes that predicted the early termination of the acrR reading frame. Pulsed-field gel electrophoresis analysis demonstrated that most BLNAR strains were genetically diverse, although clonal dissemination was detected in a group of isolates presenting with increased resistance to cefotaxime and cefixime. Background antibiotic use at the community level revealed a marked trend toward increased amoxicillin-clavulanic acid consumption. BLNAR H. influenzae strains have arisen by vertical and horizontal spread and have evolved to adapt rapidly to the increased selective pressures posed by the use of oral penicillins and cephalosporins. PMID:17470649
Apparent founder effect during the early years of the San Francisco HIV type 1 epidemic (1978-1979).
Foley, B; Pan, H; Buchbinder, S; Delwart, E L
2000-10-10
HIV-1 envelope sequence variants were RT-PCR amplified from serum samples cryopreserved in San Francisco in 1978-1979. The HIV-1 subtype B env V3-V5 sequences from four homosexual men clustered phylogenetically, with a median nucleotide distance of 2.8%, reflecting a recent common origin. These early U.S. HIV-1 env variants mapped close to the phylogenetic root of the subtype B tree while env variants collected in the United States throughout the 1980s and 1990s showed, on average, increasing genetic diversity and divergence from the subtype B consensus sequence. These results indicate that the majority of HIV-1 currently circulating in the United States may be descended from an initial introduction and rapid spread during the mid- to late 1970s of subtype B viruses with limited variability (i.e., a founder effect). As expected from the starburst-shaped phylogeny of HIV-1 subtype B, contemporary U.S. strains were, on average, more closely related at the nucleic acid and amino acid levels to the earlier 1978-1979 env variants than to each other. The growing levels of HIV-1 genetic diversity, one of multiple obstacles in designing a protective vaccine, may therefore be mitigated by using epidemic founding variants as antigenic strains for protection against contemporary strains.
Identification of two allelic IgG1 C(H) coding regions (Cgamma1) of cat.
Kanai, T H; Ueda, S; Nakamura, T
2000-01-31
Two types of cDNA encoding IgG1 heavy chain (gamma1) were isolated from a single domestic short-hair cat. Sequence analysis indicated a higher level of similarity of these Cgamma1 sequences to human Cgamma1 sequence (76.9 and 77.0%) than to mouse sequence (70.0 and 69.7%) at the nucleotide level. Predicted primary structures of both the feline Cgamma1 genes, designated as Cgamma1a and Cgamma1b, were similar to that of human Cgamma1 gene, for instance, as to the size of constant domains, the presence of six conserved cysteine residues involved in formation of the domain structure, and the location of a conserved N-linked glycosylation site. Sequence comparison between the two alleles showed that 7 out of 10 nucleotide differences were within the C(H)3 domain coding region, all leading to nonsynonymous changes in amino acid residues. Partial sequence analysis of genomic clones showed three nucleotide substitutions between the two Cgamma1 alleles in the intron between the CH2 and C(H)3 domain coding regions. In 12 domestic short-hair cats used in this study, the frequency of Cgamma1a allele (62.5%) was higher than that of the Cgamma1b allele (37.5%).
Kim, Juhan; Kyung, Dohyun; Yun, Hyungdon; Cho, Byung-Kwan; Seo, Joo-Hyun; Cha, Minho; Kim, Byung-Gee
2007-01-01
A novel β-transaminase gene was cloned from Mesorhizobium sp. strain LUK. By using N-terminal sequence and an internal protein sequence, a digoxigenin-labeled probe was made for nonradioactive hybridization, and a 2.5-kb gene fragment was obtained by colony hybridization of a cosmid library. Through Southern blotting and sequence analysis of the selected cosmid clone, the structural gene of the enzyme (1,335 bp) was identified, which encodes a protein of 47,244 Da with a theoretical pI of 6.2. The deduced amino acid sequence of the β-transaminase showed the highest sequence similarity with glutamate-1-semialdehyde aminomutase of transaminase subgroup II. The β-transaminase showed higher activities toward d-β-aminocarboxylic acids such as 3-aminobutyric acid, 3-amino-5-methylhexanoic acid, and 3-amino-3-phenylpropionic acid. The β-transaminase has an unusually broad specificity for amino acceptors such as pyruvate and α-ketoglutarate/oxaloacetate. The enantioselectivity of the enzyme suggested that the recognition mode of β-aminocarboxylic acids in the active site is reversed relative to that of α-amino acids. After comparison of its primary structure with transaminase subgroup II enzymes, it was proposed that R43 interacts with the carboxylate group of the β-aminocarboxylic acids and the carboxylate group on the side chain of dicarboxylic α-keto acids such as α-ketoglutarate and oxaloacetate. R404 is another conserved residue, which interacts with the α-carboxylate group of the α-amino acids and α-keto acids. The β-transaminase was used for the asymmetric synthesis of enantiomerically pure β-aminocarboxylic acids. (3S)-Amino-3-phenylpropionic acid was produced from the ketocarboxylic acid ester substrate by coupled reaction with a lipase using 3-aminobutyric acid as amino donor. PMID:17259358
Ma, Siming; Upneja, Akhil; Galecki, Andrzej; Tsai, Yi-Miau; Burant, Charles F; Raskind, Sasha; Zhang, Quanwei; Zhang, Zhengdong D; Seluanov, Andrei; Gorbunova, Vera; Clish, Clary B; Miller, Richard A; Gladyshev, Vadim N
2016-11-22
Mammalian lifespan differs by >100 fold, but the mechanisms associated with such longevity differences are not understood. Here, we conducted a study on primary skin fibroblasts isolated from 16 species of mammals and maintained under identical cell culture conditions. We developed a pipeline for obtaining species-specific ortholog sequences, profiled gene expression by RNA-seq and small molecules by metabolite profiling, and identified genes and metabolites correlating with species longevity. Cells from longer lived species up-regulated genes involved in DNA repair and glucose metabolism, down-regulated proteolysis and protein transport, and showed high levels of amino acids but low levels of lysophosphatidylcholine and lysophosphatidylethanolamine. The amino acid patterns were recapitulated by further analyses of primate and bird fibroblasts. The study suggests that fibroblast profiling captures differences in longevity across mammals at the level of global gene expression and metabolite levels and reveals pathways that define these differences.
Bastien, Olivier; Maréchal, Eric
2008-08-07
Confidence in pairwise alignments of biological sequences, obtained by various methods such as Blast or Smith-Waterman, is critical for automatic analyses of genomic data. Two statistical models have been proposed. In the asymptotic limit of long sequences, the Karlin-Altschul model is based on the computation of a P-value, assuming that the number of high scoring matching regions above a threshold is Poisson distributed. Alternatively, the Lipman-Pearson model is based on the computation of a Z-value from a random score distribution obtained by a Monte-Carlo simulation. Z-values allow the deduction of an upper bound of the P-value (1/Z-value2) following the TULIP theorem. Simulations of Z-value distribution is known to fit with a Gumbel law. This remarkable property was not demonstrated and had no obvious biological support. We built a model of evolution of sequences based on aging, as meant in Reliability Theory, using the fact that the amount of information shared between an initial sequence and the sequences in its lineage (i.e., mutual information in Information Theory) is a decreasing function of time. This quantity is simply measured by a sequence alignment score. In systems aging, the failure rate is related to the systems longevity. The system can be a machine with structured components, or a living entity or population. "Reliability" refers to the ability to operate properly according to a standard. Here, the "reliability" of a sequence refers to the ability to conserve a sufficient functional level at the folded and maturated protein level (positive selection pressure). Homologous sequences were considered as systems 1) having a high redundancy of information reflected by the magnitude of their alignment scores, 2) which components are the amino acids that can independently be damaged by random DNA mutations. From these assumptions, we deduced that information shared at each amino acid position evolved with a constant rate, corresponding to the information hazard rate, and that pairwise sequence alignment scores should follow a Gumbel distribution, which parameters could find some theoretical rationale. In particular, one parameter corresponds to the information hazard rate. Extreme value distribution of alignment scores, assessed from high scoring segments pairs following the Karlin-Altschul model, can also be deduced from the Reliability Theory applied to molecular sequences. It reflects the redundancy of information between homologous sequences, under functional conservative pressure. This model also provides a link between concepts of biological sequence analysis and of systems biology.
Using video-oriented instructions to speed up sequence comparison.
Wozniak, A
1997-04-01
This document presents an implementation of the well-known Smith-Waterman algorithm for comparison of proteic and nucleic sequences, using specialized video instructions. These instructions, SIMD-like in their design, make possible parallelization of the algorithm at the instruction level. Benchmarks on an ULTRA SPARC running at 167 MHz show a speed-up factor of two compared to the same algorithm implemented with integer instructions on the same machine. Performance reaches over 18 million matrix cells per second on a single processor, giving to our knowledge the fastest implementation of the Smith-Waterman algorithm on a workstation. The accelerated procedure was introduced in LASSAP--a LArge Scale Sequence compArison Package software developed at INRIA--which handles parallelism at higher level. On a SUN Enterprise 6000 server with 12 processors, a speed of nearly 200 million matrix cells per second has been obtained. A sequence of length 300 amino acids is scanned against SWISSPROT R33 (1,8531,385 residues) in 29 s. This procedure is not restricted to databank scanning. It applies to all cases handled by LASSAP (intra- and inter-bank comparisons, Z-score computation, etc.
Alam, Syed Benazir; Reade, Ron; Theilmann, Jane; Rochon, D'Ann
2017-12-01
Cucumber necrosis virus (CNV) is a T = 3 icosahedral virus with a (+)ssRNA genome. The N-terminal CNV coat protein arm contains a conserved, highly basic sequence ("KGRKPR"), which we postulate is involved in RNA encapsidation during virion assembly. Seven mutants were constructed by altering the CNV "KGRKPR" sequence; the four basic residues were mutated to alanine individually, in pairs, or in total. Virion accumulation and vRNA encapsidation were significantly reduced in mutants containing two or four substitutions and virion morphology was also affected, where both T = 1 and intermediate-sized particles were produced. Mutants with two or four substitutions encapsidated significantly greater levels of truncated RNA than that of WT, suggesting that basic residues in the "KGRKPR" sequence are important for encapsidation of full-length CNV RNA. Interestingly, "KGRKPR" mutants also encapsidated relatively higher levels of host RNA, suggesting that the "KGRKPR" sequence also contributes to selective encapsidation of CNV RNA. Crown Copyright © 2017. Published by Elsevier Inc. All rights reserved.
Federal Register 2010, 2011, 2012, 2013, 2014
2012-05-15
... (EPO) as the lead, to propose a revised standard for the filing of nucleotide and/or amino acid.... ST.25 uses a controlled vocabulary of feature keys to describe nucleic acid and amino acid sequences... patent data purposes. The XML standard also includes four qualifiers for amino acids. These feature keys...
Molecular cloning of the pheromone biosynthesis-activating neuropeptide in Helicoverpa zea.
Davis, M T; Vakharia, V N; Henry, J; Kempe, T G; Raina, A K
1992-01-01
Pheromone biosynthesis-activating neuropeptide (PBAN) regulates sex pheromone biosynthesis in female Helicoverpa (Heliothis) zea. Two oligonucleotide probes representing two overlapping amino acid regions of PBAN were used to screen 2.5 x 10(5) recombinant plaques, and a positive recombinant clone was isolated. Sequence analysis of the isolated clone showed that the PBAN gene is interrupted after the codon encoding amino acid 14 by a 0.63-kilobase (kb) intron. Preceding the PBAN amino acid sequence is a 10-amino acid sequence containing a pentapeptide Phe-Thr-Pro-Arg-Leu, which is followed by a Gly-Arg-Arg processing site. Immediately after the PBAN amino acid sequence is a Gly-Arg processing site and a short stretch of 10 amino acids. This 10-amino acid sequence contains a repeat of the PBAN C-terminal pentapeptide Phe-Ser-Pro-Arg-Leu and is terminated by another Gly-Arg processing site. It is suggested that the PBAN gene in H. zea might carry, besides PBAN, a 7- and an 8-residue amidated peptide, which share with PBAN the core C-terminal pentapeptide Phe-(Ser or Thr)-Pro-Arg-Leu-NH2. The C-terminal pentapeptide sequence of PBAN represents the minimum sequence required for pheromonotropic activity in H. zea and also bears a high degree of homology to the pyrokinin family of insect peptides with myotropic activity. It is possible that the putative heptapeptide and octapeptide might be new members of the pyrokinin family, with pheromonotropic and/or myotropic activities. Thus, the PBAN gene products, besides affecting sexual behavior, might have broad influence on many biological processes in H. zea. Images PMID:1729680
Host Cell Virus Entry Mediated by Australian Bat Lyssavirus Envelope G glycoprotein
2013-10-24
39 Figure 7. Comparison of the amino acid sequences of Saccolaimus and Pteropus ABLV G mature protein... sequence analysis revealed that the PCR products were identical. Sequence comparisons of the ABLV N and other lyssavirus N proteins showed that ABLV...Saccolaimus flaviventris) (129). Nucleoprotein sequence comparisons revealed that the Saccolaimus N protein shared 96% amino acid homology with the Pteropus
DNA sequence similarity recognition by hybridization to short oligomers
Milosavljevic, Aleksandar
1999-01-01
Methods are disclosed for the comparison of nucleic acid sequences. Data is generated by hybridizing sets of oligomers with target nucleic acids. The data thus generated is manipulated simultaneously with respect to both (i) matching between oligomers and (ii) matching between oligomers and putative reference sequences available in databases. Using data compression methods to manipulate this mutual information, sequences for the target can be constructed.
Patel, Kamlesh D.
2018-01-22
Kamlesh (Ken) Patel from Sandia National Laboratories (Livermore, California) presents "Preparation of Nucleic Acid Libraries for Personalized Sequencing Systems Using an Integrated Microfluidic Hub Technology " at the 7th Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting held in June, 2012 in Santa Fe, NM.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Patel, Kamlesh D.
2012-06-01
Kamlesh (Ken) Patel from Sandia National Laboratories (Livermore, California) presents "Preparation of Nucleic Acid Libraries for Personalized Sequencing Systems Using an Integrated Microfluidic Hub Technology " at the 7th Annual Sequencing, Finishing, Analysis in the Future (SFAF) Meeting held in June, 2012 in Santa Fe, NM.
Discovery of Escherichia coli CRISPR sequences in an undergraduate laboratory.
Militello, Kevin T; Lazatin, Justine C
2017-05-01
Clustered regularly interspaced short palindromic repeats (CRISPRs) represent a novel type of adaptive immune system found in eubacteria and archaebacteria. CRISPRs have recently generated a lot of attention due to their unique ability to catalog foreign nucleic acids, their ability to destroy foreign nucleic acids in a mechanism that shares some similarity to RNA interference, and the ability to utilize reconstituted CRISPR systems for genome editing in numerous organisms. In order to introduce CRISPR biology into an undergraduate upper-level laboratory, a five-week set of exercises was designed to allow students to examine the CRISPR status of uncharacterized Escherichia coli strains and to allow the discovery of new repeats and spacers. Students started the project by isolating genomic DNA from E. coli and amplifying the iap CRISPR locus using the polymerase chain reaction (PCR). The PCR products were analyzed by Sanger DNA sequencing, and the sequences were examined for the presence of CRISPR repeat sequences. The regions between the repeats, the spacers, were extracted and analyzed with BLASTN searches. Overall, CRISPR loci were sequenced from several previously uncharacterized E. coli strains and one E. coli K-12 strain. Sanger DNA sequencing resulted in the discovery of 36 spacer sequences and their corresponding surrounding repeat sequences. Five of the spacers were homologous to foreign (non-E. coli) DNA. Assessment of the laboratory indicates that improvements were made in the ability of students to answer questions relating to the structure and function of CRISPRs. Future directions of the laboratory are presented and discussed. © 2016 by The International Union of Biochemistry and Molecular Biology, 45(3):262-269, 2017. © 2016 The International Union of Biochemistry and Molecular Biology.
Fuller, Carl W.; Kumar, Shiv; Porel, Mintu; Chien, Minchen; Bibillo, Arek; Stranges, P. Benjamin; Dorwart, Michael; Tao, Chuanjuan; Li, Zengmin; Guo, Wenjing; Shi, Shundi; Korenblum, Daniel; Trans, Andrew; Aguirre, Anne; Liu, Edward; Harada, Eric T.; Pollard, James; Bhat, Ashwini; Cech, Cynthia; Yang, Alexander; Arnold, Cleoma; Palla, Mirkó; Hovis, Jennifer; Chen, Roger; Morozova, Irina; Kalachikov, Sergey; Russo, James J.; Kasianowicz, John J.; Davis, Randy; Roever, Stefan; Church, George M.; Ju, Jingyue
2016-01-01
DNA sequencing by synthesis (SBS) offers a robust platform to decipher nucleic acid sequences. Recently, we reported a single-molecule nanopore-based SBS strategy that accurately distinguishes four bases by electronically detecting and differentiating four different polymer tags attached to the 5′-phosphate of the nucleotides during their incorporation into a growing DNA strand catalyzed by DNA polymerase. Further developing this approach, we report here the use of nucleotides tagged at the terminal phosphate with oligonucleotide-based polymers to perform nanopore SBS on an α-hemolysin nanopore array platform. We designed and synthesized several polymer-tagged nucleotides using tags that produce different electrical current blockade levels and verified they are active substrates for DNA polymerase. A highly processive DNA polymerase was conjugated to the nanopore, and the conjugates were complexed with primer/template DNA and inserted into lipid bilayers over individually addressable electrodes of the nanopore chip. When an incoming complementary-tagged nucleotide forms a tight ternary complex with the primer/template and polymerase, the tag enters the pore, and the current blockade level is measured. The levels displayed by the four nucleotides tagged with four different polymers captured in the nanopore in such ternary complexes were clearly distinguishable and sequence-specific, enabling continuous sequence determination during the polymerase reaction. Thus, real-time single-molecule electronic DNA sequencing data with single-base resolution were obtained. The use of these polymer-tagged nucleotides, combined with polymerase tethering to nanopores and multiplexed nanopore sensors, should lead to new high-throughput sequencing methods. PMID:27091962
Crotoxin: Structural Studies, Mechanism of Action and Cloning of Its Gene
1987-03-01
other venoms and examine their toxin neutral- izing ability. The amino acid sequences of both crotoxin subunits were determined Is a prelude to cloning...be examined for their potential as anti-idiotype vaccines The complete amino acid sequence of the basic subunit and two of the three dic subunit chains...of crotoxin from the venom of C.d. terrificus has been de rmined. Sequence comparison data suggest that the non-toxic, acidic subunit was derived
Shi, Yu; Liu, Xiaochun; Zhang, Haifa; Zhang, Yong; Lu, Danqi; Lin, Haoran
2012-09-01
Androgens play a crucial role in sex differentiation, sexual maturation, and spermatogenesis in vertebrates. The action of androgens is mediated via androgen receptors (ARs). The present study reports the cloning of the cDNA sequence of the ar in the orange-spotted grouper, with high expression in testis and relatively low in subdivision of brain areas. The cDNA sequence of ar was 2358 bp, encoding a protein of 759 amino acids (aa). Phylogenetic analysis showed that the ar cDNA sequence was closely related to that of threespot wrasse (Halichoeres trimaculatus) and medaka (Oryzias latipes) arβ. As deduced from the phylogenetic tree and the high amino acid identity with the ARβ subtype of other teleosts, grouper ar seems to be more closely related to the beta than the alpha subtype cloned to date. In the first week after 17α-methyltestosterone (MT) implantation, the transcript levels of ar in the hypothalamus declined significantly, and consistently stayed at low level expression to the second week, but increased back to the control levels in the third and fourth week. In the gonad, the mRNA expression of ar was not changed in the first week compared with the control, but increased significantly in the second week, consistently reached the highest level in the third week, dropped slightly but still higher than that of the control in the fourth week. The expression pattern of ar in hypothalamus and gonad during MT-induced sex reversal suggests the involvement of ar in regulating this process in the orange-spotted grouper. The present study provides the data of the changes in the mRNA levels of ar during MT-induced sex reversal in detail to help understand the complicated signals under sex reversal. Copyright © 2012 Elsevier Inc. All rights reserved.
Gao, F; Cao, X F; Si, J P; Chen, Z Y; Duan, C L
2016-05-06
Dendrobium officinale is one of the most well-known traditional Chinese medicines, and polysaccharide is its main active ingredient. Many studies have investigated the synthesis and accumulation mechanisms of polysaccharide, but until recently, little was known about the molecular mechanism of how polysaccharide is synthesized because no related genes have been cloned. In this study, we cloned an alkaline/neutral invertase gene from D. officinale (DoNI) by the rapid amplification of cDNA ends (RACE) method. DoNI was 2231 bp long and contained an open reading frame that predicted a 62.8-kDa polypeptide with 554-amino acid residues. An alkaline/neutral invertase conserved domain was predicted from this deduced amino acid sequence, and DoNI had a similar deduced amino acid sequence to Setaria italica and Oryza brachyantha. We also found that DoNI expression in different tissues was closely related to DoNI activity, and more importantly, polysaccharide level. Our results indicate that DoNI is associated with polysaccharide accumulation in D. officinale.
Primary structure and glycosylation of the S-layer protein of Haloferax volcanii.
Sumper, M; Berg, E; Mengele, R; Strobel, I
1990-01-01
The outer surface of the archaebacterium Haloferax volcanii (formerly named Halobacterium volcanii) is covered with a hexagonally packed surface (S) layer. The gene coding for the S-layer protein was cloned and sequenced. The mature polypeptide is composed of 794 amino acids and is preceded by a typical signal sequence of 34 amino acid residues. A highly hydrophobic stretch of 20 amino acids at the C-terminal end probably serves as a transmembrane domain. Clusters of threonine residues are located adjacent to this membrane anchor. The S-layer protein is a glycoprotein containing both N- and O-glycosidic bonds. Glucosyl-(1----2)-galactose disaccharides are linked to threonine residues. The primary structure and the glycosylation pattern of the S-layer glycoproteins from Haloferax volcanii and from Halobacterium halobium were compared and found to exhibit distinct differences, despite the fact that three-dimensional reconstructions from electron micrographs revealed no structural differences at least to the 2.5-nm level attained so far (M. Kessel, I. Wildhaber, S. Cohe, and W. Baumeister, EMBO J. 7:1549-1554, 1988). Images PMID:2123862
Primary structure and glycosylation of the S-layer protein of Haloferax volcanii.
Sumper, M; Berg, E; Mengele, R; Strobel, I
1990-12-01
The outer surface of the archaebacterium Haloferax volcanii (formerly named Halobacterium volcanii) is covered with a hexagonally packed surface (S) layer. The gene coding for the S-layer protein was cloned and sequenced. The mature polypeptide is composed of 794 amino acids and is preceded by a typical signal sequence of 34 amino acid residues. A highly hydrophobic stretch of 20 amino acids at the C-terminal end probably serves as a transmembrane domain. Clusters of threonine residues are located adjacent to this membrane anchor. The S-layer protein is a glycoprotein containing both N- and O-glycosidic bonds. Glucosyl-(1----2)-galactose disaccharides are linked to threonine residues. The primary structure and the glycosylation pattern of the S-layer glycoproteins from Haloferax volcanii and from Halobacterium halobium were compared and found to exhibit distinct differences, despite the fact that three-dimensional reconstructions from electron micrographs revealed no structural differences at least to the 2.5-nm level attained so far (M. Kessel, I. Wildhaber, S. Cohe, and W. Baumeister, EMBO J. 7:1549-1554, 1988).
Nagahashi, S; Endoh, H; Suzuki, Y; Okada, N
1991-11-20
A previous report from this laboratory showed that in vitro transcription of total genomic DNA of the newt Cynopus pyrrhogaster resulted in a discrete sized 8 S RNA, which represented highly repetitive and transcribable sequences with a glutamic acid tRNA-like structure in the newt genome. We isolated four independent clones from a newt genomic library and determined the complete sequences of three 2000 to 2400 base-pair PstI fragments spanning the 8 S RNA gene. The glutamic acid tRNA-related segment in the 8 S RNA gene contains the CCA sequence expected as the 3' terminus of a tRNA molecule. Further, the 11 nucleotides located 13 nucleotides upstream from one of the two transcription initiation sites of the 8 S RNA were found to be repeated in the region upstream from the termination site, suggesting that the original unit, which is shorter than the 8 S RNA, was retrotransposed via cDNA intermediates from the PolIII transcript. In the upstream region of the 8 S RNA gene, a 360 nucleotide unit containing the glutamic acid tRNA-related segment was found to be duplicated (clones NE1 and NE10) or triplicated (clone NE3). Except for the difference in the number of the 360 nucleotide unit, the three sequences of the 2000 to 2400 base-pair PstI fragment were essentially the same with only a few mutations and minor deletions. Inverse polymerase chain reaction and sequence determination of the products, together with a Southern hybridization experiment, demonstrated that the family consists of a tandemly repeated unit of 3300, 3700 or 4100 base-pairs. Thus during evolution, this family in the newt was created by retroposition via cDNA intermediates, followed by duplication or triplication of the 360 nucleotide unit and multiplication of the 3300 to 4100 base-pair region at the DNA level.
Schmitt, Bryan H; Cunningham, Scott A; Dailey, Aaron L; Gustafson, Daniel R; Patel, Robin
2013-03-01
Identification of anaerobic bacteria using phenotypic methods is often time-consuming; methods such as 16S rRNA gene sequencing are costly and may not be readily available. We evaluated 253 clinical isolates of anaerobic bacteria using the Bruker MALDI Biotyper (Bruker Daltonics, Billerica, MA) matrix-assisted laser desorption ionization-time of flight mass spectrometry (MALDI-TOF MS) system with a user-supplemented database and an on-plate formic acid-based preparation method and compared results to those of conventional identification using biochemical testing or 16S rRNA gene sequencing. A total of 179 (70.8%) and 232 (91.7%) isolates were correctly identified to the species and genus levels, respectively, using manufacturer-recommended score cutoffs. MALDI-TOF MS offers a rapid, inexpensive method for identification of anaerobic bacteria.
Lactobacillus pobuzihii sp. nov., isolated from pobuzihi (fermented cummingcordia).
Chen, Yi-Sheng; Miyashita, Mika; Suzuki, Ken-ichiro; Sato, Hajime; Hsu, Jar-Sheng; Yanagida, Fujitoshi
2010-08-01
Twenty-one homofermentative lactic acid bacteria were isolated from fermented cummingcordia (pobuzihi), a traditional food in Taiwan. The isolates had identical 16S rRNA gene sequences that were distinct from those of other lactobacilli, and their closest neighbours in the 16S rRNA gene sequence phylogenetic tree were strains of Lactobacillus acidipiscis. Levels of DNA-DNA relatedness between representative pobuzihi isolates and strains of L. acidipiscis were 17% and below. Furthermore, the new isolates could be differentiated clearly from L. acidipiscis NBRC 102163T and NBRC 102164 in terms of acid production from L-arabinose, rhamnose, mannitol, lactose and 5-ketogluconate. It was concluded that the new isolates represent a single novel species of the genus Lactobacillus, for which the name Lactobacillus pobuzihii sp. nov. is proposed. The type strain is E100301T (=RIFY 6501T =NBRC 103219T =KCTC 13174T).
Ma, Tracy Hoi Tung; Tiu, Shirley Hiu Kwan; He, Jian-Guo; Chan, Siu-Ming
2007-08-01
C-type lectin is one of the pattern-recognition proteins of the non-self innate immune system in the invertebrates. In this study, a lectin-like cDNA (LvLT) of Litopenaeus vannamei was cloned and characterized. LvLT cDNA consists of 1035 nt encoding for a protein with 345 amino acid residues. The deduced LvLT consists of two putative carbohydrate-recognition domains (CRDs) as found in most C-type lectins. The first CRD consists of an amino acid motif (QPD) for the binding of galactose and the other CRDs consist of amino acid motifs (EPN) for the binding of mannose. Except for some conserved amino acid residues, the CRD of LvLT shared an overall low amino acid sequence identity with CRDs of other lectins. Unlike other shrimp lectins, LvLT is expressed only in the hepatopancreas but not in the hemocytes as revealed by RT-PCR. When juvenile shrimp were challenged with shrimp extracts containing white spot syndrome virus (WSSV), the expression levels of LvLT decreased initially in the first 2 h and then increased to a much higher level after 4 h. The results suggest that the initial reduction in LvLT transcript level may be related to the WSSV infection in shrimp.
D'Onofrio, Giuseppe; Ghosh, Tapash Chandra
2005-01-17
Fluctuations and increments of both C(3) and G(3) levels along the human coding sequences were investigated comparing two sets of Xenopus/human orthologous genes. The first set of genes shows minor differences of the GC(3) levels, the second shows considerable increments of the GC(3) levels in the human genes. In both data sets, the fluctuations of C(3) and G(3) levels along the coding sequences correlated with the secondary structures of the encoded proteins. The human genes that underwent the compositional transition showed a different increment of the C(3) and G(3) levels within and among the structural units of the proteins. The relative synonymous codon usage (RSCU) of several amino acids were also affected during the compositional transition, showing that there exists a correlation between RSCU and protein secondary structures in human genes. The importance of natural selection for the formation of isochore organization of the human genome has been discussed on the basis of these results.
Sequence variation and phylogenetic analysis of envelope glycoprotein of hepatitis G virus.
Lim, M Y; Fry, K; Yun, A; Chong, S; Linnen, J; Fung, K; Kim, J P
1997-11-01
A transfusion-transmissible agent provisionally designated hepatitis G virus (HGV) was recently identified. In this study, we examined the variability of the HGV genome by analysing sequences in the putative envelope region from 72 isolates obtained from diverse geographical sources. The 1561 nucleotide sequence of the E1/E2/NS2a region of HGV was determined from 12 isolates, and compared with three published sequences. The most variability was observed in 400 nucleotides at the N terminus of E2. We next analysed this 400 nucleotide envelope variable region (EV) from an additional 60 HGV isolates. This sequence varied considerably among the 75 isolates, with overall identity ranging from 79.3% to 99.5% at the nucleotide level, and from 83.5% to 100% at the amino acid level. However, hypervariable regions were not identified. Phylogenetic analyses indicated that the 75 HGV isolates belong to a single genotype. A single-tier distribution of evolutionary distances was observed among the 15 E1/E2/NS2a sequences and the 75 EV sequences. In contrast, 11 isolates of HCV were analysed and showed a three-tiered distribution, representing genotypes, subtypes, and isolates. The 75 isolates of HGV fell into four clusters on the phylogenetic tree. Tight geographical clustering was observed among the HGV isolates from Japan and Korea.
NullSeq: A Tool for Generating Random Coding Sequences with Desired Amino Acid and GC Contents.
Liu, Sophia S; Hockenberry, Adam J; Lancichinetti, Andrea; Jewett, Michael C; Amaral, Luís A N
2016-11-01
The existence of over- and under-represented sequence motifs in genomes provides evidence of selective evolutionary pressures on biological mechanisms such as transcription, translation, ligand-substrate binding, and host immunity. In order to accurately identify motifs and other genome-scale patterns of interest, it is essential to be able to generate accurate null models that are appropriate for the sequences under study. While many tools have been developed to create random nucleotide sequences, protein coding sequences are subject to a unique set of constraints that complicates the process of generating appropriate null models. There are currently no tools available that allow users to create random coding sequences with specified amino acid composition and GC content for the purpose of hypothesis testing. Using the principle of maximum entropy, we developed a method that generates unbiased random sequences with pre-specified amino acid and GC content, which we have developed into a python package. Our method is the simplest way to obtain maximally unbiased random sequences that are subject to GC usage and primary amino acid sequence constraints. Furthermore, this approach can easily be expanded to create unbiased random sequences that incorporate more complicated constraints such as individual nucleotide usage or even di-nucleotide frequencies. The ability to generate correctly specified null models will allow researchers to accurately identify sequence motifs which will lead to a better understanding of biological processes as well as more effective engineering of biological systems.
Tan, Philip K.; Farrar, Jennifer E.; Gaucher, Eric A.; Miner, Jeffrey N.
2016-01-01
Uric acid is the highly insoluble end-product of purine metabolism in humans. Serum levels exceeding the solubility threshold can trigger formation of urate crystals resulting in gouty arthritis. Uric acid is primarily excreted through the kidneys with 90% reabsorbed back into the bloodstream through the uric acid transporter URAT1. This reabsorption process is essential for the high serum uric acid levels found in humans. We discovered that URAT1 proteins from humans and baboons have higher affinity for uric acid compared with transporters from rats and mice. This difference in transport kinetics of URAT1 orthologs, along with inability of modern apes to oxidize uric acid due to loss of the uricase enzyme, prompted us to ask whether these events occurred concomitantly during primate evolution. Ancestral URAT1 sequences were computationally inferred and ancient transporters were resurrected and assayed, revealing that affinity for uric acid was increased during the evolution of primates. This molecular fine-tuning occurred between the origins of simians and their diversification into New- and Old-World monkey and ape lineages. Remarkably, it was driven in large-part by only a few amino acid replacements within the transporter. This alteration in primate URAT1 coincided with changes in uricase that greatly diminished the enzymatic activity and took place 27–77 Ma. These results suggest that the modifications to URAT1 transporters were potentially adaptive and that maintaining more constant, high levels of serum uric acid may have provided an advantage to our primate ancestors. PMID:27352852
A Novel Cylindrical Representation for Characterizing Intrinsic Properties of Protein Sequences.
Yu, Jia-Feng; Dou, Xiang-Hua; Wang, Hong-Bo; Sun, Xiao; Zhao, Hui-Ying; Wang, Ji-Hua
2015-06-22
The composition and sequence order of amino acid residues are the two most important characteristics to describe a protein sequence. Graphical representations facilitate visualization of biological sequences and produce biologically useful numerical descriptors. In this paper, we propose a novel cylindrical representation by placing the 20 amino acid residue types in a circle and sequence positions along the z axis. This representation allows visualization of the composition and sequence order of amino acids at the same time. Ten numerical descriptors and one weighted numerical descriptor have been developed to quantitatively describe intrinsic properties of protein sequences on the basis of the cylindrical model. Their applications to similarity/dissimilarity analysis of nine ND5 proteins indicated that these numerical descriptors are more effective than several classical numerical matrices. Thus, the cylindrical representation obtained here provides a new useful tool for visualizing and charactering protein sequences. An online server is available at http://biophy.dzu.edu.cn:8080/CNumD/input.jsp .
Zheng, Weiwei; Zhang, Ze; Liu, Cuihua; Qiao, Yuanyuan; Zhou, Dianrong; Qu, Jia; An, Huaijie; Xiong, Ming; Zhu, Zhiming; Zhao, Xiaohang
2015-01-01
Seafaring is a difficult occupation, and sailors face higher health risks than individuals on land. Commensal microbiota participates in the host immune system and metabolism, reflecting the host's health condition. However, the interaction mechanisms between the microbiota and the host's health condition remain unclear. This study reports the influence of long sea voyages on human health by utilising a metagenomic analysis of variation in the microbiota of the buccal mucosa. Paired samples collected before and after a sea-voyage were analysed. After more than 120 days of ocean sailing, the oral microbial diversity of sailors was reduced by approximately 5 fold, and the levels of several pathogens (e.g., Streptococcus pneumonia) increased. Moreover, 69.46% of the identified microbial sequences were unclassified microbiota. Notably, several metabolic pathways were dramatically decreased, including folate biosynthesis, carbohydrate, lipid and amino acid pathways. Clinical examination of the hosts confirmed the identified metabolic changes, as demonstrated by decreased serum levels of haemoglobin and folic acid, a decreased neutrophil-to-lymphocyte ratio, and increased levels of triglycerides, cholesterol and homocysteine, which are consistent with the observed microbial variation. Our study suggests that oral mucosal bacteria may reflect host health conditions and could provide approaches for improving the health of sailors. PMID:26154405