acid sequence obtained: Topics by Science.gov

Sample records for acid sequence obtained

DOE Office of Scientific and Technical Information (OSTI.GOV)

Reiser, Steven E.; Somerville, Chris R.

The present invention relates to bacterial enzymes, in particular to an acyl-CoA reductase and a gene encoding an acyl-CoA reductase, the amino acid and nucleic acid sequences corresponding to the reductase polypeptide and gene, respectively, and to methods of obtaining such enzymes, amino acid sequences and nucleic acid sequences. The invention also relates to the use of such sequences to provide transgenic host cells capable of producing fatty alcohols and fatty aldehydes.
[Cloning and sequence analysis of full-length cDNA of secoisolariciresinol dehydrogenase of Dysosma versipellis].

PubMed

Xu, Li; Ding, Zhi-Shan; Zhou, Yun-Kai; Tao, Xue-Fen

2009-06-01

To obtain the full-length cDNA sequence of Secoisolariciresinol Dehydrogenase gene from Dysosma versipellis by RACE PCR,then investigate the character of Secoisolariciresinol Dehydrogenase gene. The full-length cDNA sequence of Secoisolariciresinol Dehydrogenase gene was obtained by 3'-RACE and 5'-RACE from Dysosma versipellis. We first reported the full cDNA sequences of Secoisolariciresinol Dehydrogenase in Dysosma versipellis. The acquired gene was 991bp in full length, including 5' untranslated region of 42bp, 3' untranslated region of 112bp with Poly (A). The open reading frame (ORF) encoding 278 amino acid with molecular weight 29253.3 Daltons and isolectric point 6.328. The gene accession nucleotide sequence number in GeneBank was EU573789. Semi-quantitative RT-PCR analysis revealed that the Secoisolariciresinol Dehydrogenase gene was highly expressed in stem. Alignment of the amino acid sequence of Secoisolariciresinol Dehydrogenase indicated there may be some significant amino acid sequence difference among different species. Obtain the full-length cDNA sequence of Secoisolariciresinol Dehydrogenase gene from Dysosma versipellis.
Methods for making nucleotide probes for sequencing and synthesis

DOEpatents

Church, George M; Zhang, Kun; Chou, Joseph

2014-07-08

Compositions and methods for making a plurality of probes for analyzing a plurality of nucleic acid samples are provided. Compositions and methods for analyzing a plurality of nucleic acid samples to obtain sequence information in each nucleic acid sample are also provided.
Negative Ion In-Source Decay Matrix-Assisted Laser Desorption/Ionization Mass Spectrometry for Sequencing Acidic Peptides

NASA Astrophysics Data System (ADS)

McMillen, Chelsea L.; Wright, Patience M.; Cassady, Carolyn J.

2016-05-01

Matrix-assisted laser desorption/ionization (MALDI) in-source decay was studied in the negative ion mode on deprotonated peptides to determine its usefulness for obtaining extensive sequence information for acidic peptides. Eight biological acidic peptides, ranging in size from 11 to 33 residues, were studied by negative ion mode ISD (nISD). The matrices 2,5-dihydroxybenzoic acid, 2-aminobenzoic acid, 2-aminobenzamide, 1,5-diaminonaphthalene, 5-amino-1-naphthol, 3-aminoquinoline, and 9-aminoacridine were used with each peptide. Optimal fragmentation was produced with 1,5-diaminonphthalene (DAN), and extensive sequence informative fragmentation was observed for every peptide except hirudin(54-65). Cleavage at the N-Cα bond of the peptide backbone, producing c' and z' ions, was dominant for all peptides. Cleavage of the N-Cα bond N-terminal to proline residues was not observed. The formation of c and z ions is also found in electron transfer dissociation (ETD), electron capture dissociation (ECD), and positive ion mode ISD, which are considered to be radical-driven techniques. Oxidized insulin chain A, which has four highly acidic oxidized cysteine residues, had less extensive fragmentation. This peptide also exhibited the only charged localized fragmentation, with more pronounced product ion formation adjacent to the highly acidic residues. In addition, spectra were obtained by positive ion mode ISD for each protonated peptide; more sequence informative fragmentation was observed via nISD for all peptides. Three of the peptides studied had no product ion formation in ISD, but extensive sequence informative fragmentation was found in their nISD spectra. The results of this study indicate that nISD can be used to readily obtain sequence information for acidic peptides.
Negative Ion In-Source Decay Matrix-Assisted Laser Desorption/Ionization Mass Spectrometry for Sequencing Acidic Peptides.

PubMed

McMillen, Chelsea L; Wright, Patience M; Cassady, Carolyn J

2016-05-01

Matrix-assisted laser desorption/ionization (MALDI) in-source decay was studied in the negative ion mode on deprotonated peptides to determine its usefulness for obtaining extensive sequence information for acidic peptides. Eight biological acidic peptides, ranging in size from 11 to 33 residues, were studied by negative ion mode ISD (nISD). The matrices 2,5-dihydroxybenzoic acid, 2-aminobenzoic acid, 2-aminobenzamide, 1,5-diaminonaphthalene, 5-amino-1-naphthol, 3-aminoquinoline, and 9-aminoacridine were used with each peptide. Optimal fragmentation was produced with 1,5-diaminonphthalene (DAN), and extensive sequence informative fragmentation was observed for every peptide except hirudin(54-65). Cleavage at the N-Cα bond of the peptide backbone, producing c' and z' ions, was dominant for all peptides. Cleavage of the N-Cα bond N-terminal to proline residues was not observed. The formation of c and z ions is also found in electron transfer dissociation (ETD), electron capture dissociation (ECD), and positive ion mode ISD, which are considered to be radical-driven techniques. Oxidized insulin chain A, which has four highly acidic oxidized cysteine residues, had less extensive fragmentation. This peptide also exhibited the only charged localized fragmentation, with more pronounced product ion formation adjacent to the highly acidic residues. In addition, spectra were obtained by positive ion mode ISD for each protonated peptide; more sequence informative fragmentation was observed via nISD for all peptides. Three of the peptides studied had no product ion formation in ISD, but extensive sequence informative fragmentation was found in their nISD spectra. The results of this study indicate that nISD can be used to readily obtain sequence information for acidic peptides.
Cloning and purification of alpha-neurotoxins from king cobra (Ophiophagus hannah).

PubMed

He, Ying-Ying; Lee, Wei-Hui; Zhang, Yun

2004-09-01

Thirteen complete and three partial cDNA sequences were cloned from the constructed king cobra (Ophiophagus hannah) venom gland cDNA library. Phylogenetic analysis of nucleotide sequences of king cobra with those from other snake venoms revealed that obtained cDNAs are highly homologous to snake venom alpha-neurotoxins. Alignment of deduced mature peptide sequences of the obtained clones with those of other reported alpha-neurotoxins from the king cobra venom indicates that our obtained 16 clones belong to long-chain neurotoxins (seven), short-chain neurotoxins (seven), weak toxin (one) and variant (one), respectively. Up to now, two out of 16 newly cloned king cobra alpha-neurotoxins have identical amino acid sequences with CM-11 and Oh-6A/6B, which have been characterized from the same venom. Furthermore, five long-chain alpha-neurotoxins and two short-chain alpha-neurotoxins were purified from crude venom and their N-terminal amino acid sequences were determined. The cDNAs encoding the putative precursors of the purified native peptide were also determined based on the N-terminal amino acid sequencing. The purified alpha-neurotoxins showed different lethal activities on mice.
Identification of Delta5-fatty acid desaturase from the cellular slime mold dictyostelium discoideum.

PubMed

Saito, T; Ochiai, H

1999-10-01

cDNA fragments putatively encoding amino acid sequences characteristic of the fatty acid desaturase were obtained using expressed sequence tag (EST) information of the Dictyostelium cDNA project. Using this sequence, we have determined the cDNA sequence and genomic sequence of a desaturase. The cloned cDNA is 1489 nucleotides long and the deduced amino acid sequence comprised 464 amino acid residues containing an N-terminal cytochrome b5 domain. The whole sequence was 38.6% identical to the initially identified Delta5-desaturase of Mortierella alpina. We have confirmed its function as Delta5-desaturase by over expression mutation in D. discoideum and also the gain of function mutation in the yeast Saccharomyces cerevisiae. Analysis of the lipids from transformed D. discoideum and yeast demonstrated the accumulation of Delta5-desaturated products. This is the first report concering fatty acid desaturase in cellular slime molds.
Complete amino acid sequence of bovine colostrum low-Mr cysteine proteinase inhibitor.

PubMed

Hirado, M; Tsunasawa, S; Sakiyama, F; Niinobe, M; Fujii, S

1985-07-01

The complete amino acid sequence of bovine colostrum cysteine proteinase inhibitor was determined by sequencing native inhibitor and peptides obtained by cyanogen bromide degradation, Achromobacter lysylendopeptidase digestion and partial acid hydrolysis of reduced and S-carboxymethylated protein. Achromobacter peptidase digestion was successfully used to isolate two disulfide-containing peptides. The inhibitor consists of 112 amino acids with an Mr of 12787. Two disulfide bonds were established between Cys 66 and Cys 77 and between Cys 90 and Cys 110. A high degree of homology in the sequence was found between the colostrum inhibitor and human gamma-trace, human salivary acidic protein and chicken egg-white cystatin.
Contribution of Tryptophan Residues to the Combining Site of a Monoclonal Anti Dinitrophenyl Spin-Label Antibody

DTIC Science & Technology

1987-01-01

identified in the difference spectra, implying that: there are five to seven tryptophans within 17 A of the spin-label hapten. Amino acid sequences...of the heavy, and light chains were obtained by a combination of amino acid and DNA sequencing. A molecular model’ was constructed from the sequence...Clore & acids yields detailed information about the amino acid com- Gronenborn, 1982, 1983). This technique should also identify position of the combining
The primary structure of the Saccharomyces cerevisiae gene for 3-phosphoglycerate kinase.

PubMed Central

Hitzeman, R A; Hagie, F E; Hayflick, J S; Chen, C Y; Seeburg, P H; Derynck, R

1982-01-01

The DNA sequence of the gene for the yeast glycolytic enzyme, 3-phosphoglycerate kinase (PGK), has been obtained by sequencing part of a 3.1 kbp HindIII fragment obtained from the yeast genome. The structural gene sequence corresponds to a reading frame of 1251 bp coding for 416 amino acids with no intervening DNA sequences. The amino acid sequence is approximately 65 percent homologous with human and horse PGK protein sequences and is in general agreement with the published protein sequence for yeast PGK. As for other highly expressed structural genes in yeast, the coding sequence is highly codon biased with 95 percent of the amino acids coded for by a select 25 codons (out of 61 possible). Besides structural DNA sequence, 291 bp of 5'-flanking sequence and 286 bp of 3'-flanking sequence were determined. Transcription starts 36 nucleotides upstream from the translational start and stops 86-93 nucleotides downstream from the translational stop. These results suggest a non-polyadenylated mRNA length of 1373 to 1380 nucleotides, which is consistent with the observed length of 1500 nucleotides for polyadenylated PGK mRNA. A sequence TATATATAAA is found at 145 nucleotides upstream from the translational start. This sequence resembles the TATAAA box that is possibly associated with RNA polymerase II binding. Images PMID:6296791
Streptococcal phosphoenolpyruvate-sugar phosphotransferase system: amino acid sequence and site of ATP-dependent phosphorylation of HPr

DOE Office of Scientific and Technical Information (OSTI.GOV)

Deutscher, J.; Pevec, B.; Beyreuther, K.

1986-10-21

The amino acid sequence of histidine-containing protein (HPr) from Streptococcus faecalis has been determined by direct Edman degradation of intact HPr and by amino acid sequence analysis of tryptic peptides, V8 proteolyptic peptides, thermolytic peptides, and cyanogen bromide cleavage products. HPr from S. faecalis was found to contain 89 amino acid residues, corresponding to a molecular weight of 9438. The amino acid sequence of HPr from S. faecalis shows extended homology to the primary structure of HPr proteins from other bacteria. Besides the phosphoenolpyruvate-dependent phosphorylation of a histidyl residue in HPr, catalyzed by enzyme I of the bacterial phosphotransferase system,more » HPr was also found to be phosphorylated at a seryl residue in an ATP-dependent protein kinase catalyzed reaction. The site of ATP-dependent phosphorylation in HPr of S faecalis has now been determined. (/sup 32/P)P-Ser-HPr was digested with three different proteases, and in each case, a single labeled peptide was isolated. Following digestion with subtilisin, they obtained a peptide with the sequence -(P)Ser-Ile-Met-. Using chymotrypsin, they isolated a peptide with the sequence -Ser-Val-Asn-Leu-Lys-(P)Ser-Ile-Met-Gly-Val-Met-. The longest labeled peptide was obtained with V8 staphylococcal protease. According to amino acid analysis, this peptide contained 36 out of the 89 amino acid residues of HPr. The following sequence of 12 amino acid residues of the V8 peptide was determined: -Tyr-Lys-Gly-Lys-Ser-Val-Asn-Leu-Lys-(P)Ser-Ile-Met-. Thus, the site of ATP-dependent phosphorylation was determined to be Ser-46 within the primary structure of HPr.« less
WEB-server for search of a periodicity in amino acid and nucleotide sequences

NASA Astrophysics Data System (ADS)

E Frenkel, F.; Skryabin, K. G.; Korotkov, E. V.

2017-12-01

A new web server (http://victoria.biengi.ac.ru/splinter/login.php) was designed and developed to search for periodicity in nucleotide and amino acid sequences. The web server operation is based upon a new mathematical method of searching for multiple alignments, which is founded on the position weight matrices optimization, as well as on implementation of the two-dimensional dynamic programming. This approach allows the construction of multiple alignments of the indistinctly similar amino acid and nucleotide sequences that accumulated more than 1.5 substitutions per a single amino acid or a nucleotide without performing the sequences paired comparisons. The article examines the principles of the web server operation and two examples of studying amino acid and nucleotide sequences, as well as information that could be obtained using the web server.
Normalization of Complete Genome Characteristics: Application to Evolution from Primitive Organisms to Homo sapiens.

PubMed

Sorimachi, Kenji; Okayasu, Teiji; Ohhira, Shuji

2015-04-01

Normalized nucleotide and amino acid contents of complete genome sequences can be visualized as radar charts. The shapes of these charts depict the characteristics of an organism's genome. The normalized values calculated from the genome sequence theoretically exclude experimental errors. Further, because normalization is independent of both target size and kind, this procedure is applicable not only to single genes but also to whole genomes, which consist of a huge number of different genes. In this review, we discuss the applications of the normalization of the nucleotide and predicted amino acid contents of complete genomes to the investigation of genome structure and to evolutionary research from primitive organisms to Homo sapiens. Some of the results could never have been obtained from the analysis of individual nucleotide or amino acid sequences but were revealed only after the normalization of nucleotide and amino acid contents was applied to genome research. The discovery that genome structure was homogeneous was obtained only after normalization methods were applied to the nucleotide or predicted amino acid contents of genome sequences. Normalization procedures are also applicable to evolutionary research. Thus, normalization of the contents of whole genomes is a useful procedure that can help to characterize organisms.
Technology-Enhanced Research in the Science Classroom.

ERIC Educational Resources Information Center

Francis, Joseph W.

1997-01-01

Describes a project where students use the Internet as a research tool. Discusses using e-mail to access molecular biology databases and identify proteins using amino acid sequences, obtaining complete amino acid sequences using the world wide web, using telnet to access library resources on the Internet, and various stages of protein analysis…
Cloning and expression of cDNA coding for bouganin.

PubMed

den Hartog, Marcel T; Lubelli, Chiara; Boon, Louis; Heerkens, Sijmie; Ortiz Buijsse, Antonio P; de Boer, Mark; Stirpe, Fiorenzo

2002-03-01

Bouganin is a ribosome-inactivating protein that recently was isolated from Bougainvillea spectabilis Willd. In this work, the cloning and expression of the cDNA encoding for bouganin is described. From the cDNA, the amino-acid sequence was deduced, which correlated with the primary sequence data obtained by amino-acid sequencing on the native protein. Bouganin is synthesized as a pro-peptide consisting of 305 amino acids, the first 26 of which act as a leader signal while the 29 C-terminal amino acids are cleaved during processing of the molecule. The mature protein consists of 250 amino acids. Using the cDNA sequence encoding the mature protein of 250 amino acids, a recombinant protein was expressed, purified and characterized. The recombinant molecule had similar activity in a cell-free protein synthesis assay and had comparable toxicity on living cells as compared to the isolated native bouganin.
Hydroxyapatite-binding peptides for bone growth and inhibition

DOEpatents

Bertozzi, Carolyn R [Berkeley, CA; Song, Jie [Shrewsbury, MA; Lee, Seung-Wuk [Walnut Creek, CA

2011-09-20

Hydroxyapatite (HA)-binding peptides are selected using combinatorial phage library display. Pseudo-repetitive consensus amino acid sequences possessing periodic hydroxyl side chains in every two or three amino acid sequences are obtained. These sequences resemble the (Gly-Pro-Hyp).sub.x repeat of human type I collagen, a major component of extracellular matrices of natural bone. A consistent presence of basic amino acid residues is also observed. The peptides are synthesized by the solid-phase synthetic method and then used for template-driven HA-mineralization. Microscopy reveal that the peptides template the growth of polycrystalline HA crystals .about.40 nm in size.
Evidence of Divergent Amino Acid Usage in Comparative Analyses of R5- and X4-Associated HIV-1 Vpr Sequences

PubMed Central

Antell, Gregory C.; Zhong, Wen; Kercher, Katherine; Passic, Shendra; Williams, Jean; Liu, Yucheng; James, Tony; Jacobson, Jeffrey M.; Szep, Zsofia

2017-01-01

Vpr is an HIV-1 accessory protein that plays numerous roles during viral replication, and some of which are cell type dependent. To test the hypothesis that HIV-1 tropism extends beyond the envelope into the vpr gene, studies were performed to identify the associations between coreceptor usage and Vpr variation in HIV-1-infected patients. Colinear HIV-1 Env-V3 and Vpr amino acid sequences were obtained from the LANL HIV-1 sequence database and from well-suppressed patients in the Drexel/Temple Medicine CNS AIDS Research and Eradication Study (CARES) Cohort. Genotypic classification of Env-V3 sequences as X4 (CXCR4-utilizing) or R5 (CCR5-utilizing) was used to group colinear Vpr sequences. To reveal the sequences associated with a specific coreceptor usage genotype, Vpr amino acid sequences were assessed for amino acid diversity and Jensen-Shannon divergence between the two groups. Five amino acid alphabets were used to comprehensively examine the impact of amino acid substitutions involving side chains with similar physiochemical properties. Positions 36, 37, 41, 89, and 96 of Vpr were characterized by statistically significant divergence across multiple alphabets when X4 and R5 sequence groups were compared. In addition, consensus amino acid switches were found at positions 37 and 41 in comparisons of the R5 and X4 sequence populations. These results suggest an evolutionary link between Vpr and gp120 in HIV-1-infected patients. PMID:28620613
The primary structure of aspartate aminotransferase from pig heart muscle. Partial sequences determined by digestion with thermolysin and elastase

PubMed Central

Bossa, Francesco; Barra, Donatella; Carloni, Massimo; Fasella, Paolo; Riva, Francesca; Doonan, Shawn; Doonan, Hilary J.; Hanford, Robin; Vernon, Charles A.; Walker, John M.

1973-01-01

Peptides produced by thermolytic digestion of aminoethylated aspartate aminotransferase and of the oxidized enzyme were isolated and their amino acid sequences determined. Digestion by elastase of the carboxymethylated enzyme gave peptides representing approximately 40% of the primary structure. Fragments from these digests overlapped with previously reported sequences of peptides obtained by peptic and tryptic digestion (Doonan et al., 1972), giving ten composite peptides containing 395 amino acid residues. The amino acid composition of these composite peptides agrees well with that of the intact enzyme. Confirmatory results for some of the present data have been deposited as Supplementary Publication 50018 at the National Lending Library for Science and Technology, Boston Spa, Yorks. LS23 7BQ, U.K., from whom copies can be obtained on the terms indicated in Biochem. J. (1973) 131, 5. PMID:4748834
Sequence and phylogenetic analysis of chicken anaemia virus obtained from backyard and commercial chickens in Nigeria.

PubMed

Oluwayelu, D O; Todd, D; Olaleye, O D

2008-12-01

This work reports the first molecular analysis study of chicken anaemia virus (CAV) in backyard chickens in Africa using molecular cloning and sequence analysis to characterize CAV strains obtained from commercial chickens and Nigerian backyard chickens. Partial VP1 gene sequences were determined for three CAVs from commercial chickens and for six CAV variants present in samples from a backyard chicken. Multiple alignment analysis revealed that the 6% and 4% nucleotide diversity obtained respectively for the commercial and backyard chicken strains translated to only 2% amino acid diversity for each breed. Overall, the amino acid composition of Nigerian CAVs was found to be highly conserved. Since the partial VP1 gene sequence of two backyard chicken cloned CAV strains (NGR/CI-8 and NGR/CI-9) were almost identical and evolutionarily closely related to the commercial chicken strains NGR-1, and NGR-4 and NGR-5, respectively, we concluded that CAV infections had crossed the farm boundary.
Purification and characterization of gamma poly glutamic acid from newly Bacillus licheniformis NRC20.

PubMed

Tork, Sanaa E; Aly, Magda M; Alakilli, Saleha Y; Al-Seeni, Madeha N

2015-03-01

γ-poly glutamic acid (γ-PGA) has received considerable attention for pharmaceutical and biomedical applications. γ-PGA from the newly isolate Bacillus licheniformis NRC20 was purified and characterized using diffusion distance agar plate, mass spectrometry and thin layer chromatography. All analysis indicated that γ-PGA is a homopolymer composed of glutamic acid. Its molecular weight was determined to be 1266 kDa. It was composed of L- and D-glutamic acid residues. An amplicon of 3050 represents the γ-PGA-coding genes was obtained, sequenced and submitted in genbank database. Its amino acid sequence showed high similarity with that obtained from B. licheniformis strains. The bacterium NRC 20 was independent of L-glutamic acid but the polymer production enhanced when cultivated in medium containing L-glutamic acid as the sole nitrogen source. Finally we can conclude that γ-PGA production from B. licheniformis NRC20 has many promised applications in medicine, industry and nanotechnology. Copyright © 2014 Elsevier B.V. All rights reserved.

A Novel Cylindrical Representation for Characterizing Intrinsic Properties of Protein Sequences.

PubMed

Yu, Jia-Feng; Dou, Xiang-Hua; Wang, Hong-Bo; Sun, Xiao; Zhao, Hui-Ying; Wang, Ji-Hua

2015-06-22

The composition and sequence order of amino acid residues are the two most important characteristics to describe a protein sequence. Graphical representations facilitate visualization of biological sequences and produce biologically useful numerical descriptors. In this paper, we propose a novel cylindrical representation by placing the 20 amino acid residue types in a circle and sequence positions along the z axis. This representation allows visualization of the composition and sequence order of amino acids at the same time. Ten numerical descriptors and one weighted numerical descriptor have been developed to quantitatively describe intrinsic properties of protein sequences on the basis of the cylindrical model. Their applications to similarity/dissimilarity analysis of nine ND5 proteins indicated that these numerical descriptors are more effective than several classical numerical matrices. Thus, the cylindrical representation obtained here provides a new useful tool for visualizing and charactering protein sequences. An online server is available at http://biophy.dzu.edu.cn:8080/CNumD/input.jsp .
Sequencing proteins with transverse ionic transport in nanochannels.

PubMed

Boynton, Paul; Di Ventra, Massimiliano

2016-05-03

De novo protein sequencing is essential for understanding cellular processes that govern the function of living organisms and all sequence modifications that occur after a protein has been constructed from its corresponding DNA code. By obtaining the order of the amino acids that compose a given protein one can then determine both its secondary and tertiary structures through structure prediction, which is used to create models for protein aggregation diseases such as Alzheimer's Disease. Here, we propose a new technique for de novo protein sequencing that involves translocating a polypeptide through a synthetic nanochannel and measuring the ionic current of each amino acid through an intersecting perpendicular nanochannel. We find that the distribution of ionic currents for each of the 20 proteinogenic amino acids encoded by eukaryotic genes is statistically distinct, showing this technique's potential for de novo protein sequencing.
The amino acid sequence of Staphylococcus aureus penicillinase.

PubMed Central

Ambler, R P

1975-01-01

The amino acid sequence of the penicillinase (penicillin amido-beta-lactamhydrolase, EC 3.5.2.6) from Staphylococcus aureus strain PC1 was determined. The protein consists of a single polypeptide chain of 257 residues, and the sequence was determined by characterization of tryptic, chymotryptic, peptic and CNBr peptides, with some additional evidence from thermolysin and S. aureus proteinase peptides. A mistake in the preliminary report of the sequence is corrected; residues 113-116 are now thought to be -Lys-Lys-Val-Lys- rather than -Lys-Val-Lys-Lys-. Detailed evidence for the amino acid sequence has been deposited as Supplementary Publication SUP 50056 (91 pages) at the British Library (Lending Division), Boston Spa, Wetherby, West Yorkshire LS23 7BQ, U.K., from whom copies may be obtained on the terms given in Biochem. J. (1975) 145, 5. PMID:1218078
Complete genome sequence of a novel genotype of squash mosaic virus

USDA-ARS?s Scientific Manuscript database

Complete genome sequence of a novel genotype of Squash mosaic virus (SqMV) infecting squash plants in Spain was obtained using deep sequencing of small ribonucleic acids and assembly. The low nucleotide sequence identities, with 87-88% on RNA1 and 84-86% on RNA2 to known SqMV isolates, suggest a new...
Molecular cloning of crustins from the hemocytes of Brazilian penaeid shrimps.

PubMed

Rosa, Rafael Diego; Bandeira, Paula Terra; Barracco, Margherita Anna

2007-09-01

Crustins are antimicrobial peptides initially identified in the hemocytes of the crab Carcinus maenas (11.5-kDa peptide or carcinin) and recently also recognized in penaeid shrimps and other crustacean species. The aim of this study was to identify sequences encoding for crustins from the hemocytes of four Brazilian penaeid species: Farfantepenaeus paulensis, Farfantepenaeus subtilis, Farfantepenaeus brasiliensis and Litopenaeus schmitti. Using primers based on consensus nucleotide alignment of crustins from different crustaceans, cDNA sequences coding for crustins in all indigenous penaeid species were amplified. The obtained four crustin sequences encoded for peptides containing a hydrophobic N-terminal region rich in glycine repeats and a C-terminal part with 12 cysteine residues and a conserved whey acidic protein domain. All obtained crustin sequences showed high amino acidic similarity among each other and with crustins from litopenaeid shrimps (76-98%). This is the first report of crustins in native Brazilian penaeid shrimps.
Amino acid sequence of a trypsin inhibitor from a Spirometra (Spirometra erinaceieuropaei).

PubMed

Sanda, A; Uchida, A; Itagaki, T; Kobayashi, H; Inokuchi, N; Koyama, T; Iwama, M; Ohgi, K; Irie, M

2001-12-01

A trypsin inhibitor that is highly homologous with bovine pancreatic trypsin inhibitor (BPTI) was co-purified along with RNase from Spirometra (Spirometra erinaceieuropaei). The amino acid sequence of this inhibitor (SETI) and the nucleotide sequence of the cDNA encoding this protein were determined by protein chemistry and gene technology. SETI contains 68 amino acid residues and has a molecular mass of 7,798 Da. SETI has 31 amino acid residues that are identical with BPTI's sequence, including 6 half-cystine and 5 aromatic amino acid residues. The active site Lys residue in BPTI is replaced by an Arg residue in SETI. SETI is an effective inhibitor of trypsin and moderately inhibits a-chymotrypsin, but less inhibits elastase or subtilisin. SETI was expressed by E. coli containing a PelB vector carrying the SETI encoding cDNA; an expression yield of 0.68 mg/l was obtained. The phylogenetic relationship of SETI and the other BPTI-like trypsin inhibitors was analyzed using most likelihood inference methods.
A proteomic analysis of leaf sheaths from rice.

PubMed

Shen, Shihua; Matsubae, Masami; Takao, Toshifumi; Tanaka, Naoki; Komatsu, Setsuko

2002-10-01

The proteins extracted from the leaf sheaths of rice seedlings were separated by 2-D PAGE, and analyzed by Edman sequencing and mass spectrometry, followed by database searching. Image analysis revealed 352 protein spots on 2-D PAGE after staining with Coomassie Brilliant Blue. The amino acid sequences of 44 of 84 proteins were determined; for 31 of these proteins, a clear function could be assigned, whereas for 12 proteins, no function could be assigned. Forty proteins did not yield amino acid sequence information, because they were N-terminally blocked, or the obtained sequences were too short and/or did not give unambiguous results. Fifty-nine proteins were analyzed by mass spectrometry; all of these proteins were identified by matching to the protein database. The amino acid sequences of 19 of 27 proteins analyzed by mass spectrometry were similar to the results of Edman sequencing. These results suggest that 2-D PAGE combined with Edman sequencing and mass spectrometry analysis can be effectively used to identify plant proteins.
The complete CDS of the prion protein (PRNP) gene of African lion (Panthera leo).

PubMed

Maj, Andrzej; Spellman, Garth M; Sarver, Shane K

2008-04-01

We provide the complete PRNP CDS sequence for the African lion, which is different from the previously published sequence and more similar to other carnivore sequences. The newly obtained prion protein sequence differs from the domestic cat sequence at three amino acid positions and contains only four octapeptide repeats. We recommend that this sequence be used as the reference sequence for future studies of the PRNP gene for this species.
Amino acid "little Big Bang": representing amino acid substitution matrices as dot products of Euclidian vectors.

PubMed

Zimmermann, Karel; Gibrat, Jean-François

2010-01-04

Sequence comparisons make use of a one-letter representation for amino acids, the necessary quantitative information being supplied by the substitution matrices. This paper deals with the problem of finding a representation that provides a comprehensive description of amino acid intrinsic properties consistent with the substitution matrices. We present a Euclidian vector representation of the amino acids, obtained by the singular value decomposition of the substitution matrices. The substitution matrix entries correspond to the dot product of amino acid vectors. We apply this vector encoding to the study of the relative importance of various amino acid physicochemical properties upon the substitution matrices. We also characterize and compare the PAM and BLOSUM series substitution matrices. This vector encoding introduces a Euclidian metric in the amino acid space, consistent with substitution matrices. Such a numerical description of the amino acid is useful when intrinsic properties of amino acids are necessary, for instance, building sequence profiles or finding consensus sequences, using machine learning algorithms such as Support Vector Machine and Neural Networks algorithms.
Production of hydroxylated fatty acids in genetically modified plants

DOEpatents

Somerville, Chris; Broun, Pierre; van de Loo, Frank

2001-01-01

This invention relates to plant fatty acyl hydroxylases. Methods to use conserved amino acid or nucleotide sequences to obtain plant fatty acyl hydroxylases are described. Also described is the use of cDNA clones encoding a plant hydroxylase to produce a family of hydroxylated fatty acids in transgenic plants.
Porcine insulin receptor substrate 4 (IRS4) gene: cloning, polymorphism and association study

USDA-ARS?s Scientific Manuscript database

Using PCR and IPCR techniques we obtained a 4498 bp nucleotide sequence FN424076 encompassing the complete coding sequence of the porcine IRS4 gene and its proximal promoter. The 1269-amino acid porcine protein deduced from the nucleotide sequence shares 92% identity with the human IRS4 and possesse...
Complementary DNA cloning and molecular evolution of opine dehydrogenases in some marine invertebrates.

PubMed

Kimura, Tomohiro; Nakano, Toshiki; Yamaguchi, Toshiyasu; Sato, Minoru; Ogawa, Tomohisa; Muramoto, Koji; Yokoyama, Takehiko; Kan-No, Nobuhiro; Nagahisa, Eizou; Janssen, Frank; Grieshaber, Manfred K

2004-01-01

The complete complementary DNA sequences of genes presumably coding for opine dehydrogenases from Arabella iricolor (sandworm), Haliotis discus hannai (abalone), and Patinopecten yessoensis (scallop) were determined, and partial cDNA sequences were derived for Meretrix lusoria (Japanese hard clam) and Spisula sachalinensis (Sakhalin surf clam). The primers ODH-9F and ODH-11R proved useful for amplifying the sequences for opine dehydrogenases from the 4 mollusk species investigated in this study. The sequence of the sandworm was obtained using primers constructed from the amino acid sequence of tauropine dehydrogenase, the main opine dehydrogenase in A. iricolor. The complete cDNA sequence of A. iricolor, H. discus hannai, and P. yessoensis encode 397, 400, and 405 amino acids, respectively. All sequences were aligned and compared with published databank sequences of Loligo opalescens, Loligo vulgaris (squid), Sepia officinalis (cuttlefish), and Pecten maximus (scallop). As expected, a high level of homology was observed for the cDNA from closely related species, such as for cephalopods or scallops, whereas cDNA from the other species showed lower-level homologies. A similar trend was observed when the deduced amino acid sequences were compared. Furthermore, alignment of these sequences revealed some structural motifs that are possibly related to the binding sites of the substrates. The phylogenetic trees derived from the nucleotide and amino acid sequences were consistent with the classification of species resulting from classical taxonomic analyses.
DNA Music.

ERIC Educational Resources Information Center

Miner, Carol; della Villa, Paula

1997-01-01

Describes an activity in which students reverse-translate proteins from their amino acid sequences back to their DNA sequences then assign musical notes to represent the adenine, guanine, cytosine, and thymine bases. Data is obtained from the National Institutes of Health (NIH) on the Internet. (DDR)
Characterization of the novel antifungal protein PgAFP and the encoding gene of Penicillium chrysogenum.

PubMed

Rodríguez-Martín, Andrea; Acosta, Raquel; Liddell, Susan; Núñez, Félix; Benito, M José; Asensio, Miguel A

2010-04-01

The strain RP42C from Penicillium chrysogenum produces a small protein PgAFP that inhibits the growth of some toxigenic molds. The molecular mass of the protein determined by electrospray ionization mass spectrometry (ESI-MS) was 6 494Da. PgAFP showed a cationic character with an estimated pI value of 9.22. Upon chemical and enzymatic treatments of PgAFP, no evidence for N- or O-glycosylations was obtained. Five partial sequences of PgAFP were obtained by Edman degradation and by ESI-MS/MS after trypsin and chymotrypsin digestions. Using degenerate primers from these peptide sequences, a segment of 70bp was amplified by PCR from pgafp gene. 5'- and 3'-ends of pgafp were obtained by RACE-PCR with gene-specific primers designed from the 70bp segment. The complete pgafp sequence of 404bp was obtained using primers designed from 5'- and 3'-ends. Comparison of genomic and cDNA sequences revealed a 279bp coding region interrupted by two introns of 63 and 62bp. The precursor of the antifungal protein consists of 92 amino acids and appears to be processed to the mature 58 amino acids PgAFP. The deduced amino acid sequence of the mature protein shares 79% identity to the antifungal protein Anafp from Aspergillus niger. PgAFP is a new protein that belongs to the group of small, cysteine-rich, and basic proteins with antifungal activity produced by ascomycetes. Given that P. chrysogenum is regarded as safe mold commonly found in foods, PgAFP may be useful to prevent growth of toxigenic molds in food and agricultural products. Copyright (c) 2009 Elsevier Inc. All rights reserved.
Characterization, production, and purification of leucocin H, a two-peptide bacteriocin from Leuconostoc MF215B.

PubMed

Blom, H; Katla, T; Holck, A; Sletten, K; Axelsson, L; Holo, H

1999-07-01

Leuconostoc MF215B was found to produce a two-peptide bacteriocin referred to as leucocin H. The two peptides were termed leucocin Halpha and leucocin Hbeta. When acting together, they inhibit, among others, Listeria monocytogenes, Bacillus cereus, and Clostridium perfringens. Production of leucocin H in growth medium takes place at temperatures down to 6 degrees C and at pH below 7. The highest activity of leucocin H in growth medium was demonstrated in the late exponential growth phase. The bacteriocin was purified by precipitation with ammonium sulfate, ion-exchange (SP Sepharose) and reverse phase chromatography. Upon purification, specific activity increased 10(5)-fold, and the final specific activity was 2 x 10(7) BU/OD280. Amino acid composition analyses of leucocin Halpha and leucocin Hbeta indicated that both peptides consisted of around 40 amino acid residues. Their N-termini were blocked for Edman degradation, and the methionin residues of leucocin Hbeta did not respond to Cyanogen Bromide (CNBr) cleavage. Absorbance at 280 nm indicated the presence of tryptophan residues and tryptophan-fracturing opened for partial sequencing by Edman degradation. From leucocin Halpha, the sequence of 20 amino acids was obtained; from leucocin Hbeta the sequence of 28 amino acid residues was obtained. No sequence homology to other known bacteriocins could be demonstrated. It also appeared that the two peptides themselves shared little or no sequence homology. The presence of soy oil did not affect the activity of leucocin H in agar.
Arrays of probes for positional sequencing by hybridization

DOEpatents

Cantor, Charles R [Boston, MA; Prezetakiewiczr, Marek [East Boston, MA; Smith, Cassandra L [Boston, MA; Sano, Takeshi [Waltham, MA

2008-01-15

This invention is directed to methods and reagents useful for sequencing nucleic acid targets utilizing sequencing by hybridization technology comprising probes, arrays of probes and methods whereby sequence information is obtained rapidly and efficiently in discrete packages. That information can be used for the detection, identification, purification and complete or partial sequencing of a particular target nucleic acid. When coupled with a ligation step, these methods can be performed under a single set of hybridization conditions. The invention also relates to the replication of probe arrays and methods for making and replicating arrays of probes which are useful for the large scale manufacture of diagnostic aids used to screen biological samples for specific target sequences. Arrays created using PCR technology may comprise probes with 5'- and/or 3'-overhangs.
Domestic dog origin of canine distemper virus in free-ranging wolves in Portugal as revealed by hemagglutinin gene characterization.

PubMed

Müller, Alexandra; Silva, Eliane; Santos, Nuno; Thompson, Gertrude

2011-07-01

Serologic evidence for canine distemper virus (CDV) has been described in grey wolves but, to our knowledge, virus strains circulating in wolves have not been characterized genetically. The emergence of CDV in several non-dog hosts has been associated with amino acid substitutions at sites 530 and 549 of the hemagglutinin (H) protein. We sequenced the H gene of wild-type canine distemper virus obtained from two free-ranging Iberian wolves (Canis lupus signatus) and from one domestic dog (Canis familiaris). More differences were found between the two wolf sequences than between one of the wolves (wolf 75) and the dog. The latter two had a very high nucleotide similarity resulting in identical H gene amino acid sequences. Possible explanations include geographic and especially temporal proximity of the CDV obtained from wolf 75 and the domestic dog, taken in 2007-2008, as opposed to that from wolf 3 taken more distantly in 1998. Analysis of the deduced amino acids of the viral hemagglutinin revealed a glycine (G) and a tyrosine (Y) at amino acid positions 530 and 549, respectively, of the partial signaling lymphocytic activation molecule (SLAM)-receptor binding region which is typically found in viral strains obtained from domestic dogs. This suggests that the CDV found in these wolves resulted from transmission events from local domestic dogs rather than from wildlife species.
Molecular cloning of two human liver 3 alpha-hydroxysteroid/dihydrodiol dehydrogenase isoenzymes that are identical with chlordecone reductase and bile-acid binder.

PubMed Central

Deyashiki, Y; Ogasawara, A; Nakayama, T; Nakanishi, M; Miyabe, Y; Sato, K; Hara, A

1994-01-01

Human liver contains two dihydrodiol dehydrogenases, DD2 and DD4, associated with 3 alpha-hydroxysteroid dehydrogenase activity. We have raised polyclonal antibodies that cross-reacted with the two enzymes and isolated two 1.2 kb cDNA clones (C9 and C11) for the two enzymes from a human liver cDNA library using the antibodies. The clones of C9 and C11 contained coding sequences corresponding to 306 and 321 amino acid residues respectively, but lacked 5'-coding regions around the initiation codon. Sequence analyses of several peptides obtained by enzymic and chemical cleavages of the two purified enzymes verified that the C9 and C11 clones encoded DD2 and DD4 respectively, and further indicated that the sequence of DD2 had at least additional 16 residues upward from the N-terminal sequence deduced from the cDNA. There was 82% amino acid sequence identity between the two enzymes, indicating that the enzymes are genetic isoenzymes. A computer-based comparison of the cDNAs of the isoenzymes with the DNA sequence database revealed that the nucleotide and amino acid sequences of DD2 and DD4 are virtually identical with those of human bile-acid binder and human chlordecone reductase cDNAs respectively. Images Figure 1 PMID:8172617
The glucose transporter 1 -GLUT1- from the white shrimp Litopenaeus vannamei is up-regulated during hypoxia.

PubMed

Martínez-Quintana, José A; Peregrino-Uriarte, Alma B; Gollas-Galván, Teresa; Gómez-Jiménez, Silvia; Yepiz-Plascencia, Gloria

2014-12-01

During hypoxia the shrimp Litopenaeus vannamei accelerates anaerobic glycolysis to obtain energy; therefore, a correct supply of glucose to the cells is needed. Facilitated glucose transport across the cells is mediated by a group of membrane embedded integral proteins called GLUT; being GLUT1 the most ubiquitous form. In this work, we report the first cDNA nucleotide and deduced amino acid sequences of a glucose transporter 1 from L. vannamei. A 1619 bp sequence was obtained by RT-PCR and RACE approaches. The 5´ UTR is 161 bp and the poly A tail is exactly after the stop codon in the mRNA. The ORF is 1485 bp and codes for 485 amino acids. The deduced protein sequence has high identity to GLUT1 proteins from several species and contains all the main features of glucose transporter proteins, including twelve transmembrane domains, the conserved motives and amino acids involved in transport activity, ligands binding and membrane anchor. Therefore, we decided to name this sequence, glucose transporter 1 of L. vannamei (LvGLUT1). A partial gene sequence of 8.87 Kbp was also obtained; it contains the complete coding sequence divided in 10 exons. LvGlut1 expression was detected in hemocytes, hepatopancreas, intestine gills, muscle and pleopods. The higher relative expression was found in gills and the lower in hemocytes. This indicates that LvGlut1 is ubiquitously expressed but its levels are tissue-specific and upon short-term hypoxia, the GLUT1 transcripts increase 3.7-fold in hepatopancreas and gills. To our knowledge, this is the first evidence of expression of GLUT1 in crustaceans.
Plant fatty acid hydroxylases

DOEpatents

Somerville, Chris; Broun, Pierre; van de Loo, Frank

2001-01-01

This invention relates to plant fatty acyl hydroxylases. Methods to use conserved amino acid or nucleotide sequences to obtain plant fatty acyl hydroxylases are described. Also described is the use of cDNA clones encoding a plant hydroxylase to produce a family of hydroxylated fatty acids in transgenic plants. In addition, the use of genes encoding fatty acid hydroxylases or desaturases to alter the level of lipid fatty acid unsaturation in transgenic plants is described.

Phylogenetic analysis of β-xylanase SRXL1 of Sporisorium reilianum and its relationship with families (GH10 and GH11) of Ascomycetes and Basidiomycetes

PubMed Central

Álvarez-Cervantes, Jorge; Díaz-Godínez, Gerardo; Mercado-Flores, Yuridia; Gupta, Vijai Kumar; Anducho-Reyes, Miguel Angel

2016-01-01

In this paper, the amino acid sequence of the β-xylanase SRXL1 of Sporisorium reilianum, which is a pathogenic fungus of maize was used as a model protein to find its phylogenetic relationship with other xylanases of Ascomycetes and Basidiomycetes and the information obtained allowed to establish a hypothesis of monophyly and of biological role. 84 amino acid sequences of β-xylanase obtained from the GenBank database was used. Groupings analysis of higher-level in the Pfam database allowed to determine that the proteins under study were classified into the GH10 and GH11 families, based on the regions of highly conserved amino acids, 233–318 and 180–193 respectively, where glutamate residues are responsible for the catalysis. PMID:27040368
NullSeq: A Tool for Generating Random Coding Sequences with Desired Amino Acid and GC Contents.

PubMed

Liu, Sophia S; Hockenberry, Adam J; Lancichinetti, Andrea; Jewett, Michael C; Amaral, Luís A N

2016-11-01

The existence of over- and under-represented sequence motifs in genomes provides evidence of selective evolutionary pressures on biological mechanisms such as transcription, translation, ligand-substrate binding, and host immunity. In order to accurately identify motifs and other genome-scale patterns of interest, it is essential to be able to generate accurate null models that are appropriate for the sequences under study. While many tools have been developed to create random nucleotide sequences, protein coding sequences are subject to a unique set of constraints that complicates the process of generating appropriate null models. There are currently no tools available that allow users to create random coding sequences with specified amino acid composition and GC content for the purpose of hypothesis testing. Using the principle of maximum entropy, we developed a method that generates unbiased random sequences with pre-specified amino acid and GC content, which we have developed into a python package. Our method is the simplest way to obtain maximally unbiased random sequences that are subject to GC usage and primary amino acid sequence constraints. Furthermore, this approach can easily be expanded to create unbiased random sequences that incorporate more complicated constraints such as individual nucleotide usage or even di-nucleotide frequencies. The ability to generate correctly specified null models will allow researchers to accurately identify sequence motifs which will lead to a better understanding of biological processes as well as more effective engineering of biological systems.
Obtaining a more resolute teleost growth hormone phylogeny by the introduction of gaps in sequence alignment.

PubMed

Rubin, D A; Dores, R M

1995-06-01

In order to obtain a more resolute phylogeny of teleosts based on growth hormone (GH) sequences, phylogenetic analyses were performed in which deletions (gaps), which appear to be order specific, were upheld to maintain GH's structural information. Sequences were analyzed at 194 amino acid positions. In addition, the two closest genealogically related groups to the teleosts, Amia calva and Acipenser guldenstadti, were used as outgroups. Modified sequence alignments were also analyzed to determine clade stability. Analyses indicated, in the most parsimonious cladogram, that molecular and morphological relationships for the orders of fishes are congruent. With GH molecular sequence data it was possible to resolve all clades at the familial level. Analyses of the primary sequence data indicate that: (a) the halecomorphean and chondrostean GH sequences are the appropriate outgroups for generating the most parsimonious cladogram for teleosts; (b) proper alignment of teleost GH sequence by the inclusion of gaps is necessary for resolution of the Percomorpha; and (c) removal of sequence information by deleting improperly aligned sequence decreases the phylogenetic signal obtained.
A population of endogenous pararetrovirus genomes in carrizo citrange

USDA-ARS?s Scientific Manuscript database

The complete genomes of three related endogenous pararetroviruses (EPRVs) were obtained by 454 sequencing of nucleic acid extracts from ‘Carrizo’citrange, used as a citrus rootstock. Numerous homologous sequences have been found in the sweet orange genome. The new EPRVs are most closely related to...
Using Maximum Entropy to Find Patterns in Genomes

NASA Astrophysics Data System (ADS)

Liu, Sophia; Hockenberry, Adam; Lancichinetti, Andrea; Jewett, Michael; Amaral, Luis

The existence of over- and under-represented sequence motifs in genomes provides evidence of selective evolutionary pressures on biological mechanisms such as transcription, translation, ligand-substrate binding, and host immunity. To accurately identify motifs and other genome-scale patterns of interest, it is essential to be able to generate accurate null models that are appropriate for the sequences under study. There are currently no tools available that allow users to create random coding sequences with specified amino acid composition and GC content. Using the principle of maximum entropy, we developed a method that generates unbiased random sequences with pre-specified amino acid and GC content. Our method is the simplest way to obtain maximally unbiased random sequences that are subject to GC usage and primary amino acid sequence constraints. This approach can also be easily be expanded to create unbiased random sequences that incorporate more complicated constraints such as individual nucleotide usage or even di-nucleotide frequencies. The ability to generate correctly specified null models will allow researchers to accurately identify sequence motifs which will lead to a better understanding of biological processes. National Institute of General Medical Science, Northwestern University Presidential Fellowship, National Science Foundation, David and Lucile Packard Foundation, Camille Dreyfus Teacher Scholar Award.
Iterative reactions of transient boronic acids enable sequential C-C bond formation

NASA Astrophysics Data System (ADS)

Battilocchio, Claudio; Feist, Florian; Hafner, Andreas; Simon, Meike; Tran, Duc N.; Allwood, Daniel M.; Blakemore, David C.; Ley, Steven V.

2016-04-01

The ability to form multiple carbon-carbon bonds in a controlled sequence and thus rapidly build molecular complexity in an iterative fashion is an important goal in modern chemical synthesis. In recent times, transition-metal-catalysed coupling reactions have dominated in the development of C-C bond forming processes. A desire to reduce the reliance on precious metals and a need to obtain products with very low levels of metal impurities has brought a renewed focus on metal-free coupling processes. Here, we report the in situ preparation of reactive allylic and benzylic boronic acids, obtained by reacting flow-generated diazo compounds with boronic acids, and their application in controlled iterative C-C bond forming reactions is described. Thus far we have shown the formation of up to three C-C bonds in a sequence including the final trapping of a reactive boronic acid species with an aldehyde to generate a range of new chemical structures.
cDNA cloning, functional expression and cellular localization of rat liver mitochondrial electron-transfer flavoprotein-ubiquinone oxidoreductase protein.

PubMed

Huang, Shengbing; Song, Wei; Lin, Qishui

2005-08-01

A membrane-bound protein was purified from rat liver mitochondria. After being digested with V8 protease, two peptides containing identical 14 amino acid residue sequences were obtained. Using the 14 amino acid peptide derived DNA sequence as gene specific primer, the cDNA of correspondent gene 5'-terminal and 3'-terminal were obtained by RACE technique. The full-length cDNA that encoded a protein of 616 amino acids was thus cloned, which included the above mentioned peptide sequence. The full length cDNA was highly homologous to that of human ETF-QO, indicating that it may be the cDNA of rat ETF-QO. ETF-QO is an iron sulfur protein located in mitochondria inner membrane containing two kinds of redox center: FAD and [4Fe-4S] center. After comparing the sequence from the cDNA of the 616 amino acids protein with that of the mature protein of rat liver mitochondria, it was found that the N terminal 32 amino acid residues did not exist in the mature protein, indicating that the cDNA was that of ETF-QOp. When the cDNA was expressed in Saccharomyces cerevisiae with inducible vectors, the protein product was enriched in mitochondrial fraction and exhibited electron transfer activity (NBT reductase activity) of ETF-QO. Results demonstrated that the 32 amino acid peptide was a mitochondrial targeting peptide, and both FAD and iron-sulfur cluster were inserted properly into the expressed ETF-QO. ETF-QO had a high level expression in rat heart, liver and kidney. The fusion protein of GFP-ETF-QO co-localized with mitochondria in COS-7 cells.
Characterization of Clostridium perfringens iota-toxin genes and expression in Escherichia coli.

PubMed

Perelle, S; Gibert, M; Boquet, P; Popoff, M R

1993-12-01

The iota toxin which is produced by Clostridium perfringens type E, is a binary toxin consisting of two independent polypeptides: Ia, which is an ADP-ribosyltransferase, and Ib, which is involved in the binding and internalization of the toxin into the cell. Two degenerate oligonucleotide probes deduced from partial amino acid sequence of each component of C. spiroforme toxin, which is closely related to the iota toxin, were used to clone three overlapping DNA fragments containing the iota-toxin genes from C. perfringens type E plasmid DNA. Two genes, in the same orientation, coding for Ia (387 amino acids) and Ib (875 amino acids) and separated by 243 noncoding nucleotides were identified. A predicted signal peptide was found for each component, and the secreted Ib displays two domains, the propeptide (172 amino acids) and the mature protein (664 amino acids). The Ia gene has been expressed in Escherichia coli and C. perfringens, under the control of its own promoter. The recombinant polypeptide obtained was recognized by Ia antibodies and ADP-ribosylated actin. The expression of the Ib gene was obtained in E. coli harboring a recombinant plasmid encompassing the putative promoter upstream of the Ia gene and the Ia and Ib genes. Two residues which have been found to be involved in the NAD+ binding site of diphtheria and pseudomonas toxins are conserved in the predicted Ia sequence (Glu-14 and Trp-19). The predicted amino acid Ib sequence shows 33.9% identity with and 54.4% similarity to the protective antigen of the anthrax toxin complex. In particular, the central region of Ib, which contains a predicted transmembrane segment (Leu-292 to Ser-308), presents 45% identity with the corresponding protective antigen sequence which is involved in the translocation of the toxin across the cell membrane.
Purification and characterization of enantioselective N-acetyl-β-Phe acylases from Burkholderia sp. AJ110349.

PubMed

Imabayashi, Yuki; Suzuki, Shun'ichi; Kawasaki, Hisashi; Nakamatsu, Tsuyoshi

2016-01-01

For the production of enantiopure β-amino acids, enantioselective resolution of N-acyl β-amino acids using acylases, especially those recognizing N-acetyl-β-amino acids, is one of the most attractive methods. Burkholderia sp. AJ110349 had been reported to exhibit either (R)- or (S)-enantiomer selective N-acetyl-β-Phe amidohydrolyzing activity, and in this study, both (R)- and (S)-enantioselective N-acetyl-β-Phe acylases were purified to be electrophoretically pure and determined the sequences, respectively. They were quite different in terms of enantioselectivities and in their amino acids sequences and molecular weights. Although both the purified acylases were confirmed to catalyze N-acetyl hydrolyzing activities, neither of them show sequence similarities to the N-acetyl-α-amino acid acylases reported thus far. Both (R)- and (S)-enantioselective N-acetyl-β-Phe acylase were expressed in Escherichia coli. Using these recombinant strains, enantiomerically pure (R)-β-Phe (>99% ee) and (S)-β-Phe (>99% ee) were obtained from the racemic substrate.
Sequence and structural implications of a bovine corneal keratan sulfate proteoglycan core protein. Protein 37B represents bovine lumican and proteins 37A and 25 are unique

NASA Technical Reports Server (NTRS)

Funderburgh, J. L.; Funderburgh, M. L.; Brown, S. J.; Vergnes, J. P.; Hassell, J. R.; Mann, M. M.; Conrad, G. W.; Spooner, B. S. (Principal Investigator)

1993-01-01

Amino acid sequence from tryptic peptides of three different bovine corneal keratan sulfate proteoglycan (KSPG) core proteins (designated 37A, 37B, and 25) showed similarities to the sequence of a chicken KSPG core protein lumican. Bovine lumican cDNA was isolated from a bovine corneal expression library by screening with chicken lumican cDNA. The bovine cDNA codes for a 342-amino acid protein, M(r) 38,712, containing amino acid sequences identified in the 37B KSPG core protein. The bovine lumican is 68% identical to chicken lumican, with an 83% identity excluding the N-terminal 40 amino acids. Location of 6 cysteine and 4 consensus N-glycosylation sites in the bovine sequence were identical to those in chicken lumican. Bovine lumican had about 50% identity to bovine fibromodulin and 20% identity to bovine decorin and biglycan. About two-thirds of the lumican protein consists of a series of 10 amino acid leucine-rich repeats that occur in regions of calculated high beta-hydrophobic moment, suggesting that the leucine-rich repeats contribute to beta-sheet formation in these proteins. Sequences obtained from 37A and 25 core proteins were absent in bovine lumican, thus predicting a unique primary structure and separate mRNA for each of the three bovine KSPG core proteins.
Identification of multiple mRNA and DNA sequences from small tissue samples isolated by laser-assisted microdissection.

PubMed

Bernsen, M R; Dijkman, H B; de Vries, E; Figdor, C G; Ruiter, D J; Adema, G J; van Muijen, G N

1998-10-01

Molecular analysis of small tissue samples has become increasingly important in biomedical studies. Using a laser dissection microscope and modified nucleic acid isolation protocols, we demonstrate that multiple mRNA as well as DNA sequences can be identified from a single-cell sample. In addition, we show that the specificity of procurement of tissue samples is not compromised by smear contamination resulting from scraping of the microtome knife during sectioning of lesions. The procedures described herein thus allow for efficient RT-PCR or PCR analysis of multiple nucleic acid sequences from small tissue samples obtained by laser-assisted microdissection.
Improved purification, crystallization and primary structure of pyruvate:ferredoxin oxidoreductase from Halobacterium halobium.

PubMed

Plaga, W; Lottspeich, F; Oesterhelt, D

1992-04-01

An improved purification procedure, including nickel chelate affinity chromatography, is reported which resulted in a crystallizable pyruvate:ferredoxin oxidoreductase preparation from Halobacterium halobium. Crystals of the enzyme were obtained using potassium citrate as the precipitant. The genes coding for pyruvate:ferredoxin oxidoreductase were cloned and their nucleotide sequences determined. The genes of both subunits were adjacent to one another on the halobacterial genome. The derived amino acid sequences were confirmed by partial primary structure analysis of the purified protein. The structural motif of thiamin-diphosphate-binding enzymes was unequivocally located in the deduced amino acid sequence of the small subunit.
DNA and RNA sequencing by nanoscale reading through programmable electrophoresis and nanoelectrode-gated tunneling and dielectric detection

DOEpatents

Lee, James W.; Thundat, Thomas G.

2005-06-14

An apparatus and method for performing nucleic acid (DNA and/or RNA) sequencing on a single molecule. The genetic sequence information is obtained by probing through a DNA or RNA molecule base by base at nanometer scale as though looking through a strip of movie film. This DNA sequencing nanotechnology has the theoretical capability of performing DNA sequencing at a maximal rate of about 1,000,000 bases per second. This enhanced performance is made possible by a series of innovations including: novel applications of a fine-tuned nanometer gap for passage of a single DNA or RNA molecule; thin layer microfluidics for sample loading and delivery; and programmable electric fields for precise control of DNA or RNA movement. Detection methods include nanoelectrode-gated tunneling current measurements, dielectric molecular characterization, and atomic force microscopy/electrostatic force microscopy (AFM/EFM) probing for nanoscale reading of the nucleic acid sequences.
Asymmetric scoring functions for proteins

NASA Astrophysics Data System (ADS)

Lezon, Timothy; Holter, Neal; Maritan, Amos; Banavar, Jayanth

2003-03-01

The protein folding problem entails the prediction of the native state structure of a protein given the sequence of amino acids. In a coarse-grained description of a protein, an important ingredient for attempting this task is the determination of the effective energies of interaction between amino acids. We will discuss a simple approach for determining such interaction potentials from a training set of protein sequences and their experimentally determined native state structures. The key new ingredient in our study is the incorporation of the lack of symmetry in the effective interactions between amino acids. Our results, obtained using a set of 513 proteins, and their implications will be discussed.
Production of hydroxylated fatty acids in genetically modified plants

DOEpatents

Somerville, Chris [Portola Valley, CA; Broun, Pierre [Burlingame, CA; van de Loo, Frank [Weston, AU; Boddupalli, Sekhar S [Manchester, MI

2011-08-23

This invention relates to plant fatty acyl hydroxylases. Methods to use conserved amino acid or nucleotide sequences to obtain plant fatty acyl hydroxylases are described. Also described is the use of cDNA clones encoding a plant hydroxylase to produce a family of hydroxylated fatty acids in transgenic plants. In addition, the use of genes encoding fatty acid hydroxylases or desaturases to alter the level of lipid fatty acid unsaturation in transgenic plants is described.
Production of hydroxylated fatty acids in genetically modified plants

DOEpatents

Somerville, Chris; Broun, Pierre; van de Loo, Frank; Boddupalli, Sekhar S.

2005-08-30

This invention relates to plant fatty acyl hydroxylases. Methods to use conserved amino acid or nucleotide sequences to obtain plant fatty acyl hydroxylases are described. Also described is the use of cDNA clones encoding a plant hydroxylase to produce a family of hydroxylated fatty acids in transgenic plants. In addition, the use of genes encoding fatty acid hydroxylases or desaturases to alter the level of lipid fatty acid unsaturation in transgenic plants is described.
Cloning and Characterization of a Novel β-Transaminase from Mesorhizobium sp. Strain LUK: a New Biocatalyst for the Synthesis of Enantiomerically Pure β-Amino Acids▿

PubMed Central

Kim, Juhan; Kyung, Dohyun; Yun, Hyungdon; Cho, Byung-Kwan; Seo, Joo-Hyun; Cha, Minho; Kim, Byung-Gee

2007-01-01

A novel β-transaminase gene was cloned from Mesorhizobium sp. strain LUK. By using N-terminal sequence and an internal protein sequence, a digoxigenin-labeled probe was made for nonradioactive hybridization, and a 2.5-kb gene fragment was obtained by colony hybridization of a cosmid library. Through Southern blotting and sequence analysis of the selected cosmid clone, the structural gene of the enzyme (1,335 bp) was identified, which encodes a protein of 47,244 Da with a theoretical pI of 6.2. The deduced amino acid sequence of the β-transaminase showed the highest sequence similarity with glutamate-1-semialdehyde aminomutase of transaminase subgroup II. The β-transaminase showed higher activities toward d-β-aminocarboxylic acids such as 3-aminobutyric acid, 3-amino-5-methylhexanoic acid, and 3-amino-3-phenylpropionic acid. The β-transaminase has an unusually broad specificity for amino acceptors such as pyruvate and α-ketoglutarate/oxaloacetate. The enantioselectivity of the enzyme suggested that the recognition mode of β-aminocarboxylic acids in the active site is reversed relative to that of α-amino acids. After comparison of its primary structure with transaminase subgroup II enzymes, it was proposed that R43 interacts with the carboxylate group of the β-aminocarboxylic acids and the carboxylate group on the side chain of dicarboxylic α-keto acids such as α-ketoglutarate and oxaloacetate. R404 is another conserved residue, which interacts with the α-carboxylate group of the α-amino acids and α-keto acids. The β-transaminase was used for the asymmetric synthesis of enantiomerically pure β-aminocarboxylic acids. (3S)-Amino-3-phenylpropionic acid was produced from the ketocarboxylic acid ester substrate by coupled reaction with a lipase using 3-aminobutyric acid as amino donor. PMID:17259358
The cDNA sequence of mouse Pgp-1 and homology to human CD44 cell surface antigen and proteoglycan core/link proteins.

PubMed

Wolffe, E J; Gause, W C; Pelfrey, C M; Holland, S M; Steinberg, A D; August, J T

1990-01-05

We describe the isolation and sequencing of a cDNA encoding mouse Pgp-1. An oligonucleotide probe corresponding to the NH2-terminal sequence of the purified protein was synthesized by the polymerase chain reaction and used to screen a mouse macrophage lambda gt11 library. A cDNA clone with an insert of 1.2 kilobases was selected and sequenced. In Northern blot analysis, only cells expressing Pgp-1 contained mRNA species that hybridized with this Pgp-1 cDNA. The nucleotide sequence of the cDNA has a single open reading frame that yields a protein-coding sequence of 1076 base pairs followed by a 132-base pair 3'-untranslated sequence that includes a putative polyadenylation signal but no poly(A) tail. The translated sequence comprises a 13-amino acid signal peptide followed by a polypeptide core of 345 residues corresponding to an Mr of 37,800. Portions of the deduced amino acid sequence were identical to those obtained by amino acid sequence analysis from the purified glycoprotein, confirming that the cDNA encodes Pgp-1. The predicted structure of Pgp-1 includes an NH2-terminal extracellular domain (residues 14-265), a transmembrane domain (residues 266-286), and a cytoplasmic tail (residues 287-358). Portions of the mouse Pgp-1 sequence are highly similar to that of the human CD44 cell surface glycoprotein implicated in cell adhesion. The protein also shows sequence similarity to the proteoglycan tandem repeat sequences found in cartilage link protein and cartilage proteoglycan core protein which are thought to be involved in binding to hyaluronic acid.
Sequencing, bioinformatic characterization and expression pattern of a putative amino acid transporter from the parasitic cestode Echinococcus granulosus.

PubMed

Camicia, Federico; Paredes, Rodolfo; Chalar, Cora; Galanti, Norbel; Kamenetzky, Laura; Gutierrez, Ariana; Rosenzvit, Mara C

2008-03-31

We have sequenced and partially characterized an Echinococcus granulosus cDNA, termed egat1, from a protoscolex signal sequence trap (SST) cDNA library. The isolated 1627 bp long cDNA contains an ORF of 489 amino acids and shows an amino acid identity of 30% with neutral and excitatory amino acid transporters members of the Dicarboxylate/Amino Acid Na+ and/or H+ Cation Symporter family (DAACS) (TC 2.A.23). Additional bioinformatics analysis of EgAT1, confirmed the results obtained by similarity searches and showed the presence of 9 to 10 transmembrane domains, consensus sequences for N-glycosylation between the third and fourth transmembrane domain, a highly similar hydropathy profile with ASCT1 (a known member of DAACS family), high score with SDF (Sodium Dicarboxilate Family) and similar motifs with EDTRANSPORT, a fingerprint of excitatory amino acid transporters. The localization of the putative amino acid transporter was analyzed by in situ hybridization and immunofluorescence in protoscoleces and associated germinal layer. The in situ hybridization labelling indicates the distribution of egat1 mRNA throughout the tegument. EgAT1 protein, which showed in Western blots a molecular mass of approximately 60 kD, is localized in the subtegumental region of the metacestode, particularly around suckers and rostellum of protoscoleces and layers from brood capsules. The sequence and expression analyses of EgAT1 pave the way for functional analysis of amino acids transporters of E. granulosus and its evaluation as new drug targets against cystic echinococcosis.
A highly conserved N-terminal sequence for teleost vitellogenin with potential value to the biochemistry, molecular biology and pathology of vitellogenesis

USGS Publications Warehouse

Folmar, L.D.; Denslow, N.D.; Wallace, R.A.; LaFleur, G.; Gross, T.S.; Bonomelli, S.; Sullivan, C.V.

1995-01-01

N-terminal amino acid sequences for vitellogenin (Vtg) from six species of teleost fish (striped bass, mummichog, pinfish, brown bullhead, medaka, yellow perch and the sturgeon) are compared with published N-terminal Vtg sequences for the lamprey, clawed frog and domestic chicken. Striped bass and mummichog had 100% identical amino acids between positions 7 and 21, while pinfish, brown bullhead, sturgeon, lamprey, Xenopus and chicken had 87%, 93%, 60%, 47%, 47-60%) for four transcripts and had 40% identical, respectively, with striped bass for the same positions. Partial sequences obtained for medaka and yellow perch were 100% identical between positions 5 to 10. The potential utility of this conserved sequence for studies on the biochemistry, molecular biology and pathology of vitellogenesis is discussed.

The primary structures of ribosomal proteins S14 and S16 from the archaebacterium Halobacterium marismortui. Comparison with eubacterial and eukaryotic ribosomal proteins.

PubMed

Kimura, J; Kimura, M

1987-09-05

The amino acid sequences of two ribosomal proteins, S14 and S16, from the archaebacterium Halobacterium marismortui have been determined. Sequence data were obtained by the manual and solid-phase sequencing of peptides derived from enzymatic digestions with trypsin, chymotrypsin, pepsin, and Staphylococcus aureus protease as well as by chemical cleavage with cyanogen bromide. Proteins S14 and S16 contain 109 and 126 amino acid residues and have Mr values of 11,964 and 13,515, respectively. Comparison of the sequences with those of ribosomal proteins from other organisms demonstrates that S14 has a significant homology with the rat liver ribosomal protein S11 (36% identity) as well as with the Escherichia coli ribosomal protein S17 (37%), and that S16 is related to the yeast ribosomal protein YS22 (40%) and proteins S8 from E. coli (28%) and Bacillus stearothermophilus (30%). A comparison of the amino acid residues in the homologous regions of halophilic and nonhalophilic ribosomal proteins reveals that halophilic proteins have more glutamic acids, asparatic acids, prolines, and alanines, and less lysines, arginines, and isoleucines than their nonhalophilic counterparts. These amino acid substitutions probably contribute to the structural stability of halophilic ribosomal proteins.
The complete nucleotide sequence of RNA 3 of a peach isolate of Prunus necrotic ringspot virus.

PubMed

Hammond, R W; Crosslin, J M

1995-04-01

The complete nucleotide sequence of RNA 3 of the PE-5 peach isolate of Prunus necrotic ringspot ilarvirus (PNRSV) was obtained from cloned cDNA. The RNA sequence is 1941 nucleotides and contains two open reading frames (ORFs). ORF 1 consisted of 284 amino acids with a calculated molecular weight of 31,729 Da and ORF 2 contained 224 amino acids with a calculated molecular weight of 25,018 Da. ORF 2 corresponds to the coat protein gene. Expression of ORF 2 engineered into a pTrcHis vector in Escherichia coli results in a fusion polypeptide of approximately 28 kDa which cross-reacts with PNRSV polyclonal antiserum. Analysis of the coat protein amino acid sequence reveals a putative "zinc-finger" domain at the amino-terminal portion of the protein. Two tetranucleotide AUGC motifs occur in the 3'-UTR of the RNA and may function in coat protein binding and genome activation. ORF 1 homologies to other ilarviruses and alfalfa mosaic virus are confined to limited regions of conserved amino acids. The translated amino acid sequence of the coat protein gene shows 92% similarity to one isolate of apple mosaic virus, a closely related member of the ilarvirus group of plant viruses, but only 66% similarity to the amino acid sequence of the coat protein gene of a second isolate. These relationships are also reflected at the nucleotide sequence level. These results in one instance confirm the close similarities observed at the biophysical and serological levels between these two viruses, but on the other hand call into question the nomenclature used to describe these viruses.
The TGA codons are present in the open reading frame of selenoprotein P cDNA

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hill, K.E.; Lloyd, R.S.; Read, R.

1991-03-11

The TGA codon in DNA has been shown to direct incorporation of selenocysteine into protein. Several proteins from bacteria and animals contain selenocysteine in their primary structures. Each of the cDNA clones of these selenoproteins contains one TGA codon in the open reading frame which corresponds to the selenocysteine in the protein. A cDNA clone for selenoprotein P (SeP), obtained from a {gamma}ZAP rat liver library, was sequenced by the dideoxy termination method. The correct reading frame was determined by comparison of the deduced amino acid sequence with the amino acid sequence of several peptides from SeP. Using SeP labelledmore » with {sup 75}Se in vivo, the selenocysteine content of the peptides was verified by the collection of carboxymethylated {sup 77}Se-selenocysteine as it eluted from the amino acid analyzer and determination of the radioactivity contained in the collected samples. Ten TGA codons are present in the open reading frame of the cDNA. Peptide fragmentation studies and the deduced sequence indicate that selenium-rich regions are located close to the carboxy terminus. Nine of the 10 selenocysteines are located in the terminal 26% of the sequence with four in the terminal 15 amino acids. The deduced sequence codes for a protein of 385 amino acids. Cleavage of the signal peptide gives the mature protein with 366 amino acids and a calculated mol wt of 41,052 Da. Searches of PIR and SWISSPROT protein databases revealed no similarity with glutathione peroxidase or other selenoproteins.« less
Identification and characterization of a NBS–LRR class resistance gene analog in Pistacia atlantica subsp. Kurdica

PubMed Central

Bahramnejad, Bahman

2014-01-01

P. atlantica subsp. Kurdica, with the local name of Baneh, is a wild medicinal plant which grows in Kurdistan, Iran. The identification of resistance gene analogs holds great promise for the development of resistant cultivars. A PCR approach with degenerate primers designed according to conserved NBS-LRR (nucleotide binding site-leucine rich repeat) regions of known disease-resistance (R) genes was used to amplify and clone homologous sequences from P. atlantica subsp. Kurdica. A DNA fragment of the expected 500-bp size was amplified. The nucleotide sequence of this amplicon was obtained through sequencing and the predicted amino acid sequence compared to the amino acid sequences of known R-genes revealed significant sequence similarity. Alignment of the deduced amino acid sequence of P. atlantica subsp. Kurdica resistance gene analog (RGA) showed strong identity, ranging from 68% to 77%, to the non-toll interleukin receptor (non-TIR) R-gene subfamily from other plants. A P-loop motif (GMMGGEGKTT), a conserved and hydrophobic motif GLPLAL, a kinase-2a motif (LLVLDDV), when replaced by IAVFDDI in PAKRGA1 and a kinase-3a (FGPGSRIII) were presented in all RGA. A phylogenetic tree, based on the deduced amino-acid sequences of PAKRGA1 and RGAs from different species indicated that they were separated in two clusters, PAKRGA1 being on cluster II. The isolated NBS analogs can be eventually used as guidelines to isolate numerous R-genes in Pistachio. PMID:27843981
Acid lipase inhibitor in chicken plasma identified as apolipoprotein A-I.

PubMed

Fujii, M; Higuchi, T; Mukai, S; Yonekura, M; Yano, T; Kawaguchi, H; Nonaka, K; Fukunaga, T; Sugimoto, Y; Yamada, S

1996-10-01

We have reported a inhibitor of acid lipases in liver lysosomes and erythrocytes from chickens [M. Fujii et al., Int. J. Biochem., 22, 895-898 (1990)]. In this paper, the properties of the inhibitor were described in comparison with those of apo A-I of chicken. The purified inhibitor migrated with the same mobility on SDS-PAGE as apo A-I, and had a molecular weight of 27,000. The peptide map from the lipase inhibitor was similar to that of apo A-I. Antibodies to the acid lipase inhibitor also reacted with apo A-I. Apo A-I inhibited the acid lipase activities of liver lysosomes and erythrocytes from chickens as strongly as the lipase inhibitor. The N-terminal amino acid sequence of lipase inhibitor was identical to that of apo A-I as far as residue 20. The amino acid sequence of peptides obtained from the inhibitor by cleavage with CNBr corresponded to internal sequence of apo A-I, and so the CNBr-peptides were derived by cleavage after the methionine residues in apo A-I. The findings showed that the inhibitor of the acid lipases in liver lysosomes and erythrocytes from chickens was identical to apo A-I.
Extension of the COG and arCOG databases by amino acid and nucleotide sequences

PubMed Central

Meereis, Florian; Kaufmann, Michael

2008-01-01

Background The current versions of the COG and arCOG databases, both excellent frameworks for studies in comparative and functional genomics, do not contain the nucleotide sequences corresponding to their protein or protein domain entries. Results Using sequence information obtained from GenBank flat files covering the completely sequenced genomes of the COG and arCOG databases, we constructed NUCOCOG (nucleotide sequences containing COG databases) as an extended version including all nucleotide sequences and in addition the amino acid sequences originally utilized to construct the current COG and arCOG databases. We make available three comprehensive single XML files containing the complete databases including all sequence information. In addition, we provide a web interface as a utility suitable to browse the NUCOCOG database for sequence retrieval. The database is accessible at . Conclusion NUCOCOG offers the possibility to analyze any sequence related property in the context of the COG and arCOG framework simply by using script languages such as PERL applied to a large but single XML document. PMID:19014535
Complete amino acid sequence of the myoglobin from the Pacific sei whale, Balaenoptera borealis.

PubMed

Jones, B N; Rothgeb, T M; England, R D; Gurd, F R

1979-04-25

The complete amino acid sequence of the major component myoglobin from Pacific sei whale, Balaenoptera borealis, was determined by specific cleavage of the protein to obtain large peptides which are readily degraded by the automatic sequencer. The acetimidated apomyoglobin was selectively cleaved at its two methionyl residues with cyanogen bromide and at its three arginyl residues by trypsin. From the sequence analysis of four of these peptides and the apomyoglobin, over 75% of the covalent structure of the protein was obtained. The remainder of the primary structure was determined by the sequence analysis of peptides that resulted from further digestion of the amino-terminal and central cyanogen bromide fragments. The amino-terminal fragment was specifically cleaved at its two tryptophanyl residues with N-chlorosuccinimide and the central cyanogen bromide fragment was cleaved at its glutamyl residues with staphylococcal protease and at its single tyrosyl residue with N-bromosuccinimide. The primary structure of this myoglobin proved identical with that from the gray whale but differs from that of the finback whale at four positions, from that of the minke whale at three positions and from the myoglobin of the humpback whale at one position. The above sequence identities and differences reflect the close taxonomic relationship of these five species of Cetacea.
Random Amplification and Pyrosequencing for Identification of Novel Viral Genome Sequences

PubMed Central

Hang, Jun; Forshey, Brett M.; Kochel, Tadeusz J.; Li, Tao; Solórzano, Víctor Fiestas; Halsey, Eric S.; Kuschner, Robert A.

2012-01-01

ssRNA viruses have high levels of genomic divergence, which can lead to difficulty in genomic characterization of new viruses using traditional PCR amplification and sequencing methods. In this study, random reverse transcription, anchored random PCR amplification, and high-throughput pyrosequencing were used to identify orthobunyavirus sequences from total RNA extracted from viral cultures of acute febrile illness specimens. Draft genome sequence for the orthobunyavirus L segment was assembled and sequentially extended using de novo assembly contigs from pyrosequencing reads and orthobunyavirus sequences in GenBank as guidance. Accuracy and continuous coverage were achieved by mapping all reads to the L segment draft sequence. Subsequently, RT-PCR and Sanger sequencing were used to complete the genome sequence. The complete L segment was found to be 6936 bases in length, encoding a 2248-aa putative RNA polymerase. The identified L segment was distinct from previously published South American orthobunyaviruses, sharing 63% and 54% identity at the nucleotide and amino acid level, respectively, with the complete Oropouche virus L segment and 73% and 81% identity at the nucleotide and amino acid level, respectively, with a partial Caraparu virus L segment. The result demonstrated the effectiveness of a sequence-independent amplification and next-generation sequencing approach for obtaining complete viral genomes from total nucleic acid extracts and its use in pathogen discovery. PMID:22468136
Complete covalent structure of statherin, a tyrosine-rich acidic peptide which inhibits calcium phosphate precipitation from human parotid saliva.

PubMed

Schlesinger, D H; Hay, D I

1977-03-10

The complete amino acid sequence of human salivary statherin, a peptide which strongly inhibits precipitation from supersaturated calcium phosphate solutions, and therefore stabilizes supersaturated saliva, has been determined. The NH2-terminal half of this Mr=5380 (43 amino acids) polypeptide was determined by automated Edman degradations (liquid phase) on native statherin. The peptide was digested separately with trypsin, chymotrypsin, and Staphylococcus aureus protease, and the resulting peptides were purified by gel filtration. Manual Edman degradations on purified peptide fragments yielded peptides that completed the amino acid sequence through the penultimate COOH-terminal residue. These analyses, together with carboxypeptidase digestion of native statherin and of peptide fragments of statherin, established the complete sequence of the molecule. The 2 serine residues (positions 2 and 3) in statherin were identified as phosphoserine. The amino acid sequence of human salivary statherin is striking in a number of ways. The NH2-terminal one-third is highly polar and includes three polar dipeptides: H2PO3-Ser-Ser-H2PO3-Arg-Arg-, and Glu-Glu-. The COOH-terminal two-thirds of the molecule is hydrophobic, containing several repeating dipeptides: four of -Gn-Pro-, three of -Tyr-Gln-, two of -Gly-Tyr-, two of-Gln-Tyr-, and two of the tetrapeptide sequence -Pro-Tyr-Gln-Pro-. Unusual cleavage sites in the statherin sequence obtained with chymotrypsin and S. aureus protease were also noted.
Identification of Clinical Coryneform Bacterial Isolates: Comparison of Biochemical Methods and Sequence Analysis of 16S rRNA and rpoB Genes▿

PubMed Central

Adderson, Elisabeth E.; Boudreaux, Jan W.; Cummings, Jessica R.; Pounds, Stanley; Wilson, Deborah A.; Procop, Gary W.; Hayden, Randall T.

2008-01-01

We compared the relative levels of effectiveness of three commercial identification kits and three nucleic acid amplification tests for the identification of coryneform bacteria by testing 50 diverse isolates, including 12 well-characterized control strains and 38 organisms obtained from pediatric oncology patients at our institution. Between 33.3 and 75.0% of control strains were correctly identified to the species level by phenotypic systems or nucleic acid amplification assays. The most sensitive tests were the API Coryne system and amplification and sequencing of the 16S rRNA gene using primers optimized for coryneform bacteria, which correctly identified 9 of 12 control isolates to the species level, and all strains with a high-confidence call were correctly identified. Organisms not correctly identified were species not included in the test kit databases or not producing a pattern of reactions included in kit databases or which could not be differentiated among several genospecies based on reaction patterns. Nucleic acid amplification assays had limited abilities to identify some bacteria to the species level, and comparison of sequence homologies was complicated by the inclusion of allele sequences obtained from uncultivated and uncharacterized strains in databases. The utility of rpoB genotyping was limited by the small number of representative gene sequences that are currently available for comparison. The correlation between identifications produced by different classification systems was poor, particularly for clinical isolates. PMID:18160450
Identification of a novel vitivirus from grapevines in New Zealand.

PubMed

Blouin, Arnaud G; Keenan, Sandi; Napier, Kathryn R; Barrero, Roberto A; MacDiarmid, Robin M

2018-01-01

We report a sequence of a novel vitivirus from Vitis vinifera obtained using two high-throughput sequencing (HTS) strategies on RNA. The initial discovery from small-RNA sequencing was confirmed by HTS of the total RNA and Sanger sequencing. The new virus has a genome structure similar to the one reported for other vitiviruses, with five open reading frames (ORFs) coding for the conserved domains described for members of that genus. Phylogenetic analysis of the complete genome sequence confirmed its affiliation to the genus Vitivirus, with the closest described viruses being grapevine virus E (GVE) and Agave tequilana leaf virus (ATLV). However, the virus we report is distinct and shares only 51% amino acid sequence identity with GVE in the replicase polyprotein and 66.8% amino acid sequence identity with ATLV in the coat protein. This is well below the threshold determined by the ICTV for species demarcation, and we propose that this virus represents a new species. It is provisionally named "grapevine virus G".
Purification and sequencing of the active site tryptic peptide from penicillin-binding protein 1b of Escherichia coli

DOE Office of Scientific and Technical Information (OSTI.GOV)

Nicholas, R.A.; Suzuki, H.; Hirota, Y.

This paper reports the sequence of the active site peptide of penicillin-binding protein 1b from Escherichia coli. Purified penicillin-binding protein 1b was labeled with (/sup 14/C)penicillin G, digested with trypsin, and partially purified by gel filtration. Upon further purification by high-pressure liquid chromatography, two radioactive peaks were observed, and the major peak, representing over 75% of the applied radioactivity, was submitted to amino acid analysis and sequencing. The sequence Ser-Ile-Gly-Ser-Leu-Ala-Lys was obtained. The active site nucleophile was identified by digesting the purified peptide with aminopeptidase M and separating the radioactive products on high-pressure liquid chromatography. Amino acid analysis confirmed thatmore » the serine residue in the middle of the sequence was covalently bonded to the (/sup 14/C)penicilloyl moiety. A comparison of this sequence to active site sequences of other penicillin-binding proteins and beta-lactamases is presented.« less
High levels of MHC class II allelic diversity in lake trout from Lake Superior

USGS Publications Warehouse

Dorschner, M.O.; Duris, T.; Bronte, C.R.; Burnham-Curtis, M. K.; Phillips, R.B.

2000-01-01

Sequence variation in a 216 bp portion of the major histocompatibility complex (MHC) II B1 domain was examined in 74 individual lake trout (Salvelinus namaycush) from different locations in Lake Superior. Forty-three alleles were obtained which encoded 71-72 amino acids of the mature protein. These sequences were compared with previous data obtained from five Pacific salmon species and Atlantic salmon using the same primers. Although all of the lake trout alleles clustered together in the neighbor-joining analysis of amino acid sequences, one amino acid allelic lineage was shared with Atlantic salmon (Salmo salar), a species in another genus which probably diverged from Salvelinus more than 10-20 million years ago. As shown previously in other salmonids, the level of nonsynonymous nucleotide substitution (d(N)) exceeded the level of synonymous substitution (d(S)). The level of nucleotide diversity at the MHC class II B1 locus was considerably higher in lake trout than in the Pacific salmon (genus Oncorhynchus). These results are consistent with the hypothesis that lake trout colonized Lake Superior from more than one refuge following the Wisconsin glaciation. Recent population bottlenecks may have reduced nucleotide diversity in Pacific salmon populations.
Biochemical and genetic characterization of enterocin A from Enterococcus faecium, a new antilisterial bacteriocin in the pediocin family of bacteriocins.

PubMed Central

Aymerich, T; Holo, H; Håvarstein, L S; Hugas, M; Garriga, M; Nes, I F

1996-01-01

A new bacteriocin has been isolated from an Enterococcus faecium strain. The bacteriocin, termed enterocin A, was purified to homogeneity as judged by sodium dodecyl sulfate-polyacrylamide gel electrophoresis, N-terminal amino acid sequencing, and mass spectrometry analysis. By combining the data obtained from amino acid and DNA sequencing, the primary structure of enterocin A was determined. It consists of 47 amino acid residues, and the molecular weight was calculated to be 4,829, assuming that the four cysteine residues form intramolecular disulfide bridges. This molecular weight was confirmed by mass spectrometry analysis. The amino acid sequence of enterocin A shared significant homology with a group of bacteriocins (now termed pediocin-like bacteriocins) isolated from a variety of lactic acid-producing bacteria, which include members of the genera Lactobacillus, Pediococcus, Leuconostoc, and Carnobacterium. Sequencing of the structural gene of enterocin A, which is located on the bacterial chromosome, revealed an N-terminal leader sequence of 18 amino acid residues, which was removed during the maturation process. The enterocin A leader belongs to the double-glycine leaders which are found among most other small nonlantibiotic bacteriocins, some lantibiotics, and colicin V. Downstream of the enterocin A gene was located a second open reading frame, encoding a putative protein of 103 amino acid residues. This gene may encode the immunity factor of enterocin A, and it shares 40% identity with a similar open reading frame in the operon of leucocin AUL 187, another pediocin-like bacteriocin. PMID:8633865
Isolation, cloning, and characterization of a partial novel aro A gene in common reed (Phragmites australis).

PubMed

Taravat, Elham; Zebarjadi, Alireza; Kahrizi, Danial; Yari, Kheirollah

2015-05-01

Among the essential amino acids, phenylalanine, tryptophan, and tyrosine are aromatic amino acids which are synthesized by the shikimate pathway in plants and bacteria. Herbicide glyphosate can inhibit the biosynthesis of these amino acids. So, identification of the gene tolerant to glyphosate is very important. It has been shown that the common reed or Phragmites australis Cav. (Poaceae) is relatively tolerant to glyphosate. The aim of the current research is identification, cloning, sequencing, and registering of partial aro A gene of the common reed P. australis. The partial aro A gene of common reed (P. australis) was cloned in Escherichia coli and the amino acid sequence was identified/determined for the first time. This is the first report for isolation, cloning, and sequencing of a part of aro A gene from the common reed. A 670 bp fragment including two introns (86 bp and 289 bp) was obtained. The open reading frame (ORF) region in part of gene was encoded for 98 amino acids. Alignment showed high similarity among this region with Zea mays (L.) (Poaceae) (94.6%), Eleusine indica L. Gaertn (Poaceae) (94.2%), and Zoysia japonica Steud. (Poaceae) (94.2%). The alignment of amino acid sequence of the investigated part of the gene showed a homology with aro A from several other plants. This conserved region forms the enzyme active site. The alignment results of nucleotide and amino acid residues with related sequences showed that there are some differences among them. The relative glyphosate tolerance in the common reed may be related to these differences.
Improving protein complex classification accuracy using amino acid composition profile.

PubMed

Huang, Chien-Hung; Chou, Szu-Yu; Ng, Ka-Lok

2013-09-01

Protein complex prediction approaches are based on the assumptions that complexes have dense protein-protein interactions and high functional similarity between their subunits. We investigated those assumptions by studying the subunits' interaction topology, sequence similarity and molecular function for human and yeast protein complexes. Inclusion of amino acids' physicochemical properties can provide better understanding of protein complex properties. Principal component analysis is carried out to determine the major features. Adopting amino acid composition profile information with the SVM classifier serves as an effective post-processing step for complexes classification. Improvement is based on primary sequence information only, which is easy to obtain. Copyright © 2013 Elsevier Ltd. All rights reserved.
Peptide array-based interaction assay of solid-bound peptides and anchorage-dependant cells and its effectiveness in cell-adhesive peptide design.

PubMed

Kato, Ryuji; Kaga, Chiaki; Kunimatsu, Mitoshi; Kobayashi, Takeshi; Honda, Hiroyuki

2006-06-01

Peptide array, the designable peptide library covalently synthesized on cellulose support, was applied to assay peptide-cell interaction, between solid-bound peptides and anchorage-dependant cells, to study objective peptide design. As a model case, cell-adhesive peptides that could enhance cell growth as tissue engineering scaffold material, was studied. On the peptide array, the relative cell-adhesion ratio of NIH/3T3 cells was 2.5-fold higher on the RGDS (Arg-Gly-Asp-Ser) peptide spot as compared to the spot with no peptide, thus indicating integrin-mediated peptide-cell interaction. Such strong cell adhesion mediated by the RGDS peptide was easily disrupted by single residue substitution on the peptide array, thus indicating that the sequence recognition accuracy of cells was strictly conserved in our optimized scheme. The observed cellular morphological extension with active actin stress-fiber on the RGD motif-containing peptide supported our strategy that peptide array-based interaction assay of solid-bound peptide and anchorage-dependant cells (PIASPAC) could provide quantitative data on biological peptide-cell interaction. The analysis of 180 peptides obtained from fibronectin type III domain (no. 1447-1629) yielded 18 novel cell-adhesive peptides without the RGD motif. Taken together with the novel candidates, representative rules of ineffective amino acid usage were obtained from non-effective candidate sequences for the effective designing of cell-adhesive peptides. On comparing the amino acid usage of the top 20 and last 20 peptides from the 180 peptides, the following four brief design rules were indicated: (i) Arg or Lys of positively charged amino acids (except His) could enhance cell adhesion, (ii) small hydrophilic amino acids are favored in cell-adhesion peptides, (iii) negatively charged amino acids and small amino acids (except Gly) could reduce cell adhesion, and (iv) Cys and Met could be excluded from the sequence combination since they have less influence on the peptide design. Such rules that are indicative of the nature of the functional peptide sequence can be obtained only by the mass comparison analysis of PIASPAC using peptide array. By following such indicative rules, numerous amino acid combinations can be effectively screened for further examination of novel peptide design.
Species-specific identification of commercial probiotic strains.

PubMed

Yeung, P S M; Sanders, M E; Kitts, C L; Cano, R; Tong, P S

2002-05-01

Products containing probiotic bacteria are gaining popularity, increasing the importance of their accurate speciation. Unfortunately, studies have suggested that improper labeling of probiotic species is common in commercial products. Species identification of a bank of commercial probiotic strains was attempted using partial 16S rDNA sequencing, carbohydrate fermentation analysis, and cellular fatty acid methyl ester analysis. Results from partial 16S rDNA sequencing indicated discrepancies between species designations for 26 out of 58 strains tested, including two ATCC Lactobacillus strains. When considering only the commercial strains obtained directly from the manufacturers, 14 of 29 strains carried species designations different from those obtained by partial 16S rDNA sequencing. Strains from six commercial products were species not listed on the label. The discrepancies mainly occurred in Lactobacillus acidophilus and Lactobacillus casei groups. Carbohydrate fermentation analysis was not sensitive enough to identify species within the L. acidophilus group. Fatty acid methyl ester analysis was found to be variable and inaccurate and is not recommended to identify probiotic lactobacilli.
Leuconostoc pseudomesenteroides WCFur3 partial 16S rRNA gene

USDA-ARS?s Scientific Manuscript database

This study used a partial 535 base pair 16S rRNA gene sequence to identify a bacterial isolate. Fatty acid profiles are consistent with the 16S rRNA gene sequence identification of this bacterium. The isolate was obtained from a compost bin in Fort Collins, Colorado, USA. The 16S rRNA gene sequen...
Variability and transmission by Aphis glycines of North American and Asian Soybean mosaic virus isolates.

PubMed

Domier, L L; Latorre, I J; Steinlage, T A; McCoppin, N; Hartman, G L

2003-10-01

The variability of North American and Asian strains and isolates of Soybean mosaic virus was investigated. First, polymerase chain reaction (PCR) products representing the coat protein (CP)-coding regions of 38 SMVs were analyzed for restriction fragment length polymorphisms (RFLP). Second, the nucleotide and predicted amino acid sequence variability of the P1-coding region of 18 SMVs and the helper component/protease (HC/Pro) and CP-coding regions of 25 SMVs were assessed. The CP nucleotide and predicted amino acid sequences were the most similar and predicted phylogenetic relationships similar to those obtained from RFLP analysis. Neither RFLP nor sequence analyses of the CP-coding regions grouped the SMVs by geographical origin. The P1 and HC/Pro sequences were more variable and separated the North American and Asian SMV isolates into two groups similar to previously reported differences in pathogenic diversity of the two sets of SMV isolates. The P1 region was the most informative of the three regions analyzed. To assess the biological relevance of the sequence differences in the HC/Pro and CP coding regions, the transmissibility of 14 SMV isolates by Aphis glycines was tested. All field isolates of SMV were transmitted efficiently by A. glycines, but the laboratory isolates analyzed were transmitted poorly. The amino acid sequences from most, but not all, of the poorly transmitted isolates contained mutations in the aphid transmission-associated DAG and/or KLSC amino acid sequence motifs of CP and HC/Pro, respectively.

Infusion of Autologous Lysed Plasma Into the Baboon: Assessment of Coagulation, Platelet, and Pulmonary Function

DTIC Science & Technology

1993-06-03

obtained from whole blood collected into a commercially available tube containing thrombin and epsilon aminocaproic acid (Wellcome 44 Diagnostics...first proposed by Hall & Slayter in 1959 as an extended, multidomained molecule. Electron microscopy, amino acid sequencing and proteolytic studies have...Plasminogen (Figure 7) is a single chain, 88 kilodalton glycoprotein. It contains 790 amino acids , 24 disulfide bridges and five homologous triple loop
Amino acid sequence of tyrosinase from Neurospora crassa.

PubMed Central

Lerch, K

1978-01-01

The amino-acid sequence of tyrosinase from Neurospora crassa (monophenol,dihydroxyphenylalanine:oxygen oxidoreductase, EC 1.14.18.1) is reported. This copper-containing oxidase consists of a single polypeptide chain of 407 amino acids. The primary structure was determined by automated and manual sequence analysis on fragments produced by cleavage with cyanogen bromide and on peptides obtained by digestion with trypsin, pepsin, thermolysin, or chymotrypsin. The amino terminus of the protein is acetylated and the single cysteinyl residue 96 is covalently linked via a thioether bridge to histidyl residue 94. The formation and the possible role of this unusual structure in Neurospora tyrosinase is discussed. Dye-sensitized photooxidation of apotyrosinase and active-site-directed inactivation of the native enzyme indicate the possible involvement of histidyl residues 188, 192, 289, and 305 or 306 as ligands to the active-site copper as well as in the catalytic mechanism of this monooxygenase. PMID:151279
Cloning and nucleotide sequence of the Pseudomonas aeruginosa glucose-selective OprB porin gene and distribution of OprB within the family Pseudomonadaceae.

PubMed

Wylie, J L; Worobec, E A

1994-03-01

OprB is a glucose-selective porin known to be produced by Pseudomonas aeruginosa and Pseudomonas putida. We have cloned and sequenced the oprB gene of P. aeruginosa and obtained expression of OprB in Escherichia coli. The mature protein consists of 423 amino acid residues with a deduced molecular mass of 47597 Da. Several clusters of amino acid residues, potentially involved in the structure or function of the protein, were identified. An area of regional homology with E. coli LamB was also identified. Carbohydrate-inducible proteins, potentially homologous to OprB, were identified in several rRNA homology-group-I pseudomonads by sodium dodecyl sulfate/polyacrylamide gel electrophoresis analysis, Western immunoblotting and N-terminal amino acid sequencing. These species also contained DNA that hybridized to a P. aeruginosa oprB gene probe.
Characterization of Clostridium perfringens iota-toxin genes and expression in Escherichia coli.

PubMed Central

Perelle, S; Gibert, M; Boquet, P; Popoff, M R

1993-01-01

The iota toxin which is produced by Clostridium perfringens type E, is a binary toxin consisting of two independent polypeptides: Ia, which is an ADP-ribosyltransferase, and Ib, which is involved in the binding and internalization of the toxin into the cell. Two degenerate oligonucleotide probes deduced from partial amino acid sequence of each component of C. spiroforme toxin, which is closely related to the iota toxin, were used to clone three overlapping DNA fragments containing the iota-toxin genes from C. perfringens type E plasmid DNA. Two genes, in the same orientation, coding for Ia (387 amino acids) and Ib (875 amino acids) and separated by 243 noncoding nucleotides were identified. A predicted signal peptide was found for each component, and the secreted Ib displays two domains, the propeptide (172 amino acids) and the mature protein (664 amino acids). The Ia gene has been expressed in Escherichia coli and C. perfringens, under the control of its own promoter. The recombinant polypeptide obtained was recognized by Ia antibodies and ADP-ribosylated actin. The expression of the Ib gene was obtained in E. coli harboring a recombinant plasmid encompassing the putative promoter upstream of the Ia gene and the Ia and Ib genes. Two residues which have been found to be involved in the NAD+ binding site of diphtheria and pseudomonas toxins are conserved in the predicted Ia sequence (Glu-14 and Trp-19). The predicted amino acid Ib sequence shows 33.9% identity with and 54.4% similarity to the protective antigen of the anthrax toxin complex. In particular, the central region of Ib, which contains a predicted transmembrane segment (Leu-292 to Ser-308), presents 45% identity with the corresponding protective antigen sequence which is involved in the translocation of the toxin across the cell membrane. Images PMID:8225592
From a marine neuropeptide to antimicrobial pseudopeptides containing aza-β(3)-amino acids: structure and activity

PubMed Central

Laurencin, Mathieu; Legrand, Baptiste; Duval, Emilie; Henry, Joël; Baudy-Floc'H, Michèle; Zatylny-Gaudin, Céline; Bondon, Arnaud

2012-01-01

Incorporation of aza-β3-amino acids into endogenous neuropeptide from mollusks (ALSGDAFLRF-NH2) with weak antimicrobial activities allows us to design new AMPs sequences. We find that, depending on the nature of the substitution, these could result either in inactive pseudopeptides or in a drastic enhancement of the antimicrobial activity without high cytotoxicity resulted. Structural studies perform by NMR and circular dichroism on the pseudopeptides show the impact of aza-β3-amino acids on the peptide structures. We obtain the first three-dimensional structures of pseudopeptides containing aza-β3-amino acids in aqueous micellar SDS and demonstrate that hydrazino turn can be formed in aqueous solution. Overall, these results demonstrate the ability to modulate AMPs activities through structural modifications induced by the nature and the position of these amino acid analogs in the peptide sequences. PMID:22320306
Complete Amino Acid Sequence of a Copper/Zinc-Superoxide Dismutase from Ginger Rhizome.

PubMed

Nishiyama, Yuki; Fukamizo, Tamo; Yoneda, Kazunari; Araki, Tomohiro

2017-04-01

Superoxide dismutase (SOD) is an antioxidant enzyme protecting cells from oxidative stress. Ginger (Zingiber officinale) is known for its antioxidant properties, however, there are no data on SODs from ginger rhizomes. In this study, we purified SOD from the rhizome of Z. officinale (Zo-SOD) and determined its complete amino acid sequence using N terminal sequencing, amino acid analysis, and de novo sequencing by tandem mass spectrometry. Zo-SOD consists of 151 amino acids with two signature Cu/Zn-SOD motifs and has high similarity to other plant Cu/Zn-SODs. Multiple sequence alignment showed that Cu/Zn-binding residues and cysteines forming a disulfide bond, which are highly conserved in Cu/Zn-SODs, are also present in Zo-SOD. Phylogenetic analysis revealed that plant Cu/Zn-SODs clustered into distinct chloroplastic, cytoplasmic, and intermediate groups. Among them, only chloroplastic enzymes carried amino acid substitutions in the region functionally important for enzymatic activity, suggesting that chloroplastic SODs may have a function distinct from those of SODs localized in other subcellular compartments. The nucleotide sequence of the Zo-SOD coding region was obtained by reverse-translation, and the gene was synthesized, cloned, and expressed. The recombinant Zo-SOD demonstrated pH stability in the range of 5-10, which is similar to other reported Cu/Zn-SODs, and thermal stability in the range of 10-60 °C, which is higher than that for most plant Cu/Zn-SODs but lower compared to the enzyme from a Z. officinale relative Curcuma aromatica.
Characterization of Stearoyl-CoA Desaturases from a Psychrophilic Antarctic Copepod, Tigriopus kingsejongensis.

PubMed

Jung, Woongsic; Kim, Eun Jae; Han, Se Jong; Choi, Han-Gu; Kim, Sanghee

2016-10-01

Stearoyl-CoA desaturase is a key regulator in fatty acid metabolism that catalyzes the desaturation of stearic acid to oleic acid and controls the intracellular levels of monounsaturated fatty acids (MUFAs). Two stearoyl-CoA desaturases (SCD, Δ9 desaturases) genes were identified in an Antarctic copepod, Tigriopus kingsejongensis, that was collected in a tidal pool near the King Sejong Station, King George Island, Antarctica. Full-length complementary DNA (cDNA) sequences of two T. kingsejongensis SCDs (TkSCDs) were obtained from next-generation sequencing and isolated by reverse transcription PCR. DNA sequence lengths of the open reading frames of TkSCD-1 and TkSCD-2 were determined to be 1110 and 681 bp, respectively. The molecular weights deduced from the corresponding genes were estimated to be 43.1 kDa (TkSCD-1) and 26.1 kDa (TkSCD-2). The amino acid sequences were compared with those of fatty acid desaturases and sterol desaturases from various organisms and used to analyze the relationships among TkSCDs. As assessed by heterologous expression of recombinant proteins in Escherichia coli, the enzymatic functions of both stearoyl-CoA desaturases revealed that the amount of C16:1 and C18:1 fatty acids increased by greater than 3-fold after induction with isopropyl β-D-thiogalactopyranoside. In particular, C18:1 fatty acid production increased greater than 10-fold in E. coli expressing TkSCD-1 and TkSCD-2. The results of this study suggest that both SCD genes from an Antarctic marine copepod encode a functional desaturase that is capable of increasing the amounts of palmitoleic acid and oleic acid in a prokaryotic expression system.
FASMA: a service to format and analyze sequences in multiple alignments.

PubMed

Costantini, Susan; Colonna, Giovanni; Facchiano, Angelo M

2007-12-01

Multiple sequence alignments are successfully applied in many studies for under- standing the structural and functional relations among single nucleic acids and protein sequences as well as whole families. Because of the rapid growth of sequence databases, multiple sequence alignments can often be very large and difficult to visualize and analyze. We offer a new service aimed to visualize and analyze the multiple alignments obtained with different external algorithms, with new features useful for the comparison of the aligned sequences as well as for the creation of a final image of the alignment. The service is named FASMA and is available at http://bioinformatica.isa.cnr.it/FASMA/.
The gene for stinging nettle lectin (Urtica dioica agglutinin) encodes both a lectin and a chitinase.

PubMed

Lerner, D R; Raikhel, N V

1992-06-05

Chitin-binding proteins are present in a wide range of plant species, including both monocots and dicots, even though these plants contain no chitin. To investigate the relationship between in vitro antifungal and insecticidal activities of chitin-binding proteins and their unknown endogenous functions, the stinging nettle lectin (Urtica dioica agglutinin, UDA) cDNA was cloned using a synthetic gene as the probe. The nettle lectin cDNA clone contained an open reading frame encoding 374 amino acids. Analysis of the deduced amino acid sequence revealed a 21-amino acid putative signal sequence and the 86 amino acids encoding the two chitin-binding domains of nettle lectin. These domains were fused to a 19-amino acid "spacer" domain and a 244-amino acid carboxyl extension with partial identity to a chitinase catalytic domain. The authenticity of the cDNA clone was confirmed by deduced amino acid sequence identity with sequence data obtained from tryptic digests, RNA gel blot, and polymerase chain reaction analyses. RNA gel blot analysis also showed the nettle lectin message was present primarily in rhizomes and inflorescence (with immature seeds) but not in leaves or stems. Chitinase enzymatic activity was found when the chitinase-like domain alone or the chitinase-like domain with the chitin-binding domains were expressed in Escherichia coli. This is the first example of a chitin-binding protein with both a duplication of the 43-amino acid chitin-binding domain and a fusion of the chitin-binding domains to a structurally unrelated domain, the chitinase domain.
Covalent structure of chicken pepsinogen.

PubMed

Baudys, M; Kostka, V

1983-10-17

Chicken pepsinogen is a glycoprotein consisting of a single polypeptide chain and containing the following 367 amino acid residues: Asp23, Asn16, Thr26, Ser41, Glu14, Gln11, Pro18, Gly31, Ala17, Cys7, Val25, Met9, Ile23, Leu28, Tyr22, Phe20, His8, Lys17, Arg7, Trp4. The Mr-value of the protein is 42 074. This value includes the carbohydrate moiety of the protein, i.e. Man3, (GlcNAc)7, (-SO3H)5. The primary fragmentation of the molecule was effected by limited trypsinolysis at arginine residues after preceding modification of the lysines with citraconic anhydride. All eight peptides expected in theory were obtained and their size, amino acid composition, and N-terminal amino acid sequence were characterized. To elucidate the amino acid sequence of these large fragments the latter were subjected to secondary cleavage by CNBr, trypsin (after removal of the protecting groups from the lysines), the proteinase from Staphylococcus aureus V8 strain, alpha-chymotrypsin, hydroxylamine, or dilute acid; the resulting peptides were isolated by gel permeation and ion-exchange chromatography and by the fingerprint techniques. Overlaps at sites of the arginine residues were obtained in an earlier study [Baudys, M. & Kostka, V. (1982) Collect. Czech. Chem. Commun. 47, 2814-2832]. Chicken pepsinogen shows the highest degree of homology with the primary structures of pepsinogens A. The internal homologies are apparent in the neighborhood of the two active aspartic acid residues. We have assigned tentatively chicken pepsinogen to the group of pepsinogens A (EC 3.4.23.1); this assignment is a result both of our sequence studies and of an investigation of the kinetic characteristics of the enzyme.
Characterization of acid-tolerant H/CO-utilizing methanogenic enrichment cultures from an acidic peat bog in New York State.

PubMed

Bräuer, Suzanna L; Yashiro, Erika; Ueno, Norikiyo G; Yavitt, Joseph B; Zinder, Stephen H

2006-08-01

Two methanogenic cultures were enriched from acidic peat soil using a growth medium buffered to c. pH 5. One culture, 6A, was obtained from peat after incubation with H(2)/CO(2), whereas culture NTA was derived from a 10(-4) dilution of untreated peat into a modified medium. 16S rRNA gene clone libraries from each culture contained one methanogen and two bacterial sequences. The methanogen 16S rRNA gene sequences were 99% identical with each other and belonged to the novel "R-10/Fen cluster" family of the Methanomicrobiales, whereas their mcrA sequences were 96% identical. One bacterial 16S rRNA gene sequence from culture 6A belonged to the Bacteroidetes and showed 99% identity with sequences from methanogenic enrichments from German and Russian bogs. The other sequence belonged to the Firmicutes and was identical to a thick rod-shaped citrate-utilizing organism isolated from culture 6A, the numbers of which decreased when the Ti (III) chelator was switched from citrate to nitrilotriacetate. Bacterial clones from the NTA culture clustered in the Delta- and Betaproteobacteria. Both cultures contained thin rods, presumably the methanogens, as the predominant morphotype, and represent a significant advance in characterization of the novel acidiphilic R-10 family methanogens.
[Identification of new conserved and variable regions in the 16S rRNA gene of acetic acid bacteria and acetobacteraceae family].

PubMed

Chakravorty, S; Sarkar, S; Gachhui, R

2015-01-01

The Acetobacteraceae family of the class Alpha Proteobacteria is comprised of high sugar and acid tolerant bacteria. The Acetic Acid Bacteria are the economically most significant group of this family because of its association with food products like vinegar, wine etc. Acetobacteraceae are often hard to culture in laboratory conditions and they also maintain very low abundances in their natural habitats. Thus identification of the organisms in such environments is greatly dependent on modern tools of molecular biology which require a thorough knowledge of specific conserved gene sequences that may act as primers and or probes. Moreover unconserved domains in genes also become markers for differentiating closely related genera. In bacteria, the 16S rRNA gene is an ideal candidate for such conserved and variable domains. In order to study the conserved and variable domains of the 16S rRNA gene of Acetic Acid Bacteria and the Acetobacteraceae family, sequences from publicly available databases were aligned and compared. Near complete sequences of the gene were also obtained from Kombucha tea biofilm, a known Acetobacteraceae family habitat, in order to corroborate the domains obtained from the alignment studies. The study indicated that the degree of conservation in the gene is significantly higher among the Acetic Acid Bacteria than the whole Acetobacteraceae family. Moreover it was also observed that the previously described hypervariable regions V1, V3, V5, V6 and V7 were more or less conserved in the family and the spans of the variable regions are quite distinct as well.
Determination of a mutational spectrum

DOEpatents

Thilly, William G.; Keohavong, Phouthone

1991-01-01

A method of resolving (physically separating) mutant DNA from nonmutant DNA and a method of defining or establishing a mutational spectrum or profile of alterations present in nucleic acid sequences from a sample to be analyzed, such as a tissue or body fluid. The present method is based on the fact that it is possible, through the use of DGGE, to separate nucleic acid sequences which differ by only a single base change and on the ability to detect the separate mutant molecules. The present invention, in another aspect, relates to a method for determining a mutational spectrum in a DNA sequence of interest present in a population of cells. The method of the present invention is useful as a diagnostic or analytical tool in forensic science in assessing environmental and/or occupational exposures to potentially genetically toxic materials (also referred to as potential mutagens); in biotechnology, particularly in the study of the relationship between the amino acid sequence of enzymes and other biologically-active proteins or protein-containing substances and their respective functions; and in determining the effects of drugs, cosmetics and other chemicals for which toxicity data must be obtained.
Cloning and characterization of transferrin cDNA and rapid detection of transferrin gene polymorphism in rainbow trout (Oncorhynchus mykiss).

PubMed

Tange, N; Jong-Young, L; Mikawa, N; Hirono, I; Aoki, T

1997-12-01

A cDNA clone of rainbow trout (Oncorhynchus mykiss) transferrin was obtained from a liver cDNA library. The 2537-bp cDNA sequence contained an open reading frame encoding 691 amino acids and the 5' and 3' noncoding regions. The amino acid sequences at the iron-binding sites and the two N-linked glycosylation sites, and the cysteine residues were consistent with known, conserved vertebrate transferrin cDNA sequences. Single N-linked glycosylation sites existed on the N- and C-lobe. The deduced amino acid sequence of the rainbow trout transferrin cDNA had 92.9% identities with transferrin of coho salmon (Oncorhynchus kisutch); 85%, Atlantic salmon (Salmo salar); 67.3%, medaka (Oryzias latipes); 61.3% Atlantic cod (Gadus morhua); and 59.7%, Japanese flounder (Paralichthys olivaceus). The long and accurate polymerase chain reaction (LA-PCR) was used to amplify approximately 6.5 kb of the transferrin gene from rainbow trout genomic DNA. Restriction fragment length polymorphisms (RFLPs) of the LA-PCR products revealed three digestion patterns in 22 samples.
Identification of random nucleic acid sequence aberrations using dual capture probes which hybridize to different chromosome regions

DOEpatents

Lucas, J.N.; Straume, T.; Bogen, K.T.

1998-03-24

A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration. 14 figs.
Identification of random nucleic acid sequence aberrations using dual capture probes which hybridize to different chromosome regions

DOEpatents

Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

1998-01-01

A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration.
Sequence-based screening for self-sufficient P450 monooxygenase from a metagenome library.

PubMed

Kim, B S; Kim, S Y; Park, J; Park, W; Hwang, K Y; Yoon, Y J; Oh, W K; Kim, B Y; Ahn, J S

2007-05-01

Cytochrome P450 monooxygenases (CYPs) are useful catalysts for oxidation reactions. Self-sufficient CYPs harbour a reductive domain covalently connected to a P450 domain and are known for their robust catalytic activity with great potential as biocatalysts. In an effort to expand genetic sources of self-sufficient CYPs, we devised a sequence-based screening system to identify them in a soil metagenome. We constructed a soil metagenome library and performed sequence-based screening for self-sufficient CYP genes. A new CYP gene, syk181, was identified from the metagenome library. Phylogenetic analysis revealed that SYK181 formed a distinct phylogenic line with 46% amino-acid-sequence identity to CYP102A1 which has been extensively studied as a fatty acid hydroxylase. The heterologously expressed SYK181 showed significant hydroxylase activity towards naphthalene and phenanthrene as well as towards fatty acids. Sequence-based screening of metagenome libraries is expected to be a useful approach for searching self-sufficient CYP genes. The translated product of syk181 shows self-sufficient hydroxylase activity towards fatty acids and aromatic compounds. SYK181 is the first self-sufficient CYP obtained directly from a metagenome library. The genetic and biochemical information on SYK181 are expected to be helpful for engineering self-sufficient CYPs with broader catalytic activities towards various substrates, which would be useful for bioconversion of natural products and biodegradation of organic chemicals.
Isolation, cDNA cloning and gene expression of an antibacterial protein from larvae of the coconut rhinoceros beetle, Oryctes rhinoceros.

PubMed

Yang, J; Yamamoto, M; Ishibashi, J; Taniai, K; Yamakawa, M

1998-08-01

An antibacterial protein, designated rhinocerosin, was purified to homogeneity from larvae of the coconut rhinoceros beetle, Oryctes rhinoceros immunized with Escherichia coli. Based on the amino acid sequence of the N-terminal region, a degenerate primer was synthesized and reverse-transcriptase PCR was performed to clone rhinocerosin cDNA. As a result, a 279-bp fragment was obtained. The complete nucleotide sequence was determined by sequencing the extended rhinocerosin cDNA clone by 5' rapid amplification of cDNA ends. The deduced amino acid sequence of the mature portion of rhinocerosin was composed of 72 amino acids without cystein residues and was shown to be rich in glycine (11.1%) and proline (11.1%) residues. Comparison of the deduced amino acid sequence of rhinocerosin with those of other antibacterial proteins indicated that it has 77.8% and 44.6% identity with holotricin 2 and coleoptrecin, respectively. Rhinocerosin had strong antibacterial activity against E. coli, Streptococcus pyogenes, Staphylococcus aureus but not against Pseudomonas aeruginosa. Results of reverse-transcriptase PCR analysis of gene expression in different tissues indicated that the rhinocerosin gene is strongly expressed in the fat body and the Malpighian tubule, and weakly expressed in hemocytes and midgut. In addition, gene expression was inducible by bacteria in the fat body, the Malpighian tubule and hemocyte but constitutive expression was observed in the midgut.
Development of chemiluminescent probe hybridization, RT-PCR and nucleic acid cycle sequencing assays of Sabin type 3 isolates to identify base pair 472 Sabin type 3 mutants associated with vaccine associated paralytic poliomyelitis.

PubMed

Old, M O; Logan, L H; Maldonado, Y A

1997-11-01

Sabin type 3 polio vaccine virus is the most common cause of poliovaccine associated paralytic poliomyelitis. Vaccine associated paralytic poliomyelitis cases have been associated with Sabin type 3 revertants containing a single U to C substitution at bp 472 of Sabin type 3. A rapid method of identification of Sabin type 3 bp 472 mutants is described. An enterovirus group-specific probe for use in a chemiluminescent dot blot hybridization assay was developed to identify enterovirus positive viral lysates. A reverse transcription-polymerase chain reaction (RT-PCR) assay producing a 319 bp PCR product containing the Sabin type 3 bp 472 mutation site was then employed to identify Sabin type 3 isolates. Chemiluminescent nucleic acid cycle sequencing of the purified 319 bp PCR product was then employed to identify nucleic acid sequences at bp 472. The enterovirus group probe hybridization procedure and isolation of the Sabin type 3 PCR product were highly sensitive and specific; nucleic acid cycle sequencing corresponded to the known sequence of stock Sabin type 3 isolates. These methods will be used to identify the Sabin type 3 reversion rate from sequential stool samples of infants obtained after the first and second doses of oral poliovirus vaccine.
Method for identifying and quantifying nucleic acid sequence aberrations

DOEpatents

Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

1998-01-01

A method for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe.

Method for identifying and quantifying nucleic acid sequence aberrations

DOEpatents

Lucas, J.N.; Straume, T.; Bogen, K.T.

1998-07-21

A method is disclosed for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe. 11 figs.
Subcellular location prediction of proteins using support vector machines with alignment of block sequences utilizing amino acid composition.

PubMed

Tamura, Takeyuki; Akutsu, Tatsuya

2007-11-30

Subcellular location prediction of proteins is an important and well-studied problem in bioinformatics. This is a problem of predicting which part in a cell a given protein is transported to, where an amino acid sequence of the protein is given as an input. This problem is becoming more important since information on subcellular location is helpful for annotation of proteins and genes and the number of complete genomes is rapidly increasing. Since existing predictors are based on various heuristics, it is important to develop a simple method with high prediction accuracies. In this paper, we propose a novel and general predicting method by combining techniques for sequence alignment and feature vectors based on amino acid composition. We implemented this method with support vector machines on plant data sets extracted from the TargetP database. Through fivefold cross validation tests, the obtained overall accuracies and average MCC were 0.9096 and 0.8655 respectively. We also applied our method to other datasets including that of WoLF PSORT. Although there is a predictor which uses the information of gene ontology and yields higher accuracy than ours, our accuracies are higher than existing predictors which use only sequence information. Since such information as gene ontology can be obtained only for known proteins, our predictor is considered to be useful for subcellular location prediction of newly-discovered proteins. Furthermore, the idea of combination of alignment and amino acid frequency is novel and general so that it may be applied to other problems in bioinformatics. Our method for plant is also implemented as a web-system and available on http://sunflower.kuicr.kyoto-u.ac.jp/~tamura/slpfa.html.
Guiding principles for peptide nanotechnology through directed discovery.

PubMed

Lampel, A; Ulijn, R V; Tuttle, T

2018-05-21

Life's diverse molecular functions are largely based on only a small number of highly conserved building blocks - the twenty canonical amino acids. These building blocks are chemically simple, but when they are organized in three-dimensional structures of tremendous complexity, new properties emerge. This review explores recent efforts in the directed discovery of functional nanoscale systems and materials based on these same amino acids, but that are not guided by copying or editing biological systems. The review summarises insights obtained using three complementary approaches of searching the sequence space to explore sequence-structure relationships for assembly, reactivity and complexation, namely: (i) strategic editing of short peptide sequences; (ii) computational approaches to predicting and comparing assembly behaviours; (iii) dynamic peptide libraries that explore the free energy landscape. These approaches give rise to guiding principles on controlling order/disorder, complexation and reactivity by peptide sequence design.
Construction and characterization of a normalized cDNA library of Nannochloropsis oculata (Eustigmatophyceae)

NASA Astrophysics Data System (ADS)

Yu, Jianzhong; Ma, Xiaolei; Pan, Kehou; Yang, Guanpin; Yu, Wengong

2010-07-01

We constructed and characterized a normalized cDNA library of Nannochloropsis oculata CS-179, and obtained 905 nonredundant sequences (NRSs) ranging from 431-1 756 bp in length. Among them, 496 were very similar to nonredundant ones in the GenBank ( E ≤1.0e-05), and 349 ESTs had significant hits with the clusters of eukaryotic orthologous groups (KOG). Bases G and/or C at the third position of codons of 14 amino acid residues suggested a strong bias in the conserved domain of 362 NRSs (>60%). We also identified the unigenes encoding phosphorus and nitrogen transporters, suggesting that N. oculata could efficiently transport and metabolize phosphorus and nitrogen, and recognized the unigenes that involved in biosynthesis and storage of both fatty acids and polyunsaturated fatty acids (PUFAs), which will facilitate the demonstration of eicosapentaenoic acid (EPA) biosynthesis pathway of N. oculata. In comparison with the original cDNA library, the normalized library significantly increased the efficiencies of random sequencing and rarely expressed genes discovering, and decreased the frequency of abundant gene sequences.
Insights into the phylogenetic positions of photosynthetic bacteria obtained from 5S rRNA and 16S rRNA sequence data

NASA Technical Reports Server (NTRS)

Fox, G. E.

1985-01-01

Comparisons of complete 16S ribosomal ribonucleic acid (rRNA) sequences established that the secondary structure of these molecules is highly conserved. Earlier work with 5S rRNA secondary structure revealed that when structural conservation exists the alignment of sequences is straightforward. The constancy of structure implies minimal functional change. Under these conditions a uniform evolutionary rate can be expected so that conditions are favorable for phylogenetic tree construction.
Hepatitis delta genotypes in chronic delta infection in the northeast of Spain (Catalonia).

PubMed

Cotrina, M; Buti, M; Jardi, R; Quer, J; Rodriguez, F; Pascual, C; Esteban, R; Guardia, J

1998-06-01

Based on genetic analysis of variants obtained around the world, three genotypes of the hepatitis delta virus have been defined. Hepatitis delta virus variants have been associated with different disease patterns and geographic distributions. To determine the prevalence of hepatitis delta virus genotypes in the northeast of Spain (Catalonia) and the correlation with transmission routes and clinical disease, we studied the nucleotide divergence of the consensus sequence of HDV RNA obtained from 33 patients with chronic delta hepatitis (24 were intravenous drug users and nine had no risk factors), and four patients with acute self-limited delta infection. Serum HDV RNA was amplified by the polymerase chain reaction technique and a fragment of 350 nucleotides (nt 910 to 1259) was directly sequenced. Genetic analysis of the nucleotide consensus sequence obtained showed a high degree of conservation among sequences (93% of mean). Comparison of these sequences with those derived from different geographic areas and pertaining to genotypes I, II and III, showed a mean sequence identity of 92% with genotype I, 73% with genotype II and 61% with genotype III. At the amino acid level (aa 115 to 214), the mean identity was 87% with genotype I, 63% with genotype II and 56% with genotype III. Conserved regions included the RNA editing domain, the carboxyl terminal 19 amino acids of the hepatitis delta antigen and the polyadenylation signal of the viral mRNA. Hepatitis delta virus isolates in the northeast of Spain are exclusively genotype I, independently of the transmission route and the type of infection. No hepatitis delta virus subgenotypes were found, suggesting that the origin of hepatitis delta virus infection in our geographical area is homogeneous.
The region of CQQQKPQRRP of PGC-1{alpha} interacts with the DNA-binding complex of FXR/RXR{alpha}

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kanaya, Eiko; Jingami, Hisato

2006-04-14

PGC-1{alpha} co-activates transcription by several nuclear receptors. To study the interaction among PGC-1{alpha}, RXR{alpha}/FXR, and DNA, we performed electrophoresis mobility shift assays. The RXR{alpha}/FXR proteins specifically bound to DNA containing the IR-1 sequence in the absence of ligand. When the fusion protein of GST-PGC-1{alpha} was added to the mixture of RXR{alpha}/FXR/DNA, the ligand-influenced retardation of the mobility was observed. The ligand for RXR{alpha} (9-cis-retinoic acid) was necessary for this retardation, whereas, the ligand for FXR, chenodeoxycholic acid, barely had an effect. The results obtained using truncated PGC-1{alpha} proteins suggested that two regions are necessary for PGC-1{alpha} to interact with themore » DNA-binding complex of RXR{alpha}/FXR. One is the region of the second leucine-rich motif, and the other is that of the amino acid sequence CQQQKPQRRP, present between the second and third leucine-rich motifs. The results obtained with the SPQSS mutation for KPQRR suggested that the basic amino acids are important for the interaction.« less
Nucleotide and deduced amino acid sequence of the envelope gene of the Vasilchenko strain of TBE virus; comparison with other flaviviruses.

PubMed

Gritsun, T S; Frolova, T V; Pogodina, V V; Lashkevich, V A; Venugopal, K; Gould, E A

1993-02-01

A strain of tick-borne encephalitis virus known as Vasilchenko (Vs) exhibits relatively low virulence characteristics in monkeys, Syrian hamsters and humans. The gene encoding the envelope glycoprotein of this virus was cloned and sequenced. Alignment of the sequence with those of other known tick-borne flaviviruses and identification of the recognised amino acid genetic marker EHLPTA confirmed its identity as a member of the TBE complex. However, Vs virus was distinguishable from eastern and western tick-borne serotypes by the presence of the sequence AQQ at amino acid positions 232-234 and also by the presence of other specific amino acid substitutions which may be genetic markers for these viruses and could determine their pathogenetic characteristics. When compared with other tick-borne flaviviruses, Vs virus had 12 unique amino acid substitutions including an additional potential glycosylation site at position (315-317). The Vs virus strain shared closest nucleotide and amino acid homology (84.5% and 95.5% respectively) with western and far eastern strains of tick-borne encephalitis virus. Comparison with the far eastern serotype of tick-borne encephalitis virus, by cross-immunoelectrophoresis of Vs virions and PAGE analysis of the extracted virion proteins, revealed differences in surface charge and virus stability that may account for the different virulence characteristics of Vs virus. These results support and enlarge upon previous data obtained from molecular and serological analysis.
Structure and characterization of a cDNA clone for phenylalanine ammonia-lyase from cut-injured roots of sweet potato

DOE Office of Scientific and Technical Information (OSTI.GOV)

Tanaka, Yoshiyuki; Matsuoka, Makoto; Yamanoto, Naoki

A cDNA clone for phenylalanine ammonia-lyase (PAL) induced in wounded sweet potato (Ipomoea batatas Lam.) root was obtained by immunoscreening a cDNA library. The protein produced in Escherichia coli cells containing the plasmid pPAL02 was indistinguishable from sweet potato PAL as judged by Ouchterlony double diffusion assays. The M{sub r} of its subunit was 77,000. The cells converted ({sup 14}C)-L-phenylalanine into ({sup 14}C)-t-cinnamic acid and PAL activity was detected in the homogenate of the cells. The activity was dependent on the presence of the pPAL02 plasmid DNA. The nucleotide sequence of the cDNA contained a 2,121-base pair (bp) open-reading framemore » capable of coding for a polypeptide with 707 amino acids (M{sub r} 77,137), a 22-bp 5{prime}-noncoding region and a 207-bp 3{prime}-noncoding region. The results suggest that the insert DNA fully encoded the amino acid sequence for sweet potato PAL that is induced by wounding. Comparison of the deduced amino acid sequence with that of a PAL cDNA fragment from Phaseolus vulgaris revealed 78.9% homology. The sequence from amino acid residues 258 to 494 was highly conserved, showing 90.7% homology.« less
Biochemical and Genetic Evidence that Enterococcus faecium L50 Produces Enterocins L50A and L50B, the sec-Dependent Enterocin P, and a Novel Bacteriocin Secreted without an N-Terminal Extension Termed Enterocin Q

PubMed Central

Cintas, Luis M.; Casaus, Pilar; Herranz, Carmen; Håvarstein, Leiv Sigve; Holo, Helge; Hernández, Pablo E.; Nes, Ingolf F.

2000-01-01

Enterococcus faecium L50 grown at 16 to 32°C produces enterocin L50 (EntL50), consisting of EntL50A and EntL50B, two unmodified non-pediocin-like peptides synthesized without an N-terminal leader sequence or signal peptide. However, the bacteriocin activity found in the cell-free culture supernatants following growth at higher temperatures (37 to 47°C) is not due to EntL50. A purification procedure including cation-exchange, hydrophobic interaction, and reverse-phase liquid chromatography has shown that the antimicrobial activity is due to two different bacteriocins. Amino acid sequences obtained by Edman degradation and DNA sequencing analyses revealed that one is identical to the sec-dependent pediocin-like enterocin P produced by E. faecium P13 (L. M. Cintas, P. Casaus, L. S. Håvarstein, P. E. Hernández, and I. F. Nes, Appl. Environ. Microbiol. 63:4321–4330, 1997) and the other is a novel unmodified non-pediocin-like bacteriocin termed enterocin Q (EntQ), with a molecular mass of 3,980. DNA sequencing analysis of a 963-bp region of E. faecium L50 containing the enterocin P structural gene (entP) and the putative immunity protein gene (entiP) reveals a genetic organization identical to that previously found in E. faecium P13. DNA sequencing analysis of a 1,448-bp region identified two consecutive but diverging open reading frames (ORFs) of which one, termed entQ, encodes a 34-amino-acid protein whose deduced amino acid sequence was identical to that obtained for EntQ by amino acid sequencing, showing that EntQ, similarly to EntL50A and EntL50B, is synthesized without an N-terminal leader sequence or signal peptide. The second ORF, termed orf2, was located immediately upstream of and in opposite orientation to entQ and encodes a putative immunity protein composed of 221 amino acids. Bacteriocin production by E. faecium L50 showed that EntP and EntQ are produced in the temperature range from 16 to 47°C and maximally detected at 47 and 37 to 47°C, respectively, while EntL50A and EntL50B are maximally synthesized at 16 to 25°C and are not detected at 37°C or above. PMID:11073927
A novel alignment-free method to classify protein folding types by combining spectral graph clustering with Chou's pseudo amino acid composition.

PubMed

Tripathi, Pooja; Pandey, Paras N

2017-07-07

The present work employs pseudo amino acid composition (PseAAC) for encoding the protein sequences in their numeric form. Later this will be arranged in the similarity matrix, which serves as input for spectral graph clustering method. Spectral methods are used previously also for clustering of protein sequences, but they uses pair wise alignment scores of protein sequences, in similarity matrix. The alignment score depends on the length of sequences, so clustering short and long sequences together may not good idea. Therefore the idea of introducing PseAAC with spectral clustering algorithm came into scene. We extensively tested our method and compared its performance with other existing machine learning methods. It is consistently observed that, the number of clusters that we obtained for a given set of proteins is close to the number of superfamilies in that set and PseAAC combined with spectral graph clustering shows the best classification results. Copyright © 2017 Elsevier Ltd. All rights reserved.
Isolation and Structural Characterization of Antioxidant Peptides from Degreased Apricot Seed Kernels.

PubMed

Zhang, Haisheng; Xue, Jing; Zhao, Huanxia; Zhao, Xinshuai; Xue, Huanhuan; Sun, Yuhan; Xue, Wanrui

2018-05-03

Background : The composition and sequence of amino acids have a prominent influence on theantioxidant activities of peptides. Objective : A series of isolation and purification experiments was conducted to explore the amino acid sequence of antioxidant peptides, which led to its antioxidation causes. Methods : The degreased apricot seed kernels were hydrolyzed by compound proteases of alkaline protease and flavor protease (3:2, u/u) to prepare apricot seed kernel hydrolysates (ASKH). ASKH were separated into ASKH-A and ASKH-B by dialysis bag. ASKH-B (MW < 3.5 kDa) was further separated into fractions by Sephadex G-25 and G-15 gel-filtration chromatography. Reversed-phase HPLC (RP-HPLC) was performed to separate fraction B4b into two antioxidant peptides (peptide B4b-4 and B4b-6). Results : The amino acid sequences were Val-Leu-Tyr-Ile-Trp and Ser-Val-Pro-Tyr-Glu, respectively. Conclusions : The results suggested that ASKH antioxidant peptides may have potential utility as healthy ingredients and as food preservatives due to their antioxidant activity. Highlights : Materials with regional characteristics were selected to explore, and hydrolysates were identified by RP-HPLC and matrix-assisted laser desorption ionization-time-of-flight-MS to obtain amino acid sequences.
Characterization and expression profiles of MaACS and MaACO genes from mulberry (Morus alba L.)*

PubMed Central

Liu, Chang-ying; Lü, Rui-hua; Li, Jun; Zhao, Ai-chun; Wang, Xi-ling; Diane, Umuhoza; Wang, Xiao-hong; Wang, Chuan-hong; Yu, Ya-sheng; Han, Shu-mei; Lu, Cheng; Yu, Mao-de

2014-01-01

1-Aminocyclopropane-1-carboxylic acid synthase (ACS) and 1-aminocyclopropane-1-carboxylic acid oxidase (ACO) are encoded by multigene families and are involved in fruit ripening by catalyzing the production of ethylene throughout the development of fruit. However, there are no reports on ACS or ACO genes in mulberry, partly because of the limited molecular research background. In this study, we have obtained five ACS gene sequences and two ACO gene sequences from Morus Genome Database. Sequence alignment and phylogenetic analysis of MaACO1 and MaACO2 showed that their amino acids are conserved compared with ACO proteins from other species. MaACS1 and MaACS2 are type I, MaACS3 and MaACS4 are type II, and MaACS5 is type III, with different C-terminal sequences. Quantitative reverse transcriptase polymerase chain reaction (qRT-PCR) expression analysis showed that the transcripts of MaACS genes were strongly expressed in fruit, and more weakly in other tissues. The expression of MaACO1 and MaACO2 showed different patterns in various mulberry tissues. MaACS and MaACO genes demonstrated two patterns throughout the development of mulberry fruit, and both of them were strongly up-regulated by abscisic acid (ABA) and ethephon. PMID:25001221
Comparative analysis and molecular characterization of genomic sequences and proteins of FABP4 and FABP5 from the giant panda (Ailuropoda melanoleuca).

PubMed

Song, B; Hou, Y L; Ding, X; Wang, T; Wang, F; Zhong, J C; Xu, T; Zhong, J; Hou, W R; Shuai, S R

2014-02-20

Fatty acid binding proteins (FABPs) are a family of small, highly conserved cytoplasmic proteins that bind long-chain fatty acids and other hydrophobic ligands. In this study, cDNA and genomic sequences of FABP4 and FABP5 were cloned successfully from the giant panda (Ailuropoda melanoleuca) using reverse transcription polymerase chain reaction (RT-PCR) technology and touchdown-PCR. The cDNAs of FABP4 and FABP5 cloned from the giant panda were 400 and 413 bp in length, containing an open reading frame of 399 and 408 bp, encoding 132 and 135 amino acids, respectively. The genomic sequences of FABP4 and FABP5 were 3976 and 3962 bp, respectively, which each contained four exons and three introns. Sequence alignment indicated a high degree of homology with reported FABP sequences of other mammals at both the amino acid and DNA levels. Topology prediction revealed seven protein kinase C phosphorylation sites, two casein kinase II phosphorylation sites, two N-myristoylation sites, and one cytosolic fatty acid-binding protein signature in the FABP4 protein, and three N-glycosylation sites, three protein kinase C phosphorylation sites, one casein kinase II phosphorylation site, one N-myristoylation site, one amidation site, and one cytosolic fatty acid-binding protein signature in the FABP5 protein. The FABP4 and FABP5 genes were overexpressed in Escherichia coli BL21 and they produced the expected 16.8- and 17.0-kDa polypeptides. The results obtained in this study provide information for further in-depth research of this system, which has great value of both theoretical and practical significance.
Physiological and Molecular Biological Characterization of Intracellular Carbonic Anhydrase from the Marine Diatom Phaeodactylum tricornutum1

PubMed Central

Satoh, Dan; Hiraoka, Yasutaka; Colman, Brian; Matsuda, Yusuke

2001-01-01

A single intracellular carbonic anhydrase (CA) was detected in air-grown and, at reduced levels, in high CO2-grown cells of the marine diatom Phaeodactylum tricornutum (UTEX 642). No external CA activity was detected irrespective of growth CO2 conditions. Ethoxyzolamide (0.4 mm), a CA-specific inhibitor, severely inhibited high-affinity photosynthesis at low concentrations of dissolved inorganic carbon, whereas 2 mm acetazolamide had little effect on the affinity for dissolved inorganic carbon, suggesting that internal CA is crucial for the operation of a carbon concentrating mechanism in P. tricornutum. Internal CA was purified 36.7-fold of that of cell homogenates by ammonium sulfate precipitation, and two-step column chromatography on diethylaminoethyl-sephacel and p-aminomethylbenzene sulfone amide agarose. The purified CA was shown, by SDS-PAGE, to comprise an electrophoretically single polypeptide of 28 kD under both reduced and nonreduced conditions. The entire sequence of the cDNA of this CA was obtained by the rapid amplification of cDNA ends method and indicated that the cDNA encodes 282 amino acids. Comparison of this putative precursor sequence with the N-terminal amino acid sequence of the purified CA indicated that it included a possible signal sequence of up to 46 amino acids at the N terminus. The mature CA was found to consist of 236 amino acids and the sequence was homologous to β-type CAs. Even though the zinc-ligand amino acid residues were shown to be completely conserved, the amino acid residues that may constitute a CO2-binding site appeared to be unique among the β-CAs so far reported. PMID:11500545
Cloning and characterization of the novel D-aspartyl endopeptidase, paenidase, from Paenibacillus sp. B38.

PubMed

Nirasawa, Satoru; Nakahara, Kazuhiko; Takahashi, Saori

2018-02-27

Paenidase is the first microorganism-derived D-aspartyl endopeptidase that specifically recognizes an internal D-Asp residue to cleave [D-Asp]-X peptide bonds. Using peptide sequences obtained from the protein, we performed PCR with degenerate primers to amplify the paenidase I-encoding gene. Nucleotide sequencing revealed that mature paenidase I consists of 322 amino acid residues and that the protein is encoded as a pro-protein with a 197-amino-acid N-terminal extension compared to the mature protein. Paenidase I exhibits amino acid sequence similarity to several penicillin-binding proteins. In addition, paenidase I was classified into peptidase family S12 based on a MEROPS database search. Family S12 contains serine-type D-Ala-D-Ala carboxypeptidases that have three active site residues (Ser, Lys, and Tyr) in the conserved motifs Ser-Xaa-Thr-Lys and Tyr-Xaa-Asn. These motifs were conserved in the primary structure of paenidase I, and the role of these residues was confirmed by site-directed mutagenesis.
Identification of potential platelet alloantigens in the Equidae family by comparison of gene sequences encoding major platelet membrane glycoproteins.

PubMed

Boudreaux, Mary K; Humphries, Drew M

2013-12-01

Platelet alloantigens in horses may play an important role in the development of neonatal alloimmune thrombocytopenia (NAIT). The objective of this study was to evaluate genes encoding major platelet glycoproteins within the Equidae family in an effort to identify potential alloantigens. DNA was isolated from blood samples obtained from Equidae family members, including a Holsteiner-Oldenburg cross, a Quarter horse, a donkey, and a Plains zebra (Equus burchelli). Gene sequences encoding equine platelet membrane glycoproteins IIb, IIIa (integrin subunits αIIb and β3), Ia (integrin subunit α2), and Ibα were determined using PCR. Gene sequences were compared to the equine genome available on GenBank. Polymorphisms that would be predicted to result in amino acid changes on platelet surfaces were documented and compared with known alloantigenic sites documented on human platelets. Amino acid differences were predicted based on nucleotide sequences for all 4 genes. Nine differences were documented for αIIb, 5 differences were documented for β3, 7 differences were documented for α2, and 16 differences were documented for Ibα outside the macroglycopeptide region. This study represents the first effort at identifying potential platelet alloantigens in members of the Equidae Family based on evaluation of gene sequences. The data obtained form the groundwork for identifying potential platelet alloantigens involved in transfusion reactions and neonatal alloimmune thrombocytopenia (NAIT). More work is required to determine whether the predicted amino acid differences documented in this study play a role in alloimmunity, and whether other polymorphisms not detected in this study are present that may result in alloimmunity. © 2013 American Society for Veterinary Clinical Pathology.
Sequence Analysis and Domain Motifs in the Porcine Skin Decorin Glycosaminoglycan Chain*

PubMed Central

Zhao, Xue; Yang, Bo; Solakylidirim, Kemal; Joo, Eun Ji; Toida, Toshihiko; Higashi, Kyohei; Linhardt, Robert J.; Li, Lingyun

2013-01-01

Decorin proteoglycan is comprised of a core protein containing a single O-linked dermatan sulfate/chondroitin sulfate glycosaminoglycan (GAG) chain. Although the sequence of the decorin core protein is determined by the gene encoding its structure, the structure of its GAG chain is determined in the Golgi. The recent application of modern MS to bikunin, a far simpler chondroitin sulfate proteoglycans, suggests that it has a single or small number of defined sequences. On this basis, a similar approach to sequence the decorin of porcine skin much larger and more structurally complex dermatan sulfate/chondroitin sulfate GAG chain was undertaken. This approach resulted in information on the consistency/variability of its linkage region at the reducing end of the GAG chain, its iduronic acid-rich domain, glucuronic acid-rich domain, and non-reducing end. A general motif for the porcine skin decorin GAG chain was established. A single small decorin GAG chain was sequenced using MS/MS analysis. The data obtained in the study suggest that the decorin GAG chain has a small or a limited number of sequences. PMID:23423381
An improved TCF sequence for biobleaching kenaf pulp: influence of the hexenuronic acid content and the use of xylanase.

PubMed

Andreu, Glòria; Vidal, Teresa

2014-01-01

Enzymatic delignification with laccase from Trametes villosa used in combination with chemical mediators (acetosyringone, acetovanillone and 1-hydroxybenzotriazole) to improve the totally chlorine-free (TCF) bleaching of kenaf pulp was studied. The best final pulp properties were obtained by using an LHBTQPo sequence developed by incorporating a laccase-mediator stage into an industrial bleaching sequence involving chelation and peroxide stages. The new sequence resulted in increased kenaf pulp delignification (90.4%) and brightness (77.2%ISO) relative to a conventional TCF chemical sequence (74.5% delignification and 74.5% brightness). Also, the sequence provided bleached kenaf fibers with high cellulose content (pulp viscosity of 890 g·mL(-1) vs 660 g·mL(-1)). Scanning electron micrographs revealed that xylanase altered fiber surfaces and facilitated reagent access as a result. However, the LHBTX (xylanase) stage removed 21% of hexenuronic acids in kenaf pulp. These recalcitrant compounds spent additional bleaching reagents and affected pulp properties after peroxide stage. Copyright © 2013 Elsevier Ltd. All rights reserved.
An artificial intelligence approach fit for tRNA gene studies in the era of big sequence data.

PubMed

Iwasaki, Yuki; Abe, Takashi; Wada, Kennosuke; Wada, Yoshiko; Ikemura, Toshimichi

2017-09-12

Unsupervised data mining capable of extracting a wide range of knowledge from big data without prior knowledge or particular models is a timely application in the era of big sequence data accumulation in genome research. By handling oligonucleotide compositions as high-dimensional data, we have previously modified the conventional self-organizing map (SOM) for genome informatics and established BLSOM, which can analyze more than ten million sequences simultaneously. Here, we develop BLSOM specialized for tRNA genes (tDNAs) that can cluster (self-organize) more than one million microbial tDNAs according to their cognate amino acid solely depending on tetra- and pentanucleotide compositions. This unsupervised clustering can reveal combinatorial oligonucleotide motifs that are responsible for the amino acid-dependent clustering, as well as other functionally and structurally important consensus motifs, which have been evolutionarily conserved. BLSOM is also useful for identifying tDNAs as phylogenetic markers for special phylotypes. When we constructed BLSOM with 'species-unknown' tDNAs from metagenomic sequences plus 'species-known' microbial tDNAs, a large portion of metagenomic tDNAs self-organized with species-known tDNAs, yielding information on microbial communities in environmental samples. BLSOM can also enhance accuracy in the tDNA database obtained from big sequence data. This unsupervised data mining should become important for studying numerous functionally unclear RNAs obtained from a wide range of organisms.

Method for isolating chromosomal DNA in preparation for hybridization in suspension

DOEpatents

Lucas, Joe N.

2000-01-01

A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration. Chromosomal DNA in a sample containing cell debris is prepared for hybridization in suspension by treating the mixture with RNase. The treated DNA can also be fixed prior to hybridization.
Assessment of the microbial community in a constructed wetland that receives acid coal mine drainage

DOE Office of Scientific and Technical Information (OSTI.GOV)

Nicomrat, D.; Dick, W.A.; Tuovinen, O.H.

2006-01-15

Constructed wetlands are used to treat acid drainage from surface or underground coal mines. However, little is known about the microbial communities in the receiving wetland cells. The purpose of this work was to characterize the microbial population present in a wetland that was receiving acid coal mine drainage (AMD). Samples were collected from the oxic sediment zone of a constructed wetland cell in southeastern Ohio that was treating acid drainage from an underground coal mine seep. Samples comprised Fe(Ill) precipitates and were pretreated with ammonium oxalate to remove interfering iron, and the DNA was extracted and purified by agarosemore » gel electrophoresis prior to amplification of portions of the 16S rRNA gene. Amplified products were separated by denaturing gradient gel electrophoresis and DNA from seven distinct bands was excised from the gel and sequenced. The sequences were matched to sequences in the GenBank bacterial 16S rDNA database. The DNA in two of the bands yielded matches with Acidithiobacillus ferrooxidans and the DNA in each of the remaining five bands was consistent with one of the following microorganisms: Acidithiobacillus thiooxidans, strain TRA3-20 (a eubacterium), strain BEN-4 (an arsenite-oxidizing bacterium), an Alcaligenes sp., and a Bordetella sp. Low bacterial diversity in these samples reflects the highly inorganic nature of the oxic sediment layer where high abundance of iron- and sulfur-oxidizing bacteria would be expected. The results we obtained by molecular methods supported our findings, obtained using culture methods, that the dominant microbial species in an acid receiving, oxic wetland are A. thiooxidans and A. ferrooxidans.« less
On the inhibition of muscle membrane chloride conductance by aromatic carboxylic acids

PubMed Central

Palade, PT; Barchi, RL

1977-01-01

25 aromatic carboxylic acids which are analogs of benzoic acid were tested in the rat diaphragm preparation for effects on chloride conductance (G(Cl)). Of the 25, 19 were shown to reduce membrane G(Cl) with little effect on other membrane parameters, although their apparent K(i) varied widely. This inhibition was reversible if exposure times were not prolonged. The most effective analog studied was anthracene-9-COOH (9-AC; K(i) = 1.1 x 10(-5) M). Active analogs produced concentration-dependent inhibition of a type consistent with interaction at a single site or group of sites having similar binding affinities, although a correlation could also be shown between lipophilicity and K(i). Structure-activity analysis indicated that hydrophobic ring substitution usually increased inhibitory activity while para polar substitutions reduced effectiveness. These compounds do not appear to inhibit G(Cl) by altering membrane surface charge and the inhibition produced is not voltage dependent. Qualitative characteristics of the I-V relationship for Cl(-) current are not altered. Conductance to all anions is not uniformly altered by these acids as would be expected from steric occlusion of a common channel. Concentrations of 9-AC reducing G(Cl) by more than 90 percent resulted in slight augmentation of G(I). The complete conductance sequence obtained at high levels of 9-AC was the reverse of that obtained under control conditions. Permeability sequences underwent progressive changes with increasing 9-AC concentration and ultimately inverted at high levels of the analog. Aromatic carboxylic acids appear to inhibit G(Cl) by binding to a specific intramembrane site and altering the selectivity sequence of the membrane anion channel. PMID:894246
Natural proteins: Sources, isolation, characterization and applications

PubMed Central

Nehete, Jitendra Y.; Bhambar, Rajendra S.; Narkhede, Minal R.; Gawali, Sonali R.

2013-01-01

Worldwide, plant protein contributes substantially as a food resource because it contains essential amino acids for meeting human physiological requirements. However, many versatile plant proteins are used as medicinal agents as they are produced by using molecular tools of biotechnology. Proteins can be obtained from plants, animals and microorganism cells. The abundant economical proteins can be obtained from plant seeds. These natural proteins are obtained by isolation procedures depending on the physicochemical properties of proteins. Isolation and purification of single protein from cells containing mixtures of unrelated proteins is achievable due to the physical and chemical attributes of proteins. The following characteristics are unique to each protein: Amino acid composition, sequence, subunit structures, size, shape, net charge, isoelectric point, solubility, heat stability and hydrophobicity. Based on these properties, various methods of isolation exist, like salting out and isoionic precipitation. Purification of proteins is quiet challenging and, therefore, several approaches like sodium dodecyl sulfate gel electrophoresis and chromatography are available. Characterization of proteins can be performed by mass spectrometry/liquid chromatography-mass spectrometry (LC-MS). The amino acid sequence of a protein can be detected by using tandem mass spectrometry. In this article, a review has been made on the sources, isolation, purification and characterization of natural proteins. PMID:24347918
Efficient production of artificially designed gelatins with a Bacillus brevis system.

PubMed

Kajino, T; Takahashi, H; Hirai, M; Yamada, Y

2000-01-01

Artificially designed gelatins comprising tandemly repeated 30-amino-acid peptide units derived from human alphaI collagen were successfully produced with a Bacillus brevis system. The DNA encoding the peptide unit was synthesized by taking into consideration the codon usage of the host cells, but no clones having a tandemly repeated gene were obtained through the above-mentioned strategy. Minirepeat genes could be selected in vivo from a mixture of every possible sequence encoding an artificial gelatin by randomly ligating the mixed sequence unit and transforming it into Escherichia coli. Larger repeat genes constructed by connecting minirepeat genes obtained by in vivo selection were also stable in the expression host cells. Gelatins derived from the eight-unit and six-unit repeat genes were extracellularly produced at the level of 0.5 g/liter and easily purified by ammonium sulfate fractionation and anion-exchange chromatography. The purified artificial gelatins had the predicted N-terminal sequences and amino acid compositions and a solgel property similar to that of the native gelatin. These results suggest that the selection of a repeat unit sequence stable in an expression host is a shortcut for the efficient production of repetitive proteins and that it can conveniently be achieved by the in vivo selection method. This study revealed the possible industrial application of artificially designed repetitive proteins.
Isolation and N-terminal sequencing of a novel cadmium-binding protein from Boletus edulis

NASA Astrophysics Data System (ADS)

Collin-Hansen, C.; Andersen, R. A.; Steinnes, E.

2003-05-01

A Cd-binding protein was isolated from the popular edible mushroom Boletus edulis, which is a hyperaccumulator of both Cd and Hg. Wild-growing samples of B. edulis were collected from soils rich in Cd. Cd radiotracer was added to the crude protein preparation obtained from ethanol precipitation of heat-treated cytosol. Proteins were then further separated in two consecutive steps; gel filtration and anion exchange chromatography. In both steps the Cd radiotracer profile showed only one distinct peak, which corresponded well with the profiles of endogenous Cd obtained by atomic absorption spectrophotometry (AAS). Concentrations of the essential elements Cu and Zn were low in the protein fractions high in Cd. N-terminal sequencing performed on the Cd-binding protein fractions revealed a protein with a novel amino acid sequence, which contained aromatic amino acids as well as proline. Both the N-terminal sequencing and spectrofluorimetric analysis with EDTA and ABD-F (4-aminosulfonyl-7-fluoro-2, 1, 3-benzoxadiazole) failed to detect cysteine in the Cd-binding fractions. These findings conclude that the novel protein does not belong to the metallothionein family. The results suggest a role for the protein in Cd transport and storage, and they are of importance in view of toxicology and food chemistry, but also for environmental protection.
Terminal sequence importance of de novo proteins from binary-patterned library: stable artificial proteins with 11- or 12-amino acid alphabet.

PubMed

Okura, Hiromichi; Takahashi, Tsuyoshi; Mihara, Hisakazu

2012-06-01

Successful approaches of de novo protein design suggest a great potential to create novel structural folds and to understand natural rules of protein folding. For these purposes, smaller and simpler de novo proteins have been developed. Here, we constructed smaller proteins by removing the terminal sequences from stable de novo vTAJ proteins and compared stabilities between mutant and original proteins. vTAJ proteins were screened from an α3β3 binary-patterned library which was designed with polar/ nonpolar periodicities of α-helix and β-sheet. vTAJ proteins have the additional terminal sequences due to the method of constructing the genetically repeated library sequences. By removing the parts of the sequences, we successfully obtained the stable smaller de novo protein mutants with fewer amino acid alphabets than the originals. However, these mutants showed the differences on ANS binding properties and stabilities against denaturant and pH change. The terminal sequences, which were designed just as flexible linkers not as secondary structure units, sufficiently affected these physicochemical details. This study showed implications for adjusting protein stabilities by designing N- and C-terminal sequences.
Sphaeridiotrema globulus and Sphaeridiotrema pseudoglobulus (Digenea): Species Differentiation Based On mtDNA (Barcode) and Partial LSU–rDNA Sequences

USGS Publications Warehouse

Bergmame, Laura; Huffman, Jane; Cole, Rebecca; Dayanandan, Selvadurai; Tkach, Vasyl; McLaughlin, J. Daniel

2011-01-01

Flukes belonging to Sphaeridiotrema are important parasites of waterfowl, and 2 morphologically similar species Sphaeridiotrema globulus and Sphaeridiotrema pseudoglobulus, have been implicated in waterfowl mortality in North America. Cytochrome oxidase I (barcode region) and partial LSU-rDNA sequences from specimens of S. globulus and S. pseudoglobulus, obtained from naturally and experimentally infected hosts from New Jersey and Quebec, respectively, confirmed that these species were distinct. Barcode sequences of the 2 species differed at 92 of 590 nucleotide positions (15.6%) and the translated sequences differed by 13 amino acid residues. Partial LSU-rDNA sequences differed at 29 of 1,208 nucleotide positions (2.4%). Additional barcode sequences from specimens collected from waterfowl in Wisconsin and Minnesota and morphometric data obtained from specimens acquired along the north shore of Lake Superior revealed the presence of S. pseudoglobulus in these areas. Although morphometric data suggested the presence of S. globulus in the Lake Superior sample, it was not found among the specimens sequenced from Wisconsin or Minnesota.
Sphaeridiotrema globulus and Sphaeridiotrema pseudoglobulus (Digenea): Species Differentiation Based on mtDNA (Barcode) and Partial LSUrDNA Sequences

USGS Publications Warehouse

Bergmame, L.; Huffman, J.; Cole, R.; Dayanandan, S.; Tkach, V.; McLaughlin, J.D.

2011-01-01

Flukes belonging to Sphaeridiotrema are important parasites of waterfowl, and 2 morphologically similar species Sphaeridiotrema globulus and Sphaeridiotrema pseudoglobulus, have been implicated in waterfowl mortality in North America. Cytochrome oxidase I (barcode region) and partial LSU-rDNA sequences from specimens of S. globulus and S. pseudoglobulus, obtained from naturally and experimentally infected hosts from New Jersey and Quebec, respectively, confirmed that these species were distinct. Barcode sequences of the 2 species differed at 92 of 590 nucleotide positions (15.6%) and the translated sequences differed by 13 amino acid residues. Partial LSU-rDNA sequences differed at 29 of 1,208 nucleotide positions (2.4%). Additional barcode sequences from specimens collected from waterfowl in Wisconsin and Minnesota and morphometric data obtained from specimens acquired along the north shore of Lake Superior revealed the presence of S. pseudoglobulus in these areas. Although morphometric data suggested the presence of S. globulus in the Lake Superior sample, it was not found among the specimens sequenced from Wisconsin or Minnesota. ?? 2011 American Society of Parasitologists.
Streptomyces pharmamarensis sp. nov. isolated from a marine sediment.

PubMed

Carro, Lorena; Zúñiga, Paz; de la Calle, Fernando; Trujillo, Martha E

2012-05-01

A Gram-stain-positive actinobacterium, strain PM267(T), was isolated from a marine sediment sample in the Mediterranean Sea. The novel strain produced extensively branched substrate and aerial hyphae that carried spiral spore chains. Substrate and aerial mycelia were cream-white and white, respectively. Diffusible pigments were not observed. 16S rRNA gene sequence analysis revealed that strain PM267(T) belonged to the genus Streptomyces and shared a gene sequence similarity of 97.1 % with Streptomyces artemisiae YIM 63135(T) and Streptomyces armeniacus JCM 3070(T). Values <97 % were obtained with other sequences representing members of the genus Streptomyces. The cell wall peptidoglycan contained ll-diaminopimelic acid. MK-9(H(8)) was the major menaquinone. The phospholipid pattern included phosphatidylethanolamine as diagnostic lipid (type II). Major fatty acids found were iso- and anteiso- fatty acids. The G+C content of the DNA was 71.2 mol%. The strain was halotolerant and was able to grow in the presence of 9 % (w/v) NaCl (with an optimum of 2 %). On the basis of these results and additional physiological data obtained in the present study, strain PM267(T) represents a novel species within the genus Streptomyces for which the name Streptomyces pharmamarensis sp. nov. is proposed (type strain PM267(T) = CECT 7841(T) = DSM 42032(T)).
Biochemical and Genetic Characterization of Coagulin, a New Antilisterial Bacteriocin in the Pediocin Family of Bacteriocins, Produced by Bacillus coagulans I4

PubMed Central

Le Marrec, Claire; Hyronimus, Bertrand; Bressollier, Philippe; Verneuil, Bernard; Urdaci, Maria C.

2000-01-01

A plasmid-linked antimicrobial peptide, named coagulin, produced by Bacillus coagulans I4 has recently been reported (B. Hyronimus, C. Le Marrec and M. C. Urdaci, J. Appl. Microbiol. 85:42–50, 1998). In the present study, the complete, unambiguous primary amino acid sequence of the peptide was obtained by a combination of both N-terminal sequencing of purified peptide and the complete sequence deduced from the structural gene harbored by plasmid I4. Data revealed that this peptide of 44 residues has an amino acid sequence similar to that described for pediocins AcH and PA-1, produced by different Pediococcus acidilactici strains and 100% identical. Coagulin and pediocin differed only by a single amino acid at their C terminus. Analysis of the genetic determinants revealed the presence, on the pI4 DNA, of the entire 3.5-kb operon of four genes described for pediocin AcH and PA-1 production. No extended homology was observed between pSMB74 from P. acidilactici and pI4 when analyzing the regions upstream and downstream of the operon. An oppositely oriented gene immediately dowstream of the bacteriocin operon specifies a 474-amino-acid protein which shows homology to Mob-Pre (plasmid recombination enzyme) proteins encoded by several small plasmids extracted from gram-positive bacteria. This is the first report of a pediocin-like peptide appearing naturally in a non-lactic acid bacterium genus. PMID:11097892
Optimization of Reversed-Phase Peptide Liquid Chromatography Ultraviolet Mass Spectrometry Analyses Using an Automated Blending Methodology

PubMed Central

Chakraborty, Asish B.; Berger, Scott J.

2005-01-01

The balance between chromatographic performance and mass spectrometric response has been evaluated using an automated series of experiments where separations are produced by the real-time automated blending of water with organic and acidic modifiers. In this work, the concentration effects of two acidic modifiers (formic acid and trifluoroacetic acid) were studied on the separation selectivity, ultraviolet, and mass spectrometry detector response, using a complex peptide mixture. Peptide retention selectivity differences were apparent between the two modifiers, and under the conditions studied, trifluoroacetic acid produced slightly narrower (more concentrated) peaks, but significantly higher electrospray mass spectrometry suppression. Trifluoroacetic acid suppression of electrospray signal and influence on peptide retention and selectivity was dominant when mixtures of the two modifiers were analyzed. Our experimental results indicate that in analyses where the analyzed components are roughly equimolar (e.g., a peptide map of a recombinant protein), the selectivity of peptide separations can be optimized by choice and concentration of acidic modifier, without compromising the ability to obtain effective sequence coverage of a protein. In some cases, these selectivity differences were explored further, and a rational basis for differentiating acidic modifier effects from the underlying peptide sequences is described. PMID:16522853
Cloning of a cDNA encoding 1-aminocyclopropane-1-carboxylate synthase and expression of its mRNA in ripening apple fruit.

PubMed

Dong, J G; Kim, W T; Yip, W K; Thompson, G A; Li, L; Bennett, A B; Yang, S F

1991-08-01

1-Aminocyclopropane-1-carboxylate (ACC) synthase (EC 4.4.1.14) purified from apple (Malus sylvestris Mill.) fruit was subjected to trypsin digestion. Following separation by reversed-phase high-pressure liquid chromatography, ten tryptic peptides were sequenced. Based on the sequences of three tryptic peptides, three sets of mixed oligonucleotide probes were synthesized and used to screen a plasmid cDNA library prepared from poly(A)(+) RNA of ripe apple fruit. A 1.5-kb (kilobase) cDNA clone which hybridized to all three probes were isolated. The clone contained an open reading frame of 1214 base pairs (bp) encoding a sequence of 404 amino acids. While the polyadenine tail at the 3'-end was intact, it lacked a portion of sequence at the 5'-end. Using the RNA-based polymerase chain reaction, an additional sequence of 148 bp was obtained at the 5'-end. Thus, 1362 bp were sequenced and they encode 454 amino acids. The deduced amino-acid sequence contained peptide sequences corresponding to all ten tryptic fragments, confirming the identity of the cDNA clone. Comparison of the deduced amino-acid sequence between ACC synthase from apple fruit and those from tomato (Lycopersicon esculentum Mill.) and winter squash (Cucurbita maxima Duch.) fruits demonstrated the presence of seven highly conserved regions, including the previously identified region for the active site. The size of the translation product of ACC-synthase mRNA was similar to that of the mature protein on sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE), indicating that apple ACC-synthase undergoes only minor, if any, post-translational proteolytic processing. Analysis of ACC-synthase mRNA by in-vitro translation-immunoprecipitation, and by Northern blotting indicates that the ACC-synthase mRNA was undetectable in unripe fruit, but was accumulated massively during the ripening proccess. These data demonstrate that the expression of the ACC-synthase gene is developmentally regulated.
Cloning and sequence analysis demonstrate the chromate reduction ability of a novel chromate reductase gene from Serratia sp.

PubMed

Deng, Peng; Tan, Xiaoqing; Wu, Ying; Bai, Qunhua; Jia, Yan; Xiao, Hong

2015-03-01

The ChrT gene encodes a chromate reductase enzyme which catalyzes the reduction of Cr(VI). The chromate reductase is also known as flavin mononucleotide (FMN) reductase (FMN_red). The aim of the present study was to clone the full-length ChrT DNA from Serratia sp. CQMUS2 and analyze the deduced amino acid sequence and three-dimensional structure. The putative ChrT gene fragment of Serratia sp. CQMUS2 was isolated by polymerase chain reaction (PCR), according to the known FMN_red gene sequence from Serratia sp. AS13. The flanking sequences of the ChrT gene were obtained by high efficiency TAIL-PCR, while the full-length gene of ChrT was cloned in Escherichia coli for subsequent sequencing. The nucleotide sequence of ChrT was submitted onto GenBank under the accession number, KF211434. Sequence analysis of the gene and amino acids was conducted using the Basic Local Alignment Search Tool, and open reading frame (ORF) analysis was performed using ORF Finder software. The ChrT gene was found to be an ORF of 567 bp that encodes a 188-amino acid enzyme with a calculated molecular weight of 20.4 kDa. In addition, the ChrT protein was hypothesized to be an NADPH-dependent FMN_red and a member of the flavodoxin-2 superfamily. The amino acid sequence of ChrT showed high sequence similarity to the FMN reductase genes of Klebsiella pneumonia and Raoultella ornithinolytica , which belong to the flavodoxin-2 superfamily. Furthermore, ChrT was shown to have a 85.6% similarity to the three-dimensional structure of Escherichia coli ChrR, sharing four common enzyme active sites for chromate reduction. Therefore, ChrT gene cloning and protein structure determination demonstrated the ability of the gene for chromate reduction. The results of the present study provide a basis for further studies on ChrT gene expression and protein function.
Cloning and sequence analysis demonstrate the chromate reduction ability of a novel chromate reductase gene from Serratia sp

PubMed Central

DENG, PENG; TAN, XIAOQING; WU, YING; BAI, QUNHUA; JIA, YAN; XIAO, HONG

2015-01-01

The ChrT gene encodes a chromate reductase enzyme which catalyzes the reduction of Cr(VI). The chromate reductase is also known as flavin mononucleotide (FMN) reductase (FMN_red). The aim of the present study was to clone the full-length ChrT DNA from Serratia sp. CQMUS2 and analyze the deduced amino acid sequence and three-dimensional structure. The putative ChrT gene fragment of Serratia sp. CQMUS2 was isolated by polymerase chain reaction (PCR), according to the known FMN_red gene sequence from Serratia sp. AS13. The flanking sequences of the ChrT gene were obtained by high efficiency TAIL-PCR, while the full-length gene of ChrT was cloned in Escherichia coli for subsequent sequencing. The nucleotide sequence of ChrT was submitted onto GenBank under the accession number, KF211434. Sequence analysis of the gene and amino acids was conducted using the Basic Local Alignment Search Tool, and open reading frame (ORF) analysis was performed using ORF Finder software. The ChrT gene was found to be an ORF of 567 bp that encodes a 188-amino acid enzyme with a calculated molecular weight of 20.4 kDa. In addition, the ChrT protein was hypothesized to be an NADPH-dependent FMN_red and a member of the flavodoxin-2 superfamily. The amino acid sequence of ChrT showed high sequence similarity to the FMN reductase genes of Klebsiella pneumonia and Raoultella ornithinolytica, which belong to the flavodoxin-2 superfamily. Furthermore, ChrT was shown to have a 85.6% similarity to the three-dimensional structure of Escherichia coli ChrR, sharing four common enzyme active sites for chromate reduction. Therefore, ChrT gene cloning and protein structure determination demonstrated the ability of the gene for chromate reduction. The results of the present study provide a basis for further studies on ChrT gene expression and protein function. PMID:25667630
A comparison of anaerobic 2, 4-dichlorophenoxy acetic acid degradation in single-fed and sequencing batch reactor systems

NASA Astrophysics Data System (ADS)

Elefsiniotis, P.; Wareham, D. G.; Fongsatitukul, P.

2017-08-01

This paper compares the practical limits of 2, 4-dichlorophenoxy acetic acid (2,4-D) degradation that can be obtained in two laboratory-scale anaerobic digestion systems; namely, a sequencing batch reactor (SBR) and a single-fed batch reactor (SFBR) system. The comparison involved synthesizing a decade of research conducted by the lead author and drawing summative conclusions about the ability of each system to accommodate industrial-strength concentrations of 2,4-D. In the main, 2 L liquid volume anaerobic SBRs were used with glucose as a supplemental carbon source for both acid-phase and two-phase conditions. Volatile fatty acids however were used as a supplemental carbon source for the methanogenic SBRs. The anaerobic SBRs were operated at an hydraulic retention time of 48 hours, while being subjected to increasing concentrations of 2,4-D. The SBRs were able to degrade between 130 and 180 mg/L of 2,4-D depending upon whether they were operated in the acid-phase or two-phase regime. The methanogenic-only phase did not achieve 2,4-D degradation however this was primarily attributed to difficulties with obtaining a sufficiently long SRT. For the two-phase SFBR system, 3.5 L liquid-volume digesters were used and no difficulty was experienced with degrading 100 % of the 2,4-D concentration applied (300 mg/L).
Sequence analysis of dolphin ferritin H and L subunits and possible iron-dependent translational control of dolphin ferritin gene

PubMed Central

Takaesu, Azusa; Watanabe, Kiyotaka; Takai, Shinji; Sasaki, Yukako; Orino, Koichi

2008-01-01

Background Iron-storage protein, ferritin plays a central role in iron metabolism. Ferritin has dual function to store iron and segregate iron for protection of iron-catalyzed reactive oxygen species. Tissue ferritin is composed of two kinds of subunits (H: heavy chain or heart-type subunit; L: light chain or liver-type subunit). Ferritin gene expression is controlled at translational level in iron-dependent manner or at transcriptional level in iron-independent manner. However, sequencing analysis of marine mammalian ferritin subunits has not yet been performed fully. The purpose of this study is to reveal cDNA-derived amino acid sequences of cetacean ferritin H and L subunits, and demonstrate the possibility of expression of these subunits, especially H subunit, by iron. Methods Sequence analyses of cetacean ferritin H and L subunits were performed by direct sequencing of polymerase chain reaction (PCR) fragments from cDNAs generated via reverse transcription-PCR of leukocyte total RNA prepared from blood samples of six different dolphin species (Pseudorca crassidens, Lagenorhynchus obliquidens, Grampus griseus, Globicephala macrorhynchus, Tursiops truncatus, and Delphinapterus leucas). The putative iron-responsive element sequence in the 5'-untranslated region of the six different dolphin species was revealed by direct sequencing of PCR fragments obtained using leukocyte genomic DNA. Results Dolphin H and L subunits consist of 182 and 174 amino acids, respectively, and amino acid sequence identities of ferritin subunits among these dolphins are highly conserved (H: 99–100%, (99→98) ; L: 98–100%). The conserved 28 bp IRE sequence was located -144 bp upstream from the initiation codon in the six different dolphin species. Conclusion These results indicate that six different dolphin species have conserved ferritin sequences, and suggest that these genes are iron-dependently expressed. PMID:18954429
Cloning and expression of a cDNA coding for a human monocyte-derived plasminogen activator inhibitor.

PubMed

Antalis, T M; Clark, M A; Barnes, T; Lehrbach, P R; Devine, P L; Schevzov, G; Goss, N H; Stephens, R W; Tolstoshev, P

1988-02-01

Human monocyte-derived plasminogen activator inhibitor (mPAI-2) was purified to homogeneity from the U937 cell line and partially sequenced. Oligonucleotide probes derived from this sequence were used to screen a cDNA library prepared from U937 cells. One positive clone was sequenced and contained most of the coding sequence as well as a long incomplete 3' untranslated region (1112 base pairs). This cDNA sequence was shown to encode mPAI-2 by hybrid-select translation. A cDNA clone encoding the remainder of the mPAI-2 mRNA was obtained by primer extension of U937 poly(A)+ RNA using a probe complementary to the mPAI-2 coding region. The coding sequence for mPAI-2 was placed under the control of the lambda PL promoter, and the protein expressed in Escherichia coli formed a complex with urokinase that could be detected immunologically. By nucleotide sequence analysis, mPAI-2 cDNA encodes a protein containing 415 amino acids with a predicted unglycosylated Mr of 46,543. The predicted amino acid sequence of mPAI-2 is very similar to placental PAI-2 (3 amino acid differences) and shows extensive homology with members of the serine protease inhibitor (serpin) superfamily. mPAI-2 was found to be more homologous to ovalbumin (37%) than the endothelial plasminogen activator inhibitor, PAI-1 (26%). Like ovalbumin, mPAI-2 appears to have no typical amino-terminal signal sequence. The 3' untranslated region of the mPAI-2 cDNA contains a putative regulatory sequence that has been associated with the inflammatory mediators.
Cloning and expression of a cDNA coding for a human monocyte-derived plasminogen activator inhibitor.

PubMed Central

Antalis, T M; Clark, M A; Barnes, T; Lehrbach, P R; Devine, P L; Schevzov, G; Goss, N H; Stephens, R W; Tolstoshev, P

1988-01-01

Human monocyte-derived plasminogen activator inhibitor (mPAI-2) was purified to homogeneity from the U937 cell line and partially sequenced. Oligonucleotide probes derived from this sequence were used to screen a cDNA library prepared from U937 cells. One positive clone was sequenced and contained most of the coding sequence as well as a long incomplete 3' untranslated region (1112 base pairs). This cDNA sequence was shown to encode mPAI-2 by hybrid-select translation. A cDNA clone encoding the remainder of the mPAI-2 mRNA was obtained by primer extension of U937 poly(A)+ RNA using a probe complementary to the mPAI-2 coding region. The coding sequence for mPAI-2 was placed under the control of the lambda PL promoter, and the protein expressed in Escherichia coli formed a complex with urokinase that could be detected immunologically. By nucleotide sequence analysis, mPAI-2 cDNA encodes a protein containing 415 amino acids with a predicted unglycosylated Mr of 46,543. The predicted amino acid sequence of mPAI-2 is very similar to placental PAI-2 (3 amino acid differences) and shows extensive homology with members of the serine protease inhibitor (serpin) superfamily. mPAI-2 was found to be more homologous to ovalbumin (37%) than the endothelial plasminogen activator inhibitor, PAI-1 (26%). Like ovalbumin, mPAI-2 appears to have no typical amino-terminal signal sequence. The 3' untranslated region of the mPAI-2 cDNA contains a putative regulatory sequence that has been associated with the inflammatory mediators. Images PMID:3257578
Amino acid sequence of the smaller basic protein from rat brain myelin

PubMed Central

Dunkley, Peter R.; Carnegie, Patrick R.

1974-01-01

1. The complete amino acid sequence of the smaller basic protein from rat brain myelin was determined. This protein differs from myelin basic proteins of other species in having a deletion of a polypeptide of 40 amino acid residues from the centre of the molecule. 2. A detailed comparison is made of the constant and variable regions in a group of myelin basic proteins from six species. 3. An arginine residue in the rat protein was found to be partially methylated. The ratio of methylated to unmethylated arginine at this position differed from that found for the human basic protein. 4. Three tryptic peptides were isolated in more than one form. The differences between the two forms of each peptide are discussed in relation to the electrophoretic heterogeneity of myelin basic proteins, which is known to occur at alkaline pH values. 5. Detailed evidence for the amino acid sequence of the protein has been deposited as Supplementary Publication SUP 50029 at the British Library (Lending Division) (formerly the National Lending Library for Science and Technology), Boston Spa, Yorks. LS23 7BQ, U.K., from whom copies may be obtained on the terms given in Biochem. J. (1973) 131, 5. PMID:4141893

An oleate 12-hydroxylase from Ricinus communis L. is a fatty acyl desaturase homolog

DOE Office of Scientific and Technical Information (OSTI.GOV)

Van De Loo, F.J.; Broun, P.; Turner, S.

1995-07-18

Recent spectroscopic evidence implicating a binuclear iron site at the reaction center of fatty acyl desaturases suggested to us that certain fatty acyl hydroxylases may share significant amino acid sequence similarity with desaturases. To test this theory, we prepared a cDNA library from developing endosperm of the castor-oil plant (Ricinus communis L.) and obtained partial nucleotide sequences for 468 anonymous clones that were not expressed at high levels in leaves, a tissue deficient in 12-hydroxyoleic acid. This resulted in the identification of several cDNA clones encoding a polypeptide of 387 amino acids with a predicted molecular weight of 44,407 andmore » with {approx}67% sequence homology to microsomal oleate desaturase from Arabidopsis. Expression of a full-length clone under control of the cauliflower mosaic virus 35S promoter in transgenic tobacco resulted in the accumulation of low levels of 12-hydroxyoleic acid in seeds, indicating that the clone encodes the castor oleate hydroxylase. These results suggest that fatty acyl desaturases and hydroxylases share similar reaction mechanisms and provide an example of enzyme evolution. 26 refs., 6 figs., 1 tab.« less
Prediction of glutathionylation sites in proteins using minimal sequence information and their experimental validation.

PubMed

Pal, Debojyoti; Sharma, Deepak; Kumar, Mukesh; Sandur, Santosh K

2016-09-01

S-glutathionylation of proteins plays an important role in various biological processes and is known to be protective modification during oxidative stress. Since, experimental detection of S-glutathionylation is labor intensive and time consuming, bioinformatics based approach is a viable alternative. Available methods require relatively longer sequence information, which may prevent prediction if sequence information is incomplete. Here, we present a model to predict glutathionylation sites from pentapeptide sequences. It is based upon differential association of amino acids with glutathionylated and non-glutathionylated cysteines from a database of experimentally verified sequences. This data was used to calculate position dependent F-scores, which measure how a particular amino acid at a particular position may affect the likelihood of glutathionylation event. Glutathionylation-score (G-score), indicating propensity of a sequence to undergo glutathionylation, was calculated using position-dependent F-scores for each amino-acid. Cut-off values were used for prediction. Our model returned an accuracy of 58% with Matthew's correlation-coefficient (MCC) value of 0.165. On an independent dataset, our model outperformed the currently available model, in spite of needing much less sequence information. Pentapeptide motifs having high abundance among glutathionylated proteins were identified. A list of potential glutathionylation hotspot sequences were obtained by assigning G-scores and subsequent Protein-BLAST analysis revealed a total of 254 putative glutathionable proteins, a number of which were already known to be glutathionylated. Our model predicted glutathionylation sites in 93.93% of experimentally verified glutathionylated proteins. Outcome of this study may assist in discovering novel glutathionylation sites and finding candidate proteins for glutathionylation.
Elman RNN based classification of proteins sequences on account of their mutual information.

PubMed

Mishra, Pooja; Nath Pandey, Paras

2012-10-21

In the present work we have employed the method of estimating residue correlation within the protein sequences, by using the mutual information (MI) of adjacent residues, based on structural and solvent accessibility properties of amino acids. The long range correlation between nonadjacent residues is improved by constructing a mutual information vector (MIV) for a single protein sequence, like this each protein sequence is associated with its corresponding MIVs. These MIVs are given to Elman RNN to obtain the classification of protein sequences. The modeling power of MIV was shown to be significantly better, giving a new approach towards alignment free classification of protein sequences. We also conclude that sequence structural and solvent accessible property based MIVs are better predictor. Copyright © 2012 Elsevier Ltd. All rights reserved.
Characterization of papain-like isoenzymes from latex of Asclepias curassavica by molecular biology validated by proteomic approach.

PubMed

Obregón, Walter D; Liggieri, Constanza S; Trejo, Sebastian A; Avilés, Francesc X; Vairo-Cavalli, Sandra E; Priolo, Nora S

2009-01-01

Latices from Asclepias spp are used in wound healing and the treatment of some digestive disorders. These pharmacological actions have been attributed to the presence of cysteine proteases in these milky latices. Asclepias curassavica (Asclepiadaceae), "scarlet milkweed" is a perennial subshrub native to South America. In the current paper we report a new approach directed at the selective biochemical and molecular characterization of asclepain cI (acI) and asclepain cII (acII), the enzymes responsible for the proteolytic activity of the scarlet milkweed latex. SDS-PAGE spots of both purified peptidases were digested with trypsin and Peptide Mass Fingerprints (PMFs) obtained showed no equivalent peptides. No identification was possible by MASCOT search due to the paucity of information concerning Asclepiadaceae latex cysteine proteinases available in databases. From total RNA extracted from latex samples, cDNA of both peptidases was obtained by RT-PCR using degenerate primers encoding Asclepiadaceae cysteine peptidase conserved domains. Theoretical PMFs of partial polypeptide sequences obtained by cloning (186 and 185 amino acids) were compared with empirical PMFs, confirming that the sequences of 186 and 185 amino acids correspond to acI and acII, respectively. N-terminal sequences of acI and acII, characterized by Edman sequencing, were overlapped with those coming from the cDNA to obtain the full-length sequence of both mature peptidases (212 and 211 residues respectively). Alignment and phylogenetic analysis confirmed that acI and acII belong to the subfamily C1A forming a new group of papain-like cysteine peptidases together with asclepain f from Asclepias fruticosa. We conclude that PMF could be adopted as an excellent tool to differentiate, in a fast and unequivocal way, peptidases with very similar physicochemical and functional properties, with advantages over other conventional methods (for instance enzyme kinetics) that are time consuming and afford less reliable results.
Sequence diversity of hepatitis C virus 6a within the extended interferon sensitivity-determining region correlates with interferon-alpha/ribavirin treatment outcomes.

PubMed

Zhou, Daniel X M; Chan, Paul K S; Zhang, Tiejun; Tully, Damien C; Tam, John S

2010-10-01

Studies on the association between sequence variability of the interferon sensitivity-determining region (ISDR) of hepatitis C virus and the outcome of treatment have reached conflicting results. In this study, 25 patients infected with HCV 6a who had received interferon-alpha/ribavirin combination treatment were analyzed for the sequence variations. 14 of them had the full genome sequences obtained from a previous study, whereas the other 11 samples were sequenced for the extended ISDR (eISDR). This eISDR fragment covers 192 bp (64 amino acids) upstream and 201 bp (67 amino acids) downstream from the ISDR previously defined for HCV 1b. The comparison between interferon-alpha resistance and response groups for the amino acid mutations located in the full genome (6 and 8 patients respectively) as well as the mutations located in the eISDR (10 and 15 patients respectively) showed that the mutations I2160V, I2256V, V2292I (P<0.05) within eISDR were significantly associated with resistance to treatment. However, the extent of amino acid variations within previously defined ISDR was not associated with resistance to treatment as previously reported. Four amino acid variations I248V (P=0.03-0.06) within E1, R445K (P=0.02-0.05) and S747T (P=0.03) within E2, I861V (P=0.01) within NS2 which located outside the eISDR may also associate with treatment outcome as identified by a prescreening of variations within 14 HCV 6a full genomes. (c) 2010 Elsevier B.V. All rights reserved.
Cloning and Sequence Analysis of Vibrio halioticoli Genes Encoding Three Types of Polyguluronate Lyase.

PubMed

Sugimura; Sawabe; Ezura

2000-01-01

The alginate lyase-coding genes of Vibrio halioticoli IAM 14596(T), which was isolated from the gut of the abalone Haliotis discus hannai, were cloned using plasmid vector pUC 18, and expressed in Escherichia coli. Three alginate lyase-positive clones, pVHB, pVHC, and pVHE, were obtained, and all clones expressed the enzyme activity specific for polyguluronate. Three genes, alyVG1, alyVG2, and alyVG3, encoding polyguluronate lyase were sequenced: alyVG1 from pVHB was composed of a 1056-bp open reading frame (ORF) encoding 352 amino acid residues; alyVG2 gene from pVHC was composed of a 993-bp ORF encoding 331 amino acid residues; and alyVG3 gene from pVHE was composed of a 705-bp ORF encoding 235 amino acid residues. Comparison of nucleotide and deduced amino acid sequences among AlyVG1, AlyVG2, and AlyVG3 revealed low homologies. The identity value between AlyVG1 and AlyVG2 was 18.7%, and that between AlyVG2 and AlyVG3 was 17.0%. A higher identity value (26.0%) was observed between AlyVG1 and AlyVG3. Sequence comparison among known polyguluronate lyases including AlyVG1, AlyVG2, and AlyVG3 also did not reveal an identical region in these sequences. However, AlyVG1 showed the highest identity value (36.2%) and the highest similarity (73.3%) to AlyA from Klebsiella pneumoniae. A consensus region comprising nine amino acid (YFKAGXYXQ) in the carboxy-terminal region previously reported by Mallisard and colleagues was observed only in AlyVG1 and AlyVG2.
Characterization of HIV Type 1 Envelope Sequence Among Viral Isolates Circulating in the Northern Region of Colombia, South America

PubMed Central

Villarreal, José-Luis; Gutiérrez, Jaime; Palacio, Lucy; Peñuela, Martha; Hernández, Robin; Lemay, Guy

2012-01-01

Abstract To characterize human immunodeficiency virus (HIV-1) strains circulating in the Northern region of Colombia in South America, sequences of the viral envelope C2V3C3 region were obtained from patients with different high-risk practices. Close to 60% of the sequences were predicted to belong to macrophage-tropic viruses, according to the positions of acidic amino acids and putative N-linked glycosylation sites. This is in agreement with the fact that most of the patients were recently diagnosed individuals. Phylogenic analysis then allowed assignment of all 35 samples to subtype B viruses. This same subtype was found in previous studies carried out in other Colombian regions. This study thus expands previous analyses with previously missing data from the Northern region of the country. The number and the length of the sequences examined also help to provide a clearer picture of the prevailing situation of the present HIV epidemics in this country. PMID:22482735
RaptorX server: a resource for template-based protein structure modeling.

PubMed

Källberg, Morten; Margaryan, Gohar; Wang, Sheng; Ma, Jianzhu; Xu, Jinbo

2014-01-01

Assigning functional properties to a newly discovered protein is a key challenge in modern biology. To this end, computational modeling of the three-dimensional atomic arrangement of the amino acid chain is often crucial in determining the role of the protein in biological processes. We present a community-wide web-based protocol, RaptorX server ( http://raptorx.uchicago.edu ), for automated protein secondary structure prediction, template-based tertiary structure modeling, and probabilistic alignment sampling.Given a target sequence, RaptorX server is able to detect even remotely related template sequences by means of a novel nonlinear context-specific alignment potential and probabilistic consistency algorithm. Using the protocol presented here it is thus possible to obtain high-quality structural models for many target protein sequences when only distantly related protein domains have experimentally solved structures. At present, RaptorX server can perform secondary and tertiary structure prediction of a 200 amino acid target sequence in approximately 30 min.
Peptides derivatized with bicyclic quaternary ammonium ionization tags. Sequencing via tandem mass spectrometry.

PubMed

Setner, Bartosz; Rudowska, Magdalena; Klem, Ewelina; Cebrat, Marek; Szewczuk, Zbigniew

2014-10-01

Improving the sensitivity of detection and fragmentation of peptides to provide reliable sequencing of peptides is an important goal of mass spectrometric analysis. Peptides derivatized by bicyclic quaternary ammonium ionization tags: 1-azabicyclo[2.2.2]octane (ABCO) or 1,4-diazabicyclo[2.2.2]octane (DABCO), are characterized by an increased detection sensitivity in electrospray ionization mass spectrometry (ESI-MS) and longer retention times on the reverse-phase (RP) chromatography columns. The improvement of the detection limit was observed even for peptides dissolved in 10 mM NaCl. Collision-induced dissociation tandem mass spectrometry of quaternary ammonium salts derivatives of peptides showed dominant a- and b-type ions, allowing facile sequencing of peptides. The bicyclic ionization tags are stable in collision-induced dissociation experiments, and the resulted fragmentation pattern is not significantly influenced by either acidic or basic amino acid residues in the peptide sequence. Obtained results indicate the general usefulness of the bicyclic quaternary ammonium ionization tags for ESI-MS/MS sequencing of peptides. Copyright © 2014 John Wiley & Sons, Ltd.
Nucleic acid arrays and methods of synthesis

DOEpatents

Sabanayagam, Chandran R.; Sano, Takeshi; Misasi, John; Hatch, Anson; Cantor, Charles

2001-01-01

The present invention generally relates to high density nucleic acid arrays and methods of synthesizing nucleic acid sequences on a solid surface. Specifically, the present invention contemplates the use of stabilized nucleic acid primer sequences immobilized on solid surfaces, and circular nucleic acid sequence templates combined with the use of isothermal rolling circle amplification to thereby increase nucleic acid sequence concentrations in a sample or on an array of nucleic acid sequences.
Identification and biochemical characterization of a GDSL-motif carboxylester hydrolase from Carica papaya latex.

PubMed

Abdelkafi, Slim; Ogata, Hiroyuki; Barouh, Nathalie; Fouquet, Benjamin; Lebrun, Régine; Pina, Michel; Scheirlinckx, Frantz; Villeneuve, Pierre; Carrière, Frédéric

2009-11-01

An esterase (CpEst) showing high specific activities on tributyrin and short chain vinyl esters was obtained from Carica papaya latex after an extraction step with zwitterionic detergent and sonication, followed by gel filtration chromatography. Although the protein could not be purified to complete homogeneity due to its presence in high molecular mass aggregates, a major protein band with an apparent molecular mass of 41 kDa was obtained by SDS-PAGE. This material was digested with trypsin and the amino acid sequences of the tryptic peptides were determined by LC/ESI/MS/MS. These sequences were used to identify a partial cDNA (679 bp) from expressed sequence tags (ESTs) of C. papaya. Based upon EST sequences, a full-length gene was identified in the genome of C. papaya, with an open reading frame of 1029 bp encoding a protein of 343 amino acid residues, with a theoretical molecular mass of 38 kDa. From sequence analysis, CpEst was identified as a GDSL-motif carboxylester hydrolase belonging to the SGNH protein family and four potential N-glycosylation sites were identified. The putative catalytic triad was localised (Ser(35)-Asp(307)-His(310)) with the nucleophile serine being part of the GDSL-motif. A 3D-model of CpEst was built from known X-ray structures and sequence alignments and the catalytic triad was found to be exposed at the surface of the molecule, thus confirming the results of CpEst inhibition by tetrahydrolipstatin suggesting a direct accessibility of the inhibitor to the active site.
Purification, characterization and molecular cloning of chymotrypsin inhibitor peptides from the venom of Burmese Daboia russelii siamensis.

PubMed

Guo, Chun-Teng; McClean, Stephen; Shaw, Chris; Rao, Ping-Fan; Ye, Ming-Yu; Bjourson, Anthony J

2013-05-01

One novel Kunitz BPTI-like peptide designated as BBPTI-1, with chymotrypsin inhibitory activity was identified from the venom of Burmese Daboia russelii siamensis. It was purified by three steps of chromatography including gel filtration, cation exchange and reversed phase. A partial N-terminal sequence of BBPTI-1, HDRPKFCYLPADPGECLAHMRSF was obtained by automated Edman degradation and a Ki value of 4.77nM determined. Cloning of BBPTI-1 including the open reading frame and 3' untranslated region was achieved from cDNA libraries derived from lyophilized venom using a 3' RACE strategy. In addition a cDNA sequence, designated as BBPTI-5, was also obtained. Alignment of cDNA sequences showed that BBPTI-5 exhibited an identical sequence to BBPTI-1 cDNA except for an eight nucleotide deletion in the open reading frame. Gene variations that represented deletions in the BBPTI-5 cDNA resulted in a novel protease inhibitor analog. Amino acid sequence alignment revealed that deduced peptides derived from cloning of their respective precursor cDNAs from libraries showed high similarity and homology with other Kunitz BPTI proteinase inhibitors. BBPTI-1 and BBPTI-5 consist of 60 and 66 amino acid residues respectively, including six conserved cysteine residues. As these peptides have been reported to have influence on the processes of coagulation, fibrinolysis and inflammation, their potential application in biomedical contexts warrants further investigation. Copyright © 2013 Elsevier Inc. All rights reserved.
In silico Analysis for Predicting Fatty Acids of Black Cumin Oil as Inhibitors of P-Glycoprotein.

PubMed

Ali, Babar; Jamal, Qazi Mohd Sajid; Mir, Showkat R; Shams, Saiba; Al-Wabel, Naser A; Kamal, Mohammad A

2015-10-01

Black cumin oil is obtained from the seeds of Nigella sativa L. which belongs to family Ranunculaceae. The seed oil has been reported to possess antitumor, antioxidant, antibacterial, anti-inflammatory, hypoglycemic, central nervous system depressant, antioxidant, and immunostimulatory activities. These bioactivities have been attributed to the fixed oil, volatile oil, or their components. Seed oil consisted of 15 saturated fatty acids (17%) and 17 unsaturated fatty acids (82.9%). Long chain fatty acids and medium chain fatty acids have been reported to increase oral bioavailability of peptides, antibiotics, and other important therapeutic agents. In earlier studies, permeation enhancement and bioenhancement of drugs has been done with black cumin oil. In order to recognize the mechanism of binding of fatty acids to P-glycoprotein (P-gp), linoleic acid, oleic acid, margaric acid, cis-11, 14-eicosadienoic acid, and stearic acid were selected for in silico studies, which were carried out using AutoDock 4.2, based on the Lamarckian genetic algorithm principle. Template search with BLAST and HHblits has been performed against the SWISS-MODEL template library. The target sequence was searched with BLAST against the primary amino acid sequence of P-gp from Rattus norvegicus. The amount of energy needed by linoleic acid, oleic acid, eicosadienoic acid, margaric acid, and stearic acid to bind with P-gp were found to be - 10.60, -10.48, -9.95, -11.92, and - 10.37 kcal/mol, respectively. The obtained data support that all the selected fatty acids have contributed to inhibit P-gp activity thereby enhances the bioavailability of drugs. This study plays a significant role in finding hot spots in P-gp and may offer the further scope of designing potent and specific inhibitors of P-gp. Generation of 3D structure of fatty acid compounds from Black cumin oil and 3D homology modeling of Rat P glycoprotein as a receptor.Rat P-gp structure quality shows 88.5% residues in favored region obtained by Ramchandran plot analysis.Docking analysis revealed that Some amino acids common for all compounds like Ser221, Pro222, Ile224, Gly225, Ser228, Ala229, Lys233, Tyr302, Tyr309, Ile337, Leu338 and Thr341 in the P-gp and ligands binding patterns.Eicosadeinoic acid has highest binding affinity with P-gp as the amount of energy needed to bind with P-gp was lowest (-11.92 kcal/mol). Abbreviations used: P-gp: P-glycoprotein.
Identifying functionally informative evolutionary sequence profiles.

PubMed

Gil, Nelson; Fiser, Andras

2018-04-15

Multiple sequence alignments (MSAs) can provide essential input to many bioinformatics applications, including protein structure prediction and functional annotation. However, the optimal selection of sequences to obtain biologically informative MSAs for such purposes is poorly explored, and has traditionally been performed manually. We present Selection of Alignment by Maximal Mutual Information (SAMMI), an automated, sequence-based approach to objectively select an optimal MSA from a large set of alternatives sampled from a general sequence database search. The hypothesis of this approach is that the mutual information among MSA columns will be maximal for those MSAs that contain the most diverse set possible of the most structurally and functionally homogeneous protein sequences. SAMMI was tested to select MSAs for functional site residue prediction by analysis of conservation patterns on a set of 435 proteins obtained from protein-ligand (peptides, nucleic acids and small substrates) and protein-protein interaction databases. Availability and implementation: A freely accessible program, including source code, implementing SAMMI is available at https://github.com/nelsongil92/SAMMI.git. andras.fiser@einstein.yu.edu. Supplementary data are available at Bioinformatics online.
2-Aminobenzamide and 2-Aminobenzoic Acid as New MALDI Matrices Inducing Radical Mediated In-Source Decay of Peptides and Proteins

NASA Astrophysics Data System (ADS)

Smargiasso, Nicolas; Quinton, Loic; de Pauw, Edwin

2012-03-01

One of the mechanisms leading to MALDI in-source decay (MALDI ISD) is the transfer of hydrogen radicals to analytes upon laser irradiation. Analytes such as peptides or proteins may undergo ISD and this method can therefore be exploited for top-down sequencing. When performed on peptides, radical-induced ISD results in production of c- and z-ions, as also found in ETD and ECD activation. Here, we describe two new compounds which, when used as MALDI matrices, are able to efficiently induce ISD of peptides and proteins: 2-aminobenzamide and 2-aminobenzoic acid. In-source reduction of the disulfide bridge containing peptide Calcitonin further confirmed the radicalar mechanism of the ISD process. ISD of peptides led, in addition to c- and z-ions, to the generation of a-, x-, and y-ions both in positive and in negative ion modes. Finally, good sequence coverage was obtained for the sequencing of myoglobin (17 kDa protein), confirming the effectiveness of both 2-aminobenzamide and 2-aminobenzoic acid as MALDI ISD matrices.
2-Aminobenzamide and 2-aminobenzoic acid as new MALDI matrices inducing radical mediated in-source decay of peptides and proteins.

PubMed

Smargiasso, Nicolas; Quinton, Loic; De Pauw, Edwin

2012-03-01

One of the mechanisms leading to MALDI in-source decay (MALDI ISD) is the transfer of hydrogen radicals to analytes upon laser irradiation. Analytes such as peptides or proteins may undergo ISD and this method can therefore be exploited for top-down sequencing. When performed on peptides, radical-induced ISD results in production of c- and z-ions, as also found in ETD and ECD activation. Here, we describe two new compounds which, when used as MALDI matrices, are able to efficiently induce ISD of peptides and proteins: 2-aminobenzamide and 2-aminobenzoic acid. In-source reduction of the disulfide bridge containing peptide Calcitonin further confirmed the radicalar mechanism of the ISD process. ISD of peptides led, in addition to c- and z-ions, to the generation of a-, x-, and y-ions both in positive and in negative ion modes. Finally, good sequence coverage was obtained for the sequencing of myoglobin (17 kDa protein), confirming the effectiveness of both 2-aminobenzamide and 2-aminobenzoic acid as MALDI ISD matrices.
Molecular characterization and phylogenetic analysis of a yak (Bos grunniens) κ-casein cDNA from lactating mammary gland.

PubMed

Bai, W L; Yin, R H; Dou, Q L; Jiang, W Q; Zhao, S J; Ma, Z J; Luo, G B; Zhao, Z H

2011-04-01

κ-Casein is one of the major proteins in the milk of mammals. It plays an important role in determining the size and specific function of milk micelles. We have previously identified and characterized a genetic variant of yak κ-casein by evaluating genomic DNA. Here, we isolate and characterize a yak κ-casein cDNA harboring the full-length open reading frame (ORF) from lactating mammary gland. Total RNA was extracted from mammary tissue of lactating female yak, and the κ-casein cDNA were synthesized by RT-PCR technique, then cloned and sequenced. The obtained cDNA of 660-bp contained an ORF sufficient to encode the entire amino acid sequence of κ-casein precursor protein consisting of 190 amino acids with a signal peptide of 21 amino acids. Yak κ-casein has a predicted molecular mass of 19,006.588 Da with a calculated isoelectric point of 7.245. Compared with the corresponding sequences in GenBank of cattle, buffalo, sheep, goat, Arabian camel, horse, and rabbit, yak κ-casein sequence had identity of 64.76-98.78% in cDNA, and identity of 44.79-98.42% and similarity of 53.65-98.42% in deduced amino acids, revealing a high homology with the other livestock species. Based on κ-casein cDNA sequences, the phylogenetic analysis indicated that yak κ-casein had a close relationship with that of cattle. This work might be useful in the genetic engineering researches for yak κ-casein.
Score distributions of gapped multiple sequence alignments down to the low-probability tail

NASA Astrophysics Data System (ADS)

Fieth, Pascal; Hartmann, Alexander K.

2016-08-01

Assessing the significance of alignment scores of optimally aligned DNA or amino acid sequences can be achieved via the knowledge of the score distribution of random sequences. But this requires obtaining the distribution in the biologically relevant high-scoring region, where the probabilities are exponentially small. For gapless local alignments of infinitely long sequences this distribution is known analytically to follow a Gumbel distribution. Distributions for gapped local alignments and global alignments of finite lengths can only be obtained numerically. To obtain result for the small-probability region, specific statistical mechanics-based rare-event algorithms can be applied. In previous studies, this was achieved for pairwise alignments. They showed that, contrary to results from previous simple sampling studies, strong deviations from the Gumbel distribution occur in case of finite sequence lengths. Here we extend the studies to multiple sequence alignments with gaps, which are much more relevant for practical applications in molecular biology. We study the distributions of scores over a large range of the support, reaching probabilities as small as 10-160, for global and local (sum-of-pair scores) multiple alignments. We find that even after suitable rescaling, eliminating the sequence-length dependence, the distributions for multiple alignment differ from the pairwise alignment case. Furthermore, we also show that the previously discussed Gaussian correction to the Gumbel distribution needs to be refined, also for the case of pairwise alignments.
Structural and genetic analysis of a mutant of Rhodobacter sphaeroides WS8 deficient in hook length control.

PubMed Central

González-Pedrajo, B; Ballado, T; Campos, A; Sockett, R E; Camarena, L; Dreyfus, G

1997-01-01

Motility in the photosynthetic bacterium Rhodobacter sphaeroides is achieved by the unidirectional rotation of a single subpolar flagellum. In this study, transposon mutagenesis was used to obtain nonmotile flagellar mutants from this bacterium. We report here the isolation and characterization of a mutant that shows a polyhook phenotype. Morphological characterization of the mutant was done by electron microscopy. Polyhooks were obtained by shearing and were used to purify the hook protein monomer (FlgE). The apparent molecular mass of the hook protein was 50 kDa. N-terminal amino acid sequencing and comparisons with the hook proteins of other flagellated bacteria indicated that the Rhodobacter hook protein has consensus sequences common to axial flagellar components. A 25-kb fragment from an R. sphaeroides WS8 cosmid library restored wild-type flagellation and motility to the mutant. Using DNA adjacent to the inserted transposon as a probe, we identified a 4.6-kb SalI restriction fragment that contained the gene responsible for the polyhook phenotype. Nucleotide sequence analysis of this region revealed an open reading frame with a deduced amino acid sequence that was 23.4% identical to that of FliK of Salmonella typhimurium, the polypeptide responsible for hook length control in that enteric bacterium. The relevance of a gene homologous to fliK in the uniflagellated bacterium R. sphaeroides is discussed. PMID:9352903
Mitochondrial DNA Sequence Divergence among Meloidogyne incognita, Romanomermis culicivorax, Ascaris suum, and Caenorhabditis elegans

PubMed Central

Powers, T. O.; Harris, T. S.; Hyman, B. C.

1993-01-01

Mitochondrial DNA sequences were obtained from the NADH dehydrogenase subunit 3 (ND3), large rRNA, and cytochrome b genes from Meloidogyne incognita and Romanomermis culicivorax. Both species show considerable genetic distance within these same genes when compared with Caenorhabditis elegans or Ascaris suum, two species previously analyzed. Caenorhabditis, Ascaris, and Meloidogyne were selected as representatives of three subclasses in the nematode class Secernentea: Rhabditia, Spiruria, and Diplogasteria, respectively. Romanomermis served as a representative out-group of the class Adenophorea. The divergence between the phytoparasitic lineage (represented by Meloidogyne) and the three other species is so great that virtually every variable position in these genes appears to have accumulated multiple mutations, obscuring the phylogenetic information obtainable from these comparisons. The 39 and 42% amino acid similarity between the M. incognita and C. elegans ND3 and cytochrome b coding sequences, respectively, are approximately the same as those of C. elegans-mouse comparisons for the same genes (26 and 44%). This discovery calls into question the feasibility of employing cloned C. elegans probes as reagents to isolate phytoparasitic nematode genes. The genetic distance between the phytoparasitic nematode lineage and C. elegans markedly contrasts with the 79% amino acid similarity between C. elegans and A. suum for the same sequences. The molecular data suggest that Caenorhabditis and Ascaris belong to the same subclass. PMID:19279810

Generation of a glucose de-repressed mutant of Trichoderma reesei using disparity mutagenesis.

PubMed

Iwakuma, Hidekazu; Koyama, Yoshiyuki; Miyachi, Ayako; Nasukawa, Masashi; Matsumoto, Hitoshi; Yano, Shuntaro; Ogihara, Jun; Kasumi, Takafumi

2016-01-01

We obtained a novel glucose de-repressed mutant of Trichoderma reesei using disparity mutagenesis. A plasmid containing DNA polymerase δ lacking proofreading activity, and AMAI, an autonomously replicating sequence was introduced into T. reesei ATCC66589. The rate of mutation evaluated with 5-fluoroorotic acid resistance was approximately 30-fold higher than that obtained by UV irradiation. The transformants harboring incompetent DNA polymerase δ were then selected on 2-deoxyglucose agar plates with hygromycin B. The pNP-lactoside hydrolyzing activities of mutants were 2 to 5-fold higher than the parent in liquid medium containing glucose. Notably, the amino acid sequence of cre1, a key gene involved in glucose repression, was identical in the mutant and parent strains, and further, the cre1 expression levels was not abolished in the mutant. Taken together, these results demonstrate that the strains of T. reesei generated by disparity mutagenesis are glucose de-repressed variants that contain mutations in yet-unidentified factors other than cre1.
Prediction of delayed retention of antibodies in hydrophobic interaction chromatography from sequence using machine learning.

PubMed

Jain, Tushar; Boland, Todd; Lilov, Asparouh; Burnina, Irina; Brown, Michael; Xu, Yingda; Vásquez, Maximiliano

2017-12-01

The hydrophobicity of a monoclonal antibody is an important biophysical property relevant for its developability into a therapeutic. In addition to characterizing heterogeneity, Hydrophobic Interaction Chromatography (HIC) is an assay that is often used to quantify the hydrophobicity of an antibody to assess downstream risks. Earlier studies have shown that retention times in this assay can be correlated to amino-acid or atomic propensities weighted by the surface areas obtained from protein 3-dimensional structures. The goal of this study is to develop models to enable prediction of delayed HIC retention times directly from sequence. We utilize the randomforest machine learning approach to estimate the surface exposure of amino-acid side-chains in the variable region directly from the antibody sequence. We obtain mean-absolute errors of 4.6% for the prediction of surface exposure. Using experimental HIC data along with the estimated surface areas, we derive an amino-acid propensity scale that enables prediction of antibodies likely to have delayed retention times in the assay. We achieve a cross-validation Area Under Curve of 0.85 for the Receiver Operating Characteristic curve of our model. The low computational expense and high accuracy of this approach enables real-time assessment of hydrophobic character to enable prioritization of antibodies during the discovery process and rational engineering to reduce hydrophobic liabilities. Structure data, aligned sequences, experimental data and prediction scores for test-cases, and R scripts used in this work are provided as part of the Supplementary Material. tushar.jain@adimab.com. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Identification of branched-chain amino acid aminotransferases active towards (R)-(+)-1-phenylethylamine among PLP fold type IV transaminases.

PubMed

Bezsudnova, Ekaterina Yu; Dibrova, Daria V; Nikolaeva, Alena Yu; Rakitina, Tatiana V; Popov, Vladimir O

2018-04-10

New class IV transaminases with activity towards L-Leu, which is typical of branched-chain amino acid aminotransferases (BCAT), and with activity towards (R)-(+)-1-phenylethylamine ((R)-PEA), which is typical of (R)-selective (R)-amine:pyruvate transaminases, were identified by bioinformatics analysis, obtained in recombinant form, and analyzed. The values of catalytic activities in the reaction with L-Leu and (R)-PEA are comparable to those measured for characteristic transaminases with the corresponding specificity. Earlier, (R)-selective class IV transaminases were found to be active, apart from (R)-PEA, only with some other (R)-primary amines and D-amino acids. Sequences encoding new transaminases with mixed type of activity were found by searching for changes in the conserved motifs of sequences of BCAT by different bioinformatics tools. Copyright © 2018 Elsevier B.V. All rights reserved.
37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

Code of Federal Regulations, 2011 CFR

2011-07-01

... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data shall...
Sequence-dependent DNA deformability studied using molecular dynamics simulations.

PubMed

Fujii, Satoshi; Kono, Hidetoshi; Takenaka, Shigeori; Go, Nobuhiro; Sarai, Akinori

2007-01-01

Proteins recognize specific DNA sequences not only through direct contact between amino acids and bases, but also indirectly based on the sequence-dependent conformation and deformability of the DNA (indirect readout). We used molecular dynamics simulations to analyze the sequence-dependent DNA conformations of all 136 possible tetrameric sequences sandwiched between CGCG sequences. The deformability of dimeric steps obtained by the simulations is consistent with that by the crystal structures. The simulation results further showed that the conformation and deformability of the tetramers can highly depend on the flanking base pairs. The conformations of xATx tetramers show the most rigidity and are not affected by the flanking base pairs and the xYRx show by contrast the greatest flexibility and change their conformations depending on the base pairs at both ends, suggesting tetramers with the same central dimer can show different deformabilities. These results suggest that analysis of dimeric steps alone may overlook some conformational features of DNA and provide insight into the mechanism of indirect readout during protein-DNA recognition. Moreover, the sequence dependence of DNA conformation and deformability may be used to estimate the contribution of indirect readout to the specificity of protein-DNA recognition as well as nucleosome positioning and large-scale behavior of nucleic acids.
Identification of a Herbal Powder by Deoxyribonucleic Acid Barcoding and Structural Analyses.

PubMed

Sheth, Bhavisha P; Thaker, Vrinda S

2015-10-01

Authentic identification of plants is essential for exploiting their medicinal properties as well as to stop the adulteration and malpractices with the trade of the same. To identify a herbal powder obtained from a herbalist in the local vicinity of Rajkot, Gujarat, using deoxyribonucleic acid (DNA) barcoding and molecular tools. The DNA was extracted from a herbal powder and selected Cassia species, followed by the polymerase chain reaction (PCR) and sequencing of the rbcL barcode locus. Thereafter the sequences were subjected to National Center for Biotechnology Information (NCBI) basic local alignment search tool (BLAST) analysis, followed by the protein three-dimension structure determination of the rbcL protein from the herbal powder and Cassia species namely Cassia fistula, Cassia tora and Cassia javanica (sequences obtained in the present study), Cassia Roxburghii, and Cassia abbreviata (sequences retrieved from Genbank). Further, the multiple and pairwise structural alignment were carried out in order to identify the herbal powder. The nucleotide sequences obtained from the selected species of Cassia were submitted to Genbank (Accession No. JX141397, JX141405, JX141420). The NCBI BLAST analysis of the rbcL protein from the herbal powder showed an equal sequence similarity (with reference to different parameters like E value, maximum identity, total score, query coverage) to C. javanica and C. roxburghii. In order to solve the ambiguities of the BLAST result, a protein structural approach was implemented. The protein homology models obtained in the present study were submitted to the protein model database (PM0079748-PM0079753). The pairwise structural alignment of the herbal powder (as template) and C. javanica and C. roxburghii (as targets individually) revealed a close similarity of the herbal powder with C. javanica. A strategy as used here, incorporating the integrated use of DNA barcoding and protein structural analyses could be adopted, as a novel rapid and economic procedure, especially in cases when protein coding loci are considered. Authentic identification of plants is essential for exploiting their medicinal properties as well as to stop the adulteration and malpractices with the trade of the same. A herbal powder was obtained from a herbalist in the local vicinity of Rajkot, Gujarat. An integrated approach using DNA barcoding and structural analyses was carried out to identify the herbal powder. The herbal powder was identified as Cassia javanica L.
Mammalian evolution: timing and implications from using the LogDeterminant transform for proteins of differing amino acid composition.

PubMed

Penny, D; Hasegawa, M; Waddell, P J; Hendy, M D

1999-03-01

We explore the tree of mammalian mtDNA sequences, using particularly the LogDet transform on amino acid sequences, the distance Hadamard transform, and the Closest Tree selection criterion. The amino acid composition of different species show significant differences, even within mammals. After compensating for these differences, nearest-neighbor bootstrap results suggest that the tree is locally stable, though a few groups show slightly greater rearrangements when a large proportion of the constant sites are removed. Many parts of the trees we obtain agree with those on published protein ML trees. Interesting results include a preference for rodent monophyly. The detection of a few alternative signals to those on the optimal tree were obtained using the distance Hadamard transform (with results expressed as a Lento plot). One rearrangement suggested was the interchange of the position of primates and rodents on the optimal tree. The basic stability of the tree, combined with two calibration points (whale/cow and horse/rhinoceros), together with a distant secondary calibration from the mammal/bird divergence, allows inferences of the times of divergence of putative clades. Allowing for sampling variances due to finite sequence length, most major divergences amongst lineages leading to modern orders, appear to occur well before the Cretaceous/Tertiary (K/T) boundary. Implications arising from these early divergences are discussed, particularly the possibility of competition between the small dinosaurs and the new mammal clades.
Production, purification, sequencing and activity spectra of mutacins D-123.1 and F-59.1

PubMed Central

2011-01-01

Background The increase in bacterial resistance to antibiotics impels the development of new anti-bacterial substances. Mutacins (bacteriocins) are small antibacterial peptides produced by Streptococcus mutans showing activity against bacterial pathogens. The objective of the study was to produce and characterise additional mutacins in order to find new useful antibacterial substances. Results Mutacin F-59.1 was produced in liquid media by S. mutans 59.1 while production of mutacin D-123.1 by S. mutans 123.1 was obtained in semi-solid media. Mutacins were purified by hydrophobic chromatography. The amino acid sequences of the mutacins were obtained by Edman degradation and their molecular mass was determined by mass spectrometry. Mutacin F-59.1 consists of 25 amino acids, containing the YGNGV consensus sequence of pediocin-like bacteriocins with a molecular mass calculated at 2719 Da. Mutacin D-123.1 has an identical molecular mass (2364 Da) with the same first 9 amino acids as mutacin I. Mutacins D-123.1 and F-59.1 have wide activity spectra inhibiting human and food-borne pathogens. The lantibiotic mutacin D-123.1 possesses a broader activity spectrum than mutacin F-59.1 against the bacterial strains tested. Conclusion Mutacin F-59.1 is the first pediocin-like bacteriocin identified and characterised that is produced by Streptococcus mutans. Mutacin D-123.1 appears to be identical to mutacin I previously identified in different strains of S. mutans. PMID:21477375
Production, purification, sequencing and activity spectra of mutacins D-123.1 and F-59.1.

PubMed

Nicolas, Guillaume G; LaPointe, Gisèle; Lavoie, Marc C

2011-04-10

The increase in bacterial resistance to antibiotics impels the development of new anti-bacterial substances. Mutacins (bacteriocins) are small antibacterial peptides produced by Streptococcus mutans showing activity against bacterial pathogens. The objective of the study was to produce and characterise additional mutacins in order to find new useful antibacterial substances. Mutacin F-59.1 was produced in liquid media by S. mutans 59.1 while production of mutacin D-123.1 by S. mutans 123.1 was obtained in semi-solid media. Mutacins were purified by hydrophobic chromatography. The amino acid sequences of the mutacins were obtained by Edman degradation and their molecular mass was determined by mass spectrometry. Mutacin F-59.1 consists of 25 amino acids, containing the YGNGV consensus sequence of pediocin-like bacteriocins with a molecular mass calculated at 2719 Da. Mutacin D-123.1 has an identical molecular mass (2364 Da) with the same first 9 amino acids as mutacin I. Mutacins D-123.1 and F-59.1 have wide activity spectra inhibiting human and food-borne pathogens. The lantibiotic mutacin D-123.1 possesses a broader activity spectrum than mutacin F-59.1 against the bacterial strains tested. Mutacin F-59.1 is the first pediocin-like bacteriocin identified and characterised that is produced by Streptococcus mutans. Mutacin D-123.1 appears to be identical to mutacin I previously identified in different strains of S. mutans.
Identification, Classification, and Phylogeny of the Pathogenic Species Exophiala jeanselmei and Related Species by Mitochondrial Cytochrome b Gene Analysis

PubMed Central

Wang, Li; Yokoyama, Koji; Miyaji, Makoto; Nishimura, Kazuko

2001-01-01

We analyzed a 402-bp sequence of the mitochondrial cytochrome b gene of 34 strains of Exophiala jeanselmei and 16 strains representing 12 related species. The strains of E. jeanselmei were classified into 20 DNA types and 17 amino acid types. The differences between these strains were found in 1 to 60 nucleotides and 1 to 17 amino acids. On the basis of the identities and similarities of nucleotide and amino acid sequences, some strains were reidentified: i.e., two strains of E. jeanselmei var. hetermorpha and one strain of E. castellanii as E. dermatitidis (including the type strain), three strains of E. jeanselmei as E. jeanselmei var. lecanii-corni (including the type strain), three strains of E. jeanselmei as E. bergeri (including the type strain), seven strains of E. jeanselmei as E. pisciphila (including the type strain), seven strains of E. jeanselmei as E. jeanselmei var. jeanselmei (including the type strain), one strain of E. jeanselmei as Fonsecaea pedrosoi (including the type strain), and one strain of E. jeanselmei as E. spinifera (including the type strain). Some E. jeanselmei strains showed distinct nucleotide and amino acid sequences. The amino-acid-based UPGMA (unweighted pair group method with the arithmetic mean) tree exhibited nearly the same topology as those of the DNA-based trees obtained by neighbor joining, maximum parsimony, and maximum likelihood methods. PMID:11724862
Isolation, Purification and Molecular Mechanism of a Peanut Protein-Derived ACE-Inhibitory Peptide

PubMed Central

Shi, Aimin; Liu, Hongzhi; Liu, Li; Hu, Hui; Wang, Qiang; Adhikari, Benu

2014-01-01

Although a number of bioactive peptides are capable of angiotensin I-converting enzyme (ACE) inhibitory effects, little is known regarding the mechanism of peanut peptides using molecular simulation. The aim of this study was to obtain ACE inhibiting peptide from peanut protein and provide insight on the molecular mechanism of its ACE inhibiting action. Peanut peptides having ACE inhibitory activity were isolated through enzymatic hydrolysis and ultrafiltration. Further chromatographic fractionation was conducted to isolate a more potent peanut peptide and its antihypertensive activity was analyzed through in vitro ACE inhibitory tests and in vivo animal experiments. MALDI-TOF/TOF-MS was used to identify its amino acid sequence. Mechanism of ACE inhibition of P8 was analyzed using molecular docking and molecular dynamics simulation. A peanut peptide (P8) having Lys-Leu-Tyr-Met-Arg-Pro amino acid sequence was obtained which had the highest ACE inhibiting activity of 85.77% (half maximal inhibitory concentration (IC50): 0.0052 mg/ml). This peanut peptide is a competitive inhibitor and show significant short term (12 h) and long term (28 days) antihypertensive activity. Dynamic tests illustrated that P8 can be successfully docked into the active pocket of ACE and can be combined with several amino acid residues. Hydrogen bond, electrostatic bond and Pi-bond were found to be the three main interaction contributing to the structural stability of ACE-peptide complex. In addition, zinc atom could form metal-carboxylic coordination bond with Tyr, Met residues of P8, resulting into its high ACE inhibiting activity. Our finding indicated that the peanut peptide (P8) having a Lys-Leu-Tyr-Met-Arg-Pro amino acid sequence can be a promising candidate for functional foods and prescription drug aimed at control of hypertension. PMID:25347076
Properties and cDNA cloning of antihemorrhagic factors in sera of Chinese and Japanese mamushi (Gloydius blomhoffi).

PubMed

Aoki, Narumi; Tsutsumi, Kadzuyo; Deshimaru, Masanobu; Terada, Shigeyuki

2008-02-01

An antihemorrhagic protein has been isolated from the serum of Chinese mamushi (Gloydius blomhoffi brevicaudus) by using a combination of ethanol precipitation and a reverse-phase high-performance liquid chromatography (HPLC) on a C8 column. This protein-designated Chinese mamushi serum factor (cMSF)-suppressed mamushi venom-induced hemorrhage in a dose-dependent manner. It had no effect on trypsin, chymotrypsin, thermolysin, and papain but inhibited the proteinase activities of several snake venom metalloproteinases (SVMPs) including hemorrhagic enzymes isolated from the venoms of mamushi and habu (Trimeresurus flavoviridis). A similar protein (Japanese MSF, jMSF) with antihemorrhagic activity has also been purified from the sera of Japanese mamushi (G. blomhoffi). The N-terminal 70 and 51 residues of the intact cMSF and jMSF were directly analyzed; a similarity between the sequences of two MSFs to that of antihemorrhagic protein (HSF) from habu serum was noticed. To obtain the complete amino acid sequences of MSFs, cDNAs encoding these proteins were cloned from the liver mRNA of Chinese and Japanese vipers based on their N-terminal amino acid sequences. The mature forms of both MSFs consisted of 305 amino acids with a 19-residue signal sequence, and a unique 17-residue deletion was detected in their His-rich domains.
Complete amino acid sequences of the ribosomal proteins L25, L29 and L31 from the archaebacterium Halobacterium marismortui.

PubMed

Hatakeyama, T; Kimura, M

1988-03-15

Ribosomal proteins were extracted from 50S ribosomal subunits of the archaebacterium Halobacterium marismortui by decreasing the concentration of Mg2+ and K+, and the proteins were separated and purified by ion-exchange column chromatography on DEAE-cellulose. Ten proteins were purified to homogeneity and three of these proteins were subjected to sequence analysis. The complete amino acid sequences of the ribosomal proteins L25, L29 and L31 were established by analyses of the peptides obtained by enzymatic digestion with trypsin, Staphylococcus aureus protease, chymotrypsin and lysylendopeptidase. Proteins L25, L29 and L31 consist of 84, 115 and 95 amino acid residues with the molecular masses of 9472 Da, 12293 Da and 10418 Da respectively. A comparison of their sequences with those of other large-ribosomal-subunit proteins from other organisms revealed that protein L25 from H. marismortui is homologous to protein L23 from Escherichia coli (34.6%), Bacillus stearothermophilus (41.8%), and tobacco chloroplasts (16.3%) as well as to protein L25 from yeast (38.0%). Proteins L29 and L31 do not appear to be homologous to any other ribosomal proteins whose structures are so far known.
Cloning of a coconut endosperm cDNA encoding a 1-acyl-sn-glycerol-3-phosphate acyltransferase that accepts medium-chain-length substrates.

PubMed Central

Knutzon, D S; Lardizabal, K D; Nelsen, J S; Bleibaum, J L; Davies, H M; Metz, J G

1995-01-01

Immature coconut (Cocos nucifera) endosperm contains a 1-acyl-sn-glycerol-3-phosphate acyltransferase (LPAAT) activity that shows a preference for medium-chain-length fatty acyl-coenzyme A substrates (H.M. Davies, D.J. Hawkins, J.S. Nelsen [1995] Phytochemistry 39:989-996). Beginning with solubilized membrane preparations, we have used chromatographic separations to identify a polypeptide with an apparent molecular mass of 29 kD, whose presence in various column fractions correlates with the acyltransferase activity detected in those same fractions. Amino acid sequence data obtained from several peptides generated from this protein were used to isolate a full-length clone from a coconut endosperm cDNA library. Clone pCGN5503 contains a 1325-bp cDNA insert with an open reading frame encoding a 308-amino acid protein with a calculated molecular mass of 34.8 kD. Comparison of the deduced amino acid sequence of pCGN5503 to sequences in the data banks revealed significant homology to other putative LPAAT sequences. Expression of the coconut cDNA in Escherichia coli conferred upon those cells a novel LPAAT activity whose substrate activity profile matched that of the coconut enzyme. PMID:8552723
Classifying Membrane Proteins in the Proteome by Using Artificial Neural Networks Based on the Preferential Parameters of Amino Acids

NASA Astrophysics Data System (ADS)

Bose, Subrata K.; Browne, Antony; Kazemian, Hassan; White, Kenneth

Membrane proteins (MPs) are large set of biological macromolecules that play a fundamental role in physiology and pathophysiology for survival. From a pharma-economical perspective, though it is the fact that MPs constitute ˜75% of possible targets for novel drugs but MPs are one of the most understudied groups of proteins in biochemical research. This is mainly because of the technical difficulties of obtaining structural information about trans-membrane regions (these are small sequences that crossways the bilayer lipid membrane). It is quite useful to predict the location of transmembrane segments down the sequence, since these are the elementary structural building blocks defining their topology. There have been several attempts over the last 20 years to develop tools for predicting membrane-spanning regions but current tools are far away from achieving a considerable reliability in prediction. This study aims to exploit the knowledge and current understanding in the field of artificial neural networks (ANNs) in particular data representation through the development of a system to identify and predict membrane-spanning regions by analysing primary amino acids sequence. In this paper we present a novel neural network (NNs) architecture and algorithms for predicting membrane spanning regions from primary amino acids sequences by using their preference parameters.
The perils of pathogen discovery: origin of a novel parvovirus-like hybrid genome traced to nucleic acid extraction spin columns.

PubMed

Naccache, Samia N; Greninger, Alexander L; Lee, Deanna; Coffey, Lark L; Phan, Tung; Rein-Weston, Annie; Aronsohn, Andrew; Hackett, John; Delwart, Eric L; Chiu, Charles Y

2013-11-01

Next-generation sequencing was used for discovery and de novo assembly of a novel, highly divergent DNA virus at the interface between the Parvoviridae and Circoviridae. The virus, provisionally named parvovirus-like hybrid virus (PHV), is nearly identical by sequence to another DNA virus, NIH-CQV, previously detected in Chinese patients with seronegative (non-A-E) hepatitis. Although we initially detected PHV in a wide range of clinical samples, with all strains sharing ∼99% nucleotide and amino acid identity with each other and with NIH-CQV, the exact origin of the virus was eventually traced to contaminated silica-binding spin columns used for nucleic acid extraction. Definitive confirmation of the origin of PHV, and presumably NIH-CQV, was obtained by in-depth analyses of water eluted through contaminated spin columns. Analysis of environmental metagenome libraries detected PHV sequences in coastal marine waters of North America, suggesting that a potential association between PHV and diatoms (algae) that generate the silica matrix used in the spin columns may have resulted in inadvertent viral contamination during manufacture. The confirmation of PHV/NIH-CQV as laboratory reagent contaminants and not bona fide infectious agents of humans underscores the rigorous approach needed to establish the validity of new viral genomes discovered by next-generation sequencing.
Genotypic diversity of stress response in Lactobacillus plantarum, Lactobacillus paraplantarum and Lactobacillus pentosus.

PubMed

Ricciardi, Annamaria; Parente, Eugenio; Guidone, Angela; Ianniello, Rocco Gerardo; Zotta, Teresa; Abu Sayem, S M; Varcamonti, Mario

2012-07-02

Lactobacillus plantarum, Lactobacillus pentosus and Lactobacillus paraplantarum are three closely related species which are widespread in food and non-food environments, and are important as starter bacteria or probiotics. In order to evaluate the phenotypic diversity of stress tolerance in the L. plantarum group and the ability to mount an adaptive heat shock response, the survival of exponential and stationary phase and of heat adapted exponential phase cells of six L. plantarum subsp. plantarum, one L. plantarum subsp. argentoratensis, one L. pentosus and two L. paraplantarum strains selected in a previous work upon exposure to oxidative, heat, detergent, starvation and acid stresses was compared to that of the L. plantarum WCFS1 strain. Furthermore, to evaluate the genotypic diversity in stress response genes, ten genes (encoding for chaperones DnaK, GroES and GroEL, regulators CtsR, HrcA and CcpA, ATPases/proteases ClpL, ClpP, ClpX and protease FtsH) were amplified using primers derived from the WCFS1 genome sequence and submitted to restriction with one or two endonucleases. The results were compared by univariate and multivariate statistical methods. In addition, the amplicons for hrcA and ctsR were sequenced and compared by multiple sequence alignment and polymorphism analysis. Although there was evidence of a generalized stress response in the stationary phase, with increase of oxidative, heat, and, to a lesser extent, starvation stress tolerance, and for adaptive heat stress response, with increased tolerance to heat, acid and detergent, different growth phases and adaptation patterns were found. Principal component analysis showed that while heat, acid and detergent stresses respond similarly to growth phase and adaptation, tolerance to oxidative and starvation stresses implies completely unrelated mechanisms. A dendrogram obtained using the data from multilocus restriction typing (MLRT) of stress response genes clearly separated two groups of L. plantarum strains from the other species but there was no correlation between genotypic grouping and grouping obtained on the basis of the stress response pattern, nor with the phylograms obtained from hrcA and ctsR sequences. Differences in sequence in L. plantarum strains were mostly due to single nucleotide polymorphisms with a high frequency of synonymous nucleotide changes and, while hrcA was characterized by an excess of low frequency polymorphism, very low diversity was found in ctsR sequences. Sequence alignment of hrcA allowed a correct discrimination of the strains at the species level, thus confirming the relevance of stress response genes for taxonomy. Copyright © 2012 Elsevier B.V. All rights reserved.
Cloning and characterisation of cDNA sequences encoding for anti-lipopolysaccharide factors (ALFs) in Brazilian palaemonid and penaeid shrimps.

PubMed

Rosa, Rafael Diego; Stoco, Patricia Hermes; Barracco, Margherita Anna

2008-11-01

Anti-lipopolysaccharide factors (ALFs) are antimicrobial peptides found in limulids and crustaceans that have a potent and broad range of antimicrobial activity. We report here the identification and molecular characterisation of new sequences encoding for ALFs in the haemocytes of the freshwater prawn Macrobrachium olfersi and also in two Brazilian penaeid species, Farfantepenaeus paulensis and Litopenaeus schmitti. All obtained sequences encoded for highly cationic peptides containing two conserved cysteine residues flanking a putative LPS-binding domain. They exhibited a significant amino acid similarity with crustacean and limulid ALF sequences, especially with those of penaeid shrimps. This is the first identification of ALF in a freshwater prawn.
Isolation and distribution of a novel iron-oxidizing crenarchaeon from acidic geothermal springs in Yellowstone National Park.

PubMed

Kozubal, M; Macur, R E; Korf, S; Taylor, W P; Ackerman, G G; Nagy, A; Inskeep, W P

2008-02-01

Novel thermophilic crenarchaea have been observed in Fe(III) oxide microbial mats of Yellowstone National Park (YNP); however, no definitive work has identified specific microorganisms responsible for the oxidation of Fe(II). The objectives of the current study were to isolate and characterize an Fe(II)-oxidizing member of the Sulfolobales observed in previous 16S rRNA gene surveys and to determine the abundance and distribution of close relatives of this organism in acidic geothermal springs containing high concentrations of dissolved Fe(II). Here we report the isolation and characterization of the novel, Fe(II)-oxidizing, thermophilic, acidophilic organism Metallosphaera sp. strain MK1 obtained from a well-characterized acid-sulfate-chloride geothermal spring in Norris Geyser Basin, YNP. Full-length 16S rRNA gene sequence analysis revealed that strain MK1 exhibits only 94.9 to 96.1% sequence similarity to other known Metallosphaera spp. and less than 89.1% similarity to known Sulfolobus spp. Strain MK1 is a facultative chemolithoautotroph with an optimum pH range of 2.0 to 3.0 and an optimum temperature range of 65 to 75 degrees C. Strain MK1 grows optimally on pyrite or Fe(II) sorbed onto ferrihydrite, exhibiting doubling times between 10 and 11 h under aerobic conditions (65 degrees C). The distribution and relative abundance of MK1-like 16S rRNA gene sequences in 14 acidic geothermal springs containing Fe(III) oxide microbial mats were evaluated. Highly related MK1-like 16S rRNA gene sequences (>99% sequence similarity) were consistently observed in Fe(III) oxide mats at temperatures ranging from 55 to 80 degrees C. Quantitative PCR using Metallosphaera-specific primers confirmed that organisms highly similar to strain MK1 comprised up to 40% of the total archaeal community at selected sites. The broad distribution of highly related MK1-like 16S rRNA gene sequences in acidic Fe(III) oxide microbial mats is consistent with the observed characteristics and growth optima of Metallosphaera-like strain MK1 and emphasizes the importance of this newly described taxon in Fe(II) chemolithotrophy in acidic high-temperature environments of YNP.
Solid phase sequencing of double-stranded nucleic acids

DOEpatents

Fu, Dong-Jing; Cantor, Charles R.; Koster, Hubert; Smith, Cassandra L.

2002-01-01

This invention relates to methods for detecting and sequencing of target double-stranded nucleic acid sequences, to nucleic acid probes and arrays of probes useful in these methods, and to kits and systems which contain these probes. Useful methods involve hybridizing the nucleic acids or nucleic acids which represent complementary or homologous sequences of the target to an array of nucleic acid probes. These probe comprise a single-stranded portion, an optional double-stranded portion and a variable sequence within the single-stranded portion. The molecular weights of the hybridized nucleic acids of the set can be determined by mass spectroscopy, and the sequence of the target determined from the molecular weights of the fragments. Nucleic acids whose sequences can be determined include nucleic acids in biological samples such as patient biopsies and environmental samples. Probes may be fixed to a solid support such as a hybridization chip to facilitate automated determination of molecular weights and identification of the target sequence.

Nucleotide sequence analysis establishes the role of endogenous murine leukemia virus DNA segments in formation of recombinant mink cell focus-forming murine leukemia viruses.

PubMed Central

Khan, A S

1984-01-01

The sequence of 363 nucleotides near the 3' end of the pol gene and 564 nucleotides from the 5' terminus of the env gene in an endogenous murine leukemia viral (MuLV) DNA segment, cloned from AKR/J mouse DNA and designated as A-12, was obtained. For comparison, the nucleotide sequence in an analogous portion of AKR mink cell focus-forming (MCF) 247 MuLV provirus was also determined. Sequence features unique to MCF247 MuLV DNA in the 3' pol and 5' env regions were identified by comparison with nucleotide sequences in analogous regions of NFS -Th-1 xenotropic and AKR ecotropic MuLV proviruses. These included (i) an insertion of 12 base pairs encoding four amino acids located 60 base pairs from the 3' terminus of the pol gene and immediately preceding the env gene, (ii) the deletion of 12 base pairs (encoding four amino acids) and the insertion of 3 base pairs (encoding one amino acid) in the 5' portion of the env gene, and (iii) single base substitutions resulting in 2 MCF247 -specific amino acids in the 3' pol and 23 in the 5' env regions. Nucleotide sequence comparison involving the 3' pol and 5' env regions of AKR MCF247 , NFS xenotropic, and AKR ecotropic MuLV proviruses with the cloned endogenous MuLV DNA indicated that MCF247 proviral DNA sequences were conserved in the cloned endogenous MuLV proviral segment. In fact, total nucleotide sequence identity existed between the endogenous MuLV DNA and the MCF247 MuLV provirus in the 3' portion of the pol gene. In the 5' env region, only 4 of 564 nucleotides were different, resulting in three amino acid changes between AKR MCF247 MuLV DNA and the endogenous MuLV DNA present in clone A-12. In addition, nucleotide sequence comparison indicated that Moloney-and Friend-MCF MuLVs were also highly related in the 3' pol and 5' env regions to the cloned endogenous MuLV DNA. These results establish the role of endogenous MuLV DNA segments in generation of recombinant MCF viruses. PMID:6328017
Comparison of ZP3 protein sequences among vertebrate species: to obtain a consensus sequence for immunocontraception.

PubMed

Zhu, X; Naz, R K

1999-03-01

The deduced ZP3 amino acid (aa) sequences of 13 vertebrate species namely mouse, hamster, rabbit, pig, porcine, cow, dog, cat, human, bonnet, marmoset, carp, and frog were compared using the PILEUP and PRETTY alignment programs (GCG, Wisconsin, USA). The published aa sequences obtained from 13 vertebrate species indicated the overall evolutionarily conservation in the N-terminus, central region, and C-terminus of the ZP3 polypeptide. More variations of ZP3 polypeptide sequences were seen in the alignments of carp and frog from the 11 mammalian species making the leader sequence more prominent. The canonical furin proteolytic processing signal at the C-terminus was found in all the ZP3 polypeptide sequences except of carp and frog. In the central region, the ZP3 deduced aa sequences of all the 13 vertebrate species aligned well, and six relatively conserved sequences were found. There are 11 conserved cysteine residues in the central region across all species including carp and frog, indicating that these residues have longer evolutionary history. The ZP3 aa sequence similarities were examined using the GAP program (GCG). The highest aa similarities are observed between the members of the same order within the class mammalia, and also (95.4%) between pig (ungulata) and rabbit (lagomorpha). The deduced ZP3 aa sequences per se may not be enough to build a phylogenetic tree.
Typing of canine parvovirus isolates using mini-sequencing based single nucleotide polymorphism analysis.

PubMed

Naidu, Hariprasad; Subramanian, B Mohana; Chinchkar, Shankar Ramchandra; Sriraman, Rajan; Rana, Samir Kumar; Srinivasan, V A

2012-05-01

The antigenic types of canine parvovirus (CPV) are defined based on differences in the amino acids of the major capsid protein VP2. Type specificity is conferred by a limited number of amino acid changes and in particular by few nucleotide substitutions. PCR based methods are not particularly suitable for typing circulating variants which differ in a few specific nucleotide substitutions. Assays for determining SNPs can detect efficiently nucleotide substitutions and can thus be adapted to identify CPV types. In the present study, CPV typing was performed by single nucleotide extension using the mini-sequencing technique. A mini-sequencing signature was established for all the four CPV types (CPV2, 2a, 2b and 2c) and feline panleukopenia virus. The CPV typing using the mini-sequencing reaction was performed for 13 CPV field isolates and the two vaccine strains available in our repository. All the isolates had been typed earlier by full-length sequencing of the VP2 gene. The typing results obtained from mini-sequencing matched completely with that of sequencing. Typing could be achieved with less than 100 copies of standard plasmid DNA constructs or ≤10¹ FAID₅₀ of virus by mini-sequencing technique. The technique was also efficient for detecting multiple types in mixed infections. Copyright © 2012 Elsevier B.V. All rights reserved.
Phylogenetic analysis of mitochondrial protein coding genes confirms the reciprocal paraphyly of Hexapoda and Crustacea

PubMed Central

Carapelli, Antonio; Liò, Pietro; Nardi, Francesco; van der Wath, Elizabeth; Frati, Francesco

2007-01-01

Background The phylogeny of Arthropoda is still a matter of harsh debate among systematists, and significant disagreement exists between morphological and molecular studies. In particular, while the taxon joining hexapods and crustaceans (the Pancrustacea) is now widely accepted among zoologists, the relationships among its basal lineages, and particularly the supposed reciprocal paraphyly of Crustacea and Hexapoda, continues to represent a challenge. Several genes, as well as different molecular markers, have been used to tackle this problem in molecular phylogenetic studies, with the mitochondrial DNA being one of the molecules of choice. In this study, we have assembled the largest data set available so far for Pancrustacea, consisting of 100 complete (or almost complete) sequences of mitochondrial genomes. After removal of unalignable sequence regions and highly rearranged genomes, we used nucleotide and inferred amino acid sequences of the 13 protein coding genes to reconstruct the phylogenetic relationships among major lineages of Pancrustacea. The analysis was performed with Bayesian inference, and for the amino acid sequences a new, Pancrustacea-specific, matrix of amino acid replacement was developed and used in this study. Results Two largely congruent trees were obtained from the analysis of nucleotide and amino acid datasets. In particular, the best tree obtained based on the new matrix of amino acid replacement (MtPan) was preferred over those obtained using previously available matrices (MtArt and MtRev) because of its higher likelihood score. The most remarkable result is the reciprocal paraphyly of Hexapoda and Crustacea, with some lineages of crustaceans (namely the Malacostraca, Cephalocarida and, possibly, the Branchiopoda) being more closely related to the Insecta s.s. (Ectognatha) than two orders of basal hexapods, Collembola and Diplura. Our results confirm that the mitochondrial genome, unlike analyses based on morphological data or nuclear genes, consistently supports the non monophyly of Hexapoda. Conclusion The finding of the reciprocal paraphyly of Hexapoda and Crustacea suggests an evolutionary scenario in which the acquisition of the hexapod condition may have occurred several times independently in lineages descending from different crustacean-like ancestors, possibly as a consequence of the process of terrestrialization. If this hypothesis was confirmed, we should therefore re-think our interpretation of the evolution of the Arthropoda, where terrestrialization may have led to the acquisition of similar anatomical features by convergence. At the same time, the disagreement between reconstructions based on morphological, nuclear and mitochondrial data sets seems to remain, despite the use of larger data sets and more powerful analytical methods. PMID:17767736
Draft Genome Sequence of Enterococcus casseliflavus PAVET15 Obtained from the Oviduct Infection of the Cattle Tick (Rhipicephalus microplus) in Jiutepec, Morelos, Mexico

PubMed Central

Cossío-Bayúgar, R.; Miranda-Miranda, E.; Arreguín-Pérez, C. A.; Lozano, L.; Peréz de la Rosa, D.; Rocha-Martínez, M. K.; Bravo-Díaz, M. A.

2017-01-01

ABSTRACT Enterococcus spp. are Gram-positive lactic acid-producing bacteria found in the intestinal tracts of animals, like mammals, birds, and arthropods. Enterococcus spp. may cause oportunistic infections in vertebrate and invertebrate hosts. We report here the draft genome sequence of Enterococcus casseliflavus PAVET15 containing 3,722,480 bp, with 80 contigs, an N50 of 179,476 bp, and 41.93% G+C content. PMID:28428300
Solid phase sequencing of biopolymers

DOEpatents

Cantor, Charles; Koster, Hubert

2010-09-28

This invention relates to methods for detecting and sequencing target nucleic acid sequences, to mass modified nucleic acid probes and arrays of probes useful in these methods, and to kits and systems which contain these probes. Useful methods involve hybridizing the nucleic acids or nucleic acids which represent complementary or homologous sequences of the target to an array of nucleic acid probes. These probes comprise a single-stranded portion, an optional double-stranded portion and a variable sequence within the single-stranded portion. The molecular weights of the hybridized nucleic acids of the set can be determined by mass spectroscopy, and the sequence of the target determined from the molecular weights of the fragments. Nucleic acids whose sequences can be determined include DNA or RNA in biological samples such as patient biopsies and environmental samples. Probes may be fixed to a solid support such as a hybridization chip to facilitate automated molecular weight analysis and identification of the target sequence.
Cloning and sequence analysis of a cDNA encoding the alpha-subunit of mouse beta-N-acetylhexosaminidase and comparison with the human enzyme.

PubMed Central

Beccari, T; Hoade, J; Orlacchio, A; Stirling, J L

1992-01-01

cDNAs encoding the mouse beta-N-acetylhexosaminidase alpha-subunit were isolated from a mouse testis library. The longest of these (1.7 kb) was sequenced and showed 83% similarity with the human alpha-subunit cDNA sequence. The 5' end of the coding sequence was obtained from a genomic DNA clone. Alignment of the human and mouse sequences showed that all three putative N-glycosylation sites are conserved, but that the mouse alpha-subunit has an additional site towards the C-terminus. All eight cysteines in the human sequence are conserved in the mouse. There are an additional two cysteines in the mouse alpha-subunit signal peptide. All amino acids affected in Tay-Sachs-disease mutations are conserved in the mouse. Images Fig. 1. PMID:1379046
An in-silico insight into the characteristics of β-propeller phytase.

PubMed

Mathew, Akash; Verma, Anukriti; Gaur, Smriti

2014-06-01

Phytase is an enzyme that is found extensively in the plant kingdom and in some species of bacteria and fungi. This paper identifies and analyses the available full length sequences of β-propeller phytases (BPP). BPP was chosen due to its potential applicability in the field of aquaculture. The sequences were obtained from the Uniprot database and subject to various online bioinformatics tools to elucidate the physio-chemical characteristics, secondary structures and active site compositions of BPP. Protparam and SOPMA were used to analyse the physiochemical and secondary structure characteristics, while the Expasy online modelling tool and CASTp were used to model the 3-D structure and identify the active sites of the BPP sequences. The amino acid compositions of the four sequences were compared and composed in a graphical format to identify similarities and highlight the potentially important amino acids that form the active site of BPP. This study aims to analyse BPP and contribute to the clarification of the molecular mechanism involved in the enzyme activity of BPP and contribute in part to the possibility of constructing a synthetic version of BPP.
Lipoxygenase in Caragana jubata responds to low temperature, abscisic acid, methyl jasmonate and salicylic acid.

PubMed

Bhardwaj, Pardeep Kumar; Kaur, Jagdeep; Sobti, Ranbir Chander; Ahuja, Paramvir Singh; Kumar, Sanjay

2011-09-01

Lipoxygenase (LOX) catalyses oxygenation of free polyunsaturated fatty acids into oxylipins, and is a critical enzyme of the jasmonate signaling pathway. LOX has been shown to be associated with biotic and abiotic stress responses in diverse plant species, though limited data is available with respect to low temperature and the associated cues. Using rapid amplification of cDNA ends, a full-length cDNA (CjLOX) encoding lipoxygenase was cloned from apical buds of Caragana jubata, a temperate plant species that grows under extreme cold. The cDNA obtained was 2952bp long consisting of an open reading frame of 2610bp encoding 869 amino acids protein. Multiple alignment of the deduced amino acid sequence with those of other plants demonstrated putative LH2/ PLAT domain, lipoxygenase iron binding catalytic domain and lipoxygenase_2 signature sequences. CjLOX exhibited up- and down-regulation of gene expression pattern in response to low temperature (LT), abscisic acid (ABA), methyl jasmonate (MJ) and salicylic acid (SA). Among all the treatments, a strong up-regulation was observed in response to MJ. Data suggests an important role of jasmonate signaling pathway in response to LT in C. jubata. Copyright © 2011 Elsevier B.V. All rights reserved.
Molecular cloning and expression of rat liver bile acid CoA ligase.

PubMed

Falany, Charles N; Xie, Xiaowei; Wheeler, James B; Wang, Jin; Smith, Michelle; He, Dongning; Barnes, Stephen

2002-12-01

Bile acid CoA ligase (BAL) is responsible for catalyzing the first step in the conjugation of bile acids with amino acids. Sequencing of putative rat liver BAL cDNAs identified a cDNA (rBAL-1) possessing a 51 nucleotide 5'-untranslated region, an open reading frame of 2,070 bases encoding a 690 aa protein with a molecular mass of 75,960 Da, and a 138 nucleotide 3'-nontranslated region followed by a poly(A) tail. Identity of the cDNA was established by: 1) the rBAL-1 open reading frame encoded peptides obtained by chemical sequencing of the purified rBAL protein; 2) expressed rBAL-1 protein comigrated with purified rBAL during SDS-polyacrylamide gel electrophoresis; and 3) rBAL-1 expressed in insect Sf9 cells had enzymatic properties that were comparable to the enzyme isolated from rat liver. Evidence for a relationship between fatty acid and bile acid metabolism is suggested by specific inhibition of rBAL-1 by cis-unsaturated fatty acids and its high homology to a human very long chain fatty acid CoA ligase. In summary, these results indicate that the cDNA for rat liver BAL has been isolated and expression of the rBAL cDNA in insect Sf9 cells results in a catalytically active enzyme capable of utilizing several different bile acids as substrates.
Using msa-2b as a molecular marker for genotyping Mexican isolates of Babesia bovis.

PubMed

Genis, Alma D; Perez, Jocelin; Mosqueda, Juan J; Alvarez, Antonio; Camacho, Minerva; Muñoz, Maria de Lourdes; Rojas, Carmen; Figueroa, Julio V

2009-12-01

Variable merozoite surface antigens of Babesia bovis are exposed glycoproteins having a role in erythrocyte invasion. Members of this gene family include msa-1 and msa-2 (msa-2c, msa-2a(1), msa-2a(2) and msa-2b). To determine the sequence variation among B. bovis Mexican isolates using msa-2b as a genetic marker, PCR amplicons corresponding to msa-2b were cloned and plasmids carrying the corresponding inserts were purified and sequenced. Comparative analysis of nucleotide and deduced amino acid sequences revealed distinct degrees of variability and identity among the coding gene sequences obtained from 16 geographically different Mexican B. bovis isolates and a reference strain. Clustal-W multiple alignments of the MSA-2b deduced amino acid sequences performed with the 17 B. bovis Mexican isolates, revealed the identification of three genotypes with a distinct set each of amino acid residues present at the variable region: Genotype I represented by the MO7 strain (in vitro culture-derived from the Mexico isolate) as well as RAD, Chiapas-1, Tabasco and Veracruz-3 isolates; Genotype II, represented by the Jalisco, Mexico and Veracruz-2 isolates; and Genotype III comprising the sequences from most of the isolates studied, Tamaulipas-1, Chiapas-2, Guerrero-1, Nayarit, Quintana Roo, Nuevo Leon, Tamaulipas-2, Yucatan and Guerrero-2. Moreover, these three genotypes could be discriminated against each other by using a PCR-RFLP approach. The results suggest that occurrence of indels within the variable region of msa-2b sequences can be useful markers for identifying a particular genotype present in field populations of B. bovis isolated from infected cattle in Mexico.
Preparation, crystallization and preliminary X-ray diffraction analysis of two intestinal fatty-acid binding proteins in the presence of 11-(dansylamino)undecanoic acid

PubMed Central

Laguerre, Aisha; Wielens, Jerome; Parker, Michael W.; Porter, Christopher J. H.; Scanlon, Martin J.

2011-01-01

Fatty-acid binding proteins (FABPs) are abundantly expressed proteins that bind a range of lipophilic molecules. They have been implicated in the import and intracellular distribution of their ligands and have been linked with metabolic and inflammatory responses in the cells in which they are expressed. Despite their high sequence identity, human intestinal FABP (hIFABP) and rat intestinal FABP (rIFABP) bind some ligands with different affinities. In order to address the structural basis of this differential binding, diffraction-quality crystals have been obtained of hIFABP and rIFABP in complex with the fluorescent fatty-acid analogue 11-(dansylamino)undecanoic acid. PMID:21301109
Analysis of the Transcriptome of Erigeron breviscapus Uncovers Putative Scutellarin and Chlorogenic Acids Biosynthetic Genes and Genetic Markers

PubMed Central

Zhang, Jia-Jin; Shu, Li-Ping; Zhang, Wei; Long, Guang-Qiang; Liu, Tao; Meng, Zheng-Gui; Chen, Jun-Wen; Yang, Sheng-Chao

2014-01-01

Background Erigeron breviscapus (Vant.) Hand-Mazz. is a famous medicinal plant. Scutellarin and chlorogenic acids are the primary active components in this herb. However, the mechanisms of biosynthesis and regulation for scutellarin and chlorogenic acids in E. breviscapus are considerably unknown. In addition, genomic information of this herb is also unavailable. Principal Findings Using Illumina sequencing on GAIIx platform, a total of 64,605,972 raw sequencing reads were generated and assembled into 73,092 non-redundant unigenes. Among them, 44,855 unigenes (61.37%) were annotated in the public databases Nr, Swiss-Prot, KEGG, and COG. The transcripts encoding the known enzymes involved in flavonoids and in chlorogenic acids biosynthesis were discovered in the Illumina dataset. Three candidate cytochrome P450 genes were discovered which might encode flavone 6-hydroase converting apigenin to scutellarein. Furthermore, 4 unigenes encoding the homologues of maize P1 (R2R3-MYB transcription factors) were defined, which might regulate the biosynthesis of scutellarin. Additionally, a total of 11,077 simple sequence repeat (SSR) were identified from 9,255 unigenes. Of SSRs, tri-nucleotide motifs were the most abundant motif. Thirty-six primer pairs for SSRs were randomly selected for validation of the amplification and polymorphism. The result revealed that 34 (94.40%) primer pairs were successfully amplified and 19 (52.78%) primer pairs exhibited polymorphisms. Conclusion Using next generation sequencing (NGS) technology, this study firstly provides abundant genomic data for E. breviscapus. The candidate genes involved in the biosynthesis and transcriptional regulation of scutellarin and chlorogenic acids were obtained in this study. Additionally, a plenty of genetic makers were generated by identification of SSRs, which is a powerful tool for molecular breeding and genetics applications in this herb. PMID:24956277
Atan1p-an extracellular tannase from the dimorphic yeast Arxula adeninivorans: molecular cloning of the ATAN1 gene and characterization of the recombinant enzyme.

PubMed

Böer, Erik; Bode, Rüdiger; Mock, Hans-Peter; Piontek, Michael; Kunze, Gotthard

2009-06-01

The tannase-encoding Arxula adeninivorans gene ATAN1 was isolated from genomic DNA by PCR, using as primers oligonucleotide sequences derived from peptides obtained after tryptic digestion of the purified tannase protein. The gene harbours an ORF of 1764 bp, encoding a 587-amino acid protein, preceded by an N-terminal secretion sequence comprising 28 residues. The deduced amino acid sequence was similar to those of tannases from Aspergillus oryzae (50% identity), A. niger (48%) and putative tannases from A. fumigatus (52%) and A. nidulans (50%). The sequence contains the consensus pentapeptide motif (-Gly-X-Ser-X-Gly-) which forms part of the catalytic centre of serine hydrolases. Expression of ATAN1 is regulated by the carbon source. Supplementation with tannic acid or gallic acid leads to induction of ATAN1, and accumulation of the native tannase enzyme in the medium. The enzymes recovered from both wild-type and recombinant strains were essentially indistinguishable. A molecular mass of approximately 320 kDa was determined, indicating that the native, glycosylated tannase consists of four identical subunits. The enzyme has a temperature optimum at 35-40 degrees C and a pH optimum at approximately 6.0. The enzyme is able to remove gallic acid from both condensed and hydrolysable tannins. The wild-type strain LS3 secreted amounts of tannase equivalent to 100 U/l under inducing conditions, while the transformant strain, which overexpresses the ATAN1 gene from the strong, constitutively active A. adeninivorans TEF1 promoter, produced levels of up to 400 U/l when grown in glucose medium in shake flasks. Copyright (c) 2009 John Wiley & Sons, Ltd.
Analysis of the transcriptome of Erigeron breviscapus uncovers putative scutellarin and chlorogenic acids biosynthetic genes and genetic markers.

PubMed

Jiang, Ni-Hao; Zhang, Guang-Hui; Zhang, Jia-Jin; Shu, Li-Ping; Zhang, Wei; Long, Guang-Qiang; Liu, Tao; Meng, Zheng-Gui; Chen, Jun-Wen; Yang, Sheng-Chao

2014-01-01

Erigeron breviscapus (Vant.) Hand-Mazz. is a famous medicinal plant. Scutellarin and chlorogenic acids are the primary active components in this herb. However, the mechanisms of biosynthesis and regulation for scutellarin and chlorogenic acids in E. breviscapus are considerably unknown. In addition, genomic information of this herb is also unavailable. Using Illumina sequencing on GAIIx platform, a total of 64,605,972 raw sequencing reads were generated and assembled into 73,092 non-redundant unigenes. Among them, 44,855 unigenes (61.37%) were annotated in the public databases Nr, Swiss-Prot, KEGG, and COG. The transcripts encoding the known enzymes involved in flavonoids and in chlorogenic acids biosynthesis were discovered in the Illumina dataset. Three candidate cytochrome P450 genes were discovered which might encode flavone 6-hydroase converting apigenin to scutellarein. Furthermore, 4 unigenes encoding the homologues of maize P1 (R2R3-MYB transcription factors) were defined, which might regulate the biosynthesis of scutellarin. Additionally, a total of 11,077 simple sequence repeat (SSR) were identified from 9,255 unigenes. Of SSRs, tri-nucleotide motifs were the most abundant motif. Thirty-six primer pairs for SSRs were randomly selected for validation of the amplification and polymorphism. The result revealed that 34 (94.40%) primer pairs were successfully amplified and 19 (52.78%) primer pairs exhibited polymorphisms. Using next generation sequencing (NGS) technology, this study firstly provides abundant genomic data for E. breviscapus. The candidate genes involved in the biosynthesis and transcriptional regulation of scutellarin and chlorogenic acids were obtained in this study. Additionally, a plenty of genetic makers were generated by identification of SSRs, which is a powerful tool for molecular breeding and genetics applications in this herb.
SNP in Chalcone Synthase gene is associated with variation of 6-gingerol content in contrasting landraces of Zingiber officinale.Roscoe.

PubMed

Ghosh, Subhabrata; Mandi, Swati Sen

2015-07-25

Zingiber officinale, medicinally the most important species within Zingiber genus, contains 6-gingerol as the active principle. This compound obtained from rhizomes of Z.officinale, has immense medicinal importance and is used in various herbal drug formulations. Our record of variation in content of this active principle, viz. 6-gingerol, in land races of this drug plant collected from different locations correlated with our Gene expression studies exhibiting high Chalcone Synthase gene (Chalcone Synthase is the rate limiting enzyme of 6-gingerol biosynthesis pathway) expression in high 6-gingerol containing landraces than in the low 6-gingerol containing landraces. Sequencing of Chalcone Synthase cDNA and subsequent multiple sequence alignment revealed seven SNPs between these contrasting genotypes. Converting this nucleotide sequence to amino acid sequence, alteration of two amino acids becomes evident; one amino acid change (asparagine to serine at position 336) is associated with base change (A→G) and another change (serine to leucine at position 142) is associated with the base change (C→T). Since asparagine at position 336 is one of the critical amino acids of the catalytic triad of Chalcone Synthase enzyme, responsible for substrate binding, our study suggests that landraces with a specific amino acid change viz. Asparagine (found in high 6-gingerol containing landraces) to serine causes low 6-gingerol content. This is probably due to a weak enzyme substrate association caused by the absence of asparagine in the catalytic triad. Detailed study of this finding could also help to understand molecular mechanism associated with variation in 6-gingerol content in Z.officinale genotypes and thereby strategies for developing elite genotypes containing high 6-gingerol content. Copyright © 2015 Elsevier B.V. All rights reserved.
Toscana Virus Genome Stability: Data from a Meningoencephalitis Case in Mantua, Italy

PubMed Central

Baggieri, Melissa; Gattuso, Gianni; Fortuna, Claudia; Remoli, Maria Elena; Vaccari, Gabriele; Zaccaria, Guendalina; Marchi, Antonella; Bucci, Paola; Benedetti, Eleonora; Fiorentini, Cristiano; Nicoletti, Loredana

2014-01-01

Abstract In July of 2013, samples from a patient with a neurological syndrome were collected from Mantua hospital and sent to the National Reference Laboratory for Arboviruses (National Institute of Health, Rome). On the basis of the symptoms, serological and molecular assays were performed to diagnose either West Nile virus (WNV) or Toscana virus (TOSV) infection. Molecular and serological tests confirmed TOSV infection. Virus isolation was obtained from cerebrospinal fluid. A full genome sequence was determined from this TOSV strain with next-generation sequencing using Ion Torrent technology. Nucleotide and amino acidic sequences grouped phylogenetically with lineage TOSV A and showed a low genome variability. PMID:25514123
Candidate new rotavirus species in Schreiber's bats, Serbia.

PubMed

Bányai, Krisztián; Kemenesi, Gábor; Budinski, Ivana; Földes, Fanni; Zana, Brigitta; Marton, Szilvia; Varga-Kugler, Renáta; Oldal, Miklós; Kurucz, Kornélia; Jakab, Ferenc

2017-03-01

The genus Rotavirus comprises eight species designated A to H and one tentative species, Rotavirus I. In a virus metagenomic analysis of Schreiber's bats sampled in Serbia in 2014 we obtained sequences likely representing novel rotavirus species. Whole genome sequencing and phylogenetic analysis classified the representative strain into a tentative tenth rotavirus species, we provisionally called Rotavirus J. The novel virus shared a maximum of 50% amino acid sequence identity within the VP6 gene to currently known members of the genus. This study extends our understanding of the genetic diversity of rotaviruses in bats. Copyright © 2016 Elsevier B.V. All rights reserved.
Integrating De Novo Transcriptome Assembly and Cloning to Obtain Chicken Ovocleidin-17 Full-Length cDNA

PubMed Central

Ning, ZhongHua; Hincke, Maxwell T.; Yang, Ning; Hou, ZhuoCheng

2014-01-01

Efficiently obtaining full-length cDNA for a target gene is the key step for functional studies and probing genetic variations. However, almost all sequenced domestic animal genomes are not ‘finished’. Many functionally important genes are located in these gapped regions. It can be difficult to obtain full-length cDNA for which only partial amino acid/EST sequences exist. In this study we report a general pipeline to obtain full-length cDNA, and illustrate this approach for one important gene (Ovocleidin-17, OC-17) that is associated with chicken eggshell biomineralization. Chicken OC-17 is one of the best candidates to control and regulate the deposition of calcium carbonate in the calcified eggshell layer. OC-17 protein has been purified, sequenced, and has had its three-dimensional structure solved. However, researchers still cannot conduct OC-17 mRNA related studies because the mRNA sequence is unknown and the gene is absent from the current chicken genome. We used RNA-Seq to obtain the entire transcriptome of the adult hen uterus, and then conducted de novo transcriptome assembling with bioinformatics analysis to obtain candidate OC-17 transcripts. Based on this sequence, we used RACE and PCR cloning methods to successfully obtain the full-length OC-17 cDNA. Temporal and spatial OC-17 mRNA expression analyses were also performed to demonstrate that OC-17 is predominantly expressed in the adult hen uterus during the laying cycle and barely at immature developmental stages. Differential uterine expression of OC-17 was observed in hens laying eggs with weak versus strong eggshell, confirming its important role in the regulation of eggshell mineralization and providing a new tool for genetic selection for eggshell quality parameters. This study is the first one to report the full-length OC-17 cDNA sequence, and builds a foundation for OC-17 mRNA related studies. We provide a general method for biologists experiencing difficulty in obtaining candidate gene full-length cDNA sequences. PMID:24676480
Integrating de novo transcriptome assembly and cloning to obtain chicken Ovocleidin-17 full-length cDNA.

PubMed

Zhang, Quan; Liu, Long; Zhu, Feng; Ning, ZhongHua; Hincke, Maxwell T; Yang, Ning; Hou, ZhuoCheng

2014-01-01

Efficiently obtaining full-length cDNA for a target gene is the key step for functional studies and probing genetic variations. However, almost all sequenced domestic animal genomes are not 'finished'. Many functionally important genes are located in these gapped regions. It can be difficult to obtain full-length cDNA for which only partial amino acid/EST sequences exist. In this study we report a general pipeline to obtain full-length cDNA, and illustrate this approach for one important gene (Ovocleidin-17, OC-17) that is associated with chicken eggshell biomineralization. Chicken OC-17 is one of the best candidates to control and regulate the deposition of calcium carbonate in the calcified eggshell layer. OC-17 protein has been purified, sequenced, and has had its three-dimensional structure solved. However, researchers still cannot conduct OC-17 mRNA related studies because the mRNA sequence is unknown and the gene is absent from the current chicken genome. We used RNA-Seq to obtain the entire transcriptome of the adult hen uterus, and then conducted de novo transcriptome assembling with bioinformatics analysis to obtain candidate OC-17 transcripts. Based on this sequence, we used RACE and PCR cloning methods to successfully obtain the full-length OC-17 cDNA. Temporal and spatial OC-17 mRNA expression analyses were also performed to demonstrate that OC-17 is predominantly expressed in the adult hen uterus during the laying cycle and barely at immature developmental stages. Differential uterine expression of OC-17 was observed in hens laying eggs with weak versus strong eggshell, confirming its important role in the regulation of eggshell mineralization and providing a new tool for genetic selection for eggshell quality parameters. This study is the first one to report the full-length OC-17 cDNA sequence, and builds a foundation for OC-17 mRNA related studies. We provide a general method for biologists experiencing difficulty in obtaining candidate gene full-length cDNA sequences.

Detection of nucleic acid sequences by invader-directed cleavage

DOEpatents

Brow, Mary Ann D.; Hall, Jeff Steven Grotelueschen; Lyamichev, Victor; Olive, David Michael; Prudent, James Robert

1999-01-01

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The 5' nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based by charge.
37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.

Code of Federal Regulations, 2011 CFR

2011-07-01

... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...
37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.

Code of Federal Regulations, 2013 CFR

2013-07-01

... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...
37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.

Code of Federal Regulations, 2012 CFR

2012-07-01

... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...
37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.

Code of Federal Regulations, 2010 CFR

2010-07-01

... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...
37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.

Code of Federal Regulations, 2014 CFR

2014-07-01

... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...
Methods of biological dosimetry employing chromosome-specific staining

DOEpatents

Gray, Joe W.; Pinkel, Daniel

2000-01-01

Methods and compositions for staining based upon nucleic acid sequence that employ nucleic acid probes are provided. Said methods produce staining patterns that can be tailored for specific cytogenetic analyses. Said probes are appropriate for in situ hybridization and stain both interphase and metaphase chromosomal material with reliable signals. The nucleic acid probes are typically of a complexity greater than 50 kb, the complexity depending upon the cytogenetic application. Methods are provided to disable the hybridization capacity of shared, high copy repetitive sequences and/or remove such sequences to provide for useful contrast. Still further methods are provided to produce chromosome-specific staining reagents which are made specific to the targeted chromosomal material, which can be one or more whole chromosomes, one or more regions on one or more chromosomes, subsets of chromosomes and/or the entire genome. Probes and test kits are provided for use in tumor cytogenetics, in the detection of disease related loci, in analysis of structural abnormalities, such as translocations, and for biological dosimetry. Further, methods and prenatal test kits are provided to stain targeted chromosomal material of fetal cells, including fetal cells obtained from maternal blood. Still further, the invention provides for automated means to detect and analyse chromosomal abnormalities.
Methods And Compositions For Chromosome-Specific Staining

DOEpatents

Gray, Joe W.; Pinkel, Daniel

2003-08-19

Methods and compositions for staining based upon nucleic acid sequence that employ nucleic acid probes are provided. Said methods produce staining patterns that can be tailored for specific cytogenetic analyses. Said probes are appropriate for in situ hybridization and stain both interphase and metaphase chromosomal material with reliable signals. The nucleic acid probes are typically of a complexity greater than 50 kb, the complexity depending upon the cytogenetic application. Methods are provided to disable the hybridization capacity of shared, high copy repetitive sequences and/or remove such sequences to provide for useful contrast. Still further methods are provided to produce chromosome-specific staining reagents which are made specific to the targeted chromosomal material, which can be one or more whole chromosomes, one or more regions on one or more chromosomes, subsets of chromosomes and/or the entire genome. Probes and test kits are provided for use in tumor cytogenetics, in the detection of disease related loci, in analysis of structural abnormalities, such as translocations, and for biological dosimetry. Further, methods and prenatal test kits are provided to stain targeted chromosomal material of fetal cells, including fetal cells obtained from maternal blood. Still further, the invention provides for automated means to detect and analyse chromosomal abnormalities.
Compositions for chromosome-specific staining

DOEpatents

Gray, Joe W.; Pinkel, Daniel

1998-01-01

Methods and compositions for staining based upon nucleic acid sequence that employ nucleic acid probes are provided. Said methods produce staining patterns that can be tailored for specific cytogenetic analyses. Said probes are appropriate for in situ hybridization and stain both interphase and metaphase chromosomal material with reliable signals. The nucleic acid probes are typically of a complexity greater than 50 kb, the complexity depending upon the cytogenetic application. Methods are provided to disable the hybridization capacity of shared, high copy repetitive sequences and/or remove such sequences to provide for useful contrast. Still further methods are provided to produce chromosome-specific staining reagents which are made specific to the targeted chromosomal material, which can be one or more whole chromosomes, one or more regions on one or more chromosomes, subsets of chromosomes and/or the entire genome. Probes and test kits are provided for use in tumor cytogenetics, in the detection of disease related loci, in analysis of structural abnormalities, such as translocations, and for biological dosimetry. Further, methods and prenatal test kits are provided to stain targeted chromosomal material of fetal cells, including fetal cells obtained from maternal blood. Still further, the invention provides for automated means to detect and analyse chromosomal abnormalities.
Prediction protein structural classes with pseudo-amino acid composition: approximate entropy and hydrophobicity pattern.

PubMed

Zhang, Tong-Liang; Ding, Yong-Sheng; Chou, Kuo-Chen

2008-01-07

Compared with the conventional amino acid (AA) composition, the pseudo-amino acid (PseAA) composition as originally introduced for protein subcellular location prediction can incorporate much more information of a protein sequence, so as to remarkably enhance the power of using a discrete model to predict various attributes of a protein. In this study, based on the concept of PseAA composition, the approximate entropy and hydrophobicity pattern of a protein sequence are used to characterize the PseAA components. Also, the immune genetic algorithm (IGA) is applied to search the optimal weight factors in generating the PseAA composition. Thus, for a given protein sequence sample, a 27-D (dimensional) PseAA composition is generated as its descriptor. The fuzzy K nearest neighbors (FKNN) classifier is adopted as the prediction engine. The results thus obtained in predicting protein structural classification are quite encouraging, indicating that the current approach may also be used to improve the prediction quality of other protein attributes, or at least can play a complimentary role to the existing methods in the relevant areas. Our algorithm is written in Matlab that is available by contacting the corresponding author.
Compositions for chromosome-specific staining

DOEpatents

Gray, J.W.; Pinkel, D.

1998-05-26

Methods and compositions for staining based upon nucleic acid sequence that employ nucleic acid probes are provided. The methods produce staining patterns that can be tailored for specific cytogenetic analyses. The probes are appropriate for in situ hybridization and stain both interphase and metaphase chromosomal material with reliable signals. The nucleic acid probes are typically of a complexity greater than 50 kb, the complexity depending upon the cytogenetic application. Methods are provided to disable the hybridization capacity of shared, high copy repetitive sequences and/or remove such sequences to provide for useful contrast. Still further methods are provided to produce chromosome-specific staining reagents which are made specific to the targeted chromosomal material, which can be one or more whole chromosomes, one or more regions on one or more chromosomes, subsets of chromosomes and/or the entire genome. Probes and test kits are provided for use in tumor cytogenetics, in the detection of disease related loci, in analysis of structural abnormalities, such as translocations, and for biological dosimetry. Methods and prenatal test kits are provided to stain targeted chromosomal material of fetal cells, including fetal cells obtained from maternal blood. The invention provides for automated means to detect and analyze chromosomal abnormalities. 17 figs.
Comprehensive global amino acid sequence analysis of PB1F2 protein of influenza A H5N1 viruses and the influenza A virus subtypes responsible for the 20th‐century pandemics

PubMed Central

Pasricha, Gunisha; Mishra, Akhilesh C.; Chakrabarti, Alok K.

2012-01-01

Please cite this paper as: Pasricha et al. (2012) Comprehensive global amino acid sequence analysis of PB1F2 protein of influenza A H5N1 viruses and the Influenza A virus subtypes responsible for the 20th‐century pandemics. Influenza and Other Respiratory Viruses 7(4), 497–505. Background PB1F2 is the 11th protein of influenza A virus translated from +1 alternate reading frame of PB1 gene. Since the discovery, varying sizes and functions of the PB1F2 protein of influenza A viruses have been reported. Selection of PB1 gene segment in the pandemics, variable size and pleiotropic effect of PB1F2 intrigued us to analyze amino acid sequences of this protein in various influenza A viruses. Methods Amino acid sequences for PB1F2 protein of influenza A H5N1, H1N1, H2N2, and H3N2 subtypes were obtained from Influenza Research Database. Multiple sequence alignments of the PB1F2 protein sequences of the aforementioned subtypes were used to determine the size, variable and conserved domains and to perform mutational analysis. Results Analysis showed that 96·4% of the H5N1 influenza viruses harbored full‐length PB1F2 protein. Except for the 2009 pandemic H1N1 virus, all the subtypes of the 20th‐century pandemic influenza viruses contained full‐length PB1F2 protein. Through the years, PB1F2 protein of the H1N1 and H3N2 viruses has undergone much variation. PB1F2 protein sequences of H5N1 viruses showed both human‐ and avian host‐specific conserved domains. Global database of PB1F2 protein revealed that N66S mutation was present only in 3·8% of the H5N1 strains. We found a novel mutation, N84S in the PB1F2 protein of 9·35% of the highly pathogenic avian influenza H5N1 influenza viruses. Conclusions Varying sizes and mutations of the PB1F2 protein in different influenza A virus subtypes with pandemic potential were obtained. There was genetic divergence of the protein in various hosts which highlighted the host‐specific evolution of the virus. However, studies are required to correlate this sequence variability with the virulence and pathogenicity. PMID:22788742
Numeric promoter description - A comparative view on concepts and general application.

PubMed

Beier, Rico; Labudde, Dirk

2016-01-01

Nucleic acid molecules play a key role in a variety of biological processes. Starting from storage and transfer tasks, this also comprises the triggering of biological processes, regulatory effects and the active influence gained by target binding. Based on the experimental output (in this case promoter sequences), further in silico analyses aid in gaining new insights into these processes and interactions. The numerical description of nucleic acids thereby constitutes a bridge between the concrete biological issues and the analytical methods. Hence, this study compares 26 descriptor sets obtained by applying well-known numerical description concepts to an established dataset of 38 DNA promoter sequences. The suitability of the description sets was evaluated by computing partial least squares regression models and assessing the model accuracy. We conclude that the major importance regarding the descriptive power is attached to positional information rather than to explicitly incorporated physico-chemical information, since a sufficient amount of implicit physico-chemical information is already encoded in the nucleobase classification. The regression models especially benefited from employing the information that is encoded in the sequential and structural neighborhood of the nucleobases. Thus, the analyses of n-grams (short fragments of length n) suggested that they are valuable descriptors for DNA target interactions. A mixed n-gram descriptor set thereby yielded the best description of the promoter sequences. The corresponding regression model was checked and found to be plausible as it was able to reproduce the characteristic binding motifs of promoter sequences in a reasonable degree. As most functional nucleic acids are based on the principle of molecular recognition, the findings are not restricted to promoter sequences, but can rather be transferred to other kinds of functional nucleic acids. Thus, the concepts presented in this study could provide advantages for future nucleic acid-based technologies, like biosensoring, therapeutics and molecular imaging. Copyright © 2015 Elsevier Inc. All rights reserved.
Isolation and characterization of NBS-LRR- resistance gene candidates in turmeric (Curcuma longa cv. surama).

PubMed

Joshi, R K; Mohanty, S; Subudhi, E; Nayak, S

2010-09-08

Turmeric (Curcuma longa), an important asexually reproducing spice crop of the family Zingiberaceae is highly susceptible to bacterial and fungal pathogens. The identification of resistance gene analogs holds great promise for development of resistant turmeric cultivars. Degenerate primers designed based on known resistance genes (R-genes) were used in combinations to elucidate resistance gene analogs from Curcuma longa cultivar surama. The three primers resulted in amplicons with expected sizes of 450-600 bp. The nucleotide sequence of these amplicons was obtained through sequencing; their predicted amino acid sequences compared to each other and to the amino acid sequences of known R-genes revealed significant sequence similarity. The finding of conserved domains, viz., kinase-1a, kinase-2 and hydrophobic motif, provided evidence that the sequences belong to the NBS-LRR class gene family. The presence of tryptophan as the last residue of kinase-2 motif further qualified them to be in the non-TIR-NBS-LRR subfamily of resistance genes. A cluster analysis based on the neighbor-joining method was carried out using Curcuma NBS analogs together with several resistance gene analogs and known R-genes, which classified them into two distinct subclasses, corresponding to clades N3 and N4 of non-TIR-NBS sequences described in plants. The NBS analogs that we isolated can be used as guidelines to eventually isolate numerous R-genes in turmeric.
Xylopsora canopeorum (Umbilicariaceae), a new lichen species from the canopy of Sequoia sempervirens.

PubMed

Bendiksby, Mika; Næsborg, Rikke Reese; Timdal, Einar

2018-01-01

Xylopsora canopeorum Timdal, Reese Næsborg & Bendiksby is described as a new species occupying the crowns of large Sequoia sempervirens trees in California, USA. The new species is supported by morphology, anatomy, secondary chemistry and DNA sequence data. While similar in external appearance to X. friesii , it is distinguished by forming smaller, partly coralloid squamules, by the occurrence of soralia and, in some specimens, by the presence of thamnolic acid in addition to friesiic acid in the thallus. Molecular phylogenetic results are based on nuclear (ITS and LSU) as well as mitochondrial (SSU) ribosomal DNA sequence alignments. Phylogenetic hypotheses obtained using Bayesian Inference, Maximum Likelihood and Maximum Parsimony all support X. canopeorum as a distinct evolutionary lineage belonging to the X. caradocensis - X. friesii clade.
Xylopsora canopeorum (Umbilicariaceae), a new lichen species from the canopy of Sequoia sempervirens

PubMed Central

Bendiksby, Mika; Næsborg, Rikke Reese; Timdal, Einar

2018-01-01

Abstract Xylopsora canopeorum Timdal, Reese Næsborg & Bendiksby is described as a new species occupying the crowns of large Sequoia sempervirens trees in California, USA. The new species is supported by morphology, anatomy, secondary chemistry and DNA sequence data. While similar in external appearance to X. friesii, it is distinguished by forming smaller, partly coralloid squamules, by the occurrence of soralia and, in some specimens, by the presence of thamnolic acid in addition to friesiic acid in the thallus. Molecular phylogenetic results are based on nuclear (ITS and LSU) as well as mitochondrial (SSU) ribosomal DNA sequence alignments. Phylogenetic hypotheses obtained using Bayesian Inference, Maximum Likelihood and Maximum Parsimony all support X. canopeorum as a distinct evolutionary lineage belonging to the X. caradocensis–X. friesii clade. PMID:29559828
Pyrin gene and mutants thereof, which cause familial Mediterranean fever

DOEpatents

Kastner, Daniel L [Bethesda, MD; Aksentijevichh, Ivona [Bethesda, MD; Centola, Michael [Tacoma Park, MD; Deng, Zuoming [Gaithersburg, MD; Sood, Ramen [Rockville, MD; Collins, Francis S [Rockville, MD; Blake, Trevor [Laytonsville, MD; Liu, P Paul [Ellicott City, MD; Fischel-Ghodsian, Nathan [Los Angeles, CA; Gumucio, Deborah L [Ann Arbor, MI; Richards, Robert I [North Adelaide, AU; Ricke, Darrell O [San Diego, CA; Doggett, Norman A [Santa Cruz, NM; Pras, Mordechai [Tel-Hashomer, IL

2003-09-30

The invention provides the nucleic acid sequence encoding the protein associated with familial Mediterranean fever (FMF). The cDNA sequence is designated as MEFV. The invention is also directed towards fragments of the DNA sequence, as well as the corresponding sequence for the RNA transcript and fragments thereof. Another aspect of the invention provides the amino acid sequence for a protein (pyrin) associated with FMF. The invention is directed towards both the full length amino acid sequence, fusion proteins containing the amino acid sequence and fragments thereof. The invention is also directed towards mutants of the nucleic acid and amino acid sequences associated with FMF. In particular, the invention discloses three missense mutations, clustered in within about 40 to 50 amino acids, in the highly conserved rfp (B30.2) domain at the C-terminal of the protein. These mutants include M6801, M694V, K695R, and V726A. Additionally, the invention includes methods for diagnosing a patient at risk for having FMF and kits therefor.
Vanillin formation from ferulic acid in Vanilla planifolia is catalysed by a single enzyme.

PubMed

Gallage, Nethaji J; Hansen, Esben H; Kannangara, Rubini; Olsen, Carl Erik; Motawia, Mohammed Saddik; Jørgensen, Kirsten; Holme, Inger; Hebelstrup, Kim; Grisoni, Michel; Møller, Birger Lindberg

2014-06-19

Vanillin is a popular and valuable flavour compound. It is the key constituent of the natural vanilla flavour obtained from cured vanilla pods. Here we show that a single hydratase/lyase type enzyme designated vanillin synthase (VpVAN) catalyses direct conversion of ferulic acid and its glucoside into vanillin and its glucoside, respectively. The enzyme shows high sequence similarity to cysteine proteinases and is specific to the substitution pattern at the aromatic ring and does not metabolize caffeic acid and p-coumaric acid as demonstrated by coupled transcription/translation assays. VpVAN localizes to the inner part of the vanilla pod and high transcript levels are found in single cells located a few cell layers from the inner epidermis. Transient expression of VpVAN in tobacco and stable expression in barley in combination with the action of endogenous alcohol dehydrogenases and UDP-glucosyltransferases result in vanillyl alcohol glucoside formation from endogenous ferulic acid. A gene encoding an enzyme showing 71% sequence identity to VpVAN was identified in another vanillin-producing plant species Glechoma hederacea and was also shown to be a vanillin synthase as demonstrated by transient expression in tobacco.
Vanillin formation from ferulic acid in Vanilla planifolia is catalysed by a single enzyme

PubMed Central

Gallage, Nethaji J.; Hansen, Esben H.; Kannangara, Rubini; Olsen, Carl Erik; Motawia, Mohammed Saddik; Jørgensen, Kirsten; Holme, Inger; Hebelstrup, Kim; Grisoni, Michel; Møller, Birger Lindberg

2014-01-01

Vanillin is a popular and valuable flavour compound. It is the key constituent of the natural vanilla flavour obtained from cured vanilla pods. Here we show that a single hydratase/lyase type enzyme designated vanillin synthase (VpVAN) catalyses direct conversion of ferulic acid and its glucoside into vanillin and its glucoside, respectively. The enzyme shows high sequence similarity to cysteine proteinases and is specific to the substitution pattern at the aromatic ring and does not metabolize caffeic acid and p-coumaric acid as demonstrated by coupled transcription/translation assays. VpVAN localizes to the inner part of the vanilla pod and high transcript levels are found in single cells located a few cell layers from the inner epidermis. Transient expression of VpVAN in tobacco and stable expression in barley in combination with the action of endogenous alcohol dehydrogenases and UDP-glucosyltransferases result in vanillyl alcohol glucoside formation from endogenous ferulic acid. A gene encoding an enzyme showing 71% sequence identity to VpVAN was identified in another vanillin-producing plant species Glechoma hederacea and was also shown to be a vanillin synthase as demonstrated by transient expression in tobacco. PMID:24941968
Molecular Cloning and Characterization of Novel Morus alba Germin-Like Protein Gene Which Encodes for a Silkworm Gut Digestion-Resistant Antimicrobial Protein

PubMed Central

Patnaik, Bharat Bhusan; Kim, Dong Hyun; Oh, Seung Han; Song, Yong-Su; Chanh, Nguyen Dang Minh; Kim, Jong Sun; Jung, Woo-jin; Saha, Atul Kumar; Bindroo, Bharat Bhushan; Han, Yeon Soo

2012-01-01

Background Silkworm fecal matter is considered one of the richest sources of antimicrobial and antiviral protein (substances) and such economically feasible and eco-friendly proteins acting as secondary metabolites from the insect system can be explored for their practical utility in conferring broad spectrum disease resistance against pathogenic microbial specimens. Methodology/Principal Findings Silkworm fecal matter extracts prepared in 0.02 M phosphate buffer saline (pH 7.4), at a temperature of 60°C was subjected to 40% saturated ammonium sulphate precipitation and purified by gel-filtration chromatography (GFC). SDS-PAGE under denaturing conditions showed a single band at about 21.5 kDa. The peak fraction, thus obtained by GFC wastested for homogeneityusing C18reverse-phase high performance liquid chromatography (HPLC). The activity of the purified protein was tested against selected Gram +/− bacteria and phytopathogenic Fusarium species with concentration-dependent inhibitionrelationship. The purified bioactive protein was subjected to matrix-assisted laser desorption and ionization-time of flight mass spectrometry (MALDI-TOF-MS) and N-terminal sequencing by Edman degradation towards its identification. The N-terminal first 18 amino acid sequence following the predicted signal peptide showed homology to plant germin-like proteins (Glp). In order to characterize the full-length gene sequence in detail, the partial cDNA was cloned and sequenced using degenerate primers, followed by 5′- and 3′-rapid amplification of cDNA ends (RACE-PCR). The full-length cDNA sequence composed of 630 bp encoding 209 amino acids and corresponded to germin-like proteins (Glps) involved in plant development and defense. Conclusions/Significance The study reports, characterization of novel Glpbelonging to subfamily 3 from M. alba by the purification of mature active protein from silkworm fecal matter. The N-terminal amino acid sequence of the purified protein was found similar to the deduced amino acid sequence (without the transit peptide sequence) of the full length cDNA from M. alba. PMID:23284650

Molecular characterization of the spike gene of the porcine epidemic diarrhea virus in Mexico, 2013-2016.

PubMed

Lara-Romero, Rocío; Gómez-Núñez, Luis; Cerriteño-Sánchez, José Luis; Márquez-Valdelamar, Laura; Mendoza-Elvira, Susana; Ramírez-Mendoza, Humberto; Rivera-Benítez, José Francisco

2018-04-01

In Mexico, the first outbreaks suggestive of the circulation of the porcine epidemic diarrhea virus (PEDV) were identified at the beginning of July 2013. To identify the molecular characteristics of the PEDV Spike (S) gene in Mexico, 116 samples of the intestine and diarrhea of piglets with clinical signs of porcine epidemic diarrhea (PED) were obtained. Samples were collected from 14 farms located in six states of Mexico (Jalisco, Puebla, Sonora, Veracruz, Guanajuato, and Michoacán) from 2013 to 2016. To identify PEDV, we used real-time RT-PCR to discriminate between non-INDEL and INDEL strains. We chose samples according to state and year to characterize the S gene. After amplification of the S gene, the obtained products were sequenced and assembled. The complete amino acid sequences of the spike protein were used to perform an epitope analysis, which was used to determine null mutations in regions SS2, SS6, and 2C10 compared to the sequences of G2. A phylogenetic analysis determined the circulation of G2b and INDEL strains in Mexico. However, several mutations were recorded in the collagenase equivalent (COE) region that were related to the change in polarity and charge of the amino acid residues. The PEDV strain circulating in Jalisco in 2016 has an insertion of three amino acids ( 232 LGL 234 ) and one change in the antigenic site of the COE region, and strains from the years 2015 and 2016 changed the index of the surface probability, which could be related to the re-emergence of disease outbreaks.
Isolation of acetic, propionic and butyric acid-forming bacteria from biogas plants.

PubMed

Cibis, Katharina Gabriela; Gneipel, Armin; König, Helmut

2016-02-20

In this study, acetic, propionic and butyric acid-forming bacteria were isolated from thermophilic and mesophilic biogas plants (BGP) located in Germany. The fermenters were fed with maize silage and cattle or swine manure. Furthermore, pressurized laboratory fermenters digesting maize silage were sampled. Enrichment cultures for the isolation of acid-forming bacteria were grown in minimal medium supplemented with one of the following carbon sources: Na(+)-dl-lactate, succinate, ethanol, glycerol, glucose or a mixture of amino acids. These substrates could be converted by the isolates to acetic, propionic or butyric acid. In total, 49 isolates were obtained, which belonged to the phyla Firmicutes, Tenericutes or Thermotogae. According to 16S rRNA gene sequences, most isolates were related to Clostridium sporosphaeroides, Defluviitoga tunisiensis and Dendrosporobacter quercicolus. Acetic, propionic or butyric acid were produced in cultures of isolates affiliated to Bacillus thermoamylovorans, Clostridium aminovalericum, Clostridium cochlearium/Clostridium tetani, C. sporosphaeroides, D. quercicolus, Proteiniborus ethanoligenes, Selenomonas bovis and Tepidanaerobacter sp. Isolates related to Thermoanaerobacterium thermosaccharolyticum produced acetic, butyric and lactic acid, and isolates related to D. tunisiensis formed acetic acid. Specific primer sets targeting 16S rRNA gene sequences were designed and used for real-time quantitative PCR (qPCR). The isolates were physiologically characterized and their role in BGP discussed. Copyright © 2016 Elsevier B.V. All rights reserved.
TALEN-mediated targeted mutagenesis of fatty acid desaturase 2 (FAD2) in peanut (Arachis hypogaea L.) promotes the accumulation of oleic acid.

PubMed

Wen, Shijie; Liu, Hao; Li, Xingyu; Chen, Xiaoping; Hong, Yanbin; Li, Haifen; Lu, Qing; Liang, Xuanqiang

2018-05-01

A first creation of high oleic acid peanut varieties by using transcription activator-like effecter nucleases (TALENs) mediated targeted mutagenesis of Fatty Acid Desaturase 2 (FAD2). Transcription activator like effector nucleases (TALENs), which allow the precise editing of DNA, have already been developed and applied for genome engineering in diverse organisms. However, they are scarcely used in higher plant study and crop improvement, especially in allopolyploid plants. In the present study, we aimed to create targeted mutagenesis by TALENs in peanut. Targeted mutations in the conserved coding sequence of Arachis hypogaea fatty acid desaturase 2 (AhFAD2) were created by TALENs. Genetic stability of AhFAD2 mutations was identified by DNA sequencing in up to 9.52 and 4.11% of the regeneration plants at two different targeted sites, respectively. Mutation frequencies among AhFAD2 mutant lines were significantly correlated to oleic acid accumulation. Genetically, stable individuals of positive mutant lines displayed a 0.5-2 fold increase in the oleic acid content compared with non-transgenic controls. This finding suggested that TALEN-mediated targeted mutagenesis could increase the oleic acid content in edible peanut oil. Furthermore, this was the first report on peanut genome editing event, and the obtained high oleic mutants could serve for peanut breeding project.
The bean. alpha. -amylase inhibitor is encoded by a lectin gene

DOE Office of Scientific and Technical Information (OSTI.GOV)

Moreno, J.; Altabella, T.; Chrispeels, M.J.

The common bean, Phaseolus vulgaris, contains an inhibitor of insect and mammalian {alpha}-amylases that does not inhibit plant {alpha}-amylase. This inhibitor functions as an anti-feedant or seed-defense protein. We purified this inhibitor by affinity chromatography and found that it consists of a series of glycoforms of two polypeptides (Mr 14,000-19,000). Partial amino acid sequencing was carried out, and the sequences obtained are identical with portions of the derived amino acid sequence of a lectin-like gene. This lectin gene encodes a polypeptide of MW 28,000, and the primary in vitro translation product identified by antibodies to the {alpha}-amylase inhibitor has themore » same size. Co- and posttranslational processing of this polypeptide results in glycosylated polypeptides of 14-19 kDa. Our interpretation of these results is that the bean lectins constitute a gene family that encodes diverse plant defense proteins, including phytohemagglutinin, arcelin and {alpha}-amylase inhibitor.« less
Detection and quantification of Plasmodium falciparum in blood samples using quantitative nucleic acid sequence-based amplification.

PubMed

Schoone, G J; Oskam, L; Kroon, N C; Schallig, H D; Omar, S A

2000-11-01

A quantitative nucleic acid sequence-based amplification (QT-NASBA) assay for the detection of Plasmodium parasites has been developed. Primers and probes were selected on the basis of the sequence of the small-subunit rRNA gene. Quantification was achieved by coamplification of the RNA in the sample with one modified in vitro RNA as a competitor in a single-tube NASBA reaction. Parasite densities ranging from 10 to 10(8) Plasmodium falciparum parasites per ml could be demonstrated and quantified in whole blood. This is approximately 1,000 times more sensitive than conventional microscopy analysis of thick blood smears. Comparison of the parasite densities obtained by microscopy and QT-NASBA with 120 blood samples from Kenyan patients with clinical malaria revealed that for 112 of 120 (93%) of the samples results were within a 1-log difference. QT-NASBA may be especially useful for the detection of low parasite levels in patients with early-stage malaria and for the monitoring of the efficacy of drug treatment.
Peracetic acid-ionic liquid pretreatment to enhance enzymatic saccharification of lignocellulosic biomass.

PubMed

Uju; Abe, Kojiro; Uemura, Nobuyuki; Oshima, Toyoji; Goto, Masahiro; Kamiya, Noriho

2013-06-01

To enhance enzymatic saccharification of pine biomass, the pretreatment reagents peracetic acid (PAA) and ionic liquid (IL) were validated in single reagent pretreatments or combination pretreatments with different sequences. In a 1h saccharification, 5-25% cellulose conversion was obtained from the single pretreatment of PAA or IL. In contrast, a marked enhancement in conversion rates was achieved by PAA-IL combination pretreatments (45-70%). The PAA followed by IL (PAA+IL) pretreatment sequence was the most effective for preparing an enzymatic digestible regenerated biomass with 250-fold higher glucose formation rates than untreated biomass and 2- to 12-fold higher than single pretreatments with PAA or IL alone. Structural analysis confirmed that this pretreatment resulted in biomass with highly porous structural fibers associated with the reduction of lignin content and acetyl groups. Using the PAA+IL sequence, biomass loading in the pretreatment step can be increased from 5% to 15% without significant decrease in cellulose conversion. Copyright © 2013 Elsevier Ltd. All rights reserved.
Genetic diversity of the movement and coat protein genes of South American isolates of Prunus necrotic ringspot virus.

PubMed

Fiore, Nicola; Fajardo, Thor V M; Prodan, Simona; Herranz, María Carmen; Aparicio, Frederic; Montealegre, Jaime; Elena, Santiago F; Pallás, Vicente; Sánchez-Navarro, Jesús

2008-01-01

Prunus necrotic ringspot virus (PNRSV) is distributed worldwide, but no molecular data have been previously reported from South American isolates. The nucleotide sequences corresponding to the movement (MP) and coat (CP) proteins of 23 isolates of PNRSV from Chile, Brazil, and Uruguay, and from different Prunus species, have been obtained. Phylogenetic analysis performed with full-length MP and CP sequences from all the PNRSV isolates confirmed the clustering of the isolates into the previously reported PV32-I, PV96-II and PE5-III phylogroups. No association was found between specific sequences and host, geographic origin or symptomatology. Comparative analysis showed that both MP and CP have phylogroup-specific amino acids and all of the motifs previously characterized for both proteins. The study of the distribution of synonymous and nonsynonymous changes along both open reading frames revealed that most amino acid sites are under the effect of negative purifying selection.
77 FR 65537 - Requirements for Patent Applications Containing Nucleotide Sequence and/or Amino Acid Sequence...

Federal Register 2010, 2011, 2012, 2013, 2014

2012-10-29

... DEPARTMENT OF COMMERCE Patent and Trademark Office Requirements for Patent Applications Containing Nucleotide Sequence and/or Amino Acid Sequence Disclosures ACTION: Proposed collection; comment request... Patent applications that contain nucleotide and/or amino acid sequence disclosures must include a copy of...
The Perils of Pathogen Discovery: Origin of a Novel Parvovirus-Like Hybrid Genome Traced to Nucleic Acid Extraction Spin Columns

PubMed Central

Naccache, Samia N.; Greninger, Alexander L.; Lee, Deanna; Coffey, Lark L.; Phan, Tung; Rein-Weston, Annie; Aronsohn, Andrew; Hackett, John; Delwart, Eric L.

2013-01-01

Next-generation sequencing was used for discovery and de novo assembly of a novel, highly divergent DNA virus at the interface between the Parvoviridae and Circoviridae. The virus, provisionally named parvovirus-like hybrid virus (PHV), is nearly identical by sequence to another DNA virus, NIH-CQV, previously detected in Chinese patients with seronegative (non-A-E) hepatitis. Although we initially detected PHV in a wide range of clinical samples, with all strains sharing ∼99% nucleotide and amino acid identity with each other and with NIH-CQV, the exact origin of the virus was eventually traced to contaminated silica-binding spin columns used for nucleic acid extraction. Definitive confirmation of the origin of PHV, and presumably NIH-CQV, was obtained by in-depth analyses of water eluted through contaminated spin columns. Analysis of environmental metagenome libraries detected PHV sequences in coastal marine waters of North America, suggesting that a potential association between PHV and diatoms (algae) that generate the silica matrix used in the spin columns may have resulted in inadvertent viral contamination during manufacture. The confirmation of PHV/NIH-CQV as laboratory reagent contaminants and not bona fide infectious agents of humans underscores the rigorous approach needed to establish the validity of new viral genomes discovered by next-generation sequencing. PMID:24027301
Partial characterization of the lettuce infectious yellows virus genomic RNAs, identification of the coat protein gene and comparison of its amino acid sequence with those of other filamentous RNA plant viruses.

PubMed

Klaassen, V A; Boeshore, M; Dolja, V V; Falk, B W

1994-07-01

Purified virions of lettuce infectious yellows virus (LIYV), a tentative member of the closterovirus group, contained two RNAs of approximately 8500 and 7300 nucleotides (RNAs 1 and 2 respectively) and a single coat protein species with M(r) of approximately 28,000. LIYV-infected plants contained multiple dsRNAs. The two largest were the correct size for the replicative forms of LIYV virion RNAs 1 and 2. To assess the relationships between LIYV RNAs 1 and 2, cDNAs corresponding to the virion RNAs were cloned. Northern blot hybridization analysis showed no detectable sequence homology between these RNAs. A partial amino acid sequence obtained from purified LIYV coat protein was found to align in the most upstream of four complete open reading frames (ORFs) identified in a LIYV RNA 2 cDNA clone. The identity of this ORF was confirmed as the LIYV coat protein gene by immunological analysis of the gene product expressed in vitro and in Escherichia coli. Computer analysis of the LIYV coat protein amino acid sequence indicated that it belongs to a large family of proteins forming filamentous capsids of RNA plant viruses. The LIYV coat protein appears to be most closely related to the coat proteins of two closteroviruses, beet yellows virus and citrus tristeza virus.
Cloning and sequence analysis of a full-length cDNA of SmPP1cb encoding turbot protein phosphatase 1 beta catalytic subunit

NASA Astrophysics Data System (ADS)

Qi, Fei; Guo, Huarong; Wang, Jian

2008-02-01

Reversible protein phosphorylation, catalyzed by protein kinases and phosphatases, is an important and versatile mechanism by which eukaryotic cells regulate almost all the signaling processes. Protein phosphatase 1 (PP1) is the first and well-characterized member of the protein serine/threonine phosphatase family. In the present study, a full-length cDNA encoding the beta isoform of the catalytic subunit of protein phosphatase 1(PP1cb), was for the first time isolated and sequenced from the skin tissue of flatfish turbot Scophthalmus maximus, designated SmPP1cb, by the rapid amplification of cDNA ends (RACE) technique. The cDNA sequence of SmPP1cb we obtained contains a 984 bp open reading frame (ORF), flanked by a complete 39 bp 5' untranslated region and 462 bp 3' untranslated region. The ORF encodes a putative 327 amino acid protein, and the N-terminal section of this protein is highly acidic, Met-Ala-Glu-Gly-Glu-Leu-Asp-Val-Asp, a common feature for PP1 catalytic subunit but absent in protein phosphatase 2B (PP2B). And its calculated molecular mass is 37 193 Da and pI 5.8. Sequence analysis indicated that, SmPP1cb is extremely conserved in both amino acid and nucleotide acid levels compared with the PP1cb of other vertebrates and invertebrates, and its Kozak motif contained in the 5'UTR around ATG start codon is GXXAXXGXX ATGG, which is different from mammalian in two positions A-6 and G-3, indicating the possibility of different initiation of translation in turbot, and also the 3'UTR of SmPP1cb is highly diverse in the sequence similarity and length compared with other animals, especially zebrafish. The cloning and sequencing of SmPP1cb gene lays a good foundation for the future work on the biological functions of PP1 in the flatfish turbot.
Biosynthesis of riboflavin: an unusual riboflavin synthase of Methanobacterium thermoautotrophicum.

PubMed Central

Eberhardt, S; Korn, S; Lottspeich, F; Bacher, A

1997-01-01

Riboflavin synthase was purified by a factor of about 1,500 from cell extract of Methanobacterium thermoautotrophicum. The enzyme had a specific activity of about 2,700 nmol mg(-1) h(-1) at 65 degrees C, which is relatively low compared to those of riboflavin synthases of eubacteria and yeast. Amino acid sequences obtained after proteolytic cleavage had no similarity with known riboflavin synthases. The gene coding for riboflavin synthase (designated ribC) was subsequently cloned by marker rescue with a ribC mutant of Escherichia coli. The ribC gene of M. thermoautotrophicum specifies a protein of 153 amino acid residues. The predicted amino acid sequence agrees with the information gleaned from Edman degradation of the isolated protein and shows 67% identity with the sequence predicted for the unannotated reading frame MJ1184 of Methanococcus jannaschii. The ribC gene is adjacent to a cluster of four genes with similarity to the genes cbiMNQO of Salmonella typhimurium, which form part of the cob operon (this operon contains most of the genes involved in the biosynthesis of vitamin B12). The amino acid sequence predicted by the ribC gene of M. thermoautotrophicum shows no similarity whatsoever to the sequences of riboflavin synthases of eubacteria and yeast. Most notably, the M. thermoautotrophicum protein does not show the internal sequence homology characteristic of eubacterial and yeast riboflavin synthases. The protein of M. thermoautotrophicum can be expressed efficiently in a recombinant E. coli strain. The specific activity of the purified, recombinant protein is 1,900 nmol mg(-1) h(-1) at 65 degrees C. In contrast to riboflavin synthases from eubacteria and fungi, the methanobacterial enzyme has an absolute requirement for magnesium ions. The 5' phosphate of 6,7-dimethyl-8-ribityllumazine does not act as a substrate. The findings suggest that riboflavin synthase has evolved independently in eubacteria and methanobacteria. PMID:9139911
Comprehensive global amino acid sequence analysis of PB1F2 protein of influenza A H5N1 viruses and the influenza A virus subtypes responsible for the 20th-century pandemics.

PubMed

Pasricha, Gunisha; Mishra, Akhilesh C; Chakrabarti, Alok K

2013-07-01

PB1F2 is the 11th protein of influenza A virus translated from +1 alternate reading frame of PB1 gene. Since the discovery, varying sizes and functions of the PB1F2 protein of influenza A viruses have been reported. Selection of PB1 gene segment in the pandemics, variable size and pleiotropic effect of PB1F2 intrigued us to analyze amino acid sequences of this protein in various influenza A viruses. Amino acid sequences for PB1F2 protein of influenza A H5N1, H1N1, H2N2, and H3N2 subtypes were obtained from Influenza Research Database. Multiple sequence alignments of the PB1F2 protein sequences of the aforementioned subtypes were used to determine the size, variable and conserved domains and to perform mutational analysis. Analysis showed that 96·4% of the H5N1 influenza viruses harbored full-length PB1F2 protein. Except for the 2009 pandemic H1N1 virus, all the subtypes of the 20th-century pandemic influenza viruses contained full-length PB1F2 protein. Through the years, PB1F2 protein of the H1N1 and H3N2 viruses has undergone much variation. PB1F2 protein sequences of H5N1 viruses showed both human- and avian host-specific conserved domains. Global database of PB1F2 protein revealed that N66S mutation was present only in 3·8% of the H5N1 strains. We found a novel mutation, N84S in the PB1F2 protein of 9·35% of the highly pathogenic avian influenza H5N1 influenza viruses. Varying sizes and mutations of the PB1F2 protein in different influenza A virus subtypes with pandemic potential were obtained. There was genetic divergence of the protein in various hosts which highlighted the host-specific evolution of the virus. However, studies are required to correlate this sequence variability with the virulence and pathogenicity. © 2012 John Wiley & Sons Ltd.
Cleavage of nucleic acids

DOEpatents

Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor L.; Brow, Mary Ann D.; Dahlberg, James E.

2007-12-11

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Invasive cleavage of nucleic acids

DOEpatents

Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor I.; Brow, Mary Ann D.; Dahlberg, James E.

1999-01-01

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Invasive cleavage of nucleic acids

DOEpatents

Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor I.; Brow, Mary Ann D.; Dahlberg, James E.

2002-01-01

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Cleavage of nucleic acids

DOEpatents

Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor I.; Brow; Mary Ann D.; Dahlberg, James E.

2010-11-09

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Cleavage of nucleic acids

DOEpatents

Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor I.; Brow, Mary Ann D.; Dahlberg, James E.

2000-01-01

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Nucleic acid detection assays

DOEpatents

Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor I.; Brow, Mary Ann; Dahlberg, James E.

2005-04-05

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Structure, organization and expression of common carp (Cyprinus carpio L.) SLP-76 gene.

PubMed

Huang, Rong; Sun, Xiao-Feng; Hu, Wei; Wang, Ya-Ping; Guo, Qiong-Lin

2008-05-01

SLP-76 is an important member of the SLP-76 family of adapters, and it plays a key role in TCR signaling and T cell function. Partial cDNA sequence of SLP-76 of common carp (Cyprinus carpio L.) was isolated from thymus cDNA library by the method of suppression subtractive hybridization (SSH). Subsequently, the full length cDNA of carp SLP-76 was obtained by means of 3' RACE and 5' RACE, respectively. The full length cDNA of carp SLP-76 was 2007 bp, consisting of a 5'-terminal untranslated region (UTR) of 285 bp, a 3'-terminal UTR of 240 bp, and an open reading frame of 1482 bp. Sequence comparison showed that the deduced amino acid sequence of carp SLP-76 had an overall similarity of 34-73% to that of other species homologues, and it was composed of an NH2-terminal domain, a central proline-rich domain, and a C-terminal SH2 domain. Amino acid sequence analysis indicated the existence of a Gads binding site R-X-X-K, a 10-aa-long sequence which binds to the SH3 domain of LCK in vitro, and three conserved tyrosine-containing sequence in the NH2-terminal domain. Then we used PCR to obtain a genomic DNA which covers the entire coding region of carp SLP-76. In the 9.2k-long genomic sequence, twenty one exons and twenty introns were identified. RT-PCR results showed that carp SLP-76 was expressed predominantly in hematopoietic tissues, and was upregulated in thymus tissue of four-month carp compared to one-year old carp. RT-PCR and virtual northern hybridization results showed that carp SLP-76 was also upregulated in thymus tissue of GH transgenic carp at the age of four-months. These results suggest that the expression level of SLP-76 gene may be related to thymocyte development in teleosts.

NMR structure determination of a synthetic analogue of bacillomycin Lc reveals the strategic role of L-Asn1 in the natural iturinic antibiotics

NASA Astrophysics Data System (ADS)

Volpon, Laurent; Tsan, Pascale; Majer, Zsuzsa; Vass, Elemer; Hollósi, Miklós; Noguéra, Valérie; Lancelin, Jean-Marc; Besson, Françoise

2007-08-01

Iturins are a group of antifungal produced by Bacillus subtilis. All are cyclic lipopeptides with seven α-amino acids of configuration LDDLLDL and one β-amino fatty acid. The bacillomycin L is a member of this family and its NMR structure was previously resolved using the sequence Asp-Tyr-Asn-Ser-Gln-Ser-Thr. In this work, we carefully examined the NMR spectra of this compound and detected an error in the sequence. In fact, Asp1 and Gln5 need to be changed into Asn1 and Glu5, which therefore makes it identical to bacillomycin Lc. As a consequence, it now appears that all iturinic peptides with antibiotic activity share the common β-amino fatty acid 8- L-Asn1- D-Tyr2- D-Asn3 sequence. To better understand the conformational influence of the acidic residue L-Asp1, present, for example in the inactive iturin C, the NMR structure of the synthetic analogue SCP [cyclo ( L-Asp1- D-Tyr2- D-Asn3- L-Ser4- L-Gln5- D-Ser6- L-Thr7-β-Ala8)] was determined and compared with bacillomycin Lc recalculated with the corrected sequence. In both cases, the conformers obtained were separated into two families of similar energy which essentially differ in the number and type of turns. A detailed analysis of both cyclopeptide structures is presented here. In addition, CD and FTIR spectra were performed and confirmed the conformational differences observed by NMR between both cyclopeptides.
Sequence diversity among badnavirus isolates infecting yam (Dioscorea spp.) in Ghana, Togo, Benin and Nigeria.

PubMed

Eni, A O; Hughes, J d'A; Asiedu, R; Rey, M E C

2008-01-01

We analysed the sequence diversity in the reverse transcriptase (RT)/ribonuclease H (RNaseH) coding region of 19 badnavirus isolates infecting yam (Dioscorea spp.) in Ghana, Togo, Benin, and Nigeria. Phylogenetic analysis of the deduced amino acid sequences revealed that the isolates are broadly divided into two distinct species, each clustering with Dioscorea alata bacilliform virus (DaBV) and Dioscorea sansibarensis bacilliform virus (DsBV). Fourteen isolates had 90-96% amino acid identity with DaBV, while four isolates had 83-84% amino acid identity with DsBV. One isolate from Benin, BN4Dr, was distinct and had 77 and 75% amino acid identity with DaBV and DsBV, respectively, and may be a member of a new badnavirus species infecting yam in West Africa. Viruses of the two main species were present in Ghana, Togo and Benin and were observed to infect both D. alata and D. rotundata indiscriminately. This is the first confirmed report of DsBV infection in yam in Ghana and Togo. The results of this study demonstrate that members of two distinct species of badnaviruses infect yam in the West African yam zone and suggest a putative new species, BN4Dr. We also conclude that these species are not confined to limited geographic regions or specific for yam host species. However, the three badnavirus species are serologically related. The sequence information obtained from this study can be used to develop PCR-based diagnostics to detect members of the various species and/or strains of badnaviruses infecting yam in West Africa.
High-Quality Draft Genome Sequence of Candida apicola NRRL Y-50540

PubMed Central

Vega-Alvarado, Leticia; Gómez-Angulo, Jorge; Escalante-García, Zazil; Grande, Ricardo; Gschaedler-Mathis, Anne; Amaya-Delgado, Lorena

2015-01-01

Candida apicola, a highly osmotolerant ascomycetes yeast, produces sophorolipids (biosurfactants), membrane fatty acids, and enzymes of biotechnological interest. The genome obtained has a high-quality draft for this species and can be used as a reference to perform further analyses, such as differential gene expression in yeast from Candida genera. PMID:26067948
A diverse family of serine proteinase genes expressed in cotton boll weevil (Anthonomus grandis): implications for the design of pest-resistant transgenic cotton plants.

PubMed

Oliveira-Neto, Osmundo B; Batista, João A N; Rigden, Daniel J; Fragoso, Rodrigo R; Silva, Rodrigo O; Gomes, Eliane A; Franco, Octávio L; Dias, Simoni C; Cordeiro, Célia M T; Monnerat, Rose G; Grossi-De-Sá, Maria F

2004-09-01

Fourteen different cDNA fragments encoding serine proteinases were isolated by reverse transcription-PCR from cotton boll weevil (Anthonomus grandis) larvae. A large diversity between the sequences was observed, with a mean pairwise identity of 22% in the amino acid sequence. The cDNAs encompassed 11 trypsin-like sequences classifiable into three families and three chymotrypsin-like sequences belonging to a single family. Using a combination of 5' and 3' RACE, the full-length sequence was obtained for five of the cDNAs, named Agser2, Agser5, Agser6, Agser10 and Agser21. The encoded proteins included amino acid sequence motifs of serine proteinase active sites, conserved cysteine residues, and both zymogen activation and signal peptides. Southern blotting analysis suggested that one or two copies of these serine proteinase genes exist in the A. grandis genome. Northern blotting analysis of Agser2 and Agser5 showed that for both genes, expression is induced upon feeding and is concentrated in the gut of larvae and adult insects. Reverse northern analysis of the 14 cDNA fragments showed that only two trypsin-like and two chymotrypsin-like were expressed at detectable levels. Under the effect of the serine proteinase inhibitors soybean Kunitz trypsin inhibitor and black-eyed pea trypsin/chymotrypsin inhibitor, expression of one of the trypsin-like sequences was upregulated while expression of the two chymotrypsin-like sequences was downregulated. Copyright 2004 Elsevier Ltd.
Isolation and characterization of a new bacteriocin, termed enterocin M, produced by environmental isolate Enterococcus faecium AL41.

PubMed

Mareková, Mária; Lauková, Andrea; Skaugen, Morten; Nes, Ingolf

2007-08-01

The new bacteriocin, termed enterocin M, produced by Enterococcus faecium AL 41 showed a wide spectrum of inhibitory activity against the indicator organisms from different sources. It was purified by (NH4)2SO4 precipitation, cation-exchange chromatography and reverse phase chromatography (FPLC). The purified peptide was sequenced by N-terminal amino acid Edman degradation and a mass spectrometry analysis was performed. By combining the data obtained from amino acid sequence (39 N-terminal amino acid residues was determined) and the molecular weight (determined to be 4628 Da) it was concluded that the purified enterocin M is a new bacteriocin, which is very similar to enterocin P. However, its molecular weight is different from enterocin P (4701.25). Of the first 39 N-terminal residues of enterocin M, valine was found in position 20 and a lysine in position 35, while enterocin P has tryptophane residues in these positions.
Rapid identification of acetic acid bacteria using MALDI-TOF mass spectrometry fingerprinting.

PubMed

Andrés-Barrao, Cristina; Benagli, Cinzia; Chappuis, Malou; Ortega Pérez, Ruben; Tonolla, Mauro; Barja, François

2013-03-01

Acetic acid bacteria (AAB) are widespread microorganisms characterized by their ability to transform alcohols and sugar-alcohols into their corresponding organic acids. The suitability of matrix-assisted laser desorption-time of flight mass spectrometry (MALDI-TOF MS) for the identification of cultured AAB involved in the industrial production of vinegar was evaluated on 64 reference strains from the genera Acetobacter, Gluconacetobacter and Gluconobacter. Analysis of MS spectra obtained from single colonies of these strains confirmed their basic classification based on comparative 16S rRNA gene sequence analysis. MALDI-TOF analyses of isolates from vinegar cross-checked by comparative sequence analysis of 16S rRNA gene fragments allowed AAB to be identified, and it was possible to differentiate them from mixed cultures and non-AAB. The results showed that MALDI-TOF MS analysis was a rapid and reliable method for the clustering and identification of AAB species. Copyright © 2012 Elsevier GmbH. All rights reserved.
Characterization of myosin heavy chain and its gene in Amoeba proteus.

PubMed

Oh, S W; Jeon, K W

1998-01-01

Monoclonal antibodies against the myosin heavy chain of Amoeba proteus were obtained and used to localize myosin inside amoebae and to clone cDNAs encoding myosin. Myosin was found throughout the amoeba cytoplasm but was more concentrated in the ectoplasmic regions as determined by indirect immunofluorescence microscopy. In symbiont-bearing xD amoebae, myosin was also found on the symbiosome membranes, as checked by indirect immunofluorescence microscopy and by immunoelectron microscopy. The open reading frame of a cloned myosin cDNA contained 6,414 nucleotides, coding for a polypeptide of 2,138 amino acids. While the amino-acid sequence of the globular head region of amoeba's myosin had a high degree of similarity with that of myosins from various organisms, the tail region building a coiled-coil structure did not show a significant sequence similarity. There appeared to be at least three different isoforms of myosins in amoebae, with closely related amino acids in the globular head region.
Molecular characterization of two prunus necrotic ringspot virus isolates from Canada.

PubMed

Cui, Hongguang; Hong, Ni; Wang, Guoping; Wang, Aiming

2012-05-01

We determined the entire RNA1, 2 and 3 sequences of two prunus necrotic ringspot virus (PNRSV) isolates, Chr3 from cherry and Pch12 from peach, obtained from an orchard in the Niagara Fruit Belt, Canada. The RNA1, 2 and 3 of the two isolates share nucleotide sequence identities of 98.6%, 98.4% and 94.5%, respectively. Their RNA1- and 2-encoded amino acid sequences are about 98% identical to the corresponding sequences of a cherry isolate, CH57, the only other PNRSV isolate with complete RNA1 and 2 sequences available. Phylogenetic analysis of the coat protein and movement protein encoded by RNA3 of Pch12 and Chr3 and published PNRSV isolates indicated that Chr3 belongs to the PV96 group and Pch12 belongs to the PV32 group.
Method for nucleic acid hybridization using single-stranded DNA binding protein

DOEpatents

Tabor, Stanley; Richardson, Charles C.

1996-01-01

Method of nucleic acid hybridization for detecting the presence of a specific nucleic acid sequence in a population of different nucleic acid sequences using a nucleic acid probe. The nucleic acid probe hybridizes with the specific nucleic acid sequence but not with other nucleic acid sequences in the population. The method includes contacting a sample (potentially including the nucleic acid sequence) with the nucleic acid probe under hybridizing conditions in the presence of a single-stranded DNA binding protein provided in an amount which stimulates renaturation of a dilute solution (i.e., one in which the t.sub.1/2 of renaturation is longer than 3 weeks) of single-stranded DNA greater than 500 fold (i.e., to a t.sub.1/2 less than 60 min, preferably less than 5 min, and most preferably about 1 min.) in the absence of nucleotide triphosphates.
Sequence quality analysis tool for HIV type 1 protease and reverse transcriptase.

PubMed

Delong, Allison K; Wu, Mingham; Bennett, Diane; Parkin, Neil; Wu, Zhijin; Hogan, Joseph W; Kantor, Rami

2012-08-01

Access to antiretroviral therapy is increasing globally and drug resistance evolution is anticipated. Currently, protease (PR) and reverse transcriptase (RT) sequence generation is increasing, including the use of in-house sequencing assays, and quality assessment prior to sequence analysis is essential. We created a computational HIV PR/RT Sequence Quality Analysis Tool (SQUAT) that runs in the R statistical environment. Sequence quality thresholds are calculated from a large dataset (46,802 PR and 44,432 RT sequences) from the published literature ( http://hivdb.Stanford.edu ). Nucleic acid sequences are read into SQUAT, identified, aligned, and translated. Nucleic acid sequences are flagged if with >five 1-2-base insertions; >one 3-base insertion; >one deletion; >six PR or >18 RT ambiguous bases; >three consecutive PR or >four RT nucleic acid mutations; >zero stop codons; >three PR or >six RT ambiguous amino acids; >three consecutive PR or >four RT amino acid mutations; >zero unique amino acids; or <0.5% or >15% genetic distance from another submitted sequence. Thresholds are user modifiable. SQUAT output includes a summary report with detailed comments for troubleshooting of flagged sequences, histograms of pairwise genetic distances, neighbor joining phylogenetic trees, and aligned nucleic and amino acid sequences. SQUAT is a stand-alone, free, web-independent tool to ensure use of high-quality HIV PR/RT sequences in interpretation and reporting of drug resistance, while increasing awareness and expertise and facilitating troubleshooting of potentially problematic sequences.
The Endocannabinoid System in the Baboon (Papio SPP.) as a Complex Framework for Developmental Pharmacology

PubMed Central

Rodriguez-Sanchez, Iram P.; Guindon, Josee; Ruiz, Marco; Tejero, Maria E.; Hubbard, Gene; Martinez-De-Villarreal, Laura E.; Barrera-Saldaña, Hugo A.; Dick, Edward J.; Commuzzie, Anthony G; Schlabritz-Loutsevitch, Natalia E

2017-01-01

Introduction The consumption of marijuana (exogenous cannabinoid) almost doubled in adults during last decade. Consumption of exogenous cannabinoids interferes with the endogenous cannabinoid (or “endocannabinoid” (eCB)) system (ECS), which comprises N-arachidonylethanolamide (anandamide, AEA), 2-arachidonoyl glycerol (2-AG), endocannabinoid receptors (cannabinoid receptors 1 and 2 (CB1R and CB2R), encoded by CNR1 and CNR2, respectively), and synthesizing/degrading enzymes (FAAH, fatty-acid amide hydrolase; MAGL, monoacylglycerol lipase; DAGL-α, diacylglycerol lipase-alpha). Reports regarding the toxic and therapeutic effects of pharmacological compounds targeting the ECS are sometimes contradictory. This may be caused by the fact that structure of the eCBs varies in the species studied. Objectives First: to clone and characterize the cDNAs of selected members of ECS in a non-human primate (baboon, Papio spp.), and second: to compare those cDNA sequences to known human structural variants (single nucleotide polymorphisms and haplotypes). Materials and methods Polymerase chain reaction-amplified gene products from baboon tissues were transformed into Escherichia coli. Amplicon-positive clones were sequenced, and the obtained sequences were conceptually translated into amino-acid sequences using the genetic code. Results Among the ECS members, CNR1 was the best conserved gene between humans and baboons. The phenotypes associated with mutations in the untranslated regions of this gene in humans have not been described in baboons. One difference in the structure of CNR2 between humans and baboons was detected in the region with the only known clinically relevant polymorphism in a human receptor. All of the differences in the amino-acid structure of DAGL-α between humans and baboons were located in the hydroxylase domain, close to phosphorylation sites. None of the differences in the amino-acid structure of MAGL observed between baboons and humans were located in the area critical for enzyme function. Conclusion The evaluation of the data, obtained in non-human primate model of cannabis-related developmental exposure should take into consideration possible evolutionary-determined species-specific differences in the CB1R expression, CB2R transduction pathway, and FAAH and DAGLα substrate-enzyme interactions. PMID:27327781
The endocannabinoid system in the baboon (Papio spp.) as a complex framework for developmental pharmacology.

PubMed

Rodriguez-Sanchez, Iram P; Guindon, Josee; Ruiz, Marco; Tejero, M Elizabeth; Hubbard, Gene; Martinez-de-Villarreal, Laura E; Barrera-Saldaña, Hugo A; Dick, Edward J; Comuzzie, Anthony G; Schlabritz-Loutsevitch, Natalia E

The consumption of marijuana (exogenous cannabinoid) almost doubled in adults during last decade. Consumption of exogenous cannabinoids interferes with the endogenous cannabinoid (or "endocannabinoid" (eCB)) system (ECS), which comprises N-arachidonylethanolamide (anandamide, AEA), 2-arachidonoyl glycerol (2-AG), endocannabinoid receptors (cannabinoid receptors 1 and 2 (CB1R and CB2R), encoded by CNR1 and CNR2, respectively), and synthesizing/degrading enzymes (FAAH, fatty-acid amide hydrolase; MAGL, monoacylglycerol lipase; DAGL-α, diacylglycerol lipase-alpha). Reports regarding the toxic and therapeutic effects of pharmacological compounds targeting the ECS are sometimes contradictory. This may be caused by the fact that structure of the eCBs varies in the species studied. First: to clone and characterize the cDNAs of selected members of ECS in a non-human primate (baboon, Papio spp.), and second: to compare those cDNA sequences to known human structural variants (single nucleotide polymorphisms and haplotypes). Polymerase chain reaction-amplified gene products from baboon tissues were transformed into Escherichia coli. Amplicon-positive clones were sequenced, and the obtained sequences were conceptually translated into amino-acid sequences using the genetic code. Among the ECS members, CNR1 was the best conserved gene between humans and baboons. The phenotypes associated with mutations in the untranslated regions of this gene in humans have not been described in baboons. One difference in the structure of CNR2 between humans and baboons was detected in the region with the only known clinically relevant polymorphism in a human receptor. All of the differences in the amino-acid structure of DAGL-α between humans and baboons were located in the hydroxylase domain, close to phosphorylation sites. None of the differences in the amino-acid structure of MAGL observed between baboons and humans were located in the area critical for enzyme function. The evaluation of the data, obtained in non-human primate model of cannabis-related developmental exposure should take into consideration possible evolutionary-determined species-specific differences in the CB1R expression, CB2R transduction pathway, and FAAH and DAGLα substrate-enzyme interactions. Copyright © 2016 Elsevier Inc. All rights reserved.
Simultaneous determination of Ca, Cu, Ni, Zn and Cd binding strengths with fulvic acid fractions by Schubert's method

USGS Publications Warehouse

Brown, G.K.; MacCarthy, P.; Leenheer, J.A.

1999-01-01

The equilibrium binding of Ca2+, Ni2+, Cd2+, Cu2+ and Zn2+ with unfractionated Suwannee river fulvic acid (SRFA) and an enhanced metal binding subfraction of SRFA was measured using Schubert's ion-exchange method at pH 6.0 and at an ionic strength (??) of 0.1 (NaNO3). The fractionation and subfractionation were directed towards obtaining an isolate with an elevated metal binding capacity or binding strength as estimated by Cu2+ potentiometry (ISE). Fractions were obtained by stepwise eluting an XAD-8 column loaded with SRFA with water eluents of pH 1.0 to pH 12.0. Subfractions were obtained by loading the fraction eluted from XAD-8 at pH 5.0 onto a silica gel column and eluting with solvents of increasing polarity. Schuberts ion exchange method was rigorously tested by measuring simultaneously the conditional stability constants (K) of citric acid complexed with the five metals at pH 3.5 and 6.0. The logK of SRFA with Ca2+, Ni2+, Cd2+, Cu2+ and Zn2+ determined simultaneously at pH 6.0 follow the sequence of Cu2+>Cd2+>Ni2+>Zn2+>Ca2+ while all logK values increased for the enhanced metal binding subfraction and followed a different sequence of Cu2+>Cd2+>Ca2+>Ni2+>Zn2+. Both fulvic acid samples and citric acid exhibited a 1:1 metal to ligand stochiometry under the relatively low metal loading conditions used here. Quantitative 13C nuclear magnetic resonance spectroscopy showed increases in aromaticity and ketone content and decreases in aliphatic carbon for the elevated metal binding fraction while the carboxyl carbon, and elemental nitrogen, phosphorus, and sulfur content did not change. The more polar, elevated metal binding fraction did show a significant increase in molecular weight over the unfractionated SRFA. Copyright (C) 1999 Elsevier Science B.V.
Heterologous overproduction of β-fructofuranosidase from yeast Xanthophyllomyces dendrorhous, an enzyme producing prebiotic sugars.

PubMed

Gimeno-Pérez, María; Linde, Dolores; Fernández-Arrojo, Lucía; Plou, Francisco J; Fernández-Lobato, María

2015-04-01

The β-fructofuranosidase Xd-INV from the yeast Xanthophyllomyces dendrorhous is the largest microbial enzyme producing neo-fructooligosaccharides (neo-FOS) known to date. It mainly synthesizes neokestose and neonystose, oligosaccharides with potentially improved prebiotic properties. The Xd-INV gene comprises an open reading frame of 1995 bp, which encodes a 665-amino acid protein. Initial N-terminal sequencing of Xd-INV pointed to a majority extracellular protein of 595 amino acids lacking the first 70 residues (potential signal peptide). Functionality of the last 1785 bp of Xd-INV gene was previously proved in Saccharomyces cerevisiae but only weak β-fructofuranosidase activity was quantified. In this study, different strategies to improve this enzyme level in a heterologous system have been used. Curiously, best results were obtained by increasing the protein N-terminus sequence in 39 amino acids, protein of 634 residues. The higher β-fructofuranosidase activity detected in this study, about 15 U/mL, was obtained using Pichia pastoris and represents an improvement of about 1500 times the level previously obtained in a heterologous organism and doubles the best level of activity obtained by the natural producer. Heterologously expressed protein was purified and characterized biochemically and kinetically. Except by its glycosylation degree (10 % lower) and thermal stability (4-5 °C lower in the 60-85 °C range), the properties of the heterologous enzyme, including ability to produce neo-FOS, remained unchanged. Interestingly, besides the neo-FOS referred before blastose was also detected (8-22 g/L) in the reaction mixtures, making Xd-INV the first yeast enzyme producing this non-conventional disaccharide reported to date.
Purification, characterization and preliminary crystallographic studies of a PR-10 protein from Pachyrrhizus erosus seeds.

PubMed

Wu, Fang; Li, Yikun; Chang, Shaojie; Zhou, Zhaocai; Wang, Fang; Song, Xiaomin; Lin, Yujuan; Gong, Weimin

2002-12-01

A 16 kDa protein SPE16 was purified from the seeds of Pachyrrhizus erosus. Its N-terminal amino-acid sequence showed significant sequence homology to pathogenesis-related proteins from the PR-10 family. An activity assay indicated that SPE16 possesses ribonuclease activity as do some other PR-10 proteins. SPE16 crystals were obtained by the hanging-drop vapour-diffusion method. The space group is P2(1)2(1)2(1), with unit-cell parameters a = 53.36, b = 63.70, c = 72.96 A.
Draft Genome Sequence of Enterococcus casseliflavus PAVET15 Obtained from the Oviduct Infection of the Cattle Tick (Rhipicephalus microplus) in Jiutepec, Morelos, Mexico.

PubMed

Cossío-Bayúgar, R; Miranda-Miranda, E; Arreguín-Pérez, C A; Lozano, L; Peréz de la Rosa, D; Rocha-Martínez, M K; Bravo-Díaz, M A; Sachman-Ruiz, B

2017-04-20

Enterococcus spp. are Gram-positive lactic acid-producing bacteria found in the intestinal tracts of animals, like mammals, birds, and arthropods. Enterococcus spp. may cause oportunistic infections in vertebrate and invertebrate hosts. We report here the draft genome sequence of Enterococcus casseliflavus PAVET15 containing 3,722,480 bp, with 80 contigs, an N 50 of 179,476 bp, and 41.93% G+C content. Copyright © 2017 Cossío-Bayúgar et al.
Isolation and Distribution of a Novel Iron-Oxidizing Crenarchaeon from Acidic Geothermal Springs in Yellowstone National Park▿ †

PubMed Central

Kozubal, M.; Macur, R. E.; Korf, S.; Taylor, W. P.; Ackerman, G. G.; Nagy, A.; Inskeep, W. P.

2008-01-01

Novel thermophilic crenarchaea have been observed in Fe(III) oxide microbial mats of Yellowstone National Park (YNP); however, no definitive work has identified specific microorganisms responsible for the oxidation of Fe(II). The objectives of the current study were to isolate and characterize an Fe(II)-oxidizing member of the Sulfolobales observed in previous 16S rRNA gene surveys and to determine the abundance and distribution of close relatives of this organism in acidic geothermal springs containing high concentrations of dissolved Fe(II). Here we report the isolation and characterization of the novel, Fe(II)-oxidizing, thermophilic, acidophilic organism Metallosphaera sp. strain MK1 obtained from a well-characterized acid-sulfate-chloride geothermal spring in Norris Geyser Basin, YNP. Full-length 16S rRNA gene sequence analysis revealed that strain MK1 exhibits only 94.9 to 96.1% sequence similarity to other known Metallosphaera spp. and less than 89.1% similarity to known Sulfolobus spp. Strain MK1 is a facultative chemolithoautotroph with an optimum pH range of 2.0 to 3.0 and an optimum temperature range of 65 to 75°C. Strain MK1 grows optimally on pyrite or Fe(II) sorbed onto ferrihydrite, exhibiting doubling times between 10 and 11 h under aerobic conditions (65°C). The distribution and relative abundance of MK1-like 16S rRNA gene sequences in 14 acidic geothermal springs containing Fe(III) oxide microbial mats were evaluated. Highly related MK1-like 16S rRNA gene sequences (>99% sequence similarity) were consistently observed in Fe(III) oxide mats at temperatures ranging from 55 to 80°C. Quantitative PCR using Metallosphaera-specific primers confirmed that organisms highly similar to strain MK1 comprised up to 40% of the total archaeal community at selected sites. The broad distribution of highly related MK1-like 16S rRNA gene sequences in acidic Fe(III) oxide microbial mats is consistent with the observed characteristics and growth optima of Metallosphaera-like strain MK1 and emphasizes the importance of this newly described taxon in Fe(II) chemolithotrophy in acidic high-temperature environments of YNP. PMID:18083851
Complementary DNA sequencing and identification of mRNAs from the venomous gland of Agkistrodon piscivorus leucostoma.

PubMed

Jia, Ying; Cantu, Bruno A; Sánchez, Elda E; Pérez, John C

2008-06-15

To advance our knowledge on the snake venom composition and transcripts expressed in venom gland at the molecular level, we constructed a cDNA library from the venom gland of Agkistrodon piscivorus leucostoma for the generation of expressed sequence tags (ESTs) database. From the randomly sequenced 2112 independent clones, we have obtained ESTs for 1309 (62%) cDNAs, which showed significant deduced amino acid sequence similarity (scores >80) to previously characterized proteins in National Center for Biotechnology Information (NCBI) database. Ribosomal proteins make up 47 clones (2%) and the remaining 756 (36%) cDNAs represent either unknown identity or show BLASTX sequence identity scores of <80 with known GenBank accessions. The most highly expressed gene encoding phospholipase A(2) (PLA(2)) accounting for 35% of A. p. leucostoma venom gland cDNAs was identified and further confirmed by crude venom applied to sodium dodecyl sulfate/polyacrylamide gel electrophoresis (SDS-PAGE) electrophoresis and protein sequencing. A total of 180 representative genes were obtained from the sequence assemblies and deposited to EST database. Clones showing sequence identity to disintegrins, thrombin-like enzymes, hemorrhagic toxins, fibrinogen clotting inhibitors and plasminogen activators were also identified in our EST database. These data can be used to develop a research program that will help us identify genes encoding proteins that are of medical importance or proteins involved in the mechanisms of the toxin venom.
Electron-Transfer Ion/Ion Reactions of Doubly Protonated Peptides: Effect of Elevated Bath Gas Temperature

PubMed Central

Pitteri, Sharon J.; Chrisman, Paul A.; McLuckey, Scott A.

2005-01-01

In this study, the electron-transfer dissociation (ETD) behavior of cations derived from 27 different peptides (22 of which are tryptic peptides) has been studied in a 3D quadrupole ion trap mass spectrometer. Ion/ion reactions between peptide cations and nitrobenzene anions have been examined at both room temperature and in an elevated temperature bath gas environment to form ETD product ions. From the peptides studied, the ETD sequence coverage tends to be inversely related to peptide size. At room temperature, very high sequence coverage (~100%) was observed for small peptides (≤7 amino acids). For medium-sized peptides composed of 8–11 amino acids, the average sequence coverage was 46%. Larger peptides with 14 or more amino acids yielded an average sequence coverage of 23%. Elevated-temperature ETD provided increased sequence coverage over room-temperature experiments for the peptides of greater than 7 residues, giving an average of 67% for medium-sized peptides and 63% for larger peptides. Percent ETD, a measure of the extent of electron transfer, has also been calculated for the peptides and also shows an inverse relation with peptide size. Bath gas temperature does not have a consistent effect on percent ETD, however. For the tryptic peptides, fragmentation is localized at the ends of the peptides suggesting that the distribution of charge within the peptide may play an important role in determining fragmentation sites. A triply protonated peptide has also been studied and shows behavior similar to the doubly charged peptides. These preliminary results suggest that for a given charge state there is a maximum size for which high sequence coverage is obtained and that increasing the bath gas temperature can increase this maximum. PMID:16131079
Molecular Cloning and Characterization of cDNA Encoding a Putative Stress-Induced Heat-Shock Protein from Camelus dromedarius

PubMed Central

Elrobh, Mohamed S.; Alanazi, Mohammad S.; Khan, Wajahatullah; Abduljaleel, Zainularifeen; Al-Amri, Abdullah; Bazzi, Mohammad D.

2011-01-01

Heat shock proteins are ubiquitous, induced under a number of environmental and metabolic stresses, with highly conserved DNA sequences among mammalian species. Camelus dromedaries (the Arabian camel) domesticated under semi-desert environments, is well adapted to tolerate and survive against severe drought and high temperatures for extended periods. This is the first report of molecular cloning and characterization of full length cDNA of encoding a putative stress-induced heat shock HSPA6 protein (also called HSP70B′) from Arabian camel. A full-length cDNA (2417 bp) was obtained by rapid amplification of cDNA ends (RACE) and cloned in pET-b expression vector. The sequence analysis of HSPA6 gene showed 1932 bp-long open reading frame encoding 643 amino acids. The complete cDNA sequence of the Arabian camel HSPA6 gene was submitted to NCBI GeneBank (accession number HQ214118.1). The BLAST analysis indicated that C. dromedaries HSPA6 gene nucleotides shared high similarity (77–91%) with heat shock gene nucleotide of other mammals. The deduced 643 amino acid sequences (accession number ADO12067.1) showed that the predicted protein has an estimated molecular weight of 70.5 kDa with a predicted isoelectric point (pI) of 6.0. The comparative analyses of camel HSPA6 protein sequences with other mammalian heat shock proteins (HSPs) showed high identity (80–94%). Predicted camel HSPA6 protein structure using Protein 3D structural analysis high similarities with human and mouse HSPs. Taken together, this study indicates that the cDNA sequences of HSPA6 gene and its amino acid and protein structure from the Arabian camel are highly conserved and have similarities with other mammalian species. PMID:21845074

Prediction of cis/trans isomerization in proteins using PSI-BLAST profiles and secondary structure information.

PubMed

Song, Jiangning; Burrage, Kevin; Yuan, Zheng; Huber, Thomas

2006-03-09

The majority of peptide bonds in proteins are found to occur in the trans conformation. However, for proline residues, a considerable fraction of Prolyl peptide bonds adopt the cis form. Proline cis/trans isomerization is known to play a critical role in protein folding, splicing, cell signaling and transmembrane active transport. Accurate prediction of proline cis/trans isomerization in proteins would have many important applications towards the understanding of protein structure and function. In this paper, we propose a new approach to predict the proline cis/trans isomerization in proteins using support vector machine (SVM). The preliminary results indicated that using Radial Basis Function (RBF) kernels could lead to better prediction performance than that of polynomial and linear kernel functions. We used single sequence information of different local window sizes, amino acid compositions of different local sequences, multiple sequence alignment obtained from PSI-BLAST and the secondary structure information predicted by PSIPRED. We explored these different sequence encoding schemes in order to investigate their effects on the prediction performance. The training and testing of this approach was performed on a newly enlarged dataset of 2424 non-homologous proteins determined by X-Ray diffraction method using 5-fold cross-validation. Selecting the window size 11 provided the best performance for determining the proline cis/trans isomerization based on the single amino acid sequence. It was found that using multiple sequence alignments in the form of PSI-BLAST profiles could significantly improve the prediction performance, the prediction accuracy increased from 62.8% with single sequence to 69.8% and Matthews Correlation Coefficient (MCC) improved from 0.26 with single local sequence to 0.40. Furthermore, if coupled with the predicted secondary structure information by PSIPRED, our method yielded a prediction accuracy of 71.5% and MCC of 0.43, 9% and 0.17 higher than the accuracy achieved based on the singe sequence information, respectively. A new method has been developed to predict the proline cis/trans isomerization in proteins based on support vector machine, which used the single amino acid sequence with different local window sizes, the amino acid compositions of local sequence flanking centered proline residues, the position-specific scoring matrices (PSSMs) extracted by PSI-BLAST and the predicted secondary structures generated by PSIPRED. The successful application of SVM approach in this study reinforced that SVM is a powerful tool in predicting proline cis/trans isomerization in proteins and biological sequence analysis.
Composition for nucleic acid sequencing

DOEpatents

Korlach, Jonas [Ithaca, NY; Webb, Watt W [Ithaca, NY; Levene, Michael [Ithaca, NY; Turner, Stephen [Ithaca, NY; Craighead, Harold G [Ithaca, NY; Foquet, Mathieu [Ithaca, NY

2008-08-26

The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
Method for sequencing nucleic acid molecules

DOEpatents

Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

2006-06-06

The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
Method for sequencing nucleic acid molecules

DOEpatents

Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

2006-05-30

The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
Dipeptide Sequence Determination: Analyzing Phenylthiohydantoin Amino Acids by HPLC

NASA Astrophysics Data System (ADS)

Barton, Janice S.; Tang, Chung-Fei; Reed, Steven S.

2000-02-01

Amino acid composition and sequence determination, important techniques for characterizing peptides and proteins, are essential for predicting conformation and studying sequence alignment. This experiment presents improved, fundamental methods of sequence analysis for an upper-division biochemistry laboratory. Working in pairs, students use the Edman reagent to prepare phenylthiohydantoin derivatives of amino acids for determination of the sequence of an unknown dipeptide. With a single HPLC technique, students identify both the N-terminal amino acid and the composition of the dipeptide. This method yields good precision of retention times and allows use of a broad range of amino acids as components of the dipeptide. Students learn fundamental principles and techniques of sequence analysis and HPLC.
Involvement of the ornithine decarboxylase gene in acid stress response in probiotic Lactobacillus delbrueckii UFV H2b20.

PubMed

Ferreira, A B; Oliveira, M N V de; Freitas, F S; Paiva, A D; Alfenas-Zerbini, P; Silva, D F da; Queiroz, M V de; Borges, A C; Moraes, C A de

2015-01-01

Amino acid decarboxylation is important for the maintenance of intracellular pH under acid stress. This study aims to carry out phylogenetic and expression analysis by real-time PCR of two genes that encode proteins involved in ornithine decarboxylation in Lactobacillus delbrueckii UFV H2b20 exposed to acid stress. Sequencing and phylogeny analysis of genes encoding ornithine decarboxylase and amino acid permease in L. delbrueckii UFV H2b20 showed their high sequence identity (99%) and grouping with those of L. delbrueckii subsp. bulgaricus ATCC 11842. Exposure of L. delbrueckii UFV H2b20 cells in MRS pH 3.5 for 30 and 60 min caused a significant increase in expression of the gene encoding ornithine decarboxylase (up to 8.1 times higher when compared to the control treatment). Increased expression of the ornithine decarboxylase gene demonstrates its involvement in acid stress response in L. delbrueckii UFV H2b20, evidencing that the protein encoded by that gene could be involved in intracellular pH regulation. The results obtained show ornithine decarboxylation as a possible mechanism of adaptation to an acidic environmental condition, a desirable and necessary characteristic for probiotic cultures and certainly important to the survival and persistence of the L. delbrueckii UFV H2b20 in the human gastrointestinal tract.
Nucleic acid molecules encoding isopentenyl monophosphate kinase, and methods of use

DOEpatents

Croteau, Rodney B.; Lange, Bernd M.

2001-01-01

A cDNA encoding isopentenyl monophosphate kinase (IPK) from peppermint (Mentha x piperita) has been isolated and sequenced, and the corresponding amino acid sequence has been determined. Accordingly, an isolated DNA sequence (SEQ ID NO:1) is provided which codes for the expression of isopentenyl monophosphate kinase (SEQ ID NO:2), from peppermint (Mentha x piperita). In other aspects, replicable recombinant cloning vehicles are provided which code for isopentenyl monophosphate kinase, or for a base sequence sufficiently complementary to at least a portion of isopentenyl monophosphate kinase DNA or RNA to enable hybridization therewith. In yet other aspects, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding isopentenyl monophosphate kinase. Thus, systems and methods are provided for the recombinant expression of the aforementioned recombinant isopentenyl monophosphate kinase that may be used to facilitate its production, isolation and purification in significant amounts. Recombinant isopentenyl monophosphate kinase may be used to obtain expression or enhanced expression of isopentenyl monophosphate kinase in plants in order to enhance the production of isopentenyl monophosphate kinase, or isoprenoids derived therefrom, or may be otherwise employed for the regulation or expression of isopentenyl monophosphate kinase, or the production of its products.
Acid-base chemical reaction model for nucleation rates in the polluted atmospheric boundary layer.

PubMed

Chen, Modi; Titcombe, Mari; Jiang, Jingkun; Jen, Coty; Kuang, Chongai; Fischer, Marc L; Eisele, Fred L; Siepmann, J Ilja; Hanson, David R; Zhao, Jun; McMurry, Peter H

2012-11-13

Climate models show that particles formed by nucleation can affect cloud cover and, therefore, the earth's radiation budget. Measurements worldwide show that nucleation rates in the atmospheric boundary layer are positively correlated with concentrations of sulfuric acid vapor. However, current nucleation theories do not correctly predict either the observed nucleation rates or their functional dependence on sulfuric acid concentrations. This paper develops an alternative approach for modeling nucleation rates, based on a sequence of acid-base reactions. The model uses empirical estimates of sulfuric acid evaporation rates obtained from new measurements of neutral molecular clusters. The model predicts that nucleation rates equal the sulfuric acid vapor collision rate times a prefactor that is less than unity and that depends on the concentrations of basic gaseous compounds and preexisting particles. Predicted nucleation rates and their dependence on sulfuric acid vapor concentrations are in reasonable agreement with measurements from Mexico City and Atlanta.
Molecular identification and partial sequence analysis of an aryl hydrocarbon receptor from beluga (Delphinapterus leucas)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Jensen, B.A.; Hahn, M.E.

1995-12-31

The aryl hydrocarbon receptor (AhR) mediates the effects of many common and potentially toxic organic hydrocarbons, including some polychlorinated biphenyls and dioxins. Since small cetaceans often inhabit industrially polluted coastal waters, comparison of the molecular structure and function of this protein in cetaeans with other marine and mammalian species is important for evaluating the sensitivity of cetaceans to these pollutants. An AhR protein has been identified in beluga liver by photoaffinity labeling. In the present study, the authors sought to clone and sequence an AhR cDNA from beluga as a prelude to studying its structure and function, using reverse-transcription polymerasemore » chain reaction (RT-PCR) and degenerate primers, a 515 base pair fragment was amplified, cloned and sequenced, revealing homology to the PAS domain (ligand binding and dimerization region) of AhRs from terrestrial mammals. This portion of the putative beluga AhR has 82% amino acid and 81% nucleotide sequence identity to the mouse AhR, and 63% amino acid and 64% nucleotide sequence identity to an AhR from the marine fish Fundulus heteroclitus. A beluga cDNA library was synthesized and is currently being screened with the PCR-generated fragment to obtain the complete coding sequence. This is the first molecular evidence of AhR presence in cetaceans.« less
EGVII endoglucanase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2014-02-25

The present invention provides a novel endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.
EGVII endoglucanase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2006-05-16

The present invention provides a novel endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.
EGVI endoglucanase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA

2008-04-01

The present invention provides a novel endoglucanase nucleic acid sequence, designated egl6, and the corresponding EGVI amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVI, recombinant EGVI proteins and methods for producing the same.
EGVI endoglucanase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2010-10-12

The present invention provides a novel endoglucanase nucleic acid sequence, designated egl6, and the corresponding EGVI amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVI, recombinant EGVI proteins and methods for producing the same.
EGVIII endoglucanase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2006-05-23

The present invention provides a novel endoglucanase nucleic acid sequence, designated egl8, and the corresponding EGVIII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVIII, recombinant EGVIII proteins and methods for producing the same.
EGVI endoglucanase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2010-10-05

The present invention provides a novel endoglucanase nucleic acid sequence, designated egl6, and the corresponding EGVI amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVI, recombinant EGVI proteins and methods for producing the same.
EGVI endoglucanase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2006-06-06

The present invention provides a novel endoglucanase nucleic acid sequence, designated egl6, and the corresponding EGVI amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVI, recombinant EGVI proteins and methods for producing the same.
EGVII endoglucanase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA

2009-05-05

The present invention provides an endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.
EGVII endoglucanase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2013-07-16

The present invention provides a novel endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.
EGVII endoglucanase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA

2012-02-14

The present invention provides a novel endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.
EGVII endoglucanase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2015-04-14

The present invention provides a novel endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.

The complete amino acid sequence of echinoidin, a lectin from the coelomic fluid of the sea urchin Anthocidaris crassispina. Homologies with mammalian and insect lectins.

PubMed

Giga, Y; Ikai, A; Takahashi, K

1987-05-05

The complete amino acid sequence of echinoidin, the proposed name for a lectin from the coelomic fluid of the sea urchin Anthocidaris crassispina, has been determined by sequencing the peptides obtained from tryptic, Staphylococcus aureus V8 protease, chymotryptic, and thermolysin digestions. Echinoidin is a multimeric protein (Giga, Y., Sutoh, K., and Ikai, A. (1985) Biochemistry 24, 4461-4467) whose subunit consists of a total of 147 amino acid residues and one carbohydrate chain attached to Ser38. The molecular weight of the polypeptide without carbohydrate was calculated to be 16,671. Each polypeptide chain contains seven half-cystines, and six of them form three disulfide bonds in the single polypeptide chain (Cys3-Cys14, Cys31-Cys141, and Cys116-Cys132), while Cys2 is involved in an interpolypeptide disulfide linkage. From secondary structure prediction by the method of Chou and Fasman (Chou, P. Y., and Fasman, G. D. (1974) Biochemistry 13, 211-222) the protein appears to be rich in beta-sheet and beta-turn structures and poor in alpha-helical structure. The sequence of the COOH-terminal half of echinoidin is highly homologous to those of the COOH-terminal carbohydrate recognition portions of rat liver mannose-binding protein and several other hepatic lectins. This COOH-terminal region of echinoidin is also homologous to the central portion of the lectin from the flesh fly Sarcophaga peregrina. Moreover, echinoidin contains an Arg-Gly-Asp sequence which has been proposed to be a basic functional unit in cellular recognition proteins.
Kit for detecting nucleic acid sequences using competitive hybridization probes

DOEpatents

Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

2001-01-01

A kit is provided for detecting a target nucleic acid sequence in a sample, the kit comprising: a first hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the first hybridization probe including a first complexing agent for forming a binding pair with a second complexing agent; and a second hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the first hybridization probe does not selectively hybridize, the second hybridization probe including a detectable marker; a third hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the third hybridization probe including the same detectable marker as the second hybridization probe; and a fourth hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the third hybridization probe does not selectively hybridize, the fourth hybridization probe including the first complexing agent for forming a binding pair with the second complexing agent; wherein the first and second hybridization probes are capable of simultaneously hybridizing to the target sequence and the third and fourth hybridization probes are capable of simultaneously hybridizing to the target sequence, the detectable marker is not present on the first or fourth hybridization probes and the first, second, third, and fourth hybridization probes each include a competitive nucleic acid sequence which is sufficiently complementary to a third portion of the target sequence that the competitive sequences of the first, second, third, and fourth hybridization probes compete with each other to hybridize to the third portion of the target sequence.
Low molecular weight squash trypsin inhibitors from Sechium edule seeds.

PubMed

Laure, Hélen J; Faça, Vítor M; Izumi, Clarice; Padovan, Júlio C; Greene, Lewis J

2006-02-01

Nine chromatographic components containing trypsin inhibitor activity were isolated from Sechium edule seeds by acetone fractionation, gel filtration, affinity chromatography and RP-HPLC in an overall yield of 46% of activity and 0.05% of protein. The components obtained with highest yield of total activity and highest specific activity were sequenced by Edman degradation and their molecular masses determined by mass spectrometry. The inhibitors contained 31, 32 and 27 residues per molecule and their sequences were: SETI-IIa, EDRKCPKILMRCKRDSDCLAKCTCQESGYCG; SETI-IIb, EEDRKCPKILMRCKRDSDCLAKCTCQESGYCG and SETI-V, CPRILMKCKLDTDCFPTCTCRPSGFCG. SETI-IIa and SETI-IIb, which differed by an amino-terminal E in the IIb form, were not separable under the conditions employed. The sequences are consistent with consensus sequences obtained from 37 other inhibitors: CPriI1meCk_DSDCla_C_C_G_CG, where capital letters are invariant amino acid residues and lower case letters are the most preserved in this position. SETI-II and SETI-V form complexes with trypsin with a 1:1 stoichiometry and have dissociation constants of 5.4x10(-11)M and 1.1x10(-9)M, respectively.
Neisseria arctica sp. nov. isolated from nonviable eggs of greater white-fronted geese (Anser albifrons) in Arctic Alaska

USGS Publications Warehouse

Hansen, Cristina M.; Himschoot, Elizabeth; Hare, Rebekah F.; Meixell, Brandt W.; Van Hemert, Caroline R.; Hueffer, Karsten

2017-01-01

During the summers of 2013 and 2014, isolates of a novel Gram-negative coccus in the Neisseria genus were obtained from the contents of nonviable greater white-fronted goose (Anser albifrons) eggs on the Arctic Coastal Plain of Alaska. We used a polyphasic approach to determine whether these isolates represent a novel species. 16S rRNA gene sequences, 23S rRNA gene sequences, and chaperonin 60 gene sequences suggested that these Alaskan isolates are members of a distinct species that is most closely related to Neisseria canis, N. animaloris, and N. shayeganii. Analysis of the rplF gene additionally showed that our isolates are unique and most closely related to N. weaveri. Average nucleotide identity of the whole genome sequence of our type strain was between 71.5% and 74.6% compared to close relatives, further supporting designation as a novel species. Fatty acid methyl ester analysis showed a predominance of C14:0, C16:0, and C16:1ω7c fatty acids. Finally, biochemical characteristics distinguished our isolates from other Neisseria species. The name Neisseria arctica (type strain KH1503T = ATCC TSD-57T = DSM 103136T) is proposed.
Structurally detailed coarse-grained model for Sec-facilitated co-translational protein translocation and membrane integration

PubMed Central

Miller, Thomas F.

2017-01-01

We present a coarse-grained simulation model that is capable of simulating the minute-timescale dynamics of protein translocation and membrane integration via the Sec translocon, while retaining sufficient chemical and structural detail to capture many of the sequence-specific interactions that drive these processes. The model includes accurate geometric representations of the ribosome and Sec translocon, obtained directly from experimental structures, and interactions parameterized from nearly 200 μs of residue-based coarse-grained molecular dynamics simulations. A protocol for mapping amino-acid sequences to coarse-grained beads enables the direct simulation of trajectories for the co-translational insertion of arbitrary polypeptide sequences into the Sec translocon. The model reproduces experimentally observed features of membrane protein integration, including the efficiency with which polypeptide domains integrate into the membrane, the variation in integration efficiency upon single amino-acid mutations, and the orientation of transmembrane domains. The central advantage of the model is that it connects sequence-level protein features to biological observables and timescales, enabling direct simulation for the mechanistic analysis of co-translational integration and for the engineering of membrane proteins with enhanced membrane integration efficiency. PMID:28328943
Extracting features from protein sequences to improve deep extreme learning machine for protein fold recognition.

PubMed

Ibrahim, Wisam; Abadeh, Mohammad Saniee

2017-05-21

Protein fold recognition is an important problem in bioinformatics to predict three-dimensional structure of a protein. One of the most challenging tasks in protein fold recognition problem is the extraction of efficient features from the amino-acid sequences to obtain better classifiers. In this paper, we have proposed six descriptors to extract features from protein sequences. These descriptors are applied in the first stage of a three-stage framework PCA-DELM-LDA to extract feature vectors from the amino-acid sequences. Principal Component Analysis PCA has been implemented to reduce the number of extracted features. The extracted feature vectors have been used with original features to improve the performance of the Deep Extreme Learning Machine DELM in the second stage. Four new features have been extracted from the second stage and used in the third stage by Linear Discriminant Analysis LDA to classify the instances into 27 folds. The proposed framework is implemented on the independent and combined feature sets in SCOP datasets. The experimental results show that extracted feature vectors in the first stage could improve the performance of DELM in extracting new useful features in second stage. Copyright © 2017 Elsevier Ltd. All rights reserved.
Structure characterization of lipocyclopeptide antibiotics, aspartocins A, B & C, by ESI-MSMS and ESI-nozzle-skimmer-MSMS.

PubMed

Siegel, Marshall M; Kong, Fangming; Feng, Xidong; Carter, Guy T

2009-12-01

Three lipocyclopeptide antibiotics, aspartocins A (1), B (2), and C (3), were obtained from the aspartocin complex by HPLC separation methodology. Their structures were elucidated using previously published chemical degradation results coupled with spectroscopic studies including ESI-MS, ESI-Nozzle Skimmer-MSMS and NMR. All three aspartocin compounds share the same cyclic decapeptide core of cyclo [Dab2 (Asp1-FA)-Pip3-MeAsp4-Asp5-Gly6-Asp7-Gly8-Dab9-Val10-Pro11]. They differ only in the fatty acid side chain moiety (FA) corresponding to (Z)-13-methyltetradec-3-ene-carbonyl, (+,Z)-12-methyltetradec-3-ene-carbonyl and (Z)-12-methyltridec-3-ene-carbonyl for aspartocins A (1), B (2), and C (3), respectively. All of the sequence ions were observed by ESI-MSMS of the doubly charged parent ions. However, a number of the sequence ions observed were of low abundance. To fully sequence the lipocyclopeptide antibiotic structures, these low abundance sequence ions together with complementary sequence ions were confirmed by ESI-Nozzle-Skimmer-MSMS of the singly charged linear peptide parent fragment ions H-Asp5-Gly6-Asp7-Gly8-Dab9-Val10-Pro11-Dab2(1+)-Asp1-FA. Cyclization of the aspartocins was demonstrated to occur via the beta-amino group of Dab2 from ions of moderate intensity in the ESI-MSMS spectra. As the fatty acid moieties do not undergo internal fragmentations under the experimental ESI mass spectral conditions used, the 14 Da mass difference between the fatty acid moieties of aspartocins A (1) and B (2) versus aspartocin C (3) was used as an internal mass tag to differentiate fragment ions containing fatty acid moieties and those not containing the fatty acid moieties. The most numerous and abundant fragment ions observed in the tandem mass spectra are due to the cleavage of the tertiary nitrogen amide of the pipecolic acid residue-3 (16 fragment ions) and the proline residue-11 (7 fragment ions). In addition, the neutral loss of ethanimine from alpha,beta-diaminobutyric acid residue 9 was observed for the parent molecular ion and for 7 fragment ions. Copyright 2009 John Wiley & Sons, Ltd.
Chip-based sequencing nucleic acids

DOEpatents

Beer, Neil Reginald

2014-08-26

A system for fast DNA sequencing by amplification of genetic material within microreactors, denaturing, demulsifying, and then sequencing the material, while retaining it in a PCR/sequencing zone by a magnetic field. One embodiment includes sequencing nucleic acids on a microchip that includes a microchannel flow channel in the microchip. The nucleic acids are isolated and hybridized to magnetic nanoparticles or to magnetic polystyrene-coated beads. Microreactor droplets are formed in the microchannel flow channel. The microreactor droplets containing the nucleic acids and the magnetic nanoparticles are retained in a magnetic trap in the microchannel flow channel and sequenced.
"De-novo" amino acid sequence elucidation of protein G'e by combined "top-down" and "bottom-up" mass spectrometry.

PubMed

Yefremova, Yelena; Al-Majdoub, Mahmoud; Opuni, Kwabena F M; Koy, Cornelia; Cui, Weidong; Yan, Yuetian; Gross, Michael L; Glocker, Michael O

2015-03-01

Mass spectrometric de-novo sequencing was applied to review the amino acid sequence of a commercially available recombinant protein G´ with great scientific and economic importance. Substantial deviations to the published amino acid sequence (Uniprot Q54181) were found by the presence of 46 additional amino acids at the N-terminus, including a so-called "His-tag" as well as an N-terminal partial α-N-gluconoylation and α-N-phosphogluconoylation, respectively. The unexpected amino acid sequence of the commercial protein G' comprised 241 amino acids and resulted in a molecular mass of 25,998.9 ± 0.2 Da for the unmodified protein. Due to the higher mass that is caused by its extended amino acid sequence compared with the original protein G' (185 amino acids), we named this protein "protein G'e." By means of mass spectrometric peptide mapping, the suggested amino acid sequence, as well as the N-terminal partial α-N-gluconoylations, was confirmed with 100% sequence coverage. After the protein G'e sequence was determined, we were able to determine the expression vector pET-28b from Novagen with the Xho I restriction enzyme cleavage site as the best option that was used for cloning and expressing the recombinant protein G'e in E. coli. A dissociation constant (K(d)) value of 9.4 nM for protein G'e was determined thermophoretically, showing that the N-terminal flanking sequence extension did not cause significant changes in the binding affinity to immunoglobulins.
Seq2Logo: a method for construction and visualization of amino acid binding motifs and sequence profiles including sequence weighting, pseudo counts and two-sided representation of amino acid enrichment and depletion

PubMed Central

Thomsen, Martin Christen Frølund; Nielsen, Morten

2012-01-01

Seq2Logo is a web-based sequence logo generator. Sequence logos are a graphical representation of the information content stored in a multiple sequence alignment (MSA) and provide a compact and highly intuitive representation of the position-specific amino acid composition of binding motifs, active sites, etc. in biological sequences. Accurate generation of sequence logos is often compromised by sequence redundancy and low number of observations. Moreover, most methods available for sequence logo generation focus on displaying the position-specific enrichment of amino acids, discarding the equally valuable information related to amino acid depletion. Seq2logo aims at resolving these issues allowing the user to include sequence weighting to correct for data redundancy, pseudo counts to correct for low number of observations and different logotype representations each capturing different aspects related to amino acid enrichment and depletion. Besides allowing input in the format of peptides and MSA, Seq2Logo accepts input as Blast sequence profiles, providing easy access for non-expert end-users to characterize and identify functionally conserved/variable amino acids in any given protein of interest. The output from the server is a sequence logo and a PSSM. Seq2Logo is available at http://www.cbs.dtu.dk/biotools/Seq2Logo (14 May 2012, date last accessed). PMID:22638583
Comparative characterization of random-sequence proteins consisting of 5, 12, and 20 kinds of amino acids

PubMed Central

Tanaka, Junko; Doi, Nobuhide; Takashima, Hideaki; Yanagawa, Hiroshi

2010-01-01

Screening of functional proteins from a random-sequence library has been used to evolve novel proteins in the field of evolutionary protein engineering. However, random-sequence proteins consisting of the 20 natural amino acids tend to aggregate, and the occurrence rate of functional proteins in a random-sequence library is low. From the viewpoint of the origin of life, it has been proposed that primordial proteins consisted of a limited set of amino acids that could have been abundantly formed early during chemical evolution. We have previously found that members of a random-sequence protein library constructed with five primitive amino acids show high solubility (Doi et al., Protein Eng Des Sel 2005;18:279–284). Although such a library is expected to be appropriate for finding functional proteins, the functionality may be limited, because they have no positively charged amino acid. Here, we constructed three libraries of 120-amino acid, random-sequence proteins using alphabets of 5, 12, and 20 amino acids by preselection using mRNA display (to eliminate sequences containing stop codons and frameshifts) and characterized and compared the structural properties of random-sequence proteins arbitrarily chosen from these libraries. We found that random-sequence proteins constructed with the 12-member alphabet (including five primitive amino acids and positively charged amino acids) have higher solubility than those constructed with the 20-member alphabet, though other biophysical properties are very similar in the two libraries. Thus, a library of moderate complexity constructed from 12 amino acids may be a more appropriate resource for functional screening than one constructed from 20 amino acids. PMID:20162614
Using One's Hands for Naming Optical Isomers and Other Stereochemical Positions.

ERIC Educational Resources Information Center

Mezl, Vasek A.

1996-01-01

Presents a method that allows students to use their hands to obtain the stereochemistry of chiral centers without redrawing the structure. Discusses the use of the model in: determining the configurations of amino acids, determining if sugars are D or L isomers, the sequence rule procedure, prochirality, naming the sides of trigonal carbons, and…
Draft Genome Sequence of a Novel Thermofilum sp. Strain from a New Zealand Hot Spring Enrichment Culture

DOE PAGES

Reysenbach, Anna-Louise; Donaho, John; Hinsch, Todd; ...

2018-02-22

A draft genome of a newThermofilumsp. strain was obtained from an enrichment culture metagenome. Like its relatives,Thermofilumsp. strain NZ13 is adapted to organic-rich thermal environments and has to depend on other organisms and the environment for some key amino acids, purines, and cofactors.
Draft Genome Sequence of a Novel Thermofilum sp. Strain from a New Zealand Hot Spring Enrichment Culture

DOE Office of Scientific and Technical Information (OSTI.GOV)

Reysenbach, Anna-Louise; Donaho, John; Hinsch, Todd

A draft genome of a newThermofilumsp. strain was obtained from an enrichment culture metagenome. Like its relatives,Thermofilumsp. strain NZ13 is adapted to organic-rich thermal environments and has to depend on other organisms and the environment for some key amino acids, purines, and cofactors.
The Genome Sequence of the Leaf-Cutter Ant Atta cephalotes Reveals Insights into Its Obligate Symbiotic Lifestyle

PubMed Central

Suen, Garret; Holt, Carson; Abouheif, Ehab; Bornberg-Bauer, Erich; Bouffard, Pascal; Caldera, Eric J.; Cash, Elizabeth; Cavanaugh, Amy; Denas, Olgert; Elhaik, Eran; Favé, Marie-Julie; Gadau, Jürgen; Gibson, Joshua D.; Graur, Dan; Grubbs, Kirk J.; Hagen, Darren E.; Harkins, Timothy T.; Helmkampf, Martin; Hu, Hao; Johnson, Brian R.; Kim, Jay; Marsh, Sarah E.; Moeller, Joseph A.; Muñoz-Torres, Mónica C.; Murphy, Marguerite C.; Naughton, Meredith C.; Nigam, Surabhi; Overson, Rick; Rajakumar, Rajendhran; Reese, Justin T.; Scott, Jarrod J.; Smith, Chris R.; Tao, Shu; Tsutsui, Neil D.; Viljakainen, Lumi; Wissler, Lothar; Yandell, Mark D.; Zimmer, Fabian; Taylor, James; Slater, Steven C.; Clifton, Sandra W.; Warren, Wesley C.; Elsik, Christine G.; Smith, Christopher D.; Weinstock, George M.; Gerardo, Nicole M.; Currie, Cameron R.

2011-01-01

Leaf-cutter ants are one of the most important herbivorous insects in the Neotropics, harvesting vast quantities of fresh leaf material. The ants use leaves to cultivate a fungus that serves as the colony's primary food source. This obligate ant-fungus mutualism is one of the few occurrences of farming by non-humans and likely facilitated the formation of their massive colonies. Mature leaf-cutter ant colonies contain millions of workers ranging in size from small garden tenders to large soldiers, resulting in one of the most complex polymorphic caste systems within ants. To begin uncovering the genomic underpinnings of this system, we sequenced the genome of Atta cephalotes using 454 pyrosequencing. One prediction from this ant's lifestyle is that it has undergone genetic modifications that reflect its obligate dependence on the fungus for nutrients. Analysis of this genome sequence is consistent with this hypothesis, as we find evidence for reductions in genes related to nutrient acquisition. These include extensive reductions in serine proteases (which are likely unnecessary because proteolysis is not a primary mechanism used to process nutrients obtained from the fungus), a loss of genes involved in arginine biosynthesis (suggesting that this amino acid is obtained from the fungus), and the absence of a hexamerin (which sequesters amino acids during larval development in other insects). Following recent reports of genome sequences from other insects that engage in symbioses with beneficial microbes, the A. cephalotes genome provides new insights into the symbiotic lifestyle of this ant and advances our understanding of host–microbe symbioses. PMID:21347285
BGL7 beta-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Ward, Michael

2013-01-29

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl7, and the corresponding BGL7 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL7, recombinant BGL7 proteins and methods for producing the same.
BGL6 .beta.-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Ward, Michael

2012-10-02

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl6, and the corresponding BGL6 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL6, recombinant BGL6 proteins and methods for producing the same.
BGL5 .beta.-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2006-02-28

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl5, and the corresponding BGL5 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL5, recombinant BGL5 proteins and methods for producing the same.
BGL5 .beta.-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA

2008-03-18

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl5, and the corresponding BGL5 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL5, recombinant BGL5 proteins and methods for producing the same.
BGL6 beta-glucosidase and nucleic acids encoding the same

DOE Office of Scientific and Technical Information (OSTI.GOV)

Dunn-Coleman, Nigel; Ward, Michael

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl6, and the corresponding BGL6 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL6, recombinant BGL6 proteins and methods for producing the same.

BGL6 beta-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Ward, Michael

2014-03-04

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl6, and the corresponding BGL6 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL6, recombinant BGL6 proteins and methods for producing the same.
BGL7 beta-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Ward, Michael

2015-04-14

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl7, and the corresponding BGL7 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL7, recombinant BGL7 proteins and methods for producing the same.
BGL7 beta-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Ward, Michael

2014-03-25

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl7, and the corresponding BGL7 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL7, recombinant BGL7 proteins and methods for producing the same.
BGL6 beta-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Ward, Michael

2015-08-11

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl6, and the corresponding BGL6 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL6, recombinant BGL6 proteins and methods for producing the same.
BGL3 beta-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2007-09-25

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl3, and the corresponding BGL3 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL3, recombinant BGL3 proteins and methods for producing the same.
BGL3 beta-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA

2008-04-01

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl3, and the corresponding BGL3 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL3, recombinant BGL3 proteins and methods for producing the same.
BGL4 beta-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA

2011-12-06

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl4, and the corresponding BGL4 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL4, recombinant BGL4 proteins and methods for producing the same.
BGL4 .beta.-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2006-05-16

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl4, and the corresponding BGL4 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL4, recombinant BGL4 proteins and methods for producing the same.
BGL3 beta-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA

2011-06-14

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl3, and the corresponding BGL3 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL3, recombinant BGL3 proteins and methods for producing the same.
BGL6 beta-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel [Los Gatos, CA; Ward, Michael [San Francisco, CA

2009-09-01

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl6, and the corresponding BGL6 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL6, recombinant BGL6 proteins and methods for producing the same.
BGL3 beta-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2012-10-30

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl3, and the corresponding BGL3 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL3, recombinant BGL3 proteins and methods for producing the same.
BGL4 beta-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA

2008-01-22

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl4, and the corresponding BGL4 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL4, recombinant BGL4 proteins and methods for producing the same.
Fluorimetric determinations of nucleic acids using iron, osmium and samarium complexes of 4,7-diphenyl-1,10-phenanthroline

NASA Astrophysics Data System (ADS)

Salem, A. A.

2006-09-01

New sensitive, reliable and reproducible fluorimetric methods for determining microgram amounts of nucleic acids based on their reactions with Fe(II), Os(III) or Sm(III) complexes of 4,7-diphenyl-1,10-phenanthroline are proposed. Two complementary single stranded synthetic DNA sequences based on calf thymus as well as their hybridized double stranded were used. Nucleic acids were found to react instantaneously at room temperature in Tris-Cl buffer pH 7, with the investigated complexes resulting in decreasing their fluorescence emission. Two fluorescence peaks around 388 and 567 nm were obtained for the three complexes using excitation λmax of 280 nm and were used for this investigation. Linear calibration graphs in the range 1-6 μg/ml were obtained. Detection limits of 0.35-0.98 μg/ml were obtained. Using the calibration graphs for the synthetic dsDNA, relative standard deviations of 2.0-5.0% were obtained for analyzing DNA in the extraction products from calf thymus and human blood. Corresponding Recovery% of 80-114 were obtained. Student's t-values at 95% confidence level showed insignificant difference between the real and measured values. Results obtained by these methods were compared with the ethidium bromide method using the F-test and satisfactory results were obtained. The association constants and number of binding sites of synthetic ssDNA and dsDNA with the three complexes were estimated using Rosenthanl graphic method. The interaction mechanism was discussed and an intercalation mechanism was suggested for the binding reaction between nucleic acids and the three complexes.
Using amphiphilic pseudo amino acid composition to predict enzyme subfamily classes.

PubMed

Chou, Kuo-Chen

2005-01-01

With protein sequences entering into databanks at an explosive pace, the early determination of the family or subfamily class for a newly found enzyme molecule becomes important because this is directly related to the detailed information about which specific target it acts on, as well as to its catalytic process and biological function. Unfortunately, it is both time-consuming and costly to do so by experiments alone. In a previous study, the covariant-discriminant algorithm was introduced to identify the 16 subfamily classes of oxidoreductases. Although the results were quite encouraging, the entire prediction process was based on the amino acid composition alone without including any sequence-order information. Therefore, it is worthy of further investigation. To incorporate the sequence-order effects into the predictor, the 'amphiphilic pseudo amino acid composition' is introduced to represent the statistical sample of a protein. The novel representation contains 20 + 2lambda discrete numbers: the first 20 numbers are the components of the conventional amino acid composition; the next 2lambda numbers are a set of correlation factors that reflect different hydrophobicity and hydrophilicity distribution patterns along a protein chain. Based on such a concept and formulation scheme, a new predictor is developed. It is shown by the self-consistency test, jackknife test and independent dataset tests that the success rates obtained by the new predictor are all significantly higher than those by the previous predictors. The significant enhancement in success rates also implies that the distribution of hydrophobicity and hydrophilicity of the amino acid residues along a protein chain plays a very important role to its structure and function.
Microbial Community Structure and Arsenic Biogeochemistry in an Acid Vapor-Formed Spring in Tengchong Geothermal Area, China.

PubMed

Jiang, Zhou; Li, Ping; Jiang, Dawei; Dai, Xinyue; Zhang, Rui; Wang, Yanhong; Wang, Yanxin

2016-01-01

Arsenic biogeochemistry has been studied extensively in acid sulfate-chloride hot springs, but not in acid sulfate hot springs with low chloride. In this study, Zhenzhuquan in Tengchong geothermal area, a representative acid sulfate hot spring with low chloride, was chosen to study arsenic geochemistry and microbial community structure using Illumina MiSeq sequencing. Over 0.3 million 16S rRNA sequence reads were obtained from 6-paired parallel water and sediment samples along its outflow channel. Arsenic oxidation occurred in the Zhenxhuquan pool, with distinctly high ratios of arsenate to total dissolved arsenic (0.73-0.86). Coupled with iron and sulfur oxidation along the outflow channel, arsenic accumulated in downstream sediments with concentrations up to 16.44 g/kg and appeared to significantly constrain their microbial community diversity. These oxidations might be correlated with the appearance of some putative functional microbial populations, such as Aquificae and Pseudomonas (arsenic oxidation), Sulfolobus (sulfur and iron oxidation), Metallosphaera and Acidicaldus (iron oxidation). Temperature, total organic carbon and dissolved oxygen significantly shaped the microbial community structure of upstream and downstream samples. In the upstream outflow channel region, most microbial populations were microaerophilic/anaerobic thermophiles and hyperthermophiles, such as Sulfolobus, Nocardia, Fervidicoccus, Delftia, and Ralstonia. In the downstream region, aerobic heterotrophic mesophiles and thermophiles were identified, including Ktedonobacteria, Acidicaldus, Chthonomonas and Sphingobacteria. A total of 72.41-95.91% unassigned-genus sequences were derived from the downstream high arsenic sediments 16S rRNA clone libraries. This study could enable us to achieve an integrated understanding on arsenic biogeochemistry in acid hot springs.
Molecular characterization of infectious bursal disease virus isolates from Nepal based on hypervariable region of VP2 gene.

PubMed

Sharma, K; Hair-Bejo, M; Omar, A R; Aini, I

2005-01-01

Two Infectious bursal disease virus (IBDV) isolates, NP1SSH and NP2K were obtained from a severe infectious bursal disease (IBD) outbreak in Nepal in 2002. The hypervariable (HV) region of VP2 gene (1326 bp) of the isolates was generated by RT-PCR and sequenced. The obtained nucleotide sequences were compared with those of twenty other IBDV isolates/strains. Phylogenetic analysis based on this comparison revealed that NP1SSH and NP2K clustered with very virulent (vv) IBDV strains of serotype 1. In contrast, classical, Australian classical and attenuated strains of serotype 1 and avirulent IBDV strains of serotype 2 formed a different cluster. The deduced amino acid sequences of the two isolates showed a 98.3% identity with each other and 97.1% and 98.3% identities, respectively with very virulent IBDV (vvIBDV) isolates/strains. Three amino acids substitutions at positions 300 (E-->A), 308 (I-->F) and 334 (A-->P) within the HV region were common for both the isolates. The amino acids substitutions at positions 27 (S-->T), 28 (I-->T), 31 (D-->A), 36 (H-->Y), 135 (E-->G), 223 (G-->S), 225 (V-->I), 351 (L-->I), 352 (V-->E) and 399 (I-->S) for NP1SSH and at position 438 (I-->S) for NP2K were unique and differed from other IBDV isolates/strains. NP1SSH and NP2K showed highest similarity (97.8%) with the BD399 strain from Bangladesh as compared with other vvIBDV isolates/strains. We conclude that the NP1SSH and NP2K isolates of IBDV from Nepal represent vvIBDV of serotype 1.
The primary structure of rat liver ribosomal protein L37. Homology with yeast and bacterial ribosomal proteins.

PubMed

Lin, A; McNally, J; Wool, I G

1983-09-10

The covalent structure of the rat liver 60 S ribosomal subunit protein L37 was determined. Twenty-four tryptic peptides were purified and the sequence of each was established; they accounted for all 111 residues of L37. The sequence of the first 30 residues of L37, obtained previously by automated Edman degradation of the intact protein, provided the alignment of the first 9 tryptic peptides. Three peptides (CN1, CN2, and CN3) were produced by cleavage of protein L37 with cyanogen bromide. The sequence of CN1 (65 residues) was established from the sequence of secondary peptides resulting from cleavage with trypsin and chymotrypsin. The sequence of CN1 in turn served to order tryptic peptides 1 through 14. The sequence of CN2 (15 residues) was determined entirely by a micromanual procedure and allowed the alignment of tryptic peptides 14 through 18. The sequence of the NH2-terminal 28 amino acids of CN3 (31 residues) was determined; in addition the complete sequences of the secondary tryptic and chymotryptic peptides were done. The sequence of CN3 provided the order of tryptic peptides 18 through 24. Thus the sequence of the three cyanogen bromide peptides also accounted for the 111 residues of protein L37. The carboxyl-terminal amino acids were identified after carboxypeptidase A treatment. There is a disulfide bridge between half-cystinyl residues at positions 40 and 69. Rat liver ribosomal protein L37 is homologous with yeast YP55 and with Escherichia coli L34. Moreover, there is a segment of 17 residues in rat L37 that occurs, albeit with modifications, in yeast YP55 and in E. coli S4, L20, and L34.
Methods and compositions for efficient nucleic acid sequencing

DOEpatents

Drmanac, Radoje

2006-07-04

Disclosed are novel methods and compositions for rapid and highly efficient nucleic acid sequencing based upon hybridization with two sets of small oligonucleotide probes of known sequences. Extremely large nucleic acid molecules, including chromosomes and non-amplified RNA, may be sequenced without prior cloning or subcloning steps. The methods of the invention also solve various current problems associated with sequencing technology such as, for example, high noise to signal ratios and difficult discrimination, attaching many nucleic acid fragments to a surface, preparing many, longer or more complex probes and labelling more species.
Methods and compositions for efficient nucleic acid sequencing

DOEpatents

Drmanac, Radoje

2002-01-01

Disclosed are novel methods and compositions for rapid and highly efficient nucleic acid sequencing based upon hybridization with two sets of small oligonucleotide probes of known sequences. Extremely large nucleic acid molecules, including chromosomes and non-amplified RNA, may be sequenced without prior cloning or subcloning steps. The methods of the invention also solve various current problems associated with sequencing technology such as, for example, high noise to signal ratios and difficult discrimination, attaching many nucleic acid fragments to a surface, preparing many, longer or more complex probes and labelling more species.
Population genomic analysis of strain variation in Leptospirillum group II bacteria involved in acid mine drainage formation.

PubMed

Simmons, Sheri L; Dibartolo, Genevieve; Denef, Vincent J; Goltsman, Daniela S Aliaga; Thelen, Michael P; Banfield, Jillian F

2008-07-22

Deeply sampled community genomic (metagenomic) datasets enable comprehensive analysis of heterogeneity in natural microbial populations. In this study, we used sequence data obtained from the dominant member of a low-diversity natural chemoautotrophic microbial community to determine how coexisting closely related individuals differ from each other in terms of gene sequence and gene content, and to uncover evidence of evolutionary processes that occur over short timescales. DNA sequence obtained from an acid mine drainage biofilm was reconstructed, taking into account the effects of strain variation, to generate a nearly complete genome tiling path for a Leptospirillum group II species closely related to L. ferriphilum (sampling depth approximately 20x). The population is dominated by one sequence type, yet we detected evidence for relatively abundant variants (>99.5% sequence identity to the dominant type) at multiple loci, and a few rare variants. Blocks of other Leptospirillum group II types ( approximately 94% sequence identity) have recombined into one or more variants. Variant blocks of both types are more numerous near the origin of replication. Heterogeneity in genetic potential within the population arises from localized variation in gene content, typically focused in integrated plasmid/phage-like regions. Some laterally transferred gene blocks encode physiologically important genes, including quorum-sensing genes of the LuxIR system. Overall, results suggest inter- and intrapopulation genetic exchange involving distinct parental genome types and implicate gain and loss of phage and plasmid genes in recent evolution of this Leptospirillum group II population. Population genetic analyses of single nucleotide polymorphisms indicate variation between closely related strains is not maintained by positive selection, suggesting that these regions do not represent adaptive differences between strains. Thus, the most likely explanation for the observed patterns of polymorphism is divergence of ancestral strains due to geographic isolation, followed by mixing and subsequent recombination.

Population Genomic Analysis of Strain Variation in Leptospirillum Group II Bacteria Involved in Acid Mine Drainage Formation

PubMed Central

Denef, Vincent J; Goltsman, Daniela S. Aliaga; Thelen, Michael P; Banfield, Jillian F

2008-01-01

Deeply sampled community genomic (metagenomic) datasets enable comprehensive analysis of heterogeneity in natural microbial populations. In this study, we used sequence data obtained from the dominant member of a low-diversity natural chemoautotrophic microbial community to determine how coexisting closely related individuals differ from each other in terms of gene sequence and gene content, and to uncover evidence of evolutionary processes that occur over short timescales. DNA sequence obtained from an acid mine drainage biofilm was reconstructed, taking into account the effects of strain variation, to generate a nearly complete genome tiling path for a Leptospirillum group II species closely related to L. ferriphilum (sampling depth ∼20×). The population is dominated by one sequence type, yet we detected evidence for relatively abundant variants (>99.5% sequence identity to the dominant type) at multiple loci, and a few rare variants. Blocks of other Leptospirillum group II types (∼94% sequence identity) have recombined into one or more variants. Variant blocks of both types are more numerous near the origin of replication. Heterogeneity in genetic potential within the population arises from localized variation in gene content, typically focused in integrated plasmid/phage-like regions. Some laterally transferred gene blocks encode physiologically important genes, including quorum-sensing genes of the LuxIR system. Overall, results suggest inter- and intrapopulation genetic exchange involving distinct parental genome types and implicate gain and loss of phage and plasmid genes in recent evolution of this Leptospirillum group II population. Population genetic analyses of single nucleotide polymorphisms indicate variation between closely related strains is not maintained by positive selection, suggesting that these regions do not represent adaptive differences between strains. Thus, the most likely explanation for the observed patterns of polymorphism is divergence of ancestral strains due to geographic isolation, followed by mixing and subsequent recombination. PMID:18651792
Complementary DNA cloning of the pear 1-aminocyclopropane-1-carboxylic acid oxidase gene and agrobacterium-mediated anti-sense genetic transformation.

PubMed

Qi, Jing; Dong, Zhen; Zhang, Yu-Xing

2015-12-01

The aim of the present study was to genetically modify plantlets of the Chinese yali pear to reduce their expression of ripening-associated 1-aminocyclopropane-1-carboxylic acid oxidase (ACO) and therefore increase the shelf-life of the fruit. Primers were designed with selectivity for the conserved regions of published ACO gene sequences, and yali complementary DNA (cDNA) cloning was performed by reverse transcription quantitative polymerase chain reaction (PCR). The obtained cDNA fragment contained 831 base pairs, encoding 276 amino acid residues, and shared no less than 94% nucleotide sequence identity with other published ACO genes. The cDNA fragment was inversely inserted into a pBI121 expression vector, between the cauliflower mosaic virus 35S promoter and the nopaline synthase terminator, in order to construct the anti‑sense expression vector of the ACO gene; it was transfected into cultured yali plants using Agrobacterium LBA4404. Four independent transgenic lines of pear plantlets were obtained and validated by PCR analysis. A Southern blot assay revealed that there were three transgenic lines containing a single copy of exogenous gene and one line with double copies. The present study provided germplasm resources for the cultivation of novel storage varieties of pears, therefore providing a reference for further applications of anti‑sense RNA technology in the genetic improvement of pears and other fruit.
Relevance and Diversity of Nitrospira Populations in Biofilters of Brackish RAS

PubMed Central

Kruse, Myriam; Keuter, Sabine; Bakker, Evert; Spieck, Eva; Eggers, Till; Lipski, André

2013-01-01

Lithoautotrophic nitrite-oxidizing bacterial populations from moving-bed biofilters of brackish recirculation aquaculture systems (RAS; shrimp and barramundi) were tested for their metabolic activity and phylogenetic diversity. Samples from the biofilters were labeled with 13C-bicarbonate and supplemented with nitrite at concentrations of 0.3, 3 and 10 mM, and incubated at 17 and 28°C, respectively. The biofilm material was analyzed by fatty acid methyl ester - stable isotope probing (FAME-SIP). High portions of up to 45% of Nitrospira-related labeled lipid markers were found confirming that Nitrospira is the major autotrophic nitrite oxidizer in these brackish systems with high nitrogen loads. Other nitrite-oxidizing bacteria such as Nitrobacter or Nitrotoga were functionally not relevant in the investigated biofilters. Nitrospira-related 16S rRNA gene sequences were obtained from the samples with 10 mM nitrite and analyzed by a cloning approach. Sequence studies revealed four different phylogenetic clusters within the marine sublineage IV of Nitrospira, though most sequences clustered with the type strain of Nitrospira marina and with a strain isolated from a marine RAS. Three lipids dominated the whole fatty acid profiles of nitrite-oxidizing marine and brackish enrichments of Nitrospira sublineage IV organisms. The membranes included two marker lipids (16∶1 cis7 and 16∶1 cis11) combined with the non-specific acid 16∶0 as major compounds and confirmed these marker lipids as characteristic for sublineage IV species. The predominant labeling of these characteristic fatty acids and the phylogenetic sequence analyses of the marine Nitrospira sublineage IV identified organisms of this sublineage as main autotrophic nitrite-oxidizers in the investigated brackish biofilter systems. PMID:23705006
Hybridization and sequencing of nucleic acids using base pair mismatches

DOEpatents

Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua

2001-01-01

Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.
Cloning and expression of a conjugated bile acid hydrolase gene from Lactobacillus plantarum by using a direct plate assay.

PubMed

Christiaens, H; Leer, R J; Pouwels, P H; Verstraete, W

1992-12-01

The conjugated bile acid hydrolase gene from the silage isolate Lactobacillus plantarum 80 was cloned and expressed in Escherichia coli MC1061. For the screening of this hydrolase gene within the gene bank, a direct plate assay developed by Dashkevicz and Feighner (M. P. Dashkevicz and S. D. Feighner, Appl. Environ. Microbiol. 53:331-336, 1989) was adapted to the growth requirements of E. coli. Because of hydrolysis and medium acidification, hydrolase-active colonies were surrounded with big halos of precipitated, free bile acids. This phenomenon was also obtained when the gene was cloned into a multicopy shuttle vector and subsequently reintroduced into the parental Lactobacillus strain. The cbh gene and surrounding regions were characterized by nucleotide sequence analysis. The deduced amino acid sequence was shown to have 52% similarity with a penicillin V amidase from Bacillus sphaericus. Preliminary characterization of the gene product showed that it is a cholylglycine hydrolase (EC 3.5.1.24) with only slight activity against taurine conjugates. The optimum pH was between 4.7 and 5.5. Optimum temperature ranged from 30 to 45 degrees C. Southern blot analysis indicated that the cloned gene has similarity with genomic DNA of bile acid hydrolase-active Lactobacillus spp. of intestinal origin.
Human jagged polypeptide, encoding nucleic acids and methods of use

DOEpatents

Li, Linheng; Hood, Leroy

2000-01-01

The present invention provides an isolated polypeptide exhibiting substantially the same amino acid sequence as JAGGED, or an active fragment thereof, provided that the polypeptide does not have the amino acid sequence of SEQ ID NO:5 or SEQ ID NO:6. The invention further provides an isolated nucleic acid molecule containing a nucleotide sequence encoding substantially the same amino acid sequence as JAGGED, or an active fragment thereof, provided that the nucleotide sequence does not encode the amino acid sequence of SEQ ID NO:5 or SEQ ID NO:6. Also provided herein is a method of inhibiting differentiation of hematopoietic progenitor cells by contacting the progenitor cells with an isolated JAGGED polypeptide, or active fragment thereof. The invention additionally provides a method of diagnosing Alagille Syndrome in an individual. The method consists of detecting an Alagille Syndrome disease-associated mutation linked to a JAGGED locus.
Epitope mapping of the variable repetitive region with the MB antigen of Ureaplasma urealyticum.

PubMed Central

Zheng, X; Lau, K; Frazier, M; Cassell, G H; Watson, H L

1996-01-01

One of the major surface structures of Ureaplasma urealyticum recognized by antibodies of patients during infection is the MB antigen. Previously, we showed by Western blot (immunoblot) analysis that any one of the anti-MB monoclonal antibodies (MAbs) 3B1.5, 5B1.1, and 10C6.6 could block the binding of patient antibodies to MB. Subsequent DNA sequencing revealed that a unique six-amino-acid direct tandem repeat region composed the carboxy two-thirds of this antigen. In the present study, using antibody-reactive peptide scanning of this repeat region, we demonstrated that the amino acids defining the epitopes for MAbs 3B1.5 5B1.1 and 10C6.6 are EQP, GK, and KEQPA, respectively. Peptide scanning analysis of an infected patient's serum antibody response showed that the dominant epitope was defined by the sequence PAGK. Mapping of these continuous epitopes revealed overlap between all MAb and patient polyclonal antibody binding sites, thus explaining the ability of a single MAb to apparently block all polyclonal antibody binding sites. We also show that a single amino acid difference in the sequence of the repeats of serovars 3 and 14 accounts for the lack of reactivity with serovar 14 of two of the serovar 3-specific MAbs. Finally, the data demonstrate the need to obtain the sequences of the mba genes of all serovars before an effective serovar-specific antibody detection method can be developed. PMID:8914774
Characterization of the cDNA coding for rat brain cysteine sulfinate decarboxylase: brain and liver enzymes are identical proteins encoded by two distinct mRNAs.

PubMed

Tappaz, M; Bitoun, M; Reymond, I; Sergeant, A

1999-09-01

Cysteine sulfinate decarboxylase (CSD) is considered as the rate-limiting enzyme in the biosynthesis of taurine, a possible osmoregulator in brain. Through cloning and sequencing of RT-PCR and RACE-PCR products of rat brain mRNAs, a 2,396-bp cDNA sequence was obtained encoding a protein of 493 amino acids (calculated molecular mass, 55.2 kDa). The corresponding fusion protein showed a substrate specificity similar to that of the endogenous enzyme. The sequence of the encoded protein is identical to that encoded by liver CSD cDNA. Among other characterized amino acid decarboxylases, CSD shows the highest homology (54%) with either isoform of glutamic acid decarboxylase (GAD65 and GAD67). A single mRNA band, approximately 2.5 kb, was detected by northern blot in RNA extracts of brain, liver, and kidney. However, brain and liver CSD cDNA sequences differed in the 5' untranslated region. This indicates two forms of CSD mRNA. Analysis of PCR-amplified products of genomic DNA suggests that the brain form results from the use of a 3' alternative internal splicing site within an exon specifically found in liver CSD mRNA. Through selective RT-PCR the brain form was detected in brain only, whereas the liver form was found in liver and kidney. These results indicate a tissue-specific regulation of CSD genomic expression.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Denef, Vincent; Shah, Manesh B; Verberkmoes, Nathan C

The recent surge in microbial genomic sequencing, combined with the development of high-throughput liquid chromatography-mass-spectrometry-based (LC/LC-MS/MS) proteomics, has raised the question of the extent to which genomic information of one strain or environmental sample can be used to profile proteomes of related strains or samples. Even with decreasing sequencing costs, it remains impractical to obtain genomic sequence for every strain or sample analyzed. Here, we evaluate how shotgun proteomics is affected by amino acid divergence between the sample and the genomic database using a probability-based model and a random mutation simulation model constrained by experimental data. To assess the effectsmore » of nonrandom distribution of mutations, we also evaluated identification levels using in silico peptide data from sequenced isolates with average amino acid identities (AAI) varying between 76 and 98%. We compared the predictions to experimental protein identification levels for a sample that was evaluated using a database that included genomic information for the dominant organism and for a closely related variant (95% AAI). The range of models set the boundaries at which half of the proteins in a proteomic experiment can be identified to be 77-92% AAI between orthologs in the sample and database. Consistent with this prediction, experimental data indicated loss of half the identifiable proteins at 90% AAI. Additional analysis indicated a 6.4% reduction of the initial protein coverage per 1% amino acid divergence and total identification loss at 86% AAI. Consequently, shotgun proteomics is capable of cross-strain identifications but avoids most crossspecies false positives.« less
Detection of Bacillus anthracis DNA in Complex Soil and Air Samples Using Next-Generation Sequencing

PubMed Central

Be, Nicholas A.; Thissen, James B.; Gardner, Shea N.; McLoughlin, Kevin S.; Fofanov, Viacheslav Y.; Koshinsky, Heather; Ellingson, Sally R.; Brettin, Thomas S.; Jackson, Paul J.; Jaing, Crystal J.

2013-01-01

Bacillus anthracis is the potentially lethal etiologic agent of anthrax disease, and is a significant concern in the realm of biodefense. One of the cornerstones of an effective biodefense strategy is the ability to detect infectious agents with a high degree of sensitivity and specificity in the context of a complex sample background. The nature of the B. anthracis genome, however, renders specific detection difficult, due to close homology with B. cereus and B. thuringiensis. We therefore elected to determine the efficacy of next-generation sequencing analysis and microarrays for detection of B. anthracis in an environmental background. We applied next-generation sequencing to titrated genome copy numbers of B. anthracis in the presence of background nucleic acid extracted from aerosol and soil samples. We found next-generation sequencing to be capable of detecting as few as 10 genomic equivalents of B. anthracis DNA per nanogram of background nucleic acid. Detection was accomplished by mapping reads to either a defined subset of reference genomes or to the full GenBank database. Moreover, sequence data obtained from B. anthracis could be reliably distinguished from sequence data mapping to either B. cereus or B. thuringiensis. We also demonstrated the efficacy of a microbial census microarray in detecting B. anthracis in the same samples, representing a cost-effective and high-throughput approach, complementary to next-generation sequencing. Our results, in combination with the capacity of sequencing for providing insights into the genomic characteristics of complex and novel organisms, suggest that these platforms should be considered important components of a biosurveillance strategy. PMID:24039948
Polypeptide having or assisting in carbohydrate material degrading activity and uses thereof

DOEpatents

Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Los, Alrik Pieter

2016-02-16

The invention relates to a polypeptide which comprises the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 76% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 76% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having beta-glucosidase activity and uses thereof

DOE Office of Scientific and Technical Information (OSTI.GOV)

Schoonneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; De Jong, Rene Marcel

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well asmore » the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.« less
Polypeptide having swollenin activity and uses thereof

DOEpatents

Schoonneveld-Bergmans, Margot Elizabeth Francoise; Heijne, Wilbert Herman Marie; Vlasie, Monica D; Damveld, Robbertus Antonius

2015-11-04

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having beta-glucosidase activity and uses thereof

DOEpatents

Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; De Jong, Rene Marcel; Damveld, Robbertus Antonius

2015-09-01

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 70% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 70% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having cellobiohydrolase activity and uses thereof

DOEpatents

Sagt, Cornelis Maria Jacobus; Schooneveld-Bergmans, Margot Elisabeth Francoise; Roubos, Johannes Andries; Los, Alrik Pieter

2015-09-15

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 93% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 93% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having acetyl xylan esterase activity and uses thereof

DOEpatents

Schoonneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Los, Alrik Pieter

2015-10-20

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 82% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 82% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having carbohydrate degrading activity and uses thereof

DOEpatents

Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Vlasie, Monica Diana; Damveld, Robbertus Antonius

2015-08-18

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
A Generalized Michaelis-Menten Equation in Protein Synthesis: Effects of Mis-Charged Cognate tRNA and Mis-Reading of Codon.

PubMed

Dutta, Annwesha; Chowdhury, Debashish

2017-05-01

The sequence of amino acid monomers in the primary structure of a protein is decided by the corresponding sequence of codons (triplets of nucleic acid monomers) on the template messenger RNA (mRNA). The polymerization of a protein, by incorporation of the successive amino acid monomers, is carried out by a molecular machine called ribosome. We develop a stochastic kinetic model that captures the possibilities of mis-reading of mRNA codon and prior mis-charging of a tRNA. By a combination of analytical and numerical methods, we obtain the distribution of the times taken for incorporation of the successive amino acids in the growing protein in this mathematical model. The corresponding exact analytical expression for the average rate of elongation of a nascent protein is a 'biologically motivated' generalization of the Michaelis-Menten formula for the average rate of enzymatic reactions. This generalized Michaelis-Menten-like formula (and the exact analytical expressions for a few other quantities) that we report here display the interplay of four different branched pathways corresponding to selection of four different types of tRNA.
Comparative RNA-Sequence Transcriptome Analysis of Phenolic Acid Metabolism in Salvia miltiorrhiza, a Traditional Chinese Medicine Model Plant

PubMed Central

Song, Zhenqiao; Guo, Linlin; Liu, Tian; Lin, Caicai; Wang, Jianhua

2017-01-01

Salvia miltiorrhiza Bunge is an important traditional Chinese medicine (TCM). In this study, two S. miltiorrhiza genotypes (BH18 and ZH23) with different phenolic acid concentrations were used for de novo RNA sequencing (RNA-seq). A total of 170,787 transcripts and 56,216 unigenes were obtained. There were 670 differentially expressed genes (DEGs) identified between BH18 and ZH23, 250 of which were upregulated in ZH23, with genes involved in the phenylpropanoid biosynthesis pathway being the most upregulated genes. Nine genes involved in the lignin biosynthesis pathway were upregulated in BH18 and thus result in higher lignin content in BH18. However, expression profiles of most genes involved in the core common upstream phenylpropanoid biosynthesis pathway were higher in ZH23 than that in BH18. These results indicated that genes involved in the core common upstream phenylpropanoid biosynthesis pathway might play an important role in downstream secondary metabolism and demonstrated that lignin biosynthesis was a putative partially competing pathway with phenolic acid biosynthesis. The results of this study expanded our understanding of the regulation of phenolic acid biosynthesis in S. miltiorrhiza. PMID:28194403
A novel cysteine-rich antifungal peptide ToAMP4 from Taraxacum officinale Wigg. flowers.

PubMed

Astafieva, A A; Rogozhin, Eugene A; Andreev, Yaroslav A; Odintsova, T I; Kozlov, S A; Grishin, Eugene V; Egorov, Tsezi A

2013-09-01

A novel peptide named ToAMP4 was isolated from Taraxacum officinale Wigg. flowers by a combination of acetic acid extraction and different types of chromatography: affinity, size-exclusion, and RP-HPLC. The amino acid sequence of ToAMP4 was determined by automated Edman degradation. The peptide is basic, consists of 41 amino acids, and incorporates three disulphide bonds. Due to the unusual cysteine spacing pattern, ToAMP4 does not belong to any known plant AMP family, but classifies together with two other antimicrobial peptides ToAMP1 and ToAMP2 previously isolated from the dandelion flowers. To study the biological activity of ToAMP4, it was successfully produced in a prokaryotic expression system as a fusion protein with thioredoxin. The recombinant peptide was shown to be identical to the native ToAMP4 by chromatographic behavior, molecular mass, and N-terminal amino acid sequence. The peptide displays broad-spectrum antifungal activity against important phytopathogens. Two ToAMP4-mediated inhibition strategies depending on the fungus were demonstrated. The results obtained add to our knowledge on the structural and functional diversity of AMPs in plants. Copyright © 2013 Elsevier Masson SAS. All rights reserved.

37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

Code of Federal Regulations, 2010 CFR

2010-07-01

... 37 Patents, Trademarks, and Copyrights 1 2010-07-01 2010-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide and...
37 CFR 5.31-5.33 - [Reserved

Code of Federal Regulations, 2011 CFR

2011-07-01

... from abandonment 1.135 Amino Acid Sequences. (See Nucleotide and/or Amino Acid Sequences) Appeal to... Appeals and Interference 41.47 Of rejection of an application 1.104(a) Nucleotide and/or Amino Acid...) Symbols for nucleotide and/or amino acid sequence data 1.822 T Tables in patent applications 1.58 Terminal...
37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

Code of Federal Regulations, 2011 CFR

2011-07-01

... 37 Patents, Trademarks, and Copyrights 1 2011-07-01 2011-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide and...
Characterization of the amino acid contribution to the folding degree of proteins.

PubMed

Estrada, Ernesto

2004-03-01

The folding degree index (Estrada, Bioinformatics 2002;18:697-704) is extended to account for the contribution of amino acids to folding. First, the mathematical formalism for extending the folding degree index is presented. Then, the amino acid contributions to folding degree of several proteins are used to analyze its relation to secondary structure. The possibilities of using these contributions in helping or checking the assignation of secondary structure to amino acids are also introduced. The influence of external factors to the amino acids contribution to folding degree is studied through the temperature effect on ribonuclease A. Finally, the analysis of 3D protein similarity through the use of amino acid contributions to folding degree is studied by selecting a series of lysozymes. These results are compared to that obtained by sequence alignment (2D similarity) and 3D superposition of the structures, showing the uniqueness of the current approach. Copyright 2004 Wiley-Liss, Inc.
A novel chaotic image encryption scheme using DNA sequence operations

NASA Astrophysics Data System (ADS)

Wang, Xing-Yuan; Zhang, Ying-Qian; Bao, Xue-Mei

2015-10-01

In this paper, we propose a novel image encryption scheme based on DNA (Deoxyribonucleic acid) sequence operations and chaotic system. Firstly, we perform bitwise exclusive OR operation on the pixels of the plain image using the pseudorandom sequences produced by the spatiotemporal chaos system, i.e., CML (coupled map lattice). Secondly, a DNA matrix is obtained by encoding the confused image using a kind of DNA encoding rule. Then we generate the new initial conditions of the CML according to this DNA matrix and the previous initial conditions, which can make the encryption result closely depend on every pixel of the plain image. Thirdly, the rows and columns of the DNA matrix are permuted. Then, the permuted DNA matrix is confused once again. At last, after decoding the confused DNA matrix using a kind of DNA decoding rule, we obtain the ciphered image. Experimental results and theoretical analysis show that the scheme is able to resist various attacks, so it has extraordinarily high security.
Flexibility of nucleic acids: From DNA to RNA

NASA Astrophysics Data System (ADS)

Lei, Bao; Xi, Zhang; Lei, Jin; Zhi-Jie, Tan

2016-01-01

The structural flexibility of nucleic acids plays a key role in many fundamental life processes, such as gene replication and expression, DNA-protein recognition, and gene regulation. To obtain a thorough understanding of nucleic acid flexibility, extensive studies have been performed using various experimental methods and theoretical models. In this review, we will introduce the progress that has been made in understanding the flexibility of nucleic acids including DNAs and RNAs, and will emphasize the experimental findings and the effects of salt, temperature, and sequence. Finally, we will discuss the major unanswered questions in understanding the flexibility of nucleic acids. Project supported by the National Basic Research Program of China (Grant No. 2011CB933600), the National Natural Science Foundation of China (Grant Nos. 11175132, 11575128, and 11374234), and the Program for New Century Excellent Talents, China (Grant No. NCET 08-0408).
Electron Transfer Dissociation with Supplemental Activation to Differentiate Aspartic and Isoaspartic Residues in Doubly Charged Peptide Cations

PubMed Central

Chan, Wai Yi Kelly; Chan, T. W. Dominic; O’Connor, Peter B.

2011-01-01

Electron-transfer dissociation (ETD) with supplemental activation of the doubly charged deamidated tryptic digested peptide ions allows differentiation of isoaspartic acid and aspartic acid residues using c + 57 or z• − 57 peaks. The diagnostic peak clearly localizes and characterizes the isoaspartic acid residue. Supplemental activation in ETD of the doubly charged peptide ions involves resonant excitation of the charge reduced precursor radical cations and leads to further dissociation, including extra backbone cleavages and secondary fragmentation. Supplemental activation is essential to obtain a high quality ETD spectrum (especially for doubly charged peptide ions) with sequence information. Unfortunately, the low-resolution of the ion trap mass spectrometer makes detection of the diagnostic peak for the aspartic acid residue difficult due to interference with side-chain loss from arginine and glutamic acid residues. PMID:20304674
A Snapshot of a Coral “Holobiont”: A Transcriptome Assembly of the Scleractinian Coral, Porites, Captures a Wide Variety of Genes from Both the Host and Symbiotic Zooxanthellae

PubMed Central

Shinzato, Chuya; Inoue, Mayuri; Kusakabe, Makoto

2014-01-01

Massive scleractinian corals of the genus Porites are important reef builders in the Indo-Pacific, and they are more resistant to thermal stress than other stony corals, such as the genus Acropora. Because coral health and survival largely depend on the interaction between a coral host and its symbionts, it is important to understand the molecular interactions of an entire “coral holobiont”. We simultaneously sequenced transcriptomes of Porites australiensis and its symbionts using the Illumina Hiseq2000 platform. We obtained 14.3 Gbp of sequencing data and assembled it into 74,997 contigs (average: 1,263 bp, N50 size: 2,037 bp). We successfully distinguished contigs originating from the host (Porites) and the symbiont (Symbiodinium) by aligning nucleotide sequences with the decoded Acropora digitifera and Symbiodinium minutum genomes. In contrast to previous coral transcriptome studies, at least 35% of the sequences were found to have originated from the symbionts, indicating that it is possible to analyze both host and symbiont transcriptomes simultaneously. Conserved protein domain and KEGG analyses showed that the dataset contains broad gene repertoires of both Porites and Symbiodinium. Effective utilization of sequence reads revealed that the polymorphism rate in P. australiensis is 1.0% and identified the major symbiotic Symbiodinium as Type C15. Analyses of amino acid biosynthetic pathways suggested that this Porites holobiont is probably able to synthesize most of the common amino acids and that Symbiodinium is potentially able to provide essential amino acids to its host. We believe this to be the first molecular evidence of complementarity in amino acid metabolism between coral hosts and their symbionts. We successfully assembled genes originating from both the host coral and the symbiotic Symbiodinium to create a snapshot of the coral holobiont transcriptome. This dataset will facilitate a deeper understanding of molecular mechanisms of coral symbioses and stress responses. PMID:24454815
A snapshot of a coral "holobiont": a transcriptome assembly of the scleractinian coral, porites, captures a wide variety of genes from both the host and symbiotic zooxanthellae.

PubMed

Shinzato, Chuya; Inoue, Mayuri; Kusakabe, Makoto

2014-01-01

Massive scleractinian corals of the genus Porites are important reef builders in the Indo-Pacific, and they are more resistant to thermal stress than other stony corals, such as the genus Acropora. Because coral health and survival largely depend on the interaction between a coral host and its symbionts, it is important to understand the molecular interactions of an entire "coral holobiont". We simultaneously sequenced transcriptomes of Porites australiensis and its symbionts using the Illumina Hiseq2000 platform. We obtained 14.3 Gbp of sequencing data and assembled it into 74,997 contigs (average: 1,263 bp, N50 size: 2,037 bp). We successfully distinguished contigs originating from the host (Porites) and the symbiont (Symbiodinium) by aligning nucleotide sequences with the decoded Acropora digitifera and Symbiodinium minutum genomes. In contrast to previous coral transcriptome studies, at least 35% of the sequences were found to have originated from the symbionts, indicating that it is possible to analyze both host and symbiont transcriptomes simultaneously. Conserved protein domain and KEGG analyses showed that the dataset contains broad gene repertoires of both Porites and Symbiodinium. Effective utilization of sequence reads revealed that the polymorphism rate in P. australiensis is 1.0% and identified the major symbiotic Symbiodinium as Type C15. Analyses of amino acid biosynthetic pathways suggested that this Porites holobiont is probably able to synthesize most of the common amino acids and that Symbiodinium is potentially able to provide essential amino acids to its host. We believe this to be the first molecular evidence of complementarity in amino acid metabolism between coral hosts and their symbionts. We successfully assembled genes originating from both the host coral and the symbiotic Symbiodinium to create a snapshot of the coral holobiont transcriptome. This dataset will facilitate a deeper understanding of molecular mechanisms of coral symbioses and stress responses.
Gene encoding a novel extracellular metalloprotease in Bacillus subtilis.

PubMed Central

Sloma, A; Rudolph, C F; Rufo, G A; Sullivan, B J; Theriault, K A; Ally, D; Pero, J

1990-01-01

The gene for a novel extracellular metalloprotease was cloned, and its nucleotide sequence was determined. The gene (mpr) encodes a primary product of 313 amino acids that has little similarity to other known Bacillus proteases. The amino acid sequence of the mature protease was preceded by a signal sequence of approximately 34 amino acids and a pro sequence of 58 amino acids. Four cysteine residues were found in the deduced amino acid sequence of the mature protein, indicating the possible presence of disulfide bonds. The mpr gene mapped in the cysA-aroI region of the chromosome and was not required for growth or sporulation. Images FIG. 2 FIG. 7 PMID:2105291
Automated side-chain model building and sequence assignment by template matching.

PubMed

Terwilliger, Thomas C

2003-01-01

An algorithm is described for automated building of side chains in an electron-density map once a main-chain model is built and for alignment of the protein sequence to the map. The procedure is based on a comparison of electron density at the expected side-chain positions with electron-density templates. The templates are constructed from average amino-acid side-chain densities in 574 refined protein structures. For each contiguous segment of main chain, a matrix with entries corresponding to an estimate of the probability that each of the 20 amino acids is located at each position of the main-chain model is obtained. The probability that this segment corresponds to each possible alignment with the sequence of the protein is estimated using a Bayesian approach and high-confidence matches are kept. Once side-chain identities are determined, the most probable rotamer for each side chain is built into the model. The automated procedure has been implemented in the RESOLVE software. Combined with automated main-chain model building, the procedure produces a preliminary model suitable for refinement and extension by an experienced crystallographer.
Culturable Facultative Methylotrophic Bacteria from the Cactus Neobuxbaumia macrocephala Possess the Locus xoxF and Consume Methanol in the Presence of Ce3+ and Ca2+

PubMed Central

del Rocío Bustillos-Cristales, María; Corona-Gutierrez, Ivan; Castañeda-Lucio, Miguel; Águila-Zempoaltécatl, Carolina; Seynos-García, Eduardo; Hernández-Lucas, Ismael; Muñoz-Rojas, Jesús; Medina-Aparicio, Liliana; Fuentes-Ramírez, Luis Ernesto

2017-01-01

Methanol-consuming culturable bacteria were isolated from the plant surface, rhizosphere, and inside the stem of Neobuxbaumia macrocephala. All 38 isolates were facultative methylotrophic microorganisms. Their classification included the Classes Actinobacteria, Sphingobacteriia, Alpha-, Beta-, and Gammaproteobacteria. The deduced amino acid sequences of methanol dehydrogenase obtained by PCR belonging to Actinobacteria, Alpha-, Beta-, and Gammaproteobacteria showed high similarity to rare-earth element (REE)-dependent XoxF methanol dehydrogenases, particularly the group XoxF5. The sequences included Asp301, the REE-coordinating amino acid, present in all known XoxF dehydrogenases and absent in MxaF methanol dehydrogenases. The quantity of the isolates showed positive hybridization with a xoxF probe, but not with a mxaF probe. Isolates of all taxonomic groups showed methylotrophic growth in the presence of Ce3+ or Ca2+. The presence of xoxF-like sequences in methylotrophic bacteria from N. macrocephala and its potential relationship with their adaptability to xerophytic plants are discussed. PMID:28855445
Characterization and Screening of Native Scenedesmus sp. Isolates Suitable for Biofuel Feedstock.

PubMed

Gour, Rakesh Singh; Chawla, Aseem; Singh, Harvinder; Chauhan, Rajinder Singh; Kant, Anil

2016-01-01

In current study isolates of two native microalgae species were screened on the basis of growth kinetics and lipid accumulation potential. On the basis of data obtained on growth parameters and lipid accumulation, it is concluded that Scenedesmus dimorphus has better potential as biofuel feedstock. Two of the isolates of Scenedesmus dimorphus performed better than other isolates with respect to important growth parameters with lipid content of ~30% of dry biomass. Scenedesmus dimorphus was found to be more suitable as biodiesel feedstock candidate on the basis of cumulative occurrence of five important biodiesel fatty acids, relative occurrence of SFA (53.04%), MUFA (23.81%) and PUFA (19.69%), and more importantly that of oleic acid in its total lipids. The morphological observations using light and Scanning Electron Microscope and molecular characterization using amplified 18S rRNA gene sequences of microalgae species under study were also performed. Amplified 18S rRNA gene fragments of the microalgae species were sequenced, annotated at the NCBI website and phylogenetic analysis was done. We have published eight 18S rRNA gene sequences of microalgae species in NCBI GenBank.
Development of a Rapid Identification Method for a Variety of Antibody Candidates Using High-throughput Sequencing.

PubMed

Ito, Yuji

2017-01-01

As an alternative to hybridoma technology, the antibody phage library system can also be used for antibody selection. This method enables the isolation of antigen-specific binders through an in vitro selection process known as biopanning. While it has several advantages, such as an avoidance of animal immunization, the phage cloning and screening steps of biopanning are time-consuming and problematic. Here, we introduce a novel biopanning method combined with high-throughput sequencing (HTS) using a next-generation sequencer (NGS) to save time and effort in antibody selection, and to increase the diversity of acquired antibody sequences. Biopannings against a target antigen were performed using a human single chain Fv (scFv) antibody phage library. VH genes in pooled phages at each round of biopanning were analyzed by HTS on a NGS. The obtained data were trimmed, merged, and translated into amino acid sequences. The frequencies (%) of the respective VH sequences at each biopanning step were calculated, and the amplification factor (change of frequency through biopanning) was obtained to estimate the potential for antigen binding. A phylogenetic tree was drawn using the top 50 VH sequences with high amplification factors. Representative VH sequences forming the cluster were then picked up and used to reconstruct scFv genes harboring these VHs. Their derived scFv-Fc fusion proteins showed clear antigen binding activity. These results indicate that a combination of biopanning and HTS enables the rapid and comprehensive identification of specific binders from antibody phage libraries.
Thermophilic cellobiohydrolase

DOEpatents

Sapra, Rajat; Park, Joshua I.; Datta, Supratim; Simmons, Blake A.

2017-04-18

The present invention provides for a composition comprising a polypeptide comprising a first amino acid sequence having at least 70% identity with the amino acid sequence of Csac GH5 wherein said first amino acid sequence has a thermostable or thermophilic cellobiohydrolase (CBH) or exoglucanase activity.
Evolution of biological sequences implies an extreme value distribution of type I for both global and local pairwise alignment scores.

PubMed

Bastien, Olivier; Maréchal, Eric

2008-08-07

Confidence in pairwise alignments of biological sequences, obtained by various methods such as Blast or Smith-Waterman, is critical for automatic analyses of genomic data. Two statistical models have been proposed. In the asymptotic limit of long sequences, the Karlin-Altschul model is based on the computation of a P-value, assuming that the number of high scoring matching regions above a threshold is Poisson distributed. Alternatively, the Lipman-Pearson model is based on the computation of a Z-value from a random score distribution obtained by a Monte-Carlo simulation. Z-values allow the deduction of an upper bound of the P-value (1/Z-value2) following the TULIP theorem. Simulations of Z-value distribution is known to fit with a Gumbel law. This remarkable property was not demonstrated and had no obvious biological support. We built a model of evolution of sequences based on aging, as meant in Reliability Theory, using the fact that the amount of information shared between an initial sequence and the sequences in its lineage (i.e., mutual information in Information Theory) is a decreasing function of time. This quantity is simply measured by a sequence alignment score. In systems aging, the failure rate is related to the systems longevity. The system can be a machine with structured components, or a living entity or population. "Reliability" refers to the ability to operate properly according to a standard. Here, the "reliability" of a sequence refers to the ability to conserve a sufficient functional level at the folded and maturated protein level (positive selection pressure). Homologous sequences were considered as systems 1) having a high redundancy of information reflected by the magnitude of their alignment scores, 2) which components are the amino acids that can independently be damaged by random DNA mutations. From these assumptions, we deduced that information shared at each amino acid position evolved with a constant rate, corresponding to the information hazard rate, and that pairwise sequence alignment scores should follow a Gumbel distribution, which parameters could find some theoretical rationale. In particular, one parameter corresponds to the information hazard rate. Extreme value distribution of alignment scores, assessed from high scoring segments pairs following the Karlin-Altschul model, can also be deduced from the Reliability Theory applied to molecular sequences. It reflects the redundancy of information between homologous sequences, under functional conservative pressure. This model also provides a link between concepts of biological sequence analysis and of systems biology.
Computer-aided visualization and analysis system for sequence evaluation

DOEpatents

Chee, M.S.

1998-08-18

A computer system for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments are improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area and sample sequences in another area on a display device. 27 figs.
Computer-aided visualization and analysis system for sequence evaluation

DOEpatents

Chee, Mark S.; Wang, Chunwei; Jevons, Luis C.; Bernhart, Derek H.; Lipshutz, Robert J.

2004-05-11

A computer system for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments are improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area and sample sequences in another area on a display device.
Computer-aided visualization and analysis system for sequence evaluation

DOEpatents

Chee, Mark S.

1998-08-18

A computer system for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments are improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area and sample sequences in another area on a display device.
Computer-aided visualization and analysis system for sequence evaluation

DOEpatents

Chee, Mark S.

2003-08-19

A computer system for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments may be improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area and sample sequences in another area on a display device.

Cell culture compositions

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yiao, Jian

2014-03-18

The present invention provides a novel endoglucanase nucleic acid sequence, designated egl6 (SEQ ID NO:1 encodes the full length endoglucanase; SEQ ID NO:4 encodes the mature form), and the corresponding endoglucanase VI amino acid sequence ("EGVI"; SEQ ID NO:3 is the signal sequence; SEQ ID NO:2 is the mature sequence). The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVI, recombinant EGVI proteins and methods for producing the same.
Purification and partial characterization of bacteriocin produced by Lactococcus lactis ssp. lactis LL171.

PubMed

Kumari, Archana; Akkoç, Nefise; Akçelik, Mustafa

2012-04-01

Lactic acid bacteria (LAB) are possessing ability to synthesize antimicrobial compounds (like bacteriocin) during their growth. In this regard, novel bacteriocin compound secreting capability of LAB isolated from Tulum Cheese in Turkey was demonstrated. The synthesized bacteriocin was purified by ammonium sulphate precipitation, dialysis and gel filtration. The molecular weight (≈3.4 kDa) of obtained bacteriocin was confirmed by SDS-PAGE, which revealed single peptide band. Molecular identification of LAB strain isolated from Tulum Cheese was conducted using 16S rDNA gene sequencing as Lactococcus lactis ssp. lactis LL171. The amino acid sequences (KKIDTRTGKTMEKTEKKIELSLKNMKTAT) of the bacteriocin from Lactococcus lactis ssp. lactis LL171 was found unique and novel than reported bacteriocins. Further, the bacteriocin was possessed the thermostable property and active at wide range of pH values from 1 to 11. Thus, bacteriocin reported in this study has the potential applications property as food preservative agent.
A recombinant isoform of the Ole e 7 olive pollen allergen assembled by de novo mass spectrometry retains the allergenic ability of the natural allergen.

PubMed

Oeo-Santos, Carmen; Mas, Salvador; Benedé, Sara; López-Lucendo, María; Quiralte, Joaquín; Blanca, Miguel; Mayorga, Cristobalina; Villalba, Mayte; Barderas, Rodrigo

2018-06-05

The allergenic non-specific lipid transfer protein Ole e 7 from olive pollen is a major allergen associated with severe symptoms in areas with high olive pollen levels. Despite its clinical importance, its cloning and recombinant production has been unable by classical approaches. This study aimed at determining by mass-spectrometry based proteomics its complete amino acid sequence for its subsequent expression and characterization. To this end, the natural protein was in-2D-gel tryptic digested, and CID and HCD fragmentation spectra obtained by nLC-MS/MS analyzed using PEAKS software. Thirteen out of the 457 de novo sequenced peptides obtained allowed assembling its full-length amino acid sequence. Then, Ole e 7-encoding cDNA was synthesized and cloned in pPICZαA vector for its expression in Pichia pastoris yeast. The analyses by Circular Dichroism, and WB, ELISA and cell-based tests using sera and blood from olive pollen-sensitized patients showed that rOle e 7 mostly retained the structural, allergenic and antigenic properties of the natural allergen. In summary, rOle e 7 allergen assembled by de novo peptide sequencing by MS behaved immunologically similar to the natural allergen scarcely isolated from pollen. Olive pollen is an important cause of allergy. The non-specific lipid binding protein Ole e 7 is a major allergen with a high incidence and a phenotype associated to severe clinical symptoms. Despite its relevance, its cloning and recombinant expression has been unable by classical techniques. Here, we have inferred the primary amino acid sequence of Ole e 7 by mass-spectrometry. We separated Ole e 7 isolated from pollen by 2DE. After in-gel digestion with trypsin and a direct analysis by nLC-MS/MS in an LTQ-Orbitrap Velos, we got the complete de novo sequenced peptides repertoire that allowed the assembling of the primary sequence of Ole e 7. After its protein expression, purification to homogeneity, and structural and immunological characterization using sera from olive pollen allergic patients and cell-based assays, we observed that the recombinant allergen retained the antigenic and allergenic properties of the natural allergen. Collectively, we show that the recombinant protein assembled by proteomics would be suitable for a better in vitro diagnosis of olive pollen allergic patients. Copyright © 2018. Published by Elsevier B.V.
Labeled nucleotide phosphate (NP) probes

DOEpatents

Korlach, Jonas [Ithaca, NY; Webb, Watt W [Ithaca, NY; Levene, Michael [Ithaca, NY; Turner, Stephen [Ithaca, NY; Craighead, Harold G [Ithaca, NY; Foquet, Mathieu [Ithaca, NY

2009-02-03

The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
Development of a peptide nucleic acid polymerase chain reaction clamping assay for semiquantitative evaluation of genetically modified organism content in food.

PubMed

Peano, C; Lesignoli, F; Gulli, M; Corradini, R; Samson, M C; Marchelli, R; Marmiroli, N

2005-09-15

In the present study a peptide nucleic acid (PNA)-mediated polymerase chain reaction (PCR) clamping method was developed and applied to the detection of genetically modified organisms (GMO), to test PCR products for band identity and to obtain a semiquantitative evaluation of GMO content. The minimal concentration of PNA necessary to block the PCR was determined by comparing PCRs containing a constant amount of DNA in the presence of increasing concentration of target-specific PNA. The lowest PNA concentration at which specific inhibition took place, by the inhibition of primer extension and/or steric hindrance, was the most efficient condition. Optimization of PCR clamping by PNA was observed by testing five different PNAs with a minimum of 13 bp to a maximum of 15 bp, designed on the target sequence of Roundup Ready soybean. The results obtained on the DNA extracted from Roundup Ready soybean standard flour were verified also on DNA extracted from standard flours of maize GA21, Bt176, Bt11, and MON810. A correlation between the PNA concentration necessary for inducing PCR clamping and the percentage of the GMO target sequence in the sample was found.
Development of designed site-directed pseudopeptide-peptido-mimetic immunogens as novel minimal subunit-vaccine candidates for malaria.

PubMed

Lozano, José Manuel; Lesmes, Liliana P; Carreño, Luisa F; Gallego, Gina M; Patarroyo, Manuel Elkin

2010-12-06

Synthetic vaccines constitute the most promising tools for controlling and preventing infectious diseases. When synthetic immunogens are designed from the pathogen native sequences, these are normally poorly immunogenic and do not induce protection, as demonstrated in our research. After attempting many synthetic strategies for improving the immunogenicity properties of these sequences, the approach consisting of identifying high binding motifs present in those, and then performing specific changes on amino-acids belonging to such motifs, has proven to be a workable strategy. In addition, other strategies consisting of chemically introducing non-natural constraints to the backbone topology of the molecule and modifying the α-carbon asymmetry are becoming valuable tools to be considered in this pursuit. Non-natural structural constraints to the peptide backbone can be achieved by introducing peptide bond isosters such as reduced amides, partially retro or retro-inverso modifications or even including urea motifs. The second can be obtained by strategically replacing L-amino-acids with their enantiomeric forms for obtaining both structurally site-directed designed immunogens as potential vaccine candidates and their Ig structural molecular images, both having immuno-therapeutic effects for preventing and controlling malaria.
Cloning, characterization, expression and comparative analysis of pig Golgi membrane sphingomyelin synthase 1.

PubMed

Guillén, Natalia; Navarro, María A; Surra, Joaquín C; Arnal, Carmen; Fernández-Juan, Marta; Cebrián-Pérez, Jose Alvaro; Osada, Jesús

2007-02-15

Pig sphingomyelin synthase 1 (SMS1) cDNA was cloned, characterized and compared to the human ortholog. Porcine protein consists of 413 amino acids and displays a 97% sequence identity with human protein. A phylogenic tree of proteins reveals that porcine SMS1 is more closely related to bovine and rodent proteins than to human. Analysis of protein mass was higher than the theoretical prediction based on amino acid sequence suggesting a kind of posttranslational modification. Quantitative representation of tissue distribution obtained by real-time RT-PCR showed that it was widely expressed although important variations in levels were obtained among organs. Thus, the cardiovascular system, especially the heart, showed the highest value of all the tissues studied. Regional differences of expression were observed in the central nervous system and intestinal tract. Analysis of the hepatic mRNA and protein expressions of SMS1 following turpentine treatment revealed a progressive decrease in the former paralleled by a decrease in the protein concentration. These findings indicate the variation in expression in the different tissues might suggest a different requirement of Golgi sphingomyelin for the specific function in each organ and a regulation of the enzyme in response to turpentine-induced hepatic injury.
Heavy metal-binding proteins from metal-stimulated bacteria as a novel adsorbent for metal removal technology.

PubMed

Sano, D; Myojo, K; Omura, T

2006-01-01

Water pollution with toxic heavy metals is of growing concern because heavy metals could bring about serious problems for not only ecosystems in the water environment but also human health. Some metal removal technologies have been in practical use, but much energy and troublesome treatments for chemical wastes are required to operate these conventional technologies. In this study, heavy metal-binding proteins (HMBPs) were obtained from metal-stimulated activated sludge culture with affinity chromatography using copper ion as a ligand. Two-dimensional electrophoresis revealed that a number of proteins in activated sludge culture were recovered as HMBPs for copper ion. N-termini of five HMBPs were determined, and two of them were found to be newly discovered proteins for which no amino acid sequences in protein databases were retrieved at more than 80% identities. Metal-coordinating amino acids occupied 38% of residues in one of the N-terminal sequences of the newly discovered HMBPs. Since these HMBPs were expected to be stable under conditions of water and wastewater treatments, it would be possible to utilize HMBPs as novel adsorbents for heavy metal removal if mass volume of HMBPs can be obtained with protein cloning techniques.
Biosynthesis of Lipoic Acid in Arabidopsis: Cloning and Characterization of the cDNA for Lipoic Acid Synthase1

PubMed Central

Yasuno, Rie; Wada, Hajime

1998-01-01

Lipoic acid is a coenzyme that is essential for the activity of enzyme complexes such as those of pyruvate dehydrogenase and glycine decarboxylase. We report here the isolation and characterization of LIP1 cDNA for lipoic acid synthase of Arabidopsis. The Arabidopsis LIP1 cDNA was isolated using an expressed sequence tag homologous to the lipoic acid synthase of Escherichia coli. This cDNA was shown to code for Arabidopsis lipoic acid synthase by its ability to complement a lipA mutant of E. coli defective in lipoic acid synthase. DNA-sequence analysis of the LIP1 cDNA revealed an open reading frame predicting a protein of 374 amino acids. Comparisons of the deduced amino acid sequence with those of E. coli and yeast lipoic acid synthase homologs showed a high degree of sequence similarity and the presence of a leader sequence presumably required for import into the mitochondria. Southern-hybridization analysis suggested that LIP1 is a single-copy gene in Arabidopsis. Western analysis with an antibody against lipoic acid synthase demonstrated that this enzyme is located in the mitochondrial compartment in Arabidopsis cells as a 43-kD polypeptide. PMID:9808738
SeSaM-Tv-II generates a protein sequence space that is unobtainable by epPCR.

PubMed

Mundhada, Hemanshu; Marienhagen, Jan; Scacioc, Andreea; Schenk, Alexander; Roccatano, Danilo; Schwaneberg, Ulrich

2011-07-04

Generating high-quality mutant libraries in which each amino acid is equally targeted and substituted in a chemically diverse manner is crucial to obtain improved variants in small mutant libraries. The sequence saturation mutagenesis method (SeSaM-Tv(+) ) offers the opportunity to generate such high-quality mutant libraries by introducing consecutive mutations and by enriching transversions. In this study, automated gel electrophoresis, real-time quantitative PCR, and a phosphorimager quantification system were developed and employed to optimize each step of previously reported SeSaM-Tv(+) method. Advancements of the SeSaM-Tv(+) protocol and the use of a novel DNA polymerase quadrupled the number of transversions, by doubling the fraction of consecutive mutations (from 16.7 to 37.1 %). About 33 % of all amino acid substitutions observed in a model library are rarely introduced by epPCR methods, and around 10 % of all clones carried amino acid substitutions that are unobtainable by epPCR. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Identification of a new genotype H wild-type mumps virus strain and its molecular relatedness to other virulent and attenuated strains.

PubMed

Amexis, Georgios; Rubin, Steven; Chatterjee, Nando; Carbone, Kathryn; Chumakov, Kostantin

2003-06-01

A single clinical isolate of mumps virus designated 88-1961 was obtained from a patient hospitalized with a clinical history of upper respiratory tract infection, parotitis, severe headache, fever and lymphadenopathy. We have sequenced the full-length genome of 88-1961 and compared it against all available full-length sequences of mumps virus. Based upon its nucleotide sequence of the SH gene 88-1961 was identified as a genotype H mumps strain. The overall extent of nucleotide and amino acid differences between each individual gene and protein of 88-1961 and the full-length mumps samples showed that the missense to silent ratios were unevenly distributed. Upon evaluation of the consensus sequence of 88-1961, four positions were found to be clearly heterogeneous at the nucleotide level (NP 315C/T, NP 318C/T, F 271A/C, and HN 855C/T). Sequence analysis revealed that the amino acid sequences for the NP, M, and the L protein were the most conserved, whereas the SH protein exhibited the highest variability among the compared mumps genotypes A, B, and G. No identifying molecular patterns in the non-coding (intergenic) or coding regions of 88-1961 were found when we compared it against relatively virulent (Urabe AM9 B, Glouc1/UK96, 87-1004 and 87-1005) and non-virulent mumps strains (Jeryl Lynn and all Urabe Am9 A substrains). Copyright 2003 Wiley-Liss, Inc.
Trichoderma .beta.-glucosidase

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2006-01-03

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl3, and the corresponding BGL3 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL3, recombinant BGL3 proteins and methods for producing the same.
Computer-aided visualization and analysis system for sequence evaluation

DOEpatents

Chee, Mark S.

1999-10-26

A computer system (1) for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments may be improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area (814) and sample sequences in another area (816) on a display device (3).
Computer-aided visualization and analysis system for sequence evaluation

DOEpatents

Chee, Mark S.

2001-06-05

A computer system (1) for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments may be improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area (814) and sample sequences in another area (816) on a display device (3).
Carbohydrate degrading polypeptide and uses thereof

DOEpatents

Sagt, Cornelis Maria Jacobus; Schooneveld-Bergmans, Margot Elisabeth Francoise; Roubos, Johannes Andries; Los, Alrik Pieter

2015-10-20

The invention relates to a polypeptide having carbohydrate material degrading activity which comprises the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1 or SEQ ID NO: 4, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional protein and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Microsporidia, amitochondrial protists, possess a 70-kDa heat shock protein gene of mitochondrial evolutionary origin.

PubMed

Peyretaillade, E; Broussolle, V; Peyret, P; Méténier, G; Gouy, M; Vivarès, C P

1998-06-01

An intronless gene encoding a protein of 592 amino acid residues with similarity to 70-kDa heat shock proteins (HSP70s) has been cloned and sequenced from the amitochondrial protist Encephalitozoon cuniculi (phylum Microsporidia). Southern blot analyses show the presence of a single gene copy located on chromosome XI. The encoded protein exhibits an N-terminal hydrophobic leader sequence and two motifs shared by proteobacterial and mitochondrially expressed HSP70 homologs. Phylogenetic analysis using maximum likelihood and evolutionary distances place the E. cuniculi sequence in the cluster of mitochondrially expressed HSP70s, with a higher evolutionary rate than those of homologous sequences. Similar results were obtained after cloning a fragment of the homologous gene in the closely related species E. hellem. The presence of a nuclear targeting signal-like sequence supports a role of the Encephalitozoon HSP70 as a molecular chaperone of nuclear proteins. No evidence for cytosolic or endoplasmic reticulum forms of HSP70 was obtained through PCR amplification. These data suggest that Encephalitozoon species have evolved from an ancestor bearing mitochondria, which is in disagreement with the postulated presymbiotic origin of Microsporidia. The specific role and intracellular localization of the mitochondrial HSP70-like protein remain to be elucidated.
Automated Sanger Analysis Pipeline (ASAP): A Tool for Rapidly Analyzing Sanger Sequencing Data with Minimum User Interference.

PubMed

Singh, Aditya; Bhatia, Prateek

2016-12-01

Sanger sequencing platforms, such as applied biosystems instruments, generate chromatogram files. Generally, for 1 region of a sequence, we use both forward and reverse primers to sequence that area, in that way, we have 2 sequences that need to be aligned and a consensus generated before mutation detection studies. This work is cumbersome and takes time, especially if the gene is large with many exons. Hence, we devised a rapid automated command system to filter, build, and align consensus sequences and also optionally extract exonic regions, translate them in all frames, and perform an amino acid alignment starting from raw sequence data within a very short time. In full capabilities of Automated Mutation Analysis Pipeline (ASAP), it is able to read "*.ab1" chromatogram files through command line interface, convert it to the FASTQ format, trim the low-quality regions, reverse-complement the reverse sequence, create a consensus sequence, extract the exonic regions using a reference exonic sequence, translate the sequence in all frames, and align the nucleic acid and amino acid sequences to reference nucleic acid and amino acid sequences, respectively. All files are created and can be used for further analysis. ASAP is available as Python 3.x executable at https://github.com/aditya-88/ASAP. The version described in this paper is 0.28.
Nucleic acid analysis using terminal-phosphate-labeled nucleotides

DOEpatents

Korlach, Jonas [Ithaca, NY; Webb, Watt W [Ithaca, NY; Levene, Michael [Ithaca, NY; Turner, Stephen [Ithaca, NY; Craighead, Harold G [Ithaca, NY; Foquet, Mathieu [Ithaca, NY

2008-04-22

The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
The partial sequence of RNA 1 of the ophiovirus Ranunculus white mottle virus indicates its relationship to rhabdoviruses and provides candidate primers for an ophiovirus-specific RT-PCR test.

PubMed

Vaira, A M; Accotto, G P; Costantini, A; Milne, R G

2003-06-01

A 4018 nucleotide sequence was obtained for RNA 1 of Ranunculus white mottle virus (RWMV), genus Ophiovirus, representing an incomplete ORF of 1339 aa. Amino acid sequence analysis revealed significant similarities with RNA polymerases of viruses in the family Rhabdoviridae and a conserved domain of 685 aa, corresponding to the RdRp domain of those in the order Mononegavirales. Phylogenetic analysis indicated that the genus Ophiovirus is not related to the genus Tenuivirus or the family Bunyaviridae, with which it has been linked, and probably deserves a special taxonomic position, within a new family. A pair of degenerate primers was designed from a consensus sequence obtained from a relatively conserved region in the RNA 1 of two members of the genus, Citrus psorosis virus (CPsV) and RWMV. The primers, used in RT-PCR experiments, amplified a 136 bp DNA fragment from all the three recognized members of the genus, i.e. CPsV, RWMV and Tulip mild mottle mosaic virus (TMMMV) and from two tentative ophioviruses from lettuce and freesia. The amplified DNAs were sequenced and compared with the corresponding sequences of CPsV and RWMV and phylogenetic relationships were evaluated. Assays using extracts from plants infected by viruses belonging to the genera Tospovirus, Tenuivirus, Rhabdovirus and Varicosavirus indicated that the primers are genus-specific.
Method for high-volume sequencing of nucleic acids: random and directed priming with libraries of oligonucleotides

DOEpatents

Studier, F. William

1995-04-18

Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient.

Method for high-volume sequencing of nucleic acids: random and directed priming with libraries of oligonucleotides

DOEpatents

Studier, F.W.

1995-04-18

Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient. 2 figs.
Variability of the protein sequences of lcrV between epidemic and atypical rhamnose-positive strains of Yersinia pestis.

PubMed

Anisimov, Andrey P; Panfertsev, Evgeniy A; Svetoch, Tat'yana E; Dentovskaya, Svetlana V

2007-01-01

Sequencing of lcrV genes and comparison of the deduced amino acid sequences from ten Y. pestis strains belonging mostly to the group of atypical rhamnose-positive isolates (non-pestis subspecies or pestoides group) showed that the LcrV proteins analyzed could be classified into five sequence types. This classification was based on major amino acid polymorphisms among LcrV proteins in the four "hot points" of the protein sequences. Some additional minor polymorphisms were found throughout these sequence types. The "hot points" corresponded to amino acids 18 (Lys --> Asn), 72 (Lys --> Arg), 273 (Cys --> Ser), and 324-326 (Ser-Gly-Lys --> Arg) in the LcrV sequence of the reference Y. pestis strain CO92. One possible explanation for polymorphism in amino acid sequences of LcrV among different strains is that strain-specific variation resulted from adaptation of the plague pathogen to different rodent and lagomorph hosts.
Purification, identification and preliminary crystallographic studies of an allergenic protein from Lathyrus sativus

DOE Office of Scientific and Technical Information (OSTI.GOV)

Qureshi, Insaf A.; Sethi, Dhruv K.; Salunke, Dinakar M., E-mail: dinakar@nii.res.in

2006-09-01

A 24 kDa protein was purified from the seeds of L. sativus by ammonium sulfate fractionation and ion-exchange chromatography. Crystals were obtained by the hanging-drop vapour-diffusion method. A 24 kDa protein was purified from the seeds of Lathyrus sativus by ammonium sulfate fractionation and ion-exchange chromatography. The N-terminal amino-acid sequence showed significant homology with the 2S albumin class of seed storage proteins. The protein showed 85% sequence homology with the seed albumin of Pisum sativum within the 40 N-terminal residues. Crystals were obtained by the hanging-drop vapour-diffusion method. The crystals belonged to space group P2{sub 1}2{sub 1}2{sub 1}, with unit-cellmore » parameters a = 43.5, b = 82.7, c = 153.4 Å.« less
A robust and cost-effective approach to sequence and analyze complete genomes of small RNA viruses

USDA-ARS?s Scientific Manuscript database

Background: Next-generation sequencing (NGS) allows ultra-deep sequencing of nucleic acids. The use of sequence-independent amplification of viral nucleic acids without utilization of target-specific primers provides advantages over traditional sequencing methods and allows detection of unsuspected ...
.beta.-glucosidase 5 (BGL5) compositions

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2010-06-01

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl5, and the corresponding BGL5 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL5, recombinant BGL5 proteins and methods for producing the same.
The respective roles of polar/nonpolar binary patterns and amino acid composition in protein regular secondary structures explored exhaustively using hydrophobic cluster analysis.

PubMed

Rebehmed, Joseph; Quintus, Flavien; Mornon, Jean-Paul; Callebaut, Isabelle

2016-05-01

Several studies have highlighted the leading role of the sequence periodicity of polar and nonpolar amino acids (binary patterns) in the formation of regular secondary structures (RSS). However, these were based on the analysis of only a few simple cases, with no direct mean to correlate binary patterns with the limits of RSS. Here, HCA-derived hydrophobic clusters (HC) which are conditioned binary patterns whose positions fit well those of RSS, were considered. All the HC types, defined by unique binary patterns, which were commonly observed in three-dimensional (3D) structures of globular domains, were analyzed. The 180 HC types with preferences for either α-helices or β-strands distinctly contain basic binary units typical of these RSS. Therefore a general trend supporting the "binary pattern preference" assumption was observed. HC for which observed RSS are in disagreement with their expected behavior (discordant HC) were also examined. They were separated in HC types with moderate preferences for RSS, having "weak" binary patterns and versatile RSS and HC types with high preferences for RSS, having "strong" binary patterns and then displaying nonpolar amino acids at the protein surface. It was shown that in both cases, discordant HC could be distinguished from concordant ones by well-differentiated amino acid compositions. The obtained results could, thus, help to complement the currently available methods for the accurate prediction of secondary structures in proteins from the only information of a single amino acid sequence. This can be especially useful for characterizing orphan sequences and for assisting protein engineering and design. © 2016 Wiley Periodicals, Inc.
Microbial Community Structure and Arsenic Biogeochemistry in an Acid Vapor-Formed Spring in Tengchong Geothermal Area, China

PubMed Central

Jiang, Zhou; Li, Ping; Jiang, Dawei; Dai, Xinyue; Zhang, Rui; Wang, Yanhong; Wang, Yanxin

2016-01-01

Arsenic biogeochemistry has been studied extensively in acid sulfate-chloride hot springs, but not in acid sulfate hot springs with low chloride. In this study, Zhenzhuquan in Tengchong geothermal area, a representative acid sulfate hot spring with low chloride, was chosen to study arsenic geochemistry and microbial community structure using Illumina MiSeq sequencing. Over 0.3 million 16S rRNA sequence reads were obtained from 6-paired parallel water and sediment samples along its outflow channel. Arsenic oxidation occurred in the Zhenxhuquan pool, with distinctly high ratios of arsenate to total dissolved arsenic (0.73–0.86). Coupled with iron and sulfur oxidation along the outflow channel, arsenic accumulated in downstream sediments with concentrations up to 16.44 g/kg and appeared to significantly constrain their microbial community diversity. These oxidations might be correlated with the appearance of some putative functional microbial populations, such as Aquificae and Pseudomonas (arsenic oxidation), Sulfolobus (sulfur and iron oxidation), Metallosphaera and Acidicaldus (iron oxidation). Temperature, total organic carbon and dissolved oxygen significantly shaped the microbial community structure of upstream and downstream samples. In the upstream outflow channel region, most microbial populations were microaerophilic/anaerobic thermophiles and hyperthermophiles, such as Sulfolobus, Nocardia, Fervidicoccus, Delftia, and Ralstonia. In the downstream region, aerobic heterotrophic mesophiles and thermophiles were identified, including Ktedonobacteria, Acidicaldus, Chthonomonas and Sphingobacteria. A total of 72.41–95.91% unassigned-genus sequences were derived from the downstream high arsenic sediments 16S rRNA clone libraries. This study could enable us to achieve an integrated understanding on arsenic biogeochemistry in acid hot springs. PMID:26761709
Methods of diagnosing alagille syndrome

DOEpatents

Li, Linheng; Hood, Leroy; Krantz, Ian D.; Spinner, Nancy B.

2004-03-09

The present invention provides an isolated polypeptide exhibiting substantially the same amino acid sequence as JAGGED, or an active fragment thereof, provided that the polypeptide does not have the amino acid sequence of SEQ ID NO:5 or SEQ ID NO:6. The invention further provides an isolated nucleic acid molecule containing a nucleotide sequence encoding substantially the same amino acid sequence as JAGGED, or an active fragment thereof, provided that the nucleotide sequence does not encode the amino acid sequence of SEQ ID NO:5 or SEQ ID NO:6. Also provided herein is a method of inhibiting differentiation of hematopoietic progenitor cells by contacting the progenitor cells with an isolated JAGGED polypeptide, or active fragment thereof. The invention additionally provides a method of diagnosing Alagille Syndrome in an individual. The method consists of detecting an Alagille Syndrome disease-associated mutation linked to a JAGGED locus.
Direct effects of casein phosphopeptides on growth and differentiation of in vitro cultured osteoblastic cells (MC3T3-E1).

PubMed

Tulipano, Giovanni; Bulgari, Omar; Chessa, Stefania; Nardone, Alessandro; Cocchi, Daniela; Caroli, Anna

2010-02-25

Casein phosphopeptides (CPPs) obtained by enzymatic hydrolysis in vitro of caseins, have been shown to enhance calcium solubility and to increase the calcification of embryonic rat bones in their diaphyseal area. Little is known about the direct effects of CPPs on cultured osteoblastic cells. Calcium in the microenvironment surrounding bone cells is not only important for the mineralization of the extracellular matrix, but it is believed to provide preosteblasts with a signal that modulates their proliferation and differentiation. The aim of the present study was to investigate the direct effects of four selected casein phosphopeptides on osteoblastic cell (MC3T3-E1 cells) viability and differentiation. The selected peptides have been obtained by chemical synthesis and differed in the number of phosphorylated sites and in the amino acid spacing out two phosphorylated sites, in order to further characterize the relationship between structure and function. The results obtained in this work demonstrated that CPPs may directly affect osteoblast-like cell growth, calcium uptake and ultimately calcium deposition in the extracellular matrix. The effects exerted by distinct CPPs on osteogenesis in vitro can be either stimulatory or inhibitory. Differential short amino acid sequences in their molecules, like the -SpEE- and the -SpTSpEE-motifs, are likely the molecular determinants for their biological activities on osteoblastic cells. Moreover, two genetic variants of CPPs showing one amino acid change in their sequence may profoundly differ in their biological activities. Finally, our data may also suggest important clues about the role of intrinsic phosphorylated peptides derived from endogenous phosphorylated proteins in bone metabolism, apart from extrinsic CPPs. Copyright 2009 Elsevier B.V. All rights reserved.
Isolation, identification and fibrolytic characteristics of rumen fungi grown with indigenous methanogen from yaks (Bos grunniens) grazing on the Qinghai-Tibetan Plateau.

PubMed

Wei, Y-Q; Yang, H-J; Luan, Y; Long, R-J; Wu, Y-J; Wang, Z-Y

2016-03-01

To obtain co-cultures of anaerobic fungi and their indigenously associated methanogens from the rumen of yaks grazing on the Qinghai-Tibetan Plateau and investigate their morphology features and ability to degrade lignocellulose. Twenty fungus-methanogen co-cultures were obtained by Hungate roll-tube technique. The fungi were identified as Orpinomyces, Neocallimastix and Piromyces genera based on the morphological characteristics and internal transcribed spacer 1 sequences analysis. All methanogens were identified as Methanobrevibacter sp. by 16S rRNA gene sequencing. There were four types of co-cultures: Neocallimastix with Methanobrevibacter ruminantium, Orpinomyces with M. ruminantium, Orpinomyces with Methanobrevibacter millerae and Piromyces with M. ruminantium among 20 co-cultures. In vitro studies with wheat straw as substrate showed that the Neocallimastix with M. ruminantium co-cultures and Piromyces with M. ruminantium co-cultures exhibited higher xylanase, filter paper cellulase (FPase), ferulic acid esterase, acetyl esterase activities, in vitro dry matter digestibility, gas, CH4 , acetate production, ferulic acid and p-coumaric acid releases. The Neocallimastix frontalis Yak16 with M. ruminantium co-culture presented the strongest lignocellulose degradation ability among 20 co-cultures. Twenty fungus-methanogen co-cultures were obtained from the rumen of grazing yaks. The N. frontalis with M. ruminantium co-cultures were highly effective combination for developing a fermentative system that bioconverts lignocellulose to high activity fibre-degrading enzyme, CH4 and acetate. The N. frontalis with M. ruminantium co-cultures from yaks grazing on the Qinghai-Tibetan Plateau present great potential in lignocellulose biodegradation industry. © 2015 The Society for Applied Microbiology.
A New Primer to Amplify pmoA Gene From NC10 Bacteria in the Sediments of Dongchang Lake and Dongping Lake.

PubMed

Wang, Shenghui; Liu, Yanjun; Liu, Guofu; Huang, Yaru; Zhou, Yu

2017-08-01

Nitrite-dependent anaerobic methane oxidation (n-damo) is catalyzed by the NC10 phylum bacterium "Candidatus Methylomirabilis oxyfera" (M. oxyfera). Generally, the pmoA gene is applied as a functional marker to test and identify NC10-like bacteria. However, it is difficult to detect the NC10 bacteria from sediments of freshwater lake (Dongchang Lake and Dongping Lake) with the previous pmoA gene primer sets. In this work, a new primer cmo208 was designed and used to amplify pmoA gene of NC10-like bacteria. A newly nested PCR approach was performed using the new primer cmo208 and the previous primers cmo182, cmo682, and cmo568 to detect the NC10 bacteria. The obtained pmoA gene sequences exhibited 85-92% nucleotide identity and 95-97% amino acid sequence identity to pmoA gene of M. oxyfera. The obtained diversity of pmoA gene sequences coincided well with the diversity of 16S rRNA sequences. These results indicated that the newly designed pmoA primer cmo208 could give one more option to detect NC10 bacteria from different environmental samples.
A motif detection and classification method for peptide sequences using genetic programming.

PubMed

Tomita, Yasuyuki; Kato, Ryuji; Okochi, Mina; Honda, Hiroyuki

2008-08-01

An exploration of common rules (property motifs) in amino acid sequences has been required for the design of novel sequences and elucidation of the interactions between molecules controlled by the structural or physical environment. In the present study, we developed a new method to search property motifs that are common in peptide sequence data. Our method comprises the following two characteristics: (i) the automatic determination of the position and length of common property motifs by calculating the physicochemical similarity of amino acids, and (ii) the quick and effective exploration of motif candidates that discriminates the positives and negatives by the introduction of genetic programming (GP). Our method was evaluated by two types of model data sets. First, the intentionally buried property motifs were searched in the artificially derived peptide data containing intentionally buried property motifs. As a result, the expected property motifs were correctly extracted by our algorithm. Second, the peptide data that interact with MHC class II molecules were analyzed as one of the models of biologically active peptides with buried motifs in various lengths. Twofold MHC class II binding peptides were identified with the rule using our method, compared to the existing scoring matrix method. In conclusion, our GP based motif searching approach enabled to obtain knowledge of functional aspects of the peptides without any prior knowledge.
In silico Derivation of HLA-Specific Alloreactivity Potential from Whole Exome Sequencing of Stem-Cell Transplant Donors and Recipients: Understanding the Quantitative Immunobiology of Allogeneic Transplantation

PubMed Central

Jameson-Lee, Max; Koparde, Vishal; Griffith, Phil; Scalora, Allison F.; Sampson, Juliana K.; Khalid, Haniya; Sheth, Nihar U.; Batalo, Michael; Serrano, Myrna G.; Roberts, Catherine H.; Hess, Michael L.; Buck, Gregory A.; Neale, Michael C.; Manjili, Masoud H.; Toor, Amir Ahmed

2014-01-01

Donor T-cell mediated graft versus host (GVH) effects may result from the aggregate alloreactivity to minor histocompatibility antigens (mHA) presented by the human leukocyte antigen (HLA) molecules in each donor–recipient pair undergoing stem-cell transplantation (SCT). Whole exome sequencing has previously demonstrated a large number of non-synonymous single nucleotide polymorphisms (SNP) present in HLA-matched recipients of SCT donors (GVH direction). The nucleotide sequence flanking each of these SNPs was obtained and the amino acid sequence determined. All the possible nonameric peptides incorporating the variant amino acid resulting from these SNPs were interrogated in silico for their likelihood to be presented by the HLA class I molecules using the Immune Epitope Database stabilized matrix method (SMM) and NetMHCpan algorithms. The SMM algorithm predicted that a median of 18,396 peptides weakly bound HLA class I molecules in individual SCT recipients, and 2,254 peptides displayed strong binding. A similar library of presented peptides was identified when the data were interrogated using the NetMHCpan algorithm. The bioinformatic algorithm presented here demonstrates that there may be a high level of mHA variation in HLA-matched individuals, constituting a HLA-specific alloreactivity potential. PMID:25414699
In silico Derivation of HLA-Specific Alloreactivity Potential from Whole Exome Sequencing of Stem-Cell Transplant Donors and Recipients: Understanding the Quantitative Immunobiology of Allogeneic Transplantation.

PubMed

Jameson-Lee, Max; Koparde, Vishal; Griffith, Phil; Scalora, Allison F; Sampson, Juliana K; Khalid, Haniya; Sheth, Nihar U; Batalo, Michael; Serrano, Myrna G; Roberts, Catherine H; Hess, Michael L; Buck, Gregory A; Neale, Michael C; Manjili, Masoud H; Toor, Amir Ahmed

2014-01-01

Donor T-cell mediated graft versus host (GVH) effects may result from the aggregate alloreactivity to minor histocompatibility antigens (mHA) presented by the human leukocyte antigen (HLA) molecules in each donor-recipient pair undergoing stem-cell transplantation (SCT). Whole exome sequencing has previously demonstrated a large number of non-synonymous single nucleotide polymorphisms (SNP) present in HLA-matched recipients of SCT donors (GVH direction). The nucleotide sequence flanking each of these SNPs was obtained and the amino acid sequence determined. All the possible nonameric peptides incorporating the variant amino acid resulting from these SNPs were interrogated in silico for their likelihood to be presented by the HLA class I molecules using the Immune Epitope Database stabilized matrix method (SMM) and NetMHCpan algorithms. The SMM algorithm predicted that a median of 18,396 peptides weakly bound HLA class I molecules in individual SCT recipients, and 2,254 peptides displayed strong binding. A similar library of presented peptides was identified when the data were interrogated using the NetMHCpan algorithm. The bioinformatic algorithm presented here demonstrates that there may be a high level of mHA variation in HLA-matched individuals, constituting a HLA-specific alloreactivity potential.
Proteomic analysis of the venom from the fish eating coral snake Micrurus surinamensis: novel toxins, their function and phylogeny.

PubMed

Olamendi-Portugal, Timoteo; Batista, Cesar V F; Restano-Cassulini, Rita; Pando, Victoria; Villa-Hernandez, Oscar; Zavaleta-Martínez-Vargas, Alfonso; Salas-Arruz, Maria C; Rodríguez de la Vega, Ricardo C; Becerril, Baltazar; Possani, Lourival D

2008-05-01

The protein composition of the soluble venom from the South American fish-eating coral snake Micrurus surinamensis surinamensis, here abbreviated M. surinamensis, was separated by RP-HPLC and 2-DE, and their components were analyzed by automatic Edman degradation, MALDI-TOF and ESI-MS/MS. Approximately 100 different molecules were identified. Sixty-two components possess molecular masses between 6 and 8 kDa, are basically charged molecules, among which are cytotoxins and neurotoxins lethal to fish (Brachidanios rerio). Six new toxins (abbreviated Ms1-Ms5 and Ms11) were fully sequenced. Amino acid sequences similar to the enzymes phospholipase A2 and amino acid oxidase were identified. Over 20 additional peptides were identified by sequencing minor components of the HPLC separation and from 2-DE gels. A functional assessment of the physiological activity of the six toxins was also performed by patch clamp using muscular nicotinic acetylcholine receptor assays. Variable degrees of blockade were observed, most of them reversible. The structural and functional data obtained were used for phylogenetic analysis, providing information on some evolutionary aspects of the venom components of this snake. This contribution increases by a factor of two the total number of alpha-neurotoxins sequenced from the Micrurus genus in currently available literature.
Disoxaril mutants of Coxsackievirus B1: phenotypic characteristics and analysis of the target VP1 gene.

PubMed

Nikolova, Ivanka; Galabov, Angel S; Petkova, Rumena; Chakarov, Stoyan; Atanasov, Boris

2011-01-01

Disoxaril inhibits enterovirus replication by binding to the hydrophobic pocket within the VP1 coat protein, thus stabilizing the virion and blocking its uncoating. Disoxaril-resistant (RES) mutants of the Coxsackievirus B1 (CVB1/RES) were derived from the wild disoxaril-sensitive (SOF) strain (CVB1/SOF) using a selection approach. A disoxaril-dependent (DEP) mutant (CVB1/DEP) was obtained following nine consecutive passages of the disoxaril-resistant mutant in the presence of disoxaril. Phenotypic characteristics of the disoxaril mutants were investigated. A timing-of-addition study of the CVB1/DEP replication demonstrated that in the absence of disoxaril the virus particle assembly stopped. VP1 RNA sequences of disoxaril mutants were compared with the existing Gen Bank CVB1 reference structure. The amino acid sequence of a large VP1 196-258 peptide (disoxaril-binding region) of CVB1/RES was significantly different from that of the CVB1/SOF. Crucially important changes in CVB1/RES were two point mutations, M213H and F237L, both in the ligand-binding pocket. The sequence analysis of the CVB1/DEP showed some reversion to CVB1/SOF. The amino acid sequences of the three VP1 proteins are presented.
Analysis of the beak and feather disease viral genome indicates the existence of several genotypes which have a complex psittacine host specificity.

PubMed

de Kloet, E; de Kloet, S R

2004-12-01

A study was made of the phylogenetic relationships between fifteen complete nucleotide sequences as well as 43 nucleotide sequences of the putative coat protein gene of different strains belonging to the virus species Beak and feather disease virus obtained from 39 individuals of 16 psittacine species. The species included among others, cockatoos ( Cacatuini), African grey parrots ( Psittacus erithacus) and peach-faced lovebirds ( Agapornis roseicollis), which were infected at different geographical locations, within and outside Australia, the native origin of the virus. The derived amino acid sequences of the putative coat protein were highly diverse, with differences between some strains amounting to 50 of the 250 amino acids. Phylogenetic analysis demonstrated that the putative coat gene sequences form six clusters which show a varying degree of psittacine species specificity. Most, but not all strains infecting African grey parrots formed a single cluster as did the strains infecting the cockatoos. Strains infecting the lovebirds clustered with those infecting such Australasian species as Eclectus roratus, Psittacula kramerii and Psephotus haematogaster. Although individual birds included in this study were, where studied, often infected by closely related strains, infection by highly diverged trains was also detected. The possible relationship between BFD viral strains and clinical disease signs is discussed.
Acid–base chemical reaction model for nucleation rates in the polluted atmospheric boundary layer

PubMed Central

Chen, Modi; Titcombe, Mari; Jiang, Jingkun; Jen, Coty; Kuang, Chongai; Fischer, Marc L.; Eisele, Fred L.; Siepmann, J. Ilja; Hanson, David R.; Zhao, Jun; McMurry, Peter H.

2012-01-01

Climate models show that particles formed by nucleation can affect cloud cover and, therefore, the earth's radiation budget. Measurements worldwide show that nucleation rates in the atmospheric boundary layer are positively correlated with concentrations of sulfuric acid vapor. However, current nucleation theories do not correctly predict either the observed nucleation rates or their functional dependence on sulfuric acid concentrations. This paper develops an alternative approach for modeling nucleation rates, based on a sequence of acid–base reactions. The model uses empirical estimates of sulfuric acid evaporation rates obtained from new measurements of neutral molecular clusters. The model predicts that nucleation rates equal the sulfuric acid vapor collision rate times a prefactor that is less than unity and that depends on the concentrations of basic gaseous compounds and preexisting particles. Predicted nucleation rates and their dependence on sulfuric acid vapor concentrations are in reasonable agreement with measurements from Mexico City and Atlanta. PMID:23091030
Detection and isolation of nucleic acid sequences using competitive hybridization probes

DOEpatents

Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

1997-01-01

A method for detecting a target nucleic acid sequence in a sample is provided using hybridization probes which competitively hybridize to a target nucleic acid. According to the method, a target nucleic acid sequence is hybridized to first and second hybridization probes which are complementary to overlapping portions of the target nucleic acid sequence, the first hybridization probe including a first complexing agent capable of forming a binding pair with a second complexing agent and the second hybridization probe including a detectable marker. The first complexing agent attached to the first hybridization probe is contacted with a second complexing agent, the second complexing agent being attached to a solid support such that when the first and second complexing agents are attached, target nucleic acid sequences hybridized to the first hybridization probe become immobilized on to the solid support. The immobilized target nucleic acids are then separated and detected by detecting the detectable marker attached to the second hybridization probe. A kit for performing the method is also provided.
Detection and isolation of nucleic acid sequences using competitive hybridization probes

DOEpatents

Lucas, J.N.; Straume, T.; Bogen, K.T.

1997-04-01

A method for detecting a target nucleic acid sequence in a sample is provided using hybridization probes which competitively hybridize to a target nucleic acid. According to the method, a target nucleic acid sequence is hybridized to first and second hybridization probes which are complementary to overlapping portions of the target nucleic acid sequence, the first hybridization probe including a first complexing agent capable of forming a binding pair with a second complexing agent and the second hybridization probe including a detectable marker. The first complexing agent attached to the first hybridization probe is contacted with a second complexing agent, the second complexing agent being attached to a solid support such that when the first and second complexing agents are attached, target nucleic acid sequences hybridized to the first hybridization probe become immobilized on to the solid support. The immobilized target nucleic acids are then separated and detected by detecting the detectable marker attached to the second hybridization probe. A kit for performing the method is also provided. 7 figs.

Detection of nucleic acids by multiple sequential invasive cleavages

DOEpatents

Hall, Jeff G.; Lyamichev, Victor I.; Mast, Andrea L.; Brow, Mary Ann D.

1999-01-01

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based on charge. The present invention also provides methods for the detection of non-target cleavage products via the formation of a complete and activated protein binding region. The invention further provides sensitive and specific methods for the detection of human cytomegalovirus nucleic acid in a sample.
Nucleic acid detection kits

DOEpatents

Hall, Jeff G.; Lyamichev, Victor I.; Mast, Andrea L.; Brow, Mary Ann; Kwiatkowski, Robert W.; Vavra, Stephanie H.

2005-03-29

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based on charge. The present invention also provides methods for the detection of non-target cleavage products via the formation of a complete and activated protein binding region. The invention further provides sensitive and specific methods for the detection of nucleic acid from various viruses in a sample.
Detection of nucleic acids by multiple sequential invasive cleavages 02

DOEpatents

Hall, Jeff G.; Lyamichev, Victor I.; Mast, Andrea L.; Brow, Mary Ann D.

2002-01-01

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based on charge. The present invention also provides methods for the detection of non-target cleavage products via the formation of a complete and activated protein binding region. The invention further provides sensitive and specific methods for the detection of human cytomegalovirus nucleic acid in a sample.
Detection of nucleic acids by multiple sequential invasive cleavages

DOEpatents

Hall, Jeff G; Lyamichev, Victor I; Mast, Andrea L; Brow, Mary Ann D

2012-10-16

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based on charge. The present invention also provides methods for the detection of non-target cleavage products via the formation of a complete and activated protein binding region. The invention further provides sensitive and specific methods for the detection of human cytomegalovirus nucleic acid in a sample.
Molecular characterization of a nuclear topoisomerase II from Nicotiana tabacum that functionally complements a temperature-sensitive topoisomerase II yeast mutant.

PubMed

Singh, B N; Mudgil, Yashwanti; Sopory, S K; Reddy, M K

2003-07-01

We have successfully expressed enzymatically active plant topoisomerase II in Escherichia coli for the first time, which has enabled its biochemical characterization. Using a PCR-based strategy, we obtained a full-length cDNA and the corresponding genomic clone of tobacco topoisomerase II. The genomic clone has 18 exons interrupted by 17 introns. Most of the 5' and 3' splice junctions follow the typical canonical consensus dinucleotide sequence GU-AG present in other plant introns. The position of introns and phasing with respect to primary amino acid sequence in tobacco TopII and Arabidopsis TopII are highly conserved, suggesting that the two genes are evolved from the common ancestral type II topoisomerase gene. The cDNA encodes a polypeptide of 1482 amino acids. The primary amino acid sequence shows a striking sequence similarity, preserving all the structural domains that are conserved among eukaryotic type II topoisomerases in an identical spatial order. We have expressed the full-length polypeptide in E. coli and purified the recombinant protein to homogeneity. The full-length polypeptide relaxed supercoiled DNA and decatenated the catenated DNA in a Mg(2+)- and ATP-dependent manner, and this activity was inhibited by 4'-(9-acridinylamino)-3'-methoxymethanesulfonanilide (m-AMSA). The immunofluorescence and confocal microscopic studies, with antibodies developed against the N-terminal region of tobacco recombinant topoisomerase II, established the nuclear localization of topoisomerase II in tobacco BY2 cells. The regulated expression of tobacco topoisomerase II gene under the GAL1 promoter functionally complemented a temperature-sensitive TopII(ts) yeast mutant.
Expanding the peptide beta-turn in alphagamma hybrid sequences: 12 atom hydrogen bonded helical and hairpin turns.

PubMed

Chatterjee, Sunanda; Vasudev, Prema G; Raghothama, Srinivasarao; Ramakrishnan, Chandrasekharan; Shamala, Narayanaswamy; Balaram, Padmanabhan

2009-04-29

Hybrid peptide segments containing contiguous alpha and gamma amino acid residues can form C(12) hydrogen bonded turns which may be considered as backbone expanded analogues of C(10) (beta-turns) found in alphaalpha segments. Exploration of the regular hydrogen bonded conformations accessible for hybrid alphagamma sequences is facilitated by the use of a stereochemically constrained gamma amino acid residue gabapentin (1-aminomethylcyclohexaneacetic acid, Gpn), in which the two torsion angles about C(gamma)-C(beta) (theta(1)) and C(beta)-C(alpha) (theta(2)) are predominantly restricted to gauche conformations. The crystal structures of the octapeptides Boc-Gpn-Aib-Gpn-Aib-Gpn-Aib-Gpn-Aib-OMe (1) and Boc-Leu-Phe-Val-Aib-Gpn-Leu-Phe-Val-OMe (2) reveal two distinct conformations for the Aib-Gpn segment. Peptide 1 forms a continuous helix over the Aib(2)-Aib(6) segment, while the peptide 2 forms a beta-hairpin structure stabilized by four cross-strand hydrogen bonds with the Aib-Gpn segment forming a nonhelical C(12) turn. The robustness of the helix in peptide 1 in solution is demonstrated by NMR methods. Peptide 2 is conformationally fragile in solution with evidence of beta-hairpin conformations being obtained in methanol. Theoretical calculations permit delineation of the various C(12) hydrogen bonded structures which are energetically feasible in alphagamma and gammaalpha sequences.
Molecular Characterization of Tomato 3-Dehydroquinate Dehydratase-Shikimate:NADP Oxidoreductase1

PubMed Central

Bischoff, Markus; Schaller, Andreas; Bieri, Fabian; Kessler, Felix; Amrhein, Nikolaus; Schmid, Jürg

2001-01-01

Analysis of cDNAs encoding the bifunctional 3-dehydroquinate dehydratase-shikimate:NADP oxidoreductase (DHQase-SORase) from tomato (Lycopersicon esculentum) revealed two classes of cDNAs that differed by 57 bp within the coding regions, but were otherwise identical. Comparison of these cDNA sequences with the sequence of the corresponding single gene unequivocally proved that the primary transcript is differentially spliced, potentially giving rise to two polypeptides that differ by 19 amino acids. Quantitative real-time polymerase chain reaction revealed that the longer transcript constitutes at most 1% to 2% of DHQase-SORase transcripts. Expression of the respective polypeptides in Escherichia coli mutants lacking the DHQase or the SORase activity gave functional complementation only in case of the shorter polypeptide, indicating that skipping of a potential exon is a prerequisite for the production of an enzymatically active protein. The deduced amino acid sequence revealed that the DHQase-SORase is most likely synthesized as a precursor with a very short (13-amino acid) plastid-specific transit peptide. Like other genes encoding enzymes of the prechorismate pathway in tomato, this gene is elicitor-inducible. Tissue-specific expression resembles the patterns obtained for 3-deoxy-d-arabino-heptulosonate 7-phosphate synthase 2 and dehydroquinate synthase genes. This work completes our studies of the prechorismate pathway in that cDNAs for all seven enzymes (including isozymes) of the prechorismate pathway from tomato have now been characterized. PMID:11299368
Primary structure of rat cardiac beta-adrenergic and muscarinic cholinergic receptors obtained by automated DNA sequence analysis: further evidence for a multigene family.

PubMed

Gocayne, J; Robinson, D A; FitzGerald, M G; Chung, F Z; Kerlavage, A R; Lentes, K U; Lai, J; Wang, C D; Fraser, C M; Venter, J C

1987-12-01

Two cDNA clones, lambda RHM-MF and lambda RHB-DAR, encoding the muscarinic cholinergic receptor and the beta-adrenergic receptor, respectively, have been isolated from a rat heart cDNA library. The cDNA clones were characterized by restriction mapping and automated DNA sequence analysis utilizing fluorescent dye primers. The rat heart muscarinic receptor consists of 466 amino acids and has a calculated molecular weight of 51,543. The rat heart beta-adrenergic receptor consists of 418 amino acids and has a calculated molecular weight of 46,890. The two cardiac receptors have substantial amino acid homology (27.2% identity, 50.6% with favored substitutions). The rat cardiac beta receptor has 88.0% homology (92.5% with favored substitutions) with the human brain beta receptor and the rat cardiac muscarinic receptor has 94.6% homology (97.6% with favored substitutions) with the porcine cardiac muscarinic receptor. The muscarinic cholinergic and beta-adrenergic receptors appear to be as conserved as hemoglobin and cytochrome c but less conserved than histones and are clearly members of a multigene family. These data support our hypothesis, based upon biochemical and immunological evidence, that suggests considerable structural homology and evolutionary conservation between adrenergic and muscarinic cholinergic receptors. To our knowledge, this is the first report utilizing automated DNA sequence analysis to determine the structure of a gene.
High-Throughput Next-Generation Sequencing of Polioviruses

PubMed Central

Montmayeur, Anna M.; Schmidt, Alexander; Zhao, Kun; Magaña, Laura; Iber, Jane; Castro, Christina J.; Chen, Qi; Henderson, Elizabeth; Ramos, Edward; Shaw, Jing; Tatusov, Roman L.; Dybdahl-Sissoko, Naomi; Endegue-Zanga, Marie Claire; Adeniji, Johnson A.; Oberste, M. Steven; Burns, Cara C.

2016-01-01

ABSTRACT The poliovirus (PV) is currently targeted for worldwide eradication and containment. Sanger-based sequencing of the viral protein 1 (VP1) capsid region is currently the standard method for PV surveillance. However, the whole-genome sequence is sometimes needed for higher resolution global surveillance. In this study, we optimized whole-genome sequencing protocols for poliovirus isolates and FTA cards using next-generation sequencing (NGS), aiming for high sequence coverage, efficiency, and throughput. We found that DNase treatment of poliovirus RNA followed by random reverse transcription (RT), amplification, and the use of the Nextera XT DNA library preparation kit produced significantly better results than other preparations. The average viral reads per total reads, a measurement of efficiency, was as high as 84.2% ± 15.6%. PV genomes covering >99 to 100% of the reference length were obtained and validated with Sanger sequencing. A total of 52 PV genomes were generated, multiplexing as many as 64 samples in a single Illumina MiSeq run. This high-throughput, sequence-independent NGS approach facilitated the detection of a diverse range of PVs, especially for those in vaccine-derived polioviruses (VDPV), circulating VDPV, or immunodeficiency-related VDPV. In contrast to results from previous studies on other viruses, our results showed that filtration and nuclease treatment did not discernibly increase the sequencing efficiency of PV isolates. However, DNase treatment after nucleic acid extraction to remove host DNA significantly improved the sequencing results. This NGS method has been successfully implemented to generate PV genomes for molecular epidemiology of the most recent PV isolates. Additionally, the ability to obtain full PV genomes from FTA cards will aid in facilitating global poliovirus surveillance. PMID:27927929
Generation and analysis of expressed sequence tags from the bone marrow of Chinese Sika deer.

PubMed

Yao, Baojin; Zhao, Yu; Zhang, Mei; Li, Juan

2012-03-01

Sika deer is one of the best-known and highly valued animals of China. Despite its economic, cultural, and biological importance, there has not been a large-scale sequencing project for Sika deer to date. With the ultimate goal of sequencing the complete genome of this organism, we first established a bone marrow cDNA library for Sika deer and generated a total of 2,025 reads. After processing the sequences, 2,017 high-quality expressed sequence tags (ESTs) were obtained. These ESTs were assembled into 1,157 unigenes, including 238 contigs and 919 singletons. Comparative analyses indicated that 888 (76.75%) of the unigenes had significant matches to sequences in the non-redundant protein database, In addition to highly expressed genes, such as stearoyl-CoA desaturase, cytochrome c oxidase, adipocyte-type fatty acid-binding protein, adiponectin and thymosin beta-4, we also obtained vascular endothelial growth factor-A and heparin-binding growth-associated molecule, both of which are of great importance for angiogenesis research. There were 244 (21.09%) unigenes with no significant match to any sequence in current protein or nucleotide databases, and these sequences may represent genes with unknown function in Sika deer. Open reading frame analysis of the sequences was performed using the getorf program. In addition, the sequences were functionally classified using the gene ontology hierarchy, clusters of orthologous groups of proteins and Kyoto encyclopedia of genes and genomes databases. Analysis of ESTs described in this paper provides an important resource for the transcriptome exploration of Sika deer, and will also facilitate further studies on functional genomics, gene discovery and genome annotation of Sika deer.
ISOLATION AND CHARACTERIZATION OF AXOLOTL NPDC-1 AND ITS EFFECTS ON RETINOIC ACID RECEPTOR SIGNALING

PubMed Central

Theodosiou, Maria; Monaghan, James R; Spencer, Michael L; Voss, S Randal; Noonan, Daniel J

2009-01-01

Retinoic acid, a key morphogen in early vertebrate development and tissue regeneration, mediates its effects through the binding of receptors that act as ligand-induced transcription factors. These binding events function to recruit an array of transcription co-regulatory proteins to specific gene promoters. One such co-regulatory protein, neuronal proliferation and differentiation control-1 (NPDC-1), is broadly expressed during mammalian development and functions as an in vitro repressor of retinoic acid receptor (RAR)-mediated transcription. To obtain comparative and developmental insights about NPDC-1 function, we cloned the axolotl (Ambystoma mexicanum) orthologue and measured transcript abundances among tissues sampled during the embryonic and juvenile phases of development, and also during spinal cord regeneration. Structurally, the axolotl orthologue of NPDC-1 retained sequence identity to mammalian sequences in all functional domains. Functionally, we observed that axolotl NPDC-1 mRNA expression peaked late in embryogenesis, with highest levels of expression occurring during the time of limb development, a process regulated by retinoic acid signaling. Also similar to what has been observed in mammals, axolotl NPDC-1 directly interacts with axolotl RAR, modulates axolotl RAR DNA binding, and represses cell proliferation and axolotl RAR-mediated gene transcription. These data justify axolotl as a model to further investigate NPDC-1 and its role in regulating retinoic acid signaling. PMID:17331771
Treatability of cheese whey for single-cell protein production in nonsterile systems: Part II. The application of aerobic sequencing batch reactor (aerobic SBR) to produce high biomass of Dioszegia sp. TISTR 5792.

PubMed

Monkoondee, Sarawut; Kuntiya, Ampin; Chaiyaso, Thanongsak; Leksawasdi, Noppol; Techapun, Charin; Kawee-Ai, Arthitaya; Seesuriyachan, Phisit

2016-07-03

This study aimed to investigate the efficiency of an aerobic sequencing batch reactor (aerobic SBR) in a nonsterile system using the application of an experimental design via central composite design (CCD). The acidic whey obtained from lactic acid fermentation by immobilized Lactobacillus plantarum sp. TISTR 2265 was fed into the bioreactor of the aerobic SBR in an appropriate ratio between acidic whey and cheese whey to produce an acidic environment below 4.5 and then was used to support the growth of Dioszegia sp. TISTR 5792 by inhibiting bacterial contamination. At the optimal condition for a high yield of biomass production, the system was run with a hydraulic retention time (HRT) of 4 days, a solid retention time (SRT) of 8.22 days, and an acidic whey concentration of 80% feeding. The chemical oxygen demand (COD) decreased from 25,230 mg/L to 6,928 mg/L, which represented a COD removal of 72.15%. The yield of biomass production and lactose utilization by Dioszegia sp. TISTR 5792 were 13.14 g/L and 33.36%, respectively, with a long run of up to 180 cycles and the pH values of effluent were rose up to 8.32 without any pH adjustment.
An Aspergillus oryzae acetyl xylan esterase: molecular cloning and characteristics of recombinant enzyme expressed in Pichia pastoris.

PubMed

Koseki, Takuya; Miwa, Yozo; Akao, Takeshi; Akita, Osamu; Hashizume, Katsumi

2006-02-10

We screened 20,000 clones of an expressed sequence tag (EST) library from Aspergillus oryzae (http://www.nrib.go.jp/ken/EST/db/index.html) and obtained one cDNA clone encoding a protein with similarity to fungal acetyl xylan esterase. We also cloned the corresponding gene, designated as Aoaxe, from the genomic DNA. The deduced amino acid sequence consisted of a putative signal peptide of 31-amino acids and a mature protein of 276-amino acids. We engineered Aoaxe for heterologous expression in P. pastoris. Recombinant AoAXE (rAoAXE) was secreted by the aid of fused alpha-factor secretion signal peptide and accumulated as an active enzyme in the culture medium to a final level of 190 mg/l after 5 days. Purified rAoAXEA before and after treatment with endoglycosidase H migrated by SDS-PAGE with a molecular mass of 31 and 30 kDa, respectively. Purified rAoAXE displayed the greatest hydrolytic activity toward alpha-naphthylacetate (C2), lower activity toward alpha-naphthylpropionate (C3) and no detectable activity toward acyl-chain substrates containing four or more carbon atoms. The recombinant enzyme catalyzed the release of acetic acid from birchwood xylan. No activity was detectable using methyl esters of ferulic, caffeic or sinapic acids. rAoAXE was thermolabile in comparison to other AXEs from Aspergillus.
Thraustochytrids as production organisms for docosahexaenoic acid (DHA), squalene, and carotenoids.

PubMed

Aasen, Inga Marie; Ertesvåg, Helga; Heggeset, Tonje Marita Bjerkan; Liu, Bin; Brautaset, Trygve; Vadstein, Olav; Ellingsen, Trond E

2016-05-01

Thraustochytrids have been applied for industrial production of the omega-3 fatty acid docosahexaenoic (DHA) since the 1990s. During more than 20 years of research on this group of marine, heterotrophic microorganisms, considerable increases in DHA productivities have been obtained by process and medium optimization. Strains of thraustochytrids also produce high levels of squalene and carotenoids, two other commercially interesting compounds with a rapidly growing market potential, but where yet few studies on process optimization have been reported. Thraustochytrids use two pathways for fatty acid synthesis. The saturated fatty acids are produced by the standard fatty acid synthesis, while DHA is synthesized by a polyketide synthase. However, fundamental knowledge about the relationship between the two pathways is still lacking. In the present review, we extract main findings from the high number of reports on process optimization for DHA production and interpret these in the light of the current knowledge of DHA synthesis in thraustochytrids and lipid accumulation in oleaginous microorganisms in general. We also summarize published reports on squalene and carotenoid production and review the current status on strain improvement, which has been hampered by the yet very few published genome sequences and the lack of tools for gene transfer to the organisms. As more sequences now are becoming available, targets for strain improvement can be identified and open for a system-level metabolic engineering for improved productivities.
Complete nucleotide and derived amino acid sequence of cDNA encoding the mitochondrial uncoupling protein of rat brown adipose tissue: lack of a mitochondrial targeting presequence.

PubMed Central

Ridley, R G; Patel, H V; Gerber, G E; Morton, R C; Freeman, K B

1986-01-01

A cDNA clone spanning the entire amino acid sequence of the nuclear-encoded uncoupling protein of rat brown adipose tissue mitochondria has been isolated and sequenced. With the exception of the N-terminal methionine the deduced N-terminus of the newly synthesized uncoupling protein is identical to the N-terminal 30 amino acids of the native uncoupling protein as determined by protein sequencing. This proves that the protein contains no N-terminal mitochondrial targeting prepiece and that a targeting region must reside within the amino acid sequence of the mature protein. Images PMID:3012461
Method of increasing conversion of a fatty acid to its corresponding dicarboxylic acid

DOEpatents

Craft, David L.; Wilson, C. Ron; Eirich, Dudley; Zhang, Yeyan

2004-09-14

A nucleic acid sequence including a CYP promoter operably linked to nucleic acid encoding a heterologous protein is provided to increase transcription of the nucleic acid. Expression vectors and host cells containing the nucleic acid sequence are also provided. The methods and compositions described herein are especially useful in the production of polycarboxylic acids by yeast cells.
Comparative genomics and transcriptome analysis of Lactobacillus rhamnosus ATCC 11443 and the mutant strain SCT-10-10-60 with enhanced L-lactic acid production capacity.

PubMed

Sun, Liang; Lu, Zhilong; Li, Jianxiu; Sun, Feifei; Huang, Ribo

2018-02-01

Mechanisms for high L-lactic acid production remain unclear in many bacteria. Lactobacillus rhamnosus SCT-10-10-60 was previously obtained from L. rhamnosus ATCC 11443 via mutagenesis and showed improved L-lactic acid production. In this study, the genomes of strains SCT-10-10-60 and ATCC 11443 were sequenced. Both genomes are a circular chromosome, 2.99 Mb in length with a GC content of approximately 46.8%. Eight split genes were identified in strain SCT-10-10-60, including two LytR family transcriptional regulators, two Rex redox-sensing transcriptional repressors, and four ABC transporters. In total, 60 significantly up-regulated genes (log 2 fold-change ≥ 2) and 39 significantly down-regulated genes (log 2 fold-change ≤ - 2) were identified by a transcriptome comparison between strains SCT-10-10-60 and ATCC 11443. KEGG pathway enrichment analysis revealed that "pyruvate metabolism" was significantly different (P < 0.05) between the two strains. The split genes and the differentially expressed genes involved in the "pyruvate metabolism" pathway are probably responsible for the increased L-lactic acid production by SCT-10-10-60. The genome and transcriptome sequencing information and comparison of SCT-10-10-60 with ATCC 11443 provide insights into the anabolism of L-lactic acid and a reference for improving L-lactic acid production using genetic engineering.
Regulatory elements in vivo in the promoter of the abscisic acid responsive gene rab17 from maize.

PubMed

Busk, P K; Jensen, A B; Pagès, M

1997-06-01

The rab17 gene from maize is transcribed in late embryonic development and is responsive to abscisic acid and water stress in embryo and vegetative tissues. In vivo footprinting and transient transformation of rab17 were performed in embryos and vegetative tissues to characterize the cis-elements involved in regulation of the gene. By in vivo footprinting, protein binding was observed to nine elements in the promoter, which correspond to five putative ABREs (abscisic acid responsive elements) and four other sequences. The footprints indicated that distinct proteins interact with these elements in the two developmental stages. In transient transformation, six of the elements were important for high level expression of the rab17 promoter in embryos, whereas only three elements were important in leaves. The cis-acting sequences can be divided in embryo-specific, ABA-specific and leaf-specific elements on the basis of protein binding and the ability to confer expression of rab17. We found one positive, new element, called GRA, with the sequence CACTGGCCGCCC. This element was important for transcription in leaves but not in embryos. Two other non-ABRE elements that stimulated transcription from the rab17 promoter resemble previously described abscisic acid and drought-inducible elements. There were differences in protein binding and function of the five ABREs in the rab17 promoter. The possible reasons for these differences are discussed. The in vivo data obtained suggest that an embryo-specific pathway regulates transcription of the rab genes during development, whereas another pathway is responsible for induction in response to ABA and drought in vegetative tissues.
Characterization of a prototype strain of hepatitis E virus.

PubMed

Tsarev, S A; Emerson, S U; Reyes, G R; Tsareva, T S; Legters, L J; Malik, I A; Iqbal, M; Purcell, R H

1992-01-15

A strain of hepatitis E virus (SAR-55) implicated in an epidemic of enterically transmitted non-A, non-B hepatitis, now called hepatitis E, was characterized extensively. Six cynomolgus monkeys (Macaca fascicularis) were infected with a strain of hepatitis E virus from Pakistan. Reverse transcription-polymerase chain reaction was used to determine the pattern of virus shedding in feces, bile, and serum relative to hepatitis and induction of specific antibodies. Virtually the entire genome of SAR-55 (7195 nucleotides) was sequenced. Comparison of the sequence of SAR-55 with that of a Burmese strain revealed a high level of homology except for one region encoding 100 amino acids of a putative nonstructural polyprotein. Identification of this region as hypervariable was obtained by partial sequencing of a third isolate of hepatitis E virus from Kirgizia.
Enterocin T, a novel class IIa bacteriocin produced by Enterococcus sp. 812.

PubMed

Chen, Yi-Sheng; Yu, Chi-Rong; Ji, Si-Hua; Liou, Min-Shiuan; Leong, Kun-Hon; Pan, Shwu-Fen; Wu, Hui-Chung; Lin, Yu-Hsuan; Yu, Bi; Yanagida, Fujitoshi

2013-09-01

Enterococcus sp. 812, isolated from fresh broccoli, was previously found to produce a bacteriocin active against a number of Gram-positive bacteria, including Listeria monocytogenes. Bacteriocin activity decreased slightly after autoclaving (121 °C for 15 min), but was inactivated by protease K. Mass spectrometry analysis revealed the bacteriocin mass to be approximately 4,521.34 Da. N-terminal amino acid sequencing yielded a partial sequence, NH2-ATYYGNGVYXDKKKXWVEWGQA, by Edman degradation, which contained the consensus class IIa bacteriocin motif YGNGV in the N-terminal region. The obtained partial sequence showed high homology with some enterococcal bacteriocins; however, no identical peptide or protein was found. This peptide was therefore considered to be a novel bacteriocin produced by Enterococcus sp. 812 and was termed enterocin T.

Evaluation of magnetic resonance signal modification induced by hyaluronic acid therapy in chondromalacia patellae: a preliminary study.

PubMed

Magarelli, N; Palmieri, D; Ottaviano, L; Savastano, M; Barbato, M; Leone, A; Maggialetti, A; Ciampa, F P; Bonomo, L

2008-01-01

Hyaluronic Acid (HA) is an alternative method for the treatment of osteoarthritis (OA), which acts on pain through a double action: anti-inflammatory and synovial fluid (SF) visco-supplementation. Magnetic Resonance Imaging (MRI), utilizing specific sequences, is a valid method for studying the initial phase of chondral damage. The analysis of the data, obtained through the intensity of values taken by positioning Region of Interest (ROIs) within the lesion, determining the differences before and after treatment with HA injected into the knee. The results obtained after six months and one year from the injection were statistically different in respect to those taken before, immediately and after three months of treatment. MRI represents a valid tool to evaluate the grade of chondromalacia patellae and also to follow the cartilage modification induced by HA therapy.
A putative carbohydrate-binding domain of the lactose-binding Cytisus sessilifolius anti-H(O) lectin has a similar amino acid sequence to that of the L-fucose-binding Ulex europaeus anti-H(O) lectin.

PubMed

Konami, Y; Yamamoto, K; Osawa, T; Irimura, T

1995-04-01

The complete amino acid sequence of a lactose-binding Cytisus sessilifolius anti-H(O) lectin II (CSA-II) was determined using a protein sequencer. After digestion of CSA-II with endoproteinase Lys-C or Asp-N, the resulting peptides were purified by reversed-phase high performance liquid chromatography (HPLC) and then subjected to sequence analysis. Comparison of the complete amino acid sequence of CSA-II with the sequences of other leguminous seed lectins revealed regions of extensive homology. The amino acid sequence of a putative carbohydrate-binding domain of CSA-II was found to be similar to those of several anti-H(O) leguminous lectins, especially to that of the L-fucose-binding Ulex europaeus lectin I (UEA-I).
Deep sequencing of the Mexican avocado transcriptome, an ancient angiosperm with a high content of fatty acids.

PubMed

Ibarra-Laclette, Enrique; Méndez-Bravo, Alfonso; Pérez-Torres, Claudia Anahí; Albert, Victor A; Mockaitis, Keithanne; Kilaru, Aruna; López-Gómez, Rodolfo; Cervantes-Luevano, Jacob Israel; Herrera-Estrella, Luis

2015-08-13

Avocado (Persea americana) is an economically important tropical fruit considered to be a good source of fatty acids. Despite its importance, the molecular and cellular characterization of biochemical and developmental processes in avocado is limited due to the lack of transcriptome and genomic information. The transcriptomes of seeds, roots, stems, leaves, aerial buds and flowers were determined using different sequencing platforms. Additionally, the transcriptomes of three different stages of fruit ripening (pre-climacteric, climacteric and post-climacteric) were also analyzed. The analysis of the RNAseqatlas presented here reveals strong differences in gene expression patterns between different organs, especially between root and flower, but also reveals similarities among the gene expression patterns in other organs, such as stem, leaves and aerial buds (vegetative organs) or seed and fruit (storage organs). Important regulators, functional categories, and differentially expressed genes involved in avocado fruit ripening were identified. Additionally, to demonstrate the utility of the avocado gene expression atlas, we investigated the expression patterns of genes implicated in fatty acid metabolism and fruit ripening. A description of transcriptomic changes occurring during fruit ripening was obtained in Mexican avocado, contributing to a dynamic view of the expression patterns of genes involved in fatty acid biosynthesis and the fruit ripening process.
trans-10,cis-12 conjugated linoleic acid alters lipid metabolism of goat mammary epithelial cells by regulation of de novo synthesis and the AMPK signaling pathway.

PubMed

Zhang, T Y; Huang, J T; Tian, H B; Ma, Y; Chen, Z; Wang, J J; Shi, H P; Luo, J

2018-06-01

The trans-10,cis-12 isomer of conjugated linoleic acid (t10c12-CLA) is a biohydrogenation intermediate in the rumen and has been shown to cause milk fat depression in dairy goats. However, few studies have focused on the in vitro molecular mechanisms involved in the response of the goat mammary gland to t10c12-CLA. In the present study, RNA sequencing technology was used to investigate the effects of t10c12-CLA on goat mammary epithelial cells. From the data, 25,153 annotated transcripts were obtained, and differentially expressed genes were selected based on a false discovery rate <0.05. Candidate genes and potent cellular signaling pathways were identified through Gene Ontology (GO) and pathway analysis. Next, real-time quantitative PCR and Western blot analyses were used to verify the results of the RNA sequencing data. The results indicated that t10c12-CLA inhibits fatty acid synthesis through downregulation of genes involved in de novo fatty acid synthesis, and this process is likely correlated with the activation of the AMP-activated protein kinase signaling pathways. Copyright © 2018 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Fast and Non-Toxic In Situ Hybridization without Blocking of Repetitive Sequences

PubMed Central

Matthiesen, Steen H.; Hansen, Charles M.

2012-01-01

Formamide is the preferred solvent to lower the melting point and annealing temperature of nucleic acid strands in in situ hybridization (ISH). A key benefit of formamide is better preservation of morphology due to a lower incubation temperature. However, in fluorescence in situ hybridization (FISH), against unique DNA targets in tissue sections, an overnight hybridization is required to obtain sufficient signal intensity. Here, we identified alternative solvents and developed a new hybridization buffer that reduces the required hybridization time to one hour (IQFISH method). Remarkably, denaturation and blocking against repetitive DNA sequences to prevent non-specific binding is not required. Furthermore, the new hybridization buffer is less hazardous than formamide containing buffers. The results demonstrate a significant increased hybridization rate at a lowered denaturation and hybridization temperature for both DNA and PNA (peptide nucleic acid) probes. We anticipate that these formamide substituting solvents will become the foundation for changes in the understanding and performance of denaturation and hybridization of nucleic acids. For example, the process time for tissue-based ISH for gene aberration tests in cancer diagnostics can be reduced from days to a few hours. Furthermore, the understanding of the interactions and duplex formation of nucleic acid strands may benefit from the properties of these solvents. PMID:22911704
High throughput method to characterize acid-base properties of insoluble drug candidates in water.

PubMed

Benito, D E; Acquaviva, A; Castells, C B; Gagliardi, L G

2018-05-30

In drug design experimental characterization of acidic groups in candidate molecules is one of the more important steps prior to the in-vivo studies. Potentiometry combined with Yasuda-Shedlovsky extrapolation is one of the more important strategy to study drug candidates with low solubility in water, although, it requires a large number of sequences to determine pK a values at different solvent-mixture compositions to, finally, obtain the pK a in water (pwwK a ) by extrapolation. We have recently proposed a method which requires only two sequences of additions to study the effect of organic solvent content in liquid chromatography mobile phases on the acidity of the buffer compounds usually dissolved in it along wide ranges of compositions. In this work we propose to apply this method to study thermodynamic pwwK a of drug candidates with low solubilities in pure water. Using methanol/water solvent mixtures we study six pharmaceutical drugs at 25 °C. Four of them: ibuprofen, salicylic acid, atenolol and labetalol, were chosen as members of carboxylic, amine and phenol families, respectively. Since these compounds have known pwwK a values, they were used to validate the procedure, the accuracy of Yasuda-Shedlovsky and other empirical models to fit the behaviors, and to obtain pwwK a by extrapolation. Finally, the method is applied to determine unknown thermodynamic pwwK a values of two pharmaceutical drugs: atorvastatin calcium and the two dissociation constants of ethambutol. The procedure proved to be simple, very fast and accurate in all of the studied cases. Copyright © 2018 Elsevier B.V. All rights reserved.
Sequence heterogeneity of cannabidiolic- and tetrahydrocannabinolic acid-synthase in Cannabis sativa L. and its relationship with chemical phenotype.

PubMed

Onofri, Chiara; de Meijer, Etienne P M; Mandolino, Giuseppe

2015-08-01

Sequence variants of THCA- and CBDA-synthases were isolated from different Cannabis sativa L. strains expressing various wild-type and mutant chemical phenotypes (chemotypes). Expressed and complete sequences were obtained from mature inflorescences. Each strain was shown to have a different specificity and/or ability to convert the precursor CBGA into CBDA and/or THCA type products. The comparison of the expressed sequences led to the identification of different mutations, all of them due to SNPs. These SNPs were found to relate to the cannabinoid composition of the inflorescence at maturity and are therefore proposed to have a functional significance. The amount of variation was found to be higher within the CBDAS sequence family than in the THCAS family, suggesting a more recent evolution of THCA-forming enzymes from the CBDAS group. We therefore consider CBDAS as the ancestral type of these synthases. Copyright © 2015 Elsevier Ltd. All rights reserved.
Characterization of Austrian koi herpesvirus samples based on the ORF40 region.

PubMed

Marek, A; Schachner, O; Bilic, I; Hess, M

2010-02-17

Using a PCR that amplifies a region of the thymidine kinase (TK) gene, an epidemic spread of koi herpesvirus (KHV) was determined in koi carps in Austria in 2007. A total of 15 virus samples from different locations in Austria were analyzed to determine their genetic relatedness following PCR and nucleic acid sequencing of the open reading frame 40 (ORF40) region of the KHV genome. ORF40-specific PCR amplification products that were obtained from tissue samples shared 100% nucleotide sequence identity with the published sequence of the Japanese strain of KHV. The ORF40 sequence of one isolate from the UK that was included in the present study was 100% identical with the published sequence of an Israeli strain of KHV. This is the first study that used a larger number of samples and a PCR method, which allowed distinguishing all 3 strains of KHV. The present investigation provides information on the epidemiology of KHV infections in Europe and describes a useful molecular tool for epidemiological studies.
Primary structure of prostaglandin G/H synthase from sheep vesicular gland determined from the complementary DNA sequence.

PubMed Central

DeWitt, D L; Smith, W L

1988-01-01

Prostaglandin G/H synthase (8,11,14-icosatrienoate, hydrogen-donor:oxygen oxidoreductase, EC 1.14.99.1) catalyzes the first step in the formation of prostaglandins and thromboxanes, the conversion of arachidonic acid to prostaglandin endoperoxides G and H. This enzyme is the site of action of nonsteroidal anti-inflammatory drugs. We have isolated a 2.7-kilobase complementary DNA (cDNA) encompassing the entire coding region of prostaglandin G/H synthase from sheep vesicular glands. This cDNA, cloned from a lambda gt 10 library prepared from poly(A)+ RNA of vesicular glands, hybridizes with a single 2.75-kilobase mRNA species. The cDNA clone was selected using oligonucleotide probes modeled from amino acid sequences of tryptic peptides prepared from the purified enzyme. The full-length cDNA encodes a protein of 600 amino acids, including a signal sequence of 24 amino acids. Identification of the cDNA as coding for prostaglandin G/H synthase is based on comparison of amino acid sequences of seven peptides comprising 103 amino acids with the amino acid sequence deduced from the nucleotide sequence of the cDNA. The molecular weight of the unglycosylated enzyme lacking the signal peptide is 65,621. The synthase is a glycoprotein, and there are three potential sites for N-glycosylation, two of them in the amino-terminal half of the molecule. The serine reported to be acetylated by aspirin is at position 530, near the carboxyl terminus. There is no significant similarity between the sequence of the synthase and that of any other protein in amino acid or nucleotide sequence libraries, and a heme binding site(s) is not apparent from the amino acid sequence. The availability of a full-length cDNA clone coding for prostaglandin G/H synthase should facilitate studies of the regulation of expression of this enzyme and the structural features important for catalysis and for interaction with anti-inflammatory drugs. Images PMID:3125548
PubDNA Finder: a web database linking full-text articles to sequences of nucleic acids.

PubMed

García-Remesal, Miguel; Cuevas, Alejandro; Pérez-Rey, David; Martín, Luis; Anguita, Alberto; de la Iglesia, Diana; de la Calle, Guillermo; Crespo, José; Maojo, Víctor

2010-11-01

PubDNA Finder is an online repository that we have created to link PubMed Central manuscripts to the sequences of nucleic acids appearing in them. It extends the search capabilities provided by PubMed Central by enabling researchers to perform advanced searches involving sequences of nucleic acids. This includes, among other features (i) searching for papers mentioning one or more specific sequences of nucleic acids and (ii) retrieving the genetic sequences appearing in different articles. These additional query capabilities are provided by a searchable index that we created by using the full text of the 176 672 papers available at PubMed Central at the time of writing and the sequences of nucleic acids appearing in them. To automatically extract the genetic sequences occurring in each paper, we used an original method we have developed. The database is updated monthly by automatically connecting to the PubMed Central FTP site to retrieve and index new manuscripts. Users can query the database via the web interface provided. PubDNA Finder can be freely accessed at http://servet.dia.fi.upm.es:8080/pubdnafinder
[Apply fourier transform infrared spectra coupled with two-dimensional correlation analysis to study the evolution of humic acids during composting].

PubMed

Bu, Gui-jun; Yu, Jing; Di, Hui-hui; Luo, Shi-jia; Zhou, Da-zhai; Xiao, Qiang

2015-02-01

The composition and structure of humic acids formed during composting play an important influence on the quality and mature of compost. In order to explore the composition and evolution mechanism, municipal solid wastes were collected to compost and humic and fulvic acids were obtained from these composted municipal solid wastes. Furthermore, fourier transform infrared spectra and two-dimensional correlation analysis were applied to study the composition and transformation of humic and fulvic acids during composting. The results from fourier transform infrared spectra showed that, the composition of humic acids was complex, and several absorbance peaks were observed at 2917-2924, 2844-2852, 2549, 1662, 1622, 1566, 1454, 1398, 1351, 990-1063, 839 and 711 cm(-1). Compared to humic acids, the composition of fulvci acids was simple, and only three peaks were detected at 1725, 1637 and 990 cm(-1). The appearance of these peaks showed that both humic and fulvic acids comprised the benzene originated from lignin and the polysaccharide. In addition, humic acids comprised a large number of aliphatic and protein which were hardly detected in fulvic acids. Aliphatic, polysaccharide, protein and lignin all were degraded during composting, however, the order of degradation was different between humic and fulvci acids. The result from two-dimensional correlation analysis showed that, organic compounds in humic acids were degraded in the following sequence: aliphatic> protein> polysaccharide and lignin, while that in fulvic acids was as following: protein> polysaccharide and aliphatic. A large number of carboxyl, alcohols and ethers were formed during the degradation process, and the carboxyl was transformed into carbonates. It can be concluded that, fourier transform infrared spectra coupled with two-dimensional correlation analysis not only can analyze the function group composition of humic substances, but also can characterize effectively the degradation sequence of these groups and identified the formation mechanism and dynamics of humic substances during composting.
RNA-seq reveals transcriptome changes in goats following myostatin gene knockout

PubMed Central

Cai, Bei; Zhou, Shiwei; Zhu, Haijing; Qu, Lei; Wang, Xiaolong

2017-01-01

Myostatin (MSTN) is a powerful negative regulator of skeletal muscle mass in mammalian species that is primarily expressed in skeletal muscles, and mutations of its encoding gene can result in the double-muscling trait. In this study, the CRISPR/Cas9 technique was used to edit MSTN in Shaanbei Cashmere goats and generate knockout animals. RNA sequencing was used to determine and compare the transcriptome profiles of the muscles from three wild-type (WT) goats, three fibroblast growth factor 5 (FGF5) knockout goats (FGF5+/- group) and three goats with disrupted expression of both the FGF5 and MSTN genes (FM+/- group). The sequence reads were obtained using the Illumina HiSeq 2000 system and mapped to the Capra hircus reference genome using TopHat (v2.0.9). In total, 68.93, 62.04 and 66.26 million clean sequencing reads were obtained from the WT, FM+/- and FGF5+/- groups, respectively. There were 201 differentially expressed genes (DEGs) between the WT and FGF5+/- groups, with 86 down- and 115 up-regulated genes in the FGF5+/- group. Between the WT and FM+/- groups, 121 DEGs were identified, including 81 down- and 40 up-regulated genes in the FM+/- group. A total of 198 DEGs were detected between the FGF5+/- group and FM+/- group, with 128 down- and 70 up-regulated genes in the FM+/- group. At the transcriptome level, we found substantial changes in genes involved in fatty acid metabolism and the biosynthesis of unsaturated fatty acids, such as stearoyl-CoA dehydrogenase, 3-hydroxyacyl-CoA dehydratase 2, ELOVL fatty acid elongase 6 and fatty acid synthase, suggesting that the expression levels of these genes may be directly regulated by MSTN and that these genes are likely downstream targets of MSTN with potential roles in lipid metabolism in goats. Moreover, five randomly selected DEGs were further validated with qRT-PCR, and the results were consistent with the transcriptome analysis. The present study provides insight into the unique transcriptome profile of the MSTN knockout goat, which is a valuable resource for studying goat genomics. PMID:29228005
Nucleotide sequence analysis of the gene encoding the Deinococcus radiodurans surface protein, derived amino acid sequence, and complementary protein chemical studies

DOE Office of Scientific and Technical Information (OSTI.GOV)

Peters, J.; Peters, M.; Lottspeich, F.

1987-11-01

The complete nucleotide sequence of the gene encoding the surface (hexagonally packed intermediate (HPI))-layer polypeptide of Deinococcus radiodurans Sark was determined and found to encode a polypeptide of 1036 amino acids. Amino acid sequence analysis of about 30% of the residues revealed that the mature polypeptide consists of at least 978 amino acids. The N terminus was blocked to Edman degradation. The results of proteolytic modification of the HPI layer in situ and M/sub r/ estimations of the HPI polypeptide expressed in Escherichia coli indicated that there is a leader sequence. The N-terminal region contained a very high percentage (29%)more » of threonine and serine, including a cluster of nine consecutive serine or threonine residues, whereas a stretch near the C terminus was extremely rich in aromatic amino acids (29%). The protein contained at least two disulfide bridges, as well as tightly bound reducing sugars and fatty acids.« less
Artificial mismatch hybridization

DOEpatents

Guo, Zhen; Smith, Lloyd M.

1998-01-01

An improved nucleic acid hybridization process is provided which employs a modified oligonucleotide and improves the ability to discriminate a control nucleic acid target from a variant nucleic acid target containing a sequence variation. The modified probe contains at least one artificial mismatch relative to the control nucleic acid target in addition to any mismatch(es) arising from the sequence variation. The invention has direct and advantageous application to numerous existing hybridization methods, including, applications that employ, for example, the Polymerase Chain Reaction, allele-specific nucleic acid sequencing methods, and diagnostic hybridization methods.
Immunoglobulin from Antarctic fish species of Rajidae family.

PubMed

Coscia, Maria Rosaria; Cocca, Ennio; Giacomelli, Stefano; Cuccaro, Fausta; Oreste, Umberto

2012-03-01

Immunoglobulins (Ig) of Chondroichthyes have been extensively studied in sharks; in contrast, in skates investigations on Ig remain scarce and fragmentary despite the high occurrence of skates in all of the major oceans of the world. To focus on Rajidae Igμ, the most abundant heavy chain isotype, we have chosen the Antarctic species Bathyraja eatonii, Bathyraja albomaculata, Bathyraja brachyurops, and Amblyraja georgiana which live at high latitudes in the Southern Ocean, and at very low temperatures. We prepared mRNA from the spleen of individuals of each species and performed RT-PCR experiments using two oligonucleotides designed on the alignment of various elasmobranch Igμ heavy chain sequences available in GenBank. The PCR products, about 1400-nt long, were cloned and sequenced. Nucleotide sequence identities calculated for the constant region domains ranged from 88.5% to 97.5% between species, and from 91.1% to 99.7% within species. In a distance tree, including also Raja erinacea sequences, two major branches were obtained, one containing Arhynchobatinae sequences, the other one Rajinae sequences. Four presumptive D gene segments were identified in the region of the VH/D/JH recombination; two different D segments were often found in the same sequence. Moreover, 5-15 genomic fragments of different lengths, carrying the gene locus encoding Igμ chain were revealed by Southern blotting analysis. B. eatonii amino acid sequences were analyzed for the positional diversity by Shannon entropy analysis, showing CH4 as the most conserved domain, and CH3 as the most variable one. B. eatonii CDR3 region length varied between 11 and 15 amino acid residues; the mean length (13.4 aa) was greater than that of Leucoraja eglanteria sequences (7.7 aa). An alignment of representative sequences of Antarctic species and R. erinacea showed that more cysteine residues not involved in the intradomain disulfide bridges were present in Antarctic species. Copyright Â© 2011 Elsevier B.V. All rights reserved.
Detection and isolation of nucleic acid sequences using a bifunctional hybridization probe

DOEpatents

Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

2000-01-01

A method for detecting and isolating a target sequence in a sample of nucleic acids is provided using a bifunctional hybridization probe capable of hybridizing to the target sequence that includes a detectable marker and a first complexing agent capable of forming a binding pair with a second complexing agent. A kit is also provided for detecting a target sequence in a sample of nucleic acids using a bifunctional hybridization probe according to this method.
Microsatellite analysis in the genome of Acanthaceae: An in silico approach.

PubMed

Kaliswamy, Priyadharsini; Vellingiri, Srividhya; Nathan, Bharathi; Selvaraj, Saravanakumar

2015-01-01

Acanthaceae is one of the advanced and specialized families with conventionally used medicinal plants. Simple sequence repeats (SSRs) play a major role as molecular markers for genome analysis and plant breeding. The microsatellites existing in the complete genome sequences would help to attain a direct role in the genome organization, recombination, gene regulation, quantitative genetic variation, and evolution of genes. The current study reports the frequency of microsatellites and appropriate markers for the Acanthaceae family genome sequences. The whole nucleotide sequences of Acanthaceae species were obtained from National Center for Biotechnology Information database and screened for the presence of SSRs. SSR Locator tool was used to predict the microsatellites and inbuilt Primer3 module was used for primer designing. Totally 110 repeats from 108 sequences of Acanthaceae family plant genomes were identified, and the occurrence of dinucleotide repeats was found to be abundant in the genome sequences. The essential amino acid isoleucine was found rich in all the sequences. We also designed the SSR-based primers/markers for 59 sequences of this family that contains microsatellite repeats in their genome. The identified microsatellites and primers might be useful for breeding and genetic studies of plants that belong to Acanthaceae family in the future.
Mining for class-specific motifs in protein sequence classification

PubMed Central

2013-01-01

Background In protein sequence classification, identification of the sequence motifs or n-grams that can precisely discriminate between classes is a more interesting scientific question than the classification itself. A number of classification methods aim at accurate classification but fail to explain which sequence features indeed contribute to the accuracy. We hypothesize that sequences in lower denominations (n-grams) can be used to explore the sequence landscape and to identify class-specific motifs that discriminate between classes during classification. Discriminative n-grams are short peptide sequences that are highly frequent in one class but are either minimally present or absent in other classes. In this study, we present a new substitution-based scoring function for identifying discriminative n-grams that are highly specific to a class. Results We present a scoring function based on discriminative n-grams that can effectively discriminate between classes. The scoring function, initially, harvests the entire set of 4- to 8-grams from the protein sequences of different classes in the dataset. Similar n-grams of the same size are combined to form new n-grams, where the similarity is defined by positive amino acid substitution scores in the BLOSUM62 matrix. Substitution has resulted in a large increase in the number of discriminatory n-grams harvested. Due to the unbalanced nature of the dataset, the frequencies of the n-grams are normalized using a dampening factor, which gives more weightage to the n-grams that appear in fewer classes and vice-versa. After the n-grams are normalized, the scoring function identifies discriminative 4- to 8-grams for each class that are frequent enough to be above a selection threshold. By mapping these discriminative n-grams back to the protein sequences, we obtained contiguous n-grams that represent short class-specific motifs in protein sequences. Our method fared well compared to an existing motif finding method known as Wordspy. We have validated our enriched set of class-specific motifs against the functionally important motifs obtained from the NLSdb, Prosite and ELM databases. We demonstrate that this method is very generic; thus can be widely applied to detect class-specific motifs in many protein sequence classification tasks. Conclusion The proposed scoring function and methodology is able to identify class-specific motifs using discriminative n-grams derived from the protein sequences. The implementation of amino acid substitution scores for similarity detection, and the dampening factor to normalize the unbalanced datasets have significant effect on the performance of the scoring function. Our multipronged validation tests demonstrate that this method can detect class-specific motifs from a wide variety of protein sequence classes with a potential application to detecting proteome-specific motifs of different organisms. PMID:23496846
The genome sequence of Geobacter metallireducens: features of metabolism, physiology and regulation common and dissimilar to Geobacter sulfurreducens

DOE Office of Scientific and Technical Information (OSTI.GOV)

Aklujkar, Muktak; Krushkal, Julia; DiBartolo, Genevieve

Background. The genome sequence of Geobacter metallireducens is the second to be completed from the metal-respiring genus Geobacter, and is compared in this report to that of Geobacter sulfurreducens in order to understand their metabolic, physiological and regulatory similarities and differences. Results. The experimentally observed greater metabolic versatility of G. metallireducens versus G. sulfurreducens is borne out by the presence of more numerous genes for metabolism of organic acids including acetate, propionate, and pyruvate. Although G. metallireducens lacks a dicarboxylic acid transporter, it has acquired a second succinate dehydrogenase/fumarate reductase complex, suggesting that respiration of fumarate was important until recentlymore » in its evolutionary history. Vestiges of the molybdate (ModE) regulon of G. sulfurreducens can be detected in G. metallireducens, which has lost the global regulatory protein ModE but retained some putative ModE-binding sites and multiplied certain genes of molybdenum cofactor biosynthesis. Several enzymes of amino acid metabolism are of different origin in the two species, but significant patterns of gene organization are conserved. Whereas most Geobacteraceae are predicted to obtain biosynthetic reducing equivalents from electron transfer pathways via a ferredoxin oxidoreductase, G. metallireducens can derive them from the oxidative pentose phosphate pathway. In addition to the evidence of greater metabolic versatility, the G. metallireducens genome is also remarkable for the abundance of multicopy nucleotide sequences found in intergenic regions and even within genes. Conclusion. The genomic evidence suggests that metabolism, physiology Background. The genome sequence of Geobacter metallireducens is the second to be completed from the metal-respiring genus Geobacter, and is compared in this report to that of Geobacter sulfurreducens in order to understand their metabolic, physiological and regulatory similarities and differences. Results. The experimentally observed greater metabolic versatility of G. metallireducens versus G. sulfurreducens is borne out by the presence of more numerous genes for metabolism of organic acids including acetate, propionate, and pyruvate. Although G. metallireducens lacks a dicarboxylic acid transporter, it has acquired a second succinate dehydrogenase/fumarate reductase complex, suggesting that respiration of fumarate was important until recently in its evolutionary history. Vestiges of the molybdate (ModE) regulon of G. sulfurreducens can be detected in G. metallireducens, which has lost the global regulatory protein ModE but retained some putative ModE-binding sites and multiplied certain genes of molybdenum cofactor biosynthesis. Several enzymes of amino acid metabolism are of different origin in the two species, but significant patterns of gene organization are conserved. Whereas most Geobacteraceae are predicted to obtain biosynthetic reducing equivalents from electron transfer pathways via a ferredoxin oxidoreductase, G. metallireducens can derive them from the oxidative pentose phosphate pathway. In addition to the evidence of greater metabolic versatility, the G. metallireducens genome is also remarkable for the abundance of multicopy nucleotide sequences found in intergenic regions and even within genes. Conclusion. The genomic evidence suggests that metabolism, physiology and regulation of gene expression in G. metallireducens may be dramatically different from other Geobacteraceae.« less
Molecular cloning and characterization of a cDNA encoding the gibberellin biosynthetic enzyme ent-kaurene synthase B from pumpkin (Cucurbita maxima L.).

PubMed

Yamaguchi, S; Saito, T; Abe, H; Yamane, H; Murofushi, N; Kamiya, Y

1996-08-01

The first committed step in the formation of diterpenoids leading to gibberellin (GA) biosynthesis is the conversion of geranylgeranyl diphosphate (GGDP) to ent-kaurene. ent-Kaurene synthase A (KSA) catalyzes the conversion of GGDP to copalyl diphosphate (CDP), which is subsequently converted to ent-kaurene by ent-kaurene synthase B (KSB). A full-length KSB cDNA was isolated from developing cotyledons in immature seeds of pumpkin (Cucurbita maxima L.). Degenerate oligonucleotide primers were designed from the amino acid sequences obtained from the purified protein to amplify a cDNA fragment, which was used for library screening. The isolated full-length cDNA was expressed in Escherichia coli as a fusion protein, which demonstrated the KSB activity to cyclize [3H]CDP to [3H]ent-kaurene. The KSB transcript was most abundant in growing tissues, but was detected in every organ in pumpkin seedlings. The deduced amino acid sequence shares significant homology with other terpene cyclases, including the conserved DDXXD motif, a putative divalent metal ion-diphosphate complex binding site. A putative transit peptide sequence that may target the translated product into the plastids is present in the N-terminal region.

Building toy models of proteins using coevolutionary information

NASA Astrophysics Data System (ADS)

Cheng, Ryan; Raghunathan, Mohit; Onuchic, Jose

2015-03-01

Recent developments in global statistical methodologies have advanced the analysis of large collections of protein sequences for coevolutionary information. Coevolution between amino acids in a protein arises from compensatory mutations that are needed to maintain the stability or function of a protein over the course of evolution. This gives rise to quantifiable correlations between amino acid positions within the multiple sequence alignment of a protein family. Here, we use Direct Coupling Analysis (DCA) to infer a Potts model Hamiltonian governing the correlated mutations in a protein family to obtain the sequence-dependent interaction energies of a toy protein model. We demonstrate that this methodology predicts residue-residue interaction energies that are consistent with experimental mutational changes in protein stabilities as well as other computational methodologies. Furthermore, we demonstrate with several examples that DCA could be used to construct a structure-based model that quantitatively agrees with experimental data on folding mechanisms. This work serves as a potential framework for generating models of proteins that are enriched by evolutionary data that can potentially be used to engineer key functional motions and interactions in protein systems. This research has been supported by the NSF INSPIRE award MCB-1241332 and by the CTBP sponsored by the NSF (Grant PHY-1427654).
Characterization and Screening of Native Scenedesmus sp. Isolates Suitable for Biofuel Feedstock

PubMed Central

Gour, Rakesh Singh; Chawla, Aseem; Singh, Harvinder; Chauhan, Rajinder Singh; Kant, Anil

2016-01-01

In current study isolates of two native microalgae species were screened on the basis of growth kinetics and lipid accumulation potential. On the basis of data obtained on growth parameters and lipid accumulation, it is concluded that Scenedesmus dimorphus has better potential as biofuel feedstock. Two of the isolates of Scenedesmus dimorphus performed better than other isolates with respect to important growth parameters with lipid content of ~30% of dry biomass. Scenedesmus dimorphus was found to be more suitable as biodiesel feedstock candidate on the basis of cumulative occurrence of five important biodiesel fatty acids, relative occurrence of SFA (53.04%), MUFA (23.81%) and PUFA (19.69%), and more importantly that of oleic acid in its total lipids. The morphological observations using light and Scanning Electron Microscope and molecular characterization using amplified 18S rRNA gene sequences of microalgae species under study were also performed. Amplified 18S rRNA gene fragments of the microalgae species were sequenced, annotated at the NCBI website and phylogenetic analysis was done. We have published eight 18S rRNA gene sequences of microalgae species in NCBI GenBank. PMID:27195694
Phenotype-information-phenotype cycle for deconvolution of combinatorial antibody libraries selected against complex systems.

PubMed

Zhang, Hongkai; Torkamani, Ali; Jones, Teresa M; Ruiz, Diana I; Pons, Jaume; Lerner, Richard A

2011-08-16

Use of large combinatorial antibody libraries and next-generation sequencing of nucleic acids are two of the most powerful methods in modern molecular biology. The libraries are screened using the principles of evolutionary selection, albeit in real time, to enrich for members with a particular phenotype. This selective process necessarily results in the loss of information about less-fit molecules. On the other hand, sequencing of the library, by itself, gives information that is mostly unrelated to phenotype. If the two methods could be combined, the full potential of very large molecular libraries could be realized. Here we report the implementation of a phenotype-information-phenotype cycle that integrates information and gene recovery. After selection for phage-encoded antibodies that bind to targets expressed on the surface of Escherichia coli, the information content of the selected pool is obtained by pyrosequencing. Sequences that encode specific antibodies are identified by a bioinformatic analysis and recovered by a stringent affinity method that is uniquely suited for gene isolation from a highly degenerate collection of nucleic acids. This approach can be generalized for selection of antibodies against targets that are present as minor components of complex systems.
Nucleic and amino acid sequences relating to a novel transketolase, and methods for the expression thereof

DOEpatents

Croteau, Rodney Bruce; Wildung, Mark Raymond; Lange, Bernd Markus; McCaskill, David G.

2001-01-01

cDNAs encoding 1-deoxyxylulose-5-phosphate synthase from peppermint (Mentha piperita) have been isolated and sequenced, and the corresponding amino acid sequences have been determined. Accordingly, isolated DNA sequences (SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7) are provided which code for the expression of 1-deoxyxylulose-5-phosphate synthase from plants. In another aspect the present invention provides for isolated, recombinant DXPS proteins, such as the proteins having the sequences set forth in SEQ ID NO:4, SEQ ID NO:6 and SEQ ID NO:8. In other aspects, replicable recombinant cloning vehicles are provided which code for plant 1-deoxyxylulose-5-phosphate synthases, or for a base sequence sufficiently complementary to at least a portion of 1-deoxyxylulose-5-phosphate synthase DNA or RNA to enable hybridization therewith. In yet other aspects, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding a plant 1-deoxyxylulose-5-phosphate synthase. Thus, systems and methods are provided for the recombinant expression of the aforementioned recombinant 1-deoxyxylulose-5-phosphate synthase that may be used to facilitate its production, isolation and purification in significant amounts. Recombinant 1-deoxyxylulose-5-phosphate synthase may be used to obtain expression or enhanced expression of 1-deoxyxylulose-5-phosphate synthase in plants in order to enhance the production of 1-deoxyxylulose-5-phosphate, or its derivatives such as isopentenyl diphosphate (BP), or may be otherwise employed for the regulation or expression of 1-deoxyxylulose-5-phosphate synthase, or the production of its products.
PepLine: a software pipeline for high-throughput direct mapping of tandem mass spectrometry data on genomic sequences.

PubMed

Ferro, Myriam; Tardif, Marianne; Reguer, Erwan; Cahuzac, Romain; Bruley, Christophe; Vermat, Thierry; Nugues, Estelle; Vigouroux, Marielle; Vandenbrouck, Yves; Garin, Jérôme; Viari, Alain

2008-05-01

PepLine is a fully automated software which maps MS/MS fragmentation spectra of trypsic peptides to genomic DNA sequences. The approach is based on Peptide Sequence Tags (PSTs) obtained from partial interpretation of QTOF MS/MS spectra (first module). PSTs are then mapped on the six-frame translations of genomic sequences (second module) giving hits. Hits are then clustered to detect potential coding regions (third module). Our work aimed at optimizing the algorithms of each component to allow the whole pipeline to proceed in a fully automated manner using raw nucleic acid sequences (i.e., genomes that have not been "reduced" to a database of ORFs or putative exons sequences). The whole pipeline was tested on controlled MS/MS spectra sets from standard proteins and from Arabidopsis thaliana envelope chloroplast samples. Our results demonstrate that PepLine competed with protein database searching softwares and was fast enough to potentially tackle large data sets and/or high size genomes. We also illustrate the potential of this approach for the detection of the intron/exon structure of genes.
Optical resolution of phenylthiohydantoin-amino acids by capillary electrophoresis and identification of the phenylthiohydantoin-D-amino acid residue of [D-Ala2]-methionine enkephalin.

PubMed

Kurosu, Y; Murayama, K; Shindo, N; Shisa, Y; Ishioka, N

1996-11-01

This is an initial report to propose a protein sequence analysis system with DL differentiation using capillary electrophoresis (CE). This system consists of a protein sequencer and a CE system. After fractionation of phenyl-thiohydantoin (PTH)-amino acids using a protein sequencer, optical resolution for each PTH-amino acid is performed by CE using some chiral selectors such as digitonin, beta-escin and others. As a model peptide, [D-Ala2]-methionine enkephalin (L-Tyr-D-Ala-Gly-L-Phe-L-Met), was used and the sequence with DL differentiation was determined, with the exception of the fourth amino acid, L-Phe, using our proposed system.
Adaptive molecular evolution of the two-pore channel 1 gene TPC1 in the karst-adapted genus Primulina (Gesneriaceae)

PubMed Central

Tao, Junjie; Feng, Chao; Ai, Bin; Kang, Ming

2016-01-01

Background and Aims Limestone karst areas possess high floral diversity and endemism. The genus Primulina, which contributes to the unique calcicole flora, has high species richness and exhibit specific soil-based habitat associations that are mainly distributed on calcareous karst soils. The adaptive molecular evolutionary mechanism of the genus to karst calcium-rich environments is still not well understood. The Ca2+-permeable channel TPC1 was used in this study to test whether its gene is involved in the local adaptation of Primulina to karst high-calcium soil environments. Methods Specific amplification and sequencing primers were designed and used to amplify the full-length coding sequences of TPC1 from cDNA of 76 Primulina species. The sequence alignment without recombination and the corresponding reconstructed phylogeny tree were used in molecular evolutionary analyses at the nucleic acid level and amino acid level, respectively. Finally, the identified sites under positive selection were labelled on the predicted secondary structure of TPC1. Key Results Seventy-six full-length coding sequences of Primulina TPC1 were obtained. The length of the sequences varied between 2220 and 2286 bp and the insertion/deletion was located at the 5′ end of the sequences. No signal of substitution saturation was detected in the sequences, while significant recombination breakpoints were detected. The molecular evolutionary analyses showed that TPC1 was dominated by purifying selection and the selective pressures were not significantly different among species lineages. However, significant signals of positive selection were detected at both TPC1 codon level and amino acid level, and five sites under positive selective pressure were identified by at least three different methods. Conclusions The Ca2+-permeable channel TPC1 may be involved in the local adaptation of Primulina to karst Ca2+-rich environments. Different species lineages suffered similar selective pressure associated with calcium in karst environments, and episodic diversifying selection at a few sites may play a major role in the molecular evolution of Primulina TPC1. PMID:27582362
Cryptic Hepatitis B and E in Patients With Acute Hepatitis of Unknown Etiology.

PubMed

Ganova-Raeva, Lilia; Punkova, Lili; Campo, David S; Dimitrova, Zoya; Skums, Pavel; Vu, Nga H; Dat, Do T; Dalton, Harry R; Khudyakov, Yury

2015-12-15

Up to 30% of acute viral hepatitis has no known etiology. To determine the disease etiology in patients with acute hepatitis of unknown etiology (HUE), serum specimens were obtained from 38 patients residing in the United Kingdom and Vietnam and from 26 healthy US blood donors. All specimens tested negative for known viral infections causing hepatitis, using commercially available serological and nucleic acid assays. Specimens were processed by sequence-independent complementary DNA amplification and next-generation sequencing (NGS). Sufficient material for individual NGS libraries was obtained from 12 HUE cases and 26 blood donors; the remaining HUE cases were sequenced as a pool. Read mapping was done by targeted and de novo assembly. Sequences from hepatitis B virus (HBV) were detected in 7 individuals with HUE (58.3%) and the pooled library, and hepatitis E virus (HEV) was detected in 2 individuals with HUE (16.7%) and the pooled library. Both HEV-positive cases were coinfected with HBV. HBV sequences belonged to genotypes A, D, or G, and HEV sequences belonged to genotype 3. No known hepatotropic viruses were detected in the tested normal human sera. NGS-based detection of HBV and HEV infections is more sensitive than using commercially available assays. HBV and HEV may be cryptically associated with HUE. Published by Oxford University Press on behalf of the Infectious Diseases Society of America 2015. This work is written by (a) US Government employee(s) and is in the public domain in the US.
Production of antioxidant and ACE-inhibitory peptides from Kluyveromyces marxianus protein hydrolysates: Purification and molecular docking.

PubMed

Mirzaei, Mahta; Mirdamadi, Saeed; Ehsani, Mohamad Reza; Aminlari, Mahmoud

2018-04-01

Kluyveromyces marxianus protein hydrolysates were prepared by two different sonicated-enzymatic (trypsin and chymotrypsin) hydrolysis treatments to obtain antioxidant and ACE-inhibitory peptides. Trypsin and chymotrypsin hydrolysates obtained by 5 h, exhibited the highest antioxidant and ACE-inhibitory activities. After fractionation using ultrafiltration and reverse phase high performance liquid chromatography (RP-HPLC) techniques, two new peptides were identified. One fragment (LL-9, MW = 1180 Da) with the amino acid sequence of Leu-Pro-Glu-Ser-Val-His-Leu-Asp-Lys showed significant ACE inhibitory activity (IC 50 = 22.88 μM) while another peptide fragment (VL-9, MW = 1118 Da) with the amino acid sequence of Val-Leu-Ser-Thr-Ser-Phe-Pro-Pro-Lys showed the highest antioxidant and ACE inhibitory properties (IC 50 = 15.20 μM, 5568 μM TE/mg protein). The molecular docking studies revealed that the ACE inhibitory activities of VL-9 is due to interaction with the S2 (His513, His353, Glu281) and S'1 (Glu162) pockets of ACE and LL-9 can fit perfectly into the S1 (Thr345) and S2 (Tyr520, Lys511, Gln281) pockets of ACE. Copyright © 2017. Published by Elsevier B.V.
C terminal retroviral-type zinc finger domain from the HIV-1 nucleocapsid protein is structurally similar to the N-terminal zinc finger domain

DOE Office of Scientific and Technical Information (OSTI.GOV)

South, T.L.; Blake, P.R.; Hare, D.R.

Two-dimensional NMR spectroscopic and computational methods were employed for the structure determination of an 18-residue peptide with the amino acid sequence of the C-terminal retriviral-type (r.t.) zinc finger domain from the nucleocapsid protein (NCP) of HIV-1 (Zn(HIV1-F2)). Unlike results obtained for the first retroviral-type zinc finger peptide, Zn (HIV1-F1) broad signals indicative of confomational lability were observed in the {sup 1}H NMR spectrum of An(HIV1-F2) at 25 C. The NMR signals narrowed upon cooling to {minus}2 C, enabling complete {sup 1}H NMR signal assignment via standard two-dimensional (2D) NMR methods. Distance restraints obtained from qualitative analysis of 2D nuclear Overhausermore » effect (NOESY) data were sued to generate 30 distance geometry (DG) structures with penalties in the range 0.02-0.03 {angstrom}{sup 2}. All structures were qualitatively consistent with the experimental NOESY spectrum based on comparisons with 2D NOESY back-calculated spectra. These results indicate that the r.t. zinc finger sequences observed in retroviral NCPs, simple plant virus coat proteins, and in a human single-stranded nucleic acid binding protein share a common structural motif.« less
Knocking out the MFE-2 gene of Candida bombicola leads to improved medium-chain sophorolipid production.

PubMed

Van Bogaert, Inge N A; Sabirova, Julia; Develter, Dirk; Soetaert, Wim; Vandamme, Erick J

2009-06-01

The nonpathogenic yeast Candida bombicola synthesizes sophorolipids. These biosurfactants are composed of the disaccharide sophorose linked to a long-chain hydroxy fatty acid and have potential applications in the food, pharmaceutical, cosmetic and cleaning industries. In order to expand the range of application, a shift of the fatty acid moiety towards medium-chain lengths would be recommendable. However, the synthesis of medium-chain sophorolipids by C. bombicola is a challenging objective. First of all, these sophorolipids can only be obtained by fermentations on unconventional carbon sources, which often have a toxic effect on the cells. Furthermore, medium-chain substrates are partially metabolized in the beta-oxidation pathway. In order to redirect unconventional substrates towards sophorolipid synthesis, the beta-oxidation pathway was blocked on the genome level by knocking out the multifunctional enzyme type 2 (MFE-2) gene. The total gene sequence of the C. bombicola MFE-2 (6033 bp) was cloned (GenBank accession number EU371724), and the obtained nucleotide sequence was used to construct a knock-out cassette. Several knock-out mutants with the correct geno- and phenotype were evaluated in a fermentation on 1-dodecanol. All mutants showed a 1.7-2.9 times higher production of sophorolipids, indicating that in those strains the substrate is redirected towards the sophorolipid synthesis.
A Peptide-Based Method for 13C Metabolic Flux Analysis in Microbial Communities

PubMed Central

Ghosh, Amit; Nilmeier, Jerome; Weaver, Daniel; Adams, Paul D.; Keasling, Jay D.; Mukhopadhyay, Aindrila; Petzold, Christopher J.; Martín, Héctor García

2014-01-01

The study of intracellular metabolic fluxes and inter-species metabolite exchange for microbial communities is of crucial importance to understand and predict their behaviour. The most authoritative method of measuring intracellular fluxes, 13C Metabolic Flux Analysis (13C MFA), uses the labeling pattern obtained from metabolites (typically amino acids) during 13C labeling experiments to derive intracellular fluxes. However, these metabolite labeling patterns cannot easily be obtained for each of the members of the community. Here we propose a new type of 13C MFA that infers fluxes based on peptide labeling, instead of amino acid labeling. The advantage of this method resides in the fact that the peptide sequence can be used to identify the microbial species it originates from and, simultaneously, the peptide labeling can be used to infer intracellular metabolic fluxes. Peptide identity and labeling patterns can be obtained in a high-throughput manner from modern proteomics techniques. We show that, using this method, it is theoretically possible to recover intracellular metabolic fluxes in the same way as through the standard amino acid based 13C MFA, and quantify the amount of information lost as a consequence of using peptides instead of amino acids. We show that by using a relatively small number of peptides we can counter this information loss. We computationally tested this method with a well-characterized simple microbial community consisting of two species. PMID:25188426
HBC-Evo: predicting human breast cancer by exploiting amino acid sequence-based feature spaces and evolutionary ensemble system.

PubMed

Majid, Abdul; Ali, Safdar

2015-01-01

We developed genetic programming (GP)-based evolutionary ensemble system for the early diagnosis, prognosis and prediction of human breast cancer. This system has effectively exploited the diversity in feature and decision spaces. First, individual learners are trained in different feature spaces using physicochemical properties of protein amino acids. Their predictions are then stacked to develop the best solution during GP evolution process. Finally, results for HBC-Evo system are obtained with optimal threshold, which is computed using particle swarm optimization. Our novel approach has demonstrated promising results compared to state of the art approaches.
CAPRRESI: Chimera Assembly by Plasmid Recovery and Restriction Enzyme Site Insertion.

PubMed

Santillán, Orlando; Ramírez-Romero, Miguel A; Dávila, Guillermo

2017-06-25

Here, we present chimera assembly by plasmid recovery and restriction enzyme site insertion (CAPRRESI). CAPRRESI benefits from many strengths of the original plasmid recovery method and introduces restriction enzyme digestion to ease DNA ligation reactions (required for chimera assembly). For this protocol, users clone wildtype genes into the same plasmid (pUC18 or pUC19). After the in silico selection of amino acid sequence regions where chimeras should be assembled, users obtain all the synonym DNA sequences that encode them. Ad hoc Perl scripts enable users to determine all synonym DNA sequences. After this step, another Perl script searches for restriction enzyme sites on all synonym DNA sequences. This in silico analysis is also performed using the ampicillin resistance gene (ampR) found on pUC18/19 plasmids. Users design oligonucleotides inside synonym regions to disrupt wildtype and ampR genes by PCR. After obtaining and purifying complementary DNA fragments, restriction enzyme digestion is accomplished. Chimera assembly is achieved by ligating appropriate complementary DNA fragments. pUC18/19 vectors are selected for CAPRRESI because they offer technical advantages, such as small size (2,686 base pairs), high copy number, advantageous sequencing reaction features, and commercial availability. The usage of restriction enzymes for chimera assembly eliminates the need for DNA polymerases yielding blunt-ended products. CAPRRESI is a fast and low-cost method for fusing protein-coding genes.
From cultured to uncultured genome sequences: metagenomics and modeling microbial ecosystems.

PubMed

Garza, Daniel R; Dutilh, Bas E

2015-11-01

Microorganisms and the viruses that infect them are the most numerous biological entities on Earth and enclose its greatest biodiversity and genetic reservoir. With strength in their numbers, these microscopic organisms are major players in the cycles of energy and matter that sustain all life. Scientists have only scratched the surface of this vast microbial world through culture-dependent methods. Recent developments in generating metagenomes, large random samples of nucleic acid sequences isolated directly from the environment, are providing comprehensive portraits of the composition, structure, and functioning of microbial communities. Moreover, advances in metagenomic analysis have created the possibility of obtaining complete or nearly complete genome sequences from uncultured microorganisms, providing important means to study their biology, ecology, and evolution. Here we review some of the recent developments in the field of metagenomics, focusing on the discovery of genetic novelty and on methods for obtaining uncultured genome sequences, including through the recycling of previously published datasets. Moreover we discuss how metagenomics has become a core scientific tool to characterize eco-evolutionary patterns of microbial ecosystems, thus allowing us to simultaneously discover new microbes and study their natural communities. We conclude by discussing general guidelines and challenges for modeling the interactions between uncultured microorganisms and viruses based on the information contained in their genome sequences. These models will significantly advance our understanding of the functioning of microbial ecosystems and the roles of microbes in the environment.
Identification and Analysis of Novel Amino-Acid Sequence Repeats in Bacillus anthracis str. Ames Proteome Using Computational Tools

PubMed Central

Hemalatha, G. R.; Rao, D. Satyanarayana; Guruprasad, L.

2007-01-01

We have identified four repeats and ten domains that are novel in proteins encoded by the Bacillus anthracis str. Ames proteome using automated in silico methods. A “repeat” corresponds to a region comprising less than 55-amino-acid residues that occur more than once in the protein sequence and sometimes present in tandem. A “domain” corresponds to a conserved region with greater than 55-amino-acid residues and may be present as single or multiple copies in the protein sequence. These correspond to (1) 57-amino-acid-residue PxV domain, (2) 122-amino-acid-residue FxF domain, (3) 111-amino-acid-residue YEFF domain, (4) 109-amino-acid-residue IMxxH domain, (5) 103-amino-acid-residue VxxT domain, (6) 84-amino-acid-residue ExW domain, (7) 104-amino-acid-residue NTGFIG domain, (8) 36-amino-acid-residue NxGK repeat, (9) 95-amino-acid-residue VYV domain, (10) 75-amino-acid-residue KEWE domain, (11) 59-amino-acid-residue AFL domain, (12) 53-amino-acid-residue RIDVK repeat, (13) (a) 41-amino-acid-residue AGQF repeat and (b) 42-amino-acid-residue GSAL repeat. A repeat or domain type is characterized by specific conserved sequence motifs. We discuss the presence of these repeats and domains in proteins from other genomes and their probable secondary structure. PMID:17538688
Comparative genome analysis of the candidate functional starter culture strains Lactobacillus fermentum 222 and Lactobacillus plantarum 80 for controlled cocoa bean fermentation processes.

PubMed

Illeghems, Koen; De Vuyst, Luc; Weckx, Stefan

2015-10-12

Lactobacillus fermentum 222 and Lactobacillus plantarum 80, isolates from a spontaneous Ghanaian cocoa bean fermentation process, proved to be interesting functional starter culture strains for cocoa bean fermentations. Lactobacillus fermentum 222 is a thermotolerant strain, able to dominate the fermentation process, thereby converting citrate and producing mannitol. Lactobacillus plantarum 80 is an acid-tolerant and facultative heterofermentative strain that is competitive during cocoa bean fermentation processes. In this study, whole-genome sequencing and comparative genome analysis was used to investigate the mechanisms of these strains to dominate the cocoa bean fermentation process. Through functional annotation and analysis of the high-coverage contigs obtained through 454 pyrosequencing, plantaricin production was predicted for L. plantarum 80. For L. fermentum 222, genes encoding a complete arginine deiminase pathway were attributed. Further, in-depth functional analysis revealed the capacities of these strains associated with carbohydrate and amino acid metabolism, such as the ability to use alternative external electron acceptors, the presence of an extended pyruvate metabolism, and the occurrence of several amino acid conversion pathways. A comparative genome sequence analysis using publicly available genome sequences of strains of the species L. plantarum and L. fermentum revealed unique features of both strains studied. Indeed, L. fermentum 222 possessed genes encoding additional citrate transporters and enzymes involved in amino acid conversions, whereas L. plantarum 80 is the only member of this species that harboured a gene cluster involved in uptake and consumption of fructose and/or sorbose. In-depth genome sequence analysis of the candidate functional starter culture strains L. fermentum 222 and L. plantarum 80 revealed their metabolic capacities, niche adaptations and functionalities that enable them to dominate the cocoa bean fermentation process. Further, these results offered insights into the cocoa bean fermentation ecosystem as a whole and will facilitate the selection of appropriate starter culture strains for controlled cocoa bean fermentation processes.
Genetic Diversity of Hepatitis A Virus in China: VP3-VP1-2A Genes and Evidence of Quasispecies Distribution in the Isolates

PubMed Central

Cao, Jingyuan; Zhou, Wenting; Yi, Yao; Jia, Zhiyuan; Bi, Shengli

2013-01-01

Hepatitis A virus (HAV) is the most common cause of infectious hepatitis throughout the world, spread largely by the fecal-oral route. To characterize the genetic diversity of the virus circulating in China where HAV in endemic, we selected the outbreak cases with identical sequences in VP1-2A junction region and compiled a panel of 42 isolates. The VP3-VP1-2A regions of the HAV capsid-coding genes were further sequenced and analyzed. The quasispecies distribution was evaluated by cloning the VP3 and VP1-2A genes in three clinical samples. Phylogenetic analysis demonstrated that the same genotyping results could be obtained whether using the complete VP3, VP1, or partial VP1-2A genes for analysis in this study, although some differences did exist. Most isolates clustered in sub-genotype IA, and fewer in sub-genotype IB. No amino acid mutations were found at the published neutralizing epitope sites, however, several unique amino acid substitutions in the VP3 or VP1 region were identified, with two amino acid variants closely located to the immunodominant site. Quasispecies analysis showed the mutation frequencies were in the range of 7.22x10-4 -2.33x10-3 substitutions per nucleotide for VP3, VP1, or VP1-2A. When compared with the consensus sequences, mutated nucleotide sites represented the minority of all the analyzed sequences sites. HAV replicated as a complex distribution of closely genetically related variants referred to as quasispecies, and were under negative selection. The results indicate that diverse HAV strains and quasispecies inside the viral populations are presented in China, with unique amino acid substitutions detected close to the immunodominant site, and that the possibility of antigenic escaping mutants cannot be ruled out and needs to be further analyzed. PMID:24069343
37 CFR 1.824 - Form and format for nucleotide and/or amino acid sequence submissions in computer readable form.

Code of Federal Regulations, 2010 CFR

2010-07-01

... 37 Patents, Trademarks, and Copyrights 1 2010-07-01 2010-07-01 false Form and format for... And/or Amino Acid Sequences § 1.824 Form and format for nucleotide and/or amino acid sequence... Code for Information Interchange (ASCII) text. No other formats shall be allowed. (3) The computer...
Novel beta-lactamase genes from two environmental isolates of Vibrio harveyi.

PubMed

Teo, J W; Suwanto, A; Poh, C L

2000-05-01

Two ampicillin-resistant (Amp(r)) isolates of Vibrio harveyi, W3B and HB3, were obtained from the coastal waters of the Indonesian island of Java. Strain W3B was isolated from marine water near a shrimp farm in North Java while HB3 was from pristine seawater in South Java. In this study, novel beta-lactamase genes from W3B (bla(VHW-1)) and HB3 (bla(VHH-1)) were cloned and their nucleotide sequences were determined. An open reading frame (ORF) of 870 bp encoding a deduced protein of 290 amino acids (VHW-1) was revealed for the bla gene of strain W3B while an ORF of 849 bp encoding a 283-amino-acid protein (VHH-1) was deduced for bla(VHH-1). At the DNA level, genes for VHW-1 and VHH-1 have a 97% homology, while at the protein level they have a 91% homology of amino acid sequences. Neither gene sequence showed homology to any other beta-lactamases in the databases. The deduced proteins were found to be class A beta-lactamases bearing low levels of homology (<50%) to other beta-lactamases of the same class. The highest level of identity was obtained with beta-lactamases from Pseudomonas aeruginosa, i.e., PSE-1, PSE-4, and CARB-3, and Vibrio cholerae CARB-6. Our study showed that both strains W3B and HB3 possess an endogenous plasmid of approximately 60 kb in size. However, Southern hybridization analysis employing bla(VHW-1) as a gene probe demonstrated that the bla gene was not located in the plasmid. A total of nine ampicillin-resistant V. harveyi strains, including W3B and HB3, were examined by pulsed-field gel electrophoresis of NotI-digested genomic DNA. Despite a high level of intrastrain genetic diversity, the bla(VHW-1) probe hybridized only to an 80- or 160-kb NotI genomic fragment in different isolates.

Cloning, in Vitro expression, and novel phylogenetic classification of a channel catfish estrogen receptor

USGS Publications Warehouse

Xia, Z.; Patino, R.; Gale, W.L.; Maule, A.G.; Densmore, L.D.

1999-01-01

We obtained two channel catfish estrogen receptor (ccER) cDNA from liver of female fish using RT–PCR. The two fragments were identical in sequence except that the smaller one had an out-of-frame deletion in the E domain, suggesting the existence of ccER splice variants. The larger fragment was used to screen a cDNA library from liver of a prepubescent female. A cDNA was obtained that encoded a 581-amino-acid ER with a deduced molecular weight of 63.8 kDa. Extracts of COS-7 cells transfected with ccER cDNA bound estrogen with high affinity (Kd = 4.7 nM) and specificity. Maximum parsimony and Neighbor Joining analyses were used to generate a phylogenetic classification of ccER on the basis of 18 full-length ER sequences. The tree suggested the existence of two major ER branches. One branch contained two clearly divergent clades which included all piscine ER (except Japanese eel ER) and all tetrapod ERα, respectively. The second major branch contained the eel ER and the mammalian ERβ. The high degree of divergence between the eel ER and mammalian ERβ suggested that they also represent distinct piscine and tetrapod ER. These data suggest that ERα and ERβ are present throughout vertebrates and that these two major ER types evolved by duplication of an ancestral ER gene. Sequence alignments with other members of the nuclear hormone receptor superfamily indicated the presence of 8 amino acids in the E domain that align exclusively among ER. Four of these amino acids have not received prior research attention and their function is unknown. The novel finding of putative ER splice variants in a nonmammalian vertebrate and the novel phylogenetic classification of ER offer new perspectives in understanding the diversification and function of ER.
Question 3: The Worlds of the Prebiotic and Never Born Proteins

NASA Astrophysics Data System (ADS)

Chiarabelli, Cristiano; de Lucrezia, Davide

2007-10-01

Starting from the statement that no reliable methods are known to produce high molecular weight polypeptides under prebiotic conditions, a possible approach, at least to understand the differences between extant proteins and the possible large number of never born proteins, could be biological. Using the phage display method a large library of totally random amino acidic sequences was obtained. Consequently, different experiments to directly consider the frequency of stable folds were performed, and the interesting results obtained from such new approach are discussed in terms of contingency, contributing to the discussion on the selection mechanism of extant proteins.
Application of 2D graphic representation of protein sequence based on Huffman tree method.

PubMed

Qi, Zhao-Hui; Feng, Jun; Qi, Xiao-Qin; Li, Ling

2012-05-01

Based on Huffman tree method, we propose a new 2D graphic representation of protein sequence. This representation can completely avoid loss of information in the transfer of data from a protein sequence to its graphic representation. The method consists of two parts. One is about the 0-1 codes of 20 amino acids by Huffman tree with amino acid frequency. The amino acid frequency is defined as the statistical number of an amino acid in the analyzed protein sequences. The other is about the 2D graphic representation of protein sequence based on the 0-1 codes. Then the applications of the method on ten ND5 genes and seven Escherichia coli strains are presented in detail. The results show that the proposed model may provide us with some new sights to understand the evolution patterns determined from protein sequences and complete genomes. Copyright © 2012 Elsevier Ltd. All rights reserved.
Opsin cDNA sequences of a UV and green rhodopsin of the satyrine butterfly Bicyclus anynana.

PubMed

Vanhoutte, K J A; Eggen, B J L; Janssen, J J M; Stavenga, D G

2002-11-01

The cDNAs of an ultraviolet (UV) and long-wavelength (LW) (green) absorbing rhodopsin of the bush brown Bicyclus anynana were partially identified. The UV sequence, encoding 377 amino acids, is 76-79% identical to the UV sequences of the papilionids Papilio glaucus and Papilio xuthus and the moth Manduca sexta. A dendrogram derived from aligning the amino acid sequences reveals an equidistant position of Bicyclus between Papilio and Manduca. The sequence of the green opsin cDNA fragment, which encodes 242 amino acids, represents six of the seven transmembrane regions. At the amino acid level, this fragment is more than 80% identical to the corresponding LW opsin sequences of Dryas, Heliconius, Papilio (rhodopsin 2) and Manduca. Whereas three LW absorbing rhodopsins were identified in the papilionid butterflies, only one green opsin was found in B. anynana.
Complete amino acid sequence of ananain and a comparison with stem bromelain and other plant cysteine proteases.

PubMed Central

Lee, K L; Albee, K L; Bernasconi, R J; Edmunds, T

1997-01-01

The amino acid sequences of ananain (EC3.4.22.31) and stem bromelain (3.4.22.32), two cysteine proteases from pineapple stem, are similar yet ananain and stem bromelain possess distinct specificities towards synthetic peptide substrates and different reactivities towards the cysteine protease inhibitors E-64 and chicken egg white cystatin. We present here the complete amino acid sequence of ananain and compare it with the reported sequences of pineapple stem bromelain, papain and chymopapain from papaya and actinidin from kiwifruit. Ananain is comprised of 216 residues with a theoretical mass of 23464 Da. This primary structure includes a sequence insert between residues 170 and 174 not present in stem bromelain or papain and a hydrophobic series of amino acids adjacent to His-157. It is possible that these sequence differences contribute to the different substrate and inhibitor specificities exhibited by ananain and stem bromelain. PMID:9355753
Efficient analysis of mouse genome sequences reveal many nonsense variants

PubMed Central

Steeland, Sophie; Timmermans, Steven; Van Ryckeghem, Sara; Hulpiau, Paco; Saeys, Yvan; Van Montagu, Marc; Vandenbroucke, Roosmarijn E.; Libert, Claude

2016-01-01

Genetic polymorphisms in coding genes play an important role when using mouse inbred strains as research models. They have been shown to influence research results, explain phenotypical differences between inbred strains, and increase the amount of interesting gene variants present in the many available inbred lines. SPRET/Ei is an inbred strain derived from Mus spretus that has ∼1% sequence difference with the C57BL/6J reference genome. We obtained a listing of all SNPs and insertions/deletions (indels) present in SPRET/Ei from the Mouse Genomes Project (Wellcome Trust Sanger Institute) and processed these data to obtain an overview of all transcripts having nonsynonymous coding sequence variants. We identified 8,883 unique variants affecting 10,096 different transcripts from 6,328 protein-coding genes, which is about 28% of all coding genes. Because only a subset of these variants results in drastic changes in proteins, we focused on variations that are nonsense mutations that ultimately resulted in a gain of a stop codon. These genes were identified by in silico changing the C57BL/6J coding sequences to the SPRET/Ei sequences, converting them to amino acid (AA) sequences, and comparing the AA sequences. All variants and transcripts affected were also stored in a database, which can be browsed using a SPRET/Ei M. spretus variants web tool (www.spretus.org), including a manual. We validated the tool by demonstrating the loss of function of three proteins predicted to be severely truncated, namely Fas, IRAK2, and IFNγR1. PMID:27147605
WebLogo

DOE Office of Scientific and Technical Information (OSTI.GOV)

Crooks, Gavin E.

WebLogo is a web based application designed to make the generation of sequence logos as easy and painless as possible. Sequesnce logos are a graphical representation of an amino acid or nucleic acid multiple sequence alignment developed by Tom Schneider and Mike Stephens. Each logo consists of stacks of symbols, one stack for each position in the sequence. The overall height of the stack indicates the sequence conservation at that position, while the height of symbols within the stack indicates the relative frequency of each amino or nucleic acid at that position. In general, a sequence logo provides a richermore » and more precise description of, for example, a binding site, than would a consensus sequence.« less
High molecular weight glutenin subunits in some durum wheat cultivars investigated by means of mass spectrometric techniques.

PubMed

Muccilli, Vera; Lo Bianco, Marisol; Cunsolo, Vincenzo; Saletti, Rosaria; Gallo, Giulia; Foti, Salvatore

2011-11-23

The primary structures of high molecular weight glutenin subunits (HMW-GS) of 5 Triticum durum Desf. cultivars (Simeto, Svevo, Duilio, Bronte, and Sant'Agata), largely cultivated in the south of Italy, and of 13 populations of the old spring Sicilian durum wheat landrace Timilia (Triticum durum Desf.) (accession nos. 1, 2, 3, 4, 7, 8, 9, 13, 14, 15, SG1, SG2, and SG3) were investigated using matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOFMS) and reversed-phase high performance liquid chromatography/nanoelectrospray ionization mass spectrometry (RP-HPLC/nESI-MS/MS). M(r) of the intact proteins determined by MALDI mass spectrometry showed that all the 13 populations of Timilia contained the same two HMW-GS with 75.2 kDa and 86.4 kDa, whereas the other durum wheat cultivars showed the presence of the expected HMW-GS 1By8 and 1Bx7 at 75.1 kDa and 83.1 kDa, respectively. By MALDI mass spectrometry of the tryptic digestion peptides of the isolated HMW-GS of Timilia, the 1Bx and 1By subunits were identified as the NCBInr Acc. No AAQ93629, and AAQ93633, respectively. Sequence verification for HMW-GS 1Bx and 1By both in Simeto and Timilia was obtained by MALDI mass mapping and HPLC/nESI-MSMS of the tryptic peptides. The Bx subunit of Timila presents a sequence similarity of 96% with respect to Simeto, with differences in the insertion of 3 peptides of 5, 9, and 15 amino acids, for a total insertion of 29 amino acids and 25 amino acid substitutions. These differences in the amino acidic sequence account for the determined Δm of 3294 Da between the M(r) of the 1Bx subunits in Timilia and Simeto. Sequence alignment between the two By subunits shows 10 amino acid substitutions and is consistent with the Δm of 148 Da found in the MALDI mass spectra of the intact subunits.
The DynaMine webserver: predicting protein dynamics from sequence.

PubMed

Cilia, Elisa; Pancsa, Rita; Tompa, Peter; Lenaerts, Tom; Vranken, Wim F

2014-07-01

Protein dynamics are important for understanding protein function. Unfortunately, accurate protein dynamics information is difficult to obtain: here we present the DynaMine webserver, which provides predictions for the fast backbone movements of proteins directly from their amino-acid sequence. DynaMine rapidly produces a profile describing the statistical potential for such movements at residue-level resolution. The predicted values have meaning on an absolute scale and go beyond the traditional binary classification of residues as ordered or disordered, thus allowing for direct dynamics comparisons between protein regions. Through this webserver, we provide molecular biologists with an efficient and easy to use tool for predicting the dynamical characteristics of any protein of interest, even in the absence of experimental observations. The prediction results are visualized and can be directly downloaded. The DynaMine webserver, including instructive examples describing the meaning of the profiles, is available at http://dynamine.ibsquare.be. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Short Communication An efficient method for simultaneous extraction of high-quality RNA and DNA from various plant tissues.

PubMed

Oliveira, R R; Viana, A J C; Reátegui, A C E; Vincentz, M G A

2015-12-29

Determination of gene expression is an important tool to study biological processes and relies on the quality of the extracted RNA. Changes in gene expression profiles may be directly related to mutations in regulatory DNA sequences or alterations in DNA cytosine methylation, which is an epigenetic mark. Correlation of gene expression with DNA sequence or epigenetic mark polymorphism is often desirable; for this, a robust protocol to isolate high-quality RNA and DNA simultaneously from the same sample is required. Although commercial kits and protocols are available, they are mainly optimized for animal tissues and, in general, restricted to RNA or DNA extraction, not both. In the present study, we describe an efficient and accessible method to extract both RNA and DNA simultaneously from the same sample of various plant tissues, using small amounts of starting material. The protocol was efficient in the extraction of high-quality nucleic acids from several Arabidopsis thaliana tissues (e.g., leaf, inflorescence stem, flower, fruit, cotyledon, seedlings, root, and embryo) and from other tissues of non-model plants, such as Avicennia schaueriana (Acanthaceae), Theobroma cacao (Malvaceae), Paspalum notatum (Poaceae), and Sorghum bicolor (Poaceae). The obtained nucleic acids were used as templates for downstream analyses, such as mRNA sequencing, quantitative real time-polymerase chain reaction, bisulfite treatment, and others; the results were comparable to those obtained with commercial kits. We believe that this protocol could be applied to a broad range of plant species, help avoid technical and sampling biases, and facilitate several RNA- and DNA-dependent analyses.
A sensitive detection method for MPLW515L or MPLW515K mutation in chronic myeloproliferative disorders with locked nucleic acid-modified probes and real-time polymerase chain reaction.

PubMed

Pancrazzi, Alessandro; Guglielmelli, Paola; Ponziani, Vanessa; Bergamaschi, Gaetano; Bosi, Alberto; Barosi, Giovanni; Vannucchi, Alessandro M

2008-09-01

Acquired mutations in the juxtamembrane region of MPL (W515K or W515L), the receptor for thrombopoietin, have been described in patients with primary myelofibrosis or essential thrombocythemia, which are chronic myeloproliferative disorders. We have developed a real-time polymerase chain reaction assay for the detection and quantification of MPL mutations that is based on locked nucleic acid fluorescent probes. Mutational analysis was performed using DNA from granulocytes. Reference curves were obtained using cloned fragments of MPL containing either the wild-type or mutated sequence; the predicted sensitivity level was at least 0.1% mutant allele in a wild-type background. None of the 60 control subjects presented with a MPLW515L/K mutation. Of 217 patients with myelofibrosis, 19 (8.7%) harbored the MPLW515 mutation, 10 (52.6%) with the W515L allele. In one case, both the W515L and W515K alleles were detected by real-time polymerase chain reaction. By comparing results obtained with conventional sequencing, no erroneous genotype attribution using real-time polymerase chain reaction was found, whereas one patient considered wild type according to sequence analysis actually harbored a low W515L allele burden. This is a simple, sensitive, and cost-effective procedure for large-scale screening of the MPLW515L/K mutation in patients suspected to have a myeloproliferative disorder. It can also provide a quantitative estimate of mutant allele burden that might be useful for both patient prognosis and monitoring response to therapy.
Nucleotide sequence of the phosphoglycerate kinase gene from the extreme thermophile Thermus thermophilus. Comparison of the deduced amino acid sequence with that of the mesophilic yeast phosphoglycerate kinase.

PubMed Central

Bowen, D; Littlechild, J A; Fothergill, J E; Watson, H C; Hall, L

1988-01-01

Using oligonucleotide probes derived from amino acid sequencing information, the structural gene for phosphoglycerate kinase from the extreme thermophile, Thermus thermophilus, was cloned in Escherichia coli and its complete nucleotide sequence determined. The gene consists of an open reading frame corresponding to a protein of 390 amino acid residues (calculated Mr 41,791) with an extreme bias for G or C (93.1%) in the codon third base position. Comparison of the deduced amino acid sequence with that of the corresponding mesophilic yeast enzyme indicated a number of significant differences. These are discussed in terms of the unusual codon bias and their possible role in enhanced protein thermal stability. Images Fig. 1. PMID:3052437
Sequence of a cDNA encoding pancreatic preprosomatostatin-22.

PubMed Central

Magazin, M; Minth, C D; Funckes, C L; Deschenes, R; Tavianini, M A; Dixon, J E

1982-01-01

We report the nucleotide sequence of a precursor to somatostatin that upon proteolytic processing may give rise to a hormone of 22 amino acids. The nucleotide sequence of a cDNA from the channel catfish (Ictalurus punctatus) encodes a precursor to somatostatin that is 105 amino acids (Mr, 11,500). The cDNA coding for somatostatin-22 consists of 36 nucleotides in the 5' untranslated region, 315 nucleotides that code for the precursor to somatostatin-22, 269 nucleotides at the 3' untranslated region, and a variable length of poly(A). The putative preprohormone contains a sequence of hydrophobic amino acids at the amino terminus that has the properties of a "signal" peptide. A connecting sequence of approximately 57 amino acids is followed by a single Arg-Arg sequence, which immediately precedes the hormone. Somatostatin-22 is homologous to somatostatin-14 in 7 of the 14 amino acids, including the Phe-Trp-Lys sequence. Hybridization selection of mRNA, followed by its translation in a wheat germ cell-free system, resulted in the synthesis of a single polypeptide having a molecular weight of approximately 10,000 as estimated on Na-DodSO4/polyacrylamide gels. Images PMID:6127673
Genomic perspectives of spider silk genes through target capture sequencing: Conservation of stabilization mechanisms and homology-based structural models of spidroin terminal regions.

PubMed

Collin, Matthew A; Clarke, Thomas H; Ayoub, Nadia A; Hayashi, Cheryl Y

2018-07-01

A powerful system for studying protein aggregation, particularly rapid self-assembly, is spider silk. Spider silks are proteinaceous and silk proteins are synthesized and stored within silk glands as liquid dope. As needed, liquid dope is near-instantaneously transformed into solid fibers or viscous adhesives. The dominant constituents of silks are spidroins (spider fibroins) and their terminal domains are vital for the tight control of silk self-assembly. To better understand spidroin termini, we used target capture and deep sequencing to identify spidroin gene sequences from six species representing the araneoid families of Araneidae, Nephilidae, and Theridiidae. We obtained 145 terminal regions, of which 103 are newly annotated here, as well as novel variants within nine diverse spidroin types. Our comparative analyses demonstrated the conservation of acidic, basic, and cysteine amino acid residues across spidroin types that had been proposed to be important for monomer stability, dimer formation, and self-assembly from a limited sampling of spidroins. Computational, protein homology modeling revealed areas of spidroin terminal regions that are highly conserved in three-dimensions despite sequence divergence across spidroin types. Analyses of our dense sampling of terminal regions suggest that most spidroins share stabilization mechanisms, dimer formation, and tertiary structure, despite producing functionally distinct materials. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.
Recognition of Double Stranded RNA by Guanidine-Modified Peptide Nucleic Acids (GPNA)

PubMed Central

Gupta, Pankaj; Muse, Oluwatoyosi; Rozners, Eriks

2011-01-01

Double helical RNA has become an attractive target for molecular recognition because many non-coding RNAs play important roles in control of gene expression. Recently, we discovered that short peptide nucleic acids (PNA) bind strongly and sequence selectively to a homopurine tract of double helical RNA via triple helix formation. Herein we tested if the molecular recognition of RNA can be enhanced by α-guanidine modification of PNA. Our study was motivated by the discovery of Ly and co-workers that the guanidine modification greatly enhances the cellular delivery of PNA. Isothermal titration calorimetry showed that the guanidine-modified PNA (GPNA) had reduced affinity and sequence selectivity for triple helical recognition of RNA. The data suggested that in contrast to unmodified PNA, which formed a 1:1 PNA-RNA triple helix, GPNA preferred a 2:1 GPNA-RNA triplex-invasion complex. Nevertheless, promising results were obtained for recognition of biologically relevant double helical RNA. Consistent with enhanced strand invasion ability, GPNA derived from D-arginine recognized the transactivation response element (TAR) of HIV-1 with high affinity and sequence selectivity, presumably via Watson-Crick duplex formation. On the other hand, strong and sequence selective triple helices were formed by unmodified and nucelobase-modified PNAs and the purine rich strand of bacterial A-site. These results suggest that appropriate chemical modifications of PNA may enhance molecular recognition of complex non-coding RNAs. PMID:22146072
Phylogenetic Relationship of Necoclí Virus to Other South American Hantaviruses (Bunyaviridae: Hantavirus).

PubMed

Montoya-Ruiz, Carolina; Cajimat, Maria N B; Milazzo, Mary Louise; Diaz, Francisco J; Rodas, Juan David; Valbuena, Gustavo; Fulhorst, Charles F

2015-07-01

The results of a previous study suggested that Cherrie's cane rat (Zygodontomys cherriei) is the principal host of Necoclí virus (family Bunyaviridae, genus Hantavirus) in Colombia. Bayesian analyses of complete nucleocapsid protein gene sequences and complete glycoprotein precursor gene sequences in this study confirmed that Necoclí virus is phylogenetically closely related to Maporal virus, which is principally associated with the delicate pygmy rice rat (Oligoryzomys delicatus) in western Venezuela. In pairwise comparisons, nonidentities between the complete amino acid sequence of the nucleocapsid protein of Necoclí virus and the complete amino acid sequences of the nucleocapsid proteins of other hantaviruses were ≥8.7%. Likewise, nonidentities between the complete amino acid sequence of the glycoprotein precursor of Necoclí virus and the complete amino acid sequences of the glycoprotein precursors of other hantaviruses were ≥11.7%. Collectively, the unique association of Necoclí virus with Z. cherriei in Colombia, results of the Bayesian analyses of complete nucleocapsid protein gene sequences and complete glycoprotein precursor gene sequences, and results of the pairwise comparisons of amino acid sequences strongly support the notion that Necoclí virus represents a novel species in the genus Hantavirus. Further work is needed to determine whether Calabazo virus (a hantavirus associated with Z. brevicauda cherriei in Panama) and Necoclí virus are conspecific.
Identification and characterization of Theileria ovis surface protein (ToSp) resembled TaSp in Theileria annulata.

PubMed

Shayan, P; Jafari, S; Fattahi, R; Ebrahimzade, E; Amininia, N; Changizi, E

2016-05-01

Ovine theileriosis is an important hemoprotozoal disease of sheep and goats in tropical and subtropical regions which caused high economic loses in the livestock industry. Theileria annulata surface protein (TaSp) was used previously as a tool for serological analysis in livestock. Since the amino acid sequences of TaSp is, at least, in part very conserved in T. annulata, Theileria lestoquardi and Theileria china I and II, it is very important to determine the amino acid sequence of this protein in Theileria ovis as well, to avoid false interpretation of serological data based on this protein in small animal. In the present study, the nucleotide sequence and amino acid sequence of T. ovis surface protein (ToSp) were determined. The comparison of the nucleotide sequence of ToSp showed 96, 96, 99, and 86 % homology to the corresponding nucleotide sequence of TaSp genes by T. annulata, T. China I, T. China II and T. lestoquardi, previously registered in GenBank under accession nos. AJ316260.1, AY274329.1, DQ120058.1, and EF092924.1 respectively. The amino acid sequence analysis showed 95, 81, 98 and 70 % homology to the corresponding amino acid sequence of T. annulata, T chinaI, T china II and T. lestoquardi, registered in GenBank under accession nos. CAC87478.1, AAP36993.1, AAZ30365.1 and AAP36999.11, respectively. Interestingly, in contrast to the C terminus, a significant difference in amino acid sequence in the N teminus of the ToSp protein could be determined compared to the other known corresponding TaSp sequences, which make this region attractive for designing of a suitable tool for serological diagnosis.
Vicilin and convicilin are potential major allergens from pea.

PubMed

Sanchez-Monge, R; Lopez-Torrejón, G; Pascual, C Y; Varela, J; Martin-Esteban, M; Salcedo, G

2004-11-01

Allergic reactions to pea (Pisum sativum) ingestion are frequently associated with lentil allergy in the Spanish population. Vicilin have been described as a major lentil allergen. To identify the main IgE binding components from pea seeds and to study their potential cross-reactivity with lentil vicilin. A serum pool or individual sera from 18 patients with pea allergy were used to detect IgE binding proteins from pea seeds by immunodetection and immunoblot inhibition assays. Protein preparations enriched in pea vicilin were obtained by gel filtration chromatography followed by reverse-phase high-performance liquid chromatography (HPLC). IgE binding components were identified by means of N-terminal amino acid sequencing. Complete cDNAs encoding pea vicilin were isolated by PCR, using primers based on the amino acid sequence of the reactive proteins. IgE immunodetection of crude pea extracts revealed that convicilin (63 kDa), as well as vicilin (44 kDa) and one of its proteolytic fragments (32 kDa), reacted with more than 50% of the individual sera tested. Additional proteolytic subunits of vicilin (36, 16 and 13 kDa) bound IgE from approximately 20% of the sera. The lentil vicilin allergen Len c 1 strongly inhibited the IgE binding to all components mentioned above. The characterization of cDNA clones encoding pea vicilin has allowed the deduction of its complete amino acid sequence (90% of sequence identity to Len c 1), as well as those of its reactive proteolytic processed subunits. Vicilin and convicilin are potential major allergens from pea seeds. Furthermore, proteolytic fragments from vicilin are also relevant IgE binding pea components. All these proteins cross-react with the major lentil allergen Len c 1.
Cloning, sequencing, purification, and crystal structure of Grenache (Vitis vinifera) polyphenol oxidase.

PubMed

Virador, Victoria M; Reyes Grajeda, Juan P; Blanco-Labra, Alejandro; Mendiola-Olaya, Elizabeth; Smith, Gary M; Moreno, Abel; Whitaker, John R

2010-01-27

The full-length cDNA sequence (P93622_VITVI) of polyphenol oxidase (PPO) cDNA from grape Vitis vinifera L., cv Grenache, was found to encode a translated protein of 607 amino acids with an expected molecular weight of ca. 67 kDa and a predicted pI of 6.83. The translated amino acid sequence was 99%, identical to that of a white grape berry PPO (1) (5 out of 607 amino acid potential sequence differences). The protein was purified from Grenache grape berries by using traditional methods, and it was crystallized with ammonium acetate by the hanging-drop vapor diffusion method. The crystals were orthorhombic, space group C222(1). The structure was obtained at 2.2 A resolution using synchrotron radiation using the 39 kDa isozyme of sweet potato PPO (PDB code: 1BT1 ) as a phase donor. The basic symmetry of the cell parameters (a, b, and c and alpha, beta, and gamma) as well as in the number of asymmetric units in the unit cell of the crystals of PPO, differed between the two proteins. The structures of the two enzymes are quite similar in overall fold, the location of the helix bundles at the core, and the active site in which three histidines bind each of the two catalytic copper ions, and one of the histidines is engaged in a thioether linkage with a cysteine residue. The possibility that the formation of the Cys-His thioether linkage constitutes the activation step is proposed. No evidence of phosphorylation or glycoslyation was found in the electron density map. The mass of the crystallized protein appears to be only 38.4 kDa, and the processing that occurs in the grape berry that leads to this smaller size is discussed.
Characterization and functional analysis of hypoxia-inducible factor HIF1α and its inhibitor HIF1αn in tilapia.

PubMed

Li, Hong Lian; Gu, Xiao Hui; Li, Bi Jun; Chen, Xiao; Lin, Hao Ran; Xia, Jun Hong

2017-01-01

Hypoxia is a major cause of fish morbidity and mortality in the aquatic environment. Hypoxia-inducible factors are very important modulators in the transcriptional response to hypoxic stress. In this study, we characterized and conducted functional analysis of hypoxia-inducible factor HIF1α and its inhibitor HIF1αn in Nile tilapia (Oreochromis niloticus). By cloning and Sanger sequencing, we obtained the full length cDNA sequences for HIF1α (2686bp) and HIF1αn (1308bp), respectively. The CDS of HIF1α includes 15 exons encoding 768 amino acid residues and the CDS of HIF1αn contains 8 exons encoding 354 amino acid residues. The complete CDS sequences of HIF1α and HIF1αn cloned from tilapia shared very high homology with known genes from other fishes. HIF1α show differentiated expression in different tissues (brain, heart, gill, spleen, liver) and at different hypoxia exposure times (6h, 12h, 24h). HIF1αn expression level under hypoxia is generally increased (6h, 12h, 24h) and shows extremely highly upregulation in brain tissue under hypoxia. A functional determination site analysis in the protein sequences between fish and land animals identified 21 amino acid sites in HIF1α and 2 sites in HIF1αn as significantly associated sites (α = 0.05). Phylogenetic tree-based positive selection analysis suggested 22 sites in HIF1α as positively selected sites with a p-value of at least 95% for fish lineages compared to the land animals. Our study could be important for clarifying the mechanism of fish adaptation to aquatic hypoxia environment.

Characterization and functional analysis of hypoxia-inducible factor HIF1α and its inhibitor HIF1αn in tilapia

PubMed Central

Li, Hong Lian; Gu, Xiao Hui; Li, Bi Jun; Chen, Xiao; Lin, Hao Ran; Xia, Jun Hong

2017-01-01

Hypoxia is a major cause of fish morbidity and mortality in the aquatic environment. Hypoxia-inducible factors are very important modulators in the transcriptional response to hypoxic stress. In this study, we characterized and conducted functional analysis of hypoxia-inducible factor HIF1α and its inhibitor HIF1αn in Nile tilapia (Oreochromis niloticus). By cloning and Sanger sequencing, we obtained the full length cDNA sequences for HIF1α (2686bp) and HIF1αn (1308bp), respectively. The CDS of HIF1α includes 15 exons encoding 768 amino acid residues and the CDS of HIF1αn contains 8 exons encoding 354 amino acid residues. The complete CDS sequences of HIF1α and HIF1αn cloned from tilapia shared very high homology with known genes from other fishes. HIF1α show differentiated expression in different tissues (brain, heart, gill, spleen, liver) and at different hypoxia exposure times (6h, 12h, 24h). HIF1αn expression level under hypoxia is generally increased (6h, 12h, 24h) and shows extremely highly upregulation in brain tissue under hypoxia. A functional determination site analysis in the protein sequences between fish and land animals identified 21 amino acid sites in HIF1α and 2 sites in HIF1αn as significantly associated sites (α = 0.05). Phylogenetic tree-based positive selection analysis suggested 22 sites in HIF1α as positively selected sites with a p-value of at least 95% for fish lineages compared to the land animals. Our study could be important for clarifying the mechanism of fish adaptation to aquatic hypoxia environment. PMID:28278251
Brain cDNA clone for human cholinesterase

DOE Office of Scientific and Technical Information (OSTI.GOV)

McTiernan, C.; Adkins, S.; Chatonnet, A.

1987-10-01

A cDNA library from human basal ganglia was screened with oligonucleotide probes corresponding to portions of the amino acid sequence of human serum cholinesterase. Five overlapping clones, representing 2.4 kilobases, were isolated. The sequenced cDNA contained 207 base pairs of coding sequence 5' to the amino terminus of the mature protein in which there were four ATG translation start sites in the same reading frame as the protein. Only the ATG coding for Met-(-28) lay within a favorable consensus sequence for functional initiators. There were 1722 base pairs of coding sequence corresponding to the protein found circulating in human serum.more » The amino acid sequence deduced from the cDNA exactly matched the 574 amino acid sequence of human serum cholinesterase, as previously determined by Edman degradation. Therefore, our clones represented cholinesterase rather than acetylcholinesterase. It was concluded that the amino acid sequences of cholinesterase from two different tissues, human brain and human serum, were identical. Hybridization of genomic DNA blots suggested that a single gene, or very few genes coded for cholinesterase.« less
Transcriptomic Analysis of the Underground Renewal Buds during Dormancy Transition and Release in ‘Hangbaishao’ Peony (Paeonia lactiflora)

PubMed Central

Zhang, Jiaping; Wang, Guanqun; Li, Xin; Xia, Yiping

2015-01-01

Paeonia lactiflora is one of the most famous species of herbaceous peonies with gorgeous flowers. Bud dormancy is a crucial developmental process that allows P. lactiflora to survive unfavorable environmental conditions. However, little information is available on the molecular mechanism of the bud dormancy in P. lactiflora. We performed de novo transcriptome sequencing using the Illumina RNA sequencing platform for the underground renewal buds of P. lactiflora ‘Hangbaishao’ to study the molecular mechanism underlying its bud dormancy transition (the period from endodormancy to ecodormancy) and release (the period from ecodormancy to bud elongation and sprouting). Approximately 300 million high-quality clean reads were generated and assembled into 207,827 (mean length = 828 bp) and 51,481 (mean length = 1250 bp) unigenes using two assembly methods named “Trinity” and “Trinity+PRICE”, respectively. Based on the data obtained by the latter method, 32,316 unigenes were annotated by BLAST against various databases. Approximately 1,251 putative transcription factors were obtained, of which the largest number of unique transcripts belonged to the basic helix-loop-helix protein (bHLH) transcription factor family, and five of the top ten highly expressed transcripts were annotated as dehydrin (DHN). A total of 17,705 simple sequence repeat (SSR) motifs distributed in 13,797 sequences were obtained. The budbreak morphology, levels of indole-3-acetic acid (IAA) and abscisic acid (ABA), and activities of guaiacol peroxidase (POD) and catalase (CAT) were observed. The expression of 20 interested unigenes, which annotated as DHN, heat shock protein (HSP), histone, late elongated hypocotyl (LHY), and phytochrome (PHY), and so on, were also analyzed. These studies were based on morphological, physiological, biochemical, and molecular levels and provide comprehensive insight into the mechanism of dormancy transition and release in P. lactiflora. Transcriptome dataset can be highly valuable for future investigation on gene expression networks in P. lactiflora as well as research on dormancy in other non-model perennial horticultural crops of commercial significance. PMID:25790307
Microbial communities and their predicted metabolic characteristics in deep fracture groundwaters of the crystalline bedrock at Olkiluoto, Finland

NASA Astrophysics Data System (ADS)

Bomberg, Malin; Lamminmäki, Tiina; Itävaara, Merja

2016-11-01

The microbial diversity in oligotrophic isolated crystalline Fennoscandian Shield bedrock fracture groundwaters is high, but the core community has not been identified. Here we characterized the bacterial and archaeal communities in 12 water conductive fractures situated at depths between 296 and 798 m by high throughput amplicon sequencing using the Illumina HiSeq platform. Between 1.7 × 104 and 1.2 × 106 bacterial or archaeal sequence reads per sample were obtained. These sequences revealed that up to 95 and 99 % of the bacterial and archaeal sequences obtained from the 12 samples, respectively, belonged to only a few common species, i.e. the core microbiome. However, the remaining rare microbiome contained over 3- and 6-fold more bacterial and archaeal taxa. The metabolic properties of the microbial communities were predicted using PICRUSt. The approximate estimation showed that the metabolic pathways commonly included fermentation, fatty acid oxidation, glycolysis/gluconeogenesis, oxidative phosphorylation, and methanogenesis/anaerobic methane oxidation, but carbon fixation through the Calvin cycle, reductive TCA cycle, and the Wood-Ljungdahl pathway was also predicted. The rare microbiome is an unlimited source of genomic functionality in all ecosystems. It may consist of remnants of microbial communities prevailing in earlier environmental conditions, but could also be induced again if changes in their living conditions occur.
Predicting protein-protein interactions by combing various sequence- derived features into the general form of Chou's Pseudo amino acid composition.

PubMed

Zhao, Xiao-Wei; Ma, Zhi-Qiang; Yin, Ming-Hao

2012-05-01

Knowledge of protein-protein interactions (PPIs) plays an important role in constructing protein interaction networks and understanding the general machineries of biological systems. In this study, a new method is proposed to predict PPIs using a comprehensive set of 930 features based only on sequence information, these features measure the interactions between residues a certain distant apart in the protein sequences from different aspects. To achieve better performance, the principal component analysis (PCA) is first employed to obtain an optimized feature subset. Then, the resulting 67-dimensional feature vectors are fed to Support Vector Machine (SVM). Experimental results on Drosophila melanogaster and Helicobater pylori datasets show that our method is very promising to predict PPIs and may at least be a useful supplement tool to existing methods.
Characterization of a prototype strain of hepatitis E virus.

PubMed Central

Tsarev, S A; Emerson, S U; Reyes, G R; Tsareva, T S; Legters, L J; Malik, I A; Iqbal, M; Purcell, R H

1992-01-01

A strain of hepatitis E virus (SAR-55) implicated in an epidemic of enterically transmitted non-A, non-B hepatitis, now called hepatitis E, was characterized extensively. Six cynomolgus monkeys (Macaca fascicularis) were infected with a strain of hepatitis E virus from Pakistan. Reverse transcription-polymerase chain reaction was used to determine the pattern of virus shedding in feces, bile, and serum relative to hepatitis and induction of specific antibodies. Virtually the entire genome of SAR-55 (7195 nucleotides) was sequenced. Comparison of the sequence of SAR-55 with that of a Burmese strain revealed a high level of homology except for one region encoding 100 amino acids of a putative nonstructural polyprotein. Identification of this region as hypervariable was obtained by partial sequencing of a third isolate of hepatitis E virus from Kirgizia. Images PMID:1731327
Genome-wide comparison of medieval and modern Mycobacterium leprae.

PubMed

Schuenemann, Verena J; Singh, Pushpendra; Mendum, Thomas A; Krause-Kyora, Ben; Jäger, Günter; Bos, Kirsten I; Herbig, Alexander; Economou, Christos; Benjak, Andrej; Busso, Philippe; Nebel, Almut; Boldsen, Jesper L; Kjellström, Anna; Wu, Huihai; Stewart, Graham R; Taylor, G Michael; Bauer, Peter; Lee, Oona Y-C; Wu, Houdini H T; Minnikin, David E; Besra, Gurdyal S; Tucker, Katie; Roffey, Simon; Sow, Samba O; Cole, Stewart T; Nieselt, Kay; Krause, Johannes

2013-07-12

Leprosy was endemic in Europe until the Middle Ages. Using DNA array capture, we have obtained genome sequences of Mycobacterium leprae from skeletons of five medieval leprosy cases from the United Kingdom, Sweden, and Denmark. In one case, the DNA was so well preserved that full de novo assembly of the ancient bacterial genome could be achieved through shotgun sequencing alone. The ancient M. leprae sequences were compared with those of 11 modern strains, representing diverse genotypes and geographic origins. The comparisons revealed remarkable genomic conservation during the past 1000 years, a European origin for leprosy in the Americas, and the presence of an M. leprae genotype in medieval Europe now commonly associated with the Middle East. The exceptional preservation of M. leprae biomarkers, both DNA and mycolic acids, in ancient skeletons has major implications for palaeomicrobiology and human pathogen evolution.
Decoding DNA, RNA and peptides with quantum tunnelling

NASA Astrophysics Data System (ADS)

di Ventra, Massimiliano; Taniguchi, Masateru

2016-02-01

Drugs and treatments could be precisely tailored to an individual patient by extracting their cellular- and molecular-level information. For this approach to be feasible on a global scale, however, information on complete genomes (DNA), transcriptomes (RNA) and proteomes (all proteins) needs to be obtained quickly and at low cost. Quantum mechanical phenomena could potentially be of value here, because the biological information needs to be decoded at an atomic level and quantum tunnelling has recently been shown to be able to differentiate single nucleobases and amino acids in short sequences. Here, we review the different approaches to using quantum tunnelling for sequencing, highlighting the theoretical background to the method and the experimental capabilities demonstrated to date. We also explore the potential advantages of the approach and the technical challenges that must be addressed to deliver practical quantum sequencing devices.
Limonoate dehydrogenase from Arthrobacter globiformis: the native enzyme and its N-terminal sequence.

PubMed

Suhayda, C G; Omura, M; Hasegawa, S

1995-09-01

Bitter limonoids in citrus juice lower the quality and value of commercial juices. Limonoate dehydrogenase converts the precursor of bitter limonin, limonoate A-ring lactone, to nonbitter 17-dehydrolimonoate A-ring lactone. This enzyme was isolated from Arthrobacter globiformis cells by a combination of ammonium sulfate fractionation, Cibacron Blue affinity chromatography and DEAE ion exchange HPLC. Using this protocol a 428-fold purification of the enzyme was obtained. Gel filtration HPLC indicated a M(r) of 118,000 for the native enzyme. SDS-PAGE indicated an individual subunit M(r) of 31,000. N-Terminal sequencing of the protein provided a sequence of the first 16 amino acid residues. Since LDH activity in citrus is very low, cloning the gene for this bacterial enzyme into citrus trees should enhance the natural debittering mechanism in citrus fruit.
Two potato proteins, including a novel RING finger protein (HIP1), interact with the potyviral multifunctional protein HCpro.

PubMed

Guo, Deyin; Spetz, Carl; Saarma, Mart; Valkonen, Jari P T

2003-05-01

Potyviral helper-component proteinase (HCpro) is a multifunctional protein exerting its cellular functions in interaction with putative host proteins. In this study, cellular protein partners of the HCpro encoded by Potato virus A (PVA) (genus Potyvirus) were screened in a potato leaf cDNA library using a yeast two-hybrid system. Two cellular proteins were obtained that interact specifically with PVA HCpro in yeast and in the two in vitro binding assays used. Both proteins are encoded by single-copy genes in the potato genome. Analysis of the deduced amino acid sequences revealed that one (HIP1) of the two HCpro interactors is a novel RING finger protein. The sequence of the other protein (HIP2) showed no resemblance to the protein sequences available from databanks and has known biological functions.
Method for altering antibody light chain interactions

DOEpatents

Stevens, Fred J.; Stevens, Priscilla Wilkins; Raffen, Rosemarie; Schiffer, Marianne

2002-01-01

A method for recombinant antibody subunit dimerization including modifying at least one codon of a nucleic acid sequence to replace an amino acid occurring naturally in the antibody with a charged amino acid at a position in the interface segment of the light polypeptide variable region, the charged amino acid having a first polarity; and modifying at least one codon of the nucleic acid sequence to replace an amino acid occurring naturally in the antibody with a charged amino acid at a position in an interface segment of the heavy polypeptide variable region corresponding to a position in the light polypeptide variable region, the charged amino acid having a second polarity opposite the first polarity. Nucleic acid sequences which code for novel light chain proteins, the latter of which are used in conjunction with the inventive method, are also provided.
Effects of fulvic acid on growth performance and intestinal health of juvenile loach Paramisgurnus dabryanus (Sauvage).

PubMed

Gao, Yang; He, Jie; He, Zhuliu; Li, Zhiwei; Zhao, Bo; Mu, Yi; Lee, Jeong-Yeol; Chu, Zhangjie

2017-03-01

A 60-day feeding trial was conducted to determine the effect of dietary fulvic acid supplements on intestinal digestive activity (enzymatic analysis), antioxidant activity, immune enzyme activity and microflora composition of juvenile loach (initial weight of 6.2 ± 0.1 g) reared in experimental aquaria. Five test diets containing 0, 0.5, 1.0, 1.5, and 2% fulvic acid were randomly assigned to three aquaria, respectively. Elevated growth performance including final weight, weight gain (WG), specific growth rate (SGR) and feed conversion ratio (FCR) was observed in loaches that were fed fulvic acid. Maximal weight gain rates and specific growth rates occurred at the 1.5% additive level. The optimal dietary fulvic requirement for maximal growth of juvenile loach is 16.4 g per kg of the diet based on the quadratic regression analysis of specific growth rate against dietary fulvic acid levels. Furthermore, intestinal protease activity, antioxidant activity, lysozyme activity (LZM), complement 3 (C3) content, immunoglobulin M (IgM) content, acid phosphatase activity (ACP) and alkaline phosphatase activity (AKP) were significantly elevated with concomitant increasing levels of dietary fulvic acid. Following a deep sequencing analysis, a total of 42,058 valid reads and 609 OTUs (operational taxonomic units) obtained from the control group and the group displaying the most optimal growth rate were analyzed. Fulvic acid supplementation resulted in an abundance of Firmicute and Actinobacteria sequences, with a concomitant reduction in the abundance of Proteobacteria. Results indicated that fulvic acid supplementation resulted in a reduction in the relative abundance of Serratia, Acinetobacter, Aeromonas and Edwardsiella, and a relative increase in the abundance of Lactobacillus in the intestine. In conclusion, these results suggest that fulvic acid improves growth performance and intestinal health condition of loach, indicates that fulvic acid could be used as an immunoenhancer in loach culture. Copyright © 2017. Published by Elsevier Ltd.
37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

Code of Federal Regulations, 2013 CFR

2013-07-01

... in WIPO Standard ST.25 (1998), Appendix 2, Tables 1 and 3. This incorporation by reference was... ST.25 (1998), Appendix 2, Tables 1 and 3, shall be listed in a given sequence as “n” or “Xaa... acids. (1) The amino acids in a protein or peptide sequence shall be listed using the three-letter...
37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

Code of Federal Regulations, 2010 CFR

2010-07-01

... in WIPO Standard ST.25 (1998), Appendix 2, Tables 1 and 3. This incorporation by reference was... ST.25 (1998), Appendix 2, Tables 1 and 3, shall be listed in a given sequence as “n” or “Xaa... acids. (1) The amino acids in a protein or peptide sequence shall be listed using the three-letter...
37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

Code of Federal Regulations, 2012 CFR

2012-07-01

... in WIPO Standard ST.25 (1998), Appendix 2, Tables 1 and 3. This incorporation by reference was... ST.25 (1998), Appendix 2, Tables 1 and 3, shall be listed in a given sequence as “n” or “Xaa... acids. (1) The amino acids in a protein or peptide sequence shall be listed using the three-letter...
A Thermoacidophile-Specific Protein Family, DUF3211, Functions as a Fatty Acid Carrier with Novel Binding Mode

PubMed Central

Miyakawa, Takuya; Sawano, Yoriko; Miyazono, Ken-ichi; Miyauchi, Yumiko; Hatano, Ken-ichi

2013-01-01

STK_08120 is a member of the thermoacidophile-specific DUF3211 protein family from Sulfolobus tokodaii strain 7. Its molecular function remains obscure, and sequence similarities for obtaining functional remarks are not available. In this study, the crystal structure of STK_08120 was determined at 1.79-Å resolution to predict its probable function using structure similarity searches. The structure adopts an α/β structure of a helix-grip fold, which is found in the START domain proteins with cavities for hydrophobic substrates or ligands. The detailed structural features implied that fatty acids are the primary ligand candidates for STK_08120, and binding assays revealed that the protein bound long-chain saturated fatty acids (>C14) and their trans-unsaturated types with an affinity equal to that for major fatty acid binding proteins in mammals and plants. Moreover, the structure of an STK_08120-myristic acid complex revealed a unique binding mode among fatty acid binding proteins. These results suggest that the thermoacidophile-specific protein family DUF3211 functions as a fatty acid carrier with a novel binding mode. PMID:23836863
Characterization of a novel 8R,11S-linoleate diol synthase from Penicillium chrysogenum by identification of its enzymatic products[S

PubMed Central

Shin, Kyung-Chul; Seo, Min-Ju; Oh, Deok-Kun

2016-01-01

To identify novel fatty acid diol synthases, putative candidate sequences from Penicillium species were analyzed, and hydroxy fatty acid production by crude Penicillium enzyme extracts was assessed. Penicillium chrysogenum was found to produce an unknown dihydroxy fatty acid, a candidate gene implicated in this production was cloned and expressed, and the expressed enzyme was purified. The product obtained by the reaction of the purified enzyme with linoleic acid was identified as 8R,11S-dihydroxy-9,12(Z,Z)-octadecadienoic acid (8R,11S-DiHODE). The catalytic efficiency of this enzyme toward linoleic acid was the highest among the unsaturated fatty acids tested, indicating that this enzyme was a novel 8R,11S-linoleate diol synthase (8R,11S-LDS). A sexual stage in the life cycle of P. chrysogenum has recently been discovered, and 8R,11S-DiHODE produced by 8R,11S-LDS may constitute a precocious sexual inducer factor, responsible for regulating the sexual and asexual cycles of this fungus. PMID:26681780
The twilight zone of cis element alignments.

PubMed

Sebastian, Alvaro; Contreras-Moreira, Bruno

2013-02-01

Sequence alignment of proteins and nucleic acids is a routine task in bioinformatics. Although the comparison of complete peptides, genes or genomes can be undertaken with a great variety of tools, the alignment of short DNA sequences and motifs entails pitfalls that have not been fully addressed yet. Here we confront the structural superposition of transcription factors with the sequence alignment of their recognized cis elements. Our goals are (i) to test TFcompare (http://floresta.eead.csic.es/tfcompare), a structural alignment method for protein-DNA complexes; (ii) to benchmark the pairwise alignment of regulatory elements; (iii) to define the confidence limits and the twilight zone of such alignments and (iv) to evaluate the relevance of these thresholds with elements obtained experimentally. We find that the structure of cis elements and protein-DNA interfaces is significantly more conserved than their sequence and measures how this correlates with alignment errors when only sequence information is considered. Our results confirm that DNA motifs in the form of matrices produce better alignments than individual sequences. Finally, we report that empirical and theoretically derived twilight thresholds are useful for estimating the natural plasticity of regulatory sequences, and hence for filtering out unreliable alignments.
The twilight zone of cis element alignments

PubMed Central

Sebastian, Alvaro; Contreras-Moreira, Bruno

2013-01-01

Sequence alignment of proteins and nucleic acids is a routine task in bioinformatics. Although the comparison of complete peptides, genes or genomes can be undertaken with a great variety of tools, the alignment of short DNA sequences and motifs entails pitfalls that have not been fully addressed yet. Here we confront the structural superposition of transcription factors with the sequence alignment of their recognized cis elements. Our goals are (i) to test TFcompare (http://floresta.eead.csic.es/tfcompare), a structural alignment method for protein–DNA complexes; (ii) to benchmark the pairwise alignment of regulatory elements; (iii) to define the confidence limits and the twilight zone of such alignments and (iv) to evaluate the relevance of these thresholds with elements obtained experimentally. We find that the structure of cis elements and protein–DNA interfaces is significantly more conserved than their sequence and measures how this correlates with alignment errors when only sequence information is considered. Our results confirm that DNA motifs in the form of matrices produce better alignments than individual sequences. Finally, we report that empirical and theoretically derived twilight thresholds are useful for estimating the natural plasticity of regulatory sequences, and hence for filtering out unreliable alignments. PMID:23268451
Combinatorial interactions of two amino acids with a single base pair define target site specificity in plant dimeric homeodomain proteins

PubMed Central

Tron, Adriana E.; Bertoncini, Carlos W.; Palena, Claudia M.; Chan, Raquel L.; Gonzalez, Daniel H.

2001-01-01

Four groups of plant homeodomain proteins contain a dimerization motif closely linked to the homeodomain. We here show that two sunflower homeodomain proteins, Hahb-4 and HAHR1, which belong to the Hd-Zip I and GL2/Hd-Zip IV groups, respectively, show different binding preferences at a defined position of a pseudopalindromic DNA-binding site used as a target. HAHR1 shows a preference for the sequence 5′-CATT(A/T)AATG-3′, rather than 5′-CAAT(A/T)ATTG-3′, recognized by Hahb-4. To analyze the molecular basis of this behavior, we have constructed a set of mutants with exchanged residues (Phe→Ile and Ile→Phe) at position 47 of the homeodomain, together with chimeric proteins between HAHR1 and Hahb-4. The results obtained indicate that Phe47, but not Ile47, allows binding to 5′-CATT(A/T)AATG-3′. However, the preference for this sequence is determined, in addition, by amino acids located C-terminal to residue 53 of the HAHR1 homeodomain. A double mutant of Hahb-4 (Ile47→Phe/Ala54→Thr) shows the same binding behavior as HAHR1, suggesting that combinatorial interactions of amino acid residues at positions 47 and 54 of the homeodomain are involved in establishing the affinity and selectivity of plant dimeric homeodomain proteins with different DNA target sequences. PMID:11726696

Some links on this page may take you to non-federal websites. Their policies may differ from this site.