acid sequence determination: Topics by Science.gov

Sample records for acid sequence determination

Solid phase sequencing of double-stranded nucleic acids

DOEpatents

Fu, Dong-Jing; Cantor, Charles R.; Koster, Hubert; Smith, Cassandra L.

2002-01-01

This invention relates to methods for detecting and sequencing of target double-stranded nucleic acid sequences, to nucleic acid probes and arrays of probes useful in these methods, and to kits and systems which contain these probes. Useful methods involve hybridizing the nucleic acids or nucleic acids which represent complementary or homologous sequences of the target to an array of nucleic acid probes. These probe comprise a single-stranded portion, an optional double-stranded portion and a variable sequence within the single-stranded portion. The molecular weights of the hybridized nucleic acids of the set can be determined by mass spectroscopy, and the sequence of the target determined from the molecular weights of the fragments. Nucleic acids whose sequences can be determined include nucleic acids in biological samples such as patient biopsies and environmental samples. Probes may be fixed to a solid support such as a hybridization chip to facilitate automated determination of molecular weights and identification of the target sequence.
Solid phase sequencing of biopolymers

DOEpatents

Cantor, Charles; Koster, Hubert

2010-09-28

This invention relates to methods for detecting and sequencing target nucleic acid sequences, to mass modified nucleic acid probes and arrays of probes useful in these methods, and to kits and systems which contain these probes. Useful methods involve hybridizing the nucleic acids or nucleic acids which represent complementary or homologous sequences of the target to an array of nucleic acid probes. These probes comprise a single-stranded portion, an optional double-stranded portion and a variable sequence within the single-stranded portion. The molecular weights of the hybridized nucleic acids of the set can be determined by mass spectroscopy, and the sequence of the target determined from the molecular weights of the fragments. Nucleic acids whose sequences can be determined include DNA or RNA in biological samples such as patient biopsies and environmental samples. Probes may be fixed to a solid support such as a hybridization chip to facilitate automated molecular weight analysis and identification of the target sequence.
Hybridization and sequencing of nucleic acids using base pair mismatches

DOEpatents

Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua

2001-01-01

Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.
Dipeptide Sequence Determination: Analyzing Phenylthiohydantoin Amino Acids by HPLC

NASA Astrophysics Data System (ADS)

Barton, Janice S.; Tang, Chung-Fei; Reed, Steven S.

2000-02-01

Amino acid composition and sequence determination, important techniques for characterizing peptides and proteins, are essential for predicting conformation and studying sequence alignment. This experiment presents improved, fundamental methods of sequence analysis for an upper-division biochemistry laboratory. Working in pairs, students use the Edman reagent to prepare phenylthiohydantoin derivatives of amino acids for determination of the sequence of an unknown dipeptide. With a single HPLC technique, students identify both the N-terminal amino acid and the composition of the dipeptide. This method yields good precision of retention times and allows use of a broad range of amino acids as components of the dipeptide. Students learn fundamental principles and techniques of sequence analysis and HPLC.
Method of Identifying a Base in a Nucleic Acid

DOEpatents

Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua

1999-01-01

Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.
Identifying a base in a nucleic acid

DOEpatents

Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua

2005-02-08

Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.
Probe kit for identifying a base in a nucleic acid

DOEpatents

Fodor, Stephen P. A.; Lipshutz, Robert J.; Huang, Xiaohua

2001-01-01

Devices and techniques for hybridization of nucleic acids and for determining the sequence of nucleic acids. Arrays of nucleic acids are formed by techniques, preferably high resolution, light-directed techniques. Positions of hybridization of a target nucleic acid are determined by, e.g., epifluorescence microscopy. Devices and techniques are proposed to determine the sequence of a target nucleic acid more efficiently and more quickly through such synthesis and detection techniques.
"De-novo" amino acid sequence elucidation of protein G'e by combined "top-down" and "bottom-up" mass spectrometry.

PubMed

Yefremova, Yelena; Al-Majdoub, Mahmoud; Opuni, Kwabena F M; Koy, Cornelia; Cui, Weidong; Yan, Yuetian; Gross, Michael L; Glocker, Michael O

2015-03-01

Mass spectrometric de-novo sequencing was applied to review the amino acid sequence of a commercially available recombinant protein G´ with great scientific and economic importance. Substantial deviations to the published amino acid sequence (Uniprot Q54181) were found by the presence of 46 additional amino acids at the N-terminus, including a so-called "His-tag" as well as an N-terminal partial α-N-gluconoylation and α-N-phosphogluconoylation, respectively. The unexpected amino acid sequence of the commercial protein G' comprised 241 amino acids and resulted in a molecular mass of 25,998.9 ± 0.2 Da for the unmodified protein. Due to the higher mass that is caused by its extended amino acid sequence compared with the original protein G' (185 amino acids), we named this protein "protein G'e." By means of mass spectrometric peptide mapping, the suggested amino acid sequence, as well as the N-terminal partial α-N-gluconoylations, was confirmed with 100% sequence coverage. After the protein G'e sequence was determined, we were able to determine the expression vector pET-28b from Novagen with the Xho I restriction enzyme cleavage site as the best option that was used for cloning and expressing the recombinant protein G'e in E. coli. A dissociation constant (K(d)) value of 9.4 nM for protein G'e was determined thermophoretically, showing that the N-terminal flanking sequence extension did not cause significant changes in the binding affinity to immunoglobulins.
Method for high-volume sequencing of nucleic acids: random and directed priming with libraries of oligonucleotides

DOEpatents

Studier, F. William

1995-04-18

Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient.
Method for high-volume sequencing of nucleic acids: random and directed priming with libraries of oligonucleotides

DOEpatents

Studier, F.W.

1995-04-18

Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient. 2 figs.
Streptococcal phosphoenolpyruvate-sugar phosphotransferase system: amino acid sequence and site of ATP-dependent phosphorylation of HPr

DOE Office of Scientific and Technical Information (OSTI.GOV)

Deutscher, J.; Pevec, B.; Beyreuther, K.

1986-10-21

The amino acid sequence of histidine-containing protein (HPr) from Streptococcus faecalis has been determined by direct Edman degradation of intact HPr and by amino acid sequence analysis of tryptic peptides, V8 proteolyptic peptides, thermolytic peptides, and cyanogen bromide cleavage products. HPr from S. faecalis was found to contain 89 amino acid residues, corresponding to a molecular weight of 9438. The amino acid sequence of HPr from S. faecalis shows extended homology to the primary structure of HPr proteins from other bacteria. Besides the phosphoenolpyruvate-dependent phosphorylation of a histidyl residue in HPr, catalyzed by enzyme I of the bacterial phosphotransferase system,more » HPr was also found to be phosphorylated at a seryl residue in an ATP-dependent protein kinase catalyzed reaction. The site of ATP-dependent phosphorylation in HPr of S faecalis has now been determined. (/sup 32/P)P-Ser-HPr was digested with three different proteases, and in each case, a single labeled peptide was isolated. Following digestion with subtilisin, they obtained a peptide with the sequence -(P)Ser-Ile-Met-. Using chymotrypsin, they isolated a peptide with the sequence -Ser-Val-Asn-Leu-Lys-(P)Ser-Ile-Met-Gly-Val-Met-. The longest labeled peptide was obtained with V8 staphylococcal protease. According to amino acid analysis, this peptide contained 36 out of the 89 amino acid residues of HPr. The following sequence of 12 amino acid residues of the V8 peptide was determined: -Tyr-Lys-Gly-Lys-Ser-Val-Asn-Leu-Lys-(P)Ser-Ile-Met-. Thus, the site of ATP-dependent phosphorylation was determined to be Ser-46 within the primary structure of HPr.« less
Cloning and sequencing of a gene encoding a novel extracellular neutral proteinase from Streptomyces sp. strain C5 and expression of the gene in Streptomyces lividans 1326.

PubMed Central

Lampel, J S; Aphale, J S; Lampel, K A; Strohl, W R

1992-01-01

The gene encoding a novel milk protein-hydrolyzing proteinase was cloned on a 6.56-kb SstI fragment from Streptomyces sp. strain C5 genomic DNA into Streptomyces lividans 1326 by using the plasmid vector pIJ702. The gene encoding the small neutral proteinase (snpA) was located within a 2.6-kb BamHI-SstI restriction fragment that was partially sequenced. The molecular mass of the deduced amino acid sequence of the mature protein was determined to be 15,740, which corresponds very closely with the relative molecular mass of the purified protein (15,500) determined by sodium dodecyl sulfate-polyacrylamide gel electrophoresis. The N-terminal amino acid sequence of the purified neutral proteinase was determined, and the DNA encoding this sequence was found to be located within the sequenced DNA. The deduced amino acid sequence contains a conserved zinc binding site, although secondary ligand binding and active sites typical of thermolysinlike metalloproteinases are absent. The combination of its small size, deduced amino acid sequence, and substrate and inhibition profile indicate that snpA encodes a novel neutral proteinase. Images PMID:1569011
Computer-aided visualization and analysis system for sequence evaluation

DOEpatents

Chee, M.S.

1998-08-18

A computer system for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments are improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area and sample sequences in another area on a display device. 27 figs.
Computer-aided visualization and analysis system for sequence evaluation

DOEpatents

Chee, Mark S.; Wang, Chunwei; Jevons, Luis C.; Bernhart, Derek H.; Lipshutz, Robert J.

2004-05-11

A computer system for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments are improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area and sample sequences in another area on a display device.
Computer-aided visualization and analysis system for sequence evaluation

DOEpatents

Chee, Mark S.

1998-08-18

A computer system for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments are improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area and sample sequences in another area on a display device.
Computer-aided visualization and analysis system for sequence evaluation

DOEpatents

Chee, Mark S.

2003-08-19

A computer system for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments may be improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area and sample sequences in another area on a display device.
Computer-aided visualization and analysis system for sequence evaluation

DOEpatents

Chee, Mark S.

1999-10-26

A computer system (1) for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments may be improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area (814) and sample sequences in another area (816) on a display device (3).
Computer-aided visualization and analysis system for sequence evaluation

DOEpatents

Chee, Mark S.

2001-06-05

A computer system (1) for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments may be improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area (814) and sample sequences in another area (816) on a display device (3).
Identification and characterization of Theileria ovis surface protein (ToSp) resembled TaSp in Theileria annulata.

PubMed

Shayan, P; Jafari, S; Fattahi, R; Ebrahimzade, E; Amininia, N; Changizi, E

2016-05-01

Ovine theileriosis is an important hemoprotozoal disease of sheep and goats in tropical and subtropical regions which caused high economic loses in the livestock industry. Theileria annulata surface protein (TaSp) was used previously as a tool for serological analysis in livestock. Since the amino acid sequences of TaSp is, at least, in part very conserved in T. annulata, Theileria lestoquardi and Theileria china I and II, it is very important to determine the amino acid sequence of this protein in Theileria ovis as well, to avoid false interpretation of serological data based on this protein in small animal. In the present study, the nucleotide sequence and amino acid sequence of T. ovis surface protein (ToSp) were determined. The comparison of the nucleotide sequence of ToSp showed 96, 96, 99, and 86 % homology to the corresponding nucleotide sequence of TaSp genes by T. annulata, T. China I, T. China II and T. lestoquardi, previously registered in GenBank under accession nos. AJ316260.1, AY274329.1, DQ120058.1, and EF092924.1 respectively. The amino acid sequence analysis showed 95, 81, 98 and 70 % homology to the corresponding amino acid sequence of T. annulata, T chinaI, T china II and T. lestoquardi, registered in GenBank under accession nos. CAC87478.1, AAP36993.1, AAZ30365.1 and AAP36999.11, respectively. Interestingly, in contrast to the C terminus, a significant difference in amino acid sequence in the N teminus of the ToSp protein could be determined compared to the other known corresponding TaSp sequences, which make this region attractive for designing of a suitable tool for serological diagnosis.
Gene encoding a novel extracellular metalloprotease in Bacillus subtilis.

PubMed Central

Sloma, A; Rudolph, C F; Rufo, G A; Sullivan, B J; Theriault, K A; Ally, D; Pero, J

1990-01-01

The gene for a novel extracellular metalloprotease was cloned, and its nucleotide sequence was determined. The gene (mpr) encodes a primary product of 313 amino acids that has little similarity to other known Bacillus proteases. The amino acid sequence of the mature protease was preceded by a signal sequence of approximately 34 amino acids and a pro sequence of 58 amino acids. Four cysteine residues were found in the deduced amino acid sequence of the mature protein, indicating the possible presence of disulfide bonds. The mpr gene mapped in the cysA-aroI region of the chromosome and was not required for growth or sporulation. Images FIG. 2 FIG. 7 PMID:2105291

Identification of Delta5-fatty acid desaturase from the cellular slime mold dictyostelium discoideum.

PubMed

Saito, T; Ochiai, H

1999-10-01

cDNA fragments putatively encoding amino acid sequences characteristic of the fatty acid desaturase were obtained using expressed sequence tag (EST) information of the Dictyostelium cDNA project. Using this sequence, we have determined the cDNA sequence and genomic sequence of a desaturase. The cloned cDNA is 1489 nucleotides long and the deduced amino acid sequence comprised 464 amino acid residues containing an N-terminal cytochrome b5 domain. The whole sequence was 38.6% identical to the initially identified Delta5-desaturase of Mortierella alpina. We have confirmed its function as Delta5-desaturase by over expression mutation in D. discoideum and also the gain of function mutation in the yeast Saccharomyces cerevisiae. Analysis of the lipids from transformed D. discoideum and yeast demonstrated the accumulation of Delta5-desaturated products. This is the first report concering fatty acid desaturase in cellular slime molds.
Molecular Cloning of Adenosinediphosphoribosyl Transferase.

DTIC Science & Technology

1987-09-08

nature of the blocking group is unknown, except its identity with pyroglutamic acid was ruled out by its insensitivity to pyroglutaminase (not shown...AdenosinediphosphoribOSyl Transferase (ADPRT) is: 1) the complete amino acid sequence of this large protein is best determined -from the DNA !equence of the gene, 2...enzyme (I), determination of its peptide structure (II) and application of synthetic DNA probes (III) derived from amino acid sequences, resulting in the
Complete amino acid sequence of bovine colostrum low-Mr cysteine proteinase inhibitor.

PubMed

Hirado, M; Tsunasawa, S; Sakiyama, F; Niinobe, M; Fujii, S

1985-07-01

The complete amino acid sequence of bovine colostrum cysteine proteinase inhibitor was determined by sequencing native inhibitor and peptides obtained by cyanogen bromide degradation, Achromobacter lysylendopeptidase digestion and partial acid hydrolysis of reduced and S-carboxymethylated protein. Achromobacter peptidase digestion was successfully used to isolate two disulfide-containing peptides. The inhibitor consists of 112 amino acids with an Mr of 12787. Two disulfide bonds were established between Cys 66 and Cys 77 and between Cys 90 and Cys 110. A high degree of homology in the sequence was found between the colostrum inhibitor and human gamma-trace, human salivary acidic protein and chicken egg-white cystatin.
N-Terminal Amino Acid Sequence Determination of Proteins by N-Terminal Dimethyl Labeling: Pitfalls and Advantages When Compared with Edman Degradation Sequence Analysis.

PubMed

Chang, Elizabeth; Pourmal, Sergei; Zhou, Chun; Kumar, Rupesh; Teplova, Marianna; Pavletich, Nikola P; Marians, Kenneth J; Erdjument-Bromage, Hediye

2016-07-01

In recent history, alternative approaches to Edman sequencing have been investigated, and to this end, the Association of Biomolecular Resource Facilities (ABRF) Protein Sequencing Research Group (PSRG) initiated studies in 2014 and 2015, looking into bottom-up and top-down N-terminal (Nt) dimethyl derivatization of standard quantities of intact proteins with the aim to determine Nt sequence information. We have expanded this initiative and used low picomole amounts of myoglobin to determine the efficiency of Nt-dimethylation. Application of this approach on protein domains, generated by limited proteolysis of overexpressed proteins, confirms that it is a universal labeling technique and is very sensitive when compared with Edman sequencing. Finally, we compared Edman sequencing and Nt-dimethylation of the same polypeptide fragments; results confirm that there is agreement in the identity of the Nt amino acid sequence between these 2 methods.
The amino acid sequence of Staphylococcus aureus penicillinase.

PubMed Central

Ambler, R P

1975-01-01

The amino acid sequence of the penicillinase (penicillin amido-beta-lactamhydrolase, EC 3.5.2.6) from Staphylococcus aureus strain PC1 was determined. The protein consists of a single polypeptide chain of 257 residues, and the sequence was determined by characterization of tryptic, chymotryptic, peptic and CNBr peptides, with some additional evidence from thermolysin and S. aureus proteinase peptides. A mistake in the preliminary report of the sequence is corrected; residues 113-116 are now thought to be -Lys-Lys-Val-Lys- rather than -Lys-Val-Lys-Lys-. Detailed evidence for the amino acid sequence has been deposited as Supplementary Publication SUP 50056 (91 pages) at the British Library (Lending Division), Boston Spa, Wetherby, West Yorkshire LS23 7BQ, U.K., from whom copies may be obtained on the terms given in Biochem. J. (1975) 145, 5. PMID:1218078
Crotoxin: Structural Studies, Mechanism of Action and Cloning of its Gene

DTIC Science & Technology

1988-03-01

thirteen amino acids being acidic . Sequencing of the three peptides present in the acidic subunit, two of which are blocked by pyroglutamate ...the sequence determination of both the basic and acidic subunits of crotoxin- The acidic * subunit peptides were d!Tfficult, .sfi~n~e two of-ftflý...fluorescence spectroscopy. Results indicate a large conformational change occurs upon) ccmplex formation between the acidic and basic subunits of all four
Complete nucleotide and derived amino acid sequence of cDNA encoding the mitochondrial uncoupling protein of rat brown adipose tissue: lack of a mitochondrial targeting presequence.

PubMed Central

Ridley, R G; Patel, H V; Gerber, G E; Morton, R C; Freeman, K B

1986-01-01

A cDNA clone spanning the entire amino acid sequence of the nuclear-encoded uncoupling protein of rat brown adipose tissue mitochondria has been isolated and sequenced. With the exception of the N-terminal methionine the deduced N-terminus of the newly synthesized uncoupling protein is identical to the N-terminal 30 amino acids of the native uncoupling protein as determined by protein sequencing. This proves that the protein contains no N-terminal mitochondrial targeting prepiece and that a targeting region must reside within the amino acid sequence of the mature protein. Images PMID:3012461
Arrays of nucleic acid probes on biological chips

DOEpatents

Chee, Mark; Cronin, Maureen T.; Fodor, Stephen P. A.; Huang, Xiaohua X.; Hubbell, Earl A.; Lipshutz, Robert J.; Lobban, Peter E.; Morris, MacDonald S.; Sheldon, Edward L.

1998-11-17

DNA chips containing arrays of oligonucleotide probes can be used to determine whether a target nucleic acid has a nucleotide sequence identical to or different from a specific reference sequence. The array of probes comprises probes exactly complementary to the reference sequence, as well as probes that differ by one or more bases from the exactly complementary probes.
Terminal region sequence variations in variola virus DNA.

PubMed

Massung, R F; Loparev, V N; Knight, J C; Totmenin, A V; Chizhikov, V E; Parsons, J M; Safronov, P F; Gutorov, V V; Shchelkunov, S N; Esposito, J J

1996-07-15

Genome DNA terminal region sequences were determined for a Brazilian alastrim variola minor virus strain Garcia-1966 that was associated with an 0.8% case-fatality rate and African smallpox strains Congo-1970 and Somalia-1977 associated with variola major (9.6%) and minor (0.4%) mortality rates, respectively. A base sequence identity of > or = 98.8% was determined after aligning 30 kb of the left- or right-end region sequences with cognate sequences previously determined for Asian variola major strains India-1967 (31% death rate) and Bangladesh-1975 (18.5% death rate). The deduced amino acid sequences of putative proteins of > or = 65 amino acids also showed relatively high identity, although the Asian and African viruses were clearly more related to each other than to alastrim virus. Alastrim virus contained only 10 of 70 proteins that were 100% identical to homologs in Asian strains, and 7 alastrim-specific proteins were noted.
Crotoxin: Structural Studies, Mechanism of Action and Cloning of Its Gene

DTIC Science & Technology

1987-03-01

other venoms and examine their toxin neutral- izing ability. The amino acid sequences of both crotoxin subunits were determined Is a prelude to cloning...be examined for their potential as anti-idiotype vaccines The complete amino acid sequence of the basic subunit and two of the three dic subunit chains...of crotoxin from the venom of C.d. terrificus has been de rmined. Sequence comparison data suggest that the non-toxic, acidic subunit was derived
Optical resolution of phenylthiohydantoin-amino acids by capillary electrophoresis and identification of the phenylthiohydantoin-D-amino acid residue of [D-Ala2]-methionine enkephalin.

PubMed

Kurosu, Y; Murayama, K; Shindo, N; Shisa, Y; Ishioka, N

1996-11-01

This is an initial report to propose a protein sequence analysis system with DL differentiation using capillary electrophoresis (CE). This system consists of a protein sequencer and a CE system. After fractionation of phenyl-thiohydantoin (PTH)-amino acids using a protein sequencer, optical resolution for each PTH-amino acid is performed by CE using some chiral selectors such as digitonin, beta-escin and others. As a model peptide, [D-Ala2]-methionine enkephalin (L-Tyr-D-Ala-Gly-L-Phe-L-Met), was used and the sequence with DL differentiation was determined, with the exception of the fourth amino acid, L-Phe, using our proposed system.
Cloning of the poly(ADP-ribose) Gene from Rat Liver.

DTIC Science & Technology

1986-09-24

Levinson, Ph.D. (Cetus Corp., Berkeley). 5. Amino acid analysis done in UCSF Bioanal. Lab. TABLE OF CONTENTS Page METHOD I...TABLE I ............. ............................... ... 12 Proteolytic degradation, isolation of peptide and amino acid sequences...technique developed for enzyme quantitation in biological materials. The amino- acid sequence of the enzyme has so far been determined because the amino
Nucleic Acid Detection Methods

DOEpatents

Smith, Cassandra L.; Yaar, Ron; Szafranski, Przemyslaw; Cantor, Charles R.

1998-05-19

The invention relates to methods for rapidly determining the sequence and/or length a target sequence. The target sequence may be a series of known or unknown repeat sequences which are hybridized to an array of probes. The hybridized array is digested with a single-strand nuclease and free 3'-hydroxyl groups extended with a nucleic acid polymerase. Nuclease cleaved heteroduplexes can be easily distinguish from nuclease uncleaved heteroduplexes by differential labeling. Probes and target can be differentially labeled with detectable labels. Matched target can be detected by cleaving resulting loops from the hybridized target and creating free 3-hydroxyl groups. These groups are recognized and extended by polymerases added into the reaction system which also adds or releases one label into solution. Analysis of the resulting products using either solid phase or solution. These methods can be used to detect characteristic nucleic acid sequences, to determine target sequence and to screen for genetic defects and disorders. Assays can be conducted on solid surfaces allowing for multiple reactions to be conducted in parallel and, if desired, automated.
A putative carbohydrate-binding domain of the lactose-binding Cytisus sessilifolius anti-H(O) lectin has a similar amino acid sequence to that of the L-fucose-binding Ulex europaeus anti-H(O) lectin.

PubMed

Konami, Y; Yamamoto, K; Osawa, T; Irimura, T

1995-04-01

The complete amino acid sequence of a lactose-binding Cytisus sessilifolius anti-H(O) lectin II (CSA-II) was determined using a protein sequencer. After digestion of CSA-II with endoproteinase Lys-C or Asp-N, the resulting peptides were purified by reversed-phase high performance liquid chromatography (HPLC) and then subjected to sequence analysis. Comparison of the complete amino acid sequence of CSA-II with the sequences of other leguminous seed lectins revealed regions of extensive homology. The amino acid sequence of a putative carbohydrate-binding domain of CSA-II was found to be similar to those of several anti-H(O) leguminous lectins, especially to that of the L-fucose-binding Ulex europaeus lectin I (UEA-I).
Complete cDNA sequence and amino acid analysis of a bovine ribonuclease K6 gene.

PubMed

Pietrowski, D; Förster, M

2000-01-01

The complete cDNA sequence of a ribonuclease k6 gene of Bos Taurus has been determined. It codes for a protein with 154 amino acids and contains the invariant cysteine, histidine and lysine residues as well as the characteristic motifs specific to ribonuclease active sites. The deduced protein sequence is 27 residues longer than other known ribonucleases k6 and shows amino acids exchanges which could reflect a strain specificity or polymorphism within the bovine genome. Based on sequence similarity we have termed the identified gene bovine ribonuclease k6 b (brk6b).
Asymmetric scoring functions for proteins

NASA Astrophysics Data System (ADS)

Lezon, Timothy; Holter, Neal; Maritan, Amos; Banavar, Jayanth

2003-03-01

The protein folding problem entails the prediction of the native state structure of a protein given the sequence of amino acids. In a coarse-grained description of a protein, an important ingredient for attempting this task is the determination of the effective energies of interaction between amino acids. We will discuss a simple approach for determining such interaction potentials from a training set of protein sequences and their experimentally determined native state structures. The key new ingredient in our study is the incorporation of the lack of symmetry in the effective interactions between amino acids. Our results, obtained using a set of 513 proteins, and their implications will be discussed.
Nucleotide sequence of the phosphoglycerate kinase gene from the extreme thermophile Thermus thermophilus. Comparison of the deduced amino acid sequence with that of the mesophilic yeast phosphoglycerate kinase.

PubMed Central

Bowen, D; Littlechild, J A; Fothergill, J E; Watson, H C; Hall, L

1988-01-01

Using oligonucleotide probes derived from amino acid sequencing information, the structural gene for phosphoglycerate kinase from the extreme thermophile, Thermus thermophilus, was cloned in Escherichia coli and its complete nucleotide sequence determined. The gene consists of an open reading frame corresponding to a protein of 390 amino acid residues (calculated Mr 41,791) with an extreme bias for G or C (93.1%) in the codon third base position. Comparison of the deduced amino acid sequence with that of the corresponding mesophilic yeast enzyme indicated a number of significant differences. These are discussed in terms of the unusual codon bias and their possible role in enhanced protein thermal stability. Images Fig. 1. PMID:3052437
Composition for nucleic acid sequencing

DOEpatents

Korlach, Jonas [Ithaca, NY; Webb, Watt W [Ithaca, NY; Levene, Michael [Ithaca, NY; Turner, Stephen [Ithaca, NY; Craighead, Harold G [Ithaca, NY; Foquet, Mathieu [Ithaca, NY

2008-08-26

The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
Method for sequencing nucleic acid molecules

DOEpatents

Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

2006-06-06

The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
Method for sequencing nucleic acid molecules

DOEpatents

Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

2006-05-30

The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

The complete amino acid sequence of human skeletal-muscle fructose-bisphosphate aldolase.

PubMed Central

Freemont, P S; Dunbar, B; Fothergill-Gilmore, L A

1988-01-01

The complete amino acid sequence of human skeletal-muscle fructose-bisphosphate aldolase, comprising 363 residues, was determined. The sequence was deduced by automated sequencing of CNBr-cleavage, o-iodosobenzoic acid-cleavage, trypsin-digest and staphylococcal-proteinase-digest fragments. Comparison of the sequence with other class I aldolase sequences shows that the mammalian muscle isoenzyme is one of the most highly conserved enzymes known, with only about 2% of the residues changing per 100 million years. Non-mammalian aldolases appear to be evolving at the same rate as other glycolytic enzymes, with about 4% of the residues changing per 100 million years. Secondary-structure predictions are analysed in an accompanying paper [Sawyer, Fothergill-Gilmore & Freemont (1988) Biochem. J. 249, 789-793]. PMID:3355497
Application of 2D graphic representation of protein sequence based on Huffman tree method.

PubMed

Qi, Zhao-Hui; Feng, Jun; Qi, Xiao-Qin; Li, Ling

2012-05-01

Based on Huffman tree method, we propose a new 2D graphic representation of protein sequence. This representation can completely avoid loss of information in the transfer of data from a protein sequence to its graphic representation. The method consists of two parts. One is about the 0-1 codes of 20 amino acids by Huffman tree with amino acid frequency. The amino acid frequency is defined as the statistical number of an amino acid in the analyzed protein sequences. The other is about the 2D graphic representation of protein sequence based on the 0-1 codes. Then the applications of the method on ten ND5 genes and seven Escherichia coli strains are presented in detail. The results show that the proposed model may provide us with some new sights to understand the evolution patterns determined from protein sequences and complete genomes. Copyright © 2012 Elsevier Ltd. All rights reserved.
Nucleotide sequence analysis of the gene encoding the Deinococcus radiodurans surface protein, derived amino acid sequence, and complementary protein chemical studies

DOE Office of Scientific and Technical Information (OSTI.GOV)

Peters, J.; Peters, M.; Lottspeich, F.

1987-11-01

The complete nucleotide sequence of the gene encoding the surface (hexagonally packed intermediate (HPI))-layer polypeptide of Deinococcus radiodurans Sark was determined and found to encode a polypeptide of 1036 amino acids. Amino acid sequence analysis of about 30% of the residues revealed that the mature polypeptide consists of at least 978 amino acids. The N terminus was blocked to Edman degradation. The results of proteolytic modification of the HPI layer in situ and M/sub r/ estimations of the HPI polypeptide expressed in Escherichia coli indicated that there is a leader sequence. The N-terminal region contained a very high percentage (29%)more » of threonine and serine, including a cluster of nine consecutive serine or threonine residues, whereas a stretch near the C terminus was extremely rich in aromatic amino acids (29%). The protein contained at least two disulfide bridges, as well as tightly bound reducing sugars and fatty acids.« less
Nucleic acid detection methods

DOEpatents

Smith, C.L.; Yaar, R.; Szafranski, P.; Cantor, C.R.

1998-05-19

The invention relates to methods for rapidly determining the sequence and/or length a target sequence. The target sequence may be a series of known or unknown repeat sequences which are hybridized to an array of probes. The hybridized array is digested with a single-strand nuclease and free 3{prime}-hydroxyl groups extended with a nucleic acid polymerase. Nuclease cleaved heteroduplexes can be easily distinguish from nuclease uncleaved heteroduplexes by differential labeling. Probes and target can be differentially labeled with detectable labels. Matched target can be detected by cleaving resulting loops from the hybridized target and creating free 3-hydroxyl groups. These groups are recognized and extended by polymerases added into the reaction system which also adds or releases one label into solution. Analysis of the resulting products using either solid phase or solution. These methods can be used to detect characteristic nucleic acid sequences, to determine target sequence and to screen for genetic defects and disorders. Assays can be conducted on solid surfaces allowing for multiple reactions to be conducted in parallel and, if desired, automated. 18 figs.
Cloning and purification of alpha-neurotoxins from king cobra (Ophiophagus hannah).

PubMed

He, Ying-Ying; Lee, Wei-Hui; Zhang, Yun

2004-09-01

Thirteen complete and three partial cDNA sequences were cloned from the constructed king cobra (Ophiophagus hannah) venom gland cDNA library. Phylogenetic analysis of nucleotide sequences of king cobra with those from other snake venoms revealed that obtained cDNAs are highly homologous to snake venom alpha-neurotoxins. Alignment of deduced mature peptide sequences of the obtained clones with those of other reported alpha-neurotoxins from the king cobra venom indicates that our obtained 16 clones belong to long-chain neurotoxins (seven), short-chain neurotoxins (seven), weak toxin (one) and variant (one), respectively. Up to now, two out of 16 newly cloned king cobra alpha-neurotoxins have identical amino acid sequences with CM-11 and Oh-6A/6B, which have been characterized from the same venom. Furthermore, five long-chain alpha-neurotoxins and two short-chain alpha-neurotoxins were purified from crude venom and their N-terminal amino acid sequences were determined. The cDNAs encoding the putative precursors of the purified native peptide were also determined based on the N-terminal amino acid sequencing. The purified alpha-neurotoxins showed different lethal activities on mice.
Molecular cloning and sequence analysis of the gene coding for the 57kDa soluble antigen of the salmonid fish pathogen Renibacterium salmoninarum

USGS Publications Warehouse

Chien, Maw-Sheng; Gilbert , Teresa L.; Huang, Chienjin; Landolt, Marsha L.; O'Hara, Patrick J.; Winton, James R.

1992-01-01

The complete sequence coding for the 57-kDa major soluble antigen of the salmonid fish pathogen, Renibacterium salmoninarum, was determined. The gene contained an opening reading frame of 1671 nucleotides coding for a protein of 557 amino acids with a calculated Mr value of 57190. The first 26 amino acids constituted a signal peptide. The deduced sequence for amino acid residues 27–61 was in agreement with the 35 N-terminal amino acid residues determined by microsequencing, suggesting the protein in synthesized as a 557-amino acid precursor and processed to produce a mature protein of Mr 54505. Two regions of the protein contained imperfect direct repeats. The first region contained two copies of an 81-residue repeat, the second contained five copies of an unrelated 25-residue repeat. Also, a perfect inverted repeat (including three in-frame UAA stop codons) was observed at the carboxyl-terminus of the gene.
alpha-Amylase gene of Streptomyces limosus: nucleotide sequence, expression motifs, and amino acid sequence homology to mammalian and invertebrate alpha-amylases.

PubMed Central

Long, C M; Virolle, M J; Chang, S Y; Chang, S; Bibb, M J

1987-01-01

The nucleotide sequence of the coding and regulatory regions of the alpha-amylase gene (aml) of Streptomyces limosus was determined. High-resolution S1 mapping was used to locate the 5' end of the transcript and demonstrated that the gene is transcribed from a unique promoter. The predicted amino acid sequence has considerable identity to mammalian and invertebrate alpha-amylases, but not to those of plant, fungal, or eubacterial origin. Consistent with this is the susceptibility of the enzyme to an inhibitor of mammalian alpha-amylases. The amino-terminal sequence of the extracellular enzyme was determined, revealing the presence of a typical signal peptide preceding the mature form of the alpha-amylase. Images PMID:3500166
Crotoxin: Structural Studies, Mechanism of Action and Cloning of Its gene

DTIC Science & Technology

1989-12-01

B-chain. Sequencing of the three peptides present in the acidic subunit, two of which are blocked by pyroglutamate , represents a significant...We have completed the sequence determination of both the basic and acidic subunits of crotoxin. The acidic subunit peptides were difficult, since two...of the three peptides were blocked at the amino-terminus by pyroglutamate . Earlier structural studies on crotoxin and related crotalid dimeric
Determination of a mutational spectrum

DOEpatents

Thilly, William G.; Keohavong, Phouthone

1991-01-01

A method of resolving (physically separating) mutant DNA from nonmutant DNA and a method of defining or establishing a mutational spectrum or profile of alterations present in nucleic acid sequences from a sample to be analyzed, such as a tissue or body fluid. The present method is based on the fact that it is possible, through the use of DGGE, to separate nucleic acid sequences which differ by only a single base change and on the ability to detect the separate mutant molecules. The present invention, in another aspect, relates to a method for determining a mutational spectrum in a DNA sequence of interest present in a population of cells. The method of the present invention is useful as a diagnostic or analytical tool in forensic science in assessing environmental and/or occupational exposures to potentially genetically toxic materials (also referred to as potential mutagens); in biotechnology, particularly in the study of the relationship between the amino acid sequence of enzymes and other biologically-active proteins or protein-containing substances and their respective functions; and in determining the effects of drugs, cosmetics and other chemicals for which toxicity data must be obtained.
The TGA codons are present in the open reading frame of selenoprotein P cDNA

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hill, K.E.; Lloyd, R.S.; Read, R.

1991-03-11

The TGA codon in DNA has been shown to direct incorporation of selenocysteine into protein. Several proteins from bacteria and animals contain selenocysteine in their primary structures. Each of the cDNA clones of these selenoproteins contains one TGA codon in the open reading frame which corresponds to the selenocysteine in the protein. A cDNA clone for selenoprotein P (SeP), obtained from a {gamma}ZAP rat liver library, was sequenced by the dideoxy termination method. The correct reading frame was determined by comparison of the deduced amino acid sequence with the amino acid sequence of several peptides from SeP. Using SeP labelledmore » with {sup 75}Se in vivo, the selenocysteine content of the peptides was verified by the collection of carboxymethylated {sup 77}Se-selenocysteine as it eluted from the amino acid analyzer and determination of the radioactivity contained in the collected samples. Ten TGA codons are present in the open reading frame of the cDNA. Peptide fragmentation studies and the deduced sequence indicate that selenium-rich regions are located close to the carboxy terminus. Nine of the 10 selenocysteines are located in the terminal 26% of the sequence with four in the terminal 15 amino acids. The deduced sequence codes for a protein of 385 amino acids. Cleavage of the signal peptide gives the mature protein with 366 amino acids and a calculated mol wt of 41,052 Da. Searches of PIR and SWISSPROT protein databases revealed no similarity with glutathione peroxidase or other selenoproteins.« less
Labeled nucleotide phosphate (NP) probes

DOEpatents

Korlach, Jonas [Ithaca, NY; Webb, Watt W [Ithaca, NY; Levene, Michael [Ithaca, NY; Turner, Stephen [Ithaca, NY; Craighead, Harold G [Ithaca, NY; Foquet, Mathieu [Ithaca, NY

2009-02-03

The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
Brain cDNA clone for human cholinesterase

DOE Office of Scientific and Technical Information (OSTI.GOV)

McTiernan, C.; Adkins, S.; Chatonnet, A.

1987-10-01

A cDNA library from human basal ganglia was screened with oligonucleotide probes corresponding to portions of the amino acid sequence of human serum cholinesterase. Five overlapping clones, representing 2.4 kilobases, were isolated. The sequenced cDNA contained 207 base pairs of coding sequence 5' to the amino terminus of the mature protein in which there were four ATG translation start sites in the same reading frame as the protein. Only the ATG coding for Met-(-28) lay within a favorable consensus sequence for functional initiators. There were 1722 base pairs of coding sequence corresponding to the protein found circulating in human serum.more » The amino acid sequence deduced from the cDNA exactly matched the 574 amino acid sequence of human serum cholinesterase, as previously determined by Edman degradation. Therefore, our clones represented cholinesterase rather than acetylcholinesterase. It was concluded that the amino acid sequences of cholinesterase from two different tissues, human brain and human serum, were identical. Hybridization of genomic DNA blots suggested that a single gene, or very few genes coded for cholinesterase.« less
Complete covalent structure of statherin, a tyrosine-rich acidic peptide which inhibits calcium phosphate precipitation from human parotid saliva.

PubMed

Schlesinger, D H; Hay, D I

1977-03-10

The complete amino acid sequence of human salivary statherin, a peptide which strongly inhibits precipitation from supersaturated calcium phosphate solutions, and therefore stabilizes supersaturated saliva, has been determined. The NH2-terminal half of this Mr=5380 (43 amino acids) polypeptide was determined by automated Edman degradations (liquid phase) on native statherin. The peptide was digested separately with trypsin, chymotrypsin, and Staphylococcus aureus protease, and the resulting peptides were purified by gel filtration. Manual Edman degradations on purified peptide fragments yielded peptides that completed the amino acid sequence through the penultimate COOH-terminal residue. These analyses, together with carboxypeptidase digestion of native statherin and of peptide fragments of statherin, established the complete sequence of the molecule. The 2 serine residues (positions 2 and 3) in statherin were identified as phosphoserine. The amino acid sequence of human salivary statherin is striking in a number of ways. The NH2-terminal one-third is highly polar and includes three polar dipeptides: H2PO3-Ser-Ser-H2PO3-Arg-Arg-, and Glu-Glu-. The COOH-terminal two-thirds of the molecule is hydrophobic, containing several repeating dipeptides: four of -Gn-Pro-, three of -Tyr-Gln-, two of -Gly-Tyr-, two of-Gln-Tyr-, and two of the tetrapeptide sequence -Pro-Tyr-Gln-Pro-. Unusual cleavage sites in the statherin sequence obtained with chymotrypsin and S. aureus protease were also noted.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Leong, JoAnn Ching

The nucleotide sequence of the IHNV glycoprotein gene has been determined from a cDNA clone containing the entire coding region. The glycoprotein cDNA clone contained a leader sequence of 48 bases, a coding region of 1524 nucleotides, and 39 bases at the 3 foot end. The entire cDNA clone contains 1609 nucleodites and encodes a protein of 508 amino acids. The deduced amino acid sequence gave a translated molecular weight of 56,795 daltons. A hydropathicity profile of the deduced amino acid sequence indicated that there were two major hydrophobic domains: one,at the N-terminus,delineating a signal peptide of 18 amino acidsmore » and the other, at the C-terminus,delineating the region of the transmembrane. Five possible sites of N-linked glyscoylation were identified. Although no nucleic acid homology existed between the IHNV glycoprotein gene and the glycoprotein genes of rabies and VSV, there was significant homology at the amino acid level between all three rhabdovirus glycoproteins.« less
Nucleic Acid Extraction from Synthetic Mars Analog Soils for in situ Life Detection.

PubMed

Mojarro, Angel; Ruvkun, Gary; Zuber, Maria T; Carr, Christopher E

2017-08-01

Biological informational polymers such as nucleic acids have the potential to provide unambiguous evidence of life beyond Earth. To this end, we are developing an automated in situ life-detection instrument that integrates nucleic acid extraction and nanopore sequencing: the Search for Extra-Terrestrial Genomes (SETG) instrument. Our goal is to isolate and determine the sequence of nucleic acids from extant or preserved life on Mars, if, for example, there is common ancestry to life on Mars and Earth. As is true of metagenomic analysis of terrestrial environmental samples, the SETG instrument must isolate nucleic acids from crude samples and then determine the DNA sequence of the unknown nucleic acids. Our initial DNA extraction experiments resulted in low to undetectable amounts of DNA due to soil chemistry-dependent soil-DNA interactions, namely adsorption to mineral surfaces, binding to divalent/trivalent cations, destruction by iron redox cycling, and acidic conditions. Subsequently, we developed soil-specific extraction protocols that increase DNA yields through a combination of desalting, utilization of competitive binders, and promotion of anaerobic conditions. Our results suggest that a combination of desalting and utilizing competitive binders may establish a "universal" nucleic acid extraction protocol suitable for analyzing samples from diverse soils on Mars. Key Words: Life-detection instruments-Nucleic acids-Mars-Panspermia. Astrobiology 17, 747-760.
Cloning and characterization of acid invertase genes in the roots of the metallophyte Kummerowia stipulacea (Maxim.) Makino from two populations: Differential expression under copper stress.

PubMed

Zhang, Luan; Xiong, Zhi-ting; Xu, Zhong-rui; Liu, Chen; Cai, Shen-wen

2014-06-01

The roots of metallophytes serve as the key interface between plants and heavy metal-contaminated underground environments. It is known that the roots of metallicolous plants show a higher activity of acid invertase enzymes than those of non-metallicolous plants when under copper stress. To test whether the higher activity of acid invertases is the result of increased expression of acid invertase genes or variations in the amino acid sequences between the two population types, we isolated full cDNAs for acid invertases from two populations of Kummerowia stipulacea (from metalliferous and non-metalliferous soils), determined their nucleotide sequences, expressed them in Pichia pastoris, and conducted real-time PCR to determine differences in transcript levels during Cu stress. Heterologous expression of acid invertase cDNAs in P. pastoris indicated that variations in the amino acid sequences of acid invertases between the two populations played no significant role in determining enzyme characteristics. Seedlings of K. stipulacea were exposed to 0.3µM Cu(2+) (control) and 10µM Cu(2+) for 7 days under hydroponics׳ conditions. The transcript levels of acid invertases in metallicolous plants were significantly higher than in non-metallicolous plants when under copper stress. The results suggest that the expression of acid invertase genes in metallicolous plants of K. stipulacea differed from those in non-metallicolous plants under such conditions. In addition, the sugars may play an important role in regulating the transcript level of acid invertase genes and acid invertase genes may also be involved in root/shoot biomass allocation. Copyright © 2014 Elsevier Inc. All rights reserved.
The complete amino acid sequence of human erythrocyte diphosphoglycerate mutase.

PubMed Central

Haggarty, N W; Dunbar, B; Fothergill, L A

1983-01-01

The complete amino acid sequence of human erythrocyte diphosphoglycerate mutase, comprising 239 residues, was determined. The sequence was deduced from the four cyanogen bromide fragments, and from the peptides derived from these fragments after digestion with a number of proteolytic enzymes. Comparison of this sequence with that of the yeast glycolytic enzyme, phosphoglycerate mutase, shows that these enzymes are 47% identical. Most, but not all, of the residues implicated as being important for the activity of the glycolytic mutase are conserved in the erythrocyte diphosphoglycerate mutase. PMID:6313356
Nucleic acid analysis using terminal-phosphate-labeled nucleotides

DOEpatents

Korlach, Jonas [Ithaca, NY; Webb, Watt W [Ithaca, NY; Levene, Michael [Ithaca, NY; Turner, Stephen [Ithaca, NY; Craighead, Harold G [Ithaca, NY; Foquet, Mathieu [Ithaca, NY

2008-04-22

The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
Amino acid sequences of peptides from a tryptic digest of a urea-soluble protein fraction (U.S.3) from oxidized wool

PubMed Central

Corfield, M. C.; Fletcher, J. C.; Robson, A.

1967-01-01

1. A tryptic digest of the protein fraction U.S.3 from oxidized wool has been separated into 32 peptide fractions by cation-exchange resin chromatography. 2. Most of these fractions have been resolved into their component peptides by a combination of the techniques of cation-exchange resin chromatography, paper chromatography and paper electrophoresis. 3. The amino acid compositions of 58 of the peptides in the digest present in the largest amounts have been determined. 4. The amino acid sequences of 38 of these have been completely elucidated and those of six others partially derived. 5. These findings indicate that the parent protein in wool from which the protein fraction U.S.3 is derived has a minimum molecular weight of 74000. 6. The structures of wool proteins are discussed in the light of the peptide sequences determined, and, in particular, of those sequences in fraction U.S.3 that could not be elucidated. PMID:16742497
Nucleic Acid Extraction from Synthetic Mars Analog Soils for in situ Life Detection

NASA Astrophysics Data System (ADS)

Mojarro, Angel; Ruvkun, Gary; Zuber, Maria T.; Carr, Christopher E.

2017-08-01

Biological informational polymers such as nucleic acids have the potential to provide unambiguous evidence of life beyond Earth. To this end, we are developing an automated in situ life-detection instrument that integrates nucleic acid extraction and nanopore sequencing: the Search for Extra-Terrestrial Genomes (SETG) instrument. Our goal is to isolate and determine the sequence of nucleic acids from extant or preserved life on Mars, if, for example, there is common ancestry to life on Mars and Earth. As is true of metagenomic analysis of terrestrial environmental samples, the SETG instrument must isolate nucleic acids from crude samples and then determine the DNA sequence of the unknown nucleic acids. Our initial DNA extraction experiments resulted in low to undetectable amounts of DNA due to soil chemistry-dependent soil-DNA interactions, namely adsorption to mineral surfaces, binding to divalent/trivalent cations, destruction by iron redox cycling, and acidic conditions. Subsequently, we developed soil-specific extraction protocols that increase DNA yields through a combination of desalting, utilization of competitive binders, and promotion of anaerobic conditions. Our results suggest that a combination of desalting and utilizing competitive binders may establish a "universal" nucleic acid extraction protocol suitable for analyzing samples from diverse soils on Mars.

Investigation of the protein osteocalcin of Camelops hesternus: Sequence, structure and phylogenetic implications

NASA Astrophysics Data System (ADS)

Humpula, James F.; Ostrom, Peggy H.; Gandhi, Hasand; Strahler, John R.; Walker, Angela K.; Stafford, Thomas W.; Smith, James J.; Voorhies, Michael R.; George Corner, R.; Andrews, Phillip C.

2007-12-01

Ancient DNA sequences offer an extraordinary opportunity to unravel the evolutionary history of ancient organisms. Protein sequences offer another reservoir of genetic information that has recently become tractable through the application of mass spectrometric techniques. The extent to which ancient protein sequences resolve phylogenetic relationships, however, has not been explored. We determined the osteocalcin amino acid sequence from the bone of an extinct Camelid (21 ka, Camelops hesternus) excavated from Isleta Cave, New Mexico and three bones of extant camelids: bactrian camel ( Camelus bactrianus); dromedary camel ( Camelus dromedarius) and guanaco ( Llama guanacoe) for a diagenetic and phylogenetic assessment. There was no difference in sequence among the four taxa. Structural attributes observed in both modern and ancient osteocalcin include a post-translation modification, Hyp 9, deamidation of Gln 35 and Gln 39, and oxidation of Met 36. Carbamylation of the N-terminus in ancient osteocalcin may result in blockage and explain previous difficulties in sequencing ancient proteins via Edman degradation. A phylogenetic analysis using osteocalcin sequences of 25 vertebrate taxa was conducted to explore osteocalcin protein evolution and the utility of osteocalcin sequences for delineating phylogenetic relationships. The maximum likelihood tree closely reflected generally recognized taxonomic relationships. For example, maximum likelihood analysis recovered rodents, birds and, within hominins, the Homo-Pan-Gorilla trichotomy. Within Artiodactyla, character state analysis showed that a substitution of Pro 4 for His 4 defines the Capra-Ovis clade within Artiodactyla. Homoplasy in our analysis indicated that osteocalcin evolution is not a perfect indicator of species evolution. Limited sequence availability prevented assigning functional significance to sequence changes. Our preliminary analysis of osteocalcin evolution represents an initial step towards a complete character analysis aimed at determining the evolutionary history of this functionally significant protein. We emphasize that ancient protein sequencing and phylogenetic analyses using amino acid sequences must pay close attention to post-translational modifications, amino acid substitutions due to diagenetic alteration and the impacts of isobaric amino acids on mass shifts and sequence alignments.
1,4-Benzoquinone reductase from Phanerochaete chrysosporium: cDNA cloning and regulation of expression

DOE Office of Scientific and Technical Information (OSTI.GOV)

Akileswaran, L.; Brock, B.J.; Cereghino, J.L.

1999-02-01

A cDNA clone encoding a quinone reductase (QR) from the white rot basidiomycete Phanerochaete chrysosporium was isolated and sequenced. The cDNA consisted of 1,007 nucleotides and a poly(A) tail and encoded a deduced protein containing 271 amino acids. The experimentally determined eight-amino-acid N-germinal sequence of the purified QR protein from P. chrysosporium matched amino acids 72 to 79 of the predicted translation product of the cDNA. The M{sub r} of the predicted translation product, beginning with Pro-72, was essentially identical to the experimentally determined M{sub r} of one monomer of the QR dimer, and this finding suggested that QR ismore » synthesized as a proenzyme. The results of in vitro transcription-translation experiments suggested that QR is synthesized as a proenzyme with a 71-amino-acid leader sequence. This leader sequence contains two potential KEX2 cleavage sites and numerous potential cleavage sites for dipeptidyl aminopeptidase. The QR activity in cultures of P. chrysosporium increased following the addition of 2-dimethoxybenzoquinone, vanillic acid, or several other aromatic compounds. An immunoblot analysis indicated that induction resulted in an increase in the amount of QR protein, and a Northern blot analysis indicated that this regulation occurs at the level of the qr mRNA.« less
Comparative In silico Study of Sex-Determining Region Y (SRY) Protein Sequences Involved in Sex-Determining.

PubMed

Vakili Azghandi, Masoume; Nasiri, Mohammadreza; Shamsa, Ali; Jalali, Mohsen; Shariati, Mohammad Mahdi

2016-04-01

The SRY gene (SRY) provides instructions for making a transcription factor called the sex-determining region Y protein. The sex-determining region Y protein causes a fetus to develop as a male. In this study, SRY of 15 spices included of human, chimpanzee, dog, pig, rat, cattle, buffalo, goat, sheep, horse, zebra, frog, urial, dolphin and killer whale were used for determine of bioinformatic differences. Nucleotide sequences of SRY were retrieved from the NCBI databank. Bioinformatic analysis of SRY is done by CLC Main Workbench version 5.5 and ClustalW (http:/www.ebi.ac.uk/clustalw/) and MEGA6 softwares. The multiple sequence alignment results indicated that SRY protein sequences from Orcinus orca (killer whale) and Tursiopsaduncus (dolphin) have least genetic distance of 0.33 in these 15 species and are 99.67% identical at the amino acid level. Homosapiens and Pantroglodytes (chimpanzee) have the next lowest genetic distance of 1.35 and are 98.65% identical at the amino acid level. These findings indicate that the SRY proteins are conserved in the 15 species, and their evolutionary relationships are similar.
Mutations in type 3 reovirus that determine binding to sialic acid are contained in the fibrous tail domain of viral attachment protein sigma1.

PubMed

Chappell, J D; Gunn, V L; Wetzel, J D; Baer, G S; Dermody, T S

1997-03-01

The reovirus attachment protein, sigma1, determines numerous aspects of reovirus-induced disease, including viral virulence, pathways of spread, and tropism for certain types of cells in the central nervous system. The sigma1 protein projects from the virion surface and consists of two distinct morphologic domains, a virion-distal globular domain known as the head and an elongated fibrous domain, termed the tail, which is anchored into the virion capsid. To better understand structure-function relationships of sigma1 protein, we conducted experiments to identify sequences in sigma1 important for viral binding to sialic acid, a component of the receptor for type 3 reovirus. Three serotype 3 reovirus strains incapable of binding sialylated receptors were adapted to growth in murine erythroleukemia (MEL) cells, in which sialic acid is essential for reovirus infectivity. MEL-adapted (MA) mutant viruses isolated by serial passage in MEL cells acquired the capacity to bind sialic acid-containing receptors and demonstrated a dependence on sialic acid for infection of MEL cells. Analysis of reassortant viruses isolated from crosses of an MA mutant virus and a reovirus strain that does not bind sialic acid indicated that the sigma1 protein is solely responsible for efficient growth of MA mutant viruses in MEL cells. The deduced sigma1 amino acid sequences of the MA mutant viruses revealed that each strain contains a substitution within a short region of sequence in the sigma1 tail predicted to form beta-sheet. These studies identify specific sequences that determine the capacity of reovirus to bind sialylated receptors and suggest a location for a sialic acid-binding domain. Furthermore, the results support a model in which type 3 sigma1 protein contains discrete receptor binding domains, one in the head and another in the tail that binds sialic acid.
Phylogenetic Relationship of Necoclí Virus to Other South American Hantaviruses (Bunyaviridae: Hantavirus).

PubMed

Montoya-Ruiz, Carolina; Cajimat, Maria N B; Milazzo, Mary Louise; Diaz, Francisco J; Rodas, Juan David; Valbuena, Gustavo; Fulhorst, Charles F

2015-07-01

The results of a previous study suggested that Cherrie's cane rat (Zygodontomys cherriei) is the principal host of Necoclí virus (family Bunyaviridae, genus Hantavirus) in Colombia. Bayesian analyses of complete nucleocapsid protein gene sequences and complete glycoprotein precursor gene sequences in this study confirmed that Necoclí virus is phylogenetically closely related to Maporal virus, which is principally associated with the delicate pygmy rice rat (Oligoryzomys delicatus) in western Venezuela. In pairwise comparisons, nonidentities between the complete amino acid sequence of the nucleocapsid protein of Necoclí virus and the complete amino acid sequences of the nucleocapsid proteins of other hantaviruses were ≥8.7%. Likewise, nonidentities between the complete amino acid sequence of the glycoprotein precursor of Necoclí virus and the complete amino acid sequences of the glycoprotein precursors of other hantaviruses were ≥11.7%. Collectively, the unique association of Necoclí virus with Z. cherriei in Colombia, results of the Bayesian analyses of complete nucleocapsid protein gene sequences and complete glycoprotein precursor gene sequences, and results of the pairwise comparisons of amino acid sequences strongly support the notion that Necoclí virus represents a novel species in the genus Hantavirus. Further work is needed to determine whether Calabazo virus (a hantavirus associated with Z. brevicauda cherriei in Panama) and Necoclí virus are conspecific.
Molecular cloning and characterization of a gene encoding glutaminase from Aspergillus oryzae.

PubMed

Koibuchi, K; Nagasaki, H; Yuasa, A; Kataoka, J; Kitamoto, K

2000-07-01

A glutaminase from Aspergillus oryzae was purified and its molecular weight was determined to be 82,091 by matrix-assisted laser desorption ionization time-of-flight mass spectrometry. Purified glutaminase catalysed the hydrolysis not only of L-glutamine but also of D-glutamine. Both the molecular weight and the substrate specificity of this glutaminase were different from those reported previously [Yano et al. (1998) J Ferment Technol 66: 137-143]. On the basis of its internal amino acid sequences, we have isolated and characterized the glutaminase gene (gtaA) from A. oryzae. The gtaA gene had an open reading frame coding for 690 amino acid residues, including a signal peptide of 20 amino acid residues and a mature protein of 670 amino acid residues. In the 5'-flanking region of the gene, there were three putative CreAp binding sequences and one putative AreAp binding sequence. The gtaA structural gene was introduced into A. oryzae NS4 and a marked increase in activity was detected in comparison with the control strain. The gtaA gene was also isolated from Aspergillus nidulans on the basis of the determined nucleotide sequence of the gtaA gene from A. oryzae.
Amino acid sequences of ribosomal proteins S11 from Bacillus stearothermophilus and S19 from Halobacterium marismortui. Comparison of the ribosomal protein S11 family.

PubMed

Kimura, M; Kimura, J; Hatakeyama, T

1988-11-21

The complete amino acid sequences of ribosomal proteins S11 from the Gram-positive eubacterium Bacillus stearothermophilus and of S19 from the archaebacterium Halobacterium marismortui have been determined. A search for homologous sequences of these proteins revealed that they belong to the ribosomal protein S11 family. Homologous proteins have previously been sequenced from Escherichia coli as well as from chloroplast, yeast and mammalian ribosomes. A pairwise comparison of the amino acid sequences showed that Bacillus protein S11 shares 68% identical residues with S11 from Escherichia coli and a slightly lower homology (52%) with the homologous chloroplast protein. The halophilic protein S19 is more related to the eukaryotic (45-49%) than to the eubacterial counterparts (35%).
Evaluating the efficacy of a structure-derived amino acid substitution matrix in detecting protein homologs by BLAST and PSI-BLAST.

PubMed

Goonesekere, Nalin Cw

2009-01-01

The large numbers of protein sequences generated by whole genome sequencing projects require rapid and accurate methods of annotation. The detection of homology through computational sequence analysis is a powerful tool in determining the complex evolutionary and functional relationships that exist between proteins. Homology search algorithms employ amino acid substitution matrices to detect similarity between proteins sequences. The substitution matrices in common use today are constructed using sequences aligned without reference to protein structure. Here we present amino acid substitution matrices constructed from the alignment of a large number of protein domain structures from the structural classification of proteins (SCOP) database. We show that when incorporated into the homology search algorithms BLAST and PSI-blast, the structure-based substitution matrices enhance the efficacy of detecting remote homologs.
Sequencing proteins with transverse ionic transport in nanochannels.

PubMed

Boynton, Paul; Di Ventra, Massimiliano

2016-05-03

De novo protein sequencing is essential for understanding cellular processes that govern the function of living organisms and all sequence modifications that occur after a protein has been constructed from its corresponding DNA code. By obtaining the order of the amino acids that compose a given protein one can then determine both its secondary and tertiary structures through structure prediction, which is used to create models for protein aggregation diseases such as Alzheimer's Disease. Here, we propose a new technique for de novo protein sequencing that involves translocating a polypeptide through a synthetic nanochannel and measuring the ionic current of each amino acid through an intersecting perpendicular nanochannel. We find that the distribution of ionic currents for each of the 20 proteinogenic amino acids encoded by eukaryotic genes is statistically distinct, showing this technique's potential for de novo protein sequencing.
The primary structure of the thymidine kinase gene of fish lymphocystis disease virus.

PubMed

Schnitzler, P; Handermann, M; Szépe, O; Darai, G

1991-06-01

The DNA nucleotide sequence of the thymidine kinase (TK) gene of fish lymphocystis disease virus (FLDV) which has been localized between the coordinates 0.678 to 0.688 of the viral genome was determined. The analysis of the DNA nucleotide sequence located between the recognition sites of HindIII (0.669 map unit; nucleotide position 1) and AccI (nucleotide position 2032) revealed the presence of an open reading frame of 954 bp on the lower strand of this region between nucleotide positions 1868 (ATG) and 915 (TAA). It encodes for a protein of 318 amino acid residues. The evolutionary relationships of the TK gene of FLDV to the other known TK genes was investigated using the method of progressive sequence alignment. These analyses revealed a high degree of diversity between the protein sequence of FLDV TK gene and the amino acid composition of other TKs tested. However, significant conservations were detected at several regions of amino acid residues of the FLDV TK protein when compared to the amino acid sequence of TKs of African swine fever virus, fowlpox virus, shope fibroma virus, and vaccinia virus and to the amino acid sequences of the cellular cytoplasmic TK of chicken, mouse, and man.
The primary structure of aspartate aminotransferase from pig heart muscle. Partial sequences determined by digestion with thermolysin and elastase

PubMed Central

Bossa, Francesco; Barra, Donatella; Carloni, Massimo; Fasella, Paolo; Riva, Francesca; Doonan, Shawn; Doonan, Hilary J.; Hanford, Robin; Vernon, Charles A.; Walker, John M.

1973-01-01

Peptides produced by thermolytic digestion of aminoethylated aspartate aminotransferase and of the oxidized enzyme were isolated and their amino acid sequences determined. Digestion by elastase of the carboxymethylated enzyme gave peptides representing approximately 40% of the primary structure. Fragments from these digests overlapped with previously reported sequences of peptides obtained by peptic and tryptic digestion (Doonan et al., 1972), giving ten composite peptides containing 395 amino acid residues. The amino acid composition of these composite peptides agrees well with that of the intact enzyme. Confirmatory results for some of the present data have been deposited as Supplementary Publication 50018 at the National Lending Library for Science and Technology, Boston Spa, Yorks. LS23 7BQ, U.K., from whom copies can be obtained on the terms indicated in Biochem. J. (1973) 131, 5. PMID:4748834
Structure and Sequence Search on Aptamer-Protein Docking

NASA Astrophysics Data System (ADS)

Xiao, Jiajie; Bonin, Keith; Guthold, Martin; Salsbury, Freddie

2015-03-01

Interactions between proteins and deoxyribonucleic acid (DNA) play a significant role in the living systems, especially through gene regulation. However, short nucleic acids sequences (aptamers) with specific binding affinity to specific proteins exhibit clinical potential as therapeutics. Our capillary and gel electrophoresis selection experiments show that specific sequences of aptamers can be selected that bind specific proteins. Computationally, given the experimentally-determined structure and sequence of a thrombin-binding aptamer, we can successfully dock the aptamer onto thrombin in agreement with experimental structures of the complex. In order to further study the conformational flexibility of this thrombin-binding aptamer and to potentially develop a predictive computational model of aptamer-binding, we use GPU-enabled molecular dynamics simulations to both examine the conformational flexibility of the aptamer in the absence of binding to thrombin, and to determine our ability to fold an aptamer. This study should help further de-novo predictions of aptamer sequences by enabling the study of structural and sequence-dependent effects on aptamer-protein docking specificity.
Sequence signatures of allosteric proteins towards rational design.

PubMed

Namboodiri, Saritha; Verma, Chandra; Dhar, Pawan K; Giuliani, Alessandro; Nair, Achuthsankar S

2010-12-01

Allostery is the phenomenon of changes in the structure and activity of proteins that appear as a consequence of ligand binding at sites other than the active site. Studying mechanistic basis of allostery leading to protein design with predetermined functional endpoints is an important unmet need of synthetic biology. Here, we screened the amino acid sequence landscape in search of sequence-signatures of allostery using Recurrence Quantitative Analysis (RQA) method. A characteristic vector, comprised of 10 features extracted from RQA was defined for amino acid sequences. Using Principal Component Analysis, four factors were found to be important determinants of allosteric behavior. Our sequence-based predictor method shows 82.6% accuracy, 85.7% sensitivity and 77.9% specificity with the current dataset. Further, we show that Laminarity-Mean-hydrophobicity representing repeated hydrophobic patches is the most crucial indicator of allostery. To our best knowledge this is the first report that describes sequence determinants of allostery based on hydrophobicity. As an outcome of these findings, we plan to explore possibility of inducing allostery in proteins.
The amino acid sequence around the active-site cysteine and histidine residues of stem bromelain

PubMed Central

Husain, S. S.; Lowe, G.

1970-01-01

Stem bromelain that had been irreversibly inhibited with 1,3-dibromo[2-14C]-acetone was reduced with sodium borohydride and carboxymethylated with iodoacetic acid. After digestion with trypsin and α-chymotrypsin three radioactive peptides were isolated chromatographically. The amino acid sequences around the cross-linked cysteine and histidine residues were determined and showed a high degree of homology with those around the active-site cysteine and histidine residues of papain and ficin. PMID:5420046
Amino acid sequence of a trypsin inhibitor from a Spirometra (Spirometra erinaceieuropaei).

PubMed

Sanda, A; Uchida, A; Itagaki, T; Kobayashi, H; Inokuchi, N; Koyama, T; Iwama, M; Ohgi, K; Irie, M

2001-12-01

A trypsin inhibitor that is highly homologous with bovine pancreatic trypsin inhibitor (BPTI) was co-purified along with RNase from Spirometra (Spirometra erinaceieuropaei). The amino acid sequence of this inhibitor (SETI) and the nucleotide sequence of the cDNA encoding this protein were determined by protein chemistry and gene technology. SETI contains 68 amino acid residues and has a molecular mass of 7,798 Da. SETI has 31 amino acid residues that are identical with BPTI's sequence, including 6 half-cystine and 5 aromatic amino acid residues. The active site Lys residue in BPTI is replaced by an Arg residue in SETI. SETI is an effective inhibitor of trypsin and moderately inhibits a-chymotrypsin, but less inhibits elastase or subtilisin. SETI was expressed by E. coli containing a PelB vector carrying the SETI encoding cDNA; an expression yield of 0.68 mg/l was obtained. The phylogenetic relationship of SETI and the other BPTI-like trypsin inhibitors was analyzed using most likelihood inference methods.
A proteomic analysis of leaf sheaths from rice.

PubMed

Shen, Shihua; Matsubae, Masami; Takao, Toshifumi; Tanaka, Naoki; Komatsu, Setsuko

2002-10-01

The proteins extracted from the leaf sheaths of rice seedlings were separated by 2-D PAGE, and analyzed by Edman sequencing and mass spectrometry, followed by database searching. Image analysis revealed 352 protein spots on 2-D PAGE after staining with Coomassie Brilliant Blue. The amino acid sequences of 44 of 84 proteins were determined; for 31 of these proteins, a clear function could be assigned, whereas for 12 proteins, no function could be assigned. Forty proteins did not yield amino acid sequence information, because they were N-terminally blocked, or the obtained sequences were too short and/or did not give unambiguous results. Fifty-nine proteins were analyzed by mass spectrometry; all of these proteins were identified by matching to the protein database. The amino acid sequences of 19 of 27 proteins analyzed by mass spectrometry were similar to the results of Edman sequencing. These results suggest that 2-D PAGE combined with Edman sequencing and mass spectrometry analysis can be effectively used to identify plant proteins.
Amino acid sequences of peptides from a chymotryptic digest of a urea-soluble protein fraction (U.S.3) from oxidized wool

PubMed Central

Corfield, M. C.; Fletcher, J. C.

1969-01-01

1. A chymotryptic digest of the protein fraction U.S.3. from oxidized wool was separated into 51 peptide fractions by chromatography on a column of cation-exchange resin. 2. The less acidic fractions were separated into their component peptides by a combination of cation-exchange-resin chromatography, paper chromatography and paper electrophoresis. 3. The amino acid sequences of 34 of these peptides were elucidated, and those of 14 others partially determined. 4. Overlaps between the tryptic and chymotryptic peptides from fraction U.S.3 have enabled ten extended amino acid sequences to be deduced, the longest containing 20 amino acid residues. 5. The relevance of the results to the structures of the helical and non-helical regions of wool is discussed. PMID:5395876
Cloning and sequence determination of the gene coding for the pyruvate phosphate dikinase of Entamoeba histolytica.

PubMed

Saavedra-Lira, E; Pérez-Montfort, R

1994-05-16

We isolated three overlapping clones from a DNA genomic library of Entamoeba histolytica strain HM1:IMSS, whose translated nucleotide (nt) sequence shows similarities of 51, 48 and 47% with the amino acid (aa) sequences reported for the pyruvate phosphate dikinases from Bacteroides symbiosus, maize and Flaveria trinervia, respectively. The reading frame determined codes for a protein of 886 aa.
Partial amino acid sequence of the branched chain amino acid aminotransferase (TmB) of E. coli JA199 pDU11

DOE Office of Scientific and Technical Information (OSTI.GOV)

Feild, M.J.; Armstrong, F.B.

1987-05-01

E. coli JA199 pDU11 harbors a multicopy plasmid containing the ilv GEDAY gene cluster of S. typhimurium. TmB, gene product of ilv E, was purified, crystallized, and subjected to Edman degradation using a gas phase sequencer. The intact protein yielded an amino terminal 31 residue sequence. Both carboxymethylated apoenzyme and (/sup 3/H)-NaBH-reduced holoenzyme were then subjected to digestion by trypsin. The digests were fractionated using reversed phase HPLC, and the peptides isolated were sequenced. The borohydride-treated holoenzyme was used to isolate the cofactor-binding peptide. The peptide is 27 residues long and a comparison with known sequences of other aminotransferases revealedmore » limited homology. Peptides accounting for 211 of 288 predicted residues have been sequenced, including 9 residues of the carboxyl terminus. Comparison of peptides with the inferred amino acid sequence of the E. coli K-12 enzyme has helped determine the sequence of the amino terminal 59 residues; only two differences between the sequences are noted in this region.« less
A Protocol for Functional Assessment of Whole-Protein Saturation Mutagenesis Libraries Utilizing High-Throughput Sequencing.

PubMed

Stiffler, Michael A; Subramanian, Subu K; Salinas, Victor H; Ranganathan, Rama

2016-07-03

Site-directed mutagenesis has long been used as a method to interrogate protein structure, function and evolution. Recent advances in massively-parallel sequencing technology have opened up the possibility of assessing the functional or fitness effects of large numbers of mutations simultaneously. Here, we present a protocol for experimentally determining the effects of all possible single amino acid mutations in a protein of interest utilizing high-throughput sequencing technology, using the 263 amino acid antibiotic resistance enzyme TEM-1 β-lactamase as an example. In this approach, a whole-protein saturation mutagenesis library is constructed by site-directed mutagenic PCR, randomizing each position individually to all possible amino acids. The library is then transformed into bacteria, and selected for the ability to confer resistance to β-lactam antibiotics. The fitness effect of each mutation is then determined by deep sequencing of the library before and after selection. Importantly, this protocol introduces methods which maximize sequencing read depth and permit the simultaneous selection of the entire mutation library, by mixing adjacent positions into groups of length accommodated by high-throughput sequencing read length and utilizing orthogonal primers to barcode each group. Representative results using this protocol are provided by assessing the fitness effects of all single amino acid mutations in TEM-1 at a clinically relevant dosage of ampicillin. The method should be easily extendable to other proteins for which a high-throughput selection assay is in place.

Sequence fingerprints distinguish erroneous from correct predictions of intrinsically disordered protein regions.

PubMed

Saravanan, Konda Mani; Dunker, A Keith; Krishnaswamy, Sankaran

2017-12-27

More than 60 prediction methods for intrinsically disordered proteins (IDPs) have been developed over the years, many of which are accessible on the World Wide Web. Nearly, all of these predictors give balanced accuracies in the ~65%-~80% range. Since predictors are not perfect, further studies are required to uncover the role of amino acid residues in native IDP as compared to predicted IDP regions. In the present work, we make use of sequences of 100% predicted IDP regions, false positive disorder predictions, and experimentally determined IDP regions to distinguish the characteristics of native versus predicted IDP regions. A higher occurrence of asparagine is observed in sequences of native IDP regions but not in sequences of false positive predictions of IDP regions. The occurrences of certain combinations of amino acids at the pentapeptide level provide a distinguishing feature in the IDPs with respect to globular proteins. The distinguishing features presented in this paper provide insights into the sequence fingerprints of amino acid residues in experimentally determined as compared to predicted IDP regions. These observations and additional work along these lines should enable the development of improvements in the accuracy of disorder prediction algorithm.
Primary structure of prostaglandin G/H synthase from sheep vesicular gland determined from the complementary DNA sequence.

PubMed Central

DeWitt, D L; Smith, W L

1988-01-01

Prostaglandin G/H synthase (8,11,14-icosatrienoate, hydrogen-donor:oxygen oxidoreductase, EC 1.14.99.1) catalyzes the first step in the formation of prostaglandins and thromboxanes, the conversion of arachidonic acid to prostaglandin endoperoxides G and H. This enzyme is the site of action of nonsteroidal anti-inflammatory drugs. We have isolated a 2.7-kilobase complementary DNA (cDNA) encompassing the entire coding region of prostaglandin G/H synthase from sheep vesicular glands. This cDNA, cloned from a lambda gt 10 library prepared from poly(A)+ RNA of vesicular glands, hybridizes with a single 2.75-kilobase mRNA species. The cDNA clone was selected using oligonucleotide probes modeled from amino acid sequences of tryptic peptides prepared from the purified enzyme. The full-length cDNA encodes a protein of 600 amino acids, including a signal sequence of 24 amino acids. Identification of the cDNA as coding for prostaglandin G/H synthase is based on comparison of amino acid sequences of seven peptides comprising 103 amino acids with the amino acid sequence deduced from the nucleotide sequence of the cDNA. The molecular weight of the unglycosylated enzyme lacking the signal peptide is 65,621. The synthase is a glycoprotein, and there are three potential sites for N-glycosylation, two of them in the amino-terminal half of the molecule. The serine reported to be acetylated by aspirin is at position 530, near the carboxyl terminus. There is no significant similarity between the sequence of the synthase and that of any other protein in amino acid or nucleotide sequence libraries, and a heme binding site(s) is not apparent from the amino acid sequence. The availability of a full-length cDNA clone coding for prostaglandin G/H synthase should facilitate studies of the regulation of expression of this enzyme and the structural features important for catalysis and for interaction with anti-inflammatory drugs. Images PMID:3125548
Complementary DNA cloning and molecular evolution of opine dehydrogenases in some marine invertebrates.

PubMed

Kimura, Tomohiro; Nakano, Toshiki; Yamaguchi, Toshiyasu; Sato, Minoru; Ogawa, Tomohisa; Muramoto, Koji; Yokoyama, Takehiko; Kan-No, Nobuhiro; Nagahisa, Eizou; Janssen, Frank; Grieshaber, Manfred K

2004-01-01

The complete complementary DNA sequences of genes presumably coding for opine dehydrogenases from Arabella iricolor (sandworm), Haliotis discus hannai (abalone), and Patinopecten yessoensis (scallop) were determined, and partial cDNA sequences were derived for Meretrix lusoria (Japanese hard clam) and Spisula sachalinensis (Sakhalin surf clam). The primers ODH-9F and ODH-11R proved useful for amplifying the sequences for opine dehydrogenases from the 4 mollusk species investigated in this study. The sequence of the sandworm was obtained using primers constructed from the amino acid sequence of tauropine dehydrogenase, the main opine dehydrogenase in A. iricolor. The complete cDNA sequence of A. iricolor, H. discus hannai, and P. yessoensis encode 397, 400, and 405 amino acids, respectively. All sequences were aligned and compared with published databank sequences of Loligo opalescens, Loligo vulgaris (squid), Sepia officinalis (cuttlefish), and Pecten maximus (scallop). As expected, a high level of homology was observed for the cDNA from closely related species, such as for cephalopods or scallops, whereas cDNA from the other species showed lower-level homologies. A similar trend was observed when the deduced amino acid sequences were compared. Furthermore, alignment of these sequences revealed some structural motifs that are possibly related to the binding sites of the substrates. The phylogenetic trees derived from the nucleotide and amino acid sequences were consistent with the classification of species resulting from classical taxonomic analyses.
Nucleic Acid Extraction from Synthetic Mars Analog Soils for in situ Life Detection

PubMed Central

Mojarro, Angel; Ruvkun, Gary; Zuber, Maria T.

2017-01-01

Abstract Biological informational polymers such as nucleic acids have the potential to provide unambiguous evidence of life beyond Earth. To this end, we are developing an automated in situ life-detection instrument that integrates nucleic acid extraction and nanopore sequencing: the Search for Extra-Terrestrial Genomes (SETG) instrument. Our goal is to isolate and determine the sequence of nucleic acids from extant or preserved life on Mars, if, for example, there is common ancestry to life on Mars and Earth. As is true of metagenomic analysis of terrestrial environmental samples, the SETG instrument must isolate nucleic acids from crude samples and then determine the DNA sequence of the unknown nucleic acids. Our initial DNA extraction experiments resulted in low to undetectable amounts of DNA due to soil chemistry–dependent soil-DNA interactions, namely adsorption to mineral surfaces, binding to divalent/trivalent cations, destruction by iron redox cycling, and acidic conditions. Subsequently, we developed soil-specific extraction protocols that increase DNA yields through a combination of desalting, utilization of competitive binders, and promotion of anaerobic conditions. Our results suggest that a combination of desalting and utilizing competitive binders may establish a “universal” nucleic acid extraction protocol suitable for analyzing samples from diverse soils on Mars. Key Words: Life-detection instruments—Nucleic acids—Mars—Panspermia. Astrobiology 17, 747–760. PMID:28704064
Isolation and determination of the primary structure of a lectin protein from the serum of the American alligator (Alligator mississippiensis).

PubMed

Darville, Lancia N F; Merchant, Mark E; Maccha, Venkata; Siddavarapu, Vivekananda Reddy; Hasan, Azeem; Murray, Kermit K

2012-02-01

Mass spectrometry in conjunction with de novo sequencing was used to determine the amino acid sequence of a 35kDa lectin protein isolated from the serum of the American alligator that exhibits binding to mannose. The protein N-terminal sequence was determined using Edman degradation and enzymatic digestion with different proteases was used to generate peptide fragments for analysis by liquid chromatography tandem mass spectrometry (LC MS/MS). Separate analysis of the protein digests with multiple enzymes enhanced the protein sequence coverage. De novo sequencing was accomplished using MASCOT Distiller and PEAKS software and the sequences were searched against the NCBI database using MASCOT and BLAST to identify homologous peptides. MS analysis of the intact protein indicated that it is present primarily as monomer and dimer in vitro. The isolated 35kDa protein was ~98% sequenced and found to have 313 amino acids and nine cysteine residues and was identified as an alligator lectin. The alligator lectin sequence was aligned with other lectin sequences using DIALIGN and ClustalW software and was found to exhibit 58% and 59% similarity to both human and mouse intelectin-1. The alligator lectin exhibited strong binding affinities toward mannan and mannose as compared to other tested carbohydrates. Copyright © 2011 Elsevier Inc. All rights reserved.
The primary structure of fatty-acid-binding protein from nurse shark liver. Structural and evolutionary relationship to the mammalian fatty-acid-binding protein family.

PubMed

Medzihradszky, K F; Gibson, B W; Kaur, S; Yu, Z H; Medzihradszky, D; Burlingame, A L; Bass, N M

1992-02-01

The primary structure of a fatty-acid-binding protein (FABP) isolated from the liver of the nurse shark (Ginglymostoma cirratum) was determined by high-performance tandem mass spectrometry (employing multichannel array detection) and Edman degradation. Shark liver FABP consists of 132 amino acids with an acetylated N-terminal valine. The chemical molecular mass of the intact protein determined by electrospray ionization mass spectrometry (Mr = 15124 +/- 2.5) was in good agreement with that calculated from the amino acid sequence (Mr = 15121.3). The amino acid sequence of shark liver FABP displays significantly greater similarity to the FABP expressed in mammalian heart, peripheral nerve myelin and adipose tissue (61-53% sequence similarity) than to the FABP expressed in mammalian liver (22% similarity). Phylogenetic trees derived from the comparison of the shark liver FABP amino acid sequence with the members of the mammalian fatty-acid/retinoid-binding protein gene family indicate the initial divergence of an ancestral gene into two major subfamilies: one comprising the genes for mammalian liver FABP and gastrotropin, the other comprising the genes for mammalian cellular retinol-binding proteins I and II, cellular retinoic-acid-binding protein myelin P2 protein, adipocyte FABP, heart FABP and shark liver FABP, the latter having diverged from the ancestral gene that ultimately gave rise to the present day mammalian heart-FABP, adipocyte FABP and myelin P2 protein sequences. The sequence for intestinal FABP from the rat could be assigned to either subfamily, depending on the approach used for phylogenetic tree construction, but clearly diverged at a relatively early evolutionary time point. Indeed, sequences proximately ancestral or closely related to mammalian intestinal FABP, liver FABP, gastrotropin and the retinoid-binding group of proteins appear to have arisen prior to the divergence of shark liver FABP and should therefore also be present in elasmobranchs. The presence in shark liver of an FABP which differs substantially in primary structure from mammalian liver FABP, while being closely related to the FABP expressed in mammalian heart muscle, peripheral nerve myelin and adipocytes, opens a further dimension regarding the question of the existence of structure-dependent and tissue-specific specialization of FABP function in lipid metabolism.
A harmonized immunoassay with liquid chromatography-mass spectrometry analysis in egg allergen determination.

PubMed

Nimata, Masaomi; Okada, Hideki; Kurihara, Kei; Sugimoto, Tsukasa; Honjoh, Tsutomu; Kuroda, Kazuhiko; Yano, Takeo; Tachibana, Hirofumi; Shoji, Masahiro

2018-01-01

Food allergy is a serious health issue worldwide. Implementing allergen labeling regulations is extremely challenging for regulators, food manufacturers, and analytical kit manufacturers. Here we have developed an "amino acid sequence immunoassay" approach to ELISA. The new ELISA comprises of a monoclonal antibody generated via an analyte specific peptide antigen and sodium lauryl sulfate/sulfite solution. This combination enables the antibody to access the epitope site in unfolded analyte protein. The newly developed ELISA recovered 87.1%-106.4% ovalbumin from ovalbumin-incurred model processed foods, thereby demonstrating its applicability as practical egg allergen determination. Furthermore, the comparison of LC-MS/MS and the new ELISA, which targets the amino acid sequence conforming to the LC-MS/MS detection peptide, showed a good agreement. Consequently the harmonization of two methods was demonstrated. The complementary use of the new ELISA and LC-MS analysis can offer a wide range of practical benefits in terms of easiness, cost, accuracy, and efficiency in food allergen analysis. In addition, the new assay is attractive in respect to its easy antigen preparation and predetermined specificity. Graphical abstract The ELISA composing of the monoclonal antibody targeting the amino acid sequence conformed to LC-MS detection peptide, and the protein conformation unfolding reagent was developed. In ovalbumin determination, the developed ELISA showed a good agreement with LC-MS analysis. Consequently the harmonization of immunoassay with LC-MS analysis by using common target amino acid sequence was demonstrated.
The primary structure of the Saccharomyces cerevisiae gene for 3-phosphoglycerate kinase.

PubMed Central

Hitzeman, R A; Hagie, F E; Hayflick, J S; Chen, C Y; Seeburg, P H; Derynck, R

1982-01-01

The DNA sequence of the gene for the yeast glycolytic enzyme, 3-phosphoglycerate kinase (PGK), has been obtained by sequencing part of a 3.1 kbp HindIII fragment obtained from the yeast genome. The structural gene sequence corresponds to a reading frame of 1251 bp coding for 416 amino acids with no intervening DNA sequences. The amino acid sequence is approximately 65 percent homologous with human and horse PGK protein sequences and is in general agreement with the published protein sequence for yeast PGK. As for other highly expressed structural genes in yeast, the coding sequence is highly codon biased with 95 percent of the amino acids coded for by a select 25 codons (out of 61 possible). Besides structural DNA sequence, 291 bp of 5'-flanking sequence and 286 bp of 3'-flanking sequence were determined. Transcription starts 36 nucleotides upstream from the translational start and stops 86-93 nucleotides downstream from the translational stop. These results suggest a non-polyadenylated mRNA length of 1373 to 1380 nucleotides, which is consistent with the observed length of 1500 nucleotides for polyadenylated PGK mRNA. A sequence TATATATAAA is found at 145 nucleotides upstream from the translational start. This sequence resembles the TATAAA box that is possibly associated with RNA polymerase II binding. Images PMID:6296791
Identification of a nuclear localization sequence in the polyomavirus capsid protein VP2

NASA Technical Reports Server (NTRS)

Chang, D.; Haynes, J. I. 2nd; Brady, J. N.; Consigli, R. A.; Spooner, B. S. (Principal Investigator)

1992-01-01

A nuclear localization signal (NLS) has been identified in the C-terminal (Glu307-Glu-Asp-Gly-Pro-Gln-Lys-Lys-Lys-Arg-Arg-Leu318) amino acid sequence of the polyomavirus minor capsid protein VP2. The importance of this amino acid sequence for nuclear transport of newly synthesized VP2 was demonstrated by a genetic "subtractive" study using the constructs pSG5VP2 (expressing full-length VP2) and pSG5 delta 3VP2 (expressing truncated VP2, lacking amino acids Glu307-Leu318). These constructs were transfected into COS-7 cells, and the intracellular localization of the VP2 protein was determined by indirect immunofluorescence. These studies revealed that the full-length VP2 was localized in the nucleus, while the truncated VP2 protein was localized in the cytoplasm and not transported to the nucleus. A biochemical "additive" approach was also used to determine whether this sequence could target nonnuclear proteins to the nucleus. A synthetic peptide identical to VP2 amino acids Glu307-Leu318 was cross-linked to the nonnuclear proteins bovine serum albumin (BSA) or immunoglobulin G (IgG). The conjugates were then labeled with fluorescein isothiocyanate and microinjected into the cytoplasm of NIH 3T6 cells. Both conjugates localized in the nucleus of the microinjected cells, whereas unconjugated BSA and IgG remained in the cytoplasm. Taken together, these genetic subtractive and biochemical additive approaches have identified the C-terminal sequence of polyoma-virus VP2 (containing amino acids Glu307-Leu318) as the NLS of this protein.
Nucleotide sequence of the gene determining plasmid-mediated citrate utilization.

PubMed Central

Ishiguro, N; Sato, G

1985-01-01

The citrate utilization determinant from transposon Tn3411 has been cloned and sequenced, and its polypeptide products have been characterized in minicell experiments. The nucleotide sequence was determined for a 2,047-base-pair BglII restriction endonuclease fragment that includes the citrate determinant. This region contains an open reading frame that would encode a 431-amino-acid very hydrophobic polypeptide and which is preceded by a reasonable ribosomal binding site. However, the single polypeptide found in minicell experiments had an apparent molecular weight of 35,000 on sodium dodecyl sulfate-polyacrylamide gel electrophoresis. Images PMID:2999087
Saccharomyces cerevisiae SSB1 protein and its relationship to nucleolar RNA-binding proteins.

PubMed

Jong, A Y; Clark, M W; Gilbert, M; Oehm, A; Campbell, J L

1987-08-01

To better define the function of Saccharomyces cerevisiae SSB1, an abundant single-stranded nucleic acid-binding protein, we determined the nucleotide sequence of the SSB1 gene and compared it with those of other proteins of known function. The amino acid sequence contains 293 amino acid residues and has an Mr of 32,853. There are several stretches of sequence characteristic of other eucaryotic single-stranded nucleic acid-binding proteins. At the amino terminus, residues 39 to 54 are highly homologous to a peptide in calf thymus UP1 and UP2 and a human heterogeneous nuclear ribonucleoprotein. Residues 125 to 162 constitute a fivefold tandem repeat of the sequence RGGFRG, the composition of which suggests a nucleic acid-binding site. Near the C terminus, residues 233 to 245 are homologous to several RNA-binding proteins. Of 18 C-terminal residues, 10 are acidic, a characteristic of the procaryotic single-stranded DNA-binding proteins and eucaryotic DNA- and RNA-binding proteins. In addition, examination of the subcellular distribution of SSB1 by immunofluorescence microscopy indicated that SSB1 is a nuclear protein, predominantly located in the nucleolus. Sequence homologies and the nucleolar localization make it likely that SSB1 functions in RNA metabolism in vivo, although an additional role in DNA metabolism cannot be excluded.
Nanopores and nucleic acids: prospects for ultrarapid sequencing

NASA Technical Reports Server (NTRS)

Deamer, D. W.; Akeson, M.

2000-01-01

DNA and RNA molecules can be detected as they are driven through a nanopore by an applied electric field at rates ranging from several hundred microseconds to a few milliseconds per molecule. The nanopore can rapidly discriminate between pyrimidine and purine segments along a single-stranded nucleic acid molecule. Nanopore detection and characterization of single molecules represents a new method for directly reading information encoded in linear polymers. If single-nucleotide resolution can be achieved, it is possible that nucleic acid sequences can be determined at rates exceeding a thousand bases per second.
Codes in the codons: construction of a codon/amino acid periodic table and a study of the nature of specific nucleic acid-protein interactions.

PubMed

Benyo, B; Biro, J C; Benyo, Z

2004-01-01

The theory of "codon-amino acid coevolution" was first proposed by Woese in 1967. It suggests that there is a stereochemical matching - that is, affinity - between amino acids and certain of the base triplet sequences that code for those amino acids. We have constructed a common periodic table of codons and amino acids, where the nucleic acid table showed perfect axial symmetry for codons and the corresponding amino acid table also displayed periodicity regarding the biochemical properties (charge and hydrophobicity) of the 20 amino acids and the position of the stop signals. The table indicates that the middle (2/sup nd/) amino acid in the codon has a prominent role in determining some of the structural features of the amino acids. The possibility that physical contact between codons and amino acids might exist was tested on restriction enzymes. Many recognition site-like sequences were found in the coding sequences of these enzymes and as many as 73 examples of codon-amino acid co-location were observed in the 7 known 3D structures (December 2003) of endonuclease-nucleic acid complexes. These results indicate that the smallest possible units of specific nucleic acid-protein interaction are indeed the stereochemically compatible codons and amino acids.
Dna Sequencing

DOEpatents

Tabor, Stanley; Richardson, Charles C.

1995-04-25

A method for sequencing a strand of DNA, including the steps off: providing the strand of DNA; annealing the strand with a primer able to hybridize to the strand to give an annealed mixture; incubating the mixture with four deoxyribonucleoside triphosphates, a DNA polymerase, and at least three deoxyribonucleoside triphosphates in different amounts, under conditions in favoring primer extension to form nucleic acid fragments complementory to the DNA to be sequenced; labelling the nucleic and fragments; separating them and determining the position of the deoxyribonucleoside triphosphates by differences in the intensity of the labels, thereby to determine the DNA sequence.
Characterization and mapping of cDNA encoding aspartate aminotransferase in rice, Oryza sativa L.

PubMed

Song, J; Yamamoto, K; Shomura, A; Yano, M; Minobe, Y; Sasaki, T

1996-10-31

Fifteen cDNA clones, putatively identified as encoding aspartate aminotransferase (AST, EC 2.6.1.1.), were isolated and partially sequenced. Together with six previously isolated clones putatively identified to encode ASTs (Sasaki, et al. 1994, Plant Journal 6, 615-624), their sequences were characterized and classified into 4 cDNA species. Two of the isolated clones, C60213 and C2079, were full-length cDNAs, and their complete nucleotide sequences were determined. C60213 was 1612 bp long and its deduced amino acid sequence showed 88% homology with that of Panicum miliaceum L. mitochondrial AST. The C60213-encoded protein had an N-terminal amino acid sequence that was characteristic of a mitochondrial transit peptide. On the other hand, C2079 was 1546 bp long and had 91% amino acid sequence homology with P. miliaceum L. cytosolic AST but lacked in the transit peptide sequence. The homologies of nucleotide sequences and deduced amino acid sequences of C2079 and C60213 were 54% and 52%, respectively. C2079 and C60213 were mapped on chromosomes 1 and 6, respectively, by restriction fragment length polymorphism linkage analysis. Northern blot analysis using C2079 as a probe revealed much higher transcript levels in callus and root than in green and etiolated shoots, suggesting tissue-specific variations of AST gene expression.
Complete genome sequence of a divergent strain of Japanese yam mosaic virus from China

USDA-ARS?s Scientific Manuscript database

A novel strain of Japanese yam mosaic virus (JYMV-CN) was identified in a yam plant with foliar mottle symptoms in China. The complete genomic sequence of JYMV-CN was determined. Its genomic sequence of 9701 nucleotides encodes a polyprotein of 3247 amino acids. Its organization was virtually identi...
Sequence Analysis and Domain Motifs in the Porcine Skin Decorin Glycosaminoglycan Chain*

PubMed Central

Zhao, Xue; Yang, Bo; Solakylidirim, Kemal; Joo, Eun Ji; Toida, Toshihiko; Higashi, Kyohei; Linhardt, Robert J.; Li, Lingyun

2013-01-01

Decorin proteoglycan is comprised of a core protein containing a single O-linked dermatan sulfate/chondroitin sulfate glycosaminoglycan (GAG) chain. Although the sequence of the decorin core protein is determined by the gene encoding its structure, the structure of its GAG chain is determined in the Golgi. The recent application of modern MS to bikunin, a far simpler chondroitin sulfate proteoglycans, suggests that it has a single or small number of defined sequences. On this basis, a similar approach to sequence the decorin of porcine skin much larger and more structurally complex dermatan sulfate/chondroitin sulfate GAG chain was undertaken. This approach resulted in information on the consistency/variability of its linkage region at the reducing end of the GAG chain, its iduronic acid-rich domain, glucuronic acid-rich domain, and non-reducing end. A general motif for the porcine skin decorin GAG chain was established. A single small decorin GAG chain was sequenced using MS/MS analysis. The data obtained in the study suggest that the decorin GAG chain has a small or a limited number of sequences. PMID:23423381
A statistical physics perspective on alignment-independent protein sequence comparison.

PubMed

Chattopadhyay, Amit K; Nasiev, Diar; Flower, Darren R

2015-08-01

Within bioinformatics, the textual alignment of amino acid sequences has long dominated the determination of similarity between proteins, with all that implies for shared structure, function and evolutionary descent. Despite the relative success of modern-day sequence alignment algorithms, so-called alignment-free approaches offer a complementary means of determining and expressing similarity, with potential benefits in certain key applications, such as regression analysis of protein structure-function studies, where alignment-base similarity has performed poorly. Here, we offer a fresh, statistical physics-based perspective focusing on the question of alignment-free comparison, in the process adapting results from 'first passage probability distribution' to summarize statistics of ensemble averaged amino acid propensity values. In this article, we introduce and elaborate this approach. © The Author 2015. Published by Oxford University Press.
Comprehensive analysis of the dynamic structure of nuclear localization signals.

PubMed

Yamagishi, Ryosuke; Okuyama, Takahide; Oba, Shuntaro; Shimada, Jiro; Chaen, Shigeru; Kaneko, Hiroki

2015-12-01

Most transcription and epigenetic factors in eukaryotic cells have nuclear localization signals (NLSs) and are transported to the nucleus by nuclear transport proteins. Understanding the features of NLSs and the mechanisms of nuclear transport might help understand gene expression regulation, somatic cell reprogramming, thus leading to the treatment of diseases associated with abnormal gene expression. Although many studies analyzed the amino acid sequence of NLSs, few studies investigated their three-dimensional structure. Therefore, we conducted a statistical investigation of the dynamic structure of NLSs by extracting the conformation of these sequences from proteins examined by X-ray crystallography and using a quantity defined as conformational determination rate (a ratio between the number of amino acids determining the conformation and the number of all amino acids included in a certain region). We found that determining the conformation of NLSs is more difficult than determining the conformation of other regions and that NLSs may tend to form more heteropolymers than monomers. Therefore, these findings strongly suggest that NLSs are intrinsically disordered regions.
Amino acid sequence and the cellular location of the Na(+)-dependent D-glucose symporters (SGLT1) in the ovine enterocyte and the parotid acinar cell.

PubMed Central

Tarpey, P S; Wood, I S; Shirazi-Beechey, S P; Beechey, R B

1995-01-01

The Na(+)-dependent D-glucose symporter has been shown to be located on the basolateral domain of the plasma membrane of ovine parotid acinar cells. This is in contrast to the apical location of this transporter in the ovine enterocyte. The amino acid sequences of these two proteins have been determined. They are identical. The results indicated that the signals responsible for the differential targeting of these two proteins to the apical and the basal domains of the plasma membrane are not contained within the primary amino acid sequence. Images Figure 1 Figure 2 Figure 3 Figure 4 Figure 5 Figure 6 PMID:7492327

Molecular characterization of two genotypes of a new polerovirus infecting brassicas in China.

PubMed

Xiang, Hai-Ying; Dong, Shu-Wei; Shang, Qiao-Xia; Zhou, Cui-Ji; Li, Da-Wei; Yu, Jia-Lin; Han, Cheng-Gui

2011-12-01

The genomic RNA sequences of two genotypes of a brassica-infecting polerovirus from China were determined. Sequence analysis revealed that the virus was closely related to but significantly different from turnip yellows virus (TuYV). This virus and other poleroviruses, including TuYV, had less than 90% amino acid sequence identity in all gene products except the coat protein. Based on the molecular criterion (>10% amino acid sequence difference) for species demarcation in the genus Polerovirus, the virus represents a distinct species for which the name Brassica yellows virus (BrYV) is proposed. Interestingly, there were two genotypes of BrYV, which mainly differed in the 5'-terminal half of the genome.
Primary and secondary structural analyses of glutathione S-transferase pi from human placenta.

PubMed

Ahmad, H; Wilson, D E; Fritz, R R; Singh, S V; Medh, R D; Nagle, G T; Awasthi, Y C; Kurosky, A

1990-05-01

The primary structure of glutathione S-transferase (GST) pi from a single human placenta was determined. The structure was established by chemical characterization of tryptic and cyanogen bromide peptides as well as automated sequence analysis of the intact enzyme. The structural analysis indicated that the protein is comprised of 209 amino acid residues and gave no evidence of post-translational modifications. The amino acid sequence differed from that of the deduced amino acid sequence determined by nucleotide sequence analysis of a cDNA clone (Kano, T., Sakai, M., and Muramatsu, M., 1987, Cancer Res. 47, 5626-5630) at position 104 which contained both valine and isoleucine whereas the deduced sequence from nucleotide sequence analysis identified only isoleucine at this position. These results demonstrated that in the one individual placenta studied at least two GST pi genes are coexpressed, probably as a result of allelomorphism. Computer assisted consensus sequence evaluation identified a hydrophobic region in GST pi (residues 155-181) that was predicted to be either a buried transmembrane helical region or a signal sequence region. The significance of this hydrophobic region was interpreted in relation to the mode of action of the enzyme especially in regard to the potential involvement of a histidine in the active site mechanism. A comparison of the chemical similarity of five known human GST complete enzyme structures, one of pi, one of mu, two of alpha, and one microsomal, gave evidence that all five enzymes have evolved by a divergent evolutionary process after gene duplication, with the microsomal enzyme representing the most divergent form.
Protein location prediction using atomic composition and global features of the amino acid sequence

DOE Office of Scientific and Technical Information (OSTI.GOV)

Cherian, Betsy Sheena, E-mail: betsy.skb@gmail.com; Nair, Achuthsankar S.

2010-01-22

Subcellular location of protein is constructive information in determining its function, screening for drug candidates, vaccine design, annotation of gene products and in selecting relevant proteins for further studies. Computational prediction of subcellular localization deals with predicting the location of a protein from its amino acid sequence. For a computational localization prediction method to be more accurate, it should exploit all possible relevant biological features that contribute to the subcellular localization. In this work, we extracted the biological features from the full length protein sequence to incorporate more biological information. A new biological feature, distribution of atomic composition is effectivelymore » used with, multiple physiochemical properties, amino acid composition, three part amino acid composition, and sequence similarity for predicting the subcellular location of the protein. Support Vector Machines are designed for four modules and prediction is made by a weighted voting system. Our system makes prediction with an accuracy of 100, 82.47, 88.81 for self-consistency test, jackknife test and independent data test respectively. Our results provide evidence that the prediction based on the biological features derived from the full length amino acid sequence gives better accuracy than those derived from N-terminal alone. Considering the features as a distribution within the entire sequence will bring out underlying property distribution to a greater detail to enhance the prediction accuracy.« less
Negative Ion In-Source Decay Matrix-Assisted Laser Desorption/Ionization Mass Spectrometry for Sequencing Acidic Peptides

NASA Astrophysics Data System (ADS)

McMillen, Chelsea L.; Wright, Patience M.; Cassady, Carolyn J.

2016-05-01

Matrix-assisted laser desorption/ionization (MALDI) in-source decay was studied in the negative ion mode on deprotonated peptides to determine its usefulness for obtaining extensive sequence information for acidic peptides. Eight biological acidic peptides, ranging in size from 11 to 33 residues, were studied by negative ion mode ISD (nISD). The matrices 2,5-dihydroxybenzoic acid, 2-aminobenzoic acid, 2-aminobenzamide, 1,5-diaminonaphthalene, 5-amino-1-naphthol, 3-aminoquinoline, and 9-aminoacridine were used with each peptide. Optimal fragmentation was produced with 1,5-diaminonphthalene (DAN), and extensive sequence informative fragmentation was observed for every peptide except hirudin(54-65). Cleavage at the N-Cα bond of the peptide backbone, producing c' and z' ions, was dominant for all peptides. Cleavage of the N-Cα bond N-terminal to proline residues was not observed. The formation of c and z ions is also found in electron transfer dissociation (ETD), electron capture dissociation (ECD), and positive ion mode ISD, which are considered to be radical-driven techniques. Oxidized insulin chain A, which has four highly acidic oxidized cysteine residues, had less extensive fragmentation. This peptide also exhibited the only charged localized fragmentation, with more pronounced product ion formation adjacent to the highly acidic residues. In addition, spectra were obtained by positive ion mode ISD for each protonated peptide; more sequence informative fragmentation was observed via nISD for all peptides. Three of the peptides studied had no product ion formation in ISD, but extensive sequence informative fragmentation was found in their nISD spectra. The results of this study indicate that nISD can be used to readily obtain sequence information for acidic peptides.
Negative Ion In-Source Decay Matrix-Assisted Laser Desorption/Ionization Mass Spectrometry for Sequencing Acidic Peptides.

PubMed

McMillen, Chelsea L; Wright, Patience M; Cassady, Carolyn J

2016-05-01

Matrix-assisted laser desorption/ionization (MALDI) in-source decay was studied in the negative ion mode on deprotonated peptides to determine its usefulness for obtaining extensive sequence information for acidic peptides. Eight biological acidic peptides, ranging in size from 11 to 33 residues, were studied by negative ion mode ISD (nISD). The matrices 2,5-dihydroxybenzoic acid, 2-aminobenzoic acid, 2-aminobenzamide, 1,5-diaminonaphthalene, 5-amino-1-naphthol, 3-aminoquinoline, and 9-aminoacridine were used with each peptide. Optimal fragmentation was produced with 1,5-diaminonphthalene (DAN), and extensive sequence informative fragmentation was observed for every peptide except hirudin(54-65). Cleavage at the N-Cα bond of the peptide backbone, producing c' and z' ions, was dominant for all peptides. Cleavage of the N-Cα bond N-terminal to proline residues was not observed. The formation of c and z ions is also found in electron transfer dissociation (ETD), electron capture dissociation (ECD), and positive ion mode ISD, which are considered to be radical-driven techniques. Oxidized insulin chain A, which has four highly acidic oxidized cysteine residues, had less extensive fragmentation. This peptide also exhibited the only charged localized fragmentation, with more pronounced product ion formation adjacent to the highly acidic residues. In addition, spectra were obtained by positive ion mode ISD for each protonated peptide; more sequence informative fragmentation was observed via nISD for all peptides. Three of the peptides studied had no product ion formation in ISD, but extensive sequence informative fragmentation was found in their nISD spectra. The results of this study indicate that nISD can be used to readily obtain sequence information for acidic peptides.
Sequence diversity of hepatitis C virus 6a within the extended interferon sensitivity-determining region correlates with interferon-alpha/ribavirin treatment outcomes.

PubMed

Zhou, Daniel X M; Chan, Paul K S; Zhang, Tiejun; Tully, Damien C; Tam, John S

2010-10-01

Studies on the association between sequence variability of the interferon sensitivity-determining region (ISDR) of hepatitis C virus and the outcome of treatment have reached conflicting results. In this study, 25 patients infected with HCV 6a who had received interferon-alpha/ribavirin combination treatment were analyzed for the sequence variations. 14 of them had the full genome sequences obtained from a previous study, whereas the other 11 samples were sequenced for the extended ISDR (eISDR). This eISDR fragment covers 192 bp (64 amino acids) upstream and 201 bp (67 amino acids) downstream from the ISDR previously defined for HCV 1b. The comparison between interferon-alpha resistance and response groups for the amino acid mutations located in the full genome (6 and 8 patients respectively) as well as the mutations located in the eISDR (10 and 15 patients respectively) showed that the mutations I2160V, I2256V, V2292I (P<0.05) within eISDR were significantly associated with resistance to treatment. However, the extent of amino acid variations within previously defined ISDR was not associated with resistance to treatment as previously reported. Four amino acid variations I248V (P=0.03-0.06) within E1, R445K (P=0.02-0.05) and S747T (P=0.03) within E2, I861V (P=0.01) within NS2 which located outside the eISDR may also associate with treatment outcome as identified by a prescreening of variations within 14 HCV 6a full genomes. (c) 2010 Elsevier B.V. All rights reserved.
The catalytic chain of human complement subcomponent C1r. Purification and N-terminal amino acid sequences of the major cyanogen bromide-cleavage fragments.

PubMed

Arlaud, G J; Gagnon, J; Porter, R R

1982-01-01

1. The a- and b-chains of reduced and alkylated human complement subcomponent C1r were separated by high-pressure gel-permeation chromatography and isolated in good yield and in pure form. 2. CNBr cleavage of C1r b-chain yielded eight major peptides, which were purified by gel filtration and high-pressure reversed-phase chromatography. As determined from the sum of their amino acid compositions, these peptides accounted for a minimum molecular weight of 28 000, close to the value 29 100 calculated from the whole b-chain. 3. N-Terminal sequence determinations of C1r b-chain and its CNBr-cleavage peptides allowed the identification of about two-thirds of the amino acids of C1r b-chain. From our results, and on the basis of homology with other serine proteinases, an alignment of the eight CNBr-cleavage peptides from C1r b-chain is proposed. 4. The residues forming the 'charge-relay' system of the active site of serine proteinases (His-57, Asp-102 and Ser-195 in the chymotrypsinogen numbering) are found in the corresponding regions of C1r b-chain, and the amino acid sequence around these residues has been determined. 5. The N-terminal sequence of C1r b-chain has been extended to residue 60 and reveals that C1r b-chain lacks the 'histidine loop', a disulphide bond that is present in all other known serine proteinases.
Rational Protein Engineering Guided by Deep Mutational Scanning

PubMed Central

Shin, HyeonSeok; Cho, Byung-Kwan

2015-01-01

Sequence–function relationship in a protein is commonly determined by the three-dimensional protein structure followed by various biochemical experiments. However, with the explosive increase in the number of genome sequences, facilitated by recent advances in sequencing technology, the gap between protein sequences available and three-dimensional structures is rapidly widening. A recently developed method termed deep mutational scanning explores the functional phenotype of thousands of mutants via massive sequencing. Coupled with a highly efficient screening system, this approach assesses the phenotypic changes made by the substitution of each amino acid sequence that constitutes a protein. Such an informational resource provides the functional role of each amino acid sequence, thereby providing sufficient rationale for selecting target residues for protein engineering. Here, we discuss the current applications of deep mutational scanning and consider experimental design. PMID:26404267
Amino acid sequences of the ribosomal proteins HL30 and HmaL5 from the archaebacterium Halobacterium marismortui.

PubMed

Hatakeyama, T; Hatakeyama, T

1990-07-06

The complete amino acid sequences of the ribosomal proteins HL30 and HmaL5 from the archaebacterium Halobacterium marismortui were determined. Protein HL30 was found to be acetylated at its N-terminal amino acid and shows homology to the eukaryotic ribosomal proteins YL34 from yeast and RL31 from rat. Protein HmaL5 was homologous to the protein L5 from Escherichia coli and Bacillus stearothermophilus as well as to YL16 from yeast. HmaL5 shows more similarities to its eukaryotic counterpart than to eubacterial ones.
Saccharomyces cerevisiae SSB1 protein and its relationship to nucleolar RNA-binding proteins.

PubMed Central

Jong, A Y; Clark, M W; Gilbert, M; Oehm, A; Campbell, J L

1987-01-01

To better define the function of Saccharomyces cerevisiae SSB1, an abundant single-stranded nucleic acid-binding protein, we determined the nucleotide sequence of the SSB1 gene and compared it with those of other proteins of known function. The amino acid sequence contains 293 amino acid residues and has an Mr of 32,853. There are several stretches of sequence characteristic of other eucaryotic single-stranded nucleic acid-binding proteins. At the amino terminus, residues 39 to 54 are highly homologous to a peptide in calf thymus UP1 and UP2 and a human heterogeneous nuclear ribonucleoprotein. Residues 125 to 162 constitute a fivefold tandem repeat of the sequence RGGFRG, the composition of which suggests a nucleic acid-binding site. Near the C terminus, residues 233 to 245 are homologous to several RNA-binding proteins. Of 18 C-terminal residues, 10 are acidic, a characteristic of the procaryotic single-stranded DNA-binding proteins and eucaryotic DNA- and RNA-binding proteins. In addition, examination of the subcellular distribution of SSB1 by immunofluorescence microscopy indicated that SSB1 is a nuclear protein, predominantly located in the nucleolus. Sequence homologies and the nucleolar localization make it likely that SSB1 functions in RNA metabolism in vivo, although an additional role in DNA metabolism cannot be excluded. Images PMID:2823109
The bglA Gene of Aspergillus kawachii Encodes Both Extracellular and Cell Wall-Bound β-Glucosidases

PubMed Central

Iwashita, Kazuhiro; Nagahara, Tatsuya; Kimura, Hitoshi; Takano, Makoto; Shimoi, Hitoshi; Ito, Kiyoshi

1999-01-01

We cloned the genomic DNA and cDNA of bglA, which encodes β-glucosidase in Aspergillus kawachii, based on a partial amino acid sequence of purified cell wall-bound β-glucosidase CB-1. The nucleotide sequence of the cloned bglA gene revealed a 2,933-bp open reading frame with six introns that encodes an 860-amino-acid protein. Based on the deduced amino acid sequence, we concluded that the bglA gene encodes cell wall-bound β-glucosidase CB-1. The amino acid sequence exhibited high levels of homology with the amino acid sequences of fungal β-glucosidases classified in subfamily B. We expressed the bglA cDNA in Saccharomyces cerevisiae and detected the recombinant β-glucosidase in the periplasm fraction of the recombinant yeast. A. kawachii can produce two extracellular β-glucosidases (EX-1 and EX-2) in addition to the cell wall-bound β-glucosidase. A. kawachii in which the bglA gene was disrupted produced none of the three β-glucosidases, as determined by enzyme assays and a Western blot analysis. Thus, we concluded that the bglA gene encodes both extracellular and cell wall-bound β-glucosidases in A. kawachii. PMID:10584016
Purification and characterization of enantioselective N-acetyl-β-Phe acylases from Burkholderia sp. AJ110349.

PubMed

Imabayashi, Yuki; Suzuki, Shun'ichi; Kawasaki, Hisashi; Nakamatsu, Tsuyoshi

2016-01-01

For the production of enantiopure β-amino acids, enantioselective resolution of N-acyl β-amino acids using acylases, especially those recognizing N-acetyl-β-amino acids, is one of the most attractive methods. Burkholderia sp. AJ110349 had been reported to exhibit either (R)- or (S)-enantiomer selective N-acetyl-β-Phe amidohydrolyzing activity, and in this study, both (R)- and (S)-enantioselective N-acetyl-β-Phe acylases were purified to be electrophoretically pure and determined the sequences, respectively. They were quite different in terms of enantioselectivities and in their amino acids sequences and molecular weights. Although both the purified acylases were confirmed to catalyze N-acetyl hydrolyzing activities, neither of them show sequence similarities to the N-acetyl-α-amino acid acylases reported thus far. Both (R)- and (S)-enantioselective N-acetyl-β-Phe acylase were expressed in Escherichia coli. Using these recombinant strains, enantiomerically pure (R)-β-Phe (>99% ee) and (S)-β-Phe (>99% ee) were obtained from the racemic substrate.
Dictionary-driven protein annotation.

PubMed

Rigoutsos, Isidore; Huynh, Tien; Floratos, Aris; Parida, Laxmi; Platt, Daniel

2002-09-01

Computational methods seeking to automatically determine the properties (functional, structural, physicochemical, etc.) of a protein directly from the sequence have long been the focus of numerous research groups. With the advent of advanced sequencing methods and systems, the number of amino acid sequences that are being deposited in the public databases has been increasing steadily. This has in turn generated a renewed demand for automated approaches that can annotate individual sequences and complete genomes quickly, exhaustively and objectively. In this paper, we present one such approach that is centered around and exploits the Bio-Dictionary, a collection of amino acid patterns that completely covers the natural sequence space and can capture functional and structural signals that have been reused during evolution, within and across protein families. Our annotation approach also makes use of a weighted, position-specific scoring scheme that is unaffected by the over-representation of well-conserved proteins and protein fragments in the databases used. For a given query sequence, the method permits one to determine, in a single pass, the following: local and global similarities between the query and any protein already present in a public database; the likeness of the query to all available archaeal/ bacterial/eukaryotic/viral sequences in the database as a function of amino acid position within the query; the character of secondary structure of the query as a function of amino acid position within the query; the cytoplasmic, transmembrane or extracellular behavior of the query; the nature and position of binding domains, active sites, post-translationally modified sites, signal peptides, etc. In terms of performance, the proposed method is exhaustive, objective and allows for the rapid annotation of individual sequences and full genomes. Annotation examples are presented and discussed in Results, including individual queries and complete genomes that were released publicly after we built the Bio-Dictionary that is used in our experiments. Finally, we have computed the annotations of more than 70 complete genomes and made them available on the World Wide Web at http://cbcsrv.watson.ibm.com/Annotations/.
Methods for determining the genetic affinity of microorganisms and viruses

NASA Technical Reports Server (NTRS)

Fox, George E. (Inventor); Willson, III, Richard C. (Inventor); Zhang, Zhengdong (Inventor)

2012-01-01

Selecting which sub-sequences in a database of nucleic acid such as 16S rRNA are highly characteristic of particular groupings of bacteria, microorganisms, fungi, etc. on a substantially phylogenetic tree. Also applicable to viruses comprising viral genomic RNA or DNA. A catalogue of highly characteristic sequences identified by this method is assembled to establish the genetic identity of an unknown organism. The characteristic sequences are used to design nucleic acid hybridization probes that include the characteristic sequence or its complement, or are derived from one or more characteristic sequences. A plurality of these characteristic sequences is used in hybridization to determine the phylogenetic tree position of the organism(s) in a sample. Those target organisms represented in the original sequence database and sufficient characteristic sequences can identify to the species or subspecies level. Oligonucleotide arrays of many probes are especially preferred. A hybridization signal can comprise fluorescence, chemiluminescence, or isotopic labeling, etc.; or sequences in a sample can be detected by direct means, e.g. mass spectrometry. The method's characteristic sequences can also be used to design specific PCR primers. The method uniquely identifies the phylogenetic affinity of an unknown organism without requiring prior knowledge of what is present in the sample. Even if the organism has not been previously encountered, the method still provides useful information about which phylogenetic tree bifurcation nodes encompass the organism.
Isolation, characterization, and primary structure of rubredoxin from the photosynthetic bacterium, Heliobacillus mobilis

NASA Technical Reports Server (NTRS)

Lee, W. Y.; Brune, D. C.; LoBrutto, R.; Blankenship, R. E.

1995-01-01

Rubredoxin is a small nonheme iron protein that serves as an electron carrier in bacterial systems. Rubredoxin has now been isolated and characterized from the strictly anaerobic phototroph, Heliobacillus mobilis. THe molecular mass (5671.3 Da from the amino acid sequence) was confirmed and partial formylation of the N-terminal methionyl residue was established by matrix-assisted laser desorption mass spectroscopy. The complete 52-amino-acid sequence was determined by a combination of N-terminal sequencing by Edman degradation and C-terminal sequencing by a novel method using carboxypeptidase treatment in conjunction with amino acid analysis and laser desorption time of flight mass spectrometry. The molar absorption coefficient of Hc. mobilis rubredoxin at 490 nm is 6.9 mM-1 cm-1 and the midpoint redox potential at pH 8.0 is -46 mV. The EPR spectrum of the oxidized form shows resonances at g = 9.66 and 4.30 due to a high-spin ferric iron. The amino acid sequence is homologous to those of rubredoxins from other species, in particular, the gram-positive bacteria, and the phototrophic green sulfur bacteria, and the evolutionary implications of this are discussed.
Identification of random nucleic acid sequence aberrations using dual capture probes which hybridize to different chromosome regions

DOEpatents

Lucas, J.N.; Straume, T.; Bogen, K.T.

1998-03-24

A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration. 14 figs.
Identification of random nucleic acid sequence aberrations using dual capture probes which hybridize to different chromosome regions

DOEpatents

Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

1998-01-01

A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration.
Characterization and prediction of residues determining protein functional specificity.

PubMed

Capra, John A; Singh, Mona

2008-07-01

Within a homologous protein family, proteins may be grouped into subtypes that share specific functions that are not common to the entire family. Often, the amino acids present in a small number of sequence positions determine each protein's particular functional specificity. Knowledge of these specificity determining positions (SDPs) aids in protein function prediction, drug design and experimental analysis. A number of sequence-based computational methods have been introduced for identifying SDPs; however, their further development and evaluation have been hindered by the limited number of known experimentally determined SDPs. We combine several bioinformatics resources to automate a process, typically undertaken manually, to build a dataset of SDPs. The resulting large dataset, which consists of SDPs in enzymes, enables us to characterize SDPs in terms of their physicochemical and evolutionary properties. It also facilitates the large-scale evaluation of sequence-based SDP prediction methods. We present a simple sequence-based SDP prediction method, GroupSim, and show that, surprisingly, it is competitive with a representative set of current methods. We also describe ConsWin, a heuristic that considers sequence conservation of neighboring amino acids, and demonstrate that it improves the performance of all methods tested on our large dataset of enzyme SDPs. Datasets and GroupSim code are available online at http://compbio.cs.princeton.edu/specificity/. Supplementary data are available at Bioinformatics online.
Cloning and characterization of the gene for an additional extracellular serine protease of Bacillus subtilis.

PubMed

Sloma, A; Rufo, G A; Theriault, K A; Dwyer, M; Wilson, S W; Pero, J

1991-11-01

We have purified a minor extracellular serine protease from a strain of Bacillus subtilis bearing null mutations in five extracellular protease genes: apr, npr, epr, bpr, and mpr (A. Sloma, C. Rudolph, G. Rufo, Jr., B. Sullivan, K. Theriault, D. Ally, and J. Pero, J. Bacteriol. 172:1024-1029, 1990). During purification, this novel protease (Vpr) was found bound in a complex in the void volume after gel filtration chromatography. The amino-terminal sequence of the purified protein was determined, and an oligonucleotide probe was constructed on the basis of the amino acid sequence. This probe was used to clone the structural gene (vpr) for this protease. The gene encodes a primary product of 806 amino acids. The amino acid sequence of the mature protein was preceded by a signal sequence of approximately 28 amino acids and a prosequence of approximately 132 amino acids. The mature protein has a predicted molecular weight of 68,197; however, the isolated protein has an apparent molecular weight of 28,500, suggesting that Vpr undergoes C-terminal processing or proteolysis. The vpr gene maps in the ctrA-sacA-epr region of the chromosome and is not required for growth or sporulation.
Amino- and carboxyl-terminal amino acid sequences of proteins coded by gag gene of murine leukemia virus

PubMed Central

Oroszlan, Stephen; Henderson, Louis E.; Stephenson, John R.; Copeland, Terry D.; Long, Cedric W.; Ihle, James N.; Gilden, Raymond V.

1978-01-01

The amino- and carboxyl-terminal amino acid sequences of proteins (p10, p12, p15, and p30) coded by the gag gene of Rauscher and AKR murine leukemia viruses were determined. Among these proteins, p15 from both viruses appears to have a blocked amino end. Proline was found to be the common NH2 terminus of both p30s and both p12s, and alanine of both p10s. The amino-terminal sequences of p30s are identical, as are those of p10s, while the p12 sequences are clearly distinctive but also show substantial homology. The carboxyl-terminal amino acids of both viral p30s and p12s are leucine and phenylalanine, respectively. Rauscher leukemia virus p15 has tyrosine as the carboxyl terminus while AKR virus p15 has phenylalanine in this position. The compositional and sequence data provide definite chemical criteria for the identification of analogous gag gene products and for the comparison of viral proteins isolated in different laboratories. On the basis of amino acid sequences and the previously proposed H-p15-p12-p30-p10-COOH peptide sequence in the precursor polyprotein, a model for cleavage sites involved in the post-translational processing of the precursor coded for by the gag gene is proposed. PMID:206897

Implication of the cause of differences in 3D structures of proteins with high sequence identity based on analyses of amino acid sequences and 3D structures.

PubMed

Matsuoka, Masanari; Sugita, Masatake; Kikuchi, Takeshi

2014-09-18

Proteins that share a high sequence homology while exhibiting drastically different 3D structures are investigated in this study. Recently, artificial proteins related to the sequences of the GA and IgG binding GB domains of human serum albumin have been designed. These artificial proteins, referred to as GA and GB, share 98% amino acid sequence identity but exhibit different 3D structures, namely, a 3α bundle versus a 4β + α structure. Discriminating between their 3D structures based on their amino acid sequences is a very difficult problem. In the present work, in addition to using bioinformatics techniques, an analysis based on inter-residue average distance statistics is used to address this problem. It was hard to distinguish which structure a given sequence would take only with the results of ordinary analyses like BLAST and conservation analyses. However, in addition to these analyses, with the analysis based on the inter-residue average distance statistics and our sequence tendency analysis, we could infer which part would play an important role in its structural formation. The results suggest possible determinants of the different 3D structures for sequences with high sequence identity. The possibility of discriminating between the 3D structures based on the given sequences is also discussed.
Method for identifying and quantifying nucleic acid sequence aberrations

DOEpatents

Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

1998-01-01

A method for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe.
Method for identifying and quantifying nucleic acid sequence aberrations

DOEpatents

Lucas, J.N.; Straume, T.; Bogen, K.T.

1998-07-21

A method is disclosed for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe. 11 figs.
Poliovirus replication proteins: RNA sequence encoding P3-1b and the sites of proteolytic processing

DOE Office of Scientific and Technical Information (OSTI.GOV)

Semler, B.L.; Anderson, C.W.; Kitamura, N.

1981-06-01

A partial amino-terminal amino acid sequence of each of the major proteins encoded by the replicase region of the poliovirus genome has been determined. A comparison of this sequence information with the amino acid sequence predicted from the RNA sequence that has been determined for the 3' region of the poliovirus genome has allowed us to locate precisely the proteolytic cleavage sites at which the initial polyprotein is processed to create the poliovirus products P3-1b (NCVP1b), P3-2 (NCVP2), P3-4b (NCVP4b), and P3-7c (NCVP7c). For each of these products, as well as for the small genome-linked protein VPg, proteolytic cleavage occursmore » between a glutamine and a glycine residue to create the amino terminus of each protein. This result suggests that a single proteinase may be responsible for all of these cleavages. The sequence data also allow the precise positioning of the genome-linked protein VPg within the precursor P3-1b just proximal to the amino terminus of polypeptide P3-2.« less
The primary structures of ribosomal proteins S14 and S16 from the archaebacterium Halobacterium marismortui. Comparison with eubacterial and eukaryotic ribosomal proteins.

PubMed

Kimura, J; Kimura, M

1987-09-05

The amino acid sequences of two ribosomal proteins, S14 and S16, from the archaebacterium Halobacterium marismortui have been determined. Sequence data were obtained by the manual and solid-phase sequencing of peptides derived from enzymatic digestions with trypsin, chymotrypsin, pepsin, and Staphylococcus aureus protease as well as by chemical cleavage with cyanogen bromide. Proteins S14 and S16 contain 109 and 126 amino acid residues and have Mr values of 11,964 and 13,515, respectively. Comparison of the sequences with those of ribosomal proteins from other organisms demonstrates that S14 has a significant homology with the rat liver ribosomal protein S11 (36% identity) as well as with the Escherichia coli ribosomal protein S17 (37%), and that S16 is related to the yeast ribosomal protein YS22 (40%) and proteins S8 from E. coli (28%) and Bacillus stearothermophilus (30%). A comparison of the amino acid residues in the homologous regions of halophilic and nonhalophilic ribosomal proteins reveals that halophilic proteins have more glutamic acids, asparatic acids, prolines, and alanines, and less lysines, arginines, and isoleucines than their nonhalophilic counterparts. These amino acid substitutions probably contribute to the structural stability of halophilic ribosomal proteins.
The isolation, purification and amino-acid sequence of insulin from the teleost fish Cottus scorpius (daddy sculpin).

PubMed

Cutfield, J F; Cutfield, S M; Carne, A; Emdin, S O; Falkmer, S

1986-07-01

Insulin from the principal islets of the teleost fish, Cottus scorpius (daddy sculpin), has been isolated and sequenced. Purification involved acid/alcohol extraction, gel filtration, and reverse-phase high-performance liquid chromatography to yield nearly 1 mg pure insulin/g wet weight islet tissue. Biological potency was estimated as 40% compared to porcine insulin. The sculpin insulin crystallised in the absence of zinc ions although zinc is known to be present in the islets in significant amounts. Two other hormones, glucagon and pancreatic polypeptide, were copurified with the insulin, and an N-terminal sequence for pancreatic polypeptide was determined. The primary structure of sculpin insulin shows a number of sequence changes unique so far amongst teleost fish. These changes occur at A14 (Arg), A15 (Val), and B2 (Asp). The B chain contains 29 amino acids and there is no N-terminal extension as seen with several other fish. Presumably as a result of the amino acid substitutions, sculpin insulin does not readily form crystals containing zinc-insulin hexamers, despite the presence of the coordinating B10 His.
Molecular cloning of actin genes in Trichomonas vaginalis and phylogeny inferred from actin sequences.

PubMed

Bricheux, G; Brugerolle, G

1997-08-01

The parasitic protozoan Trichomonas vaginalis is known to contain the ubiquitous and highly conserved protein actin. A genomic library and a cDNA library have been screened to identify and clone the actin gene(s) of T. vaginalis. The nucleotide sequence of one gene and its flanking regions have been determined. The open reading frame encodes a protein of 376 amino acids. The sequence is not interrupted by any introns and the promoter could be represented by a 10 bp motif close to a consensus motif also found upstream of most sequenced T. vaginalis genes. The five different clones isolated from the cDNA library have similar sequences and encode three actin proteins differing only by one or two amino acids. A phylogenetic analysis of 31 actin sequences by distance matrix and parsimony methods, using centractin as outgroup, gives congruent trees with Parabasala branching above Diplomonadida.
Improved purification, crystallization and primary structure of pyruvate:ferredoxin oxidoreductase from Halobacterium halobium.

PubMed

Plaga, W; Lottspeich, F; Oesterhelt, D

1992-04-01

An improved purification procedure, including nickel chelate affinity chromatography, is reported which resulted in a crystallizable pyruvate:ferredoxin oxidoreductase preparation from Halobacterium halobium. Crystals of the enzyme were obtained using potassium citrate as the precipitant. The genes coding for pyruvate:ferredoxin oxidoreductase were cloned and their nucleotide sequences determined. The genes of both subunits were adjacent to one another on the halobacterial genome. The derived amino acid sequences were confirmed by partial primary structure analysis of the purified protein. The structural motif of thiamin-diphosphate-binding enzymes was unequivocally located in the deduced amino acid sequence of the small subunit.
Isolation, Cloning, and Expression of an Acid Phosphatase Containing Phosphotyrosyl Phosphatase Activity from Prevotella intermedia

PubMed Central

Chen, Xiaochi; Ansai, Toshihiro; Awano, Shuji; Iida, Toshiya; Barik, Sailen; Takehara, Tadamichi

1999-01-01

A novel acid phosphatase containing phosphotyrosyl phosphatase (PTPase) activity, designated PiACP, from Prevotella intermedia ATCC 25611, an anaerobe implicated in progressive periodontal disease, has been purified and characterized. PiACP, a monomer with an apparent molecular mass of 30 kDa, did not require divalent metal cations for activity and was sensitive to orthovanadate but highly resistant to okadaic acid. The enzyme exhibited substantial activity against tyrosine phosphate-containing peptides derived from the epidermal growth factor receptor. On the basis of N-terminal and internal amino acid sequences of purified PiACP, the gene coding for PiACP was isolated and sequenced. The PiACP gene consisted of 792 bp and coded for a basic protein with an Mr of 29,164. The deduced amino acid sequence exhibited striking similarity (25 to 64%) to those of members of class A bacterial acid phosphatases, including PhoC of Morganella morganii, and involved a conserved phosphatase sequence motif that is shared among several lipid phosphatases and the mammalian glucose-6-phosphatases. The highly conservative motif HCXAGXXR in the active domain of PTPase was not found in PiACP. Mutagenesis of recombinant PiACP showed that His-170 and His-209 were essential for activity. Thus, the class A bacterial acid phosphatases including PiACP may function as atypical PTPases, the biological functions of which remain to be determined. PMID:10559178
The primary structure of rat liver ribosomal protein L37. Homology with yeast and bacterial ribosomal proteins.

PubMed

Lin, A; McNally, J; Wool, I G

1983-09-10

The covalent structure of the rat liver 60 S ribosomal subunit protein L37 was determined. Twenty-four tryptic peptides were purified and the sequence of each was established; they accounted for all 111 residues of L37. The sequence of the first 30 residues of L37, obtained previously by automated Edman degradation of the intact protein, provided the alignment of the first 9 tryptic peptides. Three peptides (CN1, CN2, and CN3) were produced by cleavage of protein L37 with cyanogen bromide. The sequence of CN1 (65 residues) was established from the sequence of secondary peptides resulting from cleavage with trypsin and chymotrypsin. The sequence of CN1 in turn served to order tryptic peptides 1 through 14. The sequence of CN2 (15 residues) was determined entirely by a micromanual procedure and allowed the alignment of tryptic peptides 14 through 18. The sequence of the NH2-terminal 28 amino acids of CN3 (31 residues) was determined; in addition the complete sequences of the secondary tryptic and chymotryptic peptides were done. The sequence of CN3 provided the order of tryptic peptides 18 through 24. Thus the sequence of the three cyanogen bromide peptides also accounted for the 111 residues of protein L37. The carboxyl-terminal amino acids were identified after carboxypeptidase A treatment. There is a disulfide bridge between half-cystinyl residues at positions 40 and 69. Rat liver ribosomal protein L37 is homologous with yeast YP55 and with Escherichia coli L34. Moreover, there is a segment of 17 residues in rat L37 that occurs, albeit with modifications, in yeast YP55 and in E. coli S4, L20, and L34.
The full genome sequences of 8 equine herpesvirus type 4 isolates from horses in Japan.

PubMed

Izume, Satoko; Kirisawa, Rikio; Ohya, Kenji; Ohnuma, Aiko; Kimura, Takashi; Omatsu, Tsutomu; Katayama, Yukie; Mizutani, Tetsuya; Fukushi, Hideto

2017-01-24

Equine herpesvirus type 4 (EHV-4) is one of the most important pathogens in horses. To clarify the key genes of the EHV-4 genome that cause abortion in female horses, we determined the whole genome sequences of a laboratory strain and 7 Japanese EHV-4 isolates that were isolated from 2 aborted fetuses and nasal swabs of 5 horses with respiratory disease. The full genome sequences and predicted amino acid sequences of each gene of these isolates were compared with of the reference EHV-4 strain NS80567 and Australian isolates that were reported in 2015. The EHV-4 isolates clustered in 2 groups which did not reflect their pathogenicity. A comparison of the predicted amino acid sequences of the genes did not reveal any genes that were associated with EHV-4-induced abortion.
Nucleotide sequences of Japanese isolates of citrus vein enation virus.

PubMed

Nakazono-Nagaoka, Eiko; Fujikawa, Takashi; Iwanami, Toru

2017-03-01

The genomic sequences of five Japanese isolates of citrus vein enation virus (CVEV) isolates that induce vein enation were determined and compared with that of the Spanish isolate VE-1. The nucleotide sequences of all Japanese isolates were 5,983 nt in length. The genomic RNA of Japanese isolates had five potential open reading frames (ORF 0, ORF 1, ORF 2, ORF 3, and ORF 5) in the positive-sense strand. The nucleotide sequence identity among the Japanese isolates and Spanish isolate VE-1 ranged from 98.0% to 99.8%. Comparison of the partial amino acid sequences of ten Japanese isolates and three Spanish isolates suggested that four amino acid residues, at positions of 83, 104, and 113 in ORF 2 and position 41 in ORF 5, might be unique to some Japanese isolates.
Synthetic oligonucleotide probes deduced from amino acid sequence data. Theoretical and practical considerations.

PubMed

Lathe, R

1985-05-05

Synthetic probes deduced from amino acid sequence data are widely used to detect cognate coding sequences in libraries of cloned DNA segments. The redundancy of the genetic code dictates that a choice must be made between (1) a mixture of probes reflecting all codon combinations, and (2) a single longer "optimal" probe. The second strategy is examined in detail. The frequency of sequences matching a given probe by chance alone can be determined and also the frequency of sequences closely resembling the probe and contributing to the hybridization background. Gene banks cannot be treated as random associations of the four nucleotides, and probe sequences deduced from amino acid sequence data occur more often than predicted by chance alone. Probe lengths must be increased to confer the necessary specificity. Examination of hybrids formed between unique homologous probes and their cognate targets reveals that short stretches of perfect homology occurring by chance make a significant contribution to the hybridization background. Statistical methods for improving homology are examined, taking human coding sequences as an example, and considerations of codon utilization and dinucleotide frequencies yield an overall homology of greater than 82%. Recommendations for probe design and hybridization are presented, and the choice between using multiple probes reflecting all codon possibilities and a unique optimal probe is discussed.
Determination of the sequences of protein-derived peptides and peptide mixtures by mass spectrometry

PubMed Central

Morris, Howard R.; Williams, Dudley H.; Ambler, Richard P.

1971-01-01

Micro-quantities of protein-derived peptides have been converted into N-acetylated permethyl derivatives, and their sequences determined by low-resolution mass spectrometry without prior knowledge of their amino acid compositions or lengths. A new strategy is suggested for the mass spectrometric sequencing of oligopeptides or proteins, involving gel filtration of protein hydrolysates and subsequent sequence analysis of peptide mixtures. Finally, results are given that demonstrate for the first time the use of mass spectrometry for the analysis of a protein-derived peptide mixture, again without prior knowledge of the protein or components within the mixture. PMID:5158904
Specificity determinants for the abscisic acid response element.

PubMed

Sarkar, Aditya Kumar; Lahiri, Ansuman

2013-01-01

Abscisic acid (ABA) response elements (ABREs) are a group of cis-acting DNA elements that have been identified from promoter analysis of many ABA-regulated genes in plants. We are interested in understanding the mechanism of binding specificity between ABREs and a class of bZIP transcription factors known as ABRE binding factors (ABFs). In this work, we have modeled the homodimeric structure of the bZIP domain of ABRE binding factor 1 from Arabidopsis thaliana (AtABF1) and studied its interaction with ACGT core motif-containing ABRE sequences. We have also examined the variation in the stability of the protein-DNA complex upon mutating ABRE sequences using the protein design algorithm FoldX. The high throughput free energy calculations successfully predicted the ability of ABF1 to bind to alternative core motifs like GCGT or AAGT and also rationalized the role of the flanking sequences in determining the specificity of the protein-DNA interaction.
Biochemical and genetic characterization of enterocin A from Enterococcus faecium, a new antilisterial bacteriocin in the pediocin family of bacteriocins.

PubMed Central

Aymerich, T; Holo, H; Håvarstein, L S; Hugas, M; Garriga, M; Nes, I F

1996-01-01

A new bacteriocin has been isolated from an Enterococcus faecium strain. The bacteriocin, termed enterocin A, was purified to homogeneity as judged by sodium dodecyl sulfate-polyacrylamide gel electrophoresis, N-terminal amino acid sequencing, and mass spectrometry analysis. By combining the data obtained from amino acid and DNA sequencing, the primary structure of enterocin A was determined. It consists of 47 amino acid residues, and the molecular weight was calculated to be 4,829, assuming that the four cysteine residues form intramolecular disulfide bridges. This molecular weight was confirmed by mass spectrometry analysis. The amino acid sequence of enterocin A shared significant homology with a group of bacteriocins (now termed pediocin-like bacteriocins) isolated from a variety of lactic acid-producing bacteria, which include members of the genera Lactobacillus, Pediococcus, Leuconostoc, and Carnobacterium. Sequencing of the structural gene of enterocin A, which is located on the bacterial chromosome, revealed an N-terminal leader sequence of 18 amino acid residues, which was removed during the maturation process. The enterocin A leader belongs to the double-glycine leaders which are found among most other small nonlantibiotic bacteriocins, some lantibiotics, and colicin V. Downstream of the enterocin A gene was located a second open reading frame, encoding a putative protein of 103 amino acid residues. This gene may encode the immunity factor of enterocin A, and it shares 40% identity with a similar open reading frame in the operon of leucocin AUL 187, another pediocin-like bacteriocin. PMID:8633865
Isolation, cloning, and characterization of a partial novel aro A gene in common reed (Phragmites australis).

PubMed

Taravat, Elham; Zebarjadi, Alireza; Kahrizi, Danial; Yari, Kheirollah

2015-05-01

Among the essential amino acids, phenylalanine, tryptophan, and tyrosine are aromatic amino acids which are synthesized by the shikimate pathway in plants and bacteria. Herbicide glyphosate can inhibit the biosynthesis of these amino acids. So, identification of the gene tolerant to glyphosate is very important. It has been shown that the common reed or Phragmites australis Cav. (Poaceae) is relatively tolerant to glyphosate. The aim of the current research is identification, cloning, sequencing, and registering of partial aro A gene of the common reed P. australis. The partial aro A gene of common reed (P. australis) was cloned in Escherichia coli and the amino acid sequence was identified/determined for the first time. This is the first report for isolation, cloning, and sequencing of a part of aro A gene from the common reed. A 670 bp fragment including two introns (86 bp and 289 bp) was obtained. The open reading frame (ORF) region in part of gene was encoded for 98 amino acids. Alignment showed high similarity among this region with Zea mays (L.) (Poaceae) (94.6%), Eleusine indica L. Gaertn (Poaceae) (94.2%), and Zoysia japonica Steud. (Poaceae) (94.2%). The alignment of amino acid sequence of the investigated part of the gene showed a homology with aro A from several other plants. This conserved region forms the enzyme active site. The alignment results of nucleotide and amino acid residues with related sequences showed that there are some differences among them. The relative glyphosate tolerance in the common reed may be related to these differences.
Synthetic signal sequences that enable efficient secretory protein production in the yeast Kluyveromyces marxianus.

PubMed

Yarimizu, Tohru; Nakamura, Mikiko; Hoshida, Hisashi; Akada, Rinji

2015-02-14

Targeting of cellular proteins to the extracellular environment is directed by a secretory signal sequence located at the N-terminus of a secretory protein. These signal sequences usually contain an N-terminal basic amino acid followed by a stretch containing hydrophobic residues, although no consensus signal sequence has been identified. In this study, simple modeling of signal sequences was attempted using Gaussia princeps secretory luciferase (GLuc) in the yeast Kluyveromyces marxianus, which allowed comprehensive recombinant gene construction to substitute synthetic signal sequences. Mutational analysis of the GLuc signal sequence revealed that the GLuc hydrophobic peptide length was lower limit for effective secretion and that the N-terminal basic residue was indispensable. Deletion of the 16th Glu caused enhanced levels of secreted protein, suggesting that this hydrophilic residue defined the boundary of a hydrophobic peptide stretch. Consequently, we redesigned this domain as a repeat of a single hydrophobic amino acid between the N-terminal Lys and C-terminal Glu. Stretches consisting of Phe, Leu, Ile, or Met were effective for secretion but the number of residues affected secretory activity. A stretch containing sixteen consecutive methionine residues (M16) showed the highest activity; the M16 sequence was therefore utilized for the secretory production of human leukemia inhibitory factor protein in yeast, resulting in enhanced secreted protein yield. We present a new concept for the provision of secretory signal sequence ability in the yeast K. marxianus, determined by the number of residues of a single hydrophobic residue located between N-terminal basic and C-terminal acidic amino acid boundaries.
Ultra-deep sequencing reveals high prevalence and broad structural diversity of hepatitis B surface antigen mutations in a global population

PubMed Central

Gencay, Mikael; Hübner, Kirsten; Gohl, Peter; Seffner, Anja; Weizenegger, Michael; Neofytos, Dionysios; Batrla, Richard; Woeste, Andreas; Kim, Hyon-suk; Westergaard, Gaston; Reinsch, Christine; Brill, Eva; Thu Thuy, Pham Thi; Hoang, Bui Huu; Sonderup, Mark; Spearman, C. Wendy; Pabinger, Stephan; Gautier, Jérémie; Brancaccio, Giuseppina; Fasano, Massimo; Santantonio, Teresa; Gaeta, Giovanni B.; Nauck, Markus; Kaminski, Wolfgang E.

2017-01-01

The diversity of the hepatitis B surface antigen (HBsAg) has a significant impact on the performance of diagnostic screening tests and the clinical outcome of hepatitis B infection. Neutralizing or diagnostic antibodies against the HBsAg are directed towards its highly conserved major hydrophilic region (MHR), in particular towards its “a” determinant subdomain. Here, we explored, on a global scale, the genetic diversity of the HBsAg MHR in a large, multi-ethnic cohort of randomly selected subjects with HBV infection from four continents. A total of 1553 HBsAg positive blood samples of subjects originating from 20 different countries across Africa, America, Asia and central Europe were characterized for amino acid variation in the MHR. Using highly sensitive ultra-deep sequencing, we found 72.8% of the successfully sequenced subjects (n = 1391) demonstrated amino acid sequence variation in the HBsAg MHR. This indicates that the global variation frequency in the HBsAg MHR is threefold higher than previously reported. The majority of the amino acid mutations were found in the HBV genotypes B (28.9%) and C (25.4%). Collectively, we identified 345 distinct amino acid mutations in the MHR. Among these, we report 62 previously unknown mutations, which extends the worldwide pool of currently known HBsAg MHR mutations by 22%. Importantly, topological analysis identified the “a” determinant upstream flanking region as the structurally most diverse subdomain of the HBsAg MHR. The highest prevalence of “a” determinant region mutations was observed in subjects from Asia, followed by the African, American and European cohorts, respectively. Finally, we found that more than half (59.3%) of all HBV subjects investigated carried multiple MHR mutations. Together, this worldwide ultra-deep sequencing based genotyping study reveals that the global prevalence and structural complexity of variation in the hepatitis B surface antigen have, to date, been significantly underappreciated. PMID:28472040
Ultra-deep sequencing reveals high prevalence and broad structural diversity of hepatitis B surface antigen mutations in a global population.

PubMed

Gencay, Mikael; Hübner, Kirsten; Gohl, Peter; Seffner, Anja; Weizenegger, Michael; Neofytos, Dionysios; Batrla, Richard; Woeste, Andreas; Kim, Hyon-Suk; Westergaard, Gaston; Reinsch, Christine; Brill, Eva; Thu Thuy, Pham Thi; Hoang, Bui Huu; Sonderup, Mark; Spearman, C Wendy; Pabinger, Stephan; Gautier, Jérémie; Brancaccio, Giuseppina; Fasano, Massimo; Santantonio, Teresa; Gaeta, Giovanni B; Nauck, Markus; Kaminski, Wolfgang E

2017-01-01

The diversity of the hepatitis B surface antigen (HBsAg) has a significant impact on the performance of diagnostic screening tests and the clinical outcome of hepatitis B infection. Neutralizing or diagnostic antibodies against the HBsAg are directed towards its highly conserved major hydrophilic region (MHR), in particular towards its "a" determinant subdomain. Here, we explored, on a global scale, the genetic diversity of the HBsAg MHR in a large, multi-ethnic cohort of randomly selected subjects with HBV infection from four continents. A total of 1553 HBsAg positive blood samples of subjects originating from 20 different countries across Africa, America, Asia and central Europe were characterized for amino acid variation in the MHR. Using highly sensitive ultra-deep sequencing, we found 72.8% of the successfully sequenced subjects (n = 1391) demonstrated amino acid sequence variation in the HBsAg MHR. This indicates that the global variation frequency in the HBsAg MHR is threefold higher than previously reported. The majority of the amino acid mutations were found in the HBV genotypes B (28.9%) and C (25.4%). Collectively, we identified 345 distinct amino acid mutations in the MHR. Among these, we report 62 previously unknown mutations, which extends the worldwide pool of currently known HBsAg MHR mutations by 22%. Importantly, topological analysis identified the "a" determinant upstream flanking region as the structurally most diverse subdomain of the HBsAg MHR. The highest prevalence of "a" determinant region mutations was observed in subjects from Asia, followed by the African, American and European cohorts, respectively. Finally, we found that more than half (59.3%) of all HBV subjects investigated carried multiple MHR mutations. Together, this worldwide ultra-deep sequencing based genotyping study reveals that the global prevalence and structural complexity of variation in the hepatitis B surface antigen have, to date, been significantly underappreciated.

High-Throughput rRNA Gene Sequencing Reveals High and Complex Bacterial Diversity Associated with Brazilian Coffee Bean Fermentation

PubMed Central

Vinícius de Melo, Gilberto

2018-01-01

Summary Coffee bean fermentation is a spontaneous, on-farm process involving the action of different microbial groups, including bacteria and fungi. In this study, high-throughput sequencing approach was employed to study the diversity and dynamics of bacteria associated with Brazilian coffee bean fermentation. The total DNA from fermenting coffee samples was extracted at different time points, and the 16S rRNA gene with segments around the V4 variable region was sequenced by Illumina high-throughput platform. Using this approach, the presence of over eighty bacterial genera was determined, many of which have been detected for the first time during coffee bean fermentation, including Fructobacillus, Pseudonocardia, Pedobacter, Sphingomonas and Hymenobacter. The presence of Fructobacillus suggests an influence of these bacteria on fructose metabolism during coffee fermentation. Temporal analysis showed a strong dominance of lactic acid bacteria with over 97% of read sequences at the end of fermentation, mainly represented by the Leuconostoc and Lactococcus. Metabolism of lactic acid bacteria was associated with the high formation of lactic acid during fermentation, as determined by HPLC analysis. The results reported in this study confirm the underestimation of bacterial diversity associated with coffee fermentation. New microbial groups reported in this study may be explored as functional starter cultures for on-farm coffee processing.
Cloning and baculovirus expression of a desiccation stress gene from the beetle, Tenebrio molitor.

PubMed

Graham, L A; Bendena, W G; Walker, V K

1996-02-01

The cDNA sequence encoding a novel desiccation stress protein (dsp28) found in the hemolymph of the common yellow mealworm beetle, Tenebrio molitor, has been determined. The sequence encodes a 225 amino acid protein containing a 20 amino acid signal peptide. Dsp28 shows no significant similarity to any known nucleic acid or protein sequence. Levels of dsp28 mRNA were found to increase approx 5-fold following desiccation. Dsp28 cDNA has been cloned into a baculovirus expression vector and the expressed protein was compared to native dsp28. Both dsp28 expressed by recombinant baculovirus and native dsp28 are glycosylated and N-terminally processed. Although dsp28 is induced by cold in addition to desiccation stress, it does not contribute to the freezing point depression (thermal hysteresis) observed in Tenebrio hemolymph.
Sex determination: balancing selection in the honey bee.

PubMed

Charlesworth, Deborah

2004-07-27

Sequences of alleles of the honey bee's primary sex-determining gene have extremely high diversity, with many amino acid variants, suggesting that different alleles of this gene have been maintained in populations for very long evolutionary times.
Method for isolating chromosomal DNA in preparation for hybridization in suspension

DOEpatents

Lucas, Joe N.

2000-01-01

A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration. Chromosomal DNA in a sample containing cell debris is prepared for hybridization in suspension by treating the mixture with RNase. The treated DNA can also be fixed prior to hybridization.
Isolation and characterization of a novel acidic matrix protein hic22 from the nacreous layer of the freshwater mussel, Hyriopsis cumingii.

PubMed

Liu, X J; Jin, C; Wu, L M; Dong, S J; Zeng, S M; Li, J L

2016-07-29

Matrix proteins that either weakly acidic or unusually highly acidic have important roles in shell biomineralization. In this study, we have identified and characterized hic22, a weakly acidic matrix protein, from the nacreous layer of Hyriopsis cumingii. Total protein was extracted from the nacre using 5 M EDTA and hic22 was purified using a DEAE-sepharose column. The N-terminal amino acid sequence of hic22 was determined and the complete cDNA encoding hic22 was cloned and sequenced by rapid amplification of cDNA ends-polymerase chain reaction. Finally, the localization and distribution of hic22 was determined by in situ hybridization. Our results revealed that hic22 encodes a 22-kDa protein composed of 185 amino acids. Tissue expression analysis and in situ hybridization indicated that hic22 is expressed in the dorsal epithelial cells of the mantle pallial; moreover, significant expression levels of hic22 were observed after the early formation of the pearl sac (days 19-77), implying that hic22 may play an important role in biomineralization of the nacreous layer.
Molecular cloning and nucleotide sequence of the alpha and beta subunits of allophycocyanin from the cyanelle genome of Cyanophora paradoxa.

PubMed Central

Bryant, D A; de Lorimier, R; Lambert, D H; Dubbs, J M; Stirewalt, V L; Stevens, S E; Porter, R D; Tam, J; Jay, E

1985-01-01

The genes for the alpha- and beta-subunit apoproteins of allophycocyanin (AP) were isolated from the cyanelle genome of Cyanophora paradoxa and subjected to nucleotide sequence analysis. The AP beta-subunit apoprotein gene was localized to a 7.8-kilobase-pair Pst I restriction fragment from cyanelle DNA by hybridization with a tetradecameric oligonucleotide probe. Sequence analysis using that oligonucleotide and its complement as primers for the dideoxy chain-termination sequencing method confirmed the presence of both AP alpha- and beta-subunit genes on this restriction fragment. Additional oligonucleotide primers were synthesized as sequencing progressed and were used to determine rapidly the nucleotide sequence of a 1336-base-pair region of this cloned fragment. This strategy allowed the sequencing to be completed without a detailed restriction map and without extensive and time-consuming subcloning. The sequenced region contains two open reading frames whose deduced amino acid sequences are 81-85% homologous to cyanobacterial and red algal AP subunits whose amino acid sequences have been determined. The two open reading frames are in the same orientation and are separated by 39 base pairs. AP alpha is 5' to AP beta and both coding sequences are preceded by a polypurine, Shine-Dalgarno-type sequence. Sequences upstream from AP alpha closely resemble the Escherichia coli consensus promoter sequences and also show considerable homology to promoter sequences for several chloroplast-encoded psbA genes. A 56-base-pair palindromic sequence downstream from the AP beta gene could play a role in the termination of transcription or translation. The allophycocyanin apoprotein subunit genes are located on the large single-copy region of the cyanelle genome. PMID:2987916
Purification and properties of a third form of anthranilate-5-phosphoribosylpyrophosphate phosphoribosyltransferase from the enterobacteriaceae

DOE Office of Scientific and Technical Information (OSTI.GOV)

Largen, M.; Mills, S.E.; Rowe, J.

1978-01-25

Anthranilate-5-phosphoribosypyrophosphate phosphoribosyltransferase was purified from the bacterium Erwinia carotovora, a member of the Enterobacteriaceae. The enzyme was homogeneous according to the criteria of gel electrophoresis and NH/sub 2/-terminal amino acid sequence analysis. The molecular weight of the enzyme as determined on a calibrated Sephadex G-200 column was 67,000 +- 2,000. Sodium dodecyl sulfate-polyacrylamide gels gave a subunit molecular weight of 40,000 +- 1,000, suggesting that the enzyme was a dimer. A comparison of the NH/sub 2/-terminal sequence of the enzyme with the (previously determined) homologue from Serratia marcescens, a monomer with a molecular weight of 45,000, showed that the largermore » Serratia subunit came into register with amino acid 14 of the Erwinia subunit. The register for the length of the known overlap, 26 amino acids, was highly conserved.« less
Identification of a novel vitivirus from grapevines in New Zealand.

PubMed

Blouin, Arnaud G; Keenan, Sandi; Napier, Kathryn R; Barrero, Roberto A; MacDiarmid, Robin M

2018-01-01

We report a sequence of a novel vitivirus from Vitis vinifera obtained using two high-throughput sequencing (HTS) strategies on RNA. The initial discovery from small-RNA sequencing was confirmed by HTS of the total RNA and Sanger sequencing. The new virus has a genome structure similar to the one reported for other vitiviruses, with five open reading frames (ORFs) coding for the conserved domains described for members of that genus. Phylogenetic analysis of the complete genome sequence confirmed its affiliation to the genus Vitivirus, with the closest described viruses being grapevine virus E (GVE) and Agave tequilana leaf virus (ATLV). However, the virus we report is distinct and shares only 51% amino acid sequence identity with GVE in the replicase polyprotein and 66.8% amino acid sequence identity with ATLV in the coat protein. This is well below the threshold determined by the ICTV for species demarcation, and we propose that this virus represents a new species. It is provisionally named "grapevine virus G".
The N-terminal sequence of ribosomal protein L10 from the archaebacterium Halobacterium marismortui and its relationship to eubacterial protein L6 and other ribosomal proteins.

PubMed

Dijk, J; van den Broek, R; Nasiulas, G; Beck, A; Reinhardt, R; Wittmann-Liebold, B

1987-08-01

The amino-terminal sequence of ribosomal protein L10 from Halobacterium marismortui has been determined up to residue 54, using both a liquid- and a gas-phase sequenator. The two sequences are in good agreement. The protein is clearly homologous to protein HcuL10 from the related strain Halobacterium cutirubrum. Furthermore, a weaker but distinct homology to ribosomal protein L6 from Escherichia coli and Bacillus stearothermophilus can be detected. In addition to 7 identical amino acids in the first 36 residues in all four sequences a number of conservative replacements occurs, of mainly hydrophobic amino acids. In this common region the pattern of conserved amino acids suggests the presence of a beta-alpha fold as it occurs in ribosomal proteins L12 and L30. Furthermore, several potential cases of homology to other ribosomal components of the three ur-kingdoms have been found.
Nucleotide and amino acid variations of tannase gene from different Aspergillus strains.

PubMed

Borrego-Terrazas, J A; Lara-Victoriano, F; Flores-Gallegos, A C; Veana, F; Aguilar, C N; Rodríguez-Herrera, R

2014-08-01

Tannase is an enzyme that catalyses the hydrolysis of ester bonds present in tannins. Most of the scientific reports about this biocatalysis focus on aspects related to tannase production and its recovery; on the other hand, reports assessing the molecular aspects of the tannase gene or protein are scarce. In the present study, a tannase gene fragment from several Aspergillus strains isolated from the Mexican semidesert was sequenced and compared with tannase amino acid sequences reported in NCBI database using bioinformatics tools. The genetic relationship among the different tannase sequences was also determined. A conserved region of 7 amino acids was found with the conserved motif GXSXG common to esterases, in which the active-site serine residue is located. In addition, in Aspergillus niger strains GH1 and PSH, we found an extra codon in the tannase sequences encoding glycine. The tannase gene belonging to semidesert fungal strains followed a neutral evolution path with the formation of 10 haplotypes, of which A. niger GH1 and PSH haplotypes are the oldest.
The complete nucleotide sequence of RNA beta from the type strain of barley stripe mosaic virus.

PubMed Central

Gustafson, G; Armour, S L

1986-01-01

The complete nucleotide sequence of RNA beta from the type strain of barley stripe mosaic virus (BSMV) has been determined. The sequence is 3289 nucleotides in length and contains four open reading frames (ORFs) which code for proteins of Mr 22,147 (ORF1), Mr 58,098 (ORF2), Mr 17,378 (ORF3), and Mr 14,119 (ORF4). The predicted N-terminal amino acid sequence of the polypeptide encoded by the ORF nearest the 5'-end of the RNA (ORF1) is identical (after the initiator methionine) to the published N-terminal amino acid sequence of BSMV coat protein for 29 of the first 30 amino acids. ORF2 occupies the central portion of the coding region of RNA beta and ORF3 is located at the 3'-end. The ORF4 sequence overlaps the 3'-region of ORF2 and the 5'-region of ORF3 and differs in codon usage from the other three RNA beta ORFs. The coding region of RNA beta is followed by a poly(A) tract and a 238 nucleotide tRNA-like structure which are common to all three BSMV genomic RNAs. Images PMID:3754962
A Score of the Ability of a Three-Dimensional Protein Model to Retrieve Its Own Sequence as a Quantitative Measure of Its Quality and Appropriateness

PubMed Central

Martínez-Castilla, León P.; Rodríguez-Sotres, Rogelio

2010-01-01

Background Despite the remarkable progress of bioinformatics, how the primary structure of a protein leads to a three-dimensional fold, and in turn determines its function remains an elusive question. Alignments of sequences with known function can be used to identify proteins with the same or similar function with high success. However, identification of function-related and structure-related amino acid positions is only possible after a detailed study of every protein. Folding pattern diversity seems to be much narrower than sequence diversity, and the amino acid sequences of natural proteins have evolved under a selective pressure comprising structural and functional requirements acting in parallel. Principal Findings The approach described in this work begins by generating a large number of amino acid sequences using ROSETTA [Dantas G et al. (2003) J Mol Biol 332:449–460], a program with notable robustness in the assignment of amino acids to a known three-dimensional structure. The resulting sequence-sets showed no conservation of amino acids at active sites, or protein-protein interfaces. Hidden Markov models built from the resulting sequence sets were used to search sequence databases. Surprisingly, the models retrieved from the database sequences belonged to proteins with the same or a very similar function. Given an appropriate cutoff, the rate of false positives was zero. According to our results, this protocol, here referred to as Rd.HMM, detects fine structural details on the folding patterns, that seem to be tightly linked to the fitness of a structural framework for a specific biological function. Conclusion Because the sequence of the native protein used to create the Rd.HMM model was always amongst the top hits, the procedure is a reliable tool to score, very accurately, the quality and appropriateness of computer-modeled 3D-structures, without the need for spectroscopy data. However, Rd.HMM is very sensitive to the conformational features of the models' backbone. PMID:20830209
Purification, characterization, gene cloning and nucleotide sequencing of D: -stereospecific amino acid amidase from soil bacterium: Delftia acidovorans.

PubMed

Hongpattarakere, Tipparat; Komeda, Hidenobu; Asano, Yasuhisa

2005-12-01

The D-amino acid amidase-producing bacterium was isolated from soil samples using an enrichment culture technique in medium broth containing D-phenylalanine amide as a sole source of nitrogen. The strain exhibiting the strongest activity was identified as Delftia acidovorans strain 16. This strain produced intracellular D-amino acid amidase constitutively. The enzyme was purified about 380-fold to homogeneity and its molecular mass was estimated to be about 50 kDa, on sodium dodecyl sulfate polyacrylamide gel electrophoresis. The enzyme was active preferentially toward D-amino acid amides rather than their L-counterparts. It exhibited strong amino acid amidase activity toward aromatic amino acid amides including D-phenylalanine amide, D-tryptophan amide and D-tyrosine amide, yet it was not specifically active toward low-molecular-weight D-amino acid amides such as D-alanine amide, L-alanine amide and L-serine amide. Moreover, it was not specifically active toward oligopeptides. The enzyme showed maximum activity at 40 degrees C and pH 8.5 and appeared to be very stable, with 92.5% remaining activity after the reaction was performed at 45 degrees C for 30 min. However, it was mostly inactivated in the presence of phenylmethanesulfonyl fluoride or Cd2+, Ag+, Zn2+, Hg2+ and As3+ . The NH2 terminal and internal amino acid sequences of the enzyme were determined; and the gene was cloned and sequenced. The enzyme gene damA encodes a 466-amino-acid protein (molecular mass 49,860.46 Da); and the deduced amino acid sequence exhibits homology to the D-amino acid amidase from Variovorax paradoxus (67.9% identity), the amidotransferase A subunit from Burkholderia fungorum (50% identity) and other enantioselective amidases.
Complete genome sequence of keunjorong mosaic virus, a potyvirus from Cynanchum wilfordii.

PubMed

Nam, Moon; Lee, Joo-Hee; Choi, Hong Soo; Lim, Hyoun-Sub; Moon, Jae Sun; Lee, Su-Heon

2013-08-01

We have determined the complete genome sequence of keunjorong mosaic virus (KjMV). The KjMV genome is composed of 9,611 nucleotides, excluding the 3'-terminal poly(A) tail. It contains two open reading frames (ORFs), with the large one encoding a polyprotein of 3,070 amino acids and the small overlapping ORF encoding a PIPO protein of 81 amino acids. The KjMV genome shared the highest nucleotide sequence identity (57.5 %) with pepper mottle virus and freesia mosaic virus, two members of the genus Potyvirus. Based on the phylogenetic relatedness to known potyviruses, KjMV appears to be a member of a new species in the genus Potyvirus.
Cloning and characterization of the gene for an additional extracellular serine protease of Bacillus subtilis.

PubMed Central

Sloma, A; Rufo, G A; Theriault, K A; Dwyer, M; Wilson, S W; Pero, J

1991-01-01

We have purified a minor extracellular serine protease from a strain of Bacillus subtilis bearing null mutations in five extracellular protease genes: apr, npr, epr, bpr, and mpr (A. Sloma, C. Rudolph, G. Rufo, Jr., B. Sullivan, K. Theriault, D. Ally, and J. Pero, J. Bacteriol. 172:1024-1029, 1990). During purification, this novel protease (Vpr) was found bound in a complex in the void volume after gel filtration chromatography. The amino-terminal sequence of the purified protein was determined, and an oligonucleotide probe was constructed on the basis of the amino acid sequence. This probe was used to clone the structural gene (vpr) for this protease. The gene encodes a primary product of 806 amino acids. The amino acid sequence of the mature protein was preceded by a signal sequence of approximately 28 amino acids and a prosequence of approximately 132 amino acids. The mature protein has a predicted molecular weight of 68,197; however, the isolated protein has an apparent molecular weight of 28,500, suggesting that Vpr undergoes C-terminal processing or proteolysis. The vpr gene maps in the ctrA-sacA-epr region of the chromosome and is not required for growth or sporulation. Images FIG. 1 PMID:1938892
Molecular characterization of amino acid deletion in VP1 (1D) protein and novel amino acid substitutions in 3D polymerase protein of foot and mouth disease virus subtype A/Iran87.

PubMed

Esmaelizad, Majid; Jelokhani-Niaraki, Saber; Hashemnejad, Khadije; Kamalzadeh, Morteza; Lotfi, Mohsen

2011-12-01

The nucleotide sequence of the VP1 (1D) and partial 3D polymerase (3D(pol)) coding regions of the foot and mouth disease virus (FMDV) vaccine strain A/Iran87, a highly passaged isolate (~150 passages), was determined and aligned with previously published FMDV serotype A sequences. Overall analysis of the amino acid substitutions revealed that the partial 3D(pol) coding region contained four amino acid alterations. Amino acid sequence comparison of the VP1 coding region of the field isolates revealed deletions in the highly passaged Iranian isolate (A/Iran87). The prominent G-H loop of the FMDV VP1 protein contains the conserved arginine-glycine-aspartic acid (RGD) tripeptide, which is a well-known ligand for a specific cell surface integrin. Despite losing the RGD sequence of the VP1 protein and an Asp(26)→Glu substitution in a beta sheet located within a small groove of the 3D(pol) protein, the virus grew in BHK 21 suspension cell cultures. Since this strain has been used as a vaccine strain, it may be inferred that the RGD deletion has no critical role in virus attachment to the cell during the initiation of infection. It is probable that this FMDV subtype can utilize other pathways for cell attachment.
Nucleotide sequence analysis of the L gene of Newcastle disease virus: homologies with Sendai and vesicular stomatitis viruses.

PubMed Central

Yusoff, K; Millar, N S; Chambers, P; Emmerson, P T

1987-01-01

The nucleotide sequence of the L gene of the Beaudette C strain of Newcastle disease virus (NDV) has been determined. The L gene is 6704 nucleotides long and encodes a protein of 2204 amino acids with a calculated molecular weight of 248822. Mung bean nuclease mapping of the 5' terminus of the L gene mRNA indicates that the transcription of the L gene is initiated 11 nucleotides upstream of the translational start site. Comparison with the amino acid sequences of the L genes of Sendai virus and vesicular stomatitis virus (VSV) suggests that there are several regions of homology between the sequences. These data provide further evidence for an evolutionary relationship between the Paramyxoviridae and the Rhabdoviridae. A non-coding sequence of 46 nucleotides downstream of the presumed polyadenylation site of the L gene may be part of a negative strand leader RNA. Images PMID:3035486
5S ribosomal ribonucleic acid sequences in Bacteroides and Fusobacterium: evolutionary relationships within these genera and among eubacteria in general

NASA Technical Reports Server (NTRS)

Van den Eynde, H.; De Baere, R.; Shah, H. N.; Gharbia, S. E.; Fox, G. E.; Michalik, J.; Van de Peer, Y.; De Wachter, R.

1989-01-01

The 5S ribosomal ribonucleic acid (rRNA) sequences were determined for Bacteroides fragilis, Bacteroides thetaiotaomicron, Bacteroides capillosus, Bacteroides veroralis, Porphyromonas gingivalis, Anaerorhabdus furcosus, Fusobacterium nucleatum, Fusobacterium mortiferum, and Fusobacterium varium. A dendrogram constructed by a clustering algorithm from these sequences, which were aligned with all other hitherto known eubacterial 5S rRNA sequences, showed differences as well as similarities with respect to results derived from 16S rRNA analyses. In the 5S rRNA dendrogram, Bacteroides clustered together with Cytophaga and Fusobacterium, as in 16S rRNA analyses. Intraphylum relationships deduced from 5S rRNAs suggested that Bacteroides is specifically related to Cytophaga rather than to Fusobacterium, as was suggested by 16S rRNA analyses. Previous taxonomic considerations concerning the genus Bacteroides, based on biochemical and physiological data, were confirmed by the 5S rRNA sequence analysis.
Frontoxins, three-finger toxins from Micrurus frontalis venom, decrease miniature endplate potential amplitude at frog neuromuscular junction.

PubMed

Moreira, K G; Prates, M V; Andrade, F A C; Silva, L P; Beirão, P S L; Kushmerick, C; Naves, L A; Bloch, C

2010-08-01

Neurotoxicity is a major symptom of envenomation caused by Brazilian coral snake Micrurus frontalis. Due to the small amount of material that can be collected, no neurotoxin has been fully sequenced from this venom. In this work we report six new three-finger like toxins isolated from the venom of the coral snake M. frontalis which we named Frontoxin (FTx) I-VI. Toxins were purified using multiple steps of RP-HPLC. Molecular masses were determined by MALDI-TOF and ESI ion-trap mass spectrometry. The complete amino acid sequence of FTx II, III, IV and V were determined by sequencing of overlapping proteolytic fragments by Edman degradation and by de novo sequencing. The amino acid sequences of FTx I, II, III and VI predict 4 conserved disulphide bonds and structural similarity to previously reported short-chain alpha-neurotoxins. FTx IV and V each contained 10 conserved cysteines and share high similarity with long-chain alpha-neurotoxins. At the frog neuromuscular junction FTx II, III and IV reduced miniature endplate potential amplitudes in a time-and concentration-dependent manner suggesting Frontoxins block nicotinic acetylcholine receptors. Copyright 2010 Elsevier Ltd. All rights reserved.
Gene Cloning and Characterization of the Very Large NAD-Dependent l-Glutamate Dehydrogenase from the Psychrophile Janthinobacterium lividum, Isolated from Cold Soil▿

PubMed Central

Kawakami, Ryushi; Sakuraba, Haruhiko; Ohshima, Toshihisa

2007-01-01

NAD-dependent l-glutamate dehydrogenase (NAD-GDH) activity was detected in cell extract from the psychrophile Janthinobacterium lividum UTB1302, which was isolated from cold soil and purified to homogeneity. The native enzyme (1,065 kDa, determined by gel filtration) is a homohexamer composed of 170-kDa subunits (determined by sodium dodecyl sulfate-polyacrylamide gel electrophoresis). Consistent with these findings, gene cloning and sequencing enabled deduction of the amino acid sequence of the subunit, which proved to be comprised of 1,575 amino acids with a combined molecular mass of 169,360 Da. The enzyme from this psychrophile thus appears to belong to the GDH family characterized by very large subunits, like those expressed by Streptomyces clavuligerus and Pseudomonas aeruginosa (about 180 kDa). The entire amino acid sequence of the J. lividum enzyme showed about 40% identity with the sequences from S. clavuligerus and P. aeruginosa enzymes, but the central domains showed higher homology (about 65%). Within the central domain, the residues related to substrate and NAD binding were highly conserved, suggesting that this is the enzyme's catalytic domain. In the presence of NAD, but not in the presence of NADP, this GDH exclusively catalyzed the oxidative deamination of l-glutamate. The stereospecificity of the hydride transfer to NAD was pro-S, which is the same as that of the other known GDHs. Surprisingly, NAD-GDH activity was markedly enhanced by the addition of various amino acids, such as l-aspartate (1,735%) and l-arginine (936%), which strongly suggests that the N- and/or C-terminal domains play regulatory roles and are involved in the activation of the enzyme by these amino acids. PMID:17526698

Gene cloning and characterization of the very large NAD-dependent l-glutamate dehydrogenase from the psychrophile Janthinobacterium lividum, isolated from cold soil.

PubMed

Kawakami, Ryushi; Sakuraba, Haruhiko; Ohshima, Toshihisa

2007-08-01

NAD-dependent l-glutamate dehydrogenase (NAD-GDH) activity was detected in cell extract from the psychrophile Janthinobacterium lividum UTB1302, which was isolated from cold soil and purified to homogeneity. The native enzyme (1,065 kDa, determined by gel filtration) is a homohexamer composed of 170-kDa subunits (determined by sodium dodecyl sulfate-polyacrylamide gel electrophoresis). Consistent with these findings, gene cloning and sequencing enabled deduction of the amino acid sequence of the subunit, which proved to be comprised of 1,575 amino acids with a combined molecular mass of 169,360 Da. The enzyme from this psychrophile thus appears to belong to the GDH family characterized by very large subunits, like those expressed by Streptomyces clavuligerus and Pseudomonas aeruginosa (about 180 kDa). The entire amino acid sequence of the J. lividum enzyme showed about 40% identity with the sequences from S. clavuligerus and P. aeruginosa enzymes, but the central domains showed higher homology (about 65%). Within the central domain, the residues related to substrate and NAD binding were highly conserved, suggesting that this is the enzyme's catalytic domain. In the presence of NAD, but not in the presence of NADP, this GDH exclusively catalyzed the oxidative deamination of l-glutamate. The stereospecificity of the hydride transfer to NAD was pro-S, which is the same as that of the other known GDHs. Surprisingly, NAD-GDH activity was markedly enhanced by the addition of various amino acids, such as l-aspartate (1,735%) and l-arginine (936%), which strongly suggests that the N- and/or C-terminal domains play regulatory roles and are involved in the activation of the enzyme by these amino acids.
Fast computational methods for predicting protein structure from primary amino acid sequence

DOEpatents

Agarwal, Pratul Kumar [Knoxville, TN

2011-07-19

The present invention provides a method utilizing primary amino acid sequence of a protein, energy minimization, molecular dynamics and protein vibrational modes to predict three-dimensional structure of a protein. The present invention also determines possible intermediates in the protein folding pathway. The present invention has important applications to the design of novel drugs as well as protein engineering. The present invention predicts the three-dimensional structure of a protein independent of size of the protein, overcoming a significant limitation in the prior art.
Identification and expression analysis of cDNA encoding insulin-like growth factor 2 in horses

PubMed Central

KIKUCHI, Kohta; SASAKI, Keisuke; AKIZAWA, Hiroki; TSUKAHARA, Hayato; BAI, Hanako; TAKAHASHI, Masashi; NAMBO, Yasuo; HATA, Hiroshi; KAWAHARA, Manabu

2017-01-01

Insulin-like growth factor 2 (IGF2) is responsible for a broad range of physiological processes during fetal development and adulthood, but genomic analyses of IGF2 containing the 5ʹ- and 3ʹ-untranslated regions (UTRs) in equines have been limited. In this study, we characterized the IGF2 mRNA containing the UTRs, and determined its expression pattern in the fetal tissues of horses. The complete equine IGF2 mRNA sequence harboring another exon approximately 2.8 kb upstream from the canonical transcription start site was identified as a new transcript variant. As this upstream exon did not contain the start codon, the amino acid sequence was identical to the canonical variant. Analysis of the deduced amino acid sequence revealed that the protein possessed two major domains, IlGF and IGF2_C, and analysis of IGF2 sequence polymorphism in fetal tissues of Hokkaido native horse and Thoroughbreds revealed a single nucleotide polymorphism (T to C transition) at position 398 in Thoroughbreds, which caused an amino acid substitution at position 133 in the IGF2 sequence. Furthermore, the expression pattern of the IGF2 mRNA in the fetal tissues of horses was determined for the first time, and was found to be consistent with those of other species. Taken together, these results suggested that the transcriptional and translational products of the IGF2 gene have conserved functions in the fetal development of mammals, including horses. PMID:29151450
Complete amino acid sequence of the myoglobin from the Pacific sei whale, Balaenoptera borealis.

PubMed

Jones, B N; Rothgeb, T M; England, R D; Gurd, F R

1979-04-25

The complete amino acid sequence of the major component myoglobin from Pacific sei whale, Balaenoptera borealis, was determined by specific cleavage of the protein to obtain large peptides which are readily degraded by the automatic sequencer. The acetimidated apomyoglobin was selectively cleaved at its two methionyl residues with cyanogen bromide and at its three arginyl residues by trypsin. From the sequence analysis of four of these peptides and the apomyoglobin, over 75% of the covalent structure of the protein was obtained. The remainder of the primary structure was determined by the sequence analysis of peptides that resulted from further digestion of the amino-terminal and central cyanogen bromide fragments. The amino-terminal fragment was specifically cleaved at its two tryptophanyl residues with N-chlorosuccinimide and the central cyanogen bromide fragment was cleaved at its glutamyl residues with staphylococcal protease and at its single tyrosyl residue with N-bromosuccinimide. The primary structure of this myoglobin proved identical with that from the gray whale but differs from that of the finback whale at four positions, from that of the minke whale at three positions and from the myoglobin of the humpback whale at one position. The above sequence identities and differences reflect the close taxonomic relationship of these five species of Cetacea.
Complete Amino Acid Sequence of a Copper/Zinc-Superoxide Dismutase from Ginger Rhizome.

PubMed

Nishiyama, Yuki; Fukamizo, Tamo; Yoneda, Kazunari; Araki, Tomohiro

2017-04-01

Superoxide dismutase (SOD) is an antioxidant enzyme protecting cells from oxidative stress. Ginger (Zingiber officinale) is known for its antioxidant properties, however, there are no data on SODs from ginger rhizomes. In this study, we purified SOD from the rhizome of Z. officinale (Zo-SOD) and determined its complete amino acid sequence using N terminal sequencing, amino acid analysis, and de novo sequencing by tandem mass spectrometry. Zo-SOD consists of 151 amino acids with two signature Cu/Zn-SOD motifs and has high similarity to other plant Cu/Zn-SODs. Multiple sequence alignment showed that Cu/Zn-binding residues and cysteines forming a disulfide bond, which are highly conserved in Cu/Zn-SODs, are also present in Zo-SOD. Phylogenetic analysis revealed that plant Cu/Zn-SODs clustered into distinct chloroplastic, cytoplasmic, and intermediate groups. Among them, only chloroplastic enzymes carried amino acid substitutions in the region functionally important for enzymatic activity, suggesting that chloroplastic SODs may have a function distinct from those of SODs localized in other subcellular compartments. The nucleotide sequence of the Zo-SOD coding region was obtained by reverse-translation, and the gene was synthesized, cloned, and expressed. The recombinant Zo-SOD demonstrated pH stability in the range of 5-10, which is similar to other reported Cu/Zn-SODs, and thermal stability in the range of 10-60 °C, which is higher than that for most plant Cu/Zn-SODs but lower compared to the enzyme from a Z. officinale relative Curcuma aromatica.
Dictionary-driven protein annotation

PubMed Central

Rigoutsos, Isidore; Huynh, Tien; Floratos, Aris; Parida, Laxmi; Platt, Daniel

2002-01-01

Computational methods seeking to automatically determine the properties (functional, structural, physicochemical, etc.) of a protein directly from the sequence have long been the focus of numerous research groups. With the advent of advanced sequencing methods and systems, the number of amino acid sequences that are being deposited in the public databases has been increasing steadily. This has in turn generated a renewed demand for automated approaches that can annotate individual sequences and complete genomes quickly, exhaustively and objectively. In this paper, we present one such approach that is centered around and exploits the Bio-Dictionary, a collection of amino acid patterns that completely covers the natural sequence space and can capture functional and structural signals that have been reused during evolution, within and across protein families. Our annotation approach also makes use of a weighted, position-specific scoring scheme that is unaffected by the over-representation of well-conserved proteins and protein fragments in the databases used. For a given query sequence, the method permits one to determine, in a single pass, the following: local and global similarities between the query and any protein already present in a public database; the likeness of the query to all available archaeal/bacterial/eukaryotic/viral sequences in the database as a function of amino acid position within the query; the character of secondary structure of the query as a function of amino acid position within the query; the cytoplasmic, transmembrane or extracellular behavior of the query; the nature and position of binding domains, active sites, post-translationally modified sites, signal peptides, etc. In terms of performance, the proposed method is exhaustive, objective and allows for the rapid annotation of individual sequences and full genomes. Annotation examples are presented and discussed in Results, including individual queries and complete genomes that were released publicly after we built the Bio-Dictionary that is used in our experiments. Finally, we have computed the annotations of more than 70 complete genomes and made them available on the World Wide Web at http://cbcsrv.watson.ibm.com/Annotations/. PMID:12202776
Teaching Biology around Themes: Teach Proteins and DNA Together.

ERIC Educational Resources Information Center

Offner, Susan

1992-01-01

Proposes as a unifying theme for high school biology the question of "how chromosomes determine what we are." Describes a sequence of lessons in which students learn about proteins, enzymes, and amino acids. Includes three dry laboratory exercises to demonstrate the DNA sequences for sickle cell anemia and cystic fibrosis. (MDH)
Transposon Tn10 contains two structural genes with opposite polarity between tetA and IS10R.

PubMed Central

Schollmeier, K; Hillen, W

1984-01-01

The nucleotide sequence of the central part of Tn10 has been determined from the rightmost HindIII site to IS10R. This sequence contains two open reading frames with opposite polarity. The in vivo transcription start points in this sequence have been determined by S1 mapping. These results define one minor and two major promoters. The transcription starts of the two major promoters are only 18 base pairs apart, and the transcripts show different polarity and overlap by 18 base pairs. The nucleotide sequence reveals two regions with palindromic symmetry which may serve as operators. Their possible involvement in the regulation of transcription of both genes is discussed. Taken together these results allow for a maximal coding capacity of 138 amino acids directed toward IS10R and 197 amino acids directed toward tetA. The possible function of these gene products is discussed. The accompanying article (Braus et al., J. Bacteriol. 160:504-509, 1984) presents evidence that these genes are expressed. Images PMID:6094471
Venom characterization of the Amazonian scorpion Tityus metuendus.

PubMed

Batista, C V F; Martins, J G; Restano-Cassulini, R; Coronas, F I V; Zamudio, F Z; Procópio, R; Possani, L D

2018-03-01

The soluble venom from the scorpion Tityus metuendus was characterized by various methods. In vivo experiments with mice showed that it is lethal. Extended electrophysiological recordings using seven sub-types of human voltage gated sodium channels (hNav1.1 to 1.7) showed that it contains both α- and β-scorpion toxin types. Fingerprint analysis by mass spectrometry identified over 200 distinct molecular mass components. At least 60 sub-fractions were recovered from HPLC separation. Five purified peptides were sequenced by Edman degradation, and their complete primary structures were determined. Additionally, three other peptides have had their N-terminal amino acid sequences determined by Edman degradation and reported. Mass spectrometry analysis of tryptic digestion of the soluble venom permitted the identification of the amino acid sequence of 111 different peptides. Search for similarities of the sequences found indicated that they probably are: sodium and potassium channel toxins, metalloproteinases, hyaluronidases, endothelin and angiotensin-converting enzymes, bradykinin-potentiating peptide, hypothetical proteins, allergens, other enzymes, other proteins and peptides. Copyright © 2018 Elsevier Ltd. All rights reserved.
A complementarity-determining region synthetic peptide acts as a miniantibody and neutralizes human immunodeficiency virus type 1 in vitro.

PubMed Central

Levi, M; Sällberg, M; Rudén, U; Herlyn, D; Maruyama, H; Wigzell, H; Marks, J; Wahren, B

1993-01-01

A complementarity-determining region (CDR) of the mouse monoclonal antibody (mAb) F58 was constructed with specificity to a neutralization-inducing region of human immunodeficiency virus type 1 (HIV-1). The mAb has its major reactivity to the amino acid sequence I--GPGRA in the V3 viral envelope region. All CDRs including several framework amino acids were synthesized from the sequence deduced by cloning and sequencing mAb F58 heavy- and light-chain variable domains. Peptides derived from the third heavy-chain domain (CDR-H3) alone or in combination with the other CDR sequences competed with F58 mAb for the V3 region. The CDR-H3 peptide was chemically modified by cyclization and then inhibited HIV-1 replication as well as syncytium formation by infected cells. Both the homologous IIIB viral strain to which the F58 mAb was induced and the heterologous SF2 strain were inhibited. This synthetic peptide had unexpectedly potent antiviral activity and may be a potential tool for treatment of HIV-infected persons. PMID:7685100
Investigating intermolecular forces associated with thrombus initiation using optical tweezers

NASA Astrophysics Data System (ADS)

Arya, Maneesh; Lopez, Jose A.; Romo, Gabriel M.; Dong, Jing-Fei; McIntire, Larry V.; Moake, Joel L.; Anvari, Bahman

2002-05-01

Thrombus formation occurs when a platelet membrane receptor, glycoprotein (GP) Ib-IX-V complex, binds to its ligand, von Willebrand factor (vWf), in the subendothelium or plasma. To determine which GP Ib-IX-V amino acid sequences are critical for bond formation, we have used optical tweezers to measure forces involved in the binding of vWf to GP Ib-IX-V variants. Inasmuch as GP Ib(alpha) subunit is the primary component in human GP Ib-IX-V complex that binds to vWf, and that canine GP Ib(alpha) , on the other hand, does not bind to human vWf, we progressively replaced human GP Ib(alpha) amino acid sequences with canine GP Ib(alpha) sequences to determine the sequences essential for vWf/GP Ib(alpha) binding. After measuring the adhesive forces between optically trapped, vWf-coated beads and GP Ib(alpha) variants expressed on mammalian cells, we determined that leucine- rich repeat 2 of GP Ib(alpha) was necessary for vWf/GP Ib-IX- V bond formation. We also found that deletion of the N- terminal flanking sequence and leucine-rich repeat 1 reduced adhesion strength to vWf but did not abolish binding. While divalent cations are known to influence binding of vWf, addition of 1mM CaCl2 had no effect on measured vWf/GP Ib(alpha) bond strengths.
Improving protein complex classification accuracy using amino acid composition profile.

PubMed

Huang, Chien-Hung; Chou, Szu-Yu; Ng, Ka-Lok

2013-09-01

Protein complex prediction approaches are based on the assumptions that complexes have dense protein-protein interactions and high functional similarity between their subunits. We investigated those assumptions by studying the subunits' interaction topology, sequence similarity and molecular function for human and yeast protein complexes. Inclusion of amino acids' physicochemical properties can provide better understanding of protein complex properties. Principal component analysis is carried out to determine the major features. Adopting amino acid composition profile information with the SVM classifier serves as an effective post-processing step for complexes classification. Improvement is based on primary sequence information only, which is easy to obtain. Copyright © 2013 Elsevier Ltd. All rights reserved.
Characterization of Stearoyl-CoA Desaturases from a Psychrophilic Antarctic Copepod, Tigriopus kingsejongensis.

PubMed

Jung, Woongsic; Kim, Eun Jae; Han, Se Jong; Choi, Han-Gu; Kim, Sanghee

2016-10-01

Stearoyl-CoA desaturase is a key regulator in fatty acid metabolism that catalyzes the desaturation of stearic acid to oleic acid and controls the intracellular levels of monounsaturated fatty acids (MUFAs). Two stearoyl-CoA desaturases (SCD, Δ9 desaturases) genes were identified in an Antarctic copepod, Tigriopus kingsejongensis, that was collected in a tidal pool near the King Sejong Station, King George Island, Antarctica. Full-length complementary DNA (cDNA) sequences of two T. kingsejongensis SCDs (TkSCDs) were obtained from next-generation sequencing and isolated by reverse transcription PCR. DNA sequence lengths of the open reading frames of TkSCD-1 and TkSCD-2 were determined to be 1110 and 681 bp, respectively. The molecular weights deduced from the corresponding genes were estimated to be 43.1 kDa (TkSCD-1) and 26.1 kDa (TkSCD-2). The amino acid sequences were compared with those of fatty acid desaturases and sterol desaturases from various organisms and used to analyze the relationships among TkSCDs. As assessed by heterologous expression of recombinant proteins in Escherichia coli, the enzymatic functions of both stearoyl-CoA desaturases revealed that the amount of C16:1 and C18:1 fatty acids increased by greater than 3-fold after induction with isopropyl β-D-thiogalactopyranoside. In particular, C18:1 fatty acid production increased greater than 10-fold in E. coli expressing TkSCD-1 and TkSCD-2. The results of this study suggest that both SCD genes from an Antarctic marine copepod encode a functional desaturase that is capable of increasing the amounts of palmitoleic acid and oleic acid in a prokaryotic expression system.
Molecular mechanisms of adaptation emerging from the physics and evolution of nucleic acids and proteins.

PubMed

Goncearenco, Alexander; Ma, Bin-Guang; Berezovsky, Igor N

2014-03-01

DNA, RNA and proteins are major biological macromolecules that coevolve and adapt to environments as components of one highly interconnected system. We explore here sequence/structure determinants of mechanisms of adaptation of these molecules, links between them, and results of their mutual evolution. We complemented statistical analysis of genomic and proteomic sequences with folding simulations of RNA molecules, unraveling causal relations between compositional and sequence biases reflecting molecular adaptation on DNA, RNA and protein levels. We found many compositional peculiarities related to environmental adaptation and the life style. Specifically, thermal adaptation of protein-coding sequences in Archaea is characterized by a stronger codon bias than in Bacteria. Guanine and cytosine load in the third codon position is important for supporting the aerobic life style, and it is highly pronounced in Bacteria. The third codon position also provides a tradeoff between arginine and lysine, which are favorable for thermal adaptation and aerobicity, respectively. Dinucleotide composition provides stability of nucleic acids via strong base-stacking in ApG dinucleotides. In relation to coevolution of nucleic acids and proteins, thermostability-related demands on the amino acid composition affect the nucleotide content in the second codon position in Archaea.
Cloning and expression of UDP-glucose: flavonoid 7-O-glucosyltransferase from hairy root cultures of Scutellaria baicalensis.

PubMed

Hirotani, M; Kuroda, R; Suzuki, H; Yoshikawa, T

2000-05-01

A cDNA encoding UDP-glucose: baicalein 7-O-glucosyltransferase (UBGT) was isolated from a cDNA library from hairy root cultures of Scutellaria baicalensis Georgi probed with a partial-length cDNA clone of a UDP-glucose: flavonoid 3-O-glucosyltransferase (UFGT) from grape (Vitis vinifera L.). The heterologous probe contained a glucosyltransferase consensus amino acid sequence which was also present in the Scutellaria cDNA clones. The complete nucleotide sequence of the 1688-bp cDNA insert was determined and the deduced amino acid sequences are presented. The nucleotide sequence analysis of UBGT revealed an open reading frame encoding a polypeptide of 476 amino acids with a calculated molecular mass of 53,094 Da. The reaction product for baicalein and UDP-glucose catalyzed by recombinant UBGT in Escherichia coli was identified as authentic baicalein 7-O-glucoside using high-performance liquid chromatography and proton nuclear magnetic resonance spectroscopy. The enzyme activities of recombinant UBGT expressed in E. coli were also detected towards flavonoids such as baicalein, wogonin, apigenin, scutellarein, 7,4'-dihydroxyflavone and kaempferol, and phenolic compounds. The accumulation of UBGT mRNA in hairy roots was in response to wounding or salicylic acid treatments.
Molecular mechanisms of adaptation emerging from the physics and evolution of nucleic acids and proteins

PubMed Central

Goncearenco, Alexander; Ma, Bin-Guang; Berezovsky, Igor N.

2014-01-01

DNA, RNA and proteins are major biological macromolecules that coevolve and adapt to environments as components of one highly interconnected system. We explore here sequence/structure determinants of mechanisms of adaptation of these molecules, links between them, and results of their mutual evolution. We complemented statistical analysis of genomic and proteomic sequences with folding simulations of RNA molecules, unraveling causal relations between compositional and sequence biases reflecting molecular adaptation on DNA, RNA and protein levels. We found many compositional peculiarities related to environmental adaptation and the life style. Specifically, thermal adaptation of protein-coding sequences in Archaea is characterized by a stronger codon bias than in Bacteria. Guanine and cytosine load in the third codon position is important for supporting the aerobic life style, and it is highly pronounced in Bacteria. The third codon position also provides a tradeoff between arginine and lysine, which are favorable for thermal adaptation and aerobicity, respectively. Dinucleotide composition provides stability of nucleic acids via strong base-stacking in ApG dinucleotides. In relation to coevolution of nucleic acids and proteins, thermostability-related demands on the amino acid composition affect the nucleotide content in the second codon position in Archaea. PMID:24371267
Amino acid sequence of tyrosinase from Neurospora crassa.

PubMed Central

Lerch, K

1978-01-01

The amino-acid sequence of tyrosinase from Neurospora crassa (monophenol,dihydroxyphenylalanine:oxygen oxidoreductase, EC 1.14.18.1) is reported. This copper-containing oxidase consists of a single polypeptide chain of 407 amino acids. The primary structure was determined by automated and manual sequence analysis on fragments produced by cleavage with cyanogen bromide and on peptides obtained by digestion with trypsin, pepsin, thermolysin, or chymotrypsin. The amino terminus of the protein is acetylated and the single cysteinyl residue 96 is covalently linked via a thioether bridge to histidyl residue 94. The formation and the possible role of this unusual structure in Neurospora tyrosinase is discussed. Dye-sensitized photooxidation of apotyrosinase and active-site-directed inactivation of the native enzyme indicate the possible involvement of histidyl residues 188, 192, 289, and 305 or 306 as ligands to the active-site copper as well as in the catalytic mechanism of this monooxygenase. PMID:151279
Canine Lat1: molecular structure, distribution and its expression in cancer samples.

PubMed

Ochiai, Hideharu; Morishita, Taiki; Onda, Ken; Sugiyama, Hiroki; Maruo, Takuya

2012-07-01

A full-length cDNA sequence of canine L-type amino acid transporter 1 (Lat1) was determined from a canine brain. The sequence was 1828 bp long and was predicted to encode 485 amino acid polypeptides. The deduced amino acid sequence of canine Lat1 showed 93.2% and 91.1% similarities to those of humans and rats, respectively. Northern blot analysis detected Lat1 expression in the cerebellum at 4 kb, and Western blot analysis showed a single band at 40 kDa. RT-PCR analysis revealed a distinct expression of Lat1 in the pancreas and testis in addition to the cerebrum and cerebellum. Notably, Lat1 expression was observed in the tissues of thyroid cancer, melanoma and hemangiopericytoma. Although the cancer samples examined were not enough, Lat1 may serve as a useful biomarker of cancer cells in veterinary clinic.
Nucleotide sequence of the gene encoding the nitrogenase iron protein of Thiobacillus ferrooxidans

DOE Office of Scientific and Technical Information (OSTI.GOV)

Pretorius, I.M.; Rawlings, D.E.; O'Neill, E.G.

1987-01-01

The DNA sequence was determined for the cloned Thiobacillus ferrooxidans nifH and part of the nifD genes. The DNA chains were radiolabeled with (..cap alpha..-/sup 32/P)dCTP (3000 Ci/mmol) or (..cap alpha..-/sup 35/S)dCTP (400 Ci/mmol). A putative T. ferrooxidans nifH promoter was identified whose sequences showed perfect consensus with those of the Klebsiella pneumoniae nif promoter. Two putative consensus upstream activator sequences were also identified. The amino acid sequence was deduced from the DNA sequence. In a comparison of nifH DNA sequences from T. ferrooxidans and eight other nitrogen-fixing microbes, a Rhizobium sp. isolated from Parasponia andersonii showed the greatest homologymore » (74%) and Clostridium pasteurianum (nifH1) showed the least homology (54%). In the comparison of the amino acid sequences of the Fe proteins, the Rhizobium sp. and Rhizobium japonicum showed the greatest homology (both 86%) and C. pasteurianum (nifH1 gene product) demonstrated the least homology (56%) to the T. ferrooxidans Fe protein.« less
Analysis of the regulatory region of the protease III (ptr) gene of Escherichia coli K-12.

PubMed

Claverie-Martin, F; Diaz-Torres, M R; Kushner, S R

1987-01-01

The ptr gene of Escherichia coli encodes protease III (Mr 110,000) and a 50-kDa polypeptide, both of which are found in the periplasmic space. The gene is physically located between the recC and recB loci on the E. coli chromosome. The nucleotide sequence of a 1167-bp EcoRV-ClaI fragment of chromosomal DNA containing the promoter region and 885 bp of the ptr coding sequence has been determined. S1 nuclease mapping analysis showed that the major 5' end of the ptr mRNA was localized 127 bp upstream from the ATG start codon. The open reading frame (ORF), preceded by a Shine-Dalgarno sequence, extends to the end of the sequenced DNA. Downstream from the -35 and -10 regions is a sequence that strongly fits the consensus sequence of known nitrogen-regulated promoters. A signal peptide of 23 amino acids residues is present at the N terminus of the derived amino acid sequence. The cleavage site as well as the ORF were confirmed by sequencing the N terminus of mature protease III.

Surface gene variants of hepatitis B Virus in Saudi Patients.

PubMed

Al-Qudari, Ahmed Y; Amer, Haitham M; Abdo, Ayman A; Hussain, Zahid; Al-Hamoudi, Waleed; Alswat, Khalid; Almajhdi, Fahad N

2016-01-01

Hepatitis B virus (HBV) continues to be one of the most important viral pathogens in humans. Surface (S) protein is the major HBV antigen that mediates virus attachment and entry and determines the virus subtype. Mutations in S gene, particularly in the "a" determinant, can influence virus detection by ELISA and may generate escape mutants. Since no records have documented the S gene mutations in HBV strains circulating in Saudi Arabia, the current study was designed to study sequence variation of S gene in strains circulating in Saudi Arabia and its correlation with clinical and risk factors. A total of 123 HBV-infected patients were recruited for this study. Clinical and biochemical parameters, serological markers, and viral load were determined in all patients. The entire S gene sequence of samples with viral load exceeding 2000 IU/mL was retrieved and exploited in sequence and phylogenetic analysis. A total of 48 mutations (21 unique) were recorded in viral strains in Saudi Arabia, among which 24 (11 unique) changed their respective amino acids. Two amino acid changes were recorded in "a" determinant, including F130L and S135F with no evidence of the vaccine escape mutant G145R in any of the samples. No specific relationship was recognized between the mutation/amino acid change record of HBsAg in strains in Saudi Arabia and clinical or laboratory data. Phylogenetic analysis categorized HBV viral strains in Saudi Arabia as members of subgenotypes D1 and D3. The present report is the first that describes mutation analysis of HBsAg in strains in Saudi Arabia on both nucleotide and amino acid levels. Different substitutions, particularly in major hydrophilic region, may have a potential influence on disease diagnosis, vaccination strategy, and antiviral chemotherapy.
Genome analysis and identification of gelatinase encoded gene in Enterobacter aerogenes

NASA Astrophysics Data System (ADS)

Shahimi, Safiyyah; Mutalib, Sahilah Abdul; Khalid, Rozida Abdul; Repin, Rul Aisyah Mat; Lamri, Mohd Fadly; Bakar, Mohd Faizal Abu; Isa, Mohd Noor Mat

2016-11-01

In this study, bioinformatic analysis towards genome sequence of E. aerogenes was done to determine gene encoded for gelatinase. Enterobacter aerogenes was isolated from hot spring water and gelatinase species-specific bacterium to porcine and fish gelatin. This bacterium offers the possibility of enzymes production which is specific to both species gelatine, respectively. Enterobacter aerogenes was partially genome sequenced resulting in 5.0 mega basepair (Mbp) total size of sequence. From pre-process pipeline, 87.6 Mbp of total reads, 68.8 Mbp of total high quality reads and 78.58 percent of high quality percentage was determined. Genome assembly produced 120 contigs with 67.5% of contigs over 1 kilo base pair (kbp), 124856 bp of N50 contig length and 55.17 % of GC base content percentage. About 4705 protein gene was identified from protein prediction analysis. Two candidate genes selected have highest similarity identity percentage against gelatinase enzyme available in Swiss-Prot and NCBI online database. They were NODE_9_length_26866_cov_148.013245_12 containing 1029 base pair (bp) sequence with 342 amino acid sequence and NODE_24_length_155103_cov_177.082458_62 which containing 717 bp sequence with 238 amino acid sequence, respectively. Thus, two paired of primers (forward and reverse) were designed, based on the open reading frame (ORF) of selected genes. Genome analysis of E. aerogenes resulting genes encoded gelatinase were identified.
Purification and characterization of gamma poly glutamic acid from newly Bacillus licheniformis NRC20.

PubMed

Tork, Sanaa E; Aly, Magda M; Alakilli, Saleha Y; Al-Seeni, Madeha N

2015-03-01

γ-poly glutamic acid (γ-PGA) has received considerable attention for pharmaceutical and biomedical applications. γ-PGA from the newly isolate Bacillus licheniformis NRC20 was purified and characterized using diffusion distance agar plate, mass spectrometry and thin layer chromatography. All analysis indicated that γ-PGA is a homopolymer composed of glutamic acid. Its molecular weight was determined to be 1266 kDa. It was composed of L- and D-glutamic acid residues. An amplicon of 3050 represents the γ-PGA-coding genes was obtained, sequenced and submitted in genbank database. Its amino acid sequence showed high similarity with that obtained from B. licheniformis strains. The bacterium NRC 20 was independent of L-glutamic acid but the polymer production enhanced when cultivated in medium containing L-glutamic acid as the sole nitrogen source. Finally we can conclude that γ-PGA production from B. licheniformis NRC20 has many promised applications in medicine, industry and nanotechnology. Copyright © 2014 Elsevier B.V. All rights reserved.
Cloning and characterization of two novel DNases from Streptococcus pyogenes.

PubMed

Hasegawa, Tadao; Torii, Keizo; Hashikawa, Shinnosuke; Iinuma, Yoshitsugu; Ohta, Michio

2002-06-01

The proteins in the culture supernatant (exoproteins) from Streptococcus pyogenes serotype M1 were separated by two-dimensional gel electrophoresis, and their N-terminal amino acid sequences were determined. The amino acid sequences were compared to sequences in the S. pyogenes genome database. The coding sequence showed similarity to sequences of two genes, mf2-v ( mf2 variant) and mf3, which had sequence similarity to genes encoding mitogenic factor (MF); MF has DNase activity. The recombinant genes were expressed in Escherichia coli and the proteins were synthesized. Mf2-v and Mf3 had DNase activity. The activity of Mf2-v was localized to the C-terminal half of the protein. The mf3 gene was shown to be present in most clinically isolated strains of S. pyogenes tested, and the mf2gene was detected in 20% of the isolates. The products of the mf2 and mf3 genes in clinically isolated S. pyogenes strains were thus shown to be DNases.
The sequence of sequencers: The history of sequencing DNA

PubMed Central

Heather, James M.; Chain, Benjamin

2016-01-01

Determining the order of nucleic acid residues in biological samples is an integral component of a wide variety of research applications. Over the last fifty years large numbers of researchers have applied themselves to the production of techniques and technologies to facilitate this feat, sequencing DNA and RNA molecules. This time-scale has witnessed tremendous changes, moving from sequencing short oligonucleotides to millions of bases, from struggling towards the deduction of the coding sequence of a single gene to rapid and widely available whole genome sequencing. This article traverses those years, iterating through the different generations of sequencing technology, highlighting some of the key discoveries, researchers, and sequences along the way. PMID:26554401
Nucleic acid arrays and methods of synthesis

DOEpatents

Sabanayagam, Chandran R.; Sano, Takeshi; Misasi, John; Hatch, Anson; Cantor, Charles

2001-01-01

The present invention generally relates to high density nucleic acid arrays and methods of synthesizing nucleic acid sequences on a solid surface. Specifically, the present invention contemplates the use of stabilized nucleic acid primer sequences immobilized on solid surfaces, and circular nucleic acid sequence templates combined with the use of isothermal rolling circle amplification to thereby increase nucleic acid sequence concentrations in a sample or on an array of nucleic acid sequences.
The primary structures of ribosomal proteins L16, L23 and L33 from the archaebacterium Halobacterium marismortui.

PubMed

Hatakeyama, T; Hatakeyama, T; Kimura, M

1988-11-21

The complete amino acid sequences of ribosomal proteins L16, L23 and L33 from the archaebacterium Halobacterium marismortui were determined. The sequences were established by manual sequencing of peptides produced with several proteases as well as by cleavage with dilute HCl. Proteins L16, L23 and L33 consist of 119, 154 and 69 amino acid residues, and their molecular masses are 13,538, 16,812 and 7620 Da, respectively. The comparison of their sequences with those of ribosomal proteins from other organisms revealed that L23 and L33 are related to eubacterial ribosomal proteins from Escherichia coli and Bacillus stearothermophilus, while protein L16 was found to be homologous to a eukaryotic ribosomal protein from yeast. These results provide information about the special phylogenetic position of archaebacteria.
Stability of monomeric Cro variants: Isoenergetic transformation of a type I' to a type II' beta-hairpin by single amino acid replacements.

PubMed

Mollah, A K M M; Stennis, Rhonda L; Mossing, Michael C

2003-05-01

The thermodynamic stabilities of three monomeric variants of the bacteriophage lambda Cro repressor that differ only in the sequence of two amino acids at the apex of an engineered beta-hairpin have been determined. The sequences of the turns are EVK-XX-EVK, where the two central residues are DG, GG, and GT, respectively. Standard-state unfolding free energies, determined from circular dichroism measurements as a function of urea concentration, range from 2.4 to 2.7 kcal/mole, while those determined from guanidine hydrochloride range from 2.8 to 3.3 kcal/mole for the three proteins. Thermal denaturation yields van't Hoff unfolding enthalpies of 36 to 40 kcal /mole at midpoint temperatures in the range of 53 to 58 degrees C. Extrapolation of the thermal denaturation free energies with heat capacities of 400 to 600 cal/mole deg gives good agreement with the parameters determined in denaturant titrations. As predicted from statistical surveys of amino acid replacements in beta-hairpins, energetic barriers to transformation from a type I' turn (DG) to a type II' turn (GT) can be quite small.
NMR structure determination of a synthetic analogue of bacillomycin Lc reveals the strategic role of L-Asn1 in the natural iturinic antibiotics

NASA Astrophysics Data System (ADS)

Volpon, Laurent; Tsan, Pascale; Majer, Zsuzsa; Vass, Elemer; Hollósi, Miklós; Noguéra, Valérie; Lancelin, Jean-Marc; Besson, Françoise

2007-08-01

Iturins are a group of antifungal produced by Bacillus subtilis. All are cyclic lipopeptides with seven α-amino acids of configuration LDDLLDL and one β-amino fatty acid. The bacillomycin L is a member of this family and its NMR structure was previously resolved using the sequence Asp-Tyr-Asn-Ser-Gln-Ser-Thr. In this work, we carefully examined the NMR spectra of this compound and detected an error in the sequence. In fact, Asp1 and Gln5 need to be changed into Asn1 and Glu5, which therefore makes it identical to bacillomycin Lc. As a consequence, it now appears that all iturinic peptides with antibiotic activity share the common β-amino fatty acid 8- L-Asn1- D-Tyr2- D-Asn3 sequence. To better understand the conformational influence of the acidic residue L-Asp1, present, for example in the inactive iturin C, the NMR structure of the synthetic analogue SCP [cyclo ( L-Asp1- D-Tyr2- D-Asn3- L-Ser4- L-Gln5- D-Ser6- L-Thr7-β-Ala8)] was determined and compared with bacillomycin Lc recalculated with the corrected sequence. In both cases, the conformers obtained were separated into two families of similar energy which essentially differ in the number and type of turns. A detailed analysis of both cyclopeptide structures is presented here. In addition, CD and FTIR spectra were performed and confirmed the conformational differences observed by NMR between both cyclopeptides.
Prediction of cis/trans isomerization in proteins using PSI-BLAST profiles and secondary structure information.

PubMed

Song, Jiangning; Burrage, Kevin; Yuan, Zheng; Huber, Thomas

2006-03-09

The majority of peptide bonds in proteins are found to occur in the trans conformation. However, for proline residues, a considerable fraction of Prolyl peptide bonds adopt the cis form. Proline cis/trans isomerization is known to play a critical role in protein folding, splicing, cell signaling and transmembrane active transport. Accurate prediction of proline cis/trans isomerization in proteins would have many important applications towards the understanding of protein structure and function. In this paper, we propose a new approach to predict the proline cis/trans isomerization in proteins using support vector machine (SVM). The preliminary results indicated that using Radial Basis Function (RBF) kernels could lead to better prediction performance than that of polynomial and linear kernel functions. We used single sequence information of different local window sizes, amino acid compositions of different local sequences, multiple sequence alignment obtained from PSI-BLAST and the secondary structure information predicted by PSIPRED. We explored these different sequence encoding schemes in order to investigate their effects on the prediction performance. The training and testing of this approach was performed on a newly enlarged dataset of 2424 non-homologous proteins determined by X-Ray diffraction method using 5-fold cross-validation. Selecting the window size 11 provided the best performance for determining the proline cis/trans isomerization based on the single amino acid sequence. It was found that using multiple sequence alignments in the form of PSI-BLAST profiles could significantly improve the prediction performance, the prediction accuracy increased from 62.8% with single sequence to 69.8% and Matthews Correlation Coefficient (MCC) improved from 0.26 with single local sequence to 0.40. Furthermore, if coupled with the predicted secondary structure information by PSIPRED, our method yielded a prediction accuracy of 71.5% and MCC of 0.43, 9% and 0.17 higher than the accuracy achieved based on the singe sequence information, respectively. A new method has been developed to predict the proline cis/trans isomerization in proteins based on support vector machine, which used the single amino acid sequence with different local window sizes, the amino acid compositions of local sequence flanking centered proline residues, the position-specific scoring matrices (PSSMs) extracted by PSI-BLAST and the predicted secondary structures generated by PSIPRED. The successful application of SVM approach in this study reinforced that SVM is a powerful tool in predicting proline cis/trans isomerization in proteins and biological sequence analysis.
Peptide Analysis Using Tandem Mass Spectrometry

DTIC Science & Technology

1989-06-01

to give pyroglutamic acid during storage, eliminating ammonia. It is almost absent in the spectrum of a freshly-prepared sample and is not seen in...USING TANDEM MASS SPECTROMETRY INTRODUCTION S The objective of the project was to determine the complete amino acid sequence of the large polypeptide...Ubiquitin by use of fast atom bombardment (FAB) ionization and tandem mass spectrometry. The peptide containing 76 amino acid residues was available
Rattlesnake Neurotoxin Structure, Mechanism of Action, Immunology and Molecular Biology

DTIC Science & Technology

1990-09-01

the three peptides present in the acidic subunit, two of which are blocked by pyroglutamate , represents a significant contribution. Others have...the amino acid sequence studies on these two proteins, except for determination of their disulfide bond arrangements. These arrangemoents should be...lysine-49 phospholipase A with key amino acid differences from active phosPholiPases. Notexin isoforms (scutoxins A ard B) have been isolated and
Isolation, cDNA cloning and gene expression of an antibacterial protein from larvae of the coconut rhinoceros beetle, Oryctes rhinoceros.

PubMed

Yang, J; Yamamoto, M; Ishibashi, J; Taniai, K; Yamakawa, M

1998-08-01

An antibacterial protein, designated rhinocerosin, was purified to homogeneity from larvae of the coconut rhinoceros beetle, Oryctes rhinoceros immunized with Escherichia coli. Based on the amino acid sequence of the N-terminal region, a degenerate primer was synthesized and reverse-transcriptase PCR was performed to clone rhinocerosin cDNA. As a result, a 279-bp fragment was obtained. The complete nucleotide sequence was determined by sequencing the extended rhinocerosin cDNA clone by 5' rapid amplification of cDNA ends. The deduced amino acid sequence of the mature portion of rhinocerosin was composed of 72 amino acids without cystein residues and was shown to be rich in glycine (11.1%) and proline (11.1%) residues. Comparison of the deduced amino acid sequence of rhinocerosin with those of other antibacterial proteins indicated that it has 77.8% and 44.6% identity with holotricin 2 and coleoptrecin, respectively. Rhinocerosin had strong antibacterial activity against E. coli, Streptococcus pyogenes, Staphylococcus aureus but not against Pseudomonas aeruginosa. Results of reverse-transcriptase PCR analysis of gene expression in different tissues indicated that the rhinocerosin gene is strongly expressed in the fat body and the Malpighian tubule, and weakly expressed in hemocytes and midgut. In addition, gene expression was inducible by bacteria in the fat body, the Malpighian tubule and hemocyte but constitutive expression was observed in the midgut.
RoboOligo: software for mass spectrometry data to support manual and de novo sequencing of post-transcriptionally modified ribonucleic acids

PubMed Central

Sample, Paul J.; Gaston, Kirk W.; Alfonzo, Juan D.; Limbach, Patrick A.

2015-01-01

Ribosomal ribonucleic acid (RNA), transfer RNA and other biological or synthetic RNA polymers can contain nucleotides that have been modified by the addition of chemical groups. Traditional Sanger sequencing methods cannot establish the chemical nature and sequence of these modified-nucleotide containing oligomers. Mass spectrometry (MS) has become the conventional approach for determining the nucleotide composition, modification status and sequence of modified RNAs. Modified RNAs are analyzed by MS using collision-induced dissociation tandem mass spectrometry (CID MS/MS), which produces a complex dataset of oligomeric fragments that must be interpreted to identify and place modified nucleosides within the RNA sequence. Here we report the development of RoboOligo, an interactive software program for the robust analysis of data generated by CID MS/MS of RNA oligomers. There are three main functions of RoboOligo: (i) automated de novo sequencing via the local search paradigm. (ii) Manual sequencing with real-time spectrum labeling and cumulative intensity scoring. (iii) A hybrid approach, coined ‘variable sequencing’, which combines the user intuition of manual sequencing with the high-throughput sampling of automated de novo sequencing. PMID:25820423
Nucleotide and deduced amino acid sequence of the envelope gene of the Vasilchenko strain of TBE virus; comparison with other flaviviruses.

PubMed

Gritsun, T S; Frolova, T V; Pogodina, V V; Lashkevich, V A; Venugopal, K; Gould, E A

1993-02-01

A strain of tick-borne encephalitis virus known as Vasilchenko (Vs) exhibits relatively low virulence characteristics in monkeys, Syrian hamsters and humans. The gene encoding the envelope glycoprotein of this virus was cloned and sequenced. Alignment of the sequence with those of other known tick-borne flaviviruses and identification of the recognised amino acid genetic marker EHLPTA confirmed its identity as a member of the TBE complex. However, Vs virus was distinguishable from eastern and western tick-borne serotypes by the presence of the sequence AQQ at amino acid positions 232-234 and also by the presence of other specific amino acid substitutions which may be genetic markers for these viruses and could determine their pathogenetic characteristics. When compared with other tick-borne flaviviruses, Vs virus had 12 unique amino acid substitutions including an additional potential glycosylation site at position (315-317). The Vs virus strain shared closest nucleotide and amino acid homology (84.5% and 95.5% respectively) with western and far eastern strains of tick-borne encephalitis virus. Comparison with the far eastern serotype of tick-borne encephalitis virus, by cross-immunoelectrophoresis of Vs virions and PAGE analysis of the extracted virion proteins, revealed differences in surface charge and virus stability that may account for the different virulence characteristics of Vs virus. These results support and enlarge upon previous data obtained from molecular and serological analysis.
Use of conserved key amino acid positions to morph protein folds.

PubMed

Reddy, Boojala V B; Li, Wilfred W; Bourne, Philip E

2002-07-15

By using three-dimensional (3D) structure alignments and a previously published method to determine Conserved Key Amino Acid Positions (CKAAPs) we propose a theoretical method to design mutations that can be used to morph the protein folds. The original Paracelsus challenge, met by several groups, called for the engineering of a stable but different structure by modifying less than 50% of the amino acid residues. We have used the sequences from the Protein Data Bank (PDB) identifiers 1ROP, and 2CRO, which were previously used in the Paracelsus challenge by those groups, and suggest mutation to CKAAPs to morph the protein fold. The total number of mutations suggested is less than 40% of the starting sequence theoretically improving the challenge results. From secondary structure prediction experiments of the proposed mutant sequence structures, we observe that each of the suggested mutant protein sequences likely folds to a different, non-native potentially stable target structure. These results are an early indicator that analyses using structure alignments leading to CKAAPs of a given structure are of value in protein engineering experiments. Copyright 2002 Wiley Periodicals, Inc.
A retrotransposable element from the mosquito Anopheles gambiae .

PubMed Central

Besansky, N J

1990-01-01

A family of middle repetitive elements from the African malaria vector Anopheles gambiae is described. Approximately 100 copies of the element, designated T1Ag, are dispersed in the genome. Full-length elements are 4.6 kilobase pairs in length, but truncation of the 5' end is common. Nucleotide sequences of one full-length, two 5'-truncated, and two 5' ends of T1Ag elements were determined and aligned to define a consensus sequence. Sequence analysis revealed two long, overlapping open reading frames followed by a polyadenylation signal, AATAAA, and a tail consisting of tandem repetitions of the motif TGAAA. No direct or inverted long terminal repeats (LTRs) were detected. The first open reading frame, 442 amino acids in length, includes a domain resembling that of nucleic acid-binding proteins. The second open reading frame, 975 amino acids long, resembles the reverse transcriptases of a category of retrotransposable elements without LTRs, variously termed class II retrotransposons, class III elements or non-LTR retrotransposons. Similarity at the sequence and structural levels places T1Ag in this category. Images PMID:1689457
Heparin Characterization: Challenges and Solutions

NASA Astrophysics Data System (ADS)

Jones, Christopher J.; Beni, Szabolcs; Limtiaco, John F. K.; Langeslay, Derek J.; Larive, Cynthia K.

2011-07-01

Although heparin is an important and widely prescribed pharmaceutical anticoagulant, its high degree of sequence microheterogeneity and size polydispersity make molecular-level characterization challenging. Unlike nucleic acids and proteins that are biosynthesized through template-driven assembly processes, heparin and the related glycosaminoglycan heparan sulfate are actively remodeled during biosynthesis through a series of enzymatic reactions that lead to variable levels of O- and N-sulfonation and uronic acid epimers. As summarized in this review, heparin sequence information is determined through a bottom-up approach that relies on depolymerization reactions, size- and charge-based separations, and sensitive mass spectrometric and nuclear magnetic resonance experiments to determine the structural identity of component oligosaccharides. The structure-elucidation process, along with its challenges and opportunities for future analytical improvements, is reviewed and illustrated for a heparin-derived hexasaccharide.
The amino acid sequences of carboxypeptidases I and II from Aspergillus niger and their stability in the presence of divalent cations.

PubMed

Svendsen, I; Dal Degan, F

1998-09-08

The amino acid sequences of serine carboxypeptidase I (CPD-I) and II (CPD-II), respectively, from Aspergillus niger have been determined by conventional Edman degradation of the reduced and vinylpyridinated enzymes and peptides hereof generated by cleavage with cyanogen bromide, iodobenzoic acid, glutamic acid cleaving enzyme, AspN-endoproteinase and EndoLysC proteinase. CPD-I consists of a single peptide chain of 471 amino acid residues, three disulfide bridges and nine N-glycosylated asparaginyl residues, while CPD-II consists of a single peptide chain of 481 amino acid residues, has three disulfide bridges, one free cysteinyl residue and nine glycosylated asparaginyl residues. The enzymes are closely related to carboxypeptidase S3 from Penicillium janthinellum. Both Ca2+ and Mg2+ stabilize CPD-I as well as CPD-II, at basic pH values, Ca2+ being most effective, while the divalent ions have no effect on the activity of the two enzymes.
13C NMR spectroscopic analysis of poly(electrolyte) cement liquids.

PubMed

Watts, D C

1979-05-01

13C NMR spectroscopy has been applied to the analysis of carboxylic poly-acid cement liquids. Monomer incorporation, composition ratio, sequence statistics, and stereochemical configuration have been considered theoretically, and determined experimentally, from the spectra. Conventionally polymerized poly(acrylic acid) has an approximately random configuration, but other varieties may be synthesized. Two commercial glass-ionomer cement liquids both contain tartaric acid as a chelating additive but the composition of their poly-acids are different. Itaconic acid units, distributed randomly, constitute 21% of the repeating units in one of these polyelectrolytes.

Molecular cloning of pepsinogens A and C from adult newt (Cynops pyrrhogaster) stomach.

PubMed

Inokuchi, Tomofumi; Ikuzawa, Masayuki; Yamazaki, Shin; Watanabe, Yukari; Shiota, Koushiro; Katoh, Takuma; Kobayashi, Ken-Ichiro

2013-08-01

The full-length cDNAs of three pepsinogens (Pgs) were cloned from the stomach of newt, Cynops pyrrhogaster, and nucleotide sequences of the full-length cDNAs were determined. Molecular phylogenetic analysis showed that two Pgs, named PgC1 and PgC2, belong to the pepsinogen C group, and one Pg, named PgA, belongs to the pepsinogen A group. The sequences contain an open reading frame (ORF) encoding 385 amino acid residues for PgC1, 383 amino acid residues for PgC2 and 377 amino acid residues for PgA. In addition, all of the three amino acid sequences conserve some unique characteristics such as six cysteine residues and putative active site two aspartic acid residues. All of the pepsinogen mRNAs were detected in the stomach by RT-PCR but not in other organs. Although a slight difference at the time of the start of expression was seen among the three pepsinogen genes, all of them were expressed in the larval stage after hatching. This is the first report on cloning of pepsinogens from urodele stomach. Copyright © 2013 Elsevier Inc. All rights reserved.
Characterization of gene encoding amylopullulanase from plant-originated lactic acid bacterium, Lactobacillus plantarum L137.

PubMed

Kim, Jong-Hyun; Sunako, Michihiro; Ono, Hisayo; Murooka, Yoshikatsu; Fukusaki, Eiichiro; Yamashita, Mitsuo

2008-11-01

A starch-hydrolyzing lactic acid bacterium, Lactobacillus plantarum L137, was isolated from traditional fermented food made from fish and rice in the Philippines. A gene (apuA) encoding an amylolytic enzyme from Lactobacillus plantarum L137 was cloned, and its nucleotide sequence was determined. The apuA gene consisted of an open reading frame of 6171 bp encoding a protein of 2056 amino acids, the molecular mass of which was calculated to be 215,625 Da. The catalytic domains of amylase and pullulanase were located in the same region within the middle of the N-terminal region. The deduced amino acid sequence revealed four highly conserved regions that are common among amylolytic enzymes. In the N-terminal region, a six-amino-acid sequence (Asp-Ala/Thr-Ala-Asn-Ser-Thr) is repeated 39 times, and a three-amino-acid sequence (Gln-Pro-Thr) is repeated 50 times in the C-terminal region. The apuA gene was subcloned in L. plantarum NCL21, which is a plasmid-cured derivative of the wild-type L137 strain and has no amylopullulanase activity, and the gene was overexpressed under the control of its own promoter. The ApuA enzyme from this recombinant L. plantarum NCL21 harboring apuA gene was purified. The enzyme has both alpha-amylase and pullulanase activities. The N-terminal sequence of the purified enzyme showed that the signal peptide was cleaved at Ala(36) and the molecular mass of the mature extracellular enzyme is 211,537 Da. The major reaction products from soluble starch were maltotriose (G3) and maltotetraose (G4). Only maltotriose (G3) was produced from pullulan. From these results, we concluded that ApuA is an amylolytic enzyme belonging to the amylopullulanase family.
Array-Based Rational Design of Short Peptide Probe-Derived from an Anti-TNT Monoclonal Antibody.

PubMed

Okochi, Mina; Muto, Masaki; Yanai, Kentaro; Tanaka, Masayoshi; Onodera, Takeshi; Wang, Jin; Ueda, Hiroshi; Toko, Kiyoshi

2017-10-09

Complementarity-determining regions (CDRs) are sites on the variable chains of antibodies responsible for binding to specific antigens. In this study, a short peptide probe for recognition of 2,4,6-trinitrotoluene (TNT), was identified by testing sequences derived from the CDRs of an anti-TNT monoclonal antibody. The major TNT-binding site in this antibody was identified in the heavy chain CDR3 by antigen docking simulation and confirmed by an immunoassay using a spot-synthesis based peptide array comprising amino acid sequences of six CDRs in the variable region. A peptide derived from heavy chain CDR3 (RGYSSFIYWF) bound to TNT with a dissociation constant of 1.3 μM measured by surface plasmon resonance. Substitution of selected amino acids with basic residues increased TNT binding while substitution with acidic amino acids decreased affinity, an isoleucine to arginine change showed the greatest improvement of 1.8-fold. The ability to create simple peptide binders of volatile organic compounds from sequence information provided by the immune system in the creation of an immune response will be beneficial for sensor developments in the future.
Isolation and characterization of a new bacteriocin, termed enterocin M, produced by environmental isolate Enterococcus faecium AL41.

PubMed

Mareková, Mária; Lauková, Andrea; Skaugen, Morten; Nes, Ingolf

2007-08-01

The new bacteriocin, termed enterocin M, produced by Enterococcus faecium AL 41 showed a wide spectrum of inhibitory activity against the indicator organisms from different sources. It was purified by (NH4)2SO4 precipitation, cation-exchange chromatography and reverse phase chromatography (FPLC). The purified peptide was sequenced by N-terminal amino acid Edman degradation and a mass spectrometry analysis was performed. By combining the data obtained from amino acid sequence (39 N-terminal amino acid residues was determined) and the molecular weight (determined to be 4628 Da) it was concluded that the purified enterocin M is a new bacteriocin, which is very similar to enterocin P. However, its molecular weight is different from enterocin P (4701.25). Of the first 39 N-terminal residues of enterocin M, valine was found in position 20 and a lysine in position 35, while enterocin P has tryptophane residues in these positions.
Analysis of Protein Thermostability Enhancing Factors in Industrially Important Thermus Bacteria Species

PubMed Central

Kumwenda, Benjamin; Litthauer, Derek; Bishop, Özlem Tastan; Reva, Oleg

2013-01-01

Elucidation of evolutionary factors that enhance protein thermostability is a critical problem and was the focus of this work on Thermus species. Pairs of orthologous sequences of T. scotoductus SA-01 and T. thermophilus HB27, with the largest negative minimum folding energy (MFE) as predicted by the UNAFold algorithm, were statistically analyzed. Favored substitutions of amino acids residues and their properties were determined. Substitutions were analyzed in modeled protein structures to determine their locations and contribution to energy differences using PyMOL and FoldX programs respectively. Dominant trends in amino acid substitutions consistent with differences in thermostability between orthologous sequences were observed. T. thermophilus thermophilic proteins showed an increase in non-polar, tiny, and charged amino acids. An abundance of alanine substituted by serine and threonine, as well as arginine substituted by glutamine and lysine was observed in T. thermophilus HB27. Structural comparison showed that stabilizing mutations occurred on surfaces and loops in protein structures. PMID:24023508
37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

Code of Federal Regulations, 2011 CFR

2011-07-01

... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data shall...
Structural details (kinks and non-α conformations) in transmembrane helices are intrahelically determined and can be predicted by sequence pattern descriptors

PubMed Central

Rigoutsos, Isidore; Riek, Peter; Graham, Robert M.; Novotny, Jiri

2003-01-01

One of the promising methods of protein structure prediction involves the use of amino acid sequence-derived patterns. Here we report on the creation of non-degenerate motif descriptors derived through data mining of training sets of residues taken from the transmembrane-spanning segments of polytopic proteins. These residues correspond to short regions in which there is a deviation from the regular α-helical character (i.e. π-helices, 310-helices and kinks). A ‘search engine’ derived from these motif descriptors correctly identifies, and discriminates amongst instances of the above ‘non-canonical’ helical motifs contained in the SwissProt/TrEMBL database of protein primary structures. Our results suggest that deviations from α-helicity are encoded locally in sequence patterns only about 7–9 residues long and can be determined in silico directly from the amino acid sequence. Delineation of such variations in helical habit is critical to understanding the complex structure–function relationships of polytopic proteins and for drug discovery. The success of our current methodology foretells development of similar prediction tools capable of identifying other structural motifs from sequence alone. The method described here has been implemented and is available on the World Wide Web at http://cbcsrv.watson.ibm.com/Ttkw.html. PMID:12888523
Structural details (kinks and non-alpha conformations) in transmembrane helices are intrahelically determined and can be predicted by sequence pattern descriptors.

PubMed

Rigoutsos, Isidore; Riek, Peter; Graham, Robert M; Novotny, Jiri

2003-08-01

One of the promising methods of protein structure prediction involves the use of amino acid sequence-derived patterns. Here we report on the creation of non-degenerate motif descriptors derived through data mining of training sets of residues taken from the transmembrane-spanning segments of polytopic proteins. These residues correspond to short regions in which there is a deviation from the regular alpha-helical character (i.e. pi-helices, 3(10)-helices and kinks). A 'search engine' derived from these motif descriptors correctly identifies, and discriminates amongst instances of the above 'non-canonical' helical motifs contained in the SwissProt/TrEMBL database of protein primary structures. Our results suggest that deviations from alpha-helicity are encoded locally in sequence patterns only about 7-9 residues long and can be determined in silico directly from the amino acid sequence. Delineation of such variations in helical habit is critical to understanding the complex structure-function relationships of polytopic proteins and for drug discovery. The success of our current methodology foretells development of similar prediction tools capable of identifying other structural motifs from sequence alone. The method described here has been implemented and is available on the World Wide Web at http://cbcsrv.watson.ibm.com/Ttkw.html.
The sequence of sequencers: The history of sequencing DNA.

PubMed

Heather, James M; Chain, Benjamin

2016-01-01

Determining the order of nucleic acid residues in biological samples is an integral component of a wide variety of research applications. Over the last fifty years large numbers of researchers have applied themselves to the production of techniques and technologies to facilitate this feat, sequencing DNA and RNA molecules. This time-scale has witnessed tremendous changes, moving from sequencing short oligonucleotides to millions of bases, from struggling towards the deduction of the coding sequence of a single gene to rapid and widely available whole genome sequencing. This article traverses those years, iterating through the different generations of sequencing technology, highlighting some of the key discoveries, researchers, and sequences along the way. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
Dipeptide frequency/bias analysis identifies conserved sites of nonrandomness shared by cysteine-rich motifs.

PubMed

Campion, S R; Ameen, A S; Lai, L; King, J M; Munzenmaier, T N

2001-08-15

This report describes the application of a simple computational tool, AAPAIR.TAB, for the systematic analysis of the cysteine-rich EGF, Sushi, and Laminin motif/sequence families at the two-amino acid level. Automated dipeptide frequency/bias analysis detects preferences in the distribution of amino acids in established protein families, by determining which "ordered dipeptides" occur most frequently in comprehensive motif-specific sequence data sets. Graphic display of the dipeptide frequency/bias data revealed family-specific preferences for certain dipeptides, but more importantly detected a shared preference for employment of the ordered dipeptides Gly-Tyr (GY) and Gly-Phe (GF) in all three protein families. The dipeptide Asn-Gly (NG) also exhibited high-frequency and bias in the EGF and Sushi motif families, whereas Asn-Thr (NT) was distinguished in the Laminin family. Evaluation of the distribution of dipeptides identified by frequency/bias analysis subsequently revealed the highly restricted localization of the G(F/Y) and N(G/T) sequence elements at two separate sites of extreme conservation in the consensus sequence of all three sequence families. The similar employment of the high-frequency/bias dipeptides in three distinct protein sequence families was further correlated with the concurrence of these shared molecular determinants at similar positions within the distinctive scaffolds of three structurally divergent, but similarly employed, motif modules.
Cloning, sequencing, and expression of cDNA for human. beta. -glucuronidase

DOE Office of Scientific and Technical Information (OSTI.GOV)

Oshima, A.; Kyle, J.W.; Miller, R.D.

1987-02-01

The authors report here the cDNA sequence for human placental ..beta..-glucuronidase (..beta..-D-glucuronoside glucuronosohydrolase, EC 3.2.1.31) and demonstrate expression of the human enzyme in transfected COS cells. They also sequenced a partial cDNA clone from human fibroblasts that contained a 153-base-pair deletion within the coding sequence and found a second type of cDNA clone from placenta that contained the same deletion. Nuclease S1 mapping studies demonstrated two types of mRNAs in human placenta that corresponded to the two types of cDNA clones isolated. The NH/sub 2/-terminal amino acid sequence determined for human spleen ..beta..-glucuronidase agreed with that inferred from the DNAmore » sequence of the two placental clones, beginning at amino acid 23, suggesting a cleaved signal sequence of 22 amino acids. When transfected into COS cells, plasmids containing either placental clone expressed an immunoprecipitable protein that contained N-linked oligosaccharides as evidenced by sensitivity to endoglycosidase F. However, only transfection with the clone containing the 153-base-pair segment led to expression of human ..beta..-glucuronidase activity. These studies provide the sequence for the full-length cDNA for human ..beta..-glucuronidase, demonstrate the existence of two populations of mRNA for ..beta..-glucuronidase in human placenta, only one of which specifies a catalytically active enzyme, and illustrate the importance of expression studies in verifying that a cDNA is functionally full-length.« less
Sequence swapping does not result in conformation swapping for the beta4/beta5 and beta8/beta9 beta-hairpin turns in human acidic fibroblast growth factor.

PubMed

Kim, Jaewon; Lee, Jihun; Brych, Stephen R; Logan, Timothy M; Blaber, Michael

2005-02-01

The beta-turn is the most common type of nonrepetitive structure in globular proteins, comprising ~25% of all residues; however, a detailed understanding of effects of specific residues upon beta-turn stability and conformation is lacking. Human acidic fibroblast growth factor (FGF-1) is a member of the beta-trefoil superfold and contains a total of five beta-hairpin structures (antiparallel beta-sheets connected by a reverse turn). beta-Turns related by the characteristic threefold structural symmetry of this superfold exhibit different primary structures, and in some cases, different secondary structures. As such, they represent a useful system with which to study the role that turn sequences play in determining structure, stability, and folding of the protein. Two turns related by the threefold structural symmetry, the beta4/beta5 and beta8/beta9 turns, were subjected to both sequence-swapping and poly-glycine substitution mutations, and the effects upon stability, folding, and structure were investigated. In the wild-type protein these turns are of identical length, but exhibit different conformations. These conformations were observed to be retained during sequence-swapping and glycine substitution mutagenesis. The results indicate that the beta-turn structure at these positions is not determined by the turn sequence. Structural analysis suggests that residues flanking the turn are a primary structural determinant of the conformation within the turn.
Partial De Novo Sequencing and Unusual CID Fragmentation of a 7 kDa, Disulfide-Bridged Toxin

NASA Astrophysics Data System (ADS)

Medzihradszky, Katalin F.; Bohlen, Christopher J.

2012-05-01

A 7 kDa toxin isolated from the venom of the Texas coral snake ( Micrurus tener tener) was subjected to collision-induced dissociation (CID) and electron-transfer dissociation (ETD) analyses both before and after reduction at low pH. Manual and automated approaches to de novo sequencing are compared in detail. Manual de novo sequencing utilizing the combination of high accuracy CID and ETD data and an acid-related cleavage yielded the N-terminal half of the sequence from the reduced species. The intact polypeptide, containing 3 disulfide bridges produced a series of unusual fragments in ion trap CID experiments: abundant internal amino acid losses were detected, and also one of the disulfide-linkage positions could be determined from fragments formed by the cleavage of two bonds. In addition, internal and c-type fragments were also observed.
The structure of cell adhesion molecule uvomorulin. Insights into the molecular mechanism of Ca2+-dependent cell adhesion.

PubMed Central

Ringwald, M; Schuh, R; Vestweber, D; Eistetter, H; Lottspeich, F; Engel, J; Dölz, R; Jähnig, F; Epplen, J; Mayer, S

1987-01-01

We have determined the amino acid sequence of the Ca2+-dependent cell adhesion molecule uvomorulin as it appears on the cell surface. The extracellular part of the molecule exhibits three internally repeated domains of 112 residues which are most likely generated by gene duplication. Each of the repeated domains contains two highly conserved units which could represent putative Ca2+-binding sites. Secondary structure predictions suggest that the putative Ca2+-binding units are located in external loops at the surface of the protein. The protein sequence exhibits a single membrane-spanning region and a cytoplasmic domain. Sequence comparison reveals extensive homology to the chicken L-CAM. Both uvomorulin and L-CAM are identical in 65% of their entire amino acid sequence suggesting a common origin for both CAMs. Images Fig. 1. Fig. 4. Fig. 7. PMID:3501370
Characterization of a Mutant Diphtheria Toxin that is Defective in Binding to Cell Membrane Receptors on Vero Cells

DTIC Science & Technology

1982-08-13

phospholipid, such as phosphatidyllnositolphosphate or phosphatidic acid , or some other phosphorylated membrane molecule. 34 Clearly, it has not been...metabolize lactic acid (40), all represented secondary changes which followed a rapid and irreversible primary Injury (73, 142). Pappenheimer...CR>1197 is taken as 100. Determined from the amino acid sequence (42), From Holmes (74). and lysogeny was demonstrated (11, 66, reviewed in 6, 8
Isolation of prolactin and growth hormone from the pituitary of the holostean fish Amia calva.

PubMed

Dores, R M; Noso, T; Rand-Weaver, M; Kawauchi, H

1993-06-01

Pituitaries from adult male and female Amia calva (Order Holostei) were acid extracted and fractionated by gel filtration column chromatography and reversed-phase high performance liquid chromatography. This two-step isolation procedure yielded homogeneous pools of Amia prolaction (PRL) and growth hormone (GH). The amino acid composition of both purified polypeptides was determined. Primary sequence analysis of the first 22 positions at the N-terminal of Amia PRL revealed that this region has 63% sequence identity with eel PRL-1. The N-terminal region of Amia PRL lacks the disulfide bridge which is characteristic of tetrapod PRLs. Primary sequence analysis of the first 24 positions at the N-terminal of Amia GH revealed that this region has 62% sequence identity with eel GH and 54% sequence identity with both blue shark GH and sea turtle GH. Based on N-terminal analysis, it appears that Amia PRL and GH are more closely related to teleost PRLs and GHs than they are to tetrapod PRLs and GHs.
Repeat sequence chromosome specific nucleic acid probes and methods of preparing and using

DOEpatents

Weier, H.U.G.; Gray, J.W.

1995-06-27

A primer directed DNA amplification method to isolate efficiently chromosome-specific repeated DNA wherein degenerate oligonucleotide primers are used is disclosed. The probes produced are a heterogeneous mixture that can be used with blocking DNA as a chromosome-specific staining reagent, and/or the elements of the mixture can be screened for high specificity, size and/or high degree of repetition among other parameters. The degenerate primers are sets of primers that vary in sequence but are substantially complementary to highly repeated nucleic acid sequences, preferably clustered within the template DNA, for example, pericentromeric alpha satellite repeat sequences. The template DNA is preferably chromosome-specific. Exemplary primers and probes are disclosed. The probes of this invention can be used to determine the number of chromosomes of a specific type in metaphase spreads, in germ line and/or somatic cell interphase nuclei, micronuclei and/or in tissue sections. Also provided is a method to select arbitrarily repeat sequence probes that can be screened for chromosome-specificity. 18 figs.
Creation of a data base for sequences of ribosomal nucleic acids and detection of conserved restriction endonucleases sites through computerized processing.

PubMed Central

Patarca, R; Dorta, B; Ramirez, J L

1982-01-01

As part of a project pertaining the organization of ribosomal genes in Kinetoplastidae, we have created a data base for published sequences of ribosomal nucleic acids, with information in Spanish. As a first step in their processing, we have written a computer program which introduces the new feature of determining the length of the fragments produced after single or multiple digestion with any of the known restriction enzymes. With this information we have detected conserved SAU 3A sites: (i) at the 5' end of the 5.8S rRNA and at the 3' end of the small subunit rRNA, both included in similar larger sequences; (ii) in the 5.8S rRNA of vertebrates (a second one), which is not present in lower eukaryotes, showing a clear evolutive divergence; and, (iii) at the 5' terminal of the small subunit rRNA, included in a larger conserved sequence. The possible biological importance of these sequences is discussed. PMID:6278402
Repeat sequence chromosome specific nucleic acid probes and methods of preparing and using

DOEpatents

Weier, Heinz-Ulrich G.; Gray, Joe W.

1995-01-01

A primer directed DNA amplification method to isolate efficiently chromosome-specific repeated DNA wherein degenerate oligonucleotide primers are used is disclosed. The probes produced are a heterogeneous mixture that can be used with blocking DNA as a chromosome-specific staining reagent, and/or the elements of the mixture can be screened for high specificity, size and/or high degree of repetition among other parameters. The degenerate primers are sets of primers that vary in sequence but are substantially complementary to highly repeated nucleic acid sequences, preferably clustered within the template DNA, for example, pericentromeric alpha satellite repeat sequences. The template DNA is preferably chromosome-specific. Exemplary primers ard probes are disclosed. The probes of this invention can be used to determine the number of chromosomes of a specific type in metaphase spreads, in germ line and/or somatic cell interphase nuclei, micronuclei and/or in tissue sections. Also provided is a method to select arbitrarily repeat sequence probes that can be screened for chromosome-specificity.
Complete Genome Sequence of Leuconostoc citreum KM20▿

PubMed Central

Kim, Jihyun F.; Jeong, Haeyoung; Lee, Jung-Sook; Choi, Sang-Haeng; Ha, Misook; Hur, Cheol-Goo; Kim, Ji-Sun; Lee, Soohyun; Park, Hong-Seog; Park, Yong-Ha; Oh, Tae Kwang

2008-01-01

Leuconostoc citreum is one of the most prevalent lactic acid bacteria during the manufacturing process of kimchi, the best-known Korean traditional dish. We have determined the complete genome sequence of L. citreum KM20. It consists of a 1.80-Mb chromosome and four circular plasmids and reveals genes likely involved in kimchi fermentation and its probiotic effects. PMID:18281406

Structure-related statistical singularities along protein sequences: a correlation study.

PubMed

Colafranceschi, Mauro; Colosimo, Alfredo; Zbilut, Joseph P; Uversky, Vladimir N; Giuliani, Alessandro

2005-01-01

A data set composed of 1141 proteins representative of all eukaryotic protein sequences in the Swiss-Prot Protein Knowledge base was coded by seven physicochemical properties of amino acid residues. The resulting numerical profiles were submitted to correlation analysis after the application of a linear (simple mean) and a nonlinear (Recurrence Quantification Analysis, RQA) filter. The main RQA variables, Recurrence and Determinism, were subsequently analyzed by Principal Component Analysis. The RQA descriptors showed that (i) within protein sequences is embedded specific information neither present in the codes nor in the amino acid composition and (ii) the most sensitive code for detecting ordered recurrent (deterministic) patterns of residues in protein sequences is the Miyazawa-Jernigan hydrophobicity scale. The most deterministic proteins in terms of autocorrelation properties of primary structures were found (i) to be involved in protein-protein and protein-DNA interactions and (ii) to display a significantly higher proportion of structural disorder with respect to the average data set. A study of the scaling behavior of the average determinism with the setting parameters of RQA (embedding dimension and radius) allows for the identification of patterns of minimal length (six residues) as possible markers of zones specifically prone to inter- and intramolecular interactions.
Thermostable NADP+-Dependent Medium-Chain Alcohol Dehydrogenase from Acinetobacter sp. Strain M-1: Purification and Characterization and Gene Expression in Escherichia coli

PubMed Central

Tani, Akio; Sakai, Yasuyoshi; Ishige, Takeru; Kato, Nobuo

2000-01-01

NADPH-dependent alkylaldehyde reducing enzyme, which was greatly induced by n-hexadecane, from Acinetobacter sp. strain M-1 was purified and characterized. The purified enzyme had molecular masses of 40 kDa as determined by sodium dodecyl sulfate-polyacrylamide gel electrophoresis and 160 kDa as determined by gel filtration chromatography. The enzyme, which was shown to be highly thermostable, was most active toward n-heptanal and could use n-alkylaldehydes ranging from C2 to C14 and several substituted benzaldehydes, including the industrially important compounds cinnamyl aldehyde and anisaldehyde, as substrates. The alrA gene coding for this enzyme was cloned, and its nucleotide sequence was determined. The deduced amino acid sequence encoded by the alrA gene exhibited homology to the amino acid sequences of zinc-containing alcohol dehydrogenases from various sources. The gene could be highly expressed in Escherichia coli, and the product was purified to homogeneity by simpler procedures from the recombinant than from the original host. Our results show that this enzyme can be used for industrial bioconversion of useful alcohols and aldehydes. PMID:11097895
Virulence and molecular polymorphism of Prunus necrotic ringspot virus isolates.

PubMed

Hammond, R W; Crosslin, J M

1998-07-01

Prunus necrotic ringspot virus (PNRSV) occurs as numerous strains or isolates that vary widely in their pathogenic, biophysical and serological properties. Prior attempts to distinguish pathotypes based upon physical properties have not been successful; our approach was to examine the molecular properties that may distinguish these isolates. The nucleic acid sequence was determined from 1.65 kbp RT-PCR products derived from RNA 3 of seven distinct isolates of PNRSV that differ serologically and in pathology on sweet cherry. Sequence comparisons of ORF 3a (putative movement protein) and ORF 3b (coat protein) revealed single nucleotide and amino acid differences with strong correlations to serology and symptom types (pathotypes). Sequence differences between serotypes and pathotypes were also reflected in the overall phylogenetic relationships between the isolates.
DNA sequence of the lymphotropic variant of minute virus of mice, MVM(i), and comparison with the DNA sequence of the fibrotropic prototype strain.

PubMed

Astell, C R; Gardiner, E M; Tattersall, P

1986-02-01

The sequence of molecular clones of the genome of MVM(i), a lymphotropic variant of minute virus of mice, was determined and compared with that of MVM(p), the fibrotropic prototype strain. At the nucleotide level there are 163 base changes: 129 transitions and 34 transversions. Most nucleotide changes are silent, with only 27 amino acids changes predicted, of which 22 are conservative. Notable differences between the MVM(i) and MVM(p) genomes which may account for the cell specificities of these viruses occur within the 3' nontranslated regions. The differences discussed include the absence of a 65-base-pair direct in MVM(i), the presence of only two polyadenylation sites in MVM(i) compared with four in MVM(p), and sequences that bear a resemblance to enhancer sequences. Also included in this paper is an important correction to the MVM(p) sequence (C.R. Astell, M. Thomson, M. Merchlinsky, and D. C. Ward, Nucleic Acids Res. 11:999-1018, 1983).
Purification and sequence analysis of 4-methyl-5-nitrocatechol oxygenase from Burkholderia sp. strain DNT.

PubMed Central

Haigler, B E; Suen, W C; Spain, J C

1996-01-01

4-Methyl-5-nitrocatechol (MNC) is an intermediate in the degradation of 2,4-dinitrotoluene by Burkholderia sp. strain DNT. In the presence of NADPH and oxygen, MNC monooxygenase catalyzes the removal of the nitro group from MNC to form 2-hydroxy-5-methylquinone. The gene (dntB) encoding MNC monooxygenase has been previously cloned and characterized. In order to examine the properties of MNC monooxygenase and to compare it with other enzymes, we sequenced the gene encoding the MNC monooxygenase and purified the enzyme from strain DNT. dntB was localized within a 2.2-kb ApaI DNA fragment. Sequence analysis of this fragment revealed an open reading frame of 1,644 bp with an N-terminal amino acid sequence identical to that of purified MNC monooxygenase from strain DNT. Comparison of the derived amino acid sequences with those of other genes showed that DntB contains the highly conserved ADP and flavin adenine dinucleotide (FAD) binding motifs characteristic of flavoprotein hydroxylases. MNC monooxygenase was purified to homogeneity from strain DNT by anion exchange and gel filtration chromatography. Sodium dodecyl sulfate-polyacrylamide gel electrophoresis revealed a single protein with a molecular weight of 60,200, which is consistent with the size determined from the gene sequence. The native molecular weight determined by gel filtration was 65,000, which indicates that the native enzyme is a monomer. It used either NADH or NADPH as electron donors, and NADPH was the preferred cofactor. The purified enzyme contained 1 mol of FAD per mol of protein, which is also consistent with the detection of an FAD binding motif in the amino acid sequence of DntB. MNC monooxygenase has a narrow substrate specificity. MNC and 4-nitrocatechol are good substrates whereas 3-methyl-4-nitrophenol, 3-methyl-4-nitrocatechol, 4-nitrophenol, 3-nitrophenol, and 4-chlorocatechol were not. These studies suggest that MNC monooxygenase is a flavoprotein that shares some properties with previously studied nitrophenol oxygenases. PMID:8830701
Molecular cloning, structural analysis, and expression in Escherichia coli of a chitinase gene from Enterobacter agglomerans.

PubMed Central

Chernin, L S; De la Fuente, L; Sobolev, V; Haran, S; Vorgias, C E; Oppenheim, A B; Chet, I

1997-01-01

The gene chiA, which codes for endochitinase, was cloned from a soilborne Enterobacter agglomerans. Its complete sequence was determined, and the deduced amino acid sequence of the enzyme designated Chia_Entag yielded an open reading frame coding for 562 amino acids of a 61-kDa precursor protein with a putative leader peptide at its N terminus. The nucleotide and polypeptide sequences of Chia_Entag showed 86.8 and 87.7% identity with the corresponding gene and enzyme, Chia_Serma, of Serratia marcescens, respectively. Homology modeling of Chia_Entag's three-dimensional structure demonstrated that most amino acid substitutions are at solvent-accessible sites. Escherichia coli JM109 carrying the E. agglomerans chiA gene produced and secreted Chia_Entag. The antifungal activity of the secreted endochitinase was demonstrated in vitro by inhibition of Fusarium oxysporum spore germination. The transformed strain inhibited Rhizoctonia solani growth on plates and the root rot disease caused by this fungus in cotton seedlings under greenhouse conditions. PMID:9055404
A new comprehensive method for detection of livestock-related pathogenic viruses using a target enrichment system.

PubMed

Oba, Mami; Tsuchiaka, Shinobu; Omatsu, Tsutomu; Katayama, Yukie; Otomaru, Konosuke; Hirata, Teppei; Aoki, Hiroshi; Murata, Yoshiteru; Makino, Shinji; Nagai, Makoto; Mizutani, Tetsuya

2018-01-08

We tested usefulness of a target enrichment system SureSelect, a comprehensive viral nucleic acid detection method, for rapid identification of viral pathogens in feces samples of cattle, pigs and goats. This system enriches nucleic acids of target viruses in clinical/field samples by using a library of biotinylated RNAs with sequences complementary to the target viruses. The enriched nucleic acids are amplified by PCR and subjected to next generation sequencing to identify the target viruses. In many samples, SureSelect target enrichment method increased efficiencies for detection of the viruses listed in the biotinylated RNA library. Furthermore, this method enabled us to determine nearly full-length genome sequence of porcine parainfluenza virus 1 and greatly increased Breadth, a value indicating the ratio of the mapping consensus length in the reference genome, in pig samples. Our data showed usefulness of SureSelect target enrichment system for comprehensive analysis of genomic information of various viruses in field samples. Copyright © 2017 Elsevier Inc. All rights reserved.
Genome sequence of the white koji mold Aspergillus kawachii IFO 4308, used for brewing the Japanese distilled spirit shochu.

PubMed

Futagami, Taiki; Mori, Kazuki; Yamashita, Ayaka; Wada, Shotaro; Kajiwara, Yasuhiro; Takashita, Hideharu; Omori, Toshiro; Takegawa, Kaoru; Tashiro, Kosuke; Kuhara, Satoru; Goto, Masatoshi

2011-11-01

The filamentous fungus Aspergillus kawachii has traditionally been used for brewing the Japanese distilled spirit shochu. A. kawachii characteristically hyperproduces citric acid and a variety of polysaccharide glycoside hydrolases. Here the genome sequence of A. kawachii IFO 4308 was determined and annotated. Analysis of the sequence may provide insight into the properties of this fungus that make it superior for use in shochu production, leading to the further development of A. kawachii for industrial applications.
Evolution of DMY, a newly emergent male sex-determination gene of medaka fish.

PubMed

Zhang, Jianzhi

2004-04-01

The Japanese medaka fish Oryzias latipes has an XX/XY sex-determination system. The Y-linked sex-determination gene DMY is a duplicate of the autosomal gene DMRT1, which encodes a DM-domain-containing transcriptional factor. DMY appears to have originated recently within Oryzias, allowing a detailed evolutionary study of the initial steps that led to the new gene and new sex-determination system. Here I analyze the publicly available DMRT1 and DMY gene sequences of Oryzias species and report the following findings. First, the synonymous substitution rate in DMY is 1.73 times that in DMRT1, consistent with the male-driven evolution hypothesis. Second, the ratio of the rate of nonsynonymous nucleotide substitution (d(N)) to that of synonymous substitution (d(S)) is significantly higher in DMY than in DMRT1. Third, in DMRT1, the d(N)/d(S) ratio for the DM domain is lower than that for non-DM regions, as expected from the functional importance of the DM domain. But in DMY, the opposite is observed and the DM domain is likely under positive Darwinian selection. Fourth, only one characteristic amino acid distinguishes all DMY sequences from all DMRT1 sequences, suggesting that a single amino acid change may be largely responsible for the establishment of DMY as the male sex-determination gene in medaka fish.
Secretion of alpha 2-plasmin inhibitor is impaired by amino acid deletion in a small region of the molecule.

PubMed

Toyota, S; Hirosawa, S; Aoki, N

1994-02-01

Alpha 2-plasmin inhibitor (alpha 2PI) deficiency Okinawa results from defective secretion of the inhibitor from the liver and appears to be a direct consequence of the deletion of Glu137 in the amino acid sequence of alpha 2PI. To examine the effects of replacing the amino acid occupying position 137 or deleting its neighboring amino acid on alpha 2PI secretion, we used oligonucleotide-directed mutagenesis of alpha 2PI cDNA to change the codon specifying Glu137 or delete a codon specifying its neighboring amino acid. The effects were determined by pulse-chase experiments and by enzyme-linked immunosorbent assay of media from transiently transfected COS-7 cells. Replacement of Glu137 with an amino acid other than Cys had little effect on alpha 2PI secretion. In contrast, deletion of an amino acid in a region spanning a sequence of less than 30 amino acids including positions 127 and 137 severely impaired the secretion. The results suggest that structural integrity of the region, rather than its component amino acids, is important for the intracellular transport and secretion of alpha 2PI.
Investigation of compounds essential for the origin of life

NASA Technical Reports Server (NTRS)

Dayhoff, M. O.; Hunt, L. T.

1983-01-01

Nucleic acid sequencing as a technique to determine the chemical and biological evolution of certain prokaryotic metabolic pathways is discussed. Protein in data and a microbiological organization of the prokaryotes is included.
[Convergent origin of repeats in genes coding for globular proteins. An analysis of the factors determining the presence of inverted and symmetrical repeats].

PubMed

Solov'ev, V V; Kel', A E; Kolchanov, N A

1989-01-01

The factors, determining the presence of inverted and symmetrical repeats in genes coding for globular proteins, have been analysed. An interesting property of genetical code has been revealed in the analysis of symmetrical repeats: the pairs of symmetrical codons corresponded to pairs of amino acids with mostly similar physical-chemical parameters. This property may explain the presence of symmetrical repeats and palindromes only in genes coding for beta-structural proteins-polypeptides, where amino acids with similar physical-chemical properties occupy symmetrical positions. A stochastic model of evolution of polynucleotide sequences has been used for analysis of inverted repeats. The modelling demonstrated that only limiting of sequences (uneven frequencies of used codons) is enough for arising of nonrandom inverted repeats in genes.
Cloning and sequence analysis demonstrate the chromate reduction ability of a novel chromate reductase gene from Serratia sp.

PubMed

Deng, Peng; Tan, Xiaoqing; Wu, Ying; Bai, Qunhua; Jia, Yan; Xiao, Hong

2015-03-01

The ChrT gene encodes a chromate reductase enzyme which catalyzes the reduction of Cr(VI). The chromate reductase is also known as flavin mononucleotide (FMN) reductase (FMN_red). The aim of the present study was to clone the full-length ChrT DNA from Serratia sp. CQMUS2 and analyze the deduced amino acid sequence and three-dimensional structure. The putative ChrT gene fragment of Serratia sp. CQMUS2 was isolated by polymerase chain reaction (PCR), according to the known FMN_red gene sequence from Serratia sp. AS13. The flanking sequences of the ChrT gene were obtained by high efficiency TAIL-PCR, while the full-length gene of ChrT was cloned in Escherichia coli for subsequent sequencing. The nucleotide sequence of ChrT was submitted onto GenBank under the accession number, KF211434. Sequence analysis of the gene and amino acids was conducted using the Basic Local Alignment Search Tool, and open reading frame (ORF) analysis was performed using ORF Finder software. The ChrT gene was found to be an ORF of 567 bp that encodes a 188-amino acid enzyme with a calculated molecular weight of 20.4 kDa. In addition, the ChrT protein was hypothesized to be an NADPH-dependent FMN_red and a member of the flavodoxin-2 superfamily. The amino acid sequence of ChrT showed high sequence similarity to the FMN reductase genes of Klebsiella pneumonia and Raoultella ornithinolytica , which belong to the flavodoxin-2 superfamily. Furthermore, ChrT was shown to have a 85.6% similarity to the three-dimensional structure of Escherichia coli ChrR, sharing four common enzyme active sites for chromate reduction. Therefore, ChrT gene cloning and protein structure determination demonstrated the ability of the gene for chromate reduction. The results of the present study provide a basis for further studies on ChrT gene expression and protein function.
Cloning and sequence analysis demonstrate the chromate reduction ability of a novel chromate reductase gene from Serratia sp

PubMed Central

DENG, PENG; TAN, XIAOQING; WU, YING; BAI, QUNHUA; JIA, YAN; XIAO, HONG

2015-01-01

The ChrT gene encodes a chromate reductase enzyme which catalyzes the reduction of Cr(VI). The chromate reductase is also known as flavin mononucleotide (FMN) reductase (FMN_red). The aim of the present study was to clone the full-length ChrT DNA from Serratia sp. CQMUS2 and analyze the deduced amino acid sequence and three-dimensional structure. The putative ChrT gene fragment of Serratia sp. CQMUS2 was isolated by polymerase chain reaction (PCR), according to the known FMN_red gene sequence from Serratia sp. AS13. The flanking sequences of the ChrT gene were obtained by high efficiency TAIL-PCR, while the full-length gene of ChrT was cloned in Escherichia coli for subsequent sequencing. The nucleotide sequence of ChrT was submitted onto GenBank under the accession number, KF211434. Sequence analysis of the gene and amino acids was conducted using the Basic Local Alignment Search Tool, and open reading frame (ORF) analysis was performed using ORF Finder software. The ChrT gene was found to be an ORF of 567 bp that encodes a 188-amino acid enzyme with a calculated molecular weight of 20.4 kDa. In addition, the ChrT protein was hypothesized to be an NADPH-dependent FMN_red and a member of the flavodoxin-2 superfamily. The amino acid sequence of ChrT showed high sequence similarity to the FMN reductase genes of Klebsiella pneumonia and Raoultella ornithinolytica, which belong to the flavodoxin-2 superfamily. Furthermore, ChrT was shown to have a 85.6% similarity to the three-dimensional structure of Escherichia coli ChrR, sharing four common enzyme active sites for chromate reduction. Therefore, ChrT gene cloning and protein structure determination demonstrated the ability of the gene for chromate reduction. The results of the present study provide a basis for further studies on ChrT gene expression and protein function. PMID:25667630
Sequencing of the amylopullulanase (apu) gene of Thermoanaerobacter ethanolicus 39E, and identification of the active site by site-directed mutagenesis.

PubMed

Mathupala, S P; Lowe, S E; Podkovyrov, S M; Zeikus, J G

1993-08-05

The complete nucleotide sequence of the gene encoding the dual active amylopullulanase of Thermoanaerobacter ethanolicus 39E (formerly Clostridium thermohydrosulfuricum) was determined. The structural gene (apu) contained a single open reading frame 4443 base pairs in length, corresponding to 1481 amino acids, with an estimated molecular weight of 162,780. Analysis of the deduced sequence of apu with sequences of alpha-amylases and alpha-1,6 debranching enzymes enabled the identification of four conserved regions putatively involved in substrate binding and in catalysis. The conserved regions were localized within a 2.9-kilobase pair gene fragment, which encoded a M(r) 100,000 protein that maintained the dual activities and thermostability of the native enzyme. The catalytic residues of amylopullulanase were tentatively identified by using hydrophobic cluster analysis for comparison of amino acid sequences of amylopullulanase and other amylolytic enzymes. Asp597, Glu626, and Asp703 were individually modified to their respective amide form, or the alternate acid form, and in all cases both alpha-amylase and pullulanase activities were lost, suggesting the possible involvement of 3 residues in a catalytic triad, and the presence of a putative single catalytic site within the enzyme. These findings substantiate amylopullulanase as a new type of amylosaccharidase.
Cloning and sequencing of the gene coding for alcohol dehydrogenase of Bacillus stearothermophilus and rational shift of the optimum pH.

PubMed

Sakoda, H; Imanaka, T

1992-02-01

Using Bacillus subtilis as a host and pTB524 as a vector plasmid, we cloned the thermostable alcohol dehydrogenase (ADH-T) gene (adhT) from Bacillus stearothermophilus NCA1503 and determined its nucleotide sequence. The deduced amino acid sequence (337 amino acids) was compared with the sequences of ADHs from four different origins. The amino acid residues responsible for the catalytic activity of horse liver ADH had been clarified on the basis of three-dimensional structure. Since those catalytic amino acid residues were fairly conserved in ADH-T and other ADHs, ADH-T was inferred to have basically the same proton release system as horse liver ADH. The putative proton release system of ADH-T was elucidated by introducing point mutations at the catalytic amino acid residues, Cys-38 (cysteine at position 38), Thr-40, and His-43, with site-directed mutagenesis. The mutant enzyme Thr-40-Ser (Thr-40 was replaced by serine) showed a little lower level of activity than wild-type ADH-T did. The result indicates that the OH group of serine instead of threonine can also be used for the catalytic activity. To change the pKa value of the putative system, His-43 was replaced by the more basic amino acid arginine. As a result, the optimum pH of the mutant enzyme His-43-Arg was shifted from 7.8 (wild-type enzyme) to 9.0. His-43-Arg exhibited a higher level of activity than wild-type enzyme at the optimum pH.
Cloning and sequencing of the gene coding for alcohol dehydrogenase of Bacillus stearothermophilus and rational shift of the optimum pH.

PubMed Central

Sakoda, H; Imanaka, T

1992-01-01

Using Bacillus subtilis as a host and pTB524 as a vector plasmid, we cloned the thermostable alcohol dehydrogenase (ADH-T) gene (adhT) from Bacillus stearothermophilus NCA1503 and determined its nucleotide sequence. The deduced amino acid sequence (337 amino acids) was compared with the sequences of ADHs from four different origins. The amino acid residues responsible for the catalytic activity of horse liver ADH had been clarified on the basis of three-dimensional structure. Since those catalytic amino acid residues were fairly conserved in ADH-T and other ADHs, ADH-T was inferred to have basically the same proton release system as horse liver ADH. The putative proton release system of ADH-T was elucidated by introducing point mutations at the catalytic amino acid residues, Cys-38 (cysteine at position 38), Thr-40, and His-43, with site-directed mutagenesis. The mutant enzyme Thr-40-Ser (Thr-40 was replaced by serine) showed a little lower level of activity than wild-type ADH-T did. The result indicates that the OH group of serine instead of threonine can also be used for the catalytic activity. To change the pKa value of the putative system, His-43 was replaced by the more basic amino acid arginine. As a result, the optimum pH of the mutant enzyme His-43-Arg was shifted from 7.8 (wild-type enzyme) to 9.0. His-43-Arg exhibited a higher level of activity than wild-type enzyme at the optimum pH. Images PMID:1735726
Mapping the neutralizing epitopes on the glycoprotein of infectious haematopoietic necrosis virus, a fish rhabdovirus

USGS Publications Warehouse

Huang, C.; Chien, M.S.; Landolt, M.L.; Batts, W.; Winton, J.

1996-01-01

Twelve neutralizing monoclonal antibodies (MAbs) against the fish rhabdovirus, infectious haematopoietic necrosis virus (IHNV), were used to select 20 MAb escape mutants. The nucleotide sequence of the entire glycoprotein (G) gene was determined for six mutants representing differing cross-neutralization patterns and each had a single nucleotide change leading to a single amino acid substitution within one of three regions of the protein. These data were used to design nested PCR primers to amplify portions of the G gene of the 14 remaining mutants. When the PCR products from these mutants were sequenced, they also had single nucleotide substitutions coding for amino acid substitutions at the same, or nearby, locations. Of the 20 mutants for which all or part of the glycoprotein gene was sequenced, two MAbs selected mutants with substitutions at amino acids 230-231 (antigenic site I) and the remaining MAbs selected mutants with substitutions at amino acids 272-276 (antigenic site II). Two MAbs that selected mutants mapping to amino acids 272-276, selected other mutants that mapped to amino acids 78-81, raising the possibility that this portion of the N terminus of the protein was part of a discontinuous epitope defining antigenic site II. CLUSTAL alignment of the glycoproteins of rabies virus, vesicular stomatitis virus and IHNV revealed similarities in the location of the neutralizing epitopes and a high degree of conservation among cysteine residues, indicating that the glycoproteins of three different genera of animal rhabdoviruses may share a similar three-dimensional structure in spite of extensive sequence divergence.
The delta-subunit of murine guanine nucleotide exchange factor eIF-2B. Characterization of cDNAs predicts isoforms differing at the amino-terminal end.

PubMed

Henderson, R A; Krissansen, G W; Yong, R Y; Leung, E; Watson, J D; Dholakia, J N

1994-12-02

Protein synthesis in mammalian cells is regulated at the level of the guanine nucleotide exchange factor, eIF-2B, which catalyzes the exchange of eukaryotic initiation factor 2-bound GDP for GTP. We have isolated and sequenced cDNA clones encoding the delta-subunit of murine eIF-2B. The cDNA sequence encodes a polypeptide of 544 amino acids with molecular mass of 60 kDa. Antibodies against a synthetic polypeptide of 30 amino acids deduced from the cDNA sequence specifically react with the delta-subunit of mammalian eIF-2B. The cDNA-derived amino acid sequence shows significant homology with the yeast translational regulator Gcd2, supporting the hypothesis that Gcd2 may be the yeast homolog of the delta-subunit of mammalian eIF-2B. Primer extension studies and anchor polymerase chain reaction analysis were performed to determine the 5'-end of the transcript for the delta-subunit of eIF-2B. Results of these experiments demonstrate two different mRNAs for the delta-subunit of eIF-2B in murine cells. The isolation and characterization of two different full-length cDNAs also predicts the presence of two alternate forms of the delta-subunit of eIF-2B in murine cells. These differ at their amino-terminal end but have identical nucleotide sequences coding for amino acids 31-544.
Sequence determination and analysis of the NSs genes of two tospoviruses.

PubMed

Hallwass, Mariana; Leastro, Mikhail O; Lima, Mirtes F; Inoue-Nagata, Alice K; Resende, Renato O

2012-03-01

The tospoviruses groundnut ringspot virus (GRSV) and zucchini lethal chlorosis virus (ZLCV) cause severe losses in many crops, especially in solanaceous and cucurbit species. In this study, the non-structural NSs gene and the 5'UTRs of these two biologically distinct tospoviruses were cloned and sequenced. The NSs sequence of GRSV and ZLCV were both 1,404 nucleotides long. Pairwise comparison showed that the NSs amino acid sequence of GRSV shared 69.6% identity with that of ZLCV and 75.9% identity with that of TSWV, while the NSs sequence of ZLCV and TSWV shared 67.9% identity. Phylogenetic analysis based on NSs sequences confirmed that these viruses cluster in the American clade.

Isolation and preliminary characterization of a Cd-binding protein from Tenebrio molitor (Coleoptera).

PubMed

Pedersen, S A; Kristiansen, E; Andersen, R A; Zachariassen, K E

2007-04-01

The effect of cadmium (Cd) exposure on Cd-binding ligands was investigated for the first time in a beetle (Coleoptera), using the mealworm Tenebrio molitor (L) as a model species. Exposure to Cd resulted in an approximate doubling of the Cd-binding capacity of the protein extracts from whole animals. Analysis showed that the increase was mainly explained by the induction of a Cd-binding protein of 7134.5 Da, with non-metallothionein characteristics. Amino acid analysis and de novo sequencing revealed that the protein has an unusually high content of the acidic amino acids aspartic and glutamic acid that may explain how this protein can bind Cd even without cysteine residues. Similarities in the amino acid composition suggest it to belong to a group of little studied proteins often referred to as "Cd-binding proteins without high cysteine content". This is the first report on isolation and peptide sequence determination of such a protein from a coleopteran.
Glucocorticoid Regulation of Rat Renal Sodium Potassium Adenosine Triphosphatase

DTIC Science & Technology

1990-03-29

sequences; restriction enzymes fluorescein isothiocyanate glomerular filtration rate Horseradish Peroxidase immunoglobulin G kllodalton Magnesium...studies were conducted, in this project , to determine whether the observed changes in NaK-ATPase activity occurred after, and possibly as the result of...excitable tissue required for nerve impulse transmission and 6 muscle contraction (Skou, 1957), the functioning of hepatic amino acid and bile acid
Relative stability of major types of beta-turns as a function of amino acid composition: a study based on Ab initio energetic and natural abundance data.

PubMed

Perczel, András; Jákli, Imre; McAllister, Michael A; Csizmadia, Imre G

2003-06-06

Folding properties of small globular proteins are determined by their amino acid sequence (primary structure). This holds both for local (secondary structure) and for global conformational features of linear polypeptides and proteins composed from natural amino acid derivatives. It thus provides the rational basis of structure prediction algorithms. The shortest secondary structure element, the beta-turn, most typically adopts either a type I or a type II form, depending on the amino acid composition. Herein we investigate the sequence-dependent folding stability of both major types of beta-turns using simple dipeptide models (-Xxx-Yyy-). Gas-phase ab initio properties of 16 carefully selected and suitably protected dipeptide models (for example Val-Ser, Ala-Gly, Ser-Ser) were studied. For each backbone fold most probable side-chain conformers were considered. Fully optimized 321G RHF molecular structures were employed in medium level [B3LYP/6-311++G(d,p)//RHF/3-21G] energy calculations to estimate relative populations of the different backbone conformers. Our results show that the preference for beta-turn forms as calculated by quantum mechanics and observed in Xray determined proteins correlates significantly.
RNAi trigger fragment truncation attenuates soybean FAD2-1 transcript suppression and yields intermediate oil phenotypes.

PubMed

Wagner, Nicholas; Mroczka, Andrew; Roberts, Peter D; Schreckengost, William; Voelker, Toni

2011-09-01

Suppression of the microsomal ω6 oleate desaturase during the seed development of soybean (Glycine max) with the 420-bp soybean FAD2-1A intron as RNAi trigger shifts the conventional fatty acid composition of soybean oil from 20% oleic and 60% polyunsaturates to one containing greater than 80% oleic acid and less than 10% polyunsaturates. To determine whether RNAi could be attenuated by reducing the trigger fragment length, transgenic plants were generated to express successively shorter 5' or 3' deletion derivatives of the FAD2-1A intron. We observed a gradual reduction in transcript suppression with shorter trigger fragments. Fatty acid composition was less affected with shorter triggers, and triggers less than 60 bp had no phenotypic effect. No trigger sequences conferring significantly higher or lower suppression efficiencies were found, and the primary determinant of suppression effect was sequence length. The observed relationship of transcript suppression with the induced fatty acid phenotype indicates that RNAi is a saturation process and not a step change between suppressed and nonsuppressed states and intermediate suppression states can be achieved. © 2010 Monsanto. Plant Biotechnology Journal © 2010 Society for Experimental Biology and Blackwell Publishing Ltd.
Unraveling Core Functional Microbiota in Traditional Solid-State Fermentation by High-Throughput Amplicons and Metatranscriptomics Sequencing.

PubMed

Song, Zhewei; Du, Hai; Zhang, Yan; Xu, Yan

2017-01-01

Fermentation microbiota is specific microorganisms that generate different types of metabolites in many productions. In traditional solid-state fermentation, the structural composition and functional capacity of the core microbiota determine the quality and quantity of products. As a typical example of food fermentation, Chinese Maotai-flavor liquor production involves a complex of various microorganisms and a wide variety of metabolites. However, the microbial succession and functional shift of the core microbiota in this traditional food fermentation remain unclear. Here, high-throughput amplicons (16S rRNA gene amplicon sequencing and internal transcribed space amplicon sequencing) and metatranscriptomics sequencing technologies were combined to reveal the structure and function of the core microbiota in Chinese soy sauce aroma type liquor production. In addition, ultra-performance liquid chromatography and headspace-solid phase microextraction-gas chromatography-mass spectrometry were employed to provide qualitative and quantitative analysis of the major flavor metabolites. A total of 10 fungal and 11 bacterial genera were identified as the core microbiota. In addition, metatranscriptomic analysis revealed pyruvate metabolism in yeasts (genera Pichia, Schizosaccharomyces, Saccharomyces , and Zygosaccharomyces ) and lactic acid bacteria (genus Lactobacillus ) classified into two stages in the production of flavor components. Stage I involved high-level alcohol (ethanol) production, with the genus Schizosaccharomyces serving as the core functional microorganism. Stage II involved high-level acid (lactic acid and acetic acid) production, with the genus Lactobacillus serving as the core functional microorganism. The functional shift from the genus Schizosaccharomyces to the genus Lactobacillus drives flavor component conversion from alcohol (ethanol) to acid (lactic acid and acetic acid) in Chinese Maotai-flavor liquor production. Our findings provide insight into the effects of the core functional microbiota in soy sauce aroma type liquor production and the characteristics of the fermentation microbiota under different environmental conditions.
Sequence of the structural gene for granule-bound starch synthase of potato (Solanum tuberosum L.) and evidence for a single point deletion in the amf allele.

PubMed

van der Leij, F R; Visser, R G; Ponstein, A S; Jacobsen, E; Feenstra, W J

1991-08-01

The genomic sequence of the potato gene for starch granule-bound starch synthase (GBSS; "waxy protein") has been determined for the wild-type allele of a monoploid genotype from which an amylose-free (amf) mutant was derived, and for the mutant part of the amf allele. Comparison of the wild-type sequence with a cDNA sequence from the literature and a newly isolated cDNA revealed the presence of 13 introns, the first of which is located in the untranslated leader. The promoter contains a G-box-like sequence. The deduced amino acid sequence of the precursor of GBSS shows a high degree of identity with monocot waxy protein sequences in the region corresponding to the mature form of the enzyme. The transit peptide of 77 amino acids, required for routing of the precursor to the plastids, shows much less identity with the transit peptides of the other waxy preproteins, but resembles the hydropathic distributions of these peptides. Alignment of the amino acid sequences of the four mature starch synthases with the Escherichia coli glgA gene product revealed the presence of at least three conserved boxes; there is no homology with previously proposed starch-binding domains of other enzymes involved in starch metabolism. We report the use of chimeric constructs with wild-type and amf sequences to localize, via complementation experiments, the region of the amf allele in which the mutation resides. Direct sequencing of polymerase chain reaction products confirmed that the amf mutation is a deletion of a single AT basepair in the region coding for the transit peptide.(ABSTRACT TRUNCATED AT 250 WORDS)
A natural mutation-led truncation in one of the two aluminum-activated malate transporter-like genes at the Ma locus is associated with low fruit acidity in apple.

PubMed

Bai, Yang; Dougherty, Laura; Li, Mingjun; Fazio, Gennaro; Cheng, Lailiang; Xu, Kenong

2012-08-01

Acidity levels greatly affect the taste and flavor of fruit, and consequently its market value. In mature apple fruit, malic acid is the predominant organic acid. Several studies have confirmed that the major quantitative trait locus Ma largely controls the variation of fruit acidity levels. The Ma locus has recently been defined in a region of 150 kb that contains 44 predicted genes on chromosome 16 in the Golden Delicious genome. In this study, we identified two aluminum-activated malate transporter-like genes, designated Ma1 and Ma2, as strong candidates of Ma by narrowing down the Ma locus to 65-82 kb containing 12-19 predicted genes depending on the haplotypes. The Ma haplotypes were determined by sequencing two bacterial artificial chromosome clones from G.41 (an apple rootstock of genotype Mama) that cover the two distinct haplotypes at the Ma locus. Gene expression profiling in 18 apple germplasm accessions suggested that Ma1 is the major determinant at the Ma locus controlling fruit acidity as Ma1 is expressed at a much higher level than Ma2 and the Ma1 expression is significantly correlated with fruit titratable acidity (R (2) = 0.4543, P = 0.0021). In the coding sequences of low acidity alleles of Ma1 and Ma2, sequence variations at the amino acid level between Golden Delicious and G.41 were not detected. But the alleles for high acidity vary considerably between the two genotypes. The low acidity allele of Ma1, Ma1-1455A, is mainly characterized by a mutation at base 1455 in the open reading frame. The mutation leads to a premature stop codon that truncates the carboxyl terminus of Ma1-1455A by 84 amino acids compared with Ma1-1455G. A survey of 29 apple germplasm accessions using marker CAPS(1455) that targets the SNP(1455) in Ma1 showed that the CAPS(1455A) allele was associated completely with high pH and highly with low titratable acidity, suggesting that the natural mutation-led truncation is most likely responsible for the abolished function of Ma for low pH or high acidity in apple.
The wheat cytochrome oxidase subunit II gene has an intron insert and three radical amino acid changes relative to maize

PubMed Central

Bonen, Linda; Boer, Poppo H.; Gray, Michael W.

1984-01-01

We have determined the sequence of the wheat mitochondrial gene for cytochrome oxidase subunit II (COII) and find that its derived protein sequence differs from that of maize at only three amino acid positions. Unexpectedly, all three replacements are non-conservative ones. The wheat COII gene has a highly-conserved intron at the same position as in maize, but the wheat intron is 1.5 times longer because of an insert relative to its maize counterpart. Hybridization analysis of mitochondrial DNA from rye, pea, broad bean and cucumber indicates strong sequence conservation of COII coding sequences among all these higher plants. However, only rye and maize mitochondrial DNA show homology with wheat COII intron sequences and rye alone with intron-insert sequences. We find that a sequence identical to the region of the 5' exon corresponding to the transmembrane domain of the COII protein is present at a second genomic location in wheat mitochondria. These variations in COII gene structure and size, as well as the presence of repeated COII sequences, illustrate at the DNA sequence level, factors which contribute to higher plant mitochondrial DNA diversity and complexity. ImagesFig. 3.Fig. 4.Fig. 5. PMID:16453565
[Rubella virus genetic determinant of attenuation].

PubMed

Dmitriev, G V; Borisova, T K; Faizuloev, E B; Desiatskova, R G; Zverev, V V

2014-01-01

Vaccination is the most effective and available way to prevent Rubella. Presently, 9 vaccine strains were registered. Nevertheless, the molecular mechanisms of the attenuation were poorly elucidated for the rubella virus. However, the study of these mechanisms identifying genotypic and phenotypic markers of attenuation, which together with sequence analysis could be used for the genetic stability control of vaccine strains, is still of current interest. Common trends of genetic changes in the process of adaptation to cold were found due to comparison of nucleic acid and amino acid sequences of the Russian strain C-77 with corresponding positions of the known rubella virus strains and its wild type progenitors, if available.
A Unique (3+2) Annulation Reaction between Meldrum's Acid and Nitrones: Mechanistic Insight by ESI-IMS-MS and DFT Studies.

PubMed

Lespes, Nicolas; Pair, Etienne; Maganga, Clisy; Bretier, Marie; Tognetti, Vincent; Joubert, Laurent; Levacher, Vincent; Hubert-Roux, Marie; Afonso, Carlos; Loutelier-Bourhis, Corinne; Brière, Jean-François

2018-03-15

The fragile intermediates of the domino process leading to an isoxazolidin-5-one, triggered by unique reactivity between Meldrum's acid and an N-benzyl nitrone in the presence of a Brønsted base, were determined thanks to the softness and accuracy of electrospray ionization mass spectrometry coupled to ion mobility spectrometry (ESI-IMS-MS). The combined DFT study shed light on the overall organocatalytic sequence that starts with a stepwise (3+2) annulation reaction that is followed by a decarboxylative protonation sequence encompassing a stereoselective pathway issue. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Biochemical and Genetic Characterization of Coagulin, a New Antilisterial Bacteriocin in the Pediocin Family of Bacteriocins, Produced by Bacillus coagulans I4

PubMed Central

Le Marrec, Claire; Hyronimus, Bertrand; Bressollier, Philippe; Verneuil, Bernard; Urdaci, Maria C.

2000-01-01

A plasmid-linked antimicrobial peptide, named coagulin, produced by Bacillus coagulans I4 has recently been reported (B. Hyronimus, C. Le Marrec and M. C. Urdaci, J. Appl. Microbiol. 85:42–50, 1998). In the present study, the complete, unambiguous primary amino acid sequence of the peptide was obtained by a combination of both N-terminal sequencing of purified peptide and the complete sequence deduced from the structural gene harbored by plasmid I4. Data revealed that this peptide of 44 residues has an amino acid sequence similar to that described for pediocins AcH and PA-1, produced by different Pediococcus acidilactici strains and 100% identical. Coagulin and pediocin differed only by a single amino acid at their C terminus. Analysis of the genetic determinants revealed the presence, on the pI4 DNA, of the entire 3.5-kb operon of four genes described for pediocin AcH and PA-1 production. No extended homology was observed between pSMB74 from P. acidilactici and pI4 when analyzing the regions upstream and downstream of the operon. An oppositely oriented gene immediately dowstream of the bacteriocin operon specifies a 474-amino-acid protein which shows homology to Mob-Pre (plasmid recombination enzyme) proteins encoded by several small plasmids extracted from gram-positive bacteria. This is the first report of a pediocin-like peptide appearing naturally in a non-lactic acid bacterium genus. PMID:11097892
Identification of potential platelet alloantigens in the Equidae family by comparison of gene sequences encoding major platelet membrane glycoproteins.

PubMed

Boudreaux, Mary K; Humphries, Drew M

2013-12-01

Platelet alloantigens in horses may play an important role in the development of neonatal alloimmune thrombocytopenia (NAIT). The objective of this study was to evaluate genes encoding major platelet glycoproteins within the Equidae family in an effort to identify potential alloantigens. DNA was isolated from blood samples obtained from Equidae family members, including a Holsteiner-Oldenburg cross, a Quarter horse, a donkey, and a Plains zebra (Equus burchelli). Gene sequences encoding equine platelet membrane glycoproteins IIb, IIIa (integrin subunits αIIb and β3), Ia (integrin subunit α2), and Ibα were determined using PCR. Gene sequences were compared to the equine genome available on GenBank. Polymorphisms that would be predicted to result in amino acid changes on platelet surfaces were documented and compared with known alloantigenic sites documented on human platelets. Amino acid differences were predicted based on nucleotide sequences for all 4 genes. Nine differences were documented for αIIb, 5 differences were documented for β3, 7 differences were documented for α2, and 16 differences were documented for Ibα outside the macroglycopeptide region. This study represents the first effort at identifying potential platelet alloantigens in members of the Equidae Family based on evaluation of gene sequences. The data obtained form the groundwork for identifying potential platelet alloantigens involved in transfusion reactions and neonatal alloimmune thrombocytopenia (NAIT). More work is required to determine whether the predicted amino acid differences documented in this study play a role in alloimmunity, and whether other polymorphisms not detected in this study are present that may result in alloimmunity. © 2013 American Society for Veterinary Clinical Pathology.
Isolation and molecular characterization of partial FSH and LH receptor genes in Arabian camels (Camelus dromedarius)

PubMed Central

Jelokhani-Niaraki, Saber; Tahmoorespur, Mojtaba; Bitaraf-Sani, Morteza

2015-01-01

Very little is known about LHR and FSHR genes of domestic dromedary camels. The main objective of this study was to determine and analyze partial genomic regions of FSHR and LHR genes in dromedary camels for the first time. To this end, a total of50 DNA samples belonging to dromedary camels raised in Iran were sent for sequencing (25 samples of each gene). We compared the nucleotide sequences of Camelus dromedarius with corresponding sequences of previously published FSHR and LHR genes in bactrian camels and other species. According to the data, the same nucleotide variation was identified in both regions of the two camel species. The alignment of deduced protein sequences of the two different species revealed an amino acid variation at the FSHR region. No evidence of amino acid variation was observed, however, in LHR sequences. Phylogenetic analysis indicated that both camel species had a close relationship and clustered together in a separate branch. This was further confirmed by genetic distance values illustrating significant sequence identity between Camelus dromedarius and Camelus bactrianus. Interestingly, sequence comparisons revealed heterozygote patterns in FSHR sequences isolated from dromedary camels of Iran. In comparison to other species, this camel contains three amino acid substitutions at 5, 67, and 105 positions in the FSHR coding region. These positions are found exclusively in camels and can be considered as species specific. The results of our study can be used for hormone functionality research (FSHR and LHR) as well as reproduction-linked polymorphisms and breeding programs. PMID:27844002
Isolation and molecular characterization of partial FSH and LH receptor genes in Arabian camels (Camelus dromedarius).

PubMed

Jelokhani-Niaraki, Saber; Tahmoorespur, Mojtaba; Bitaraf-Sani, Morteza

2015-06-01

Very little is known about LHR and FSHR genes of domestic dromedary camels. The main objective of this study was to determine and analyze partial genomic regions of FSHR and LHR genes in dromedary camels for the first time. To this end, a total of50 DNA samples belonging to dromedary camels raised in Iran were sent for sequencing (25 samples of each gene). We compared the nucleotide sequences of Camelus dromedarius with corresponding sequences of previously published FSHR and LHR genes in bactrian camels and other species. According to the data, the same nucleotide variation was identified in both regions of the two camel species. The alignment of deduced protein sequences of the two different species revealed an amino acid variation at the FSHR region. No evidence of amino acid variation was observed, however, in LHR sequences. Phylogenetic analysis indicated that both camel species had a close relationship and clustered together in a separate branch. This was further confirmed by genetic distance values illustrating significant sequence identity between Camelus dromedarius and Camelus bactrianus . Interestingly, sequence comparisons revealed heterozygote patterns in FSHR sequences isolated from dromedary camels of Iran. In comparison to other species, this camel contains three amino acid substitutions at 5, 67, and 105 positions in the FSHR coding region. These positions are found exclusively in camels and can be considered as species specific. The results of our study can be used for hormone functionality research ( FSHR and LHR ) as well as reproduction-linked polymorphisms and breeding programs.
Detection of nucleic acid sequences by invader-directed cleavage

DOEpatents

Brow, Mary Ann D.; Hall, Jeff Steven Grotelueschen; Lyamichev, Victor; Olive, David Michael; Prudent, James Robert

1999-01-01

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The 5' nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based by charge.
Complete nucleotide sequences of the coat protein messenger RNAs of brome mosaic virus and cowpea chlorotic mottle virus.

PubMed Central

Dasgupta, R; Kaesberg, P

1982-01-01

The nucleotide sequences of the subgenomic coat protein messengers (RNA4's) of two related bromoviruses, brome mosaic virus (BMV) and cowpea chlorotic mottle virus (CCMV), have been determined by direct RNA and CDNA sequencing without cloning. BMV RNA4 is 876 b long including a 5' noncoding region of nine nucleotides and a 3' noncoding region of 300 nucleotides. CCMV RNA 4 is 824 b long, including a 5' noncoding region of 10 nucleotides and a 3' noncoding region of 244 nucleotides. The encoded coat proteins are similar in length (188 amino acids for BMV and 189 amino acids for CCMV) and display about 70% homology in their amino acid sequences. Length difference between the two RNAs is due mostly to a single deletion, in CCMV with respect to BMV, of about 57 b immediately following the coding region. Allowing for this deletion the RNAs are indicate that mutations leading to divergence were constrained in the coding region primarily by the requirement of maintaining a favorable coat protein structure and in the 3' noncoding region primarily by the requirement of maintaining a favorable RNA spatial configuration. PMID:6895941
Studying the evolutionary relationships and phylogenetic trees of 21 groups of tRNA sequences based on complex networks.

PubMed

Wei, Fangping; Chen, Bowen

2012-03-01

To find out the evolutionary relationships among different tRNA sequences of 21 amino acids, 22 networks are constructed. One is constructed from whole tRNAs, and the other 21 networks are constructed from the tRNAs which carry the same amino acids. A new method is proposed such that the alignment scores of any two amino acids groups are determined by the average degree and the average clustering coefficient of their networks. The anticodon feature of isolated tRNA and the phylogenetic trees of 21 group networks are discussed. We find that some isolated tRNA sequences in 21 networks still connect with other tRNAs outside their group, which reflects the fact that those tRNAs might evolve by intercrossing among these 21 groups. We also find that most anticodons among the same cluster are only one base different in the same sites when S ≥ 70, and they stay in the same rank in the ladder of evolutionary relationships. Those observations seem to agree on that some tRNAs might mutate from the same ancestor sequences based on point mutation mechanisms.
Organization of the hao gene cluster of Nitrosomonas europaea: genes for two tetraheme c cytochromes.

PubMed

Bergmann, D J; Arciero, D M; Hooper, A B

1994-06-01

The organization of genes for three proteins involved in ammonia oxidation in Nitrosomonas europaea has been investigated. The amino acid sequence of the N-terminal region and four heme-containing peptides produced by proteolysis of the tetraheme cytochrome c554 of N. europaea were determined by Edman degradation. The gene (cycA) encoding this cytochrome is present in three copies per genome (H. McTavish, F. LaQuier, D. Arciero, M. Logan, G. Mundfrom, J.A. Fuchs, and A. B. Hooper, J. Bacteriol. 175:2445-2447, 1993). Three clones, representing at least two copies of cycA, were isolated and sequenced by the dideoxy-chain termination procedure. In both copies, the sequences of 211 amino acids derived from the gene sequence are identical and include all amino acids predicted by the proteolytic peptides. In two copies, the cycA open reading frame (ORF) is followed closely (three bases in one copy) by a second ORF predicted to encode a 28-kDa tetraheme c cytochrome not previously characterized but similar to the nirT gene product of Pseudomonas stutzeri. In one copy of the cycA gene cluster, the second ORF is absent.
Proteins without unique 3D structures: biotechnological applications of intrinsically unstable/disordered proteins.

PubMed

Uversky, Vladimir N

2015-03-01

Intrinsically disordered proteins (IDPs) and intrinsically disordered protein regions (IDPRs) are functional proteins or regions that do not have unique 3D structures under functional conditions. Therefore, from the viewpoint of their lack of stable 3D structure, IDPs/IDPRs are inherently unstable. As much as structure and function of normal ordered globular proteins are determined by their amino acid sequences, the lack of unique 3D structure in IDPs/IDPRs and their disorder-based functionality are also encoded in the amino acid sequences. Because of their specific sequence features and distinctive conformational behavior, these intrinsically unstable proteins or regions have several applications in biotechnology. This review introduces some of the most characteristic features of IDPs/IDPRs (such as peculiarities of amino acid sequences of these proteins and regions, their major structural features, and peculiar responses to changes in their environment) and describes how these features can be used in the biotechnology, for example for the proteome-wide analysis of the abundance of extended IDPs, for recombinant protein isolation and purification, as polypeptide nanoparticles for drug delivery, as solubilization tools, and as thermally sensitive carriers of active peptides and proteins. Copyright © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Whole-Genome Sequence Analysis of Bombella intestini LMG 28161T, a Novel Acetic Acid Bacterium Isolated from the Crop of a Red-Tailed Bumble Bee, Bombus lapidarius.

PubMed

Li, Leilei; Illeghems, Koen; Van Kerrebroeck, Simon; Borremans, Wim; Cleenwerck, Ilse; Smagghe, Guy; De Vuyst, Luc; Vandamme, Peter

2016-01-01

The whole-genome sequence of Bombella intestini LMG 28161T, an endosymbiotic acetic acid bacterium (AAB) occurring in bumble bees, was determined to investigate the molecular mechanisms underlying its metabolic capabilities. The draft genome sequence of B. intestini LMG 28161T was 2.02 Mb. Metabolic carbohydrate pathways were in agreement with the metabolite analyses of fermentation experiments and revealed its oxidative capacity towards sucrose, D-glucose, D-fructose and D-mannitol, but not ethanol and glycerol. The results of the fermentation experiments also demonstrated that the lack of effective aeration in small-scale carbohydrate consumption experiments may be responsible for the lack of reproducibility of such results in taxonomic studies of AAB. Finally, compared to the genome sequences of its nearest phylogenetic neighbor and of three other insect associated AAB strains, the B. intestini LMG 28161T genome lost 69 orthologs and included 89 unique genes. Although many of the latter were hypothetical they also included several type IV secretion system proteins, amino acid transporter/permeases and membrane proteins which might play a role in the interaction with the bumble bee host.

Mapping of the linear antigenic determinants from the Leishmania infantum histone H2A recognized by sera from dogs with leishmaniasis.

PubMed

Soto, M; Requena, J M; Quijada, L; García, M; Guzman, F; Patarroyo, M E; Alonso, C

1995-12-01

Antibodies reacting against the H2A histone protein were frequently observed in the sera from dogs naturally infected with the protozoan parasite Leishmania infantum. Using synthetic peptides covering the complete sequence of the protein we have identified the amino terminal region, comprising from amino acids 1 to 20, and the carboxyl terminal region, comprising from amino acids 106 to 132, as conforming the antigenic determinants of the protein. Those regions, exposed in the nucleosome surface, are highly divergent in sequence relative to the mammalian H2A histones. The anti-H2A histone antibodies present in the sera of these dogs specifically recognize the L. infantum H2A histone and they do not react with mammalian histones. The present data indicate that, in spite of the evolutionary conservation of the H2A histone protein among eukaryotic organisms, the humoral response against this protein during natural infection is specifically triggered by the parasite protein antigenic determinants.
37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.

Code of Federal Regulations, 2011 CFR

2011-07-01

... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...
37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.

Code of Federal Regulations, 2013 CFR

2013-07-01

... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...
37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.

Code of Federal Regulations, 2012 CFR

2012-07-01

... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...
37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.

Code of Federal Regulations, 2010 CFR

2010-07-01

... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...
37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.

Code of Federal Regulations, 2014 CFR

2014-07-01

... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...
The central domain of bovine submaxillary mucin consists of over 50 tandem repeats of 329 amino acids. Chromosomal localization of the BSM1 gene and relations to ovine and porcine counterparts.

PubMed

Jiang, W; Gupta, D; Gallagher, D; Davis, S; Bhavanandan, V P

2000-04-01

We previously elucidated five distinct protein domains (I-V) for bovine submaxillary mucin, which is encoded by two genes, BSM1 and BSM2. Using Southern blot analysis, genomic cloning and sequencing of the BSM1 gene, we now show that the central domain (V) consists of approximately 55 tandem repeats of 329 amino acids and that domains III-V are encoded by a 58.4-kb exon, the largest exon known for all genes to date. The BSM1 gene was mapped by fluorescence in situ hybridization to the proximal half of chromosome 5 at bands q2. 2-q2.3. The amino-acid sequence of six tandem repeats (two full and four partial) were found to have only 92-94% identities. We propose that the variability in the amino-acid sequences of the mucin tandem repeat is important for generating the combinatorial library of saccharides that are necessary for the protective function of mucins. The deduced peptide sequences of the central domain match those determined from the purified bovine submaxillary mucin and also show 68-94% identity to published peptide sequences of ovine submaxillary mucin. This indicates that the core protein of ovine submaxillary mucin is closely related to that of bovine submaxillary mucin and contains similar tandem repeats in the central domain. In contrast, the central domain of porcine submaxillary mucin is reported to consist of 81-amino-acid tandem repeats. However, both bovine submaxillary mucin and porcine submaxillary mucin contain similar N-terminal and C-terminal domains and the corresponding genes are in the conserved linkage regions of the respective genomes.
Cloning and functional characterization of SAD genes in potato.

PubMed

Li, Fei; Bian, Chun Song; Xu, Jian Fei; Pang, Wan Fu; Liu, Jie; Duan, Shao Guang; Lei, Zun-Guo; Jiwan, Palta; Jin, Li-Ping

2015-01-01

Stearoyl-acyl carrier protein desaturase (SAD), locating in the plastid stroma, is an important fatty acid biosynthetic enzyme in higher plants. SAD catalyzes desaturation of stearoyl-ACP to oleyl-ACP and plays a key role in determining the homeostasis between saturated fatty acids and unsaturated fatty acids, which is an important player in cold acclimation in plants. Here, four new full-length cDNA of SADs (ScoSAD, SaSAD, ScaSAD and StSAD) were cloned from four Solanum species, Solanum commersonii, S. acaule, S. cardiophyllum and S. tuberosum, respectively. The ORF of the four SADs were 1182 bp in length, encoding 393 amino acids. A sequence alignment indicated 13 amino acids varied among the SADs of three wild species. Further analysis showed that the freezing tolerance and cold acclimation capacity of S. commersonii are similar to S. acaule and their SAD amino acid sequences were identical but differed from that of S. cardiophyllum, which is sensitive to freezing. Furthermore, the sequence alignments between StSAD and ScoSAD indicated that only 7 different amino acids at residues were found in SAD of S. tuberosum (Zhongshu8) against the protein sequence of ScoSAD. A phylogenetic analysis showed the three wild potato species had the closest genetic relationship with the SAD of S. lycopersicum and Nicotiana tomentosiformis but not S. tuberosum. The SAD gene from S. commersonii (ScoSAD) was cloned into multiple sites of the pBI121 plant binary vector and transformed into the cultivated potato variety Zhongshu 8. A freeze tolerance analysis showed overexpression of the ScoSAD gene in transgenic plants significantly enhanced freeze tolerance in cv. Zhongshu 8 and increased their linoleic acid content, suggesting that linoleic acid likely plays a key role in improving freeze tolerance in potato plants. This study provided some new insights into how SAD regulates in the freezing tolerance and cold acclimation in potato.
Amino acid sequence of the smaller basic protein from rat brain myelin

PubMed Central

Dunkley, Peter R.; Carnegie, Patrick R.

1974-01-01

1. The complete amino acid sequence of the smaller basic protein from rat brain myelin was determined. This protein differs from myelin basic proteins of other species in having a deletion of a polypeptide of 40 amino acid residues from the centre of the molecule. 2. A detailed comparison is made of the constant and variable regions in a group of myelin basic proteins from six species. 3. An arginine residue in the rat protein was found to be partially methylated. The ratio of methylated to unmethylated arginine at this position differed from that found for the human basic protein. 4. Three tryptic peptides were isolated in more than one form. The differences between the two forms of each peptide are discussed in relation to the electrophoretic heterogeneity of myelin basic proteins, which is known to occur at alkaline pH values. 5. Detailed evidence for the amino acid sequence of the protein has been deposited as Supplementary Publication SUP 50029 at the British Library (Lending Division) (formerly the National Lending Library for Science and Technology), Boston Spa, Yorks. LS23 7BQ, U.K., from whom copies may be obtained on the terms given in Biochem. J. (1973) 131, 5. PMID:4141893
Pyrin gene and mutants thereof, which cause familial Mediterranean fever

DOEpatents

Kastner, Daniel L [Bethesda, MD; Aksentijevichh, Ivona [Bethesda, MD; Centola, Michael [Tacoma Park, MD; Deng, Zuoming [Gaithersburg, MD; Sood, Ramen [Rockville, MD; Collins, Francis S [Rockville, MD; Blake, Trevor [Laytonsville, MD; Liu, P Paul [Ellicott City, MD; Fischel-Ghodsian, Nathan [Los Angeles, CA; Gumucio, Deborah L [Ann Arbor, MI; Richards, Robert I [North Adelaide, AU; Ricke, Darrell O [San Diego, CA; Doggett, Norman A [Santa Cruz, NM; Pras, Mordechai [Tel-Hashomer, IL

2003-09-30

The invention provides the nucleic acid sequence encoding the protein associated with familial Mediterranean fever (FMF). The cDNA sequence is designated as MEFV. The invention is also directed towards fragments of the DNA sequence, as well as the corresponding sequence for the RNA transcript and fragments thereof. Another aspect of the invention provides the amino acid sequence for a protein (pyrin) associated with FMF. The invention is directed towards both the full length amino acid sequence, fusion proteins containing the amino acid sequence and fragments thereof. The invention is also directed towards mutants of the nucleic acid and amino acid sequences associated with FMF. In particular, the invention discloses three missense mutations, clustered in within about 40 to 50 amino acids, in the highly conserved rfp (B30.2) domain at the C-terminal of the protein. These mutants include M6801, M694V, K695R, and V726A. Additionally, the invention includes methods for diagnosing a patient at risk for having FMF and kits therefor.
Nucleotide Sequence and Genetic Structure of a Novel Carbaryl Hydrolase Gene (cehA) from Rhizobium sp. Strain AC100

PubMed Central

Hashimoto, Masayuki; Fukui, Mitsuru; Hayano, Kouichi; Hayatsu, Masahito

2002-01-01

Rhizobium sp. strain AC100, which is capable of degrading carbaryl (1-naphthyl-N-methylcarbamate), was isolated from soil treated with carbaryl. This bacterium hydrolyzed carbaryl to 1-naphthol and methylamine. Carbaryl hydrolase from the strain was purified to homogeneity, and its N-terminal sequence, molecular mass (82 kDa), and enzymatic properties were determined. The purified enzyme hydrolyzed 1-naphthyl acetate and 4-nitrophenyl acetate indicating that the enzyme is an esterase. We then cloned the carbaryl hydrolase gene (cehA) from the plasmid DNA of the strain and determined the nucleotide sequence of the 10-kb region containing cehA. No homologous sequences were found by a database homology search using the nucleotide and deduced amino acid sequences of the cehA gene. Six open reading frames including the cehA gene were found in the 10-kb region, and sequencing analysis shows that the cehA gene is flanked by two copies of insertion sequence-like sequence, suggesting that it makes part of a composite transposon. PMID:11872471
Analysis of Endogenous D-Amino Acid-Containing Peptides in Metazoa

PubMed Central

Bai, Lu; Sheeley, Sarah; Sweedler, Jonathan V.

2010-01-01

Peptides are chiral molecules with their structure determined by the composition and configuration of their amino acid building blocks. The naturally occurring amino acids, except glycine, possess two chiral forms. This allows the formation of multiple peptide diastereomers that have the same sequence. Although living organisms use L-amino acids to make proteins, a group of D-amino acid-containing peptides (DAACPs) has been discovered in animals that have at least one of their residues isomerized to the D-form via an enzyme-catalyzed process. In many cases, the biological functions of these peptides are enhanced due to this structural conversion. These DAACPs are different from those known to occur in bacterial cell wall and antibiotic peptides, the latter of which are synthesized in a ribosome-independent manner. DAACPs have now also been identified in a number of distinct groups throughout the Metazoa. Their serendipitous discovery has often resulted from discrepancies observed in bioassays or in chromatographic behavior between natural peptide fractions and peptides synthesized according to a presumed all-L sequence. Because this L-to-D post-translational modification is subtle and not detectable by most sequence determination approaches, it is reasonable to suspect that many studies have overlooked this change; accordingly, DAACPs may be more prevalent than currently thought. Although diastereomer separation techniques developed with synthetic peptides in recent years have greatly aided in the discovery of natural DAACPs, there is a need for new, more robust methods for naturally complex samples. In this review, a brief history of DAACPs in animals is presented, followed by discussion of a variety of analytical methods that have been used for diastereomeric separation and detection of peptides. PMID:20490347
Summary of International Exhibition and Congress (3rd): BIOTECHNICA 󈨛 Hannover Held in Hannover (Germany, F.R.) on 22-24 September 1987

DTIC Science & Technology

1988-01-21

nucleic acids which occur in DNA and seem to play an e Improved theoretical analysis of the important role in determining gene reg’- fntra- and...developed two retroviral vectors, based on the murine new peptide-based animal vaccines which myeloproliferative sarcoma virus (MPSV), are currertly...Structure tides are part of a precursor molecule elucidation is performed by gas-phase composed of 126 amino acids. From a pre- amino acid sequence analysis
Mutations in the E2 and NS5A regions in patients infected with hepatitis C virus genotype 1a and their correlation with response to treatment.

PubMed

Yahoo, Neda; Sabahi, Farzaneh; Shahzamani, Kiana; Malboobi, Mohamad Ali; Jabbari, Hossain; Sharifi, Houshang; Mousavi-Fard, Seyed Hossein; Merat, Shahin

2011-08-01

Heterogeneity of subgenomic regions of hepatitis C virus (HCV) may be associated with response to interferon (IFN) therapy. The amino acid sequences of the PKR/eIF-2α phosphorylation homology domain (pePHD), IFN sensitivity determining region (ISDR), PKR binding domain (PKRBD), and variable region 3 (V3) were studied in 19 patients before and after 4 weeks of treatment. All patients were infected with HCV genotype 1a and were treated with pegylated-IFN and ribavirin. Thirteen patients achieved sustained viral response (responders) and six failed to clear viral RNA (nonresponders). The amino acid sequences in the pePHD and ISDR were identical in responders and nonresponders. However, amino acid substitution at position 2252 of PKRBD was significantly different between responders and nonresponders (P = 0.044). A larger number of mutations were observed in the V3 region of responders (P < 0.001). In this region, the amino acid in position 2364 differed between responders and nonresponders (responders: aspartic acid and serine, nonresponders: asparagine, P = 0.018). The amino acid sequences in the regions which were studied did not change after 4 weeks of treatment. It is concluded that the presence of specific amino acids in position 2252 of PKRBD and position 2364 of V3 might be associated with clinical response to IFN. Copyright © 2011 Wiley-Liss, Inc.
Complete cDNA sequence of SAP-like pentraxin from Limulus polyphemus: implications for pentraxin evolution.

PubMed

Tharia, Hazel A; Shrive, Annette K; Mills, John D; Arme, Chris; Williams, Gwyn T; Greenhough, Trevor J

2002-02-22

The serum amyloid P component (SAP)-like pentraxin Limulus polyphemus SAP is a recently discovered, distinct pentraxin species, of known structure, which does not bind phosphocholine and whose N-terminal sequence has been shown to differ markedly from the highly conserved N terminus of all other known horseshoe crab pentraxins. The complete cDNA sequence of Limulus SAP, and the derived amino acid sequence, the first invertebrate SAP-like pentraxin sequence, have been determined. Two sequences were identified that differed only in the length of the 3' untranslated region. Limulus SAP is synthesised as a precursor protein of 234 amino acid residues, the first 17 residues encoding a signal peptide that is absent from the mature protein. Phylogenetic analysis clusters Limulus SAP pentraxin with the horseshoe crab C-reactive proteins (CRPs) rather than the mammalian SAPs, which are clustered with mammalian CRPs. The deduced amino acid sequence shares 22% identity with both human SAP and CRP, which are 51% identical, and 31-35% with horseshoe crab CRPs. These analyses indicate that gene duplication of CRP (or SAP), followed by sequence divergence and the evolution of CRP and/or SAP function, occurred independently along the chordate and arthropod evolutionary lines rather than in a common ancestor. They further indicate that the CRP/SAP gene duplication event in Limulus occurred before both the emergence of the Limulus CRP variants and the mammalian CRP/SAP gene duplication. Limulus SAP, which does not exhibit the CRP characteristic of calcium-dependent binding to phosphocholine, is established as a pentraxin species distinct from all other known horseshoe crab pentraxins that exist in many variant forms sharing a high level of sequence homology. Copyright 2002 Elsevier Science Ltd.
Sequence diversity within the reovirus S2 gene: reovirus genes reassort in nature, and their termini are predicted to form a panhandle motif.

PubMed Central

Chapell, J D; Goral, M I; Rodgers, S E; dePamphilis, C W; Dermody, T S

1994-01-01

To better understand genetic diversity within mammalian reoviruses, we determined S2 nucleotide and deduced sigma 2 amino acid sequences of nine reovirus strains and compared these sequences with those of prototype strains of the three reovirus serotypes. The S2 gene and sigma 2 protein are highly conserved among the four type 1, one type 2, and seven type 3 strains studied. Phylogenetic analyses based on S2 nucleotide sequences of the 12 reovirus strains indicate that diversity within the S2 gene is independent of viral serotype. Additionally, we found marked topological differences between phylogenetic trees generated from S1 and S2 gene nucleotide sequences of the seven type 3 strains. These results demonstrate that reovirus S1 and S2 genes have distinct evolutionary histories, thus providing phylogenetic evidence for lateral transfer of reovirus genes in nature. When variability among the 12 sigma 2-encoding S2 nucleotide sequences was analyzed at synonymous positions, we found that approximately 60 nucleotides at the 5' terminus and 30 nucleotides at the 3' terminus were markedly conserved in comparison with other sigma 2-encoding regions of S2. Predictions of RNA secondary structures indicate that the more conserved S2 sequences participate in the formation of an extended region of duplex RNA interrupted by a pair of stem-loops. Among the 12 deduced sigma 2 amino acid sequences examined, substitutions were observed at only 11% of amino acid positions. This finding suggests that constraints on the structure or function of sigma 2, perhaps in part because of its location in the virion core, have limited sequence diversity within this protein. PMID:8289378
77 FR 65537 - Requirements for Patent Applications Containing Nucleotide Sequence and/or Amino Acid Sequence...

Federal Register 2010, 2011, 2012, 2013, 2014

2012-10-29

... DEPARTMENT OF COMMERCE Patent and Trademark Office Requirements for Patent Applications Containing Nucleotide Sequence and/or Amino Acid Sequence Disclosures ACTION: Proposed collection; comment request... Patent applications that contain nucleotide and/or amino acid sequence disclosures must include a copy of...
Characterization of the Triticum Mosaic Virus Genome and Interactions between Triticum Mosaic Virus and Wheat Streak Mosaic Virus

USDA-ARS?s Scientific Manuscript database

The complete genome sequence of Triticum mosaic virus (TriMV) has been determined to be 10,266 nucleotides encoding a large polyprotein of 3,112 amino acids. The proteins of TriMV possess only 33-44% (with NIb protein) and 15-29% (with P1 protein) amino acid identity with the reported members of Pot...
Cloning and molecular characterization of the betaine aldehyde dehydrogenase involved in the biosynthesis of glycine betaine in white shrimp (Litopenaeus vannamei).

PubMed

Delgado-Gaytán, María F; Rosas-Rodríguez, Jesús A; Yepiz-Plascencia, Gloria; Figueroa-Soto, Ciria G; Valenzuela-Soto, Elisa M

2017-10-01

The enzyme betaine aldehyde dehydrogenase (BADH) catalyzes the irreversible oxidation of betaine aldehyde to glycine betaine (GB), a very efficient osmolyte accumulated during osmotic stress. In this study, we determined the nucleotide sequence of the cDNA for the BADH from the white shrimp Litopenaeus vannamei (LvBADH). The cDNA was 1882 bp long, with a complete open reading frame of 1524 bp, encoding 507 amino acids with a predicted molecular mass of 54.15 kDa and a pI of 5.4. The predicted LvBADH amino acid sequence shares a high degree of identity with marine invertebrate BADHs. Catalytic residues (C-298, E-264 and N-167) and the decapeptide VTLELGGKSP involved in nucleotide binding and highly conserved in BADHs were identified in the amino acid sequence. Phylogenetic analyses classified LvBADH in a clade that includes ALDH9 sequences from marine invertebrates. Molecular modeling of LvBADH revealed that the protein has amino acid residues and sequence motifs essential for the function of the ALDH9 family of enzymes. LvBADH modeling showed three potential monovalent cation binding sites, one site is located in an intra-subunit cavity; other in an inter-subunit cavity and a third in a central-cavity of the protein. The results show that LvBADH shares a high degree of identity with BADH sequences from marine invertebrates and enzymes that belong to the ALDH9 family. Our findings suggest that the LvBADH has molecular mechanisms of regulation similar to those of other BADHs belonging to the ALDH9 family, and that BADH might be playing a role in the osmoregulation capacity of L. vannamei. Copyright © 2017 Elsevier B.V. All rights reserved.
Subset of Kappa and Lambda Germline Sequences Result in Light Chains with a Higher Molecular Mass Phenotype.

PubMed

Barnidge, David R; Lundström, Susanna L; Zhang, Bo; Dasari, Surendra; Murray, David L; Zubarev, Roman A

2015-12-04

In our previous work, we showed that electrospray ionization of intact polyclonal kappa and lambda light chains isolated from normal serum generates two distinct, Gaussian-shaped, molecular mass distributions representing the light-chain repertoire. During the analysis of a large (>100) patient sample set, we noticed a low-intensity molecular mass distribution with a mean of approximately 24 250 Da, roughly 800 Da higher than the mean of the typical kappa molecular-mass distribution mean of 23 450 Da. We also observed distinct clones in this region that did not appear to contain any typical post-translational modifications that would account for such a large mass shift. To determine the origin of the high molecular mass clones, we performed de novo bottom-up mass spectrometry on a purified IgM monoclonal light chain that had a calculated molecular mass of 24 275.03 Da. The entire sequence of the monoclonal light chain was determined using multienzyme digestion and de novo sequence-alignment software and was found to belong to the germline allele IGKV2-30. The alignment of kappa germline sequences revealed ten IGKV2 and one IGKV4 sequences that contained additional amino acids in their CDR1 region, creating the high-molecular-mass phenotype. We also performed an alignment of lambda germline sequences, which showed additional amino acids in the CDR2 region, and the FR3 region of functional germline sequences that result in a high-molecular-mass phenotype. The work presented here illustrates the ability of mass spectrometry to provide information on the diversity of light-chain molecular mass phenotypes in circulation, which reflects the germline sequences selected by the immunoglobulin-secreting B-cell population.

Prevalence, distribution, and sequence diversity of hmwA among commensal and otitis media non-typeable Haemophilus influenzae.

PubMed

Davis, Gregg S; Patel, May; Hammond, James; Zhang, Lixin; Dawid, Suzanne; Marrs, Carl F; Gilsdorf, Janet R

2014-12-01

Nontypeable Haemophilus influenzae (NTHi) are Gram-negative coccobacilli that colonize the human pharynx, their only known natural reservoir. Adherence to the host epithelium facilitates NTHi colonization and marks one of the first steps in NTHi pathogenesis. Epithelial cell attachment is mediated, in part, by a pair of high molecular weight (HMW) adhesins that are highly immunogenic, antigenically diverse, and display a wide range of amino acid diversity both within and between isolates. In this study, the prevalence of hmwA, which encodes the HMW adhesin, was determined for a collection of 170 NTHi isolates recovered from the middle ears of children with otitis media (OM isolates) or throats or nasopharynges of healthy children (commensal isolates) from Finland, Israel, and the U.S. Overall, hmwA was detected in 61% of NTHi isolates and was significantly more prevalent (P=0.004) among OM isolates than among commensal isolates; the prevalence ratio comparing hmwA prevalence among ear isolates with that of commensal isolates was 1.47 (95% CI (1.12, 1.92)). Ninety-five percent (98/103) of the hmwA-positive NTHi isolates possessed two hmw loci. To advance our understanding of hmwA binding sequence diversity, we determined the DNA sequence of the hmwA binding region of 33 isolates from this collection. The average amino acid identity across all hmwA sequences was 62%. Phylogenetic analyses of the hmwA binding revealed four distinct sequence clusters, and the majority of hmwA sequences (83%) belonged to one of two dominant sequence clusters. hmwA sequences did not cluster by chromosomal location, geographic region, or disease status. Copyright © 2014 Elsevier B.V. All rights reserved.
Cleavage of nucleic acids

DOEpatents

Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor L.; Brow, Mary Ann D.; Dahlberg, James E.

2007-12-11

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Invasive cleavage of nucleic acids

DOEpatents

Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor I.; Brow, Mary Ann D.; Dahlberg, James E.

1999-01-01

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Invasive cleavage of nucleic acids

DOEpatents

Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor I.; Brow, Mary Ann D.; Dahlberg, James E.

2002-01-01

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Cleavage of nucleic acids

DOEpatents

Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor I.; Brow; Mary Ann D.; Dahlberg, James E.

2010-11-09

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Cleavage of nucleic acids

DOEpatents

Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor I.; Brow, Mary Ann D.; Dahlberg, James E.

2000-01-01

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Nucleic acid detection assays

DOEpatents

Prudent, James R.; Hall, Jeff G.; Lyamichev, Victor I.; Brow, Mary Ann; Dahlberg, James E.

2005-04-05

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof.
Detection of diastereomer peptides as the intermediates generating D-amino acids during acid hydrolysis of peptides.

PubMed

Miyamoto, Tetsuya; Sekine, Masae; Ogawa, Tetsuhiro; Hidaka, Makoto; Watanabe, Hidenori; Homma, Hiroshi; Masaki, Haruhiko

2016-11-01

In this study, we investigated whether the amino acid residues within peptides were isomerized (and the peptides converted to diastereomers) during the early stages of acid hydrolysis. We demonstrate that the model dipeptides L-Ala-L-Phe and L-Phe-L-Ala are epimerized to produce the corresponding diastereomers at a very early stage, prior to their acid hydrolytic cleavage to amino acids. Furthermore, the sequence-inverted dipeptides were generated via formation of a diketopiperazine during hydrolytic incubation, and these dipeptides were also epimerized. The proportion of diastereomers increased rapidly during incubation for 0.5-2 h. During acid hydrolysis, C-terminal residues of the model dipeptides were isomerized faster than N-terminal residues, consistent with the observation that the D-amino acid values of the C-terminal residues determined by the 0 h-extrapolating method were larger than those of the N-terminal residues. Thus, the artificial D-amino acid contents determined by the 0 h-extrapolating method appear to be products of the isomerization of amino acid residues during acid hydrolysis.
Characterization of a tandemly repeated DNA sequence family originally derived by retroposition of tRNA(Glu) in the newt.

PubMed

Nagahashi, S; Endoh, H; Suzuki, Y; Okada, N

1991-11-20

A previous report from this laboratory showed that in vitro transcription of total genomic DNA of the newt Cynopus pyrrhogaster resulted in a discrete sized 8 S RNA, which represented highly repetitive and transcribable sequences with a glutamic acid tRNA-like structure in the newt genome. We isolated four independent clones from a newt genomic library and determined the complete sequences of three 2000 to 2400 base-pair PstI fragments spanning the 8 S RNA gene. The glutamic acid tRNA-related segment in the 8 S RNA gene contains the CCA sequence expected as the 3' terminus of a tRNA molecule. Further, the 11 nucleotides located 13 nucleotides upstream from one of the two transcription initiation sites of the 8 S RNA were found to be repeated in the region upstream from the termination site, suggesting that the original unit, which is shorter than the 8 S RNA, was retrotransposed via cDNA intermediates from the PolIII transcript. In the upstream region of the 8 S RNA gene, a 360 nucleotide unit containing the glutamic acid tRNA-related segment was found to be duplicated (clones NE1 and NE10) or triplicated (clone NE3). Except for the difference in the number of the 360 nucleotide unit, the three sequences of the 2000 to 2400 base-pair PstI fragment were essentially the same with only a few mutations and minor deletions. Inverse polymerase chain reaction and sequence determination of the products, together with a Southern hybridization experiment, demonstrated that the family consists of a tandemly repeated unit of 3300, 3700 or 4100 base-pairs. Thus during evolution, this family in the newt was created by retroposition via cDNA intermediates, followed by duplication or triplication of the 360 nucleotide unit and multiplication of the 3300 to 4100 base-pair region at the DNA level.
Characterization of the complete mitochondrial genome of Marshallagia marshalli and phylogenetic implications for the superfamily Trichostrongyloidea.

PubMed

Sun, Miao-Miao; Han, Liang; Zhang, Fu-Kai; Zhou, Dong-Hui; Wang, Shu-Qing; Ma, Jun; Zhu, Xing-Quan; Liu, Guo-Hua

2018-01-01

Marshallagia marshalli (Nematoda: Trichostrongylidae) infection can lead to serious parasitic gastroenteritis in sheep, goat, and wild ruminant, causing significant socioeconomic losses worldwide. Up to now, the study concerning the molecular biology of M. marshalli is limited. Herein, we sequenced the complete mitochondrial (mt) genome of M. marshalli and examined its phylogenetic relationship with selected members of the superfamily Trichostrongyloidea using Bayesian inference (BI) based on concatenated mt amino acid sequence datasets. The complete mt genome sequence of M. marshalli is 13,891 bp, including 12 protein-coding genes, 22 transfer RNA genes, and 2 ribosomal RNA genes. All protein-coding genes are transcribed in the same direction. Phylogenetic analyses based on concatenated amino acid sequences of the 12 protein-coding genes supported the monophylies of the families Haemonchidae, Molineidae, and Dictyocaulidae with strong statistical support, but rejected the monophyly of the family Trichostrongylidae. The determination of the complete mt genome sequence of M. marshalli provides novel genetic markers for studying the systematics, population genetics, and molecular epidemiology of M. marshalli and its congeners.
The complete DNA sequence of lymphocystis disease virus.

PubMed

Tidona, C A; Darai, G

1997-04-14

Lymphocystis disease virus (LCDV) is the causative agent of lymphocystis disease, which has been reported to occur in over 100 different fish species worldwide. LCDV is a member of the family Iridoviridae and the type species of the genus Lymphocystivirus. The virions contain a single linear double-stranded DNA molecule, which is circularly permuted, terminally redundant, and heavily methylated at cytosines in CpG sequences. The complete nucleotide sequence of LCDV-1 (flounder isolate) was determined by automated cycle sequencing and primer walking. The genome of LCDV-1 is 102.653 bp in length and contains 195 open reading frames with coding capacities ranging from 40 to 1199 amino acids. Computer-assisted analyses of the deduced amino acid sequences led to the identification of several putative gene products with significant homologies to entries in protein data banks, such as the two major subunits of the viral DNA-dependent RNA polymerase, DNA polymerase, several protein kinases, two subunits of the ribonucleoside diphosphate reductase, DNA methyltransferase, the viral major capsid protein, insulin-like growth factor, and tumor necrosis factor receptor homolog.
Sequences in Influenza A Virus PB2 Protein That Determine Productive Infection for an Avian Influenza Virus in Mouse and Human Cell Lines

PubMed Central

Yao, Yongxiu; Mingay, Louise J.; McCauley, John W.; Barclay, Wendy S.

2001-01-01

Reverse genetics was used to analyze the host range of two avian influenza viruses which differ in their ability to replicate in mouse and human cells in culture. Engineered viruses carrying sequences encoding amino acids 362 to 581 of PB2 from a host range variant productively infect mouse and human cells. PMID:11333926
CHARACTERIZATION AND NUCLEOTIDE SEQUENCE DETERMINATION OF A REPEAT ELEMENT ISOLATED FROM A 2,4,5,-T DEGRADING STRAIN OF PSEUDOMONAS CEPACIA

EPA Science Inventory

Pseudomonas cepacia strain AC1100, capable of growth on 2,4,5-trichlorophenoxyacetic acid (2,4,5-T), was mutated to the 2,4,5-T− strain PT88 by a ColE1 :: Tn5 chromosomal insertion. Using cloned DNA from the region flanking the insertion, a 1477-bp sequence (designated RS1100) wa...
An improved procedure, involving mass spectrometry, for N-terminal amino acid sequence determination of proteins which are N alpha-blocked.

PubMed Central

Rose, K; Kocher, H P; Blumberg, B M; Kolakofsky, D

1984-01-01

A modification to a previously described procedure [Gray & del Valle (1970) Biochemistry 9, 2134-2137; Rose, Simona & Offord (1983) Biochem. J. 215, 261-272] for mass-spectral identification of the N-terminal regions of proteins is shown to be useful in cases where the N-terminus is blocked. Three proteins were studied: vesicular-stomatitis-virus N protein, Sendai-virus NP protein, and a rabbit immunoglobulin lambda-light chain. These proteins, found to be blocked at the N-terminus with either the acetyl group or a pyroglutamic acid residue, had all failed to yield to attempted Edman degradation, in one case even after attempted enzymic removal of the pyroglutamic acid residue. The N-terminal regions of all three proteins were sequenced by using the new procedure. PMID:6421284
Phylogenetic analysis of β-xylanase SRXL1 of Sporisorium reilianum and its relationship with families (GH10 and GH11) of Ascomycetes and Basidiomycetes

PubMed Central

Álvarez-Cervantes, Jorge; Díaz-Godínez, Gerardo; Mercado-Flores, Yuridia; Gupta, Vijai Kumar; Anducho-Reyes, Miguel Angel

2016-01-01

In this paper, the amino acid sequence of the β-xylanase SRXL1 of Sporisorium reilianum, which is a pathogenic fungus of maize was used as a model protein to find its phylogenetic relationship with other xylanases of Ascomycetes and Basidiomycetes and the information obtained allowed to establish a hypothesis of monophyly and of biological role. 84 amino acid sequences of β-xylanase obtained from the GenBank database was used. Groupings analysis of higher-level in the Pfam database allowed to determine that the proteins under study were classified into the GH10 and GH11 families, based on the regions of highly conserved amino acids, 233–318 and 180–193 respectively, where glutamate residues are responsible for the catalysis. PMID:27040368
Direct Calculation of Protein Fitness Landscapes through Computational Protein Design

PubMed Central

Au, Loretta; Green, David F.

2016-01-01

Naturally selected amino-acid sequences or experimentally derived ones are often the basis for understanding how protein three-dimensional conformation and function are determined by primary structure. Such sequences for a protein family comprise only a small fraction of all possible variants, however, representing the fitness landscape with limited scope. Explicitly sampling and characterizing alternative, unexplored protein sequences would directly identify fundamental reasons for sequence robustness (or variability), and we demonstrate that computational methods offer an efficient mechanism toward this end, on a large scale. The dead-end elimination and A∗ search algorithms were used here to find all low-energy single mutant variants, and corresponding structures of a G-protein heterotrimer, to measure changes in structural stability and binding interactions to define a protein fitness landscape. We established consistency between these algorithms with known biophysical and evolutionary trends for amino-acid substitutions, and could thus recapitulate known protein side-chain interactions and predict novel ones. PMID:26745411
Sequence and phylogenetic analysis of chicken anaemia virus obtained from backyard and commercial chickens in Nigeria.

PubMed

Oluwayelu, D O; Todd, D; Olaleye, O D

2008-12-01

This work reports the first molecular analysis study of chicken anaemia virus (CAV) in backyard chickens in Africa using molecular cloning and sequence analysis to characterize CAV strains obtained from commercial chickens and Nigerian backyard chickens. Partial VP1 gene sequences were determined for three CAVs from commercial chickens and for six CAV variants present in samples from a backyard chicken. Multiple alignment analysis revealed that the 6% and 4% nucleotide diversity obtained respectively for the commercial and backyard chicken strains translated to only 2% amino acid diversity for each breed. Overall, the amino acid composition of Nigerian CAVs was found to be highly conserved. Since the partial VP1 gene sequence of two backyard chicken cloned CAV strains (NGR/CI-8 and NGR/CI-9) were almost identical and evolutionarily closely related to the commercial chicken strains NGR-1, and NGR-4 and NGR-5, respectively, we concluded that CAV infections had crossed the farm boundary.
RaptorX server: a resource for template-based protein structure modeling.

PubMed

Källberg, Morten; Margaryan, Gohar; Wang, Sheng; Ma, Jianzhu; Xu, Jinbo

2014-01-01

Assigning functional properties to a newly discovered protein is a key challenge in modern biology. To this end, computational modeling of the three-dimensional atomic arrangement of the amino acid chain is often crucial in determining the role of the protein in biological processes. We present a community-wide web-based protocol, RaptorX server ( http://raptorx.uchicago.edu ), for automated protein secondary structure prediction, template-based tertiary structure modeling, and probabilistic alignment sampling.Given a target sequence, RaptorX server is able to detect even remotely related template sequences by means of a novel nonlinear context-specific alignment potential and probabilistic consistency algorithm. Using the protocol presented here it is thus possible to obtain high-quality structural models for many target protein sequences when only distantly related protein domains have experimentally solved structures. At present, RaptorX server can perform secondary and tertiary structure prediction of a 200 amino acid target sequence in approximately 30 min.
The ura5 gene of the ascomycete Sordaria macrospora: molecular cloning, characterization and expression in Escherichia coli.

PubMed

Le Chevanton, L; Leblon, G

1989-04-15

We cloned the ura5 gene coding for the orotate phosphoribosyl transferase from the ascomycete Sordaria macrospora by heterologous probing of a Sordaria genomic DNA library with the corresponding Podospora anserina sequence. The Sordaria gene was expressed in an Escherichia coli pyrE mutant strain defective for the same enzyme, and expression was shown to be promoted by plasmid sequences. The nucleotide sequence of the 1246-bp DNA fragment encompassing the region of homology with the Podospora gene has been determined. This sequence contains an open reading frame of 699 nucleotides. The deduced amino acid sequence shows 72% similarity with the corresponding Podospora protein.
How Messenger RNA and Nascent Chain Sequences Regulate Translation Elongation.

PubMed

Choi, Junhong; Grosely, Rosslyn; Prabhakar, Arjun; Lapointe, Christopher P; Wang, Jinfan; Puglisi, Joseph D

2018-06-20

Translation elongation is a highly coordinated, multistep, multifactor process that ensures accurate and efficient addition of amino acids to a growing nascent-peptide chain encoded in the sequence of translated messenger RNA (mRNA). Although translation elongation is heavily regulated by external factors, there is clear evidence that mRNA and nascent-peptide sequences control elongation dynamics, determining both the sequence and structure of synthesized proteins. Advances in methods have driven experiments that revealed the basic mechanisms of elongation as well as the mechanisms of regulation by mRNA and nascent-peptide sequences. In this review, we highlight how mRNA and nascent-peptide elements manipulate the translation machinery to alter the dynamics and pathway of elongation.

Method for nucleic acid hybridization using single-stranded DNA binding protein

DOEpatents

Tabor, Stanley; Richardson, Charles C.

1996-01-01

Method of nucleic acid hybridization for detecting the presence of a specific nucleic acid sequence in a population of different nucleic acid sequences using a nucleic acid probe. The nucleic acid probe hybridizes with the specific nucleic acid sequence but not with other nucleic acid sequences in the population. The method includes contacting a sample (potentially including the nucleic acid sequence) with the nucleic acid probe under hybridizing conditions in the presence of a single-stranded DNA binding protein provided in an amount which stimulates renaturation of a dilute solution (i.e., one in which the t.sub.1/2 of renaturation is longer than 3 weeks) of single-stranded DNA greater than 500 fold (i.e., to a t.sub.1/2 less than 60 min, preferably less than 5 min, and most preferably about 1 min.) in the absence of nucleotide triphosphates.
In-Gel Determination of L-Amino Acid Oxidase Activity Based on the Visualization of Prussian Blue-Forming Reaction

PubMed Central

Zhou, Ning; Zhao, Chuntian

2013-01-01

L-amino acid oxidase (LAAO) is attracting increasing attention due to its important functions. Diverse detection methods with their own properties have been developed for characterization of LAAO. In the present study, a simple, rapid, sensitive, cost-effective and reproducible method for quantitative in-gel determination of LAAO activity based on the visualization of Prussian blue-forming reaction is described. Coupled with SDS-PAGE, this Prussian blue agar assay can be directly used to determine the numbers and approximate molecular weights of LAAO in one step, allowing straightforward application for purification and sequence identification of LAAO from diverse samples. PMID:23383337
Nucleotide Sequence Analysis of RNA Synthesized from Rabbit Globin Complementary DNA

PubMed Central

Poon, Raymond; Paddock, Gary V.; Heindell, Howard; Whitcome, Philip; Salser, Winston; Kacian, Dan; Bank, Arthur; Gambino, Roberto; Ramirez, Francesco

1974-01-01

Rabbit globin complementary DNA made with RNA-dependent DNA polymerase (reverse transcriptase) was used as template for in vitro synthesis of 32P-labeled RNA. The sequences of the nucleotides in most of the fragments resulting from combined ribonuclease T1 and alkaline phosphatase digestion have been determined. Several fragments were long enough to fit uniquely with the α or β globin amino-acid sequences. These data demonstrate that the cDNA was copied from globin mRNA and contained no detectable contaminants. Images PMID:4139714
Nucleotide Sequence of the blaRTG-2 (CARB-5) Gene and Phylogeny of a New Group of Carbenicillinases

PubMed Central

Choury, Daniele; Szajnert, Marie-France; Joly-Guillou, Marie-Laure; Azibi, Kemal; Delpech, Marc; Paul, Gérard

2000-01-01

We determined the nucleotide sequence of the bla gene for the Acinetobacter calcoaceticus β-lactamase previously described as CARB-5. Alignment of the deduced amino acid sequence with those of known β-lactamases revealed that CARB-5 possesses an RTG triad in box VII, as described for the Proteus mirabilis GN79 enzyme, instead of the RSG consensus characteristic of the other carbenicillinases. Phylogenetic studies showed that these RTG enzymes constitute a new, separate group, possibly ancestors of the carbenicillinase family. PMID:10722515
Genome Sequence of the White Koji Mold Aspergillus kawachii IFO 4308, Used for Brewing the Japanese Distilled Spirit Shochu

PubMed Central

Futagami, Taiki; Mori, Kazuki; Yamashita, Ayaka; Wada, Shotaro; Kajiwara, Yasuhiro; Takashita, Hideharu; Omori, Toshiro; Takegawa, Kaoru; Tashiro, Kosuke; Kuhara, Satoru; Goto, Masatoshi

2011-01-01

The filamentous fungus Aspergillus kawachii has traditionally been used for brewing the Japanese distilled spirit shochu. A. kawachii characteristically hyperproduces citric acid and a variety of polysaccharide glycoside hydrolases. Here the genome sequence of A. kawachii IFO 4308 was determined and annotated. Analysis of the sequence may provide insight into the properties of this fungus that make it superior for use in shochu production, leading to the further development of A. kawachii for industrial applications. PMID:22045919
Sequence quality analysis tool for HIV type 1 protease and reverse transcriptase.

PubMed

Delong, Allison K; Wu, Mingham; Bennett, Diane; Parkin, Neil; Wu, Zhijin; Hogan, Joseph W; Kantor, Rami

2012-08-01

Access to antiretroviral therapy is increasing globally and drug resistance evolution is anticipated. Currently, protease (PR) and reverse transcriptase (RT) sequence generation is increasing, including the use of in-house sequencing assays, and quality assessment prior to sequence analysis is essential. We created a computational HIV PR/RT Sequence Quality Analysis Tool (SQUAT) that runs in the R statistical environment. Sequence quality thresholds are calculated from a large dataset (46,802 PR and 44,432 RT sequences) from the published literature ( http://hivdb.Stanford.edu ). Nucleic acid sequences are read into SQUAT, identified, aligned, and translated. Nucleic acid sequences are flagged if with >five 1-2-base insertions; >one 3-base insertion; >one deletion; >six PR or >18 RT ambiguous bases; >three consecutive PR or >four RT nucleic acid mutations; >zero stop codons; >three PR or >six RT ambiguous amino acids; >three consecutive PR or >four RT amino acid mutations; >zero unique amino acids; or <0.5% or >15% genetic distance from another submitted sequence. Thresholds are user modifiable. SQUAT output includes a summary report with detailed comments for troubleshooting of flagged sequences, histograms of pairwise genetic distances, neighbor joining phylogenetic trees, and aligned nucleic and amino acid sequences. SQUAT is a stand-alone, free, web-independent tool to ensure use of high-quality HIV PR/RT sequences in interpretation and reporting of drug resistance, while increasing awareness and expertise and facilitating troubleshooting of potentially problematic sequences.
Identification of amino acid changes in the envelope glycoproteins of bovine viral diarrhea viruses isolated from alpaca that may be involved in host adaptation.

PubMed

Neill, John D; Dubovi, Edward J; Ridpath, Julia F

2015-09-30

Bovine viral diarrhea viruses (BVDV) are most commonly associated with infections of cattle. However, BVDV are often isolated from closely related ruminants with a number of BVDV-1b viruses being isolated from alpacas that were both acutely and persistently infected. The complete nucleotide sequence of the open reading frame of eleven alpaca-adapted BVDV isolates and the region encoding the envelope glycoproteins of an additional three isolates were determined. With the exception of one, all alpaca isolates were >99.2% similar at the nucleotide level. The Hercules isolate was more divergent, with 95.7% sequence identity to the other viruses. Sequence similarity of the 14 viruses indicated they were isolates of a single BVDV strain that had adapted to and were circulating through alpaca herds. Hercules was a more distantly related strain that has been isolated only once in Canada and represented a separate adaptation event that possessed the same adaptive changes. Comparison of amino acid sequences of alpaca and bovine-derived BVDV strains revealed three regions with amino acid sequences unique to all alpaca isolates. The first contained two small in-frame deletions near the N-terminus of the E2 glycoprotein. The second was found near the C-terminus of the E2 protein where four altered amino acids were located within a 30 amino acid domain that participates in E2 homodimerization. The third region contained three variable amino acids in the C-terminus of the E(rns) within the amphipathic helix membrane anchor. These changes were found in the polar side of the amphipathic helix and resulted in an increased charge within the polar face. Titration of bovine and alpaca viruses in both bovine and alpaca cells indicated that with increased charge in the amphipathic helix, the ability to infect alpaca cells also increased. Published by Elsevier B.V.
Identification of single amino acid substitutions (SAAS) in neuraminidase from influenza a virus (H1N1) via mass spectrometry analysis coupled with de novo peptide sequencing.

PubMed

Peng, Qisheng; Wang, Zijian; Wu, Donglin; Li, Xiaoou; Liu, Xiaofeng; Sun, Wanchun; Liu, Ning

2016-08-01

Amino acid substitutions in the neuraminidase of the influenza virus are the main cause of the emergence of resistance to zanamivir or oseltamivir during seasonal influenza treatment; they are the result of non-synonymous mutations in the viral genome that can be successfully detected by polymer chain reaction (PCR)-based approaches. There is always an urgent need to detect variation in amino acid sequences directly at the protein level. Mass spectrometry coupled with de novo sequencing has been explored as an alternative and straightforward strategy for detecting amino acid substitutions, as well - this approach is the primary focus of the present study. Influenza virus (A/Puerto Rico/8/1934 H1N1) propagated in embryonated chicken eggs was purified by ultracentrifugation, followed by PNGase F treatment. The deglycosylated virion was lysed and separated by sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE). The gel band corresponding to neuraminidase was picked up and subjected to liquid chromatography tandem mass spectrometry (LC-MS/MS) analysis. LC-MS/MS analyses, coupled with manual de novo sequencing, allowed the determination of three amino acid substitutions: R346K, S349 N, and S370I/L, in the neuraminidase from the influenza virus (A/Puerto Rico/8/1934 H1N1), which were located in three mutated peptides of the neuraminidase: YGNGVWIGK, TKNHSSR, and PNGWTETDI/LK, respectively. We found that the amino acid substitutions in the proteins of RNA viruses (including influenza A virus) resulting from non-synonymous gene mutations can indeed be directly analyzed via mass spectrometry, and that manual interpretation of the MS/MS data may be beneficial. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
Cloning and molecular characterization of the cDNAs encoding the variable regions of an anti-CD20 monoclonal antibody.

PubMed

Shanehbandi, Dariush; Majidi, Jafar; Kazemi, Tohid; Baradaran, Behzad; Aghebati-Maleki, Leili

2017-01-01

CD20-based targeting of B-cells in hematologic malignancies and autoimmune disorders is associated with outstanding clinical outcomes. Isolation and characterization of VH and VL cDNAs encoding the variable regions of the heavy and light chains of monoclonal antibodies (MAb) is necessary to produce next generation MAbs and their derivatives such as bispecific antibodies (bsAb) and single-chain variable fragments (scFv). This study was aimed at cloning and characterization of the VH and VL cDNAs from a hybridoma cell line producing an anti-CD20 MAb. VH and VL fragments were amplified, cloned and characterized. Furthermore, amino acid sequences of VH, VL and corresponding complementarity-determining regions (CDR) were determined and compared with those of four approved MAbs including Rituximab (RTX), Ibritumomab tiuxetan, Ofatumumab and GA101. The cloned VH and VL cDNAs were found to be functional and follow a consensus pattern. Amino acid sequences corresponding to the VH and VL fragments also indicated noticeable homologies to those of RTX and Ibritumomab. Furthermore, amino acid sequences of the relating CDRs had remarkable similarities to their counterparts in RTX and Ibritumomab. Successful recovery of VH and VL fragments encourages the development of novel CD20 targeting bsAbs, scFvs, antibody conjugates and T-cells armed with chimeric antigen receptors.
Bioinformatics analysis and detection of gelatinase encoded gene in Lysinibacillussphaericus

NASA Astrophysics Data System (ADS)

Repin, Rul Aisyah Mat; Mutalib, Sahilah Abdul; Shahimi, Safiyyah; Khalid, Rozida Mohd.; Ayob, Mohd. Khan; Bakar, Mohd. Faizal Abu; Isa, Mohd Noor Mat

2016-11-01

In this study, we performed bioinformatics analysis toward genome sequence of Lysinibacillussphaericus (L. sphaericus) to determine gene encoded for gelatinase. L. sphaericus was isolated from soil and gelatinase species-specific bacterium to porcine and bovine gelatin. This bacterium offers the possibility of enzymes production which is specific to both species of meat, respectively. The main focus of this research is to identify the gelatinase encoded gene within the bacteria of L. Sphaericus using bioinformatics analysis of partially sequence genome. From the research study, three candidate gene were identified which was, gelatinase candidate gene 1 (P1), NODE_71_length_93919_cov_158.931839_21 which containing 1563 base pair (bp) in size with 520 amino acids sequence; Secondly, gelatinase candidate gene 2 (P2), NODE_23_length_52851_cov_190.061386_17 which containing 1776 bp in size with 591 amino acids sequence; and Thirdly, gelatinase candidate gene 3 (P3), NODE_106_length_32943_cov_169.147919_8 containing 1701 bp in size with 566 amino acids sequence. Three pairs of oligonucleotide primers were designed and namely as, F1, R1, F2, R2, F3 and R3 were targeted short sequences of cDNA by PCR. The amplicons were reliably results in 1563 bp in size for candidate gene P1 and 1701 bp in size for candidate gene P3. Therefore, the results of bioinformatics analysis of L. Sphaericus resulting in gene encoded gelatinase were identified.
Structural requirements for recognition of the HLA-Dw14 class II epitope: A key HLA determinant associated with rheumatoid arthritis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hiraiwa, Akikazu; Yamanaka, Katsuo; Kwok, W.W.

Although HLA genes have been shown to be associated with certain diseases, the basis for this association is unknown. Recent studies, however, have documented patterns of nucleotide sequence variation among some HLA genes associated with a particular disease. For rheumatoid arthritis, HLA genes in most patients have a shared nucleotide sequence encoding a key structural element of an HLA class II polypeptide; this sequence element is critical for the interaction of the HLA molecule with antigenic peptides and with responding T cells, suggestive of a direct role for this sequence element in disease susceptibility. The authors describe the serological andmore » cellular immunologic characteristics encoded by this rheumatoid arthritis-associated sequence element. Site-directed mutagenesis of the DRB1 gene was used to define amino acids critical for antibody and T-cell recognition of this structural element, focusing on residues that distinguish the rheumatoid arthritis-associated alleles Dw4 and Dw14 from a closely related allele, Dw10, not associated with disease. Both the gain and loss of rheumatoid arthritis-associated epitopes were highly dependent on three residues within a discrete domain of the HLA-DR molecule. Recognition was most strongly influenced by the following amino acids (in order): 70 > 71 > 67. Some alloreactive T-cell clones were also influenced by amino acid variation in portions of the DR molecule lying outside the shared sequence element.« less
Molecular characterization and phylogenetic analysis of a yak (Bos grunniens) κ-casein cDNA from lactating mammary gland.

PubMed

Bai, W L; Yin, R H; Dou, Q L; Jiang, W Q; Zhao, S J; Ma, Z J; Luo, G B; Zhao, Z H

2011-04-01

κ-Casein is one of the major proteins in the milk of mammals. It plays an important role in determining the size and specific function of milk micelles. We have previously identified and characterized a genetic variant of yak κ-casein by evaluating genomic DNA. Here, we isolate and characterize a yak κ-casein cDNA harboring the full-length open reading frame (ORF) from lactating mammary gland. Total RNA was extracted from mammary tissue of lactating female yak, and the κ-casein cDNA were synthesized by RT-PCR technique, then cloned and sequenced. The obtained cDNA of 660-bp contained an ORF sufficient to encode the entire amino acid sequence of κ-casein precursor protein consisting of 190 amino acids with a signal peptide of 21 amino acids. Yak κ-casein has a predicted molecular mass of 19,006.588 Da with a calculated isoelectric point of 7.245. Compared with the corresponding sequences in GenBank of cattle, buffalo, sheep, goat, Arabian camel, horse, and rabbit, yak κ-casein sequence had identity of 64.76-98.78% in cDNA, and identity of 44.79-98.42% and similarity of 53.65-98.42% in deduced amino acids, revealing a high homology with the other livestock species. Based on κ-casein cDNA sequences, the phylogenetic analysis indicated that yak κ-casein had a close relationship with that of cattle. This work might be useful in the genetic engineering researches for yak κ-casein.
GAWK, a novel human pituitary polypeptide: isolation, immunocytochemical localization and complete amino acid sequence.

PubMed

Benjannet, S; Leduc, R; Lazure, C; Seidah, N G; Marcinkiewicz, M; Chrétien, M

1985-01-16

During the course of reverse-phase high pressure liquid chromatography (RP-HPLC) purification of a postulated big ACTH (1) from human pituitary gland extracts, a highly purified peptide bearing no resemblance to any known polypeptide was isolated. The complete sequence of this 74 amino acid polypeptide, called GAWK, has been determined. Search on a computer data bank on the possible homology to any known protein or fragment, using a mutation data matrix, failed to reveal any homology greater than 30%. An antibody produced against a synthetic fragment allowed us to detect several immunoreactive forms. The antisera also enabled us to localize the polypeptide, by immunocytochemistry, in the anterior lobe of the pituitary gland.
A novel chaotic based image encryption using a hybrid model of deoxyribonucleic acid and cellular automata

NASA Astrophysics Data System (ADS)

Enayatifar, Rasul; Sadaei, Hossein Javedani; Abdullah, Abdul Hanan; Lee, Malrey; Isnin, Ismail Fauzi

2015-08-01

Currently, there are many studies have conducted on developing security of the digital image in order to protect such data while they are sending on the internet. This work aims to propose a new approach based on a hybrid model of the Tinkerbell chaotic map, deoxyribonucleic acid (DNA) and cellular automata (CA). DNA rules, DNA sequence XOR operator and CA rules are used simultaneously to encrypt the plain-image pixels. To determine rule number in DNA sequence and also CA, a 2-dimension Tinkerbell chaotic map is employed. Experimental results and computer simulations, both confirm that the proposed scheme not only demonstrates outstanding encryption, but also resists various typical attacks.
Molecular identification of catalases from Nicotiana plumbaginifolia (L.).

PubMed

Willekens, H; Villarroel, R; Van Montagu, M; Inzé, D; Van Camp, W

1994-09-19

We have isolated three different catalase cDNAs from Nicotiana plumbaginifolia (cat1, cat2, and cat3) and a partial sequence of a fourth catalase gene (cat4) that shows no discernible expression based on Northern analysis. The catalase sequences were used to determine the similarity with other plant catalases and to study the transcriptional response to paraquat, 3-aminotriazole, and salicylic acid. 3-Aminotriazole induces mRNA levels of cat1, cat2 and cat3, indicating that a reduction in catalase activity positively affects catalase mRNA abundance. Salicylic acid that binds catalase in vitro, had no effect on catalase transcript levels at physiological concentrations. Paraquat resulted in the induction of cat1.
Antimicrobial peptides containing unnatural amino acid exhibit potent bactericidal activity against ESKAPE pathogens.

PubMed

Hicks, R P; Abercrombie, J J; Wong, R K; Leung, K P

2013-01-01

A series of 36 synthetic antimicrobial peptides containing unnatural amino acids were screened to determine their effectiveness to treat Enterococcus faecium, Staphylococcus aureus, Klebsiella pnemoniae, Acinetobacter baumannii, Pseudomonas aeruginosa, and Enterobacter species (ESKAPE) pathogens, which are known to commonly infect chronic wounds. The primary amino acid sequences of these peptides incorporate either three or six dipeptide units consisting of the unnatural amino acids Tetrahydroisoquinolinecarboxylic acid (Tic) and Octahydroindolecarboxylic acid (Oic). The Tic-Oic dipeptide units are separated by SPACER amino acids with specific physicochemical properties that control how these peptides interact with bacterial cell membranes of different chemical compositions. These peptides exhibited minimum inhibitory concentrations (MIC) against these pathogens in the range from >100 to 6.25 μg/mL. The observed diversity of MIC values for these peptides against the various bacterial strains are consistent with our hypothesis that the complementarity of the physicochemical properties of the peptide and the lipid of the bacteria's cell membrane determines the resulting antibacterial activity of the peptide. Published by Elsevier Ltd.
Genomic organization and sequence of the Gus-s/sup a/ allele of the murine. beta. -glucuronidase gene

DOE Office of Scientific and Technical Information (OSTI.GOV)

Funkenstein, B.; Leary, S.L.; Stein, J.C.

1988-03-01

The Gus-s/sup ..cap alpha../ allele of the mouse ..beta..-glucuronidase gene exhibits a high degree of inducibility by androgens due to its linkage with the Gus-r/sup ..cap alpha../ regulatory locus. The authors isolated Gus-s/sup ..cap alpha../ on a 28-kilobase pair fragment of mouse chromosome 5 and found that it contains 12 exons and 11 intervening sequences spanning 14 kilobase pairs of this genomic segment. The mRNA cap site was identified by ribonuclease protection and primer extension analyses which revealed an unusually short 5' noncoding sequence of 12 nucleotides. Proximal regulatory sequences in the 5'-flanking DNA and the complete sequence of themore » Gus-s/sup ..cap alpha../ mRNA transcript were also determined. Comparison of the amino acid sequence determined from the Gus-s/sup ..cap alpha../ nucleotide sequence with that of human ..beta..-glucuronidase indicated that the two human mRNA species differ due to alternate splicing of an exon homologous to exon 6 of the mouse gene.« less
Contribution of silent mutations to thermal adaptation of RNA bacteriophage Qβ.

PubMed

Kashiwagi, Akiko; Sugawara, Ryu; Sano Tsushima, Fumie; Kumagai, Tomofumi; Yomo, Tetsuya

2014-10-01

Changes in protein function and other biological properties, such as RNA structure, are crucial for adaptation of organisms to novel or inhibitory environments. To investigate how mutations that do not alter amino acid sequence may be positively selected, we performed a thermal adaptation experiment using the single-stranded RNA bacteriophage Qβ in which the culture temperature was increased from 37.2°C to 41.2°C and finally to an inhibitory temperature of 43.6°C in a stepwise manner in three independent lines. Whole-genome analysis revealed 31 mutations, including 14 mutations that did not result in amino acid sequence alterations, in this thermal adaptation. Eight of the 31 mutations were observed in all three lines. Reconstruction and fitness analyses of Qβ strains containing only mutations observed in all three lines indicated that five mutations that did not result in amino acid sequence changes but increased the amplification ratio appeared in the course of adaptation to growth at 41.2°C. Moreover, these mutations provided a suitable genetic background for subsequent mutations, altering the fitness contribution from deleterious to beneficial. These results clearly showed that mutations that do not alter the amino acid sequence play important roles in adaptation of this single-stranded RNA virus to elevated temperature. Recent studies using whole-genome analysis technology suggested the importance of mutations that do not alter the amino acid sequence for adaptation of organisms to novel environmental conditions. It is necessary to investigate how these mutations may be positively selected and to determine to what degree such mutations that do not alter amino acid sequences contribute to adaptive evolution. Here, we report the roles of these silent mutations in thermal adaptation of RNA bacteriophage Qβ based on experimental evolution during which Qβ showed adaptation to growth at an inhibitory temperature. Intriguingly, four synonymous mutations and one mutation in the untranslated region that spread widely in the Qβ population during the adaptation process at moderately high temperature provided a suitable genetic background to alter the fitness contribution of subsequent mutations from deleterious to beneficial at a higher temperature. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
Isolation and distribution of a novel iron-oxidizing crenarchaeon from acidic geothermal springs in Yellowstone National Park.

PubMed

Kozubal, M; Macur, R E; Korf, S; Taylor, W P; Ackerman, G G; Nagy, A; Inskeep, W P

2008-02-01

Novel thermophilic crenarchaea have been observed in Fe(III) oxide microbial mats of Yellowstone National Park (YNP); however, no definitive work has identified specific microorganisms responsible for the oxidation of Fe(II). The objectives of the current study were to isolate and characterize an Fe(II)-oxidizing member of the Sulfolobales observed in previous 16S rRNA gene surveys and to determine the abundance and distribution of close relatives of this organism in acidic geothermal springs containing high concentrations of dissolved Fe(II). Here we report the isolation and characterization of the novel, Fe(II)-oxidizing, thermophilic, acidophilic organism Metallosphaera sp. strain MK1 obtained from a well-characterized acid-sulfate-chloride geothermal spring in Norris Geyser Basin, YNP. Full-length 16S rRNA gene sequence analysis revealed that strain MK1 exhibits only 94.9 to 96.1% sequence similarity to other known Metallosphaera spp. and less than 89.1% similarity to known Sulfolobus spp. Strain MK1 is a facultative chemolithoautotroph with an optimum pH range of 2.0 to 3.0 and an optimum temperature range of 65 to 75 degrees C. Strain MK1 grows optimally on pyrite or Fe(II) sorbed onto ferrihydrite, exhibiting doubling times between 10 and 11 h under aerobic conditions (65 degrees C). The distribution and relative abundance of MK1-like 16S rRNA gene sequences in 14 acidic geothermal springs containing Fe(III) oxide microbial mats were evaluated. Highly related MK1-like 16S rRNA gene sequences (>99% sequence similarity) were consistently observed in Fe(III) oxide mats at temperatures ranging from 55 to 80 degrees C. Quantitative PCR using Metallosphaera-specific primers confirmed that organisms highly similar to strain MK1 comprised up to 40% of the total archaeal community at selected sites. The broad distribution of highly related MK1-like 16S rRNA gene sequences in acidic Fe(III) oxide microbial mats is consistent with the observed characteristics and growth optima of Metallosphaera-like strain MK1 and emphasizes the importance of this newly described taxon in Fe(II) chemolithotrophy in acidic high-temperature environments of YNP.
Nucleotide sequence of a chickpea chlorotic stunt virus relative that infects pea and faba bean in China.

PubMed

Zhou, Cui-Ji; Xiang, Hai-Ying; Zhuo, Tao; Li, Da-Wei; Yu, Jia-Lin; Han, Cheng-Gui

2012-07-01

We determined the genome sequence of a new polerovirus that infects field pea and faba bean in China. Its entire nucleotide sequence (6021 nt) was most closely related (83.3% identity) to that of an Ethiopian isolate of chickpea chlorotic stunt virus (CpCSV-Eth). With the exception of the coat protein (encoded by ORF3), amino acid sequence identities of all gene products of this virus to those of CpCSV-Eth and other poleroviruses were <90%. This suggests that it is a new member of the genus Polerovirus, and the name pea mild chlorosis virus is proposed.

Statistical theory for protein combinatorial libraries. Packing interactions, backbone flexibility, and the sequence variability of a main-chain structure.

PubMed

Kono, H; Saven, J G

2001-02-23

Combinatorial experiments provide new ways to probe the determinants of protein folding and to identify novel folding amino acid sequences. These types of experiments, however, are complicated both by enormous conformational complexity and by large numbers of possible sequences. Therefore, a quantitative computational theory would be helpful in designing and interpreting these types of experiment. Here, we present and apply a statistically based, computational approach for identifying the properties of sequences compatible with a given main-chain structure. Protein side-chain conformations are included in an atom-based fashion. Calculations are performed for a variety of similar backbone structures to identify sequence properties that are robust with respect to minor changes in main-chain structure. Rather than specific sequences, the method yields the likelihood of each of the amino acids at preselected positions in a given protein structure. The theory may be used to quantify the characteristics of sequence space for a chosen structure without explicitly tabulating sequences. To account for hydrophobic effects, we introduce an environmental energy that it is consistent with other simple hydrophobicity scales and show that it is effective for side-chain modeling. We apply the method to calculate the identity probabilities of selected positions of the immunoglobulin light chain-binding domain of protein L, for which many variant folding sequences are available. The calculations compare favorably with the experimentally observed identity probabilities.
Nucleotide sequence analysis establishes the role of endogenous murine leukemia virus DNA segments in formation of recombinant mink cell focus-forming murine leukemia viruses.

PubMed Central

Khan, A S

1984-01-01

The sequence of 363 nucleotides near the 3' end of the pol gene and 564 nucleotides from the 5' terminus of the env gene in an endogenous murine leukemia viral (MuLV) DNA segment, cloned from AKR/J mouse DNA and designated as A-12, was obtained. For comparison, the nucleotide sequence in an analogous portion of AKR mink cell focus-forming (MCF) 247 MuLV provirus was also determined. Sequence features unique to MCF247 MuLV DNA in the 3' pol and 5' env regions were identified by comparison with nucleotide sequences in analogous regions of NFS -Th-1 xenotropic and AKR ecotropic MuLV proviruses. These included (i) an insertion of 12 base pairs encoding four amino acids located 60 base pairs from the 3' terminus of the pol gene and immediately preceding the env gene, (ii) the deletion of 12 base pairs (encoding four amino acids) and the insertion of 3 base pairs (encoding one amino acid) in the 5' portion of the env gene, and (iii) single base substitutions resulting in 2 MCF247 -specific amino acids in the 3' pol and 23 in the 5' env regions. Nucleotide sequence comparison involving the 3' pol and 5' env regions of AKR MCF247 , NFS xenotropic, and AKR ecotropic MuLV proviruses with the cloned endogenous MuLV DNA indicated that MCF247 proviral DNA sequences were conserved in the cloned endogenous MuLV proviral segment. In fact, total nucleotide sequence identity existed between the endogenous MuLV DNA and the MCF247 MuLV provirus in the 3' portion of the pol gene. In the 5' env region, only 4 of 564 nucleotides were different, resulting in three amino acid changes between AKR MCF247 MuLV DNA and the endogenous MuLV DNA present in clone A-12. In addition, nucleotide sequence comparison indicated that Moloney-and Friend-MCF MuLVs were also highly related in the 3' pol and 5' env regions to the cloned endogenous MuLV DNA. These results establish the role of endogenous MuLV DNA segments in generation of recombinant MCF viruses. PMID:6328017
Development of a rapid and simple immunochromatographic assay to identify Vibrio parahaemolyticus.

PubMed

Sakata, Junko; Kawatsu, Kentaro; Iwasaki, Tadashi; Kumeda, Yuko

2015-09-01

To rapidly and simply determine whether or not bacterial colonies growing on agar were Vibrio parahaemolyticus, we developed an immunochromatographic assay (VP-ICA) using two different monoclonal antibodies (designated mAb-VP34 and mAb-VP109) against the delta subunit of V. parahaemolyticus-F0F1 ATP synthase. The epitopes recognized by mAb-VP34 and mAb-VP109 were mapped to sequences of eight ((47)LLTSSFSA(54)) and six amino acid residues ((16)FDFAVD(21)), respectively. An amino acid sequence similarity search of the NCBI database using BLASTP showed that both epitopic amino acid sequences were present together only in V. parahaemolyticus. When 124 V. parahaemolyticus strains and 94 strains of 27 other Vibrio species or 35 non-Vibrio species were tested using the VP-ICA, the VP-ICA identified V. parahaemolyticus with 100% accuracy. The VP-ICA rapidly and simply identified the pathogen directly from a single agar colony within 30 min, indicating that VP-ICA will greatly reduce labor and time required to identify V. parahaemolyticus compared with conventional biochemical tests. Copyright © 2015. Published by Elsevier B.V.
Sequence of the fhuE outer-membrane receptor gene of Escherichia coli K12 and properties of mutants.

PubMed

Sauer, M; Hantke, K; Braun, V

1990-03-01

The fhuE gene of Escherichia coli codes for an outer-membrane receptor protein required for the uptake of iron(III) via coprogen, ferrioxamine B and rhodotorulic acid. The amino acid sequence, deduced from the nucleotide sequence, consisted of 729 residues. The mature form, composed of 693 residues, has a calculated molecular weight of 77,453, which agrees with the molecular weight of 76,000 determined by polyacrylamide gel electrophoresis. The FhuE protein contains four regions of homology with other TonB-dependent receptors. A valine to proline exchange in the 'TonB box' abolished transport activity. Phenotypic revertants with substitutions of arginine, glutamine, or leucine at the valine position exhibited increasing iron-coprogen transport rates. Point mutations resulting in the replacement of glycine (127) in the second homology region with either alanine, aspartate, valine, asparagine or histidine exhibited decreased transport rates (listed in descending order). A truncated FhuE protein lacking 24 amino acids at the C-terminal end was exported to the periplasm but failed to be inserted into the outer membrane.
Determination of Calcium in Dietary Supplements: Statistical Comparison of Methods in the Analytical Laboratory

ERIC Educational Resources Information Center

Garvey, Sarah L.; Shahmohammadi, Golbon; McLain, Derek R.; Dietz, Mark L.

2015-01-01

A laboratory experiment is described in which students compare two methods for the determination of the calcium content of commercial dietary supplement tablets. In a two-week sequence, the sample tablets are first analyzed via complexometric titration with ethylenediaminetetraacetic acid and then, following ion exchange of the calcium ion present…
Using One's Hands for Naming Optical Isomers and Other Stereochemical Positions.

ERIC Educational Resources Information Center

Mezl, Vasek A.

1996-01-01

Presents a method that allows students to use their hands to obtain the stereochemistry of chiral centers without redrawing the structure. Discusses the use of the model in: determining the configurations of amino acids, determining if sugars are D or L isomers, the sequence rule procedure, prochirality, naming the sides of trigonal carbons, and…
HeLa Nucleic Acid Contamination in The Cancer Genome Atlas Leads to the Misidentification of Human Papillomavirus 18

PubMed Central

Cantalupo, Paul G.; Katz, Joshua P.

2015-01-01

ABSTRACT We searched The Cancer Genome Atlas (TCGA) database for viruses by comparing non-human reads present in transcriptome sequencing (RNA-Seq) and whole-exome sequencing (WXS) data to viral sequence databases. Human papillomavirus 18 (HPV18) is an etiologic agent of cervical cancer, and as expected, we found robust expression of HPV18 genes in cervical cancer samples. In agreement with previous studies, we also found HPV18 transcripts in non-cervical cancer samples, including those from the colon, rectum, and normal kidney. However, in each of these cases, HPV18 gene expression was low, and single-nucleotide variants and positions of genomic alignments matched the integrated portion of HPV18 present in HeLa cells. Chimeric reads that match a known virus-cell junction of HPV18 integrated in HeLa cells were also present in some samples. We hypothesize that HPV18 sequences in these non-cervical samples are due to nucleic acid contamination from HeLa cells. This finding highlights the problems that contamination presents in computational virus detection pipelines. IMPORTANCE Viruses associated with cancer can be detected by searching tumor sequence databases. Several studies involving searches of the TCGA database have reported the presence of HPV18, a known cause of cervical cancer, in a small number of additional cancers, including those of the rectum, kidney, and colon. We have determined that the sequences related to HPV18 in non-cervical samples are due to nucleic acid contamination from HeLa cells. To our knowledge, this is the first report of the misidentification of viruses in next-generation sequencing data of tumors due to contamination with a cancer cell line. These results raise awareness of the difficulty of accurately identifying viruses in human sequence databases. PMID:25631090
EGVII endoglucanase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2014-02-25

The present invention provides a novel endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.
EGVII endoglucanase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2006-05-16

The present invention provides a novel endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.
EGVI endoglucanase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA

2008-04-01

The present invention provides a novel endoglucanase nucleic acid sequence, designated egl6, and the corresponding EGVI amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVI, recombinant EGVI proteins and methods for producing the same.
EGVI endoglucanase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2010-10-12

The present invention provides a novel endoglucanase nucleic acid sequence, designated egl6, and the corresponding EGVI amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVI, recombinant EGVI proteins and methods for producing the same.
EGVIII endoglucanase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2006-05-23

The present invention provides a novel endoglucanase nucleic acid sequence, designated egl8, and the corresponding EGVIII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVIII, recombinant EGVIII proteins and methods for producing the same.
EGVI endoglucanase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2010-10-05

The present invention provides a novel endoglucanase nucleic acid sequence, designated egl6, and the corresponding EGVI amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVI, recombinant EGVI proteins and methods for producing the same.
EGVI endoglucanase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2006-06-06

The present invention provides a novel endoglucanase nucleic acid sequence, designated egl6, and the corresponding EGVI amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVI, recombinant EGVI proteins and methods for producing the same.
EGVII endoglucanase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA

2009-05-05

The present invention provides an endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.
EGVII endoglucanase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2013-07-16

The present invention provides a novel endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.
EGVII endoglucanase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA

2012-02-14

The present invention provides a novel endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.
EGVII endoglucanase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2015-04-14

The present invention provides a novel endoglucanase nucleic acid sequence, designated egl7, and the corresponding EGVII amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVII, recombinant EGVII proteins and methods for producing the same.
Physico-chemical foundations underpinning microarray and next-generation sequencing experiments

PubMed Central

Harrison, Andrew; Binder, Hans; Buhot, Arnaud; Burden, Conrad J.; Carlon, Enrico; Gibas, Cynthia; Gamble, Lara J.; Halperin, Avraham; Hooyberghs, Jef; Kreil, David P.; Levicky, Rastislav; Noble, Peter A.; Ott, Albrecht; Pettitt, B. Montgomery; Tautz, Diethard; Pozhitkov, Alexander E.

2013-01-01

Hybridization of nucleic acids on solid surfaces is a key process involved in high-throughput technologies such as microarrays and, in some cases, next-generation sequencing (NGS). A physical understanding of the hybridization process helps to determine the accuracy of these technologies. The goal of a widespread research program is to develop reliable transformations between the raw signals reported by the technologies and individual molecular concentrations from an ensemble of nucleic acids. This research has inputs from many areas, from bioinformatics and biostatistics, to theoretical and experimental biochemistry and biophysics, to computer simulations. A group of leading researchers met in Ploen Germany in 2011 to discuss present knowledge and limitations of our physico-chemical understanding of high-throughput nucleic acid technologies. This meeting inspired us to write this summary, which provides an overview of the state-of-the-art approaches based on physico-chemical foundation to modeling of the nucleic acids hybridization process on solid surfaces. In addition, practical application of current knowledge is emphasized. PMID:23307556
Kit for detecting nucleic acid sequences using competitive hybridization probes

DOEpatents

Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

2001-01-01

A kit is provided for detecting a target nucleic acid sequence in a sample, the kit comprising: a first hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the first hybridization probe including a first complexing agent for forming a binding pair with a second complexing agent; and a second hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the first hybridization probe does not selectively hybridize, the second hybridization probe including a detectable marker; a third hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the third hybridization probe including the same detectable marker as the second hybridization probe; and a fourth hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the third hybridization probe does not selectively hybridize, the fourth hybridization probe including the first complexing agent for forming a binding pair with the second complexing agent; wherein the first and second hybridization probes are capable of simultaneously hybridizing to the target sequence and the third and fourth hybridization probes are capable of simultaneously hybridizing to the target sequence, the detectable marker is not present on the first or fourth hybridization probes and the first, second, third, and fourth hybridization probes each include a competitive nucleic acid sequence which is sufficiently complementary to a third portion of the target sequence that the competitive sequences of the first, second, third, and fourth hybridization probes compete with each other to hybridize to the third portion of the target sequence.

G-quadruplex prediction in E. coli genome reveals a conserved putative G-quadruplex-Hairpin-Duplex switch.

PubMed

Kaplan, Oktay I; Berber, Burak; Hekim, Nezih; Doluca, Osman

2016-11-02

Many studies show that short non-coding sequences are widely conserved among regulatory elements. More and more conserved sequences are being discovered since the development of next generation sequencing technology. A common approach to identify conserved sequences with regulatory roles relies on topological changes such as hairpin formation at the DNA or RNA level. G-quadruplexes, non-canonical nucleic acid topologies with little established biological roles, are increasingly considered for conserved regulatory element discovery. Since the tertiary structure of G-quadruplexes is strongly dependent on the loop sequence which is disregarded by the generally accepted algorithm, we hypothesized that G-quadruplexes with similar topology and, indirectly, similar interaction patterns, can be determined using phylogenetic clustering based on differences in the loop sequences. Phylogenetic analysis of 52 G-quadruplex forming sequences in the Escherichia coli genome revealed two conserved G-quadruplex motifs with a potential regulatory role. Further analysis revealed that both motifs tend to form hairpins and G quadruplexes, as supported by circular dichroism studies. The phylogenetic analysis as described in this work can greatly improve the discovery of functional G-quadruplex structures and may explain unknown regulatory patterns. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
A re-evaluation of the final step of vanillin biosynthesis in the orchid Vanilla planifolia.

PubMed

Yang, Hailian; Barros-Rios, Jaime; Kourteva, Galina; Rao, Xiaolan; Chen, Fang; Shen, Hui; Liu, Chenggang; Podstolski, Andrzej; Belanger, Faith; Havkin-Frenkel, Daphna; Dixon, Richard A

2017-07-01

A recent publication describes an enzyme from the vanilla orchid Vanilla planifolia with the ability to convert ferulic acid directly to vanillin. The authors propose that this represents the final step in the biosynthesis of vanillin, which is then converted to its storage form, glucovanillin, by glycosylation. The existence of such a "vanillin synthase" could enable biotechnological production of vanillin from ferulic acid using a "natural" vanilla enzyme. The proposed vanillin synthase exhibits high identity to cysteine proteases, and is identical at the protein sequence level to a protein identified in 2003 as being associated with the conversion of 4-coumaric acid to 4-hydroxybenzaldehyde. We here demonstrate that the recombinant cysteine protease-like protein, whether expressed in an in vitro transcription-translation system, E. coli, yeast, or plants, is unable to convert ferulic acid to vanillin. Rather, the protein is a component of an enzyme complex that preferentially converts 4-coumaric acid to 4-hydroxybenzaldehyde, as demonstrated by the purification of this complex and peptide sequencing. Furthermore, RNA sequencing provides evidence that this protein is expressed in many tissues of V. planifolia irrespective of whether or not they produce vanillin. On the basis of our results, V. planifolia does not appear to contain a cysteine protease-like "vanillin synthase" that can, by itself, directly convert ferulic acid to vanillin. The pathway to vanillin in V. planifolia is yet to be conclusively determined. Copyright © 2017 Elsevier Ltd. All rights reserved.
Y chromosome specific nucleic acid probe and method for determining the Y chromosome in situ

DOEpatents

Gray, Joe W.; Weier, Heinz-Ulrich

1998-01-01

A method for producing a Y chromosome specific probe selected from highly repeating sequences on that chromosome is described. There is little or no nonspecific binding to autosomal and X chromosomes, and a very large signal is provided. Inventive primers allowing the use of PCR for both sample amplification and probe production are described, as is their use in producing large DNA chromosome painting sequences.
Y chromosome specific nucleic acid probe and method for determining the Y chromosome in situ

DOEpatents

Gray, Joe W.; Weier, Heinz-Ulrich

2001-01-01

A method for producing a Y chromosome specific probe selected from highly repeating sequences on that chromosome is described. There is little or no nonspecific binding to autosomal and X chromosomes, and a very large signal is provided. Inventive primers allowing the use of PCR for both sample amplification and probe production are described, as is their use in producing large DNA chromosome painting sequences.
Y chromosome specific nucleic acid probe and method for determining the Y chromosome in situ

DOEpatents

Gray, J.W.; Weier, H.U.

1998-11-24

A method for producing a Y chromosome specific probe selected from highly repeating sequences on that chromosome is described. There is little or no nonspecific binding to autosomal and X chromosomes, and a very large signal is provided. Inventive primers allowing the use of PCR for both sample amplification and probe production are described, as is their use in producing large DNA chromosome painting sequences. 9 figs.
Chip-based sequencing nucleic acids

DOEpatents

Beer, Neil Reginald

2014-08-26

A system for fast DNA sequencing by amplification of genetic material within microreactors, denaturing, demulsifying, and then sequencing the material, while retaining it in a PCR/sequencing zone by a magnetic field. One embodiment includes sequencing nucleic acids on a microchip that includes a microchannel flow channel in the microchip. The nucleic acids are isolated and hybridized to magnetic nanoparticles or to magnetic polystyrene-coated beads. Microreactor droplets are formed in the microchannel flow channel. The microreactor droplets containing the nucleic acids and the magnetic nanoparticles are retained in a magnetic trap in the microchannel flow channel and sequenced.
Seq2Logo: a method for construction and visualization of amino acid binding motifs and sequence profiles including sequence weighting, pseudo counts and two-sided representation of amino acid enrichment and depletion

PubMed Central

Thomsen, Martin Christen Frølund; Nielsen, Morten

2012-01-01

Seq2Logo is a web-based sequence logo generator. Sequence logos are a graphical representation of the information content stored in a multiple sequence alignment (MSA) and provide a compact and highly intuitive representation of the position-specific amino acid composition of binding motifs, active sites, etc. in biological sequences. Accurate generation of sequence logos is often compromised by sequence redundancy and low number of observations. Moreover, most methods available for sequence logo generation focus on displaying the position-specific enrichment of amino acids, discarding the equally valuable information related to amino acid depletion. Seq2logo aims at resolving these issues allowing the user to include sequence weighting to correct for data redundancy, pseudo counts to correct for low number of observations and different logotype representations each capturing different aspects related to amino acid enrichment and depletion. Besides allowing input in the format of peptides and MSA, Seq2Logo accepts input as Blast sequence profiles, providing easy access for non-expert end-users to characterize and identify functionally conserved/variable amino acids in any given protein of interest. The output from the server is a sequence logo and a PSSM. Seq2Logo is available at http://www.cbs.dtu.dk/biotools/Seq2Logo (14 May 2012, date last accessed). PMID:22638583
RNA-seq based transcriptomic analysis uncovers α-linolenic acid and jasmonic acid biosynthesis pathways respond to cold acclimation in Camellia japonica

PubMed Central

Li, Qingyuan; Lei, Sheng; Du, Kebing; Li, Lizhi; Pang, Xufeng; Wang, Zhanchang; Wei, Ming; Fu, Shao; Hu, Limin; Xu, Lin

2016-01-01

Camellia is a well-known ornamental flower native to Southeast of Asia, including regions such as Japan, Korea and South China. However, most species in the genus Camellia are cold sensitive. To elucidate the cold stress responses in camellia plants, we carried out deep transcriptome sequencing of ‘Jiangxue’, a cold-tolerant cultivar of Camellia japonica, and approximately 1,006 million clean reads were generated using Illumina sequencing technology. The assembly of the clean reads produced 367,620 transcripts, including 207,592 unigenes. Overall, 28,038 differentially expressed genes were identified during cold acclimation. Detailed elucidation of responses of transcription factors, protein kinases and plant hormone signalling-related genes described the interplay of signal that allowed the plant to fine-tune cold stress responses. On the basis of global gene regulation of unsaturated fatty acid biosynthesis- and jasmonic acid biosynthesis-related genes, unsaturated fatty acid biosynthesis and jasmonic acid biosynthesis pathways were deduced to be involved in the low temperature responses in C. japonica. These results were supported by the determination of the fatty acid composition and jasmonic acid content. Our results provide insights into the genetic and molecular basis of the responses to cold acclimation in camellia plants. PMID:27819341
Axolotl hemoglobin: cDNA-derived amino acid sequences of two alpha globins and a beta globin from an adult Ambystoma mexicanum.

PubMed

Shishikura, Fumio; Takeuchi, Hiro-aki; Nagai, Takatoshi

2005-11-01

Erythrocytes of the adult axolotl, Ambystoma mexicanum, have multiple hemoglobins. We separated and purified two kinds of hemoglobin, termed major hemoglobin (Hb M) and minor hemoglobin (Hb m), from a five-year-old male by hydrophobic interaction column chromatography on Alkyl Superose. The hemoglobins have two distinct alpha type globin polypeptides (alphaM and alpham) and a common beta globin polypeptide, all of which were purified in FPLC on a reversed-phase column after S-pyridylethylation. The complete amino acid sequences of the three globin chains were determined separately using nucleotide sequencing with the assistance of protein sequencing. The mature globin molecules were composed of 141 amino acid residues for alphaM globin, 143 for alpham globin and 146 for beta globin. Comparing primary structures of the five kinds of axolotl globins, including two previously established alpha type globins from the same species, with other known globins of amphibians and representatives of other vertebrates, we constructed phylogenetic trees for amphibian hemoglobins and tetrapod hemoglobins. The molecular trees indicated that alphaM, alpham, beta and the previously known alpha major globin were adult types of globins and the other known alpha globin was a larval type. The existence of two to four more globins in the axolotl erythrocyte is predicted.
A novel cry2Ab gene from the indigenous isolate Bacillus thuringiensis subsp. kurstaki.

PubMed

Sevim, Ali; Eryüzlü, Emine; Demirbağ, Zihni; Demir, Ismail

2012-01-01

A novel cry2Ab gene was cloned and sequenced from the indigenous isolate of Bacillus thuringiensis subsp. kurstaki. This gene was designated as cry2Ab25 and its sequence revealed an open reading frame of 1,902 bp encoding a 633 aa protein with calculated molecular mass of 70 kDa and pI value of 8.98. The amino acid sequence of the Cry2Ab25 protein was compared with previously known Cry2Ab toxins, and the phylogenetic relationships among them were determined. The deduced amino acid sequence of the Cry2Ab25 protein showed 99% homology to the known Cry2Ab proteins, except for Cry2Ab10 and Cry2Ab12 with 97% homology, and a variation in one amino acid residue in comparison with all known Cry2Ab proteins. The cry2Ab25 gene was expressed in Escherichia coli BL21(DE3) cells. Sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) revealed that the Cry2Ab25 protein is about 70 kDa. The toxin expressed in BL21(DE3) exhibited high toxicity against Malacosoma neustria and Rhagoletis cerasi with 73% and 75% mortality after 5 days of treatment, respectively.
The legumin gene family: structure of a B type gene of Vicia faba and a possible legumin gene specific regulatory element.

PubMed Central

Bäumlein, H; Wobus, U; Pustell, J; Kafatos, F C

1986-01-01

The field bean, Vicia faba L. var. minor, possesses two sub-families of 11 S legumin genes named A and B. We isolated from a genomic library a B-type gene (LeB4) and determined its primary DNA sequence. Gene LeB4 codes for a 484 amino acid residue prepropolypeptide, encompassing a signal peptide of 22 amino acid residues, an acidic, very hydrophilic alpha-chain of 281 residues and a basic, somewhat hydrophobic beta-chain of 181 residues. The latter two coding regions are immediately contiguous, but each is interrupted by a short intron. Type A legumin genes from soybean and pea are known to have introns in the same two positions, in addition to an extra intron (within the alpha-coding sequence). Sequence comparisons of legumin genes from these three plants revealed a highly conserved sequence element of at least 28 bp, centered at approximately 100 bp upstream of each cap site. The element is absent from the equivalent position of all non-legumin and other plant and fungal genes examined. We tentatively name this element "legumin box" and suggest that it may have a function in the regulation of legumin gene expression. PMID:3960730
Comparative characterization of random-sequence proteins consisting of 5, 12, and 20 kinds of amino acids

PubMed Central

Tanaka, Junko; Doi, Nobuhide; Takashima, Hideaki; Yanagawa, Hiroshi

2010-01-01

Screening of functional proteins from a random-sequence library has been used to evolve novel proteins in the field of evolutionary protein engineering. However, random-sequence proteins consisting of the 20 natural amino acids tend to aggregate, and the occurrence rate of functional proteins in a random-sequence library is low. From the viewpoint of the origin of life, it has been proposed that primordial proteins consisted of a limited set of amino acids that could have been abundantly formed early during chemical evolution. We have previously found that members of a random-sequence protein library constructed with five primitive amino acids show high solubility (Doi et al., Protein Eng Des Sel 2005;18:279–284). Although such a library is expected to be appropriate for finding functional proteins, the functionality may be limited, because they have no positively charged amino acid. Here, we constructed three libraries of 120-amino acid, random-sequence proteins using alphabets of 5, 12, and 20 amino acids by preselection using mRNA display (to eliminate sequences containing stop codons and frameshifts) and characterized and compared the structural properties of random-sequence proteins arbitrarily chosen from these libraries. We found that random-sequence proteins constructed with the 12-member alphabet (including five primitive amino acids and positively charged amino acids) have higher solubility than those constructed with the 20-member alphabet, though other biophysical properties are very similar in the two libraries. Thus, a library of moderate complexity constructed from 12 amino acids may be a more appropriate resource for functional screening than one constructed from 20 amino acids. PMID:20162614
Codon-Anticodon Recognition in the Bacillus subtilis glyQS T Box Riboswitch

PubMed Central

Caserta, Enrico; Liu, Liang-Chun; Grundy, Frank J.; Henkin, Tina M.

2015-01-01

Many amino acid-related genes in Gram-positive bacteria are regulated by the T box riboswitch. The leader RNA of genes in the T box family controls the expression of downstream genes by monitoring the aminoacylation status of the cognate tRNA. Previous studies identified a three-nucleotide codon, termed the “Specifier Sequence,” in the riboswitch that corresponds to the amino acid identity of the downstream genes. Pairing of the Specifier Sequence with the anticodon of the cognate tRNA is the primary determinant of specific tRNA recognition. This interaction mimics codon-anticodon pairing in translation but occurs in the absence of the ribosome. The goal of the current study was to determine the effect of a full range of mismatches for comparison with codon recognition in translation. Mutations were individually introduced into the Specifier Sequence of the glyQS leader RNA and tRNAGly anticodon to test the effect of all possible pairing combinations on tRNA binding affinity and antitermination efficiency. The functional role of the conserved purine 3′ of the Specifier Sequence was also verifiedin this study. We found that substitutions at the Specifier Sequence resulted in reduced binding, the magnitude of which correlates well with the predicted stability of the RNA-RNA pairing. However, the tolerance for specific mismatches in antitermination was generally different from that during decoding, which reveals a unique tRNA recognition pattern in the T box antitermination system. PMID:26229106
DOE Office of Scientific and Technical Information (OSTI.GOV)

Reiser, Steven E.; Somerville, Chris R.

The present invention relates to bacterial enzymes, in particular to an acyl-CoA reductase and a gene encoding an acyl-CoA reductase, the amino acid and nucleic acid sequences corresponding to the reductase polypeptide and gene, respectively, and to methods of obtaining such enzymes, amino acid sequences and nucleic acid sequences. The invention also relates to the use of such sequences to provide transgenic host cells capable of producing fatty alcohols and fatty aldehydes.
BGL7 beta-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Ward, Michael

2013-01-29

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl7, and the corresponding BGL7 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL7, recombinant BGL7 proteins and methods for producing the same.
BGL6 .beta.-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Ward, Michael

2012-10-02

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl6, and the corresponding BGL6 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL6, recombinant BGL6 proteins and methods for producing the same.
BGL5 .beta.-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2006-02-28

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl5, and the corresponding BGL5 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL5, recombinant BGL5 proteins and methods for producing the same.
BGL5 .beta.-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA

2008-03-18

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl5, and the corresponding BGL5 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL5, recombinant BGL5 proteins and methods for producing the same.
BGL6 beta-glucosidase and nucleic acids encoding the same

DOE Office of Scientific and Technical Information (OSTI.GOV)

Dunn-Coleman, Nigel; Ward, Michael

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl6, and the corresponding BGL6 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL6, recombinant BGL6 proteins and methods for producing the same.
BGL6 beta-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Ward, Michael

2014-03-04

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl6, and the corresponding BGL6 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL6, recombinant BGL6 proteins and methods for producing the same.

BGL7 beta-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Ward, Michael

2015-04-14

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl7, and the corresponding BGL7 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL7, recombinant BGL7 proteins and methods for producing the same.
BGL7 beta-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Ward, Michael

2014-03-25

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl7, and the corresponding BGL7 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL7, recombinant BGL7 proteins and methods for producing the same.
BGL6 beta-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Ward, Michael

2015-08-11

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl6, and the corresponding BGL6 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL6, recombinant BGL6 proteins and methods for producing the same.
BGL3 beta-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2007-09-25

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl3, and the corresponding BGL3 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL3, recombinant BGL3 proteins and methods for producing the same.
BGL3 beta-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA

2008-04-01

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl3, and the corresponding BGL3 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL3, recombinant BGL3 proteins and methods for producing the same.
BGL4 beta-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA

2011-12-06

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl4, and the corresponding BGL4 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL4, recombinant BGL4 proteins and methods for producing the same.
BGL4 .beta.-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2006-05-16

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl4, and the corresponding BGL4 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL4, recombinant BGL4 proteins and methods for producing the same.
BGL3 beta-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA

2011-06-14

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl3, and the corresponding BGL3 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL3, recombinant BGL3 proteins and methods for producing the same.
BGL6 beta-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel [Los Gatos, CA; Ward, Michael [San Francisco, CA

2009-09-01

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl6, and the corresponding BGL6 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL6, recombinant BGL6 proteins and methods for producing the same.
BGL3 beta-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2012-10-30

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl3, and the corresponding BGL3 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL3, recombinant BGL3 proteins and methods for producing the same.
BGL4 beta-glucosidase and nucleic acids encoding the same

DOEpatents

Dunn-Coleman, Nigel [Los Gatos, CA; Goedegebuur, Frits [Vlaardingen, NL; Ward, Michael [San Francisco, CA; Yao, Jian [Sunnyvale, CA

2008-01-22

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl4, and the corresponding BGL4 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL4, recombinant BGL4 proteins and methods for producing the same.
Mutations that Cause Human Disease: A Computational/Experimental Approach

DOE Office of Scientific and Technical Information (OSTI.GOV)

Beernink, P; Barsky, D; Pesavento, B

International genome sequencing projects have produced billions of nucleotides (letters) of DNA sequence data, including the complete genome sequences of 74 organisms. These genome sequences have created many new scientific opportunities, including the ability to identify sequence variations among individuals within a species. These genetic differences, which are known as single nucleotide polymorphisms (SNPs), are particularly important in understanding the genetic basis for disease susceptibility. Since the report of the complete human genome sequence, over two million human SNPs have been identified, including a large-scale comparison of an entire chromosome from twenty individuals. Of the protein coding SNPs (cSNPs), approximatelymore » half leads to a single amino acid change in the encoded protein (non-synonymous coding SNPs). Most of these changes are functionally silent, while the remainder negatively impact the protein and sometimes cause human disease. To date, over 550 SNPs have been found to cause single locus (monogenic) diseases and many others have been associated with polygenic diseases. SNPs have been linked to specific human diseases, including late-onset Parkinson disease, autism, rheumatoid arthritis and cancer. The ability to predict accurately the effects of these SNPs on protein function would represent a major advance toward understanding these diseases. To date several attempts have been made toward predicting the effects of such mutations. The most successful of these is a computational approach called ''Sorting Intolerant From Tolerant'' (SIFT). This method uses sequence conservation among many similar proteins to predict which residues in a protein are functionally important. However, this method suffers from several limitations. First, a query sequence must have a sufficient number of relatives to infer sequence conservation. Second, this method does not make use of or provide any information on protein structure, which can be used to understand how an amino acid change affects the protein. The experimental methods that provide the most detailed structural information on proteins are X-ray crystallography and NMR spectroscopy. However, these methods are labor intensive and currently cannot be carried out on a genomic scale. Nonetheless, Structural Genomics projects are being pursued by more than a dozen groups and consortia worldwide and as a result the number of experimentally determined structures is rising exponentially. Based on the expectation that protein structures will continue to be determined at an ever-increasing rate, reliable structure prediction schemes will become increasingly valuable, leading to information on protein function and disease for many different proteins. Given known genetic variability and experimentally determined protein structures, can we accurately predict the effects of single amino acid substitutions? An objective assessment of this question would involve comparing predicted and experimentally determined structures, which thus far has not been rigorously performed. The completed research leveraged existing expertise at LLNL in computational and structural biology, as well as significant computing resources, to address this question.« less
Characterization of Austrian koi herpesvirus samples based on the ORF40 region.

PubMed

Marek, A; Schachner, O; Bilic, I; Hess, M

2010-02-17

Using a PCR that amplifies a region of the thymidine kinase (TK) gene, an epidemic spread of koi herpesvirus (KHV) was determined in koi carps in Austria in 2007. A total of 15 virus samples from different locations in Austria were analyzed to determine their genetic relatedness following PCR and nucleic acid sequencing of the open reading frame 40 (ORF40) region of the KHV genome. ORF40-specific PCR amplification products that were obtained from tissue samples shared 100% nucleotide sequence identity with the published sequence of the Japanese strain of KHV. The ORF40 sequence of one isolate from the UK that was included in the present study was 100% identical with the published sequence of an Israeli strain of KHV. This is the first study that used a larger number of samples and a PCR method, which allowed distinguishing all 3 strains of KHV. The present investigation provides information on the epidemiology of KHV infections in Europe and describes a useful molecular tool for epidemiological studies.
Some properties and cDNA cloning of proteinaceous toxins from two species of lionfish (Pterois antennata and Pterois volitans).

PubMed

Kiriake, Aya; Shiomi, Kazuo

2011-11-01

Lionfish, members of the genera Pterois, Parapterois and Dendrochirus, are well known to be venomous, having venomous glandular tissues in dorsal, pelvic and anal spines. The lionfish toxins have been shown to cross-react with the stonefish toxins by neutralization tests using the commercial stonefish antivenom, although their chemical properties including structures have been little characterized. In this study, an antiserum against neoverrucotoxin, the stonefish Synanceia verrucosa toxin, was first raised in a guinea pig and used in immunoblotting and inhibition immunoblotting to confirm that two species of Pterois lionfish (P. antennata and P. volitans) contain a 75kDa protein (corresponding to the toxin subunit) cross-reacting with neoverrucotoxin. Then, the amino acid sequences of the P. antennata and P. volitans toxins were successfully determined by cDNA cloning using primers designed from the highly conserved sequences of the stonefish toxins. Notably, either α-subunits (699 amino acid residues) or β-subunits (698 amino acid residues) of the P. antennata and P. volitans toxins share as high as 99% sequence identity with each other. Furthermore, both α- and β-subunits of the lionfish toxins exhibit high sequence identity (70-80% identity) with each other and also with the β-subunits of the stonefish toxins. As reported for the stonefish toxins, the lionfish toxins also contain a B30.2/SPRY domain (comprising nearly 200 amino acid residues) in the C-terminal region of each subunit. Copyright © 2011 Elsevier Ltd. All rights reserved.
(Mechanisms of inhibition of viral replication in plants)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Not Available

1991-01-01

During the last year we have made a number of important observations in the fields of virology and plant molecular biology. By directly sequencing Tomato Mosaic Virus (ToMV) movement genes, previously undetected sequence alterations common to specific viral strains were found. The difficulty in regenerating transgenic tomato plants containing the Tm-2 gene was overcome. Tobacco plants transformed with Cucumber Mosaic Virus (CMV) are being characterized. Analysis of transgenic tobacco plants expressing CMV coat protein have shown no correlation between coat protein expression and level of resistance. Specific amino acid changes have been found to correlate with CMV resistance breaking andmore » degree of pathogenicity. Satellite RNAs are shown to be too unstable for use as a biological control agent. The aphid transmission domain CMV has been localized to one (or more) of three amino acids; constructs have been made to determine the exact amino acids involved. 15 refs.« less
Cloning and sequencing of the Thermoanaerobacterium saccharolyticum B6A-RI apu gene and purification and characterization of the amylopullulanase from Escherichia coli.

PubMed

Ramesh, M V; Podkovyrov, S M; Lowe, S E; Zeikus, J G

1994-01-01

The amylopullulanase gene (apu) of the thermophilic anaerobic bacterium Thermoanaerobacterium saccharolyticum B6A-RI was cloned into Escherichia coli. The complete nucleotide sequence of the gene was determined. It encoded a protein consisting of 1,288 amino acids with a signal peptide of 35 amino acids. The enzyme purified from E. coli was a monomer with an M(r) of 142,000 +/- 2,000 and had same the catalytic and thermal characteristics as the native glycoprotein from T. saccharolyticum B6A. Linear alignment and the hydrophobic cluster analysis were used to compare this amylopullulanase with other amylolytic enzymes. Both methods revealed strictly conserved amino acid residues among these enzymes, and it is proposed that Asp-594, Asp-700, and Glu-623 are a putative catalytic triad of the T. saccharolyticum B6A-RI amylopullulanase.
Cloning and sequencing of the Thermoanaerobacterium saccharolyticum B6A-RI apu gene and purification and characterization of the amylopullulanase from Escherichia coli.

PubMed Central

Ramesh, M V; Podkovyrov, S M; Lowe, S E; Zeikus, J G

1994-01-01

The amylopullulanase gene (apu) of the thermophilic anaerobic bacterium Thermoanaerobacterium saccharolyticum B6A-RI was cloned into Escherichia coli. The complete nucleotide sequence of the gene was determined. It encoded a protein consisting of 1,288 amino acids with a signal peptide of 35 amino acids. The enzyme purified from E. coli was a monomer with an M(r) of 142,000 +/- 2,000 and had same the catalytic and thermal characteristics as the native glycoprotein from T. saccharolyticum B6A. Linear alignment and the hydrophobic cluster analysis were used to compare this amylopullulanase with other amylolytic enzymes. Both methods revealed strictly conserved amino acid residues among these enzymes, and it is proposed that Asp-594, Asp-700, and Glu-623 are a putative catalytic triad of the T. saccharolyticum B6A-RI amylopullulanase. Images PMID:8117096
High molecular weight glutenin subunits in some durum wheat cultivars investigated by means of mass spectrometric techniques.

PubMed

Muccilli, Vera; Lo Bianco, Marisol; Cunsolo, Vincenzo; Saletti, Rosaria; Gallo, Giulia; Foti, Salvatore

2011-11-23

The primary structures of high molecular weight glutenin subunits (HMW-GS) of 5 Triticum durum Desf. cultivars (Simeto, Svevo, Duilio, Bronte, and Sant'Agata), largely cultivated in the south of Italy, and of 13 populations of the old spring Sicilian durum wheat landrace Timilia (Triticum durum Desf.) (accession nos. 1, 2, 3, 4, 7, 8, 9, 13, 14, 15, SG1, SG2, and SG3) were investigated using matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOFMS) and reversed-phase high performance liquid chromatography/nanoelectrospray ionization mass spectrometry (RP-HPLC/nESI-MS/MS). M(r) of the intact proteins determined by MALDI mass spectrometry showed that all the 13 populations of Timilia contained the same two HMW-GS with 75.2 kDa and 86.4 kDa, whereas the other durum wheat cultivars showed the presence of the expected HMW-GS 1By8 and 1Bx7 at 75.1 kDa and 83.1 kDa, respectively. By MALDI mass spectrometry of the tryptic digestion peptides of the isolated HMW-GS of Timilia, the 1Bx and 1By subunits were identified as the NCBInr Acc. No AAQ93629, and AAQ93633, respectively. Sequence verification for HMW-GS 1Bx and 1By both in Simeto and Timilia was obtained by MALDI mass mapping and HPLC/nESI-MSMS of the tryptic peptides. The Bx subunit of Timila presents a sequence similarity of 96% with respect to Simeto, with differences in the insertion of 3 peptides of 5, 9, and 15 amino acids, for a total insertion of 29 amino acids and 25 amino acid substitutions. These differences in the amino acidic sequence account for the determined Δm of 3294 Da between the M(r) of the 1Bx subunits in Timilia and Simeto. Sequence alignment between the two By subunits shows 10 amino acid substitutions and is consistent with the Δm of 148 Da found in the MALDI mass spectra of the intact subunits.
Genetic characterization of L-Zagreb mumps vaccine strain.

PubMed

Ivancic, Jelena; Gulija, Tanja Kosutic; Forcic, Dubravko; Baricevic, Marijana; Jug, Renata; Mesko-Prejac, Majda; Mazuran, Renata

2005-04-01

Eleven mumps vaccine strains, all containing live attenuated virus, have been used throughout the world. Although L-Zagreb mumps vaccine has been licensed since 1972, only its partial nucleotide sequence was previously determined (accession numbers , and ). Therefore, we sequenced the entire genome of L-Zagreb vaccine strain (Institute of Immunology Inc., Zagreb, Croatia). In order to investigate the genetic stability of the vaccine, sequences of both L-Zagreb master seed and currently produced vaccine batch were determined and no difference between them was observed. A phylogenetic analysis based on SH gene sequence has shown that L-Zagreb strain does not belong to any of established mumps genotypes and that it is most similar to old, laboratory preserved European strains (1950s-1970s). L-Zagreb nucleotide and deduced protein sequences were compared with other mumps virus sequences obtained from the GenBank. Emphasis was put on functionally important protein regions and known antigenic epitopes. The extensive comparisons of nucleotide and deduced protein sequences between L-Zagreb vaccine strain and other previously determined mumps virus sequences have shown that while the functional regions of HN, V, and L proteins are well conserved among various mumps strains, there can be a substantial amino acid difference in antigenic epitopes of all proteins and in functional regions of F protein. No molecular pattern was identified that can be used as a distinction marker between virulent and attenuated strains.
Typing of canine parvovirus isolates using mini-sequencing based single nucleotide polymorphism analysis.

PubMed

Naidu, Hariprasad; Subramanian, B Mohana; Chinchkar, Shankar Ramchandra; Sriraman, Rajan; Rana, Samir Kumar; Srinivasan, V A

2012-05-01

The antigenic types of canine parvovirus (CPV) are defined based on differences in the amino acids of the major capsid protein VP2. Type specificity is conferred by a limited number of amino acid changes and in particular by few nucleotide substitutions. PCR based methods are not particularly suitable for typing circulating variants which differ in a few specific nucleotide substitutions. Assays for determining SNPs can detect efficiently nucleotide substitutions and can thus be adapted to identify CPV types. In the present study, CPV typing was performed by single nucleotide extension using the mini-sequencing technique. A mini-sequencing signature was established for all the four CPV types (CPV2, 2a, 2b and 2c) and feline panleukopenia virus. The CPV typing using the mini-sequencing reaction was performed for 13 CPV field isolates and the two vaccine strains available in our repository. All the isolates had been typed earlier by full-length sequencing of the VP2 gene. The typing results obtained from mini-sequencing matched completely with that of sequencing. Typing could be achieved with less than 100 copies of standard plasmid DNA constructs or ≤10¹ FAID₅₀ of virus by mini-sequencing technique. The technique was also efficient for detecting multiple types in mixed infections. Copyright © 2012 Elsevier B.V. All rights reserved.

Unraveling Core Functional Microbiota in Traditional Solid-State Fermentation by High-Throughput Amplicons and Metatranscriptomics Sequencing

PubMed Central

Song, Zhewei; Du, Hai; Zhang, Yan; Xu, Yan

2017-01-01

Fermentation microbiota is specific microorganisms that generate different types of metabolites in many productions. In traditional solid-state fermentation, the structural composition and functional capacity of the core microbiota determine the quality and quantity of products. As a typical example of food fermentation, Chinese Maotai-flavor liquor production involves a complex of various microorganisms and a wide variety of metabolites. However, the microbial succession and functional shift of the core microbiota in this traditional food fermentation remain unclear. Here, high-throughput amplicons (16S rRNA gene amplicon sequencing and internal transcribed space amplicon sequencing) and metatranscriptomics sequencing technologies were combined to reveal the structure and function of the core microbiota in Chinese soy sauce aroma type liquor production. In addition, ultra-performance liquid chromatography and headspace-solid phase microextraction-gas chromatography-mass spectrometry were employed to provide qualitative and quantitative analysis of the major flavor metabolites. A total of 10 fungal and 11 bacterial genera were identified as the core microbiota. In addition, metatranscriptomic analysis revealed pyruvate metabolism in yeasts (genera Pichia, Schizosaccharomyces, Saccharomyces, and Zygosaccharomyces) and lactic acid bacteria (genus Lactobacillus) classified into two stages in the production of flavor components. Stage I involved high-level alcohol (ethanol) production, with the genus Schizosaccharomyces serving as the core functional microorganism. Stage II involved high-level acid (lactic acid and acetic acid) production, with the genus Lactobacillus serving as the core functional microorganism. The functional shift from the genus Schizosaccharomyces to the genus Lactobacillus drives flavor component conversion from alcohol (ethanol) to acid (lactic acid and acetic acid) in Chinese Maotai-flavor liquor production. Our findings provide insight into the effects of the core functional microbiota in soy sauce aroma type liquor production and the characteristics of the fermentation microbiota under different environmental conditions. PMID:28769888
Methods and compositions for efficient nucleic acid sequencing

DOEpatents

Drmanac, Radoje

2006-07-04

Disclosed are novel methods and compositions for rapid and highly efficient nucleic acid sequencing based upon hybridization with two sets of small oligonucleotide probes of known sequences. Extremely large nucleic acid molecules, including chromosomes and non-amplified RNA, may be sequenced without prior cloning or subcloning steps. The methods of the invention also solve various current problems associated with sequencing technology such as, for example, high noise to signal ratios and difficult discrimination, attaching many nucleic acid fragments to a surface, preparing many, longer or more complex probes and labelling more species.
Methods and compositions for efficient nucleic acid sequencing

DOEpatents

Drmanac, Radoje

2002-01-01

Disclosed are novel methods and compositions for rapid and highly efficient nucleic acid sequencing based upon hybridization with two sets of small oligonucleotide probes of known sequences. Extremely large nucleic acid molecules, including chromosomes and non-amplified RNA, may be sequenced without prior cloning or subcloning steps. The methods of the invention also solve various current problems associated with sequencing technology such as, for example, high noise to signal ratios and difficult discrimination, attaching many nucleic acid fragments to a surface, preparing many, longer or more complex probes and labelling more species.
Fatty Acid Profile and Unigene-Derived Simple Sequence Repeat Markers in Tung Tree (Vernicia fordii)

PubMed Central

Zhang, Lin; Jia, Baoguang; Tan, Xiaofeng; Thammina, Chandra S.; Long, Hongxu; Liu, Min; Wen, Shanna; Song, Xianliang; Cao, Heping

2014-01-01

Tung tree (Vernicia fordii) provides the sole source of tung oil widely used in industry. Lack of fatty acid composition and molecular markers hinders biochemical, genetic and breeding research. The objectives of this study were to determine fatty acid profiles and develop unigene-derived simple sequence repeat (SSR) markers in tung tree. Fatty acid profiles of 41 accessions showed that the ratio of α-eleostearic acid was increasing continuously with a parallel trend to the amount of tung oil accumulation while the ratios of other fatty acids were decreasing in different stages of the seeds and that α-eleostearic acid (18∶3) consisted of 77% of the total fatty acids in tung oil. Transcriptome sequencing identified 81,805 unigenes from tung cDNA library constructed using seed mRNA and discovered 6,366 SSRs in 5,404 unigenes. The di- and tri-nucleotide microsatellites accounted for 92% of the SSRs with AG/CT and AAG/CTT being the most abundant SSR motifs. Fifteen polymorphic genic-SSR markers were developed from 98 unigene loci tested in 41 cultivated tung accessions by agarose gel and capillary electrophoresis. Genbank database search identified 10 of them putatively coding for functional proteins. Quantitative PCR demonstrated that all 15 polymorphic SSR-associated unigenes were expressed in tung seeds and some of them were highly correlated with oil composition in the seeds. Dendrogram revealed that most of the 41 accessions were clustered according to the geographic region. These new polymorphic genic-SSR markers will facilitate future studies on genetic diversity, molecular fingerprinting, comparative genomics and genetic mapping in tung tree. The lipid profiles in the seeds of 41 tung accessions will be valuable for biochemical and breeding studies. PMID:25167054
Recombinatorial biases and convergent recombination determine interindividual TCRβ sharing in murine thymocytes.

PubMed

Li, Hanjie; Ye, Congting; Ji, Guoli; Wu, Xiaohui; Xiang, Zhe; Li, Yuanyue; Cao, Yonghao; Liu, Xiaolong; Douek, Daniel C; Price, David A; Han, Jiahuai

2012-09-01

Overlap of TCR repertoires among individuals provides the molecular basis for public T cell responses. By deep-sequencing the TCRβ repertoires of CD4+CD8+ thymocytes from three individual mice, we observed that a substantial degree of TCRβ overlap, comprising ∼10-15% of all unique amino acid sequences and ∼5-10% of all unique nucleotide sequences across any two individuals, is already present at this early stage of T cell development. The majority of TCRβ sharing between individual thymocyte repertoires could be attributed to the process of convergent recombination, with additional contributions likely arising from recombinatorial biases; the role of selection during intrathymic development was negligible. These results indicate that the process of TCR gene recombination is the major determinant of clonotype sharing between individuals.
Human jagged polypeptide, encoding nucleic acids and methods of use

DOEpatents

Li, Linheng; Hood, Leroy

2000-01-01

The present invention provides an isolated polypeptide exhibiting substantially the same amino acid sequence as JAGGED, or an active fragment thereof, provided that the polypeptide does not have the amino acid sequence of SEQ ID NO:5 or SEQ ID NO:6. The invention further provides an isolated nucleic acid molecule containing a nucleotide sequence encoding substantially the same amino acid sequence as JAGGED, or an active fragment thereof, provided that the nucleotide sequence does not encode the amino acid sequence of SEQ ID NO:5 or SEQ ID NO:6. Also provided herein is a method of inhibiting differentiation of hematopoietic progenitor cells by contacting the progenitor cells with an isolated JAGGED polypeptide, or active fragment thereof. The invention additionally provides a method of diagnosing Alagille Syndrome in an individual. The method consists of detecting an Alagille Syndrome disease-associated mutation linked to a JAGGED locus.
Polypeptide having or assisting in carbohydrate material degrading activity and uses thereof

DOEpatents

Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Los, Alrik Pieter

2016-02-16

The invention relates to a polypeptide which comprises the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 76% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 76% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having beta-glucosidase activity and uses thereof

DOE Office of Scientific and Technical Information (OSTI.GOV)

Schoonneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; De Jong, Rene Marcel

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well asmore » the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.« less
Polypeptide having swollenin activity and uses thereof

DOEpatents

Schoonneveld-Bergmans, Margot Elizabeth Francoise; Heijne, Wilbert Herman Marie; Vlasie, Monica D; Damveld, Robbertus Antonius

2015-11-04

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having beta-glucosidase activity and uses thereof

DOEpatents

Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; De Jong, Rene Marcel; Damveld, Robbertus Antonius

2015-09-01

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 70% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 70% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having cellobiohydrolase activity and uses thereof

DOEpatents

Sagt, Cornelis Maria Jacobus; Schooneveld-Bergmans, Margot Elisabeth Francoise; Roubos, Johannes Andries; Los, Alrik Pieter

2015-09-15

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 93% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 93% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having acetyl xylan esterase activity and uses thereof

DOEpatents

Schoonneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Los, Alrik Pieter

2015-10-20

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 82% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 82% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having carbohydrate degrading activity and uses thereof

DOEpatents

Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Vlasie, Monica Diana; Damveld, Robbertus Antonius

2015-08-18

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Comprehensive analysis of the T-cell receptor beta chain gene in rhesus monkey by high throughput sequencing

PubMed Central

Li, Zhoufang; Liu, Guangjie; Tong, Yin; Zhang, Meng; Xu, Ying; Qin, Li; Wang, Zhanhui; Chen, Xiaoping; He, Jiankui

2015-01-01

Profiling immune repertoires by high throughput sequencing enhances our understanding of immune system complexity and immune-related diseases in humans. Previously, cloning and Sanger sequencing identified limited numbers of T cell receptor (TCR) nucleotide sequences in rhesus monkeys, thus their full immune repertoire is unknown. We applied multiplex PCR and Illumina high throughput sequencing to study the TCRβ of rhesus monkeys. We identified 1.26 million TCRβ sequences corresponding to 643,570 unique TCRβ sequences and 270,557 unique complementarity-determining region 3 (CDR3) gene sequences. Precise measurements of CDR3 length distribution, CDR3 amino acid distribution, length distribution of N nucleotide of junctional region, and TCRV and TCRJ gene usage preferences were performed. A comprehensive profile of rhesus monkey immune repertoire might aid human infectious disease studies using rhesus monkeys. PMID:25961410
Computational structural analysis of an anti-l-amino acid antibody and inversion of its stereoselectivity

PubMed Central

Ranieri, Daniel I.; Hofstetter, Heike; Hofstetter, Oliver

2009-01-01

The binding site of a monoclonal anti-l-amino acid antibody was modeled using the program SWISS-MODEL. Docking experiments with the enantiomers of phenylalanine revealed that the antibody interacts with l-phenylalanine via hydrogen bonds and hydrophobic contacts, whereas the d-enantiomer is rejected due to steric hindrance. Comparison of the sequences of this antibody and an anti-d-amino acid antibody indicates that both immunoglobulins derived from the same germline progenitor. Substitution of four amino acids residues, three in the framework and one in the complementarity determining regions, allowed in silico conversion of the anti-l-amino acid antibody into an antibody that stereoselectively binds d-phenylalanine. PMID:19472280
37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

Code of Federal Regulations, 2010 CFR

2010-07-01

... 37 Patents, Trademarks, and Copyrights 1 2010-07-01 2010-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide and...
37 CFR 5.31-5.33 - [Reserved

Code of Federal Regulations, 2011 CFR

2011-07-01

... from abandonment 1.135 Amino Acid Sequences. (See Nucleotide and/or Amino Acid Sequences) Appeal to... Appeals and Interference 41.47 Of rejection of an application 1.104(a) Nucleotide and/or Amino Acid...) Symbols for nucleotide and/or amino acid sequence data 1.822 T Tables in patent applications 1.58 Terminal...
37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

Code of Federal Regulations, 2011 CFR

2011-07-01

... 37 Patents, Trademarks, and Copyrights 1 2011-07-01 2011-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide and...
Molecular characterization of KGH, the first human isolate of rabies virus in Korea.

PubMed

Park, Jun-Sun; Kim, Chi-Kyeong; Kim, Su Yeon; Ju, Young Ran

2013-04-01

The complete genome sequence of the KGH strain of the first human rabies virus, which was isolated from a skin biopsy of a patient with rabies, whose symptoms developed due to bites from a raccoon dog in 2001. The size of the KGH strain genome was determined to be 11,928 nucleotides (nt) with a leader sequence of 58 nt, nucleoprotein gene of 1,353 nt, phosphoprotein gene of 894 nt, matrix protein gene of 609 nt, glycoprotein gene of 1,575 nt, RNA-dependent RNA polymerase gene of 6,384 nt, and trailer region of 69 nt. Sequence similarity was compared with 39 fully sequenced rabies virus genomes currently available, and the result showed 70.6-91.6 % at the nucleotide level, and 82.8-97.9 % at the amino acid level. The deduced amino acids in the viral protein were compared with those of other rabies viruses, and various functional regions were investigated. As a result, we found that the KGH strain only had a unique amino acid substitution that was identified to be associated either with host immune response and pathogenicity in the N protein, or with a related region regulating STAT1 in the P protein, and related to pathogenicity in G protein. Based on phylogenetic analyses using the complete genome of 39 rabies viruses, the KGH strain was determined to be closely related with the NNV-RAB-H strain and transplant rabies virus serotype 1, which are Indian isolates, and was confirmed to belong to the Arctic-like 2 clade. The KGH strain was most closely related to the SKRRD0204HC and SKRRD0205HC strain when compared with Korean animal isolates, which was separated around the same time and place, and belonged to the Gangwon III subgroup.
A charge-dependent mechanism is responsible for the dynamic accumulation of proteins inside nucleoli.

PubMed

Musinova, Yana R; Kananykhina, Eugenia Y; Potashnikova, Daria M; Lisitsyna, Olga M; Sheval, Eugene V

2015-01-01

The majority of known nucleolar proteins are freely exchanged between the nucleolus and the surrounding nucleoplasm. One way proteins are retained in the nucleoli is by the presence of specific amino acid sequences, namely nucleolar localization signals (NoLSs). The mechanism by which NoLSs retain proteins inside the nucleoli is still unclear. Here, we present data showing that the charge-dependent (electrostatic) interactions of NoLSs with nucleolar components lead to nucleolar accumulation as follows: (i) known NoLSs are enriched in positively charged amino acids, but the NoLS structure is highly heterogeneous, and it is not possible to identify a consensus sequence for this type of signal; (ii) in two analyzed proteins (NF-κB-inducing kinase and HIV-1 Tat), the NoLS corresponds to a region that is enriched for positively charged amino acid residues; substituting charged amino acids with non-charged ones reduced the nucleolar accumulation in proportion to the charge reduction, and nucleolar accumulation efficiency was strongly correlated with the predicted charge of the tested sequences; and (iii) sequences containing only lysine or arginine residues (which were referred to as imitative NoLSs, or iNoLSs) are accumulated in the nucleoli in a charge-dependent manner. The results of experiments with iNoLSs suggested that charge-dependent accumulation inside the nucleoli was dependent on interactions with nucleolar RNAs. The results of this work are consistent with the hypothesis that nucleolar protein accumulation by NoLSs can be determined by the electrostatic interaction of positively charged regions with nucleolar RNAs rather than by any sequence-specific mechanism. Copyright © 2014 Elsevier B.V. All rights reserved.

Differentiation of highly virulent strains of Streptococcus suis serotype 2 according to glutamate dehydrogenase electrophoretic and sequence type.

PubMed

Kutz, Russell; Okwumabua, Ogi

2008-10-01

The glutamate dehydrogenase (GDH) enzymes of 19 Streptococcus suis serotype 2 strains, consisting of 18 swine isolates and 1 human clinical isolate from a geographically varied collection, were analyzed by activity staining on a nondenaturing gel. All seven (100%) of the highly virulent strains tested produced an electrophoretic type (ET) distinct from those of moderately virulent and nonvirulent strains. By PCR and nucleotide sequence determination, the gdh genes of the 19 strains and of 2 highly virulent strains involved in recent Chinese outbreaks yielded a 1,820-bp fragment containing an open reading frame of 1,344 nucleotides, which encodes a protein of 448 amino acid residues with a calculated molecular mass of approximately 49 kDa. The nucleotide sequences contained base pair differences, but most were silent. Cluster analysis of the deduced amino acid sequences separated the isolates into three groups. Group I (ETI) consisted of the seven highly virulent isolates and the two Chinese outbreak strains, containing Ala(299)-to-Ser, Glu(305)-to-Lys, and Glu(330)-to-Lys amino acid substitutions compared with groups II and III (ETII). Groups II and III consisted of moderately virulent and nonvirulent strains, which are separated from each other by Tyr(72)-to-Asp and Thr(296)-to-Ala substitutions. Gene exchange studies resulted in the change of ETI to ETII and vice versa. A spectrophotometric activity assay for GDH did not show significant differences between the groups. These results suggest that the GDH ETs and sequence types may serve as useful markers in predicting the pathogenic behavior of strains of this serotype and that the molecular basis for the observed differences in the ETs was amino acid substitutions and not deletion, insertion, or processing uniqueness.
Chemical property based sequence characterization of PpcA and its homolog proteins PpcB-E: A mathematical approach

PubMed Central

Pal Choudhury, Pabitra

2017-01-01

Periplasmic c7 type cytochrome A (PpcA) protein is determined in Geobacter sulfurreducens along with its other four homologs (PpcB-E). From the crystal structure viewpoint the observation emerges that PpcA protein can bind with Deoxycholate (DXCA), while its other homologs do not. But it is yet to be established with certainty the reason behind this from primary protein sequence information. This study is primarily based on primary protein sequence analysis through the chemical basis of embedded amino acids. Firstly, we look for the chemical group specific score of amino acids. Along with this, we have developed a new methodology for the phylogenetic analysis based on chemical group dissimilarities of amino acids. This new methodology is applied to the cytochrome c7 family members and pinpoint how a particular sequence is differing with others. Secondly, we build a graph theoretic model on using amino acid sequences which is also applied to the cytochrome c7 family members and some unique characteristics and their domains are highlighted. Thirdly, we search for unique patterns as subsequences which are common among the group or specific individual member. In all the cases, we are able to show some distinct features of PpcA that emerges PpcA as an outstanding protein compared to its other homologs, resulting towards its binding with deoxycholate. Similarly, some notable features for the structurally dissimilar protein PpcD compared to the other homologs are also brought out. Further, the five members of cytochrome family being homolog proteins, they must have some common significant features which are also enumerated in this study. PMID:28362850
Using msa-2b as a molecular marker for genotyping Mexican isolates of Babesia bovis.

PubMed

Genis, Alma D; Perez, Jocelin; Mosqueda, Juan J; Alvarez, Antonio; Camacho, Minerva; Muñoz, Maria de Lourdes; Rojas, Carmen; Figueroa, Julio V

2009-12-01

Variable merozoite surface antigens of Babesia bovis are exposed glycoproteins having a role in erythrocyte invasion. Members of this gene family include msa-1 and msa-2 (msa-2c, msa-2a(1), msa-2a(2) and msa-2b). To determine the sequence variation among B. bovis Mexican isolates using msa-2b as a genetic marker, PCR amplicons corresponding to msa-2b were cloned and plasmids carrying the corresponding inserts were purified and sequenced. Comparative analysis of nucleotide and deduced amino acid sequences revealed distinct degrees of variability and identity among the coding gene sequences obtained from 16 geographically different Mexican B. bovis isolates and a reference strain. Clustal-W multiple alignments of the MSA-2b deduced amino acid sequences performed with the 17 B. bovis Mexican isolates, revealed the identification of three genotypes with a distinct set each of amino acid residues present at the variable region: Genotype I represented by the MO7 strain (in vitro culture-derived from the Mexico isolate) as well as RAD, Chiapas-1, Tabasco and Veracruz-3 isolates; Genotype II, represented by the Jalisco, Mexico and Veracruz-2 isolates; and Genotype III comprising the sequences from most of the isolates studied, Tamaulipas-1, Chiapas-2, Guerrero-1, Nayarit, Quintana Roo, Nuevo Leon, Tamaulipas-2, Yucatan and Guerrero-2. Moreover, these three genotypes could be discriminated against each other by using a PCR-RFLP approach. The results suggest that occurrence of indels within the variable region of msa-2b sequences can be useful markers for identifying a particular genotype present in field populations of B. bovis isolated from infected cattle in Mexico.
Effects of a Non-Conservative Sequence on the Properties of β-glucuronidase from Aspergillus terreus Li-20

PubMed Central

Liu, Yanli; Huangfu, Jie; Qi, Feng; Kaleem, Imdad; E, Wenwen; Li, Chun

2012-01-01

We cloned the β-glucuronidase gene (AtGUS) from Aspergillus terreus Li-20 encoding 657 amino acids (aa), which can transform glycyrrhizin into glycyrrhetinic acid monoglucuronide (GAMG) and glycyrrhetinic acid (GA). Based on sequence alignment, the C-terminal non-conservative sequence showed low identity with those of other species; thus, the partial sequence AtGUS(-3t) (1–592 aa) was amplified to determine the effects of the non-conservative sequence on the enzymatic properties. AtGUS and AtGUS(-3t) were expressed in E. coli BL21, producing AtGUS-E and AtGUS(-3t)-E, respectively. At the similar optimum temperature (55°C) and pH (AtGUS-E, 6.6; AtGUS(-3t)-E, 7.0) conditions, the thermal stability of AtGUS(-3t)-E was enhanced at 65°C, and the metal ions Co2+, Ca2+ and Ni2+ showed opposite effects on AtGUS-E and AtGUS(-3t)-E, respectively. Furthermore, Km of AtGUS(-3t)-E (1.95 mM) was just nearly one-seventh that of AtGUS-E (12.9 mM), whereas the catalytic efficiency of AtGUS(-3t)-E was 3.2 fold higher than that of AtGUS-E (7.16 vs. 2.24 mM s−1), revealing that the truncation of non-conservative sequence can significantly improve the catalytic efficiency of AtGUS. Conformational analysis illustrated significant difference in the secondary structure between AtGUS-E and AtGUS(-3t)-E by circular dichroism (CD). The results showed that the truncation of the non-conservative sequence could preferably alter and influence the stability and catalytic efficiency of enzyme. PMID:22347419
An atypical topoisomerase II sequence from the slime mold Physarum polycephalum.

PubMed

Hugodot, Yannick; Dutertre, Murielle; Duguet, Michel

2004-01-21

We have determined the complete nucleotide sequence of the cDNA encoding DNA topoisomerase II from Physarum polycephalum. Using degenerate primers, based on the conserved amino acid sequences of other eukaryotic enzymes, a 250-bp fragment was polymerase chain reaction (PCR) amplified. This fragment was used as a probe to screen a Physarum cDNA library. A partial cDNA clone was isolated that was truncated at the 3' end. Rapid amplification of cDNA ends (RACE)-PCR was employed to isolate the remaining portion of the gene. The complete sequence of 4613 bp contains an open reading frame of 4494 bp that codes for 1498 amino acid residues with a theoretical molecular weight of 167 kDa. The predicted amino acid sequence shares similarity with those of other eukaryotes and shows the highest degree of identity with the enzyme of Dictyostelium discoideum. However, the enzyme of P. polycephalum contains an atypical amino-terminal domain very rich in serine and proline, whose function is unknown. Remarkably, both a mitochondrial targeting sequence and a nuclear localization signal were predicted respectively in the amino and carboxy-terminus of the protein, as in the case of human topoisomerase III alpha. At the Physarum genomic level, the topoisomerase II gene encompasses a region of about 16 kbp suggesting a large proportion of intronic sequences, an unusual situation for a gene of a lower eukaryote, often free of introns. Finally, expression of topoisomerase II mRNA does not appear significantly dependent on the plasmodium cycle stage, possibly due to the lack of G1 phase or (and) to a mitochondrial localization of the enzyme.
Analysis of the vhoGAC and vhtGAC operons from Methanosarcina mazei strain Gö1, both encoding a membrane-bound hydrogenase and a cytochrome b.

PubMed

Deppenmeier, U; Blaut, M; Lentes, S; Herzberg, C; Gottschalk, G

1995-01-15

DNA encompassing the structural genes of two membrane-bound hydrogenases from Methanosarcina mazei Gö1 was cloned and sequenced. The genes, arranged in the order vhoG and vhoA as well as vhtG and vhtA, were identified as those encoding the small and the large subunits of the NiFe hydrogenases [Deppenmeier, U., Blaut, M., Schmidt, B. & Gottschalk, G. (1992) Arch. Microbiol. 157, 505-511]. Northern-blot analysis revealed that the structural genes formed part of two operons, both containing one additional open reading frame (vhoC and vhtC) which codes for a cytochrome b. This conclusion was drawn from the homology of the deduced N-terminal amino acid sequences of vhoC and vhtC and the N-terminus of a 27-kDa cytochrome isolated from Ms. mazei C16. VhoC and VhtC contain four tentative hydrophobic segments which might span the cytoplasmic membrane. Hydropathy plots suggest that His23 and His50 are involved in heme coordination. The comparison of the sequencing data of vhoG and vhtG with the experimentally determined N-terminus of the small subunit indicate the presence of a 48-amino-acid leader peptide in front of the polypeptides. VhoA and VhtA contained the conserved sequence DPCXXC in the C-terminal region, which excludes the presence of a selenocysteine residue in these hydrogenases. Promoter sequences were found upstream of vhoG and vhtG, respectively. Downstream of vhoC, a putative terminator sequence was identified. Alignments of the deduced amino acid sequences of the gene clusters vhoGAC and vhtGAC showed 92-97% identity. Only the C-termini of VhoC and VhtC were not similar.
Characterization of a molt-inhibiting hormone (MIH) of the crayfish, Orconectes limosus, by cDNA cloning and mass spectrometric analysis.

PubMed

Bulau, Patrick; Okuno, Atsuro; Thome, Elke; Schmitz, Tina; Peter-Katalinic, Jasna; Keller, Rainer

2005-11-01

The structure of the precursor of a molt-inhibiting hormone (MIH) of the American crayfish, Orconectes limosus was determined by cloning of a cDNA based on RNA from the neurosecretory perikarya of the X-organ in the eyestalk ganglia. The open reading frame includes the complete precursor sequence, consisting of a signal peptide of 29, and the MIH sequence of 77 amino acids. In addition, the mature peptide was isolated by HPLC from the neurohemal sinus gland and analyzed by ESI-MS and MALDI-TOF-MS peptide mapping. This showed that the mature peptide (Mass 8664.29 Da) consists of only 75 amino acids, having Ala75-NH2 as C-terminus. Thus, C-terminal Arg77 of the precursor is removed during processing, and Gly76 serves as an amide donor. Sequence comparison confirms this peptide as a novel member of the large family, which includes crustacean hyperglycaemic hormone (CHH), MIH and gonad (vitellogenesis)-inhibiting hormone (GIH/VIH). The lack of a CPRP (CHH-precursor related peptide) in the hormone precursor, the size and specific sequence characteristics show that Orl MIH belongs to the MIH/GIH(VIH) subgroup of this larger family. Comparison with the MIH of Procambarus clarkii, the only other MIH that has thus far been identified in freshwater crayfish, shows extremely high sequence conservation. Both MIHs differ in only one amino acid residue ( approximately 99% identity), whereas the sequence identity to several other known MIHs is between 40 and 46%.
Identification of cancer-specific motifs in mimotope profiles of serum antibody repertoire.

PubMed

Gerasimov, Ekaterina; Zelikovsky, Alex; Măndoiu, Ion; Ionov, Yurij

2017-06-07

For fighting cancer, earlier detection is crucial. Circulating auto-antibodies produced by the patient's own immune system after exposure to cancer proteins are promising bio-markers for the early detection of cancer. Since an antibody recognizes not the whole antigen but 4-7 critical amino acids within the antigenic determinant (epitope), the whole proteome can be represented by a random peptide phage display library. This opens the possibility to develop an early cancer detection test based on a set of peptide sequences identified by comparing cancer patients' and healthy donors' global peptide profiles of antibody specificities. Due to the enormously large number of peptide sequences contained in global peptide profiles generated by next generation sequencing, the large number of cancer and control sera is required to identify cancer-specific peptides with high degree of statistical significance. To decrease the number of peptides in profiles generated by nextgen sequencing without losing cancer-specific sequences we used for generation of profiles the phage library enriched by panning on the pool of cancer sera. To further decrease the complexity of profiles we used computational methods for transforming a list of peptides constituting the mimotope profiles to the list motifs formed by similar peptide sequences. We have shown that the amino-acid order is meaningful in mimotope motifs since they contain significantly more peptides than motifs among peptides where amino-acids are randomly permuted. Also the single sample motifs significantly differ from motifs in peptides drawn from multiple samples. Finally, multiple cancer-specific motifs have been identified.
Molecular characterization of a phloem-specific gene encoding the filament protein, phloem protein 1 (PP1), from Cucurbita maxima.

PubMed

Clark, A M; Jacobsen, K R; Bostwick, D E; Dannenhoffer, J M; Skaggs, M I; Thompson, G A

1997-07-01

Sieve elements in the phloem of most angiosperms contain proteinaceous filaments and aggregates called P-protein. In the genus Cucurbita, these filaments are composed of two major proteins: PP1, the phloem filament protein, and PP2, the phloem lactin. The gene encoding the phloem filament protein in pumpkin (Cucurbita maxima Duch.) has been isolated and characterized. Nucleotide sequence analysis of the reconstructed gene gPP1 revealed a continuous 2430 bp protein coding sequence, with no introns, encoding an 809 amino acid polypeptide. The deduced polypeptide had characteristics of PP1 and contained a 15 amino acid sequence determined by N-terminal peptide sequence analysis of PP1. The sequence of PP1 was highly repetitive with four 200 amino acid sequence domains containing structural motifs in common with cysteine proteinase inhibitors. Expression of the PP1 gene was detected in roots, hypocotyls, cotyledons, stems, and leaves of pumpkin plants. PP1 and its mRNA accumulated in pumpkin hypocotyls during the period of rapid hypocotyl elongation after which mRNA levels declined, while protein levels remained elevated. PP1 was immunolocalized in slime plugs and P-protein bodies in sieve elements of the phloem. Occasionally, PP1 was detected in companion cells. PP1 mRNA was localized by in situ hybridization in companion cells at early stages of vascular differentiation. The developmental accumulation and localization of PP1 and its mRNA paralleled the phloem lactin, further suggesting an interaction between these phloem-specific proteins.
Capillary electrophoresis, a method for the determination of nucleic acid ligands covalently attached to quantum dots representing a donor of Förster resonance energy transfer.

PubMed

Datinská, Vladimíra; Klepárník, Karel; Belšánová, Barbora; Minárik, Marek; Foret, František

2018-05-09

The synthesis and determination of the structure of a Förster resonance energy transfer probe intended for the detection of specific nucleic acid sequences are described here. The probe is based on the hybridization of oligonucleotide modified quantum dots with a fluorescently labeled nucleic acid sample resulting in changes of the fluorescence emission due to the energy transfer effect. The stoichiometry distribution of oligonucleotides conjugated to quantum dots was determined by capillary electrophoresis separation. The results indicate that one to four molecules of oligonucleotide are conjugated to the surface of a single nanoparticle. This conclusion is confirmed by the course of the dependence of Förster resonance energy transfer efficiency on the concentration of fluorescently labeled complementary single-stranded nucleic acid, showing saturation. While the energy transfer efficiency of the probe hybridized with complementary nucleic acid strands was 30%, negligible efficiency was observed with a non-complementary strands. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Identification and characterization of a matrix protein (PPP-10) in the periostracum of the pearl oyster, Pinctada fucata.

PubMed

Nakayama, Seiji; Suzuki, Michio; Endo, Hirotoshi; Iimura, Kurin; Kinoshita, Shigeharu; Watabe, Shugo; Kogure, Toshihiro; Nagasawa, Hiromichi

2013-01-01

The periostracum is a layered structure that is formed as a mollusk shell grows. The shell is covered by the periostracum, which consists of organic matrices that prevent decalcification of the shell. In the present study, we discovered the presence of chitin in the periostracum and identified a novel matrix protein, Pinctada fucata periostracum protein named PPP-10. It was purified from the sodium dodecyl sulfate/dithiothreitol-soluble fraction of the periostracum of the Japanese pearl oyster, P. fucata. The deduced amino acid sequence was determined by a combination of amino acid sequence analysis and cDNA cloning. The open reading frame encoded a precursor protein of 112 amino acid residues including a 21-residue signal peptide. The 91 residues following the signal peptide contained abundant Cys and Tyr residues. PPP-10 was expressed on the outer side of the outer fold in the mantle, indicating that PPP-10 was present in the second or third layer of the periostracum. We also determined that the recombinant PPP-10 had chitin-binding activity and could incorporate chitin into the scaffolds of the periostracum. These results shed light on the early steps in mollusk shell formation.
Identification and characterization of a matrix protein (PPP-10) in the periostracum of the pearl oyster, Pinctada fucata☆

PubMed Central

Nakayama, Seiji; Suzuki, Michio; Endo, Hirotoshi; Iimura, Kurin; Kinoshita, Shigeharu; Watabe, Shugo; Kogure, Toshihiro; Nagasawa, Hiromichi

2013-01-01

The periostracum is a layered structure that is formed as a mollusk shell grows. The shell is covered by the periostracum, which consists of organic matrices that prevent decalcification of the shell. In the present study, we discovered the presence of chitin in the periostracum and identified a novel matrix protein, Pinctada fucata periostracum protein named PPP-10. It was purified from the sodium dodecyl sulfate/dithiothreitol-soluble fraction of the periostracum of the Japanese pearl oyster, P. fucata. The deduced amino acid sequence was determined by a combination of amino acid sequence analysis and cDNA cloning. The open reading frame encoded a precursor protein of 112 amino acid residues including a 21-residue signal peptide. The 91 residues following the signal peptide contained abundant Cys and Tyr residues. PPP-10 was expressed on the outer side of the outer fold in the mantle, indicating that PPP-10 was present in the second or third layer of the periostracum. We also determined that the recombinant PPP-10 had chitin-binding activity and could incorporate chitin into the scaffolds of the periostracum. These results shed light on the early steps in mollusk shell formation. PMID:24251105
Complete sequence analysis reveals two distinct poleroviruses infecting cucurbits in China.

PubMed

Xiang, Hai-ying; Shang, Qiao-xia; Han, Cheng-gui; Li, Da-wei; Yu, Jia-lin

2008-01-01

The complete RNA genomes of a Chinese isolate of cucurbit aphid-borne yellows virus (CABYV-CHN) and a new polerovirus tentatively referred to as melon aphid-borne yellows virus (MABYV) were determined. The entire genome of CABYV-CHN shared 89.0% nucleotide sequence identity with the French CABYV isolate. In contrast, nucleotide sequence identities between MABYV and CABYV and other poleroviruses were in the range of 50.7-74.2%, with amino acid sequence identities ranging from 24.8 to 82.9% for individual gene products. We propose that CABYV-CHN is a strain of CABYV and that MABYV is a member of a tentative distinct species within the genus Polerovirus.
The complete genomic sequence of a tentative new polerovirus identified in barley in South Korea.

PubMed

Zhao, Fumei; Lim, Seungmo; Yoo, Ran Hee; Igori, Davaajargal; Kim, Sang-Min; Kwak, Do Yeon; Kim, Sun Lim; Lee, Bong Choon; Moon, Jae Sun

2016-07-01

The complete nucleotide sequence of a new barley polerovirus, tentatively named barley virus G (BVG), which was isolated in Gimje, South Korea, has been determined using an RNA sequencing technique combined with polymerase chain reaction methods. The viral genomic RNA of BVG is 5,620 nucleotides long and contains six typical open reading frames commonly observed in other poleroviruses. Sequence comparisons revealed that BVG is most closely related to maize yellow dwarf virus-RMV, with the highest amino acid identities being less than 90 % for all of the corresponding proteins. These results suggested that BVG is a member of a new species in the genus Polerovirus.
Identification of Bacteriophage N4 Virion RNA Polymerase-Nucleic Acid Interactions in Transcription Complexes*

PubMed Central

Davydova, Elena K.; Kaganman, Irene; Kazmierczak, Krystyna M.; Rothman-Denes, Lucia B.

2009-01-01

Bacteriophage N4 mini-virion RNA polymerase (mini-vRNAP), the 1106-amino acid transcriptionally active domain of vRNAP, recognizes single-stranded DNA template-containing promoters composed of conserved sequences and a 3-base loop–5-base pair stem hairpin structure. The major promoter recognition determinants are a purine located at the center of the hairpin loop (–11G) and a base at the hairpin stem (–8G). Mini-vRNAP is an evolutionarily highly diverged member of the T7 family of RNAPs. A two-plasmid system was developed to measure the in vivo activity of mutant mini-vRNAP enzymes. Five mini-vRNAP derivatives, each containing a pair of cysteine residues separated by ∼100 amino acids and single cysteine-containing enzymes, were generated. These reagents were used to determine the smallest catalytically active polypeptide and to map promoter, substrate, and RNA-DNA hybrid contact sites to single amino acid residues in the enzyme by using end-labeled 5-iododeoxyuridine- and azidophenacyl-substituted oligonucleotides, cross-linkable derivatives of the initiating nucleotide, and RNA products with 5-iodouridine incorporated at specific positions. Localization of functionally important amino acid residues in the recently determined crystal structures of apomini-vRNAP and the mini-vRNAP-promoter complex and comparison with the crystal structures of the T7 RNAP initiation and elongation complexes allowed us to predict major rearrangements in mini-vRNAP in the transition from transcription initiation to elongation similar to those observed in T7 RNAP, a task otherwise precluded by the lack of sequence homology between N4 mini-vRNAP and T7 RNAP. PMID:19015264
Potential Value of Major Antigenic Protein 2 for Serological Diagnosis of Heartwater and Related Ehrlichial Infections

PubMed Central

Bowie, Michael V.; Reddy, G. Roman; Semu, Shalt M.; Mahan, Suman M.; Barbet, Anthony F.

1999-01-01

Cowdria ruminantium is the etiologic agent of heartwater, a disease causing major economic loss in ruminants in sub-Saharan Africa and the Caribbean. Development of a serodiagnostic test is essential for determining the carrier status of animals from regions where heartwater is endemic, but most available tests give false-positive reactions with sera against related Erhlichia species. Current approaches rely on molecular methods to define proteins and epitopes that may allow specific diagnosis. Two major antigenic proteins (MAPs), MAP1 and MAP2, have been examined for their use as antigens in the serodiagnosis of heartwater. The objectives of this study were (i) to determine if MAP2 is conserved among five geographically divergent strains of C. ruminantium and (ii) to determine if MAP2 homologs are present in Ehrlichia canis, the causative agent of canine ehrlichiosis, and Ehrlichia chaffeensis, the organism responsible for human monocytic ehrlichiosis. These two agents are closely related to C. ruminantium. The map2 gene from four strains of C. ruminantium was cloned, sequenced, and compared with the previously reported map2 gene from the Crystal Springs strain. Only 10 nucleic acid differences between the strains were identified, and they translate to only 3 amino acid changes, indicating that MAP2 is highly conserved. Genes encoding MAP2 homologs from E. canis and E. chaffeensis also were cloned and sequenced. Amino acid analysis of MAP2 homologs of E. chaffeensis and E. canis with MAP2 of C. ruminantium revealed 83.4 and 84.4% identities, respectively. Further analysis of MAP2 and its homologs revealed that the whole protein lacks specificity for heartwater diagnosis. The development of epitope-specific assays using this sequence information may produce diagnostic tests suitable for C. ruminantium and also other related rickettsiae. PMID:10066656
Cloning and Expression of a Phloretin Hydrolase Gene from Eubacterium ramulus and Characterization of the Recombinant Enzyme

PubMed Central

Schoefer, Lilian; Braune, Annett; Blaut, Michael

2004-01-01

Phloretin hydrolase catalyzes the hydrolytic C-C cleavage of phloretin to phloroglucinol and 3-(4-hydroxyphenyl)propionic acid during flavonoid degradation in Eubacterium ramulus. The gene encoding the enzyme was cloned by screening a gene library for hydrolase activity. The insert of a clone conferring phloretin hydrolase activity was sequenced. Sequence analysis revealed an open reading frame of 822 bp (phy), a putative promoter region, and a terminating stem-loop structure. The deduced amino acid sequence of phy showed similarities to a putative protein of the 2,4-diacetylphloroglucinol biosynthetic operon from Pseudomonas fluorescens. The phloretin hydrolase was heterologously expressed in Escherichia coli and purified. The molecular mass of the native enzyme was approximately 55 kDa as determined by gel filtration. The results of sodium dodecyl sulfate-polyacrylamide gel electrophoresis and the deduced amino acid sequence of phy indicated molecular masses of 30 and 30.8 kDa, respectively, suggesting that the enzyme is a homodimer. The recombinant phloretin hydrolase catalyzed the hydrolysis of phloretin to equimolar amounts of phloroglucinol and 3-(4-hydroxyphenyl)propionic acid. The optimal temperature and pH of the catalyzed reaction mixture were 37°C and 7.0, respectively. The Km for phloretin was 13 ± 3 μM and the kcat was 10 ± 2 s−1. The enzyme did not transform phloretin-2′-glucoside (phloridzin), neohesperidin dihydrochalcone, 1,3-diphenyl-1,3-propandione, or trans-1,3-diphenyl-2,3-epoxy-propan-1-one. The catalytic activity of the phloretin hydrolase was reduced by N-bromosuccinimide, o-phenanthroline, N-ethylmaleimide, and CuCl2 to 3, 20, 35, and 85%, respectively. Phloroglucinol and 3-(4-hydroxyphenyl)propionic acid reduced the activity to 54 and 70%, respectively. PMID:15466559
Atan1p-an extracellular tannase from the dimorphic yeast Arxula adeninivorans: molecular cloning of the ATAN1 gene and characterization of the recombinant enzyme.

PubMed

Böer, Erik; Bode, Rüdiger; Mock, Hans-Peter; Piontek, Michael; Kunze, Gotthard

2009-06-01

The tannase-encoding Arxula adeninivorans gene ATAN1 was isolated from genomic DNA by PCR, using as primers oligonucleotide sequences derived from peptides obtained after tryptic digestion of the purified tannase protein. The gene harbours an ORF of 1764 bp, encoding a 587-amino acid protein, preceded by an N-terminal secretion sequence comprising 28 residues. The deduced amino acid sequence was similar to those of tannases from Aspergillus oryzae (50% identity), A. niger (48%) and putative tannases from A. fumigatus (52%) and A. nidulans (50%). The sequence contains the consensus pentapeptide motif (-Gly-X-Ser-X-Gly-) which forms part of the catalytic centre of serine hydrolases. Expression of ATAN1 is regulated by the carbon source. Supplementation with tannic acid or gallic acid leads to induction of ATAN1, and accumulation of the native tannase enzyme in the medium. The enzymes recovered from both wild-type and recombinant strains were essentially indistinguishable. A molecular mass of approximately 320 kDa was determined, indicating that the native, glycosylated tannase consists of four identical subunits. The enzyme has a temperature optimum at 35-40 degrees C and a pH optimum at approximately 6.0. The enzyme is able to remove gallic acid from both condensed and hydrolysable tannins. The wild-type strain LS3 secreted amounts of tannase equivalent to 100 U/l under inducing conditions, while the transformant strain, which overexpresses the ATAN1 gene from the strong, constitutively active A. adeninivorans TEF1 promoter, produced levels of up to 400 U/l when grown in glucose medium in shake flasks. Copyright (c) 2009 John Wiley & Sons, Ltd.
Connective tissue activation. XXXII. Structural and biologic characteristics of mesenchymal cell-derived connective tissue activating peptide-V.

PubMed

Cabral, A R; Cole, L A; Walz, D A; Castor, C W

1987-12-01

Connective tissue activating peptide-V (CTAP-V) is a single-chain, mesenchymal cell-derived anionic protein with large and small molecular forms (Mr of 28,000 and 16,000, respectively), as defined by sodium dodecyl sulfate-polyacrylamide gel electrophoresis. The proteins have similar specific activities with respect to stimulation of hyaluronic acid and DNA formation in human synovial fibroblast cultures. S-carboxymethylation or removal of sialic acid residues did not modify CTAP-V biologic activity. Rabbit antibodies raised separately against each of the purified CTAP-V proteins reacted, on immunodiffusion and on Western blot, with each antigen and neutralized mitogenic activity. The amino-terminal amino acid sequence of the CTAP-V proteins, determined by 2 laboratories, confirmed their structural similarities. The amino-terminal sequence through 37 residues was demonstrated for the smaller protein. The first 10 residues of CTAP-V (28 kd) were identical to the N-terminal decapeptide of CTAP-V (16 kd). The C-terminal sequence, determined by carboxypeptidase Y digestion, was the same for both CTAP-V molecular species. The 2 CTAP-V peptides had similar amino acid compositions, whether residues were expressed as a percent of the total or were normalized to mannose. Reduction of native CTAP-V protein released sulfhydryl groups in a protein:disulfide ratio of 1:2; this suggests that CTAP-V contains 2 intramolecular disulfide bonds. Clearly, CTAP-V is a glycoprotein. The carbohydrate content of CTAP-V (16 kd) and CTAP-V (28 kd) is 27% and 25%, respectively. CTAP-V may have significance in relation to autocrine mechanisms for growth regulation of connective tissue cells and other cell types.
Alanine-170 and proline-172 are critical determinants for extracellular CD20 epitopes; heterogeneity in the fine specificity of CD20 monoclonal antibodies is defined by additional requirements imposed by both amino acid sequence and quaternary structure.

PubMed

Polyak, Maria J; Deans, Julie P

2002-05-01

In vivo ablation of malignant B cells can be achieved using antibodies directed against the CD20 antigen. Fine specificity differences among CD20 monoclonal antibodies (mAbs) are assumed not to be a factor in determining their efficacy because evidence from antibody-blocking studies indicates limited epitope diversity with only 2 overlapping extracellular CD20 epitopes. However, in this report a high degree of heterogeneity among antihuman CD20 mAbs is demonstrated. Mutation of alanine and proline at positions 170 and 172 (AxP) (single-letter amino acid codes; x indicates the identical amino acid at the same position in the murine and human CD20 sequences) in human CD20 abrogated the binding of all CD20 mAbs tested. Introduction of AxP into the equivalent positions in the murine sequence, which is not otherwise recognized by antihuman CD20 mAbs, fully reconstituted the epitope recognized by B1, the prototypic anti-CD20 mAb. 2H7, a mAb previously thought to recognize the same epitope as B1, did not recognize the murine AxP mutant. Reconstitution of the 2H7 epitope was achieved with additional mutations replacing VDxxD in the murine sequence for INxxN (positions 162-166 in the human sequence). The integrity of the 2H7 epitope, unlike that of B1, further depends on the maintenance of CD20 in an oligomeric complex. The majority of 16 antihuman CD20 mAbs tested, including rituximab, bound to murine CD20 containing the AxP mutations. Heterogeneity in the fine specificity of these antibodies was indicated by marked differences in their ability to induce homotypic cellular aggregation and translocation of CD20 to a detergent-insoluble membrane compartment previously identified as lipid rafts.

Mutations in the S gene and in the overlapping reverse transcriptase region in chronic hepatitis B Chinese patients with coexistence of HBsAg and anti-HBs.

PubMed

Ding, Feng; Miao, Xi-Li; Li, Yan-Xia; Dai, Jin-Fen; Yu, Hong-Gang

2016-01-01

The mechanism underlying the coexistence of hepatitis B surface antigen and antibodies to HBsAg in chronic hepatitis B patients remains unknown. This research aimed to determine the clinical and virological features of the rare pattern. A total of 32 chronic hepatitis B patients infected by HBV genotype C were included: 15 carrying both HBsAg and anti-HBs (group I) and 17 solely positive for HBsAg (group II). S gene and reverse transcriptase region sequences were amplified, sequenced and compared with the reference sequences. The amino acid variability within major hydrophilic region, especially the "a" determinant region, and within reverse transcriptase for regions overlapping the major hydrophilic region in group I is significantly higher than those in group II. Mutation sI126S/T within the "a" determinant was the most frequent change, and only patients from group I had the sQ129R, sG130N, sF134I, sG145R amino acid changes, which are known to alter immunogenicity. In chronic patients, the concurrent HBsAg/anti-HBs serological profile is associated with an increased aa variability in several key areas of HBV genome. Additional research on these genetic mutants are needed to clarify their biological significance for viral persistence. Copyright © 2015 Elsevier Editora Ltda. All rights reserved.
Thermophilic cellobiohydrolase

DOEpatents

Sapra, Rajat; Park, Joshua I.; Datta, Supratim; Simmons, Blake A.

2017-04-18

The present invention provides for a composition comprising a polypeptide comprising a first amino acid sequence having at least 70% identity with the amino acid sequence of Csac GH5 wherein said first amino acid sequence has a thermostable or thermophilic cellobiohydrolase (CBH) or exoglucanase activity.
Molecular cloning and characterization of beluga whale (Delphinapterus leucas) interleukin-1beta and tumor necrosis factor-alpha.

PubMed Central

Denis, F; Archambault, D

2001-01-01

Interleukin-1beta (IL-1beta) and tumor necrosis factor-alpha (TNF-alpha) are cytokines produced primarily by monocytes and macrophages with regulatory effects in inflammation and multiple aspects of the immune response. As yet, no molecular data have been reported for IL-1beta and TNF-alpha of the beluga whale. In this study, we cloned and determined the entire cDNA sequence encoding beluga whale IL-1beta and TNF-alpha. The genetic relationship of the cytokine sequences was then analyzed with those from several mammalian species, including the human and the pig. The homology of beluga whale IL-1beta nucleic acid and deduced amino acid sequences with those from these mammalian species ranged from 74.6 to 86.0% and 62.7 to 77.1%, respectively, whereas that of TNF-alpha varied from 79.3 to 90.8% and 75.3 to 87.7%, respectively. Phylogenetic analyses based on deduced amino acid sequences showed that the beluga whale IL-1beta and TNF-alpha were most closely related to those of the ruminant species (cattle, sheep, and deer). The beluga whale IL-1beta- and TNF-alpha-encoding sequences were thereafter successfully expressed in Escherichia coli as fusion proteins by using procaryotic expression vectors. The fusion proteins were used to produce beluga whale IL-1beta- and TNF-alpha-specific rabbit antisera. Images Figure 3. Figure 4. Figure 5. PMID:11768130
Isolation and characterization of full-length putative alcohol dehydrogenase genes from polygonum minus

NASA Astrophysics Data System (ADS)

Hamid, Nur Athirah Abd; Ismail, Ismanizan

2013-11-01

Polygonum minus, locally named as Kesum is an aromatic herb which is high in secondary metabolite content. Alcohol dehydrogenase is an important enzyme that catalyzes the reversible oxidation of alcohol and aldehyde with the presence of NAD(P)(H) as co-factor. The main focus of this research is to identify the gene of ADH. The total RNA was extracted from leaves of P. minus which was treated with 150 μM Jasmonic acid. Full-length cDNA sequence of ADH was isolated via rapid amplification cDNA end (RACE). Subsequently, in silico analysis was conducted on the full-length cDNA sequence and PCR was done on genomic DNA to determine the exon and intron organization. Two sequences of ADH, designated as PmADH1 and PmADH2 were successfully isolated. Both sequences have ORF of 801 bp which encode 266 aa residues. Nucleotide sequence comparison of PmADH1 and PmADH2 indicated that both sequences are highly similar at the ORF region but divergent in the 3' untranslated regions (UTR). The amino acid is differ at the 107 residue; PmADH1 contains Gly (G) residue while PmADH2 contains Cys (C) residue. The intron-exon organization pattern of both sequences are also same, with 3 introns and 4 exons. Based on in silico analysis, both sequences contain "classical" short chain alcohol dehydrogenases/reductases ((c) SDRs) conserved domain. The results suggest that both sequences are the members of short chain alcohol dehydrogenase family.
[Taxonomic status of the Tyulek virus (TLKV) (Orthomyxoviridae, Quaranjavirus, Quaranfil group) isolated from the ticks Argas vulgaris Filippova, 1961 (Argasidae) from the birds burrow nest biotopes in the Kyrgyzstan].

PubMed

L'vov, D K; Al'khovskiĭ, S V; Shchelkanov, M Iu; Shchetinin, A M; Deriabin, P G; Aristova, V A; Gitel'man, A K; Samokhvalov, E I; Botikov, A G

2014-01-01

The Tyulek virus (TLKV) was isolated from the ticks Argas vulgaris Filippova, 1961 (Argasidae), collected from the burrow biotopes in multispecies birds colony in the Aksu river floodplain near Tyulek village (northern part of Chu Valley, Kyrgyzstan). Recently, the TLKV was assigned to the Quaranfil group (including the Quaranfil virus (QRFV), Johnston Atoll virus (JAV), Lake Chad virus) that is a novel genus of the Quaranjavirus in the Orthomyxoviridae family. In his work, the complete genome (ID GenBank KJ438647-8) sequence of the TLKV was determined using next-generation sequencing (Illumina platform). Comparison of deduced amino acid sequences shows closed relationship of the TLKV with QRFV and JAV (86% and 84% identity for PB1 and about 70% for PB2 and PA, respectively). The identity level of the TLKV and QRFV in outer glycoprotein GP is 72% and 80% for nucleotide and amino acid sequences, respectively. The phylogenetic analysis showed that the TLKV belongs to the genus of the Quaranjavirus in the family Orthomyxoviridae.
Active site of tripeptidyl peptidase II from human erythrocytes is of the subtilisin type.

PubMed Central

Tomkinson, B; Wernstedt, C; Hellman, U; Zetterqvist, O

1987-01-01

The present report presents evidence that the amino acid sequence around the serine of the active site of human tripeptidyl peptidase II is of the subtilisin type. The enzyme from human erythrocytes was covalently labeled at its active site with [3H]diisopropyl fluorophosphate, and the protein was subsequently reduced, alkylated, and digested with trypsin. The labeled tryptic peptides were purified by gel filtration and repeated reversed-phase HPLC, and their amino-terminal sequences were determined. Residue 9 contained the radioactive label and was, therefore, considered to be the active serine residue. The primary structure of the part of the active site (residues 1-10) containing this residue was concluded to be Xaa-Thr-Gln-Leu-Met-Asx-Gly-Thr-Ser-Met. This amino acid sequence is homologous to the sequence surrounding the active serine of the microbial peptidases subtilisin and thermitase. These data demonstrate that human tripeptidyl peptidase II represents a potentially distinct class of human peptidases and raise the question of an evolutionary relationship between the active site of a mammalian peptidase and that of the subtilisin family of serine peptidases. PMID:3313395
Cell culture compositions

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yiao, Jian

2014-03-18

The present invention provides a novel endoglucanase nucleic acid sequence, designated egl6 (SEQ ID NO:1 encodes the full length endoglucanase; SEQ ID NO:4 encodes the mature form), and the corresponding endoglucanase VI amino acid sequence ("EGVI"; SEQ ID NO:3 is the signal sequence; SEQ ID NO:2 is the mature sequence). The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding EGVI, recombinant EGVI proteins and methods for producing the same.
J-Refocused Coherence Transfer Spectroscopic Imaging at 7 T in Human Brain

PubMed Central

Pan, J.W.; Avdievich, N.; Hetherington, H.P.

2013-01-01

Short echo spectroscopy is commonly used to minimize signal modulation due to J-evolution of the cerebral amino acids. However, short echo acquisitions suffer from high sensitivity to macromolecules which make accurate baseline determination difficult. In this report, we describe implementation at 7 T of a double echo J-refocused coherence transfer sequence at echo time (TE) of 34 msec to minimize J-modulation of amino acids while also decreasing interfering macromolecule signals. Simulation of the pulse sequence at 7 T shows excellent resolution of glutamate, glutamine, and N-acetyl aspartate. B1 sufficiency at 7 T for the double echo acquisition is achieved using a transceiver array with radiofrequency (RF) shimming. Using an alternate RF distribution to minimize receiver phase cancellation in the transceiver, accurate phase determination for the coherence transfer is achieved with rapid single scan calibration. This method is demonstrated in spectroscopic imaging mode with n = 5 healthy volunteers resulting in metabolite values consistent with literature and in a patient with epilepsy. PMID:20648684
Molecular cloning and expression of a gene for a factor which stabilizes formation of inhibitor-mitochondrial ATPase complex from Saccharomyces cerevisiae.

PubMed

Akashi, A; Yoshida, Y; Nakagoshi, H; Kuroki, K; Hashimoto, T; Tagawa, K; Imamoto, F

1988-10-01

Stabilizing factor, a 9 kDa protein, stabilizes and facilitates formation of the complex between mitochondrial ATP synthase and its intrinsic inhibitor protein. A clone containing the gene encoding the 9 kDa protein was selected from a yeast genomic library to determine the structure of its precursor protein. As deduced from the nucleotide sequence, the precursor of the yeast 9 kDa stabilizing factor contains 86 amino acid residues and has a molecular weight of 10,062. From the predicted sequence we infer that the stabilizing factor precursor contains a presequence of 23 amino acid residues at its amino terminus. We also used S1 mapping to determine the initiation site of transcription under glucose-repressed or derepressed conditions. These experiments suggest that transcription of this gene starts at three different sites and that only one of them is not affected by the presence of glucose.
Biosynthesis of Lipoic Acid in Arabidopsis: Cloning and Characterization of the cDNA for Lipoic Acid Synthase1

PubMed Central

Yasuno, Rie; Wada, Hajime

1998-01-01

Lipoic acid is a coenzyme that is essential for the activity of enzyme complexes such as those of pyruvate dehydrogenase and glycine decarboxylase. We report here the isolation and characterization of LIP1 cDNA for lipoic acid synthase of Arabidopsis. The Arabidopsis LIP1 cDNA was isolated using an expressed sequence tag homologous to the lipoic acid synthase of Escherichia coli. This cDNA was shown to code for Arabidopsis lipoic acid synthase by its ability to complement a lipA mutant of E. coli defective in lipoic acid synthase. DNA-sequence analysis of the LIP1 cDNA revealed an open reading frame predicting a protein of 374 amino acids. Comparisons of the deduced amino acid sequence with those of E. coli and yeast lipoic acid synthase homologs showed a high degree of sequence similarity and the presence of a leader sequence presumably required for import into the mitochondria. Southern-hybridization analysis suggested that LIP1 is a single-copy gene in Arabidopsis. Western analysis with an antibody against lipoic acid synthase demonstrated that this enzyme is located in the mitochondrial compartment in Arabidopsis cells as a 43-kD polypeptide. PMID:9808738
UET: a database of evolutionarily-predicted functional determinants of protein sequences that cluster as functional sites in protein structures.

PubMed

Lua, Rhonald C; Wilson, Stephen J; Konecki, Daniel M; Wilkins, Angela D; Venner, Eric; Morgan, Daniel H; Lichtarge, Olivier

2016-01-04

The structure and function of proteins underlie most aspects of biology and their mutational perturbations often cause disease. To identify the molecular determinants of function as well as targets for drugs, it is central to characterize the important residues and how they cluster to form functional sites. The Evolutionary Trace (ET) achieves this by ranking the functional and structural importance of the protein sequence positions. ET uses evolutionary distances to estimate functional distances and correlates genotype variations with those in the fitness phenotype. Thus, ET ranks are worse for sequence positions that vary among evolutionarily closer homologs but better for positions that vary mostly among distant homologs. This approach identifies functional determinants, predicts function, guides the mutational redesign of functional and allosteric specificity, and interprets the action of coding sequence variations in proteins, people and populations. Now, the UET database offers pre-computed ET analyses for the protein structure databank, and on-the-fly analysis of any protein sequence. A web interface retrieves ET rankings of sequence positions and maps results to a structure to identify functionally important regions. This UET database integrates several ways of viewing the results on the protein sequence or structure and can be found at http://mammoth.bcm.tmc.edu/uet/. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Three closely related herpesviruses are associated with fibropapillomatosis in marine turtles

USGS Publications Warehouse

Quackenbush, S.L.; Work, Thierry M.; Balazs, George H.; Casey, Rufina N.; Rovnak, J.; Chaves, A.; duToit, L.; Baines, J.D.; Parrish, C.R.; Bowser, Paul R.; Casey, James W.

1998-01-01

Green turtle fibropapillomatosis is a neoplastic disease of increasingly significant threat to the survivability of this species. Degenerate PCR primers that target highly conserved regions of genes encoding herpesvirus DNA polymerases were used to amplify a DNA sequence from fibropapillomas and fibromas from Hawaiian and Florida green turtles. All of the tumors tested (n= 23) were found to harbor viral DNA, whereas no viral DNA was detected in skin biopsies from tumor-negative turtles. The tissue distribution of the green turtle herpesvirus appears to be generally limited to tumors where viral DNA was found to accumulate at approximately two to five copies per cell and is occasionally detected, only by PCR, in some tissues normally associated with tumor development. In addition, herpesviral DNA was detected in fibropapillomas from two loggerhead and four olive ridley turtles. Nucleotide sequencing of a 483-bp fragment of the turtle herpesvirus DNA polymerase gene determined that the Florida green turtle and loggerhead turtle sequences are identical and differ from the Hawaiian green turtle sequence by five nucleotide changes, which results in two amino acid substitutions. The olive ridley sequence differs from the Florida and Hawaiian green turtle sequences by 15 and 16 nucleotide changes, respectively, resulting in four amino acid substitutions, three of which are unique to the olive ridley sequence. Our data suggest that these closely related turtle herpesviruses are intimately involved in the genesis of fibropapillomatosis.
Purification, characterization and sequence analysis of Omp50,a new porin isolated from Campylobacter jejuni.

PubMed Central

Bolla, J M; Dé, E; Dorez, A; Pagès, J M

2000-01-01

A novel pore-forming protein identified in Campylobacter was purified by ion-exchange chromatography and named Omp50 according to both its molecular mass and its outer membrane localization. We observed a pore-forming ability of Omp50 after re-incorporation into artificial membranes. The protein induced cation-selective channels with major conductance values of 50-60 pS in 1 M NaCl. N-terminal sequencing allowed us to identify the predicted coding sequence Cj1170c from the Campylobacter jejuni genome database as the corresponding gene in the NCTC 11168 genome sequence. The gene, designated omp50, consists of a 1425 bp open reading frame encoding a deduced 453-amino acid protein with a calculated pI of 5.81 and a molecular mass of 51169.2 Da. The protein possessed a 20-amino acid leader sequence. No significant similarity was found between Omp50 and porin protein sequences already determined. Moreover, the protein showed only weak sequence identity with the major outer-membrane protein (MOMP) of Campylobacter, correlating with the absence of antigenic cross-reactivity between these two proteins. Omp50 is expressed in C. jejuni and Campylobacter lari but not in Campylobacter coli. The gene, however, was detected in all three species by PCR. According to its conformation and functional properties, the protein would belong to the family of outer-membrane monomeric porins. PMID:11104668
Complete genomic sequence of Powassan virus: evaluation of genetic elements in tick-borne versus mosquito-borne flaviviruses.

PubMed

Mandl, C W; Holzmann, H; Kunz, C; Heinz, F X

1993-05-01

The complete nucleotide sequence of the positive-stranded RNA genome of the tick-borne flavivirus Powassan (10,839 nucleotides) was elucidated and the amino acid sequence of all viral proteins was derived. Based on this sequence as well as serological data, Powassan virus represents the most divergent member of the tick-borne serocomplex within the genus flaviviruses, family Flaviviridae. The primary nucleotide sequence and potential RNA secondary structures of the Powassan virus genome as well as the protein sequences and the reactivities of the virion with a panel of monoclonal antibodies were compared to other tick-borne and mosquito-borne flaviviruses. These analyses corroborated significant differences between tick-borne and mosquito-borne flaviviruses, but also emphasized structural elements that are conserved among both vector groups. The comparisons among tick-borne flaviviruses revealed conserved sequence elements that might represent important determinants of the tick-borne flavivirus phenotype.
Nucleotide sequence determination of guinea-pig casein B mRNA reveals homology with bovine and rat alpha s1 caseins and conservation of the non-coding regions of the mRNA.

PubMed Central

Hall, L; Laird, J E; Craig, R K

1984-01-01

Nucleotide sequence analysis of cloned guinea-pig casein B cDNA sequences has identified two casein B variants related to the bovine and rat alpha s1 caseins. Amino acid homology was largely confined to the known bovine or predicted rat phosphorylation sites and within the 'signal' precursor sequence. Comparison of the deduced nucleotide sequence of the guinea-pig and rat alpha s1 casein mRNA species showed greater sequence conservation in the non-coding than in the coding regions, suggesting a functional and possibly regulatory role for the non-coding regions of casein mRNA. The results provide insight into the evolution of the casein genes, and raise questions as to the role of conserved nucleotide sequences within the non-coding regions of mRNA species. Images Fig. 1. PMID:6548375
Novel rod-shaped viruses isolated from garlic, Allium sativum, possessing a unique genome organization.

PubMed

Sumi, S; Tsuneyoshi, T; Furutani, H

1993-09-01

Rod-shaped flexuous viruses were partially purified from garlic plants (Allium sativum) showing typical mosaic symptoms. The genome was shown to be composed of RNA with a poly(A) tail of an estimated size of 10 kb as shown by denaturing agarose gel electrophoresis. We constructed cDNA libraries and screened four independent clones, which were designated GV-A, GV-B, GV-C and GV-D, using Northern and Southern blot hybridization. Nucleotide sequence determination of the cDNAs, two of which correspond to nearly one-third of the virus genomic RNA, shows that all of these viruses possess an identical genomic structure and that also at least four proteins are encoded in the viral cDNA, their M(r)s being estimated to be 15K, 27K, 40K and 11K. The 15K open reading frame (ORF) encodes the core-like sequence of a zinc finger protein preceded by a cluster of basic amino acid residues. The 27K ORF probably encodes the viral coat protein (CP), based on both the existence of some conserved sequences observed in many other rod-shaped or flexuous virus CPs and an overall amino acid sequence similarity to potexvirus and carlavirus CPs. The 11K ORF shows significant amino acid sequence similarities to the corresponding 12K proteins of the potexviruses and carlaviruses. On the other hand, the 40K ORF product does not resemble any other plant virus gene products reported so far. The genomic organization in the 3' region of the garlic viruses resembles, but clearly differs from, that of carlaviruses. Phylogenetic analysis based upon the amino acid sequence of the viral capsid protein also indicates that the garlic viruses have a unique and distinct domain different from those of the potexvirus and carlavirus groups. The results suggest that the garlic viruses described here belong to an unclassified and new virus group closely related to the carlaviruses.
Quasispecies characters of hepatitis B virus in immunoprophylaxis failure infants.

PubMed

Wang, Xin; Deng, Wanyan; Qian, Keli; Deng, Haijun; Huang, Yong; Tu, Zeng; Huang, Ailong; Long, Quanxin

2018-06-01

Hepatitis B vaccination prevents 80-95% of transmission and reduces the incidence of HBV in children. The variations in the a determinant of HBV surface antigen (HBsAg) have been reported to be the most prevalent cause for vaccine or antibody escape. There is a conflicting evidence on as to whether escape mutants arise de novo in infected infants or whether the mutants, that have preexisted maternally, subsequently undergo selective replication in the infant under immune pressure. Here, we report that nearly 65% (55 of 85) vaccination failure in child patients has no amino acid substitution in a determinant as seen by Sanger sequencing. We further employed an Illumina sequencing platform-based method to detect HBV quasispecies in four immunoprophylaxis failure infants and their mothers. In our data, the substitution rate of amino acid located at a determinant is relatively low (< 10%), I/T126A, C124S, F134Y, K141Q, Q129H, D144A, G145V, and N146K, which showed no statistical difference to their mothers, proving that these vaccine escape mutants preexist maternally as minor variants. Besides that, bioinformatical analysis showed that the binding affinity of high variation epitopes (amino acid divergence in mother and their infants > 20%) to related HLA molecules was generally decreased, these traces of immune escape suggesting that immune pressure was present and was effective in all samples.
[Analysis of structural characteristics of alpha-tubulins in plants with enhanced cold tolerance].

PubMed

Nyporko, A Iu; Demchuk, O N; Blium, Ia B

2003-01-01

The uniqueness of the point substitutions in the sequences of two alpha-tubulin isotypes from psychrophilic alga Chloromonas that can determine the increased cold tolerance of this alga was analyzed. The comparison of all known amino acid sequences of plant alpha-tubulins enabled to ascertain that only M268-->V replacement is unique and may have a significant influence on spatial structure of plant alpha-tubulins. Modeling of molecular surfaces of alpha-tubulins from Chloromonas, Chalmydomonas reinhardtii and goose grass Eleusine indica showed that insertion of the amino acid replacement M268-->V into the sequence of goose grace tubulin led to the likening of this protein surface to the surface of native alpha-tubulin from Chloromonas. Alteration of local hydrophobic properties of alpha-tubulin molecular surface in interdimeric contact zone as a result of the mentioned replacement was shown that may play important role in increasing the level of cold resistance of microtubules. The crucial role of amino acid residue in 268 position for forming the interdimeric contact surface of alpha-tubulin molecule was revealed. The assumption is made about the importance of replacements at this position for plant tolerance to abiotic factors of different nature (cold, herbicides).
Trichoderma .beta.-glucosidase

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2006-01-03

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl3, and the corresponding BGL3 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL3, recombinant BGL3 proteins and methods for producing the same.
Carbohydrate degrading polypeptide and uses thereof

DOEpatents

Sagt, Cornelis Maria Jacobus; Schooneveld-Bergmans, Margot Elisabeth Francoise; Roubos, Johannes Andries; Los, Alrik Pieter

2015-10-20

The invention relates to a polypeptide having carbohydrate material degrading activity which comprises the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1 or SEQ ID NO: 4, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional protein and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.

Canine Antithrombin-III: Some Biochemical and Biologic Properties

DTIC Science & Technology

1987-06-02

performing amino acid analyses, amino acid sequence analysis, and differential refractometry . I thank Ms. Kerry Singer for her excellent typing and...previously determined by differential refractometry at 546 nm, with a value of 0.186 ml/g for dn/dc (refractive index increment) (69). 20 •, "• 11...J ;,.-0 t.! ’ > ~ +-’ :I c: ... Q) :::J ~ w Protein concentration by refractometry = 8.29 mg/ml O.D. value at 280 nm (1:10 dilution
Automated Sanger Analysis Pipeline (ASAP): A Tool for Rapidly Analyzing Sanger Sequencing Data with Minimum User Interference.

PubMed

Singh, Aditya; Bhatia, Prateek

2016-12-01

Sanger sequencing platforms, such as applied biosystems instruments, generate chromatogram files. Generally, for 1 region of a sequence, we use both forward and reverse primers to sequence that area, in that way, we have 2 sequences that need to be aligned and a consensus generated before mutation detection studies. This work is cumbersome and takes time, especially if the gene is large with many exons. Hence, we devised a rapid automated command system to filter, build, and align consensus sequences and also optionally extract exonic regions, translate them in all frames, and perform an amino acid alignment starting from raw sequence data within a very short time. In full capabilities of Automated Mutation Analysis Pipeline (ASAP), it is able to read "*.ab1" chromatogram files through command line interface, convert it to the FASTQ format, trim the low-quality regions, reverse-complement the reverse sequence, create a consensus sequence, extract the exonic regions using a reference exonic sequence, translate the sequence in all frames, and align the nucleic acid and amino acid sequences to reference nucleic acid and amino acid sequences, respectively. All files are created and can be used for further analysis. ASAP is available as Python 3.x executable at https://github.com/aditya-88/ASAP. The version described in this paper is 0.28.
Isolation and Distribution of a Novel Iron-Oxidizing Crenarchaeon from Acidic Geothermal Springs in Yellowstone National Park▿ †

PubMed Central

Kozubal, M.; Macur, R. E.; Korf, S.; Taylor, W. P.; Ackerman, G. G.; Nagy, A.; Inskeep, W. P.

2008-01-01

Novel thermophilic crenarchaea have been observed in Fe(III) oxide microbial mats of Yellowstone National Park (YNP); however, no definitive work has identified specific microorganisms responsible for the oxidation of Fe(II). The objectives of the current study were to isolate and characterize an Fe(II)-oxidizing member of the Sulfolobales observed in previous 16S rRNA gene surveys and to determine the abundance and distribution of close relatives of this organism in acidic geothermal springs containing high concentrations of dissolved Fe(II). Here we report the isolation and characterization of the novel, Fe(II)-oxidizing, thermophilic, acidophilic organism Metallosphaera sp. strain MK1 obtained from a well-characterized acid-sulfate-chloride geothermal spring in Norris Geyser Basin, YNP. Full-length 16S rRNA gene sequence analysis revealed that strain MK1 exhibits only 94.9 to 96.1% sequence similarity to other known Metallosphaera spp. and less than 89.1% similarity to known Sulfolobus spp. Strain MK1 is a facultative chemolithoautotroph with an optimum pH range of 2.0 to 3.0 and an optimum temperature range of 65 to 75°C. Strain MK1 grows optimally on pyrite or Fe(II) sorbed onto ferrihydrite, exhibiting doubling times between 10 and 11 h under aerobic conditions (65°C). The distribution and relative abundance of MK1-like 16S rRNA gene sequences in 14 acidic geothermal springs containing Fe(III) oxide microbial mats were evaluated. Highly related MK1-like 16S rRNA gene sequences (>99% sequence similarity) were consistently observed in Fe(III) oxide mats at temperatures ranging from 55 to 80°C. Quantitative PCR using Metallosphaera-specific primers confirmed that organisms highly similar to strain MK1 comprised up to 40% of the total archaeal community at selected sites. The broad distribution of highly related MK1-like 16S rRNA gene sequences in acidic Fe(III) oxide microbial mats is consistent with the observed characteristics and growth optima of Metallosphaera-like strain MK1 and emphasizes the importance of this newly described taxon in Fe(II) chemolithotrophy in acidic high-temperature environments of YNP. PMID:18083851
Studies of the structure-activity relationships of peptides and proteins involved in growth and development based on their three-dimensional structures.

PubMed

Nagata, Koji

2010-01-01

Peptides and proteins with similar amino acid sequences can have different biological functions. Knowledge of their three-dimensional molecular structures is critically important in identifying their functional determinants. In this review, I describe the results of our and other groups' structure-based functional characterization of insect insulin-like peptides, a crustacean hyperglycemic hormone-family peptide, a mammalian epidermal growth factor-family protein, and an intracellular signaling domain that recognizes proline-rich sequence.
Human ribosomal protein L37 has motifs predicting serine/threonine phosphorylation and a zinc-finger domain.

PubMed

Barnard, G F; Staniunas, R J; Puder, M; Steele, G D; Chen, L B

1994-08-02

Ribosomal protein L37 mRNA is overexpressed in colon cancer. The nucleotide sequences of human L37 from several tumor and normal, colon and liver cDNA sources were determined to be identical. L37 mRNA was approximately 375 nucleotides long encoding 97 amino acids with M(r) = 11,070, pI = 12.6, multiple potential serine/threonine phosphorylation sites and a zinc-finger domain. The human sequence is compared to other species.
Variation in opsin genes correlates with signaling ecology in North American fireflies

PubMed Central

Sander, Sarah E.; Hall, David W.

2015-01-01

Genes underlying signal reception should evolve to maximize signal detection in a particular environment. In animals, opsins, the protein component of visual pigments, are predicted to evolve according to this expectation. Fireflies are known for their bioluminescent mating signals. The eyes of nocturnal species are expected to maximize detection of conspecific signal colors emitted in the typical low-light environment. This is not expected for species that have transitioned to diurnal activity in bright daytime environments. Here we test the hypothesis that opsin gene sequence plays a role in modifying firefly eye spectral sensitivity. We use genome and transcriptome sequencing in four firefly species, transcriptome sequencing in six additional species, and targeted gene sequencing in 28 other species to identify all opsin genes present in North American fireflies and to elucidate amino acid sites under positive selection. We also determine whether amino acid substitutions in opsins are linked to evolutionary changes in signal mode, signal color, and light environment. We find only two opsins, one long wavelength and one ultraviolet, in all firefly species and identify 25 candidate sites that may be involved in determining spectral sensitivity. In addition, we find elevated rates of evolution at transitions to diurnal activity, and changes in selective constraint on LW opsin associated with changes in light environment. Our results suggest that changes in eye spectral sensitivity are at least partially due to opsin sequence. Fireflies continue to be a promising system in which to investigate the evolution of signals, receptors, and signaling environments. PMID:26289828
Variability of the protein sequences of lcrV between epidemic and atypical rhamnose-positive strains of Yersinia pestis.

PubMed

Anisimov, Andrey P; Panfertsev, Evgeniy A; Svetoch, Tat'yana E; Dentovskaya, Svetlana V

2007-01-01

Sequencing of lcrV genes and comparison of the deduced amino acid sequences from ten Y. pestis strains belonging mostly to the group of atypical rhamnose-positive isolates (non-pestis subspecies or pestoides group) showed that the LcrV proteins analyzed could be classified into five sequence types. This classification was based on major amino acid polymorphisms among LcrV proteins in the four "hot points" of the protein sequences. Some additional minor polymorphisms were found throughout these sequence types. The "hot points" corresponded to amino acids 18 (Lys --> Asn), 72 (Lys --> Arg), 273 (Cys --> Ser), and 324-326 (Ser-Gly-Lys --> Arg) in the LcrV sequence of the reference Y. pestis strain CO92. One possible explanation for polymorphism in amino acid sequences of LcrV among different strains is that strain-specific variation resulted from adaptation of the plague pathogen to different rodent and lagomorph hosts.
THE SMALL ACID SOLUBLE PROTEINS (SASP α and SASP β) OF BACILLUS WEIHENSTEPHANENSIS AND B. MYCOIDES GROUP 2 ARE THE MOST DISTINCT AMONG THE B. CEREUS GROUP

PubMed Central

Callahan, Courtney; Fox, Karen; Fox, Alvin

2009-01-01

The Bacillus cereus group includes Bacillus anthracis, Bacillus cereus, Bacillus thuringiensis, Bacillus mycoides and Bacillus weihenstephanensis. The small acid-soluble spore protein (SASP) β has been previously demonstrated to be among the biomarkers differentiating B. anthracis and B. cereus; SASP β of B. cereus most commonly exhibits one or two amino acid substitutions when compared to B. anthracis. SASP α is conserved in sequence among these two species. Neither SASP α nor β for B. thuringiensis, B. mycoides and B. weihenstephanensis have been previously characterized as taxonomic discriminators. In the current work molecular weight (MW) variation of these SASPs were determined by matrix assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI TOF MS) for representative strains of the 5 species within the B. cereus group. The measured MWs also correlate with calculated MWs of translated amino acid sequences generated from whole genome sequencing projects. SASP α and β demonstrated consistent MW among B. cereus, B. thuringiensis, and B. mycoides strains (group 1). However B. mycoides (group 2) and B. weihenstephanensis SASP α and β were quite distinct making them unique among the B. cereus group. Limited sequence changes were observed in SASP α (at most 3 substitutions and 2 deletions) indicating it is a more conserved protein than SASP β (up to 6 substitutions and a deletion). Another even more conserved SASP, SASP α-β type, was described here for the first time. PMID:19616612
A robust and cost-effective approach to sequence and analyze complete genomes of small RNA viruses

USDA-ARS?s Scientific Manuscript database

Background: Next-generation sequencing (NGS) allows ultra-deep sequencing of nucleic acids. The use of sequence-independent amplification of viral nucleic acids without utilization of target-specific primers provides advantages over traditional sequencing methods and allows detection of unsuspected ...
Performance of 1,2-indanedione and the need for sequential treatment of fingerprints.

PubMed

Mangle, Milery Figuera; Xu, Xioama; de Puit, M

2015-09-01

The use of 1,2-indanedione-ZnCl2 (IND-Zn) for the visualisation of fingermarks on porous materials has been widely accepted. The use of the reagent in comparison with others has been well described. To what extent IND or IND-Zn reacts with amino acids, in comparison to ninhydrin, has not been described to date. In this technical note we describe the analysis of amino acids with LCMS with the purpose of understanding the reactivity of ninhydrin, IND-Zn and the sequence thereof. The consumption of amino acids by these visualisation reagents is a feature we propose to use for calculations on the reactivity of these reagents. By using recently developed methods for the quantification of amino acids, we determined the consumption of these entities by visualisation reagents. We show that the differences in reactivity between IND and ninhydrin are not as big as the differences between 1,8-diazafluoren-9-one (DFO) and ninhydrin. We also show that it is of great importance to use IND-Zn and ninhydrin in sequence, in order to fully consume the amino acids present in fingermarks. Copyright © 2015 The Chartered Society of Forensic Sciences. Published by Elsevier Ireland Ltd. All rights reserved.
.beta.-glucosidase 5 (BGL5) compositions

DOEpatents

Dunn-Coleman, Nigel; Goedegebuur, Frits; Ward, Michael; Yao, Jian

2010-06-01

The present invention provides a novel .beta.-glucosidase nucleic acid sequence, designated bgl5, and the corresponding BGL5 amino acid sequence. The invention also provides expression vectors and host cells comprising a nucleic acid sequence encoding BGL5, recombinant BGL5 proteins and methods for producing the same.
Evidence of Divergent Amino Acid Usage in Comparative Analyses of R5- and X4-Associated HIV-1 Vpr Sequences

PubMed Central

Antell, Gregory C.; Zhong, Wen; Kercher, Katherine; Passic, Shendra; Williams, Jean; Liu, Yucheng; James, Tony; Jacobson, Jeffrey M.; Szep, Zsofia

2017-01-01

Vpr is an HIV-1 accessory protein that plays numerous roles during viral replication, and some of which are cell type dependent. To test the hypothesis that HIV-1 tropism extends beyond the envelope into the vpr gene, studies were performed to identify the associations between coreceptor usage and Vpr variation in HIV-1-infected patients. Colinear HIV-1 Env-V3 and Vpr amino acid sequences were obtained from the LANL HIV-1 sequence database and from well-suppressed patients in the Drexel/Temple Medicine CNS AIDS Research and Eradication Study (CARES) Cohort. Genotypic classification of Env-V3 sequences as X4 (CXCR4-utilizing) or R5 (CCR5-utilizing) was used to group colinear Vpr sequences. To reveal the sequences associated with a specific coreceptor usage genotype, Vpr amino acid sequences were assessed for amino acid diversity and Jensen-Shannon divergence between the two groups. Five amino acid alphabets were used to comprehensively examine the impact of amino acid substitutions involving side chains with similar physiochemical properties. Positions 36, 37, 41, 89, and 96 of Vpr were characterized by statistically significant divergence across multiple alphabets when X4 and R5 sequence groups were compared. In addition, consensus amino acid switches were found at positions 37 and 41 in comparisons of the R5 and X4 sequence populations. These results suggest an evolutionary link between Vpr and gp120 in HIV-1-infected patients. PMID:28620613
Methods of diagnosing alagille syndrome

DOEpatents

Li, Linheng; Hood, Leroy; Krantz, Ian D.; Spinner, Nancy B.

2004-03-09

The present invention provides an isolated polypeptide exhibiting substantially the same amino acid sequence as JAGGED, or an active fragment thereof, provided that the polypeptide does not have the amino acid sequence of SEQ ID NO:5 or SEQ ID NO:6. The invention further provides an isolated nucleic acid molecule containing a nucleotide sequence encoding substantially the same amino acid sequence as JAGGED, or an active fragment thereof, provided that the nucleotide sequence does not encode the amino acid sequence of SEQ ID NO:5 or SEQ ID NO:6. Also provided herein is a method of inhibiting differentiation of hematopoietic progenitor cells by contacting the progenitor cells with an isolated JAGGED polypeptide, or active fragment thereof. The invention additionally provides a method of diagnosing Alagille Syndrome in an individual. The method consists of detecting an Alagille Syndrome disease-associated mutation linked to a JAGGED locus.
Kinetics of coffee industrial residue pyrolysis using distributed activation energy model and components separation of bio-oil by sequencing temperature-raising pyrolysis.

PubMed

Chen, Nanwei; Ren, Jie; Ye, Ziwei; Xu, Qizhi; Liu, Jingyong; Sun, Shuiyu

2016-12-01

This study was carried out to investigate the kinetics of coffee industrial residue (CIR) pyrolysis, the effect of pyrolysis factors on yield of bio-oil component and components separation of bio-oil. The kinetics of CIR pyrolysis was analyzed using distributed activation energy model (DAEM), based on the experiments in thermogravimetric analyzer (TGA), and it indicated that the average of activation energy (E) is 187.86kJ·mol -1 . The bio-oils were prepared from CIR pyrolysis in vacuum tube furnace, and its components were determined by gas chromatography/mass spectrometry (GC-MS). Among pyrolysis factors, pyrolysis temperature is the most influential factor on components yield of bio-oil, directly concerned with the volatilization and yield of components (palmitic acid, linoleic acid, oleic acid, octadecanoic acid and caffeine). Furthermore, a new method (sequencing temperature-raising pyrolysis) was put forward and applied to the components separation of bio-oil. Based on experiments, a solution of components separation of bio-oil was come out. Copyright © 2016 Elsevier Ltd. All rights reserved.
Partial molar volumes of proteins: amino acid side-chain contributions derived from the partial molar volumes of some tripeptides over the temperature range 10-90 degrees C.

PubMed

Häckel, M; Hinz, H J; Hedwig, G R

1999-11-15

The partial molar volumes of tripeptides of sequence glycyl-X-glycine, where X is one of the amino acids alanine, leucine, threonine, glutamine, phenylalanine, histidine, cysteine, proline, glutamic acid, and arginine, have been determined in aqueous solution over the temperature range 10-90 degrees C using differential scanning densitometry . These data, together with those reported previously, have been used to derive the partial molar volumes of the side-chains of all 20 amino acids. The side-chain volumes are critically compared with literature values derived using partial molar volumes for alternative model compounds. The new amino acid side-chain volumes, along with that for the backbone glycyl group, were used to calculate the partial specific volumes of several proteins in aqueous solution. The results obtained are compared with those observed experimentally. The new side-chain volumes have also been used to re-determine residue volume changes upon protein folding.
Ancient DNA sequence revealed by error-correcting codes.

PubMed

Brandão, Marcelo M; Spoladore, Larissa; Faria, Luzinete C B; Rocha, Andréa S L; Silva-Filho, Marcio C; Palazzo, Reginaldo

2015-07-10

A previously described DNA sequence generator algorithm (DNA-SGA) using error-correcting codes has been employed as a computational tool to address the evolutionary pathway of the genetic code. The code-generated sequence alignment demonstrated that a residue mutation revealed by the code can be found in the same position in sequences of distantly related taxa. Furthermore, the code-generated sequences do not promote amino acid changes in the deviant genomes through codon reassignment. A Bayesian evolutionary analysis of both code-generated and homologous sequences of the Arabidopsis thaliana malate dehydrogenase gene indicates an approximately 1 MYA divergence time from the MDH code-generated sequence node to its paralogous sequences. The DNA-SGA helps to determine the plesiomorphic state of DNA sequences because a single nucleotide alteration often occurs in distantly related taxa and can be found in the alternative codon patterns of noncanonical genetic codes. As a consequence, the algorithm may reveal an earlier stage of the evolution of the standard code.
Ancient DNA sequence revealed by error-correcting codes

PubMed Central

Brandão, Marcelo M.; Spoladore, Larissa; Faria, Luzinete C. B.; Rocha, Andréa S. L.; Silva-Filho, Marcio C.; Palazzo, Reginaldo

2015-01-01

A previously described DNA sequence generator algorithm (DNA-SGA) using error-correcting codes has been employed as a computational tool to address the evolutionary pathway of the genetic code. The code-generated sequence alignment demonstrated that a residue mutation revealed by the code can be found in the same position in sequences of distantly related taxa. Furthermore, the code-generated sequences do not promote amino acid changes in the deviant genomes through codon reassignment. A Bayesian evolutionary analysis of both code-generated and homologous sequences of the Arabidopsis thaliana malate dehydrogenase gene indicates an approximately 1 MYA divergence time from the MDH code-generated sequence node to its paralogous sequences. The DNA-SGA helps to determine the plesiomorphic state of DNA sequences because a single nucleotide alteration often occurs in distantly related taxa and can be found in the alternative codon patterns of noncanonical genetic codes. As a consequence, the algorithm may reveal an earlier stage of the evolution of the standard code. PMID:26159228
Coarse-grained sequences for protein folding and design.

PubMed

Brown, Scott; Fawzi, Nicolas J; Head-Gordon, Teresa

2003-09-16

We present the results of sequence design on our off-lattice minimalist model in which no specification of native-state tertiary contacts is needed. We start with a sequence that adopts a target topology and build on it through sequence mutation to produce new sequences that comprise distinct members within a target fold class. In this work, we use the alpha/beta ubiquitin fold class and design two new sequences that, when characterized through folding simulations, reproduce the differences in folding mechanism seen experimentally for proteins L and G. The primary implication of this work is that patterning of hydrophobic and hydrophilic residues is the physical origin for the success of relative contact-order descriptions of folding, and that these physics-based potentials provide a predictive connection between free energy landscapes and amino acid sequence (the original protein folding problem). We present results of the sequence mapping from a 20- to the three-letter code for determining a sequence that folds into the WW domain topology to illustrate future extensions to protein design.
Coarse-grained sequences for protein folding and design

PubMed Central

Brown, Scott; Fawzi, Nicolas J.; Head-Gordon, Teresa

2003-01-01

We present the results of sequence design on our off-lattice minimalist model in which no specification of native-state tertiary contacts is needed. We start with a sequence that adopts a target topology and build on it through sequence mutation to produce new sequences that comprise distinct members within a target fold class. In this work, we use the α/β ubiquitin fold class and design two new sequences that, when characterized through folding simulations, reproduce the differences in folding mechanism seen experimentally for proteins L and G. The primary implication of this work is that patterning of hydrophobic and hydrophilic residues is the physical origin for the success of relative contact-order descriptions of folding, and that these physics-based potentials provide a predictive connection between free energy landscapes and amino acid sequence (the original protein folding problem). We present results of the sequence mapping from a 20- to the three-letter code for determining a sequence that folds into the WW domain topology to illustrate future extensions to protein design. PMID:12963815
Detection and isolation of nucleic acid sequences using competitive hybridization probes

DOEpatents

Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

1997-01-01

A method for detecting a target nucleic acid sequence in a sample is provided using hybridization probes which competitively hybridize to a target nucleic acid. According to the method, a target nucleic acid sequence is hybridized to first and second hybridization probes which are complementary to overlapping portions of the target nucleic acid sequence, the first hybridization probe including a first complexing agent capable of forming a binding pair with a second complexing agent and the second hybridization probe including a detectable marker. The first complexing agent attached to the first hybridization probe is contacted with a second complexing agent, the second complexing agent being attached to a solid support such that when the first and second complexing agents are attached, target nucleic acid sequences hybridized to the first hybridization probe become immobilized on to the solid support. The immobilized target nucleic acids are then separated and detected by detecting the detectable marker attached to the second hybridization probe. A kit for performing the method is also provided.

Detection and isolation of nucleic acid sequences using competitive hybridization probes

DOEpatents

Lucas, J.N.; Straume, T.; Bogen, K.T.

1997-04-01

A method for detecting a target nucleic acid sequence in a sample is provided using hybridization probes which competitively hybridize to a target nucleic acid. According to the method, a target nucleic acid sequence is hybridized to first and second hybridization probes which are complementary to overlapping portions of the target nucleic acid sequence, the first hybridization probe including a first complexing agent capable of forming a binding pair with a second complexing agent and the second hybridization probe including a detectable marker. The first complexing agent attached to the first hybridization probe is contacted with a second complexing agent, the second complexing agent being attached to a solid support such that when the first and second complexing agents are attached, target nucleic acid sequences hybridized to the first hybridization probe become immobilized on to the solid support. The immobilized target nucleic acids are then separated and detected by detecting the detectable marker attached to the second hybridization probe. A kit for performing the method is also provided. 7 figs.
Redesigning Channel-Forming Peptides: Amino Acid Substitutions that Enhance Rates of Supramolecular Self-Assembly and Raise Ion Transport Activity

PubMed Central

Shank, Lalida P.; Broughman, James R.; Takeguchi, Wade; Cook, Gabriel; Robbins, Ashley S.; Hahn, Lindsey; Radke, Gary; Iwamoto, Takeo; Schultz, Bruce D.; Tomich, John M.

2006-01-01

Three series of 22-residue peptides derived from the transmembrane M2 segment of the glycine receptor α1-subunit (M2GlyR) have been designed, synthesized, and tested to determine the plasticity of a channel-forming sequence and to define whether channel pores with enhanced conductive properties could be created. Sixteen sequences were examined for aqueous solubility, solution-association tendency, secondary structure, and half-maximal concentration for supramolecular assembly, channel activity, and ion transport properties across epithelial monolayers. All peptides interact strongly with membranes: associating with, inserting across, and assembling to form homooligomeric bundles when in micromolar concentrations. Single and double amino acid replacements involving arginine and/or aromatic amino acids within the final five C-terminal residues of the peptide cause dramatic effects on the concentration dependence, yielding a range of K1/2 values from 36 ± 5 to 390 ± 220 μM for transport activity. New water/lipid interfacial boundaries were established for the transmembrane segment using charged or aromatic amino acids, thus limiting the peptides' ability to move perpendicularly to the plane of the bilayer. Formation of discrete water/lipid interfacial boundaries appears to be necessary for efficient supramolecular assembly and high anion transport activity. A peptide sequence is identified that may show efficacy in channel replacement therapy for channelopathies such as cystic fibrosis. PMID:16387776
Two Perspectives on the Origin of the Standard Genetic Code

NASA Astrophysics Data System (ADS)

Sengupta, Supratim; Aggarwal, Neha; Bandhu, Ashutosh Vishwa

2014-12-01

The origin of a genetic code made it possible to create ordered sequences of amino acids. In this article we provide two perspectives on code origin by carrying out simulations of code-sequence coevolution in finite populations with the aim of examining how the standard genetic code may have evolved from more primitive code(s) encoding a small number of amino acids. We determine the efficacy of the physico-chemical hypothesis of code origin in the absence and presence of horizontal gene transfer (HGT) by allowing a diverse collection of code-sequence sets to compete with each other. We find that in the absence of horizontal gene transfer, natural selection between competing codes distinguished by differences in the degree of physico-chemical optimization is unable to explain the structure of the standard genetic code. However, for certain probabilities of the horizontal transfer events, a universal code emerges having a structure that is consistent with the standard genetic code.
Identification of the gene for disaggregatase from Methanosarcina mazei.

PubMed

Osumi, Naoki; Kakehashi, Yoshihiro; Matsumoto, Shiho; Nagaoka, Kazunari; Sakai, Junichi; Miyashita, Kiyotaka; Kimura, Makoto; Asakawa, Susumu

2008-12-01

The gene sequences encoding disaggregatase (Dag), the enzyme responsible for dispersion of cell aggregates of Methanosarcina mazei to single cells, were determined for three strains of M. mazei (S-6(T), LYC and TMA). The dag genes of the three strains were 3234 bp in length and had almost the same sequences with 97% amino acid sequence identities. Dag was predicted to comprise 1077 amino acid residues and to have a molecular mass of 120 kDa containing three repeats of the DNRLRE domain in the C terminus, which is specific to the genus Methanosarcina and may be responsible for structural organization and cell wall function. Recombinant Dag was overexpressed in Escherichia coli and preparations of the expressed protein exhibited enzymatic activity. The RT-PCR analysis showed that dag was transcribed to mRNA in M. mazei LYC and indicated that the gene was expressed in vivo. This is the first time the gene involved in the morphological change of Methanosarcina spp. from aggregate to single cells has been identified.
FoxP2 in song-learning birds and vocal-learning mammals.

PubMed

Webb, D M; Zhang, J

2005-01-01

FoxP2 is the first identified gene that is specifically involved in speech and language development in humans. Population genetic studies of FoxP2 revealed a selective sweep in recent human history associated with two amino acid substitutions in exon 7. Avian song learning and human language acquisition share many behavioral and neurological similarities. To determine whether FoxP2 plays a similar role in song-learning birds, we sequenced exon 7 of FoxP2 in multiple song-learning and nonlearning birds. We show extreme conservation of FoxP2 sequences in birds, including unusually low rates of synonymous substitutions. However, no amino acid substitutions are shared between the song-learning birds and humans. Furthermore, sequences from vocal-learning whales, dolphins, and bats do not share the human-unique substitutions. While FoxP2 appears to be under strong functional constraints in mammals and birds, we find no evidence for its role during the evolution of vocal learning in nonhuman animals as in humans.
Cloning and characterization of the ddc homolog encoding L-2,4-diaminobutyrate decarboxylase in Enterobacter aerogenes.

PubMed

Yamamoto, S; Mutoh, N; Tsuzuki, D; Ikai, H; Nakao, H; Shinoda, S; Narimatsu, S; Miyoshi, S I

2000-05-01

L-2,4-diaminobutyrate decarboxylase (DABA DC) catalyzes the formation of 1,3-diaminopropane (DAP) from DABA. In the present study, the ddc gene encoding DABA DC from Enterobacter aerogenes ATCC 13048 was cloned and characterized. Determination of the nucleotide sequence revealed an open reading frame of 1470 bp encoding a 53659-Da protein of 490 amino acids, whose deduced NH2-terminal sequence was identical to that of purified DABA DC from E. aerogenes. The deduced amino acid sequence was highly similar to those of Acinetobacter baumannii and Haemophilus influenzae DABA DCs encoded by the ddc genes. The lysine-307 of the E. aerogenes DABA DC was identified as the pyridoxal 5'-phosphate binding residue by site-directed mutagenesis. Furthermore, PCR analysis revealed the distribution of E. aerogenes ddc homologs in some other species of Enterobacteriaceae. Such a relatively wide occurrence of the ddc homologs implies biological significance of DABA DC and its product DAP.
Physical Model of the Genotype-to-Phenotype Map of Proteins

NASA Astrophysics Data System (ADS)

Tlusty, Tsvi; Libchaber, Albert; Eckmann, Jean-Pierre

2017-04-01

How DNA is mapped to functional proteins is a basic question of living matter. We introduce and study a physical model of protein evolution which suggests a mechanical basis for this map. Many proteins rely on large-scale motion to function. We therefore treat protein as learning amorphous matter that evolves towards such a mechanical function: Genes are binary sequences that encode the connectivity of the amino acid network that makes a protein. The gene is evolved until the network forms a shear band across the protein, which allows for long-range, soft modes required for protein function. The evolution reduces the high-dimensional sequence space to a low-dimensional space of mechanical modes, in accord with the observed dimensional reduction between genotype and phenotype of proteins. Spectral analysis of the space of 1 06 solutions shows a strong correspondence between localization around the shear band of both mechanical modes and the sequence structure. Specifically, our model shows how mutations are correlated among amino acids whose interactions determine the functional mode.
The nucleotide sequence of a segment of Trypanosoma brucei mitochondrial maxi-circle DNA that contains the gene for apocytochrome b and some unusual unassigned reading frames.

PubMed Central

Benne, R; De Vries, B F; Van den Burg, J; Klaver, B

1983-01-01

The nucleotide sequence of a 2.5-kb segment of the maxi-circle of Trypanosoma brucei mtDNA has been determined. The segment contains the gene for apocytochrome b, which displays about 25% homology at the amino acid level to the apocytochrome b gene from fungal and mammalian mtDNAs. Northern blot and S1 nuclease analyses have yielded accurate map positions of an RNA species in an area that coincides with the reading frame. The segment also contains two pairs of overlapping unassigned reading frames, which lack homology with any known mitochondrial gene or URF. The DNA sequence in these areas is AG-rich (70%), resulting in URFs with an unusually high level of glycine and charged amino acids (60%). They may not encode proteins, in spite of their size and the fact that abundant transcripts are mapped in these areas. Images PMID:6314266
Toscana Virus Genome Stability: Data from a Meningoencephalitis Case in Mantua, Italy

PubMed Central

Baggieri, Melissa; Gattuso, Gianni; Fortuna, Claudia; Remoli, Maria Elena; Vaccari, Gabriele; Zaccaria, Guendalina; Marchi, Antonella; Bucci, Paola; Benedetti, Eleonora; Fiorentini, Cristiano; Nicoletti, Loredana

2014-01-01

Abstract In July of 2013, samples from a patient with a neurological syndrome were collected from Mantua hospital and sent to the National Reference Laboratory for Arboviruses (National Institute of Health, Rome). On the basis of the symptoms, serological and molecular assays were performed to diagnose either West Nile virus (WNV) or Toscana virus (TOSV) infection. Molecular and serological tests confirmed TOSV infection. Virus isolation was obtained from cerebrospinal fluid. A full genome sequence was determined from this TOSV strain with next-generation sequencing using Ion Torrent technology. Nucleotide and amino acidic sequences grouped phylogenetically with lineage TOSV A and showed a low genome variability. PMID:25514123
Real-time assays with molecular beacons and other fluorescent nucleic acid hybridization probes.

PubMed

Marras, Salvatore A E; Tyagi, Sanjay; Kramer, Fred Russell

2006-01-01

A number of formats for nucleic acid hybridization have been developed to identify DNA and RNA sequences that are involved in cellular processes and that aid in the diagnosis of genetic and infectious diseases. The introduction of hybridization probes with interactive fluorophore pairs has enabled the development of homogeneous hybridization assays for the direct identification of nucleic acids. A change in the fluorescence of these probes indicates the presence of a target nucleic acid, and there is no need to separate unbound probes from hybridized probes. The advantages of homogeneous hybridization assays are their speed and simplicity. In addition, homogeneous assays can be combined with nucleic acid amplification, enabling the detection of rare target nucleic acids. These assays can be followed in real time, providing quantitative determination of target nucleic acids over a broad range of concentrations.
Predicting membrane protein types by incorporating protein topology, domains, signal peptides, and physicochemical properties into the general form of Chou's pseudo amino acid composition.

PubMed

Chen, Yen-Kuang; Li, Kuo-Bin

2013-02-07

The type information of un-annotated membrane proteins provides an important hint for their biological functions. The experimental determination of membrane protein types, despite being more accurate and reliable, is not always feasible due to the costly laboratory procedures, thereby creating a need for the development of bioinformatics methods. This article describes a novel computational classifier for the prediction of membrane protein types using proteins' sequences. The classifier, comprising a collection of one-versus-one support vector machines, makes use of the following sequence attributes: (1) the cationic patch sizes, the orientation, and the topology of transmembrane segments; (2) the amino acid physicochemical properties; (3) the presence of signal peptides or anchors; and (4) the specific protein motifs. A new voting scheme was implemented to cope with the multi-class prediction. Both the training and the testing sequences were collected from SwissProt. Homologous proteins were removed such that there is no pair of sequences left in the datasets with a sequence identity higher than 40%. The performance of the classifier was evaluated by a Jackknife cross-validation and an independent testing experiments. Results show that the proposed classifier outperforms earlier predictors in prediction accuracy in seven of the eight membrane protein types. The overall accuracy was increased from 78.3% to 88.2%. Unlike earlier approaches which largely depend on position-specific substitution matrices and amino acid compositions, most of the sequence attributes implemented in the proposed classifier have supported literature evidences. The classifier has been deployed as a web server and can be accessed at http://bsaltools.ym.edu.tw/predmpt. Copyright © 2012 Elsevier Ltd. All rights reserved.
Complex alternative splicing of acetylcholinesterase transcripts in Torpedo electric organ; primary structure of the precursor of the glycolipid-anchored dimeric form.

PubMed Central

Sikorav, J L; Duval, N; Anselmet, A; Bon, S; Krejci, E; Legay, C; Osterlund, M; Reimund, B; Massoulié, J

1988-01-01

In this paper, we show the existence of alternative splicing in the 3' region of the coding sequence of Torpedo acetylcholinesterase (AChE). We describe two cDNA structures which both diverge from the previously described coding sequence of the catalytic subunit of asymmetric (A) forms (Schumacher et al., 1986; Sikorav et al., 1987). They both contain a coding sequence followed by a non-coding sequence and a poly(A) stretch. Both of these structures were shown to exist in poly(A)+ RNAs, by S1 mapping experiments. The divergent region encoded by the first sequence corresponds to the precursor of the globular dimeric form (G2a), since it contains the expected C-terminal amino acids, Ala-Cys. These amino acids are followed by a 29 amino acid extension which contains a hydrophobic segment and must be replaced by a glycolipid in the mature protein. Analyses of intact G2a AChE showed that the common domain of the protein contains intersubunit disulphide bonds. The divergent region of the second type of cDNA consists of an adjacent genomic sequence, which is removed as an intron in A and Ga mRNAs, but may encode a distinct, less abundant catalytic subunit. The structures of the cDNA clones indicate that they are derived from minor mRNAs, shorter than the three major transcripts which have been described previously (14.5, 10.5 and 5.5 kb). Oligonucleotide probes specific for the asymmetric and globular terminal regions hybridize with the three major transcripts, indicating that their size is determined by 3'-untranslated regions which are not related to the differential splicing leading to A and Ga forms. Images PMID:3181125
Electron-Transfer Ion/Ion Reactions of Doubly Protonated Peptides: Effect of Elevated Bath Gas Temperature

PubMed Central

Pitteri, Sharon J.; Chrisman, Paul A.; McLuckey, Scott A.

2005-01-01

In this study, the electron-transfer dissociation (ETD) behavior of cations derived from 27 different peptides (22 of which are tryptic peptides) has been studied in a 3D quadrupole ion trap mass spectrometer. Ion/ion reactions between peptide cations and nitrobenzene anions have been examined at both room temperature and in an elevated temperature bath gas environment to form ETD product ions. From the peptides studied, the ETD sequence coverage tends to be inversely related to peptide size. At room temperature, very high sequence coverage (~100%) was observed for small peptides (≤7 amino acids). For medium-sized peptides composed of 8–11 amino acids, the average sequence coverage was 46%. Larger peptides with 14 or more amino acids yielded an average sequence coverage of 23%. Elevated-temperature ETD provided increased sequence coverage over room-temperature experiments for the peptides of greater than 7 residues, giving an average of 67% for medium-sized peptides and 63% for larger peptides. Percent ETD, a measure of the extent of electron transfer, has also been calculated for the peptides and also shows an inverse relation with peptide size. Bath gas temperature does not have a consistent effect on percent ETD, however. For the tryptic peptides, fragmentation is localized at the ends of the peptides suggesting that the distribution of charge within the peptide may play an important role in determining fragmentation sites. A triply protonated peptide has also been studied and shows behavior similar to the doubly charged peptides. These preliminary results suggest that for a given charge state there is a maximum size for which high sequence coverage is obtained and that increasing the bath gas temperature can increase this maximum. PMID:16131079
The glycoprotein genes and gene junctions of the fish rhabdoviruses spring viremia of carp virus and hirame rhabdovirus: Analysis of relationships with other rhabdoviruses

USGS Publications Warehouse

Bjorklund, H.V.; Higman, K.H.; Kurath, G.

1996-01-01

The nucleotide sequences of the glycoprotein genes and all of the internal gene junctions of the fish pathogenic rhabdoviruses spring viremia of carp virus (SVCV) and hirame rhabdovirus (HIRRV) have been determined from cDNA clones generated from viral genomic RNA. The SVCV glycoprotein gene sequence is 1588 nucleotides (nt) long and encodes a 509 amino acid (aa) protein. The HIRRV glycoprotein gene sequence comprises 1612 nt, coding for a 508 aa protein. In sequence comparisons of 15 rhabdovirus glycoproteins, the SVCV glycoprotein gene showed the highest amino acid sequence identity (31.2–33.2%) with vesicular stomatitis New Jersey virus (VSNJV), Chandipura virus (CHPV) and vesicular stomatitis Indiana virus (VSIV). The HIRRV glycoprotein gene showed a very high amino acid sequence identity (74.3%) with the glycoprotein gene of another fish pathogenic rhabdovirus, infectious hematopoietic necrosis virus (IHNV), but no significant similarity with glycoproteins of VSIV or rabies virus (RABV). In phylogenetic analyses SVCV was grouped consistently with VSIV, VSNJV and CHPV in the Vesiculovirus genus of Rhabdoviridae. The fish rhabdoviruses HIRRV, IHNV and viral hemorrhagic septicemia virus (VHSV) showed close relationships with each other, but only very distant relationships with mammalian rhabdoviruses. The gene junctions are highly conserved between SVCV and VSIV, well conserved between IHNV and HIRRV, but not conserved between HIRRV/IHNV and RABV. Based on the combined results we suggest that the fish lyssa-type rhabdoviruses HIRRV, IHNV and VHSV may be grouped in their own genus within the family Rhabdoviridae. Aquarhabdovirus has been proposed for the name of this new genus.
The glycoprotein genes and gene junctions of the fish rhabdoviruses spring viremia of carp virus and hirame rhabdovirus: Analysis of relationships with other rhabdoviruses

USGS Publications Warehouse

Bjorklund, H.V.; Higman, K.H.; Kurath, G.

1996-01-01

The nucleotide sequences of the glycoprotein genes and all of the internal gene junctions of the fish pathogenic rhabdoviruses spring viremia of carp virus (SVCV) and hirame rhabdovirus (HIRRV) have been determined from cDNA clones generated from viral genomic RNA. The SVCV glycoprotein gene sequence is 1588 nucleotides (nt) long and encodes a 509 amino acid (aa) protein. The HIRRV glycoprotein gene sequence comprises 1612 nt, coding for a 508 aa protein. In sequence comparisons of 15 rhabdovirus glycoproteins, the SVCV glycoprotein gene showed the highest amino acid sequence identity (31.2-33.2%) with vesicular stomatitis New Jersey virus (VSNJV), Chandipura virus (CHPV) and vesicular stomatitis Indiana virus (VSIV). The HIRRV glycoprotein gene showed a very high amino acid sequence identity (74.3%) with the glycoprotein gene of another fish pathogenic rhabdovirus, infectious hematopoietic necrosis virus (IHNV), but no significant similarity with glycoproteins of VSIV or rabies virus (RABV). In phylogenetic analyses SVCV was grouped consistently with VSIV, VSNJV and CHPV in the Vesiculovirus genus of Rhabdoviridae. The fish rhabdoviruses HIRRV, IHNV and viral hemorrhagic septicemia virus (VHSV) showed close relationships with each other, but only very distant relationships with mammalian rhabdoviruses. The gene junctions are highly conserved between SVCV and VSIV, well conserved between IHNV and HIRRV, but not conserved between HIRRV/IHNV and RABV. Based on the combined results we suggest that the fish lyssa-type rhabdoviruses HIRRV, IHNV and VHSV may be grouped in their own genus within the family Rhabdoviridae. Aquarhabdovirus has been proposed for the name of this new genus.
Detection of nucleic acids by multiple sequential invasive cleavages

DOEpatents

Hall, Jeff G.; Lyamichev, Victor I.; Mast, Andrea L.; Brow, Mary Ann D.

1999-01-01

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based on charge. The present invention also provides methods for the detection of non-target cleavage products via the formation of a complete and activated protein binding region. The invention further provides sensitive and specific methods for the detection of human cytomegalovirus nucleic acid in a sample.
Nucleic acid detection kits

DOEpatents

Hall, Jeff G.; Lyamichev, Victor I.; Mast, Andrea L.; Brow, Mary Ann; Kwiatkowski, Robert W.; Vavra, Stephanie H.

2005-03-29

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based on charge. The present invention also provides methods for the detection of non-target cleavage products via the formation of a complete and activated protein binding region. The invention further provides sensitive and specific methods for the detection of nucleic acid from various viruses in a sample.
Detection of nucleic acids by multiple sequential invasive cleavages 02

DOEpatents

Hall, Jeff G.; Lyamichev, Victor I.; Mast, Andrea L.; Brow, Mary Ann D.

2002-01-01

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based on charge. The present invention also provides methods for the detection of non-target cleavage products via the formation of a complete and activated protein binding region. The invention further provides sensitive and specific methods for the detection of human cytomegalovirus nucleic acid in a sample.
Detection of nucleic acids by multiple sequential invasive cleavages

DOEpatents

Hall, Jeff G; Lyamichev, Victor I; Mast, Andrea L; Brow, Mary Ann D

2012-10-16

The present invention relates to means for the detection and characterization of nucleic acid sequences, as well as variations in nucleic acid sequences. The present invention also relates to methods for forming a nucleic acid cleavage structure on a target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. The structure-specific nuclease activity of a variety of enzymes is used to cleave the target-dependent cleavage structure, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. The present invention further relates to methods and devices for the separation of nucleic acid molecules based on charge. The present invention also provides methods for the detection of non-target cleavage products via the formation of a complete and activated protein binding region. The invention further provides sensitive and specific methods for the detection of human cytomegalovirus nucleic acid in a sample.
The mitochondrial genome of Protostrongylus rufescens – implications for population and systematic studies

PubMed Central

2013-01-01

Background Protostrongylus rufescens is a metastrongyloid nematode of small ruminants, such as sheep and goats, causing protostrongylosis. In spite of its importance, the ecology and epidemiology of this parasite are not entirely understood. In addition, genetic data are scant for P. rufescens and related metastrongyloids. Methods The mt genome was amplified from a single adult worm of P. rufescens (from sheep) by long-PCR, sequenced using 454-technology and annotated using bioinformatic tools. Amino acid sequences inferred from individual genes of the mt genomes were concatenated and subjected to phylogenetic analysis using Bayesian inference. Results The circular mitochondrial genome was 13,619 bp in length and contained two ribosomal RNA, 12 protein-coding and 22 transfer RNA genes, consistent with nematodes of the order Strongylida for which mt genomes have been determined. Phylogenetic analysis of the concatenated amino acid sequence data for the 12 mt proteins showed that P. rufescens was closely related to Aelurostrongylus abstrusus, Angiostrongylus vasorum, Angiostrongylus cantonensis and Angiostrongylus costaricensis. Conclusions The mt genome determined herein provides a source of markers for future investigations of P. rufescens. Molecular tools, employing such mt markers, are likely to find applicability in studies of the population biology of this parasite and the systematics of lungworms. PMID:24025317

Modular probes for enriching and detecting complex nucleic acid sequences

NASA Astrophysics Data System (ADS)

Wang, Juexiao Sherry; Yan, Yan Helen; Zhang, David Yu

2017-12-01

Complex DNA sequences are difficult to detect and profile, but are important contributors to human health and disease. Existing hybridization probes lack the capability to selectively bind and enrich hypervariable, long or repetitive sequences. Here, we present a generalized strategy for constructing modular hybridization probes (M-Probes) that overcomes these challenges. We demonstrate that M-Probes can tolerate sequence variations of up to 7 nt at prescribed positions while maintaining single nucleotide sensitivity at other positions. M-Probes are also shown to be capable of sequence-selectively binding a continuous DNA sequence of more than 500 nt. Furthermore, we show that M-Probes can detect genes with triplet repeats exceeding a programmed threshold. As a demonstration of this technology, we have developed a hybrid capture method to determine the exact triplet repeat expansion number in the Huntington's gene of genomic DNA using quantitative PCR.
Method of increasing conversion of a fatty acid to its corresponding dicarboxylic acid

DOEpatents

Craft, David L.; Wilson, C. Ron; Eirich, Dudley; Zhang, Yeyan

2004-09-14

A nucleic acid sequence including a CYP promoter operably linked to nucleic acid encoding a heterologous protein is provided to increase transcription of the nucleic acid. Expression vectors and host cells containing the nucleic acid sequence are also provided. The methods and compositions described herein are especially useful in the production of polycarboxylic acids by yeast cells.
Characterization of the Complete Mitochondrial Genome Sequence of Spirometra erinaceieuropaei (Cestoda: Diphyllobothriidae) from China

PubMed Central

Liu, Guo-Hua; Li, Chun; Li, Jia-Yuan; Zhou, Dong-Hui; Xiong, Rong-Chuan; Lin, Rui-Qing; Zou, Feng-Cai; Zhu, Xing-Quan

2012-01-01

Sparganosis, caused by the plerocercoid larvae of members of the genus Spirometra, can cause significant public health problem and considerable economic losses. In the present study, the complete mitochondrial DNA (mtDNA) sequence of Spirometra erinaceieuropaei from China was determined, characterized and compared with that of S. erinaceieuropaei from Japan. The gene arrangement in the mt genome sequences of S. erinaceieuropaei from China and Japan is identical. The identity of the mt genomes was 99.1% between S. erinaceieuropaei from China and Japan, and the complete mtDNA sequence of S. erinaceieuropaei from China is slightly shorter (2 bp) than that from Japan. Phylogenetic analysis of S. erinaceieuropaei with other representative cestodes using two different computational algorithms [Bayesian inference (BI) and maximum likelihood (ML)] based on concatenated amino acid sequences of 12 protein-coding genes, revealed that S. erinaceieuropaei is closely related to Diphyllobothrium spp., supporting classification based on morphological features. The present study determined the complete mtDNA sequences of S. erinaceieuropaei from China that provides novel genetic markers for studying the population genetics and molecular epidemiology of S. erinaceieuropaei in humans and animals. PMID:22553464
Cell-free fetal nucleic acid testing: a review of the technology and its applications.

PubMed

Sayres, Lauren C; Cho, Mildred K

2011-07-01

Cell-free fetal nucleic acids circulating in the blood of pregnant women afford the opportunity for early, noninvasive prenatal genetic testing. The predominance of admixed maternal genetic material in circulation demands innovative means for identification and analysis of cell-free fetal DNA and RNA. Techniques using polymerase chain reaction, mass spectrometry, and sequencing have been developed for the purposes of detecting fetal-specific sequences, such as paternally inherited or de novo mutations, or determining allelic balance or chromosome dosage. Clinical applications of these methods include fetal sex determination and blood group typing, which are currently available commercially although not offered routinely in the United States. Other uses of cell-free fetal DNA and RNA being explored are the detection of single-gene disorders, chromosomal abnormalities, and inheritance of parental polymorphisms across the whole fetal genome. The concentration of cell-free fetal DNA may also provide predictive capabilities for pregnancy-associated complications. The roles that cell-free fetal nucleic acid testing assume in the existing framework of prenatal screening and invasive diagnostic testing will depend on factors such as costs, clinical validity and utility, and perceived benefit-risk ratios for different applications. As cell-free fetal DNA and RNA testing continues to be developed and translated, significant ethical, legal, and social questions will arise that will need to be addressed by those with a stake in the use of this technology. Obstetricians & Gynecologists and Family Physicians Learning Objectives: After participating in this activity, physicians should be better able to evaluate techniques and tools for analyzing cell-free fetal nucleic acids, assess clinical applications of prenatal testing, using cell-free fetal nucleic acids and barriers to implementation, and distinguish between relevant clinical features of cell-free fetal nucleic acid testing and existing prenatal genetic screening and diagnostic procedures.
Characterization of myosin heavy chain and its gene in Amoeba proteus.

PubMed

Oh, S W; Jeon, K W

1998-01-01

Monoclonal antibodies against the myosin heavy chain of Amoeba proteus were obtained and used to localize myosin inside amoebae and to clone cDNAs encoding myosin. Myosin was found throughout the amoeba cytoplasm but was more concentrated in the ectoplasmic regions as determined by indirect immunofluorescence microscopy. In symbiont-bearing xD amoebae, myosin was also found on the symbiosome membranes, as checked by indirect immunofluorescence microscopy and by immunoelectron microscopy. The open reading frame of a cloned myosin cDNA contained 6,414 nucleotides, coding for a polypeptide of 2,138 amino acids. While the amino-acid sequence of the globular head region of amoeba's myosin had a high degree of similarity with that of myosins from various organisms, the tail region building a coiled-coil structure did not show a significant sequence similarity. There appeared to be at least three different isoforms of myosins in amoebae, with closely related amino acids in the globular head region.
An artificial molecular machine that builds an asymmetric catalyst

NASA Astrophysics Data System (ADS)

De Bo, Guillaume; Gall, Malcolm A. Y.; Kuschel, Sonja; De Winter, Julien; Gerbaux, Pascal; Leigh, David A.

2018-05-01

Biomolecular machines perform types of complex molecular-level tasks that artificial molecular machines can aspire to. The ribosome, for example, translates information from the polymer track it traverses (messenger RNA) to the new polymer it constructs (a polypeptide)1. The sequence and number of codons read determines the sequence and number of building blocks incorporated into the biomachine-synthesized polymer. However, neither control of sequence2,3 nor the transfer of length information from one polymer to another (which to date has only been accomplished in man-made systems through template synthesis)4 is easily achieved in the synthesis of artificial macromolecules. Rotaxane-based molecular machines5-7 have been developed that successively add amino acids8-10 (including β-amino acids10) to a growing peptide chain by the action of a macrocycle moving along a mono-dispersed oligomeric track derivatized with amino-acid phenol esters. The threaded macrocycle picks up groups that block its path and links them through successive native chemical ligation reactions11 to form a peptide sequence corresponding to the order of the building blocks on the track. Here, we show that as an alternative to translating sequence information, a rotaxane molecular machine can transfer the narrow polydispersity of a leucine-ester-derivatized polystyrene chain synthesized by atom transfer radical polymerization12 to a molecular-machine-made homo-leucine oligomer. The resulting narrow-molecular-weight oligomer folds to an α-helical secondary structure13 that acts as an asymmetric catalyst for the Juliá-Colonna epoxidation14,15 of chalcones.
The complete amino acid sequence of echinoidin, a lectin from the coelomic fluid of the sea urchin Anthocidaris crassispina. Homologies with mammalian and insect lectins.

PubMed

Giga, Y; Ikai, A; Takahashi, K

1987-05-05

The complete amino acid sequence of echinoidin, the proposed name for a lectin from the coelomic fluid of the sea urchin Anthocidaris crassispina, has been determined by sequencing the peptides obtained from tryptic, Staphylococcus aureus V8 protease, chymotryptic, and thermolysin digestions. Echinoidin is a multimeric protein (Giga, Y., Sutoh, K., and Ikai, A. (1985) Biochemistry 24, 4461-4467) whose subunit consists of a total of 147 amino acid residues and one carbohydrate chain attached to Ser38. The molecular weight of the polypeptide without carbohydrate was calculated to be 16,671. Each polypeptide chain contains seven half-cystines, and six of them form three disulfide bonds in the single polypeptide chain (Cys3-Cys14, Cys31-Cys141, and Cys116-Cys132), while Cys2 is involved in an interpolypeptide disulfide linkage. From secondary structure prediction by the method of Chou and Fasman (Chou, P. Y., and Fasman, G. D. (1974) Biochemistry 13, 211-222) the protein appears to be rich in beta-sheet and beta-turn structures and poor in alpha-helical structure. The sequence of the COOH-terminal half of echinoidin is highly homologous to those of the COOH-terminal carbohydrate recognition portions of rat liver mannose-binding protein and several other hepatic lectins. This COOH-terminal region of echinoidin is also homologous to the central portion of the lectin from the flesh fly Sarcophaga peregrina. Moreover, echinoidin contains an Arg-Gly-Asp sequence which has been proposed to be a basic functional unit in cellular recognition proteins.
Molecular evolution of Dmrt1 accompanies change of sex-determining mechanisms in reptilia.

PubMed

Janes, Daniel E; Organ, Christopher L; Stiglec, Rami; O'Meally, Denis; Sarre, Stephen D; Georges, Arthur; Graves, Jennifer A M; Valenzuela, Nicole; Literman, Robert A; Rutherford, Kim; Gemmell, Neil; Iverson, John B; Tamplin, Jeffrey W; Edwards, Scott V; Ezaz, Tariq

2014-12-01

In reptiles, sex-determining mechanisms have evolved repeatedly and reversibly between genotypic and temperature-dependent sex determination. The gene Dmrt1 directs male determination in chicken (and presumably other birds), and regulates sex differentiation in animals as distantly related as fruit flies, nematodes and humans. Here, we show a consistent molecular difference in Dmrt1 between reptiles with genotypic and temperature-dependent sex determination. Among 34 non-avian reptiles, a convergently evolved pair of amino acids encoded by sequence within exon 2 near the DM-binding domain of Dmrt1 distinguishes species with either type of sex determination. We suggest that this amino acid shift accompanied the evolution of genotypic sex determination from an ancestral condition of temperature-dependent sex determination at least three times among reptiles, as evident in turtles, birds and squamates. This novel hypothesis describes the evolution of sex-determining mechanisms as turnover events accompanied by one or two small mutations. © 2014 The Author(s) Published by the Royal Society. All rights reserved.
Molecular evolution of Dmrt1 accompanies change of sex-determining mechanisms in reptilia

PubMed Central

Janes, Daniel E.; Organ, Christopher L.; Stiglec, Rami; O'Meally, Denis; Sarre, Stephen D.; Georges, Arthur; Graves, Jennifer A. M.; Valenzuela, Nicole; Literman, Robert A.; Rutherford, Kim; Gemmell, Neil; Iverson, John B.; Tamplin, Jeffrey W.; Edwards, Scott V.; Ezaz, Tariq

2014-01-01

In reptiles, sex-determining mechanisms have evolved repeatedly and reversibly between genotypic and temperature-dependent sex determination. The gene Dmrt1 directs male determination in chicken (and presumably other birds), and regulates sex differentiation in animals as distantly related as fruit flies, nematodes and humans. Here, we show a consistent molecular difference in Dmrt1 between reptiles with genotypic and temperature-dependent sex determination. Among 34 non-avian reptiles, a convergently evolved pair of amino acids encoded by sequence within exon 2 near the DM-binding domain of Dmrt1 distinguishes species with either type of sex determination. We suggest that this amino acid shift accompanied the evolution of genotypic sex determination from an ancestral condition of temperature-dependent sex determination at least three times among reptiles, as evident in turtles, birds and squamates. This novel hypothesis describes the evolution of sex-determining mechanisms as turnover events accompanied by one or two small mutations. PMID:25540158
WEB-server for search of a periodicity in amino acid and nucleotide sequences

NASA Astrophysics Data System (ADS)

E Frenkel, F.; Skryabin, K. G.; Korotkov, E. V.

2017-12-01

A new web server (http://victoria.biengi.ac.ru/splinter/login.php) was designed and developed to search for periodicity in nucleotide and amino acid sequences. The web server operation is based upon a new mathematical method of searching for multiple alignments, which is founded on the position weight matrices optimization, as well as on implementation of the two-dimensional dynamic programming. This approach allows the construction of multiple alignments of the indistinctly similar amino acid and nucleotide sequences that accumulated more than 1.5 substitutions per a single amino acid or a nucleotide without performing the sequences paired comparisons. The article examines the principles of the web server operation and two examples of studying amino acid and nucleotide sequences, as well as information that could be obtained using the web server.
Complete genome sequence and description of Salinispira pacifica gen. nov., sp. nov., a novel spirochaete isolated form a hypersaline microbial mat.

PubMed

Ben Hania, Wajdi; Joseph, Manon; Schumann, Peter; Bunk, Boyke; Fiebig, Anne; Spröer, Cathrin; Klenk, Hans-Peter; Fardeau, Marie-Laure; Spring, Stefan

2015-01-01

During a study of the anaerobic microbial community of a lithifying hypersaline microbial mat of Lake 21 on the Kiritimati atoll (Kiribati Republic, Central Pacific) strain L21-RPul-D2(T) was isolated. The closest phylogenetic neighbor was Spirochaeta africana Z-7692(T) that shared a 16S rRNA gene sequence identity value of 90% with the novel strain and thus was only distantly related. A comprehensive polyphasic study including determination of the complete genome sequence was initiated to characterize the novel isolate. Cells of strain L21-RPul-D2(T) had a size of 0.2 - 0.25 × 8-9 μm, were helical, motile, stained Gram-negative and produced an orange carotenoid-like pigment. Optimal conditions for growth were 35°C, a salinity of 50 g/l NaCl and a pH around 7.0. Preferred substrates for growth were carbohydrates and a few carboxylic acids. The novel strain had an obligate fermentative metabolism and produced ethanol, acetate, lactate, hydrogen and carbon dioxide during growth on glucose. Strain L21-RPul-D2(T) was aerotolerant, but oxygen did not stimulate growth. Major cellular fatty acids were C14:0, iso-C15:0, C16:0 and C18:0. The major polar lipids were an unidentified aminolipid, phosphatidylglycerol, an unidentified phospholipid and two unidentified glycolipids. Whole-cell hydrolysates contained L-ornithine as diagnostic diamino acid of the cell wall peptidoglycan. The complete genome sequence was determined and annotated. The genome comprised one circular chromosome with a size of 3.78 Mbp that contained 3450 protein-coding genes and 50 RNA genes, including 2 operons of ribosomal RNA genes. The DNA G + C content was determined from the genome sequence as 51.9 mol%. There were no predicted genes encoding cytochromes or enzymes responsible for the biosynthesis of respiratory lipoquinones. Based on significant differences to the uncultured type species of the genus Spirochaeta, S. plicatilis, as well as to any other phylogenetically related cultured species it is suggested to place strain L21-RPul-D2(T) (=DSM 27196(T) = JCM 18663(T)) in a novel species and genus, for which the name Salinispira pacifica gen. nov., sp. nov. is proposed.
Molecular characterization of the spike gene of the porcine epidemic diarrhea virus in Mexico, 2013-2016.

PubMed

Lara-Romero, Rocío; Gómez-Núñez, Luis; Cerriteño-Sánchez, José Luis; Márquez-Valdelamar, Laura; Mendoza-Elvira, Susana; Ramírez-Mendoza, Humberto; Rivera-Benítez, José Francisco

2018-04-01

In Mexico, the first outbreaks suggestive of the circulation of the porcine epidemic diarrhea virus (PEDV) were identified at the beginning of July 2013. To identify the molecular characteristics of the PEDV Spike (S) gene in Mexico, 116 samples of the intestine and diarrhea of piglets with clinical signs of porcine epidemic diarrhea (PED) were obtained. Samples were collected from 14 farms located in six states of Mexico (Jalisco, Puebla, Sonora, Veracruz, Guanajuato, and Michoacán) from 2013 to 2016. To identify PEDV, we used real-time RT-PCR to discriminate between non-INDEL and INDEL strains. We chose samples according to state and year to characterize the S gene. After amplification of the S gene, the obtained products were sequenced and assembled. The complete amino acid sequences of the spike protein were used to perform an epitope analysis, which was used to determine null mutations in regions SS2, SS6, and 2C10 compared to the sequences of G2. A phylogenetic analysis determined the circulation of G2b and INDEL strains in Mexico. However, several mutations were recorded in the collagenase equivalent (COE) region that were related to the change in polarity and charge of the amino acid residues. The PEDV strain circulating in Jalisco in 2016 has an insertion of three amino acids ( 232 LGL 234 ) and one change in the antigenic site of the COE region, and strains from the years 2015 and 2016 changed the index of the surface probability, which could be related to the re-emergence of disease outbreaks.
A Next-Generation Sequencing Primer—How Does It Work and What Can It Do?

PubMed Central

Alekseyev, Yuriy O.; Fazeli, Roghayeh; Yang, Shi; Basran, Raveen; Miller, Nancy S.

2018-01-01

Next-generation sequencing refers to a high-throughput technology that determines the nucleic acid sequences and identifies variants in a sample. The technology has been introduced into clinical laboratory testing and produces test results for precision medicine. Since next-generation sequencing is relatively new, graduate students, medical students, pathology residents, and other physicians may benefit from a primer to provide a foundation about basic next-generation sequencing methods and applications, as well as specific examples where it has had diagnostic and prognostic utility. Next-generation sequencing technology grew out of advances in multiple fields to produce a sophisticated laboratory test with tremendous potential. Next-generation sequencing may be used in the clinical setting to look for specific genetic alterations in patients with cancer, diagnose inherited conditions such as cystic fibrosis, and detect and profile microbial organisms. This primer will review DNA sequencing technology, the commercialization of next-generation sequencing, and clinical uses of next-generation sequencing. Specific applications where next-generation sequencing has demonstrated utility in oncology are provided. PMID:29761157
Determining divergence times with a protein clock: update and reevaluation

NASA Technical Reports Server (NTRS)

Feng, D. F.; Cho, G.; Doolittle, R. F.; Bada, J. L. (Principal Investigator)

1997-01-01

A recent study of the divergence times of the major groups of organisms as gauged by amino acid sequence comparison has been expanded and the data have been reanalyzed with a distance measure that corrects for both constraints on amino acid interchange and variation in substitution rate at different sites. Beyond that, the availability of complete genome sequences for several eubacteria and an archaebacterium has had a great impact on the interpretation of certain aspects of the data. Thus, the majority of the archaebacterial sequences are not consistent with currently accepted views of the Tree of Life which cluster the archaebacteria with eukaryotes. Instead, they are either outliers or mixed in with eubacterial orthologs. The simplest resolution of the problem is to postulate that many of these sequences were carried into eukaryotes by early eubacterial endosymbionts about 2 billion years ago, only very shortly after or even coincident with the divergence of eukaryotes and archaebacteria. The strong resemblances of these same enzymes among the major eubacterial groups suggest that the cyanobacteria and Gram-positive and Gram-negative eubacteria also diverged at about this same time, whereas the much greater differences between archaebacterial and eubacterial sequences indicate these two groups may have diverged between 3 and 4 billion years ago.
Nucleic acid molecules encoding isopentenyl monophosphate kinase, and methods of use

DOEpatents

Croteau, Rodney B.; Lange, Bernd M.

2001-01-01

A cDNA encoding isopentenyl monophosphate kinase (IPK) from peppermint (Mentha x piperita) has been isolated and sequenced, and the corresponding amino acid sequence has been determined. Accordingly, an isolated DNA sequence (SEQ ID NO:1) is provided which codes for the expression of isopentenyl monophosphate kinase (SEQ ID NO:2), from peppermint (Mentha x piperita). In other aspects, replicable recombinant cloning vehicles are provided which code for isopentenyl monophosphate kinase, or for a base sequence sufficiently complementary to at least a portion of isopentenyl monophosphate kinase DNA or RNA to enable hybridization therewith. In yet other aspects, modified host cells are provided that have been transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence encoding isopentenyl monophosphate kinase. Thus, systems and methods are provided for the recombinant expression of the aforementioned recombinant isopentenyl monophosphate kinase that may be used to facilitate its production, isolation and purification in significant amounts. Recombinant isopentenyl monophosphate kinase may be used to obtain expression or enhanced expression of isopentenyl monophosphate kinase in plants in order to enhance the production of isopentenyl monophosphate kinase, or isoprenoids derived therefrom, or may be otherwise employed for the regulation or expression of isopentenyl monophosphate kinase, or the production of its products.
Identification of a new phospholipase D in Carica papaya latex.

PubMed

Abdelkafi, Slim; Abousalham, Abdelkarim; Fendri, Imen; Ogata, Hiroyuki; Barouh, Nathalie; Fouquet, Benjamin; Scheirlinckx, Frantz; Villeneuve, Pierre; Carrière, Frédéric

2012-05-15

Phospholipase D (PLD) is a lipolytic enzyme involved in signal transduction, vesicle trafficking and membrane metabolism. It catalyzes the hydrolysis and transphosphatidylation of glycerophospholipids at the terminal phosphodiester bond. The presence of a PLD in the latex of Carica papaya (CpPLD1) was demonstrated by transphosphatidylation of phosphatidylcholine (PtdCho) in the presence of 2% ethanol. Although the protein could not be purified to homogeneity due to its presence in high molecular mass aggregates, a protein band was separated by SDS-PAGE after SDS/chloroform-methanol/TCA-acetone extraction of the latex insoluble fraction. This material was digested with trypsin and the amino acid sequences of the tryptic peptides were determined by micro-LC/ESI/MS/MS. These sequences were used to identify a partial cDNA (723 bp) from expressed sequence tags (ESTs) of C. papaya. Based upon EST sequences, a full-length gene was identified in the genome of C. papaya, with an open reading frame of 2424 bp encoding a protein of 808 amino acid residues, with a theoretical molecular mass of 92.05 kDa. From sequence analysis, CpPLD1 was identified as a PLD belonging to the plant phosphatidylcholine phosphatidohydrolase family. Copyright © 2012 Elsevier B.V. All rights reserved.
Efficient farnesylation of an extended C-terminal C(x)3X sequence motif expands the scope of the prenylated proteome.

PubMed

Blanden, Melanie J; Suazo, Kiall F; Hildebrandt, Emily R; Hardgrove, Daniel S; Patel, Meet; Saunders, William P; Distefano, Mark D; Schmidt, Walter K; Hougland, James L

2018-02-23

Protein prenylation is a post-translational modification that has been most commonly associated with enabling protein trafficking to and interaction with cellular membranes. In this process, an isoprenoid group is attached to a cysteine near the C terminus of a substrate protein by protein farnesyltransferase (FTase) or protein geranylgeranyltransferase type I or II (GGTase-I and GGTase-II). FTase and GGTase-I have long been proposed to specifically recognize a four-amino acid C AAX C-terminal sequence within their substrates. Surprisingly, genetic screening reveals that yeast FTase can modify sequences longer than the canonical C AAX sequence, specifically C( x ) 3 X sequences with four amino acids downstream of the cysteine. Biochemical and cell-based studies using both peptide and protein substrates reveal that mammalian FTase orthologs can also prenylate C( x ) 3 X sequences. As the search to identify physiologically relevant C( x ) 3 X proteins begins, this new prenylation motif nearly doubles the number of proteins within the yeast and human proteomes that can be explored as potential FTase substrates. This work expands our understanding of prenylation's impact within the proteome, establishes the biologically relevant reactivity possible with this new motif, and opens new frontiers in determining the impact of non-canonically prenylated proteins on cell function. © 2018 by The American Society for Biochemistry and Molecular Biology, Inc.
Protein structure and the sequential structure of mRNA: alpha-helix and beta-sheet signals at the nucleotide level.

PubMed

Brunak, S; Engelbrecht, J

1996-06-01

A direct comparison of experimentally determined protein structures and their corresponding protein coding mRNA sequences has been performed. We examine whether real world data support the hypothesis that clusters of rare codons correlate with the location of structural units in the resulting protein. The degeneracy of the genetic code allows for a biased selection of codons which may control the translational rate of the ribosome, and may thus in vivo have a catalyzing effect on the folding of the polypeptide chain. A complete search for GenBank nucleotide sequences coding for structural entries in the Brookhaven Protein Data Bank produced 719 protein chains with matching mRNA sequence, amino acid sequence, and secondary structure assignment. By neural network analysis, we found strong signals in mRNA sequence regions surrounding helices and sheets. These signals do not originate from the clustering of rare codons, but from the similarity of codons coding for very abundant amino acid residues at the N- and C-termini of helices and sheets. No correlation between the positioning of rare codons and the location of structural units was found. The mRNA signals were also compared with conserved nucleotide features of 16S-like ribosomal RNA sequences and related to mechanisms for maintaining the correct reading frame by the ribosome.
Sequence specificity of single-stranded DNA-binding proteins: a novel DNA microarray approach

PubMed Central

Morgan, Hugh P.; Estibeiro, Peter; Wear, Martin A.; Max, Klaas E.A.; Heinemann, Udo; Cubeddu, Liza; Gallagher, Maurice P.; Sadler, Peter J.; Walkinshaw, Malcolm D.

2007-01-01

We have developed a novel DNA microarray-based approach for identification of the sequence-specificity of single-stranded nucleic-acid-binding proteins (SNABPs). For verification, we have shown that the major cold shock protein (CspB) from Bacillus subtilis binds with high affinity to pyrimidine-rich sequences, with a binding preference for the consensus sequence, 5′-GTCTTTG/T-3′. The sequence was modelled onto the known structure of CspB and a cytosine-binding pocket was identified, which explains the strong preference for a cytosine base at position 3. This microarray method offers a rapid high-throughput approach for determining the specificity and strength of ss DNA–protein interactions. Further screening of this newly emerging family of transcription factors will help provide an insight into their cellular function. PMID:17488853
Molecular characterization of two prunus necrotic ringspot virus isolates from Canada.

PubMed

Cui, Hongguang; Hong, Ni; Wang, Guoping; Wang, Aiming

2012-05-01

We determined the entire RNA1, 2 and 3 sequences of two prunus necrotic ringspot virus (PNRSV) isolates, Chr3 from cherry and Pch12 from peach, obtained from an orchard in the Niagara Fruit Belt, Canada. The RNA1, 2 and 3 of the two isolates share nucleotide sequence identities of 98.6%, 98.4% and 94.5%, respectively. Their RNA1- and 2-encoded amino acid sequences are about 98% identical to the corresponding sequences of a cherry isolate, CH57, the only other PNRSV isolate with complete RNA1 and 2 sequences available. Phylogenetic analysis of the coat protein and movement protein encoded by RNA3 of Pch12 and Chr3 and published PNRSV isolates indicated that Chr3 belongs to the PV96 group and Pch12 belongs to the PV32 group.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.